RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6892
(190 letters)
>gnl|CDD|215646 pfam00001, 7tm_1, 7 transmembrane receptor (rhodopsin family).
This family contains, amongst other G-protein-coupled
receptors (GCPRs), members of the opsin family, which
have been considered to be typical members of the
rhodopsin superfamily. They share several motifs,
mainly the seven transmembrane helices, GCPRs of the
rhodopsin superfamily. All opsins bind a chromophore,
such as 11-cis-retinal. The function of most opsins
other than the photoisomerases is split into two steps:
light absorption and G-protein activation.
Photoisomerases, on the other hand, are not coupled to
G-proteins - they are thought to generate and supply
the chromophore that is used by visual opsins.
Length = 251
Score = 63.1 bits (154), Expect = 2e-12
Identities = 15/40 (37%), Positives = 24/40 (60%)
Query: 37 LAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLV 76
L I K+L+ TN FL++LA+ DL+ + + P + LV
Sbjct: 1 LVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLV 40
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 47.7 bits (113), Expect = 1e-06
Identities = 24/37 (64%), Positives = 29/37 (78%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEK 139
G D EEEEEEEEEEEEEEE+E++E EE+ENE+
Sbjct: 856 GGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 43.8 bits (103), Expect = 2e-05
Identities = 22/36 (61%), Positives = 27/36 (75%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEE 146
+ EEEEEEEEEEEEE+E++E EE+E E E L E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLE 897
Score = 42.7 bits (100), Expect = 5e-05
Identities = 23/69 (33%), Positives = 40/69 (57%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
A D ++E E+E E + E +QDE + + EE+E E+EE++EEEE+++
Sbjct: 823 ADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEE 882
Query: 164 GEEKRVEDK 172
EE+ E++
Sbjct: 883 EEEEEEENE 891
Score = 41.9 bits (98), Expect = 9e-05
Identities = 20/37 (54%), Positives = 26/37 (70%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERR 142
+E EEEEEEEEEEEEEEE+E +E L + E ++
Sbjct: 868 EEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQK 904
Score = 41.9 bits (98), Expect = 1e-04
Identities = 22/44 (50%), Positives = 30/44 (68%)
Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
+ EEEEEEEEEEE+E++E EE+E E+E P E E +K+
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905
Score = 40.4 bits (94), Expect = 3e-04
Identities = 23/72 (31%), Positives = 39/72 (54%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
E Q +DET +E E + E +++E+ D D + E EE+E E+EE+
Sbjct: 820 ETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEE 879
Query: 156 KEEEEDKQGEEK 167
+EEEE+++ E +
Sbjct: 880 EEEEEEEEEENE 891
Score = 40.4 bits (94), Expect = 3e-04
Identities = 20/36 (55%), Positives = 25/36 (69%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKER 141
+E EEEEEEEEEEEEEEE E+ LE E +++
Sbjct: 870 EEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905
Score = 39.6 bits (92), Expect = 6e-04
Identities = 20/52 (38%), Positives = 29/52 (55%)
Query: 89 LKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
L E + + G + + EEEEEEEEEE+E++E EE+E E+E
Sbjct: 837 LNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEE 888
Score = 38.4 bits (89), Expect = 0.001
Identities = 25/84 (29%), Positives = 45/84 (53%), Gaps = 4/84 (4%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED----EEKKE 157
AG +DE + E + ++ E ++E EQ+ E K+ + G D EE++E
Sbjct: 809 AGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEE 868
Query: 158 EEEDKQGEEKRVEDKIGDEKRLEE 181
EEE+++ EE+ E++ +E+ EE
Sbjct: 869 EEEEEEEEEEEEEEEEEEEEENEE 892
Score = 38.1 bits (88), Expect = 0.002
Identities = 24/94 (25%), Positives = 47/94 (50%), Gaps = 12/94 (12%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDE--------- 153
G + E E++E E + E Q D ++++ E+E + E +DE
Sbjct: 799 GKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGS 858
Query: 154 ---EKKEEEEDKQGEEKRVEDKIGDEKRLEENKK 184
+ +EEEE+++ EE+ E++ +E+ EEN++
Sbjct: 859 DGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 35.7 bits (82), Expect = 0.013
Identities = 25/81 (30%), Positives = 38/81 (46%), Gaps = 4/81 (4%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE----EEEDK 162
E +E + + E E EE E E + E E+E E EE E EDE + E E +
Sbjct: 698 EIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVET 757
Query: 163 QGEEKRVEDKIGDEKRLEENK 183
+G+ K E + E +E++
Sbjct: 758 EGDRKETEHEGETEAEGKEDE 778
Score = 35.4 bits (81), Expect = 0.014
Identities = 22/69 (31%), Positives = 31/69 (44%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
A + E E E E+E E E ++ EDE E E E E D ++ E E + +
Sbjct: 712 AEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETE 771
Query: 164 GEEKRVEDK 172
E K ED+
Sbjct: 772 AEGKEDEDE 780
Score = 35.4 bits (81), Expect = 0.014
Identities = 25/73 (34%), Positives = 33/73 (45%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
EE EE E+E E E E + + E D E E E E EDE++ E + + GE K
Sbjct: 734 TGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKG 793
Query: 169 VEDKIGDEKRLEE 181
E G + E
Sbjct: 794 DEGAEGKVEHEGE 806
Score = 35.4 bits (81), Expect = 0.015
Identities = 27/87 (31%), Positives = 38/87 (43%), Gaps = 4/87 (4%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQE-QDEYLEEDENEKERRLPEEDERGEDEE---KK 156
+ G E+E E E + E E E E E E E E E E D +GE E +
Sbjct: 658 ENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEH 717
Query: 157 EEEEDKQGEEKRVEDKIGDEKRLEENK 183
E E + +G E E + G+E E++
Sbjct: 718 EGETEAEGTEDEGEIETGEEGEEVEDE 744
Score = 35.0 bits (80), Expect = 0.020
Identities = 22/61 (36%), Positives = 29/61 (47%), Gaps = 2/61 (3%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
E + E+E E E E +E D E + E E E E EDE + E E+ GEE E
Sbjct: 688 ERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEE--GEEVEDEG 745
Query: 172 K 172
+
Sbjct: 746 E 746
Score = 35.0 bits (80), Expect = 0.021
Identities = 23/96 (23%), Positives = 43/96 (44%), Gaps = 7/96 (7%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
+AG E +E E + E E E E + + E + +DE GE E E +
Sbjct: 784 QAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQG 843
Query: 161 DKQGEEKRVEDKIG-------DEKRLEENKKKKTQK 189
+ + +EK V+ G +E+ EE ++++ ++
Sbjct: 844 EAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEE 879
Score = 34.6 bits (79), Expect = 0.028
Identities = 18/68 (26%), Positives = 30/68 (44%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
G E + E E+E E E + E + + E E + E+E E + + + + E EE +
Sbjct: 657 GENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVE 716
Query: 163 QGEEKRVE 170
E E
Sbjct: 717 HEGETEAE 724
Score = 34.2 bits (78), Expect = 0.036
Identities = 18/83 (21%), Positives = 34/83 (40%), Gaps = 3/83 (3%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+ T E E E E E EE + E + K E + E + ++E E + + +
Sbjct: 643 ERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAK 702
Query: 166 EKRVEDKIGDEKRLEENKKKKTQ 188
E + G+ + E + +T+
Sbjct: 703 EADHK---GETEAEEVEHEGETE 722
Score = 33.8 bits (77), Expect = 0.054
Identities = 30/88 (34%), Positives = 40/88 (45%), Gaps = 5/88 (5%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDE-- 153
E+ + G E E E E EE E EQE E + ENE E +P E +GE E
Sbjct: 639 EHTGERTGEEGERPTEAEGENGEESGGEAEQE-GETETKGENESEGEIPAE-RKGEQEGE 696
Query: 154 -EKKEEEEDKQGEEKRVEDKIGDEKRLE 180
E + +E D +GE + E + E E
Sbjct: 697 GEIEAKEADHKGETEAEEVEHEGETEAE 724
Score = 33.0 bits (75), Expect = 0.081
Identities = 18/73 (24%), Positives = 32/73 (43%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
EE E E+E E + + E E E ER+ +E E + ++ + + + + EE
Sbjct: 657 GENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVE 716
Query: 169 VEDKIGDEKRLEE 181
E + E +E
Sbjct: 717 HEGETEAEGTEDE 729
Score = 32.7 bits (74), Expect = 0.11
Identities = 22/84 (26%), Positives = 46/84 (54%), Gaps = 3/84 (3%)
Query: 103 GARDETAREEEEEEEEEEEEEE---EQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
G R ET E E E E +E+E+E + +D ++ DE + + E + ++++ E +
Sbjct: 759 GDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQ 818
Query: 160 EDKQGEEKRVEDKIGDEKRLEENK 183
+ Q ++ V+D+ G+++ EN+
Sbjct: 819 SETQADDTEVKDETGEQELNAENQ 842
Score = 32.7 bits (74), Expect = 0.11
Identities = 22/72 (30%), Positives = 33/72 (45%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
A + E E E+E E E +E +E +E E E E + E E E + E E +
Sbjct: 712 AEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETE 771
Query: 162 KQGEEKRVEDKI 173
+G+E E +I
Sbjct: 772 AEGKEDEDEGEI 783
Score = 32.7 bits (74), Expect = 0.13
Identities = 20/84 (23%), Positives = 39/84 (46%), Gaps = 3/84 (3%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK---E 157
+ E +E+E E + E+ E + + + E+E E E+DE E +
Sbjct: 767 EGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDT 826
Query: 158 EEEDKQGEEKRVEDKIGDEKRLEE 181
E +D+ GE++ + G+ K+ E+
Sbjct: 827 EVKDETGEQELNAENQGEAKQDEK 850
Score = 32.3 bits (73), Expect = 0.17
Identities = 20/87 (22%), Positives = 43/87 (49%), Gaps = 1/87 (1%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE-RGEDEEKKEEEED 161
G + + E E E + +E E E E + +EDE+E E + E+ E +G++ + + E +
Sbjct: 745 GEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHE 804
Query: 162 KQGEEKRVEDKIGDEKRLEENKKKKTQ 188
+ E ++ G + ++ + K +
Sbjct: 805 GETEAGEKDEHEGQSETQADDTEVKDE 831
Score = 31.5 bits (71), Expect = 0.29
Identities = 23/88 (26%), Positives = 38/88 (43%), Gaps = 4/88 (4%)
Query: 103 GARDETAREEEEEE---EEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
G + E E E E + E+E E E + + + E E E + E E ++E
Sbjct: 671 GETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEG 730
Query: 160 EDKQGEE-KRVEDKIGDEKRLEENKKKK 186
E + GEE + VED+ E + + +
Sbjct: 731 EIETGEEGEEVEDEGEGEAEGKHEVETE 758
Score = 30.0 bits (67), Expect = 0.78
Identities = 26/86 (30%), Positives = 38/86 (44%), Gaps = 4/86 (4%)
Query: 101 KAGARDETAREEEEEEEE-EEEEEEEQEQDEYLEEDENEKERRLPE---EDERGEDEEKK 156
+ G E +E E E E + E E E ++ E E E E E + E E + GED E K
Sbjct: 733 ETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMK 792
Query: 157 EEEEDKQGEEKRVEDKIGDEKRLEEN 182
+E + E E + G++ E
Sbjct: 793 GDEGAEGKVEHEGETEAGEKDEHEGQ 818
Score = 29.6 bits (66), Expect = 1.3
Identities = 26/89 (29%), Positives = 37/89 (41%), Gaps = 10/89 (11%)
Query: 103 GARDET-AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE-RGEDEEKKEEEE 160
G E E EE E E E E E E+ E E E +GE+E + E
Sbjct: 633 GDVAEAEHTGERTGEEGERPTEAEGENGE-----ESGGEAEQEGETETKGENESEGEIPA 687
Query: 161 DKQGE---EKRVEDKIGDEKRLEENKKKK 186
+++GE E +E K D K E ++ +
Sbjct: 688 ERKGEQEGEGEIEAKEADHKGETEAEEVE 716
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 43.8 bits (104), Expect = 2e-05
Identities = 24/72 (33%), Positives = 47/72 (65%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
E+EEEEE+ ++E+E++ D+ LEE+E + + EEDE ED + +++EE+++ E+
Sbjct: 317 GQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEK 376
Query: 167 KRVEDKIGDEKR 178
++ + K + R
Sbjct: 377 EKKKKKSAESTR 388
Score = 38.4 bits (90), Expect = 0.001
Identities = 21/76 (27%), Positives = 42/76 (55%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
+ EE+EEEEE+ ++DE ++D+ E+E + + EDEE ++ +++ EE+ E
Sbjct: 316 LGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEE 375
Query: 171 DKIGDEKRLEENKKKK 186
+ +K E + +
Sbjct: 376 KEKKKKKSAESTRSEL 391
Score = 38.4 bits (90), Expect = 0.001
Identities = 18/64 (28%), Positives = 42/64 (65%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
++ EE+ ++E+EE++++ ++E + D +++E +ED ED+E++EEEE ++
Sbjct: 319 GEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEK 378
Query: 164 GEEK 167
++K
Sbjct: 379 KKKK 382
Score = 36.5 bits (85), Expect = 0.005
Identities = 17/68 (25%), Positives = 44/68 (64%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+E ++E+EE+++++ EE+E+D L ++E ++E ++++ E+EE+++E++ K+
Sbjct: 325 EEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSA 384
Query: 166 EKRVEDKI 173
E +
Sbjct: 385 ESTRSELP 392
Score = 35.7 bits (83), Expect = 0.009
Identities = 18/67 (26%), Positives = 45/67 (67%), Gaps = 4/67 (5%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
+ +E + ++E+EE+++++ +E++E ++ + E++ EEDE +DE+ +EEEE
Sbjct: 318 QGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEED----EEDEDSDDEDDEEEEE 373
Query: 161 DKQGEEK 167
+++ ++K
Sbjct: 374 EEKEKKK 380
Score = 35.7 bits (83), Expect = 0.010
Identities = 25/86 (29%), Positives = 44/86 (51%), Gaps = 10/86 (11%)
Query: 106 DETAREEEE-----EEE-----EEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
+E A+EE E E E EEE++E+E+D D+ + E ++D G + +
Sbjct: 261 EELAKEEAERLKKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGE 320
Query: 156 KEEEEDKQGEEKRVEDKIGDEKRLEE 181
++EEE++ G + E+ D+ EE
Sbjct: 321 EDEEEEEDGVDDEDEEDDDDDLEEEE 346
Score = 34.6 bits (80), Expect = 0.027
Identities = 31/93 (33%), Positives = 53/93 (56%), Gaps = 14/93 (15%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYL------------EEDENEKERRLPEEDER--GEDEEKK 156
EE++EEEE+ +E + DE+ EEDE E+E + +EDE +D E++
Sbjct: 286 EEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEE 345
Query: 157 EEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
EE+ D EE+ ED+ D++ EE ++++ +K
Sbjct: 346 EEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEK 378
Score = 34.2 bits (79), Expect = 0.033
Identities = 19/52 (36%), Positives = 30/52 (57%)
Query: 121 EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
EEE +E+ E L++ E E+ RR+ E+E E+EE +E D +E +D
Sbjct: 260 EEELAKEEAERLKKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDD 311
Score = 33.4 bits (77), Expect = 0.066
Identities = 18/69 (26%), Positives = 41/69 (59%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
+E ++E+EE+++++ E+E+++ DE E E +DE E+EE++E+E+ K+
Sbjct: 323 EEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKK 382
Query: 165 EEKRVEDKI 173
+ ++
Sbjct: 383 SAESTRSEL 391
Score = 33.0 bits (76), Expect = 0.091
Identities = 21/80 (26%), Positives = 42/80 (52%), Gaps = 8/80 (10%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
+ EE+EEEEE+ + E +E ++D EE+E D +EE+E+ + + +
Sbjct: 317 GQGEEDEEEEEDGVDDEDEEDDDDDL--------EEEEEDVDLSDEEEDEEDEDSDDEDD 368
Query: 171 DKIGDEKRLEENKKKKTQKR 190
++ +E++ ++ KK R
Sbjct: 369 EEEEEEEKEKKKKKSAESTR 388
Score = 32.3 bits (74), Expect = 0.15
Identities = 20/85 (23%), Positives = 49/85 (57%), Gaps = 3/85 (3%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
D+ ++++ + EE++E++E +DE+E++ ++D E+E+ +E++ E
Sbjct: 303 DDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDD---DDDLEEEEEDVDLSDEEEDEE 359
Query: 166 EKRVEDKIGDEKRLEENKKKKTQKR 190
++ +D+ +E+ EE +KKK +
Sbjct: 360 DEDSDDEDDEEEEEEEKEKKKKKSA 384
Score = 26.9 bits (60), Expect = 8.1
Identities = 12/40 (30%), Positives = 27/40 (67%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
+ D +EEE+EE+E+ ++E+ E++E E+++ +K+
Sbjct: 343 EEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKK 382
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 43.6 bits (103), Expect = 2e-05
Identities = 19/80 (23%), Positives = 37/80 (46%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
K AR + EE +E+E+EEEE+ ++ E E +++E+E ED E
Sbjct: 352 SKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSD 411
Query: 156 KEEEEDKQGEEKRVEDKIGD 175
+ + + +++ D +
Sbjct: 412 VGSDSESKADKESASDSDSE 431
Score = 35.5 bits (82), Expect = 0.013
Identities = 20/74 (27%), Positives = 33/74 (44%), Gaps = 14/74 (18%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
+ EE +E+E+EEEEQ DE E+E GED E++ + + G + D
Sbjct: 366 DFEEVDEDEDEEEEQRSDE--------------HEEEEGEDSEEEGSQSREDGSSESSSD 411
Query: 172 KIGDEKRLEENKKK 185
D + + +
Sbjct: 412 VGSDSESKADKESA 425
Score = 31.2 bits (71), Expect = 0.29
Identities = 13/72 (18%), Positives = 27/72 (37%), Gaps = 4/72 (5%)
Query: 123 EEEQEQDEY----LEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKR 178
+E + + L+ + E+ +E+E +E +EEE + EE + G +
Sbjct: 349 TKESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSES 408
Query: 179 LEENKKKKTQKR 190
+ K
Sbjct: 409 SSDVGSDSESKA 420
Score = 30.1 bits (68), Expect = 0.68
Identities = 20/79 (25%), Positives = 30/79 (37%), Gaps = 19/79 (24%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
+ EE +E+E+EEEE++ + EE E+ EEE Q E
Sbjct: 366 DFEEVDEDEDEEEEQRSDEHEEEEGEDS-------------------EEEGSQSREDGSS 406
Query: 171 DKIGDEKRLEENKKKKTQK 189
+ D E+K K
Sbjct: 407 ESSSDVGSDSESKADKESA 425
Score = 28.9 bits (65), Expect = 1.8
Identities = 14/75 (18%), Positives = 24/75 (32%)
Query: 77 RVRTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDE 136
+ R L E + ++ +E E+ EEE + E+ E + D
Sbjct: 357 KRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDS 416
Query: 137 NEKERRLPEEDERGE 151
K + D E
Sbjct: 417 ESKADKESASDSDSE 431
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 43.5 bits (102), Expect = 3e-05
Identities = 31/110 (28%), Positives = 54/110 (49%), Gaps = 23/110 (20%)
Query: 72 ILTLVRVRTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEY 131
I+ R +L+ + + N + ++ DE E++++EE++EEEEEE+E+ +
Sbjct: 130 IIIKRRRARHLAEEDMSPRDNFVIDDD-------DEDEDEDDDDEEDDEEEEEEEEEIKG 182
Query: 132 LEEDENEKERRLPEEDERGEDEEKKEEEEDKQ--------GEEKRVEDKI 173
++++ EEDE GED ++ E DK G V DKI
Sbjct: 183 FDDED--------EEDEGGEDFTYEKSEVDKTDCFKFIEAGAGDDVFDKI 224
Score = 33.5 bits (76), Expect = 0.057
Identities = 16/48 (33%), Positives = 31/48 (64%)
Query: 133 EEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLE 180
EED + ++ + ++D+ EDE+ +EE+D++ EE+ E K D++ E
Sbjct: 142 EEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEE 189
Score = 33.5 bits (76), Expect = 0.069
Identities = 21/67 (31%), Positives = 43/67 (64%), Gaps = 1/67 (1%)
Query: 121 EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLE 180
EE+ +D ++ +D++E E ++DE ++EE++EEEE K +++ ED+ G++ E
Sbjct: 141 AEEDMSPRDNFVIDDDDEDED-EDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYE 199
Query: 181 ENKKKKT 187
+++ KT
Sbjct: 200 KSEVDKT 206
Score = 30.8 bits (69), Expect = 0.50
Identities = 22/77 (28%), Positives = 38/77 (49%), Gaps = 11/77 (14%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK------EEEEDK 162
AR EE+ + + DE +ED++++ E+DE E+EE++ E+EED+
Sbjct: 137 ARHLAEEDMSPRDNFVIDDDDEDEDEDDDDE-----EDDEEEEEEEEEIKGFDDEDEEDE 191
Query: 163 QGEEKRVEDKIGDEKRL 179
GE+ E D+
Sbjct: 192 GGEDFTYEKSEVDKTDC 208
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 42.6 bits (100), Expect = 5e-05
Identities = 21/90 (23%), Positives = 41/90 (45%), Gaps = 1/90 (1%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
K G D+ + ++ ++E EE+ D E+E +L E + E+ E+ E
Sbjct: 262 KRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSE 321
Query: 161 DKQGEEKRVEDKIGDE-KRLEENKKKKTQK 189
+ + E+ E + + K+L++ K KK
Sbjct: 322 ESEEEKNEEEGGLSKKGKKLKKLKGKKNGL 351
Score = 35.7 bits (82), Expect = 0.010
Identities = 17/81 (20%), Positives = 36/81 (44%)
Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRV 169
R +++ +E + ++ + E E ++ PEE E E + E +Q E+
Sbjct: 263 RGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEE 322
Query: 170 EDKIGDEKRLEENKKKKTQKR 190
++ +E+ +KK K K+
Sbjct: 323 SEEEKNEEEGGLSKKGKKLKK 343
Score = 34.2 bits (78), Expect = 0.036
Identities = 19/85 (22%), Positives = 42/85 (49%), Gaps = 4/85 (4%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
E + E E++E+ EE EE++ +E E ++K ++L + + +K + + +
Sbjct: 306 PEIPAKPEIEQDEDSEESEEEKNEE--EGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSD 363
Query: 166 EKRVEDKIGDEKRLEENKKKKTQKR 190
+ ++ + D L KK+K K+
Sbjct: 364 DSDIDGE--DSVSLVTAKKQKEPKK 386
Score = 33.0 bits (75), Expect = 0.085
Identities = 13/71 (18%), Positives = 27/71 (38%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+ + EE E++ E + + +ED E E EE+ + KK ++ +
Sbjct: 290 SSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKN 349
Query: 166 EKRVEDKIGDE 176
+D +
Sbjct: 350 GLDKDDSDSGD 360
Score = 31.5 bits (71), Expect = 0.23
Identities = 19/88 (21%), Positives = 45/88 (51%), Gaps = 3/88 (3%)
Query: 105 RDETAREEEE---EEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
+D +E++ ++ E+ +EE+ + + + +N+K+ ++ +RG D++ E + D
Sbjct: 216 KDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSD 275
Query: 162 KQGEEKRVEDKIGDEKRLEENKKKKTQK 189
+E R ED I D + +++ K
Sbjct: 276 DGDDEGREEDYISDSSASGNDPEEREDK 303
Score = 26.5 bits (58), Expect = 9.6
Identities = 16/67 (23%), Positives = 33/67 (49%)
Query: 120 EEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRL 179
+E+EE + + ++ K + L +DE DE K E+ + + K+ + K+ K+
Sbjct: 195 DEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKK 254
Query: 180 EENKKKK 186
++ KK
Sbjct: 255 LDDDKKG 261
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 42.8 bits (101), Expect = 5e-05
Identities = 22/60 (36%), Positives = 34/60 (56%)
Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEK 167
+ REEEE + E EE E EQ E EED +E EDE +++E +E + +++ E +
Sbjct: 390 SQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEEAR 449
Score = 34.0 bits (78), Expect = 0.040
Identities = 20/73 (27%), Positives = 33/73 (45%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
D+ A + + E + EEEE Q E E E E+ R E+ E E+E + + E
Sbjct: 377 DQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDE 436
Query: 166 EKRVEDKIGDEKR 178
+ + + +E R
Sbjct: 437 LEEAQPEEEEEAR 449
Score = 33.6 bits (77), Expect = 0.043
Identities = 21/76 (27%), Positives = 37/76 (48%), Gaps = 1/76 (1%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP-EEDERGEDEEKKEEEEDKQGEEKRV 169
E ++E E+ + E Q E E + E E RL E+ ER E++ + E +E
Sbjct: 374 AEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFED 433
Query: 170 EDKIGDEKRLEENKKK 185
ED++ + + EE + +
Sbjct: 434 EDELEEAQPEEEEEAR 449
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 42.6 bits (100), Expect = 5e-05
Identities = 33/81 (40%), Positives = 44/81 (54%), Gaps = 2/81 (2%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
E+ + +E EEEEEEEEEEEEEEQE +E EE E+E+E E D E+E +
Sbjct: 430 ESPSMASQESEEEESVEEEEEEEEEEEEEEQESEE--EEGEDEEEEEEVEADNGSEEEME 487
Query: 156 KEEEEDKQGEEKRVEDKIGDE 176
E D GEE + + +
Sbjct: 488 GSSEGDGDGEEPEEDAERRNS 508
Score = 40.7 bits (95), Expect = 3e-04
Identities = 29/90 (32%), Positives = 45/90 (50%), Gaps = 5/90 (5%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP-----EEDERGEDEEK 155
A E EEEEEEEEEEEEE+++ E E ++ E+E + EE+ G E
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGD 493
Query: 156 KEEEEDKQGEEKRVEDKIGDEKRLEENKKK 185
+ EE ++ E+R + G + E + +
Sbjct: 494 GDGEEPEEDAERRNSEMAGISRMSEGQQPR 523
Score = 39.5 bits (92), Expect = 5e-04
Identities = 24/95 (25%), Positives = 45/95 (47%), Gaps = 2/95 (2%)
Query: 95 CENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE 154
+ + + + + +E EEEE EEEE+E++E EE++ +E +E+E E E
Sbjct: 419 SDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEA 478
Query: 155 KKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
EE+ +G + D G+E + ++
Sbjct: 479 DNGSEEEMEGSSEGDGD--GEEPEEDAERRNSEMA 511
Score = 39.1 bits (91), Expect = 7e-04
Identities = 23/78 (29%), Positives = 40/78 (51%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+E EEEEE+E EEEE E++E++E +E D +E + G+ EE +E+ E + E
Sbjct: 450 EEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSE 509
Query: 166 EKRVEDKIGDEKRLEENK 183
+ ++ +
Sbjct: 510 MAGISRMSEGQQPRGSSV 527
Score = 28.7 bits (64), Expect = 2.1
Identities = 23/97 (23%), Positives = 39/97 (40%), Gaps = 11/97 (11%)
Query: 96 ENQLFKAGARDETAREEEEEE--EEEEEEEEEQEQDEYLEEDENEKER---------RLP 144
E Q + ++ EEE E EEE E E D EE E + ER R+
Sbjct: 458 EEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMS 517
Query: 145 EEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEE 181
E + + E +++ + + ++ + E+ EE
Sbjct: 518 EGQQPRGSSVQPESPQEEPLQPESMDAESVGEESDEE 554
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 41.9 bits (99), Expect = 1e-04
Identities = 16/80 (20%), Positives = 36/80 (45%), Gaps = 4/80 (5%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
+ EEE ++E ++ E ++D+ E + EEDE E E+++D ++
Sbjct: 105 DIEEELQDEPRYDDAYRDLEEDDDDDEESD----EEDEESSKSEDDEDDDDDDDDDDIAT 160
Query: 171 DKIGDEKRLEENKKKKTQKR 190
+ E+R + ++ +
Sbjct: 161 RERSLERRRRRREWEEKRAE 180
Score = 40.7 bits (96), Expect = 2e-04
Identities = 22/90 (24%), Positives = 45/90 (50%), Gaps = 5/90 (5%)
Query: 92 NILCENQ--LFKAGARDETAREEEEEEEEEEEEEEEQEQDEYL-EEDENEKERRLPEEDE 148
N+ +Q +F G +E ++E ++ + EE+ + DE EEDE + E+D+
Sbjct: 91 NVFGSDQVVIFDDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDD 150
Query: 149 RGEDEEKKEEEEDKQGEEKRVEDKIGDEKR 178
+D++ + ++ E+R + +EKR
Sbjct: 151 DDDDDD--DIATRERSLERRRRRREWEEKR 178
Score = 39.2 bits (92), Expect = 7e-04
Identities = 19/78 (24%), Positives = 40/78 (51%), Gaps = 6/78 (7%)
Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
+ EEE ++E + LEED+++ +E+ EDEE + E+D+ ++ +D
Sbjct: 104 GDIEEELQDEPRYDDAYRDLEEDDDD------DEESDEEDEESSKSEDDEDDDDDDDDDD 157
Query: 173 IGDEKRLEENKKKKTQKR 190
I +R E ++++ +
Sbjct: 158 IATRERSLERRRRRREWE 175
Score = 34.6 bits (80), Expect = 0.022
Identities = 11/67 (16%), Positives = 34/67 (50%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
D EE+++++EE +EE++E + ++++++ + + R E++ + +
Sbjct: 117 DDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEE 176
Query: 165 EEKRVED 171
+ +E
Sbjct: 177 KRAELEF 183
Score = 32.3 bits (74), Expect = 0.15
Identities = 20/59 (33%), Positives = 33/59 (55%), Gaps = 4/59 (6%)
Query: 104 ARDETAREEEEEEEEEEE----EEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
+E ++EE +EE+EE E++E + D+ ++D +ER L R E EEK+ E
Sbjct: 122 DLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAE 180
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 41.7 bits (97), Expect = 1e-04
Identities = 28/90 (31%), Positives = 52/90 (57%), Gaps = 6/90 (6%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE----RGEDEEKKEEEE 160
+ E A+ + EE ++ EEE+++ EQ + E +E +K L + +E + +E KK EE+
Sbjct: 1614 KAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEED 1673
Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
K+ EE + ++ DEK+ E KK+ ++
Sbjct: 1674 KKKAEEAKKAEE--DEKKAAEALKKEAEEA 1701
Score = 41.3 bits (96), Expect = 2e-04
Identities = 28/101 (27%), Positives = 56/101 (55%)
Query: 90 KKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER 149
++N + + K D+ EE ++ EE+E++ E + E E + E+ ++ E+++
Sbjct: 1657 EENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKK 1716
Query: 150 GEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
+E KK EEE+K E+ ++ D+K+ EE KK + +K+
Sbjct: 1717 KAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKK 1757
Score = 37.4 bits (86), Expect = 0.003
Identities = 27/93 (29%), Positives = 50/93 (53%), Gaps = 1/93 (1%)
Query: 98 QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
QL K A ++ EE ++ EEE + + +E + E+ + +E + EEDE+ E K+
Sbjct: 1637 QLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK 1696
Query: 158 EEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
E E+ + E+ ++ K +EK+ E KK ++
Sbjct: 1697 EAEEAKKAEE-LKKKEAEEKKKAEELKKAEEEN 1728
Score = 37.4 bits (86), Expect = 0.003
Identities = 26/93 (27%), Positives = 56/93 (60%), Gaps = 1/93 (1%)
Query: 98 QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
+L KA + + +++E EE+++ EE ++ E + + +E + EED++ +E KK
Sbjct: 1624 ELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKA 1683
Query: 158 EEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
EE++K+ E ++ + + K+ EE KKK+ +++
Sbjct: 1684 EEDEKKAAEA-LKKEAEEAKKAEELKKKEAEEK 1715
Score = 37.0 bits (85), Expect = 0.004
Identities = 26/80 (32%), Positives = 49/80 (61%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
EEE++ + EE ++ E+ + + E + E+E++ E+ ++ E EEKK+ EE K+ EE+
Sbjct: 1602 EEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKI 1661
Query: 171 DKIGDEKRLEENKKKKTQKR 190
+ K+ EE+KKK + +
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAK 1681
Score = 37.0 bits (85), Expect = 0.004
Identities = 22/88 (25%), Positives = 51/88 (57%), Gaps = 6/88 (6%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE----RGEDEEKKEEEED 161
+E ++ E ++E EE ++ E+ + E +E +K L + +E + E+ +K+ EE+
Sbjct: 1684 EEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDK 1743
Query: 162 KQGEEKRVEDKIGDEKRLEENKKKKTQK 189
K+ EE + ++ ++K++ KK++ +K
Sbjct: 1744 KKAEEAKKDE--EEKKKIAHLKKEEEKK 1769
Score = 36.7 bits (84), Expect = 0.006
Identities = 30/103 (29%), Positives = 49/103 (47%), Gaps = 8/103 (7%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
E A + E A++ EE EE + E+E+ EE + +E ++ E+ + +EEK
Sbjct: 1573 EEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEK 1632
Query: 156 K--------EEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
K E EE K+ EE + ++ K EE KK + K+
Sbjct: 1633 KKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKK 1675
Score = 36.3 bits (83), Expect = 0.008
Identities = 22/93 (23%), Positives = 50/93 (53%)
Query: 98 QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
++ K ++ + EE ++ EE + + E+ + E+ + E+ ++ E+++ +E KK
Sbjct: 1596 EVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKA 1655
Query: 158 EEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
EEE+K + + D+K+ EE KK + ++
Sbjct: 1656 EEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEK 1688
Score = 35.5 bits (81), Expect = 0.014
Identities = 24/85 (28%), Positives = 49/85 (57%), Gaps = 2/85 (2%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
DE ++ EE+++ +E +++ +E + E + +E + EE ++ +E KK +E K+ E
Sbjct: 1421 DEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAE 1480
Query: 166 EKRVEDKIGDEKRLEENKKKKTQKR 190
E + D+ +K+ EE KKK + +
Sbjct: 1481 EAKKADEA--KKKAEEAKKKADEAK 1503
Score = 35.1 bits (80), Expect = 0.018
Identities = 21/87 (24%), Positives = 46/87 (52%), Gaps = 3/87 (3%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
DE + +++ +E +++ +E+ + E + +E + +E ++ +E KK EE K+ E
Sbjct: 1408 DELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAE 1467
Query: 166 EKRVED---KIGDEKRLEENKKKKTQK 189
E + D K +E + + KKK ++
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEE 1494
Score = 35.1 bits (80), Expect = 0.021
Identities = 26/92 (28%), Positives = 50/92 (54%), Gaps = 7/92 (7%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKER-----RLPEEDERGEDEEKKEE 158
A DE EE+ E E+++EE +++ + ++ EK++ + EED++ DE KK
Sbjct: 1355 AADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAA 1414
Query: 159 EEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
K+ +E + K ++K+ +E KKK + +
Sbjct: 1415 AAKKKADE--AKKKAEEKKKADEAKKKAEEAK 1444
Score = 34.7 bits (79), Expect = 0.024
Identities = 29/102 (28%), Positives = 54/102 (52%), Gaps = 7/102 (6%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
L K + A E +++E EE+++ EE ++ E + + E+ ++ EED++ +E K
Sbjct: 1691 AEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAK 1750
Query: 156 KEEEE-------DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
K+EEE K+ E+K E + E +EE ++ +KR
Sbjct: 1751 KDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKR 1792
Score = 34.7 bits (79), Expect = 0.027
Identities = 25/94 (26%), Positives = 52/94 (55%), Gaps = 6/94 (6%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE----RRLPEEDERGEDEEKK 156
+A E A+ E E +E E EE+ + +++E +K+ ++ EE ++ ++ +KK
Sbjct: 1340 EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399
Query: 157 EEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
EE+ K+ +E ++ +K+ +E KKK +K+
Sbjct: 1400 AEEDKKKADE--LKKAAAAKKKADEAKKKAEEKK 1431
Score = 34.3 bits (78), Expect = 0.031
Identities = 26/94 (27%), Positives = 55/94 (58%), Gaps = 6/94 (6%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
KA A + A E+++ +E +++ EE++++ + L++ K++ +E ++ +E+KK +E
Sbjct: 1379 KADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK--ADEAKKKAEEKKKADEA 1436
Query: 161 DKQGEEKRVED----KIGDEKRLEENKKKKTQKR 190
K+ EE + D K + K+ EE KKK + +
Sbjct: 1437 KKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAK 1470
Score = 34.0 bits (77), Expect = 0.041
Identities = 20/89 (22%), Positives = 49/89 (55%), Gaps = 4/89 (4%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
E +EE +++ + +++ E+++ + + E++++ +E ++ +KK +E K+ E
Sbjct: 1369 AEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAE 1428
Query: 166 EKRVED----KIGDEKRLEENKKKKTQKR 190
EK+ D K + K+ +E KKK + +
Sbjct: 1429 EKKKADEAKKKAEEAKKADEAKKKAEEAK 1457
Score = 34.0 bits (77), Expect = 0.044
Identities = 20/85 (23%), Positives = 47/85 (55%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
DE ++ EE +++ +E ++ E + +E + +E + +E ++ E+ +K +E + + +
Sbjct: 1486 DEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEK 1545
Query: 166 EKRVEDKIGDEKRLEENKKKKTQKR 190
+K E K +E + E KKK + +
Sbjct: 1546 KKADELKKAEELKKAEEKKKAEEAK 1570
Score = 33.6 bits (76), Expect = 0.063
Identities = 21/90 (23%), Positives = 49/90 (54%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
+A + E A+++ +E ++ E +++ ++ + EE + E + EE ++ ++ +K EE++
Sbjct: 1487 EAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKK 1546
Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
+K E K +EK+ E KK + +
Sbjct: 1547 KADELKKAEELKKAEEKKKAEEAKKAEEDK 1576
Score = 33.2 bits (75), Expect = 0.071
Identities = 26/92 (28%), Positives = 47/92 (51%), Gaps = 2/92 (2%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
KA + A E ++ E +++ +E ++ +E + DE +K + DE + EEKK+ +E
Sbjct: 1491 KAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADE 1550
Query: 161 DKQGEE--KRVEDKIGDEKRLEENKKKKTQKR 190
K+ EE K E K +E + E K ++
Sbjct: 1551 LKKAEELKKAEEKKKAEEAKKAEEDKNMALRK 1582
Score = 33.2 bits (75), Expect = 0.082
Identities = 25/88 (28%), Positives = 50/88 (56%), Gaps = 1/88 (1%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
A + +E ++ EE ++ +E ++ +E + DE +K L + +E+ + EE K+ EEDK
Sbjct: 1518 AEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKN 1577
Query: 164 GEEKRVED-KIGDEKRLEENKKKKTQKR 190
++ E+ K +E R+EE K +++
Sbjct: 1578 MALRKAEEAKKAEEARIEEVMKLYEEEK 1605
Score = 33.2 bits (75), Expect = 0.086
Identities = 23/87 (26%), Positives = 46/87 (52%), Gaps = 2/87 (2%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+E + EE +++ EE ++ ++ + + E + ++ ++ EE ++ DE KK E K+ +
Sbjct: 1454 EEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKAD 1513
Query: 166 E--KRVEDKIGDEKRLEENKKKKTQKR 190
E K E K DE + E KK + +
Sbjct: 1514 EAKKAEEAKKADEAKKAEEAKKADEAK 1540
Score = 32.4 bits (73), Expect = 0.13
Identities = 21/94 (22%), Positives = 50/94 (53%), Gaps = 6/94 (6%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
KA + A E++++ +E ++ +++ + E + +E++ +E ++ +E KK +E
Sbjct: 1392 KADEAKKKAEEDKKKADELKKAAAAKKKAD--EAKKKAEEKKKADEAKKKAEEAKKADEA 1449
Query: 161 DKQGEEKR----VEDKIGDEKRLEENKKKKTQKR 190
K+ EE + + K + K+ +E KKK + +
Sbjct: 1450 KKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAK 1483
Score = 32.4 bits (73), Expect = 0.14
Identities = 25/96 (26%), Positives = 45/96 (46%), Gaps = 6/96 (6%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
KA + A +++ EE ++ E + + DE E E E+ ++E KK+ +
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADA 1382
Query: 161 -DKQGEEKRVEDKIG-----DEKRLEENKKKKTQKR 190
K+ EEK+ D+ D+K+ +E KK K+
Sbjct: 1383 AKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKK 1418
Score = 32.4 bits (73), Expect = 0.15
Identities = 25/90 (27%), Positives = 46/90 (51%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
KA A + EE ++ EE E+ Y EE + + E E+ + + EE K+ EE
Sbjct: 1571 KAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEE 1630
Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
+K+ E+ + + ++K+ EE KK + + +
Sbjct: 1631 EKKKVEQLKKKEAEEKKKAEELKKAEEENK 1660
Score = 32.0 bits (72), Expect = 0.19
Identities = 25/102 (24%), Positives = 53/102 (51%)
Query: 89 LKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
LKK + + +A + E ++ +E +++ EE ++ E + EE + +E + E+
Sbjct: 1410 LKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEA 1469
Query: 149 RGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
+ DE KK+ EE K+ +E + + + +K E K + +K+
Sbjct: 1470 KKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511
Score = 32.0 bits (72), Expect = 0.19
Identities = 18/86 (20%), Positives = 42/86 (48%), Gaps = 3/86 (3%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
A+ + +++ + +++ EE ++ E + + +E+ E EKK+EE K+
Sbjct: 1320 AKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379
Query: 164 GEEKRVEDKIGDEKRLEENKKKKTQK 189
+ + K +EK+ + KKK ++
Sbjct: 1380 ADAAK---KKAEEKKKADEAKKKAEE 1402
Score = 32.0 bits (72), Expect = 0.21
Identities = 24/94 (25%), Positives = 46/94 (48%), Gaps = 4/94 (4%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
+A + E A++ +E +++ EE ++ +E + EE + E + E+ + DE KK+ EE
Sbjct: 1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEE 1494
Query: 161 DKQGEE----KRVEDKIGDEKRLEENKKKKTQKR 190
K+ + K DE + E KK + +
Sbjct: 1495 AKKKADEAKKAAEAKKKADEAKKAEEAKKADEAK 1528
Score = 32.0 bits (72), Expect = 0.21
Identities = 26/87 (29%), Positives = 41/87 (47%), Gaps = 3/87 (3%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+E + EEE EE + EE + K + DE + EEKK+ +E K+ E
Sbjct: 1240 EEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAE 1299
Query: 166 EKRVED---KIGDEKRLEENKKKKTQK 189
EK+ D K +E + + KKK ++
Sbjct: 1300 EKKKADEAKKKAEEAKKADEAKKKAEE 1326
Score = 31.6 bits (71), Expect = 0.22
Identities = 30/95 (31%), Positives = 49/95 (51%), Gaps = 5/95 (5%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEK----ERRLPEEDERGEDEEKK 156
KA + A E ++ +E +++ EE ++ DE ++ E K E + E ++ DE KK
Sbjct: 1458 KAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKK 1517
Query: 157 EEEEDKQGEEKRVED-KIGDEKRLEENKKKKTQKR 190
EE K E K+ E+ K DE + E KKK + +
Sbjct: 1518 AEEAKKADEAKKAEEAKKADEAKKAEEKKKADELK 1552
Score = 31.3 bits (70), Expect = 0.33
Identities = 17/80 (21%), Positives = 41/80 (51%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
EE+++ +E +++ EE ++ + ++ E +++ ++ E+ +K E + E E
Sbjct: 1299 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADE 1358
Query: 171 DKIGDEKRLEENKKKKTQKR 190
+ +EK KKK+ K+
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKK 1378
Score = 30.9 bits (69), Expect = 0.44
Identities = 16/80 (20%), Positives = 40/80 (50%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
E ++ +E ++ EE+++ ++ + + E + ++ ++ EE ++ D KK+ EE K+ E
Sbjct: 1287 EEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAE 1346
Query: 167 KRVEDKIGDEKRLEENKKKK 186
+ E ++K
Sbjct: 1347 AAKAEAEAAADEAEAAEEKA 1366
Score = 30.9 bits (69), Expect = 0.47
Identities = 22/90 (24%), Positives = 46/90 (51%), Gaps = 4/90 (4%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
+A + E A++ +E +++ EE +++ + + E+ + E E DE + EE+
Sbjct: 1306 EAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEK 1365
Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
+ E+K+ E K K+ + KKK +K+
Sbjct: 1366 AEAAEKKKEEAK----KKADAAKKKAEEKK 1391
Score = 30.5 bits (68), Expect = 0.60
Identities = 27/82 (32%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
A E+++ EE ++ EE++ EE + +E R+ E + E+E+K + EE K+ EE +
Sbjct: 1560 AEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAK 1619
Query: 169 VEDKIGDEKRLEENKKKKTQKR 190
+ K + K+ EE KKK Q +
Sbjct: 1620 I--KAEELKKAEEEKKKVEQLK 1639
Score = 30.5 bits (68), Expect = 0.63
Identities = 20/81 (24%), Positives = 39/81 (48%), Gaps = 1/81 (1%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+E + +E +++ EE ++ ++ + + E + + E+ + E K E E E
Sbjct: 1299 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADE 1358
Query: 166 EKRVEDKI-GDEKRLEENKKK 185
+ E+K EK+ EE KKK
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKKK 1379
Score = 30.5 bits (68), Expect = 0.67
Identities = 26/90 (28%), Positives = 43/90 (47%), Gaps = 4/90 (4%)
Query: 105 RDETAREEEEEEEEEE----EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
+ E AR+ EEE + EE E+ ++ E + EE + + E E+ER +E +K EE
Sbjct: 1201 KAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEA 1260
Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
+R +E R + KK +K+
Sbjct: 1261 RMAHFARRQAAIKAEEARKADELKKAEEKK 1290
Score = 30.5 bits (68), Expect = 0.70
Identities = 22/83 (26%), Positives = 44/83 (53%), Gaps = 1/83 (1%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPE-EDERGEDEEKKEEEEDKQGEEK 167
A E +++ +E ++ EE ++ DE + +E +K + E+++ DE KK EE K E+K
Sbjct: 1505 AAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKK 1564
Query: 168 RVEDKIGDEKRLEENKKKKTQKR 190
+ E+ E+ +K + +
Sbjct: 1565 KAEEAKKAEEDKNMALRKAEEAK 1587
Score = 30.5 bits (68), Expect = 0.70
Identities = 25/93 (26%), Positives = 54/93 (58%), Gaps = 1/93 (1%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
E KA + A E++++ EE +++EEE+++ +L+++E +K + +E E +EE
Sbjct: 1726 EENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEEL 1785
Query: 156 KEEEEDKQGE-EKRVEDKIGDEKRLEENKKKKT 187
EE+E ++ E +K+++D + + E K+
Sbjct: 1786 DEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGN 1818
Score = 30.1 bits (67), Expect = 0.71
Identities = 26/99 (26%), Positives = 46/99 (46%), Gaps = 8/99 (8%)
Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
F A+++ +E EE + EE ++ + EE +E + ED R +E +K E+
Sbjct: 1079 FDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAED 1138
Query: 160 EDKQGEEKRVED--------KIGDEKRLEENKKKKTQKR 190
K E ++ ED K D ++ EE +K + K+
Sbjct: 1139 ARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKK 1177
Score = 29.0 bits (64), Expect = 1.7
Identities = 18/76 (23%), Positives = 39/76 (51%)
Query: 115 EEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIG 174
EE + +E ++ E+ + +E + +E++ +E ++ +E KK +E K+ EE + +
Sbjct: 1275 EEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAA 1334
Query: 175 DEKRLEENKKKKTQKR 190
+K E K + K
Sbjct: 1335 KKKAEEAKKAAEAAKA 1350
Score = 28.6 bits (63), Expect = 2.6
Identities = 20/95 (21%), Positives = 46/95 (48%), Gaps = 8/95 (8%)
Query: 101 KAGARDETAREEEEEEEEEE----EEEEEQEQDEYLEE--DENEKERRLPEEDERGEDEE 154
+A + E AR+ +E ++ EE +E ++ E+ + +E + E+ ++ E ++ E+ +
Sbjct: 1269 QAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1328
Query: 155 KKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
KK + K+ EE + + K E + +
Sbjct: 1329 KKADAAKKKAEEAKKAAEA--AKAEAEAAADEAEA 1361
Score = 28.6 bits (63), Expect = 2.6
Identities = 20/84 (23%), Positives = 47/84 (55%), Gaps = 1/84 (1%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
+ A+++ EE ++ +E +++ E+ + EE + + E ++ + + EE K+ +E K+ E
Sbjct: 1434 DEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAE 1493
Query: 167 KRVEDKIGDEKRLEENKKKKTQKR 190
+ + K + K+ E KKK + +
Sbjct: 1494 E-AKKKADEAKKAAEAKKKADEAK 1516
Score = 28.2 bits (62), Expect = 3.4
Identities = 29/94 (30%), Positives = 47/94 (50%), Gaps = 6/94 (6%)
Query: 98 QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
+L K A ++ EE ++ EEE + + E+ + E E+ + +E + EE+++ KKE
Sbjct: 1706 ELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKE 1765
Query: 158 EEEDKQGEEKRVEDKIG------DEKRLEENKKK 185
EE+ + K E I DEKR E KK
Sbjct: 1766 EEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKK 1799
Score = 28.2 bits (62), Expect = 3.7
Identities = 22/89 (24%), Positives = 42/89 (47%), Gaps = 3/89 (3%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE--RRLPEEDERGEDEEKKEEEEDK 162
+ E ++ +E +++ EE ++ E + EE + + + ++ EE ++ + K E E
Sbjct: 1297 KAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAA 1356
Query: 163 QGEEKRVEDKIGDEKRLEENKKK-KTQKR 190
E E EK+ EE KKK K+
Sbjct: 1357 DEAEAAEEKAEAAEKKKEEAKKKADAAKK 1385
Score = 28.2 bits (62), Expect = 3.8
Identities = 26/89 (29%), Positives = 47/89 (52%), Gaps = 10/89 (11%)
Query: 106 DETAREEEEEEEEEE----EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE--- 158
DE + E +++ +E EE ++ ++ + EE + E + EE ++ ++ +K EE
Sbjct: 1500 DEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKK 1559
Query: 159 -EEDKQGEEKRV--EDKIGDEKRLEENKK 184
EE K+ EE + EDK ++ EE KK
Sbjct: 1560 AEEKKKAEEAKKAEEDKNMALRKAEEAKK 1588
Score = 27.8 bits (61), Expect = 4.1
Identities = 25/86 (29%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE-KKEEEEDKQGEEK 167
A E ++ EE++ + E+ + EE E+ +L EE+++ + EE KK EE + EE
Sbjct: 1566 AEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEEL 1625
Query: 168 RVED----KIGDEKRLEENKKKKTQK 189
+ + K+ K+ E +KKK ++
Sbjct: 1626 KKAEEEKKKVEQLKKKEAEEKKKAEE 1651
Score = 27.8 bits (61), Expect = 4.5
Identities = 20/86 (23%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
+ + ++ + + +E+ +E ++ + + +E +K E+ R +E KK+ E+ ++
Sbjct: 1071 GLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKA 1130
Query: 165 EEKRVEDKIGDEKRLEENKKKKTQKR 190
EE R K D ++ EE +K + KR
Sbjct: 1131 EEAR---KAEDARKAEEARKAEDAKR 1153
Score = 27.8 bits (61), Expect = 4.9
Identities = 27/96 (28%), Positives = 47/96 (48%), Gaps = 9/96 (9%)
Query: 104 ARDETAREEEEEEEEEEEEEEE---QEQDEYLEEDENEKERRLPEEDERGED---EEKKE 157
A D E ++ EE +++ EE E++ EE +E R+ R EE ++
Sbjct: 1220 AEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARK 1279
Query: 158 EEEDKQGEEKRVED---KIGDEKRLEENKKKKTQKR 190
+E K+ EEK+ D K ++K+ +E KKK + +
Sbjct: 1280 ADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAK 1315
Score = 27.4 bits (60), Expect = 6.9
Identities = 27/98 (27%), Positives = 48/98 (48%), Gaps = 11/98 (11%)
Query: 101 KAGARDETAREEEEEEEEEE----EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
+A + E AR+ EE + E+ EE + E + +E ++ R EE + ED +K
Sbjct: 1119 EAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKA 1178
Query: 157 EE----EEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
E EE ++ EE R K D ++ E +K + +++
Sbjct: 1179 EAARKAEEVRKAEELR---KAEDARKAEAARKAEEERK 1213
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 40.9 bits (96), Expect = 2e-04
Identities = 15/42 (35%), Positives = 27/42 (64%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
EEE+EEEEEEE+E+E E+ +++E E++ + ++
Sbjct: 388 RDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYED 429
Score = 34.7 bits (80), Expect = 0.021
Identities = 17/57 (29%), Positives = 33/57 (57%)
Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
F A ++ E ++EEEE+EEEEEE+++DE ++ ++ E ++ E ++
Sbjct: 376 FSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDG 432
Score = 34.3 bits (79), Expect = 0.034
Identities = 19/79 (24%), Positives = 37/79 (46%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
E +++ + +E EQ+ ++ +DE EDEE++EEE++ +G K
Sbjct: 354 ETLNKQKWTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHS 413
Query: 171 DKIGDEKRLEENKKKKTQK 189
D E+ E+K + +
Sbjct: 414 DDEEFEEDDVESKYEDSDG 432
Score = 34.0 bits (78), Expect = 0.047
Identities = 19/59 (32%), Positives = 35/59 (59%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
D + E E+ E ++EEE++++E EEDE+E + +DE E+++ + + ED G
Sbjct: 374 DAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDG 432
Score = 32.0 bits (73), Expect = 0.17
Identities = 20/81 (24%), Positives = 39/81 (48%), Gaps = 1/81 (1%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLE-EDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
ET +++ + +E E++ + LE ED N + EEDE E+EE ++E K+
Sbjct: 354 ETLNKQKWTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHS 413
Query: 166 EKRVEDKIGDEKRLEENKKKK 186
+ ++ E + E++
Sbjct: 414 DDEEFEEDDVESKYEDSDGNS 434
Score = 32.0 bits (73), Expect = 0.18
Identities = 17/45 (37%), Positives = 25/45 (55%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
E+ + +E EEEEEE+E+E +E DE EED+ E +
Sbjct: 382 EDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESK 426
>gnl|CDD|162039 TIGR00790, fnt, formate/nitrite transporter. The Formate-Nitrite
Transporter (FNT) Family (TC 2.A.44)The prokaryotic
proteins of the FNT family probably function in the
transport of the structurally related compounds, formate
and nitrite. The homologous yeast protein may function
as a short chain aliphatic carboxylate H+
symporter,transporting formate, acetate and propionate,
and functioning primarily as an acetate uptake permease.
The putative formate efflux transporters (FocA) of
bacteria associated with pyruvate-formate lyase (pfl)
comprise cluster I; the putative formate uptake
permeases (FdhC) of bacteria and archaea associated with
formate dehydrogenase comprise cluster II; the putative
nitrite uptake permeases (NirC) of bacteria comprise
cluster III, and the single yeast protein, the putative
acetate:H+ symporter alone comprises cluster IV. The
energy coupling mechanisms for proteins of the FNT
family have not been extensively characterized. HCO2 -,
CH3CO2 - and NO2 - uptakes are probably coupled to
H+symport. HCO2 - efflux may be driven by the membrane
potential by a uniport mechanism or by H+ antiport
[Transport and binding proteins, Anions].
Length = 239
Score = 40.0 bits (94), Expect = 3e-04
Identities = 17/57 (29%), Positives = 20/57 (35%), Gaps = 14/57 (24%)
Query: 21 TLILVFCTATG-NILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLV 76
T + F N LVCLA+ W MSL+ DL M I V
Sbjct: 136 TFVEAFFLGILCNWLVCLAV-W------------MSLSGKDLAGKFFAMWFPIFAFV 179
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 40.4 bits (94), Expect = 3e-04
Identities = 24/86 (27%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE---EEDK 162
D A +E +EE + E++E Q++D E + +++ + + + ED+EK E EE+
Sbjct: 4012 DMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENV 4071
Query: 163 QGEEKRVEDKIGDEKRLEENKKKKTQ 188
Q E+ ED + ++ LE+ + + Q
Sbjct: 4072 QENEESTEDGVKSDEELEQGEVPEDQ 4097
Score = 38.8 bits (90), Expect = 0.001
Identities = 21/87 (24%), Positives = 40/87 (45%), Gaps = 5/87 (5%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEE-EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE 154
EN+ +DE ++E+ EE +E+ +Q+ L ED+ + EE+ + +E
Sbjct: 4018 ENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEES 4077
Query: 155 KKE----EEEDKQGEEKRVEDKIGDEK 177
++ +EE +QGE + K
Sbjct: 4078 TEDGVKSDEELEQGEVPEDQAIDNHPK 4104
Score = 38.1 bits (88), Expect = 0.002
Identities = 23/84 (27%), Positives = 40/84 (47%), Gaps = 4/84 (4%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRL---PEEDERGEDEEKKEEEED 161
+D E E + EEE ++ ++E ++ E+ N +L P E++ E E+K E+
Sbjct: 3869 KDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSA 3928
Query: 162 KQGEEKRVEDKIGDEKRLEENKKK 185
E V K D K LE+ ++
Sbjct: 3929 ANNESDLV-SKEDDNKALEDKDRQ 3951
Score = 36.1 bits (83), Expect = 0.010
Identities = 17/84 (20%), Positives = 42/84 (50%), Gaps = 4/84 (4%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEED----ERGEDEEKKEEEEDKQG 164
+E + ++ + E+ + + DE EE + EK+ + +ED DE+ ++++
Sbjct: 3997 EKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLA 4056
Query: 165 EEKRVEDKIGDEKRLEENKKKKTQ 188
E+ ++ G E+ ++EN++
Sbjct: 4057 EDDEKMNEDGFEENVQENEESTED 4080
Score = 35.7 bits (82), Expect = 0.012
Identities = 16/63 (25%), Positives = 31/63 (49%), Gaps = 3/63 (4%)
Query: 116 EEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE---EDKQGEEKRVEDK 172
E+++ EE E+D + D +E E R E D G ++ E D + E + ++++
Sbjct: 3832 NEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEE 3891
Query: 173 IGD 175
+ D
Sbjct: 3892 VND 3894
Score = 35.4 bits (81), Expect = 0.015
Identities = 18/76 (23%), Positives = 36/76 (47%), Gaps = 4/76 (5%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEE---DENEKERRLPEEDERGEDEEKKEEEE 160
+ +E +E EE+ E E++ EQ E E + + E+ +R +++E +EE
Sbjct: 3902 SLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDR-QEKEDEEEMS 3960
Query: 161 DKQGEEKRVEDKIGDE 176
D G + ++ I +
Sbjct: 3961 DDVGIDDEIQPDIQEN 3976
Score = 34.2 bits (78), Expect = 0.042
Identities = 17/80 (21%), Positives = 40/80 (50%), Gaps = 4/80 (5%)
Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDE----EKKEEEEDKQGE 165
++ +E+E+EEE ++ DE + + + PE ++ E ++KE + K +
Sbjct: 3948 KDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSD 4007
Query: 166 EKRVEDKIGDEKRLEENKKK 185
+ ++ + DE + E + +K
Sbjct: 4008 LEDMDMEAADENKEEADAEK 4027
Score = 31.1 bits (70), Expect = 0.34
Identities = 18/85 (21%), Positives = 36/85 (42%), Gaps = 5/85 (5%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
A D+ E+ EE +E EE E +E E E+ ED+ ++ K + +
Sbjct: 4056 AEDDEKMNEDGFEENVQENEESTEDGVKSDE---ELEQGEVPEDQAIDNHPKMDAKSTFA 4112
Query: 164 GEEKRVEDKIGDEKRLEENKKKKTQ 188
+++ D+ + EN++ +
Sbjct: 4113 --SAEADEENTDKGIVGENEELGEE 4135
Score = 31.1 bits (70), Expect = 0.37
Identities = 20/89 (22%), Positives = 42/89 (47%), Gaps = 11/89 (12%)
Query: 106 DETAREEEEEEEEEEEEEEEQE--QDEYLEEDENEKERR---------LPEEDERGEDEE 154
D A E+++ +E+E+EEE + D+ ++ D E + LPE+ + E E
Sbjct: 3941 DNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEG 4000
Query: 155 KKEEEEDKQGEEKRVEDKIGDEKRLEENK 183
++ D + + D+ +E E+++
Sbjct: 4001 DVSKDSDLEDMDMEAADENKEEADAEKDE 4029
Score = 28.8 bits (64), Expect = 2.4
Identities = 16/67 (23%), Positives = 30/67 (44%), Gaps = 2/67 (2%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEE-DENEKERRLPEEDERG-EDEEKKEEEEDKQG 164
E E+ E + +++ ++ LE+ D EKE D+ G +DE + + +E+
Sbjct: 3920 EQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQ 3979
Query: 165 EEKRVED 171
ED
Sbjct: 3980 PPPENED 3986
Score = 27.7 bits (61), Expect = 4.6
Identities = 18/93 (19%), Positives = 37/93 (39%), Gaps = 5/93 (5%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
E+ L + DE ++++ + E++E+ ++ EE+ E E + + E+ E+
Sbjct: 4034 EDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDG---FEENVQENEESTEDGVKSDEELEQ 4090
Query: 156 KEEEEDK--QGEEKRVEDKIGDEKRLEENKKKK 186
E ED+ K +E K
Sbjct: 4091 GEVPEDQAIDNHPKMDAKSTFASAEADEENTDK 4123
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 39.6 bits (92), Expect = 4e-04
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 1/75 (1%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
EEEE EE + EE+E +E + E R L EE+E E E + E+ E
Sbjct: 172 PDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEE-EAEGSDDGEDVVDYEG 230
Query: 167 KRVEDKIGDEKRLEE 181
+R++ K G+E+ +EE
Sbjct: 231 ERIDKKQGEEEEMEE 245
Score = 38.1 bits (88), Expect = 0.001
Identities = 27/90 (30%), Positives = 47/90 (52%), Gaps = 4/90 (4%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDE--NEKERRLPEEDERGED--EEKKEE 158
+E EE + EEEE+EE D Y E + NE+E E + GED + + E
Sbjct: 173 DVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGER 232
Query: 159 EEDKQGEEKRVEDKIGDEKRLEENKKKKTQ 188
+ KQGEE+ +E+++ + +E ++ ++
Sbjct: 233 IDKKQGEEEEMEEEVINLFEIEWEEESPSE 262
Score = 28.8 bits (64), Expect = 1.5
Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 5/66 (7%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQD-----EYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
++ E E EEEEEE E D +Y E ++K+ E +E + + E EE
Sbjct: 198 SDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEE 257
Query: 161 DKQGEE 166
+ EE
Sbjct: 258 ESPSEE 263
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 39.9 bits (94), Expect = 4e-04
Identities = 18/60 (30%), Positives = 39/60 (65%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
+ EEE++E++++ ++++E EE++ +KE EE+E E+E+++EEE+ K+
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 38.7 bits (91), Expect = 0.001
Identities = 19/60 (31%), Positives = 42/60 (70%), Gaps = 1/60 (1%)
Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEK 167
+ E++ EEE++E++++ +E+E E+E+ +E+E+ E+EE+ EEE++++ E+K
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKE-KKEEEKEEEEEEAEEEKEEEEEKK 473
Score = 38.4 bits (90), Expect = 0.001
Identities = 22/73 (30%), Positives = 44/73 (60%)
Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
F G++ T + ++ E+ E++ EEE+++ + +KE EE E+ E+E+++EEE
Sbjct: 401 FLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEE 460
Query: 160 EDKQGEEKRVEDK 172
E ++ +E+ E K
Sbjct: 461 EAEEEKEEEEEKK 473
Score = 38.0 bits (89), Expect = 0.002
Identities = 23/82 (28%), Positives = 47/82 (57%), Gaps = 7/82 (8%)
Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEK 167
T ++ ++ ++ E+ E++++E +E + + +E+E E++EKKEEE++++ EE
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Query: 168 RVEDKIGDEKRLEENKKKKTQK 189
E + EE +KKK Q
Sbjct: 463 EEEKE-------EEEEKKKKQA 477
Score = 34.9 bits (81), Expect = 0.016
Identities = 23/81 (28%), Positives = 44/81 (54%), Gaps = 9/81 (11%)
Query: 83 SGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERR 142
S + K I+ + + + + E ++ +++EEEEEEE+E+ E E +
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE---------EEK 455
Query: 143 LPEEDERGEDEEKKEEEEDKQ 163
EE+E E++E++EE++ KQ
Sbjct: 456 EEEEEEAEEEKEEEEEKKKKQ 476
Score = 34.9 bits (81), Expect = 0.020
Identities = 15/77 (19%), Positives = 44/77 (57%)
Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
EEE E ++ + ++ EK+R +++++ + K++EE+++ E+++ E++
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Query: 173 IGDEKRLEENKKKKTQK 189
+E+ E +K++ ++
Sbjct: 455 KEEEEEEAEEEKEEEEE 471
Score = 33.0 bits (76), Expect = 0.087
Identities = 13/80 (16%), Positives = 46/80 (57%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
EEE E ++ ++ + + ++ +E + ++ + ++K+EEEE+++ +++ +
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455
Query: 171 DKIGDEKRLEENKKKKTQKR 190
++ +E E+ ++++ +K+
Sbjct: 456 EEEEEEAEEEKEEEEEKKKK 475
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 39.4 bits (92), Expect = 5e-04
Identities = 20/93 (21%), Positives = 44/93 (47%), Gaps = 9/93 (9%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQ-EQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
+ A+ + +++E+++ EE +++Q + E L++ E E+ E+ ++ E+ K+
Sbjct: 71 QKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQ-EQKKQAEEAAKQAAL 129
Query: 160 EDKQGEEKRVEDKI-------GDEKRLEENKKK 185
+ KQ EE + + KR KK
Sbjct: 130 KQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKK 162
Score = 32.9 bits (75), Expect = 0.081
Identities = 15/97 (15%), Positives = 38/97 (39%), Gaps = 7/97 (7%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
E + K + E ++++ E+E ++ E++ ++ + E+ + ++ +E
Sbjct: 79 EQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAA 138
Query: 156 KEEEEDKQ----GEEKRVEDKIGDEKRLEENKKKKTQ 188
+ + E KR K+ KKK +
Sbjct: 139 AKAAAAAKAKAEAEAKRAAAA---AKKAAAEAKKKAE 172
Score = 27.8 bits (62), Expect = 4.2
Identities = 14/77 (18%), Positives = 30/77 (38%), Gaps = 2/77 (2%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE--KR 168
+E++++ EE ++ ++ E + + E E K+ K+ K+
Sbjct: 110 RLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKK 169
Query: 169 VEDKIGDEKRLEENKKK 185
+ +K E KKK
Sbjct: 170 KAEAEAAKKAAAEAKKK 186
Score = 27.1 bits (60), Expect = 7.0
Identities = 19/103 (18%), Positives = 41/103 (39%), Gaps = 5/103 (4%)
Query: 89 LKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE---RRLPE 145
L++ E + K ++ A +E++++ EE ++ +Q + E +
Sbjct: 92 LQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEA 151
Query: 146 EDERGEDEEKK-EEEEDKQGEEKRVEDKIGD-EKRLEENKKKK 186
E +R KK E K+ E + + + +K+ E K
Sbjct: 152 EAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAK 194
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 39.5 bits (92), Expect = 5e-04
Identities = 20/68 (29%), Positives = 37/68 (54%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
E E+EE+ ++++ E EQ+E E + +E P+E E + E + EEE Q ++
Sbjct: 206 DTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDD 265
Query: 169 VEDKIGDE 176
+ D+ D+
Sbjct: 266 LPDESDDD 273
Score = 32.6 bits (74), Expect = 0.11
Identities = 18/61 (29%), Positives = 28/61 (45%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
D+ E E+EE+ E E E QE E + ++E EE+ D++ +E D E
Sbjct: 216 DDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESDDDSE 275
Query: 166 E 166
Sbjct: 276 T 276
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 39.6 bits (92), Expect = 6e-04
Identities = 23/77 (29%), Positives = 37/77 (48%), Gaps = 10/77 (12%)
Query: 102 AGARDETAREEEEEEE----------EEEEEEEEQEQDEYLEEDENEKERRLPEEDERGE 151
G+ DE+ EEE E E +E+EE E E L EDE+E + E+ E +
Sbjct: 925 VGSDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWD 984
Query: 152 DEEKKEEEEDKQGEEKR 168
+ E K + + G+ ++
Sbjct: 985 ELESKAAYDSRPGKRRK 1001
Score = 33.1 bits (75), Expect = 0.081
Identities = 21/77 (27%), Positives = 33/77 (42%), Gaps = 6/77 (7%)
Query: 98 QLFKAGARDETAREEEEEEEEEEEEEEEQE------QDEYLEEDENEKERRLPEEDERGE 151
F+ G ++E +E EEE E E DE E++E+++ EDE
Sbjct: 913 SFFEDGGWSFLMVGSDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESEN 972
Query: 152 DEEKKEEEEDKQGEEKR 168
D +E+ ED E +
Sbjct: 973 DSSDEEDGEDWDELESK 989
Score = 32.7 bits (74), Expect = 0.10
Identities = 21/65 (32%), Positives = 31/65 (47%), Gaps = 9/65 (13%)
Query: 117 EEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDE 176
++E +E E+E EY ++E DE EDEE E ED +E E+ DE
Sbjct: 927 SDDESDESEEEVSEYEASSDDES-------DETDEDEESDESSEDLSEDES--ENDSSDE 977
Query: 177 KRLEE 181
+ E+
Sbjct: 978 EDGED 982
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 39.4 bits (92), Expect = 7e-04
Identities = 21/75 (28%), Positives = 40/75 (53%)
Query: 115 EEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIG 174
EEE+ + E+E +E LE +++ EDE++ EE++++ + E
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60
Query: 175 DEKRLEENKKKKTQK 189
++ RL+E KK+K Q+
Sbjct: 61 EKARLKELKKQKKQE 75
Score = 28.2 bits (63), Expect = 3.0
Identities = 18/89 (20%), Positives = 37/89 (41%), Gaps = 3/89 (3%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
E + A +E EE E + + ++ +EDE+E++ EE E E
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEED---DEEAESPAKAEI 57
Query: 156 KEEEEDKQGEEKRVEDKIGDEKRLEENKK 184
+ E+ + E K+ + + + ++N
Sbjct: 58 SKREKARLKELKKQKKQEIQKILEQQNAA 86
Score = 27.1 bits (60), Expect = 7.6
Identities = 10/50 (20%), Positives = 25/50 (50%)
Query: 97 NQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEE 146
+L++ R ++ + ++ + E +E + L +E E++ +L EE
Sbjct: 773 TELYEKEVRYLMQAHQKGQLKDTIDVAEPEEPGDPLTAEEQEEKEQLLEE 822
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 38.6 bits (90), Expect = 8e-04
Identities = 22/81 (27%), Positives = 41/81 (50%)
Query: 92 NILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGE 151
+ + A DE ++ +++E E EE ++E+ E ++ E + +EDE E
Sbjct: 33 QLFFPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESE 92
Query: 152 DEEKKEEEEDKQGEEKRVEDK 172
+E ++E+EE EK E+K
Sbjct: 93 EENEEEDEESSDENEKETEEK 113
Score = 36.6 bits (85), Expect = 0.004
Identities = 23/74 (31%), Positives = 42/74 (56%), Gaps = 2/74 (2%)
Query: 101 KAGARDETAREEEEEEEEEE--EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
K ETA EE +EEE+E E+++++ + +EDE +E E++E ++ EK+ E
Sbjct: 52 KKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETE 111
Query: 159 EEDKQGEEKRVEDK 172
E+ + EK + +
Sbjct: 112 EKTESNVEKEITNP 125
Score = 32.4 bits (74), Expect = 0.087
Identities = 26/84 (30%), Positives = 45/84 (53%), Gaps = 4/84 (4%)
Query: 98 QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
QLF + + A +E+E ++ +++E + + E E EKE E+ E D EK++
Sbjct: 33 QLFFPSSPSDQAAADEQEAKKSDDQETAEIE----EVKEEEKEAANSEDKEDKGDAEKED 88
Query: 158 EEEDKQGEEKRVEDKIGDEKRLEE 181
EE +++ EE+ E +EK EE
Sbjct: 89 EESEEENEEEDEESSDENEKETEE 112
Score = 29.3 bits (66), Expect = 0.93
Identities = 20/73 (27%), Positives = 40/73 (54%), Gaps = 1/73 (1%)
Query: 118 EEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE-EDKQGEEKRVEDKIGDE 176
++ +EQE + +++ E E EE E E+K+++ +K+ EE E++ DE
Sbjct: 41 SDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDE 100
Query: 177 KRLEENKKKKTQK 189
+ +EN+K+ +K
Sbjct: 101 ESSDENEKETEEK 113
Score = 29.3 bits (66), Expect = 0.95
Identities = 17/57 (29%), Positives = 31/57 (54%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
E A E++E++ + E+E+E+ ++E EEDE + E +E+ E +KE
Sbjct: 70 KEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEITNPS 126
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 38.6 bits (90), Expect = 9e-04
Identities = 19/77 (24%), Positives = 41/77 (53%)
Query: 85 AVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP 144
+V+ + K + L K + EEEE+ +E+ E++ ++ E+ + E E+
Sbjct: 118 SVMGINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEE 177
Query: 145 EEDERGEDEEKKEEEED 161
EE+E E++E ++++D
Sbjct: 178 EEEEEEEEDEDFDDDDD 194
Score = 33.6 bits (77), Expect = 0.040
Identities = 14/30 (46%), Positives = 26/30 (86%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
+EE+E++EEEEEEE+E+DE ++D+++ +
Sbjct: 168 VDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197
Score = 33.2 bits (76), Expect = 0.045
Identities = 18/49 (36%), Positives = 33/49 (67%)
Query: 89 LKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDEN 137
+K + E +L + A D +E++EEEEEEEEEE+++ D+ ++D++
Sbjct: 150 DEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDD 198
Score = 32.8 bits (75), Expect = 0.065
Identities = 19/68 (27%), Positives = 37/68 (54%), Gaps = 6/68 (8%)
Query: 99 LFKAGARDETAR-EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
LF D + E++ +E E E+ E+DE EE+E E+E E+++ +D++ +
Sbjct: 141 LFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEE---EEDEDFDDDDD--D 195
Query: 158 EEEDKQGE 165
+++D E
Sbjct: 196 DDDDYNAE 203
Score = 30.9 bits (70), Expect = 0.34
Identities = 14/57 (24%), Positives = 28/57 (49%), Gaps = 3/57 (5%)
Query: 129 DEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK---QGEEKRVEDKIGDEKRLEEN 182
EED +EK L ++ + E E+ EE+E + EE+ ++ D+ +++
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDD 198
>gnl|CDD|222976 PHA03087, PHA03087, G protein-coupled chemokine receptor-like
protein; Provisional.
Length = 335
Score = 38.6 bits (90), Expect = 9e-04
Identities = 19/69 (27%), Positives = 38/69 (55%), Gaps = 3/69 (4%)
Query: 4 EVSFGLDAASWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMV 63
+++ G D S + VV + F GNI+V ++ K ++ + +L++LA++DL +
Sbjct: 31 DLNIGYDTNSTIL-IVVYSTIFFFGLVGNIIVIYVLTKTK-IKTPMDIYLLNLAVSDL-L 87
Query: 64 AVLVMPLGI 72
V+ +P I
Sbjct: 88 FVMTLPFQI 96
>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 486
Score = 38.7 bits (90), Expect = 0.001
Identities = 30/114 (26%), Positives = 60/114 (52%), Gaps = 9/114 (7%)
Query: 85 AVLRLK--KNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERR 142
A+L+L K+I+ ++L K G++ ET + E++EEE+EE + ++ + + ++E +
Sbjct: 347 AILKLLYVKDIVPISELLKEGSKQETKEQPEKKEEEKEELDIDKIILQ-IIKNEGKIISA 405
Query: 143 LPEEDERGEDEEK------KEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
+ + E E+E K K EE+ E K ++ + E K+K+ +K
Sbjct: 406 ILKNAEIKEEEGKITIKVEKSEEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKE 459
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 38.5 bits (90), Expect = 0.001
Identities = 25/97 (25%), Positives = 51/97 (52%), Gaps = 3/97 (3%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQD---EYLEEDENEKERRLPEEDERGED 152
E + K A +++++EEEE E EE+++ D E+++ + +++ EDE +D
Sbjct: 80 EEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKD 139
Query: 153 EEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
E K+ +ED E +++ E+ E +K+K +
Sbjct: 140 EAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASE 176
Score = 29.2 bits (66), Expect = 1.3
Identities = 21/75 (28%), Positives = 38/75 (50%), Gaps = 5/75 (6%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
E+ +EEE +++E EQ + ++DE E+ E E ED + + E D + +++
Sbjct: 76 EKWKEEERKKKEAEQGLESDDDDDEEEEW-----EVEEDEDSDDEGEWIDVESDKEIESS 130
Query: 172 KIGDEKRLEENKKKK 186
DE+ +E KK
Sbjct: 131 DSEDEEEKDEAAKKA 145
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 38.4 bits (89), Expect = 0.001
Identities = 30/109 (27%), Positives = 42/109 (38%), Gaps = 13/109 (11%)
Query: 87 LRLKKNILCENQLFK-AGARDETAREEEEEEEEEEE---------EEEEQEQDEYLEEDE 136
LR N+ L K G E+ EE E E++ D+ L ++
Sbjct: 313 LRFLDNLDFLQFLSKYIGCLWRNLNNEKWGNEEAERKDYILDSSSVPLEKQFDDILYFEK 372
Query: 137 NEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKK 185
E E R PEE E EE+ E+ ED+ KR+ D E +K
Sbjct: 373 MEIENRNPEESE---HEEEVEDYEDENDHSKRICDDDELENHFRAADEK 418
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 38.2 bits (89), Expect = 0.001
Identities = 20/87 (22%), Positives = 36/87 (41%), Gaps = 6/87 (6%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
E E++EEEEEEEE+E E E++RR ++++ +DEE + +
Sbjct: 174 EALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGS 233
Query: 167 KR------VEDKIGDEKRLEENKKKKT 187
+ K ++ + K
Sbjct: 234 SSPAKPTSILKKSAAKRSEAPSSSKAK 260
Score = 27.4 bits (61), Expect = 4.3
Identities = 28/108 (25%), Positives = 40/108 (37%), Gaps = 32/108 (29%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQ---------------------DEYLEEDENEKER 141
R+ A + +EE+EE E+E E+E E LEE + + R
Sbjct: 104 ATRNYEADKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKELQSR 163
Query: 142 R--------LPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEE 181
R L R + EE++EEEED E G E +
Sbjct: 164 RADVDVNSMLEALFRREKKEEEEEEEED---EALIKSLSFGPETEEDR 208
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
Provisional.
Length = 672
Score = 38.0 bits (89), Expect = 0.002
Identities = 19/90 (21%), Positives = 38/90 (42%), Gaps = 4/90 (4%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
+A R + E E+ E + + EE+E ++ RR + ++R + E+ E
Sbjct: 135 EAARRGAARKAGEGGEQPATEARADAAERT--EEEERDERRRRGDREDRQAEAERGERGR 192
Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
+ E R D R E+ +++ + R
Sbjct: 193 --REERGRDGDDRDRRDRREQGDRREERGR 220
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 36.7 bits (85), Expect = 0.002
Identities = 15/29 (51%), Positives = 19/29 (65%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDE 130
AG T +EEEEEEEE E++Q+ DE
Sbjct: 85 AGTTSGTGHTRQEEEEEEEENEKQQQSDE 113
Score = 26.3 bits (58), Expect = 7.6
Identities = 10/26 (38%), Positives = 16/26 (61%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQ 128
G EEEEEEE E++++ ++ Q
Sbjct: 90 GTGHTRQEEEEEEEENEKQQQSDEAQ 115
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 37.4 bits (87), Expect = 0.003
Identities = 17/85 (20%), Positives = 35/85 (41%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
E +E + E++ +E E +E+ E + + E D++ +EE +D+
Sbjct: 200 QEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAW 259
Query: 166 EKRVEDKIGDEKRLEENKKKKTQKR 190
E + K + +K K Q+
Sbjct: 260 EGFESEYEPINKPVRPKRKTKAQRN 284
Score = 32.0 bits (73), Expect = 0.15
Identities = 17/85 (20%), Positives = 37/85 (43%), Gaps = 8/85 (9%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER------GEDEEKKEEEEDKQGE 165
E++ +E E EE++ E+ +E L EE + ++ + E + +
Sbjct: 210 EKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPI 269
Query: 166 EKRVEDKIGDEKRLEENKKKKTQKR 190
K V K + + + NK+K+ ++
Sbjct: 270 NKPVRPKR--KTKAQRNKEKRRKEL 292
Score = 28.1 bits (63), Expect = 3.4
Identities = 21/85 (24%), Positives = 39/85 (45%), Gaps = 9/85 (10%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEY--------LEEDENEKERRLPEEDERGED 152
KA E R+E E E +EE++ +++ + + E + R+ + ERGE
Sbjct: 280 KAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEK 339
Query: 153 EEKKEEEEDK-QGEEKRVEDKIGDE 176
++ K + K + E +E + DE
Sbjct: 340 KKLKRRKLGKHKYPEPPLEVVLPDE 364
>gnl|CDD|222193 pfam13520, AA_permease_2, Amino acid permease.
Length = 425
Score = 37.3 bits (87), Expect = 0.003
Identities = 9/62 (14%), Positives = 26/62 (41%), Gaps = 10/62 (16%)
Query: 14 WNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGIL 73
W + + IL+ ++ I ++QN+ ++ L + +++ + LG++
Sbjct: 116 WLTYGIAIAILIIFA----LINIRGIKESAKIQNIL--GIVKLLLPLILIIL----LGLV 165
Query: 74 TL 75
Sbjct: 166 LA 167
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 37.2 bits (86), Expect = 0.003
Identities = 26/106 (24%), Positives = 55/106 (51%), Gaps = 1/106 (0%)
Query: 85 AVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP 144
AV R++K A++ +EEE+E+E+ +EE+++ + + E K+R+
Sbjct: 76 AVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKK-EKPKEEPKDRKPK 134
Query: 145 EEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
EE + ++KE+E++K+ EE R ++ +R+ + K +
Sbjct: 135 EEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPK 180
Score = 35.6 bits (82), Expect = 0.010
Identities = 21/91 (23%), Positives = 49/91 (53%), Gaps = 3/91 (3%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEED---ERGEDEEKKEEE 159
G +E +E+ +EE+++++E+ ++E + ++E +++R E++ E+ +E + EE
Sbjct: 103 GKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREE 162
Query: 160 EDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
E K+ + K+ NKKK+ +
Sbjct: 163 EKKRERVRAKSRPKKPPKKKPPNKKKEPPEE 193
Score = 32.6 bits (74), Expect = 0.10
Identities = 23/90 (25%), Positives = 46/90 (51%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
K ++E +++E+ +EE ++ + ++E E E EKE+ E+ R +EEKK E
Sbjct: 110 KEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERV 169
Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
+ K+ K K+ E +++K ++
Sbjct: 170 RAKSRPKKPPKKKPPNKKKEPPEEEKQRQA 199
Score = 29.9 bits (67), Expect = 0.76
Identities = 21/86 (24%), Positives = 44/86 (51%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
K A+++ +E+E+E+E++ EE ++E E K R ++ +++K+ EE
Sbjct: 134 KEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEE 193
Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKK 186
+KQ + R K E+ +++K
Sbjct: 194 EKQRQAAREAVKGKPEEPDVNEEREK 219
Score = 29.1 bits (65), Expect = 1.4
Identities = 19/86 (22%), Positives = 44/86 (51%), Gaps = 1/86 (1%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK-KEEEEDKQG 164
DE + E+ + + + ++ E + E++ + ++E+ + +EK KEE +D++
Sbjct: 74 DEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKP 133
Query: 165 EEKRVEDKIGDEKRLEENKKKKTQKR 190
+E+ E + EK E+ KK + +
Sbjct: 134 KEEAKEKRPPKEKEKEKEKKVEEPRD 159
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 37.4 bits (86), Expect = 0.003
Identities = 17/48 (35%), Positives = 27/48 (56%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER 149
A ++ REE E+ + E E++ +E+ E +E E E+ER E ER
Sbjct: 573 ASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAER 620
Score = 35.0 bits (80), Expect = 0.019
Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 9/67 (13%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
A++ EE E+ + E E++ + EE E EKE+ E ER E E ++E E +
Sbjct: 578 AKKREEAVEKAKREAEQKAR----EEREREKEK----EKER-EREREREAERAAKASSSS 628
Query: 169 VEDKIGD 175
E ++ +
Sbjct: 629 HESRMSE 635
Score = 33.9 bits (77), Expect = 0.050
Identities = 19/66 (28%), Positives = 32/66 (48%)
Query: 80 TNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEK 139
T L+ + L K+ E +A + RE E+E+E+E E E E+E + + +
Sbjct: 570 TPLASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAKASSSSH 629
Query: 140 ERRLPE 145
E R+ E
Sbjct: 630 ESRMSE 635
Score = 32.0 bits (72), Expect = 0.21
Identities = 14/48 (29%), Positives = 24/48 (50%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGE 151
A+ E ++ EE E E+E+E+E+E++ E + K E E
Sbjct: 588 AKREAEQKAREEREREKEKEKEREREREREAERAAKASSSSHESRMSE 635
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 36.3 bits (84), Expect = 0.003
Identities = 29/82 (35%), Positives = 45/82 (54%), Gaps = 3/82 (3%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
+ +EEE+E +E + +E+EQ EE E EKE PE R + E++E+EE Q
Sbjct: 34 KKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPE---REDIAEQQEKEEIAQE 90
Query: 165 EEKRVEDKIGDEKRLEENKKKK 186
EEK E + ++ + K+KK
Sbjct: 91 EEKEEEAEDVKQQEVFSFKRKK 112
Score = 28.3 bits (63), Expect = 2.0
Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 2/80 (2%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQD--EYLEEDENEKERRLPEEDERGEDE 153
EN +++E + E EE E+E+E E + +D E E++E +E EE E + +
Sbjct: 44 ENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQQ 103
Query: 154 EKKEEEEDKQGEEKRVEDKI 173
E + K +E +E+KI
Sbjct: 104 EVFSFKRKKPFKEMNLEEKI 123
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 36.9 bits (86), Expect = 0.004
Identities = 14/72 (19%), Positives = 33/72 (45%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
+ E+E+ E EEE+ + + L+E+ ++ + + +E ++ E K + G + E
Sbjct: 16 RKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEE 75
Query: 171 DKIGDEKRLEEN 182
++ E
Sbjct: 76 EEAKAEAAEFRA 87
Score = 29.2 bits (66), Expect = 1.3
Identities = 23/85 (27%), Positives = 39/85 (45%), Gaps = 1/85 (1%)
Query: 99 LFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
L + E E+ E + +EE ++ + + LEE +E E + E G EE++EE
Sbjct: 18 LTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEEE 77
Query: 159 EEDKQGEEKRVEDKIGDEKRLEENK 183
+ + E + GD+ EE K
Sbjct: 78 AKAEAAEFRAYLRG-GDDALAEERK 101
>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
unknown].
Length = 769
Score = 37.0 bits (85), Expect = 0.004
Identities = 16/71 (22%), Positives = 32/71 (45%), Gaps = 5/71 (7%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+ T ++++ +E + EE +E +D E PEE+ER + E D + E
Sbjct: 407 ESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEER-----QLREFRDMEKE 461
Query: 166 EKRVEDKIGDE 176
++ D+ +
Sbjct: 462 DREFPDEAELQ 472
Score = 29.3 bits (65), Expect = 1.3
Identities = 16/72 (22%), Positives = 29/72 (40%), Gaps = 1/72 (1%)
Query: 120 EEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKR- 178
E+EE+E + +E+ + E +E +EE E E +E+R
Sbjct: 392 WAEDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQ 451
Query: 179 LEENKKKKTQKR 190
L E + + + R
Sbjct: 452 LREFRDMEKEDR 463
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 36.9 bits (86), Expect = 0.004
Identities = 24/73 (32%), Positives = 34/73 (46%), Gaps = 11/73 (15%)
Query: 106 DETAREEEEEEEEEEEEE-------EEQEQDEYLEED----ENEKERRLPEEDERGEDEE 154
ET+ EE EEE + E E +QEQ+ EED EKE ED +
Sbjct: 227 KETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEIL 286
Query: 155 KKEEEEDKQGEEK 167
K++++E+ EK
Sbjct: 287 KEKKDEELFWFEK 299
Score = 34.6 bits (80), Expect = 0.025
Identities = 15/55 (27%), Positives = 26/55 (47%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
D E + ++E+E E+D L +E E + + D+ +EKK+EE
Sbjct: 239 TDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDEE 293
Score = 33.4 bits (77), Expect = 0.058
Identities = 24/73 (32%), Positives = 33/73 (45%), Gaps = 11/73 (15%)
Query: 116 EEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ--GEEKRVEDKI 173
+E E EE E+E D +E E + + E++ EED EEK DK
Sbjct: 227 KETSETEEREEETDVEIETT---------SETKGTKQEQEGSTEEDPSLFSEEKEDPDKT 277
Query: 174 GDEKRLEENKKKK 186
D +LE K+KK
Sbjct: 278 EDLDKLEILKEKK 290
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 36.7 bits (85), Expect = 0.004
Identities = 21/81 (25%), Positives = 48/81 (59%), Gaps = 5/81 (6%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+ ++ +E +E ++ Q+ + L E ++E + E + + ++E+ K+EE +K+ +
Sbjct: 359 NTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDAS-EEAEAKAKEEKLKQEENEKKQK 417
Query: 166 EKRVEDKIGDEKRLEENKKKK 186
E+ EDK EKR +++++KK
Sbjct: 418 EQADEDK---EKR-QKDERKK 434
Score = 30.9 bits (70), Expect = 0.42
Identities = 13/52 (25%), Positives = 31/52 (59%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
+ ++E + EE E + ++E+ + E ++ +KE+ ++++R +DE KK
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 36.2 bits (84), Expect = 0.006
Identities = 19/43 (44%), Positives = 26/43 (60%)
Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
E E EEEEEE +E +E LEED +++ R +E E EE +
Sbjct: 146 ELELPEEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELR 188
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 36.2 bits (84), Expect = 0.007
Identities = 23/80 (28%), Positives = 43/80 (53%), Gaps = 2/80 (2%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
+ R E+E E+E +E QEQ L+E E+ + E+ G+ EE +EE E+ +
Sbjct: 820 KLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEI--ENLNGKKEELEEELEELEAAL 877
Query: 167 KRVEDKIGDEKRLEENKKKK 186
+ +E ++GD K+ + + +
Sbjct: 878 RDLESRLGDLKKERDELEAQ 897
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 35.7 bits (83), Expect = 0.009
Identities = 9/90 (10%), Positives = 45/90 (50%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
+ + +++++++++++ + D+ ++D++E + ++D+ +++E+K+E +
Sbjct: 117 DLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAK 176
Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
+ + + ++ + +K K
Sbjct: 177 ELEKLSDDDDFVWDEDDSEALRQARKDAKL 206
Score = 31.1 bits (71), Expect = 0.36
Identities = 11/77 (14%), Positives = 36/77 (46%), Gaps = 1/77 (1%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
NQ D+ ++++ ++++++E++ E D+ ++ ++E E + ++ ++
Sbjct: 127 LNQADDDDDDDDDDDLDDDDIDDDDDDEDDDE-DDDDDDVDDEDEEKKEAKELEKLSDDD 185
Query: 156 KEEEEDKQGEEKRVEDK 172
++ E R K
Sbjct: 186 DFVWDEDDSEALRQARK 202
Score = 30.3 bits (69), Expect = 0.60
Identities = 11/77 (14%), Positives = 39/77 (50%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
K +DE ++ E++ +++++ + ++ + ++D+ +D++ ++++
Sbjct: 93 KKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDD 152
Query: 161 DKQGEEKRVEDKIGDEK 177
D+ +E +D + DE
Sbjct: 153 DEDDDEDDDDDDVDDED 169
Score = 28.0 bits (63), Expect = 3.1
Identities = 12/81 (14%), Positives = 43/81 (53%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
A +D+ ++ + + ++++ + D+ +D++ + E+D+ +D++ ++E+
Sbjct: 110 NALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDED 169
Query: 161 DKQGEEKRVEDKIGDEKRLEE 181
+++ E K +E D+ + +
Sbjct: 170 EEKKEAKELEKLSDDDDFVWD 190
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 34.7 bits (80), Expect = 0.009
Identities = 18/57 (31%), Positives = 32/57 (56%), Gaps = 3/57 (5%)
Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
++ +EEEE EEE +E EQE + + EK++R + + E++K + E K+
Sbjct: 93 SSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKT---EKEKPKTEPKKP 146
Score = 31.6 bits (72), Expect = 0.13
Identities = 23/85 (27%), Positives = 36/85 (42%), Gaps = 23/85 (27%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
R+ ++ ++EEEE EEE +E +++D E K E ++K+
Sbjct: 89 RELSSSDDEEEETEEESTDETEQEDP----------------------PETKTESKEKKK 126
Query: 165 EEKRVEDKIGDEKRLEENKKKKTQK 189
E + K EK E KK K K
Sbjct: 127 REVP-KPKTEKEKPKTEPKKPKPSK 150
Score = 26.3 bits (58), Expect = 7.7
Identities = 10/47 (21%), Positives = 21/47 (44%)
Query: 144 PEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
+DE E EE+ +E +++ + + +KR K + +K
Sbjct: 93 SSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKP 139
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 35.9 bits (83), Expect = 0.010
Identities = 26/54 (48%), Positives = 35/54 (64%), Gaps = 5/54 (9%)
Query: 136 ENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
E E E+ +P+E+E E EEKKEEEE +E+ V DE+ +E KKKKT+K
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEV-----DEEEEKEEKKKKTKK 76
Score = 34.7 bits (80), Expect = 0.022
Identities = 19/44 (43%), Positives = 28/44 (63%)
Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
E+E E+E +EEE+E+ E +E+E + + E DE E EEKK
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKK 71
Score = 34.4 bits (79), Expect = 0.029
Identities = 18/50 (36%), Positives = 31/50 (62%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
+EEEEEE+EE++EEE++ + EE + E+E+ ++ + E E E
Sbjct: 37 DEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86
Score = 29.7 bits (67), Expect = 1.0
Identities = 16/51 (31%), Positives = 31/51 (60%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
+E +EE++EEEE+ ++E+E DE E++E +K+ + +E + K
Sbjct: 39 EEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNK 89
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 35.4 bits (82), Expect = 0.011
Identities = 17/39 (43%), Positives = 26/39 (66%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRL 143
R E+++EEEE+E+EEE+E++E EE E E+ L
Sbjct: 338 RCTWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPPL 376
Score = 34.7 bits (80), Expect = 0.024
Identities = 27/86 (31%), Positives = 37/86 (43%), Gaps = 32/86 (37%)
Query: 113 EEEEEEEEEEEEEQEQDEYLEEDE--------------------------------NEKE 140
EEEEEEEEEEE E+DE E + N ++
Sbjct: 287 LEEEEEEEEEEEPAERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRCTWVNPEQ 346
Query: 141 RRLPEEDERGEDEEKKEEEEDKQGEE 166
+ EE E E+EE++EE E+ + EE
Sbjct: 347 KDEEEEQEDEEEEEEEEEPEEPEPEE 372
Score = 34.7 bits (80), Expect = 0.024
Identities = 22/44 (50%), Positives = 25/44 (56%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERG 150
E E EEEEEEEE E E+DE EE+E E E EE+ G
Sbjct: 164 EVELREGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGTG 207
Score = 31.2 bits (71), Expect = 0.29
Identities = 26/90 (28%), Positives = 39/90 (43%), Gaps = 26/90 (28%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENE-------------------------KERRLPEE 146
EEEEEEEEEEE E+DE E + E + + E
Sbjct: 287 LEEEEEEEEEEEPA-ERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRCTWVNPE 345
Query: 147 DERGEDEEKKEEEEDKQGEEKRVEDKIGDE 176
+ E+E++ EEEE+++ E + E + G
Sbjct: 346 QKDEEEEQEDEEEEEEEEEPEEPEPEEGPP 375
Score = 28.5 bits (64), Expect = 2.5
Identities = 15/35 (42%), Positives = 18/35 (51%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
+E EEEEEEEEE EE E +E L +
Sbjct: 351 EEQEDEEEEEEEEEPEEPEPEEGPPLLTPISEDAP 385
Score = 26.9 bits (60), Expect = 6.6
Identities = 13/25 (52%), Positives = 15/25 (60%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEE 125
K ++ EEEEEEEE EE E E
Sbjct: 347 KDEEEEQEDEEEEEEEEEPEEPEPE 371
Score = 26.9 bits (60), Expect = 7.4
Identities = 17/56 (30%), Positives = 29/56 (51%)
Query: 84 GAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEK 139
G +L +K+ + G +E E E +EE+E EEEE+E+ E + ++E
Sbjct: 151 GKILGTQKDYYVAEVELREGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGT 206
>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
[Transcription].
Length = 175
Score = 34.7 bits (80), Expect = 0.012
Identities = 25/66 (37%), Positives = 42/66 (63%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
E F+ G DE +E++EEEE++E + ++++ +ED++E L E+DE EDE+
Sbjct: 106 EVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVEILIEDDEVDEDEDD 165
Query: 156 KEEEED 161
E+EED
Sbjct: 166 DEDEED 171
Score = 28.2 bits (63), Expect = 2.0
Identities = 21/57 (36%), Positives = 34/57 (59%), Gaps = 2/57 (3%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
DE EEE++E + ++E + E ++ E E E +EDE +D+E +E+EEDK
Sbjct: 121 DEDKEEEEDDEVDSLDDENDDEDEDDDEIVEILIEDDEVDEDE--DDDEDEEDEEDK 175
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 35.5 bits (83), Expect = 0.012
Identities = 19/87 (21%), Positives = 44/87 (50%)
Query: 99 LFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
F +E E EE +++E+E+E+++ ++ E LPE+ K+
Sbjct: 172 GFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQY 231
Query: 159 EEDKQGEEKRVEDKIGDEKRLEENKKK 185
++ ++ +EK+VE ++ K+ + ++K
Sbjct: 232 KKLRKAQEKKVEGRLAQHKKYAKLREK 258
Score = 30.9 bits (71), Expect = 0.36
Identities = 17/78 (21%), Positives = 37/78 (47%), Gaps = 5/78 (6%)
Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
+ EE+ + E L++DE+E+E E++ +D +E E + ++ +
Sbjct: 173 FVDPNAEEDPAHVGSELEELDDDEDEEE-----EEDENDDSLAADESELPEKVLEKFKAL 227
Query: 173 IGDEKRLEENKKKKTQKR 190
K+L + ++KK + R
Sbjct: 228 AKQYKKLRKAQEKKVEGR 245
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 35.3 bits (81), Expect = 0.013
Identities = 12/69 (17%), Positives = 21/69 (30%), Gaps = 3/69 (4%)
Query: 102 AGARDETAR---EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
A +ET+ E E+ + + + D E+ L E + E E+
Sbjct: 360 TPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEK 419
Query: 159 EEDKQGEEK 167
K
Sbjct: 420 AAPIPDPAK 428
Score = 31.5 bits (71), Expect = 0.22
Identities = 14/78 (17%), Positives = 25/78 (32%), Gaps = 3/78 (3%)
Query: 104 ARDETAREEEEEEEEEEEEE-EEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED- 161
+ A + + E EE + + E + E+ P D DE D
Sbjct: 381 GQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVAGPGDD 440
Query: 162 -KQGEEKRVEDKIGDEKR 178
+ + R +D +R
Sbjct: 441 PAEPDGIRRQDDFSSRRR 458
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 35.2 bits (82), Expect = 0.013
Identities = 19/85 (22%), Positives = 33/85 (38%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+E +EE + E EEE + E +E E+E + E E + E + E
Sbjct: 41 EEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAE 100
Query: 166 EKRVEDKIGDEKRLEENKKKKTQKR 190
++ +R + K K +K
Sbjct: 101 DEAARPAEAAARRPKAKKAAKKKKG 125
>gnl|CDD|150923 pfam10327, 7TM_GPCR_Sri, Serpentine type 7TM GPCR chemoreceptor
Sri. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Sri is part of the Str
superfamily of chemoreceptors. Chemoperception is one of
the central senses of soil nematodes like C. elegans
which are otherwise 'blind' and 'deaf'.
Length = 303
Score = 34.8 bits (81), Expect = 0.016
Identities = 15/78 (19%), Positives = 30/78 (38%), Gaps = 13/78 (16%)
Query: 14 WNIWAVVTLILVFCTATGNILVCLAISWE-----KRLQN---VTNYFLMSLAITDLMV-- 63
+N W +V IL + L+ + ++ + KRL+ Y A+ L+
Sbjct: 187 YNPWLIVFGILAIFGGFLSFLILVYLTVDMFRMLKRLRKKISAQTYQRHKAALRSLVAQF 246
Query: 64 ---AVLVMPLGILTLVRV 78
V ++P +V +
Sbjct: 247 ATSPVCILPPFAFVVVVL 264
>gnl|CDD|192535 pfam10320, 7TM_GPCR_Srsx, Serpentine type 7TM GPCR chemoreceptor
Srsx. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of
proteins which are of the serpentine type. Srsx is a
solo family amongst the superfamilies of
chemoreceptors. Chemoperception is one of the central
senses of soil nematodes like C. elegans which are
otherwise 'blind' and 'deaf'.
Length = 257
Score = 34.9 bits (81), Expect = 0.016
Identities = 8/46 (17%), Positives = 22/46 (47%)
Query: 30 TGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTL 75
GN+++ + +K+L++ +Y + + DL+ + +L
Sbjct: 6 FGNVIMIILTFKKKKLRSKCSYLICVQCLADLLCLSGEIIFVVLLF 51
>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
subunit 8 N-terminus. The largest of the mammalian
translation initiation factors, eIF3, consists of at
least eight subunits ranging in mass from 35 to 170 kDa.
eIF3 binds to the 40 S ribosome in an early step of
translation initiation and promotes the binding of
methionyl-tRNAi and mRNA.
Length = 593
Score = 35.2 bits (81), Expect = 0.018
Identities = 24/77 (31%), Positives = 40/77 (51%), Gaps = 1/77 (1%)
Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEE-DERGEDEEKKEEEEDKQGEEKR 168
RE+ E E+EEEEE+E+ + D +EDE+E EE E + +E+D++ E+
Sbjct: 132 REDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDEDAD 191
Query: 169 VEDKIGDEKRLEENKKK 185
+ K E+ K +
Sbjct: 192 LSKKDVLEEPKMFKKPE 208
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 34.8 bits (80), Expect = 0.018
Identities = 18/66 (27%), Positives = 29/66 (43%), Gaps = 1/66 (1%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
D E EE E +E E+++ E + + E E E E E +E +E+ +
Sbjct: 239 DQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDE-SDEDTET 297
Query: 163 QGEEKR 168
GE+ R
Sbjct: 298 PGEDAR 303
Score = 34.0 bits (78), Expect = 0.034
Identities = 13/66 (19%), Positives = 30/66 (45%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
D+ E+ +EE+ ++++ + E E E+ + +E E + E ++ E + +
Sbjct: 224 DDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEAS 283
Query: 166 EKRVED 171
E D
Sbjct: 284 EDSESD 289
Score = 31.4 bits (71), Expect = 0.31
Identities = 18/65 (27%), Positives = 31/65 (47%), Gaps = 3/65 (4%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+ET + EE+ +EE+ +++Q + ED DE EDE + + E ++GE
Sbjct: 220 EETGDDGIEEDADEEDGDDDQPDN---NEDSEAGREESEGSDESEEDEAEATDGEGEEGE 276
Query: 166 EKRVE 170
E
Sbjct: 277 MDAAE 281
Score = 29.8 bits (67), Expect = 0.80
Identities = 20/74 (27%), Positives = 35/74 (47%), Gaps = 1/74 (1%)
Query: 98 QLFKAGARDETAREEEEEE-EEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
Q F RD + EE ++ EE+ E+D ++ +N ++ E+ G DE ++
Sbjct: 203 QAFARVVRDMLGSMDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEE 262
Query: 157 EEEEDKQGEEKRVE 170
+E E GE + E
Sbjct: 263 DEAEATDGEGEEGE 276
Score = 28.3 bits (63), Expect = 3.3
Identities = 21/73 (28%), Positives = 26/73 (35%), Gaps = 18/73 (24%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP------------------EED 147
+ T E EE E + E E+ E DE E+ E E P E D
Sbjct: 266 EATDGEGEEGEMDAAEASEDSESDESDEDTETPGEDARPATPFTELMEEVDYKVFTREFD 325
Query: 148 ERGEDEEKKEEEE 160
E EE +E E
Sbjct: 326 EIVLAEELCDEAE 338
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 34.8 bits (80), Expect = 0.018
Identities = 21/88 (23%), Positives = 42/88 (47%), Gaps = 7/88 (7%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
+G R ++ E EE+E+E+ + ++ +D EDE+E+ +PE +EE+ EE E
Sbjct: 276 PSGERSDSEEETEEKEKEKRKRLKKMMED----EDEDEEMEIVPESP---VEEEESEEPE 328
Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQ 188
+K E + ++ + +
Sbjct: 329 PPPLPKKEEEKEEVTVSPDGGRRRGRRR 356
Score = 33.7 bits (77), Expect = 0.045
Identities = 26/88 (29%), Positives = 43/88 (48%), Gaps = 2/88 (2%)
Query: 95 CENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE 154
+ + K A + T +EE EEE + + E E E DE+E E ER + EE
Sbjct: 226 TKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEE 285
Query: 155 KKEEEEDKQGEEKR--VEDKIGDEKRLE 180
+ EE+E ++ + + +ED+ DE+
Sbjct: 286 ETEEKEKEKRKRLKKMMEDEDEDEEMEI 313
Score = 32.1 bits (73), Expect = 0.16
Identities = 26/92 (28%), Positives = 48/92 (52%), Gaps = 5/92 (5%)
Query: 90 KKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER 149
K NI+ + FK +++ ++E E +EE EEE + + + EDE+ + L E++
Sbjct: 214 KSNIM--SSFFKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDE-- 269
Query: 150 GEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEE 181
+++E K E EE+ E + KRL++
Sbjct: 270 -DEDEPKPSGERSDSEEETEEKEKEKRKRLKK 300
>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1. This
family consists of several acidic phosphoprotein
precursor PCEMA1 sequences which appear to be found
exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
that is associated with the membrane of the infected
erythrocyte throughout the entire intraerythrocytic
cycle. The exact function of this family is unclear.
Length = 286
Score = 34.5 bits (79), Expect = 0.019
Identities = 24/83 (28%), Positives = 37/83 (44%), Gaps = 3/83 (3%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYL---EEDENEKERRLPEEDERGEDEEKKEEEEDK 162
DE + E EE ++E E E YL E D KE+ E E D+ E E +
Sbjct: 204 DEVTSYFNDGENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIE 263
Query: 163 QGEEKRVEDKIGDEKRLEENKKK 185
E E++ G+E+ E+ ++
Sbjct: 264 DESEDEYEEEAGEEQENEDKGEE 286
Score = 27.2 bits (60), Expect = 5.6
Identities = 14/52 (26%), Positives = 23/52 (44%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEED 147
+ K R E + ++ E E E++ +DEY EE E+E E+
Sbjct: 235 NDNEVKEKIRREYREWKGDKANTNETEIEDESEDEYEEEAGEEQENEDKGEE 286
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 35.0 bits (81), Expect = 0.020
Identities = 19/37 (51%), Positives = 22/37 (59%)
Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEE 146
R E EEEEEEE+E E EE+E E+ER EE
Sbjct: 297 RPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 34.2 bits (79), Expect = 0.033
Identities = 17/28 (60%), Positives = 20/28 (71%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEE 134
E EEEEE EEEEEEEE+E+ + EE
Sbjct: 306 EEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 33.1 bits (76), Expect = 0.078
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDE 136
E EEEEEEE EEEEEE+E++E E+E
Sbjct: 304 EPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 28.5 bits (64), Expect = 2.7
Identities = 14/25 (56%), Positives = 17/25 (68%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDE 130
+E E EEEEEEEEEEE ++E
Sbjct: 309 EEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 28.1 bits (63), Expect = 3.4
Identities = 15/39 (38%), Positives = 21/39 (53%)
Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
R E EEEEE+E++ EE+E E+E E+E
Sbjct: 295 RRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 27.7 bits (62), Expect = 3.8
Identities = 14/22 (63%), Positives = 16/22 (72%)
Query: 106 DETAREEEEEEEEEEEEEEEQE 127
+E EE EEEEEEEEEEE+
Sbjct: 308 EEEEEEEVPEEEEEEEEEEERT 329
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 34.8 bits (80), Expect = 0.020
Identities = 21/88 (23%), Positives = 45/88 (51%), Gaps = 3/88 (3%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQD-EYLEEDENEKERRLPEEDERGEDEE--KKEEEEDK 162
+ ++++ ++E+E +++ EQ E E+ ++ R E ++R E+ K+ E+ K
Sbjct: 53 NRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAK 112
Query: 163 QGEEKRVEDKIGDEKRLEENKKKKTQKR 190
Q EEK+ + + K+ E K K +
Sbjct: 113 QAEEKQKQAEEAKAKQAAEAKAKAEAEA 140
Score = 34.4 bits (79), Expect = 0.026
Identities = 22/85 (25%), Positives = 41/85 (48%), Gaps = 3/85 (3%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ---GE 165
E++ EE E++ E+ Q E + EK + E+ + +E++K+ EE K E
Sbjct: 72 KLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAE 131
Query: 166 EKRVEDKIGDEKRLEENKKKKTQKR 190
K + ++K EE KK+ ++
Sbjct: 132 AKAKAEAEAEKKAKEEAKKQAEEEA 156
Score = 34.0 bits (78), Expect = 0.028
Identities = 21/87 (24%), Positives = 44/87 (50%), Gaps = 1/87 (1%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER-GEDEEKKEEE 159
KA + E A ++ EE++++ EE + ++ E + E E E++ EE ++ E+E K +
Sbjct: 102 KAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAA 161
Query: 160 EDKQGEEKRVEDKIGDEKRLEENKKKK 186
+ + + + K E + + K K
Sbjct: 162 AEAKKKAAEAKKKAEAEAKAKAEAKAK 188
Score = 31.0 bits (70), Expect = 0.36
Identities = 23/91 (25%), Positives = 38/91 (41%), Gaps = 3/91 (3%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK-KEEE 159
K + E++ E+ ++E EQ + E+ + EE ++ +E K K+
Sbjct: 71 KKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAA 130
Query: 160 EDKQGEEKRVEDKIGDE--KRLEENKKKKTQ 188
E K E E K +E K+ EE K K
Sbjct: 131 EAKAKAEAEAEKKAKEEAKKQAEEEAKAKAA 161
Score = 27.1 bits (60), Expect = 5.8
Identities = 19/93 (20%), Positives = 38/93 (40%), Gaps = 3/93 (3%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQ---EQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
KA E + E E E++ +EE ++ E+ + E +K+ ++ E + K E
Sbjct: 125 KAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAE 184
Query: 158 EEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
+ + EE + + + K E K +
Sbjct: 185 AKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEA 217
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 34.6 bits (79), Expect = 0.024
Identities = 17/64 (26%), Positives = 38/64 (59%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
+E EEE EEE+++E++ D+ +EDE++ + E+DE +D++ ++ + +
Sbjct: 43 KEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETD 102
Query: 171 DKIG 174
++ G
Sbjct: 103 NEAG 106
Score = 33.4 bits (76), Expect = 0.054
Identities = 19/64 (29%), Positives = 34/64 (53%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
EEE EEE+++EE+ + D+ +ED+++ + +EDE +D ++ G E E
Sbjct: 46 EEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNEA 105
Query: 172 KIGD 175
D
Sbjct: 106 GFAD 109
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 34.5 bits (79), Expect = 0.027
Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 5/72 (6%)
Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER-----GEDEE 154
+A E+ E + E+ EE+E+E++ E +E + + R LP+ G D E
Sbjct: 665 CEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSE 724
Query: 155 KKEEEEDKQGEE 166
+E +D+Q
Sbjct: 725 DEEGMDDEQMMR 736
Score = 33.7 bits (77), Expect = 0.047
Identities = 20/85 (23%), Positives = 34/85 (40%), Gaps = 8/85 (9%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDE-------NEKERRLPEEDERGEDEEKKEEE 159
ET +E+E E E+ E E + E EEDE NE + + R +
Sbjct: 657 ETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRR-ALPKVLNLP 715
Query: 160 EDKQGEEKRVEDKIGDEKRLEENKK 184
+ G + E+ + DE+ + +
Sbjct: 716 DALDGGDSEDEEGMDDEQMMRLDTY 740
Score = 33.3 bits (76), Expect = 0.060
Identities = 21/71 (29%), Positives = 39/71 (54%), Gaps = 7/71 (9%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
EEE+E++ EE ++++++ E +E+ E+E E GED E+ E+E+D + E V
Sbjct: 648 EEEDEDDLEETDDDEDECEAIEDSESESE-------SDGEDGEEDEQEDDAEANEGVVPI 700
Query: 172 KIGDEKRLEEN 182
+ L +
Sbjct: 701 DKAVRRALPKV 711
Score = 29.5 bits (66), Expect = 1.1
Identities = 18/68 (26%), Positives = 28/68 (41%), Gaps = 5/68 (7%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
L K + + E+EE ++E+ D YL + EK+ R+ E K
Sbjct: 705 RRALPKVLNLPDALDGGDSEDEEGMDDEQMMRLDTYLAQIFKEKKERIQAGGET-----K 759
Query: 156 KEEEEDKQ 163
KE + KQ
Sbjct: 760 KEAQSQKQ 767
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity
of nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 34.0 bits (78), Expect = 0.027
Identities = 15/53 (28%), Positives = 29/53 (54%)
Query: 99 LFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGE 151
F G + ++ E+EE E++ ++++++ +E+E E EEDE GE
Sbjct: 20 TFGNGKGPFASAQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72
Score = 32.8 bits (75), Expect = 0.084
Identities = 30/101 (29%), Positives = 44/101 (43%), Gaps = 27/101 (26%)
Query: 61 LMVAVLVMPLGILTLVRVRTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEE 120
L++ +L P +LT + A A+D T EE E++ +E
Sbjct: 7 LLLTLLAFPAVLLTFGNGKG-------------------PFASAQDLTEDEEAEDDVVDE 47
Query: 121 EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
+EE+E +EEDENE L EE+E E E K + D
Sbjct: 48 DEEDEAV----VEEDENE----LTEEEEDEEGEVKASPDAD 80
>gnl|CDD|165177 PHA02834, PHA02834, chemokine receptor-like protein; Provisional.
Length = 323
Score = 34.1 bits (78), Expect = 0.030
Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 3/72 (4%)
Query: 2 ETEVSFGLDAASWNIWAVVTLILVFCTA-TGNILVCLAISWEKRLQNVTNYFLMSLAITD 60
E E F + + N + +V IL+F GN+LV +A+ KR V + +L ++A++D
Sbjct: 15 EEECDFEMVNSDVNYFVIVFYILLFIFGLIGNVLV-IAVLIVKRFMFVVDVYLFNIAMSD 73
Query: 61 LMVAVLVMPLGI 72
LM+ V P I
Sbjct: 74 LML-VFSFPFII 84
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 34.1 bits (78), Expect = 0.031
Identities = 14/20 (70%), Positives = 17/20 (85%)
Query: 106 DETAREEEEEEEEEEEEEEE 125
++ EE+EEEEEEEEEEEE
Sbjct: 224 SDSESEEDEEEEEEEEEEEE 243
Score = 33.4 bits (76), Expect = 0.057
Identities = 15/19 (78%), Positives = 17/19 (89%)
Query: 111 EEEEEEEEEEEEEEEQEQD 129
E EE+EEEEEEEEEE+E D
Sbjct: 227 ESEEDEEEEEEEEEEEEID 245
Score = 32.2 bits (73), Expect = 0.15
Identities = 14/39 (35%), Positives = 21/39 (53%)
Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
+ E EE+EEEEEE+E++E ++ EK R
Sbjct: 222 SGSDSESEEDEEEEEEEEEEEEIDVVTVEKRRSSSNRKA 260
Score = 31.1 bits (70), Expect = 0.31
Identities = 13/22 (59%), Positives = 16/22 (72%)
Query: 105 RDETAREEEEEEEEEEEEEEEQ 126
+ EE+EEEEEEEEEEE+
Sbjct: 222 SGSDSESEEDEEEEEEEEEEEE 243
Score = 30.3 bits (68), Expect = 0.60
Identities = 15/27 (55%), Positives = 19/27 (70%)
Query: 97 NQLFKAGARDETAREEEEEEEEEEEEE 123
N +G+ E+ +EEEEEEEEEEEE
Sbjct: 217 NSSSSSGSDSESEEDEEEEEEEEEEEE 243
>gnl|CDD|177154 MTH00091, ND2, NADH dehydrogenase subunit 2; Provisional.
Length = 273
Score = 33.7 bits (78), Expect = 0.034
Identities = 13/62 (20%), Positives = 26/62 (41%), Gaps = 13/62 (20%)
Query: 14 WNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLM--SLAITDLMVAVLVMPLG 71
W+++ ++TL+ VF ++ NYF++ SL + L+ +
Sbjct: 17 WSVFLLMTLLFVFL-----------NKGMGSYSSLFNYFVLQESLGLLFLLFSFGYFQFL 65
Query: 72 IL 73
IL
Sbjct: 66 IL 67
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 33.5 bits (77), Expect = 0.038
Identities = 22/84 (26%), Positives = 42/84 (50%), Gaps = 9/84 (10%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
GA D E E E E+E + + E E+E +KE++ +E ++ + E+K ++E+
Sbjct: 119 GAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMV 178
Query: 163 QGEEKRVEDKIGDEKRLEENKKKK 186
+ + K+ ++ KKKK
Sbjct: 179 EPKG---------SKKKKKKKKKK 193
Score = 27.0 bits (60), Expect = 5.3
Identities = 17/60 (28%), Positives = 27/60 (45%)
Query: 131 YLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
Y D E E E K E+E + + EEK+ + K + K+ ++ KK K +K
Sbjct: 118 YGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177
Score = 26.2 bits (58), Expect = 9.3
Identities = 20/62 (32%), Positives = 34/62 (54%), Gaps = 3/62 (4%)
Query: 132 LEEDENEKERRLPEEDERGEDEEKKEEEEDK---QGEEKRVEDKIGDEKRLEENKKKKTQ 188
E + +EKE E E +EE+K+E++ K + E+K +DK + +KKKK +
Sbjct: 130 SESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKK 189
Query: 189 KR 190
K+
Sbjct: 190 KK 191
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 33.5 bits (76), Expect = 0.041
Identities = 26/73 (35%), Positives = 40/73 (54%), Gaps = 8/73 (10%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDE--------NEKERRLPEEDERGED 152
KAG + A +E+E +E+EEEEEEE+E++ +EE+E N E G++
Sbjct: 121 KAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDN 180
Query: 153 EEKKEEEEDKQGE 165
E+ EEE + E
Sbjct: 181 GEEGEEESVTEAE 193
Score = 26.6 bits (58), Expect = 8.2
Identities = 18/42 (42%), Positives = 27/42 (64%), Gaps = 1/42 (2%)
Query: 98 QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEK 139
QL K +E+E +E+EEEEEE+E++E E +ENE+
Sbjct: 117 QLPKKAGNAGKKATKEDESDEDEEEEEEEEEEEA-EVEENEQ 157
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 33.7 bits (77), Expect = 0.045
Identities = 17/65 (26%), Positives = 32/65 (49%)
Query: 126 QEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKK 185
+ + E E E ER+ +E++ E E KK + K+ + K + D + + +K
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEK 64
Query: 186 KTQKR 190
K++KR
Sbjct: 65 KSRKR 69
Score = 30.6 bits (69), Expect = 0.52
Identities = 22/93 (23%), Positives = 45/93 (48%), Gaps = 8/93 (8%)
Query: 83 SGAVLRLKKNILCENQLFKAGARDETAREEEE------EEEEEEEEEEEQEQDEYLEEDE 136
S +K IL E +L + ++E A+E+E ++E + + + +Q D +
Sbjct: 2 SRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKK 61
Query: 137 NEKERRLPEEDERGEDEEKKEEEEDKQGEEKRV 169
+EK+ R + D E+ E + + G++KR+
Sbjct: 62 SEKKSR--KRDVEDENPEDFIDPDTPFGQKKRL 92
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 33.9 bits (78), Expect = 0.046
Identities = 15/77 (19%), Positives = 34/77 (44%), Gaps = 9/77 (11%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
+ Q A R+ E E+ ++E+++ + E++RR +E + + E K
Sbjct: 645 QAQQQTAETRESQQAEVTEKARTQDEQQQAPRR---------ERQRRRNDEKRQAQQEAK 695
Query: 156 KEEEEDKQGEEKRVEDK 172
E++ +E E++
Sbjct: 696 ALNVEEQSVQETEQEER 712
Score = 32.3 bits (74), Expect = 0.14
Identities = 17/90 (18%), Positives = 41/90 (45%), Gaps = 2/90 (2%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDE--EKKEEEE 160
+ET +E+ + E + E +Q++ + + + ++ R D R E E +EE
Sbjct: 581 SGGEETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREENR 640
Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
+ + ++ + + ++ E +K +TQ
Sbjct: 641 RNRRQAQQQTAETRESQQAEVTEKARTQDE 670
Score = 29.6 bits (67), Expect = 1.2
Identities = 14/85 (16%), Positives = 37/85 (43%), Gaps = 4/85 (4%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
R+E R + +++ E E Q+ E E + + ++ R E + ++ +E+ +
Sbjct: 635 NREENRRNRRQAQQQTAETRESQQA----EVTEKARTQDEQQQAPRRERQRRRNDEKRQA 690
Query: 164 GEEKRVEDKIGDEKRLEENKKKKTQ 188
+E + + + E +++ Q
Sbjct: 691 QQEAKALNVEEQSVQETEQEERVQQ 715
Score = 27.3 bits (61), Expect = 5.5
Identities = 14/63 (22%), Positives = 31/63 (49%), Gaps = 2/63 (3%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
E AR ++E+++ E + + DE + + K L E++ ++ E++E + Q
Sbjct: 662 TEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKA--LNVEEQSVQETEQEERVQQVQPR 719
Query: 166 EKR 168
K+
Sbjct: 720 RKQ 722
>gnl|CDD|222060 pfam13347, MFS_2, MFS/sugar transport protein. This family is part
of the major facilitator superfamily of membrane
transport proteins.
Length = 425
Score = 33.7 bits (78), Expect = 0.046
Identities = 21/74 (28%), Positives = 31/74 (41%), Gaps = 11/74 (14%)
Query: 5 VSFGLDAASWNIWAVVTLILVFCTATG-NI-----LVCLAISWEKR--LQNVTNYFLMSL 56
FGL + I+A +T IL+ T NI + L ++R L +
Sbjct: 90 TPFGLSMTAKLIYAFITYILLGLAYTFVNIPYWSLIPVLTRDPKERTSLASYRRIGAT-- 147
Query: 57 AITDLMVAVLVMPL 70
I L+VAV+ PL
Sbjct: 148 -IGGLLVAVIAPPL 160
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function
of NS2 is not known, but it may be involved in the
transport and condensation of viral mRNAs.
Length = 363
Score = 33.7 bits (77), Expect = 0.048
Identities = 18/65 (27%), Positives = 35/65 (53%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
ET++++++EE E E E + + E+ R L +E E E+ E+ ++ED+ GE+
Sbjct: 206 ETSKQDQKEERRAAVERRLAELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGED 265
Query: 167 KRVED 171
+
Sbjct: 266 SEDGE 270
Score = 26.4 bits (58), Expect = 9.6
Identities = 25/82 (30%), Positives = 35/82 (42%), Gaps = 3/82 (3%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE--RGEDEEKKEEEE 160
G D E +EE + E +QD+ EE ERRL E E EE++ +
Sbjct: 185 GGPDSDEDENPLDEEAPDMTPETSKQDQ-KEERRAAVERRLAELVEMINWNLEERRRDLR 243
Query: 161 DKQGEEKRVEDKIGDEKRLEEN 182
+Q E+ VE DE E+
Sbjct: 244 KEQELEENVERDSDDEDEHGED 265
>gnl|CDD|217749 pfam03824, NicO, High-affinity nickel-transport protein. High
affinity nickel transporters involved in the
incorporation of nickel into H2-uptake hydrogenase and
urease enzymes. Essential for the expression of
catalytically active hydrogenase and urease. Ion uptake
is dependent on proton motive force. HoxN in Alcaligenes
eutrophus is thought to be an integral membrane protein
with seven transmembrane helices. The family also
includes a cobalt transporter.
Length = 278
Score = 33.1 bits (76), Expect = 0.053
Identities = 17/80 (21%), Positives = 28/80 (35%), Gaps = 2/80 (2%)
Query: 8 GLDAASWNIW-AVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVL 66
GL + A+ + C +L+ A W MS+ T + VA+L
Sbjct: 170 GLSFDGILVLPALFAAGMRPCDGADGVLMLAAYLWAFLGG-RKLVLNMSITGTSITVALL 228
Query: 67 VMPLGILTLVRVRTNLSGAV 86
+ L L+ + SG
Sbjct: 229 AGLIETLGLLAKKLAGSGTF 248
>gnl|CDD|216374 pfam01226, Form_Nir_trans, Formate/nitrite transporter.
Length = 249
Score = 33.2 bits (77), Expect = 0.054
Identities = 16/67 (23%), Positives = 25/67 (37%), Gaps = 22/67 (32%)
Query: 11 AASWNIWAVVTLIL-VFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMP 69
+ S+ + + C N LVCLA+ W ++LA D +L +
Sbjct: 146 SLSF----WEAFLRGILC----NWLVCLAV-W------------LALAAKDTAGKILAIW 184
Query: 70 LGILTLV 76
L I V
Sbjct: 185 LPIFAFV 191
>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
Length = 390
Score = 33.3 bits (77), Expect = 0.055
Identities = 16/53 (30%), Positives = 26/53 (49%), Gaps = 3/53 (5%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
EE EE E E+ E D+Y E E E E+D+ +D+ ++++E
Sbjct: 334 ENPREEAEEAEAPEKVEFMWDDYHREQLEEVE---EEDDDDWDDDWDEDDDEG 383
Score = 33.3 bits (77), Expect = 0.056
Identities = 16/53 (30%), Positives = 26/53 (49%)
Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
EE EE EE E + D+ +E+ E+E +D + +E+D +G E
Sbjct: 333 EENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDEGVE 385
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 33.2 bits (76), Expect = 0.056
Identities = 16/65 (24%), Positives = 31/65 (47%), Gaps = 5/65 (7%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE-EDKQGE 165
+ A E +E E + E+ E E D +ED+++ E EE+ D ++ + ++ E
Sbjct: 207 DMAEELGDEPESADSEDNEDEDDPKEDEDDDQGE----EEESGSSDSLSEDSDASSEEME 262
Query: 166 EKRVE 170
+E
Sbjct: 263 SGEME 267
Score = 32.9 bits (75), Expect = 0.075
Identities = 14/54 (25%), Positives = 28/54 (51%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
DE + E+ E+E++ +E+++ D+ EE+ + + D E+ E E E
Sbjct: 214 DEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEME 267
Score = 30.2 bits (68), Expect = 0.57
Identities = 15/59 (25%), Positives = 27/59 (45%), Gaps = 5/59 (8%)
Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
+ EE +E E +D E+D E E +D++GE+EE + + + E+
Sbjct: 207 DMAEELGDEPESADSEDNEDEDDPKEDE-----DDDQGEEEESGSSDSLSEDSDASSEE 260
Score = 29.8 bits (67), Expect = 0.77
Identities = 12/57 (21%), Positives = 25/57 (43%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
DE +E+E++++ EEEE + D + +E E + + + +D
Sbjct: 225 EDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDTPDSDD 281
Score = 28.6 bits (64), Expect = 1.9
Identities = 17/76 (22%), Positives = 39/76 (51%), Gaps = 2/76 (2%)
Query: 85 AVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP 144
A R+ +++L + + + + + E+ E+E++ +E+ E D+ EE+E+ L
Sbjct: 194 AFARVVRDMLSSMDMAEELGDEPESADSEDNEDEDDPKED--EDDDQGEEEESGSSDSLS 251
Query: 145 EEDERGEDEEKKEEEE 160
E+ + +E + E E
Sbjct: 252 EDSDASSEEMESGEME 267
Score = 28.6 bits (64), Expect = 2.1
Identities = 14/61 (22%), Positives = 31/61 (50%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
D E+E++ +E+E++++ +E++ + +E EE E GE E + +D
Sbjct: 220 DSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDTPDS 279
Query: 166 E 166
+
Sbjct: 280 D 280
Score = 26.3 bits (58), Expect = 9.7
Identities = 12/55 (21%), Positives = 22/55 (40%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
D+ +E++++ EEEE E + E E E E D+ ++
Sbjct: 228 DDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDTPDSDDA 282
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 33.2 bits (76), Expect = 0.059
Identities = 17/56 (30%), Positives = 31/56 (55%), Gaps = 2/56 (3%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
+ + +++ ++++ E E L E E K+ R E++ E+EEK EEE + EE
Sbjct: 127 DSDSSDDDSDDDDSEDETAALLRELEKIKKER--AEEKEREEEEKAAEEEKAREEE 180
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 33.4 bits (76), Expect = 0.060
Identities = 9/81 (11%), Positives = 25/81 (30%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
+ E+ EE E + + + ++E P ++ + + +Q
Sbjct: 162 QPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQ 221
Query: 164 GEEKRVEDKIGDEKRLEENKK 184
+ + E E ++
Sbjct: 222 APSPNTQQAVEHEDEPTEPER 242
Score = 33.4 bits (76), Expect = 0.062
Identities = 8/60 (13%), Positives = 13/60 (21%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+ + E N + P + E+ EE E E
Sbjct: 125 GSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSE 184
Score = 33.0 bits (75), Expect = 0.067
Identities = 9/71 (12%), Positives = 23/71 (32%), Gaps = 2/71 (2%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+ + ++ + E+ +E + E E + P + E + D+ GE
Sbjct: 156 NPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPG--PPQSETPTSSPPPQSPPDEPGE 213
Query: 166 EKRVEDKIGDE 176
+ +
Sbjct: 214 PQSPTPQQAPS 224
Score = 33.0 bits (75), Expect = 0.073
Identities = 13/77 (16%), Positives = 22/77 (28%), Gaps = 1/77 (1%)
Query: 101 KAGARDETAREEEEEEEEEEEEE-EEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
+A A + E + E EE E++E+ + + P G EE
Sbjct: 59 RAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGL 118
Query: 160 EDKQGEEKRVEDKIGDE 176
+ E
Sbjct: 119 SPENTSGSSPESPASHS 135
Score = 32.6 bits (74), Expect = 0.11
Identities = 8/74 (10%), Positives = 12/74 (16%), Gaps = 2/74 (2%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
T EE E E + P E E
Sbjct: 99 SVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPP--PSPPSHPGPHEPAPPESHN 156
Query: 163 QGEEKRVEDKIGDE 176
++ +
Sbjct: 157 PSPNQQPSSFLQPS 170
Score = 32.2 bits (73), Expect = 0.14
Identities = 8/70 (11%), Positives = 12/70 (17%), Gaps = 2/70 (2%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
+A E E E P E E +
Sbjct: 109 GSAEELASGLSPENTSGSSPESPA--SHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSF 166
Query: 167 KRVEDKIGDE 176
+ + E
Sbjct: 167 LQPSHEDSPE 176
Score = 31.9 bits (72), Expect = 0.18
Identities = 11/56 (19%), Positives = 17/56 (30%), Gaps = 4/56 (7%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
E E Q+ +L+ + PEE E E + + Q E
Sbjct: 146 PHEPAPPESHNPSPNQQPSSFLQPSHEDS----PEEPEPPTSEPEPDSPGPPQSET 197
Score = 31.5 bits (71), Expect = 0.27
Identities = 12/73 (16%), Positives = 16/73 (21%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
A + + E + E EE E E + E EE G
Sbjct: 62 AEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPE 121
Query: 169 VEDKIGDEKRLEE 181
E
Sbjct: 122 NTSGSSPESPASH 134
Score = 31.1 bits (70), Expect = 0.34
Identities = 6/69 (8%), Positives = 15/69 (21%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
+ + A E E N++ + EE + +
Sbjct: 126 SSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEP 185
Query: 163 QGEEKRVED 171
+ +
Sbjct: 186 EPDSPGPPQ 194
Score = 30.7 bits (69), Expect = 0.40
Identities = 7/69 (10%), Positives = 13/69 (18%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
G E E E + + + E+ +
Sbjct: 117 GLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPE 176
Query: 163 QGEEKRVED 171
+ E E
Sbjct: 177 EPEPPTSEP 185
Score = 29.2 bits (65), Expect = 1.3
Identities = 14/83 (16%), Positives = 27/83 (32%), Gaps = 4/83 (4%)
Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
F + +++ E E E E + Q E + DE GE + ++
Sbjct: 166 FLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQS----PPDEPGEPQSPTPQQ 221
Query: 160 EDKQGEEKRVEDKIGDEKRLEEN 182
++ VE + + E
Sbjct: 222 APSPNTQQAVEHEDEPTEPEREG 244
Score = 29.2 bits (65), Expect = 1.3
Identities = 6/69 (8%), Positives = 11/69 (15%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
G + E E+ E+ + P +
Sbjct: 90 GGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEP 149
Query: 163 QGEEKRVED 171
E
Sbjct: 150 APPESHNPS 158
Score = 26.9 bits (59), Expect = 8.8
Identities = 10/76 (13%), Positives = 22/76 (28%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
EE E E E + + + + P+E + ++ + ED
Sbjct: 176 EEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHED 235
Query: 172 KIGDEKRLEENKKKKT 187
+ + +R
Sbjct: 236 EPTEPEREGPPFPGHR 251
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 33.0 bits (75), Expect = 0.061
Identities = 30/83 (36%), Positives = 47/83 (56%), Gaps = 3/83 (3%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
ET E EEE+E EEEEE+ ++E + E++ E RL R +D +E E + +E
Sbjct: 1 ETEVLELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKE 60
Query: 167 KRVEDKIGDEKRLEENKKKKTQK 189
K +E+ + KR E +K++T K
Sbjct: 61 KALEE---EAKRKAEERKRETLK 80
>gnl|CDD|225027 COG2116, FocA, Formate/nitrite family of transporters [Inorganic
ion transport and metabolism].
Length = 265
Score = 33.0 bits (76), Expect = 0.070
Identities = 13/52 (25%), Positives = 20/52 (38%), Gaps = 17/52 (32%)
Query: 25 VFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLV 76
+ C N LVCLA+ W MS + D + + L ++ V
Sbjct: 166 ILC----NWLVCLAV-W------------MSYSAKDGAAKIFAIILPVMLFV 200
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 33.1 bits (76), Expect = 0.075
Identities = 24/85 (28%), Positives = 39/85 (45%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
++E E EE E E EEE E++ +N R+ E+ E E KK ++E+K
Sbjct: 387 KEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNE 446
Query: 165 EEKRVEDKIGDEKRLEENKKKKTQK 189
+++ E +E EE K +
Sbjct: 447 FKEKKESDEEEELEDEEEAKVEKVA 471
Score = 32.3 bits (74), Expect = 0.13
Identities = 20/91 (21%), Positives = 43/91 (47%), Gaps = 8/91 (8%)
Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED--------EEKKEEE 159
R E ++EE + E EE ++ EE+ +E+E P + G +E + ++
Sbjct: 379 MQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKK 438
Query: 160 EDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
K+ + + E K DE+ E++++ ++
Sbjct: 439 LKKENKNEFKEKKESDEEEELEDEEEAKVEK 469
Score = 30.8 bits (70), Expect = 0.40
Identities = 22/95 (23%), Positives = 42/95 (44%), Gaps = 1/95 (1%)
Query: 96 ENQLFKAGARDETAREEEE-EEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE 154
EN R E EEE EEE EE ++ + ++ E+ ++ + E + +
Sbjct: 389 ENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFK 448
Query: 155 KKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
+K+E ++++ E E K+ + +K QK
Sbjct: 449 EKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQK 483
Score = 30.0 bits (68), Expect = 0.68
Identities = 25/96 (26%), Positives = 46/96 (47%), Gaps = 16/96 (16%)
Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED----------------E 153
EE+E+E+ + EEE+E + ++ + + R+ + + GED E
Sbjct: 324 SEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQRAE 383
Query: 154 EKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
+K+EE D + EE R E + +E EEN++ +
Sbjct: 384 ARKKEENDAEIEELRRELEGEEESDEEENEEPSKKN 419
Score = 29.3 bits (66), Expect = 1.5
Identities = 22/89 (24%), Positives = 47/89 (52%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
+ A + ++E + E +E++E +E+E+ E EE + EK + +E++EEE
Sbjct: 431 EKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEEL 490
Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
D++ + +G + +++KKK + K
Sbjct: 491 DEENPWLKTTSSVGKSAKKQDSKKKSSSK 519
Score = 28.5 bits (64), Expect = 2.6
Identities = 18/89 (20%), Positives = 48/89 (53%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
+ G ++ +++ ++E + E +E++E +++E LE++E K ++ + + ++ +KEEEE
Sbjct: 428 ENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEE 487
Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
++ EE + + + K +
Sbjct: 488 EELDEENPWLKTTSSVGKSAKKQDSKKKS 516
Score = 27.3 bits (61), Expect = 6.1
Identities = 19/84 (22%), Positives = 35/84 (41%), Gaps = 3/84 (3%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
+ E R + E + EE+E+E E +ED++E + + K +E ED +
Sbjct: 309 RQGEELRRKIEGKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDE 368
Query: 164 GEEKRVEDKI---GDEKRLEENKK 184
+ K + ++ EEN
Sbjct: 369 ENSGLLSMKFMQRAEARKKEENDA 392
Score = 26.9 bits (60), Expect = 7.6
Identities = 23/102 (22%), Positives = 40/102 (39%), Gaps = 8/102 (7%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
E+ + + +E E++EE ++++ E K+ E E E +
Sbjct: 343 EDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELE 402
Query: 156 KEEEEDKQGEEKRVEDKIG--------DEKRLEENKKKKTQK 189
EEE D++ E+ + +G EK E K KK K
Sbjct: 403 GEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENK 444
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 32.4 bits (74), Expect = 0.076
Identities = 13/81 (16%), Positives = 42/81 (51%)
Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEK 167
E +++++E EE E+ + EY E+ + + +++ ++ + + ++K ++++DK ++
Sbjct: 55 AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKD 114
Query: 168 RVEDKIGDEKRLEENKKKKTQ 188
E + E + + +
Sbjct: 115 EKEAEDKLEDLTKSYSETLST 135
Score = 31.6 bits (72), Expect = 0.16
Identities = 13/80 (16%), Positives = 43/80 (53%), Gaps = 3/80 (3%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
+++E EE E+ ++E +E+ ++ + + K+ ++ ++ + ++KK+++ +K+ E++
Sbjct: 62 KKKKELAEEIEKVKKEYEEKQKWKWKKKKSKK---KKDKDKDKKDDKKDDKSEKKDEKEA 118
Query: 169 VEDKIGDEKRLEENKKKKTQ 188
+ K E ++
Sbjct: 119 EDKLEDLTKSYSETLSTLSE 138
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 32.1 bits (73), Expect = 0.077
Identities = 25/84 (29%), Positives = 43/84 (51%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
++ E++E++EEEEEE+EE+ ++ EDE E EE+E ED ++ E K
Sbjct: 41 EDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNIN 100
Query: 166 EKRVEDKIGDEKRLEENKKKKTQK 189
+ + + + L KK +K
Sbjct: 101 DIFNSTQDDNAQNLISKNYKKNEK 124
Score = 26.4 bits (58), Expect = 8.8
Identities = 21/58 (36%), Positives = 37/58 (63%)
Query: 133 EEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
E ++ + E++ +E+E EDEE+ EE ED + EE+ VED+ +E+ E+N K ++
Sbjct: 39 ENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEK 96
>gnl|CDD|233471 TIGR01569, A_tha_TIGR01569, plant integral membrane protein
TIGR01569. This model describes a region of ~160
residues found exclusively in plant proteins, generally
as the near complete length of the protein. At least 24
different members are found in Arabidopsis thaliana.
Members have four predicted transmembrane regions, the
last of which is preceded by an invariant CXXXXX[FY]C
motif. The family is not functionally characterized.
Length = 154
Score = 32.4 bits (74), Expect = 0.077
Identities = 18/70 (25%), Positives = 33/70 (47%), Gaps = 4/70 (5%)
Query: 2 ETEVSFGLDA---ASWN-IWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLA 57
ET+V F A ++ + A V ++ A G L+ L +S L+ + L++L
Sbjct: 23 ETKVVFVQLITFKAKFSDLPAFVYFVVANAIACGYSLLSLVVSIFGLLKRRVFFKLIALF 82
Query: 58 ITDLMVAVLV 67
DL++ L+
Sbjct: 83 FLDLVMLALL 92
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 33.1 bits (75), Expect = 0.077
Identities = 20/95 (21%), Positives = 43/95 (45%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
AG D+ EE + ++E E E + + E E D++ + + + E+EEK
Sbjct: 483 VRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEEEK 542
Query: 156 KEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
K + ++K++ K+ +E + + +K+
Sbjct: 543 KLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKK 577
Score = 28.1 bits (62), Expect = 3.6
Identities = 17/81 (20%), Positives = 33/81 (40%), Gaps = 2/81 (2%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
D EE + E++ E+ E ++D+ E + +KE E + QG
Sbjct: 454 DSYDPRASLMTMEETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQG- 512
Query: 166 EKRVEDKIGDEKRLEENKKKK 186
+ + +K + ++K KK
Sbjct: 513 -IKYSETSEADKDVNKSKNKK 532
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 32.3 bits (74), Expect = 0.078
Identities = 19/49 (38%), Positives = 32/49 (65%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
++E+EEEEEEE E E+ +E ++ EKE + ++R E+E K++E
Sbjct: 98 KKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEI 146
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 33.2 bits (76), Expect = 0.081
Identities = 13/58 (22%), Positives = 24/58 (41%), Gaps = 1/58 (1%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
E + + E + + E Q+ E E + R+P D E ++ +E +DK
Sbjct: 885 ENTVKGQYSELQNHHKTEALSQNNKYNE-EKSAQERMPGADSPEELMKRAKEYQDKHK 941
Score = 28.6 bits (64), Expect = 2.7
Identities = 13/70 (18%), Positives = 24/70 (34%), Gaps = 2/70 (2%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
G + R EE + + E + + E L ++ E + ER + EE +
Sbjct: 875 GDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEK--SAQERMPGADSPEELMKR 932
Query: 163 QGEEKRVEDK 172
E +
Sbjct: 933 AKEYQDKHKG 942
>gnl|CDD|218010 pfam04293, SpoVR, SpoVR like protein. Bacillus subtilis stage V
sporulation protein R is involved in spore cortex
formation. Little is known about cortex biosynthesis,
except that it depends on several sigma E controlled
genes, including spoVR.
Length = 427
Score = 32.9 bits (76), Expect = 0.082
Identities = 13/39 (33%), Positives = 25/39 (64%)
Query: 119 EEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
EEE+ ++E++EYL+ N+ R +P + E+ E+E +
Sbjct: 178 EEEKARQEEREEYLQSQVNDLWRTIPPQREKLEEERARR 216
Score = 31.0 bits (71), Expect = 0.29
Identities = 14/50 (28%), Positives = 23/50 (46%), Gaps = 8/50 (16%)
Query: 108 TAREEEEEEEEEEEEEEEQEQDEY--------LEEDENEKERRLPEEDER 149
+ EE+ +EE EE + Q D + E+E + +R PEE +
Sbjct: 176 SLEEEKARQEEREEYLQSQVNDLWRTIPPQREKLEEERARRKRFPEEPQE 225
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 32.4 bits (74), Expect = 0.085
Identities = 21/54 (38%), Positives = 35/54 (64%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
RE EEEE EE ++ EE+ + E EE+E E+ER +E+E ++ E++ +E +
Sbjct: 20 QREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQE 73
Score = 31.2 bits (71), Expect = 0.23
Identities = 23/66 (34%), Positives = 41/66 (62%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
K A+ E + ++ E EEEE EE+++ E E E ++E L EE E+ ++EE+++E E
Sbjct: 6 KKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKERE 65
Query: 161 DKQGEE 166
++ +E
Sbjct: 66 EQARKE 71
Score = 30.1 bits (68), Expect = 0.57
Identities = 17/74 (22%), Positives = 43/74 (58%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
++ + EE++ ++Q E EE+ E+++ + + ++EE+ EEE +K+ EE+ ++
Sbjct: 4 GAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKE 63
Query: 172 KIGDEKRLEENKKK 185
+ ++ +E +K
Sbjct: 64 REEQARKEQEEYEK 77
Score = 26.2 bits (58), Expect = 8.4
Identities = 20/56 (35%), Positives = 34/56 (60%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
++E EEE E+++EEEE +E+E+ E++E EK + +E G D+ +EE
Sbjct: 43 KEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEEGTDKLSADEES 98
>gnl|CDD|184899 PRK14906, PRK14906, DNA-directed RNA polymerase subunit beta'/alpha
domain fusion protein; Provisional.
Length = 1460
Score = 32.9 bits (75), Expect = 0.089
Identities = 21/85 (24%), Positives = 41/85 (48%), Gaps = 3/85 (3%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
D+ AREE+ ++ +E + +E D + E RRL + +DE + +D++
Sbjct: 142 DKEAREEDADDLRDELAADLEELDAE-RDRLIEATRRLSVDYVPEDDEFVDDIGDDERLT 200
Query: 166 EKRVEDKIGDEKRLEENKKKKTQKR 190
+ V ++ D EE ++K +R
Sbjct: 201 AEEVRAEVADIY--EEYNERKALRR 223
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 32.3 bits (73), Expect = 0.090
Identities = 30/105 (28%), Positives = 50/105 (47%), Gaps = 4/105 (3%)
Query: 68 MPLGILTLVRVRTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQE 127
+PL I ++R+ + + K I+ + Q+ KA REE EE E E EE+ +
Sbjct: 97 VPLAIRIRDQLRSVKKDDIKKKKSLIIRQEQIEKA----RQEREELEERMEWERREEKID 152
Query: 128 QDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
+ E EE E E+E + EE + E E++E + + +K
Sbjct: 153 EREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESDDDKTEK 197
Score = 30.8 bits (69), Expect = 0.27
Identities = 22/73 (30%), Positives = 38/73 (52%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
+E+ E+ +E EE +E+ E+ +E ER EE ER +E+ EE+ D E +D
Sbjct: 125 QEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQD 184
Query: 172 KIGDEKRLEENKK 184
+ E ++ +K
Sbjct: 185 ESETESDDDKTEK 197
Score = 26.6 bits (58), Expect = 8.4
Identities = 22/86 (25%), Positives = 46/86 (53%), Gaps = 1/86 (1%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
+D+ +++ +E+ E+ QE++E E E E+ +E E E++E++ EE+ +
Sbjct: 112 KDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEE 171
Query: 165 EEKRVEDKIGDEKRLE-ENKKKKTQK 189
+ E +I ++ E E+ KT+K
Sbjct: 172 QSDDSEHEIIEQDESETESDDDKTEK 197
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 32.6 bits (74), Expect = 0.090
Identities = 26/89 (29%), Positives = 40/89 (44%), Gaps = 1/89 (1%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
AG E E+ EE E+ +Q+ E E ENE + E+ ER + E EED
Sbjct: 152 AGKVHE-ETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEED 210
Query: 162 KQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
E+ VED ++ L + ++ Q+
Sbjct: 211 YDEEDNPVEDSKAIKEELAKEPVEEQQEV 239
Score = 28.4 bits (63), Expect = 2.3
Identities = 16/71 (22%), Positives = 30/71 (42%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
+ +E E + EE E+ +E E + + E E + +E E+ E + E
Sbjct: 144 DDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAET 203
Query: 167 KRVEDKIGDEK 177
V ++ DE+
Sbjct: 204 DDVTEEDYDEE 214
Score = 28.0 bits (62), Expect = 3.0
Identities = 15/81 (18%), Positives = 34/81 (41%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
R + EEE +E+ + E+ + E+ ++ + ++ ED + ++E +
Sbjct: 95 RKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGK 154
Query: 165 EEKRVEDKIGDEKRLEENKKK 185
+ ED E+ E K+
Sbjct: 155 VHEETEDSYHVEETASEQYKQ 175
Score = 27.2 bits (60), Expect = 5.4
Identities = 14/91 (15%), Positives = 37/91 (40%), Gaps = 3/91 (3%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE---KKE 157
K +E + + E+ ++E++ + + +K + D+R E E E
Sbjct: 98 KQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHE 157
Query: 158 EEEDKQGEEKRVEDKIGDEKRLEENKKKKTQ 188
E ED E+ ++ + + + ++++
Sbjct: 158 ETEDSYHVEETASEQYKQDMKEKASEQENED 188
>gnl|CDD|218954 pfam06246, Isy1, Isy1-like splicing family. Isy1 protein is
important in the optimisation of splicing.
Length = 253
Score = 32.3 bits (74), Expect = 0.094
Identities = 25/88 (28%), Positives = 39/88 (44%), Gaps = 3/88 (3%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
G RDE E+ E E E E+E + E E + E EEDER ED +++ +
Sbjct: 168 GYRDE---EDGVLLEYEAEREKELIEKLVEEWKEIQDEDLQAEEDEREEDIYIPLDDDAE 224
Query: 163 QGEEKRVEDKIGDEKRLEENKKKKTQKR 190
V + E+ L E +K++ +
Sbjct: 225 FVAHVPVPTQEEVEEWLLEKRKRELLAK 252
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 31.9 bits (73), Expect = 0.096
Identities = 13/32 (40%), Positives = 25/32 (78%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERR 142
+++EE+EEEE++EE+ ++DE EE+ K+ +
Sbjct: 115 DDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146
Score = 30.4 bits (69), Expect = 0.29
Identities = 12/29 (41%), Positives = 22/29 (75%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEK 139
+E +++EE+EEEE+++E D+ E +E E
Sbjct: 112 DESDDDEEDEEEEDDEEDDDEDESEEEES 140
Score = 29.2 bits (66), Expect = 0.73
Identities = 12/31 (38%), Positives = 23/31 (74%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDEN 137
E +++EE+EEEE++EE + ++ EE+E+
Sbjct: 110 EEDESDDDEEDEEEEDDEEDDDEDESEEEES 140
Score = 28.8 bits (65), Expect = 1.1
Identities = 14/30 (46%), Positives = 25/30 (83%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
EE+E +++EE+EEE++ +E +EDE+E+E
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEE 138
Score = 28.4 bits (64), Expect = 1.4
Identities = 11/31 (35%), Positives = 22/31 (70%)
Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDE 130
+ D+ +EEEE++EE+++E+E E++E
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEE 139
Score = 28.1 bits (63), Expect = 1.6
Identities = 12/33 (36%), Positives = 23/33 (69%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP 144
EE+E +++EE+EE+E DE ++++ +E P
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESP 141
>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA. This model
describes flagellar biosynthesis protein FlhA, one of a
large number of genes associated with the biosynthesis
of functional bacterial flagella. Homologs of many such
proteins, including FlhA, function in type III protein
secretion systems. A separate model describes InvA
(Salmonella enterica), LcrD (Yersinia enterocolitica),
HrcV (Xanthomonas), etc., all of which score below the
noise cutoff for this model [Cellular processes,
Chemotaxis and motility].
Length = 678
Score = 32.6 bits (75), Expect = 0.098
Identities = 28/117 (23%), Positives = 49/117 (41%), Gaps = 6/117 (5%)
Query: 24 LVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLS 83
L+ TATG ++V A S + + + L+VA ++ L ++ + L
Sbjct: 240 LIISTATG-LIVTRASSEGSFGKAIVTQLGAN-PRALLIVAAVLGLLALVPGLPTFPFLF 297
Query: 84 GAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
A L + ++E A E + +EE EEEE+ ++ L D+ E E
Sbjct: 298 LAGALAFLAW----YLRRRSKQEEEAAAEAAKAQEEAAEEEEESINDILALDDLELE 350
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 32.6 bits (75), Expect = 0.10
Identities = 19/59 (32%), Positives = 32/59 (54%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEK 167
EEE EE EEE +E + + E E ++ E++ + ++G+DEE EE + E+
Sbjct: 65 EIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123
>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
This family consists of several Trypanosoma brucei
procyclic acidic repetitive protein (PARP) like
sequences. The procyclic acidic repetitive protein
(parp) genes of Trypanosoma brucei encode a small family
of abundant surface proteins whose expression is
restricted to the procyclic form of the parasite. They
are found at two unlinked loci, parpA and parpB;
transcription of both loci is developmentally regulated.
Length = 145
Score = 31.9 bits (71), Expect = 0.10
Identities = 25/65 (38%), Positives = 35/65 (53%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
K GA D + ++E EEEEE E E+E +E E +E +E PEE E E + E E
Sbjct: 49 KVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEP 108
Query: 161 DKQGE 165
+ + E
Sbjct: 109 EPEPE 113
Score = 29.9 bits (66), Expect = 0.41
Identities = 24/63 (38%), Positives = 32/63 (50%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
G + EEEEE E EEE EEE E +E EE+ +E E + E E + E E +
Sbjct: 57 GTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEP 116
Query: 163 QGE 165
+ E
Sbjct: 117 EPE 119
Score = 27.6 bits (60), Expect = 2.7
Identities = 24/62 (38%), Positives = 32/62 (51%), Gaps = 2/62 (3%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
D+ EEEE E EEE EEE + ++E E+E E E EE E + E + E E +
Sbjct: 60 PDDEPEEEEEPEPEEEGEEEPEPEEE--GEEEPEPEETGEEEPEPEPEPEPEPEPEPEPE 117
Query: 165 EE 166
E
Sbjct: 118 PE 119
Score = 26.8 bits (58), Expect = 5.5
Identities = 22/57 (38%), Positives = 30/57 (52%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
E E E EEE EEE E E+E +E E +E +E PE + E E + E E + +
Sbjct: 65 EEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPEPEPE 121
Score = 26.1 bits (56), Expect = 8.8
Identities = 22/59 (37%), Positives = 29/59 (49%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
+E E EEE EEE E EEE E++ EE E+ PE + E E + E E +
Sbjct: 63 EPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPEPEPE 121
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 32.7 bits (74), Expect = 0.11
Identities = 19/107 (17%), Positives = 48/107 (44%), Gaps = 6/107 (5%)
Query: 90 KKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRL---PEE 146
+ ++ + + ++ E+ + + + E+ + +E EEDE+ ++ + E
Sbjct: 79 VEFLINKEHIRVLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEA 138
Query: 147 DERGEDEEKKEEEEDKQ---GEEKRVEDKIGDEKRLEENKKKKTQKR 190
D E E E+ D + EEK++E++ E++ + + +K
Sbjct: 139 DLFNESESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKF 185
Score = 32.7 bits (74), Expect = 0.11
Identities = 20/79 (25%), Positives = 39/79 (49%)
Query: 90 KKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER 149
KK+ + +L E ++E + + EEE+EE+++++D+ EE+ E + DE
Sbjct: 266 KKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDEP 325
Query: 150 GEDEEKKEEEEDKQGEEKR 168
+ E ++ EKR
Sbjct: 326 VLEGVDLESPKELSSFEKR 344
Score = 31.1 bits (70), Expect = 0.30
Identities = 27/78 (34%), Positives = 44/78 (56%), Gaps = 1/78 (1%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
+ D EEEEEE+E E+E ++DE +E+E D+ ED+E+K+ EE++
Sbjct: 110 DSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLE-DLSDDETEDDEEKKMEEEE 168
Query: 163 QGEEKRVEDKIGDEKRLE 180
GEEK ++ EK+ +
Sbjct: 169 AGEEKESVEQATREKKFD 186
Score = 28.0 bits (62), Expect = 3.8
Identities = 20/81 (24%), Positives = 37/81 (45%), Gaps = 10/81 (12%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK----------KEEEE 160
E E E+ ++E E ++++ +EE+E +E+ E+ R + +K E E
Sbjct: 143 ESESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNE 202
Query: 161 DKQGEEKRVEDKIGDEKRLEE 181
+ E E +GDE E+
Sbjct: 203 FLEATEAEEEAALGDEDDFED 223
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 32.2 bits (74), Expect = 0.11
Identities = 23/84 (27%), Positives = 38/84 (45%), Gaps = 27/84 (32%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
REEEEE+ + EEE QE E ++++EE K+ E +
Sbjct: 264 TREEEEEKILKAAEEERQE-------------------------EAQEKKEEKKKEEREA 298
Query: 169 VEDKIGDE--KRLEENKKKKTQKR 190
K+ E ++LEE ++KK ++
Sbjct: 299 KLAKLSPEEQRKLEEKERKKQARK 322
Score = 31.5 bits (72), Expect = 0.19
Identities = 16/62 (25%), Positives = 33/62 (53%), Gaps = 2/62 (3%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE--KKEEEEDKQGEE 166
+ EEEEE+ + E+E+ E +E + EK++ E E +K EE++++ +
Sbjct: 261 VDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQA 320
Query: 167 KR 168
++
Sbjct: 321 RK 322
Score = 30.7 bits (70), Expect = 0.36
Identities = 14/52 (26%), Positives = 25/52 (48%)
Query: 91 KNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERR 142
+ IL + + E E+++EE E + + E+ LEE E +K+ R
Sbjct: 270 EKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQAR 321
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 32.7 bits (74), Expect = 0.11
Identities = 23/61 (37%), Positives = 33/61 (54%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
E EEEE+EE EE EE E+ E + + E + + R EE ++ E E + EEEE +
Sbjct: 107 ENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGS 166
Query: 167 K 167
Sbjct: 167 L 167
Score = 26.9 bits (59), Expect = 6.9
Identities = 17/56 (30%), Positives = 29/56 (51%)
Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
REE + EEEE+ +QE+ + +E EK R E + R + +K ++ + G
Sbjct: 223 REERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAEKRQKVPEDGL 278
Score = 26.9 bits (59), Expect = 7.7
Identities = 21/75 (28%), Positives = 36/75 (48%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
AR EE +E E E Q++ + D++ E +++ G + E EEEE ++ E+R
Sbjct: 64 ARREERRDERFSEALERQKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREER 123
Query: 169 VEDKIGDEKRLEENK 183
E + + E K
Sbjct: 124 EEVEETEGVTKSEQK 138
Score = 26.9 bits (59), Expect = 8.6
Identities = 22/80 (27%), Positives = 44/80 (55%), Gaps = 3/80 (3%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE- 154
EN + GA +E E+ +++++E + E L++ E+ + L EE++R + EE
Sbjct: 183 ENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQEEA 242
Query: 155 --KKEEEEDKQGEEKRVEDK 172
K EEE+K+ ++ +E +
Sbjct: 243 DRKSREEEEKRRLKEEIERR 262
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 32.6 bits (75), Expect = 0.11
Identities = 27/102 (26%), Positives = 52/102 (50%), Gaps = 10/102 (9%)
Query: 89 LKKNILCENQLFKAGARDETAREEEEEEEE-EEEEEEEQEQDEYLEEDENEKERRLPEED 147
LKK L E A++E + E E E +E E Q + L + E +R++ D
Sbjct: 48 LKKEALLE-------AKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLD 100
Query: 148 ERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
++ E+ EKKE+E + EK +++K + + L ++++ ++
Sbjct: 101 KKEENLEKKEKELSNK--EKNLDEKEEELEELIAEQREELER 140
Score = 28.4 bits (64), Expect = 2.5
Identities = 29/90 (32%), Positives = 47/90 (52%), Gaps = 12/90 (13%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENE---KERRLPEE----DERGEDEEKKEEE 159
ET ++E E +EE + E + L+E NE ERRL + D + E +KKEE
Sbjct: 46 ETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEEN 105
Query: 160 EDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
+K+ EK + +K EK L+E +++ +
Sbjct: 106 LEKK--EKELSNK---EKNLDEKEEELEEL 130
>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
Length = 356
Score = 32.2 bits (73), Expect = 0.12
Identities = 18/34 (52%), Positives = 23/34 (67%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQD 129
+ QLF A DE EE +E++EEEE EEE+E D
Sbjct: 318 QEQLFNPFAIDEEMLEETQEQQEEEENEEEEEND 351
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 32.1 bits (74), Expect = 0.15
Identities = 32/141 (22%), Positives = 69/141 (48%), Gaps = 9/141 (6%)
Query: 58 ITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEE 117
+ +++A++ + +G + VR ++ A ++ + + L +A E ++E E
Sbjct: 4 LIIILIALVALVVGAVIGYFVRKKIAEAKIKEAEE-EAKRILEEAKKEAEAIKKEALLEA 62
Query: 118 EEEEEEEEQEQDEYLEEDENE---KERRLPEE----DERGEDEEKKEEEEDKQGEEKRVE 170
+EE + E ++ L E NE E+RL ++ D + E EK+EEE +K+ +E +
Sbjct: 63 KEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQK 122
Query: 171 DKIGDEKRLE-ENKKKKTQKR 190
+ ++K E E ++ +
Sbjct: 123 QQELEKKEEELEELIEEQLQE 143
Score = 27.4 bits (62), Expect = 5.9
Identities = 16/57 (28%), Positives = 35/57 (61%)
Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
R ++EE + + E ++++E LE+ E E E++ E +++ E+ E+ EE+ ++ E
Sbjct: 90 RLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELER 146
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 31.6 bits (72), Expect = 0.15
Identities = 19/80 (23%), Positives = 39/80 (48%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
E ++ + E EE+ +E + E+ E++ E ++ E+ +E E K E K
Sbjct: 4 ECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDEN 63
Query: 171 DKIGDEKRLEENKKKKTQKR 190
+K+ +E + EN+ + + R
Sbjct: 64 NKLKEENKKLENELEALKDR 83
Score = 30.1 bits (68), Expect = 0.46
Identities = 18/67 (26%), Positives = 36/67 (53%)
Query: 120 EEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRL 179
E+E + + E +EED ++ E+ + ED E +E E+++ E+ +++ E+
Sbjct: 1 MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELK 60
Query: 180 EENKKKK 186
+EN K K
Sbjct: 61 DENNKLK 67
Score = 30.1 bits (68), Expect = 0.51
Identities = 24/83 (28%), Positives = 48/83 (57%), Gaps = 2/83 (2%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
E +EE++ +EE+ E EE E++E +E+ E E + E+ + E+ + KEE + + E
Sbjct: 20 ENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVK--IEELKDENNKLKEENKKLENEL 77
Query: 167 KRVEDKIGDEKRLEENKKKKTQK 189
+ ++D++ +N +K+T K
Sbjct: 78 EALKDRLLRTVAEYDNYRKRTAK 100
Score = 29.3 bits (66), Expect = 0.87
Identities = 24/76 (31%), Positives = 45/76 (59%), Gaps = 2/76 (2%)
Query: 115 EEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIG 174
E+E ++ + E +++ +E+EN++E + EED E+ EK+E ED + + +++
Sbjct: 1 MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELK 60
Query: 175 DE--KRLEENKKKKTQ 188
DE K EENKK + +
Sbjct: 61 DENNKLKEENKKLENE 76
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 31.9 bits (73), Expect = 0.16
Identities = 26/82 (31%), Positives = 45/82 (54%), Gaps = 10/82 (12%)
Query: 85 AVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP 144
A+ +K+ I + FK E +EE+ EE E+ EE+E+++ E E+E E
Sbjct: 248 ALEAIKEVIKKKGGDFKVKGEPEVVGGDEEDLEELLEKAEEEEEEDDYSESEDEDE---- 303
Query: 145 EEDERGEDEEKKEEEEDKQGEE 166
EDE+++EEE+D +G++
Sbjct: 304 ------EDEDEEEEEDDDEGDK 319
Score = 26.5 bits (59), Expect = 9.3
Identities = 15/44 (34%), Positives = 26/44 (59%)
Query: 133 EEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDE 176
EED E + EE+E + E ++E+E+ + EE+ +D GD+
Sbjct: 276 EEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 31.8 bits (73), Expect = 0.16
Identities = 17/33 (51%), Positives = 21/33 (63%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLE 133
A+ + A EEEEEEEEEEEEEE ++E
Sbjct: 291 VLSAQAQAAAAEEEEEEEEEEEEEEPSEEEAAA 323
Score = 28.7 bits (65), Expect = 1.7
Identities = 14/29 (48%), Positives = 21/29 (72%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEED 135
+A+ + EEEEEEEEE+E++E EE+
Sbjct: 292 LSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 32.1 bits (73), Expect = 0.17
Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 6/83 (7%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE----- 166
E+ E+E E++ + ++ L+E E K+ E +E E E K+ E +K+ +E
Sbjct: 518 EKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKAL 577
Query: 167 -KRVEDKIGDEKRLEENKKKKTQ 188
K VE I + K + +K K+ +
Sbjct: 578 KKEVESIIRELKEKKIHKAKEIK 600
Score = 31.7 bits (72), Expect = 0.22
Identities = 28/87 (32%), Positives = 45/87 (51%), Gaps = 6/87 (6%)
Query: 104 ARDETAREEEE-EEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
E + ++E E+E EE +E E+ + LE++ E + L +E E E KE++ K
Sbjct: 537 LLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVES-IIRELKEKKIHK 595
Query: 163 QGEEKRVEDKIGDEKRLEENKKKKTQK 189
E K +ED +L+E K+K QK
Sbjct: 596 AKEIKSIEDL----VKLKETKQKIPQK 618
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 31.9 bits (72), Expect = 0.17
Identities = 24/84 (28%), Positives = 41/84 (48%), Gaps = 7/84 (8%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDE------YLEEDENEKERRLPEEDER 149
E+ + + DE + ++E+ + E +E +Q+E Y + E + E E
Sbjct: 714 EDDFSDSTSDDEPKLDAIDDEDAKSEGSQESDQEEGLDEIFYSFDGEQDNSDSFAESSEE 773
Query: 150 GE-DEEKKEEEEDKQGEEKRVEDK 172
E EE+KEEEE+K+ KR + K
Sbjct: 774 DESSEEEKEEEENKEVSAKRAKKK 797
Score = 28.9 bits (64), Expect = 1.7
Identities = 16/89 (17%), Positives = 36/89 (40%), Gaps = 3/89 (3%)
Query: 96 ENQLFKAGARDETAREEEEEEEE---EEEEEEEQEQDEYLEEDENEKERRLPEEDERGED 152
EN+++ A + E++ ++ E E++ + D+ + D + E E + +
Sbjct: 687 ENEIWSALVKSRPDVEDDSDDSELDFAEDDFSDSTSDDEPKLDAIDDEDAKSEGSQESDQ 746
Query: 153 EEKKEEEEDKQGEEKRVEDKIGDEKRLEE 181
EE +E E+ D + +E
Sbjct: 747 EEGLDEIFYSFDGEQDNSDSFAESSEEDE 775
Score = 28.1 bits (62), Expect = 3.1
Identities = 16/74 (21%), Positives = 34/74 (45%), Gaps = 9/74 (12%)
Query: 106 DETAREEEEEEEEEEE--------EEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
DE A+ E +E ++EE + EQ+ + + +E++ EE E E++E
Sbjct: 733 DEDAKSEGSQESDQEEGLDEIFYSFDGEQDNSDSFA-ESSEEDESSEEEKEEEENKEVSA 791
Query: 158 EEEDKQGEEKRVED 171
+ K+ + ++
Sbjct: 792 KRAKKKQRKNMLKS 805
Score = 28.1 bits (62), Expect = 4.0
Identities = 16/83 (19%), Positives = 37/83 (44%), Gaps = 2/83 (2%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEED--ERGEDEEKKEE 158
K ++ + + ++E +E E + +E+D ++ E E+D + D+E K +
Sbjct: 670 KGKKSNKASFDSDDEMDENEIWSALVKSRPDVEDDSDDSELDFAEDDFSDSTSDDEPKLD 729
Query: 159 EEDKQGEEKRVEDKIGDEKRLEE 181
D + + + E+ L+E
Sbjct: 730 AIDDEDAKSEGSQESDQEEGLDE 752
>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2. Transcripts
harbouring premature signals for translation termination
are recognised and rapidly degraded by eukaryotic cells
through a pathway known as nonsense-mediated mRNA decay.
In Saccharomyces cerevisiae, three trans-acting factors
(Upf1 to Upf3) are required for nonsense-mediated mRNA
decay.
Length = 171
Score = 31.2 bits (71), Expect = 0.18
Identities = 18/59 (30%), Positives = 30/59 (50%), Gaps = 2/59 (3%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRV 169
E E ++ EE+EE +E ++ DE E + E+DE + EE E+ + EE+
Sbjct: 3 SESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEE--EQIFVTRQEEEVD 59
Score = 30.0 bits (68), Expect = 0.50
Identities = 15/60 (25%), Positives = 29/60 (48%), Gaps = 1/60 (1%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
E ++E +EEE + ++++ +E E+E R EE+ E E + + E +K
Sbjct: 14 EELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTR-QEEEVDPEAEAEFDREFEK 72
Score = 29.3 bits (66), Expect = 0.92
Identities = 19/82 (23%), Positives = 44/82 (53%), Gaps = 2/82 (2%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE--RGEDEEKKEEEEDK 162
D+ +EE EE+E++E ++E+ + +++++E+ E+ R E+E E E +
Sbjct: 7 SDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEAEAEF 66
Query: 163 QGEEKRVEDKIGDEKRLEENKK 184
E +++ + + ++ E KK
Sbjct: 67 DREFEKMMAESLESRKFESRKK 88
>gnl|CDD|218601 pfam05477, SURF2, Surfeit locus protein 2 (SURF2). Surfeit locus
protein 2 is part of a group of at least six sequence
unrelated genes (Surf-1 to Surf-6). The six Surfeit
genes have been classified as housekeeping genes, being
expressed in all tissue types tested and not containing
a TATA box in their promoter region. The exact function
of SURF2 is unknown.
Length = 244
Score = 31.4 bits (71), Expect = 0.19
Identities = 16/81 (19%), Positives = 33/81 (40%), Gaps = 7/81 (8%)
Query: 116 EEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED------EEKKEEEEDKQGEEKRV 169
+E++ + E+ D Y E K + + +D +E + E + Q + +
Sbjct: 160 SDEDDSDSEDSMSDLYPPELFTLKNPGKEQNGDEDDDFETDDEDEMEVESPELQQKRSKK 219
Query: 170 EDKIGDEKRLEENKKKKTQKR 190
+ K+ ++N KKK R
Sbjct: 220 QSG-SLTKKFKKNHKKKGPFR 239
>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807). This is
a family of conserved fungal proteins of unknown
function.
Length = 169
Score = 31.3 bits (71), Expect = 0.21
Identities = 17/73 (23%), Positives = 27/73 (36%), Gaps = 2/73 (2%)
Query: 110 REEEEEEEEEEEEEEE-QEQDEYLEEDENEKERRLPEEDERGEDEEK-KEEEEDKQGEEK 167
RE E +EE E EEE E + R E + + K ++E K ++
Sbjct: 67 RERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAKDP 126
Query: 168 RVEDKIGDEKRLE 180
R + +K
Sbjct: 127 RGGTQDVVDKSQA 139
Score = 28.6 bits (64), Expect = 1.6
Identities = 15/70 (21%), Positives = 26/70 (37%), Gaps = 7/70 (10%)
Query: 117 EEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDE 176
E E +EE E E EE E + + K+ E++ + K DK +
Sbjct: 68 ERELKEEAEAE-----EEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDK--ER 120
Query: 177 KRLEENKKKK 186
K ++ +
Sbjct: 121 KSAKDPRGGT 130
>gnl|CDD|218241 pfam04750, Far-17a_AIG1, FAR-17a/AIG1-like protein. This family
includes the hamster androgen-induced FAR-17a protein,
and its human homologue, the AIG1 protein. The function
of these proteins is unknown. This family also includes
homologous regions from a number of other metazoan
proteins.
Length = 203
Score = 31.1 bits (71), Expect = 0.21
Identities = 15/78 (19%), Positives = 32/78 (41%), Gaps = 12/78 (15%)
Query: 1 METEVSFGLDAASWNIWAVVTLILVFCTATGNILVCLA--ISWEKRLQNVTNYFLMSLAI 58
+ +++A + +T I + A L LA + ++L+ +
Sbjct: 20 DHQLPTLEIESAYGGKFKYLTFIGLVLQALYFTLALLADVVRLSRKLRK----------L 69
Query: 59 TDLMVAVLVMPLGILTLV 76
D ++AVL P+G++ V
Sbjct: 70 RDFLLAVLAFPVGVIVSV 87
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 31.2 bits (71), Expect = 0.21
Identities = 25/74 (33%), Positives = 39/74 (52%)
Query: 117 EEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDE 176
EEE EE+E EE+E+++E + EE+E E + ++ ED + E E+ +
Sbjct: 19 EEELEEDEFFWTYLLFEEEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKEL 78
Query: 177 KRLEENKKKKTQKR 190
+R E KKKK K
Sbjct: 79 QREERLKKKKRVKT 92
Score = 30.0 bits (68), Expect = 0.56
Identities = 26/80 (32%), Positives = 43/80 (53%), Gaps = 7/80 (8%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
EE++EE E EEEEEE+E D ++ E+ +E E ++EE ++E + ++ +K+
Sbjct: 37 EEDDEEFEIEEEEEEEEVDSDFDDSED-------DEPESDDEEEGEKELQREERLKKKKR 89
Query: 171 DKIGDEKRLEENKKKKTQKR 190
K K + KKKK
Sbjct: 90 VKTKAYKEPTKKKKKKDPTA 109
>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 31.3 bits (71), Expect = 0.21
Identities = 9/44 (20%), Positives = 19/44 (43%)
Query: 122 EEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+ ++ LE + +E D+ E E+ ++E K+ E
Sbjct: 149 KARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESSKKKE 192
>gnl|CDD|220112 pfam09110, HAND, HAND. The HAND domain adopts a secondary
structure consisting of four alpha helices, three of
which (H2, H3, H4) form an L-like configuration. Helix
H2 runs antiparallel to helices H3 and H4, packing
closely against helix H4, whilst helix H1 reposes in the
concave surface formed by these three helices and runs
perpendicular to them. The domain confers DNA and
nucleosome binding properties to the protein.
Length = 109
Score = 30.3 bits (69), Expect = 0.21
Identities = 19/50 (38%), Positives = 28/50 (56%), Gaps = 4/50 (8%)
Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQ----DEYLEEDENEKERRLPE 145
+K D +EE+EEE E + + EQE+ + EE+E EK+R L E
Sbjct: 59 YKVPLDDVGDGDEEDEEEREAKRKLEQEKIDNAEPLTEEEEEEKQRLLEE 108
>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
subunit beta; Reviewed.
Length = 463
Score = 31.6 bits (72), Expect = 0.22
Identities = 16/24 (66%), Positives = 17/24 (70%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDE 130
E EEEEEEEEEEEEEE+ E
Sbjct: 400 ERWAAEEEEEEEEEEEEEEEPVAE 423
Score = 30.4 bits (69), Expect = 0.61
Identities = 17/37 (45%), Positives = 20/37 (54%)
Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP 144
R EEEEEEEEEEEE+E+ E +LP
Sbjct: 399 VERWAAEEEEEEEEEEEEEEEPVAEVMMMPAPEMQLP 435
>gnl|CDD|216235 pfam00999, Na_H_Exchanger, Sodium/hydrogen exchanger family. Na/H
antiporters are key transporters in maintaining the pH
of actively metabolising cells. The molecular mechanisms
of antiport are unclear. These antiporters contain 10-12
transmembrane regions (M) at the amino-terminus and a
large cytoplasmic region at the carboxyl terminus. The
transmembrane regions M3-M12 share identity with other
members of the family. The M6 and M7 regions are highly
conserved. Thus, this is thought to be the region that
is involved in the transport of sodium and hydrogen
ions. The cytoplasmic region has little similarity
throughout the family.
Length = 370
Score = 31.4 bits (72), Expect = 0.22
Identities = 12/77 (15%), Positives = 25/77 (32%), Gaps = 10/77 (12%)
Query: 5 VSFGL----DAASWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITD 60
VS GL + ++ +V L+LV G +L ++ L +
Sbjct: 271 VSVGLSLDLSSLLLSLLLLVLLLLVAILL-GKLLGVFLLARLLGLSLREALIVGF----- 324
Query: 61 LMVAVLVMPLGILTLVR 77
+ + L + +
Sbjct: 325 GGLQRGAVSLALAAIGL 341
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 31.6 bits (71), Expect = 0.23
Identities = 29/89 (32%), Positives = 46/89 (51%), Gaps = 2/89 (2%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
EN + G EEE ++EEEE+E+ E+ E +EE E +E +ED +DE++
Sbjct: 85 ENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKE 144
Query: 156 KEEE--EDKQGEEKRVEDKIGDEKRLEEN 182
+ E E++ E +E+ D L EN
Sbjct: 145 SDAEGDENELAGEYIIEEVDDDVAILNEN 173
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 31.6 bits (71), Expect = 0.23
Identities = 26/98 (26%), Positives = 41/98 (41%), Gaps = 13/98 (13%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKER-------------RLPEED 147
K D EEE +++E+ E+ E +E ++E E+ R + E
Sbjct: 582 KDATLDSIEGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYET 641
Query: 148 ERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKK 185
ER E+ KKEE E+R + + D E K+K
Sbjct: 642 EREENARKKEELRGNFELEERGDPEKKDVDWYTEEKRK 679
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 31.3 bits (71), Expect = 0.24
Identities = 15/30 (50%), Positives = 17/30 (56%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDE 136
E EEEEEEEEEEEEE + + E
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPMEEPVP 428
Score = 30.2 bits (68), Expect = 0.72
Identities = 16/30 (53%), Positives = 18/30 (60%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENE 138
R EEEEEEEEEEEEE + E E+
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPMEEPVP 428
Score = 29.8 bits (67), Expect = 0.91
Identities = 13/30 (43%), Positives = 15/30 (50%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLE 133
A +E EEEEEEE E E +E E
Sbjct: 402 AEEEEEEEEEEEEEAAEAEAPMEEPVPGFE 431
Score = 27.9 bits (62), Expect = 3.3
Identities = 17/52 (32%), Positives = 27/52 (51%)
Query: 119 EEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
EE+ E E+L+E + R EE+E E+EE++E E + E+ V
Sbjct: 377 EEDATTIDELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPMEEPVP 428
Score = 27.9 bits (62), Expect = 4.0
Identities = 15/33 (45%), Positives = 19/33 (57%)
Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPE 145
EEEEEEEEEEEE+ + +E +PE
Sbjct: 402 AEEEEEEEEEEEEEAAEAEAPMEEPVPGFEVPE 434
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 29.9 bits (68), Expect = 0.24
Identities = 15/32 (46%), Positives = 21/32 (65%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLE 133
A A A EE++EEEEEEEE+E+ ++E
Sbjct: 68 AAAAAAAAAAEEKKEEEEEEEEKEESEEEAAA 99
Score = 29.2 bits (66), Expect = 0.46
Identities = 15/29 (51%), Positives = 20/29 (68%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDE 130
A A A EE++EEEEEEEE++E +E
Sbjct: 67 AAAAAAAAAAAEEKKEEEEEEEEKEESEE 95
Score = 28.8 bits (65), Expect = 0.80
Identities = 15/29 (51%), Positives = 19/29 (65%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDE 130
A A A EE++EEEEEEEE+E+ E
Sbjct: 66 AAAAAAAAAAAAEEKKEEEEEEEEKEESE 94
Score = 26.8 bits (60), Expect = 3.6
Identities = 15/34 (44%), Positives = 20/34 (58%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEED 135
A A A EE++EEEEEE+E++E EE
Sbjct: 64 AAAAAAAAAAAAAAEEKKEEEEEEEEKEESEEEA 97
Score = 26.5 bits (59), Expect = 4.6
Identities = 14/34 (41%), Positives = 19/34 (55%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEED 135
A A A EE++EEEEE+E+ E EE+
Sbjct: 63 AAAAAAAAAAAAAAAEEKKEEEEEEEEKEESEEE 96
>gnl|CDD|129888 TIGR00806, rfc, RFC reduced folate carrier. The Reduced Folate
Carrier (RFC) Family (TC 2.A.48) Members of the RFC
family mediate the uptake of folate, reduce folate,
derivatives of reduced folate and the drug,
methotrexate. Proteins of the RFC family are so-far
restricted to animals. RFC proteins possess 12 putative
transmembrane a-helical spanners (TMSs) and evidence for
a 12 TMS topology has been published for the human RFC.
The RFC transporters appear to transport reduced folate
by an energy-dependent, pH-dependent, Na+-independent
mechanism. Folate:H+ symport, folate:OH- antiport and
folate:anion antiport mechanisms have been proposed, but
the energetic mechanism is not well defined [Transport
and binding proteins, Carbohydrates, organic alcohols,
and acids].
Length = 511
Score = 31.4 bits (71), Expect = 0.25
Identities = 37/172 (21%), Positives = 60/172 (34%), Gaps = 40/172 (23%)
Query: 14 WNIW-----AVVTLI---LVFCTA-TGNILVC---------------------LAISWEK 43
W W AVV+ I LVF + + +I V +A S K
Sbjct: 325 WARWSKLLIAVVSAIQAGLVFWMSQSHDIWVLYVTYVLFRGIYQFLVPIATFQIASSLSK 384
Query: 44 RLQNVT---NYFLMSLAITDLMVAVLVMPLGILTLVRVR------TNLSGAVLRLKKNIL 94
L + N F+ + A+ ++ AV+ G+ VR + L +++ L
Sbjct: 385 ELCALVFGINTFV-ATALKTIITAVVSDKRGLGLQVRKQFRIYSVYFLVLSIICFFGAGL 443
Query: 95 CENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEE 146
+ K G A +E EE + QD L + + LPE+
Sbjct: 444 DGLRYCKRGTHQPLAPAQELRSPLEESVQAISLQDRDLRGPQPSAPQLLPED 495
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 31.5 bits (72), Expect = 0.26
Identities = 10/40 (25%), Positives = 19/40 (47%), Gaps = 2/40 (5%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEE 146
EEE E+++ ++ EE++++ E R EE
Sbjct: 262 PQPPEEEPPPPPPPPEDDDDPPED--EEEQDDAEDRALEE 299
Score = 30.8 bits (70), Expect = 0.46
Identities = 9/47 (19%), Positives = 16/47 (34%), Gaps = 7/47 (14%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
+ EEE +D+ ++ EE + ED +E
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPED-------EEEQDDAEDRALEE 299
Score = 28.8 bits (65), Expect = 1.7
Identities = 8/41 (19%), Positives = 17/41 (41%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER 149
A + EEE ++ D+ E++E + + +E
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEI 300
Score = 26.9 bits (60), Expect = 7.3
Identities = 8/35 (22%), Positives = 17/35 (48%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENE 138
+E E++++ E+E+EQD+ + E
Sbjct: 265 PEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEE 299
>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
fusion protein; Provisional.
Length = 131
Score = 30.3 bits (68), Expect = 0.26
Identities = 20/67 (29%), Positives = 29/67 (43%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
A A E EE E + +E+ E E DE ++E E + DE E EE +
Sbjct: 64 AAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADE 123
Query: 164 GEEKRVE 170
E++ E
Sbjct: 124 EEDEEAE 130
Score = 30.3 bits (68), Expect = 0.31
Identities = 19/75 (25%), Positives = 33/75 (44%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
E R E+ E E+ E + DE+ E + + ++EE+ +E D+ +E
Sbjct: 54 EAGRAEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADE 113
Query: 167 KRVEDKIGDEKRLEE 181
E + DE+ EE
Sbjct: 114 ADAEAEEADEEEDEE 128
Score = 30.3 bits (68), Expect = 0.34
Identities = 16/65 (24%), Positives = 30/65 (46%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
A + E + +E+ +E E + + +E+E E DE + E+ +EEED
Sbjct: 67 TAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEED 126
Query: 162 KQGEE 166
++ E
Sbjct: 127 EEAEA 131
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 31.1 bits (70), Expect = 0.28
Identities = 8/41 (19%), Positives = 19/41 (46%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED 152
+++E + +EE++ D+ ED ++ +ED
Sbjct: 274 DDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCDNGA 314
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 31.2 bits (70), Expect = 0.31
Identities = 27/80 (33%), Positives = 35/80 (43%), Gaps = 24/80 (30%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
EEEEE E EE+EE E +E + DE+ +EEEE E K ++
Sbjct: 140 LEEEEEVEMEEDEEYYE-----------------KEPGKVVDEKSEEEEE---EELKTMK 179
Query: 171 DKIGDEKRLEENKKKKTQKR 190
D I LE KK + K
Sbjct: 180 DFI----DLESQTKKPSVKD 195
Score = 31.2 bits (70), Expect = 0.32
Identities = 18/69 (26%), Positives = 34/69 (49%), Gaps = 1/69 (1%)
Query: 119 EEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK-QGEEKRVEDKIGDEK 177
+++EE E D ++ +P+ E+E + EE+E+ + E +V D+ +E+
Sbjct: 111 DDDEENLPSSIAPPEIDPEPRKPIVPDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEE 170
Query: 178 RLEENKKKK 186
EE K K
Sbjct: 171 EEEELKTMK 179
Score = 30.4 bits (68), Expect = 0.53
Identities = 27/93 (29%), Positives = 40/93 (43%), Gaps = 7/93 (7%)
Query: 102 AGARDETAREEEEEEEEEEEEEE-EQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
D R+ + EEEEE E E+DE E E E + + E+ E E+EE K ++
Sbjct: 123 PPEIDPEPRKPIVPDLVLEEEEEVEMEEDE--EYYEKEPGKVVDEKSEEEEEEELKTMKD 180
Query: 161 ----DKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
+ Q ++ V+D KK QK
Sbjct: 181 FIDLESQTKKPSVKDNGKSFWSAASVFSKKLQK 213
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 31.2 bits (70), Expect = 0.31
Identities = 17/74 (22%), Positives = 41/74 (55%)
Query: 117 EEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDE 176
E E +E E ++++ EE EN + + + +R ++ +++EE+D+ E +R + + DE
Sbjct: 283 EIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADE 342
Query: 177 KRLEENKKKKTQKR 190
E ++ +++
Sbjct: 343 LNELEKGIEEKRRQ 356
Score = 29.3 bits (65), Expect = 1.3
Identities = 14/65 (21%), Positives = 29/65 (44%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
E Q + + E + + +E EE+E++ + E E E E+G +E++
Sbjct: 295 EQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELEKGIEEKR 354
Query: 156 KEEEE 160
++ E
Sbjct: 355 RQMES 359
Score = 27.0 bits (59), Expect = 7.2
Identities = 21/73 (28%), Positives = 34/73 (46%), Gaps = 3/73 (4%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+ A EE EE EE+ + + E E E +KE+ +++E E KEE + + +
Sbjct: 261 KQGAEEEGEEGMSEEDLDVGAAEIENKEVSEGDKEQ---QQEEVENAEAHKEEVQSDRPD 317
Query: 166 EKRVEDKIGDEKR 178
E E + DE
Sbjct: 318 EIGEEKEEDDENE 330
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 30.6 bits (70), Expect = 0.33
Identities = 27/72 (37%), Positives = 48/72 (66%), Gaps = 3/72 (4%)
Query: 117 EEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDE 176
+ E+ + E +++ + LEE++ E E+R+ E + + E EK+EEEE +Q EEKR D+I
Sbjct: 117 QAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEE-RQIEEKRHADEIAFL 175
Query: 177 KRLEENKKKKTQ 188
K+ +N++ K+Q
Sbjct: 176 KK--QNQQLKSQ 185
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 30.8 bits (70), Expect = 0.33
Identities = 20/64 (31%), Positives = 36/64 (56%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
E A E EEEE E ++Q Q E E + +RL E + R +E+++ +++DK+ ++
Sbjct: 142 EQALLEVLEEEELAELRQQQRQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQ 201
Query: 167 KRVE 170
+ E
Sbjct: 202 REKE 205
>gnl|CDD|215369 PLN02685, PLN02685, iron superoxide dismutase.
Length = 299
Score = 30.7 bits (69), Expect = 0.36
Identities = 12/31 (38%), Positives = 19/31 (61%)
Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
RE+EE E+EEE + + + YL+ D + E
Sbjct: 267 REQEETRTEDEEEPDSEAVEMYLDSDIDVSE 297
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 29.1 bits (66), Expect = 0.36
Identities = 14/28 (50%), Positives = 18/28 (64%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQD 129
A A A EEE++EEEEEEE++ D
Sbjct: 55 AAAAAAAAAAAAEEEKKEEEEEEEEDDD 82
Score = 27.2 bits (61), Expect = 2.1
Identities = 10/20 (50%), Positives = 15/20 (75%)
Query: 111 EEEEEEEEEEEEEEEQEQDE 130
EEE++EEEEE+E+D+
Sbjct: 62 AAAAAEEEKKEEEEEEEEDD 81
Score = 26.8 bits (60), Expect = 2.3
Identities = 14/28 (50%), Positives = 17/28 (60%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQD 129
A A A EEE++EEEEEE+E D
Sbjct: 54 AAAAAAAAAAAAAEEEKKEEEEEEEEDD 81
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 31.1 bits (71), Expect = 0.37
Identities = 14/62 (22%), Positives = 30/62 (48%)
Query: 125 EQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKK 184
E E++E + + ER+ + +R +++ ++EE +K +K+ E K + K
Sbjct: 395 EGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454
Query: 185 KK 186
K
Sbjct: 455 KV 456
Score = 26.8 bits (60), Expect = 8.9
Identities = 11/63 (17%), Positives = 27/63 (42%), Gaps = 2/63 (3%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
E + + R + E +EE E+ +++ E + + + ++ + D GE +
Sbjct: 410 ERKKLRKKQRKAEKKAE-KEEAEKAAAKKKAEAAAKKAKGPDGETKK-VDPDPLGEKLAR 467
Query: 156 KEE 158
E+
Sbjct: 468 TED 470
>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
Length = 310
Score = 30.8 bits (70), Expect = 0.39
Identities = 10/26 (38%), Positives = 11/26 (42%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQE 127
A A A EEEEEEE +
Sbjct: 278 AAAAAAAAAAAAAAAPAEEEEEEEDD 303
Score = 29.6 bits (67), Expect = 0.78
Identities = 11/39 (28%), Positives = 13/39 (33%)
Query: 91 KNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQD 129
K L A A A EEEEEE++
Sbjct: 266 KEALANPSAAAAAAAAAAAAAAAAAAAPAEEEEEEEDDM 304
Score = 27.3 bits (61), Expect = 4.6
Identities = 7/22 (31%), Positives = 10/22 (45%)
Query: 109 AREEEEEEEEEEEEEEEQEQDE 130
A EEEE+E+D+
Sbjct: 282 AAAAAAAAAAAPAEEEEEEEDD 303
Score = 26.9 bits (60), Expect = 6.9
Identities = 7/29 (24%), Positives = 11/29 (37%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDE 130
A A A EEE+E+++
Sbjct: 274 AAAAAAAAAAAAAAAAAAAPAEEEEEEED 302
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 31.0 bits (71), Expect = 0.39
Identities = 26/106 (24%), Positives = 49/106 (46%), Gaps = 23/106 (21%)
Query: 97 NQLFKAGARDETAREEEEEEE-------EEEEEEEEQEQDEYLEEDENEKERRLPEEDER 149
N + A+D+ +E++ ++E E ++ +EY ED+ KER + +
Sbjct: 475 NGILTVSAKDKGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKER----IEAK 530
Query: 150 GEDEE-------KKEEEEDK--QGEEKRVEDKIGDEKRLEENKKKK 186
E EE +EE DK + ++K+VE+ I L+E + +
Sbjct: 531 NEAEEYVYSLEKSLKEEGDKLPEADKKKVEEAIEW---LKEELEGE 573
Score = 27.6 bits (62), Expect = 4.5
Identities = 22/90 (24%), Positives = 42/90 (46%), Gaps = 6/90 (6%)
Query: 106 DETAREEEEEE--EEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
++E E ++ EE E ++ + E +NE E + ++ ++E K E DK+
Sbjct: 498 SSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKK 557
Query: 164 GEEKRVED----KIGDEKRLEENKKKKTQK 189
E+ +E G++K E K ++ QK
Sbjct: 558 KVEEAIEWLKEELEGEDKEEIEAKTEELQK 587
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 31.0 bits (70), Expect = 0.40
Identities = 23/92 (25%), Positives = 42/92 (45%), Gaps = 5/92 (5%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEED-ERGEDEEKKEEEEDK 162
R E EE E+ +E + E+E ++ +E+ E+E E G E +++EE
Sbjct: 364 ERLEKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFL 423
Query: 163 QG---EEKRVEDKIGDE-KRLEENKKKKTQKR 190
+ EEK + +K+ E +E+ + R
Sbjct: 424 KELSKEEKELLEKLKMEASEVEKLFGRALPVR 455
>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
N-terminal. This is the very acidic N-terminal region
of the early transcription elongation factor Spt5. The
Spt5-Spt4 complex regulates early transcription
elongation by RNA polymerase II and has an imputed role
in pre-mRNA processing via its physical association with
mRNA capping enzymes. The actual function of this
N-terminal domain is not known although it is
dispensable for binding to Spt4.
Length = 92
Score = 29.3 bits (66), Expect = 0.41
Identities = 24/50 (48%), Positives = 36/50 (72%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
EEEEEEEEE++ E+ ++DE+++E E E +RR D R E EE+++ EE
Sbjct: 11 EEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAEE 60
Score = 25.9 bits (57), Expect = 6.1
Identities = 19/60 (31%), Positives = 36/60 (60%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
E E ++EEEEEEEE++ E L +++ + E+D R +++ E+E+++ E+ E
Sbjct: 4 TEAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAEELAE 63
>gnl|CDD|224712 COG1799, COG1799, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 167
Score = 30.1 bits (68), Expect = 0.43
Identities = 12/47 (25%), Positives = 17/47 (36%), Gaps = 3/47 (6%)
Query: 112 EEEEEEEEEEEEEEQEQDE---YLEEDENEKERRLPEEDERGEDEEK 155
++EE+ EE+ QE + EE K E R E
Sbjct: 15 PLDDEEDYYEEDPRQEPRDPAVEPEEYAPPKYNFRKREPARMEMRGN 61
>gnl|CDD|221821 pfam12871, PRP38_assoc, Pre-mRNA-splicing factor 38-associated
hydrophilic C-term. This domain is a hydrophilic region
found at the C-terminus of plant and metazoan
pre-mRNA-splicing factor 38 proteins. The function is
not known.
Length = 97
Score = 29.4 bits (66), Expect = 0.44
Identities = 12/50 (24%), Positives = 21/50 (42%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
+EEEE EEEE++EE + + D + R + ++
Sbjct: 11 DEEEESEEEEDDEEIRRKAERDVDRGRRSPRRRTRRRSRRRKRSRKRRRR 60
Score = 28.2 bits (63), Expect = 1.1
Identities = 19/65 (29%), Positives = 31/65 (47%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
+A EE+ +EEEE EEEE+ E+ E + ++ RR P R +K + ++
Sbjct: 3 VSALEEDLDEEEESEEEEDDEEIRRKAERDVDRGRRSPRRRTRRRSRRRKRSRKRRRRRR 62
Query: 167 KRVED 171
R
Sbjct: 63 DRDRA 67
>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
protein. Members of this protein family are designated
TraM and are found in a proposed transfer region of a
class of conjugative transposon found in the Bacteroides
lineage [Cellular processes, DNA transformation].
Length = 410
Score = 30.9 bits (70), Expect = 0.44
Identities = 8/34 (23%), Positives = 14/34 (41%)
Query: 133 EEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
++E E R + E + R E E ++Q
Sbjct: 142 TDEEKELLREVEELESRLATEPSPAPELEEQLAL 175
Score = 27.8 bits (62), Expect = 3.8
Identities = 14/50 (28%), Positives = 21/50 (42%)
Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
E + +EE+E + E LE + PE +E+ EK E K
Sbjct: 137 YEYPKTDEEKELLREVEELESRLATEPSPAPELEEQLALMEKSYELAAKY 186
>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
[Posttranslational modification, protein turnover,
chaperones].
Length = 193
Score = 30.4 bits (69), Expect = 0.44
Identities = 24/83 (28%), Positives = 42/83 (50%), Gaps = 13/83 (15%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
+ E + EE EE E+ EE+E +E E+ENE E E+ E E + + E+ + +
Sbjct: 6 QKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEE------EQQEIAELEAQLEELKDKY 59
Query: 167 KRVEDKIGDEKRLEENKKKKTQK 189
R + + EN +K+T++
Sbjct: 60 LRAQAEF-------ENLRKRTER 75
>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
binding domain. Human SGLT1 (hSGLT1) is a
high-affinity/low-capacity glucose transporter, which
can also transport galactose. In the transport
mechanism, two Na+ ions first bind to the extracellular
side of the transporter and induce a conformational
change in the glucose binding site. This results in an
increased affinity for glucose. A second conformational
change in the transporter follows, bringing the Na+ and
glucose binding sites to the inner surface of the
membrane. Glucose is then released, followed by the Na+
ions. In the process, hSGLT1 is also able to transport
water and urea and may be a major pathway for transport
of these across the intestinal brush-border membrane.
hSGLT1 is encoded by the SLC5A1 gene and expressed
mostly in the intestine, but also in the trachea,
kidney, heart, brain, testis, and prostate. The
WHO/UNICEF oral rehydration solution (ORS) for the
treatment of secretory diarrhea contains salt and
glucose. The glucose, along with sodium ions, is
transported by hSGLT1 and water is either co-transported
along with these or follows by osmosis. Mutations in
SGLT1 are associated with intestinal glucose galactose
malabsorption (GGM). Up-regulation of intestinal SGLT1
may protect against enteric infections. SGLT1 is
expressed in colorectal, head and neck, and prostate
tumors. Epidermal growth factor receptor (EGFR)
functions in cell survival by stabilizing SGLT1, and
thereby maintaining intracellular glucose levels. SGLT1
is predicted to have 14 membrane-spanning regions. This
subgroup belongs to the solute carrier 5
(SLC5)transporter family.
Length = 635
Score = 30.6 bits (69), Expect = 0.45
Identities = 19/80 (23%), Positives = 39/80 (48%), Gaps = 10/80 (12%)
Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED-----EEKKEEEEDKQG 164
R EE + + ++ +++DE E E ++ER+ P + + ++ K + ++
Sbjct: 538 RNSTEERIDLDADDWTEDEDE--NEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEE- 594
Query: 165 EEKRVEDKIGD--EKRLEEN 182
EE ++ K+ D EK L N
Sbjct: 595 EEAALKMKMTDTSEKPLWRN 614
Score = 29.5 bits (66), Expect = 1.1
Identities = 13/61 (21%), Positives = 26/61 (42%), Gaps = 15/61 (24%)
Query: 104 ARDETAREEEEEEEEEEEEE---------------EEQEQDEYLEEDENEKERRLPEEDE 148
A D T E+E E E +EE + ++ + + EE+E + ++ + E
Sbjct: 549 ADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEAALKMKMTDTSE 608
Query: 149 R 149
+
Sbjct: 609 K 609
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 30.7 bits (69), Expect = 0.46
Identities = 28/108 (25%), Positives = 54/108 (50%), Gaps = 4/108 (3%)
Query: 83 SGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEED--ENEKE 140
GAV++ I + K G + +EE+ EE + ++ EQE+ + LE++ + +++
Sbjct: 57 PGAVVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQ 116
Query: 141 RRLPEEDERGEDEEKK--EEEEDKQGEEKRVEDKIGDEKRLEENKKKK 186
++ EE E+ E+K EE+ K E++ + + K E K K
Sbjct: 117 QKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLK 164
Score = 27.6 bits (61), Expect = 4.7
Identities = 20/82 (24%), Positives = 37/82 (45%), Gaps = 5/82 (6%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERG-----EDEEKKEEEEDKQ 163
A E+E ++ E+E + QEQ + EE E + + +++E+ E ++K E + K
Sbjct: 97 AAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKA 156
Query: 164 GEEKRVEDKIGDEKRLEENKKK 185
E + K+ E K
Sbjct: 157 AAEAAKLKAAAEAKKKAEEAAK 178
Score = 27.2 bits (60), Expect = 5.9
Identities = 19/89 (21%), Positives = 36/89 (40%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
K A E ++ EE + EE + + E ++ E E + + E + K E++
Sbjct: 162 KLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKA 221
Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
+ EEK +K + + +K K
Sbjct: 222 EAAAEEKAAAEKKKAAAKAKADKAAAAAK 250
>gnl|CDD|226458 COG3949, COG3949, Uncharacterized membrane protein [Function
unknown].
Length = 349
Score = 30.4 bits (69), Expect = 0.46
Identities = 15/70 (21%), Positives = 30/70 (42%), Gaps = 12/70 (17%)
Query: 12 ASWNIWAVVTLIL---VFCTATGNILVCLAISWEKRLQNVTN--YFLMSLAITDLMVAVL 66
S I V+++I+ ++ T G I RL + Y++ +A L++A
Sbjct: 254 FSPLIGLVMSVIIWLEIYTTTVGLIY-----GLASRLTSFFPRRYWI--IAAIILVIAYP 306
Query: 67 VMPLGILTLV 76
+ G + L+
Sbjct: 307 LSFFGFINLI 316
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 30.4 bits (69), Expect = 0.49
Identities = 16/71 (22%), Positives = 37/71 (52%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
++EEE++ + E E L+ EN E+ ++++KK+ + K+ K ++
Sbjct: 129 DDEEEQKSFDPAEYFDTTPELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELK 188
Query: 171 DKIGDEKRLEE 181
++ EK+L++
Sbjct: 189 ERKEREKKLKK 199
>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
Cobaltochelatase is responsible for the insertion of
cobalt into the corrin ring of coenzyme B12 during its
biosynthesis. Two versions have been well described.
CbiK/CbiX is a monomeric, anaerobic version which acts
early in the biosynthesis (pfam06180). CobNST is a
trimeric, ATP-dependent, aerobic version which acts late
in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
number of genomes (actinobacteria, cyanobacteria,
betaproteobacteria and pseudomonads) which apparently
biosynthesize B12, encode a cobN gene but are
demonstrably lacking cobS and cobT. These genomes do,
however contain a homolog (modelled here) of the
magnesium chelatase subunits BchI/BchD family. Aside
from the cyanobacteria (which have a separate magnesium
chelatase trimer), these species do not make chlorins,
so do not have any use for a magnesium chelatase.
Furthermore, in nearly all cases the members of this
family are proximal to either CobN itself or other genes
involved in cobalt transport or B12 biosynthesis.
Length = 633
Score = 30.4 bits (69), Expect = 0.50
Identities = 19/96 (19%), Positives = 37/96 (38%), Gaps = 9/96 (9%)
Query: 104 ARDETAREEEEEEEEEEEEE------EEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
R + + + ++E++ EE+ + ++ DE ++ E R P E E +
Sbjct: 309 RRRKPFEQPQGKDEKDLEEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPDPQ 368
Query: 158 EEEDKQGEEKRVEDKIGDEKRLE---ENKKKKTQKR 190
EE D QG I + L + + +R
Sbjct: 369 EEADGQGSSTDPAGDIFRIRVLAPPQARARGASGRR 404
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 30.6 bits (69), Expect = 0.51
Identities = 17/50 (34%), Positives = 30/50 (60%), Gaps = 2/50 (4%)
Query: 123 EEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
EE++ + E L ++E E+ER+ E++R +EEK E D+ + VE +
Sbjct: 252 EEKRRELEKLAKEEAERERQ--AEEQRRREEEKAAMEADRAQAKAEVEKR 299
Score = 29.8 bits (67), Expect = 1.0
Identities = 18/60 (30%), Positives = 31/60 (51%), Gaps = 4/60 (6%)
Query: 128 QDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKT 187
+D LEE E E+ EE ER E++ EE+ ++ EEK + + + E K+++
Sbjct: 247 EDFLLEEKRRELEKLAKEEAER----ERQAEEQRRREEEKAAMEADRAQAKAEVEKRREK 302
Score = 28.7 bits (64), Expect = 2.4
Identities = 16/62 (25%), Positives = 31/62 (50%), Gaps = 2/62 (3%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDE--EKKEEEEDKQGEEKR 168
EE E E+E E EQ Q + L+ KE + + + +D+ ++K + E + ++
Sbjct: 69 SAVEEVEAEDEIEVEQNQSDVLKSSSIVKEESISTDMDGIDDDSLDRKLKLERENLRKRE 128
Query: 169 VE 170
+E
Sbjct: 129 IE 130
Score = 27.1 bits (60), Expect = 6.2
Identities = 15/52 (28%), Positives = 27/52 (51%), Gaps = 2/52 (3%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
EE+ E E+ +EE E++ EE +E + E +R + K E E+ ++
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADR--AQAKAEVEKRRE 301
>gnl|CDD|203554 pfam06965, Na_H_antiport_1, Na+/H+ antiporter 1. This family
contains a number of bacterial Na+/H+ antiporter 1
proteins. These are integral membrane proteins that
catalyze the exchange of H+ for Na+ in a manner that is
highly dependent on the pH.
Length = 379
Score = 30.3 bits (69), Expect = 0.51
Identities = 16/55 (29%), Positives = 25/55 (45%), Gaps = 6/55 (10%)
Query: 17 WAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLG 71
WA+ T L LA+ R+ FL++LAI D + A+L++ L
Sbjct: 122 WAIPM-----ATDIAFALGVLAL-LGSRVPASLKIFLLALAIVDDLGAILIIALF 170
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 30.1 bits (68), Expect = 0.52
Identities = 9/57 (15%), Positives = 21/57 (36%)
Query: 134 EDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
E E K + E +++ +K + + +K + KK++ +K
Sbjct: 17 ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKP 73
Score = 29.8 bits (67), Expect = 0.73
Identities = 16/62 (25%), Positives = 30/62 (48%), Gaps = 4/62 (6%)
Query: 131 YLEEDENEKERRLPEEDERGEDEEK---KEEEEDKQGEEKRVEDKIGDEKRLEENKKKKT 187
E+ + K++ PE + ++K K + K +RV K+ +++ EE KK K
Sbjct: 17 ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERV-KKLHSQEKKEEKKKPKK 75
Query: 188 QK 189
+K
Sbjct: 76 KK 77
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 30.4 bits (69), Expect = 0.52
Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 8/88 (9%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER--GEDEEKKEE 158
K + +E E EEE+E + E EKE+RL E ++ E +E +
Sbjct: 291 KLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSK 350
Query: 159 EEDKQGEEKRVEDKIGDEKRLEENKKKK 186
E+ Q K++ED EKRLE+ K K
Sbjct: 351 LEEIQ---KKLEDL---EKRLEKLKSNK 372
>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 406
Score = 30.3 bits (69), Expect = 0.53
Identities = 18/84 (21%), Positives = 30/84 (35%), Gaps = 11/84 (13%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
D + E E +E + +D L+ E EK L EE + + + E D+
Sbjct: 3 LDALDEILGDLESYEPQEYLNKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDRLR 62
Query: 165 EEKR-----------VEDKIGDEK 177
EE V + + D +
Sbjct: 63 EEIERLKEPPLIVGTVLEVLDDGR 86
>gnl|CDD|234333 TIGR03727, urea_t_UrtC_arc, urea ABC transporter, permease protein
UrtC, archaeal type. Members of this protein family are
ABC transporter permease subunits restricted to the
Archaea. Several lines of evidence suggest this protein
is functionally analogous, as well as homologous, to the
UrtC subunit of the Corynebacterium glutamicum urea
transporter. All members of the operon show sequence
similarity to urea transport subunits, the gene is
located near the urease structural subunits in two of
three species, and partial phylogenetic profiling
identifies this permease subunit as closely matching the
profile of urea utilization.
Length = 369
Score = 30.2 bits (68), Expect = 0.54
Identities = 16/50 (32%), Positives = 23/50 (46%), Gaps = 1/50 (2%)
Query: 31 GNILVCLAISW-EKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVR 79
G I+ +AI W K F + LMV +LV+P GI+ +R
Sbjct: 295 GAIVATVAIEWLRKWFSINAGEFAIVFVGLLLMVTILVIPRGIVPGLREG 344
>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2). All proteins
in this family for which functions are known are flap
endonucleases that generate the 3' incision next to DNA
damage as part of nucleotide excision repair. This
family is related to many other flap endonuclease
families including the fen1 family. This family is based
on the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1034
Score = 30.6 bits (69), Expect = 0.55
Identities = 18/79 (22%), Positives = 38/79 (48%), Gaps = 6/79 (7%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQ----EQDEYLEEDENEKERRLPEEDERGEDEEKK 156
K + + R+E E+ +E EE+ E+++ ++ +NE + EE E E
Sbjct: 691 KVASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKDADDFKNEWQDISLEELEALEANLLA 750
Query: 157 EEEEDK--QGEEKRVEDKI 173
E+ K + ++KR+ ++
Sbjct: 751 EQNSLKAQKQQQKRIAAEV 769
Score = 29.1 bits (65), Expect = 1.8
Identities = 20/80 (25%), Positives = 38/80 (47%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
EE E + E E +++ E L DE+E++ + +E + ++ K E +D EE
Sbjct: 685 EENAENKVASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKDADDFKNEWQDISLEELEAL 744
Query: 171 DKIGDEKRLEENKKKKTQKR 190
+ ++ +K+ QKR
Sbjct: 745 EANLLAEQNSLKAQKQQQKR 764
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 28.9 bits (65), Expect = 0.57
Identities = 19/32 (59%), Positives = 23/32 (71%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLE 133
A A A EEEEEEEEEEEEEEE+ ++E +
Sbjct: 67 AAAAAAAAAEEEEEEEEEEEEEEEESEEEAMA 98
Score = 28.9 bits (65), Expect = 0.62
Identities = 18/29 (62%), Positives = 20/29 (68%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDE 130
A A A EEEEEEEEEEEEE+E+ E
Sbjct: 65 AAAAAAAAAAAEEEEEEEEEEEEEEEESE 93
Score = 28.5 bits (64), Expect = 0.91
Identities = 19/29 (65%), Positives = 21/29 (72%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDE 130
A A A EEEEEEEEEEEEEE+E +E
Sbjct: 66 AAAAAAAAAAEEEEEEEEEEEEEEEESEE 94
Score = 26.9 bits (60), Expect = 3.2
Identities = 16/27 (59%), Positives = 19/27 (70%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEED 135
A EEEEEEEEEEE+E++E EE
Sbjct: 70 AAAAAAEEEEEEEEEEEEEEEESEEEA 96
Score = 26.2 bits (58), Expect = 5.8
Identities = 15/27 (55%), Positives = 19/27 (70%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEED 135
A EEEEEEEEEE+E++E EE+
Sbjct: 69 AAAAAAAEEEEEEEEEEEEEEEESEEE 95
>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
(DUF874). This family consists of several hypothetical
proteins specific to Helicobacter pylori. The function
of this family is unknown.
Length = 417
Score = 30.3 bits (67), Expect = 0.59
Identities = 25/102 (24%), Positives = 52/102 (50%), Gaps = 2/102 (1%)
Query: 90 KKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER 149
KK L + + ARD + E E+EE++ E+E+++ E + + + +E ++
Sbjct: 126 KKIELAQAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQK 185
Query: 150 GEDEEKKEEEEDKQGE--EKRVEDKIGDEKRLEENKKKKTQK 189
E E++K E+E ++ + ++ EK+ EN+K+ K
Sbjct: 186 TEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIK 227
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 30.2 bits (68), Expect = 0.59
Identities = 23/102 (22%), Positives = 46/102 (45%), Gaps = 8/102 (7%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEED--------ENEKERRLPEED 147
++ ++ + DE+ EE E++ ++EEEE+ + ++Y + D E E E
Sbjct: 169 KDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEGEEAPSINY 228
Query: 148 ERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
E + +E + + E + V D ++ K+KKT
Sbjct: 229 NEDTSESESDESDSEISESRSVSDSEESSPPSKKPKEKKTSS 270
Score = 29.4 bits (66), Expect = 1.2
Identities = 17/76 (22%), Positives = 38/76 (50%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
A+++ ++ +++E +E + DE E E+E + ED+ ++EE+ + ED +
Sbjct: 149 GKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYD 208
Query: 167 KRVEDKIGDEKRLEEN 182
+ D +E+ E
Sbjct: 209 GMLVDSSDEEEGEEAP 224
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 30.5 bits (69), Expect = 0.59
Identities = 12/42 (28%), Positives = 19/42 (45%), Gaps = 1/42 (2%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEE 146
+ E E EEE E D+ + D+ E+ ++PEE
Sbjct: 269 PEPQPPPPPPPPEPPEPEEEPDEPDQ-TDPDDGEETDQIPEE 309
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 30.4 bits (68), Expect = 0.59
Identities = 16/84 (19%), Positives = 41/84 (48%), Gaps = 1/84 (1%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
A + +E+++ E ++ E E+ + + ++E K + D + E + ++E EDK+
Sbjct: 264 ADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKE 323
Query: 164 GEEKR-VEDKIGDEKRLEENKKKK 186
E ++ E D ++ + + +
Sbjct: 324 LEAQKKREPVAEDLQKTKPQVEAQ 347
Score = 29.6 bits (66), Expect = 1.0
Identities = 18/84 (21%), Positives = 42/84 (50%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
+ A +E+++ E +++E ++ E + E L +D + D +++ + +K+ E+
Sbjct: 262 KPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAED 321
Query: 167 KRVEDKIGDEKRLEENKKKKTQKR 190
K +E + E E+ +K K Q
Sbjct: 322 KELEAQKKREPVAEDLQKTKPQVE 345
Score = 29.2 bits (65), Expect = 1.2
Identities = 19/91 (20%), Positives = 49/91 (53%), Gaps = 7/91 (7%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
RD T +E E +E+ + ++ +E+ + + D ++ +++ + + D+++ E + KQ
Sbjct: 197 RRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKA-DFAQDNADKQRDEVRQ-KQ 254
Query: 164 GEEKRVEDKIG-----DEKRLEENKKKKTQK 189
E K + ++K++ EN+K++ +K
Sbjct: 255 QEAKNLPKPADTSSPKEDKQVAENQKREIEK 285
Score = 29.2 bits (65), Expect = 1.3
Identities = 17/102 (16%), Positives = 45/102 (44%), Gaps = 12/102 (11%)
Query: 100 FKAGARDETAREEEEEEEEEEEEEE-----EQEQDEYLEEDENEKERRLPEEDERGEDEE 154
F D+ E ++++E + + ++D+ + E++ + + E ++ ++E
Sbjct: 238 FAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEA 297
Query: 155 KKEEE-------EDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
K ++ ++ + EK EDK + ++ E + QK
Sbjct: 298 LKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQK 339
Score = 28.0 bits (62), Expect = 3.2
Identities = 18/90 (20%), Positives = 40/90 (44%), Gaps = 1/90 (1%)
Query: 98 QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
+ A + R+E ++++E + + +ED+ E + E E+ + E KK
Sbjct: 235 KADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQ-KREIEKAQIEIKKN 293
Query: 158 EEEDKQGEEKRVEDKIGDEKRLEENKKKKT 187
+EE + ++ + D + K E+ + K
Sbjct: 294 DEEALKAKDHKAFDLKQESKASEKEAEDKE 323
>gnl|CDD|234300 TIGR03662, Chlor_Arch_YYY, Chlor_Arch_YYY domain. Members of this
highly hydrophobic probable integral membrane family
belong to two classes. In one, a single copy of the
region modeled by This model represents essentially the
full length of a strongly hydrophobic protein of about
700 to 900 residues (variable because of long inserts in
some). The domain architecture of the other class
consists of an additional N-terminal region, two copies
of the region represented by this model, and three to
four repeats of TPR, or tetratricopeptide repeat. The
unusual species range includes several Archaea, several
Chloroflexi, and Clostridium phytofermentans. An unusual
motif YYYxG is present, and we suggest the name
Chlor_Arch_YYY protein. The function is unknown.
Length = 723
Score = 30.4 bits (69), Expect = 0.61
Identities = 21/99 (21%), Positives = 35/99 (35%), Gaps = 24/99 (24%)
Query: 5 VSFGLDAASWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVA 64
L S I + L+ + T +W R ++ F LA+ L +
Sbjct: 441 FIALLLGLSLGIPLLALLLPLLVT-----------TWLLRRRDAPVGFETLLALGGLGLL 489
Query: 65 VLVMPLGILTLVRVRTNLSGAVLRLKKNILCENQLFKAG 103
+LV ++ VR N+ G R+ N +FK
Sbjct: 490 LLV------EIIYVRDNIGGDYYRM-------NTVFKFY 515
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 29.9 bits (67), Expect = 0.62
Identities = 18/43 (41%), Positives = 26/43 (60%), Gaps = 3/43 (6%)
Query: 119 EEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
++EEE+ Q E +EEDE E E ED E++E+ EE E+
Sbjct: 18 KDEEEDWDSQAEEVEEDEEEME---DWEDSLDEEDEEAEEVEE 57
Score = 29.1 bits (65), Expect = 1.1
Identities = 15/29 (51%), Positives = 18/29 (62%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
EE EE+EEE E+ E DE EE E +E
Sbjct: 29 EEVEEDEEEMEDWEDSLDEEDEEAEEVEE 57
Score = 28.3 bits (63), Expect = 2.0
Identities = 17/48 (35%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
Query: 116 EEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
++EEE+ + + E+ E EE+ + E L EEDE E+ E EE
Sbjct: 18 KDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVE--EETAASS 63
Score = 27.6 bits (61), Expect = 2.9
Identities = 12/35 (34%), Positives = 21/35 (60%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
D A E EE+EEE E+ E+ ++++ E+ E+
Sbjct: 25 DSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEET 59
Score = 26.4 bits (58), Expect = 7.8
Identities = 11/39 (28%), Positives = 18/39 (46%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEED 147
EE+EEE E+ E+ ++E +E E +E
Sbjct: 30 EVEEDEEEMEDWEDSLDEEDEEAEEVEEETAASSKAPSS 68
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 30.2 bits (68), Expect = 0.63
Identities = 20/63 (31%), Positives = 33/63 (52%), Gaps = 1/63 (1%)
Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKI 173
E E E ++ D + E+DE E R ED E+ E+ EE+E+++GEE+ E +
Sbjct: 46 ELAEPETSDDPYGNPDPFGEDDEGRIEVR-ISEDGEEEEVEEGEEDEEEEGEEESEEFEP 104
Query: 174 GDE 176
+
Sbjct: 105 LGD 107
Score = 29.8 bits (67), Expect = 0.84
Identities = 23/68 (33%), Positives = 33/68 (48%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
G DE E E+ EEEE EE E+DE E +E +E + E ++ EE +
Sbjct: 64 GEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGDTPEELTEASEQLEEHE 123
Query: 163 QGEEKRVE 170
+G + VE
Sbjct: 124 EGFQAMVE 131
Score = 29.4 bits (66), Expect = 1.0
Identities = 16/56 (28%), Positives = 26/56 (46%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
E E ++ + E DE E ++ E +E EDEE++ EEE ++ E
Sbjct: 49 EPETSDDPYGNPDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEP 104
Score = 27.1 bits (60), Expect = 5.8
Identities = 19/57 (33%), Positives = 29/57 (50%), Gaps = 1/57 (1%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
A ET+ + + E++E + + + ED E+E EEDE E EE+ EE E
Sbjct: 48 AEPETSDDPYGNPDPFGEDDEGRIEVR-ISEDGEEEEVEEGEEDEEEEGEEESEEFE 103
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 29.9 bits (68), Expect = 0.63
Identities = 20/77 (25%), Positives = 39/77 (50%), Gaps = 1/77 (1%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE-EEEDKQGEEK 167
+ E+ ++E EE +Q E + E K+ R E+ EE K+ E+E K+ + +
Sbjct: 66 KLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKAE 125
Query: 168 RVEDKIGDEKRLEENKK 184
+ + D +R+E+ K+
Sbjct: 126 LEKYEKNDPERIEKLKE 142
>gnl|CDD|225962 COG3428, COG3428, Predicted membrane protein [Function unknown].
Length = 494
Score = 30.1 bits (68), Expect = 0.63
Identities = 36/171 (21%), Positives = 52/171 (30%), Gaps = 24/171 (14%)
Query: 20 VTLILVFCTATGNILVCLAISWE-KRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRV 78
LI+ A +V L S+ Y A L+V V+ + L I+ +
Sbjct: 10 RALIVGIIHAILRAIVVLIGSFSFVLGVATNGYSFWGGAA--LVVLVIFLILQIIKWITF 67
Query: 79 RTNLSGAVLRLKKNILCE--------------------NQLFK-AGARDETAREEEEEEE 117
+ +RL+ I +LF R ETA E E
Sbjct: 68 TYRIEDEEVRLQTGIFSRKKRYIPIDRIQSVDTTAGLVARLFGLVVLRIETAGGGGEAEA 127
Query: 118 EEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
E E E E + E +K L + +EE E Q R
Sbjct: 128 ELVEREIAFELAALVREARVKKLDALELAEADTPEEEVAEVLARSQSSVLR 178
Score = 27.4 bits (61), Expect = 4.7
Identities = 12/45 (26%), Positives = 15/45 (33%), Gaps = 2/45 (4%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
K E A E + EEE E + Q L N+ E
Sbjct: 143 REARVKKLDALELA--EADTPEEEVAEVLARSQSSVLRYPMNKGE 185
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 30.5 bits (68), Expect = 0.64
Identities = 18/76 (23%), Positives = 38/76 (50%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
E+E E+ EEE +E L DE+ +E + E+ D+E E + + E++ +
Sbjct: 432 EDESVEDNEEEHSGDANEEELSVDEHVEEHNADDSGEQQSDDESGEHQSVNEIVEEQSVN 491
Query: 172 KIGDEKRLEENKKKKT 187
+ +E + + +++T
Sbjct: 492 EHVEEPTVADIVEQET 507
Score = 29.0 bits (64), Expect = 1.7
Identities = 20/85 (23%), Positives = 43/85 (50%), Gaps = 3/85 (3%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEE---DENEKERRLPEEDERGEDEEKKEEEEDK 162
++ + E+ EEE + EEE DE++EE D++ +++ E E E EE+
Sbjct: 432 EDESVEDNEEEHSGDANEEELSVDEHVEEHNADDSGEQQSDDESGEHQSVNEIVEEQSVN 491
Query: 163 QGEEKRVEDKIGDEKRLEENKKKKT 187
+ E+ I +++ ++E+ ++
Sbjct: 492 EHVEEPTVADIVEQETVDEHVEEPA 516
Score = 27.4 bits (60), Expect = 5.4
Identities = 23/64 (35%), Positives = 29/64 (45%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
E E EE EE EE +E E +EE+ E EE+ EE EE +D+ EE
Sbjct: 1070 EEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEYDDENPEE 1129
Query: 167 KRVE 170
E
Sbjct: 1130 HNEE 1133
Score = 27.4 bits (60), Expect = 6.2
Identities = 25/79 (31%), Positives = 40/79 (50%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
EE EE EE +EE E+ E E+ +E+ EE+ EE EE ++ EE E
Sbjct: 1012 EENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEE 1071
Query: 171 DKIGDEKRLEENKKKKTQK 189
+ E+ +EEN ++ ++
Sbjct: 1072 IEENIEENIEENVEENVEE 1090
Score = 27.4 bits (60), Expect = 6.6
Identities = 32/113 (28%), Positives = 52/113 (46%), Gaps = 3/113 (2%)
Query: 78 VRTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDEN 137
V N+ V + + E + +E E EE EE EE E+ +EY EE+
Sbjct: 1011 VEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVE 1070
Query: 138 EKERRLPEEDERGEDEEKKEE-EEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
E E + E E +E +E EE ++ E+ VE+ E+ EEN ++ ++
Sbjct: 1071 EIEENIEENIEENVEENVEENVEEIEENVEENVEENA--EENAEENAEENAEE 1121
Score = 27.1 bits (59), Expect = 8.0
Identities = 20/61 (32%), Positives = 29/61 (47%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+E E EE EE EE +E +E +EE+ E EE+ EE +E ++ E
Sbjct: 1073 EENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEYDDENPEEHNE 1132
Query: 166 E 166
E
Sbjct: 1133 E 1133
Score = 26.7 bits (58), Expect = 9.0
Identities = 20/60 (33%), Positives = 29/60 (48%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+E E EE EE EE E+ +E +EE+ E EE+ D+E EE ++ E
Sbjct: 1077 EENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEYDDENPEEHNEEYDE 1136
Score = 26.7 bits (58), Expect = 9.4
Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 3/80 (3%)
Query: 106 DETAREEEEEEEEEEEEEE-EQEQDEYLEEDENEKERRLPEEDERG--EDEEKKEEEEDK 162
+E E EE EE EE E+ +EY EE+ E E + E DE E EE EE ++
Sbjct: 996 EENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEE 1055
Query: 163 QGEEKRVEDKIGDEKRLEEN 182
EE E + + +EEN
Sbjct: 1056 NIEENIEEYDEENVEEIEEN 1075
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plant species have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site located in the C-terminal
region of P2. TCS inactivates the ribosome by
depurinating a specific adenine in the sarcin-ricin loop
of 28S rRNA.
Length = 103
Score = 28.8 bits (65), Expect = 0.67
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQD 129
A A A E ++EE++EEEEEE + D
Sbjct: 70 AAAAAAAAAAEAKKEEKKEEEEEESDDD 97
Score = 26.9 bits (60), Expect = 3.0
Identities = 11/24 (45%), Positives = 17/24 (70%)
Query: 101 KAGARDETAREEEEEEEEEEEEEE 124
A A E +EE++EEEEEE +++
Sbjct: 74 AAAAAAEAKKEEKKEEEEEESDDD 97
Score = 26.5 bits (59), Expect = 4.1
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDE 130
A A A E ++EE++EEEEE+ D+
Sbjct: 69 AAAAAAAAAAAEAKKEEKKEEEEEESDDD 97
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 30.0 bits (67), Expect = 0.69
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDE 136
T ++ +EE+EEEE EEE +Q+E D+
Sbjct: 215 TEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243
Score = 28.9 bits (64), Expect = 1.4
Identities = 12/28 (42%), Positives = 20/28 (71%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEE 134
E + +EE+EEEE EEE +QE+ + ++
Sbjct: 216 EEKQPQEEQEEEEVEEEAKQEEGQGTDD 243
Score = 27.7 bits (61), Expect = 3.1
Identities = 15/56 (26%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
+ + + ++E+ + EYL+E + E+++ E++E +EE K+EE QG +
Sbjct: 189 SVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEE--GQGTD 242
Score = 27.3 bits (60), Expect = 5.0
Identities = 11/36 (30%), Positives = 23/36 (63%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
E +E + EE++ +EE+EE+E +E +++E +
Sbjct: 206 LQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGT 241
Score = 26.9 bits (59), Expect = 7.3
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
+ EE++ +EE+EEEE E++ EE + +
Sbjct: 212 EHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243
>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
(DUF2052). This entry is of sequences of two conserved
domains separated by a region of low complexity,
spanning some 200 residues. The function is unknown.
Length = 178
Score = 29.7 bits (67), Expect = 0.73
Identities = 25/63 (39%), Positives = 33/63 (52%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
+L + +E+A EEEE+ EEE EEEEE+ Q E+E E E E EE
Sbjct: 70 RLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYDELPTPEEREEL 129
Query: 156 KEE 158
+EE
Sbjct: 130 REE 132
Score = 28.5 bits (64), Expect = 1.7
Identities = 22/62 (35%), Positives = 33/62 (53%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
+ EE E E++EE+E E EED E+ EE +RG E++EE+ D E E
Sbjct: 65 DREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYDELPTPE 124
Query: 171 DK 172
++
Sbjct: 125 ER 126
Score = 27.4 bits (61), Expect = 3.9
Identities = 24/71 (33%), Positives = 38/71 (53%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
E+ L + AR E ++EEEE EEEEE+ +E++E EE+ EE++ +E
Sbjct: 61 ESSLDREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYDEL 120
Query: 156 KEEEEDKQGEE 166
EE ++ E
Sbjct: 121 PTPEEREELRE 131
>gnl|CDD|236973 PRK11767, PRK11767, SpoVR family protein; Provisional.
Length = 498
Score = 29.8 bits (68), Expect = 0.74
Identities = 13/38 (34%), Positives = 24/38 (63%)
Query: 119 EEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
+EE+ ++E++EYL+ N+ R LP+ +E EE +
Sbjct: 188 QEEKARQKEREEYLQSQVNDLWRTLPKREEEKAVEEAR 225
Score = 28.3 bits (64), Expect = 2.5
Identities = 14/47 (29%), Positives = 23/47 (48%), Gaps = 6/47 (12%)
Query: 108 TAREEEEEEEEEEEEEEEQEQD------EYLEEDENEKERRLPEEDE 148
+ +EE+ ++E EE + Q D + EE E+ RR P E +
Sbjct: 186 SLQEEKARQKEREEYLQSQVNDLWRTLPKREEEKAVEEARRFPSEPQ 232
>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
Length = 568
Score = 29.9 bits (67), Expect = 0.74
Identities = 14/91 (15%), Positives = 27/91 (29%), Gaps = 3/91 (3%)
Query: 90 KKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDEN--EKERRLPEED 147
+ GA ++ E EEE + E + + E+E+ + R E
Sbjct: 471 GGAQFSSANSLERGADEDYLIVNGTNEPYEEEVIKTNENENFPLENESPCDGISRGTSEL 530
Query: 148 E-RGEDEEKKEEEEDKQGEEKRVEDKIGDEK 177
+D + E D + +
Sbjct: 531 PVNDDDNNNLNRGNTLRSEGNNCFDNLRCAR 561
Score = 27.6 bits (61), Expect = 5.2
Identities = 15/69 (21%), Positives = 27/69 (39%), Gaps = 3/69 (4%)
Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKI 173
+E+ E ++E ++ +ENE P E+E D + E ++
Sbjct: 486 DEDYLIVNGTNEPYEEEVIKTNENEN---FPLENESPCDGISRGTSELPVNDDDNNNLNR 542
Query: 174 GDEKRLEEN 182
G+ R E N
Sbjct: 543 GNTLRSEGN 551
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 30.2 bits (68), Expect = 0.75
Identities = 22/138 (15%), Positives = 46/138 (33%), Gaps = 29/138 (21%)
Query: 76 VRVRTNLSGAVLRLKKNILCENQLFKAGARDE---TAREEEEEEEEEEEEEEEQEQDEYL 132
V RT+ NIL + L + + + ++++++E + E DE
Sbjct: 47 VDERTSGVLRDEGKHANILYNSILCNQ--KKHASFLNQRKSLDDDDDDEFDFLYEDDEDD 104
Query: 133 --------EEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEK----------------R 168
+++ LP+ DE + + ++ D ++
Sbjct: 105 AGNATSGESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDSSNSQTDQGLKQSVNLSS 164
Query: 169 VEDKIGDEKRLEENKKKK 186
E I ++K EN K
Sbjct: 165 AEKLIEEKKGQTENTFKF 182
>gnl|CDD|219939 pfam08619, Nha1_C, Alkali metal cation/H+ antiporter Nha1 C
terminus. The C terminus of the plasma membrane Nha1
antiporter plays an important role in the immediate cell
response to hypo-osmotic shock which prevents an
execessive loss of ions and water. This domain is found
with pfam00999.
Length = 430
Score = 29.8 bits (67), Expect = 0.77
Identities = 13/52 (25%), Positives = 21/52 (40%), Gaps = 5/52 (9%)
Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP-----EEDERGEDEE 154
+R + E+E+ E + + EE E+ RRL +DEE
Sbjct: 379 PSRPRSRRDSEDEDTERDDSDSDDEEETPAERRRRLAALGELPSARDDDDEE 430
Score = 29.0 bits (65), Expect = 1.5
Identities = 16/77 (20%), Positives = 28/77 (36%), Gaps = 1/77 (1%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
KA ARDE + + + EE+ ++ + ERR E +
Sbjct: 327 KAPARDEVDTNDSSPLTDSKSEEKLEDDLAHSLLGSEASERRKNREPSSSSPPSRPRSRR 386
Query: 161 D-KQGEEKRVEDKIGDE 176
D + + +R + DE
Sbjct: 387 DSEDEDTERDDSDSDDE 403
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 30.1 bits (67), Expect = 0.79
Identities = 19/64 (29%), Positives = 29/64 (45%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
R E E E E E E E+ + E LE + E++R + +R E E E D+
Sbjct: 462 RLEREERERLERERMERIERERLERERLERERLERDRLERDRLDRLERERVDRLERDRLE 521
Query: 165 EEKR 168
+ +R
Sbjct: 522 KARR 525
Score = 27.8 bits (61), Expect = 4.1
Identities = 22/71 (30%), Positives = 38/71 (53%), Gaps = 6/71 (8%)
Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKI 173
E E EE E E+E+ E +E + E+ER E ER E + + + D+ E+ D++
Sbjct: 461 ERLEREERERLERERMERIERERLERERL---ERERLERDRLERDRLDRL--ERERVDRL 515
Query: 174 GDEKRLEENKK 184
+ RLE+ ++
Sbjct: 516 -ERDRLEKARR 525
>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein;
Provisional.
Length = 417
Score = 30.0 bits (67), Expect = 0.79
Identities = 11/47 (23%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
Query: 19 VVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAV 65
+ +I+ GN + + I + K+++ +T+ ++ +LAI+DL+ +
Sbjct: 103 IFYIIIFILGLFGNAAI-IMILFCKKIKTITDIYIFNLAISDLIFVI 148
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 29.5 bits (67), Expect = 0.81
Identities = 26/80 (32%), Positives = 33/80 (41%), Gaps = 14/80 (17%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
+ +EEEEE E E E +K DE DEE EEE+D+ EE R
Sbjct: 15 PKPPSPKEEEEEALELAIQEAHENALEKK-----LLDEL--DEELDEEEDDRFLEEYR-- 65
Query: 171 DKIGDEKRLEENKKKKTQKR 190
KRL E K + +
Sbjct: 66 -----RKRLAEMKALAEKSK 80
Score = 27.2 bits (61), Expect = 4.9
Identities = 20/60 (33%), Positives = 26/60 (43%), Gaps = 5/60 (8%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYL-----EEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
EEE E +E E + + L E DE E +R L E + E K E+ K GE
Sbjct: 24 EEEALELAIQEAHENALEKKLLDELDEELDEEEDDRFLEEYRRKRLAEMKALAEKSKFGE 83
>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
Length = 191
Score = 29.6 bits (67), Expect = 0.81
Identities = 26/86 (30%), Positives = 38/86 (44%), Gaps = 13/86 (15%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
+ E+ EEE EE E EE +DE EE E+ E E EE+ E
Sbjct: 1 MEEKNEQVEEEVEETEVEEAVEDEVEEETVEEES----------EAELLDEEQAKIAELE 50
Query: 167 KRVEDKIGDEKRLE---ENKKKKTQK 189
++++ RL+ EN K++ QK
Sbjct: 51 AKLDELEERYLRLQADFENYKRRIQK 76
Score = 27.7 bits (62), Expect = 3.1
Identities = 14/33 (42%), Positives = 22/33 (66%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENE 138
+ET EE E+E EEE EE+ + E L+E++ +
Sbjct: 13 EETEVEEAVEDEVEEETVEEESEAELLDEEQAK 45
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel, four-helix
bundle structure with a large hydrophobic patch in which
23 residues of one monomer form van der Waals contacts
with the other monomer. This CENP-B dimer configuration
may be suitable for capturing two distant CENP-B boxes
during centromeric heterochromatin formation.
Length = 101
Score = 28.6 bits (63), Expect = 0.82
Identities = 11/31 (35%), Positives = 21/31 (67%)
Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDE 130
F G D + +EEE++++E+EE++ E D+
Sbjct: 5 FLEGEEDSDSDSDEEEDDDDEDEEDDDEDDD 35
Score = 28.2 bits (62), Expect = 1.3
Identities = 12/42 (28%), Positives = 27/42 (64%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
E + + + +EEE++++E E+D+ ++DE++ E +P E
Sbjct: 7 EGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDEVPVPSFGE 48
Score = 27.0 bits (59), Expect = 3.2
Identities = 9/37 (24%), Positives = 24/37 (64%)
Query: 98 QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEE 134
+ ++ +EEE++++E+EE+++++ DE +E
Sbjct: 4 HFLEGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40
Score = 25.9 bits (56), Expect = 6.9
Identities = 13/36 (36%), Positives = 22/36 (61%)
Query: 131 YLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
+LE +E+ EED+ EDEE +E++D+ +E
Sbjct: 5 FLEGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40
>gnl|CDD|237158 PRK12650, PRK12650, putative monovalent cation/H+ antiporter
subunit A; Reviewed.
Length = 962
Score = 30.0 bits (68), Expect = 0.82
Identities = 12/30 (40%), Positives = 19/30 (63%), Gaps = 1/30 (3%)
Query: 58 ITDLMVAVLVMPLGILTLVRVRTNLSGAVL 87
DL++ ++ LG++ LVR RT L+ VL
Sbjct: 599 WIDLVLLAIIA-LGVIALVRTRTRLAAVVL 627
>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
Length = 258
Score = 29.7 bits (68), Expect = 0.85
Identities = 19/32 (59%), Positives = 21/32 (65%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEE 134
R A EEEEEE EEEEEEEE+E+ E E
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEA 257
Score = 28.6 bits (65), Expect = 1.9
Identities = 18/34 (52%), Positives = 21/34 (61%)
Query: 97 NQLFKAGARDETAREEEEEEEEEEEEEEEQEQDE 130
Q A A +E E EEEEEEEEEEE E+ + E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 27.8 bits (63), Expect = 2.9
Identities = 20/34 (58%), Positives = 23/34 (67%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEE 134
+ G E A EEEEE EEEEEEEEE+E +E E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 27.8 bits (63), Expect = 3.0
Identities = 16/33 (48%), Positives = 19/33 (57%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDE 136
EEEEEE EEEEEEE+E++ E E
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 27.8 bits (63), Expect = 3.3
Identities = 17/32 (53%), Positives = 21/32 (65%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
R E EEEEEE EEE+E++E E +E E E
Sbjct: 227 GRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 27.1 bits (61), Expect = 5.9
Identities = 17/35 (48%), Positives = 22/35 (62%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDEN 137
G + A EEEEEE EEEEEE+E++E E +
Sbjct: 224 GRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 27.1 bits (61), Expect = 5.9
Identities = 19/35 (54%), Positives = 22/35 (62%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENE 138
R E EEEEEE EEEEE+E++E EE E E
Sbjct: 224 GRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal. The C-terminal domain of
FCP-1 is required for interaction with the carboxy
terminal domain of RAP74. Interaction relies extensively
on van der Waals contacts between hydrophobic residues
situated within alpha-helices in both domains.
Length = 263
Score = 29.6 bits (66), Expect = 0.86
Identities = 20/92 (21%), Positives = 36/92 (39%), Gaps = 13/92 (14%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGE----------- 151
G+ D + +++ EEE+E+ Q + + E + P ER
Sbjct: 159 GSDDSDSEKKKPPGNEEEQEQAPQPRKQQPPGPRREPDEGTPSSSERSNIGGRGPRGHKR 218
Query: 152 --DEEKKEEEEDKQGEEKRVEDKIGDEKRLEE 181
+EE +E+ E + E ED+ G +E
Sbjct: 219 KLNEEDEEDAESESSFESSNEDEEGSSSEADE 250
>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
Tim54. Mitochondrial function depends on the import of
hundreds of different proteins synthesised in the
cytosol. Protein import is a multi-step pathway which
includes the binding of precursor proteins to surface
receptors, translocation of the precursor across one or
both mitochondrial membranes, and folding and assembly
of the imported protein inside the mitochondrion. Most
precursor proteins carry amino-terminal targeting
signals, called pre-sequences, and are imported into
mitochondria via import complexes located in both the
outer and the inner membrane (IM). The IM complex, TIM,
is made up of at least two proteins which mediate
translocation of proteins into the matrix by removing
their signal peptide and another pair of proteins, Tim54
and Tim22, that insert the polytopic proteins, that
carry internal targetting information, into the inner
membrane.
Length = 377
Score = 29.7 bits (67), Expect = 0.86
Identities = 14/57 (24%), Positives = 18/57 (31%)
Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
E +E E E E E E E PEE E +EE + +
Sbjct: 199 EPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKPPVP 255
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 29.7 bits (67), Expect = 0.91
Identities = 20/85 (23%), Positives = 41/85 (48%), Gaps = 1/85 (1%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
+EEE E+E EEEE+ + E+ + E + +E + +++ EE ++ E+
Sbjct: 395 SKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEK 454
Query: 167 KRVE-DKIGDEKRLEENKKKKTQKR 190
E ++ E R + K ++ + R
Sbjct: 455 LESELERFRREVRDKVRKDREIRAR 479
Score = 29.3 bits (66), Expect = 1.4
Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 2/97 (2%)
Query: 86 VLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPE 145
V R+K ++ L +A ++ + E+E EEEE E E + E RL E
Sbjct: 377 VERIKALVIRGYPLAEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEE 436
Query: 146 EDE--RGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLE 180
E+ + E EE K E E + E +R ++ D+ R +
Sbjct: 437 ENSELKRELEELKREIEKLESELERFRREVRDKVRKD 473
Score = 27.4 bits (61), Expect = 6.4
Identities = 26/75 (34%), Positives = 38/75 (50%), Gaps = 8/75 (10%)
Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRV 169
RE EE + E E+ E E E+ D+ K+R + D R E EK+ EE+ K+ EE
Sbjct: 443 RELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEEL-- 500
Query: 170 EDKIGDEKRLEENKK 184
E++L E +K
Sbjct: 501 ------ERKLAELRK 509
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 29.5 bits (67), Expect = 0.91
Identities = 10/44 (22%), Positives = 19/44 (43%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE 154
E E ++E E ++D+ +++ EE + G D E
Sbjct: 35 AAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPE 78
Score = 28.7 bits (65), Expect = 1.4
Identities = 17/74 (22%), Positives = 40/74 (54%), Gaps = 5/74 (6%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENE-KER--RLPEEDERGEDEEKKEEEEDK 162
+ + E++++++E+E+E+ E++ L D E +ER L E+ E+ +K K
Sbjct: 47 ELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEEARERFAELREQYEKTRKAIEKHGRGSK 106
Query: 163 QGEEKRVEDKIGDE 176
+ ++ R + + +E
Sbjct: 107 KAQKAR--EALAEE 118
Score = 27.9 bits (63), Expect = 2.7
Identities = 16/46 (34%), Positives = 26/46 (56%), Gaps = 1/46 (2%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKER-RLPEE 146
A A TA E E +EE+ E+++ + ++ E+DE E + PEE
Sbjct: 34 AAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEE 79
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 29.6 bits (66), Expect = 0.93
Identities = 29/116 (25%), Positives = 54/116 (46%), Gaps = 12/116 (10%)
Query: 86 VLRLKKNILCENQL---FKAGARDETAREEEEEEEEEEEEEEEQEQD--EYLEEDENEK- 139
+LRLK I + L + T RE+ ++E+E EEEE+ E + +D ++
Sbjct: 166 MLRLKSGIYGDTPLNVREHLWNKAATEREKRQDEKERYVEEEEESDTELEAVTDDSEKEK 225
Query: 140 ------ERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
E+ L + E ++EE + + +E ED G ++ + + KK++K
Sbjct: 226 TKKKDLEKWLGSDQSMETSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRK 281
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 29.3 bits (66), Expect = 0.93
Identities = 13/64 (20%), Positives = 25/64 (39%), Gaps = 2/64 (3%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE--KKEEEEDKQGEE 166
A E + E +E+ + + EE + L E++ E + E ++ EE
Sbjct: 108 ASGESDLENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEE 167
Query: 167 KRVE 170
+ E
Sbjct: 168 EIAE 171
>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
represents the eukaryotic large ribosomal protein P2.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P2 is located in the L12 stalk, with proteins
P1, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers). Bacteria may have four or six copies
of L7/L12 (two or three homodimers) depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plants have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site in the C-terminal region
of P2. TCS inactivates the ribosome by depurinating a
specific adenine in the sarcin-ricin loop of 28S rRNA.
Length = 109
Score = 28.4 bits (64), Expect = 0.95
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQD 129
A A A ++EE++EE EEE + D
Sbjct: 75 AAAAAAAAAAAAKKEEKKEESEEESDDD 102
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 29.8 bits (67), Expect = 0.98
Identities = 14/64 (21%), Positives = 31/64 (48%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+ E + EE + +E+ E+L E E+DE ++E +K+++ K+
Sbjct: 306 SDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPR 365
Query: 166 EKRV 169
+++V
Sbjct: 366 KRKV 369
Score = 29.4 bits (66), Expect = 1.0
Identities = 18/113 (15%), Positives = 37/113 (32%), Gaps = 27/113 (23%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENE------------------------K 139
++E + ++E + E E EE E +++ +
Sbjct: 240 VQEEESIDDELDVLREIEAEEAGIGPIEEEVVDSQAANDEPRRVFKKKGQKRTTRRVKMR 299
Query: 140 ERRLPEEDERGEDEEKKEEEEDKQGEEKRVE---DKIGDEKRLEENKKKKTQK 189
R DE E EE K E+ E G ++ E+ +++++
Sbjct: 300 PVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKE 352
Score = 29.0 bits (65), Expect = 1.5
Identities = 25/109 (22%), Positives = 34/109 (31%), Gaps = 28/109 (25%)
Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENE----------------KERRLPEEDERGEDE 153
RE E EE EEE + ++ + R DE E
Sbjct: 254 REIEAEEAGIGPIEEEVVDSQAANDEPRRVFKKKGQKRTTRRVKMRPVRAKPSDEPSLPE 313
Query: 154 EKKEEEEDKQGEEKRVE---DKIG---------DEKRLEENKKKKTQKR 190
EE K E+ E G DE+ EE +KK+ K+
Sbjct: 314 SDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKK 362
>gnl|CDD|234350 TIGR03766, TIGR03766, conserved hypothetical integral membrane
protein. Models TIGR03110, TIGR03111, and TIGR03112
describe a three-gene system found in several
Gram-positive bacteria, where TIGR03110 (XrtG) is
distantly related to a putative transpeptidase,
exosortase (TIGR02602). This model describes a small
clade that correlates by both gene clustering and
phyletic pattern, although imperfectly, to the three
gene system. Both this narrow clade, and the larger set
of full-length homologous integral membrane proteins,
have an especially well-conserved region near the
C-terminus with an invariant tyrosine. The function is
unknown.
Length = 483
Score = 29.6 bits (67), Expect = 1.0
Identities = 17/69 (24%), Positives = 31/69 (44%), Gaps = 6/69 (8%)
Query: 13 SWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNY-FLMSLAITDLMVA----VLV 67
SW + VV ++LV +A + + K+ V Y F++ LA++ ++ V
Sbjct: 137 SWLFFDVVNIVLVDLSALILYKAVKKV-FNKKKAFVALYLFVLLLALSPYILIPYTDTWV 195
Query: 68 MPLGILTLV 76
+P L L
Sbjct: 196 LPFVSLFLF 204
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 28.8 bits (65), Expect = 1.0
Identities = 24/69 (34%), Positives = 38/69 (55%), Gaps = 14/69 (20%)
Query: 107 ETAREEEEEEEEEEEEEEE------------QEQDEYLEEDENEKERRLP--EEDERGED 152
E REE+E E+EE+E +EE +E+++ LEE E K + L +DE ++
Sbjct: 15 EEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDEDYDE 74
Query: 153 EEKKEEEED 161
E K++E D
Sbjct: 75 ELKEQERWD 83
Score = 26.9 bits (60), Expect = 4.8
Identities = 21/53 (39%), Positives = 31/53 (58%), Gaps = 4/53 (7%)
Query: 141 RRLPEEDERGEDEEKKEEEEDKQGEEKR----VEDKIGDEKRLEENKKKKTQK 189
R + E++R E E +KEE+E K+ +EK + K EKRLEE +K K +
Sbjct: 10 RIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKP 62
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 29.5 bits (66), Expect = 1.0
Identities = 14/63 (22%), Positives = 27/63 (42%), Gaps = 8/63 (12%)
Query: 119 EEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK---KEEEEDKQGEEKRVEDKIGD 175
+++E+ E + +ED +K +E++R DE+K +D+
Sbjct: 165 KQKEKRRVEDSQKHKEDRRKK-----QEEKRRNDEDKRPGGGGGSSGGQSGLSTKDEPPK 219
Query: 176 EKR 178
EKR
Sbjct: 220 EKR 222
Score = 28.7 bits (64), Expect = 1.5
Identities = 22/85 (25%), Positives = 37/85 (43%), Gaps = 7/85 (8%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPE------EDERGEDEEKKEEEEDKQGE 165
E+ E+E ++ DE+ + +KE+R E ED R + EEK+ +EDK+
Sbjct: 140 AEKFEKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPG 199
Query: 166 EKRVEDKIGDEKRLEEN-KKKKTQK 189
++ K+K QK
Sbjct: 200 GGGGSSGGQSGLSTKDEPPKEKRQK 224
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 29.5 bits (66), Expect = 1.0
Identities = 18/44 (40%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 106 DETAREEEEEEEE-EEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
DE +EEE +E EE E+E+ + D L E + EKE+ EE +
Sbjct: 73 DELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQ 116
>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D. SpoVID, the
stage VI sporulation protein D, is restricted to
endospore-forming members of the bacteria, all of which
are found among the Firmicutes. It is widely distributed
but not quite universal in this group. Between
well-conserved N-terminal and C-terminal domains is a
poorly conserved, low-complexity region of variable
length, rich enough in glutamic acid to cause spurious
BLAST search results unless a filter is used. The seed
alignment for this model was trimmed, in effect, by
choosing member sequences in which these regions are
relatively short. SpoVID is involved in spore coat
assembly by the mother cell compartment late in the
process of sporulation [Cellular processes, Sporulation
and germination].
Length = 338
Score = 29.5 bits (66), Expect = 1.1
Identities = 24/75 (32%), Positives = 35/75 (46%), Gaps = 5/75 (6%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE-----RRLPEEDERG 150
+ +K A + E E E EEE EE E + LE ++ EK +ED
Sbjct: 196 DPDEWKISASEPFQLESEVEASPEEENYEEYEDETELEVEDEEKALDEQTEDPQQEDALA 255
Query: 151 EDEEKKEEEEDKQGE 165
D +K EEE+++GE
Sbjct: 256 GDAKKALEEEEEKGE 270
Score = 27.9 bits (62), Expect = 3.6
Identities = 21/82 (25%), Positives = 33/82 (40%), Gaps = 7/82 (8%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
E ARE+EEEEE E E Q+E E+E E ++ E E ++ E+
Sbjct: 140 NNLDAEPAREDEEEEESFSAEFEHPAQEETAGEEERTDEPKV-------EHEAHEQHEQP 192
Query: 162 KQGEEKRVEDKIGDEKRLEENK 183
+ + + +LE
Sbjct: 193 ADDDPDEWKISASEPFQLESEV 214
>gnl|CDD|145361 pfam02161, Prog_receptor, Progesterone receptor.
Length = 456
Score = 29.6 bits (66), Expect = 1.1
Identities = 10/35 (28%), Positives = 16/35 (45%)
Query: 118 EEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED 152
+ +EE +E+ QD+ D E + RGE
Sbjct: 54 QGQEEPDEKTQDQQSLSDVERAEPEVEASRGRGEG 88
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 29.0 bits (65), Expect = 1.2
Identities = 12/67 (17%), Positives = 33/67 (49%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
+++ EE++ E ++++ EE + + EE + +E ++E+ ++ + K
Sbjct: 73 KEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSDIDNEDDMDDSDLKLA 132
Query: 165 EEKRVED 171
E ++D
Sbjct: 133 TENIIKD 139
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 29.6 bits (67), Expect = 1.2
Identities = 23/106 (21%), Positives = 47/106 (44%), Gaps = 13/106 (12%)
Query: 89 LKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQ-EQDEYLEEDENEK----ERRL 143
L++ + L + + ++EE++++E E+ E+ + E+ + E++ E L
Sbjct: 553 LQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYSAFDETGL 612
Query: 144 PEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
P D GE+ KK E K++ + + +L E K K
Sbjct: 613 PTHDADGEEISKK--------ERKKLSKEYDKQAKLHEEYLAKGGK 650
>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
pre-ribosomes [Translation, ribosomal structure and
biogenesis / Intracellular trafficking and secretion].
Length = 657
Score = 29.6 bits (66), Expect = 1.2
Identities = 18/49 (36%), Positives = 26/49 (53%), Gaps = 3/49 (6%)
Query: 129 DEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEK 177
+E+L E E+E E D +DE++K EE K E V+ +I EK
Sbjct: 92 EEFLSESESEASL---EIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSEK 137
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 29.2 bits (66), Expect = 1.2
Identities = 24/102 (23%), Positives = 54/102 (52%), Gaps = 7/102 (6%)
Query: 93 ILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED 152
ILC +Q + +++ + +E+ + + + + ++ L E ++ +R+L + ER D
Sbjct: 263 ILCNHQRTVSKTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFER--D 320
Query: 153 EEKKEEEEDKQGEEKRVEDKIGD-----EKRLEENKKKKTQK 189
EK + E ++ +EK+ E+K E+R+E+ + + T K
Sbjct: 321 NEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDK 362
>gnl|CDD|223097 COG0018, ArgS, Arginyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 577
Score = 29.5 bits (67), Expect = 1.2
Identities = 18/70 (25%), Positives = 32/70 (45%)
Query: 115 EEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIG 174
E+ E E D YL E + + L E+ E+E ++E E+ + G+E+ +
Sbjct: 178 EKRGREALGLTPEPDGYLGEYYVKIAKDLEEDPGNDEEEAREEVEKLESGDEEAELWRKF 237
Query: 175 DEKRLEENKK 184
+ LE K+
Sbjct: 238 VDLSLEGIKE 247
>gnl|CDD|222508 pfam14033, DUF4246, Protein of unknown function (DUF4246). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria and fungi.
Proteins in this family are typically between 392 and
644 amino acids in length.
Length = 496
Score = 29.1 bits (66), Expect = 1.3
Identities = 12/47 (25%), Positives = 19/47 (40%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGE 151
+ EE + E E+E+ E + E + +ERR E G
Sbjct: 264 HPKEKEPEEPKPEPGEDEDYEDYWERLEEYLDWREERRKAIHPEPGT 310
>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
Length = 336
Score = 29.2 bits (66), Expect = 1.3
Identities = 16/50 (32%), Positives = 27/50 (54%)
Query: 132 LEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEE 181
+EE E+E E E E+EE +EE+++K G + + EK +E+
Sbjct: 16 VEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIED 65
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 29.4 bits (66), Expect = 1.3
Identities = 13/40 (32%), Positives = 21/40 (52%)
Query: 98 QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDEN 137
+L + + + EE + EE +EE EE+E+ Y E E
Sbjct: 278 ELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEK 317
Score = 27.9 bits (62), Expect = 4.7
Identities = 13/46 (28%), Positives = 30/46 (65%)
Query: 139 KERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKK 184
KE++ +++E+ E+ + +E +E+ + EEK ++ ++ +E NKK
Sbjct: 280 KEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKK 325
>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57. Eukaryotic
cilia and flagella are specialised organelles found at
the periphery of cells of diverse organisms.
Intra-flagellar transport (IFT) is required for the
assembly and maintenance of eukaryotic cilia and
flagella, and consists of the bidirectional movement of
large protein particles between the base and the distal
tip of the organelle. IFT particles contain multiple
copies of two distinct protein complexes, A and B, which
contain at least 6 and 11 protein subunits. IFT57 is
part of complex B but is not, however, required for the
core subunits to stay associated. This protein is known
as Huntington-interacting protein-1 in humans.
Length = 355
Score = 29.3 bits (66), Expect = 1.3
Identities = 11/44 (25%), Positives = 19/44 (43%)
Query: 122 EEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
EE E++ E+D + EE E E ++ + +E K
Sbjct: 127 PNEEDEEENVDEDDAEIILEEVEEEVEIEEVDDDEGTQETKYKR 170
>gnl|CDD|215094 PLN00188, PLN00188, enhanced disease resistance protein (EDR2);
Provisional.
Length = 719
Score = 29.4 bits (66), Expect = 1.3
Identities = 16/83 (19%), Positives = 35/83 (42%)
Query: 71 GILTLVRVRTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDE 130
G + V N++ A + KKN + + +R +E+ +++EE Q +
Sbjct: 397 GAPPRIPVMVNMASASVSSKKNQKPQESSPSLDQTNAASRNSVMMDEDSDDDEEFQIPES 456
Query: 131 YLEEDENEKERRLPEEDERGEDE 153
E + + E + +E +D+
Sbjct: 457 EQEPETTKNETKDTAMEEEPQDK 479
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 28.0 bits (62), Expect = 1.3
Identities = 13/26 (50%), Positives = 20/26 (76%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQE 127
AGA A+ E ++EE++EEEEEE++
Sbjct: 79 AGAATAGAKAEAKKEEKKEEEEEEED 104
Score = 26.8 bits (59), Expect = 3.5
Identities = 13/22 (59%), Positives = 19/22 (86%)
Query: 102 AGARDETAREEEEEEEEEEEEE 123
AGA+ E +EE++EEEEEEE++
Sbjct: 84 AGAKAEAKKEEKKEEEEEEEDD 105
Score = 26.8 bits (59), Expect = 3.8
Identities = 12/29 (41%), Positives = 20/29 (68%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDE 130
A TA + E ++EE++EEEE+E+D+
Sbjct: 77 PAAGAATAGAKAEAKKEEKKEEEEEEEDD 105
>gnl|CDD|217940 pfam04177, TAP42, TAP42-like family. The TOR signalling pathway
activates a cell-growth program in response to
nutrients. TIP41 (pfam04176) interacts with TAP42 and
negatively regulates the TOR signaling pathway.
Length = 335
Score = 29.2 bits (66), Expect = 1.4
Identities = 11/32 (34%), Positives = 16/32 (50%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERR 142
+ E EEEE+ E DE +++E K R
Sbjct: 295 QGGGEAAASAEEEEDDEDDEEDDDEETLKARA 326
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 29.3 bits (66), Expect = 1.4
Identities = 16/87 (18%), Positives = 37/87 (42%), Gaps = 4/87 (4%)
Query: 91 KNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE-- 148
+ I E + K + E + +E EEE + E + E E+ + + + + ++
Sbjct: 837 ELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIK 896
Query: 149 --RGEDEEKKEEEEDKQGEEKRVEDKI 173
DE K++E+ K+ ++ +
Sbjct: 897 LVISSDEIKQDEKTTKKKKKDLEKTDE 923
Score = 28.9 bits (65), Expect = 1.9
Identities = 20/98 (20%), Positives = 40/98 (40%), Gaps = 4/98 (4%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
E +L + E EE E EE++ +Q+ LE E ++ + ++E
Sbjct: 816 EERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFV 875
Query: 156 KEEEEDKQGEE----KRVEDKIGDEKRLEENKKKKTQK 189
+ + + K+ + K V K+ + +K T+K
Sbjct: 876 EFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTKK 913
Score = 28.1 bits (63), Expect = 3.2
Identities = 12/73 (16%), Positives = 31/73 (42%), Gaps = 6/73 (8%)
Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKI 173
E ++ E EEE+ + ++ +E E E ++++ ++ + + E +
Sbjct: 806 ELIDQIIESEEERLKALRIQREEMLMRP------EELELINEEQKNLKQEIKLELSEIQE 859
Query: 174 GDEKRLEENKKKK 186
+E+ N+ K
Sbjct: 860 AEEEIQNINENKN 872
Score = 27.7 bits (62), Expect = 4.3
Identities = 23/87 (26%), Positives = 37/87 (42%), Gaps = 6/87 (6%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE--DKQGEE 166
R EE E EE++ +QE L E + +E + + E E K + + +K
Sbjct: 831 MRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIA 890
Query: 167 KRVEDK----IGDEKRLEENKKKKTQK 189
K V K + K+ E+ KKK +
Sbjct: 891 KDVLIKLVISSDEIKQDEKTTKKKKKD 917
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 28.3 bits (63), Expect = 1.4
Identities = 22/81 (27%), Positives = 44/81 (54%), Gaps = 3/81 (3%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED---EEKKEEEEDKQGEEKR 168
E+E E+E E++ E+ + EK+++L E+ E + E++++E +Q E ++
Sbjct: 43 EKEGYAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEK 102
Query: 169 VEDKIGDEKRLEENKKKKTQK 189
+E +K E +KK T+K
Sbjct: 103 IELSKKKQKERERRRKKLTKK 123
>gnl|CDD|218055 pfam04386, SspB, Stringent starvation protein B. Escherichia coli
stringent starvation protein B (SspB), is thought to
enhance the specificity of degradation of tmRNA-tagged
proteins by the ClpXP protease. The tmRNA tag, also
known as ssrA, is an 11-aa peptide added to the C
terminus of proteins stalled during translation, targets
proteins for degradation by ClpXP and ClpAP. SspB a
cytoplasmic protein that specifically binds to residues
1-4 and 7 of the tag. Binding of SspB enhances
degradation of tagged proteins by ClpX, and masks
sequence elements important for ClpA interactions,
inhibiting degradation by ClpA. However, more recent
work has cast doubt on the importance of SspB in
wild-type cells. SspB is encoded in an operon whose
synthesis is stimulated by carbon, amino acid, and
phosphate starvation. SspB may play a special role
during nutrient stress, for example by ensuring rapid
degradation of the products of stalled translation,
without causing a global increase in degradation of all
ClpXP substrates.
Length = 153
Score = 28.3 bits (64), Expect = 1.4
Identities = 13/43 (30%), Positives = 22/43 (51%), Gaps = 3/43 (6%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDE 153
E EE +EEEEEE+++ + E+ E + P+ R +
Sbjct: 111 EPEEADEEEEEEDDDADDSPEDEDPEPKDP---PKTKGRPSLK 150
>gnl|CDD|215283 PLN02518, PLN02518, pheophorbide a oxygenase.
Length = 539
Score = 29.1 bits (65), Expect = 1.4
Identities = 14/84 (16%), Positives = 29/84 (34%), Gaps = 6/84 (7%)
Query: 62 MVAVLVMPLGILTLVRVRTNLSGAVLRLKKNILCENQLFKAGARDETAREEE------EE 115
M +L + LTL + + + + + G R E
Sbjct: 1 MAVLLGIACNSLTLTSSTPKSTPFFIPARTIPFVSSSRPRRGKIFTPLRVAAPPSVPSEA 60
Query: 116 EEEEEEEEEEQEQDEYLEEDENEK 139
+++E EE++ + E +E + K
Sbjct: 61 ALQQDEGEEQRVEQELGQESSDSK 84
>gnl|CDD|216194 pfam00922, Phosphoprotein, Vesiculovirus phosphoprotein.
Length = 283
Score = 29.1 bits (65), Expect = 1.5
Identities = 15/70 (21%), Positives = 31/70 (44%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
E R ++ +E+ EE YL E+ ++ E ED++G + E+ + +
Sbjct: 24 EEQRADKSSTFQEDSVEEHTSPSYYLAEELSDSETEPSIEDDQGLYTQLPPAEQVEGFIQ 83
Query: 167 KRVEDKIGDE 176
++D D+
Sbjct: 84 GPLDDIADDD 93
Score = 28.3 bits (63), Expect = 2.4
Identities = 14/67 (20%), Positives = 27/67 (40%), Gaps = 4/67 (5%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED---KQGE 165
A E + E E E+++ + ++ E + P D+ +D+ EED K
Sbjct: 50 AEELSDSETEPSIEDDQGLYTQLPPAEQVEGFIQGPL-DDIADDDIDVVFEEDRPWKPPV 108
Query: 166 EKRVEDK 172
+K +
Sbjct: 109 QKSDNGR 115
>gnl|CDD|220379 pfam09750, DRY_EERY, Alternative splicing regulator. This entry
represents the conserved N-terminal region of SWAP
(suppressor-of-white-apricot protein) proteins. This
region contains two highly conserved motifs, viz: DRY
and EERY, which appear to be the sites for alternative
splicing of exons 2 and 3 of the SWAP mRNA. These
proteins are thus thought to be involved in
auto-regulation of pre-mRNA splicing. Most family
members are associated with two Surp domains pfam01805
and an Arginine- serine-rich binding region towards the
C-terminus.
Length = 121
Score = 28.2 bits (63), Expect = 1.5
Identities = 16/59 (27%), Positives = 29/59 (49%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
R R + +EE+E EE ++ Y + +N+ E EE+E+ E +KK+ +
Sbjct: 53 RGSRNRRPYKTDEEDEAEERFLNEERYRDLIKNDFEEAEKEEEEKRELAQKKKSKGGYN 111
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 28.9 bits (65), Expect = 1.5
Identities = 15/76 (19%), Positives = 30/76 (39%), Gaps = 9/76 (11%)
Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKI 173
E EEE + ++ + +K ++ E+ + ++K+++E +GE K
Sbjct: 37 EILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKT- 95
Query: 174 GDEKRLEENKKKKTQK 189
KK K K
Sbjct: 96 --------PKKSKKTK 103
>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
Length = 1622
Score = 29.3 bits (66), Expect = 1.5
Identities = 16/58 (27%), Positives = 27/58 (46%), Gaps = 4/58 (6%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDE-NEKERRLPEEDERGEDEE---KKEEEE 160
E A + EE EE EEE++Q + + N ++ + + E + K+EE E
Sbjct: 838 ENAGKMEEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEERE 895
>gnl|CDD|235551 PRK05667, dnaG, DNA primase; Validated.
Length = 580
Score = 29.0 bits (66), Expect = 1.6
Identities = 21/81 (25%), Positives = 34/81 (41%), Gaps = 1/81 (1%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
+ EE E +E EED E+ L + E+ D+ +E E+ +E+ +E
Sbjct: 499 RDAGLEELAALLESLAVWEEISEEDIAALEKELKDALEKLRDQLLEERLEELIAKERLLE 558
Query: 171 DKIGD-EKRLEENKKKKTQKR 190
E+RLE + KR
Sbjct: 559 GHGLSSEERLELLQLLIELKR 579
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 27.8 bits (62), Expect = 1.6
Identities = 16/29 (55%), Positives = 20/29 (68%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDE 130
A A E A E +E EEEE+EEE E+E D+
Sbjct: 73 AAAGAEAAAEADEAEEEEKEEEAEEESDD 101
Score = 27.8 bits (62), Expect = 1.8
Identities = 15/29 (51%), Positives = 18/29 (62%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQD 129
AGA +E EEEE+EEE EEE + D
Sbjct: 74 AAGAEAAAEADEAEEEEKEEEAEEESDDD 102
Score = 27.4 bits (61), Expect = 2.4
Identities = 14/35 (40%), Positives = 19/35 (54%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDE 136
AGA A E E +E EEEE+E++ E D+
Sbjct: 67 AGAAAAAAAGAEAAAEADEAEEEEKEEEAEEESDD 101
Score = 26.2 bits (58), Expect = 6.4
Identities = 13/30 (43%), Positives = 18/30 (60%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDE 130
A A A E +E EEEE+EEE +E+ +
Sbjct: 71 AAAAAGAEAAAEADEAEEEEKEEEAEEESD 100
Score = 25.8 bits (57), Expect = 8.5
Identities = 14/32 (43%), Positives = 19/32 (59%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYL 132
A + A +E EEEE+EEE EE+ D+ L
Sbjct: 73 AAAGAEAAAEADEAEEEEKEEEAEEESDDDML 104
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 28.9 bits (65), Expect = 1.6
Identities = 18/60 (30%), Positives = 36/60 (60%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
E+EE+EEE+ + + + + L+ EKE+ E++E+G E +++ ED+ E+ R+
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLR 100
Score = 27.7 bits (62), Expect = 3.6
Identities = 15/77 (19%), Positives = 43/77 (55%), Gaps = 1/77 (1%)
Query: 115 EEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK-KEEEEDKQGEEKRVEDKI 173
++E+E+++ ++ +E EE E EK + + + + K +E+E+ K+ +E++ ++
Sbjct: 25 DDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84
Query: 174 GDEKRLEENKKKKTQKR 190
++ +E +K ++
Sbjct: 85 EEDTPEDELAEKLRLRK 101
Score = 27.7 bits (62), Expect = 3.7
Identities = 9/36 (25%), Positives = 20/36 (55%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERR 142
E+E+ + E+EE+ + +++ E++ EK R
Sbjct: 64 AKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRL 99
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 28.9 bits (65), Expect = 1.7
Identities = 24/86 (27%), Positives = 43/86 (50%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
R+ A EEE E+ + EE E+E +E EE E +ER E+E EE + +++
Sbjct: 723 RELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIE 782
Query: 165 EEKRVEDKIGDEKRLEENKKKKTQKR 190
E + + +E E + ++ ++R
Sbjct: 783 ELEEKRQALQEELEELEEELEEAERR 808
Score = 28.9 bits (65), Expect = 1.8
Identities = 22/84 (26%), Positives = 37/84 (44%), Gaps = 2/84 (2%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
E + E EE E E EE +E+ E L+E + L E + E+ E+ E ++ EE
Sbjct: 303 EISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEE 362
Query: 167 KRVEDKIGDEKRLEENKKKKTQKR 190
E+K+ E + ++
Sbjct: 363 L--EEKLSALLEELEELFEALREE 384
Score = 28.5 bits (64), Expect = 2.3
Identities = 26/90 (28%), Positives = 39/90 (43%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
R+E A E E E E EE + + E LEE RL + E ++ E + EE
Sbjct: 378 FEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEEL 437
Query: 161 DKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
+ EE E + +E+ E + K +R
Sbjct: 438 QTELEELNEELEELEEQLEELRDRLKELER 467
Score = 28.5 bits (64), Expect = 2.3
Identities = 21/95 (22%), Positives = 42/95 (44%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEK 155
E+ L + + E + EE + E EE+ + +E E+E EE+ E
Sbjct: 701 EDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERL 760
Query: 156 KEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
+E EE+ + E+ + + + LEE ++ ++
Sbjct: 761 EELEEELESLEEALAKLKEEIEELEEKRQALQEEL 795
Score = 28.5 bits (64), Expect = 2.4
Identities = 27/95 (28%), Positives = 45/95 (47%), Gaps = 1/95 (1%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE-RRLPEEDERGEDEE 154
+ +L +A E + E EE EE EE +E+ + E +E E E L E E E+E
Sbjct: 259 QEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENEL 318
Query: 155 KKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
++ EE ++ +EK K E+R ++ +
Sbjct: 319 EELEERLEELKEKIEALKEELEERETLLEELEQLL 353
Score = 28.5 bits (64), Expect = 2.9
Identities = 23/109 (21%), Positives = 42/109 (38%), Gaps = 2/109 (1%)
Query: 82 LSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKER 141
L + LK+ I + + E+ E EE +EE E++ LEE E E
Sbjct: 321 LEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEA 380
Query: 142 RLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
E+ + E E + + ++ +E +RL E + ++
Sbjct: 381 L--REELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEEL 427
Score = 27.8 bits (62), Expect = 4.1
Identities = 22/75 (29%), Positives = 34/75 (45%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
+E EEE E E E E +E+ E L E E E +L + + E++ EEE +
Sbjct: 882 LEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYED 941
Query: 164 GEEKRVEDKIGDEKR 178
E +E +I +
Sbjct: 942 TLETELEREIERLEE 956
Score = 27.8 bits (62), Expect = 4.9
Identities = 29/110 (26%), Positives = 51/110 (46%)
Query: 81 NLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
L + L++ + + + A +EE EE EEE EE E+ D E E+ ++
Sbjct: 762 ELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQ 821
Query: 141 RRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
RR E E E EE+ EE E+K E + +++ E + + ++ +
Sbjct: 822 RRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAE 871
>gnl|CDD|140234 PTZ00207, PTZ00207, hypothetical protein; Provisional.
Length = 591
Score = 29.0 bits (65), Expect = 1.7
Identities = 20/67 (29%), Positives = 33/67 (49%), Gaps = 14/67 (20%)
Query: 18 AVVTLILVFCTATGNILVCL--------AISWEKRL----QNVTNYF--LMSLAITDLMV 63
AVVT++ VF + G ++ + AI +L N + YF LMS A+ ++
Sbjct: 141 AVVTVLSVFPSNRGAVVAIMKTFTGLGSAILGSIQLAFFSDNTSAYFFFLMSFALVVGIL 200
Query: 64 AVLVMPL 70
A++ M L
Sbjct: 201 AIVFMRL 207
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 28.9 bits (65), Expect = 1.8
Identities = 24/84 (28%), Positives = 36/84 (42%), Gaps = 16/84 (19%)
Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ----GE 165
+E E EE E EEE + E E + K+ E++ EED E
Sbjct: 243 KETSETEERVESEEETDVEIETASETKGTKQ------------EQEGSTEEDPSPSLFSE 290
Query: 166 EKRVEDKIGDEKRLEENKKKKTQK 189
E+ DKI + + + N K+K +K
Sbjct: 291 EREDPDKIDETEEIRVNGKEKIKK 314
Score = 28.1 bits (63), Expect = 3.1
Identities = 9/36 (25%), Positives = 21/36 (58%)
Query: 145 EEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLE 180
E + + E+K +++ K+ ++K E K ++ R+E
Sbjct: 733 AEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768
Score = 28.1 bits (63), Expect = 3.9
Identities = 17/63 (26%), Positives = 30/63 (47%), Gaps = 2/63 (3%)
Query: 125 EQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK-QGEEKRVE-DKIGDEKRLEEN 182
E+ E+ N++E+ + E D+EKK E +EK +E D + + +N
Sbjct: 1512 SNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKN 1571
Query: 183 KKK 185
KK+
Sbjct: 1572 KKQ 1574
Score = 27.4 bits (61), Expect = 6.8
Identities = 14/50 (28%), Positives = 27/50 (54%), Gaps = 11/50 (22%)
Query: 121 EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
++ E + + +EE +K+++ EKK+EEE K+ E+ R+E
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKK-----------EKKKEEEYKREEKARIE 768
>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein. This
family consists of REP proteins from Dictyostelium
(Slime molds). REP protein is likely involved in
transcription regulation and control of DNA replication,
specifically amplification of plasmid at low copy
numbers. The formation of homomultimers may be required
for their regulatory activity.
Length = 910
Score = 29.0 bits (65), Expect = 1.9
Identities = 10/25 (40%), Positives = 19/25 (76%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQE 127
D+ A E+E+++E+E+E+E+E E
Sbjct: 886 ENEDDDADEDEDQDEDEDEDEDEDE 910
Score = 26.7 bits (59), Expect = 9.0
Identities = 10/34 (29%), Positives = 20/34 (58%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQD 129
E L +E +E+E+++E+E+E+E E +
Sbjct: 877 EEYLISQFQENEDDDADEDEDQDEDEDEDEDEDE 910
>gnl|CDD|235622 PRK05846, PRK05846, NADH:ubiquinone oxidoreductase subunit M;
Reviewed.
Length = 497
Score = 28.6 bits (65), Expect = 1.9
Identities = 14/63 (22%), Positives = 24/63 (38%), Gaps = 11/63 (17%)
Query: 6 SFGLDAASWNIWAVV-TLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVA 64
G+D S + VV T +L + + SWE+ + V Y L + ++
Sbjct: 77 HLGVDGIS--LLLVVLTALLT--------PLAVLASWEEIKKRVKFYMAALLWLEGGVIG 126
Query: 65 VLV 67
V
Sbjct: 127 VFA 129
>gnl|CDD|221775 pfam12794, MscS_TM, Mechanosensitive ion channel inner membrane
domain 1. The small mechanosensitive channel, MscS, is
a part of the turgor-driven solute efflux system that
protects bacteria from lysis in the event of osmotic
shock. The MscS protein alone is sufficient to form a
functional mechanosensitive channel gated directly by
tension in the lipid bilayer. The MscS proteins are
heptamers of three transmembrane subunits with seven
converging M3 domains, and this domain is one of the
inner membrane domains.
Length = 339
Score = 28.7 bits (65), Expect = 1.9
Identities = 16/42 (38%), Positives = 19/42 (45%), Gaps = 3/42 (7%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRL 143
A + A EEEE E E EE E D LE + + RL
Sbjct: 268 AEILAQRAEEEEESSEGAAETIEEPELD--LET-ISAQSLRL 306
>gnl|CDD|235887 PRK06925, PRK06925, flagellar motor protein MotS; Reviewed.
Length = 230
Score = 28.6 bits (64), Expect = 1.9
Identities = 17/45 (37%), Positives = 25/45 (55%), Gaps = 6/45 (13%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEE-----DENE-KERRLPEEDERG 150
E E E+EE++E + DE L++ +ENE K + DERG
Sbjct: 67 ENPAESSEDEEKQEDQLDELLKKVNEYIEENELKSVITAKRDERG 111
>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2. This model
describes a tightly conserved subfamily of the larger
family of sugar (and other) transporters described by
PFAM model pfam00083. Members of this subfamily include
closely related forms SV2A and SV2B of synaptic vesicle
protein from vertebrates and a more distantly related
homolog (below trusted cutoff) from Drosophila
melanogaster. Members are predicted to have two sets of
six transmembrane helices.
Length = 742
Score = 28.8 bits (64), Expect = 2.0
Identities = 16/79 (20%), Positives = 34/79 (43%), Gaps = 11/79 (13%)
Query: 104 ARDETAREEEEEEEEEEEEEE-----------EQEQDEYLEEDENEKERRLPEEDERGED 152
A+DE ++ E+E+++++ E DE E ++ E+DE E
Sbjct: 37 AQDEYSQRSYSRFEDEDDDDDFPAPADGYSRGEAADDEEEGEASSDATEGHDEDDEIYEG 96
Query: 153 EEKKEEEEDKQGEEKRVED 171
E + + G+++ + D
Sbjct: 97 EYQGIPRAENGGKDEIMAD 115
Score = 27.2 bits (60), Expect = 5.9
Identities = 21/85 (24%), Positives = 30/85 (35%), Gaps = 22/85 (25%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEY--LEEDENE-------------------KER 141
A DE E + E +E++E + EY + EN K+
Sbjct: 70 AADDEEEGEASSDATEGHDEDDEIYEGEYQGIPRAENGGKDEIMADGAPKAGVRDELKDG 129
Query: 142 RLPEEDERGEDEEKKEEEEDKQGEE 166
P GE E + +EEE Q E
Sbjct: 130 EGP-PGGEGEAERRADEEELAQQYE 153
>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22. The
mitochondrial protein translocase family, which is
responsible for movement of nuclear encoded pre-proteins
into mitochondria, is very complex with at least 19
components. These proteins include several chaperone
proteins, four proteins of the outer membrane
translocase (Tom) import receptor, five proteins of the
Tom channel complex, five proteins of the inner membrane
translocase (Tim) and three "motor" proteins. This
family represents the Tom22 proteins. The N terminal
region of Tom22 has been shown to have chaperone-like
activity, and the C terminal region faces the
intermembrane face.
Length = 136
Score = 28.0 bits (63), Expect = 2.0
Identities = 11/46 (23%), Positives = 23/46 (50%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKER 141
E K A A+EE ++++E++ + + D+ E+E +R
Sbjct: 10 ETFQEKPAAPKNLAQEESDDDDEDDTDTDSDISDDSDFENETLYDR 55
>gnl|CDD|217956 pfam04194, PDCD2_C, Programmed cell death protein 2, C-terminal
putative domain.
Length = 165
Score = 28.1 bits (63), Expect = 2.0
Identities = 19/55 (34%), Positives = 27/55 (49%), Gaps = 1/55 (1%)
Query: 119 EEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKI 173
EEEE E L E+ E+E EE E+ KE+EE + E+ + +DK
Sbjct: 8 PEEEEYETLHAPSLPEEPEEREIDAFEELEKEGGG-GKEDEETDESEKAKSDDKT 61
>gnl|CDD|206316 pfam14147, Spore_YhaL, Sporulation protein YhaL. This family of
proteins is involved in sporulation. In B. subtilis its
expression is regulated by the early
mother-cell-specific transcription factor sigma-E.
Length = 52
Score = 26.6 bits (59), Expect = 2.0
Identities = 14/28 (50%), Positives = 20/28 (71%)
Query: 115 EEEEEEEEEEEQEQDEYLEEDENEKERR 142
EE E+E+E E+E + Y+E E E+ERR
Sbjct: 25 EEREKEQEFIEKEGEVYMERLEEERERR 52
>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
Length = 193
Score = 28.2 bits (62), Expect = 2.0
Identities = 25/85 (29%), Positives = 45/85 (52%), Gaps = 1/85 (1%)
Query: 97 NQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
N K A++ R+ +E E E+ ++E +E E ED E+ PEE ++EE +
Sbjct: 3 NYCRKNKAKEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVEELEDQPEEPPE-QEEENE 61
Query: 157 EEEEDKQGEEKRVEDKIGDEKRLEE 181
E++ ++ + E+K DEK L++
Sbjct: 62 EQKPKEEIDYPIQENKSFDEKNLDD 86
Score = 28.2 bits (62), Expect = 2.3
Identities = 22/82 (26%), Positives = 43/82 (52%), Gaps = 5/82 (6%)
Query: 95 CENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE 154
C K R + E E+ ++E EE +++ E + E+ ++ PE++E E+++
Sbjct: 5 CRKNKAKEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVEELEDQPEEPPEQEEENEEQK 64
Query: 155 KKEE-----EEDKQGEEKRVED 171
KEE +E+K +EK ++D
Sbjct: 65 PKEEIDYPIQENKSFDEKNLDD 86
Score = 26.2 bits (57), Expect = 8.7
Identities = 22/82 (26%), Positives = 37/82 (45%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
+ EE E++ E+ EE ++Q E E E E E + P+E+ +E K +E +
Sbjct: 28 KKEVEEIPEQKPEDIVEELEDQPEEPPEQEEENEEQKPKEEIDYPIQENKSFDEKNLDDL 87
Query: 167 KRVEDKIGDEKRLEENKKKKTQ 188
+R I E +N K +
Sbjct: 88 ERSNSDIYSESHKYDNASDKLE 109
>gnl|CDD|240331 PTZ00254, PTZ00254, 40S ribosomal protein SA; Provisional.
Length = 249
Score = 28.5 bits (64), Expect = 2.0
Identities = 12/43 (27%), Positives = 15/43 (34%), Gaps = 4/43 (9%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDE 153
EE EE+EE E E + E E E +E
Sbjct: 208 EEAEEKEEAAAETAGVEDAAAAAAEAEEGE----EWVAAANNE 246
Score = 27.7 bits (62), Expect = 3.2
Identities = 12/57 (21%), Positives = 18/57 (31%), Gaps = 15/57 (26%)
Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
R+ EE EE+EE E ++ + E+ EE E
Sbjct: 205 RDPEEAEEKEEAAAETAGVEDAAAAA---------------AEAEEGEEWVAAANNE 246
Score = 26.9 bits (60), Expect = 6.5
Identities = 11/37 (29%), Positives = 13/37 (35%), Gaps = 1/37 (2%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQ-DEYLEEDENEKE 140
RD EE+EE E E+ EE E
Sbjct: 205 RDPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVA 241
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 28.8 bits (64), Expect = 2.0
Identities = 22/83 (26%), Positives = 43/83 (51%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
+ + E EEEEEE+ E+ ++ E+E +++L E + K+EE E K EE
Sbjct: 346 KELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEE 405
Query: 167 KRVEDKIGDEKRLEENKKKKTQK 189
K + + ++ E+ K++ ++
Sbjct: 406 KEAKLLLELSEQEEDLLKEEKKE 428
Score = 28.8 bits (64), Expect = 2.2
Identities = 28/106 (26%), Positives = 50/106 (47%)
Query: 81 NLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
L + + K+ I + K A EEEEE+ E+ +E+ EQ ++E L + + E E
Sbjct: 326 KLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESE 385
Query: 141 RRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKK 186
R + E+ E K EEE + + ++ D + E+ ++ K
Sbjct: 386 RLSSAAKLKEEELELKNEEEKEAKLLLELSEQEEDLLKEEKKEELK 431
Score = 27.6 bits (61), Expect = 4.5
Identities = 28/101 (27%), Positives = 47/101 (46%)
Query: 89 LKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
L L E ++ +EE E ++E E+EEE E E EKE++L EE+
Sbjct: 230 LDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEEL 289
Query: 149 RGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
+ +E++E + + E+R D K E+ KK ++
Sbjct: 290 KLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKE 330
Score = 27.6 bits (61), Expect = 5.3
Identities = 27/106 (25%), Positives = 49/106 (46%), Gaps = 4/106 (3%)
Query: 85 AVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLP 144
A+ K + + E +EEE EE E +EEQ+ ++ EE+ E +
Sbjct: 806 ALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEIT 865
Query: 145 EEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
+E+ E K+EE E E+K ++ E++ +E KK+ ++
Sbjct: 866 KEELLQELLLKEEELE----EQKLKDELESKEEKEKEEKKELEEES 907
>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 379
Score = 28.8 bits (64), Expect = 2.0
Identities = 9/83 (10%), Positives = 26/83 (31%), Gaps = 2/83 (2%)
Query: 101 KAGARDETAREEEEEEEEEE--EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
+ + + + E++ + + + Q EE + +K E+
Sbjct: 210 LSASLKTATEKVDALEKDLHAAQAKLDSGQALTKEELDAKKAELSKALAALEAANAADED 269
Query: 159 EEDKQGEEKRVEDKIGDEKRLEE 181
+D+ + +G + L
Sbjct: 270 PQDRDAAVEAAARLMGAARALVP 292
>gnl|CDD|220098 pfam09057, Smac_DIABLO, Second Mitochondria-derived Activator of
Caspases. Second Mitochondria-derived Activator of
Caspases promotes apoptosis by activating caspases in
the cytochrome c/Apaf-1/caspase-9 pathway, and by
opposing the inhibitory activity of inhibitor of
apoptosis proteins (XIAP-BIR3). The protein assumes an
elongated three-helix bundle structure, and forms a
dimer in solution.
Length = 234
Score = 28.3 bits (63), Expect = 2.1
Identities = 14/62 (22%), Positives = 26/62 (41%), Gaps = 2/62 (3%)
Query: 76 VRVRTNLSGAVLRLK--KNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLE 133
V R +L A +++ + + E + A ++ E + E + E E + YL
Sbjct: 173 VAARNHLQVAQSQVEEVRQLSKEAEKKLAESKAEEIQRMAEYASSIDLSELEDIPEAYLR 232
Query: 134 ED 135
ED
Sbjct: 233 ED 234
>gnl|CDD|234456 TIGR04074, bacter_Hen1, 3' terminal RNA ribose
2'-O-methyltransferase Hen1. Members of this protein
family are bacterial Hen1, a 3' terminal RNA ribose
2'-O-methyltransferase that acts in bacterial RNA
repair. All members of the seed alignment belong to a
cassette with the RNA repair enzyme polynucleotide
kinase-phosphatase (Pnkp). Chemically similar Hen1 in
eukaryotes acts instead on small regulatory RNAs
[Transcription, RNA processing, Protein synthesis, tRNA
and rRNA base modification].
Length = 462
Score = 28.4 bits (64), Expect = 2.1
Identities = 17/40 (42%), Positives = 23/40 (57%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRL 143
ARD AR E +E E EE E E+ Q+E E+ + +RL
Sbjct: 232 ARDALARLAEADEAEPEEAETEEAQEEAAEKPPSLNRQRL 271
>gnl|CDD|177543 PHA03146, PHA03146, helicase-primase primase subunit; Provisional.
Length = 1075
Score = 28.8 bits (64), Expect = 2.1
Identities = 16/58 (27%), Positives = 25/58 (43%), Gaps = 4/58 (6%)
Query: 95 CENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEY--LEEDE--NEKERRLPEEDE 148
C +A A E + + + EE E D+Y +EEDE ++ E + E E
Sbjct: 693 CPPAAARALAAGEASSLSDYVDGVEECYMESDFLDDYAAMEEDEGWDDYEMAVAREGE 750
>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein. This family consists of several
eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
to suppress GCR2 and is highly expressed in the muscle
and heart. The function of this family is unknown
although it has been speculated that SGT1 may be
functionally analogous to the Gcr2p protein of
Saccharomyces cerevisiae which is known to be a
regulatory factor of glycolytic gene expression.
Length = 557
Score = 28.5 bits (64), Expect = 2.1
Identities = 12/71 (16%), Positives = 30/71 (42%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
+ A E+++E+++E+E+ E LEE ++ + L + D +
Sbjct: 480 DLPDDSDDADEDDDEDDDEDEDSSSDSTLEELEEYMDQMDAELKQTDSSNNADISNSGSS 539
Query: 161 DKQGEEKRVED 171
+ ++ +E
Sbjct: 540 GAEDDDDDIEG 550
>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
Length = 869
Score = 28.8 bits (65), Expect = 2.2
Identities = 13/34 (38%), Positives = 19/34 (55%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
E EE EEE EE EE +D E+D ++++
Sbjct: 829 EPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDED 862
>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
Reviewed.
Length = 129
Score = 27.7 bits (62), Expect = 2.2
Identities = 13/56 (23%), Positives = 37/56 (66%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
+E +EE EE+ ++++ ++++ D++ + + +D+ E++ +E++ED++ EE
Sbjct: 74 DEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129
Score = 26.5 bits (59), Expect = 5.5
Identities = 7/53 (13%), Positives = 29/53 (54%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
+ ++ + + +++ + +++ +++ D++ EED +E++ E++E
Sbjct: 77 DEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129
>gnl|CDD|222800 PHA00458, PHA00458, single-stranded DNA-binding protein.
Length = 233
Score = 28.2 bits (63), Expect = 2.3
Identities = 23/75 (30%), Positives = 35/75 (46%), Gaps = 15/75 (20%)
Query: 87 LRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEE 146
L+L+ +L E F G D+ A E+EE+ E + + QD EE
Sbjct: 171 LQLESVMLVELAEFGGGGEDDWADEDEEDGYVASEAKARKPQD---------------EE 215
Query: 147 DERGEDEEKKEEEED 161
+ +DEE+ EE+ED
Sbjct: 216 EWDEDDEEESEEDED 230
>gnl|CDD|172358 PRK13831, PRK13831, conjugal transfer protein TrbI; Provisional.
Length = 432
Score = 28.5 bits (64), Expect = 2.4
Identities = 11/42 (26%), Positives = 21/42 (50%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
ET +EE+ + + +E+ LE +E + R E++E
Sbjct: 97 ETQQEEKAVNPFTPQPGQREERRPTLESEEEWRARLKREQEE 138
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 28.7 bits (64), Expect = 2.4
Identities = 32/133 (24%), Positives = 53/133 (39%), Gaps = 10/133 (7%)
Query: 65 VLVMPLGILTLVRV------RTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEE 118
+L M +G LTL +V R L+ V LKK L A ++ + ++E+ +
Sbjct: 1115 LLSMAIGTLTLEKVQELCADRDKLNIEVEDLKKTTPKSLWLKDLDALEKELDKLDKEDAK 1174
Query: 119 EEEEEEEQEQDEYLEEDENEKE--RRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDE 176
EE E+ ++ E K+ R+ P++ + +K E E EE + E
Sbjct: 1175 AEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETT--EETYGSSAMETE 1232
Query: 177 KRLEENKKKKTQK 189
E K K
Sbjct: 1233 NVAEVVKPKGRAG 1245
>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
kDa subunit. This family contains several DNA-directed
RNA polymerase 19 kDa polypeptides. The Poxvirus
DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
DNA-template-directed extension of the 3'-end of an RNA
strand by one nucleotide at a time.
Length = 167
Score = 27.8 bits (62), Expect = 2.4
Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 2/66 (3%)
Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
E ++++ EE EEE+E +E E E+ L + + + E E ED K+V +K
Sbjct: 11 ESDDDDSEEYEEEEEDEEDAESLESSDVSSLKQSNYKIES--ASENIEDATSNPKQVSEK 68
Query: 173 IGDEKR 178
I KR
Sbjct: 69 ISAIKR 74
>gnl|CDD|236568 PRK09560, nhaA, pH-dependent sodium/proton antiporter; Reviewed.
Length = 389
Score = 28.3 bits (64), Expect = 2.4
Identities = 8/19 (42%), Positives = 15/19 (78%)
Query: 52 FLMSLAITDLMVAVLVMPL 70
FL++LAI D + A++++ L
Sbjct: 155 FLLALAIIDDLGAIVIIAL 173
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 28.5 bits (64), Expect = 2.4
Identities = 12/69 (17%), Positives = 30/69 (43%), Gaps = 5/69 (7%)
Query: 121 EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLE 180
E E + ++ +N+++ ++ + KK+ ++D + + + E +
Sbjct: 1 AMNENGENEVKQQKQQNKQK-----GTKKKNKKSKKDVDDDDAFLAELISENQEAENKQN 55
Query: 181 ENKKKKTQK 189
KKKK +K
Sbjct: 56 NKKKKKKKK 64
>gnl|CDD|183731 PRK12766, PRK12766, 50S ribosomal protein L32e; Provisional.
Length = 232
Score = 28.3 bits (63), Expect = 2.5
Identities = 18/52 (34%), Positives = 27/52 (51%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
+ET E E+E EEEEEE+ + E EK L +E+ R + ++E
Sbjct: 63 EETEAEVEDEGGEEEEEEDADVETELRPRGLTEKTPELSDEEARLLTQRRRE 114
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 28.5 bits (64), Expect = 2.5
Identities = 28/121 (23%), Positives = 53/121 (43%), Gaps = 35/121 (28%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEY------------------LEEDENEKERRL-- 143
A +E EE++E ++E++EEE Y L + +KE+ L
Sbjct: 1061 AEEEEGAEEDDEADDEDDEEELGAAVSYDYLLSMPIWSLTKEKVEKLNAELEKKEKELEK 1120
Query: 144 -----PEE---------DERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
P++ +E E++E+ EE+E + + + + K G +L + K KK +K
Sbjct: 1121 LKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTK-GKASKLRKPKLKKKEK 1179
Query: 190 R 190
+
Sbjct: 1180 K 1180
>gnl|CDD|221342 pfam11960, DUF3474, Domain of unknown function (DUF3474). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria and eukaryotes. This domain
is typically between 126 to 140 amino acids in length.
This domain is found associated with pfam00487.
Length = 135
Score = 27.7 bits (62), Expect = 2.5
Identities = 12/18 (66%), Positives = 14/18 (77%)
Query: 112 EEEEEEEEEEEEEEQEQD 129
EEEEEE +EEEE+E D
Sbjct: 74 EEEEEENGNDEEEEEEFD 91
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 28.5 bits (64), Expect = 2.6
Identities = 12/58 (20%), Positives = 26/58 (44%), Gaps = 5/58 (8%)
Query: 133 EEDENEKER-----RLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKK 185
+E++N ++E+ ++ E E+ G + ++K+ K L+E KK
Sbjct: 220 KEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKK 277
Score = 27.0 bits (60), Expect = 7.0
Identities = 15/88 (17%), Positives = 34/88 (38%)
Query: 90 KKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDER 149
+ NI E + + + + +EE++ + E ++ E + EK ++ + E+
Sbjct: 215 EININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEK 274
Query: 150 GEDEEKKEEEEDKQGEEKRVEDKIGDEK 177
+ D + K KI +E
Sbjct: 275 AKKLGISIILFDDMTKNKTTNYKIQNED 302
>gnl|CDD|165564 PHA03309, PHA03309, transcriptional regulator ICP4; Provisional.
Length = 2033
Score = 28.7 bits (63), Expect = 2.6
Identities = 16/53 (30%), Positives = 25/53 (47%), Gaps = 3/53 (5%)
Query: 105 RDETAREEEEEEEEEEEEEEEQ---EQDEYLEEDENEKERRLPEEDERGEDEE 154
R E AR + E ++EEE E Q+ + ++ E R E + EDE+
Sbjct: 315 RKEAARRSPDAERNDDEEEYESLPCAQEFFTSQELAEINARAIAESDEAEDED 367
>gnl|CDD|215107 PLN00220, PLN00220, tubulin beta chain; Provisional.
Length = 447
Score = 28.3 bits (63), Expect = 2.7
Identities = 14/22 (63%), Positives = 17/22 (77%)
Query: 105 RDETAREEEEEEEEEEEEEEEQ 126
+D TA EE E E+EEEEEE E+
Sbjct: 426 QDATADEEGEYEDEEEEEEYEE 447
>gnl|CDD|217526 pfam03385, DUF288, Protein of unknown function, DUF288.
Length = 390
Score = 28.2 bits (63), Expect = 2.7
Identities = 10/20 (50%), Positives = 12/20 (60%)
Query: 36 CLAISWEKRLQNVTNYFLMS 55
C+ + E L NV YFLMS
Sbjct: 278 CVTLVKELGLNNVEGYFLMS 297
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 28.0 bits (63), Expect = 2.8
Identities = 22/86 (25%), Positives = 43/86 (50%), Gaps = 5/86 (5%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK-EEEEDKQG 164
E A+ E E E+E E+++E+++ +E E + + + E+ E E +K E+++
Sbjct: 208 AERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERML 267
Query: 165 EEKRVEDKIGDEKRLEENKKKKTQKR 190
E K E E+ L+E K + +
Sbjct: 268 EHKLQE----QEELLKEGFKTEAESL 289
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 28.2 bits (63), Expect = 2.9
Identities = 24/85 (28%), Positives = 40/85 (47%), Gaps = 7/85 (8%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
+D + +E+ E+EE EE E E E E++RRL + E E+E++ + ++KQ
Sbjct: 130 KDVIQKNKEKSTREQEELEEALEF-----EKEEEEQRRLLLQKE--EEEQQMNKRKNKQA 182
Query: 165 EEKRVEDKIGDEKRLEENKKKKTQK 189
+E L KK + K
Sbjct: 183 LLDELETSTLPAAELIAQHKKNSVK 207
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 27.8 bits (62), Expect = 3.0
Identities = 18/84 (21%), Positives = 36/84 (42%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEE 166
E EE + E E +EQ + E E E P+ E+ + E+K ++ + K +
Sbjct: 45 AAKVLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPK 104
Query: 167 KRVEDKIGDEKRLEENKKKKTQKR 190
+ + K+ + + ++ K K
Sbjct: 105 PKPKPKVKPQPKPKKPPSKTAAKA 128
>gnl|CDD|225340 COG2719, SpoVR, Uncharacterized conserved protein [Function
unknown].
Length = 495
Score = 28.2 bits (63), Expect = 3.1
Identities = 14/60 (23%), Positives = 24/60 (40%), Gaps = 7/60 (11%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
R + EEE +EE EE Q Q N+ R LP++ ++ + + +
Sbjct: 177 KRPKKISSEEERARQEEREEYLQSQ-------VNDLWRTLPKKPGEAAVKDARRFPSEPE 229
>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
Members of this family are bacterial proteins with a
conserved motif [KR]FYDLN, sometimes flanked by a pair
of CXXC motifs, followed by a long region of low
complexity sequence in which roughly half the residues
are Asp and Glu, including multiple runs of five or more
acidic residues. The function of members of this family
is unknown.
Length = 104
Score = 26.9 bits (60), Expect = 3.1
Identities = 10/61 (16%), Positives = 33/61 (54%)
Query: 101 KAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEE 160
+A A D +++E+EE+E++ + D+ ++D+ + + +D+ +++++
Sbjct: 44 RAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDFLEDDDD 103
Query: 161 D 161
+
Sbjct: 104 E 104
Score = 26.5 bits (59), Expect = 3.8
Identities = 12/62 (19%), Positives = 35/62 (56%)
Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
K+ A A + +++E+EE+E++ D+ ++D+++ ++ + +D++ E+
Sbjct: 41 AKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDFLED 100
Query: 160 ED 161
+D
Sbjct: 101 DD 102
Score = 26.1 bits (58), Expect = 6.6
Identities = 12/58 (20%), Positives = 32/58 (55%)
Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
A E+ +++E+EE+E+ + ++D+++ + ++D+ D++ + ED E
Sbjct: 47 AADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDFLEDDDDE 104
>gnl|CDD|227574 COG5249, RER1, Golgi protein involved in Golgi-to-ER retrieval
[Intracellular trafficking and secretion].
Length = 180
Score = 27.5 bits (61), Expect = 3.2
Identities = 12/29 (41%), Positives = 16/29 (55%), Gaps = 5/29 (17%)
Query: 124 EEQEQDEYLEEDENEKE-----RRLPEED 147
E+ E D+ +EE E + E RRLPE
Sbjct: 85 EQIEDDDEIEEGEKDNEFRPFIRRLPEFK 113
>gnl|CDD|215153 PLN02271, PLN02271, serine hydroxymethyltransferase.
Length = 586
Score = 28.2 bits (63), Expect = 3.2
Identities = 11/29 (37%), Positives = 16/29 (55%)
Query: 98 QLFKAGARDETAREEEEEEEEEEEEEEEQ 126
QL + E EE +E+EEE+ E+E
Sbjct: 50 QLLEQKEEKEEDAGEEGDEDEEEQGEDEH 78
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 26.7 bits (59), Expect = 3.4
Identities = 15/28 (53%), Positives = 23/28 (82%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQD 129
A A +E A E+EEE+++EEE+EEE+E+
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEEEA 98
Score = 26.3 bits (58), Expect = 5.0
Identities = 14/28 (50%), Positives = 22/28 (78%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQD 129
A A E EE+EEE+++EEE+EE+E++
Sbjct: 70 AAAAAEEKAEEKEEEKKKEEEKEEEEEE 97
Score = 25.9 bits (57), Expect = 6.7
Identities = 14/32 (43%), Positives = 23/32 (71%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQEQDEYLE 133
A A + EE+EEE+++EEE+E+E++E L
Sbjct: 69 AAAAAAEEKAEEKEEEKKKEEEKEEEEEEALA 100
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 27.9 bits (62), Expect = 3.4
Identities = 12/68 (17%), Positives = 21/68 (30%), Gaps = 7/68 (10%)
Query: 123 EEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEEN 182
E + + E K R+ E +K+ E + E R E +
Sbjct: 328 ENAEIKKTRTAEKNEAKARK-------KEIAQKRRAAEREINREARQERAAAMARARARR 380
Query: 183 KKKKTQKR 190
K +K+
Sbjct: 381 AAVKAKKK 388
>gnl|CDD|221323 pfam11931, DUF3449, Domain of unknown function (DUF3449). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 181 to 207 amino acids in length. This domain
has two conserved sequence motifs: PIP and CEICG. The
domain carries a zinc-finger domain of the C2H2-type.
Length = 187
Score = 27.2 bits (61), Expect = 3.8
Identities = 9/32 (28%), Positives = 15/32 (46%)
Query: 108 TAREEEEEEEEEEEEEEEQEQDEYLEEDENEK 139
T E + +E E+ E D+ ++DE E
Sbjct: 32 TEEERQASADESSEDASEDGSDDDSDDDEEEP 63
>gnl|CDD|224559 COG1645, COG1645, Uncharacterized Zn-finger containing protein
[General function prediction only].
Length = 131
Score = 27.0 bits (60), Expect = 3.8
Identities = 25/78 (32%), Positives = 32/78 (41%), Gaps = 8/78 (10%)
Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
E EEEEEE E + E+ RR E DE KKE +ED E ++
Sbjct: 51 GYREVVVEEEEEEV------EAEVQEQLRRSRPELPDDSDELKKEIDEDPASE--KLLSV 102
Query: 173 IGDEKRLEENKKKKTQKR 190
I KR+E + R
Sbjct: 103 IETLKRIEARRDLSRAGR 120
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 27.5 bits (61), Expect = 3.8
Identities = 15/56 (26%), Positives = 38/56 (67%), Gaps = 4/56 (7%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
D+ + + + ++ ++EE++ + +Y+EE E +++ EE+E E+EE++E+E++
Sbjct: 234 DDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKED----EEEEETEEEEEEEDEDE 285
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 27.8 bits (62), Expect = 3.9
Identities = 28/99 (28%), Positives = 53/99 (53%), Gaps = 5/99 (5%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPE-EDERGEDEE 154
E + +A A+ E AR E EE + +E+ +QEQ +DE ++R E E +R ++EE
Sbjct: 74 EYEAQQAQAKLERARVEAEERRKTLQEQTQQEQQRAQYQDELARKRYQKELEQQRRQNEE 133
Query: 155 --KKEEEEDKQGEE--KRVEDKIGDEKRLEENKKKKTQK 189
K +EE + E + E++I + +R ++ + ++
Sbjct: 134 LLKMQEESVLRQEAMRRATEEEILEMRRETIEEEAELER 172
>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
motility and secretion / Intracellular trafficking and
secretion].
Length = 696
Score = 28.0 bits (63), Expect = 4.0
Identities = 32/122 (26%), Positives = 52/122 (42%), Gaps = 15/122 (12%)
Query: 24 LVFCTATGNILVCLAISWEKRLQNVTNYFLMS---LAITDLMVAVL-VMP-LGILTLVRV 78
L+ TA G I+V + E Q + + L I ++ +L ++P + L + +
Sbjct: 254 LLISTAAG-IIVTRVSTDESLGQQLFTQLFANPKVLLIAAGVLFLLGLIPGMPHLPFLLL 312
Query: 79 RTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENE 138
L+ L K E Q + + EE ++EEEEEEE+ D+ L D E
Sbjct: 313 AGLLAFLAYLLSKRK--EQQAEEKE-------KPAEEAKKEEEEEEEESVDDVLLIDPIE 363
Query: 139 KE 140
E
Sbjct: 364 LE 365
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 27.7 bits (62), Expect = 4.0
Identities = 15/59 (25%), Positives = 28/59 (47%), Gaps = 2/59 (3%)
Query: 132 LEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
L + E + ++ + ++K+E EED + E R E K ++ E KKK + +
Sbjct: 211 LLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKE--EIKKKKPKESK 267
Score = 26.5 bits (59), Expect = 7.8
Identities = 14/75 (18%), Positives = 31/75 (41%), Gaps = 4/75 (5%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
+ +E + + + E++K++R EED E + E K+ ++ ++
Sbjct: 205 PDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEED----VEAAESRAEKKRKSKEEIKK 260
Query: 172 KIGDEKRLEENKKKK 186
K E + + KK
Sbjct: 261 KKPKESKGVKALKKV 275
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 27.8 bits (62), Expect = 4.1
Identities = 22/80 (27%), Positives = 40/80 (50%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
E E EE EE E+E +E +E L E +E E +E+ E E EE ++ E
Sbjct: 491 SREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKE 550
Query: 166 EKRVEDKIGDEKRLEENKKK 185
+ +++ + ++LE+ ++
Sbjct: 551 KLQLQQLKEELRQLEDRLQE 570
Score = 26.6 bits (59), Expect = 9.9
Identities = 23/80 (28%), Positives = 37/80 (46%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
EE E EEE+E E+ + LE + E E E+E E E EE E+K + +E
Sbjct: 238 EERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELE 297
Query: 171 DKIGDEKRLEENKKKKTQKR 190
+I + + E + ++
Sbjct: 298 REIEELEEELEGLRALLEEL 317
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 27.6 bits (61), Expect = 4.3
Identities = 15/45 (33%), Positives = 22/45 (48%)
Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
EE +EE E+E E DE + E+ E E + D+ E + E
Sbjct: 343 VTEESTDEESEDEVEIDESVIEEVAEMELLEVQVDDLAERLAETE 387
Score = 27.2 bits (60), Expect = 5.6
Identities = 11/35 (31%), Positives = 17/35 (48%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
D E +EE E+E E +E +E E + E +
Sbjct: 341 DVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQ 375
Score = 27.2 bits (60), Expect = 6.0
Identities = 11/53 (20%), Positives = 24/53 (45%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
+ E++E + EE ++E ++ +E DE+ E E + ++ E
Sbjct: 330 EVVEVVEKQEGDVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQVDDLAER 382
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 27.7 bits (62), Expect = 4.4
Identities = 16/50 (32%), Positives = 30/50 (60%)
Query: 116 EEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+E E E++EE+ ++++ ++ + L E + + E K EEEE+KQ E
Sbjct: 94 KELEREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQME 143
>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
modification].
Length = 429
Score = 27.6 bits (61), Expect = 4.4
Identities = 18/79 (22%), Positives = 30/79 (37%), Gaps = 3/79 (3%)
Query: 99 LFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
LF+ G + E EE +EE+ + Y ED E R+ + G E
Sbjct: 345 LFEVGPETAEGELLQFEAEEIYNDEEQYTRPVYENEDFEE---RVDDVSNGGLAEFAPAA 401
Query: 159 EEDKQGEEKRVEDKIGDEK 177
E E+ +V + ++
Sbjct: 402 VEVAPKEDAKVTKRGKAKR 420
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 27.7 bits (62), Expect = 4.5
Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 2/75 (2%)
Query: 115 EEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIG 174
EEEEE E +++++E L + E E RL ++ E + + E E +K+ K+ E+K+
Sbjct: 185 EEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLR 244
Query: 175 DEKRLEENKKKKTQK 189
E LE + QK
Sbjct: 245 QE--LERQAEAHEQK 257
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 27.7 bits (61), Expect = 4.5
Identities = 30/99 (30%), Positives = 50/99 (50%), Gaps = 14/99 (14%)
Query: 100 FKAGARDETAREEEEEEEEEEEEEEE--QEQDEYLEED-----ENEKERRLPEEDERGED 152
K G R +E++ E E +E ++ E++ D E E+ + +EDE ED
Sbjct: 165 MKRGTRITLHLKEDQMEYLEPRRLKELIKKHSEFIGYDIELMVEKTTEKEVTDEDE--ED 222
Query: 153 EEKKEEEEDKQGEEKRVED-KIGDEKRLEENKKKKTQKR 190
+K +E+ GEE +VE+ K GDE + ++ KK K +
Sbjct: 223 TKKADED----GEEPKVEEVKEGDEGKKKKTKKVKEVTK 257
>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
synthase complex, beta subunit. Nomenclature follows
the description for Methanosarcina thermophila. The
CO-methylating acetyl-CoA synthase is considered the
defining enzyme of the Wood-Ljungdahl pathway, used for
acetate catabolism by sulfate reducing bacteria but for
acetate biosynthesis by acetogenic bacteria such as
oorella thermoacetica (f. Clostridium thermoaceticum)
[Energy metabolism, Chemoautotrophy].
Length = 458
Score = 27.5 bits (61), Expect = 4.5
Identities = 14/39 (35%), Positives = 19/39 (48%), Gaps = 3/39 (7%)
Query: 89 LKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQE 127
LK+ + + K R+ E EEEEE + EE E E
Sbjct: 388 LKEK---GHPVVKRVVREVDEEEIEEEEEAMQPEEMEME 423
Score = 27.1 bits (60), Expect = 6.5
Identities = 12/22 (54%), Positives = 15/22 (68%)
Query: 109 AREEEEEEEEEEEEEEEQEQDE 130
RE +EEE EEEEE + E+ E
Sbjct: 400 VREVDEEEIEEEEEAMQPEEME 421
Score = 27.1 bits (60), Expect = 7.3
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLE 133
+ E +EEE EEEEE Q ++ +E
Sbjct: 397 KRVVREVDEEEIEEEEEAMQPEEMEME 423
>gnl|CDD|227494 COG5165, POB3, Nucleosome-binding factor SPN, POB3 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 508
Score = 27.7 bits (61), Expect = 4.7
Identities = 18/74 (24%), Positives = 30/74 (40%), Gaps = 6/74 (8%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
AR+E +E + + + E+ EEDE+E +ED + + EE D Q
Sbjct: 439 ARNEEVQERLQTDLGSISDSEDINMGSAGEEDESE------DEDFQMVSDSDVAEEYDLQ 492
Query: 164 GEEKRVEDKIGDEK 177
E +E+
Sbjct: 493 AALSDAEGGSDEER 506
>gnl|CDD|238818 cd01640, IPPase, IPPase; Inositol polyphosphate-1-phosphatase, a
member of the Mg++ dependent family of inositol
monophosphatase-like domains, hydrolyzes the 1' position
phosphate from inositol 1,3,4-trisphosphate and inositol
1,4-bisphosphate. Members in this group may also exhibit
3'-phosphoadenosine 5'-phosphate phosphatase activity,
and they all appear to be inhibited by lithium. IPPase
is one of the proposed targets of Li+ therapy in
manic-depressive illness.
Length = 293
Score = 27.3 bits (61), Expect = 4.7
Identities = 16/38 (42%), Positives = 22/38 (57%), Gaps = 1/38 (2%)
Query: 111 EEEEEEEEEEEEEEEQEQDEY-LEEDENEKERRLPEED 147
EE+ E E +E+E + + DE LEE + LPEED
Sbjct: 69 EEDNEFENQEDESRDVDLDEEILEESCPSPSKDLPEED 106
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 27.6 bits (61), Expect = 4.9
Identities = 16/86 (18%), Positives = 44/86 (51%), Gaps = 1/86 (1%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
R+ + +E+++E+++++ + E E+ ++KE + ++ +E+ +E K
Sbjct: 7 REANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKE-EVVTDNVDSPVKEQSSQENLKIA 65
Query: 165 EEKRVEDKIGDEKRLEENKKKKTQKR 190
+E + K ++ LE K K+ ++
Sbjct: 66 DEVKKSTKEESKQLLEVLKTKEEHQK 91
Score = 27.6 bits (61), Expect = 5.1
Identities = 17/80 (21%), Positives = 38/80 (47%), Gaps = 5/80 (6%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
R+ + +E+++E+Q+Q +E KE+ L +++E D +E
Sbjct: 4 RNRREANINNNDRMQEKDDEKQDQKNRME----LKEKVLDKKEEVVTDNVDSPVKEQSSQ 59
Query: 165 EEKRVEDKIGDEKRLEENKK 184
E ++ D++ + EE+K+
Sbjct: 60 ENLKIADEVKKSTK-EESKQ 78
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 27.7 bits (61), Expect = 5.0
Identities = 16/50 (32%), Positives = 25/50 (50%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGED 152
R +E+ E++ E E + E+ E++ ENE E L E + GED
Sbjct: 139 KRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELDLEKDDGED 188
>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
Length = 534
Score = 27.7 bits (61), Expect = 5.0
Identities = 16/72 (22%), Positives = 27/72 (37%), Gaps = 3/72 (4%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLE---EDENEKERRLPEEDERGEDEEKKEEE 159
GAR EE D+ +E DE +++ P +R + +E +E
Sbjct: 400 GARGSPMAAPEEGRAAAAAAAANAPADQDVEAEAHDEFDQDPGAPAHADRADSDEDDMDE 459
Query: 160 EDKQGEEKRVED 171
++ E ED
Sbjct: 460 QESGDERADGED 471
>gnl|CDD|185626 PTZ00447, PTZ00447, apical membrane antigen 1-like protein;
Provisional.
Length = 508
Score = 27.6 bits (61), Expect = 5.1
Identities = 17/60 (28%), Positives = 33/60 (55%)
Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
+++ E E ++ +++ LEEDE + P+E E +EK+ E ++ Q +E + E K
Sbjct: 412 QKDSLETTAGERDEFKNDVLEEDEISGDNIGPDEIENDHYQEKEIENDNIQYDENKNEGK 471
>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
Length = 653
Score = 27.5 bits (61), Expect = 5.1
Identities = 15/40 (37%), Positives = 22/40 (55%), Gaps = 1/40 (2%)
Query: 137 NEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDE 176
N+K R + +R +E +K + ED+ E RVE K G E
Sbjct: 507 NDKGRLSKADIDRMVNEAEKYKAEDEANRE-RVEAKNGLE 545
>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
Length = 470
Score = 27.5 bits (62), Expect = 5.3
Identities = 13/65 (20%), Positives = 30/65 (46%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVED 171
EE EE E + DE E+ E+ + E + ++E+ ++ +G ++ +
Sbjct: 381 EELAYVPVEELLEIEGFDEETVEELRERAKEALETEALAQEEKLADDLLSLEGLDRELAF 440
Query: 172 KIGDE 176
K+ ++
Sbjct: 441 KLAEK 445
Score = 26.8 bits (60), Expect = 8.6
Identities = 15/61 (24%), Positives = 28/61 (45%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
E E +EE EE + E LE + +E +L ++ E +++ + + K +E
Sbjct: 392 LEIEGFDEETVEELRERAKEALETEALAQEEKLADDLLSLEGLDRELAFKLAEKGIKTLE 451
Query: 171 D 171
D
Sbjct: 452 D 452
>gnl|CDD|153125 cd02434, Nodulin-21_like_3, Nodulin-21 and CCC1-related protein
family. Nodulin-21_like_3: This is a family of proteins
closely related to nodulin-21, a plant nodule-specific
protein that may be involved in symbiotic nitrogen
fixation. This family is also related to CCC1, a yeast
vacuole transmembrane protein that functions as an iron
and manganese transporter. .
Length = 225
Score = 26.9 bits (60), Expect = 5.4
Identities = 9/39 (23%), Positives = 19/39 (48%)
Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENE 138
A + + + E + E++ EE E + Y E +++E
Sbjct: 47 ISMAAGEYVSTKAERDFLHSEKKREEWEIENYPEGEKSE 85
>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
factor RRN3. This family consists of several eukaryotic
proteins which are homologous to the yeast RRN3 protein.
RRN3 is one of the RRN genes specifically required for
the transcription of rDNA by RNA polymerase I (Pol I) in
Saccharomyces cerevisiae.
Length = 554
Score = 27.3 bits (61), Expect = 5.4
Identities = 10/37 (27%), Positives = 24/37 (64%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
++EEEE +E++ ++D+ + D++++E PE +
Sbjct: 226 DDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVER 262
Score = 26.9 bits (60), Expect = 7.5
Identities = 14/54 (25%), Positives = 28/54 (51%), Gaps = 4/54 (7%)
Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEK 167
++ ++EEEE ++D+ E+D + + ++DE D E + K+ EK
Sbjct: 223 DDIDDEEEERVLADEDDDDEDDMFDMD----DDDEEESDPEVERTSTIKEVSEK 272
Score = 26.9 bits (60), Expect = 7.8
Identities = 7/36 (19%), Positives = 19/36 (52%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKE 140
+E R +E++++E++ + + D+ E D +
Sbjct: 227 DEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVER 262
>gnl|CDD|178354 PLN02753, PLN02753, triacylglycerol lipase.
Length = 531
Score = 27.4 bits (60), Expect = 5.4
Identities = 13/38 (34%), Positives = 19/38 (50%)
Query: 121 EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
E E + Q ++E E E L ED R D++ +EE
Sbjct: 63 ERERRERQGLLIDEAEGAGELWLTAEDIRRRDKKTEEE 100
>gnl|CDD|224400 COG1483, COG1483, Predicted ATPase (AAA+ superfamily) [General
function prediction only].
Length = 774
Score = 27.4 bits (61), Expect = 5.5
Identities = 25/100 (25%), Positives = 35/100 (35%), Gaps = 4/100 (4%)
Query: 88 RLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEED 147
RL + +L + + E E EE EEE E + KE+
Sbjct: 583 RLLELLLGFIASGRRTYSNTVLVEYEAEEILSILEEETARVLACDEVMKTIKEK----YK 638
Query: 148 ERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKT 187
+ GED E+ E E ++ EK VED K
Sbjct: 639 KYGEDVEEISENEVRKIGEKAVEDLEEQFKDSFGRVAYPV 678
>gnl|CDD|184536 PRK14145, PRK14145, heat shock protein GrpE; Provisional.
Length = 196
Score = 26.8 bits (59), Expect = 5.5
Identities = 25/84 (29%), Positives = 43/84 (51%), Gaps = 1/84 (1%)
Query: 106 DETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+E +E +EEE++ EQ E EDE ++ + P++ E EE K++ + K+ E
Sbjct: 2 EEVEKEINKEEEKDVNNLSSNEQMEGPPEDEQAQQNQ-PQQQTVDEIEELKQKLQQKEVE 60
Query: 166 EKRVEDKIGDEKRLEENKKKKTQK 189
+ D K EN +K+T+K
Sbjct: 61 AQEYLDIAQRLKAEFENYRKRTEK 84
>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal. This domain
is specific to the N-terminal part of the prp1 splicing
factor, which is involved in mRNA splicing (and possibly
also poly(A)+ RNA nuclear export and cell cycle
progression). This domain is specific to the N terminus
of the RNA splicing factor encoded by prp1. It is
involved in mRNA splicing and possibly also poly(A)and
RNA nuclear export and cell cycle progression.
Length = 131
Score = 26.5 bits (59), Expect = 5.6
Identities = 19/75 (25%), Positives = 39/75 (52%), Gaps = 12/75 (16%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQ----DEYLEEDE------NEKERRLPEEDERGEDE 153
ARD ++EE+E+ + ++ + E +Y +EDE + R+ ++ R +
Sbjct: 25 ARDGVDIDDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERM--DERRKKRR 82
Query: 154 EKKEEEEDKQGEEKR 168
E+KE+EE ++ E+
Sbjct: 83 EQKEKEEIEKYREEN 97
>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
Length = 852
Score = 27.4 bits (61), Expect = 5.6
Identities = 7/32 (21%), Positives = 17/32 (53%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERR 142
+ E+ ++++++ LEE+E+ RR
Sbjct: 36 DSEDPASLDDDDDDRDSGGGMLEEEEDGNMRR 67
Score = 26.6 bits (59), Expect = 9.5
Identities = 11/70 (15%), Positives = 24/70 (34%), Gaps = 10/70 (14%)
Query: 110 REEEEEEEEEEEEEE-------EQEQDEYLEEDENEKERRLPEEDERGEDE---EKKEEE 159
EEE + + +++ E++ E +D+ + + EEE
Sbjct: 1 MEEESSGGIGSSSNGSGGLVGTDSGESRWVDGSEDDSEDPASLDDDDDDRDSGGGMLEEE 60
Query: 160 EDKQGEEKRV 169
ED + +
Sbjct: 61 EDGNMRRRLI 70
>gnl|CDD|240321 PTZ00230, PTZ00230, variable surface protein Vir7; Provisional.
Length = 364
Score = 27.1 bits (60), Expect = 5.7
Identities = 12/48 (25%), Positives = 20/48 (41%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE 154
ET + ++ EE +E E +Q + E E+ +P DE
Sbjct: 211 ETDPQSKQLEEGDEGERVQQPTLTAIPGLEGEESLSVPPSDEXSPGST 258
>gnl|CDD|216095 pfam00748, Calpain_inhib, Calpain inhibitor. This region is found
multiple times in calpain inhibitor proteins.
Length = 131
Score = 26.7 bits (59), Expect = 5.8
Identities = 21/46 (45%), Positives = 25/46 (54%), Gaps = 6/46 (13%)
Query: 144 PEEDERGED---EEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKK 186
PEE + ED E+ KEE+ +K GE ED I E RL E K K
Sbjct: 78 PEEKKPVEDKVKEKAKEEKLEKLGER---EDTIPPEYRLLEAKDKD 120
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 26.5 bits (59), Expect = 5.9
Identities = 17/50 (34%), Positives = 30/50 (60%)
Query: 141 RRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
RR E DE+ K+E ED++ ++KR E K DE++ + + K+ +K+
Sbjct: 45 RRREYERLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKK 94
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 27.5 bits (62), Expect = 5.9
Identities = 25/97 (25%), Positives = 43/97 (44%), Gaps = 12/97 (12%)
Query: 97 NQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKK 156
N+L + E E++ EE E +E E+ ++E E+ E +E ED+ +
Sbjct: 519 NELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEE---------EDKLLE 569
Query: 157 EEEEDKQ---GEEKRVEDKIGDEKRLEENKKKKTQKR 190
E E++ Q E K+ D+I E R + + K
Sbjct: 570 EAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKA 606
>gnl|CDD|145910 pfam03012, PP_M1, Phosphoprotein. This family includes the M1
phosphoprotein non-structural RNA polymerase alpha
subunit, which is thought to be a component of the
active polymerase, and may be involved in template
binding.
Length = 298
Score = 27.0 bits (60), Expect = 6.0
Identities = 10/25 (40%), Positives = 16/25 (64%)
Query: 113 EEEEEEEEEEEEEQEQDEYLEEDEN 137
+ EEE+E+E E+D Y+ E E+
Sbjct: 64 SQFTEEEDEDEGSSEEDFYMTEGED 88
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 27.4 bits (61), Expect = 6.1
Identities = 20/92 (21%), Positives = 41/92 (44%), Gaps = 5/92 (5%)
Query: 98 QLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKE 157
+ + E +E ++E EE+ E E E L + N++ + R E EE +
Sbjct: 27 KRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEA-----RREREELQR 81
Query: 158 EEEDKQGEEKRVEDKIGDEKRLEENKKKKTQK 189
EEE +E++++ + LE +++ +
Sbjct: 82 EEERLVQKEEQLDARAEKLDNLENQLEEREKA 113
>gnl|CDD|214923 smart00937, PCRF, This domain is found in peptide chain release
factors.
Length = 116
Score = 26.3 bits (59), Expect = 6.1
Identities = 15/37 (40%), Positives = 21/37 (56%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRL 143
E EEE++E E EEE +E + LEE+ E + L
Sbjct: 1 ELLEEEEDDEMRELAEEELEELEAELEEELEELLKLL 37
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 27.2 bits (61), Expect = 6.1
Identities = 29/106 (27%), Positives = 44/106 (41%), Gaps = 11/106 (10%)
Query: 89 LKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDE 148
LKK +L + G E RE +EE EE E+E E ++ L+E + L E
Sbjct: 197 LKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEE--LKELAKKYLEELLALYE 254
Query: 149 RGEDEEKKEEEEDKQGEEKR-------VEDKIGDEKRLEENKKKKT 187
E E ++ E K + + V + K+L+E K T
Sbjct: 255 YLEIELERAEALSKFLKTDKTFAIEGWVPED--RVKKLKELIDKAT 298
>gnl|CDD|234070 TIGR02956, TMAO_torS, TMAO reductase sytem sensor TorS. This
protein, TorS, is part of a regulatory system for the
torCAD operon that encodes the pterin molybdenum
cofactor-containing enzyme trimethylamine-N-oxide (TMAO)
reductase (TorA), a cognate chaperone (TorD), and a
penta-haem cytochrome (TorC). TorS works together with
the inducer-binding protein TorT and the response
regulator TorR. TorS contains histidine kinase ATPase
(pfam02518), HAMP (pfam00672), phosphoacceptor
(pfam00512), and phosphotransfer (pfam01627) domains and
a response regulator receiver domain (pfam00072) [Signal
transduction, Two-component systems].
Length = 968
Score = 27.4 bits (61), Expect = 6.1
Identities = 18/63 (28%), Positives = 30/63 (47%)
Query: 104 ARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
RD A + + +E + +E QE E LE+ ++ + L E +ER E K + +
Sbjct: 397 FRDTAAHNLKLQADERQVAQELQEHKESLEQLVAQRTQELAETNERLNAEVKNHAKARAE 456
Query: 164 GEE 166
EE
Sbjct: 457 AEE 459
>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
Provisional.
Length = 337
Score = 27.1 bits (60), Expect = 6.1
Identities = 9/62 (14%), Positives = 24/62 (38%), Gaps = 3/62 (4%)
Query: 103 GARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDK 162
G + + +E++++ + + + + + EED+RGE E + +
Sbjct: 277 GEAIDEDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTN---EEDDRGEKESNGAKSNEL 333
Query: 163 QG 164
Sbjct: 334 HL 335
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 27.5 bits (61), Expect = 6.3
Identities = 19/74 (25%), Positives = 34/74 (45%), Gaps = 3/74 (4%)
Query: 87 LRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKER---RL 143
L +K L E + ++ E E +E+ +EE+ ++Q E L+++E +K RL
Sbjct: 2561 LENEKLKLLEKAKIEESRKERERIESETQEDNTDEEQINRQQQERLQKEEEQKAYSQERL 2620
Query: 144 PEEDERGEDEEKKE 157
E +D K
Sbjct: 2621 NREVSGTDDTNKNH 2634
>gnl|CDD|218967 pfam06273, eIF-4B, Plant specific eukaryotic initiation factor 4B.
This family consists of several plant specific
eukaryotic initiation factor 4B proteins.
Length = 496
Score = 27.4 bits (60), Expect = 6.3
Identities = 17/71 (23%), Positives = 34/71 (47%), Gaps = 3/71 (4%)
Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRV 169
R+ + E E + E E+++ L+E+ N L +E+ + + +E+ +G + +
Sbjct: 350 RKIDLELEHRRVDRPETEEEKNLKEEINLLRVDLKKEEAIAPESKGSGQEQSHKGLSELI 409
Query: 170 EDKIGDEKRLE 180
K EK LE
Sbjct: 410 RQK---EKELE 417
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 27.3 bits (61), Expect = 6.5
Identities = 21/78 (26%), Positives = 32/78 (41%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKR 168
R++ E E E+ EE+ E E E E EE +EE E E + + E +
Sbjct: 731 LRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQ 790
Query: 169 VEDKIGDEKRLEENKKKK 186
+E + K L E +
Sbjct: 791 IEQLKEELKALREALDEL 808
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 27.3 bits (61), Expect = 6.7
Identities = 16/76 (21%), Positives = 32/76 (42%), Gaps = 3/76 (3%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
+E + E E + EE + ++ +EE + E RL E DE E E+ ++
Sbjct: 84 PEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEE---LEDLEDLLEELE 140
Query: 165 EEKRVEDKIGDEKRLE 180
++ + + L+
Sbjct: 141 PLAYLDFDLSLLRGLK 156
>gnl|CDD|223403 COG0326, HtpG, Molecular chaperone, HSP90 family [Posttranslational
modification, protein turnover, chaperones].
Length = 623
Score = 27.2 bits (61), Expect = 6.8
Identities = 14/42 (33%), Positives = 23/42 (54%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDEN 137
E + FK+ + + E EEE+E + EEE++E LE +
Sbjct: 472 EGKPFKSITKGDLDLELLEEEDEADSEEEKKEFKPLLERVKE 513
>gnl|CDD|215349 PLN02647, PLN02647, acyl-CoA thioesterase.
Length = 437
Score = 27.1 bits (60), Expect = 7.1
Identities = 16/46 (34%), Positives = 22/46 (47%), Gaps = 6/46 (13%)
Query: 136 ENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEE 181
E E+E+ L EE E KK+ E K+ E G+ +RLE
Sbjct: 225 ETEEEKLLFEEAEARNKLRKKKRGEQKREFEN------GEAERLEA 264
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 26.7 bits (59), Expect = 7.2
Identities = 11/44 (25%), Positives = 23/44 (52%)
Query: 147 DERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKKTQKR 190
E + + + +K+ ++K+ ED +K+ +E KKKK +
Sbjct: 127 PEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHS 170
>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
chaperones [Posttranslational modification, protein
turnover, chaperones].
Length = 786
Score = 27.2 bits (61), Expect = 7.3
Identities = 22/93 (23%), Positives = 42/93 (45%), Gaps = 8/93 (8%)
Query: 102 AGARDETAREEEEEEEEEEEEEEEQE---QDEYLEEDENEKERRLPEEDERGEDEEKKEE 158
AGAR ++ EE +E E E + E + E+DE EK+ + + E+
Sbjct: 389 AGARVRLEIDKPEELDELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRIPELEK 448
Query: 159 EEDKQGEEKRVEDKIGD-----EKRLEENKKKK 186
E + + +E + + + +L E++K+K
Sbjct: 449 ELEAEVDEDDIAEVVARWTGIPVAKLLEDEKEK 481
>gnl|CDD|216096 pfam00749, tRNA-synt_1c, tRNA synthetases class I (E and Q),
catalytic domain. Other tRNA synthetase sub-families
are too dissimilar to be included. This family includes
only glutamyl and glutaminyl tRNA synthetases. In some
organisms, a single glutamyl-tRNA synthetase
aminoacylates both tRNA(Glu) and tRNA(Gln).
Length = 314
Score = 26.9 bits (60), Expect = 7.3
Identities = 20/53 (37%), Positives = 24/53 (45%), Gaps = 6/53 (11%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEED-ERGEDEEKK 156
EE EEE EE+E E+ Y EE RL EE+ +GE E K
Sbjct: 96 VCFCTPEELEEEREEQEALGSPERPRYDEEC-----LRLFEEEMRKGEAEGGK 143
>gnl|CDD|226246 COG3723, RecT, Recombinational DNA repair protein (RecE pathway)
[DNA replication, recombination, and repair].
Length = 276
Score = 26.7 bits (59), Expect = 7.4
Identities = 14/55 (25%), Positives = 25/55 (45%)
Query: 118 EEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDK 172
E+E E + D EED + + E E E +E+ E+ D E++ + +
Sbjct: 222 TEDEAERREPLDVTPEEDSEQLDSEPSGEPEAQETDEENEKPIDVSEEKEPITES 276
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 27.0 bits (60), Expect = 7.4
Identities = 7/41 (17%), Positives = 17/41 (41%), Gaps = 7/41 (17%)
Query: 121 EEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEED 161
++E + D+ E+ + + +D+ E+ ED
Sbjct: 250 DDEGDGSDDDDDEDAIES-------DLDDSDDDVSDEDGED 283
Score = 26.6 bits (59), Expect = 8.6
Identities = 7/59 (11%), Positives = 25/59 (42%)
Query: 96 ENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE 154
+ Q + R + + ++E + +++ ++D + ++ + E+ E D +
Sbjct: 230 KKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLDDSDDDVSDEDGEDLFDTD 288
>gnl|CDD|113857 pfam05102, Holin_BlyA, holin, BlyA family. BlyA, a small holin
found in Borrelia circular plasmids that is encoded by
a prophage. BlyA contains two largely hydrophobic
helices and a highly charged C-terminus and is membrane
associated.
Length = 61
Score = 25.1 bits (55), Expect = 7.6
Identities = 14/55 (25%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 39 ISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLKKNI 93
IS L + N L+ L I +++ ++ LG++ ++ LS + ++KKN
Sbjct: 4 ISINDFLLQLINIKLIKLMI---FISIFILSLGLIFKPVIKDMLSILISKIKKND 55
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 27.1 bits (60), Expect = 7.7
Identities = 16/62 (25%), Positives = 26/62 (41%)
Query: 100 FKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
F+ AR+E + E E E + E E E + +R L EE+ +E +
Sbjct: 728 FREAAREEQQLTQRESRLESLEAQLEGVAAEAYELSASLDQRELKEEELALLEEAIDALD 787
Query: 160 ED 161
E+
Sbjct: 788 EE 789
>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
Provisional.
Length = 130
Score = 25.9 bits (57), Expect = 7.7
Identities = 10/22 (45%), Positives = 16/22 (72%)
Query: 113 EEEEEEEEEEEEEQEQDEYLEE 134
++EEE+EEE E +++E L E
Sbjct: 96 KKEEEKEEEIPEPTKEEELLGE 117
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 26.5 bits (59), Expect = 7.8
Identities = 15/51 (29%), Positives = 30/51 (58%), Gaps = 3/51 (5%)
Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQ 163
E+ E EEEE+ +E+E++ L ++ E+ ++ E +KEE+E+ +
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQ---KAKEMQKEEDEEMR 167
>gnl|CDD|223052 PHA03346, PHA03346, US22 family homolog; Provisional.
Length = 520
Score = 27.0 bits (60), Expect = 7.9
Identities = 16/77 (20%), Positives = 30/77 (38%), Gaps = 3/77 (3%)
Query: 105 RDETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQG 164
+ R E E+E + E P++D+ G+DE+ + E+
Sbjct: 387 GEFAERRRPRGPPENEDEADGGGGLAPPPEPPRPPPPPFPDDDDEGDDEDDDDWEDLGFD 446
Query: 165 EEKRVEDKIGDEKRLEE 181
+ ED + D K ++E
Sbjct: 447 LD---EDDVYDLKDVDE 460
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 27.0 bits (59), Expect = 8.0
Identities = 18/90 (20%), Positives = 38/90 (42%), Gaps = 5/90 (5%)
Query: 76 VRVRTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEEQEQDEYLEED 135
+RV T A +K L + + AR+ + + ++ ++E + +QDE +
Sbjct: 612 IRVPTRRELASYGIK---LPSQRAAEEKAREAQRNQYDSGDQYNDDEIDAMQQDELARQF 668
Query: 136 ENEKERRLPEEDERGEDEEKKEEEEDKQGE 165
+++R E+ + D E+ D E
Sbjct: 669 AQTQQQRYGEQYQ--HDVPVNAEDADAAAE 696
>gnl|CDD|235548 PRK05657, PRK05657, RNA polymerase sigma factor RpoS; Validated.
Length = 325
Score = 26.8 bits (60), Expect = 8.3
Identities = 7/47 (14%), Positives = 20/47 (42%)
Query: 107 ETAREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDE 153
+ E+ + +E E ++ +EE+ ++ + E +G +
Sbjct: 3 LKVHDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQ 49
>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
Length = 567
Score = 26.7 bits (60), Expect = 8.6
Identities = 12/32 (37%), Positives = 17/32 (53%), Gaps = 1/32 (3%)
Query: 114 EEEEEEEEEEEEQEQDEYLEEDENEKERRLPE 145
EE+ E EE E E EE + E+++ LP
Sbjct: 63 LEEQRERLEELAPELLEEEEEKK-EEKKGLPP 93
>gnl|CDD|227268 COG4932, COG4932, Predicted outer membrane protein [Cell envelope
biogenesis, outer membrane].
Length = 1531
Score = 26.7 bits (59), Expect = 8.7
Identities = 12/63 (19%), Positives = 20/63 (31%)
Query: 124 EEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENK 183
E ++ E E + ++ +E + E K EKRL+ N
Sbjct: 1435 EFEDSIGNQLNAEEHTGNVGEEYVFKAKNPGHYKEGDQPITFEPTEPPKPDPEKRLDSNN 1494
Query: 184 KKK 186
K
Sbjct: 1495 KID 1497
>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator. This protein is
found in a wide range of eukaryotes. It is a nuclear
protein and is suggested to be DNA binding. In plants,
this family is essential for correct circadian clock
functioning by acting as a light-quality regulator
coordinating the activities of blue and red light
signalling pathways during plant growth - inhibiting
growth in red light but promoting growth in blue light.
Length = 233
Score = 26.5 bits (59), Expect = 8.8
Identities = 23/72 (31%), Positives = 33/72 (45%), Gaps = 15/72 (20%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKER-----------RLP----EEDERGEDEEK 155
EEE+E+E E+E++ +E E E + N ++ LP EE E EE
Sbjct: 18 EEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSVDTSFLPDKAREEKEAELREEL 77
Query: 156 KEEEEDKQGEEK 167
+EE KQ K
Sbjct: 78 REEFLKKQEAVK 89
>gnl|CDD|221203 pfam11748, DUF3306, Protein of unknown function (DUF3306). This
family of proteobacterial species proteins has no known
function.
Length = 115
Score = 25.7 bits (57), Expect = 8.9
Identities = 12/45 (26%), Positives = 15/45 (33%)
Query: 110 REEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEE 154
EE E E E+E E+E EDE +E
Sbjct: 11 LAVRAEEPAEPAETAEEEAAAAAPAPAPEEEEEAELEDEELLEEL 55
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 26.6 bits (58), Expect = 9.0
Identities = 13/48 (27%), Positives = 24/48 (50%)
Query: 123 EEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRVE 170
EEE+ + LE+D+ EK++R E+ R E ++ + V+
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVD 132
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 26.2 bits (58), Expect = 9.1
Identities = 11/39 (28%), Positives = 19/39 (48%)
Query: 148 ERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLEENKKKK 186
ER + ++K+E +++ E K + E KKKK
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|225427 COG2872, COG2872, Predicted metal-dependent hydrolases related to
alanyl-tRNA synthetase HxxxH domain [General function
prediction only].
Length = 241
Score = 26.6 bits (59), Expect = 9.4
Identities = 12/57 (21%), Positives = 25/57 (43%)
Query: 113 EEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRV 169
+ E+ E+E EE E +E ++E+ +P E+ K + + E ++
Sbjct: 132 DGEDTEDEIEEVEALANELVKENLPVIIYFIPREEAEKLPGLVKLKNKVPPDVEGKI 188
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 26.6 bits (59), Expect = 9.4
Identities = 18/48 (37%), Positives = 25/48 (52%)
Query: 112 EEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEE 159
EEE + E E +E +E E LE++ NE E L DE ++ EE
Sbjct: 73 EEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLDENLEE 120
>gnl|CDD|176518 cd08576, GDPD_like_SMaseD_PLD, Glycerophosphodiester
phosphodiesterase-like domain of spider venom
sphingomyelinases D, bacterial phospholipase D, and
similar proteins. This subfamily corresponds to the
glycerophosphodiester phosphodiesterase-like domain
(GDPD-like) present in sphingomyelinases D (SMases D)
(sphingomyelin phosphodiesterase D, EC 3.1.4.4) from
spider venom, the Corynebacterium pseudotuberculosis
Phospholipase D (PLD)-like protein from pathogenic
bacteria, and the Ajellomyces capsulatus H143 PLD-like
protein from ascomycetes. Spider SMases D and bacterial
PLD proteins catalyze the Mg2+-dependent hydrolysis of
sphingomyelin producing choline and ceramide 1-phosphate
(C1P), which possess a number of biological functions,
such as regulating cell proliferation and apoptosis,
participating in inflammatory responses, and playing a
key role in phagocytosis. In the presence of Mg2+,
SMases D can function as lysophospholipase D and
hydrolyze lysophosphatidylcholine (LPC) to choline and
lysophosphatidic acid (LPA), which is a multifunctional
phospholipid involved in platelet aggregation,
endothelial hyperpermeability, and pro-inflammatory
responses. Loxosceles spider venoms' SMases D are the
principal toxins responsible for dermonecrosis and
complement dependent haemolysis induced by spider venom.
Due to amino acid substitutions at the entrance to the
active-site pocket, some members lack activity. The
typical GDPD domain consists of a TIM barrel and a small
insertion domain named as the GDPD-insertion (GDPD-I)
domain, which is specific for GDPD proteins. Although
proteins in this family contain a non-typical GDPD
domain which lacks the GDPD-I, their catalytic
mechanisms are based on Mg2+-dependent acid-base
reactions similar to GDPD proteins. They might be
divergent members of the GDPD family. Moreover, this
family does not belong to phospholipase D (PLD)
superfamily, since it lacks the conserved HKD sequence
motif that characterizes the catalytic center of the PLD
superfamily. It belongs to the superfamily of PLC-like
phosphodiesterases.
Length = 265
Score = 26.5 bits (59), Expect = 9.6
Identities = 11/31 (35%), Positives = 11/31 (35%)
Query: 73 LTLVRVRTNLSGAVLRLKKNILCENQLFKAG 103
T R NL L LK LC AG
Sbjct: 67 GTTPGFRENLIFVWLDLKNPDLCGECSINAG 97
>gnl|CDD|182633 PRK10669, PRK10669, putative cation:proton antiport protein;
Provisional.
Length = 558
Score = 26.6 bits (59), Expect = 9.7
Identities = 20/62 (32%), Positives = 32/62 (51%)
Query: 62 MVAVLVMPLGILTLVRVRTNLSGAVLRLKKNILCENQLFKAGARDETAREEEEEEEEEEE 121
++A L M L +L L+GA+L + N + L + A+ ET E+ EE EEE
Sbjct: 350 ILAGLGMALNLLPQAGQNLVLAGAILSIMLNPVLFTLLERYLAKTETLEEQTLEEAIEEE 409
Query: 122 EE 123
++
Sbjct: 410 KQ 411
>gnl|CDD|224112 COG1191, FliA, DNA-directed RNA polymerase specialized sigma
subunit [Transcription].
Length = 247
Score = 26.5 bits (59), Expect = 9.7
Identities = 23/100 (23%), Positives = 42/100 (42%), Gaps = 6/100 (6%)
Query: 87 LRLKKNILCENQLFKAGARDETAREEEEEEEEEEEEEEE------QEQDEYLEEDENEKE 140
LR ++ L + G R E A +E E+E E +EE +++EY+E
Sbjct: 93 LRKNDSVKVPRSLRELGRRIEEAIDELEQELGREPTDEEIAEELGIDKEEYIEALLAING 152
Query: 141 RRLPEEDERGEDEEKKEEEEDKQGEEKRVEDKIGDEKRLE 180
+L DE ++ + ++ + + VE + E E
Sbjct: 153 SQLLSLDEDVLKDDDDDVDDQIENPDDGVEKEELLEILKE 192
>gnl|CDD|224124 COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense
mechanisms].
Length = 733
Score = 26.7 bits (59), Expect = 9.9
Identities = 14/59 (23%), Positives = 27/59 (45%)
Query: 111 EEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERGEDEEKKEEEEDKQGEEKRV 169
+E+ ++ + E + +EDE +R+ + E G EE E ++ E K+V
Sbjct: 385 KEKLKKALGKGREVVENAKFCPKEDEPGLKRKERVDVEDGPQEELIELISEEVKEGKKV 443
>gnl|CDD|220634 pfam10220, DUF2146, Uncharacterized conserved protein (DUF2146).
This is a family of proteins conserved from plants to
humans. In Dictyostelium it is annotated as Mss11p but
this could not be confirmed. Mss11p is required for the
activation of pseudo-hyphal and invasive growth by
Ste12p in yeast.
Length = 890
Score = 26.7 bits (59), Expect = 9.9
Identities = 19/87 (21%), Positives = 33/87 (37%), Gaps = 6/87 (6%)
Query: 109 AREEEEEEEEEEEEEEEQEQDEYLEEDENEKERRLPEEDERG----EDEEKKEEEED--K 162
A++ E+ E+E +EE +QEQ E LE + + E EE D
Sbjct: 559 AKKMEQAEDELADEETDQEQPESLEPQLQGSSTSPSDASDLNFSTASSSEASSEESDNYA 618
Query: 163 QGEEKRVEDKIGDEKRLEENKKKKTQK 189
+ + D+ K E + + +
Sbjct: 619 RPTSRSGTDEEEASKTAREKRPQALAR 645
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.308 0.129 0.346
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,075,149
Number of extensions: 1046012
Number of successful extensions: 19094
Number of sequences better than 10.0: 1
Number of HSP's gapped: 9893
Number of HSP's successfully gapped: 2518
Length of query: 190
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 99
Effective length of database: 6,901,388
Effective search space: 683237412
Effective search space used: 683237412
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 56 (25.2 bits)