RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7729
(167 letters)
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 55.7 bits (135), Expect = 1e-09
Identities = 32/66 (48%), Positives = 51/66 (77%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
+K E+ E++ EEEK++ + + +KKEEEEE++K+KKEEE EEEEEE + +K+EEEE
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471
Query: 146 EEEKEK 151
+++K+
Sbjct: 472 KKKKQA 477
Score = 51.8 bits (125), Expect = 3e-08
Identities = 28/76 (36%), Positives = 57/76 (75%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E L ++ ++ +K E+ E++ E+EK E ++K K++EEE++++K+++E E+EEEE
Sbjct: 400 EFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459
Query: 136 QKNKKKEEEEEEEKEK 151
++ ++++EEEEE+K+K
Sbjct: 460 EEAEEEKEEEEEKKKK 475
Score = 46.5 bits (111), Expect = 2e-06
Identities = 26/70 (37%), Positives = 49/70 (70%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EEE + ++ ++ K+ E EKKRE++++E++K+ +++ EEEEEE++ K++E+
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455
Query: 144 EEEEEKEKVE 153
EEEEE+ + E
Sbjct: 456 EEEEEEAEEE 465
Score = 46.5 bits (111), Expect = 2e-06
Identities = 23/65 (35%), Positives = 50/65 (76%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
+ +E E + EE++KE++++ +K+++E+EE+K +K+EE+EE++++ +EE+ EEEE+
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472
Query: 135 EQKNK 139
++K
Sbjct: 473 KKKQA 477
Score = 44.9 bits (107), Expect = 5e-06
Identities = 25/66 (37%), Positives = 48/66 (72%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
+K ++ ++ EK + + EE+K+EKK++ +K ++EEE E+E++E++ +++EEE EE
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464
Query: 148 EKEKVE 153
EKE+ E
Sbjct: 465 EKEEEE 470
Score = 44.9 bits (107), Expect = 5e-06
Identities = 28/79 (35%), Positives = 52/79 (65%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
+ LE E E E ++ ++ ++ + +++++ EKKE++++ KK+EE EEEE+
Sbjct: 389 AAFLELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEK 448
Query: 135 EQKNKKKEEEEEEEKEKVE 153
E+K ++KEEEEEE +E+ E
Sbjct: 449 EKKEEEKEEEEEEAEEEKE 467
Score = 42.2 bits (100), Expect = 5e-05
Identities = 19/66 (28%), Positives = 47/66 (71%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
K+ ++ ++ ++ ++ E+++++K+++ K K+EEE EE+E++++ K++EEEE E
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Query: 148 EKEKVE 153
E+++ E
Sbjct: 464 EEKEEE 469
Score = 40.3 bits (95), Expect = 2e-04
Identities = 16/59 (27%), Positives = 42/59 (71%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
+ I + E ++E+K+K +++EEE+E++++++++ +++EEEE +++ ++EEE
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471
Score = 39.5 bits (93), Expect = 3e-04
Identities = 20/55 (36%), Positives = 42/55 (76%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
A + E + E+++K +++EEEEE+EK++ EE+K E++EE EE++++++E++
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473
Score = 35.3 bits (82), Expect = 0.011
Identities = 22/64 (34%), Positives = 38/64 (59%), Gaps = 5/64 (7%)
Query: 91 EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
EEE E K+ + +K EK E++ E++K +K+++ + KK+EEEEEE+++
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGK-----KKEEEEEEEKEK 450
Query: 151 KVEM 154
K E
Sbjct: 451 KEEE 454
Score = 32.6 bits (75), Expect = 0.070
Identities = 22/45 (48%), Positives = 32/45 (71%)
Query: 71 AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
A + E EEEEKEK+EEE+EEEEE+ ++E EE++ +KK+
Sbjct: 431 KKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 50.0 bits (119), Expect = 1e-07
Identities = 27/77 (35%), Positives = 45/77 (58%), Gaps = 2/77 (2%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK--DKKEEEVEEEEEEQ 136
E + + + ++E E+E +++ E K+ EK + D +EEE EEEEEE+
Sbjct: 816 EGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEE 875
Query: 137 KNKKKEEEEEEEKEKVE 153
+ +++EEEEEEE+E E
Sbjct: 876 EEEEEEEEEEEEEENEE 892
Score = 48.8 bits (116), Expect = 3e-07
Identities = 28/88 (31%), Positives = 53/88 (60%), Gaps = 4/88 (4%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE----EKQKDKK 125
+ +E E +E E + + + ++ E ++E E++ + E + E K++E+ D
Sbjct: 802 EHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGG 861
Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+ E EEEEEE++ +++EEEEEEE+E+ E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEE 889
Score = 46.9 bits (111), Expect = 1e-06
Identities = 31/80 (38%), Positives = 46/80 (57%), Gaps = 5/80 (6%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
A +T +E E+E E + E +++EK D EEEEE++++++EEE EEEE
Sbjct: 823 ADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEE 882
Query: 134 EEQKNKKKEEEEEEEKEKVE 153
EE+ EEEE EE +E
Sbjct: 883 EEE-----EEEENEEPLSLE 897
Score = 43.8 bits (103), Expect = 2e-05
Identities = 25/84 (29%), Positives = 50/84 (59%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
ET E E + E +++E+ + D + ++ E++EEEEE++++++EEE EEEEE
Sbjct: 831 ETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEEN 890
Query: 136 QKNKKKEEEEEEEKEKVEMYRNPV 159
++ E E +K+ + ++ P+
Sbjct: 891 EEPLSLEWPETRQKQAIYLFLLPI 914
Score = 38.8 bits (90), Expect = 7e-04
Identities = 23/73 (31%), Positives = 38/73 (52%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E E E + K E E E E E+ ++E + E + +E + + + + EEVE E E + +
Sbjct: 668 EQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTE 727
Query: 142 EEEEEEEKEKVEM 154
+E E E E+ E
Sbjct: 728 DEGEIETGEEGEE 740
Score = 38.8 bits (90), Expect = 8e-04
Identities = 25/79 (31%), Positives = 43/79 (54%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
+E E E E + KE + + E E +E + + E + E E+E E + ++ EEVE+E E
Sbjct: 687 AERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGE 746
Query: 135 EQKNKKKEEEEEEEKEKVE 153
+ K E E E ++++ E
Sbjct: 747 GEAEGKHEVETEGDRKETE 765
Score = 38.8 bits (90), Expect = 8e-04
Identities = 27/80 (33%), Positives = 43/80 (53%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
+E +E+ E E ++E E E EE E+ EDE + + + E E + D+KE E E E E
Sbjct: 712 AEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETE 771
Query: 135 EQKNKKKEEEEEEEKEKVEM 154
+ + ++E E + E EM
Sbjct: 772 AEGKEDEDEGEIQAGEDGEM 791
Score = 38.4 bits (89), Expect = 9e-04
Identities = 21/71 (29%), Positives = 43/71 (60%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
Q ++A E + E+ + + EE+E++E+EE++ E++EEEEE+++++ EE +
Sbjct: 835 QELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPL 894
Query: 130 EEEEEEQKNKK 140
E E + K+
Sbjct: 895 SLEWPETRQKQ 905
Score = 38.4 bits (89), Expect = 0.001
Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 6/82 (7%)
Query: 75 SETLENNEWEEEEKEKEEE------EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
S E E E K + E E + E+E E + + ++ K E E E+ + + E E
Sbjct: 663 SGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETE 722
Query: 129 VEEEEEEQKNKKKEEEEEEEKE 150
E E+E + + EE EE E E
Sbjct: 723 AEGTEDEGEIETGEEGEEVEDE 744
Score = 38.4 bits (89), Expect = 0.001
Identities = 21/72 (29%), Positives = 38/72 (52%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
EN E E E+E E E + E + + E +++ ++E E E + + + + E E EE ++
Sbjct: 658 ENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEH 717
Query: 139 KKKEEEEEEEKE 150
+ + E E E E
Sbjct: 718 EGETEAEGTEDE 729
Score = 37.7 bits (87), Expect = 0.002
Identities = 20/67 (29%), Positives = 34/67 (50%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
E+ EE E E E E E+ + E++ E E K +++ E E+ E + ++ + E E +
Sbjct: 643 ERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAK 702
Query: 147 EEKEKVE 153
E K E
Sbjct: 703 EADHKGE 709
Score = 37.3 bits (86), Expect = 0.002
Identities = 22/72 (30%), Positives = 37/72 (51%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E E E KE + + E E EE E + + E + + E E E ++ +E E E E E + +
Sbjct: 696 EGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEV 755
Query: 142 EEEEEEEKEKVE 153
E E + ++ + E
Sbjct: 756 ETEGDRKETEHE 767
Score = 36.9 bits (85), Expect = 0.003
Identities = 27/73 (36%), Positives = 36/73 (49%), Gaps = 3/73 (4%)
Query: 84 EEEEKEKEEEEEEEEE---EKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
EE E+ E E E EE E E++ + E K E + E E + K E+E E E E ++
Sbjct: 647 EEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADH 706
Query: 141 KEEEEEEEKEKVE 153
K E E EE E
Sbjct: 707 KGETEAEEVEHEG 719
Score = 35.7 bits (82), Expect = 0.007
Identities = 28/80 (35%), Positives = 41/80 (51%), Gaps = 1/80 (1%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKD-EDEEKKREKKEEEEEKQKDKKEEEVE 130
I A E E E EE E E E E E E E + E E+ E ++E E + + K E E E
Sbjct: 699 IEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETE 758
Query: 131 EEEEEQKNKKKEEEEEEEKE 150
+ +E +++ + E E +E E
Sbjct: 759 GDRKETEHEGETEAEGKEDE 778
Score = 35.7 bits (82), Expect = 0.008
Identities = 20/69 (28%), Positives = 32/69 (46%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E E+ EE E E E + EE E ++E E + K + E E E E + ++ E E
Sbjct: 639 EHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGE 698
Query: 145 EEEEKEKVE 153
E ++ +
Sbjct: 699 IEAKEADHK 707
Score = 35.4 bits (81), Expect = 0.009
Identities = 24/81 (29%), Positives = 38/81 (46%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
E + E E EE E E E E E EDE + +E EE + + + E E + E
Sbjct: 695 GEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHE 754
Query: 133 EEEQKNKKKEEEEEEEKEKVE 153
E + ++K+ E E E + + +
Sbjct: 755 VETEGDRKETEHEGETEAEGK 775
Score = 35.4 bits (81), Expect = 0.010
Identities = 30/85 (35%), Positives = 45/85 (52%), Gaps = 3/85 (3%)
Query: 75 SETLENNEWEEE---EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
+ET NE E E E++ E+E E E E KE D E + E+ E E E + + E+E E
Sbjct: 673 TETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEI 732
Query: 132 EEEEQKNKKKEEEEEEEKEKVEMYR 156
E E+ + ++E E E + K E+
Sbjct: 733 ETGEEGEEVEDEGEGEAEGKHEVET 757
Score = 35.0 bits (80), Expect = 0.014
Identities = 25/82 (30%), Positives = 37/82 (45%), Gaps = 1/82 (1%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE-EEEEKQKDKKEEE 128
+A E E E E EE E +++ E E K + E E ++K ++E E
Sbjct: 637 EAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGE 696
Query: 129 VEEEEEEQKNKKKEEEEEEEKE 150
E E +E +K + E EE E E
Sbjct: 697 GEIEAKEADHKGETEAEEVEHE 718
Score = 34.2 bits (78), Expect = 0.030
Identities = 22/70 (31%), Positives = 36/70 (51%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E E +E E E E E E E + + E E +E E+E + + + + E E ++K + E
Sbjct: 709 ETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEG 768
Query: 144 EEEEEKEKVE 153
E E E ++ E
Sbjct: 769 ETEAEGKEDE 778
Score = 33.8 bits (77), Expect = 0.035
Identities = 22/66 (33%), Positives = 36/66 (54%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E +E EE E+E E E E + E + ++KE E E + + + +E E+E E Q + E +
Sbjct: 733 ETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMK 792
Query: 145 EEEEKE 150
+E E
Sbjct: 793 GDEGAE 798
Score = 33.0 bits (75), Expect = 0.067
Identities = 27/82 (32%), Positives = 40/82 (48%), Gaps = 10/82 (12%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDE---DEEKKREKKEEEEEKQKDKKEEEVEEEE----- 133
E EE +E E+E E E E K + E D ++ + E E E ++D+ E E++ E
Sbjct: 733 ETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMK 792
Query: 134 --EEQKNKKKEEEEEEEKEKVE 153
E + K + E E E EK E
Sbjct: 793 GDEGAEGKVEHEGETEAGEKDE 814
Score = 31.9 bits (72), Expect = 0.14
Identities = 27/109 (24%), Positives = 47/109 (43%), Gaps = 32/109 (29%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE------------- 131
E+ E + +E E + + + E E ++++ E + E Q D E + E
Sbjct: 786 GEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEA 845
Query: 132 -------------------EEEEQKNKKKEEEEEEEKEKVEMYRNPVPM 161
EEEE++ +++EEEEEEE+E+ E N P+
Sbjct: 846 KQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPL 894
Score = 31.1 bits (70), Expect = 0.26
Identities = 22/78 (28%), Positives = 37/78 (47%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
T E EE + E+E E E + E + + E E+K E+E + + + +E + E E
Sbjct: 652 PTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETE 711
Query: 136 QKNKKKEEEEEEEKEKVE 153
+ + E E E E + E
Sbjct: 712 AEEVEHEGETEAEGTEDE 729
Score = 30.7 bits (69), Expect = 0.35
Identities = 23/78 (29%), Positives = 36/78 (46%), Gaps = 3/78 (3%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEE---KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
E E E E+ E E+E E E + + E E K E+E E + + KE + + E
Sbjct: 650 ERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGE 709
Query: 133 EEEQKNKKKEEEEEEEKE 150
E ++ + + E E E E
Sbjct: 710 TEAEEVEHEGETEAEGTE 727
Score = 29.6 bits (66), Expect = 0.85
Identities = 19/88 (21%), Positives = 40/88 (45%), Gaps = 4/88 (4%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
QA E + E + + + E E E++E E + + + ++E +Q+ E +
Sbjct: 784 QAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQG 843
Query: 130 EEEEEEQKNKKKEE----EEEEEKEKVE 153
E +++E+ + EEE+E+ E
Sbjct: 844 EAKQDEKGVDGGGGSDGGDSEEEEEEEE 871
Score = 27.3 bits (60), Expect = 4.9
Identities = 15/59 (25%), Positives = 26/59 (44%)
Query: 92 EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
+ + E E + EE +R + E E ++ E E E E E + + E E E++
Sbjct: 632 KGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERK 690
Score = 26.5 bits (58), Expect = 8.1
Identities = 17/64 (26%), Positives = 30/64 (46%), Gaps = 1/64 (1%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
K + E E + E+ E+ E + E E+ + E+E E E + +N+ + E E K
Sbjct: 632 KGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQE-GETETKGENESEGEIPAERK 690
Query: 150 EKVE 153
+ E
Sbjct: 691 GEQE 694
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 45.5 bits (108), Expect = 2e-06
Identities = 23/79 (29%), Positives = 46/79 (58%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+A E ++E E EE +EEE+E ++ E+K +KE+EE ++++++E+E
Sbjct: 42 DQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEE 101
Query: 130 EEEEEEQKNKKKEEEEEEE 148
+E E++ ++K E E+
Sbjct: 102 SSDENEKETEEKTESNVEK 120
Score = 42.4 bits (100), Expect = 2e-05
Identities = 25/79 (31%), Positives = 42/79 (53%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
A ++ + E EE ++EE+E E+KE D EK+ E+ EEE E++ ++ +E
Sbjct: 46 ADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDE 105
Query: 129 VEEEEEEQKNKKKEEEEEE 147
E+E EE+ E+E
Sbjct: 106 NEKETEEKTESNVEKEITN 124
Score = 41.6 bits (98), Expect = 5e-05
Identities = 23/88 (26%), Positives = 51/88 (57%)
Query: 66 VILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
V++ I A + + ++ E+E ++ ++++ + +E K+ EK+ E ++DK
Sbjct: 23 VVVLIIIVAYQLFFPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKG 82
Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+ E E+EE E++N++++EE +E EK
Sbjct: 83 DAEKEDEESEEENEEEDEESSDENEKET 110
Score = 38.9 bits (91), Expect = 4e-04
Identities = 25/80 (31%), Positives = 43/80 (53%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
Q S+ E E EE ++E++E E++E + D ++E + ++E EEE ++ E E
Sbjct: 49 QEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEK 108
Query: 130 EEEEEEQKNKKKEEEEEEEK 149
E EE+ + N +KE K
Sbjct: 109 ETEEKTESNVEKEITNPSWK 128
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 44.6 bits (106), Expect = 7e-06
Identities = 19/68 (27%), Positives = 38/68 (55%), Gaps = 3/68 (4%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD---KKEEEVEEEEEEQKNKKKEE 143
++ E EE EEE + E + E K K+E+E ++D EE+ + ++ E +K +
Sbjct: 227 KETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEIL 286
Query: 144 EEEEEKEK 151
+E++++E
Sbjct: 287 KEKKDEEL 294
Score = 40.8 bits (96), Expect = 2e-04
Identities = 22/80 (27%), Positives = 39/80 (48%), Gaps = 1/80 (1%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
K+ +E E EE E+E D + E E K ++E Q+ EE+ EE+++ K E+ ++
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQE-QEGSTEEDPSLFSEEKEDPDKTEDLDK 282
Query: 148 EKEKVEMYRNPVPMYRNPVP 167
+ E + + P P
Sbjct: 283 LEILKEKKDEELFWFEKPWP 302
Score = 40.4 bits (95), Expect = 2e-04
Identities = 20/71 (28%), Positives = 35/71 (49%), Gaps = 4/71 (5%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
ET E E EEE + E E + K++ E ++ EE++ DK E+ ++
Sbjct: 227 KETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKT----EDLDK 282
Query: 135 EQKNKKKEEEE 145
+ K+K++EE
Sbjct: 283 LEILKEKKDEE 293
Score = 38.4 bits (90), Expect = 9e-04
Identities = 14/66 (21%), Positives = 32/66 (48%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
SET E E + E E E + ++E+E +E+ +E+E+ + + ++ +E
Sbjct: 229 TSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKE 288
Query: 134 EEQKNK 139
++ +
Sbjct: 289 KKDEEL 294
Score = 29.6 bits (67), Expect = 0.75
Identities = 17/57 (29%), Positives = 32/57 (56%), Gaps = 4/57 (7%)
Query: 63 RLTVILPQAISASETLENN----EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
R+ L + SE ++ W E++KE + + EE+ K++++ E+ KRE+KE
Sbjct: 622 RIEKTLFFSFDISELMKIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678
Score = 28.4 bits (64), Expect = 1.9
Identities = 15/48 (31%), Positives = 27/48 (56%), Gaps = 1/48 (2%)
Query: 73 SASETLENNEWEEEEKEKEEE-EEEEEEEKEKDEDEEKKREKKEEEEE 119
+ SET + +E E++ EE+E+ +K ED +K KE+++E
Sbjct: 245 TTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDE 292
Score = 27.3 bits (61), Expect = 5.4
Identities = 13/33 (39%), Positives = 23/33 (69%), Gaps = 1/33 (3%)
Query: 122 KDKKEEEVEEEEEEQKNK-KKEEEEEEEKEKVE 153
K K+ + ++ EE+ K + KKE+ + EEKE++E
Sbjct: 649 KKKEFKILDYTEEKTKKEEKKEKNKREEKERIE 681
Score = 26.5 bits (59), Expect = 8.7
Identities = 9/50 (18%), Positives = 26/50 (52%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
I + + + E+E +E+ EE+++ D+ E+ + + +E++ +
Sbjct: 243 IETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDE 292
Score = 26.5 bits (59), Expect = 9.5
Identities = 12/31 (38%), Positives = 23/31 (74%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
EK+KE + + EEK K E++++K +++E+E
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 44.4 bits (105), Expect = 9e-06
Identities = 31/71 (43%), Positives = 48/71 (67%)
Query: 95 EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
E+E E++ DE+EE+++E+K+EEEEK DK+EE EEEE+E+K KK ++ +E E +
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELL 87
Query: 155 YRNPVPMYRNP 165
+ RNP
Sbjct: 88 NKTKPIWTRNP 98
Score = 40.1 bits (94), Expect = 2e-04
Identities = 25/58 (43%), Positives = 40/58 (68%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
E +EE+E+E+EE++EEEEK D++EE E+++EE++K+ K +E E E K K
Sbjct: 34 EVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTK 91
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 44.1 bits (105), Expect = 1e-05
Identities = 20/70 (28%), Positives = 28/70 (40%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EEEE KEE + E EEE + + +E E +EE + + E E
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARP 98
Query: 144 EEEEEKEKVE 153
E+E E
Sbjct: 99 AEDEAARPAE 108
Score = 41.8 bits (99), Expect = 6e-05
Identities = 22/88 (25%), Positives = 35/88 (39%), Gaps = 3/88 (3%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
S S T+E EE KE+ + E EEE + E +E + E++ + E EE E
Sbjct: 33 SHSSTVEE---EEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAA 89
Query: 133 EEEQKNKKKEEEEEEEKEKVEMYRNPVP 160
+ + E+E + R
Sbjct: 90 AAAEAAARPAEDEAARPAEAAARRPKAK 117
Score = 36.0 bits (84), Expect = 0.006
Identities = 14/63 (22%), Positives = 26/63 (41%)
Query: 105 EDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPMYRN 164
E+EE ++E+ + E E++ + EE E E++ K + +E E R
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARP 98
Query: 165 PVP 167
Sbjct: 99 AED 101
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 44.2 bits (105), Expect = 1e-05
Identities = 27/75 (36%), Positives = 49/75 (65%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
+ E+ E EE+ + E+EE+++++ +E++ED + E+++EE+E D+ +EE EEEE+
Sbjct: 317 GQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEK 376
Query: 135 EQKNKKKEEEEEEEK 149
E+K KK E E
Sbjct: 377 EKKKKKSAESTRSEL 391
Score = 43.1 bits (102), Expect = 2e-05
Identities = 25/72 (34%), Positives = 45/72 (62%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EEEE ++E+EE++++ ++E+E+ +EE+EE + E++ EEEEEE++ KKK+
Sbjct: 324 EEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKS 383
Query: 144 EEEEEKEKVEMY 155
E E +
Sbjct: 384 AESTRSELPFTF 395
Score = 42.7 bits (101), Expect = 3e-05
Identities = 23/80 (28%), Positives = 50/80 (62%), Gaps = 5/80 (6%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
++N + +E EEEEE+ +++++++D++ E++E+ + +++ EE+ + ++E
Sbjct: 307 EPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDE 366
Query: 135 EQKNKKKEEEEEEEKEKVEM 154
+ EEEEEEEKEK +
Sbjct: 367 DD-----EEEEEEEKEKKKK 381
Score = 34.2 bits (79), Expect = 0.026
Identities = 19/70 (27%), Positives = 41/70 (58%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EEE+ E+EE+ +E ++ + + + + + E+ ++++E+ V++E+EE + EE
Sbjct: 285 EEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEE 344
Query: 144 EEEEEKEKVE 153
EEE+ E
Sbjct: 345 EEEDVDLSDE 354
Score = 33.8 bits (78), Expect = 0.028
Identities = 19/70 (27%), Positives = 48/70 (68%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+EEE++ +E ++ ++E E D+D+ + EE+EE+++D ++E EE++++ +++E+
Sbjct: 289 DEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEED 348
Query: 144 EEEEEKEKVE 153
+ ++E+ E
Sbjct: 349 VDLSDEEEDE 358
Score = 33.4 bits (77), Expect = 0.040
Identities = 20/74 (27%), Positives = 41/74 (55%), Gaps = 11/74 (14%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE-----------Q 136
K +EE +EE E +K E E +R + EEE++++++ +E ++ ++E
Sbjct: 258 KTEEELAKEEAERLKKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLG 317
Query: 137 KNKKKEEEEEEEKE 150
+ ++ EEEEE+ +
Sbjct: 318 QGEEDEEEEEDGVD 331
Score = 33.4 bits (77), Expect = 0.048
Identities = 17/69 (24%), Positives = 43/69 (62%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
++E E ++++ + E+DE+EE+ E+EE+ D +EEE + + +++ +++E+
Sbjct: 303 DDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDED 362
Query: 145 EEEEKEKVE 153
++E ++ E
Sbjct: 363 SDDEDDEEE 371
Score = 29.6 bits (67), Expect = 0.81
Identities = 21/62 (33%), Positives = 38/62 (61%), Gaps = 2/62 (3%)
Query: 71 AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
E +++ EEEE +EEE+EE+++ D+++++ E++EEE+EK+K K E
Sbjct: 331 DDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDE--EEEEEEKEKKKKKSAESTR 388
Query: 131 EE 132
E
Sbjct: 389 SE 390
Score = 29.6 bits (67), Expect = 0.89
Identities = 24/79 (30%), Positives = 42/79 (53%), Gaps = 9/79 (11%)
Query: 84 EEEEKEKEEEEE------EEEEEKEKDEDEEKKREKKEEEEEKQKDKK---EEEVEEEEE 134
EE E+ K+ E E EEE+ E++ED ++ + ++E E D + E+EEE
Sbjct: 266 EEAERLKKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEE 325
Query: 135 EQKNKKKEEEEEEEKEKVE 153
E+ E+EE+++ + E
Sbjct: 326 EEDGVDDEDEEDDDDDLEE 344
Score = 28.8 bits (65), Expect = 1.7
Identities = 17/83 (20%), Positives = 43/83 (51%), Gaps = 2/83 (2%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE--E 127
Q++S + ++ +++ + + + ++ E E +EE+ E++K KKE +
Sbjct: 121 QSLSEIDKDDDVRDDDDFDDDDLGDLASDDRAAHFGGGEDDEEDEEEQPERKKSKKEVMK 180
Query: 128 EVEEEEEEQKNKKKEEEEEEEKE 150
EV + + K ++++ +EE+E
Sbjct: 181 EVIAKSKFYKAERQKAKEEDEDL 203
>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
binding domain. Human SGLT1 (hSGLT1) is a
high-affinity/low-capacity glucose transporter, which
can also transport galactose. In the transport
mechanism, two Na+ ions first bind to the extracellular
side of the transporter and induce a conformational
change in the glucose binding site. This results in an
increased affinity for glucose. A second conformational
change in the transporter follows, bringing the Na+ and
glucose binding sites to the inner surface of the
membrane. Glucose is then released, followed by the Na+
ions. In the process, hSGLT1 is also able to transport
water and urea and may be a major pathway for transport
of these across the intestinal brush-border membrane.
hSGLT1 is encoded by the SLC5A1 gene and expressed
mostly in the intestine, but also in the trachea,
kidney, heart, brain, testis, and prostate. The
WHO/UNICEF oral rehydration solution (ORS) for the
treatment of secretory diarrhea contains salt and
glucose. The glucose, along with sodium ions, is
transported by hSGLT1 and water is either co-transported
along with these or follows by osmosis. Mutations in
SGLT1 are associated with intestinal glucose galactose
malabsorption (GGM). Up-regulation of intestinal SGLT1
may protect against enteric infections. SGLT1 is
expressed in colorectal, head and neck, and prostate
tumors. Epidermal growth factor receptor (EGFR)
functions in cell survival by stabilizing SGLT1, and
thereby maintaining intracellular glucose levels. SGLT1
is predicted to have 14 membrane-spanning regions. This
subgroup belongs to the solute carrier 5
(SLC5)transporter family.
Length = 635
Score = 43.3 bits (102), Expect = 2e-05
Identities = 21/77 (27%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 63 RLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEK---EKDED-----EEKKREKK 114
RL L + L+ ++W E+E E E E +EE ++ K + ++ K K
Sbjct: 532 RLCWSLRNSTEERIDLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKL 591
Query: 115 EEEEEKQKDKKEEEVEE 131
EEEE K + E
Sbjct: 592 TEEEEAALKMKMTDTSE 608
Score = 40.6 bits (95), Expect = 1e-04
Identities = 18/75 (24%), Positives = 34/75 (45%), Gaps = 16/75 (21%)
Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEE---------------EQKNKKKEEEEEEEKEKV 152
E++ + ++ + +D+ E E +EE + +Q K EEEE K+
Sbjct: 542 EERIDLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEAALKM 601
Query: 153 EMY-RNPVPMYRNPV 166
+M + P++RN V
Sbjct: 602 KMTDTSEKPLWRNVV 616
Score = 29.8 bits (67), Expect = 0.74
Identities = 8/39 (20%), Positives = 17/39 (43%), Gaps = 7/39 (17%)
Query: 111 REKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
R EE + + ++ E++ + E +EE +K
Sbjct: 538 RNSTEERIDL-------DADDWTEDEDENEMETDEERKK 569
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 43.0 bits (101), Expect = 3e-05
Identities = 36/73 (49%), Positives = 46/73 (63%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E EEEE +EEEEEEEEEE+E+ E EE++ E +EEEEE + D EE E E +
Sbjct: 438 ESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGE 497
Query: 142 EEEEEEEKEKVEM 154
E EE+ E+ EM
Sbjct: 498 EPEEDAERRNSEM 510
Score = 42.6 bits (100), Expect = 4e-05
Identities = 29/86 (33%), Positives = 46/86 (53%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
+ EEE+ EEEEEEEEEE+E++++ E++ + EEEEE+ + E E E + +
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDG 496
Query: 142 EEEEEEEKEKVEMYRNPVPMYRNPVP 167
EE EE+ + + M P
Sbjct: 497 EEPEEDAERRNSEMAGISRMSEGQQP 522
Score = 39.1 bits (91), Expect = 5e-04
Identities = 26/73 (35%), Positives = 45/73 (61%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E++E E EEEE+E+EE+E EEEE ++++E+EE + + EEE + + + + EE EE+
Sbjct: 443 ESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEED 502
Query: 136 QKNKKKEEEEEEE 148
+ + E
Sbjct: 503 AERRNSEMAGISR 515
Score = 38.4 bits (89), Expect = 0.001
Identities = 23/67 (34%), Positives = 38/67 (56%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
+ +E E+E+ +EE++ E++EEEEE++ +++E E EEEEEE +
Sbjct: 421 PSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADN 480
Query: 141 KEEEEEE 147
EEE E
Sbjct: 481 GSEEEME 487
Score = 37.2 bits (86), Expect = 0.002
Identities = 33/92 (35%), Positives = 49/92 (53%), Gaps = 6/92 (6%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
P S E + EEEE+E+EEEEEE+E E+E+ EDEE EEEE + + EEE
Sbjct: 432 PSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEE------EEEEVEADNGSEEE 485
Query: 129 VEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVP 160
+E E + ++ EE+ E + + +
Sbjct: 486 MEGSSEGDGDGEEPEEDAERRNSEMAGISRMS 517
Score = 36.4 bits (84), Expect = 0.005
Identities = 27/76 (35%), Positives = 41/76 (53%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
+ E E E EEEE+E+E EEEE E+E+E++E E ++E E + D EE EE+
Sbjct: 443 ESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEED 502
Query: 133 EEEQKNKKKEEEEEEE 148
E + ++ E
Sbjct: 503 AERRNSEMAGISRMSE 518
Score = 36.0 bits (83), Expect = 0.007
Identities = 26/79 (32%), Positives = 38/79 (48%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
E E E EEEE+E EEEE E+EEE+E+ E + E+ E E D +E E + E
Sbjct: 447 EEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERR 506
Query: 135 EQKNKKKEEEEEEEKEKVE 153
+ E ++ +
Sbjct: 507 NSEMAGISRMSEGQQPRGS 525
Score = 35.7 bits (82), Expect = 0.008
Identities = 28/77 (36%), Positives = 44/77 (57%), Gaps = 7/77 (9%)
Query: 84 EEEEKEKEEEEEEE-------EEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
EEEE+ K +E E + + K E +E EE++ ++EEE EEEEEE+
Sbjct: 399 EEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEE 458
Query: 137 KNKKKEEEEEEEKEKVE 153
+ + +EEE E+E+E+ E
Sbjct: 459 EQESEEEEGEDEEEEEE 475
Score = 31.4 bits (71), Expect = 0.23
Identities = 22/74 (29%), Positives = 36/74 (48%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E E E+E +E+E E+EEEEEE E D E++ E E + ++ +E+ E
Sbjct: 454 EEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGI 513
Query: 139 KKKEEEEEEEKEKV 152
+ E ++ V
Sbjct: 514 SRMSEGQQPRGSSV 527
Score = 29.5 bits (66), Expect = 0.86
Identities = 19/79 (24%), Positives = 37/79 (46%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
E E E +E E+E+ E+EEEEEE + + EE+ E + + ++ +++ E E
Sbjct: 451 EEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEM 510
Query: 135 EQKNKKKEEEEEEEKEKVE 153
++ E ++
Sbjct: 511 AGISRMSEGQQPRGSSVQP 529
Score = 29.1 bits (65), Expect = 1.4
Identities = 17/81 (20%), Positives = 36/81 (44%)
Query: 68 LPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
+ SE E + EEEE+ + + EEE E + D + + +++ E +
Sbjct: 455 EEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGIS 514
Query: 128 EVEEEEEEQKNKKKEEEEEEE 148
+ E ++ + + + E +EE
Sbjct: 515 RMSEGQQPRGSSVQPESPQEE 535
Score = 28.7 bits (64), Expect = 1.9
Identities = 19/69 (27%), Positives = 37/69 (53%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
+++ E+EE + +E E++ K + +E EEEE ++ +++EEE
Sbjct: 395 QDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEE 454
Query: 145 EEEEKEKVE 153
EEEE+++ E
Sbjct: 455 EEEEEQESE 463
Score = 28.3 bits (63), Expect = 2.4
Identities = 25/83 (30%), Positives = 47/83 (56%)
Query: 68 LPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
L + IS +++ EEE ++++E E + + D + + Q+ ++EE
Sbjct: 384 LEEVISKYAMKQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEE 443
Query: 128 EVEEEEEEQKNKKKEEEEEEEKE 150
VEEEEEE++ +++EE+E EE+E
Sbjct: 444 SVEEEEEEEEEEEEEEQESEEEE 466
Score = 27.2 bits (60), Expect = 5.4
Identities = 23/82 (28%), Positives = 37/82 (45%), Gaps = 4/82 (4%)
Query: 73 SASETLENNEWEEEEKEKEEEEEE----EEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
E + +E EE E E+EEEE E EEE E + + E+ EE+ E++ +
Sbjct: 454 EEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGI 513
Query: 129 VEEEEEEQKNKKKEEEEEEEKE 150
E +Q + E ++E
Sbjct: 514 SRMSEGQQPRGSSVQPESPQEE 535
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 42.2 bits (100), Expect = 4e-05
Identities = 23/90 (25%), Positives = 49/90 (54%)
Query: 62 ARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
L +L + + L N + E K ++ EEEEEK EE+++E+ +E++E++
Sbjct: 232 EDLLPLLNLVLYLVDKLANLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEK 291
Query: 122 KDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
K ++ E + ++ +K EE+E +++ +
Sbjct: 292 KKEEREAKLAKLSPEEQRKLEEKERKKQAR 321
Score = 36.1 bits (84), Expect = 0.005
Identities = 16/56 (28%), Positives = 32/56 (57%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
+ E +EE +EK+EE+++EE E + + +++ K EE+E K++ +K
Sbjct: 267 EEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
Score = 34.2 bits (79), Expect = 0.020
Identities = 16/48 (33%), Positives = 30/48 (62%)
Query: 107 EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
E ++ K EEE++K K E E +EE Q+ K+++++EE E + ++
Sbjct: 256 EVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKL 303
Score = 31.5 bits (72), Expect = 0.17
Identities = 16/45 (35%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 110 KREKKEEEEEKQKDKKEEEVEEE-EEEQKNKKKEEEEEEEKEKVE 153
+ ++EEEE+ K +EE EE E++++ KK+E E + K E
Sbjct: 262 DKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPE 306
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 42.1 bits (99), Expect = 4e-05
Identities = 19/78 (24%), Positives = 49/78 (62%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E +++N +K +E +E ++ ++ + + +++ + EE + KEE++++EE E
Sbjct: 354 EEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENE 413
Query: 136 QKNKKKEEEEEEEKEKVE 153
+K K++ +E++E+++K E
Sbjct: 414 KKQKEQADEDKEKRQKDE 431
Score = 41.3 bits (97), Expect = 8e-05
Identities = 18/82 (21%), Positives = 51/82 (62%), Gaps = 1/82 (1%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+ + ++ L ++ +E KE ++ ++ ++ + +DE ++ E + K++ K+EE
Sbjct: 354 EEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDE-TDASEEAEAKAKEEKLKQEEN 412
Query: 130 EEEEEEQKNKKKEEEEEEEKEK 151
E++++EQ ++ KE+ +++E++K
Sbjct: 413 EKKQKEQADEDKEKRQKDERKK 434
Score = 33.2 bits (76), Expect = 0.053
Identities = 21/92 (22%), Positives = 49/92 (53%), Gaps = 8/92 (8%)
Query: 70 QAISASETLENNEW--------EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
+AI+ + L++N+ EE K + ++ +E K+ +++ + K+ E K
Sbjct: 331 EAINIARNLDDNDLTLLALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKD 390
Query: 122 KDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+ EE E + +E+K K++E E++++++ E
Sbjct: 391 ETDASEEAEAKAKEEKLKQEENEKKQKEQADE 422
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 41.6 bits (98), Expect = 4e-05
Identities = 22/70 (31%), Positives = 51/70 (72%), Gaps = 2/70 (2%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E E +K+++E EE EK K E EEK++ K ++++ K+K K+++ ++++++ +K +++
Sbjct: 56 EYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKD--KKDDKKDDKSEKK 113
Query: 144 EEEEEKEKVE 153
+E+E ++K+E
Sbjct: 114 DEKEAEDKLE 123
Score = 37.7 bits (88), Expect = 8e-04
Identities = 21/69 (30%), Positives = 44/69 (63%), Gaps = 1/69 (1%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE EK K+E EE+++ + +K + ++KK + K+++++K+ DK E++ +E+E E K + +
Sbjct: 69 EEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK-DEKEAEDKLEDLTK 127
Query: 144 EEEEEKEKV 152
E +
Sbjct: 128 SYSETLSTL 136
Score = 36.2 bits (84), Expect = 0.003
Identities = 15/58 (25%), Positives = 37/58 (63%)
Query: 83 WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
W++++ +K++++++++++ +KD+ EKK EK+ E++ + K E E K +K
Sbjct: 86 WKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPRK 143
Score = 35.8 bits (83), Expect = 0.003
Identities = 18/67 (26%), Positives = 43/67 (64%), Gaps = 6/67 (8%)
Query: 93 EEEEEEEEKEKDEDEEKKREKKEEEEEKQ------KDKKEEEVEEEEEEQKNKKKEEEEE 146
E E ++K+K+ EE ++ KKE EE+++ K KK+++ ++++++ K K E+++
Sbjct: 55 AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKD 114
Query: 147 EEKEKVE 153
E++ + +
Sbjct: 115 EKEAEDK 121
Score = 35.1 bits (81), Expect = 0.009
Identities = 20/84 (23%), Positives = 52/84 (61%), Gaps = 1/84 (1%)
Query: 82 EWEEEEKEKEEEE-EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
E+EE++K K +++ +++++K+KD+ ++KK +K E+++EK+ + K E++ + E +
Sbjct: 77 EYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTL 136
Query: 141 KEEEEEEEKEKVEMYRNPVPMYRN 164
E + + ++Y++ + R
Sbjct: 137 SELKPRKYALHKDIYQSRLDRKRR 160
Score = 27.7 bits (62), Expect = 2.2
Identities = 9/56 (16%), Positives = 33/56 (58%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
+ +++K+K++++++++++ + ++ +EK+ E K E+ K + + E +
Sbjct: 85 KWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELK 140
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 41.3 bits (97), Expect = 5e-05
Identities = 21/63 (33%), Positives = 39/63 (61%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E + ++EE+E EEE +E E+++ E + + +EKK+ E K K +KE+ E ++ + +
Sbjct: 90 ELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPS 149
Query: 139 KKK 141
K K
Sbjct: 150 KPK 152
Score = 38.2 bits (89), Expect = 5e-04
Identities = 26/83 (31%), Positives = 40/83 (48%), Gaps = 5/83 (6%)
Query: 48 QVGP--AQYHKPIR---ANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKE 102
QVGP QY +R A ++ L + E E +E E+E E + E +EK+
Sbjct: 66 QVGPMVQQYGPLVRNLPAMWKIFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKK 125
Query: 103 KDEDEEKKREKKEEEEEKQKDKK 125
K E + K EK++ + E +K K
Sbjct: 126 KREVPKPKTEKEKPKTEPKKPKP 148
Score = 35.9 bits (83), Expect = 0.003
Identities = 17/64 (26%), Positives = 33/64 (51%), Gaps = 9/64 (14%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
+EEEE EEE + E ++E+ + K + +E+ + E + K +K++ + E
Sbjct: 93 SSDDEEEETEEESTD---------ETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEP 143
Query: 148 EKEK 151
+K K
Sbjct: 144 KKPK 147
Score = 35.5 bits (82), Expect = 0.004
Identities = 14/52 (26%), Positives = 29/52 (55%)
Query: 101 KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
DE+EE + E +E E++ + + E +E+++ + K K E+E+ + E
Sbjct: 93 SSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPK 144
Score = 29.7 bits (67), Expect = 0.42
Identities = 11/40 (27%), Positives = 23/40 (57%)
Query: 114 KEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+EEEE +++ +E +E+ E K + KE+++ E +
Sbjct: 95 DDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKT 134
Score = 29.0 bits (65), Expect = 0.70
Identities = 11/41 (26%), Positives = 25/41 (60%)
Query: 111 REKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
RE ++E+++ ++E E E+E+ K E +E++++E
Sbjct: 89 RELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREV 129
Score = 25.9 bits (57), Expect = 9.3
Identities = 14/46 (30%), Positives = 23/46 (50%)
Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
E EEEE +++ E E E+ E + K++++ E K K E
Sbjct: 90 ELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTE 135
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 41.9 bits (99), Expect = 5e-05
Identities = 20/102 (19%), Positives = 50/102 (49%), Gaps = 7/102 (6%)
Query: 55 HKP-----IRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK 109
H+P + + ++ + I E L++ ++ EE+++++EE +++++E
Sbjct: 84 HRPWNLDNVFGSDQVVIFDDGDIE--EELQDEPRYDDAYRDLEEDDDDDEESDEEDEESS 141
Query: 110 KREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
K E E++++ D E E ++ +++ EE+ E E
Sbjct: 142 KSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAELEF 183
Score = 35.0 bits (81), Expect = 0.012
Identities = 10/58 (17%), Positives = 29/58 (50%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
+ EE+E+ + E++E++++ + D+D + E +++ +++ E Q
Sbjct: 130 DEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAELEFYYYQ 187
Score = 31.5 bits (72), Expect = 0.17
Identities = 14/74 (18%), Positives = 32/74 (43%), Gaps = 12/74 (16%)
Query: 91 EEEEEEEEEEKEK-----------DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
EEE ++E + +E +E+ E + E+++ D +++ + E+ +
Sbjct: 107 EEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLE 166
Query: 140 KKEEEEE-EEKEKV 152
++ E EEK
Sbjct: 167 RRRRRREWEEKRAE 180
Score = 26.9 bits (60), Expect = 6.0
Identities = 10/56 (17%), Positives = 31/56 (55%)
Query: 98 EEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+ +E+ +DE + + + EE D +E + E+EE + ++++++++ + +
Sbjct: 104 GDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIA 159
Score = 26.9 bits (60), Expect = 7.3
Identities = 10/60 (16%), Positives = 28/60 (46%)
Query: 95 EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
+ EEE + + + EE+++ ++ EE+ E + E +++++++ E
Sbjct: 104 GDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRER 163
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 41.9 bits (98), Expect = 8e-05
Identities = 24/80 (30%), Positives = 44/80 (55%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
A E E + E++E ++E+ EE ++D ++ + E++E+ +D EE V+E E
Sbjct: 4016 ADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENE 4075
Query: 134 EEQKNKKKEEEEEEEKEKVE 153
E ++ K +EE E+ E E
Sbjct: 4076 ESTEDGVKSDEELEQGEVPE 4095
Score = 36.9 bits (85), Expect = 0.004
Identities = 15/72 (20%), Positives = 36/72 (50%), Gaps = 2/72 (2%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE--EKQKDKKEEEVEEEEEEQ 136
E +E ++E EE +E+ ++ D + + ++K E+ E+ + EE E+ +
Sbjct: 4026 EKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSD 4085
Query: 137 KNKKKEEEEEEE 148
+ ++ E E++
Sbjct: 4086 EELEQGEVPEDQ 4097
Score = 36.1 bits (83), Expect = 0.006
Identities = 22/77 (28%), Positives = 39/77 (50%), Gaps = 6/77 (7%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDED-----EEKKREKKEEEEEKQKD-KKEEEV 129
E E + E E+K E+ E + ED E+K R++KE+EEE D ++E+
Sbjct: 3910 EPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEI 3969
Query: 130 EEEEEEQKNKKKEEEEE 146
+ + +E ++ E E+
Sbjct: 3970 QPDIQENNSQPPPENED 3986
Score = 36.1 bits (83), Expect = 0.007
Identities = 16/79 (20%), Positives = 36/79 (45%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
S +L W+E +E E E++ E+ +E K+++ + + ++E+ +EEE
Sbjct: 3900 SNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEM 3959
Query: 135 EQKNKKKEEEEEEEKEKVE 153
+E + + +E
Sbjct: 3960 SDDVGIDDEIQPDIQENNS 3978
Score = 35.4 bits (81), Expect = 0.011
Identities = 21/97 (21%), Positives = 43/97 (44%), Gaps = 22/97 (22%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEE-----------------KEKDEDEEKKREKKEE 116
T ++ E E + EEE ++ +EE E +E++ + E+K
Sbjct: 3865 NGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSN 3924
Query: 117 EE-----EKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
E+ E KE++ + E++ + +K++EEE +
Sbjct: 3925 EQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSD 3961
Score = 35.4 bits (81), Expect = 0.013
Identities = 20/92 (21%), Positives = 40/92 (43%), Gaps = 15/92 (16%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDE-------EKKREKKEEEEEKQKDKKEEEVE 130
+++ E E EEE ++ +E+ D E EK ++ EE+ + ++K E
Sbjct: 3868 TKDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQS 3927
Query: 131 EEEEEQKNKKKEE--------EEEEEKEKVEM 154
E KE+ + +E++++ EM
Sbjct: 3928 AANNESDLVSKEDDNKALEDKDRQEKEDEEEM 3959
Score = 35.0 bits (80), Expect = 0.013
Identities = 15/74 (20%), Positives = 37/74 (50%), Gaps = 2/74 (2%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
ENN +E+ ++ + + E++EK ++ E+ +E EE + K +E E+ E ++
Sbjct: 4039 ENNTLDEDIQQDDFSDLAEDDEKMNEDGFEE--NVQENEESTEDGVKSDEELEQGEVPED 4096
Query: 139 KKKEEEEEEEKEKV 152
+ + + + +
Sbjct: 4097 QAIDNHPKMDAKST 4110
Score = 34.6 bits (79), Expect = 0.019
Identities = 20/86 (23%), Positives = 35/86 (40%), Gaps = 3/86 (3%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEE---EEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
E+ E+ +EE E+ E E++ K + E EE +K +
Sbjct: 4070 NVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAEADEENTDKGIVGEN 4129
Query: 127 EEVEEEEEEQKNKKKEEEEEEEKEKV 152
EE+ EE+ + N + E E+ +E
Sbjct: 4130 EELGEEDGVRGNGTADGEFEQVQEDT 4155
Score = 34.6 bits (79), Expect = 0.019
Identities = 18/66 (27%), Positives = 31/66 (46%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
+ E+ + EE EE+ + D DE + RE + + KD E E + E++N
Sbjct: 3826 DITNTLNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEEN 3885
Query: 139 KKKEEE 144
+ +EE
Sbjct: 3886 QDLDEE 3891
Score = 33.0 bits (75), Expect = 0.068
Identities = 14/61 (22%), Positives = 29/61 (47%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
++ + LE EE+ + + +E E E E D + K E E +++ ++++E
Sbjct: 3831 LNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDE 3890
Query: 132 E 132
E
Sbjct: 3891 E 3891
Score = 33.0 bits (75), Expect = 0.074
Identities = 21/84 (25%), Positives = 43/84 (51%), Gaps = 19/84 (22%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEE-------------EEEKQKDKKEEEVEEE 132
E+K+++E+E+EEE + D+E + + +E E+ + D+KE +V ++
Sbjct: 3946 EDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKD 4005
Query: 133 EE------EQKNKKKEEEEEEEKE 150
+ E ++ KEE + E+ E
Sbjct: 4006 SDLEDMDMEAADENKEEADAEKDE 4029
Score = 32.3 bits (73), Expect = 0.10
Identities = 19/74 (25%), Positives = 37/74 (50%), Gaps = 3/74 (4%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE---E 143
E E +EE + E+ E +DE+ E +E+ Q+D + E++E+ ++ +E E
Sbjct: 4014 EAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQE 4073
Query: 144 EEEEEKEKVEMYRN 157
EE ++ V+
Sbjct: 4074 NEESTEDGVKSDEE 4087
Score = 31.5 bits (71), Expect = 0.21
Identities = 14/75 (18%), Positives = 43/75 (57%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E+ + +E+E + ++ + E+ + E ++ +++ + +++E + +D EE +E+ Q++
Sbjct: 3991 EDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQD 4050
Query: 139 KKKEEEEEEEKEKVE 153
+ E++EK +
Sbjct: 4051 DFSDLAEDDEKMNED 4065
Score = 31.1 bits (70), Expect = 0.32
Identities = 20/84 (23%), Positives = 33/84 (39%), Gaps = 9/84 (10%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE---------EE 128
L ++ + E EE +E EE E +++ E+ E E++ D
Sbjct: 4055 LAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASA 4114
Query: 129 VEEEEEEQKNKKKEEEEEEEKEKV 152
+EE K E EE E++ V
Sbjct: 4115 EADEENTDKGIVGENEELGEEDGV 4138
Score = 30.7 bits (69), Expect = 0.34
Identities = 16/70 (22%), Positives = 29/70 (41%), Gaps = 1/70 (1%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E+ E+++ EE E+D + + E E + D + E +N EE
Sbjct: 3825 EDITNTLNEDDDLEELANEEDTANQSD-LDESEARELESDMNGVTKDSVVSENENSDSEE 3883
Query: 144 EEEEEKEKVE 153
E ++ E+V
Sbjct: 3884 ENQDLDEEVN 3893
Score = 30.4 bits (68), Expect = 0.46
Identities = 18/76 (23%), Positives = 41/76 (53%), Gaps = 3/76 (3%)
Query: 76 ETLENNEWEEEEKEKEEEE--EEEEEEKEKDED-EEKKREKKEEEEEKQKDKKEEEVEEE 132
E E E+ + E+ +E+E + KD D E+ E +E +E+ +K+E +++E
Sbjct: 3975 ENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDE 4034
Query: 133 EEEQKNKKKEEEEEEE 148
+ ++N +E+ +++
Sbjct: 4035 DPLEENNTLDEDIQQD 4050
Score = 30.4 bits (68), Expect = 0.55
Identities = 17/86 (19%), Positives = 43/86 (50%), Gaps = 7/86 (8%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDED----EEKKREKKEEEEEKQKDKK 125
+ E +++ +++ + + +E + ++ED E+ K ++KE + K D +
Sbjct: 3950 RQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLE 4009
Query: 126 EEEVE---EEEEEQKNKKKEEEEEEE 148
+ ++E E +EE +K E ++E+
Sbjct: 4010 DMDMEAADENKEEADAEKDEPMQDED 4035
Score = 30.0 bits (67), Expect = 0.64
Identities = 15/84 (17%), Positives = 39/84 (46%), Gaps = 4/84 (4%)
Query: 71 AISASETLENNEWEEEEKEKEEEEEEEE----EEKEKDEDEEKKREKKEEEEEKQKDKKE 126
+ + TL+ + +++ + E++E+ EE ++ +E + K +EE +Q + E
Sbjct: 4036 PLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPE 4095
Query: 127 EEVEEEEEEQKNKKKEEEEEEEKE 150
++ + + K E ++E
Sbjct: 4096 DQAIDNHPKMDAKSTFASAEADEE 4119
Score = 28.8 bits (64), Expect = 1.4
Identities = 26/73 (35%), Positives = 37/73 (50%), Gaps = 15/73 (20%)
Query: 95 EEEEEEKEKDEDEEKKREKKEEEEEKQKD----------KKEEEVEEEEEEQKNKKK--- 141
E+ EEE EKD DE ++EK ++ + KD E E EEE+ E ++
Sbjct: 2927 EQIEEEDEKD-DEYGEQEKVSDKNDYLKDFIDYDDDGSISTESEPEEEKRECESGFDGWN 2985
Query: 142 -EEEEEEEKEKVE 153
EEEE++ EK E
Sbjct: 2986 DYEEEEDDDEKNE 2998
Score = 28.0 bits (62), Expect = 2.9
Identities = 15/85 (17%), Positives = 40/85 (47%)
Query: 65 TVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
+ LP+ + E + + + ++ + E +E +E+ E +E +++ EE D+
Sbjct: 3986 DLDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDE 4045
Query: 125 KEEEVEEEEEEQKNKKKEEEEEEEK 149
++ + + + ++K E+ EE
Sbjct: 4046 DIQQDDFSDLAEDDEKMNEDGFEEN 4070
Score = 27.3 bits (60), Expect = 5.7
Identities = 12/69 (17%), Positives = 34/69 (49%), Gaps = 1/69 (1%)
Query: 82 EWEEEEKEKEEEEEEEE-EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
EWE+ + + + ++ E ++DE+E+ + E+++ + ++E + + +
Sbjct: 4187 EWEDLTESQSQAFDDSEFMHVKEDEEEDLQALGNAEKDQIKSIDRDESANQNPDSMNSTN 4246
Query: 141 KEEEEEEEK 149
E+E +E
Sbjct: 4247 IAEDEADEV 4255
Score = 26.9 bits (59), Expect = 7.8
Identities = 27/100 (27%), Positives = 45/100 (45%), Gaps = 15/100 (15%)
Query: 76 ETLENNEWEEEEKEKEEE--------EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
+ + E E EE+++E E EEEE++ EK+E E + + E E +K E
Sbjct: 2961 DGSISTESEPEEEKRECESGFDGWNDYEEEEDDDEKNEKECELEQNAEILIEDEKIGSEG 3020
Query: 128 EVEEEEEE-------QKNKKKEEEEEEEKEKVEMYRNPVP 160
+E EE + N+ E EE++ + E Y +
Sbjct: 3021 SLESGGEELVCSNLSEGNESSEIEEDDVESGGEFYADSRD 3060
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 41.4 bits (97), Expect = 8e-05
Identities = 16/75 (21%), Positives = 44/75 (58%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E + E+E E ++++EE+ KEK+ + K +K+ + + + + + ++ E+K+
Sbjct: 7 EAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKS 66
Query: 139 KKKEEEEEEEKEKVE 153
+K++ E+E ++ ++
Sbjct: 67 RKRDVEDENPEDFID 81
Score = 29.9 bits (67), Expect = 0.79
Identities = 14/42 (33%), Positives = 26/42 (61%)
Query: 107 EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
E + +K EEE ++ KK+EE +E+E +K K ++E + +
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAK 46
Score = 29.5 bits (66), Expect = 1.0
Identities = 15/42 (35%), Positives = 27/42 (64%)
Query: 105 EDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
E E +K+ EEE E++K K+E+ E+E ++ K +KE + +
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAK 46
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 41.3 bits (98), Expect = 1e-04
Identities = 21/71 (29%), Positives = 47/71 (66%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E E E ++ E+ ++EE ++ + +KRE++ E++EK+ ++K++E+E++EEE +
Sbjct: 76 ELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEE 135
Query: 139 KKKEEEEEEEK 149
+E+ +E E+
Sbjct: 136 LIEEQLQELER 146
Score = 39.0 bits (92), Expect = 5e-04
Identities = 24/69 (34%), Positives = 46/69 (66%), Gaps = 3/69 (4%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK---DKKEEEVEEEEEEQKNKK 140
EE K + E E+E E + + + EK+ +KEE +++ +K+EEE+E++E+E + K+
Sbjct: 64 EEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQ 123
Query: 141 KEEEEEEEK 149
+E E++EE+
Sbjct: 124 QELEKKEEE 132
Score = 36.3 bits (85), Expect = 0.004
Identities = 24/81 (29%), Positives = 49/81 (60%), Gaps = 1/81 (1%)
Query: 74 ASETLENNEWEEEEKEKEEEEE-EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
A LE + E E +KE E +EE K ++E E++ RE++ E ++ +K ++E +
Sbjct: 40 AKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLD 99
Query: 133 EEEQKNKKKEEEEEEEKEKVE 153
+ + +K+EEE E++++++E
Sbjct: 100 RKLELLEKREEELEKKEKELE 120
Score = 35.5 bits (83), Expect = 0.009
Identities = 29/91 (31%), Positives = 47/91 (51%), Gaps = 15/91 (16%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE------------- 126
+ + E EK +EE E++E+E ++K ++ EKK E+ EE E+Q + E
Sbjct: 99 DRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEI 158
Query: 127 --EEVEEEEEEQKNKKKEEEEEEEKEKVEMY 155
E+VEEE + +E EEE KE+ +
Sbjct: 159 LLEKVEEEARHEAAVLIKEIEEEAKEEADKK 189
Score = 35.1 bits (82), Expect = 0.009
Identities = 22/70 (31%), Positives = 38/70 (54%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EEE K EE ++E E +K+ E K E + E +K+ +E E ++ E++ +KEE
Sbjct: 37 EEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEE 96
Query: 144 EEEEEKEKVE 153
+ + E +E
Sbjct: 97 NLDRKLELLE 106
Score = 34.8 bits (81), Expect = 0.014
Identities = 23/75 (30%), Positives = 49/75 (65%), Gaps = 6/75 (8%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE-----QKN 138
E +E+ + E E+E +E+ +E +K EK+ ++E+ D+K E +E+ EEE ++
Sbjct: 61 EAKEEIHKLRNEFEKELRER-RNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKEL 119
Query: 139 KKKEEEEEEEKEKVE 153
++K++E E+++E++E
Sbjct: 120 EQKQQELEKKEEELE 134
Score = 27.8 bits (63), Expect = 2.6
Identities = 19/53 (35%), Positives = 31/53 (58%)
Query: 99 EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
E K K+ +EE KR +E ++E + KKE +E +EE K + + E+E E+
Sbjct: 30 EAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRN 82
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 40.2 bits (94), Expect = 2e-04
Identities = 16/67 (23%), Positives = 40/67 (59%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
EE+++K+E+++ EE ++++ ++E+ ++ ++E Q+ KK+ E ++ K K+ EE
Sbjct: 78 EEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEA 137
Query: 145 EEEEKEK 151
+
Sbjct: 138 AAKAAAA 144
Score = 40.2 bits (94), Expect = 2e-04
Identities = 14/68 (20%), Positives = 39/68 (57%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E+ +K+++++ EE ++++ +++ K+ EK+ ++QK + EE ++ +QK ++
Sbjct: 79 EQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAA 138
Query: 144 EEEEEKEK 151
+ K
Sbjct: 139 AKAAAAAK 146
Score = 37.1 bits (86), Expect = 0.002
Identities = 21/93 (22%), Positives = 43/93 (46%), Gaps = 7/93 (7%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEE---EEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
+ +E L+ + E+E+ K+ E+E +E++K+ +E ++ K+++ EE
Sbjct: 83 KKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAA 142
Query: 126 EEEVEEEEEEQKN----KKKEEEEEEEKEKVEM 154
+ E E K KK E ++K + E
Sbjct: 143 AAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEA 175
Score = 35.2 bits (81), Expect = 0.012
Identities = 13/72 (18%), Positives = 42/72 (58%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
+E+++ EE ++++ E+E+ + EK+R +E++++ ++ ++ ++++ ++ K
Sbjct: 84 KEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAA 143
Query: 145 EEEEKEKVEMYR 156
+ K + E R
Sbjct: 144 AAKAKAEAEAKR 155
Score = 34.0 bits (78), Expect = 0.024
Identities = 17/78 (21%), Positives = 40/78 (51%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
A E + E ++ E+ ++++ E+E K+ +++ +E+K++ EE K ++ + EE
Sbjct: 77 AEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEE 136
Query: 134 EEQKNKKKEEEEEEEKEK 151
K + + E + K
Sbjct: 137 AAAKAAAAAKAKAEAEAK 154
Score = 33.6 bits (77), Expect = 0.032
Identities = 15/67 (22%), Positives = 39/67 (58%), Gaps = 3/67 (4%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
+++++K + EE+ +K++ + E+ ++K+ E+E+ K+ E E +++ K+ EE
Sbjct: 67 QQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERL---KQLEKERLAAQEQKKQAEEA 123
Query: 145 EEEEKEK 151
++ K
Sbjct: 124 AKQAALK 130
Score = 30.9 bits (70), Expect = 0.29
Identities = 15/67 (22%), Positives = 28/67 (41%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
+K E +++ E E + + +KK E E K E + + E E +K E +
Sbjct: 157 AAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAK 216
Query: 145 EEEEKEK 151
++ E
Sbjct: 217 KKAAAEA 223
Score = 30.2 bits (68), Expect = 0.47
Identities = 17/70 (24%), Positives = 28/70 (40%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E K+K E E ++ E + E + K E K+K + E + + E +K E
Sbjct: 164 AAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEA 223
Query: 144 EEEEEKEKVE 153
+ K E
Sbjct: 224 KAAAAKAAAE 233
Score = 29.8 bits (67), Expect = 0.62
Identities = 21/86 (24%), Positives = 34/86 (39%), Gaps = 16/86 (18%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKRE-----KKEEEEEKQ-----------KDKKEE 127
+E+K++ EE ++ K+K +E + K E E K+ KK E
Sbjct: 113 AQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAE 172
Query: 128 EVEEEEEEQKNKKKEEEEEEEKEKVE 153
++ + KKK E E K E
Sbjct: 173 AEAAKKAAAEAKKKAEAEAAAKAAAE 198
Score = 29.4 bits (66), Expect = 0.96
Identities = 14/70 (20%), Positives = 25/70 (35%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
+ E E K E +++ E E + + +KK E K K + E+ K
Sbjct: 186 KAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAK 245
Query: 142 EEEEEEEKEK 151
E+ +
Sbjct: 246 AAEKAAAAKA 255
Score = 29.0 bits (65), Expect = 1.3
Identities = 12/61 (19%), Positives = 39/61 (63%)
Query: 93 EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
E+ +++++K +++ KK+E+++ ++ ++++ E+E +Q K++ +E++K+
Sbjct: 62 EQYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAE 121
Query: 153 E 153
E
Sbjct: 122 E 122
Score = 29.0 bits (65), Expect = 1.3
Identities = 15/78 (19%), Positives = 31/78 (39%), Gaps = 10/78 (12%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKR----------EKKEEEEEKQKDKKEEEVEEEE 133
++K+ EE + + + E KR E K++ E + K E +++
Sbjct: 128 ALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKA 187
Query: 134 EEQKNKKKEEEEEEEKEK 151
E + K E +++ E
Sbjct: 188 EAEAAAKAAAEAKKKAEA 205
Score = 28.2 bits (63), Expect = 2.2
Identities = 19/83 (22%), Positives = 37/83 (44%), Gaps = 10/83 (12%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR------EKKEEEEEKQ----KDKKEE 127
LE +E++K+ EE ++ ++ + EE + K E E K+ K
Sbjct: 106 LEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAA 165
Query: 128 EVEEEEEEQKNKKKEEEEEEEKE 150
E +++ E + KK E +++ E
Sbjct: 166 EAKKKAEAEAAKKAAAEAKKKAE 188
Score = 27.8 bits (62), Expect = 2.6
Identities = 18/66 (27%), Positives = 25/66 (37%), Gaps = 2/66 (3%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
K K E E + K E K KK E E +K E + + E E E +
Sbjct: 143 AAAKAKAEAEAKRAAAAAKKAAAEAK--KKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAK 200
Query: 145 EEEEKE 150
++ E E
Sbjct: 201 KKAEAE 206
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 40.0 bits (94), Expect = 2e-04
Identities = 19/72 (26%), Positives = 35/72 (48%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ E EK E EEE+ E + ++E K + + + E+ D+ E + E +++EE
Sbjct: 17 KLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEE 76
Query: 144 EEEEEKEKVEMY 155
E + E + Y
Sbjct: 77 EAKAEAAEFRAY 88
Score = 39.6 bits (93), Expect = 3e-04
Identities = 19/70 (27%), Positives = 32/70 (45%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
+ E E+ E EEE+ E D +E+ + E + ++ E E + + +EEE
Sbjct: 16 RKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEE 75
Query: 145 EEEEKEKVEM 154
EE + E E
Sbjct: 76 EEAKAEAAEF 85
Score = 36.9 bits (86), Expect = 0.002
Identities = 16/79 (20%), Positives = 33/79 (41%), Gaps = 8/79 (10%)
Query: 79 ENNEWEEEEKEKEEEEEEE--------EEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
E EEEK + + +EE + +E ++ E K E +++++E + E
Sbjct: 22 EKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEEEAKAE 81
Query: 131 EEEEEQKNKKKEEEEEEEK 149
E + ++ EE+
Sbjct: 82 AAEFRAYLRGGDDALAEER 100
Score = 36.6 bits (85), Expect = 0.004
Identities = 17/63 (26%), Positives = 31/63 (49%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
KE E E KE + E ++ + EEE+ + D +EE+++ + E ++ +E E
Sbjct: 1 LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEA 60
Query: 149 KEK 151
K
Sbjct: 61 KPA 63
Score = 35.8 bits (83), Expect = 0.006
Identities = 21/75 (28%), Positives = 33/75 (44%), Gaps = 4/75 (5%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE----EEVEEEEEEQKNK 139
E E +E + E+++ E EE+K E +EE K E EE+ +E E +
Sbjct: 5 REALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAA 64
Query: 140 KKEEEEEEEKEKVEM 154
E EE+E+
Sbjct: 65 SGEGGGGEEEEEEAK 79
Score = 35.0 bits (81), Expect = 0.012
Identities = 15/69 (21%), Positives = 35/69 (50%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+E+ E EEE+ E + KE+ + + + ++ EE ++ + K E E++ ++ +
Sbjct: 21 DEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEEEAKA 80
Query: 144 EEEEEKEKV 152
E E + +
Sbjct: 81 EAAEFRAYL 89
Score = 34.3 bits (79), Expect = 0.018
Identities = 21/81 (25%), Positives = 35/81 (43%), Gaps = 1/81 (1%)
Query: 68 LPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
L + L +E E +E++ E + +EE +K D E R ++ +E + K
Sbjct: 8 LAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDK-LDAEIDRLEELLDELEAKPAASG 66
Query: 128 EVEEEEEEQKNKKKEEEEEEE 148
E EEE++ K E E
Sbjct: 67 EGGGGEEEEEEAKAEAAEFRA 87
Score = 27.3 bits (61), Expect = 4.8
Identities = 13/48 (27%), Positives = 24/48 (50%)
Query: 107 EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
++ RE E ++ + E+E E EE+K + +EE +K E+
Sbjct: 1 LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEI 48
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 40.2 bits (95), Expect = 3e-04
Identities = 25/79 (31%), Positives = 45/79 (56%), Gaps = 3/79 (3%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE---EKQKDKKEEEVEEE 132
+LE E E E+K +E E +E EK K+E EEKK + +EEE+ E+ + + ++ ++E
Sbjct: 523 ASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEA 582
Query: 133 EEEQKNKKKEEEEEEEKEK 151
++E KE + ++
Sbjct: 583 KKEADEIIKELRQLQKGGY 601
Score = 37.1 bits (87), Expect = 0.003
Identities = 23/73 (31%), Positives = 47/73 (64%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE E+E E++ EE E ++ E +++ E+K+E+ ++++DK EE E+E ++ + K+E
Sbjct: 526 EELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKE 585
Query: 144 EEEEEKEKVEMYR 156
+E KE ++ +
Sbjct: 586 ADEIIKELRQLQK 598
Score = 36.0 bits (84), Expect = 0.006
Identities = 18/82 (21%), Positives = 42/82 (51%), Gaps = 1/82 (1%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
+ E E++ E+ E+E ++ ++ K E +E +E ++ ++ K E+ E + N
Sbjct: 559 KLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRL-N 617
Query: 139 KKKEEEEEEEKEKVEMYRNPVP 160
K E++E+++K++ E
Sbjct: 618 KANEKKEKKKKKQKEKQEELKV 639
Score = 33.3 bits (77), Expect = 0.047
Identities = 24/76 (31%), Positives = 45/76 (59%), Gaps = 6/76 (7%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE---EEKQKDKKEEEVEEEEEEQKN-- 138
EE +K E++E+ E E+ E++ E+K EE K+ +K +EE+EE++E+ +
Sbjct: 505 EEAKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEE 564
Query: 139 -KKKEEEEEEEKEKVE 153
K EE E+E ++ ++
Sbjct: 565 DKLLEEAEKEAQQAIK 580
Score = 31.7 bits (73), Expect = 0.14
Identities = 21/90 (23%), Positives = 45/90 (50%), Gaps = 6/90 (6%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE-- 127
+ E E +E EK +EE EE++EK ++E+++ E ++E ++ K+ K+E
Sbjct: 528 LERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEAD 587
Query: 128 ----EVEEEEEEQKNKKKEEEEEEEKEKVE 153
E+ + ++ K E E ++++
Sbjct: 588 EIIKELRQLQKGGYASVKAHELIEARKRLN 617
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 39.6 bits (93), Expect = 3e-04
Identities = 21/68 (30%), Positives = 41/68 (60%), Gaps = 1/68 (1%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE-EEEEKQKDKKEEEVEEEEEEQKNKKKE 142
+++ K+ +EEE+EE+E+EK + K + KK + + ++K+K + E EE+ + +
Sbjct: 30 DDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTP 89
Query: 143 EEEEEEKE 150
E+E EK
Sbjct: 90 EDELAEKL 97
Score = 38.5 bits (90), Expect = 6e-04
Identities = 18/68 (26%), Positives = 45/68 (66%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
K+K ++E+E+++ K+ ++EE + +++E+ + K K ++ ++ + EE++ K+E+EE+
Sbjct: 21 KDKWDDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKG 80
Query: 148 EKEKVEMY 155
+E E
Sbjct: 81 LRELEEDT 88
Score = 36.9 bits (86), Expect = 0.002
Identities = 17/73 (23%), Positives = 40/73 (54%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
+ ++ + EE+E+++EE+ + + K K + K EK++ + EK++ E E+ E+
Sbjct: 32 DVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPED 91
Query: 136 QKNKKKEEEEEEE 148
+ +K + +E
Sbjct: 92 ELAEKLRLRKLQE 104
Score = 35.8 bits (83), Expect = 0.005
Identities = 19/69 (27%), Positives = 41/69 (59%), Gaps = 4/69 (5%)
Query: 83 WEEEEKEKEEEEEEEEE----EKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
W+EEE E++EEE+ + K+ + + +++EK + E+E++ ++ EE E+E +
Sbjct: 37 WDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEK 96
Query: 139 KKKEEEEEE 147
+ + +EE
Sbjct: 97 LRLRKLQEE 105
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 39.9 bits (93), Expect = 3e-04
Identities = 20/91 (21%), Positives = 50/91 (54%)
Query: 60 ANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
A AR+ + +++ +E + ++ E+E++ ++++ E E++E+ E E + +E +E
Sbjct: 185 AFARVVREMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQES 244
Query: 120 KQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
+ D++ E EEE + +E +++ E
Sbjct: 245 EATDRESESGEEEMVQSDQDDLPDESDDDSE 275
Score = 34.5 bits (79), Expect = 0.020
Identities = 19/80 (23%), Positives = 40/80 (50%), Gaps = 3/80 (3%)
Query: 77 TLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE---EEVEEEE 133
++E E ++ E E+EE+ ++++ ++E EE+ + E +E + E E E E
Sbjct: 196 SMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGE 255
Query: 134 EEQKNKKKEEEEEEEKEKVE 153
EE +++ +E + E
Sbjct: 256 EEMVQSDQDDLPDESDDDSE 275
Score = 29.5 bits (66), Expect = 0.87
Identities = 11/76 (14%), Positives = 36/76 (47%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
+E+ + + ++++ + E+EE+ E E E E + + + E + +++ + ++++
Sbjct: 206 DTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDD 265
Query: 134 EEQKNKKKEEEEEEEK 149
++ E E
Sbjct: 266 LPDESDDDSETPGEGA 281
Score = 28.8 bits (64), Expect = 1.5
Identities = 18/64 (28%), Positives = 32/64 (50%), Gaps = 6/64 (9%)
Query: 91 EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
EE ++ E E E+D D+++ E ++EE+ E E E +E + + + E E
Sbjct: 201 EEMGDDTESEDEEDGDDDQPTENEQEEQ------GEGEGEGQEGSAPQESEATDRESESG 254
Query: 151 KVEM 154
+ EM
Sbjct: 255 EEEM 258
Score = 28.0 bits (62), Expect = 2.8
Identities = 17/69 (24%), Positives = 33/69 (47%), Gaps = 10/69 (14%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
E E+ ++ E E+E++ D+D+ + E++E+ E E E +E ++ E
Sbjct: 198 ELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEG----------EGEGQEGSAPQESEAT 247
Query: 146 EEEKEKVEM 154
+ E E E
Sbjct: 248 DRESESGEE 256
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 38.9 bits (90), Expect = 4e-04
Identities = 20/63 (31%), Positives = 47/63 (74%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
+E+ EK +E EE EE+ + E E+K +++E++EE++++++E+ +EE+ ++ +++ E++
Sbjct: 125 QEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQD 184
Query: 145 EEE 147
E E
Sbjct: 185 ESE 187
Score = 36.2 bits (83), Expect = 0.004
Identities = 19/61 (31%), Positives = 39/61 (63%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E E EWE E++ +E E++EE+E+E++E +++ E E ++D+ E E ++++ E
Sbjct: 137 ELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESDDDKTE 196
Query: 136 Q 136
+
Sbjct: 197 K 197
Score = 35.8 bits (82), Expect = 0.004
Identities = 21/62 (33%), Positives = 40/62 (64%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
EE +E+ E E EE+ E+++ EE++RE++E+ E+Q D E E+ E++E + ++
Sbjct: 136 EELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESDDDKT 195
Query: 145 EE 146
E+
Sbjct: 196 EK 197
Score = 35.8 bits (82), Expect = 0.004
Identities = 27/96 (28%), Positives = 53/96 (55%)
Query: 58 IRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
IR +L + I ++L + + E+ +E EE EE E E+ E++ +RE +EE+
Sbjct: 101 IRIRDQLRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQ 160
Query: 118 EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
E +++++ EE ++ E + ++ E E E + +K E
Sbjct: 161 EREREEQTIEEQSDDSEHEIIEQDESETESDDDKTE 196
Score = 33.5 bits (76), Expect = 0.026
Identities = 23/86 (26%), Positives = 50/86 (58%)
Query: 68 LPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
+P AI + L + + ++ +K+K +E+ EK + E EE + + E E++ D++E+
Sbjct: 97 VPLAIRIRDQLRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDERED 156
Query: 128 EVEEEEEEQKNKKKEEEEEEEKEKVE 153
+ E+E E ++ +E+ ++ E E +E
Sbjct: 157 QEEQEREREEQTIEEQSDDSEHEIIE 182
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 39.0 bits (91), Expect = 4e-04
Identities = 19/69 (27%), Positives = 36/69 (52%), Gaps = 2/69 (2%)
Query: 93 EEEEEEEEKEKDEDEEKKREK-KEEEEEKQKD-KKEEEVEEEEEEQKNKKKEEEEEEEKE 150
E E+ + K+K E + +++ EK K K + E ++ +KKEE+++ +K+
Sbjct: 17 ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKK 76
Query: 151 KVEMYRNPV 159
KV + NP
Sbjct: 77 KVPLQVNPA 85
Score = 30.5 bits (69), Expect = 0.27
Identities = 13/42 (30%), Positives = 24/42 (57%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEK 120
N++ + EK K + + E +K +EKK EKK+ +++K
Sbjct: 36 INSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77
Score = 30.1 bits (68), Expect = 0.35
Identities = 12/72 (16%), Positives = 33/72 (45%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
E + +++ EK + K + + ++ ++KKEE+ + ++++ + + +
Sbjct: 31 EIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKVPLQVNPAQLFVD 90
Query: 149 KEKVEMYRNPVP 160
E +PVP
Sbjct: 91 DEYYVWIYDPVP 102
Score = 30.1 bits (68), Expect = 0.41
Identities = 16/72 (22%), Positives = 32/72 (44%), Gaps = 3/72 (4%)
Query: 95 EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
E E+ + KD+ + K +++ + K + E K K +E+++EK +
Sbjct: 17 ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVK---KLHSQEKKEEKKKP 73
Query: 155 YRNPVPMYRNPV 166
+ VP+ NP
Sbjct: 74 KKKKVPLQVNPA 85
Score = 28.6 bits (64), Expect = 1.3
Identities = 10/51 (19%), Positives = 26/51 (50%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
N E + +++ E+ + + E K+ +E++E++K K+++V
Sbjct: 29 NPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKVP 79
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 39.2 bits (92), Expect = 4e-04
Identities = 10/78 (12%), Positives = 42/78 (53%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
N+ ++++ + ++++ ++++ + D+DE+ + +++ + + ++K+E E E+ +
Sbjct: 127 LNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDD 186
Query: 140 KKEEEEEEEKEKVEMYRN 157
+E++ E +
Sbjct: 187 FVWDEDDSEALRQARKDA 204
Score = 35.0 bits (81), Expect = 0.013
Identities = 12/94 (12%), Positives = 47/94 (50%), Gaps = 3/94 (3%)
Query: 60 ANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
L + +++ ++ + + ++++++ + D+ ++ + +++E+
Sbjct: 96 LKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDED 155
Query: 120 KQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+D +++V++E+EE KK+ +E E+ + +
Sbjct: 156 DDEDDDDDDVDDEDEE---KKEAKELEKLSDDDD 186
Score = 29.6 bits (67), Expect = 0.77
Identities = 8/83 (9%), Positives = 45/83 (54%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
P + L++++ E++ +++++ + ++ + +++++ D +++
Sbjct: 91 PAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDD 150
Query: 129 VEEEEEEQKNKKKEEEEEEEKEK 151
++E++++ + + ++E+E++K
Sbjct: 151 DDDEDDDEDDDDDDVDDEDEEKK 173
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 39.1 bits (92), Expect = 5e-04
Identities = 26/93 (27%), Positives = 47/93 (50%), Gaps = 13/93 (13%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKK----------REKKEEEEEKQ 121
I+AS L + +E E+ ++ EE E+K++ E E K + +EE +K
Sbjct: 495 ITASSGLSD---DEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKL 551
Query: 122 KDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
+ +++VEE E K + + E++EE + K E
Sbjct: 552 PEADKKKVEEAIEWLKEELEGEDKEEIEAKTEE 584
Score = 36.8 bits (86), Expect = 0.003
Identities = 20/82 (24%), Positives = 38/82 (46%), Gaps = 1/82 (1%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
+SA + E + ++E E KD +E +KK +E + K++ EE
Sbjct: 480 VSAKDKGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEA-EEYVY 538
Query: 132 EEEEQKNKKKEEEEEEEKEKVE 153
E+ ++ ++ E +K+KVE
Sbjct: 539 SLEKSLKEEGDKLPEADKKKVE 560
Score = 28.7 bits (65), Expect = 1.7
Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 6/83 (7%)
Query: 79 ENNEWEEEEKEKEEEEEEEEE-----EKEKDEDEEKK-REKKEEEEEKQKDKKEEEVEEE 132
E +++ KE+ E + E EE EK E+ +K K++ EE + KEE E+
Sbjct: 515 EYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKKKVEEAIEWLKEELEGED 574
Query: 133 EEEQKNKKKEEEEEEEKEKVEMY 155
+EE + K +E ++ + MY
Sbjct: 575 KEEIEAKTEELQKVVQPIGERMY 597
Score = 26.8 bits (60), Expect = 7.5
Identities = 19/62 (30%), Positives = 31/62 (50%), Gaps = 2/62 (3%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+ I A E + E+ KEE ++ E +K+K E+ KEE E + K++ E +
Sbjct: 525 ERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKKKVEE--AIEWLKEELEGEDKEEIEAKT 582
Query: 130 EE 131
EE
Sbjct: 583 EE 584
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 39.1 bits (92), Expect = 5e-04
Identities = 21/71 (29%), Positives = 38/71 (53%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E E E + E + EE ++ + K+E+ E++EK+ KE+ ++E+EEE +
Sbjct: 70 ELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEE 129
Query: 139 KKKEEEEEEEK 149
E+ EE E+
Sbjct: 130 LIAEQREELER 140
Score = 33.7 bits (78), Expect = 0.030
Identities = 19/70 (27%), Positives = 36/70 (51%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE K EE ++E E +K+ E K E + E +++ KE E + E++ ++EE
Sbjct: 31 EELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREE 90
Query: 144 EEEEEKEKVE 153
+ + E ++
Sbjct: 91 TLDRKMESLD 100
Score = 32.6 bits (75), Expect = 0.082
Identities = 31/85 (36%), Positives = 49/85 (57%), Gaps = 7/85 (8%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE-------EKQKDKKEEE 128
E+L+ E E+KEKE +E+ +++++E EE E++EE E E+ K+ EE
Sbjct: 97 ESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERISGLTQEEAKEILLEE 156
Query: 129 VEEEEEEQKNKKKEEEEEEEKEKVE 153
VEEE + K +E EEE KE+ +
Sbjct: 157 VEEEARHEAAKLIKEIEEEAKEEAD 181
Score = 31.8 bits (73), Expect = 0.15
Identities = 24/81 (29%), Positives = 44/81 (54%), Gaps = 6/81 (7%)
Query: 74 ASETLENNEWEEEEKEKE---EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
A +E + E E +KE E +EE + + + E E K+R + + E++ ++EE ++
Sbjct: 34 AKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLD 93
Query: 131 EEEEEQKNKKKEEEEEEEKEK 151
+ E + K+EE E+KEK
Sbjct: 94 RKME---SLDKKEENLEKKEK 111
Score = 29.5 bits (67), Expect = 0.81
Identities = 24/70 (34%), Positives = 37/70 (52%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE K E +++EE EK E E +EK +E+E++ ++ E EE E +EE
Sbjct: 89 EETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERISGLTQEE 148
Query: 144 EEEEEKEKVE 153
+E E+VE
Sbjct: 149 AKEILLEEVE 158
Score = 28.4 bits (64), Expect = 1.8
Identities = 22/74 (29%), Positives = 40/74 (54%), Gaps = 9/74 (12%)
Query: 86 EEKEKEEEEEEEEEEKE-KDEDEEKKRE-----KKEEEEEKQKDKK---EEEVEEEEEEQ 136
EE +KE E ++E E K+E + + E K+ E ++ +++ EE + + E
Sbjct: 39 EEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMES 98
Query: 137 KNKKKEEEEEEEKE 150
+KK+E E++EKE
Sbjct: 99 LDKKEENLEKKEKE 112
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 39.4 bits (91), Expect = 6e-04
Identities = 27/75 (36%), Positives = 51/75 (68%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
+ + E K++ EE ++ EE K+K+ +E+KK E+ ++ EE+ K K EE +E EE++K
Sbjct: 1686 DEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKK 1745
Query: 139 KKKEEEEEEEKEKVE 153
++ +++EEEK+K+
Sbjct: 1746 AEEAKKDEEEKKKIA 1760
Score = 39.0 bits (90), Expect = 6e-04
Identities = 28/87 (32%), Positives = 52/87 (59%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+ + E E + EE +K +EE + + EE +K E+++KK E+ ++ EE +K E
Sbjct: 1636 EQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695
Query: 130 EEEEEEQKNKKKEEEEEEEKEKVEMYR 156
+E EE +K ++ +++E EEK+K E +
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELK 1722
Score = 37.8 bits (87), Expect = 0.002
Identities = 25/69 (36%), Positives = 49/69 (71%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
EE K+ EEE+++ E+ K+K+ +E+KK E+ ++ EE+ K K EE ++ EE++K ++ ++
Sbjct: 1623 EELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKK 1682
Query: 145 EEEEKEKVE 153
EE+++K
Sbjct: 1683 AEEDEKKAA 1691
Score = 37.4 bits (86), Expect = 0.002
Identities = 25/72 (34%), Positives = 46/72 (63%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
EN EE +K EE++++ EE +K E++EKK + ++E ++ K EE ++E EE+K
Sbjct: 1658 ENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKK 1717
Query: 139 KKKEEEEEEEKE 150
++ ++ EEE +
Sbjct: 1718 AEELKKAEEENK 1729
Score = 36.7 bits (84), Expect = 0.004
Identities = 24/78 (30%), Positives = 49/78 (62%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E + E+ +K++ EE+++ EE K+ +E+ + K ++ ++ E+ K K EE + EE+E+K
Sbjct: 1631 EKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKA 1690
Query: 139 KKKEEEEEEEKEKVEMYR 156
+ ++E EE +K E +
Sbjct: 1691 AEALKKEAEEAKKAEELK 1708
Score = 35.9 bits (82), Expect = 0.007
Identities = 26/81 (32%), Positives = 48/81 (59%), Gaps = 8/81 (9%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV--------EEEEEE 135
EE+K+K EE ++ EE+++K + KK ++ ++ E+ K K+ EE EEE +
Sbjct: 1670 AEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENK 1729
Query: 136 QKNKKKEEEEEEEKEKVEMYR 156
K ++ ++E EE+K+K E +
Sbjct: 1730 IKAEEAKKEAEEDKKKAEEAK 1750
Score = 35.9 bits (82), Expect = 0.007
Identities = 25/73 (34%), Positives = 48/73 (65%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E + + E+ K+ EE + + E+ K +EEKK+ ++ +++E ++ KK EE+++ EEE K
Sbjct: 1602 EEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKI 1661
Query: 139 KKKEEEEEEEKEK 151
K EE ++ E++K
Sbjct: 1662 KAAEEAKKAEEDK 1674
Score = 34.0 bits (77), Expect = 0.031
Identities = 19/70 (27%), Positives = 40/70 (57%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE +K+ + +++ EE K+ E + + E +E E ++K E +++EE +K +
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAK 1384
Query: 144 EEEEEKEKVE 153
++ EEK+K +
Sbjct: 1385 KKAEEKKKAD 1394
Score = 32.8 bits (74), Expect = 0.090
Identities = 28/84 (33%), Positives = 54/84 (64%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E E + EE +K++ EE+++ EE K+ +E+ + K E+ ++E E+ K K EE ++EEE+
Sbjct: 1697 EAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEK 1756
Query: 136 QKNKKKEEEEEEEKEKVEMYRNPV 159
+K ++EEE++ E++ + V
Sbjct: 1757 KKIAHLKKEEEKKAEEIRKEKEAV 1780
Score = 32.4 bits (73), Expect = 0.11
Identities = 22/72 (30%), Positives = 44/72 (61%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
EE K+K +E ++ E K+K ++ +K E K+ +E K+ ++ ++ E ++ E+K K E +
Sbjct: 1493 EEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELK 1552
Query: 145 EEEEKEKVEMYR 156
+ EE +K E +
Sbjct: 1553 KAEELKKAEEKK 1564
Score = 32.0 bits (72), Expect = 0.16
Identities = 21/70 (30%), Positives = 40/70 (57%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE +K E +++ EE +K+ D ++K E K+ E + + + E E E+K + E+
Sbjct: 1312 EEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEK 1371
Query: 144 EEEEEKEKVE 153
++EE K+K +
Sbjct: 1372 KKEEAKKKAD 1381
Score = 31.6 bits (71), Expect = 0.17
Identities = 19/73 (26%), Positives = 41/73 (56%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E + +E +K+ EE +++ + K+K E+ +K E + E E D+ E E+ E +K
Sbjct: 1313 EAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKK 1372
Query: 139 KKKEEEEEEEKEK 151
K++ +++ + +K
Sbjct: 1373 KEEAKKKADAAKK 1385
Score = 31.6 bits (71), Expect = 0.21
Identities = 22/84 (26%), Positives = 52/84 (61%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+A +A++ E E + E EK++EE +++ + K + EEKK+ + +++ ++ KK +E+
Sbjct: 1351 EAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADEL 1410
Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
++ +K + +++ EEK+K +
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKAD 1434
Score = 31.3 bits (70), Expect = 0.23
Identities = 27/84 (32%), Positives = 53/84 (63%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+A A E + +E ++ E+ K+ EE+++ EE +K E+++ +K EE +K ++ + EEV
Sbjct: 1538 EAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEV 1597
Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
+ EE+K K EE ++ E+ K++
Sbjct: 1598 MKLYEEEKKMKAEEAKKAEEAKIK 1621
Score = 31.3 bits (70), Expect = 0.25
Identities = 20/68 (29%), Positives = 46/68 (67%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE++K+ +E ++ +K+ DE ++K EKK+ +E K+K ++ ++ +E +++ + KK E
Sbjct: 1401 EEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAE 1460
Query: 144 EEEEEKEK 151
E +++ E+
Sbjct: 1461 EAKKKAEE 1468
Score = 30.9 bits (69), Expect = 0.31
Identities = 19/75 (25%), Positives = 46/75 (61%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E+ + +E K+ +++ +E K+K E+++K E K++ EE +K + ++ EE ++ +
Sbjct: 1402 EDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEE 1461
Query: 139 KKKEEEEEEEKEKVE 153
KK+ EE ++ ++ +
Sbjct: 1462 AKKKAEEAKKADEAK 1476
Score = 30.9 bits (69), Expect = 0.32
Identities = 28/91 (30%), Positives = 52/91 (57%), Gaps = 4/91 (4%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK----DKK 125
+A A + E + +E K+K EE ++ EE K+K E+ +K E K++ EE +K KK
Sbjct: 1432 KADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKK 1491
Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
EE +++ +E K + +++ +E +K E +
Sbjct: 1492 AEEAKKKADEAKKAAEAKKKADEAKKAEEAK 1522
Score = 30.5 bits (68), Expect = 0.40
Identities = 22/82 (26%), Positives = 48/82 (58%), Gaps = 4/82 (4%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK----DKKEEEVEEEEE 134
+ + +K+ EE+++ +E +K+ +ED++K E K+ K+K KK EE ++ +E
Sbjct: 1376 AKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADE 1435
Query: 135 EQKNKKKEEEEEEEKEKVEMYR 156
+K ++ ++ +E K+K E +
Sbjct: 1436 AKKKAEEAKKADEAKKKAEEAK 1457
Score = 30.5 bits (68), Expect = 0.44
Identities = 26/75 (34%), Positives = 49/75 (65%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
EN EE K++ EE++++ EE +KDE+E+KK ++EEEK+ ++ +E E EE+ +
Sbjct: 1727 ENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELD 1786
Query: 139 KKKEEEEEEEKEKVE 153
++ E+ E +K++
Sbjct: 1787 EEDEKRRMEVDKKIK 1801
Score = 30.1 bits (67), Expect = 0.55
Identities = 19/68 (27%), Positives = 39/68 (57%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
+E +K EE+++ +E K+K E+ +K E K++ EE +K + + EE ++ + + E
Sbjct: 1293 DEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEA 1352
Query: 146 EEEKEKVE 153
E ++ E
Sbjct: 1353 EAAADEAE 1360
Score = 30.1 bits (67), Expect = 0.58
Identities = 21/74 (28%), Positives = 47/74 (63%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ E +K+ +E ++ EE +K ++ +K E K+ +E K+ ++K++ E ++ E+ K +E+
Sbjct: 1504 KAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEK 1563
Query: 144 EEEEEKEKVEMYRN 157
++ EE +K E +N
Sbjct: 1564 KKAEEAKKAEEDKN 1577
Score = 30.1 bits (67), Expect = 0.59
Identities = 26/82 (31%), Positives = 50/82 (60%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
+E +N + E+ K+ EE EE + E+E+K + ++ ++ E+ K K EE + EEE
Sbjct: 1572 AEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEE 1631
Query: 135 EQKNKKKEEEEEEEKEKVEMYR 156
++K ++ +++E EEK+K E +
Sbjct: 1632 KKKVEQLKKKEAEEKKKAEELK 1653
Score = 30.1 bits (67), Expect = 0.67
Identities = 25/70 (35%), Positives = 45/70 (64%), Gaps = 1/70 (1%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE K K EE ++ EEEK+K E + KK+E +E+++ ++ K EEE + + E+ K +E+
Sbjct: 1615 AEEAKIKAEELKKAEEEKKKVE-QLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEED 1673
Query: 144 EEEEEKEKVE 153
+++ E+ K
Sbjct: 1674 KKKAEEAKKA 1683
Score = 30.1 bits (67), Expect = 0.67
Identities = 25/75 (33%), Positives = 44/75 (58%), Gaps = 3/75 (4%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK---DKKEEEVEEEEEEQKNKKK 141
EE K+ EE +++ EE K+ DE ++K E K+ +E K+K KK+ + ++ E K K
Sbjct: 1454 EEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKAD 1513
Query: 142 EEEEEEEKEKVEMYR 156
E ++ EE +K + +
Sbjct: 1514 EAKKAEEAKKADEAK 1528
Score = 30.1 bits (67), Expect = 0.70
Identities = 23/73 (31%), Positives = 45/73 (61%), Gaps = 3/73 (4%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK---DKKEEEVEEEEEEQKNKK 140
+K+ +E +++ EE+K+ DE ++K E K+ +E K+K KK EE +++ EE K
Sbjct: 1414 AAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKAD 1473
Query: 141 KEEEEEEEKEKVE 153
+ +++ EE +K +
Sbjct: 1474 EAKKKAEEAKKAD 1486
Score = 30.1 bits (67), Expect = 0.71
Identities = 21/69 (30%), Positives = 47/69 (68%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
EE+K+ +E +++ EE K+ DE ++K E K+ EE K+K ++ ++ +E +++ + KK +E
Sbjct: 1428 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADE 1487
Query: 145 EEEEKEKVE 153
+++ E+ +
Sbjct: 1488 AKKKAEEAK 1496
Score = 29.7 bits (66), Expect = 0.73
Identities = 20/72 (27%), Positives = 46/72 (63%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E ++ + + E E +E E E++ + EKK+EE +K+ D +++ EE+++ + KKK
Sbjct: 1340 EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399
Query: 142 EEEEEEEKEKVE 153
EE++++ ++++
Sbjct: 1400 AEEDKKKADELK 1411
Score = 29.7 bits (66), Expect = 0.75
Identities = 21/73 (28%), Positives = 45/73 (61%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E + +E +K+ EE ++ +E +K+ +E ++K E K+ E K+K + ++ EE ++ +
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEA 1527
Query: 139 KKKEEEEEEEKEK 151
KK EE ++ ++ K
Sbjct: 1528 KKAEEAKKADEAK 1540
Score = 29.7 bits (66), Expect = 0.80
Identities = 22/82 (26%), Positives = 52/82 (63%)
Query: 68 LPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
L +A +A + + + + EEK+K +E +++ EE +K ++ +KK E+ ++ EE +K +E
Sbjct: 1410 LKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEA 1469
Query: 128 EVEEEEEEQKNKKKEEEEEEEK 149
+ +E +++ + K+ +E ++K
Sbjct: 1470 KKADEAKKKAEEAKKADEAKKK 1491
Score = 29.7 bits (66), Expect = 0.83
Identities = 25/80 (31%), Positives = 47/80 (58%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+A A + E + +E K+K EE ++ +E K+K E+ +K E K++ EE +K + ++
Sbjct: 1419 KADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKK 1478
Query: 130 EEEEEEQKNKKKEEEEEEEK 149
EE ++ KK+ EE ++K
Sbjct: 1479 AEEAKKADEAKKKAEEAKKK 1498
Score = 29.7 bits (66), Expect = 0.88
Identities = 24/70 (34%), Positives = 45/70 (64%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE+K+K EE +++EEEK+K +K+ EKK EE K+K+ EE +EE+E++ + ++
Sbjct: 1739 AEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDK 1798
Query: 144 EEEEEKEKVE 153
+ ++ +
Sbjct: 1799 KIKDIFDNFA 1808
Score = 29.3 bits (65), Expect = 1.1
Identities = 26/83 (31%), Positives = 53/83 (63%), Gaps = 2/83 (2%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKRE--KKEEEEEKQKDKKEE 127
+A +A + E + +E K+K EE++++ +E +K +KK + KK+ EE+K+ D+ ++
Sbjct: 1379 KADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKK 1438
Query: 128 EVEEEEEEQKNKKKEEEEEEEKE 150
+ EE ++ + KKK EE ++ +E
Sbjct: 1439 KAEEAKKADEAKKKAEEAKKAEE 1461
Score = 29.0 bits (64), Expect = 1.4
Identities = 32/82 (39%), Positives = 50/82 (60%), Gaps = 9/82 (10%)
Query: 84 EEEEKEKEEEEEEEEEE---------KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
E EEK+K EE ++ EEE KE +ED++K E K++EEEK+K ++ EE++
Sbjct: 1711 EAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKA 1770
Query: 135 EQKNKKKEEEEEEEKEKVEMYR 156
E+ K+KE EEE ++ + R
Sbjct: 1771 EEIRKEKEAVIEEELDEEDEKR 1792
Score = 29.0 bits (64), Expect = 1.5
Identities = 23/78 (29%), Positives = 45/78 (57%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E + EE +K +E++ + +E + EE + E+ + E++K K EE ++ EE +
Sbjct: 1562 EKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIK 1621
Query: 139 KKKEEEEEEEKEKVEMYR 156
++ ++ EEEK+KVE +
Sbjct: 1622 AEELKKAEEEKKKVEQLK 1639
Score = 27.4 bits (60), Expect = 4.4
Identities = 24/81 (29%), Positives = 50/81 (61%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+A A E + +E ++ E++K+ +E ++ EE +K E+++K E K+ EE+K ++ E
Sbjct: 1526 EAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEE 1585
Query: 130 EEEEEEQKNKKKEEEEEEEKE 150
++ EE + ++ + EEEK+
Sbjct: 1586 AKKAEEARIEEVMKLYEEEKK 1606
Score = 27.4 bits (60), Expect = 4.7
Identities = 21/79 (26%), Positives = 50/79 (63%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+A A + E + +E K+ EE ++ +E +K ++ + + +K EE+++ + KK EE+
Sbjct: 1498 KADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEEL 1557
Query: 130 EEEEEEQKNKKKEEEEEEE 148
++ EE++K ++ ++ EE++
Sbjct: 1558 KKAEEKKKAEEAKKAEEDK 1576
Score = 27.0 bits (59), Expect = 5.7
Identities = 23/78 (29%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E + +E K+ E +++ +E K K E+ +K E K+ EE K+ D+ ++ E+++ ++
Sbjct: 1494 EAKKKADEAKKAAEAKKKADEAK-KAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELK 1552
Query: 139 KKKEEEEEEEKEKVEMYR 156
K +E ++ EEK+K E +
Sbjct: 1553 KAEELKKAEEKKKAEEAK 1570
Score = 27.0 bits (59), Expect = 5.8
Identities = 19/72 (26%), Positives = 36/72 (50%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
+++ + KE +E EE K E+ +K K EE K ++ K++ + + E+ K +
Sbjct: 1078 DFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAE 1137
Query: 142 EEEEEEEKEKVE 153
+ + EE K E
Sbjct: 1138 DARKAEEARKAE 1149
Score = 26.6 bits (58), Expect = 8.1
Identities = 26/77 (33%), Positives = 43/77 (55%), Gaps = 9/77 (11%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK----DKKEEEVEEEEEEQKN--- 138
EEK+K +E ++ EE+K+ DE ++K E K+ +E K+K KK + +++ EE K
Sbjct: 1287 EEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAE 1346
Query: 139 --KKKEEEEEEEKEKVE 153
K + E +E E E
Sbjct: 1347 AAKAEAEAAADEAEAAE 1363
>gnl|CDD|214653 smart00410, IG_like, Immunoglobulin like. IG domains that cannot
be classified into one of IGv1, IGc1, IGc2, IG.
Length = 85
Score = 36.7 bits (85), Expect = 7e-04
Identities = 10/44 (22%), Positives = 13/44 (29%), Gaps = 2/44 (4%)
Query: 23 GDSSHGVYNLRIRNVSLTDDAEFQCQVGPAQYHKPIRANARLTV 66
S L I NV+ D + C + LTV
Sbjct: 44 VSRSGSTSTLTISNVTPEDSGTYTCAATN--SSGSASSGTTLTV 85
>gnl|CDD|214652 smart00409, IG, Immunoglobulin.
Length = 85
Score = 36.7 bits (85), Expect = 7e-04
Identities = 10/44 (22%), Positives = 13/44 (29%), Gaps = 2/44 (4%)
Query: 23 GDSSHGVYNLRIRNVSLTDDAEFQCQVGPAQYHKPIRANARLTV 66
S L I NV+ D + C + LTV
Sbjct: 44 VSRSGSTSTLTISNVTPEDSGTYTCAATN--SSGSASSGTTLTV 85
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 38.5 bits (89), Expect = 8e-04
Identities = 22/83 (26%), Positives = 44/83 (53%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
+ E + + E+ E+ E +E D+D+E+ + +KE E E Q K E E
Sbjct: 462 LMTMEETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEA 521
Query: 132 EEEEQKNKKKEEEEEEEKEKVEM 154
+++ K+K K+ + +EE+E+ ++
Sbjct: 522 DKDVNKSKNKKRKVDEEEEEKKL 544
Score = 31.2 bits (70), Expect = 0.22
Identities = 25/84 (29%), Positives = 45/84 (53%), Gaps = 4/84 (4%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
Q I SET E ++ + K K+ + +EEEEEK+ ++KK ++ K + K
Sbjct: 511 QGIKYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAK---- 566
Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
+EE+ E KKK++ +++K +
Sbjct: 567 KEEQAENLKKKKKQIAKQKKLDSK 590
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 38.4 bits (90), Expect = 9e-04
Identities = 19/66 (28%), Positives = 33/66 (50%), Gaps = 4/66 (6%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E E+E+ E E++K +++K EKK E+EE +K +++ E K K +
Sbjct: 395 EGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKA----EAAAKKAKGPD 450
Query: 145 EEEEKE 150
E +K
Sbjct: 451 GETKKV 456
Score = 34.5 bits (80), Expect = 0.019
Identities = 16/68 (23%), Positives = 31/68 (45%), Gaps = 1/68 (1%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
A E EN E++K +++ + E+K + E E +K K++ E K K +
Sbjct: 392 LLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKE-EAEKAAAKKKAEAAAKKAKGPD 450
Query: 129 VEEEEEEQ 136
E ++ +
Sbjct: 451 GETKKVDP 458
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 38.1 bits (89), Expect = 0.001
Identities = 27/73 (36%), Positives = 40/73 (54%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E EWE EE E ++E E + + E E E +EE++E K KE+ EE EE +
Sbjct: 100 EEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEE 159
Query: 139 KKKEEEEEEEKEK 151
+ EEEE E +++
Sbjct: 160 EAAEEEEAEAEKE 172
Score = 36.9 bits (86), Expect = 0.002
Identities = 22/81 (27%), Positives = 49/81 (60%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
A + LE+++ ++EE+E E EE+E+ +++ + D E +E + + E +++K E + +
Sbjct: 87 EAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAK 146
Query: 133 EEEQKNKKKEEEEEEEKEKVE 153
E+ + +E+EEE +E+
Sbjct: 147 EDSDEELSEEDEEEAAEEEEA 167
Score = 36.9 bits (86), Expect = 0.002
Identities = 24/79 (30%), Positives = 51/79 (64%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
++++ EEEE E EE+E+ ++E + D + +K+ E + E+E++KD+ ++ +E+ +
Sbjct: 91 GLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSD 150
Query: 135 EQKNKKKEEEEEEEKEKVE 153
E+ +++ EEE EE+E
Sbjct: 151 EELSEEDEEEAAEEEEAEA 169
Score = 33.9 bits (78), Expect = 0.025
Identities = 24/61 (39%), Positives = 38/61 (62%), Gaps = 1/61 (1%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
+ EW + E +KE E + E+E+EKDE KK ++ +EE ++D++E EEE E +K
Sbjct: 114 DEGEWIDVESDKEIESSDSEDEEEKDE-AAKKAKEDSDEELSEEDEEEAAEEEEAEAEKE 172
Query: 139 K 139
K
Sbjct: 173 K 173
Score = 33.5 bits (77), Expect = 0.036
Identities = 27/78 (34%), Positives = 47/78 (60%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E + E E+ + ++++EEEE E E+DED + + E + E +K+ + + E EEE++E
Sbjct: 81 EERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDE 140
Query: 136 QKNKKKEEEEEEEKEKVE 153
K KE+ +EE E+ E
Sbjct: 141 AAKKAKEDSDEELSEEDE 158
Score = 28.5 bits (64), Expect = 1.6
Identities = 17/50 (34%), Positives = 31/50 (62%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
++E EEE+ E ++ +E+ +E+ +EDEE+ E++E E EK+K
Sbjct: 124 DKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEK 173
Score = 27.7 bits (62), Expect = 2.6
Identities = 13/50 (26%), Positives = 33/50 (66%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
S E ++ +EEEK++ ++ +E+ ++E E++E++ ++EE E +++
Sbjct: 123 SDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKE 172
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 37.8 bits (88), Expect = 0.001
Identities = 21/79 (26%), Positives = 49/79 (62%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E + + E E++ +E E +E++K ++E+ + E+ E+EE + ++ EV+ EE + +N
Sbjct: 4 ECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDEN 63
Query: 139 KKKEEEEEEEKEKVEMYRN 157
K +EE ++ + ++E ++
Sbjct: 64 NKLKEENKKLENELEALKD 82
Score = 35.9 bits (83), Expect = 0.004
Identities = 23/76 (30%), Positives = 44/76 (57%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E + + + EE+ +E E KE+D+ +E+ E +E E+E+ + EE E + EE K+
Sbjct: 2 EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKD 61
Query: 139 KKKEEEEEEEKEKVEM 154
+ + +EE +K + E+
Sbjct: 62 ENNKLKEENKKLENEL 77
Score = 32.8 bits (75), Expect = 0.058
Identities = 26/74 (35%), Positives = 43/74 (58%), Gaps = 5/74 (6%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK-----N 138
E E++ +E E +EE+K K+ED E + +KEE E ++ E ++EE ++E N
Sbjct: 11 ENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEEN 70
Query: 139 KKKEEEEEEEKEKV 152
KK E E E K+++
Sbjct: 71 KKLENELEALKDRL 84
Score = 32.4 bits (74), Expect = 0.063
Identities = 22/69 (31%), Positives = 43/69 (62%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E++ K+ + E EE+ K+ + +++ + KEE+ E ++ +KEE +E+ EE + K +E +
Sbjct: 1 MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELK 60
Query: 145 EEEEKEKVE 153
+E K K E
Sbjct: 61 DENNKLKEE 69
Score = 29.3 bits (66), Expect = 0.70
Identities = 18/53 (33%), Positives = 28/53 (52%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
E E E E+ ++EE E+ EE + + EE K E + +EE +K + E E
Sbjct: 26 EDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEENKKLENELE 78
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 38.1 bits (89), Expect = 0.001
Identities = 15/77 (19%), Positives = 41/77 (53%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
+ +++ E E ++ E EK + +DE+++ ++E + + +K++ + E + + +
Sbjct: 643 RRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQ 702
Query: 140 KKEEEEEEEKEKVEMYR 156
+E E+EE+ + R
Sbjct: 703 SVQETEQEERVQQVQPR 719
Score = 32.3 bits (74), Expect = 0.12
Identities = 11/58 (18%), Positives = 29/58 (50%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
+ E EK ++E+++ ++ + EK++ ++E + EE+ +E E++
Sbjct: 653 TRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQSVQETEQE 710
Score = 31.5 bits (72), Expect = 0.18
Identities = 14/63 (22%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKD-EDEEKKREKKEEEEEKQKDKKEEEVEEE 132
+ET E+ + E EK + ++E+++ +E+ ++KR+ ++E + +++ + E+
Sbjct: 650 TAETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQSVQETEQ 709
Query: 133 EEE 135
EE
Sbjct: 710 EER 712
Score = 31.2 bits (71), Expect = 0.27
Identities = 22/99 (22%), Positives = 35/99 (35%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
PQ I+ S+ E E + E ++E + +E E E + E + E
Sbjct: 928 PQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEV 987
Query: 129 VEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPMYRNPVP 167
E E + + + + VE PM R P P
Sbjct: 988 AAEVETVTAVEPEVAPAQVPEATVEHNHATAPMTRAPAP 1026
Score = 30.4 bits (69), Expect = 0.45
Identities = 16/78 (20%), Positives = 39/78 (50%), Gaps = 7/78 (8%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDE-EKKREKKEEEEEKQKDKKEEEVEEEEE 134
+ + E ++ E E+ +++++ E++R + +E+ + Q++ K VEE+
Sbjct: 645 QAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQSV 704
Query: 135 EQKNKKKEEEEEEEKEKV 152
+ E E+EE ++V
Sbjct: 705 Q------ETEQEERVQQV 716
Score = 28.5 bits (64), Expect = 1.8
Identities = 16/70 (22%), Positives = 36/70 (51%), Gaps = 5/70 (7%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEK-----KREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
EE K +E+ + E + E+ +D K +R++ E + + + E E EE ++N
Sbjct: 583 GEETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREENRRN 642
Query: 139 KKKEEEEEEE 148
+++ +++ E
Sbjct: 643 RRQAQQQTAE 652
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 37.8 bits (88), Expect = 0.001
Identities = 14/65 (21%), Positives = 32/65 (49%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ EE +++E+EEEE+ E +E+E + E++ + + + + E K K+
Sbjct: 366 DFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESA 425
Query: 144 EEEEE 148
+ +
Sbjct: 426 SDSDS 430
Score = 37.8 bits (88), Expect = 0.001
Identities = 15/70 (21%), Positives = 38/70 (54%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
L+ ++EE +++++EEEE+ +E E++E E+ + E + E+ + + + E +
Sbjct: 362 LDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKAD 421
Query: 138 NKKKEEEEEE 147
+ + + E
Sbjct: 422 KESASDSDSE 431
Score = 37.8 bits (88), Expect = 0.001
Identities = 16/65 (24%), Positives = 33/65 (50%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE E E+EEEE+ ++ ++E+ E E+ + E + +V + E + +K+
Sbjct: 367 FEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESAS 426
Query: 144 EEEEE 148
+ + E
Sbjct: 427 DSDSE 431
Score = 36.6 bits (85), Expect = 0.004
Identities = 18/69 (26%), Positives = 35/69 (50%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
+ + E+ +E+E+EEEE+ DE EE++ E EEE + ++ E + K
Sbjct: 361 RLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKA 420
Query: 142 EEEEEEEKE 150
++E + +
Sbjct: 421 DKESASDSD 429
Score = 35.5 bits (82), Expect = 0.008
Identities = 15/66 (22%), Positives = 34/66 (51%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
+ E+ E+E+EEEE+ ++ E+EE + ++E + ++ E + + + KE
Sbjct: 366 DFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESA 425
Query: 145 EEEEKE 150
+ + E
Sbjct: 426 SDSDSE 431
Score = 34.7 bits (80), Expect = 0.013
Identities = 16/67 (23%), Positives = 28/67 (41%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
P + E+ E E+ E EEEE E+ EE+ E+ E + + K ++E
Sbjct: 364 PIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKE 423
Query: 129 VEEEEEE 135
+ +
Sbjct: 424 SASDSDS 430
Score = 33.2 bits (76), Expect = 0.049
Identities = 17/50 (34%), Positives = 29/50 (58%)
Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
E K R+K+ + ++ +E E+EEEEQ++ + EEEE E+ E+
Sbjct: 348 STKESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEG 397
Score = 33.2 bits (76), Expect = 0.051
Identities = 15/65 (23%), Positives = 30/65 (46%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
+ + EE E +++EE++R + EEEE + ++E E+ ++ + E
Sbjct: 359 RARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSES 418
Query: 149 KEKVE 153
K E
Sbjct: 419 KADKE 423
Score = 31.6 bits (72), Expect = 0.15
Identities = 15/60 (25%), Positives = 26/60 (43%), Gaps = 1/60 (1%)
Query: 76 ETLENNEWEEEEKEKEEEEE-EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
E + E + ++ +EEE E EEE + ED + + + K KE + + E
Sbjct: 372 EDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 37.7 bits (88), Expect = 0.001
Identities = 25/73 (34%), Positives = 42/73 (57%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E ++ E++ E E+EK+E +K EK +E E+Q + E++++ E Q + + E
Sbjct: 214 ESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIELQREF 273
Query: 145 EEEEKEKVEMYRN 157
+E KEKVE RN
Sbjct: 274 NKEIKEKVEEERN 286
Score = 32.3 bits (74), Expect = 0.085
Identities = 23/70 (32%), Positives = 37/70 (52%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EEEE E+ +E+ EE + +E+ + E KE EKQ + E +EE ++ +K +
Sbjct: 186 EEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQ 245
Query: 144 EEEEEKEKVE 153
E E + E E
Sbjct: 246 ELERQAEAHE 255
Score = 30.8 bits (70), Expect = 0.36
Identities = 18/64 (28%), Positives = 33/64 (51%), Gaps = 2/64 (3%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
E + EEEEE E KEK E+ K E++ + K+ ++ E + +K+E ++
Sbjct: 181 ELKAEEEEELERALKEKREELLSKLEEELLAR--LESKEAALEKQLRLEFEREKEELRKK 238
Query: 147 EEKE 150
E++
Sbjct: 239 YEEK 242
Score = 30.4 bits (69), Expect = 0.44
Identities = 15/56 (26%), Positives = 30/56 (53%)
Query: 102 EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
E++E E +EK+EE K +++ +E +E + + + E E E++E + Y
Sbjct: 186 EEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEE 241
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 37.8 bits (88), Expect = 0.002
Identities = 22/55 (40%), Positives = 32/55 (58%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
A LE + E ++EEE+EEEEEE+++DE K+ EE EE + K E+
Sbjct: 375 AFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYED 429
Score = 37.4 bits (87), Expect = 0.002
Identities = 27/78 (34%), Positives = 45/78 (57%), Gaps = 7/78 (8%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
ETL +W + KE E++ + + ED +R+ +EEE+E EEE EE+E+E
Sbjct: 354 ETLNKQKWTKA-KETEQDYILDAFSALEIEDANTERDDEEEEDE------EEEEEEDEDE 406
Query: 136 QKNKKKEEEEEEEKEKVE 153
+K+ ++EE E++ VE
Sbjct: 407 GPSKEHSDDEEFEEDDVE 424
Score = 33.2 bits (76), Expect = 0.051
Identities = 18/47 (38%), Positives = 29/47 (61%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
T ++E EE+E+E+EEE+E+E KE +DEE + + E + E
Sbjct: 385 NTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSD 431
Score = 31.3 bits (71), Expect = 0.22
Identities = 18/67 (26%), Positives = 39/67 (58%), Gaps = 4/67 (5%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
+ E E+ E+++EEEE+EEE+E+++++E ++ ++EE + E++VE +
Sbjct: 372 ILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFE----EDDVESKY 427
Query: 134 EEQKNKK 140
E+
Sbjct: 428 EDSDGNS 434
Score = 28.6 bits (64), Expect = 1.7
Identities = 14/58 (24%), Positives = 30/58 (51%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E E E ++EEE++++E+EE+ ++ +E ++ EE+ E + E +
Sbjct: 377 SALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDGNS 434
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 38.0 bits (88), Expect = 0.002
Identities = 14/55 (25%), Positives = 26/55 (47%), Gaps = 3/55 (5%)
Query: 97 EEEEKEK-DEDEEKKREKKEEEEEKQKDKKE--EEVEEEEEEQKNKKKEEEEEEE 148
+++ KEK D+E K E E E ++ + EE + N KK++ + +
Sbjct: 798 KQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKVD 852
Score = 34.9 bits (80), Expect = 0.017
Identities = 18/85 (21%), Positives = 36/85 (42%), Gaps = 2/85 (2%)
Query: 51 PAQYHKPIRANARL--TVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEE 108
P Y + + R T +L I A+ + E+ +KE + E E E +
Sbjct: 768 PRMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTI 827
Query: 109 KKREKKEEEEEKQKDKKEEEVEEEE 133
K+RE+K + + K+++ + +
Sbjct: 828 KQREEKGIDAPAILNVKKKKPYKVD 852
Score = 31.8 bits (72), Expect = 0.14
Identities = 12/48 (25%), Positives = 23/48 (47%), Gaps = 2/48 (4%)
Query: 101 KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
K ++K +EK ++E + + VE E E K++EE+ +
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIES--PSVETEGERCTIKQREEKGIDA 837
Score = 28.0 bits (62), Expect = 2.5
Identities = 10/45 (22%), Positives = 22/45 (48%)
Query: 106 DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
+ K++ +EK + KE ++E E + ++ ++ EEK
Sbjct: 790 TIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKG 834
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 37.3 bits (87), Expect = 0.002
Identities = 21/74 (28%), Positives = 41/74 (55%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
+ E +E + EEE E K+K ++ +++EK+ EE E+ ++ +E+ + EE +
Sbjct: 297 NSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQK 356
Query: 139 KKKEEEEEEEKEKV 152
K ++ E+ EK K
Sbjct: 357 KLEDLEKRLEKLKS 370
Score = 31.9 bits (73), Expect = 0.13
Identities = 15/66 (22%), Positives = 39/66 (59%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
E + EEE E +++ EK ++EK+ E+ E+ +++ + ++EE +++ ++ +K E+
Sbjct: 308 EARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEK 367
Query: 146 EEEKEK 151
+ +
Sbjct: 368 LKSNKS 373
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 36.6 bits (85), Expect = 0.002
Identities = 14/62 (22%), Positives = 31/62 (50%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E + +EE E ++E E+++K +K +E+ +Q++ K + E E K +
Sbjct: 36 IEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTVKATK 95
Query: 144 EE 145
++
Sbjct: 96 KQ 97
Score = 27.3 bits (61), Expect = 2.6
Identities = 11/48 (22%), Positives = 25/48 (52%)
Query: 99 EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
++KD + ++E E +K+ EE+ + ++ K K ++EE +
Sbjct: 30 RYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELK 77
Score = 27.0 bits (60), Expect = 3.8
Identities = 17/60 (28%), Positives = 30/60 (50%), Gaps = 3/60 (5%)
Query: 95 EEEEEEKEKDEDEEKKREKKEEE---EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
E + + + K EKK + + EE + DK+ EE+++ K KE+ E+EE +
Sbjct: 19 ENDGKLQVKPNRYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKN 78
Score = 26.6 bits (59), Expect = 4.4
Identities = 15/72 (20%), Positives = 30/72 (41%), Gaps = 8/72 (11%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEE--------KKREKKEEEEEKQKDKKEE 127
+ L E +++ ++ +E+ E+EE + E K +K+ E ++
Sbjct: 51 KELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTVKATKKQLFSSEYEQTSSSS 110
Query: 128 EVEEEEEEQKNK 139
E EEE +K
Sbjct: 111 ESTSEEETKKTS 122
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 37.5 bits (87), Expect = 0.002
Identities = 16/71 (22%), Positives = 32/71 (45%), Gaps = 1/71 (1%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK-EEEVEEEEEEQK 137
E ++ E+E + E+ E E E + E E++E++ + + E DK + +
Sbjct: 657 ETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPD 716
Query: 138 NKKKEEEEEEE 148
+ E+EE
Sbjct: 717 ALDGGDSEDEE 727
Score = 36.4 bits (84), Expect = 0.005
Identities = 20/74 (27%), Positives = 37/74 (50%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
++ E EEE+E++ EE + DEDE + E E E E + EE+ +E++ E +
Sbjct: 642 QQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPID 701
Query: 145 EEEEKEKVEMYRNP 158
+ + ++ P
Sbjct: 702 KAVRRALPKVLNLP 715
Score = 33.7 bits (77), Expect = 0.034
Identities = 17/60 (28%), Positives = 37/60 (61%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
+ L+ +E + ++ E EEE+E++ E+ +D+E + E E+ E + + E+ E+E+E+
Sbjct: 630 DVLKADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQED 689
Score = 31.4 bits (71), Expect = 0.20
Identities = 14/82 (17%), Positives = 37/82 (45%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
E +E++E E E ++ EE+E+E++ E +E + K + + + E
Sbjct: 665 CEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSE 724
Query: 135 EQKNKKKEEEEEEEKEKVEMYR 156
+++ E+ + ++++
Sbjct: 725 DEEGMDDEQMMRLDTYLAQIFK 746
Score = 26.8 bits (59), Expect = 6.3
Identities = 17/68 (25%), Positives = 30/68 (44%), Gaps = 1/68 (1%)
Query: 70 QAISASETLENNEWE-EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
+AI SE+ ++ E EE E+E++ E E D+ + K + E+
Sbjct: 666 EAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSED 725
Query: 129 VEEEEEEQ 136
E ++EQ
Sbjct: 726 EEGMDDEQ 733
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 36.6 bits (85), Expect = 0.002
Identities = 21/75 (28%), Positives = 48/75 (64%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
+ +E E + E+E + E++ + E+EEKK +KK++E +K+K +K+++ E+
Sbjct: 118 YGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177
Query: 133 EEEQKNKKKEEEEEE 147
E + +KKK++++++
Sbjct: 178 VEPKGSKKKKKKKKK 192
Score = 35.5 bits (82), Expect = 0.005
Identities = 22/66 (33%), Positives = 41/66 (62%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
E E E EK+ + ++E + EEEEK++ KK++EV++E++E+K+KK++ E
Sbjct: 120 APDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVE 179
Query: 146 EEEKEK 151
+ +K
Sbjct: 180 PKGSKK 185
Score = 35.1 bits (81), Expect = 0.007
Identities = 18/68 (26%), Positives = 43/68 (63%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E + E E+E K + E E ++ EKKE++++K+ K+++E ++++E+ K +
Sbjct: 125 PSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSK 184
Query: 144 EEEEEKEK 151
+++++K+K
Sbjct: 185 KKKKKKKK 192
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 37.4 bits (86), Expect = 0.003
Identities = 19/42 (45%), Positives = 27/42 (64%)
Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
K +K+EE EK K + E++ EE E +K K+KE E E E+E
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKERERERERE 617
Score = 35.4 bits (81), Expect = 0.010
Identities = 20/48 (41%), Positives = 30/48 (62%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
K ++ EE EK K E E+K RE++E E+EK+K+++ E E E K
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAK 623
Score = 34.3 bits (78), Expect = 0.027
Identities = 19/51 (37%), Positives = 32/51 (62%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
AS L E EK K E E++ EE+E+++++EK+RE++ E E ++ K
Sbjct: 573 ASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAK 623
Score = 33.1 bits (75), Expect = 0.056
Identities = 16/38 (42%), Positives = 26/38 (68%)
Query: 114 KEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
K+ EE +K K+E E + EE ++ K+KE+E E E+E+
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKERERERER 616
Score = 32.4 bits (73), Expect = 0.12
Identities = 15/41 (36%), Positives = 30/41 (73%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
K++EE E+ + E E+ EE++REK++E+E +++ ++E E
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAE 619
Score = 29.3 bits (65), Expect = 0.97
Identities = 16/42 (38%), Positives = 30/42 (71%), Gaps = 1/42 (2%)
Query: 109 KKREKKEEEEEKQKDKK-EEEVEEEEEEQKNKKKEEEEEEEK 149
KKRE+ E+ +++ ++K EE E E+E++K +++E E E E+
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAER 620
Score = 29.3 bits (65), Expect = 1.1
Identities = 16/52 (30%), Positives = 35/52 (67%), Gaps = 1/52 (1%)
Query: 72 ISASETLENNEWEEEEKEKEEEEE-EEEEEKEKDEDEEKKREKKEEEEEKQK 122
+++S+ + E E+ ++E E++ EE E+EK++++E++RE++ E E K
Sbjct: 572 LASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAK 623
Score = 27.0 bits (59), Expect = 6.1
Identities = 13/46 (28%), Positives = 31/46 (67%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD 123
L +++ ++ +E E+ + E E+K ++E E +K ++KE E E++++
Sbjct: 572 LASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKERERERERE 617
Score = 27.0 bits (59), Expect = 6.5
Identities = 15/39 (38%), Positives = 27/39 (69%), Gaps = 1/39 (2%)
Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
E+ EK + E E QK ++E E E+E+E+++ +++E E E
Sbjct: 582 EEAVEKAKREAE-QKAREEREREKEKEKEREREREREAE 619
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 36.7 bits (85), Expect = 0.003
Identities = 21/56 (37%), Positives = 37/56 (66%)
Query: 93 EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
EEEE+ EK EKK ++ E E+ ++D+K+EE EEEEEE+ +++++++
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197
Score = 33.2 bits (76), Expect = 0.033
Identities = 14/55 (25%), Positives = 39/55 (70%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
EEEE E+ E++ KE + ++ + ++K+EEEE+++++++E+ ++++++ +
Sbjct: 144 EEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDD 198
Score = 27.8 bits (62), Expect = 2.5
Identities = 21/51 (41%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
E+EE EK E+K K+ + E+V+EE+E+ + + EEEEEEE E +
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEE--EEEEEEEDEDFD 190
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 36.7 bits (85), Expect = 0.003
Identities = 18/64 (28%), Positives = 32/64 (50%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E + EEEE + E E+ + E+ +E+ ++ E+ E+E+E + + EEE
Sbjct: 386 AELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEE 445
Query: 145 EEEE 148
EE
Sbjct: 446 EEAR 449
Score = 33.6 bits (77), Expect = 0.039
Identities = 19/70 (27%), Positives = 36/70 (51%), Gaps = 2/70 (2%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+E E K E + EE+E + E ++R + E+ E ++D + E+ E++ E+
Sbjct: 378 QEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFED--ED 435
Query: 144 EEEEEKEKVE 153
E EE + + E
Sbjct: 436 ELEEAQPEEE 445
Score = 32.8 bits (75), Expect = 0.073
Identities = 17/69 (24%), Positives = 36/69 (52%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E ++ E+ + E +++E+E +RE +E E +Q ++ EE+ E + + E+
Sbjct: 374 AEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFED 433
Query: 145 EEEEKEKVE 153
E+E +E
Sbjct: 434 EDELEEAQP 442
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 36.7 bits (85), Expect = 0.003
Identities = 20/73 (27%), Positives = 40/73 (54%), Gaps = 3/73 (4%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE---EEVEEEEEEQKNKK 140
++ +E +EE+ ++++ + +ED E RE+ E +E ++D+ E E EE E +
Sbjct: 224 DDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEAS 283
Query: 141 KEEEEEEEKEKVE 153
++ E +E E E
Sbjct: 284 EDSESDESDEDTE 296
Score = 34.8 bits (80), Expect = 0.013
Identities = 16/60 (26%), Positives = 29/60 (48%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
+ ++ E+ E EE E +E E+DE E E +E E + + ++ E +E +E
Sbjct: 234 EDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESDE 293
Score = 33.7 bits (77), Expect = 0.035
Identities = 22/77 (28%), Positives = 37/77 (48%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
+ E++E EE E +E EE+E E E EE + + E E+ + D+ +E+ E
Sbjct: 240 QPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESDEDTETPG 299
Query: 134 EEQKNKKKEEEEEEEKE 150
E+ + E EE +
Sbjct: 300 EDARPATPFTELMEEVD 316
Score = 32.1 bits (73), Expect = 0.13
Identities = 16/62 (25%), Positives = 35/62 (56%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
E+ ++ EE+ E+D D+++ ++ E +++ + +E EE+E E + + EE E +
Sbjct: 220 EETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDA 279
Query: 149 KE 150
E
Sbjct: 280 AE 281
Score = 31.0 bits (70), Expect = 0.29
Identities = 19/81 (23%), Positives = 39/81 (48%), Gaps = 2/81 (2%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE--EE 133
+ L + + EE + EE+ +EE+ + D+ + + + EE + D+ EE+ E +
Sbjct: 211 DMLGSMDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDG 270
Query: 134 EEQKNKKKEEEEEEEKEKVEM 154
E ++ + E E+ E E
Sbjct: 271 EGEEGEMDAAEASEDSESDES 291
Score = 29.0 bits (65), Expect = 1.3
Identities = 17/71 (23%), Positives = 34/71 (47%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E + +++ E+ E EE + DE EE + E + E E+ + E E+ E ++ +
Sbjct: 233 EEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESD 292
Query: 139 KKKEEEEEEEK 149
+ E E+ +
Sbjct: 293 EDTETPGEDAR 303
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 36.7 bits (85), Expect = 0.004
Identities = 20/67 (29%), Positives = 39/67 (58%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EKE E++ E E+ ++ E +K+ E++ EE ++++ K+ E+E+E +E K+E
Sbjct: 521 SALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKE 580
Query: 144 EEEEEKE 150
E +E
Sbjct: 581 VESIIRE 587
Score = 34.0 bits (78), Expect = 0.030
Identities = 27/97 (27%), Positives = 47/97 (48%), Gaps = 5/97 (5%)
Query: 61 NARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKE-----KDEDEEKKREKKE 115
N L +L + + LE E +E+E+ ++ E E+E +E K E E RE KE
Sbjct: 531 NEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRELKE 590
Query: 116 EEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
++ K K+ K E + +E K K ++ + +K+
Sbjct: 591 KKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQADKI 627
Score = 33.6 bits (77), Expect = 0.036
Identities = 23/88 (26%), Positives = 50/88 (56%), Gaps = 1/88 (1%)
Query: 64 LTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD 123
+ V++ + + + LE E+ KE+E+ ++E E+E +E +E++R KK E E++ ++
Sbjct: 513 INVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQE 572
Query: 124 KKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
+ +E E + + KE++ + KE
Sbjct: 573 ALKALKKEVESIIR-ELKEKKIHKAKEI 599
Score = 33.6 bits (77), Expect = 0.039
Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 5/73 (6%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEK-KREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E KE+ E+ EK+ +++ EK +E+EK K +E+E+E EE K +++ ++
Sbjct: 508 EFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLK----KELEQEMEELKERERNKK 563
Query: 145 EEEEKEKVEMYRN 157
E EKE E +
Sbjct: 564 LELEKEAQEALKA 576
Score = 32.5 bits (74), Expect = 0.10
Identities = 18/72 (25%), Positives = 41/72 (56%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
+N E+ KE+E+ ++E E+E E+ ++ E+ ++ + E+E ++ K ++ E +
Sbjct: 529 QKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIREL 588
Query: 138 NKKKEEEEEEEK 149
+KK + +E K
Sbjct: 589 KEKKIHKAKEIK 600
>gnl|CDD|239285 cd02987, Phd_like_Phd, Phosducin (Phd)-like family, Phd subfamily;
Phd is a cytosolic regulator of G protein functions. It
specifically binds G protein betagamma (Gbg)-subunits
with high affinity, resulting in the solubilization of
Gbg from the plasma membrane. This impedes the formation
of a functional G protein trimer (G protein
alphabetagamma), thereby inhibiting G protein-mediated
signal transduction. Phd also inhibits the GTPase
activity of G protein alpha. Phd can be phosphorylated
by protein kinase A and G protein-coupled receptor
kinase 2, leading to its inactivation. Phd was
originally isolated from the retina, where it is highly
expressed and has been implicated to play an important
role in light adaptation. It is also found in the pineal
gland, liver, spleen, striated muscle and the brain. The
C-terminal domain of Phd adopts a thioredoxin fold, but
it does not contain a CXXC motif. Phd interacts with G
protein beta mostly through the N-terminal helical
domain.
Length = 175
Score = 35.7 bits (83), Expect = 0.004
Identities = 11/45 (24%), Positives = 30/45 (66%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
N+W + ++ KE E+E+++++++K+E ++ RE++ +E +
Sbjct: 14 NDWRKFKQLKESEQEDDDDDEDKEEFLQQYREQRMQEMHAKLPFG 58
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 36.4 bits (84), Expect = 0.004
Identities = 22/71 (30%), Positives = 40/71 (56%), Gaps = 1/71 (1%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE-EKQKDKKEEEVEEEEEEQKNKKKE 142
E+ K+ EE++++ EE K K E K + + E E+ K++ KK+ E E + + KK+
Sbjct: 108 EQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKK 167
Query: 143 EEEEEEKEKVE 153
E ++K + E
Sbjct: 168 AAEAKKKAEAE 178
Score = 34.0 bits (78), Expect = 0.021
Identities = 16/69 (23%), Positives = 39/69 (56%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E+ E+ ++ E+ ++ E+++K+ E+ + ++ + K E E++ +++ KK+ EE
Sbjct: 95 EQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEE 154
Query: 145 EEEEKEKVE 153
E + K E
Sbjct: 155 EAKAKAAAE 163
Score = 33.3 bits (76), Expect = 0.044
Identities = 17/70 (24%), Positives = 40/70 (57%), Gaps = 1/70 (1%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E++ E+ E++ E+ ++K+ ++ EK ++ E+ K+ EE +++ EE K K+ E
Sbjct: 73 LEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAA-KQAEEKQKQAEEAKAKQAAE 131
Query: 144 EEEEEKEKVE 153
+ + + + E
Sbjct: 132 AKAKAEAEAE 141
Score = 32.9 bits (75), Expect = 0.054
Identities = 14/66 (21%), Positives = 40/66 (60%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
++K ++E+E +K+ ++ E+ +++ E+ +QK+ ++ E+ +Q + ++ EE+
Sbjct: 58 QKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEK 117
Query: 148 EKEKVE 153
+K+ E
Sbjct: 118 QKQAEE 123
Score = 32.1 bits (73), Expect = 0.12
Identities = 13/67 (19%), Positives = 42/67 (62%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
+++ ++E+E +++ +++ E+ EK+R ++ +++ + + E ++ EQ K+ EE+
Sbjct: 58 QKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEK 117
Query: 145 EEEEKEK 151
+++ +E
Sbjct: 118 QKQAEEA 124
Score = 31.3 bits (71), Expect = 0.17
Identities = 21/78 (26%), Positives = 41/78 (52%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
A + E + EE K K+ E + + E E ++ +++ +K+ EEE K K E + + E
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAE 170
Query: 134 EEQKNKKKEEEEEEEKEK 151
++K + + + + E K K
Sbjct: 171 AKKKAEAEAKAKAEAKAK 188
Score = 30.6 bits (69), Expect = 0.35
Identities = 18/72 (25%), Positives = 37/72 (51%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E E E+K KEE +++ EEE + E K++ E +++ + + K + + + + + K
Sbjct: 137 EAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKA 196
Query: 142 EEEEEEEKEKVE 153
+ E + K E
Sbjct: 197 KAEAAKAKAAAE 208
Score = 28.6 bits (64), Expect = 1.5
Identities = 11/85 (12%), Positives = 44/85 (51%)
Query: 67 ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
+ P+ E ++ + ++ +++++ +++E++++ +++ EE +K +
Sbjct: 27 VKPEPGGGGEIIQAVLVDPGAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAA 86
Query: 127 EEVEEEEEEQKNKKKEEEEEEEKEK 151
E+ ++E EQ+ ++ ++ E+
Sbjct: 87 EQARQKELEQRAAAEKAAKQAEQAA 111
Score = 26.7 bits (59), Expect = 6.7
Identities = 16/72 (22%), Positives = 32/72 (44%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E E + K E +++ E K+K E E K + + + + + ++ K + E + + + K
Sbjct: 153 EEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAK 212
Query: 142 EEEEEEEKEKVE 153
E E E
Sbjct: 213 AEAEAAAAAAAE 224
Score = 26.3 bits (58), Expect = 9.5
Identities = 17/71 (23%), Positives = 33/71 (46%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+EE K++ EEE + + E + + ++K E E + + + K + EE + + K +
Sbjct: 145 KEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAK 204
Query: 144 EEEEEKEKVEM 154
E K E
Sbjct: 205 AAAEAAAKAEA 215
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 36.6 bits (85), Expect = 0.004
Identities = 26/73 (35%), Positives = 39/73 (53%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E E +E EE EE +E+ E ++E ++RE EE E+ + EE EE EE+ +E
Sbjct: 315 ENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEEL 374
Query: 144 EEEEEKEKVEMYR 156
EE E + E+
Sbjct: 375 EELFEALREELAE 387
Score = 34.7 bits (80), Expect = 0.016
Identities = 22/70 (31%), Positives = 37/70 (52%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E E + +E EE EE +++ + +E+ E++ EE ++ E E +EE E+K E
Sbjct: 313 ELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLE 372
Query: 144 EEEEEKEKVE 153
E EE E +
Sbjct: 373 ELEELFEALR 382
Score = 34.3 bits (79), Expect = 0.021
Identities = 29/99 (29%), Positives = 42/99 (42%)
Query: 59 RANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
L L + E L+ E E +E E+ E E+ K+E EEK EE E
Sbjct: 316 NELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELE 375
Query: 119 EKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
E + +EE E E E + + + EE + E E +E
Sbjct: 376 ELFEALREELAELEAELAEIRNELEELKREIESLEERLE 414
Score = 33.1 bits (76), Expect = 0.060
Identities = 28/81 (34%), Positives = 42/81 (51%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
+E E E EE KEK E +EE EE+E +E ++ + EE +++ ++K + EE
Sbjct: 314 LENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEE 373
Query: 133 EEEQKNKKKEEEEEEEKEKVE 153
EE +EE E E E E
Sbjct: 374 LEELFEALREELAELEAELAE 394
Score = 33.1 bits (76), Expect = 0.064
Identities = 21/72 (29%), Positives = 40/72 (55%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
E EE + E EE EE EE +E+ + +++ E+ E E +++ EE E EE ++
Sbjct: 266 EKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERL 325
Query: 140 KKEEEEEEEKEK 151
++ +E+ E ++
Sbjct: 326 EELKEKIEALKE 337
Score = 32.4 bits (74), Expect = 0.096
Identities = 26/86 (30%), Positives = 43/86 (50%), Gaps = 4/86 (4%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE----VEE 131
E LEN E EE+ +E +E+ E ++E +E E E ++ E ++ K+E E
Sbjct: 312 EELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALL 371
Query: 132 EEEEQKNKKKEEEEEEEKEKVEMYRN 157
EE E+ + EE E + ++ RN
Sbjct: 372 EELEELFEALREELAELEAELAEIRN 397
Score = 31.6 bits (72), Expect = 0.19
Identities = 22/84 (26%), Positives = 35/84 (41%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
E L E EE + EE E E E E + E K E E ++ +
Sbjct: 359 AKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSE 418
Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
E+ +++ K+ E E EE + ++E
Sbjct: 419 RLEDLKEELKELEAELEELQTELE 442
Score = 31.6 bits (72), Expect = 0.20
Identities = 22/68 (32%), Positives = 37/68 (54%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE E E E EE + + E+ E++ E+ +E+ E K++ EE EE ++ + E
Sbjct: 298 EELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELE 357
Query: 144 EEEEEKEK 151
E +EE E+
Sbjct: 358 EAKEELEE 365
Score = 31.6 bits (72), Expect = 0.21
Identities = 18/82 (21%), Positives = 35/82 (42%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+ + EE E++ EE EE E +E + E + E + ++ + E+
Sbjct: 347 EELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREI 406
Query: 130 EEEEEEQKNKKKEEEEEEEKEK 151
E EE + + E+ +E+ K
Sbjct: 407 ESLEERLERLSERLEDLKEELK 428
Score = 31.2 bits (71), Expect = 0.26
Identities = 23/84 (27%), Positives = 40/84 (47%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
Q E E E E E EE E E E+ E+ ++ ++K E +++ +++E +
Sbjct: 287 QEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLL 346
Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
EE E+ ++ +EE EEK
Sbjct: 347 EELEQLLAELEEAKEELEEKLSAL 370
Score = 31.2 bits (71), Expect = 0.27
Identities = 30/80 (37%), Positives = 47/80 (58%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
E E E+E +E +EE EE E EKE+ EDE K+ E+++EE E++ + E E+ E +
Sbjct: 845 DELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELK 904
Query: 134 EEQKNKKKEEEEEEEKEKVE 153
EE + ++ EE E K +
Sbjct: 905 EEIEKLRERLEELEAKLERL 924
Score = 30.5 bits (69), Expect = 0.39
Identities = 21/93 (22%), Positives = 41/93 (44%)
Query: 61 NARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEK 120
L + L E +EE E++ EE E+ + E+ E + E E
Sbjct: 336 KEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEI 395
Query: 121 QKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+ + +E + E E E++ ++ E E+ KE+++
Sbjct: 396 RNELEELKREIESLEERLERLSERLEDLKEELK 428
Score = 30.5 bits (69), Expect = 0.41
Identities = 31/73 (42%), Positives = 44/73 (60%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E LE + E E++ KE EEE+EE E+E E E + E KEE E+ ++ +E E + E E
Sbjct: 866 EELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLE 925
Query: 136 QKNKKKEEEEEEE 148
+ + EEE EEE
Sbjct: 926 VELPELEEELEEE 938
Score = 30.5 bits (69), Expect = 0.46
Identities = 31/81 (38%), Positives = 45/81 (55%), Gaps = 1/81 (1%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDE-EKKREKKEEEEEKQKDKKEEEVEEE 132
A+ E + + +E EEE EE EEE E+ ++ E+ E+ E EE KEE E E
Sbjct: 726 AALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELE 785
Query: 133 EEEQKNKKKEEEEEEEKEKVE 153
E+ Q +++ EE EEE E+ E
Sbjct: 786 EKRQALQEELEELEEELEEAE 806
Score = 30.5 bits (69), Expect = 0.49
Identities = 29/104 (27%), Positives = 46/104 (44%)
Query: 50 GPAQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK 109
AQ + L + Q E L++ + E E EE + E+ + + EE
Sbjct: 662 SLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEEL 721
Query: 110 KREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
KRE EEE ++ + E EEE E+ ++ EE +E +E E
Sbjct: 722 KRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEE 765
Score = 30.1 bits (68), Expect = 0.55
Identities = 29/84 (34%), Positives = 41/84 (48%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
LE + E EE E E+EE E+E ++ ++E EE + E +E E E + K+E E E E+
Sbjct: 858 LEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEEL 917
Query: 138 NKKKEEEEEEEKEKVEMYRNPVPM 161
K E E E E E
Sbjct: 918 EAKLERLEVELPELEEELEEEYED 941
Score = 30.1 bits (68), Expect = 0.56
Identities = 26/74 (35%), Positives = 49/74 (66%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E E E +E +E+ EE E+E EE +++ +E + +++ E+E K+ ++++EE+EEE E
Sbjct: 836 EIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRE 895
Query: 136 QKNKKKEEEEEEEK 149
+++ E +EE EK
Sbjct: 896 LESELAELKEEIEK 909
Score = 30.1 bits (68), Expect = 0.65
Identities = 25/77 (32%), Positives = 40/77 (51%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
LE E + EE ++E EEE E+ + EE + E +E EEE ++ ++ E EEE
Sbjct: 707 LRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEE 766
Query: 134 EEQKNKKKEEEEEEEKE 150
E + + +EE +E
Sbjct: 767 LESLEEALAKLKEEIEE 783
Score = 29.7 bits (67), Expect = 0.88
Identities = 31/72 (43%), Positives = 47/72 (65%), Gaps = 1/72 (1%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE-EEEEEQKNKKKE 142
E EE+ +E EE+ +E E+E +E E++ E KEE EE + +K+E E E +E EE+K + +E
Sbjct: 832 ELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEE 891
Query: 143 EEEEEEKEKVEM 154
E E E E E+
Sbjct: 892 ELRELESELAEL 903
Score = 29.3 bits (66), Expect = 0.94
Identities = 25/81 (30%), Positives = 38/81 (46%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+ L E EE ++E E EE E+ + E+ K E KE E E ++ + E E
Sbjct: 384 ELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEE 443
Query: 130 EEEEEEQKNKKKEEEEEEEKE 150
EE E+ ++ EE + KE
Sbjct: 444 LNEELEELEEQLEELRDRLKE 464
Score = 29.3 bits (66), Expect = 0.98
Identities = 29/81 (35%), Positives = 44/81 (54%), Gaps = 3/81 (3%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD---KKEEEVEEEEEE 135
E E EEE +E +E EE EEE E E+ K +++ EE E+++ ++ EE+EEE EE
Sbjct: 745 ELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEE 804
Query: 136 QKNKKKEEEEEEEKEKVEMYR 156
+ + E E E + R
Sbjct: 805 AERRLDALERELESLEQRRER 825
Score = 29.3 bits (66), Expect = 1.0
Identities = 22/71 (30%), Positives = 41/71 (57%), Gaps = 3/71 (4%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE---EEKQKDKKEEEVEEEEEEQKNKK 140
EE ++E EE E+E EE K + E+ ++ E+ +EE +++ ++ E E+ E + +
Sbjct: 256 EELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELE 315
Query: 141 KEEEEEEEKEK 151
E EE EE+ +
Sbjct: 316 NELEELEERLE 326
Score = 28.5 bits (64), Expect = 1.8
Identities = 27/70 (38%), Positives = 42/70 (60%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
+ +E KE EE EEE +E+ E+ +++ E+ E+E E+ K + EE EE EE Q+ +
Sbjct: 233 KLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLE 292
Query: 142 EEEEEEEKEK 151
+EE EE E
Sbjct: 293 LKEEIEELEG 302
Score = 28.5 bits (64), Expect = 1.9
Identities = 27/87 (31%), Positives = 45/87 (51%), Gaps = 2/87 (2%)
Query: 70 QAISASETLENNEWEEEEKEKEEE--EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
Q L E E E+ + E EEE EE +E+ E+ +++ E+ EEE E ++ +
Sbjct: 717 QLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAK 776
Query: 128 EVEEEEEEQKNKKKEEEEEEEKEKVEM 154
EE EE ++ ++ +EE EE E+
Sbjct: 777 LKEEIEELEEKRQALQEELEELEEELE 803
Score = 28.1 bits (63), Expect = 2.6
Identities = 22/76 (28%), Positives = 40/76 (52%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E E +EE E +EE EE E E + ++ E + EE E++ ++ +E++E +EE +
Sbjct: 282 ELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEE 341
Query: 139 KKKEEEEEEEKEKVEM 154
++ EE E+
Sbjct: 342 RETLLEELEQLLAELE 357
Score = 28.1 bits (63), Expect = 3.0
Identities = 23/78 (29%), Positives = 46/78 (58%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E+LE + +E+ +E EE+ + ++E +E EE+ E + + +++ + E E E
Sbjct: 768 ESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLE 827
Query: 136 QKNKKKEEEEEEEKEKVE 153
Q+ ++ EEE EE +EK++
Sbjct: 828 QEIEELEEEIEELEEKLD 845
Score = 27.8 bits (62), Expect = 3.3
Identities = 28/71 (39%), Positives = 42/71 (59%), Gaps = 1/71 (1%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDE-EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
E E E E E+ +E+ E E E+ E+ EE EEK + +EE E E+E ++ K++
Sbjct: 806 ERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEEL 865
Query: 143 EEEEEEKEKVE 153
EE E EKE++E
Sbjct: 866 EELEAEKEELE 876
Score = 27.4 bits (61), Expect = 5.3
Identities = 22/84 (26%), Positives = 46/84 (54%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
E E E + + E+ E E+E +E EE+ E +E+ +E +++ +E E E EE
Sbjct: 801 ELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEE 860
Query: 134 EEQKNKKKEEEEEEEKEKVEMYRN 157
+++ ++ E E+EE +++++
Sbjct: 861 LKEELEELEAEKEELEDELKELEE 884
Score = 26.6 bits (59), Expect = 7.4
Identities = 27/82 (32%), Positives = 48/82 (58%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+ A E E EE+ + +EE EE EEE E+ E E++ E E+++++ E+E+
Sbjct: 771 EEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEI 830
Query: 130 EEEEEEQKNKKKEEEEEEEKEK 151
EE EEE + +++ +E EE+ +
Sbjct: 831 EELEEEIEELEEKLDELEEELE 852
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 36.5 bits (85), Expect = 0.004
Identities = 14/39 (35%), Positives = 29/39 (74%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
++EE ++E+EE+E +E+++ +K+ EKK++E EK +
Sbjct: 549 DKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLE 587
Score = 35.8 bits (83), Expect = 0.008
Identities = 20/57 (35%), Positives = 35/57 (61%), Gaps = 8/57 (14%)
Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV---EMYRN 157
D+ EE +REK+E+E K++ + + ++EE KKK+E E+ EK K+ E ++
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEE-----KKKKELEKLEKAKIPPAEFFKR 599
Score = 34.6 bits (80), Expect = 0.017
Identities = 13/38 (34%), Positives = 26/38 (68%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
E + E++EKE +E++ K K ++E+KK+E ++ E+
Sbjct: 551 EELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 32.7 bits (75), Expect = 0.073
Identities = 15/41 (36%), Positives = 32/41 (78%)
Query: 94 EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
+++EE ++EK+E E K +K+ + +KQ++KK++E+E+ E+
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 31.5 bits (72), Expect = 0.19
Identities = 17/47 (36%), Positives = 34/47 (72%), Gaps = 1/47 (2%)
Query: 113 KKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE-EEEEKEKVEMYRNP 158
K +++EE Q++K+E+E +E++ + KK+EE +++E EK+E + P
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIP 592
Score = 31.5 bits (72), Expect = 0.20
Identities = 13/43 (30%), Positives = 30/43 (69%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
K +++EE + EKE+ E ++++ ++ +++++K KKE E E+
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 30.4 bits (69), Expect = 0.51
Identities = 14/41 (34%), Positives = 29/41 (70%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
+KEE + E+EE++ E + ++ KK+EE++K++ +K E+
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 35.8 bits (83), Expect = 0.005
Identities = 13/39 (33%), Positives = 22/39 (56%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEK 120
EW++ + +E EE E E+E + EKKE +++K
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 33.5 bits (77), Expect = 0.031
Identities = 13/32 (40%), Positives = 15/32 (46%)
Query: 122 KDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
KD K E EE + +EEE E EK E
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKE 191
Score = 31.9 bits (73), Expect = 0.082
Identities = 14/37 (37%), Positives = 19/37 (51%)
Query: 105 EDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
+D + E EE E KEEE E E+++ KKK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 31.1 bits (71), Expect = 0.15
Identities = 13/39 (33%), Positives = 19/39 (48%)
Query: 102 EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
E + + + EE E K+EE E E++E K KK
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 30.8 bits (70), Expect = 0.22
Identities = 9/35 (25%), Positives = 18/35 (51%)
Query: 93 EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
E ++ + +E +E + +EEE + +KKE
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKES 192
Score = 30.4 bits (69), Expect = 0.33
Identities = 13/33 (39%), Positives = 15/33 (45%)
Query: 118 EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
E K EE EE E K+EE E EK+
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKK 190
Score = 29.6 bits (67), Expect = 0.59
Identities = 11/37 (29%), Positives = 20/37 (54%)
Query: 106 DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
+ K E+ EE + +EE E E++++KKK+
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 29.6 bits (67), Expect = 0.61
Identities = 8/39 (20%), Positives = 23/39 (58%)
Query: 112 EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
E K+ + ++ +E E +E+++ + E++E ++K+
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 29.2 bits (66), Expect = 0.76
Identities = 9/36 (25%), Positives = 18/36 (50%)
Query: 94 EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
E ++ + E+ EE + +EEE + +K+E
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESK 193
Score = 29.2 bits (66), Expect = 0.81
Identities = 12/36 (33%), Positives = 17/36 (47%)
Query: 117 EEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
E + K +E EE E K++E E E+KE
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESK 193
Score = 28.1 bits (63), Expect = 1.8
Identities = 10/38 (26%), Positives = 20/38 (52%), Gaps = 1/38 (2%)
Query: 115 EEEEEKQKDK-KEEEVEEEEEEQKNKKKEEEEEEEKEK 151
E ++ K ++ EE E ++ + E E++E K+K
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity
of nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 35.9 bits (83), Expect = 0.005
Identities = 15/54 (27%), Positives = 28/54 (51%)
Query: 65 TVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
++ + E+EE E + +E+EE+E +EDE + E++E+EE
Sbjct: 17 VLLTFGNGKGPFASAQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEE 70
Score = 30.5 bits (69), Expect = 0.30
Identities = 14/40 (35%), Positives = 26/40 (65%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK 109
Q ++ E E++ +E+E+++ EE+E E E++EDEE
Sbjct: 32 QDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEG 71
Score = 29.7 bits (67), Expect = 0.54
Identities = 15/45 (33%), Positives = 27/45 (60%)
Query: 71 AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
SA + E+ E E++ +++EE+E EE E + EE++ E+ E
Sbjct: 28 FASAQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72
Score = 29.0 bits (65), Expect = 1.2
Identities = 17/37 (45%), Positives = 25/37 (67%), Gaps = 2/37 (5%)
Query: 116 EEEEKQKDKKEEEVEEE--EEEQKNKKKEEEEEEEKE 150
E+EE + D +E+ E+E EE +N+ EEEE+EE E
Sbjct: 36 EDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72
Score = 28.2 bits (63), Expect = 1.8
Identities = 12/34 (35%), Positives = 23/34 (67%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
E++E E++ +E+EE E +E++ +EEE+E
Sbjct: 36 EDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDE 69
Score = 27.4 bits (61), Expect = 3.6
Identities = 16/49 (32%), Positives = 24/49 (48%), Gaps = 7/49 (14%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
+ E+EE E++ E +EDE EE + + EEE +EE E
Sbjct: 31 AQDLTEDEEAEDDVVDEDEEDE-------AVVEEDENELTEEEEDEEGE 72
Score = 27.0 bits (60), Expect = 5.3
Identities = 15/53 (28%), Positives = 25/53 (47%), Gaps = 1/53 (1%)
Query: 58 IRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKK 110
+ L + A + + + + EE+E EE+E E EE+E +E E K
Sbjct: 24 GKGPFASAQDLTEDEEAEDDVVDED-EEDEAVVEEDENELTEEEEDEEGEVKA 75
Score = 26.3 bits (58), Expect = 8.7
Identities = 12/41 (29%), Positives = 24/41 (58%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
++ E EE E++ +E E+DE ++ E + EEE+ ++
Sbjct: 31 AQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEG 71
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 35.9 bits (83), Expect = 0.005
Identities = 19/70 (27%), Positives = 35/70 (50%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
+ L + + EE ++ E + E+ E E D E++ ++ EEEE D E+ + EE
Sbjct: 201 DMLSSMDMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEE 260
Query: 136 QKNKKKEEEE 145
++ + E E
Sbjct: 261 MESGEMEAAE 270
Score = 30.2 bits (68), Expect = 0.40
Identities = 13/62 (20%), Positives = 27/62 (43%)
Query: 93 EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
+ EE ++ + D E ++ + +E++ D+ EEE + EE E ++
Sbjct: 207 DMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEM 266
Query: 153 EM 154
E
Sbjct: 267 EA 268
Score = 29.8 bits (67), Expect = 0.64
Identities = 13/62 (20%), Positives = 31/62 (50%), Gaps = 1/62 (1%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEE-EEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
E+ ++ + E+E+ KE+E++++ EEE+ D + EE + + + E +
Sbjct: 215 EPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASAD 274
Query: 133 EE 134
+
Sbjct: 275 DT 276
Score = 27.1 bits (60), Expect = 4.6
Identities = 12/66 (18%), Positives = 29/66 (43%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
A S E++ E+E+ ++ EEEE + ++ + E + E E + ++
Sbjct: 217 ESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDT 276
Query: 129 VEEEEE 134
+ ++
Sbjct: 277 PDSDDA 282
>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
chaperones [Posttranslational modification, protein
turnover, chaperones].
Length = 786
Score = 36.1 bits (84), Expect = 0.005
Identities = 33/129 (25%), Positives = 54/129 (41%), Gaps = 12/129 (9%)
Query: 36 NVSLTDDAEFQCQVGPAQY-------HKPIR----ANARLTVILPQAISASETLENNEWE 84
V +TD+A +Y K I A AR+ + + + E
Sbjct: 354 GVRITDEALVAAVTLSDRYIPDRFLPDKAIDLLDEAGARVRLEIDKPEELDELERELAQL 413
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE-EKQKDKKEEEVEEEEEEQKNKKKEE 143
E EKE E E++E+E+K DE + K + E E E + + E+++ E +
Sbjct: 414 EIEKEALEREQDEKEKKLIDEIIKLKEGRIPELEKELEAEVDEDDIAEVVARWTGIPVAK 473
Query: 144 EEEEEKEKV 152
E+EKEK+
Sbjct: 474 LLEDEKEKL 482
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 35.7 bits (82), Expect = 0.006
Identities = 19/68 (27%), Positives = 43/68 (63%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E++ K+KEE+ EE + K+ E E K+ +KE + +++ K+ EE E++ + ++ +++E+
Sbjct: 78 EQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQ 137
Query: 144 EEEEEKEK 151
+ E+
Sbjct: 138 ARKAAAEQ 145
Score = 33.8 bits (77), Expect = 0.029
Identities = 21/70 (30%), Positives = 41/70 (58%), Gaps = 1/70 (1%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK-NKKKEE 143
EE K K+ E+E ++ EK+ + ++++K+ EE EKQ ++++ EE+ + +KK+
Sbjct: 90 EELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKA 149
Query: 144 EEEEEKEKVE 153
E + K E
Sbjct: 150 EAAKAKAAAE 159
Score = 31.8 bits (72), Expect = 0.13
Identities = 14/76 (18%), Positives = 45/76 (59%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
+E L+ + E+E+ K+ E+E + ++++ + EE +++ + E++++++ ++ E+++
Sbjct: 88 VAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKK 147
Query: 134 EEQKNKKKEEEEEEEK 149
+ + K K E +
Sbjct: 148 KAEAAKAKAAAEAAKL 163
Score = 30.3 bits (68), Expect = 0.47
Identities = 16/70 (22%), Positives = 35/70 (50%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ E+K++EE+ + E++K + K + E + K + +++ EE + + K +
Sbjct: 128 QLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKA 187
Query: 144 EEEEEKEKVE 153
E K+K E
Sbjct: 188 EAAAAKKKAE 197
Score = 28.4 bits (63), Expect = 1.9
Identities = 13/70 (18%), Positives = 43/70 (61%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
++ + ++ ++ E++ K+K+E ++ + K+ E+++ + E+E + +E+QK ++
Sbjct: 63 QYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEE 122
Query: 142 EEEEEEEKEK 151
E++ + ++K
Sbjct: 123 AEKQAQLEQK 132
Score = 28.4 bits (63), Expect = 2.2
Identities = 22/81 (27%), Positives = 34/81 (41%), Gaps = 1/81 (1%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK-KREKKEEEEEKQKDKKEEE 128
+A A+E + K+K E E + EK K E E K K EKK E ++K E++
Sbjct: 175 EAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKK 234
Query: 129 VEEEEEEQKNKKKEEEEEEEK 149
+ + + E K
Sbjct: 235 KAAAKAKADKAAAAAKAAERK 255
Score = 28.0 bits (62), Expect = 2.7
Identities = 16/78 (20%), Positives = 32/78 (41%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E + E K K E + + + + + ++ K EE + + + + + E E
Sbjct: 141 AAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEA 200
Query: 136 QKNKKKEEEEEEEKEKVE 153
+ +K + E E K K E
Sbjct: 201 KAAAEKAKAEAEAKAKAE 218
Score = 26.8 bits (59), Expect = 6.5
Identities = 21/105 (20%), Positives = 40/105 (38%), Gaps = 3/105 (2%)
Query: 52 AQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR 111
K AR + + + + + K E +++ EE + E+ + K
Sbjct: 128 QLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKA 187
Query: 112 EKKEEEEEKQKD---KKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
E +++ + + E+ E E + K +KK E EEK E
Sbjct: 188 EAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAE 232
Score = 26.5 bits (58), Expect = 9.3
Identities = 17/69 (24%), Positives = 38/69 (55%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E++++ +E E++ + E+K+++E K +++++ E K K E + + + KKK
Sbjct: 113 AQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKK 172
Query: 142 EEEEEEEKE 150
EE + E
Sbjct: 173 AEEAAKAAE 181
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 35.8 bits (82), Expect = 0.006
Identities = 20/60 (33%), Positives = 32/60 (53%)
Query: 94 EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+ EEE+ EK + +K +K +E+ E Q + E EE E + + E EEE + EK +
Sbjct: 126 SDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELDLEKDD 185
Score = 27.7 bits (61), Expect = 3.1
Identities = 15/62 (24%), Positives = 32/62 (51%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ EE++ E++ + + K+ DED E + + E + ++ + E EEE +K +
Sbjct: 127 DVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELDLEKDDG 186
Query: 144 EE 145
E+
Sbjct: 187 ED 188
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 35.4 bits (82), Expect = 0.007
Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 12/63 (19%)
Query: 72 ISASET-----------LENNEW-EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
++AS T L+ W E + K++EE EE KE KK ++ E
Sbjct: 133 LAASNTILLFVLVGVLVLQAGLWYAERKDAKQKEEFAAEERKEALAAAAKKSATPQKVET 192
Query: 120 KQK 122
K+K
Sbjct: 193 KKK 195
Score = 30.4 bits (69), Expect = 0.27
Identities = 10/37 (27%), Positives = 18/37 (48%)
Query: 115 EEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
E ++ KQK++ E +E KK ++ E +K
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKK 194
Score = 30.0 bits (68), Expect = 0.43
Identities = 9/36 (25%), Positives = 15/36 (41%)
Query: 117 EEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
E + K K+E EE +E K+ ++ E
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETK 193
Score = 29.3 bits (66), Expect = 0.84
Identities = 8/37 (21%), Positives = 18/37 (48%)
Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
KD ++++ +E +E K+ ++ E +K K
Sbjct: 160 KDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 28.1 bits (63), Expect = 1.6
Identities = 9/39 (23%), Positives = 19/39 (48%)
Query: 102 EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
E+ + ++K+ EE +E ++ ++ E K KK
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 27.3 bits (61), Expect = 3.3
Identities = 8/39 (20%), Positives = 20/39 (51%)
Query: 94 EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
E ++ ++KE+ EE+K ++ +K E +++
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 27.0 bits (60), Expect = 4.6
Identities = 8/37 (21%), Positives = 18/37 (48%)
Query: 106 DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
+ K++E+ EE K+ + ++ + KKK+
Sbjct: 160 KDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 36.1 bits (83), Expect = 0.007
Identities = 18/66 (27%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK-KEEEEEEE 148
+E+++E++++K + E +EK +KKEE D +E +E + + K+ +EE
Sbjct: 17 MQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKSTKEES 76
Query: 149 KEKVEM 154
K+ +E+
Sbjct: 77 KQLLEV 82
Score = 33.0 bits (75), Expect = 0.056
Identities = 15/65 (23%), Positives = 34/65 (52%), Gaps = 1/65 (1%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
+E + +EKD++++ ++ + E +E K DKKEE V + + ++ +E +
Sbjct: 5 NRREANINNNDRMQEKDDEKQDQKNRMELKE-KVLDKKEEVVTDNVDSPVKEQSSQENLK 63
Query: 148 EKEKV 152
++V
Sbjct: 64 IADEV 68
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 35.5 bits (82), Expect = 0.007
Identities = 13/69 (18%), Positives = 31/69 (44%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ E + EE+ + E + E K ++K + E+K K K + + + + K K + +
Sbjct: 57 QPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPK 116
Query: 144 EEEEEKEKV 152
++ +
Sbjct: 117 PKKPPSKTA 125
Score = 28.2 bits (63), Expect = 1.8
Identities = 11/62 (17%), Positives = 29/62 (46%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ + + E + KEK + E+K ++ K + + K K K + + + + ++ +K +
Sbjct: 68 KPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAK 127
Query: 144 EE 145
Sbjct: 128 AP 129
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 35.3 bits (82), Expect = 0.007
Identities = 16/66 (24%), Positives = 32/66 (48%), Gaps = 3/66 (4%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK---EEEE 145
K +EEEEE + +E E++ + D++ +E E++ ++ ++K E +
Sbjct: 15 PKPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELDEEEDDRFLEEYRRKRLAEMKA 74
Query: 146 EEEKEK 151
EK K
Sbjct: 75 LAEKSK 80
Score = 31.5 bits (72), Expect = 0.13
Identities = 17/52 (32%), Positives = 27/52 (51%), Gaps = 2/52 (3%)
Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK--VEMYRN 157
K KEEEEE + +E E E++ + +EE +EE++ +E YR
Sbjct: 15 PKPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELDEEEDDRFLEEYRR 66
Score = 26.8 bits (60), Expect = 4.8
Identities = 16/69 (23%), Positives = 32/69 (46%), Gaps = 1/69 (1%)
Query: 66 VILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
++ P+ S E E E +E E E++ E DE+ +++ + + EE ++K
Sbjct: 12 ILPPKPPSPKEEEEEAL-ELAIQEAHENALEKKLLDELDEELDEEEDDRFLEEYRRKRLA 70
Query: 126 EEEVEEEEE 134
E + E+
Sbjct: 71 EMKALAEKS 79
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 35.8 bits (83), Expect = 0.007
Identities = 16/78 (20%), Positives = 38/78 (48%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
+ EEE + +++E + E + EE++ K+E ++E E ++ ++ + E +
Sbjct: 813 ESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKN 872
Query: 150 EKVEMYRNPVPMYRNPVP 167
E VE +P + + +
Sbjct: 873 EFVEFKNDPKKLNKLIIA 890
Score = 35.8 bits (83), Expect = 0.008
Identities = 18/75 (24%), Positives = 32/75 (42%), Gaps = 1/75 (1%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
L + EEE + E + E E + D + K ++ K +E ++E+
Sbjct: 850 IKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEK 909
Query: 134 EEQKNKKK-EEEEEE 147
+K KK E+ +EE
Sbjct: 910 TTKKKKKDLEKTDEE 924
Score = 35.0 bits (81), Expect = 0.014
Identities = 17/73 (23%), Positives = 31/73 (42%), Gaps = 7/73 (9%)
Query: 84 EEEEKEKEEEEEEEE-------EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
E E ++ EEE + E + D + K ++ K +E ++E+ +
Sbjct: 853 ELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTK 912
Query: 137 KNKKKEEEEEEEK 149
K KK E+ +EE
Sbjct: 913 KKKKDLEKTDEEA 925
Score = 31.9 bits (73), Expect = 0.13
Identities = 18/73 (24%), Positives = 29/73 (39%), Gaps = 1/73 (1%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKE-KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
+ EE EE E EE+K K E + + E +E EEE Q + + E +
Sbjct: 822 LRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDP 881
Query: 138 NKKKEEEEEEEKE 150
K + ++
Sbjct: 882 KKLNKLIIAKDVL 894
Score = 29.6 bits (67), Expect = 0.94
Identities = 18/79 (22%), Positives = 38/79 (48%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
+A + L E + + + +E ++ + E++R K + ++ + EE+E
Sbjct: 780 NAEKILIITEILSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELI 839
Query: 133 EEEQKNKKKEEEEEEEKEK 151
EEQKN K+E + E + +
Sbjct: 840 NEEQKNLKQEIKLELSEIQ 858
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 34.7 bits (80), Expect = 0.008
Identities = 21/70 (30%), Positives = 43/70 (61%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
N + EE +++++E +E EEKE + +++K+EKKE++ ++ KK+ + +K K
Sbjct: 89 NLFPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATK 148
Query: 141 KEEEEEEEKE 150
K ++EE +
Sbjct: 149 KTTTKKEEGK 158
Score = 29.3 bits (66), Expect = 0.57
Identities = 16/54 (29%), Positives = 30/54 (55%)
Query: 99 EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
E+ + E +E +E +E+E K K +K+E+ E++ E+ KKK + + K
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKK 145
Score = 27.4 bits (61), Expect = 2.5
Identities = 18/65 (27%), Positives = 40/65 (61%), Gaps = 2/65 (3%)
Query: 91 EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV--EEEEEEQKNKKKEEEEEEE 148
EE E+E++E K+ +E++ + K+E++EK++ K E++ ++ + KN K+ ++
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTT 151
Query: 149 KEKVE 153
+K E
Sbjct: 152 TKKEE 156
Score = 27.4 bits (61), Expect = 2.8
Identities = 13/61 (21%), Positives = 34/61 (55%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E E +E +EKE + ++E++++K++ +K KK+ + + K+ + ++++
Sbjct: 98 EQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEEG 157
Query: 139 K 139
K
Sbjct: 158 K 158
Score = 26.6 bits (59), Expect = 5.1
Identities = 18/69 (26%), Positives = 37/69 (53%)
Query: 67 ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
+ P+ + + E EE+E K ++E++E++EK+ E KK+ K + +K K+
Sbjct: 90 LFPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKK 149
Query: 127 EEVEEEEEE 135
++EE +
Sbjct: 150 TTTKKEEGK 158
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 35.5 bits (82), Expect = 0.008
Identities = 26/108 (24%), Positives = 51/108 (47%), Gaps = 3/108 (2%)
Query: 52 AQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR 111
+ A L +A E E +E+ ++ +E E+E+ + EE++
Sbjct: 27 KRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERL 86
Query: 112 EKKEEEEEKQKDK---KEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
+KEE+ + + +K E ++EE E+ ++ E EE E++ E+YR
Sbjct: 87 VQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYR 134
Score = 29.3 bits (66), Expect = 1.0
Identities = 23/75 (30%), Positives = 35/75 (46%), Gaps = 2/75 (2%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK--K 141
EE+ + E+ + E + E+ E RE + EE EKQ D + V EQ K K
Sbjct: 90 EEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLK 149
Query: 142 EEEEEEEKEKVEMYR 156
+ E E+EK + +
Sbjct: 150 LLDAELEEEKAQRVK 164
Score = 28.1 bits (63), Expect = 2.7
Identities = 27/100 (27%), Positives = 41/100 (41%)
Query: 52 AQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR 111
+ + + L + LEN E E+ E E EE EK+ D + +
Sbjct: 77 EELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVA 136
Query: 112 EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
E+ K K + EEE+ Q+ KK EEE + E E+
Sbjct: 137 GLTPEQARKLLLKLLDAELEEEKAQRVKKIEEEADLEAER 176
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 35.4 bits (81), Expect = 0.009
Identities = 19/78 (24%), Positives = 38/78 (48%), Gaps = 4/78 (5%)
Query: 76 ETLENNEW---EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
L N +W E E K+ + EK+ D+ ++ + E ++ + EEEVE+
Sbjct: 334 RNLNNEKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEH-EEEVEDY 392
Query: 133 EEEQKNKKKEEEEEEEKE 150
E+E + K+ +++E +
Sbjct: 393 EDENDHSKRICDDDELEN 410
Score = 31.1 bits (70), Expect = 0.28
Identities = 12/65 (18%), Positives = 30/65 (46%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
L+++ E++ + E+ E + ++ +E + E+ E+ E++ K ++E E
Sbjct: 353 LDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKRICDDDELENHF 412
Query: 138 NKKKE 142
E
Sbjct: 413 RAADE 417
Score = 30.7 bits (69), Expect = 0.37
Identities = 17/51 (33%), Positives = 25/51 (49%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
P + L + E E + EE E EEE E +DE++ KR ++E E
Sbjct: 359 PLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKRICDDDELE 409
Score = 28.4 bits (63), Expect = 2.4
Identities = 14/55 (25%), Positives = 27/55 (49%)
Query: 101 KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMY 155
EK +EE +R+ + +K+ + E+ + + + EE E +E+VE Y
Sbjct: 338 NEKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDY 392
Score = 27.6 bits (61), Expect = 4.0
Identities = 16/47 (34%), Positives = 20/47 (42%), Gaps = 7/47 (14%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEK-------DEDEEKKREKKE 115
E +E EE E EEE E+ E+E + DE E R E
Sbjct: 371 EKMEIENRNPEESEHEEEVEDYEDENDHSKRICDDDELENHFRAADE 417
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 35.4 bits (81), Expect = 0.010
Identities = 15/52 (28%), Positives = 40/52 (76%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
++E++++++++EE++EE+E++E+E K + ++EE+E +D E+ E ++ +
Sbjct: 156 DDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDKTD 207
Score = 33.9 bits (77), Expect = 0.032
Identities = 18/65 (27%), Positives = 40/65 (61%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E+ ++++E++++D+D+E+ E++EEEEE+ K +E+ E+E E +K E
Sbjct: 143 EDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSE 202
Query: 144 EEEEE 148
++ +
Sbjct: 203 VDKTD 207
Score = 32.3 bits (73), Expect = 0.13
Identities = 23/69 (33%), Positives = 36/69 (52%), Gaps = 6/69 (8%)
Query: 91 EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
EE+ + D+D+E + E ++EE+ +EE EEEEEE K E+EE+E E
Sbjct: 141 AEEDMSPRDNFVIDDDDEDEDEDDDDEED------DEEEEEEEEEIKGFDDEDEEDEGGE 194
Query: 151 KVEMYRNPV 159
++ V
Sbjct: 195 DFTYEKSEV 203
Score = 31.9 bits (72), Expect = 0.15
Identities = 16/71 (22%), Positives = 42/71 (59%), Gaps = 3/71 (4%)
Query: 91 EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV---EEEEEEQKNKKKEEEEEE 147
EE+ + D+DE++ + +EE++++++++EEE+ ++E+EE + + E+
Sbjct: 142 EEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKS 201
Query: 148 EKEKVEMYRNP 158
E +K + ++
Sbjct: 202 EVDKTDCFKFI 212
Score = 27.3 bits (60), Expect = 4.9
Identities = 13/57 (22%), Positives = 30/57 (52%)
Query: 97 EEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+ E + D K+R + EE + +++++E++ +EE++EE+E+ E
Sbjct: 121 KAEIGDLDMIIIKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEE 177
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 35.5 bits (82), Expect = 0.010
Identities = 24/85 (28%), Positives = 42/85 (49%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
+E L + EE +EKE EEEE E E KK E+ E E++ + + E+EE
Sbjct: 389 PLAEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEEL 448
Query: 133 EEEQKNKKKEEEEEEEKEKVEMYRN 157
+ E + + E E + + ++ ++
Sbjct: 449 KREIEKLESELERFRREVRDKVRKD 473
Score = 28.9 bits (65), Expect = 1.5
Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 1/75 (1%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDE-DEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
ET+E E E E ++E EE + E EK + E + ++ + + ++++ ++ +E E+
Sbjct: 429 ETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEK 488
Query: 135 EQKNKKKEEEEEEEK 149
E + KKK EE E K
Sbjct: 489 ELEEKKKRVEELERK 503
Score = 26.6 bits (59), Expect = 8.7
Identities = 14/70 (20%), Positives = 42/70 (60%), Gaps = 1/70 (1%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE ++E E+ E E E + + ++ +++++ +++ ++ E+E+EE+++ + +++
Sbjct: 446 EELKREIEKLESELERFRRE-VRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKL 504
Query: 144 EEEEEKEKVE 153
E + K+E
Sbjct: 505 AELRKMRKLE 514
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 34.7 bits (80), Expect = 0.010
Identities = 27/76 (35%), Positives = 50/76 (65%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
E ++ +++ E EEEE EE++K E++ + K+EEE E++++KK+EE E +E E++
Sbjct: 8 RAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQ 67
Query: 138 NKKKEEEEEEEKEKVE 153
+K++EE E+ K
Sbjct: 68 ARKEQEEYEKLKSSFV 83
Score = 34.3 bits (79), Expect = 0.013
Identities = 31/79 (39%), Positives = 51/79 (64%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
I A + + E + +++E EEEE EE K+ +E E +R+++EE EE+++ KKEEE +
Sbjct: 3 IGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERK 62
Query: 132 EEEEQKNKKKEEEEEEEKE 150
E EEQ K++EE E+ +
Sbjct: 63 EREEQARKEQEEYEKLKSS 81
Score = 31.6 bits (72), Expect = 0.13
Identities = 21/72 (29%), Positives = 51/72 (70%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+++ K EE++ +++E +E+E ++R+K EE+ E ++ ++EE EE E++++ ++++E
Sbjct: 4 GAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKE 63
Query: 144 EEEEEKEKVEMY 155
EE+ +++ E Y
Sbjct: 64 REEQARKEQEEY 75
Score = 31.6 bits (72), Expect = 0.14
Identities = 29/72 (40%), Positives = 48/72 (66%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E EE E+ K+ EE+ E E KE++E EE++ +KKEEEE K+++++ + +EE E+ K+
Sbjct: 24 EEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFV 83
Query: 142 EEEEEEEKEKVE 153
EEE +K +
Sbjct: 84 VEEEGTDKLSAD 95
Score = 30.1 bits (68), Expect = 0.36
Identities = 29/95 (30%), Positives = 53/95 (55%)
Query: 56 KPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
K I A R + QA E E EE +K +E+ E E +EE+E +E+ EKK+E++E
Sbjct: 1 KKIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEE 60
Query: 116 EEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
+E +++ +KE+E E+ + ++E ++ +
Sbjct: 61 RKEREEQARKEQEEYEKLKSSFVVEEEGTDKLSAD 95
Score = 30.1 bits (68), Expect = 0.41
Identities = 28/76 (36%), Positives = 44/76 (57%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E + E + E + KEEEE EEE EK+K+E+E K+RE++ +E+++ +K + EEE
Sbjct: 29 EERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEEG 88
Query: 136 QKNKKKEEEEEEEKEK 151
+EE E E
Sbjct: 89 TDKLSADEESNELLED 104
Score = 29.3 bits (66), Expect = 0.73
Identities = 23/75 (30%), Positives = 51/75 (68%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
+ + + E+++ ++ E +E++ +E KK E+K E E K++++ EEE E+++EE++
Sbjct: 2 KIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEER 61
Query: 139 KKKEEEEEEEKEKVE 153
K++EE+ +E+E+ E
Sbjct: 62 KEREEQARKEQEEYE 76
Score = 26.2 bits (58), Expect = 7.2
Identities = 18/60 (30%), Positives = 42/60 (70%)
Query: 97 EEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
+++ K E+++ +R+++E EEE+++++K+ E + E E ++ ++ EEE E++KE+ E
Sbjct: 4 GAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKE 63
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 35.1 bits (81), Expect = 0.011
Identities = 18/74 (24%), Positives = 36/74 (48%)
Query: 93 EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
EEE + ++KK +K E ++K + KK+++ E++E + + + K + +K K
Sbjct: 42 FSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKK 101
Query: 153 EMYRNPVPMYRNPV 166
+ P P V
Sbjct: 102 TKKKPPKPKPNEDV 115
Score = 33.9 bits (78), Expect = 0.029
Identities = 15/71 (21%), Positives = 35/71 (49%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
E+K E +++ E+++K+K E +E K E + + K K ++ ++ + + N+ +
Sbjct: 60 EDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAF 119
Query: 146 EEEKEKVEMYR 156
+ E E
Sbjct: 120 NKIAELAEKSN 130
Score = 33.5 bits (77), Expect = 0.036
Identities = 21/68 (30%), Positives = 37/68 (54%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E EEE + +KD+ E+K E K++ E+K+K KKE++ + E E K K
Sbjct: 37 EILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTP 96
Query: 144 EEEEEKEK 151
++ ++ +K
Sbjct: 97 KKSKKTKK 104
Score = 33.5 bits (77), Expect = 0.041
Identities = 16/71 (22%), Positives = 36/71 (50%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
KE EEE + ++ K+E K E +K+ +KK+++ +E++E + + + +
Sbjct: 36 KEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKT 95
Query: 148 EKEKVEMYRNP 158
K+ + + P
Sbjct: 96 PKKSKKTKKKP 106
Score = 32.8 bits (75), Expect = 0.069
Identities = 15/69 (21%), Positives = 34/69 (49%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
E N ++ EK+K++++E++E + E + K KK ++ +K+ K + + +
Sbjct: 59 KEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNA 118
Query: 135 EQKNKKKEE 143
K + E
Sbjct: 119 FNKIAELAE 127
Score = 32.4 bits (74), Expect = 0.091
Identities = 20/88 (22%), Positives = 43/88 (48%), Gaps = 1/88 (1%)
Query: 64 LTVILPQAIS-ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
++IL +S S T + ++ + +E +EE K +++K++
Sbjct: 1 FSLILCMILSFFSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKE 60
Query: 123 DKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
DK E ++ E+++K KK+++E + E E
Sbjct: 61 DKNNESKKKSEKKKKKKKEKKEPKSEGE 88
Score = 31.6 bits (72), Expect = 0.17
Identities = 13/68 (19%), Positives = 32/68 (47%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ + + ++ + E K +KDKKE++ E +++ + KKK++
Sbjct: 18 KSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKK 77
Query: 144 EEEEEKEK 151
+E++E +
Sbjct: 78 KEKKEPKS 85
Score = 30.8 bits (70), Expect = 0.32
Identities = 17/86 (19%), Positives = 42/86 (48%), Gaps = 6/86 (6%)
Query: 77 TLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
T E + +++++E++ + K + E+KK++KKE++E K + + + + ++ +
Sbjct: 41 TFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSK 100
Query: 137 KNK------KKEEEEEEEKEKVEMYR 156
K K K E+ + K+
Sbjct: 101 KTKKKPPKPKPNEDVDNAFNKIAELA 126
Score = 30.1 bits (68), Expect = 0.49
Identities = 13/73 (17%), Positives = 36/73 (49%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
+ + E+ + E++K+K++E++E + E E + ++ K+ +++ K K E+
Sbjct: 55 KKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114
Query: 129 VEEEEEEQKNKKK 141
V+ + +
Sbjct: 115 VDNAFNKIAELAE 127
Score = 28.5 bits (64), Expect = 1.7
Identities = 10/64 (15%), Positives = 29/64 (45%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
++ + + K+ EEE + + +++E++ KK+ E++++K
Sbjct: 17 QKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKK 76
Query: 150 EKVE 153
+K +
Sbjct: 77 KKEK 80
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 35.2 bits (82), Expect = 0.011
Identities = 16/58 (27%), Positives = 30/58 (51%)
Query: 92 EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
+ + E + E+ EK +KEEE+EK + E +++ E E+K + K + +E
Sbjct: 157 GILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214
Score = 31.7 bits (73), Expect = 0.15
Identities = 21/79 (26%), Positives = 41/79 (51%), Gaps = 8/79 (10%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ +E+ ++ E ++EEE+++ ++ + +K E E++KQ + + E + K KEE
Sbjct: 167 DSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPK-EGPVQIGKKIDKEE 225
Query: 144 -------EEEEEKEKVEMY 155
EEE + VE Y
Sbjct: 226 ITPMKEINEEERRVVVEGY 244
Score = 29.0 bits (66), Expect = 1.1
Identities = 17/74 (22%), Positives = 41/74 (55%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
++++ E E+ E ++EEE+E+ KE E +K +K+++ + K+ ++ +++
Sbjct: 166 DDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEE 225
Query: 139 KKKEEEEEEEKEKV 152
+E EE+ +V
Sbjct: 226 ITPMKEINEEERRV 239
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 35.0 bits (80), Expect = 0.011
Identities = 18/66 (27%), Positives = 39/66 (59%), Gaps = 2/66 (3%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE--EEEQKNKKKEEEEE 146
+E+++ E +K + E + + +K +EE K KD K ++++E E++ + KE E +
Sbjct: 268 SPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQ 327
Query: 147 EEKEKV 152
+++E V
Sbjct: 328 KKREPV 333
Score = 32.3 bits (73), Expect = 0.11
Identities = 19/67 (28%), Positives = 39/67 (58%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
E +K E E+ + E K+ DE+ K ++ K + +++ E+E E++E E + K++ E+
Sbjct: 277 ENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAED 336
Query: 147 EEKEKVE 153
+K K +
Sbjct: 337 LQKTKPQ 343
Score = 31.5 bits (71), Expect = 0.17
Identities = 22/81 (27%), Positives = 42/81 (51%), Gaps = 6/81 (7%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDED------EEKKREKKEEEEEKQKDKKEEEVEEE 132
EN + E E+ + E ++ +EE K KD E K EK+ E++E + KK E V E+
Sbjct: 277 ENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAED 336
Query: 133 EEEQKNKKKEEEEEEEKEKVE 153
++ K + + + ++ ++
Sbjct: 337 LQKTKPQVEAQPTSLNEDAID 357
Score = 30.7 bits (69), Expect = 0.30
Identities = 19/99 (19%), Positives = 50/99 (50%), Gaps = 7/99 (7%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEE-------EEEKEKDEDEEKKREKKEEEEEKQK 122
+A A + + E +K++E + + +E+K+ E+++++ EK + E +K
Sbjct: 235 KADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKND 294
Query: 123 DKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPM 161
++ + + + + K + K E+E E +++E + P+
Sbjct: 295 EEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPV 333
Score = 30.0 bits (67), Expect = 0.65
Identities = 18/68 (26%), Positives = 37/68 (54%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E E +E + ++ +EE +K + + K ++K + + DK+ +EV ++++E KN K
Sbjct: 205 ERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPA 264
Query: 144 EEEEEKEK 151
+ KE
Sbjct: 265 DTSSPKED 272
Score = 29.2 bits (65), Expect = 1.1
Identities = 15/62 (24%), Positives = 30/62 (48%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
E + + K+E + E+E ++K+ + +KKRE E+ +K K + E +
Sbjct: 293 NDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPTSLN 352
Query: 134 EE 135
E+
Sbjct: 353 ED 354
Score = 28.0 bits (62), Expect = 2.8
Identities = 16/89 (17%), Positives = 47/89 (52%), Gaps = 5/89 (5%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE-----EKQKDK 124
+A E L+ + + ++ +++ + ++ +K++DE +K++E K + ++DK
Sbjct: 214 RAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDK 273
Query: 125 KEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+ E ++ E E+ + ++ +EE + +
Sbjct: 274 QVAENQKREIEKAQIEIKKNDEEALKAKD 302
Score = 26.9 bits (59), Expect = 6.5
Identities = 18/91 (19%), Positives = 49/91 (53%), Gaps = 11/91 (12%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE---- 133
L+ E +E+ K ++ +EE ++++ + ++K + ++ +KQ+D+ ++ +E +
Sbjct: 203 LKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPK 262
Query: 134 -------EEQKNKKKEEEEEEEKEKVEMYRN 157
+E K + ++ E EK ++E+ +N
Sbjct: 263 PADTSSPKEDKQVAENQKREIEKAQIEIKKN 293
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 35.1 bits (81), Expect = 0.012
Identities = 17/71 (23%), Positives = 43/71 (60%), Gaps = 1/71 (1%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE-EKQKDKKEEEVEEEEEEQKNKKKE 142
E + E ++++++ K+K ++ K+ KK+ ++ + + E +E E +Q NKKK+
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60
Query: 143 EEEEEEKEKVE 153
++++++K+K
Sbjct: 61 KKKKKKKKKNL 71
Score = 33.5 bits (77), Expect = 0.040
Identities = 14/71 (19%), Positives = 40/71 (56%), Gaps = 3/71 (4%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ---KNKK 140
E E E ++++++ +++ + + KK +K ++++ + E +E E +Q K KK
Sbjct: 2 MNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKK 61
Query: 141 KEEEEEEEKEK 151
K+++++++
Sbjct: 62 KKKKKKKKNLG 72
Score = 30.8 bits (70), Expect = 0.28
Identities = 13/75 (17%), Positives = 41/75 (54%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
EN +++++ K++ +++ ++ +KD D++ + E ++ + K+ +++++++K
Sbjct: 7 ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKK 66
Query: 139 KKKEEEEEEEKEKVE 153
KKK E +
Sbjct: 67 KKKNLGEAYDLAYDL 81
Score = 27.8 bits (62), Expect = 3.5
Identities = 14/91 (15%), Positives = 36/91 (39%), Gaps = 2/91 (2%)
Query: 48 QVGPAQYHKPIRANARLTVILPQAISASETLEN--NEWEEEEKEKEEEEEEEEEEKEKDE 105
+ + K + L + IS ++ EN N ++++K+K+++++ E +
Sbjct: 20 KGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAY 79
Query: 106 DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
D ++ K E+E ++
Sbjct: 80 DLPVVWSSAAFQDNSHIRKLGNWPEQEWKQT 110
Score = 26.2 bits (58), Expect = 9.9
Identities = 9/68 (13%), Positives = 33/68 (48%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
+N + +++ + + E E ++ E+++ ++KK+++++K+K E + +
Sbjct: 25 KNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVV 84
Query: 139 KKKEEEEE 146
++
Sbjct: 85 WSSAAFQD 92
>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 406
Score = 34.9 bits (81), Expect = 0.013
Identities = 17/64 (26%), Positives = 27/64 (42%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
EE + +E + E E E K + K + EK+K E + E E K+E +
Sbjct: 1 EELDALDEILGDLESYEPQEYLNKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDR 60
Query: 146 EEEK 149
E+
Sbjct: 61 LREE 64
Score = 28.4 bits (64), Expect = 1.7
Identities = 17/64 (26%), Positives = 31/64 (48%), Gaps = 3/64 (4%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK---KREKKEEEEEKQKDKKEEEVEEE 132
+ L+ + E E +E + E+ K K ++EK E++ E E + K+E + E
Sbjct: 4 DALDEILGDLESYEPQEYLNKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDRLRE 63
Query: 133 EEEQ 136
E E+
Sbjct: 64 EIER 67
Score = 26.5 bits (59), Expect = 8.7
Identities = 12/46 (26%), Positives = 24/46 (52%)
Query: 77 TLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
LE+ + + EKEK EE+ + + +++ ++ EE E+ K
Sbjct: 24 KLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDRLREEIERLK 69
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 34.7 bits (80), Expect = 0.014
Identities = 20/73 (27%), Positives = 40/73 (54%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
SE L ++ E+E ++E E E E + + R K++ + ++ K+K+ +E+E E +
Sbjct: 238 SEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAK 297
Query: 135 EQKNKKKEEEEEE 147
E+K KK+ +
Sbjct: 298 EEKQLKKKLAQLA 310
Score = 33.5 bits (77), Expect = 0.041
Identities = 18/73 (24%), Positives = 34/73 (46%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+E + EK+ +E E EEK+ ++ + E E ++ ++ EE ++E + E
Sbjct: 205 KEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFES 264
Query: 144 EEEEEKEKVEMYR 156
E E + V R
Sbjct: 265 EYEPINKPVRPKR 277
Score = 31.6 bits (72), Expect = 0.15
Identities = 19/72 (26%), Positives = 36/72 (50%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
++E E+E + E++ +E + EEKK EK E + + E +EE +++ + + +E
Sbjct: 200 QEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAW 259
Query: 147 EEKEKVEMYRNP 158
E E N
Sbjct: 260 EGFESEYEPINK 271
Score = 31.2 bits (71), Expect = 0.24
Identities = 17/58 (29%), Positives = 33/58 (56%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E+ KE E E +EE++ K+K + +E +E +K+K + ++ + +E +K K K
Sbjct: 286 EKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKKLK 343
Score = 30.4 bits (69), Expect = 0.36
Identities = 18/62 (29%), Positives = 38/62 (61%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
N E +E E+E +EE++ ++K K+ K+ ++EK + +K+E+ +E E++K K
Sbjct: 284 NKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKKLK 343
Query: 140 KK 141
++
Sbjct: 344 RR 345
Score = 28.5 bits (64), Expect = 1.9
Identities = 18/91 (19%), Positives = 41/91 (45%), Gaps = 3/91 (3%)
Query: 60 ANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
+ + + E +++ EE + E E E E E ++ + ++K + +
Sbjct: 227 MAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEP---INKPVRPKRKTKAQR 283
Query: 120 KQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
++ +++E E +EE++ KKK + KE
Sbjct: 284 NKEKRRKELEREAKEEKQLKKKLAQLARLKE 314
Score = 27.4 bits (61), Expect = 4.5
Identities = 15/57 (26%), Positives = 34/57 (59%)
Query: 95 EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
E+ +E E++ EEK+ +KK + + K+ +E ++E+ + K++ +E E+K+
Sbjct: 286 EKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKKL 342
Score = 27.0 bits (60), Expect = 5.9
Identities = 25/109 (22%), Positives = 43/109 (39%), Gaps = 23/109 (21%)
Query: 71 AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK--------------------- 109
A AS E +E EE + + EEE ++E + + E
Sbjct: 228 APEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEK 287
Query: 110 --KREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
K ++E +EEKQ KK ++ +E K ++E+ K++ R
Sbjct: 288 RRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKER 336
Score = 26.6 bits (59), Expect = 7.3
Identities = 19/72 (26%), Positives = 40/72 (55%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
N +E+ ++ +EE E+E + ++K ++ E+ EKK E+ + + +E E EE +
Sbjct: 189 NPSFEDHQELLQEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDD 248
Query: 140 KKEEEEEEEKEK 151
+EE ++E +
Sbjct: 249 GEEESDDESAWE 260
Score = 26.6 bits (59), Expect = 7.8
Identities = 16/69 (23%), Positives = 32/69 (46%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E + E+K +E E EE++ ++ + + E E E+ D EEE ++E + + +
Sbjct: 206 EVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESE 265
Query: 142 EEEEEEEKE 150
E +
Sbjct: 266 YEPINKPVR 274
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 35.0 bits (81), Expect = 0.014
Identities = 21/66 (31%), Positives = 38/66 (57%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E + +E E + EE E + ++ +E + E+K EE +++ + E E+EE E E + +
Sbjct: 315 ERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRL 374
Query: 144 EEEEEK 149
EE EE+
Sbjct: 375 EELEEQ 380
Score = 34.3 bits (79), Expect = 0.025
Identities = 23/78 (29%), Positives = 38/78 (48%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
LE + E+ E + EE E + +E E K E EE E ++ +E + E E E
Sbjct: 298 SRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLE 357
Query: 136 QKNKKKEEEEEEEKEKVE 153
+ ++ E E EE + ++E
Sbjct: 358 AELEELEAELEELESRLE 375
Score = 34.3 bits (79), Expect = 0.027
Identities = 24/72 (33%), Positives = 42/72 (58%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
E + +E E + EE E + DE E+ E +E+ EE +++ + E E EE E + ++ E
Sbjct: 315 ERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRL 374
Query: 146 EEEKEKVEMYRN 157
EE +E++E R+
Sbjct: 375 EELEEQLETLRS 386
Score = 32.7 bits (75), Expect = 0.073
Identities = 24/100 (24%), Positives = 47/100 (47%), Gaps = 1/100 (1%)
Query: 59 RANARLTVILPQAISASETLENNEWEEEEKEKEEEE-EEEEEEKEKDEDEEKKREKKEEE 117
RL + Q LE E + +E +E E EE+ EE +++ + + ++ E
Sbjct: 306 ILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEA 365
Query: 118 EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
E ++ + + EE+EE+ E ++K + E + E+ R
Sbjct: 366 ELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERL 405
Score = 32.0 bits (73), Expect = 0.16
Identities = 19/81 (23%), Positives = 35/81 (43%), Gaps = 2/81 (2%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE--EEEEEQ 136
E E E++++ E E + ++ E + EE E + D+ EE+ EE+ E+
Sbjct: 289 ELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEE 348
Query: 137 KNKKKEEEEEEEKEKVEMYRN 157
++ E E E +E
Sbjct: 349 LKEELESLEAELEELEAELEE 369
Score = 30.4 bits (69), Expect = 0.49
Identities = 25/81 (30%), Positives = 38/81 (46%), Gaps = 8/81 (9%)
Query: 82 EWEEEEKEKEEEEEEEEEEK--------EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
+ EE E E EEE EE + E E++K+ +E ++ +E E + EE
Sbjct: 268 KLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEE 327
Query: 134 EEQKNKKKEEEEEEEKEKVEM 154
E K + EE E +EK+E
Sbjct: 328 LESKLDELAEELAELEEKLEE 348
Score = 29.3 bits (66), Expect = 0.97
Identities = 25/94 (26%), Positives = 44/94 (46%), Gaps = 3/94 (3%)
Query: 63 RLTVILPQAISASETLENNEWEEEEKEKEEEEEEEE---EEKEKDEDEEKKREKKEEEEE 119
RL + Q SE +E+ E EE E+ EE E E E+ EE + E EE
Sbjct: 839 RLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEE 898
Query: 120 KQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
++ +E E + E ++ ++ E+ + + ++E
Sbjct: 899 LSEELRELESKRSELRRELEELREKLAQLELRLE 932
Score = 28.1 bits (63), Expect = 3.0
Identities = 25/76 (32%), Positives = 42/76 (55%)
Query: 77 TLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
L E EE +E +EE +E EEE E+ E ++ E+K EE + + EEE+EE ++E
Sbjct: 231 VLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKEL 290
Query: 137 KNKKKEEEEEEEKEKV 152
E E+++++
Sbjct: 291 YALANEISRLEQQKQI 306
Score = 27.7 bits (62), Expect = 3.1
Identities = 22/71 (30%), Positives = 35/71 (49%), Gaps = 1/71 (1%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV-EEEEEEQKNK 139
E EE+ +E +EE E E E E+ E E ++ E + EE E+Q + +V + E +
Sbjct: 340 AELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLN 399
Query: 140 KKEEEEEEEKE 150
+ E E E
Sbjct: 400 NEIERLEARLE 410
Score = 27.7 bits (62), Expect = 3.7
Identities = 23/83 (27%), Positives = 37/83 (44%)
Query: 60 ANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
A V + A + E E E E +E EE EE EEE + E E ++ E + E+ +
Sbjct: 736 ARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLK 795
Query: 120 KQKDKKEEEVEEEEEEQKNKKKE 142
++ E ++E E +E
Sbjct: 796 EELKALREALDELRAELTLLNEE 818
Score = 27.7 bits (62), Expect = 3.7
Identities = 15/61 (24%), Positives = 29/61 (47%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
K+ E E E+ +E+ K+ + E E E+ +++ EE EE E + ++ E
Sbjct: 730 ALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEA 789
Query: 145 E 145
+
Sbjct: 790 Q 790
Score = 27.3 bits (61), Expect = 5.2
Identities = 21/68 (30%), Positives = 36/68 (52%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
E EE E++ EE EE+ E ++ + K+ E+ EEE E+ + + EE + +K+
Sbjct: 676 RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDL 735
Query: 140 KKEEEEEE 147
+ E E E
Sbjct: 736 ARLEAEVE 743
Score = 26.9 bits (60), Expect = 6.0
Identities = 21/84 (25%), Positives = 40/84 (47%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+A++ EE E+ ++E EE + +D + + E+ EE+ +E
Sbjct: 698 KALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELT 757
Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
E E E ++ +++ EE EEE + E
Sbjct: 758 ELEAEIEELEERLEEAEEELAEAE 781
Score = 26.9 bits (60), Expect = 6.5
Identities = 25/75 (33%), Positives = 36/75 (48%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E + E E KE EE EEE E+ + E EE R+ ++ + + E E EE
Sbjct: 691 EKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIA 750
Query: 136 QKNKKKEEEEEEEKE 150
Q +K+ E E E +E
Sbjct: 751 QLSKELTELEAEIEE 765
Score = 26.9 bits (60), Expect = 6.9
Identities = 19/94 (20%), Positives = 38/94 (40%)
Query: 57 PIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEE 116
+ +++ + +E E + KE E E E E+ ++ EE + E E
Sbjct: 721 LEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEA 780
Query: 117 EEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
E E ++ + + E +EE + + +E E
Sbjct: 781 EAEIEELEAQIEQLKEELKALREALDELRAELTL 814
Score = 26.6 bits (59), Expect = 9.1
Identities = 23/82 (28%), Positives = 39/82 (47%), Gaps = 9/82 (10%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK---------DKKE 126
E L+ E EE+ +E E +E E++ +E + E +EE EE QK + E
Sbjct: 242 EELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLE 301
Query: 127 EEVEEEEEEQKNKKKEEEEEEE 148
++ + E N +++ EE E
Sbjct: 302 QQKQILRERLANLERQLEELEA 323
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 34.6 bits (79), Expect = 0.014
Identities = 29/84 (34%), Positives = 45/84 (53%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
A ++N E E E+ EEEE+EE E + +EEEEE + E+V + E
Sbjct: 170 AQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYE 229
Query: 134 EEQKNKKKEEEEEEEKEKVEMYRN 157
E+ +KK+ EEEE E+E + ++
Sbjct: 230 GERIDKKQGEEEEMEEEVINLFEI 253
Score = 28.0 bits (62), Expect = 2.3
Identities = 34/86 (39%), Positives = 42/86 (48%), Gaps = 16/86 (18%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK----------------D 123
+NE EE +E + EEEE+EE D E RE EEEEE+ + D
Sbjct: 175 DNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERID 234
Query: 124 KKEEEVEEEEEEQKNKKKEEEEEEEK 149
KK+ E EE EEE N + E EEE
Sbjct: 235 KKQGEEEEMEEEVINLFEIEWEEESP 260
Score = 27.3 bits (60), Expect = 4.0
Identities = 16/73 (21%), Positives = 38/73 (52%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
+ +E + + + EEEE E+ + E++ +++ + + +E EEEEE + +
Sbjct: 162 EDNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDD 221
Query: 141 KEEEEEEEKEKVE 153
E+ + E E+++
Sbjct: 222 GEDVVDYEGERID 234
Score = 26.1 bits (57), Expect = 8.8
Identities = 19/64 (29%), Positives = 35/64 (54%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E E E+EEEE E ++ E D E +R K++ EE++ +++ + E E E+++
Sbjct: 200 SYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEEES 259
Query: 139 KKKE 142
+E
Sbjct: 260 PSEE 263
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 35.1 bits (81), Expect = 0.015
Identities = 18/49 (36%), Positives = 30/49 (61%), Gaps = 5/49 (10%)
Query: 83 WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
W ++ E + + EE+ K+K KK+EKK+EEE K+++K E+ E
Sbjct: 728 WNGKDAEFKISDSVEEKTKKK-----KKKEKKKEEEYKREEKARIEIAE 771
Score = 34.3 bits (79), Expect = 0.021
Identities = 18/70 (25%), Positives = 38/70 (54%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
N + E + + +EE+E + E + D+EKK + ++K+ +E+ E + +++KN
Sbjct: 1512 SNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKN 1571
Query: 139 KKKEEEEEEE 148
KK+ + E
Sbjct: 1572 KKQYKSNTEA 1581
Score = 33.2 bits (76), Expect = 0.057
Identities = 19/68 (27%), Positives = 33/68 (48%), Gaps = 5/68 (7%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
+EN EE+E + E E ++E+K E +EK EE+ + D K + + +++
Sbjct: 1520 IENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIK-----KRKNKKQ 1574
Query: 138 NKKKEEEE 145
K E E
Sbjct: 1575 YKSNTEAE 1582
Score = 33.2 bits (76), Expect = 0.058
Identities = 20/70 (28%), Positives = 33/70 (47%), Gaps = 2/70 (2%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREK-KEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EK+ E E +EEKE E + +K K+ E +E+ +EE+ E KK +
Sbjct: 1512 SNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESD-IKKRK 1570
Query: 144 EEEEEKEKVE 153
+++ K E
Sbjct: 1571 NKKQYKSNTE 1580
Score = 29.3 bits (66), Expect = 1.1
Identities = 11/42 (26%), Positives = 27/42 (64%)
Query: 94 EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
++ E + +E+ K++KK+E++++++ K+EE+ E E
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771
Score = 28.9 bits (65), Expect = 1.4
Identities = 12/43 (27%), Positives = 26/43 (60%)
Query: 111 REKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
R ++ E + EE +++++++ KK+EE + EEK ++E
Sbjct: 726 RNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768
Score = 28.5 bits (64), Expect = 2.2
Identities = 10/41 (24%), Positives = 24/41 (58%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
N ++ E + + EE+ ++K+K E ++++ K+EE+
Sbjct: 725 FRNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKA 765
Score = 28.1 bits (63), Expect = 2.5
Identities = 21/71 (29%), Positives = 32/71 (45%), Gaps = 7/71 (9%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREK-------KEEEEEKQKDKKEEE 128
ET E E E E+E + E E E K +++E E+ EE E+ K + EE
Sbjct: 244 ETSETEERVESEEETDVEIETASETKGTKQEQEGSTEEDPSPSLFSEEREDPDKIDETEE 303
Query: 129 VEEEEEEQKNK 139
+ +E+ K
Sbjct: 304 IRVNGKEKIKK 314
Score = 28.1 bits (63), Expect = 2.8
Identities = 21/94 (22%), Positives = 43/94 (45%), Gaps = 3/94 (3%)
Query: 67 ILPQAISASETLEN-NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
I P I L N N E K+ ++++ EK E + R ++E+E Q + +
Sbjct: 1478 IKPWIIPIKLLLLNLNGNENVNKKINQKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELE 1537
Query: 126 EEEVEEEEEEQK--NKKKEEEEEEEKEKVEMYRN 157
++ ++ E N++K EE+ + ++ +N
Sbjct: 1538 SDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKN 1571
Score = 27.4 bits (61), Expect = 4.3
Identities = 21/62 (33%), Positives = 29/62 (46%), Gaps = 7/62 (11%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
KE E EE E E+E D + E E K K+E+E EE+ + EE E+
Sbjct: 243 KETSETEERVESEEETDVEIETASETK-------GTKQEQEGSTEEDPSPSLFSEEREDP 295
Query: 148 EK 149
+K
Sbjct: 296 DK 297
Score = 26.6 bits (59), Expect = 8.7
Identities = 13/34 (38%), Positives = 24/34 (70%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEE 108
S+++E ++++KEK++EEE + EEK + E E
Sbjct: 738 SDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 33.4 bits (77), Expect = 0.019
Identities = 22/65 (33%), Positives = 42/65 (64%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE+E+EKEE+E +EE+EKE + +K E+++ EE +K K + ++E +++ +E
Sbjct: 19 EEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDEDYDEELKE 78
Query: 144 EEEEE 148
+E +
Sbjct: 79 QERWD 83
Score = 28.8 bits (65), Expect = 0.78
Identities = 17/45 (37%), Positives = 31/45 (68%)
Query: 107 EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
EEK+ EK+ E+EEK++ +++E+ + QK ++++ EE EK K
Sbjct: 15 EEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAK 59
Score = 28.4 bits (64), Expect = 1.1
Identities = 14/57 (24%), Positives = 38/57 (66%)
Query: 92 EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
EE+ EE+E ++++++ ++++EK+ + QK+++E+ +EE E+ + ++E+
Sbjct: 15 EEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDED 71
Score = 28.0 bits (63), Expect = 1.5
Identities = 21/72 (29%), Positives = 43/72 (59%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
+E+ EE+E E++EK+ EEK++E + +K++ +K E E+ + + + ++E+ ++
Sbjct: 15 EEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDEDYDE 74
Query: 150 EKVEMYRNPVPM 161
E E R PM
Sbjct: 75 ELKEQERWDDPM 86
Score = 27.3 bits (61), Expect = 2.6
Identities = 14/48 (29%), Positives = 32/48 (66%)
Query: 106 DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
D+ + EE+ E+++ +KEE+ +EE+E++ K ++EE ++++E
Sbjct: 6 DKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLE 53
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 33.8 bits (78), Expect = 0.019
Identities = 19/62 (30%), Positives = 37/62 (59%)
Query: 71 AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
+ L+ E EEEE+E+ E EE +EEE+ + E++ + K E+ + + K++E ++
Sbjct: 89 KVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILK 148
Query: 131 EE 132
E+
Sbjct: 149 EQ 150
Score = 31.9 bits (73), Expect = 0.067
Identities = 18/54 (33%), Positives = 37/54 (68%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
+E+E+E+EEE E EE ++E+ DE ++E + + EK+++ + ++ E +E+ K
Sbjct: 99 KEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQMK 152
Score = 31.5 bits (72), Expect = 0.10
Identities = 21/58 (36%), Positives = 37/58 (63%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
++KEKEEEEEEE E +E DE+E+ ++E + +++K+ E +++E K + K
Sbjct: 95 GLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQMK 152
Score = 28.4 bits (64), Expect = 1.3
Identities = 15/50 (30%), Positives = 33/50 (66%)
Query: 107 EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
+ +KKE+EEE++++ + EE++EEE+ + +KE + + +++ E R
Sbjct: 92 KLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENER 141
>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
Length = 819
Score = 34.7 bits (80), Expect = 0.019
Identities = 16/61 (26%), Positives = 30/61 (49%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
+EE + EE ++ +E EE EK ++ E+ E +EE +K K+ ++ E
Sbjct: 444 EEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEV 503
Query: 150 E 150
+
Sbjct: 504 D 504
Score = 33.9 bits (78), Expect = 0.027
Identities = 21/101 (20%), Positives = 45/101 (44%), Gaps = 2/101 (1%)
Query: 54 YHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREK 113
H+ L +L + + L++ E ++ KE EE +EK ++ ++ +
Sbjct: 425 THEVGAEEIILYFLLKKFLEEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCE 484
Query: 114 KEEEEEKQKDKKEEEVEEEEEEQ--KNKKKEEEEEEEKEKV 152
+EE +K+ KK +++ E + + EE E+ E +
Sbjct: 485 NQEEVKKEVPKKVKKIPEVDTYLLLEELGINEETYEKLEAL 525
Score = 29.3 bits (66), Expect = 1.1
Identities = 13/64 (20%), Positives = 30/64 (46%)
Query: 93 EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
EEE + E+ + K EE ++ E+ + E +++ KK+ ++ ++ +V
Sbjct: 444 EEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEV 503
Query: 153 EMYR 156
+ Y
Sbjct: 504 DTYL 507
Score = 28.5 bits (64), Expect = 1.8
Identities = 11/54 (20%), Positives = 24/54 (44%)
Query: 106 DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNPV 159
+EE + EE ++ + + EE E+ ++ E+ E ++E + V
Sbjct: 444 EEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKV 497
>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
Length = 250
Score = 34.0 bits (78), Expect = 0.020
Identities = 18/87 (20%), Positives = 44/87 (50%), Gaps = 1/87 (1%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDED-EEKKREKKEEEEEKQKDKKEEE 128
Q + + N W++ E+ ++E +E E ++K + E+++ + +E ++++
Sbjct: 32 QVMKKRQQRIANRWQDAEQRQQEAGQEAERYRQKQQSLEQQRASFMAQAQEAADEQRQHL 91
Query: 129 VEEEEEEQKNKKKEEEEEEEKEKVEMY 155
+ E E+ + E E+ E+EK E +
Sbjct: 92 LNEAREDVATARDEWLEQLEREKQEFF 118
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 34.4 bits (80), Expect = 0.022
Identities = 27/87 (31%), Positives = 46/87 (52%), Gaps = 7/87 (8%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKD------EDEEKKREK-KEEEEEKQKDKKE 126
A E + E EE + +E+EEEEE+E + E EK EK K ++ +K +K
Sbjct: 178 AEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLRKA 237
Query: 127 EEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+E + E ++KK + E+ KE+++
Sbjct: 238 QEKKVEGRLAQHKKYAKLREKLKEELK 264
Score = 33.2 bits (77), Expect = 0.052
Identities = 16/67 (23%), Positives = 28/67 (41%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
E ++ E E E + E + ++ ++E EEEEE++ + +E
Sbjct: 154 EWYDRLENGERRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADE 213
Query: 146 EEEKEKV 152
E EKV
Sbjct: 214 SELPEKV 220
Score = 31.7 bits (73), Expect = 0.16
Identities = 16/73 (21%), Positives = 35/73 (47%), Gaps = 1/73 (1%)
Query: 77 TLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ-KDKKEEEVEEEEEE 135
+ EE+ E EE ++ E +E+EE + + +E + +K E+ + ++
Sbjct: 171 DGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQ 230
Query: 136 QKNKKKEEEEEEE 148
K +K +E++ E
Sbjct: 231 YKKLRKAQEKKVE 243
Score = 26.3 bits (59), Expect = 9.5
Identities = 9/38 (23%), Positives = 16/38 (42%)
Query: 91 EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
EE + ++ ++E + + E EEE E E
Sbjct: 61 EEAPDADDLLLAENEADAQTDEDAEEEAAAALSSVESE 98
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 34.2 bits (78), Expect = 0.022
Identities = 17/120 (14%), Positives = 36/120 (30%), Gaps = 8/120 (6%)
Query: 37 VSLTDDAEFQCQVGPAQYHKPIRANARLTVILPQAISASETLENNEWEE------EEKEK 90
++ A VGP +P + + + + ++ + + E
Sbjct: 303 LAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPA 362
Query: 91 EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
EE E + E+E+ D + + + E + + E E EK
Sbjct: 363 VEETSEADIEREQPGDLAG--QAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKA 420
Score = 32.3 bits (73), Expect = 0.12
Identities = 6/52 (11%), Positives = 17/52 (32%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
+ + + E +E E +E E + +K + + ++
Sbjct: 380 AGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDE 431
Score = 31.5 bits (71), Expect = 0.19
Identities = 9/60 (15%), Positives = 20/60 (33%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
E+E+ + + + + E +E + E E E E+ + +E
Sbjct: 372 EREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDE 431
Score = 31.1 bits (70), Expect = 0.25
Identities = 6/51 (11%), Positives = 14/51 (27%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
+ + + EE + DE + ++ K +E
Sbjct: 388 QVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVAGPG 438
Score = 31.1 bits (70), Expect = 0.29
Identities = 7/74 (9%), Positives = 23/74 (31%), Gaps = 1/74 (1%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
++ +E + + + + + E EE E +E + + E+
Sbjct: 366 TSEADIEREQPGDLAGQAPAAHQVDAEA-ASAAPEEPAALASEAHDETEPEVPEKAAPIP 424
Query: 133 EEEQKNKKKEEEEE 146
+ + ++
Sbjct: 425 DPAKPDELAVAGPG 438
Score = 29.2 bits (65), Expect = 0.97
Identities = 11/69 (15%), Positives = 19/69 (27%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E E + E E+ + + + EE E E E E +
Sbjct: 365 ETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIP 424
Query: 144 EEEEEKEKV 152
+ + E
Sbjct: 425 DPAKPDELA 433
Score = 29.2 bits (65), Expect = 1.2
Identities = 11/94 (11%), Positives = 28/94 (29%)
Query: 52 AQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR 111
A+ K A V + ++ + EE + + E E+ + +
Sbjct: 330 AEAVKAEVAEVTDEVAAESVVQVADRDGESTPAVEETSEADIEREQPGDLAGQAPAAHQV 389
Query: 112 EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
+ + ++ +E E + +K
Sbjct: 390 DAEAASAAPEEPAALASEAHDETEPEVPEKAAPI 423
Score = 27.6 bits (61), Expect = 3.2
Identities = 8/77 (10%), Positives = 19/77 (24%)
Query: 71 AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
+ ET E + E+ + + + + + E + + E +
Sbjct: 360 TPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEK 419
Query: 131 EEEEEQKNKKKEEEEEE 147
K E
Sbjct: 420 AAPIPDPAKPDELAVAG 436
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 34.5 bits (79), Expect = 0.023
Identities = 18/54 (33%), Positives = 36/54 (66%), Gaps = 1/54 (1%)
Query: 86 EEKEKEEEE-EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
EEK +E E+ +EE E+E+ +E+++RE+++ E + + + EVE+ E+ +N
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQN 305
Score = 31.8 bits (72), Expect = 0.16
Identities = 14/72 (19%), Positives = 31/72 (43%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E+ +EE + + + + EE + E + E E+ Q D + +EE +
Sbjct: 50 EKTSNAKEEIQVDFQHNSESAVEEVEAEDEIEVEQNQSDVLKSSSIVKEESISTDMDGID 109
Query: 145 EEEEKEKVEMYR 156
++ K+++ R
Sbjct: 110 DDSLDRKLKLER 121
Score = 30.6 bits (69), Expect = 0.45
Identities = 20/55 (36%), Positives = 30/55 (54%), Gaps = 4/55 (7%)
Query: 95 EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
EE+ E EK EE +RE++ EE+ + +EEE E ++ K E E+ EK
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRR----REEEKAAMEADRAQAKAEVEKRREK 302
Score = 27.1 bits (60), Expect = 5.8
Identities = 14/45 (31%), Positives = 27/45 (60%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
LE EE E+E++ EE+ EE++ + ++ + K E E+ ++K
Sbjct: 258 LEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREK 302
Score = 26.8 bits (59), Expect = 6.7
Identities = 18/52 (34%), Positives = 30/52 (57%), Gaps = 3/52 (5%)
Query: 107 EEKKRE-KKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
EEK+RE +K +EE +++++ EE EEE+ E + + K +VE R
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEK--AAMEADRAQAKAEVEKRRE 301
Score = 26.4 bits (58), Expect = 9.0
Identities = 15/59 (25%), Positives = 27/59 (45%), Gaps = 5/59 (8%)
Query: 67 ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
I+P I E W EEE + + +EE +E+ + ++ + + E K+K K
Sbjct: 417 IVPNNIP-----EELYWVEEEHQIYRKLQEERRLREEAIRAKAEKTARMKAEMKEKTLK 470
Score = 26.4 bits (58), Expect = 9.4
Identities = 16/78 (20%), Positives = 31/78 (39%), Gaps = 4/78 (5%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD----KKEEEVEEEEEEQ 136
EK +EE + + + E ++ E ++E E +Q K V+EE
Sbjct: 44 LSVSAVEKTSNAKEEIQVDFQHNSESAVEEVEAEDEIEVEQNQSDVLKSSSIVKEESIST 103
Query: 137 KNKKKEEEEEEEKEKVEM 154
+++ + K K+E
Sbjct: 104 DMDGIDDDSLDRKLKLER 121
>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
[Posttranslational modification, protein turnover,
chaperones].
Length = 193
Score = 33.8 bits (78), Expect = 0.024
Identities = 22/75 (29%), Positives = 45/75 (60%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
+ E + EE E+ ++ EEEE EEE+ ++E+E ++ +++ E E Q ++ +++ +
Sbjct: 5 EQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRAQA 64
Query: 135 EQKNKKKEEEEEEEK 149
E +N +K E E E+
Sbjct: 65 EFENLRKRTEREREE 79
Score = 31.1 bits (71), Expect = 0.16
Identities = 22/57 (38%), Positives = 39/57 (68%)
Query: 96 EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
++E++ E+ + EE + +K EEEE ++++ EEE E EEE+Q+ + E + EE K+K
Sbjct: 3 DKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKY 59
Score = 28.4 bits (64), Expect = 1.4
Identities = 26/65 (40%), Positives = 43/65 (66%), Gaps = 1/65 (1%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE-EEVEEEEEEQKNKKKEE 143
++E++ EE + EE EE EK E+EE + E+ EEE E +++++E E+E + EE K+K
Sbjct: 3 DKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRA 62
Query: 144 EEEEE 148
+ E E
Sbjct: 63 QAEFE 67
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 33.8 bits (78), Expect = 0.024
Identities = 20/52 (38%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
E+ E+EEEE+ +EEE+E+ K R E+ KQK K+ ++ E+EE +
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRS--EQSRLKQKAKEMQKEEDEEMRHR 169
Score = 32.7 bits (75), Expect = 0.050
Identities = 16/53 (30%), Positives = 33/53 (62%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
+ E+ E EEEE+++++E E R K E+ + +K +E+++EE+E+ +
Sbjct: 117 RFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHR 169
Score = 29.6 bits (67), Expect = 0.61
Identities = 14/45 (31%), Positives = 27/45 (60%)
Query: 102 EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
+ + +EE+KR+++E E + K E +++ K +KEE+EE
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEE 165
Score = 26.5 bits (59), Expect = 7.1
Identities = 15/44 (34%), Positives = 23/44 (52%), Gaps = 5/44 (11%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREK-----KEEEEEKQK 122
EEEEK EEE E + ++ + ++K KEE+EE +
Sbjct: 125 EEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRH 168
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 33.9 bits (78), Expect = 0.025
Identities = 18/55 (32%), Positives = 25/55 (45%), Gaps = 6/55 (10%)
Query: 83 WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
+K+KEEEE E E E E+ KQK K+E++ E EE +
Sbjct: 116 RFLAQKQKEEEERRVERR------RELGLEDPEQLRLKQKAKEEQKAESEETRHR 164
Score = 30.1 bits (68), Expect = 0.47
Identities = 13/43 (30%), Positives = 28/43 (65%)
Query: 112 EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
+K++EEE+++ ++ E+ E+ EQ K++ +EE++ E E
Sbjct: 119 AQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEET 161
Score = 28.1 bits (63), Expect = 2.2
Identities = 12/47 (25%), Positives = 26/47 (55%), Gaps = 6/47 (12%)
Query: 102 EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
+K ++EE++R ++ E + E+ +QK K++++ E EE
Sbjct: 120 QKQKEEEERRVERRRELGLEDP------EQLRLKQKAKEEQKAESEE 160
Score = 27.7 bits (62), Expect = 2.4
Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 2/48 (4%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKRE--KKEEEEEKQKD 123
L + EEEE+ E E E+ E+ ++K +E K E EE + +
Sbjct: 118 LAQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEETRHRA 165
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 34.3 bits (79), Expect = 0.025
Identities = 17/74 (22%), Positives = 36/74 (48%), Gaps = 1/74 (1%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE-EEVEEEEEEQKNKKKEEEE 145
K+ + + + + + ++ E + EK + KK EE+ EE E+ K ++++
Sbjct: 372 YKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKK 431
Query: 146 EEEKEKVEMYRNPV 159
++KE E +R V
Sbjct: 432 RKKKEWFEKFRWFV 445
Score = 30.4 bits (69), Expect = 0.37
Identities = 18/66 (27%), Positives = 31/66 (46%), Gaps = 4/66 (6%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ + E +E E + +K E K+ EE +EE +EE + K KK+++
Sbjct: 379 KVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEE----IREELIEEGLLKSKKKKRKK 434
Query: 144 EEEEEK 149
+E EK
Sbjct: 435 KEWFEK 440
Score = 30.1 bits (68), Expect = 0.63
Identities = 16/65 (24%), Positives = 33/65 (50%), Gaps = 1/65 (1%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
+ + EK E +E+ E + + + +K E+E +K + K E+ E+E EE + +
Sbjct: 259 AYADLEKLFNEALDEKFERDKIKQLASELEKKLEKELKKLE-NKLEKQEDELEELEKAAE 317
Query: 142 EEEEE 146
E ++
Sbjct: 318 ELRQK 322
Score = 29.3 bits (66), Expect = 1.00
Identities = 14/79 (17%), Positives = 34/79 (43%), Gaps = 8/79 (10%)
Query: 83 WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE-------- 134
+ E+ + E ++ + ++ K+ KK + + D++ E++E
Sbjct: 343 ADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTA 402
Query: 135 EQKNKKKEEEEEEEKEKVE 153
+K + K+ EE +E +E
Sbjct: 403 LEKAEGKKAIEEIREELIE 421
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 34.1 bits (78), Expect = 0.026
Identities = 23/77 (29%), Positives = 51/77 (66%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
E + EE++K+KE+ +EE ++ K K+E +EK+ K++E+E+++K ++ + EEE++
Sbjct: 106 EEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKK 165
Query: 135 EQKNKKKEEEEEEEKEK 151
++ + K ++ K+K
Sbjct: 166 RERVRAKSRPKKPPKKK 182
Score = 33.3 bits (76), Expect = 0.045
Identities = 26/68 (38%), Positives = 48/68 (70%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
++ K E +EEE+EKE+ ++E+KK+++K +EE K + KEE E+ ++K K+KE++
Sbjct: 95 AKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKV 154
Query: 146 EEEKEKVE 153
EE +++ E
Sbjct: 155 EEPRDREE 162
Score = 33.3 bits (76), Expect = 0.046
Identities = 24/85 (28%), Positives = 52/85 (61%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E E + +EE+K+K+E+ +EE ++++ E+ ++KR KE+E+EK+K +E EEE++
Sbjct: 106 EEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKK 165
Query: 136 QKNKKKEEEEEEEKEKVEMYRNPVP 160
++ + + ++ +K + P
Sbjct: 166 RERVRAKSRPKKPPKKKPPNKKKEP 190
Score = 32.9 bits (75), Expect = 0.068
Identities = 21/65 (32%), Positives = 46/65 (70%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
K +E + E +E+EK++++ K+ +KK++E+ K++ K + EE +E++ K+KE+E+E
Sbjct: 92 TKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKE 151
Query: 147 EEKEK 151
++ E+
Sbjct: 152 KKVEE 156
Score = 32.9 bits (75), Expect = 0.070
Identities = 23/70 (32%), Positives = 48/70 (68%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E KE+E+E+E+ +EEK+K +++ K+ K + +E+ K+K+ + +E+E+E+K ++ +
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRD 159
Query: 144 EEEEEKEKVE 153
EEE+K +
Sbjct: 160 REEEKKRERV 169
Score = 32.6 bits (74), Expect = 0.074
Identities = 20/70 (28%), Positives = 50/70 (71%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+E + E +EEE+E+E+ ++++ ++K++ K+E ++ K K++ +E+ +E+E++ +KK E
Sbjct: 96 KEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVE 155
Query: 144 EEEEEKEKVE 153
E + +E+ +
Sbjct: 156 EPRDREEEKK 165
Score = 32.6 bits (74), Expect = 0.093
Identities = 28/79 (35%), Positives = 54/79 (68%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
++ +N +EEEKEKE+ +EE++++KEK ++E K R+ KEE +EK+ K++E+ +E+
Sbjct: 93 KPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEK 152
Query: 133 EEEQKNKKKEEEEEEEKEK 151
+ E+ ++EE++ E
Sbjct: 153 KVEEPRDREEEKKRERVRA 171
Score = 31.8 bits (72), Expect = 0.14
Identities = 24/81 (29%), Positives = 49/81 (60%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
K K +E + E KE+++++E+ +E+K++++EK K++ ++ +EE ++K KE+E+E
Sbjct: 90 AKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKE 149
Query: 147 EEKEKVEMYRNPVPMYRNPVP 167
+EK+ E R V
Sbjct: 150 KEKKVEEPRDREEEKKRERVR 170
Score = 31.8 bits (72), Expect = 0.17
Identities = 18/75 (24%), Positives = 40/75 (53%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EEEK++E + +K + K+++ EEE++++ +E + EE N+++E+
Sbjct: 160 REEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREK 219
Query: 144 EEEEEKEKVEMYRNP 158
EE++ K++
Sbjct: 220 EEDDGKDRETTTSPM 234
Score = 29.9 bits (67), Expect = 0.63
Identities = 20/70 (28%), Positives = 42/70 (60%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ +E+ K+ + +EE +EK +++EK++EKK EE ++++K+ E + K K++
Sbjct: 123 KPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKK 182
Query: 144 EEEEEKEKVE 153
++KE E
Sbjct: 183 PPNKKKEPPE 192
Score = 29.5 bits (66), Expect = 0.88
Identities = 20/69 (28%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREK-KEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
E ++++ +EE +E+ KEK++++EKK E+ ++ EEEK++++ + ++ +K +
Sbjct: 127 EPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNK 186
Query: 143 EEEEEEKEK 151
++E E+EK
Sbjct: 187 KKEPPEEEK 195
Score = 27.9 bits (62), Expect = 3.1
Identities = 20/78 (25%), Positives = 42/78 (53%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
A +E K E+ + K K E K KEEE+EK++ K+E++ ++E+
Sbjct: 64 AKCAESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEK 123
Query: 134 EEQKNKKKEEEEEEEKEK 151
+++ K ++ +EE ++++
Sbjct: 124 PKEEPKDRKPKEEAKEKR 141
Score = 27.5 bits (61), Expect = 3.5
Identities = 15/70 (21%), Positives = 43/70 (61%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E+E++++++ EE + E+EK + + + + ++ +K+ K++E EEE++++ ++
Sbjct: 145 EKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAV 204
Query: 144 EEEEEKEKVE 153
+ + E+ V
Sbjct: 205 KGKPEEPDVN 214
Score = 27.2 bits (60), Expect = 5.6
Identities = 21/87 (24%), Positives = 47/87 (54%), Gaps = 5/87 (5%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKK-----EEEEEKQKD 123
P+ + E+ +++E+E+E++ E+ +D +EEKKRE+ ++ K+K
Sbjct: 124 PKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKP 183
Query: 124 KKEEEVEEEEEEQKNKKKEEEEEEEKE 150
+++ EEE+Q+ +E + + +E
Sbjct: 184 PNKKKEPPEEEKQRQAAREAVKGKPEE 210
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 34.3 bits (79), Expect = 0.026
Identities = 17/71 (23%), Positives = 38/71 (53%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
+ + +E+ K E+E E +KE+ E+E ++ E + E + ++E +E E + +
Sbjct: 841 QRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRE 900
Query: 139 KKKEEEEEEEK 149
+++ EE E +
Sbjct: 901 LERKIEELEAQ 911
Score = 31.2 bits (71), Expect = 0.27
Identities = 26/89 (29%), Positives = 44/89 (49%), Gaps = 1/89 (1%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE-EEVEE 131
S+ L + + E EKE E+ E+EEEK K+ EE + + E+E + K E +E+E
Sbjct: 706 ELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEA 765
Query: 132 EEEEQKNKKKEEEEEEEKEKVEMYRNPVP 160
EE + + EE + + + +P
Sbjct: 766 RIEELEEDLHKLEEALNDLEARLSHSRIP 794
Score = 29.7 bits (67), Expect = 0.92
Identities = 19/81 (23%), Positives = 34/81 (41%), Gaps = 2/81 (2%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK--NK 139
E EEE +E E + E + E + E + E E++ ++ E ++E++ +
Sbjct: 865 ELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKA 924
Query: 140 KKEEEEEEEKEKVEMYRNPVP 160
K E EEE E +
Sbjct: 925 KLEALEEELSEIEDPKGEDEE 945
Score = 29.3 bits (66), Expect = 1.2
Identities = 24/68 (35%), Positives = 41/68 (60%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E E E ++ EE + E E +K E ++ E++ EEE K++DK EE E +EE ++ +
Sbjct: 312 EKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRA 371
Query: 142 EEEEEEEK 149
E EE +++
Sbjct: 372 ELEEVDKE 379
Score = 28.1 bits (63), Expect = 2.8
Identities = 17/65 (26%), Positives = 31/65 (47%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
E E E++ + E EK+ ++ +E+ K++ + +E E K++ EEE
Sbjct: 811 EARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEEL 870
Query: 147 EEKEK 151
EE E
Sbjct: 871 EELEA 875
Score = 28.1 bits (63), Expect = 3.0
Identities = 23/71 (32%), Positives = 34/71 (47%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E E E E E++ + ED E++ K E E +K + EE E EEE+K + K
Sbjct: 297 GELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLT 356
Query: 144 EEEEEKEKVEM 154
EE E ++
Sbjct: 357 EEYAELKEELE 367
Score = 27.3 bits (61), Expect = 4.7
Identities = 21/79 (26%), Positives = 39/79 (49%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
E + E E+EK E + +EK + E E +EK+ E +K+ +++ EEE
Sbjct: 194 DEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEEL 253
Query: 135 EQKNKKKEEEEEEEKEKVE 153
E+ ++ E E+ +E +
Sbjct: 254 EKLTEEISELEKRLEEIEQ 272
Score = 26.6 bits (59), Expect = 8.0
Identities = 17/64 (26%), Positives = 33/64 (51%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E + E EEE+E++ E +K E + ++ + E++ +EE + E+E ++E
Sbjct: 433 EAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQREL 492
Query: 144 EEEE 147
E E
Sbjct: 493 AEAE 496
Score = 26.6 bits (59), Expect = 8.8
Identities = 18/73 (24%), Positives = 34/73 (46%), Gaps = 1/73 (1%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+EEEK KE EE EE+ +++ E + + +E E + ++ EE++ + EE + +
Sbjct: 730 QEEEKLKERLEELEEDLSSLEQEIENVKSELKELEAR-IEELEEDLHKLEEALNDLEARL 788
Query: 144 EEEEEKEKVEMYR 156
E
Sbjct: 789 SHSRIPEIQAELS 801
Score = 26.6 bits (59), Expect = 9.1
Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 5/77 (6%)
Query: 80 NNEWEEEEKEKEEEE----EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE-EEEE 134
E + E+E E E E++ ++E ++ +E +E++ D KE+ E+E
Sbjct: 797 QAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEI 856
Query: 135 EQKNKKKEEEEEEEKEK 151
E N KKEE EEE +E
Sbjct: 857 ENLNGKKEELEEELEEL 873
Score = 26.6 bits (59), Expect = 9.1
Identities = 25/88 (28%), Positives = 43/88 (48%), Gaps = 4/88 (4%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKRE----KKEEEEEKQKDKK 125
+ I E E EKE +E +E+ + KE+ + EK+ E KKEE EE+ ++ +
Sbjct: 815 REIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELE 874
Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+ E KK+ +E E + ++E
Sbjct: 875 AALRDLESRLGDLKKERDELEAQLRELE 902
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 34.2 bits (78), Expect = 0.026
Identities = 18/66 (27%), Positives = 35/66 (53%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E++E EEE E E ++E +E ++DE+ ++ E E+E + EE+ E+ +E +
Sbjct: 930 ESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELESK 989
Query: 139 KKKEEE 144
+
Sbjct: 990 AAYDSR 995
Score = 33.1 bits (75), Expect = 0.062
Identities = 17/63 (26%), Positives = 35/63 (55%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
+ E +E EEE E + +DE + ++ EE +E +D E+E E + ++++ + +E
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDEL 986
Query: 146 EEE 148
E +
Sbjct: 987 ESK 989
Score = 33.1 bits (75), Expect = 0.070
Identities = 17/68 (25%), Positives = 33/68 (48%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E EE+ E E ++E E DEDEE ++ E++ ++ +E + E+ ++ K
Sbjct: 933 ESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELESKAAY 992
Query: 144 EEEEEKEK 151
+ K +
Sbjct: 993 DSRPGKRR 1000
Score = 32.3 bits (73), Expect = 0.099
Identities = 19/71 (26%), Positives = 35/71 (49%), Gaps = 1/71 (1%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDED-EEKKREKKEEEEEKQKDKKEEEVEE 131
S E +E+E ++ +E +E+EE E ED E + E +EE +D E E +
Sbjct: 931 SDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELESKA 990
Query: 132 EEEEQKNKKKE 142
+ + K+++
Sbjct: 991 AYDSRPGKRRK 1001
Score = 31.5 bits (71), Expect = 0.22
Identities = 15/65 (23%), Positives = 31/65 (47%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
++ +E EEE E E D+E ++EE ++ + E+ E + + ++ +E E
Sbjct: 929 DESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELES 988
Query: 149 KEKVE 153
K +
Sbjct: 989 KAAYD 993
Score = 30.8 bits (69), Expect = 0.32
Identities = 19/75 (25%), Positives = 41/75 (54%), Gaps = 2/75 (2%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE--EEEEKQKDKKEEEVEEEEEEQ 136
++E +E E+E E E ++E ++ +++E+ E E E+E + D +EE E+ +E
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDEL 986
Query: 137 KNKKKEEEEEEEKEK 151
++K + ++ K
Sbjct: 987 ESKAAYDSRPGKRRK 1001
Score = 29.6 bits (66), Expect = 0.76
Identities = 16/65 (24%), Positives = 33/65 (50%)
Query: 91 EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
++E +E EEE + E + +E+E+ + E+ E+E E + +++ E+ +E E
Sbjct: 928 DDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELE 987
Query: 151 KVEMY 155
Y
Sbjct: 988 SKAAY 992
>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
Length = 470
Score = 34.1 bits (79), Expect = 0.028
Identities = 17/67 (25%), Positives = 31/67 (46%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E E+ KE E E ++EK D+ E + E + +K + E+ EQ + +
Sbjct: 404 ELRERAKEALETEALAQEEKLADDLLSLEGLDRELAFKLAEKGIKTLEDLAEQAVDELID 463
Query: 144 EEEEEKE 150
E +E++
Sbjct: 464 IEGDEEK 470
Score = 32.9 bits (76), Expect = 0.058
Identities = 17/85 (20%), Positives = 38/85 (44%)
Query: 66 VILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
+++ + S+ E L EE + + +EE EE +E+ ++ + +EE+
Sbjct: 371 LLVEEGFSSLEELAYVPVEELLEIEGFDEETVEELRERAKEALETEALAQEEKLADDLLS 430
Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKE 150
E ++ E + +K + E+ E
Sbjct: 431 LEGLDRELAFKLAEKGIKTLEDLAE 455
Score = 30.2 bits (69), Expect = 0.52
Identities = 20/79 (25%), Positives = 33/79 (41%), Gaps = 1/79 (1%)
Query: 71 AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
I + E E KE E E +EEK D+ + E K +K + +E
Sbjct: 393 EIEGFDEETVEELRERAKEALETEALAQEEKLADDLLSLE-GLDRELAFKLAEKGIKTLE 451
Query: 131 EEEEEQKNKKKEEEEEEEK 149
+ E+ ++ + E +EEK
Sbjct: 452 DLAEQAVDELIDIEGDEEK 470
Score = 28.7 bits (65), Expect = 1.7
Identities = 15/68 (22%), Positives = 32/68 (47%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
EE EE E E +E E+ RE+ +E E + +EE++ ++ + +E
Sbjct: 381 EELAYVPVEELLEIEGFDEETVEELRERAKEALETEALAQEEKLADDLLSLEGLDRELAF 440
Query: 146 EEEKEKVE 153
+ ++ ++
Sbjct: 441 KLAEKGIK 448
Score = 27.5 bits (62), Expect = 3.3
Identities = 15/84 (17%), Positives = 34/84 (40%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+A + + + + + +EE + E+ ++ EE + + +EE
Sbjct: 342 EASEKRQAEFDAILDLFMEALDIDEEIAQLLVEEGFSSLEELAYVPVEELLEIEGFDEET 401
Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
EE E+ + E E ++EK+
Sbjct: 402 VEELRERAKEALETEALAQEEKLA 425
Score = 27.1 bits (61), Expect = 4.6
Identities = 17/80 (21%), Positives = 33/80 (41%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
E L+ +E + +E EE E+ + EE E+ +++ +E +E E
Sbjct: 359 MEALDIDEEIAQLLVEEGFSSLEELAYVPVEELLEIEGFDEETVEELRERAKEALETEAL 418
Query: 135 EQKNKKKEEEEEEEKEKVEM 154
Q+ K ++ E E+
Sbjct: 419 AQEEKLADDLLSLEGLDREL 438
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 34.0 bits (78), Expect = 0.028
Identities = 14/75 (18%), Positives = 27/75 (36%), Gaps = 6/75 (8%)
Query: 84 EEEEKEKEEEEEEEEEEKE-KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
E E + + EE K + E + K E + EE + Q+
Sbjct: 869 EYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEE-----KSAQERMPGA 923
Query: 143 EEEEEEKEKVEMYRN 157
+ EE ++ + Y++
Sbjct: 924 DSPEELMKRAKEYQD 938
Score = 33.6 bits (77), Expect = 0.035
Identities = 9/56 (16%), Positives = 20/56 (35%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
+ + N + E + + EE+ +E+ + E + +E Q K
Sbjct: 886 NTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKHK 941
Score = 33.2 bits (76), Expect = 0.055
Identities = 12/60 (20%), Positives = 24/60 (40%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
K + + E + D + +E + Q + + + E Q NK EE+ +E+
Sbjct: 860 KHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQER 919
Score = 32.4 bits (74), Expect = 0.10
Identities = 13/58 (22%), Positives = 24/58 (41%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
EE + + E + + E K E+K +E EE+ + +E ++K K
Sbjct: 884 EENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKHK 941
Score = 28.9 bits (65), Expect = 1.6
Identities = 9/51 (17%), Positives = 20/51 (39%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
L+N+ E + + EE+ ++ + + K +E + K K
Sbjct: 892 YSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKHKG 942
Score = 27.8 bits (62), Expect = 3.4
Identities = 9/57 (15%), Positives = 23/57 (40%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
+I E ++ E + + E + + +++ +++ + EE K KE
Sbjct: 879 NSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKE 935
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 33.9 bits (78), Expect = 0.030
Identities = 17/46 (36%), Positives = 30/46 (65%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
NE+E E E+EEEE EE EE+ +++ ++ K+ EE K++++
Sbjct: 143 NEFELELPEEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELR 188
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 33.7 bits (77), Expect = 0.030
Identities = 17/74 (22%), Positives = 33/74 (44%), Gaps = 7/74 (9%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKE-EEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
K++E+ E+ ++ ED KK+E+K +E+K+ Q ++E +E
Sbjct: 165 KQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGG----GGSSGGQSGLSTKDEPPKE 220
Query: 149 KEK--VEMYRNPVP 160
K + + R P
Sbjct: 221 KRQKHHDPERRLEP 234
Score = 33.3 bits (76), Expect = 0.033
Identities = 13/65 (20%), Positives = 31/65 (47%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
E+ +K +E+ +++E+++ DE+K+ Q ++ +E+ QK+ E
Sbjct: 173 EDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLSTKDEPPKEKRQKHHDPERRL 232
Query: 146 EEEKE 150
E +
Sbjct: 233 EPQSH 237
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 33.9 bits (78), Expect = 0.031
Identities = 18/70 (25%), Positives = 37/70 (52%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+++E + E EE E E +E+ +++ ++ ++ + K E E+E E K KKE
Sbjct: 384 ARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKEN 443
Query: 144 EEEEEKEKVE 153
+ E +++K
Sbjct: 444 KNEFKEKKES 453
Score = 32.7 bits (75), Expect = 0.080
Identities = 23/76 (30%), Positives = 37/76 (48%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E + E KEK+E +EEEE E E++ EK K + EK + ++EEE +EE
Sbjct: 439 LKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLK 498
Query: 139 KKKEEEEEEEKEKVEM 154
+ +K+ +
Sbjct: 499 TTSSVGKSAKKQDSKK 514
Score = 32.7 bits (75), Expect = 0.083
Identities = 23/79 (29%), Positives = 44/79 (55%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E E E E EEE +EEE E+ ++ +++ E E++ + KK ++ + E +
Sbjct: 389 ENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFK 448
Query: 136 QKNKKKEEEEEEEKEKVEM 154
+K + EEEE E++E+ ++
Sbjct: 449 EKKESDEEEELEDEEEAKV 467
Score = 30.0 bits (68), Expect = 0.60
Identities = 18/71 (25%), Positives = 42/71 (59%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E E+E E ++ ++E + E ++ ++ +++ E ++EEE K + + ++ E+ QK +++
Sbjct: 428 ENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEE 487
Query: 142 EEEEEEEKEKV 152
EE +EE
Sbjct: 488 EELDEENPWLK 498
Score = 30.0 bits (68), Expect = 0.66
Identities = 19/67 (28%), Positives = 41/67 (61%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
+ E+E E ++ +KE + ++K+E EEEE + +++ + E + ++++K ++EEE
Sbjct: 427 PENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEE 486
Query: 147 EEKEKVE 153
EE+ E
Sbjct: 487 EEELDEE 493
Score = 29.6 bits (67), Expect = 0.83
Identities = 24/71 (33%), Positives = 44/71 (61%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E E +E+ +EEE EE K+ + E E+E E +K KKE + E +E+++ ++++E
Sbjct: 400 ELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEEL 459
Query: 144 EEEEEKEKVEM 154
E+EEE + ++
Sbjct: 460 EDEEEAKVEKV 470
Score = 28.9 bits (65), Expect = 1.6
Identities = 18/71 (25%), Positives = 41/71 (57%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EEE ++EE EE ++ + + + EK+ E ++ +K+ K E E++E +++ + ++E
Sbjct: 403 GEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDE 462
Query: 144 EEEEEKEKVEM 154
EE + ++
Sbjct: 463 EEAKVEKVANK 473
Score = 28.1 bits (63), Expect = 2.7
Identities = 19/74 (25%), Positives = 38/74 (51%), Gaps = 1/74 (1%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
+ EE+E E + EEE+E++ E D+D E ++ K K+ +++E K +
Sbjct: 322 SVSEEDEDEDSDSEEEDEDDDEDDDDGENPW-MLRKKLGKLKEGEDDEENSGLLSMKFMQ 380
Query: 141 KEEEEEEEKEKVEM 154
+ E ++E+ E+
Sbjct: 381 RAEARKKEENDAEI 394
Score = 27.3 bits (61), Expect = 4.1
Identities = 20/75 (26%), Positives = 44/75 (58%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
LE E +EE+ +E ++ K E+ EK+ E K+ ++E + + KE++ +EEEE +
Sbjct: 401 LEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELE 460
Query: 138 NKKKEEEEEEEKEKV 152
++++ + E+ + +
Sbjct: 461 DEEEAKVEKVANKLL 475
Score = 26.6 bits (59), Expect = 8.5
Identities = 17/66 (25%), Positives = 33/66 (50%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
++K + +E E+++E K ++ E +K+++ E E E E + + EEE EE
Sbjct: 355 RKKLGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEE 414
Query: 148 EKEKVE 153
+K
Sbjct: 415 PSKKNV 420
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 32.7 bits (75), Expect = 0.032
Identities = 22/74 (29%), Positives = 53/74 (71%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
L + +W++E +++E +++ EE++++ +E KKR K++++++K+K KK+ + ++EE++
Sbjct: 54 LMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKE 113
Query: 138 NKKKEEEEEEEKEK 151
K EE +E+E+
Sbjct: 114 GSKSSEESSDEEEE 127
>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
Length = 390
Score = 33.7 bits (78), Expect = 0.033
Identities = 18/51 (35%), Positives = 31/51 (60%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
EE +EE EE E EK + ++ RE+ EE EE+ D +++ +E+++E
Sbjct: 333 EENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDEG 383
>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal. This domain
is specific to the N-terminal part of the prp1 splicing
factor, which is involved in mRNA splicing (and possibly
also poly(A)+ RNA nuclear export and cell cycle
progression). This domain is specific to the N terminus
of the RNA splicing factor encoded by prp1. It is
involved in mRNA splicing and possibly also poly(A)and
RNA nuclear export and cell cycle progression.
Length = 131
Score = 32.7 bits (75), Expect = 0.036
Identities = 17/72 (23%), Positives = 44/72 (61%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
+ + ++EE+E+ + + ++E + K ++E+++ D+ E ++E +E++ K++E++
Sbjct: 26 RDGVDIDDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERMDERRKKRREQK 85
Query: 145 EEEEKEKVEMYR 156
E+EE EK
Sbjct: 86 EKEEIEKYREEN 97
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 33.7 bits (77), Expect = 0.037
Identities = 15/41 (36%), Positives = 30/41 (73%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
+E +W +E KEK+EE++EE+ E+ K E+ +++ E++E+
Sbjct: 270 IEGWDWLKELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGF 310
Score = 28.3 bits (63), Expect = 2.3
Identities = 17/47 (36%), Positives = 29/47 (61%)
Query: 111 REKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
+E KE++EEK ++K EE EE +E+ ++++ E EKV + N
Sbjct: 277 KELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEAN 323
Score = 27.9 bits (62), Expect = 3.3
Identities = 17/44 (38%), Positives = 25/44 (56%)
Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
EKK EK EE+ E+ K ++ +E EEEE+ + E+ E K
Sbjct: 281 EKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANK 324
Score = 26.7 bits (59), Expect = 7.8
Identities = 13/46 (28%), Positives = 27/46 (58%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
E++E++ EE + EE +E+ ++E++ E E+ + K +EV
Sbjct: 285 EKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKKFIKEV 330
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 32.9 bits (76), Expect = 0.037
Identities = 13/34 (38%), Positives = 21/34 (61%)
Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
D+ E K EK E++ EK K K E++ ++EE +
Sbjct: 64 DKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97
Score = 30.6 bits (70), Expect = 0.23
Identities = 19/48 (39%), Positives = 28/48 (58%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
NE ++ KE E +++E + E EKK EK + + EK+K KKEE
Sbjct: 48 NEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEP 95
Score = 27.5 bits (62), Expect = 2.2
Identities = 9/34 (26%), Positives = 20/34 (58%)
Query: 106 DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
+E K +K E++ ++ K E+ + ++EE K +
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 26.7 bits (60), Expect = 4.3
Identities = 15/53 (28%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 100 EKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
E+ KD E + +++E K +K E+ + E+ + K +KK+ ++EE K ++
Sbjct: 49 EEYKDLKESLEAALLDKKELKAWEKAEK--KAEKAKAKAEKKKAKKEEPKPRL 99
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 33.5 bits (77), Expect = 0.038
Identities = 25/85 (29%), Positives = 44/85 (51%), Gaps = 15/85 (17%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKRE---------------KKEEEEEKQKDKKE 126
E EEEE+E+EE E +E E+ D + + RE + +++
Sbjct: 288 EEEEEEEEEEEPAERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRCTWVNPEQK 347
Query: 127 EEVEEEEEEQKNKKKEEEEEEEKEK 151
+E EE+E+E++ +++EE EE E E+
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPEE 372
Score = 32.3 bits (74), Expect = 0.092
Identities = 23/89 (25%), Positives = 42/89 (47%), Gaps = 27/89 (30%)
Query: 92 EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD---------------------------K 124
EEEEEEEE+E+ + ++ E + E + ++ +
Sbjct: 287 LEEEEEEEEEEEPAERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRCTWVNPEQ 346
Query: 125 KEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
K+EE E+E+EE++ +++E EE E +E
Sbjct: 347 KDEEEEQEDEEEEEEEEEPEEPEPEEGPP 375
Score = 28.5 bits (64), Expect = 1.6
Identities = 14/28 (50%), Positives = 23/28 (82%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEE 108
N +++E+E++E+EEEEEEE+E +E E
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEP 370
Score = 27.7 bits (62), Expect = 3.0
Identities = 18/37 (48%), Positives = 27/37 (72%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEK 120
E EE+E+EEE E +EE E +E+EE++ E+ +EEE
Sbjct: 169 EGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEG 205
Score = 27.7 bits (62), Expect = 3.6
Identities = 15/33 (45%), Positives = 25/33 (75%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKRE 112
N E ++EE+E+E+EEEEEEEE+ ++ + E+
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPP 375
Score = 26.9 bits (60), Expect = 5.3
Identities = 21/43 (48%), Positives = 29/43 (67%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
E E E EEEEEEE+ + DEE + E++EEEE ++ K+EE
Sbjct: 164 EVELREGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGT 206
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 33.6 bits (77), Expect = 0.040
Identities = 16/75 (21%), Positives = 27/75 (36%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
+ LE +E + K+ +E K + + EE+ K +K E
Sbjct: 39 DELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHP 98
Query: 136 QKNKKKEEEEEEEKE 150
+ E EEEE+
Sbjct: 99 KSKAPSTESEEEEEP 113
>gnl|CDD|211344 cd02867, PseudoU_synth_TruB_4, Pseudouridine synthase homolog 4.
This group consists of Eukaryotic TruB proteins similar
to Saccharomyces cerevisiae Pus4. S. cerevisiae Pus4,
makes psi55 in the T loop of both cytoplasmic and
mitochondrial tRNAs. Psi55 is almost universally
conserved. Pseudouridine synthases catalyze the
isomerization of specific uridines in an RNA molecule to
pseudouridines (5-ribosyluracil, psi).
Length = 312
Score = 33.2 bits (76), Expect = 0.040
Identities = 29/103 (28%), Positives = 40/103 (38%), Gaps = 9/103 (8%)
Query: 56 KPIRANARLTVILPQAISASET----LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR 111
K + AR LP+ I + L +W E EEE ++ E+ + +
Sbjct: 152 KRLYEYAREGKPLPRPIERRQVVVSELLVKDWIEPGPLFTRTVEEEGKQYERSVVKMLGK 211
Query: 112 EKKEEEEEKQKDKK-----EEEVEEEEEEQKNKKKEEEEEEEK 149
E K E + EEVE EE K K + EEE EK
Sbjct: 212 ELKTFAEVTELTATAEGDPVEEVEATHEESKRKSEVEEEANEK 254
>gnl|CDD|143165 cd00096, Ig, Immunoglobulin domain. Ig: immunoglobulin (Ig)
domain found in the Ig superfamily. The Ig superfamily
is a heterogenous group of proteins, built on a common
fold comprised of a sandwich of two beta sheets.
Members of this group are components of immunoglobulin,
neuroglia, cell surface glycoproteins, such as, T-cell
receptors, CD2, CD4, CD8, and membrane glycoproteins,
such as, butyrophilin and chondroitin sulfate
proteoglycan core protein. A predominant feature of
most Ig domains is a disulfide bridge connecting the
two beta-sheets with a tryptophan residue packed
against the disulfide bond.
Length = 74
Score = 31.7 bits (71), Expect = 0.040
Identities = 9/34 (26%), Positives = 12/34 (35%)
Query: 16 FPSYMRVGDSSHGVYNLRIRNVSLTDDAEFQCQV 49
+S G L I NV+L D + C
Sbjct: 29 LTRVRSSRGTSSGSSTLTISNVTLEDSGTYTCVA 62
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 33.5 bits (77), Expect = 0.041
Identities = 18/45 (40%), Positives = 29/45 (64%)
Query: 92 EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
E E++EEE +EK + ++ ++ EE + K E +VEEEEE+Q
Sbjct: 97 EREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQ 141
Score = 32.3 bits (74), Expect = 0.088
Identities = 29/97 (29%), Positives = 50/97 (51%), Gaps = 15/97 (15%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDE--DEEKKREKKEEEEEKQKDKKE------E 127
+ LE + EEE +EK ++ E+ ++ +E K K EEEEEKQ + E
Sbjct: 94 KELEREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQMEILEKALKSYL 153
Query: 128 EVEEEEEEQKNK-----KKEEEE--EEEKEKVEMYRN 157
++ +EE + + +KE EE ++E + +E YR+
Sbjct: 154 KIVKEENKSLQRLAKALQKESEERTQDETKMIEEYRD 190
>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA. This model
describes flagellar biosynthesis protein FlhA, one of a
large number of genes associated with the biosynthesis
of functional bacterial flagella. Homologs of many such
proteins, including FlhA, function in type III protein
secretion systems. A separate model describes InvA
(Salmonella enterica), LcrD (Yersinia enterocolitica),
HrcV (Xanthomonas), etc., all of which score below the
noise cutoff for this model [Cellular processes,
Chemotaxis and motility].
Length = 678
Score = 33.4 bits (77), Expect = 0.041
Identities = 13/27 (48%), Positives = 18/27 (66%)
Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEE 135
++R K+EEE + K +EE EEEEE
Sbjct: 310 RRRSKQEEEAAAEAAKAQEEAAEEEEE 336
Score = 28.8 bits (65), Expect = 1.5
Identities = 10/31 (32%), Positives = 18/31 (58%)
Query: 120 KQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
+++ K+EEE E + + + EEEEE +
Sbjct: 310 RRRSKQEEEAAAEAAKAQEEAAEEEEESIND 340
Score = 28.4 bits (64), Expect = 2.1
Identities = 11/29 (37%), Positives = 16/29 (55%)
Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
+R K+EEE + K +E EEEE+
Sbjct: 309 LRRRSKQEEEAAAEAAKAQEEAAEEEEES 337
Score = 28.0 bits (63), Expect = 2.6
Identities = 10/30 (33%), Positives = 17/30 (56%)
Query: 124 KKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
++ + EEE + K +EE EEE+E +
Sbjct: 310 RRRSKQEEEAAAEAAKAQEEAAEEEEESIN 339
Score = 26.8 bits (60), Expect = 7.1
Identities = 8/27 (29%), Positives = 14/27 (51%)
Query: 128 EVEEEEEEQKNKKKEEEEEEEKEKVEM 154
++EEE K +EE E+E+ +
Sbjct: 312 RSKQEEEAAAEAAKAQEEAAEEEEESI 338
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 33.3 bits (77), Expect = 0.042
Identities = 13/46 (28%), Positives = 27/46 (58%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
A+ + + +EE KE + + +E++E+EE++ E++ EEE
Sbjct: 275 AAALADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 31.4 bits (72), Expect = 0.18
Identities = 18/37 (48%), Positives = 22/37 (59%)
Query: 64 LTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEE 100
L L + +SA E EEEE+E+EEEEE EEE
Sbjct: 284 LDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 31.0 bits (71), Expect = 0.21
Identities = 14/41 (34%), Positives = 27/41 (65%)
Query: 92 EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
+++ +EE KE + + +EEEEE++++++EE EEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 30.6 bits (70), Expect = 0.36
Identities = 14/40 (35%), Positives = 22/40 (55%)
Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
K EE +E + + EEEEE++ +++EEE EE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319
Score = 29.4 bits (67), Expect = 0.83
Identities = 11/38 (28%), Positives = 22/38 (57%)
Query: 113 KKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
K+ +E+ K+ + + E++ +++EEEEEEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPS 317
Score = 28.3 bits (64), Expect = 1.7
Identities = 14/41 (34%), Positives = 23/41 (56%)
Query: 106 DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
D++ E+ +E Q E EEEEEE++ +++ EEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 27.5 bits (62), Expect = 3.0
Identities = 14/42 (33%), Positives = 22/42 (52%)
Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
+ + E KE + + EE EEEEEE++ ++ EEE
Sbjct: 279 ADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 27.5 bits (62), Expect = 3.3
Identities = 12/34 (35%), Positives = 21/34 (61%)
Query: 117 EEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
EE K+ + + EEE++ +++EEEEE +E
Sbjct: 286 EELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319
Score = 27.5 bits (62), Expect = 3.6
Identities = 15/44 (34%), Positives = 26/44 (59%), Gaps = 3/44 (6%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
++ +EE +E + + E++EEEEE +++EEE EEE
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEE---EEEEEEPSEEE 320
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 33.3 bits (76), Expect = 0.043
Identities = 17/77 (22%), Positives = 40/77 (51%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
S+ +E + +E + E +E ++EEE ++ E+ EK+ + + E + +++KE+ EE +
Sbjct: 57 SQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQ 116
Query: 135 EQKNKKKEEEEEEEKEK 151
+ + + E
Sbjct: 117 YLREYNLFDRNNLQLED 133
Score = 32.9 bits (75), Expect = 0.051
Identities = 21/83 (25%), Positives = 38/83 (45%)
Query: 67 ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
L E + E +E +KEEE +E E+ + ED++ E E +EEK++ + E
Sbjct: 54 KLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENE 113
Query: 127 EEVEEEEEEQKNKKKEEEEEEEK 149
E E ++ + E+ +
Sbjct: 114 ELQYLREYNLFDRNNLQLEDNLQ 136
Score = 31.0 bits (70), Expect = 0.22
Identities = 16/70 (22%), Positives = 41/70 (58%), Gaps = 1/70 (1%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE-EEEEQKNKKKEE 143
E+E++ ++ + E + + + + + +E +K++++ +E+EE E+E+ +
Sbjct: 42 EKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELV 101
Query: 144 EEEEEKEKVE 153
E +EEKE++E
Sbjct: 102 ELQEEKEQLE 111
Score = 31.0 bits (70), Expect = 0.26
Identities = 24/84 (28%), Positives = 40/84 (47%), Gaps = 5/84 (5%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDE---DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
+ E + E E + E DE +EE+ ++ EE E++ D E VE +EE K +
Sbjct: 53 SKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEE--KEQL 110
Query: 141 KEEEEEEEKEKVEMYRNPVPMYRN 164
+ EE + +E RN + + N
Sbjct: 111 ENEELQYLREYNLFDRNNLQLEDN 134
Score = 28.3 bits (63), Expect = 1.8
Identities = 18/82 (21%), Positives = 37/82 (45%), Gaps = 3/82 (3%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKD---EDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
L+ + EEE E EE E+E++ E +E+K + + EE + ++ + +
Sbjct: 72 LDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQL 131
Query: 135 EQKNKKKEEEEEEEKEKVEMYR 156
E + E + E +++ R
Sbjct: 132 EDNLQSLELQYEYSLNQLDKLR 153
Score = 27.9 bits (62), Expect = 2.7
Identities = 17/89 (19%), Positives = 36/89 (40%), Gaps = 8/89 (8%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E + E + E+E + K+ E + + E + E + EEE + ++ E+
Sbjct: 32 ERLDSELRDAEKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEK 91
Query: 144 EE--------EEEKEKVEMYRNPVPMYRN 164
E+ E ++EK ++ + R
Sbjct: 92 EDDDLDGELVELQEEKEQLENEELQYLRE 120
Score = 27.9 bits (62), Expect = 2.9
Identities = 14/70 (20%), Positives = 38/70 (54%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E ++ K+EEE +E +E +++++ + E +E+++ + EE++ E + +
Sbjct: 71 ELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQ 130
Query: 145 EEEEKEKVEM 154
E+ + +E+
Sbjct: 131 LEDNLQSLEL 140
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 33.1 bits (76), Expect = 0.045
Identities = 17/71 (23%), Positives = 37/71 (52%)
Query: 83 WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
++ E + + K K + E + E +EK+K+KK+EE ++++ E+ ++ E
Sbjct: 294 KRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIE 353
Query: 143 EEEEEEKEKVE 153
+ E + +K E
Sbjct: 354 KLEVQATDKEE 364
Score = 32.3 bits (74), Expect = 0.087
Identities = 24/87 (27%), Positives = 46/87 (52%), Gaps = 3/87 (3%)
Query: 59 RANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
+ ++ + Q + + E + K K + + E + EK E +EKK+EKK+EE+
Sbjct: 281 KLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEK 340
Query: 119 EKQKDKKEEEVEEEEEEQKNKKKEEEE 145
+K K+ E +EE E+ + + ++EE
Sbjct: 341 KK---KQIERLEERIEKLEVQATDKEE 364
Score = 32.3 bits (74), Expect = 0.092
Identities = 20/70 (28%), Positives = 36/70 (51%), Gaps = 1/70 (1%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ + K ++ E D + K K E + EK + +E+ +E+++E+K KK+ E
Sbjct: 288 ALKYQLKRLKKMILLFEMISDLKRKLKS-KFERDNEKLDAEVKEKKKEKKKEEKKKKQIE 346
Query: 144 EEEEEKEKVE 153
EE EK+E
Sbjct: 347 RLEERIEKLE 356
Score = 31.6 bits (72), Expect = 0.17
Identities = 15/73 (20%), Positives = 37/73 (50%), Gaps = 1/73 (1%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
+ + K ++ E + + K E+ E+ + + +K++E ++EE+++K + +
Sbjct: 288 ALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKK-QIE 346
Query: 142 EEEEEEEKEKVEM 154
EE EK +V+
Sbjct: 347 RLEERIEKLEVQA 359
Score = 29.6 bits (67), Expect = 0.71
Identities = 16/65 (24%), Positives = 39/65 (60%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
E + + + + K + ++E+ E KE+++EK+K++K+++ E EE+ K + + +
Sbjct: 302 LFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATD 361
Query: 147 EEKEK 151
+E+ K
Sbjct: 362 KEENK 366
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 33.5 bits (77), Expect = 0.045
Identities = 13/56 (23%), Positives = 24/56 (42%)
Query: 101 KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
KE + ++E+K K+ KEE+ + + E +KEK+E +
Sbjct: 214 KEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIK 269
Score = 30.5 bits (69), Expect = 0.40
Identities = 16/68 (23%), Positives = 32/68 (47%), Gaps = 1/68 (1%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
K KE +EE+ + + + KEE++ + E++ E +K+K E+ ++
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEY-DKEKLEKIKD 270
Query: 148 EKEKVEMY 155
KEK +
Sbjct: 271 LKEKAKKL 278
Score = 27.8 bits (62), Expect = 3.7
Identities = 15/73 (20%), Positives = 33/73 (45%), Gaps = 3/73 (4%)
Query: 66 VILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREK---KEEEEEKQK 122
+IL I + E N E + + +E++K D + E E +K+K
Sbjct: 205 IILDTLIKSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEK 264
Query: 123 DKKEEEVEEEEEE 135
+K ++++E+ ++
Sbjct: 265 LEKIKDLKEKAKK 277
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 33.4 bits (76), Expect = 0.045
Identities = 17/82 (20%), Positives = 41/82 (50%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
IS S N+ E E+K E + E E+++D +E ++ + +EE +K KK ++++
Sbjct: 287 ISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKG 346
Query: 132 EEEEQKNKKKEEEEEEEKEKVE 153
++ + ++ + ++
Sbjct: 347 KKNGLDKDDSDSGDDSDDSDID 368
Score = 32.2 bits (73), Expect = 0.092
Identities = 17/76 (22%), Positives = 41/76 (53%), Gaps = 2/76 (2%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKK--EEEEEKQKDKKEEEVEEEEEE 135
+ +E++ ++ + E EE+ + ++ ++RE K E K + +++E+ EE EEE
Sbjct: 267 DDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEE 326
Query: 136 QKNKKKEEEEEEEKEK 151
+ ++ ++ +K K
Sbjct: 327 KNEEEGGLSKKGKKLK 342
Score = 31.1 bits (70), Expect = 0.26
Identities = 15/69 (21%), Positives = 37/69 (53%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E EE++ EEE ++ K+ + + KK +++ + D + +++ E+ K++
Sbjct: 322 ESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQ 381
Query: 144 EEEEEKEKV 152
+E +++E V
Sbjct: 382 KEPKKEEPV 390
Score = 28.4 bits (63), Expect = 2.0
Identities = 19/80 (23%), Positives = 43/80 (53%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
E+ + E +EEK K+++++ + +K+ D+D++ KR ++ +E D ++E E
Sbjct: 224 DDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGRE 283
Query: 132 EEEEQKNKKKEEEEEEEKEK 151
E+ + + EE ++K
Sbjct: 284 EDYISDSSASGNDPEEREDK 303
Score = 27.6 bits (61), Expect = 3.6
Identities = 13/70 (18%), Positives = 34/70 (48%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+++ E + ++ ++E +E + EE E + + E E+++ +++ EE
Sbjct: 266 DDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEE 325
Query: 144 EEEEEKEKVE 153
E+ EE+ +
Sbjct: 326 EKNEEEGGLS 335
>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
Length = 191
Score = 32.7 bits (75), Expect = 0.046
Identities = 18/56 (32%), Positives = 31/56 (55%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
+ E+ E+E EE E EE E + +EE E+ E E ++ K E+E + +E + +
Sbjct: 4 KNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEAKLDELEER 59
Score = 28.8 bits (65), Expect = 1.0
Identities = 16/60 (26%), Positives = 36/60 (60%)
Query: 91 EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
EE+ E+ EE+ ++ + E+ E + EEE +++ + E ++EE+ + + + +E EE+
Sbjct: 1 MEEKNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEAKLDELEERY 60
Score = 26.1 bits (58), Expect = 8.9
Identities = 21/77 (27%), Positives = 41/77 (53%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
E +E E EE +++ EEE EEE + + DEE+ + + E + + +++ ++ +
Sbjct: 7 QVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEAKLDELEERYLRLQAD 66
Query: 133 EEEQKNKKKEEEEEEEK 149
E K + ++E E EK
Sbjct: 67 FENYKRRIQKENEAAEK 83
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 32.7 bits (74), Expect = 0.052
Identities = 18/79 (22%), Positives = 39/79 (49%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
I+ T+ N+ + E+ EE + + + + ++E+ K +E ++E
Sbjct: 153 IAEHPTITENQSFQPWPERLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQE 212
Query: 132 EEEEQKNKKKEEEEEEEKE 150
+ E+K ++E+EEEE +E
Sbjct: 213 HKTEEKQPQEEQEEEEVEE 231
Score = 30.8 bits (69), Expect = 0.23
Identities = 16/73 (21%), Positives = 37/73 (50%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E L NN E + + + + +++ + ++ +E K ++K+ +E +EEEE
Sbjct: 170 ERLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEV 229
Query: 136 QKNKKKEEEEEEE 148
++ K+EE + +
Sbjct: 230 EEEAKQEEGQGTD 242
Score = 26.5 bits (58), Expect = 6.8
Identities = 15/73 (20%), Positives = 42/73 (57%)
Query: 63 RLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
RL + + + +S +L + + K K+E+ + +E ++ E+K+ ++E+EEE+ +
Sbjct: 171 RLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVE 230
Query: 123 DKKEEEVEEEEEE 135
++ ++E + ++
Sbjct: 231 EEAKQEEGQGTDD 243
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 33.1 bits (76), Expect = 0.052
Identities = 20/70 (28%), Positives = 27/70 (38%), Gaps = 4/70 (5%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV----EEEEEEQKNKKKEE 143
K E EE EK E E K E+ + EK + EE E E + + E
Sbjct: 74 KGLEGRLFILPEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEELEDLE 133
Query: 144 EEEEEKEKVE 153
+ EE E +
Sbjct: 134 DLLEELEPLA 143
Score = 30.8 bits (70), Expect = 0.31
Identities = 15/68 (22%), Positives = 26/68 (38%), Gaps = 2/68 (2%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
E E E + E + E+ + ++E E + + E E+ EE E E E
Sbjct: 216 ELYEVPEFDGGPSELISELEEVIAEIQDELESLRSELEALAEKIAEELLA--VREILEIE 273
Query: 149 KEKVEMYR 156
K ++
Sbjct: 274 KALGDVLS 281
Score = 29.3 bits (66), Expect = 1.0
Identities = 13/50 (26%), Positives = 19/50 (38%), Gaps = 3/50 (6%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKK---REKKEEEEEKQ 121
SE + E E + E E E E ++ E+ RE E E+
Sbjct: 228 SELISELEEVIAEIQDELESLRSELEALAEKIAEELLAVREILEIEKALG 277
Score = 27.3 bits (61), Expect = 4.5
Identities = 20/82 (24%), Positives = 32/82 (39%), Gaps = 14/82 (17%)
Query: 56 KPIRANARLTVILPQAISASETLEN--NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREK 113
+ ++ ILP+ + E LE EE K E+ E EE R+
Sbjct: 71 RSVKGLEGRLFILPEEV---EKLEAELKSLEEVIKPAEKFSSEVEEL---------TRKL 118
Query: 114 KEEEEEKQKDKKEEEVEEEEEE 135
+E E ++ ++ E EE E
Sbjct: 119 EERLSELDEELEDLEDLLEELE 140
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 32.8 bits (75), Expect = 0.053
Identities = 16/68 (23%), Positives = 39/68 (57%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
+ A ++ +++ + + + ++++ E+E E E+ K+E+ EE+E ++++K EE
Sbjct: 114 ARNSDADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEE 173
Query: 129 VEEEEEEQ 136
+ EEE
Sbjct: 174 EKAREEEI 181
Score = 30.5 bits (69), Expect = 0.30
Identities = 20/75 (26%), Positives = 35/75 (46%), Gaps = 5/75 (6%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ + + + ++ D+D E + E EK K ++ EE E EEEE+
Sbjct: 117 SDADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKA-----A 171
Query: 144 EEEEEKEKVEMYRNP 158
EEE+ +E+ + NP
Sbjct: 172 EEEKAREEEILTGNP 186
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 32.9 bits (75), Expect = 0.054
Identities = 25/87 (28%), Positives = 45/87 (51%), Gaps = 7/87 (8%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK-------DKKE 126
E E E E ++E EEE + + +++ + E+E+E + D +E
Sbjct: 226 TKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEE 285
Query: 127 EEVEEEEEEQKNKKKEEEEEEEKEKVE 153
E E+E+E++K KK E+E+E E++E
Sbjct: 286 ETEEKEKEKRKRLKKMMEDEDEDEEME 312
Score = 32.5 bits (74), Expect = 0.085
Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 8/87 (9%)
Query: 76 ETLENNEWEEEEKEKE--------EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
E+ E +E+E E E + EEE EE++++ KK + E+E+E+ + E
Sbjct: 258 ESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPES 317
Query: 128 EVEEEEEEQKNKKKEEEEEEEKEKVEM 154
VEEEE E+ ++EEEKE+V +
Sbjct: 318 PVEEEESEEPEPPPLPKKEEEKEEVTV 344
Score = 29.8 bits (67), Expect = 0.75
Identities = 20/73 (27%), Positives = 38/73 (52%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
++KE E +EE E++ + + E E D+ E+E E + +++ +EE EE+
Sbjct: 231 EKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEK 290
Query: 148 EKEKVEMYRNPVP 160
EKEK + + +
Sbjct: 291 EKEKRKRLKKMME 303
Score = 29.8 bits (67), Expect = 0.75
Identities = 17/73 (23%), Positives = 33/73 (45%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
SE E E E +E+E E + +R EEE E+++ +K + +++ E
Sbjct: 244 SEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMME 303
Query: 135 EQKNKKKEEEEEE 147
++ ++ E E
Sbjct: 304 DEDEDEEMEIVPE 316
Score = 29.0 bits (65), Expect = 1.3
Identities = 20/74 (27%), Positives = 39/74 (52%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
E EE+EKEK + ++ E++++DE+ E E EEEE ++ + ++EEE+++
Sbjct: 285 EETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTV 344
Query: 141 KEEEEEEEKEKVEM 154
+ + M
Sbjct: 345 SPDGGRRRGRRRVM 358
Score = 29.0 bits (65), Expect = 1.3
Identities = 24/82 (29%), Positives = 37/82 (45%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
S ET E + + + +K E+E+E+EE E + + E+ EE E KKEEE EE
Sbjct: 283 SEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEV 342
Query: 133 EEEQKNKKKEEEEEEEKEKVEM 154
++ K+K
Sbjct: 343 TVSPDGGRRRGRRRVMKKKTFK 364
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 32.9 bits (75), Expect = 0.055
Identities = 29/81 (35%), Positives = 48/81 (59%), Gaps = 4/81 (4%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
LE E + EE EK +E + E+E +++ EK KEEE ++ EE ++E E++
Sbjct: 361 PLLERLEKDVEEGEKTIVKEARQIEEELEKEVEKLG--KEEESLFKRVALEEGLKELEQD 418
Query: 136 QKNKKKE--EEEEEEKEKVEM 154
++N KE +EE+E EK++M
Sbjct: 419 EENFLKELSKEEKELLEKLKM 439
Score = 32.9 bits (75), Expect = 0.058
Identities = 24/70 (34%), Positives = 38/70 (54%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE EK +E + EEE EK+ ++ K E+ + ++ +E ++EE K KEE
Sbjct: 371 EEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFLKELSKEE 430
Query: 144 EEEEEKEKVE 153
+E EK K+E
Sbjct: 431 KELLEKLKME 440
Score = 32.9 bits (75), Expect = 0.061
Identities = 28/82 (34%), Positives = 44/82 (53%), Gaps = 4/82 (4%)
Query: 76 ETLENNEWEEEE---KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
E LE + E E+ KE + EEE E+E EK EE+ K+ EE K+ +++E
Sbjct: 364 ERLEKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFL 423
Query: 133 EEEQKNKKKEEEE-EEEKEKVE 153
+E K +K+ E+ + E +VE
Sbjct: 424 KELSKEEKELLEKLKMEASEVE 445
Score = 29.0 bits (65), Expect = 1.1
Identities = 20/63 (31%), Positives = 34/63 (53%)
Query: 83 WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
EE EKE E+ +EEE ++ EE +E +++EE K+ +EE E E+ + +
Sbjct: 385 EEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFLKELSKEEKELLEKLKMEASEV 444
Query: 143 EEE 145
E+
Sbjct: 445 EKL 447
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 33.1 bits (75), Expect = 0.058
Identities = 21/55 (38%), Positives = 35/55 (63%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
EKE +E+EE+ ++ ++D +E K E KE +E K+K K+ + +E E +NK K
Sbjct: 212 EKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYEVQNKHK 266
Score = 30.8 bits (69), Expect = 0.38
Identities = 22/65 (33%), Positives = 38/65 (58%), Gaps = 4/65 (6%)
Query: 95 EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
E+ E++ DEDEE ++ E+ EE K EEV+E +E +K K K+ +E ++ +V+
Sbjct: 208 EKTTEKEVTDEDEEDTKKADEDGEE----PKVEEVKEGDEGKKKKTKKVKEVTKEYEVQN 263
Query: 155 YRNPV 159
P+
Sbjct: 264 KHKPL 268
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 32.7 bits (74), Expect = 0.059
Identities = 25/72 (34%), Positives = 42/72 (58%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E + E+E +EEEE + E E D+ E++K +KK+ E+ D+ E E EEEE
Sbjct: 194 EKRQDEKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMETSESEEEESSE 253
Query: 139 KKKEEEEEEEKE 150
+ +E+E+E+ +
Sbjct: 254 SESDEDEDEDNK 265
Score = 27.7 bits (61), Expect = 2.9
Identities = 16/65 (24%), Positives = 34/65 (52%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ E + EEEE E E ++DEDE+ K + ++ + + K ++ + E E+++ +K
Sbjct: 239 QSMETSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQERENEKIP 298
Query: 144 EEEEE 148
+
Sbjct: 299 AVQHS 303
>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
Length = 1622
Score = 33.2 bits (76), Expect = 0.059
Identities = 14/52 (26%), Positives = 25/52 (48%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
EE +E EEE+++ K +K ++K K+ K +++EE E
Sbjct: 844 EEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEERE 895
Score = 26.6 bits (59), Expect = 9.4
Identities = 12/52 (23%), Positives = 22/52 (42%)
Query: 94 EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
EE EE E+++D+ + K + + +E + K+EE E
Sbjct: 844 EEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEERE 895
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 33.0 bits (76), Expect = 0.061
Identities = 22/63 (34%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E EEE +E E EEE +E E EK+ E +K+ +++ ++ K K+EE EE + ++
Sbjct: 65 EIEEELRE-EYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123
Query: 142 EEE 144
E
Sbjct: 124 PEP 126
Score = 32.6 bits (75), Expect = 0.080
Identities = 21/67 (31%), Positives = 37/67 (55%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
+E EEE EE EE+ K+ + EK+ + E++ ++K KK + ++EE + E E EE
Sbjct: 64 EEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123
Query: 148 EKEKVEM 154
+ +
Sbjct: 124 PEPPLRP 130
Score = 29.9 bits (68), Expect = 0.65
Identities = 18/56 (32%), Positives = 32/56 (57%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
E EE +E+ +E E E+E E ++ +K+ KK ++ K ++ EE+ E E E+
Sbjct: 68 EELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123
>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
subunit 8 N-terminus. The largest of the mammalian
translation initiation factors, eIF3, consists of at
least eight subunits ranging in mass from 35 to 170 kDa.
eIF3 binds to the 40 S ribosome in an early step of
translation initiation and promotes the binding of
methionyl-tRNAi and mRNA.
Length = 593
Score = 32.8 bits (75), Expect = 0.061
Identities = 19/67 (28%), Positives = 39/67 (58%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E E E+EEE+E+++++ DEDE++ EE + + V+E++EE ++ ++
Sbjct: 136 ESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDEDADLSKK 195
Query: 145 EEEEKEK 151
+ E+ K
Sbjct: 196 DVLEEPK 202
Score = 32.1 bits (73), Expect = 0.14
Identities = 19/62 (30%), Positives = 38/62 (61%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+EEE+E E+++++ +++++DED E+ E D+ +E+ EE+E+ +KK
Sbjct: 139 DEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDEDADLSKKDVL 198
Query: 144 EE 145
EE
Sbjct: 199 EE 200
Score = 28.2 bits (63), Expect = 2.2
Identities = 22/83 (26%), Positives = 41/83 (49%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
N ++E++ E+ E E+EE+E+DED++ E+E+E EE E
Sbjct: 121 NKQFEDDITRYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRV 180
Query: 140 KKEEEEEEEKEKVEMYRNPVPMY 162
K+++EE+E+ + + P
Sbjct: 181 KEDDEEDEDADLSKKDVLEEPKM 203
Score = 27.1 bits (60), Expect = 5.9
Identities = 19/73 (26%), Positives = 42/73 (57%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
NN+ E++ + E+ E E+E+E++++++ +E+E++ EEV E ++
Sbjct: 120 NNKQFEDDITRYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDR 179
Query: 140 KKEEEEEEEKEKV 152
KE++EE+E +
Sbjct: 180 VKEDDEEDEDADL 192
>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
Length = 773
Score = 32.9 bits (76), Expect = 0.061
Identities = 20/65 (30%), Positives = 41/65 (63%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E L+N + +K +E +EE++ EE+EKDE K + +E+++++++EE ++E EE
Sbjct: 498 EELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKEEKDKETEE 557
Query: 136 QKNKK 140
+ +
Sbjct: 558 DEPEG 562
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 33.1 bits (75), Expect = 0.061
Identities = 23/59 (38%), Positives = 35/59 (59%), Gaps = 1/59 (1%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDED-EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
EEE+E E EE+EE EKE + +EK E++EEE + KD + E + ++ K+ K
Sbjct: 140 LEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKPSVKDNGK 198
Score = 31.9 bits (72), Expect = 0.14
Identities = 19/56 (33%), Positives = 33/56 (58%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
+E+EE E EE+EE + E + EK EEEEE++ ++ ++ E + +K K+
Sbjct: 140 LEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKPSVKD 195
Score = 31.2 bits (70), Expect = 0.24
Identities = 27/100 (27%), Positives = 43/100 (43%), Gaps = 3/100 (3%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK---KREKKEEEEEKQKDKK 125
PQ S + W + EE E D + K EEEEE + ++
Sbjct: 92 PQRRSCDVRSRSTLWSLFHDDDEENLPSSIAPPEIDPEPRKPIVPDLVLEEEEEVEMEED 151
Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPMYRNP 165
EE E+E + ++K EEEEEEE + ++ + + + P
Sbjct: 152 EEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKP 191
Score = 26.9 bits (59), Expect = 5.5
Identities = 21/63 (33%), Positives = 32/63 (50%), Gaps = 7/63 (11%)
Query: 95 EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
EEEEE E +EDEE E++ +E+ EEEEEE+ K+ + E + K
Sbjct: 140 LEEEEEVEMEEDEEY-------YEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKPS 192
Query: 155 YRN 157
++
Sbjct: 193 VKD 195
>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
unknown].
Length = 769
Score = 33.1 bits (75), Expect = 0.062
Identities = 15/84 (17%), Positives = 40/84 (47%), Gaps = 3/84 (3%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE---EEEEKQKDKKEEEV 129
A + E + + ++E +++ K+ D E+ +E ++ E ++ EE
Sbjct: 389 QAKWAEDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEE 448
Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
E + E ++ +KE+ E ++ +++
Sbjct: 449 ERQLREFRDMEKEDREFPDEAELQ 472
Score = 31.2 bits (70), Expect = 0.27
Identities = 21/78 (26%), Positives = 42/78 (53%), Gaps = 4/78 (5%)
Query: 77 TLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE----EEEEKQKDKKEEEVEEE 132
+ E+EE+E + +EE D+D+ K+ + +E EE ++ EE+ E
Sbjct: 387 FYQAKWAEDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPE 446
Query: 133 EEEQKNKKKEEEEEEEKE 150
EEE++ ++ + E+E++E
Sbjct: 447 EEERQLREFRDMEKEDRE 464
Score = 28.1 bits (62), Expect = 2.5
Identities = 17/77 (22%), Positives = 36/77 (46%), Gaps = 6/77 (7%)
Query: 83 WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE------EEQ 136
++ + E EEEE+ + ++E + KE + E+ +E +++ E E+
Sbjct: 388 YQAKWAEDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEE 447
Query: 137 KNKKKEEEEEEEKEKVE 153
+ ++ E + EKE E
Sbjct: 448 EERQLREFRDMEKEDRE 464
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 33.0 bits (75), Expect = 0.064
Identities = 19/58 (32%), Positives = 28/58 (48%), Gaps = 1/58 (1%)
Query: 76 ETLENNEWEEEEKEKEE-EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
E +E E +E + EE +EE E+E E DE ++ + E E + D E E E
Sbjct: 330 EVVEVVEKQEGDVVTEESTDEESEDEVEIDESVIEEVAEMELLEVQVDDLAERLAETE 387
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 32.9 bits (76), Expect = 0.064
Identities = 14/45 (31%), Positives = 25/45 (55%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
NE +E KE+ + +++E + +KK+EK+E + K K K
Sbjct: 46 NEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKP 90
Score = 29.4 bits (67), Expect = 0.73
Identities = 12/40 (30%), Positives = 23/40 (57%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
E+ +E +EE + D+ E K K ++++EKQ+ K +
Sbjct: 47 EQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKA 86
Score = 26.7 bits (60), Expect = 6.9
Identities = 11/45 (24%), Positives = 26/45 (57%)
Query: 93 EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
E+ +E +E+ K +KK K + +K+K+K+E + + + + +
Sbjct: 47 EQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 32.7 bits (74), Expect = 0.066
Identities = 16/59 (27%), Positives = 34/59 (57%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EEE +EE+++EE+++ + DEDE+ + +E++E + D ++ + N+ E
Sbjct: 46 EEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNE 104
Score = 30.3 bits (68), Expect = 0.46
Identities = 14/66 (21%), Positives = 35/66 (53%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
++ +E EEE E+E D++E+ + E+E++ D E++ +E++++ ++
Sbjct: 39 RKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDG 98
Query: 148 EKEKVE 153
+ E
Sbjct: 99 NETDNE 104
Score = 27.3 bits (60), Expect = 5.0
Identities = 15/74 (20%), Positives = 42/74 (56%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E E E++++E ++++++E+E+ + D+D+E ++ +++ D ++ E + E
Sbjct: 48 EAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNEAGF 107
Query: 139 KKKEEEEEEEKEKV 152
++E ++ E V
Sbjct: 108 ADSDDESDDGSEYV 121
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 32.5 bits (74), Expect = 0.069
Identities = 17/70 (24%), Positives = 39/70 (55%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+++ K EE E+E + +D+ K + ++ ++ +++ + EE++ + +EE
Sbjct: 212 QDKNKSLEEYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEE 271
Query: 144 EEEEEKEKVE 153
EE EE+E+ E
Sbjct: 272 EETEEEEEEE 281
Score = 32.1 bits (73), Expect = 0.11
Identities = 18/71 (25%), Positives = 40/71 (56%)
Query: 83 WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
++K K EE E+E + ++ + + + +K D+++++ + + E+K K++
Sbjct: 209 DSSQDKNKSLEEYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKED 268
Query: 143 EEEEEEKEKVE 153
EEEEE +E+ E
Sbjct: 269 EEEEETEEEEE 279
Score = 30.2 bits (68), Expect = 0.41
Identities = 14/58 (24%), Positives = 37/58 (63%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
A S + + ++ + ++ + EE+++++ + E+ E +E + E++ EEEE+++D+ E
Sbjct: 228 DAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285
Score = 28.6 bits (64), Expect = 1.4
Identities = 12/60 (20%), Positives = 41/60 (68%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+ +A+ + + + + +K ++EE+++++ + E++E K +++EEE E+++++++E+
Sbjct: 226 SSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285
>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain. DBINO is a DNA-binding
domain found on global transcription activator SNF2L1
proteins and chromatin re-modelling proteins.
Length = 140
Score = 31.9 bits (73), Expect = 0.076
Identities = 16/36 (44%), Positives = 24/36 (66%), Gaps = 3/36 (8%)
Query: 102 EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
EK+E E +KR +KE E+ KKEEE+ E + +Q+
Sbjct: 82 EKEERELRKRAEKEALEQA---KKEEELREAKRQQR 114
Score = 26.8 bits (60), Expect = 3.9
Identities = 14/34 (41%), Positives = 22/34 (64%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKK 114
NE EE E K E+E E+ K+++E E KR+++
Sbjct: 81 NEKEERELRKRAEKEALEQAKKEEELREAKRQQR 114
Score = 26.1 bits (58), Expect = 6.4
Identities = 13/37 (35%), Positives = 24/37 (64%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
K+ E+EE E + EK+ E+ K+E++ E ++Q+ K
Sbjct: 79 KKNEKEERELRKRAEKEALEQAKKEEELREAKRQQRK 115
Score = 26.1 bits (58), Expect = 6.7
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 9/41 (21%)
Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
KK EK+E E K+ E+E + KKEEE E K
Sbjct: 79 KKNEKEERELRKR---------AEKEALEQAKKEEELREAK 110
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 32.5 bits (74), Expect = 0.086
Identities = 20/78 (25%), Positives = 37/78 (47%), Gaps = 6/78 (7%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE---- 130
E N + EEE E+E E + D+DE ++ E+EE+ ++ + + E
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60
Query: 131 --EEEEEQKNKKKEEEEE 146
+E K +KK+E ++
Sbjct: 61 EKARLKELKKQKKQEIQK 78
Score = 32.5 bits (74), Expect = 0.10
Identities = 21/71 (29%), Positives = 33/71 (46%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EEE+ + EEE+EEE E +E + +D+ EE+ EE E K + +
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60
Query: 144 EEEEEKEKVEM 154
E+ KE +
Sbjct: 61 EKARLKELKKQ 71
Score = 30.9 bits (70), Expect = 0.34
Identities = 16/89 (17%), Positives = 43/89 (48%), Gaps = 8/89 (8%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD--------KK 125
E + EEE++E+ E D++ + E ++EE++++ + K+
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60
Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
E+ +E ++QK ++ ++ E++ ++
Sbjct: 61 EKARLKELKKQKKQEIQKILEQQNAAIDA 89
Score = 30.5 bits (69), Expect = 0.42
Identities = 15/72 (20%), Positives = 41/72 (56%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
+ + A ++ +++E E+E+E+EE+++E + + + K+E+ K+ K++++
Sbjct: 15 EEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKARLKELKKQKKQ 74
Query: 129 VEEEEEEQKNKK 140
++ EQ+N
Sbjct: 75 EIQKILEQQNAA 86
Score = 29.0 bits (65), Expect = 1.4
Identities = 11/41 (26%), Positives = 22/41 (53%)
Query: 111 REKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
+E + + QK + ++ ++ E E+ EE+EEKE+
Sbjct: 778 KEVRYLMQAHQKGQLKDTIDVAEPEEPGDPLTAEEQEEKEQ 818
>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
Length = 336
Score = 32.3 bits (74), Expect = 0.087
Identities = 24/53 (45%), Positives = 39/53 (73%), Gaps = 1/53 (1%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE-EEEEKQKDKKEEEVEEEEEE 135
EE+ +E+EEEE E EE+E++EDEE+K+EK ++ K + KE+++E+ EE
Sbjct: 17 EEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIEDLLEE 69
Score = 28.4 bits (64), Expect = 2.0
Identities = 21/45 (46%), Positives = 32/45 (71%)
Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
+K +EK + EK ++K EEE EEE E + +++EE+EEE+KEK
Sbjct: 3 DKLKEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKP 47
Score = 28.0 bits (63), Expect = 2.8
Identities = 16/31 (51%), Positives = 26/31 (83%)
Query: 123 DKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+K EE++EEEEEE+ + +EEEEEE++E+ +
Sbjct: 14 EKVEEKIEEEEEEEAPEAEEEEEEEDEEEKK 44
Score = 27.6 bits (62), Expect = 3.3
Identities = 23/67 (34%), Positives = 46/67 (68%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
KEK + E+ EEK ++E+EE+ E +EEEEE+ +++K+E+ ++ + + KE++ E+
Sbjct: 6 KEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIED 65
Query: 148 EKEKVEM 154
E++E+
Sbjct: 66 LLEELEL 72
Score = 26.1 bits (58), Expect = 9.0
Identities = 20/48 (41%), Positives = 28/48 (58%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEK 120
E E E E E+E+EEE+EEE++EK D+ K E KE++ E
Sbjct: 18 EKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIED 65
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 32.3 bits (74), Expect = 0.087
Identities = 18/34 (52%), Positives = 24/34 (70%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
E EEEEEEEEE E++E+EE++ E+ EEE
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 30.8 bits (70), Expect = 0.35
Identities = 15/34 (44%), Positives = 23/34 (67%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
+ +EEEEEEEE +E++E+EE++ EEE
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 30.8 bits (70), Expect = 0.36
Identities = 18/33 (54%), Positives = 21/33 (63%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDE 107
S E EEEE+ EEEEEEEEEE+ E+E
Sbjct: 301 SPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 30.4 bits (69), Expect = 0.47
Identities = 18/27 (66%), Positives = 22/27 (81%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEE 108
E EEEE+E EEEEEEEEE+E+ +EE
Sbjct: 307 EEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 28.8 bits (65), Expect = 1.4
Identities = 18/33 (54%), Positives = 21/33 (63%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDE 105
S E E E EEE E+EEEEEEEEE ++E
Sbjct: 301 SPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 28.5 bits (64), Expect = 1.7
Identities = 16/34 (47%), Positives = 24/34 (70%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
+ EEEEEEEEE ++E+EE++ E++ EEE
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 28.1 bits (63), Expect = 2.7
Identities = 20/49 (40%), Positives = 28/49 (57%)
Query: 63 RLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR 111
RL + + + E EEEE+E+ EEEEEEEE+E+ EE+ R
Sbjct: 287 RLPLRRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEEVR 335
Score = 26.9 bits (60), Expect = 6.1
Identities = 19/43 (44%), Positives = 25/43 (58%), Gaps = 8/43 (18%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
E EEEEEEEE+ +E+ EEEEE+++ EEEV
Sbjct: 300 PSPPEPEEEEEEEEEVPEEE--------EEEEEEEERTFEEEV 334
Score = 26.5 bits (59), Expect = 7.8
Identities = 15/25 (60%), Positives = 20/25 (80%)
Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKE 150
E EEEEEE++ ++EEEEEEE+E
Sbjct: 303 PEPEEEEEEEEEVPEEEEEEEEEEE 327
Score = 26.5 bits (59), Expect = 9.1
Identities = 15/25 (60%), Positives = 19/25 (76%)
Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKE 150
EE EEEEEE +++EEEEEEE+
Sbjct: 305 PEEEEEEEEEVPEEEEEEEEEEERT 329
Score = 26.5 bits (59), Expect = 9.2
Identities = 17/40 (42%), Positives = 25/40 (62%), Gaps = 6/40 (15%)
Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
++R E ++++++EEEV EEEEE EEEEEE
Sbjct: 295 RRRPPPSPPEPEEEEEEEEEVPEEEEE------EEEEEER 328
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 32.1 bits (73), Expect = 0.089
Identities = 20/73 (27%), Positives = 44/73 (60%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E L + +E+ +E + + +E+++ +E E++ E ++E++ +K +E+E ++E EE
Sbjct: 132 EILPLLLYLGQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEE 191
Query: 136 QKNKKKEEEEEEE 148
K + +E EE E
Sbjct: 192 LKEELEELLEELE 204
Score = 29.8 bits (67), Expect = 0.47
Identities = 20/64 (31%), Positives = 41/64 (64%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
+E+ EE + + +EK++ +E +K ++ E+E+ +K EE E+++E ++ K++ EE
Sbjct: 141 GQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELL 200
Query: 147 EEKE 150
EE E
Sbjct: 201 EELE 204
Score = 28.7 bits (64), Expect = 1.2
Identities = 18/60 (30%), Positives = 39/60 (65%)
Query: 94 EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+E EE + E+K+R ++ E+E ++ + +++ +E+ EE++ KK+ EE +EE E++
Sbjct: 141 GQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELL 200
Score = 27.1 bits (60), Expect = 4.3
Identities = 22/70 (31%), Positives = 41/70 (58%), Gaps = 1/70 (1%)
Query: 68 LPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
L + E +E+KE+ EE E+E EE E ++D +K ++ +E++K+ ++ +E
Sbjct: 136 LLLYLGQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEE-KEKKKELEELKE 194
Query: 128 EVEEEEEEQK 137
E+EE EE +
Sbjct: 195 ELEELLEELE 204
>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
Length = 869
Score = 32.7 bits (75), Expect = 0.089
Identities = 12/47 (25%), Positives = 23/47 (48%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
+ ++ E +E EEE EE +E D ++ K + ++E+ Q
Sbjct: 819 TYKVKTINELEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDTGQ 865
Score = 28.0 bits (63), Expect = 2.7
Identities = 13/51 (25%), Positives = 24/51 (47%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
++ + NE E E+ EEE EE E+ ED + + + +E+ +
Sbjct: 817 LTTYKVKTINELEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDTGQLG 867
Score = 26.9 bits (60), Expect = 6.5
Identities = 21/56 (37%), Positives = 29/56 (51%), Gaps = 1/56 (1%)
Query: 56 KPIRANA-RLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKK 110
K I+A RLT + I+ E L E EEE E+ EE + E+ + E D D+E
Sbjct: 808 KGIKAKGNRLTTYKVKTINELEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDT 863
Score = 26.5 bits (59), Expect = 8.6
Identities = 11/40 (27%), Positives = 19/40 (47%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD 123
E E EE EEE E+ ++ D E + + + ++E
Sbjct: 826 NELEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDTGQ 865
>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1. This
family consists of several acidic phosphoprotein
precursor PCEMA1 sequences which appear to be found
exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
that is associated with the membrane of the infected
erythrocyte throughout the entire intraerythrocytic
cycle. The exact function of this family is unclear.
Length = 286
Score = 32.2 bits (73), Expect = 0.093
Identities = 22/79 (27%), Positives = 40/79 (50%), Gaps = 2/79 (2%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDE--DEEKKREKKEEEEEKQKDKKEEEVEEE 132
+ + E EE + E E E ++ E D E+ +RE +E + +K + E +E
Sbjct: 207 TSYFNDGENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDES 266
Query: 133 EEEQKNKKKEEEEEEEKEK 151
E+E + + EE+E E+K +
Sbjct: 267 EDEYEEEAGEEQENEDKGE 285
Score = 32.2 bits (73), Expect = 0.10
Identities = 23/85 (27%), Positives = 39/85 (45%)
Query: 67 ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
+ T N+ E EE + E E E K+ + D E K + + E E + DK
Sbjct: 197 SYLNELDDEVTSYFNDGENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKAN 256
Query: 127 EEVEEEEEEQKNKKKEEEEEEEKEK 151
E E+E +++ +EE EE++ +
Sbjct: 257 TNETEIEDESEDEYEEEAGEEQENE 281
Score = 29.1 bits (65), Expect = 0.95
Identities = 19/84 (22%), Positives = 37/84 (44%), Gaps = 1/84 (1%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+ T NE ++E + E EE + E + + + E + E K+K E
Sbjct: 189 DELDEEVTSYLNELDDEVTSYFNDGENEENDDELEAEVISYLKDGENDNEV-KEKIRREY 247
Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
E + ++ N + E E+E +++ E
Sbjct: 248 REWKGDKANTNETEIEDESEDEYE 271
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 32.4 bits (73), Expect = 0.094
Identities = 27/88 (30%), Positives = 39/88 (44%), Gaps = 4/88 (4%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+ I E + E+EE + E EE E+EE E E E ++ +K+EE+
Sbjct: 594 ELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREENARKKEEL 653
Query: 130 ----EEEEEEQKNKKKEEEEEEEKEKVE 153
E EE KK + EEK K+E
Sbjct: 654 RGNFELEERGDPEKKDVDWYTEEKRKIE 681
Score = 29.7 bits (66), Expect = 0.74
Identities = 23/83 (27%), Positives = 37/83 (44%), Gaps = 4/83 (4%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
E EE E + E E EE ++K E+ R E EE +KK+ + EE+ + ++
Sbjct: 628 AENEESADEVDYETEREENARKK----EELRGNFELEERGDPEKKDVDWYTEEKRKIEEQ 683
Query: 141 KEEEEEEEKEKVEMYRNPVPMYR 163
+ E + V R + YR
Sbjct: 684 LKINRSEFETMVPESRVVIEGYR 706
Score = 29.3 bits (65), Expect = 0.99
Identities = 18/72 (25%), Positives = 41/72 (56%), Gaps = 3/72 (4%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E +EK E ++ +E+ KD++E+++ E + +E++ KKE +E E+ + +
Sbjct: 1009 PECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKE---KEREQRIRKTIHDNY 1065
Query: 145 EEEEKEKVEMYR 156
+E K++++ R
Sbjct: 1066 KEMAKKRLKKKR 1077
Score = 29.3 bits (65), Expect = 1.1
Identities = 19/83 (22%), Positives = 38/83 (45%), Gaps = 4/83 (4%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK----KREKKEEEEEKQKDKKEEEVEEEEE 134
E+ E EE + E EE ++E+ + E++E +K++ + E+EE +
Sbjct: 605 EDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREENARKKEELRGNFELEERGD 664
Query: 135 EQKNKKKEEEEEEEKEKVEMYRN 157
+K EE+ K + ++ N
Sbjct: 665 PEKKDVDWYTEEKRKIEEQLKIN 687
Score = 27.4 bits (60), Expect = 4.4
Identities = 19/65 (29%), Positives = 28/65 (43%)
Query: 92 EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
E EEE ++ EK E+ + E+ + E ++ + E EE E E EE K
Sbjct: 590 EGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYETEREENARK 649
Query: 152 VEMYR 156
E R
Sbjct: 650 KEELR 654
Score = 27.0 bits (59), Expect = 5.8
Identities = 20/79 (25%), Positives = 42/79 (53%), Gaps = 4/79 (5%)
Query: 63 RLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
R+ + +P +++ +E K++EE+E E ++ K E++ KKE+E E++
Sbjct: 1002 RIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQRAK----EEEIGKKEKEREQRI 1057
Query: 123 DKKEEEVEEEEEEQKNKKK 141
K + +E +++ KKK
Sbjct: 1058 RKTIHDNYKEMAKKRLKKK 1076
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 32.2 bits (74), Expect = 0.095
Identities = 28/123 (22%), Positives = 50/123 (40%), Gaps = 31/123 (25%)
Query: 68 LPQAISASETLENNEWEEEEKEKEEEE--EEEEEEKEKDEDEEKK-REKKEEEEEKQK-- 122
L + + SE + + E EE EKE E EE +E +K +E E E E E+ +
Sbjct: 207 LEEEGTPSELIREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIELERAEAL 266
Query: 123 -----------------DKKEEEVEEEEEEQKNKK---KEEEEEEEKEKVEMYRNPVPMY 162
+ + ++++E ++ + E +EE+E+V P
Sbjct: 267 SKFLKTDKTFAIEGWVPEDRVKKLKELIDKATGGSAYVEFVEPDEEEEEV-----PT-KL 320
Query: 163 RNP 165
+NP
Sbjct: 321 KNP 323
Score = 31.4 bits (72), Expect = 0.20
Identities = 26/80 (32%), Positives = 38/80 (47%), Gaps = 8/80 (10%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE- 142
+E E+E ++ E E +E+ E RE KEE EE +K++ E + EE +E K E
Sbjct: 191 DEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKER--ESLLEELKELAKKYLEE 248
Query: 143 -----EEEEEEKEKVEMYRN 157
E E E E+ E
Sbjct: 249 LLALYEYLEIELERAEALSK 268
Score = 30.3 bits (69), Expect = 0.43
Identities = 13/56 (23%), Positives = 28/56 (50%)
Query: 94 EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
+ E+++K K EE K +++ E++E+E +E + + E E E ++
Sbjct: 63 RSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKE 118
Score = 29.5 bits (67), Expect = 0.76
Identities = 18/59 (30%), Positives = 31/59 (52%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
K EE+++ + E+ K E++ E+ EK+ + EEE+ E E E K ++E E
Sbjct: 67 PKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIER 125
Score = 28.7 bits (65), Expect = 1.5
Identities = 21/67 (31%), Positives = 30/67 (44%), Gaps = 4/67 (5%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
E E EEE E E +EE +E EK+RE EE ++ K EE+ E +
Sbjct: 202 FERLELEEEGTPSELIREIKEELEEI----EKERESLLEELKELAKKYLEELLALYEYLE 257
Query: 138 NKKKEEE 144
+ + E
Sbjct: 258 IELERAE 264
Score = 28.4 bits (64), Expect = 2.3
Identities = 23/108 (21%), Positives = 40/108 (37%), Gaps = 26/108 (24%)
Query: 71 AISASETLENNEWEEEEKEKEEEEEEEEEE-KEKD------EDEEKKREKKEEEEEKQKD 123
+ + EE K+ EEE E+ E+E KE + E+E K+ E++ E E +
Sbjct: 72 LREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLEPWGN 131
Query: 124 -------------------KKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
E+ EE + + + + E +K V
Sbjct: 132 FDLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDKGYV 179
Score = 27.2 bits (61), Expect = 5.5
Identities = 17/85 (20%), Positives = 34/85 (40%), Gaps = 10/85 (11%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEE--------EKEKDEDEEKKREKKEEEEEKQKDKKE 126
E E + E E ++ KE ++ E++ +K E + +++
Sbjct: 153 PEDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLELEEEGT 212
Query: 127 --EEVEEEEEEQKNKKKEEEEEEEK 149
E + E +EE + +KE E E+
Sbjct: 213 PSELIREIKEELEEIEKERESLLEE 237
>gnl|CDD|223266 COG0188, GyrA, Type IIA topoisomerase (DNA gyrase/topo II,
topoisomerase IV), A subunit [DNA replication,
recombination, and repair].
Length = 804
Score = 32.2 bits (74), Expect = 0.099
Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 2/79 (2%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKRE--KKEEEEEKQKDKKEEEVEEEEEE 135
L E E+ EKE +E E+E + ++ EE+ + KKE E K+K E E EEE
Sbjct: 431 LTGLEEEKIEKELKELEKEIADLEKILASEERLLDIIKKELLEIKKKFGDERRTEIVEEE 490
Query: 136 QKNKKKEEEEEEEKEKVEM 154
+ + E+ EE V +
Sbjct: 491 EDEIEDEDLIAEEDVVVTL 509
>gnl|CDD|149105 pfam07851, TMPIT, TMPIT-like protein. A number of members of this
family are annotated as being transmembrane proteins
induced by tumour necrosis factor alpha, but no
literature was found to support this.
Length = 330
Score = 32.0 bits (73), Expect = 0.10
Identities = 18/79 (22%), Positives = 41/79 (51%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
EW+E EKE ++ +E K+K E+ EK +E+ +Q+ + ++ + ++ +K+
Sbjct: 4 EEWKELEKEFQQLQETHRLYKQKLEEVEKLQEQCTSAIARQRKRLKQLIVSLKKLKKSLT 63
Query: 141 KEEEEEEEKEKVEMYRNPV 159
E+ E E+ + ++
Sbjct: 64 PEDSELVEQLEEQIKERKN 82
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 32.0 bits (73), Expect = 0.11
Identities = 19/56 (33%), Positives = 30/56 (53%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
E E++EEEEE+E+DE K E EE ++ +E+ E++EE+ N
Sbjct: 174 EALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSP 229
Score = 31.3 bits (71), Expect = 0.19
Identities = 24/92 (26%), Positives = 39/92 (42%), Gaps = 22/92 (23%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE---------- 134
E +K +EE+EE EKE++E+ KK E ++ E +E EE
Sbjct: 107 NYEADKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKELQSRRAD 166
Query: 135 ------------EQKNKKKEEEEEEEKEKVEM 154
+K +++EEEEE+E +
Sbjct: 167 VDVNSMLEALFRREKKEEEEEEEEDEALIKSL 198
Score = 26.2 bits (58), Expect = 9.2
Identities = 14/68 (20%), Positives = 32/68 (47%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E+++E+EEEEE+E K E + +++ ++E +D +E+ + + +
Sbjct: 179 REKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGSSSPAK 238
Query: 144 EEEEEKEK 151
K+
Sbjct: 239 PTSILKKS 246
Score = 26.2 bits (58), Expect = 9.5
Identities = 17/72 (23%), Positives = 31/72 (43%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E L E +EEE+E+EE+E + E EE +R +E+ E ++ + +
Sbjct: 174 EALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGS 233
Query: 136 QKNKKKEEEEEE 147
K ++
Sbjct: 234 SSPAKPTSILKK 245
>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
factor RRN3. This family consists of several eukaryotic
proteins which are homologous to the yeast RRN3 protein.
RRN3 is one of the RRN genes specifically required for
the transcription of rDNA by RNA polymerase I (Pol I) in
Saccharomyces cerevisiae.
Length = 554
Score = 32.3 bits (74), Expect = 0.11
Identities = 13/62 (20%), Positives = 30/62 (48%), Gaps = 12/62 (19%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
NE ++ + E+EE +E++ ++D+ + ++++ EEE + E E
Sbjct: 219 QNELDDIDDEEEERVLADEDDDDEDDMFD------MDDDD------EEESDPEVERTSTI 266
Query: 140 KK 141
K+
Sbjct: 267 KE 268
Score = 28.8 bits (65), Expect = 1.3
Identities = 13/49 (26%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
L++ + EEEE+ +E++++E++ +D+++ E+ + E E+ KE
Sbjct: 222 LDDIDDEEEERVLADEDDDDEDDMFDMDDDDE--EESDPEVERTSTIKE 268
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 32.1 bits (73), Expect = 0.12
Identities = 22/75 (29%), Positives = 37/75 (49%), Gaps = 2/75 (2%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKR--EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
EK E+EE EE + + E+EE++R +KEEEE++ +K ++ +E E E
Sbjct: 138 EKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQALLDELETSTLPAAEL 197
Query: 145 EEEEKEKVEMYRNPV 159
+ K+ V
Sbjct: 198 IAQHKKNSVKLEMQV 212
Score = 30.5 bits (69), Expect = 0.37
Identities = 21/74 (28%), Positives = 45/74 (60%), Gaps = 5/74 (6%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEE-----KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
L NN E K+K E ++E ++ KEK E+++ E+ E E+++++++ +++E
Sbjct: 109 LTNNIDLENTKKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKE 168
Query: 133 EEEQKNKKKEEEEE 146
EEEQ+ K++ ++
Sbjct: 169 EEEQQMNKRKNKQA 182
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 31.7 bits (72), Expect = 0.12
Identities = 20/73 (27%), Positives = 43/73 (58%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
+K +E+EEE+E ++ +++ +K E EE +++ + E E+ EQ+ K++ +EEE
Sbjct: 33 IKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEE 92
Query: 148 EKEKVEMYRNPVP 160
++E+ E +
Sbjct: 93 KEEEAEDVKQQEV 105
Score = 30.2 bits (68), Expect = 0.38
Identities = 23/85 (27%), Positives = 52/85 (61%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
+ +T + +E+E+EKE +E + ++E+ + E ++ EK++E E +++ E+
Sbjct: 22 EAKANMQKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQ 81
Query: 129 VEEEEEEQKNKKKEEEEEEEKEKVE 153
E+EE Q+ +K+EE E+ ++++V
Sbjct: 82 QEKEEIAQEEEKEEEAEDVKQQEVF 106
Score = 29.4 bits (66), Expect = 0.75
Identities = 22/65 (33%), Positives = 40/65 (61%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
+ N E + ++EE++ E EE +++ E E +RE E++EK++ +EEE EEE E+ K
Sbjct: 43 KENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQ 102
Query: 139 KKKEE 143
++
Sbjct: 103 QEVFS 107
Score = 28.6 bits (64), Expect = 1.3
Identities = 21/71 (29%), Positives = 43/71 (60%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E E +E + +EEE++ E E+ + E E + E+++ E+++K++ +E E+EEE +
Sbjct: 41 EEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDV 100
Query: 139 KKKEEEEEEEK 149
K++E + K
Sbjct: 101 KQQEVFSFKRK 111
Score = 28.6 bits (64), Expect = 1.3
Identities = 18/75 (24%), Positives = 48/75 (64%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
+EKE+E+E +E + +++E + + E ++E+E + ++++ ++E+EE ++++EE
Sbjct: 36 ADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEE 95
Query: 145 EEEEKEKVEMYRNPV 159
E E+ ++ E++
Sbjct: 96 EAEDVKQQEVFSFKR 110
Score = 28.3 bits (63), Expect = 1.4
Identities = 24/85 (28%), Positives = 53/85 (62%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
+ S+ ++ EE++ E EE E+E+E + + ED +++EK+E +E++K+++ E+
Sbjct: 40 EEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAED 99
Query: 129 VEEEEEEQKNKKKEEEEEEEKEKVE 153
V+++E +KK +E +EK++
Sbjct: 100 VKQQEVFSFKRKKPFKEMNLEEKID 124
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 31.5 bits (72), Expect = 0.13
Identities = 16/72 (22%), Positives = 36/72 (50%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
+ +E + E + +K E ++KKRE +E+ E + +++ +EE ++K K+
Sbjct: 206 DLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKE 265
Query: 142 EEEEEEEKEKVE 153
+ + K+ V
Sbjct: 266 SKGVKALKKVVA 277
Score = 29.6 bits (67), Expect = 0.76
Identities = 18/59 (30%), Positives = 32/59 (54%)
Query: 93 EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
+ +E K E K +K +E +K+K + EE+VE E + K+K +EE ++K+
Sbjct: 205 PDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKP 263
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 30.5 bits (69), Expect = 0.14
Identities = 14/65 (21%), Positives = 33/65 (50%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
+K K + E +E+ K ++++ K + KE + +K + E +K K K++++
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61
Query: 146 EEEKE 150
+ +K
Sbjct: 62 DCKKG 66
Score = 28.2 bits (63), Expect = 1.0
Identities = 13/69 (18%), Positives = 35/69 (50%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
++ + + E +E+ + +K ED+ K + K+ + ++ + E E+ KK+++
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61
Query: 145 EEEEKEKVE 153
+ ++ K E
Sbjct: 62 DCKKGCKWE 70
Score = 27.0 bits (60), Expect = 2.1
Identities = 8/46 (17%), Positives = 23/46 (50%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
+W+++E + + + +E + +K + + E+ K K K++
Sbjct: 17 KWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKD 62
>gnl|CDD|219514 pfam07686, V-set, Immunoglobulin V-set domain. This domain is
found in antibodies as well as neural protein P0 and
CTL4 amongst others.
Length = 114
Score = 30.7 bits (69), Expect = 0.14
Identities = 14/44 (31%), Positives = 20/44 (45%), Gaps = 1/44 (2%)
Query: 23 GDSSHGVYNLRIRNVSLTDDAEFQCQVGPAQYHKPIRANARLTV 66
G+ S ++L I N+ L+D + C V A RLTV
Sbjct: 72 GNGSKNDFSLTISNLRLSDSGTYTCAVS-NPNELVFGAGTRLTV 114
>gnl|CDD|240327 PTZ00243, PTZ00243, ABC transporter; Provisional.
Length = 1560
Score = 32.1 bits (73), Expect = 0.15
Identities = 13/63 (20%), Positives = 22/63 (34%), Gaps = 2/63 (3%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKD--EDEEKKREKKEEEEEKQKDKKEEEVEE 131
A+E EN + +E + + E E + D K+ E + D +
Sbjct: 878 AAELKENKDSKEGDADAEVAEVDAAPGGAVDHEPPVAKQEGNAEGGDGAALDAAAGRLMT 937
Query: 132 EEE 134
EE
Sbjct: 938 REE 940
>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
Length = 193
Score = 31.2 bits (70), Expect = 0.16
Identities = 22/56 (39%), Positives = 38/56 (67%)
Query: 98 EEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+E K +D DE +REK ++E E+ ++K E++ EE E+Q + E+EEE E++K +
Sbjct: 11 KEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVEELEDQPEEPPEQEEENEEQKPK 66
Score = 27.0 bits (59), Expect = 4.0
Identities = 24/65 (36%), Positives = 39/65 (60%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
+E ++ + +E E EK K E EE +K E+ E+ +D+ EE E+EEE ++ K KEE +
Sbjct: 11 KEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVEELEDQPEEPPEQEEENEEQKPKEEID 70
Query: 146 EEEKE 150
+E
Sbjct: 71 YPIQE 75
>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
Length = 472
Score = 31.5 bits (72), Expect = 0.17
Identities = 13/33 (39%), Positives = 24/33 (72%)
Query: 101 KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
KE++E E++K+ KKE +EE++ + + +EE E
Sbjct: 391 KEREEKEKEKKAKKEGKEERRIHFQNKSIEEAE 423
Score = 26.5 bits (59), Expect = 8.1
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
KE+EE+E+E++ +KE E+ + K EE
Sbjct: 391 KEREEKEKEKKAKKEGKEERRIHFQNKSIEE 421
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 31.7 bits (71), Expect = 0.17
Identities = 31/78 (39%), Positives = 40/78 (51%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E +E N E E+ EE EE EE ++ EE E EE E+ ++ EEVEE EE
Sbjct: 977 ENVEENVEENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEE 1036
Query: 136 QKNKKKEEEEEEEKEKVE 153
+ EE EE +E VE
Sbjct: 1037 YDEENVEEIEENAEENVE 1054
Score = 30.9 bits (69), Expect = 0.28
Identities = 28/70 (40%), Positives = 40/70 (57%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE +E +EE EE EE ++ DEE E +E EE ++ EE +EE +EE + +E
Sbjct: 1016 EENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEIEEN 1075
Query: 144 EEEEEKEKVE 153
EE +E VE
Sbjct: 1076 IEENIEENVE 1085
Score = 30.9 bits (69), Expect = 0.30
Identities = 29/72 (40%), Positives = 40/72 (55%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E EE +E +EE EE EE ++ EE E EE +E+ ++ EE +EE EE +
Sbjct: 1029 EVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEIEENIEENIEENVEENV 1088
Query: 142 EEEEEEEKEKVE 153
EE EE +E VE
Sbjct: 1089 EENVEEIEENVE 1100
Score = 30.9 bits (69), Expect = 0.31
Identities = 29/78 (37%), Positives = 42/78 (53%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E +E N E E+ EE EE EE ++ EE E EE +E+ ++ EE VEE +EE
Sbjct: 981 ENVEENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEE 1040
Query: 136 QKNKKKEEEEEEEKEKVE 153
+ +E EE +E +E
Sbjct: 1041 NVEEIEENAEENVEENIE 1058
Score = 30.9 bits (69), Expect = 0.32
Identities = 30/77 (38%), Positives = 40/77 (51%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
EN E EE +E EE EE E++ +E + +E EE ++ EE VEE EE
Sbjct: 961 ENVEENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENIEENVE 1020
Query: 139 KKKEEEEEEEKEKVEMY 155
+ EE EE +E VE Y
Sbjct: 1021 EYDEENVEEVEENVEEY 1037
Score = 30.5 bits (68), Expect = 0.39
Identities = 28/70 (40%), Positives = 36/70 (51%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE +E EE EE EE ++ EE E EE EE ++ EE VEE EE + +E
Sbjct: 1039 EENVEEIEENAEENVEENIEENIEEYDEENVEEIEENIEENIEENVEENVEENVEEIEEN 1098
Query: 144 EEEEEKEKVE 153
EE +E E
Sbjct: 1099 VEENVEENAE 1108
Score = 30.5 bits (68), Expect = 0.40
Identities = 28/70 (40%), Positives = 38/70 (54%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE +E EE EE +E+ +E EE E EE E+ ++ EE VEE EE + EE
Sbjct: 1008 EENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEE 1067
Query: 144 EEEEEKEKVE 153
EE +E +E
Sbjct: 1068 NVEEIEENIE 1077
Score = 30.5 bits (68), Expect = 0.45
Identities = 29/78 (37%), Positives = 39/78 (50%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E + EE E+ EE EE EE ++ DEE E +E EE ++ EE VEE EE
Sbjct: 1035 EEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEIEENIEENIEENVEENVEENVEE 1094
Query: 136 QKNKKKEEEEEEEKEKVE 153
+ +E EE +E E
Sbjct: 1095 IEENVEENVEENAEENAE 1112
Score = 30.5 bits (68), Expect = 0.49
Identities = 26/68 (38%), Positives = 36/68 (52%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE ++E EE EE EE ++ EE E EE EE ++ EE EE EE + EE
Sbjct: 1062 EEYDEENVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEE 1121
Query: 144 EEEEEKEK 151
++E E+
Sbjct: 1122 YDDENPEE 1129
Score = 29.8 bits (66), Expect = 0.69
Identities = 29/78 (37%), Positives = 39/78 (50%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E +E N E E+ EE EE EE ++ EE E EE E+ ++ +EE EE EE
Sbjct: 973 ENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEE 1032
Query: 136 QKNKKKEEEEEEEKEKVE 153
+ EE EE +E E
Sbjct: 1033 NVEEYDEENVEEIEENAE 1050
Score = 29.8 bits (66), Expect = 0.72
Identities = 28/78 (35%), Positives = 41/78 (52%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E +E N E E+ EE EE EE ++ EE E EE E+ ++ EE +EE E
Sbjct: 969 ENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVE 1028
Query: 136 QKNKKKEEEEEEEKEKVE 153
+ + EE +EE E++E
Sbjct: 1029 EVEENVEEYDEENVEEIE 1046
Score = 29.8 bits (66), Expect = 0.76
Identities = 29/80 (36%), Positives = 40/80 (50%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E +E N E E+ EE EE EE ++ EE E EE E+ ++ EE EE EE
Sbjct: 985 ENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEE 1044
Query: 136 QKNKKKEEEEEEEKEKVEMY 155
+ +E EE +E +E Y
Sbjct: 1045 IEENAEENVEENIEENIEEY 1064
Score = 29.4 bits (65), Expect = 1.1
Identities = 31/78 (39%), Positives = 42/78 (53%), Gaps = 1/78 (1%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E +E N E E+ EE +EE EE E++ EE E EE EE ++ EE +EE EE
Sbjct: 1005 ENVEENVEENIEENVEEYDEENVEEVEENV-EEYDEENVEEIEENAEENVEENIEENIEE 1063
Query: 136 QKNKKKEEEEEEEKEKVE 153
+ EE EE +E +E
Sbjct: 1064 YDEENVEEIEENIEENIE 1081
Score = 29.0 bits (64), Expect = 1.5
Identities = 30/82 (36%), Positives = 41/82 (50%), Gaps = 1/82 (1%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE +E EE EE +E+ +E EE E EE E+ ++ EE+EE EE + EE
Sbjct: 1050 EENVEENIEENIEEYDEENVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEE 1109
Query: 144 EEEEE-KEKVEMYRNPVPMYRN 164
EE +E E Y + P N
Sbjct: 1110 NAEENAEENAEEYDDENPEEHN 1131
Score = 28.6 bits (63), Expect = 1.8
Identities = 30/81 (37%), Positives = 41/81 (50%), Gaps = 3/81 (3%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK---EEEVEEE 132
E +E N E E+ EE EE EE ++ DEE E +E EE ++ EE EE
Sbjct: 993 ENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEEN 1052
Query: 133 EEEQKNKKKEEEEEEEKEKVE 153
EE + EE +EE E++E
Sbjct: 1053 VEENIEENIEEYDEENVEEIE 1073
Score = 28.2 bits (62), Expect = 2.4
Identities = 26/73 (35%), Positives = 38/73 (52%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
EN E + EE +E EE EE E++ +E + +E EE ++ EE +EE EE
Sbjct: 953 ENVEEDAEENVEENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENVE 1012
Query: 139 KKKEEEEEEEKEK 151
+ EE EE E+
Sbjct: 1013 ENIEENVEEYDEE 1025
Score = 28.2 bits (62), Expect = 2.5
Identities = 27/81 (33%), Positives = 39/81 (48%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
EN E EE +E EE EE E++ +E + +E EE ++ EE +EE EE
Sbjct: 965 ENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDE 1024
Query: 139 KKKEEEEEEEKEKVEMYRNPV 159
+ EE EE +E E +
Sbjct: 1025 ENVEEVEENVEEYDEENVEEI 1045
Score = 28.2 bits (62), Expect = 2.7
Identities = 31/81 (38%), Positives = 41/81 (50%), Gaps = 2/81 (2%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDED--EEKKREKKEEEEEKQKDKKEEEVE 130
+ E EN E EE +E EE +EE E+ E+ EE E EE E+ ++ EE VE
Sbjct: 1041 NVEEIEENAEENVEENIEENIEEYDEENVEEIEENIEENIEENVEENVEENVEEIEENVE 1100
Query: 131 EEEEEQKNKKKEEEEEEEKEK 151
E EE + EE EE E+
Sbjct: 1101 ENVEENAEENAEENAEENAEE 1121
Score = 28.2 bits (62), Expect = 2.8
Identities = 28/75 (37%), Positives = 38/75 (50%), Gaps = 2/75 (2%)
Query: 79 ENNEWEEEEKEKEEEEEEEE--EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
EN E +EE +E EE EE EE ++ EE E +E EE ++ EE EE EE
Sbjct: 1059 ENIEEYDEENVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEEN 1118
Query: 137 KNKKKEEEEEEEKEK 151
+ +E EE E+
Sbjct: 1119 AEEYDDENPEEHNEE 1133
Score = 27.4 bits (60), Expect = 4.7
Identities = 26/72 (36%), Positives = 38/72 (52%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
N E + EE +E+ EE EE E++ +E + +E EE ++ EE VEE EE +
Sbjct: 946 NIEEDAEENVEEDAEENVEENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEE 1005
Query: 140 KKEEEEEEEKEK 151
EE EE E+
Sbjct: 1006 NVEENVEENIEE 1017
Score = 27.1 bits (59), Expect = 6.5
Identities = 31/84 (36%), Positives = 42/84 (50%), Gaps = 9/84 (10%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDED---------EEKKREKKEEEEEKQKDKKEEEV 129
EN E EE ++E EE EE +E DE+ EE E EE E+ ++ EE+
Sbjct: 1013 ENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEI 1072
Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
EE EE + EE EE E++E
Sbjct: 1073 EENIEENIEENVEENVEENVEEIE 1096
Score = 26.3 bits (57), Expect = 9.7
Identities = 21/60 (35%), Positives = 31/60 (51%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E +E N E E+ EE EE EE E++ +E + +E EE ++ +E EE EE
Sbjct: 1074 ENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEYDDENPEEHNEE 1133
>gnl|CDD|240331 PTZ00254, PTZ00254, 40S ribosomal protein SA; Provisional.
Length = 249
Score = 31.2 bits (71), Expect = 0.18
Identities = 13/44 (29%), Positives = 18/44 (40%), Gaps = 1/44 (2%)
Query: 83 WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
W + E E EE+EE E ++ E +E EE E
Sbjct: 204 WRDPE-EAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVAAANNE 246
Score = 30.0 bits (68), Expect = 0.42
Identities = 11/42 (26%), Positives = 16/42 (38%)
Query: 101 KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
++ +E EEK+ E + E EE EE E
Sbjct: 205 RDPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVAAANNE 246
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 31.5 bits (72), Expect = 0.19
Identities = 26/90 (28%), Positives = 44/90 (48%), Gaps = 11/90 (12%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDE-----------DEEKKREKKEEEEEK 120
I+AS L E E KE E EE+++ KE+ E E+ +E ++ +
Sbjct: 492 ITASSGLSEEEIERMVKEAEANAEEDKKRKEEIEARNNADSLAYQAEKTLKEAGDKLPAE 551
Query: 121 QKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
+K+K E+ V E +E K + EE + + +E
Sbjct: 552 EKEKIEKAVAELKEALKGEDVEEIKAKTEE 581
Score = 26.5 bits (59), Expect = 8.3
Identities = 18/59 (30%), Positives = 29/59 (49%), Gaps = 10/59 (16%)
Query: 106 DEEKKREKKEEEEEKQKDKKEEEVEE----------EEEEQKNKKKEEEEEEEKEKVEM 154
+EE +R KE E ++DKK +E E + E+ + ++ EEKEK+E
Sbjct: 500 EEEIERMVKEAEANAEEDKKRKEEIEARNNADSLAYQAEKTLKEAGDKLPAEEKEKIEK 558
>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
Length = 653
Score = 31.7 bits (72), Expect = 0.19
Identities = 27/90 (30%), Positives = 41/90 (45%), Gaps = 13/90 (14%)
Query: 84 EEEEKEKEEEEEEEE--EEKEKDE----------DEEKKREKKEEEEEKQKDKKEEEVEE 131
E EK K E+E E E K E +EK + K + ++ +K +E E
Sbjct: 522 NEAEKYKAEDEANRERVEAKNGLENYCYSMKNTLQDEKVKGKLSDSDKATIEKAIDEALE 581
Query: 132 EEEEQKNKKKEEEEEEEKEKVEMYRNPVPM 161
E+ + +KEE E ++KE VE NP+
Sbjct: 582 WLEKNQLAEKEEFEHKQKE-VESVCNPIMT 610
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 30.6 bits (70), Expect = 0.20
Identities = 12/62 (19%), Positives = 25/62 (40%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
QA S + L + E E E + + + + E+ + + +E E + + E E
Sbjct: 45 QAKSIKQHLAELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAES 104
Query: 130 EE 131
+
Sbjct: 105 DH 106
Score = 28.3 bits (64), Expect = 1.5
Identities = 14/67 (20%), Positives = 24/67 (35%), Gaps = 7/67 (10%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
E + E E + E + K +K+K E +E E K K+ E E +
Sbjct: 55 ELAADLLELEAAAPRAELQAKI-------ARYKKEKARYRSEAKELEAKAKEAEAESDHA 107
Query: 149 KEKVEMY 155
+ +
Sbjct: 108 LHQHHRF 114
Score = 26.8 bits (60), Expect = 4.4
Identities = 7/40 (17%), Positives = 18/40 (45%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
E + + ++E+ E ++ E + K+ E + + Q
Sbjct: 71 ELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDHALHQ 110
Score = 26.8 bits (60), Expect = 4.7
Identities = 5/38 (13%), Positives = 18/38 (47%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
+ + ++E+ + + ++ E K ++ E E + +
Sbjct: 73 QAKIARYKKEKARYRSEAKELEAKAKEAEAESDHALHQ 110
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 30.6 bits (70), Expect = 0.21
Identities = 15/61 (24%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
Query: 98 EEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK-KKEEEEEEEKEKVEMYR 156
+ E+ K E E++ ++ +EE+EE +K E E + E E++ + +++ EE+ +++ +
Sbjct: 117 QAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLK 176
Query: 157 N 157
Sbjct: 177 K 177
Score = 28.3 bits (64), Expect = 1.5
Identities = 16/40 (40%), Positives = 26/40 (65%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
E ++ E+EKEE E+ E + K E EK+ E++ + EEK+
Sbjct: 128 EIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKR 167
Score = 27.6 bits (62), Expect = 2.5
Identities = 21/61 (34%), Positives = 33/61 (54%), Gaps = 8/61 (13%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E+ + E E+E ++ EEEKE+ E + E K E EK+ EEEE + + K+ +
Sbjct: 119 EQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKR--------EEEERQIEEKRHAD 170
Query: 144 E 144
E
Sbjct: 171 E 171
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 31.2 bits (71), Expect = 0.21
Identities = 14/43 (32%), Positives = 31/43 (72%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
E+ E E+ E+EEEE++ E +++DE++E + E+++++E +
Sbjct: 277 EDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
Score = 30.8 bits (70), Expect = 0.29
Identities = 15/46 (32%), Positives = 31/46 (67%)
Query: 105 EDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
DEE E E+ EE++++ E E+E+EE +++++EE+++E +
Sbjct: 274 GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
Score = 29.6 bits (67), Expect = 0.65
Identities = 14/34 (41%), Positives = 25/34 (73%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK 109
E E E E++ E E+E+EE+E+E+E+++D+E
Sbjct: 284 EKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEG 317
Score = 29.2 bits (66), Expect = 0.97
Identities = 16/44 (36%), Positives = 30/44 (68%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
E + E EK +EEEEE++ E E +++E++ E++E+++E K
Sbjct: 276 EEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
Score = 29.2 bits (66), Expect = 1.0
Identities = 15/46 (32%), Positives = 31/46 (67%)
Query: 102 EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
+ED E+ EK EEEEE+ + E+ +EE+E+++ ++ ++E ++
Sbjct: 274 GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
Score = 29.2 bits (66), Expect = 1.1
Identities = 18/43 (41%), Positives = 31/43 (72%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
E EE EK EEEEEE++ E ++++E+ +++EEE++ + DK
Sbjct: 277 EDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
Score = 28.9 bits (65), Expect = 1.4
Identities = 16/39 (41%), Positives = 26/39 (66%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKK 114
E LE E EEEE + E E+E+EE+++++E+E+ K
Sbjct: 281 ELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
Score = 28.1 bits (63), Expect = 2.4
Identities = 19/46 (41%), Positives = 32/46 (69%)
Query: 92 EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
+EE+ EE EK E+EE++ + E E+E ++D+ EEE E+++E K
Sbjct: 274 GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
Score = 28.1 bits (63), Expect = 2.5
Identities = 15/51 (29%), Positives = 33/51 (64%), Gaps = 5/51 (9%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
++E EE E+ EE+E+++D + ++ EE+E+ +EEE +++E ++
Sbjct: 274 GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDED-----EEEEEDDDEGDK 319
Score = 27.7 bits (62), Expect = 2.7
Identities = 16/45 (35%), Positives = 31/45 (68%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
+EE+ E+ E+ EEEEE++ + E + E+ E+EEE++ D + ++
Sbjct: 275 DEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
Length = 1176
Score = 31.4 bits (72), Expect = 0.22
Identities = 15/55 (27%), Positives = 26/55 (47%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
+E K+ + + E + E + +KEE E+ + E E EE+EE+ N
Sbjct: 826 YKEYKKSKGYVLVIKLENGGGLELEGEFSEKEEAEKFYNNLIEVEKVEEKEEELN 880
Score = 29.1 bits (66), Expect = 1.3
Identities = 18/71 (25%), Positives = 29/71 (40%), Gaps = 8/71 (11%)
Query: 93 EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
E +E ++ + K E E + +KEE + + +K EE+EEE
Sbjct: 824 ERYKEYKKSKGYVLVIKLENGGGLELEGEFSEKEEAEKFYNNLIEVEKVEEKEEE----- 878
Query: 153 EMYRNPVPMYR 163
NP+P Y
Sbjct: 879 ---LNPLPPYT 886
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 30.8 bits (70), Expect = 0.22
Identities = 18/91 (19%), Positives = 46/91 (50%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
+ L + + E+ +E+++K ++K++ KE +E K+++KK ++VE+ E
Sbjct: 146 TPELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVEQRLE 205
Query: 135 EQKNKKKEEEEEEEKEKVEMYRNPVPMYRNP 165
Q+ K+ + +++K + V ++
Sbjct: 206 LQRELMKKGKGKKKKIVKDKDGKVVYKWKKE 236
Score = 26.2 bits (58), Expect = 8.5
Identities = 12/67 (17%), Positives = 36/67 (53%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
++E+E++ + E + + + ++ + + EK E++ ++ ++++ KE +
Sbjct: 129 DDEEEQKSFDPAEYFDTTPELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELK 188
Query: 145 EEEEKEK 151
E +E+EK
Sbjct: 189 ERKEREK 195
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 31.1 bits (71), Expect = 0.23
Identities = 16/70 (22%), Positives = 36/70 (51%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
+E E E + EE K++ + E+K E+ E E EK++++ E+ + ++ K ++ +
Sbjct: 88 EENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSN 147
Query: 148 EKEKVEMYRN 157
E ++
Sbjct: 148 LSEALKGLNY 157
Score = 28.1 bits (63), Expect = 3.0
Identities = 14/48 (29%), Positives = 26/48 (54%)
Query: 95 EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
E +E+ + + E+K EK EK+ + E E++ E+E K +K+
Sbjct: 404 AELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKELEKQ 451
Score = 27.3 bits (61), Expect = 5.2
Identities = 19/56 (33%), Positives = 34/56 (60%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
E E + EE ++E ++ +EK E E + EKKEEE EK K+K ++ ++ ++ +
Sbjct: 92 EIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSN 147
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 30.7 bits (70), Expect = 0.23
Identities = 21/59 (35%), Positives = 31/59 (52%)
Query: 96 EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
+ EK K E EE K+ E + + +K KK E EE E + K+ E+E +K K E+
Sbjct: 68 KTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKAEL 126
Score = 29.5 bits (67), Expect = 0.52
Identities = 20/78 (25%), Positives = 41/78 (52%), Gaps = 3/78 (3%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
E+ +KE EE ++ E + + E +K RE+ EE E ++ K+ E+E ++
Sbjct: 68 KTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQ---LEKELKKLKA 124
Query: 140 KKEEEEEEEKEKVEMYRN 157
+ E+ E+ + E++E +
Sbjct: 125 ELEKYEKNDPERIEKLKE 142
Score = 28.4 bits (64), Expect = 1.3
Identities = 20/56 (35%), Positives = 31/56 (55%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
E+ K+ EE EE E E+ + EK+ +K + E EK + E +E+ +EE K K
Sbjct: 93 EKLKKGREETEERTELLEELKQLEKELKKLKAELEKYEKNDPERIEKLKEETKVAK 148
Score = 28.0 bits (63), Expect = 2.1
Identities = 18/68 (26%), Positives = 41/68 (60%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE ++ E + + E+ K+ E+ E++ E EE ++ +K+ K+ + E E+ E+ + ++ E
Sbjct: 79 EELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKAELEKYEKNDPERIE 138
Query: 144 EEEEEKEK 151
+ +EE +
Sbjct: 139 KLKEETKV 146
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 31.5 bits (71), Expect = 0.23
Identities = 26/86 (30%), Positives = 46/86 (53%)
Query: 68 LPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
+ E +EEEKEK+ +EEE + +++E+ + + K E + ++K +E
Sbjct: 260 ELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKE 319
Query: 128 EVEEEEEEQKNKKKEEEEEEEKEKVE 153
+E ++ +K KKE+EE EE EK
Sbjct: 320 SEKELKKLEKELKKEKEEIEELEKEL 345
Score = 30.7 bits (69), Expect = 0.37
Identities = 29/79 (36%), Positives = 49/79 (62%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
E E + E KEKE EEEE+ EK K E+E++++ K +EEE + +++ +E E E
Sbjct: 760 KEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLE 819
Query: 135 EQKNKKKEEEEEEEKEKVE 153
E++ ++EE+ +E+E E
Sbjct: 820 EEQLLIEQEEKIKEEELEE 838
Score = 30.3 bits (68), Expect = 0.58
Identities = 22/71 (30%), Positives = 37/71 (52%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
EE +E +E+EE E + E +K E+ + K+ ++E+E + +EEE K
Sbjct: 232 YLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKL 291
Query: 139 KKKEEEEEEEK 149
KEEEE + +
Sbjct: 292 LAKEEEELKSE 302
Score = 29.9 bits (67), Expect = 0.65
Identities = 31/71 (43%), Positives = 48/71 (67%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E ++K EE + E++ KEK+E+EEK R KKEEEEE++ + +E E EEE+K +K +
Sbjct: 730 EAQDKINEELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKV 789
Query: 144 EEEEEKEKVEM 154
EEE+E++
Sbjct: 790 EEEKEEKLKAQ 800
Score = 29.9 bits (67), Expect = 0.78
Identities = 26/82 (31%), Positives = 46/82 (56%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
E + E + + EE+ +E E+E +K E E KK +++ EE EK+ + E +
Sbjct: 293 AKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKR 352
Query: 130 EEEEEEQKNKKKEEEEEEEKEK 151
E EEEE++ +K +E+ E+ E+
Sbjct: 353 EAEEEEEEQLEKLQEKLEQLEE 374
Score = 29.6 bits (66), Expect = 0.98
Identities = 29/81 (35%), Positives = 44/81 (54%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+ E E EEE K +EEEE + E K E + E+K +E EK+ K E+E+
Sbjct: 272 LKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKEL 331
Query: 130 EEEEEEQKNKKKEEEEEEEKE 150
++E+EE + +KE +E E K
Sbjct: 332 KKEKEEIEELEKELKELEIKR 352
Score = 29.2 bits (65), Expect = 1.1
Identities = 29/88 (32%), Positives = 50/88 (56%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+ +E E E + E+EKEE+ + +EEE E+E K+ + EEE+ +++E+
Sbjct: 773 KEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIK 832
Query: 130 EEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
EEE EE + KEE++ E+ + E+ R
Sbjct: 833 EEELEELALELKEEQKLEKLAEEELERL 860
Score = 29.2 bits (65), Expect = 1.1
Identities = 22/78 (28%), Positives = 43/78 (55%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E E+EE E ++E E+EEE +E K E+KE++ ++++ K + EEE +
Sbjct: 242 LLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKS 301
Query: 136 QKNKKKEEEEEEEKEKVE 153
+ K + + ++E++ E
Sbjct: 302 ELLKLERRKVDDEEKLKE 319
Score = 29.2 bits (65), Expect = 1.3
Identities = 27/83 (32%), Positives = 45/83 (54%)
Query: 71 AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
A A L+ EE +E +E+E+EE+ +++E+E KR +EE +
Sbjct: 925 AEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEF 984
Query: 131 EEEEEQKNKKKEEEEEEEKEKVE 153
EE+EE+ NK + ++E E+EK E
Sbjct: 985 EEKEERYNKDELKKERLEEEKKE 1007
Score = 28.8 bits (64), Expect = 1.4
Identities = 27/81 (33%), Positives = 42/81 (51%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
E + E++ K+E+EE EE EKE E E K+ ++EEEE+ +K +++ E EEE K
Sbjct: 320 SEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAK 379
Query: 138 NKKKEEEEEEEKEKVEMYRNP 158
K + E + E
Sbjct: 380 KKLESERLSSAAKLKEEELEL 400
Score = 28.8 bits (64), Expect = 1.4
Identities = 29/82 (35%), Positives = 50/82 (60%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
S E EEEEK ++ + EEE+EEK K ++EE + ++E +EE + ++E+ + E
Sbjct: 767 KSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIE 826
Query: 132 EEEEQKNKKKEEEEEEEKEKVE 153
+EE+ K ++ EE E KE+ +
Sbjct: 827 QEEKIKEEELEELALELKEEQK 848
Score = 28.8 bits (64), Expect = 1.7
Identities = 27/71 (38%), Positives = 47/71 (66%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E+K KE+EEEEE+ +K+E+EE+K E +E+E +++++ E + EEE++ K K +
Sbjct: 741 LLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQ 800
Query: 144 EEEEEKEKVEM 154
EEE + E+
Sbjct: 801 EEELRALEEEL 811
Score = 28.8 bits (64), Expect = 1.8
Identities = 22/68 (32%), Positives = 41/68 (60%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EEE + EEE +EE E E+++ ++ EK +EEE ++ + +E ++ E+ + + +
Sbjct: 801 EEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERL 860
Query: 144 EEEEEKEK 151
EEE KE+
Sbjct: 861 EEEITKEE 868
Score = 28.4 bits (63), Expect = 1.9
Identities = 22/96 (22%), Positives = 54/96 (56%)
Query: 58 IRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
I L + I +S+ E E + +E +EEE+E+K ++E+ + +++EE
Sbjct: 240 IDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEEL 299
Query: 118 EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+ + + +V++EE+ ++++K+ ++ E+E +K +
Sbjct: 300 KSELLKLERRKVDDEEKLKESEKELKKLEKELKKEK 335
Score = 28.4 bits (63), Expect = 2.5
Identities = 26/73 (35%), Positives = 49/73 (67%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
E EE + E + E + +++EK ++ EK+ +K E+E +K+K++ EE +E +E + ++
Sbjct: 294 KEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKRE 353
Query: 141 KEEEEEEEKEKVE 153
EEEEEE+ EK++
Sbjct: 354 AEEEEEEQLEKLQ 366
Score = 28.0 bits (62), Expect = 2.6
Identities = 31/124 (25%), Positives = 58/124 (46%), Gaps = 5/124 (4%)
Query: 39 LTDDAEFQCQVGPAQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEE 98
L D + + +R Q + E + +E ++E++E++ +EE
Sbjct: 228 LYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEE 287
Query: 99 EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK-----NKKKEEEEEEEKEKVE 153
E K ++EE+ + + + E ++ D +E+ E E+E +K K+KEE EE EKE E
Sbjct: 288 ELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKE 347
Query: 154 MYRN 157
+
Sbjct: 348 LEIK 351
Score = 28.0 bits (62), Expect = 2.6
Identities = 27/79 (34%), Positives = 49/79 (62%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
E E E E+ E+EE+ +EEE E+ E +E+++ +K EEE ++ ++E EE +
Sbjct: 812 KEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQ 871
Query: 135 EQKNKKKEEEEEEEKEKVE 153
E K++E EE++ K+++E
Sbjct: 872 ELLLKEEELEEQKLKDELE 890
Score = 28.0 bits (62), Expect = 2.8
Identities = 29/82 (35%), Positives = 50/82 (60%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+ S LE + ++EEK KE E+E ++ EKE +++E+ E ++E +E + ++ EE
Sbjct: 298 ELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEE 357
Query: 130 EEEEEEQKNKKKEEEEEEEKEK 151
EEE+ E+ +K E+ EEE K
Sbjct: 358 EEEQLEKLQEKLEQLEEELLAK 379
Score = 28.0 bits (62), Expect = 2.8
Identities = 28/79 (35%), Positives = 44/79 (55%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E E+EEK K +EEE E+E E+ E E++ E+++K K+EE E E
Sbjct: 784 TEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALEL 843
Query: 136 QKNKKKEEEEEEEKEKVEM 154
++ +K E+ EEE E++E
Sbjct: 844 KEEQKLEKLAEEELERLEE 862
Score = 27.6 bits (61), Expect = 3.7
Identities = 31/82 (37%), Positives = 46/82 (56%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+ + E E EE EKE +E E + E E+E++E EK +EK E+ EE+ KK+ E
Sbjct: 325 KKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLES 384
Query: 130 EEEEEEQKNKKKEEEEEEEKEK 151
E K K++E E + E+EK
Sbjct: 385 ERLSSAAKLKEEELELKNEEEK 406
Score = 27.6 bits (61), Expect = 4.1
Identities = 16/80 (20%), Positives = 35/80 (43%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
A+ + E + +E K+ EE E E E+ + + K +E+ +K + + + + E
Sbjct: 163 AAGSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLEL 222
Query: 134 EEQKNKKKEEEEEEEKEKVE 153
EE+ + + E+
Sbjct: 223 EEENLLYLDYLKLNEERIDL 242
Score = 27.6 bits (61), Expect = 4.3
Identities = 26/77 (33%), Positives = 46/77 (59%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
E E E +KE+EEEE+ E KEK+ EE+++ +K + EE++++K + + EE
Sbjct: 747 KEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRA 806
Query: 135 EQKNKKKEEEEEEEKEK 151
++ K+E E EE++
Sbjct: 807 LEEELKEEAELLEEEQL 823
Score = 27.2 bits (60), Expect = 4.9
Identities = 22/70 (31%), Positives = 40/70 (57%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+EEE EEE +EE E ++E ++E+K +EEE ++ E + E++ E+ ++ E
Sbjct: 800 QEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELER 859
Query: 144 EEEEEKEKVE 153
EEE ++
Sbjct: 860 LEEEITKEEL 869
Score = 27.2 bits (60), Expect = 5.0
Identities = 19/73 (26%), Positives = 38/73 (52%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
+ + +EE+ ++EEEE + K+E E EEK++ ++E+++E E++
Sbjct: 946 ADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEK 1005
Query: 139 KKKEEEEEEEKEK 151
K+ E EE +
Sbjct: 1006 KELLREIIEETCQ 1018
Score = 27.2 bits (60), Expect = 5.0
Identities = 21/73 (28%), Positives = 31/73 (42%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E + EE E E + EE + E + K++ KK E + K+K E E E
Sbjct: 174 ERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYL 233
Query: 139 KKKEEEEEEEKEK 151
K EE + +E
Sbjct: 234 KLNEERIDLLQEL 246
Score = 27.2 bits (60), Expect = 5.2
Identities = 17/70 (24%), Positives = 33/70 (47%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
+EK +++E ++ E+ E+ + EE + ++ KE+ + E Q +K E EEE
Sbjct: 167 REKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEN 226
Query: 148 EKEKVEMYRN 157
+ N
Sbjct: 227 LLYLDYLKLN 236
Score = 27.2 bits (60), Expect = 5.2
Identities = 21/79 (26%), Positives = 46/79 (58%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E L ++ + +E+ + +E +++++ + K+ +KEEE Q K+ +E E+E++
Sbjct: 225 ENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKL 284
Query: 136 QKNKKKEEEEEEEKEKVEM 154
Q+ + K +EEE+ K E+
Sbjct: 285 QEEELKLLAKEEEELKSEL 303
Score = 27.2 bits (60), Expect = 5.4
Identities = 28/96 (29%), Positives = 49/96 (51%)
Query: 58 IRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
I Q I E EE +K +E +++ E+ K +++ K +++EEE
Sbjct: 695 ILRRQEEIKKKEQRIKEELKKLKLEKEELLADKVQEAQDKINEELKLLEQKIKEKEEEEE 754
Query: 118 EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+ + K ++EEE + E ++ + EEEE+ EK KVE
Sbjct: 755 KSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVE 790
Score = 26.5 bits (58), Expect = 8.9
Identities = 20/71 (28%), Positives = 38/71 (53%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
E+ E E+E + + ++ + + ++E+EE + K+E E EEE Q K+
Sbjct: 215 QLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKE 274
Query: 141 KEEEEEEEKEK 151
+EEE+E+K +
Sbjct: 275 NKEEEKEKKLQ 285
Score = 26.5 bits (58), Expect = 9.1
Identities = 23/79 (29%), Positives = 40/79 (50%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
LE ++ E++E E EE +E + + E +E E+ +K++EE +EEEE++
Sbjct: 903 LEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEER 962
Query: 138 NKKKEEEEEEEKEKVEMYR 156
NK+ +EE M
Sbjct: 963 NKRLLLAKEELGNVNLMAI 981
>gnl|CDD|234271 TIGR03591, polynuc_phos, polyribonucleotide nucleotidyltransferase.
Members of this protein family are polyribonucleotide
nucleotidyltransferase, also called polynucleotide
phosphorylase. Some members have been shown also to have
additional functions as guanosine pentaphosphate
synthetase and as poly(A) polymerase (see model
TIGR02696 for an exception clade, within this family)
[Transcription, Degradation of RNA].
Length = 684
Score = 31.3 bits (72), Expect = 0.23
Identities = 24/73 (32%), Positives = 36/73 (49%), Gaps = 3/73 (4%)
Query: 81 NEWEEE-EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE--EEVEEEEEEQK 137
E EE KEK E E +E+ K + +E E +E + +K+E ++ +EE
Sbjct: 211 EELAEEAGKEKWEFVAPEVDEELKAKVKELAYEAGLKEAYQITEKQERYAALDAIKEEVL 270
Query: 138 NKKKEEEEEEEKE 150
EEEEEEE+
Sbjct: 271 EALAEEEEEEEEA 283
Score = 28.2 bits (64), Expect = 2.5
Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 3/78 (3%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE---EEEEE 135
E E+ E ++E + + +E E E + +K+E KEE +E EEEEE
Sbjct: 220 EKWEFVAPEVDEELKAKVKELAYEAGLKEAYQITEKQERYAALDAIKEEVLEALAEEEEE 279
Query: 136 QKNKKKEEEEEEEKEKVE 153
++ +E+E +E + +E
Sbjct: 280 EEEADREKEIKEAFKDLE 297
>gnl|CDD|235551 PRK05667, dnaG, DNA primase; Validated.
Length = 580
Score = 31.4 bits (72), Expect = 0.23
Identities = 17/87 (19%), Positives = 33/87 (37%), Gaps = 3/87 (3%)
Query: 58 IRANARLTVILPQAISASETLENNE-WEEEEKEKEEEEEEEEEE--KEKDEDEEKKREKK 114
+ + R + A E EE+ E+E ++ E+ + E+ ++ K
Sbjct: 494 LLEHLRDAGLEELAALLESLAVWEEISEEDIAALEKELKDALEKLRDQLLEERLEELIAK 553
Query: 115 EEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E E EE +E + + K+K
Sbjct: 554 ERLLEGHGLSSEERLELLQLLIELKRK 580
Score = 29.4 bits (67), Expect = 0.87
Identities = 19/95 (20%), Positives = 36/95 (37%), Gaps = 2/95 (2%)
Query: 59 RANARLTVILPQAISASETLENNEWEEEEKEKEE--EEEEEEEEKEKDEDEEKKREKKEE 116
+ L + + + E E EE EE+ EKE + EK R++ E
Sbjct: 485 QPGLTTGSQLLEHLRDAGLEELAALLESLAVWEEISEEDIAALEKELKDALEKLRDQLLE 544
Query: 117 EEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
E ++ KE +E + + + + E ++
Sbjct: 545 ERLEELIAKERLLEGHGLSSEERLELLQLLIELKR 579
Score = 27.5 bits (62), Expect = 3.8
Identities = 17/85 (20%), Positives = 37/85 (43%), Gaps = 1/85 (1%)
Query: 69 PQAISASETLENNEWEEEEKEKEE-EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
P + S+ LE+ E+ E EE +++ ++E K+ E+ + EE
Sbjct: 486 PGLTTGSQLLEHLRDAGLEELAALLESLAVWEEISEEDIAALEKELKDALEKLRDQLLEE 545
Query: 128 EVEEEEEEQKNKKKEEEEEEEKEKV 152
+EE +++ + EE+ ++
Sbjct: 546 RLEELIAKERLLEGHGLSSEERLEL 570
>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
Length = 310
Score = 31.1 bits (71), Expect = 0.23
Identities = 9/40 (22%), Positives = 15/40 (37%)
Query: 59 RANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEE 98
+ L A +A+ +EEEEEE++
Sbjct: 264 KIKEALANPSAAAAAAAAAAAAAAAAAAAPAEEEEEEEDD 303
Score = 28.4 bits (64), Expect = 1.7
Identities = 11/48 (22%), Positives = 14/48 (29%), Gaps = 7/48 (14%)
Query: 66 VILPQAISASETLENNEWE-------EEEKEKEEEEEEEEEEKEKDED 106
P A E L N EEEE+E+D+
Sbjct: 257 FTFPLAEKIKEALANPSAAAAAAAAAAAAAAAAAAAPAEEEEEEEDDM 304
Score = 26.5 bits (59), Expect = 7.8
Identities = 6/20 (30%), Positives = 8/20 (40%)
Query: 129 VEEEEEEQKNKKKEEEEEEE 148
+EEEEEE+
Sbjct: 283 AAAAAAAAAAPAEEEEEEED 302
>gnl|CDD|237549 PRK13897, PRK13897, type IV secretion system component VirD4;
Provisional.
Length = 606
Score = 31.3 bits (71), Expect = 0.24
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEE 108
NNE EE + E EE+E++ DED++
Sbjct: 578 NNEASEETETITVPENNEEDEEDDDEDDD 606
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 30.5 bits (69), Expect = 0.25
Identities = 14/61 (22%), Positives = 27/61 (44%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
K++ + ++ K+ + EKK + E +K E EE +++E EE +
Sbjct: 118 KDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAE 177
Query: 150 E 150
E
Sbjct: 178 E 178
Score = 28.6 bits (64), Expect = 1.3
Identities = 16/69 (23%), Positives = 31/69 (44%), Gaps = 4/69 (5%)
Query: 86 EEKEKEEEEEEE----EEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
++K + ++ + E EK+ +E + +K+ E +E EEEE ++ +
Sbjct: 118 KDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAE 177
Query: 142 EEEEEEEKE 150
E EE E
Sbjct: 178 EAPAEESAE 186
Score = 27.4 bits (61), Expect = 2.7
Identities = 15/61 (24%), Positives = 27/61 (44%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ +K E E++ E + + E+K E E+ +EEE EE E+ ++
Sbjct: 126 KAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAEEAPAEESA 185
Query: 144 E 144
E
Sbjct: 186 E 186
Score = 26.7 bits (59), Expect = 5.4
Identities = 18/66 (27%), Positives = 29/66 (43%), Gaps = 6/66 (9%)
Query: 92 EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK------EEEVEEEEEEQKNKKKEEEE 145
E + E + E K+E E K KK++ + +K K E++V E E +KK E
Sbjct: 96 EAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEA 155
Query: 146 EEEKEK 151
+
Sbjct: 156 AAVAAE 161
Score = 26.7 bits (59), Expect = 6.0
Identities = 13/53 (24%), Positives = 21/53 (39%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
+E N E EK+ E +E EE++ E+ EE ++ E
Sbjct: 134 EAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAEEAPAEESAE 186
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 31.1 bits (71), Expect = 0.25
Identities = 14/73 (19%), Positives = 27/73 (36%), Gaps = 6/73 (8%)
Query: 84 EEEEKEKEE------EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
E+EK+K E E + E+EK E + ++ E K KD + + ++
Sbjct: 441 IEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKA 500
Query: 138 NKKKEEEEEEEKE 150
+ +
Sbjct: 501 AATQPIVIKAGAR 513
Score = 28.4 bits (64), Expect = 2.4
Identities = 12/51 (23%), Positives = 26/51 (50%), Gaps = 1/51 (1%)
Query: 102 EKDEDEEKKREKKEEEEEKQK-DKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
K E ++EKK+ EE K + + ++ +E E+ ++ + K+ E +
Sbjct: 434 AKAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKD 484
Score = 27.6 bits (62), Expect = 4.0
Identities = 11/67 (16%), Positives = 21/67 (31%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E EK E ++ E +D++ + K+ + V + N
Sbjct: 463 EREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVIAA 522
Query: 145 EEEEKEK 151
E K +
Sbjct: 523 REARKAQ 529
Score = 26.8 bits (60), Expect = 5.8
Identities = 14/48 (29%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
Query: 96 EEEEEKEKDEDEEKKR-EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
E+EK+K +E K R E ++ E++K +E ++ E + K K+
Sbjct: 440 AIEQEKKK-AEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKD 486
>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
Tim54. Mitochondrial function depends on the import of
hundreds of different proteins synthesised in the
cytosol. Protein import is a multi-step pathway which
includes the binding of precursor proteins to surface
receptors, translocation of the precursor across one or
both mitochondrial membranes, and folding and assembly
of the imported protein inside the mitochondrion. Most
precursor proteins carry amino-terminal targeting
signals, called pre-sequences, and are imported into
mitochondria via import complexes located in both the
outer and the inner membrane (IM). The IM complex, TIM,
is made up of at least two proteins which mediate
translocation of proteins into the matrix by removing
their signal peptide and another pair of proteins, Tim54
and Tim22, that insert the polytopic proteins, that
carry internal targetting information, into the inner
membrane.
Length = 377
Score = 30.9 bits (70), Expect = 0.25
Identities = 13/57 (22%), Positives = 21/57 (36%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
+ + E +E E E + + + EE + +E E EEE K K
Sbjct: 196 DPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKP 252
Score = 30.5 bits (69), Expect = 0.43
Identities = 12/50 (24%), Positives = 19/50 (38%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
T++ E E + E E +E E ++ E EEE + K
Sbjct: 203 PTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKP 252
Score = 29.3 bits (66), Expect = 1.00
Identities = 15/55 (27%), Positives = 20/55 (36%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
E E +E E E E E E EE E ++ E+ EEE +
Sbjct: 199 EPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKPP 253
Score = 28.9 bits (65), Expect = 1.3
Identities = 14/55 (25%), Positives = 21/55 (38%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
L+ E E ++ E E E + E E E E+ +D EEE +
Sbjct: 195 LDPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKP 249
Score = 28.9 bits (65), Expect = 1.4
Identities = 10/58 (17%), Positives = 22/58 (37%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
+ E + +E E + + + + EE + ++ E+ EEE + K
Sbjct: 196 DPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKPP 253
Score = 28.2 bits (63), Expect = 2.0
Identities = 15/62 (24%), Positives = 22/62 (35%)
Query: 99 EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNP 158
+ E E + + E E + E EE E ++ E+ EEE K P
Sbjct: 196 DPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKPPVP 255
Query: 159 VP 160
P
Sbjct: 256 KP 257
Score = 26.2 bits (58), Expect = 9.7
Identities = 14/59 (23%), Positives = 22/59 (37%)
Query: 94 EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
+ E E DE + + E + EE E EE ++ +EE + K V
Sbjct: 196 DPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNKPVKPPV 254
>gnl|CDD|236267 PRK08451, PRK08451, DNA polymerase III subunits gamma and tau;
Validated.
Length = 535
Score = 31.2 bits (71), Expect = 0.26
Identities = 17/85 (20%), Positives = 39/85 (45%), Gaps = 5/85 (5%)
Query: 61 NARLTVILPQAISASETLENNEWEEEEKEKEEE-----EEEEEEEKEKDEDEEKKREKKE 115
A++ + + + E++ ++K+ + E EEE K+ DE+ ++ + K+
Sbjct: 447 GAKIKIQKALKSAENPLQSLKEFKPSNEKKKIDTESTAEMLEEEAKKDDEEVQETQLKEA 506
Query: 116 EEEEKQKDKKEEEVEEEEEEQKNKK 140
E ++ E+ +EE +E K
Sbjct: 507 TELQEFMINNEDLIEEAKELFGIKS 531
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 30.2 bits (69), Expect = 0.26
Identities = 10/33 (30%), Positives = 24/33 (72%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
++++++++++E E E+++KK EE +E QK
Sbjct: 83 RKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQK 115
Score = 28.7 bits (65), Expect = 1.1
Identities = 14/47 (29%), Positives = 28/47 (59%)
Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
+KK++KK++E E ++KK++ V EE ++ K + +KV +
Sbjct: 84 KKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVDAALLAAIKKVPL 130
Score = 27.5 bits (62), Expect = 2.2
Identities = 17/49 (34%), Positives = 26/49 (53%), Gaps = 6/49 (12%)
Query: 102 EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
K D KR+KK+ K KK+E EE+++K EE +E++K
Sbjct: 74 PKLNDAYFKRKKKK------KKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 30.7 bits (69), Expect = 0.27
Identities = 10/46 (21%), Positives = 19/46 (41%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
N +E ++ E + +EEE D+D E + ++ D
Sbjct: 268 NYGYESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCDNG 313
Score = 26.1 bits (57), Expect = 9.5
Identities = 7/40 (17%), Positives = 18/40 (45%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD 123
++E + +EEE+ + + +ED + +E+
Sbjct: 275 DDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCDNGA 314
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 31.3 bits (71), Expect = 0.27
Identities = 24/71 (33%), Positives = 38/71 (53%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
E E E+E E E EE KE+ E++ +K E EE E+ K+K + + +EE Q
Sbjct: 507 EKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLED 566
Query: 140 KKEEEEEEEKE 150
+ +E +E +E
Sbjct: 567 RLQELKELLEE 577
Score = 29.3 bits (66), Expect = 0.98
Identities = 23/79 (29%), Positives = 47/79 (59%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
E LE E +EE E++ E+ E E+ ++ E+ + ++ +EE + +D+ +E E EE
Sbjct: 518 IELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEE 577
Query: 135 EQKNKKKEEEEEEEKEKVE 153
+ + ++EE EE +E+++
Sbjct: 578 LRLLRTRKEELEELRERLK 596
Score = 29.0 bits (65), Expect = 1.5
Identities = 26/88 (29%), Positives = 40/88 (45%), Gaps = 1/88 (1%)
Query: 67 ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEK-QKDKK 125
L + I E EE + E EE +EE +EK E E E+ EE +EK Q +
Sbjct: 498 ELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQL 557
Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+EE+ + E+ + K+ EE +
Sbjct: 558 KEELRQLEDRLQELKELLEELRLLRTRK 585
Score = 28.6 bits (64), Expect = 2.0
Identities = 29/92 (31%), Positives = 50/92 (54%)
Query: 62 ARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
L L Q ++LE +E E E +E EEE E E E+ + E+ + EE E++
Sbjct: 603 KELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEK 662
Query: 122 KDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
++ E E+ E + +N+++ EE+ EE E++E
Sbjct: 663 VEELEAEIRRELQRIENEEQLEEKLEELEQLE 694
Score = 28.2 bits (63), Expect = 2.7
Identities = 26/69 (37%), Positives = 38/69 (55%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E EEE + EE EE EK + E++ EK EE+ EK + + EE EE+ E K ++
Sbjct: 302 ELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEE 361
Query: 142 EEEEEEEKE 150
+E EE+
Sbjct: 362 RLKELEERL 370
Score = 27.8 bits (62), Expect = 3.3
Identities = 26/75 (34%), Positives = 43/75 (57%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
EE EKE E E E +E +E+ +++E+ E EE ++ +KE EEE + + +E
Sbjct: 468 PEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEAL 527
Query: 145 EEEEKEKVEMYRNPV 159
+EE +EK+E N +
Sbjct: 528 KEELEEKLEKLENLL 542
Score = 27.4 bits (61), Expect = 4.8
Identities = 26/84 (30%), Positives = 40/84 (47%), Gaps = 1/84 (1%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
E E EE + +EE EE +E+ E KK+ K+ EE Q ++ + +
Sbjct: 561 LRQLEDRLQELKELLEELRLLRTRKEELEELRER-LKELKKKLKELEERLSQLEELLQSL 619
Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
E E E + ++ EEE E E EK+
Sbjct: 620 ELSEAENELEEAEEELESELEKLN 643
Score = 27.0 bits (60), Expect = 5.1
Identities = 30/112 (26%), Positives = 46/112 (41%)
Query: 43 AEFQCQVGPAQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKE 102
E Q + + + I A L + E L+ E E E E + EEE +E
Sbjct: 219 EEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRE 278
Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
+ E+ EK E EE +++ +E E E E ++ EE E+ K E
Sbjct: 279 LERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEER 330
Score = 27.0 bits (60), Expect = 6.4
Identities = 26/70 (37%), Positives = 42/70 (60%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E E++ E E E EE +E+ E+++ E +EE EE +K+ +E E E E + + +E
Sbjct: 470 EHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKE 529
Query: 144 EEEEEKEKVE 153
E EE+ EK+E
Sbjct: 530 ELEEKLEKLE 539
Score = 27.0 bits (60), Expect = 6.8
Identities = 23/84 (27%), Positives = 45/84 (53%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+ E L++ E E+ E++ E+ E E E+ +E E + +E +E ++ +E E
Sbjct: 316 ELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEK 375
Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
E E+ ++ K+ EE +E KE++
Sbjct: 376 ELEKALERLKQLEEAIQELKEELA 399
Score = 26.6 bits (59), Expect = 7.0
Identities = 22/75 (29%), Positives = 41/75 (54%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E E E+ E EEE +E ++ E+ ++E+++ E ++E E ++ E E E+E E+
Sbjct: 196 ELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELK 255
Query: 139 KKKEEEEEEEKEKVE 153
+ E E E E ++
Sbjct: 256 ARLLEIESLELEALK 270
Score = 26.6 bits (59), Expect = 7.2
Identities = 18/78 (23%), Positives = 40/78 (51%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E E + E+ E EE EE +EK + + +++ + E+ ++ K+ EE +++
Sbjct: 529 EELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEEL 588
Query: 139 KKKEEEEEEEKEKVEMYR 156
++ E +E K+K++
Sbjct: 589 EELRERLKELKKKLKELE 606
Score = 26.6 bits (59), Expect = 8.8
Identities = 24/68 (35%), Positives = 42/68 (61%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE E + E E+ E+ E E+E K+ +K EE +E+Q++++ E+ E EE+ + +EE
Sbjct: 188 EELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEE 247
Query: 144 EEEEEKEK 151
+E E+ K
Sbjct: 248 KERLEELK 255
Score = 26.3 bits (58), Expect = 9.7
Identities = 26/95 (27%), Positives = 51/95 (53%), Gaps = 1/95 (1%)
Query: 62 ARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
+L+ +L E LE E +E +K +E +EE+EEEE E++ + ++R + EEE+++
Sbjct: 192 GQLSELLEDIEDLLEALEE-ELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKER 250
Query: 122 KDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
++ + + E E + K EEE + + +
Sbjct: 251 LEELKARLLEIESLELEALKIREEELRELERLLEE 285
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 30.8 bits (69), Expect = 0.28
Identities = 22/68 (32%), Positives = 42/68 (61%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
NE E +++EEE+E+ EE +E +E E + E ++E+ ++ D+KE + E +E E +
Sbjct: 99 NEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEGDENELAGEY 158
Query: 141 KEEEEEEE 148
EE +++
Sbjct: 159 IIEEVDDD 166
Score = 28.9 bits (64), Expect = 1.2
Identities = 19/65 (29%), Positives = 38/65 (58%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
++ + E + E E +EE+ +K+EEE+E ++ +E E E EE+ + ++ E+
Sbjct: 80 DKSDNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEK 139
Query: 146 EEEKE 150
++EKE
Sbjct: 140 DDEKE 144
Score = 28.1 bits (62), Expect = 2.6
Identities = 21/61 (34%), Positives = 34/61 (55%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
++ E + + EE E+ EE E EEE + DED EK EK+ + E + + E + EE +
Sbjct: 105 TQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEGDENELAGEYIIEEVD 164
Query: 135 E 135
+
Sbjct: 165 D 165
Score = 27.7 bits (61), Expect = 3.0
Identities = 24/83 (28%), Positives = 45/83 (54%), Gaps = 1/83 (1%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
A++ +N E E + EEE +K+E+E++ E+ EE EE + +EE ++E+
Sbjct: 78 ANDKSDNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEV-VEEEYDDDED 136
Query: 134 EEQKNKKKEEEEEEEKEKVEMYR 156
E+ ++K+ + E +E E Y
Sbjct: 137 SEKDDEKESDAEGDENELAGEYI 159
Score = 27.3 bits (60), Expect = 3.7
Identities = 21/59 (35%), Positives = 33/59 (55%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
I E +EEE++++ EE EE EE E E+E E E+++EK+ D + +E E
Sbjct: 95 IIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEGDENE 153
Score = 26.6 bits (58), Expect = 7.5
Identities = 21/65 (32%), Positives = 36/65 (55%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
E+E ++EEEE E E+ E+ E + EEE D+ E+ +E+E + + + E E
Sbjct: 100 EEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEGDENELAGEYI 159
Query: 149 KEKVE 153
E+V+
Sbjct: 160 IEEVD 164
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 30.7 bits (70), Expect = 0.28
Identities = 24/76 (31%), Positives = 39/76 (51%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E E + EK+KEEE+ E +E+ E ++ EK E E EK ++E +E + +E
Sbjct: 214 EAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQE 273
Query: 136 QKNKKKEEEEEEEKEK 151
Q+ KE + E +
Sbjct: 274 QEELLKEGFKTEAESL 289
Score = 28.0 bits (63), Expect = 2.3
Identities = 14/64 (21%), Positives = 35/64 (54%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+E+ + E + E E E++ EK++E+++ E +++ +E + E+ + ++K
Sbjct: 201 AKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLL 260
Query: 144 EEEE 147
E+E
Sbjct: 261 AEQE 264
Score = 27.3 bits (61), Expect = 3.7
Identities = 22/95 (23%), Positives = 47/95 (49%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
QA++A E E + E + E+E E++KE+++ E + +E ++ +K E E
Sbjct: 197 QALTAKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAER 256
Query: 130 EEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPMYRN 164
E+ EQ+ + + +E+E+ E ++ +
Sbjct: 257 EKLLAEQERMLEHKLQEQEELLKEGFKTEAESLQK 291
>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 753
Score = 31.2 bits (71), Expect = 0.28
Identities = 23/73 (31%), Positives = 33/73 (45%), Gaps = 4/73 (5%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE + E EE+E EE EK E+ K+ E++ EE EK +EVE K
Sbjct: 493 EEAKTEFGEEKELLEELIEKLEEVRKELEEELEEVEK----LLDEVELLTGANSGGKTSL 548
Query: 144 EEEEEKEKVEMYR 156
E + + V +
Sbjct: 549 LELKAQIVVLAHM 561
>gnl|CDD|236335 PRK08724, fliD, flagellar capping protein; Validated.
Length = 673
Score = 31.0 bits (70), Expect = 0.28
Identities = 15/68 (22%), Positives = 34/68 (50%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
KE + E E++ +++ D ++ + E+ KQ + + E EE E +K K + ++
Sbjct: 362 KEAQAEIEQKLAQEKAQLDAAVEKGELTPEQAKQIARAKLEPEERERLEKIDKAQAALKQ 421
Query: 148 EKEKVEMY 155
+ ++Y
Sbjct: 422 AQSAFDLY 429
>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807). This is
a family of conserved fungal proteins of unknown
function.
Length = 169
Score = 30.5 bits (69), Expect = 0.29
Identities = 15/70 (21%), Positives = 29/70 (41%)
Query: 83 WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
E E KE+ E EEE E + D + +E +Q+ ++ ++E+K+ K
Sbjct: 67 RERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAKDP 126
Query: 143 EEEEEEKEKV 152
++
Sbjct: 127 RGGTQDVVDK 136
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 30.9 bits (70), Expect = 0.30
Identities = 15/61 (24%), Positives = 31/61 (50%), Gaps = 3/61 (4%)
Query: 93 EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
+ +E E D EE K +E+ + +E++E ++ ++ +EE E+K+KV
Sbjct: 304 KPSDEPSLPESDIHEE---IPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKV 360
Query: 153 E 153
+
Sbjct: 361 K 361
Score = 30.1 bits (68), Expect = 0.47
Identities = 15/60 (25%), Positives = 26/60 (43%), Gaps = 2/60 (3%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD--KKEEEVEEEEEEQKNKKK 141
+ ++ E + EE + DE + ++ + EE EE E++QK KKK
Sbjct: 304 KPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKK 363
Score = 27.1 bits (60), Expect = 4.6
Identities = 16/69 (23%), Positives = 32/69 (46%), Gaps = 6/69 (8%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
S +L ++ EE + +E+ E DE+ + E EE +E E E +++
Sbjct: 306 SDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKE------EVEKKQK 359
Query: 133 EEEQKNKKK 141
+++ K+K
Sbjct: 360 VKKKPRKRK 368
Score = 26.3 bits (58), Expect = 8.7
Identities = 13/58 (22%), Positives = 27/58 (46%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ ++ E + E+ DE+ E +D ++E+ EE +EE + K+K +
Sbjct: 304 KPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVK 361
>gnl|CDD|233224 TIGR00993, 3a0901s04IAP86, chloroplast protein import component
Toc86/159, G and M domains. The long precursor of the
86K protein originally described is proposed to have
three domains. The N-terminal A-domain is acidic,
repetitive, weakly conserved, readily removed by
proteolysis during chloroplast isolation, and not
required for protein translocation. The other domains
are designated G (GTPase) and M (membrane anchor); this
family includes most of the G domain and all of M
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 763
Score = 31.1 bits (70), Expect = 0.30
Identities = 20/86 (23%), Positives = 38/86 (44%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
+ E + E E + +EE E E D+ K K + + K++++ +EE + K
Sbjct: 384 DEEDSDIELEDSSDSDEESGEDEYDQLPPFKPLTKAQMAKLSKEQRKAYLEEYDYRVKLL 443
Query: 140 KKEEEEEEEKEKVEMYRNPVPMYRNP 165
+K++ EE K M + + P
Sbjct: 444 QKKQWREELKRMKMMKKFGKEIGELP 469
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 30.9 bits (70), Expect = 0.30
Identities = 20/62 (32%), Positives = 32/62 (51%)
Query: 99 EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNP 158
EE DE ++ K++ ++ +EEE EEEEEE++ + E EE E+ P
Sbjct: 377 EEDATTIDELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGFEVPEMP 436
Query: 159 VP 160
+P
Sbjct: 437 MP 438
Score = 29.8 bits (67), Expect = 0.68
Identities = 15/27 (55%), Positives = 18/27 (66%)
Query: 83 WEEEEKEKEEEEEEEEEEKEKDEDEEK 109
W EEE+E+EEEEEEE E E +E
Sbjct: 401 WAEEEEEEEEEEEEEAAEAEAPMEEPV 427
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 31.0 bits (70), Expect = 0.31
Identities = 9/27 (33%), Positives = 21/27 (77%)
Query: 91 EEEEEEEEEEKEKDEDEEKKREKKEEE 117
+E+E++ +E E D ++E+KR ++E++
Sbjct: 371 SDEDEDDSDEDEVDYEKERKRRREEDK 397
>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 379
Score = 30.7 bits (69), Expect = 0.32
Identities = 10/65 (15%), Positives = 21/65 (32%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
EE + E E ++ E++ K + + +E+ + KK E +
Sbjct: 198 SLEETKASEVEHLSASLKTATEKVDALEKDLHAAQAKLDSGQALTKEELDAKKAELSKAL 257
Query: 149 KEKVE 153
Sbjct: 258 AALEA 262
Score = 26.1 bits (57), Expect = 9.8
Identities = 13/69 (18%), Positives = 24/69 (34%)
Query: 64 LTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD 123
L+ L A + LE + + K + +EE K + K E +D
Sbjct: 210 LSASLKTATEKVDALEKDLHAAQAKLDSGQALTKEELDAKKAELSKALAALEAANAADED 269
Query: 124 KKEEEVEEE 132
++ + E
Sbjct: 270 PQDRDAAVE 278
>gnl|CDD|235668 PRK06007, fliF, flagellar MS-ring protein; Reviewed.
Length = 542
Score = 30.6 bits (70), Expect = 0.33
Identities = 19/69 (27%), Positives = 32/69 (46%)
Query: 67 ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
L + E E EE +E+EE E E DE+ E+++ ++E + E +
Sbjct: 461 RLRPLLPEEELAEEEAAAEEAALEEDEEAALEVELSDDEELEEEKAEEELKYEDLLKRLR 520
Query: 127 EEVEEEEEE 135
E +E+ EE
Sbjct: 521 ELAKEDPEE 529
Score = 28.7 bits (65), Expect = 1.8
Identities = 18/64 (28%), Positives = 38/64 (59%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E+E EEE EE ++++E + ++EE +++K EEE++ E+ ++ ++ +E
Sbjct: 466 LPEEELAEEEAAAEEAALEEDEEAALEVELSDDEELEEEKAEEELKYEDLLKRLRELAKE 525
Query: 145 EEEE 148
+ EE
Sbjct: 526 DPEE 529
>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family. This protein is
found in bacteria. Proteins in this family are typically
between 234 to 465 amino acids in length. There is a
conserved GEV sequence motif.Most members are annotated
as being SprA-related.
Length = 261
Score = 30.5 bits (69), Expect = 0.34
Identities = 17/56 (30%), Positives = 34/56 (60%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
A SAS+ L+ E ++E EEE E+++ E + +K+++++++EEKQ +
Sbjct: 60 TAAGSASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEKQVRE 115
Score = 28.2 bits (63), Expect = 1.8
Identities = 13/51 (25%), Positives = 31/51 (60%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
+E +++ E +++ E++ EK + E KQ +K++E +++EE+Q +
Sbjct: 65 ASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEKQVRE 115
Score = 26.7 bits (59), Expect = 6.4
Identities = 10/53 (18%), Positives = 29/53 (54%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
+E +++ + DEE E++ E+++ + K+ + ++E+++Q+ K
Sbjct: 59 STAAGSASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEK 111
Score = 26.7 bits (59), Expect = 7.1
Identities = 11/49 (22%), Positives = 28/49 (57%)
Query: 97 EEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
++ + + E E+ EEE +KD+ E + +++Q+ +++EE++
Sbjct: 64 SASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEKQ 112
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 30.9 bits (70), Expect = 0.35
Identities = 15/64 (23%), Positives = 30/64 (46%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
K E +EE+ EE++++ + KR+K ++K K K + KK + E +
Sbjct: 185 KSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVSTSTAASQKKSSDLESK 244
Query: 148 EKEK 151
+ +
Sbjct: 245 LEAQ 248
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 30.8 bits (70), Expect = 0.35
Identities = 17/63 (26%), Positives = 30/63 (47%), Gaps = 8/63 (12%)
Query: 58 IRANARLTVILPQA--ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
+ ARL V+ P+A + A + EEE E++++ +DE+E+ E +
Sbjct: 243 LALAARL-VLAPRATRLPAPP-----QPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRA 296
Query: 116 EEE 118
EE
Sbjct: 297 LEE 299
Score = 28.5 bits (64), Expect = 1.7
Identities = 11/44 (25%), Positives = 21/44 (47%), Gaps = 3/44 (6%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+ EEE +D+D+ + EEE++ +D+ EE+
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDP---PEDEEEQDDAEDRALEEI 300
Score = 27.3 bits (61), Expect = 4.5
Identities = 10/40 (25%), Positives = 18/40 (45%)
Query: 92 EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
+ EEE + + E+EE+Q D ++ +EE
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEE 299
>gnl|CDD|225962 COG3428, COG3428, Predicted membrane protein [Function unknown].
Length = 494
Score = 30.5 bits (69), Expect = 0.35
Identities = 15/48 (31%), Positives = 19/48 (39%), Gaps = 1/48 (2%)
Query: 88 KEKEEEEEEEEEEKEKDED-EEKKREKKEEEEEKQKDKKEEEVEEEEE 134
E E E E E E E + +K + E + D EEEV E
Sbjct: 123 GEAEAELVEREIAFELAALVREARVKKLDALELAEADTPEEEVAEVLA 170
>gnl|CDD|233194 TIGR00930, 2a30, K-Cl cotransporter. [Transport and binding
proteins, Other].
Length = 953
Score = 30.8 bits (70), Expect = 0.37
Identities = 16/56 (28%), Positives = 22/56 (39%), Gaps = 6/56 (10%)
Query: 88 KEKEEEEEEEEE--EKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
K + E E EE + EK RE K+ + K K + E Q N +K
Sbjct: 835 KPQTESMEAFEEMIRPFRLHKTEKDREAKDPKMTWTKPWK----ITDAELQSNVRK 886
Score = 29.7 bits (67), Expect = 0.73
Identities = 20/121 (16%), Positives = 44/121 (36%), Gaps = 33/121 (27%)
Query: 64 LTVILPQAISASETLENNEWE----------EEEKEKEEEEEE------------EEEEK 101
LT++LP ++ + W+ +++ E+E+++ E
Sbjct: 774 LTLLLPYLLTTKKV-----WKKCKIRIFVGAQKDDRSEQEKKDMATLLYKFRIDAEVIVV 828
Query: 102 EKDEDEEKKREKKEEEEEKQ------KDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMY 155
D + + + E E EE K +K+ E ++ + K + E + + Y
Sbjct: 829 LMDINAKPQTESMEAFEEMIRPFRLHKTEKDREAKDPKMTWTKPWKITDAELQSNVRKSY 888
Query: 156 R 156
R
Sbjct: 889 R 889
>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated. This domain
appears in eukaryotes as well as bacteria and tends to
be found near the C-terminus of the metalloprotease M16C
(pfam05193).
Length = 248
Score = 30.3 bits (69), Expect = 0.39
Identities = 11/45 (24%), Positives = 24/45 (53%)
Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
DE E+K ++E E+ ++K E ++E+ ++ + EE +
Sbjct: 4 DETLEEKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEELQSTP 48
>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
Length = 568
Score = 30.7 bits (69), Expect = 0.39
Identities = 15/76 (19%), Positives = 29/76 (38%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
S++ +LE E+ E EEE + +E+E E + + + E V +
Sbjct: 475 FSSANSLERGADEDYLIVNGTNEPYEEEVIKTNENENFPLENESPCDGISRGTSELPVND 534
Query: 132 EEEEQKNKKKEEEEEE 147
++ N+ E
Sbjct: 535 DDNNNLNRGNTLRSEG 550
>gnl|CDD|237748 PRK14537, PRK14537, 50S ribosomal protein L20/unknown domain fusion
protein; Provisional.
Length = 230
Score = 30.3 bits (68), Expect = 0.41
Identities = 18/76 (23%), Positives = 36/76 (47%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
A + L + K K+ E +EK +++EEK + E+ + +++ + E E
Sbjct: 113 AKKFLSQFQSTTITKPKKVLINEVLQEKTINQNEEKTSLQNEKVLSPELSEEKSDSELET 172
Query: 134 EEQKNKKKEEEEEEEK 149
+ QK + KE++ E
Sbjct: 173 QPQKTQLKEKKPSIEH 188
>gnl|CDD|178439 PLN02847, PLN02847, triacylglycerol lipase.
Length = 633
Score = 30.6 bits (69), Expect = 0.42
Identities = 23/73 (31%), Positives = 34/73 (46%), Gaps = 9/73 (12%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEE--------EEEKQKDKKEEEVE-EEEE 134
E + ++EEEEE ED +EE E EK+ ++E EV+ + +E
Sbjct: 461 ESSSSDGSGHDDEEEEEPLLSEDRVITSSVEEEVTEGELWYELEKELQRQETEVDAQAQE 520
Query: 135 EQKNKKKEEEEEE 147
E+ KE EEE
Sbjct: 521 EEAAAAKEITEEE 533
Score = 26.4 bits (58), Expect = 8.8
Identities = 20/80 (25%), Positives = 39/80 (48%), Gaps = 1/80 (1%)
Query: 65 TVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
+V S+S+ +++ EEEE E+ +E+ + E E E+E ++Q+ +
Sbjct: 455 SVEHKSESSSSDGSGHDDEEEEEPLLSEDRVITSSVEEEVTEGELWYEL-EKELQRQETE 513
Query: 125 KEEEVEEEEEEQKNKKKEEE 144
+ + +EEE + EEE
Sbjct: 514 VDAQAQEEEAAAAKEITEEE 533
>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
(DUF874). This family consists of several hypothetical
proteins specific to Helicobacter pylori. The function
of this family is unknown.
Length = 417
Score = 30.3 bits (67), Expect = 0.42
Identities = 19/68 (27%), Positives = 43/68 (63%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E E+EE++ E+E+++ E+E + + + K E+++++ E++K K E+E ++
Sbjct: 150 ELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAI 209
Query: 142 EEEEEEEK 149
E E+E++K
Sbjct: 210 ELEQEKQK 217
Score = 29.1 bits (64), Expect = 1.1
Identities = 23/95 (24%), Positives = 52/95 (54%), Gaps = 1/95 (1%)
Query: 64 LTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK- 122
+L A A + + E + +KE E + + + E EE+K E+++++ EK+
Sbjct: 111 AAALLLAACFADDQDKKIELAQAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGI 170
Query: 123 DKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
+ +++ E+E+QK ++++++ E+EK+K N
Sbjct: 171 ELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIAN 205
Score = 28.3 bits (62), Expect = 2.0
Identities = 20/70 (28%), Positives = 45/70 (64%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E E++E++ E+E+++ EKE E + + ++E+++ +++K++ E E+++ K
Sbjct: 150 ELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAI 209
Query: 144 EEEEEKEKVE 153
E E+EK+K E
Sbjct: 210 ELEQEKQKTE 219
Score = 27.9 bits (61), Expect = 3.1
Identities = 19/70 (27%), Positives = 45/70 (64%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E + E+E E + + E+EK+K E E++K E+++++ +K E+E+E+++ +N
Sbjct: 161 EKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTEN 220
Query: 139 KKKEEEEEEE 148
+K++ +E++
Sbjct: 221 EKQDLIKEQK 230
>gnl|CDD|224739 COG1826, TatA, Sec-independent protein secretion pathway components
[Intracellular trafficking and secretion].
Length = 94
Score = 28.9 bits (65), Expect = 0.43
Identities = 9/45 (20%), Positives = 26/45 (57%)
Query: 110 KREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
K+ + + E ++ K EE++++++E + + +EE ++ E+
Sbjct: 40 KKAASDVKNELDEELKLEELDDKKKELTAELQATKEELDQLASEL 84
Score = 28.1 bits (63), Expect = 1.0
Identities = 16/55 (29%), Positives = 27/55 (49%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
K+ + + E E+ + EE +KKE E Q K+E + E ++ KKK +
Sbjct: 40 KKAASDVKNELDEELKLEELDDKKKELTAELQATKEELDQLASELKEDLKKKAKP 94
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 30.0 bits (68), Expect = 0.44
Identities = 22/67 (32%), Positives = 27/67 (40%), Gaps = 6/67 (8%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+E+ E +EE K+ + K EK E E EEEEE E
Sbjct: 118 NDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEI------AE 171
Query: 144 EEEEEKE 150
EEEE KE
Sbjct: 172 EEEEVKE 178
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 29.4 bits (66), Expect = 0.45
Identities = 13/32 (40%), Positives = 21/32 (65%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
+EEEEEEEE EK++ DE + ++ ++ E
Sbjct: 92 GHTRQEEEEEEEENEKQQQSDEAQVQQHQQHE 123
Score = 27.1 bits (60), Expect = 3.2
Identities = 12/36 (33%), Positives = 23/36 (63%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
+E+EEEEEE E++++ DE + ++ ++ E E
Sbjct: 92 GHTRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPAE 127
>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex. This Mediator
complex subunit was formerly known as Srb4 in yeasts or
Trap80 in Drosophila and human. The Med17 subunit is
located within the head domain and is essential for cell
viability to the extent that a mutant strain of
cerevisiae lacking it shows all RNA polymerase
II-dependent transcription ceasing at non-permissive
temperatures.
Length = 454
Score = 30.4 bits (69), Expect = 0.46
Identities = 16/51 (31%), Positives = 28/51 (54%), Gaps = 1/51 (1%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
E+L E +E + + EE +EEE ++++D + E K+ EE K + E
Sbjct: 52 ESLRE-EIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEEFPKARDE 101
Score = 30.0 bits (68), Expect = 0.52
Identities = 22/98 (22%), Positives = 36/98 (36%), Gaps = 6/98 (6%)
Query: 49 VGPAQYHKPIRANARLTVILPQAISASETLE------NNEWEEEEKEKEEEEEEEEEEKE 102
G I + R + P +LE N E EE EE ++
Sbjct: 3 GGSVNEDSSIPLSLRPLIDKPAEDEDPLSLEELIERINQERGSFRDLTEESLREEIAKEA 62
Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
D ++ +++E++EE D EE + EE K +
Sbjct: 63 AKIDFSEESDEEEDDEEDDNDDSEENKDTVEEFPKARD 100
Score = 26.9 bits (60), Expect = 6.6
Identities = 16/50 (32%), Positives = 26/50 (52%)
Query: 101 KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
+D EE RE+ +E K +E + EE++EE N EE ++ +E
Sbjct: 45 SFRDLTEESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEE 94
Score = 26.6 bits (59), Expect = 6.9
Identities = 14/42 (33%), Positives = 21/42 (50%)
Query: 112 EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
E EE K+ K + E +EEE + ++ EE K+ VE
Sbjct: 52 ESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVE 93
Score = 26.6 bits (59), Expect = 8.0
Identities = 9/44 (20%), Positives = 25/44 (56%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
++ + + EE +E+E DE+++ ++ ++ ++ K +EV
Sbjct: 59 AKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEEFPKARDEV 102
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 30.0 bits (68), Expect = 0.52
Identities = 19/77 (24%), Positives = 31/77 (40%), Gaps = 5/77 (6%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKR-----EKKEEEEEKQKDKKEEEVEEEEEEQKN 138
EE E K EEE ++K+K + ++ K+ E + + E K
Sbjct: 167 EEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKK 226
Query: 139 KKKEEEEEEEKEKVEMY 155
KK+ + EK E+Y
Sbjct: 227 LKKKRSIAPDNEKSEVY 243
Score = 26.2 bits (58), Expect = 8.4
Identities = 17/59 (28%), Positives = 27/59 (45%), Gaps = 3/59 (5%)
Query: 94 EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
EEE E K + E+E R KK+++++K+K KK E + E +V
Sbjct: 166 EEEVELLKARLEEE---RAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQV 221
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 30.1 bits (68), Expect = 0.52
Identities = 25/89 (28%), Positives = 41/89 (46%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
A +A+E EE++ KE E E E +++ DE E E E + K+
Sbjct: 246 AATAAAEVPPAEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIKALASKIKQVNE 305
Query: 130 EEEEEEQKNKKKEEEEEEEKEKVEMYRNP 158
E Q+N++ EEE + +K VE+ +
Sbjct: 306 ELTTVRQENEELEEEYKIKKRTVELLPDA 334
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 30.2 bits (68), Expect = 0.53
Identities = 14/105 (13%), Positives = 35/105 (33%), Gaps = 10/105 (9%)
Query: 56 KPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
+R N +I P E + K+ E+ E K + +E ++++
Sbjct: 306 WTLRRNRLRMIITPWRAPEL-------HAENAEIKKTRTAEKNEAKAR--KKEIAQKRRA 356
Query: 116 EEEEKQKDKKEEEVEE-EEEEQKNKKKEEEEEEEKEKVEMYRNPV 159
E E ++ ++E + + +++ + P
Sbjct: 357 AEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPS 401
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 29.9 bits (67), Expect = 0.53
Identities = 16/45 (35%), Positives = 25/45 (55%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
S + ++ E EE E+EEEEEEEEEE + E+++ +
Sbjct: 216 PNSSSSSGSDSESEEDEEEEEEEEEEEEIDVVTVEKRRSSSNRKA 260
Score = 28.7 bits (64), Expect = 1.3
Identities = 15/43 (34%), Positives = 25/43 (58%)
Query: 56 KPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEE 98
+ + + S+S + ++E EE+E+E+EEEEEEEE
Sbjct: 201 SKVASPTPRLGLRTPPNSSSSSGSDSESEEDEEEEEEEEEEEE 243
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 30.0 bits (67), Expect = 0.54
Identities = 22/61 (36%), Positives = 34/61 (55%)
Query: 77 TLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
LE E E + + + ++ EEEE E DE+ E+ E K+E EK + +EE + +EE
Sbjct: 222 PLEKLEEEIDILKNDGDKVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEI 281
Query: 137 K 137
K
Sbjct: 282 K 282
Score = 28.1 bits (62), Expect = 2.5
Identities = 18/51 (35%), Positives = 34/51 (66%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
+EEE E +EE +E +ED+++ EK E +++K K +EE++ E + K++
Sbjct: 242 EEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEIKFIEHDVKDRN 292
Score = 27.7 bits (61), Expect = 3.9
Identities = 19/58 (32%), Positives = 35/58 (60%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
N+ ++ +E+E E +EE EE E+D+ E ++ E +EEK K +E + E + + +N
Sbjct: 235 NDGDKVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEIKFIEHDVKDRN 292
>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2. Transcripts
harbouring premature signals for translation termination
are recognised and rapidly degraded by eukaryotic cells
through a pathway known as nonsense-mediated mRNA decay.
In Saccharomyces cerevisiae, three trans-acting factors
(Upf1 to Upf3) are required for nonsense-mediated mRNA
decay.
Length = 171
Score = 29.7 bits (67), Expect = 0.57
Identities = 19/52 (36%), Positives = 32/52 (61%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
+EE E++E++E +EE+ D+E+ E EEE+ ++EEEV+ E E
Sbjct: 13 DEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEAEA 64
>gnl|CDD|226111 COG3583, COG3583, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 309
Score = 29.8 bits (67), Expect = 0.59
Identities = 19/70 (27%), Positives = 31/70 (44%), Gaps = 2/70 (2%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE--EEEQKNKKKE 142
+ +KE E+ + E +D K +EK +E + + EV E +E + KE
Sbjct: 209 NSKTDKENEQVDFETVYVEDPSMNKGKEKVVQEGQPGTKEVTFEVVTENGKETSREVVKE 268
Query: 143 EEEEEEKEKV 152
EE K+ V
Sbjct: 269 VVVEEAKDAV 278
>gnl|CDD|215120 PLN02188, PLN02188, polygalacturonase/glycoside hydrolase family
protein.
Length = 404
Score = 29.8 bits (67), Expect = 0.59
Identities = 13/37 (35%), Positives = 17/37 (45%)
Query: 49 VGPAQYHKPIRANARLTVILPQAISASETLENNEWEE 85
+GP Q+H P + LT L A S N+W E
Sbjct: 81 IGPVQFHGPCTNVSSLTFTLKAATDLSRYGSGNDWIE 117
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 29.2 bits (66), Expect = 0.60
Identities = 18/42 (42%), Positives = 28/42 (66%), Gaps = 5/42 (11%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
EE+E + +EE+EEEE+++E D+++E EEEE K K
Sbjct: 110 EEDESDDDEEDEEEEDDEEDDDEDE-----SEEEESPVKKVK 146
Score = 28.4 bits (64), Expect = 0.97
Identities = 17/47 (36%), Positives = 31/47 (65%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
P IS + + E E ++ E++EEEE++EE+ ++DE EE++ K+
Sbjct: 98 PVHISGQHLVASEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKK 144
Score = 28.4 bits (64), Expect = 1.2
Identities = 12/37 (32%), Positives = 29/37 (78%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKK 114
E +E +++E+++EEE++EE++++++ E+EE +K
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKV 145
Score = 28.1 bits (63), Expect = 1.4
Identities = 18/45 (40%), Positives = 33/45 (73%)
Query: 67 ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR 111
I Q + ASE E+++ EE+E+E+++EE+++E+E E++E KK
Sbjct: 101 ISGQHLVASEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKV 145
Score = 27.3 bits (61), Expect = 3.0
Identities = 15/38 (39%), Positives = 27/38 (71%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
+E E +++EE+EE+E DE+++ + E +EEE +K K
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146
Score = 26.5 bits (59), Expect = 4.5
Identities = 14/37 (37%), Positives = 25/37 (67%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
+E+E +++EE++E+++DEE E + EEEE K
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKV 145
>gnl|CDD|218010 pfam04293, SpoVR, SpoVR like protein. Bacillus subtilis stage V
sporulation protein R is involved in spore cortex
formation. Little is known about cortex biosynthesis,
except that it depends on several sigma E controlled
genes, including spoVR.
Length = 427
Score = 29.9 bits (68), Expect = 0.60
Identities = 14/49 (28%), Positives = 24/49 (48%)
Query: 83 WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
EEE+ +EE EE + + + +K EEE ++ + EE +E
Sbjct: 177 LEEEKARQEEREEYLQSQVNDLWRTIPPQREKLEEERARRKRFPEEPQE 225
Score = 27.2 bits (61), Expect = 4.7
Identities = 14/55 (25%), Positives = 25/55 (45%), Gaps = 4/55 (7%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE----EEEQKN 138
K EEE+ ++E++E + + Q++K EEE EE Q++
Sbjct: 172 PIKLSLEEEKARQEEREEYLQSQVNDLWRTIPPQREKLEEERARRKRFPEEPQED 226
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 29.9 bits (68), Expect = 0.61
Identities = 19/68 (27%), Positives = 31/68 (45%), Gaps = 6/68 (8%)
Query: 87 EKEKEEEEEEEEEEKEKDE-DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
+ E EKE ++ ++E R +K+ E K EEV E+E K+K E
Sbjct: 807 AGLIDLAAELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKE-----KEKLAEY 861
Query: 146 EEEKEKVE 153
+ + K+E
Sbjct: 862 QVKLAKLE 869
Score = 29.1 bits (66), Expect = 1.3
Identities = 27/116 (23%), Positives = 40/116 (34%), Gaps = 15/116 (12%)
Query: 49 VGPAQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEK------- 101
VG + + AN I LE E + E+ E
Sbjct: 754 VGSEELEDRLEANED-------DIKGLANLEELEILSPDPEEPPVEVTAVVGGAELFLPL 806
Query: 102 EKDEDEEKKREKKEEEEEK-QKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
D + + E+E EK +K+ E + E K EE E+EKEK+ Y+
Sbjct: 807 AGLIDLAAELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQ 862
>gnl|CDD|227478 COG5149, TOA1, Transcription initiation factor IIA, large chain
[Transcription].
Length = 293
Score = 29.6 bits (66), Expect = 0.62
Identities = 15/81 (18%), Positives = 31/81 (38%), Gaps = 5/81 (6%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDE-----EKKREKKEEEEEKQKDKKEE 127
SA TL N +W E + + E + E + +D + E+ E + + ++
Sbjct: 163 SAITTLRNTDWRERLIDDTQSEWDGERMRRRDGKQGIHQYERLSEGPAHAFKGKPTTAKD 222
Query: 128 EVEEEEEEQKNKKKEEEEEEE 148
E + + + + E E
Sbjct: 223 EGMFSDLDDSDVDSGDSEIEG 243
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 29.9 bits (67), Expect = 0.63
Identities = 9/81 (11%), Positives = 16/81 (19%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
SA E E E E +++ +
Sbjct: 110 SAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQP 169
Query: 133 EEEQKNKKKEEEEEEEKEKVE 153
E ++ E E +
Sbjct: 170 SHEDSPEEPEPPTSEPEPDSP 190
Score = 29.2 bits (65), Expect = 1.2
Identities = 14/84 (16%), Positives = 29/84 (34%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E+ +E E E E + E + + + + ++ N ++
Sbjct: 170 SHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQ 229
Query: 142 EEEEEEEKEKVEMYRNPVPMYRNP 165
E E+E + E P P +R+
Sbjct: 230 AVEHEDEPTEPEREGPPFPGHRSH 253
Score = 27.6 bits (61), Expect = 3.6
Identities = 12/77 (15%), Positives = 20/77 (25%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
A + + E + + EE E+EE+ + E EE
Sbjct: 60 AVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLS 119
Query: 134 EEQKNKKKEEEEEEEKE 150
E + E
Sbjct: 120 PENTSGSSPESPASHSP 136
Score = 27.6 bits (61), Expect = 3.7
Identities = 13/73 (17%), Positives = 23/73 (31%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
N+ + E+ EE E E E + E + +E E Q
Sbjct: 160 NQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTP 219
Query: 141 KEEEEEEEKEKVE 153
++ ++ VE
Sbjct: 220 QQAPSPNTQQAVE 232
Score = 27.2 bits (60), Expect = 5.3
Identities = 10/78 (12%), Positives = 17/78 (21%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
S SE++ + +E E E E E
Sbjct: 95 SGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPES 154
Query: 133 EEEQKNKKKEEEEEEEKE 150
N++ + E
Sbjct: 155 HNPSPNQQPSSFLQPSHE 172
Score = 26.9 bits (59), Expect = 5.7
Identities = 7/79 (8%), Positives = 16/79 (20%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
S + + E + ++ + + EE E
Sbjct: 124 SGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTS 183
Query: 135 EQKNKKKEEEEEEEKEKVE 153
E + + E
Sbjct: 184 EPEPDSPGPPQSETPTSSP 202
Score = 26.5 bits (58), Expect = 7.5
Identities = 10/73 (13%), Positives = 24/73 (32%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
+ + EE + E E + + E + +E + + + +
Sbjct: 170 SHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQ 229
Query: 139 KKKEEEEEEEKEK 151
+ E+E E E+
Sbjct: 230 AVEHEDEPTEPER 242
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 30.0 bits (67), Expect = 0.65
Identities = 25/70 (35%), Positives = 41/70 (58%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
EN EEEEKE+ EE EE EE E E+K + ++ EE ++++K+ E EEE+ +
Sbjct: 104 SGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPK 163
Query: 136 QKNKKKEEEE 145
+ + ++ E
Sbjct: 164 RGSLEENNGE 173
Score = 29.2 bits (65), Expect = 1.2
Identities = 23/72 (31%), Positives = 41/72 (56%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E++ E E E+EE+EE EE +E +E E + +++ + ++ ++EE E E EE++
Sbjct: 102 EDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEK 161
Query: 139 KKKEEEEEEEKE 150
K+ EE E
Sbjct: 162 PKRGSLEENNGE 173
Score = 28.8 bits (64), Expect = 1.4
Identities = 20/76 (26%), Positives = 42/76 (55%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E + +E+ E E ++E++E+ RE++EE EE + K E+ + + ++ +K+E+E
Sbjct: 94 SEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKE 153
Query: 145 EEEEKEKVEMYRNPVP 160
E E+E+ +
Sbjct: 154 PEPEEEEKPKRGSLEE 169
Score = 28.1 bits (62), Expect = 2.3
Identities = 22/70 (31%), Positives = 41/70 (58%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E + +E+ E E E++E EE + E++E EE + K E++ + + E+ K+++E
Sbjct: 94 SEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKE 153
Query: 144 EEEEEKEKVE 153
E EE+EK +
Sbjct: 154 PEPEEEEKPK 163
Score = 28.1 bits (62), Expect = 2.3
Identities = 27/90 (30%), Positives = 47/90 (52%)
Query: 68 LPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
P ++ NE EEE+++E EE EE E+ + + +++ + EE QK++KE
Sbjct: 95 EPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEP 154
Query: 128 EVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
E EEEE+ ++ +E E K++ N
Sbjct: 155 EPEEEEKPKRGSLEENNGEFMTHKLKHTEN 184
Score = 27.3 bits (60), Expect = 4.3
Identities = 18/67 (26%), Positives = 43/67 (64%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E E+ +++++E E E+ K + EE+++ +EEE+ +++++ + + EEEE+++ K++
Sbjct: 199 GKEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEE 258
Query: 142 EEEEEEE 148
E E
Sbjct: 259 IERRRAE 265
Score = 27.3 bits (60), Expect = 4.5
Identities = 25/67 (37%), Positives = 38/67 (56%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
EE K+K EE + EE+E+ +E+ K EEEEK++ K+E E E +K +K E+
Sbjct: 217 EELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAEKRQKVPED 276
Query: 145 EEEEKEK 151
E +K
Sbjct: 277 GLSEDKK 283
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 29.7 bits (67), Expect = 0.66
Identities = 10/39 (25%), Positives = 25/39 (64%)
Query: 115 EEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+ E+EK+ E E+ +++++ + K+++E EE + V+
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQ 111
Score = 28.9 bits (65), Expect = 0.97
Identities = 21/61 (34%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E+EKE E E+ K+K+E E KK+++ EE + QK K ++ ++E++ + K E
Sbjct: 75 EDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQK-KIDQYIKEKQLSSSLQTKLTE 133
Query: 145 E 145
E
Sbjct: 134 E 134
Score = 28.9 bits (65), Expect = 1.0
Identities = 10/34 (29%), Positives = 22/34 (64%)
Query: 93 EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
+ E+E+E + E+ ++K+E E +K+K+ +E
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEE 106
Score = 28.6 bits (64), Expect = 1.4
Identities = 13/49 (26%), Positives = 29/49 (59%)
Query: 101 KEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
K +DE E + E+ +K+++ + ++ +E EE + +KK ++ +EK
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEK 121
Score = 27.8 bits (62), Expect = 2.7
Identities = 7/37 (18%), Positives = 22/37 (59%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
+ E+E+E E ++ ++K+ + ++++E ++ K
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKA 109
Score = 27.8 bits (62), Expect = 3.0
Identities = 13/59 (22%), Positives = 32/59 (54%)
Query: 96 EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
+ E+EKE + + K++EE + K KKE E + +++ ++ +E++ + ++
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQTKL 131
Score = 26.2 bits (58), Expect = 9.5
Identities = 12/31 (38%), Positives = 19/31 (61%)
Query: 121 QKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
QK + E+E+ E + KK+EE E +K+K
Sbjct: 72 QKPEDEKELSASSLEAEQAKKKEEAEAKKKK 102
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 30.0 bits (68), Expect = 0.69
Identities = 24/67 (35%), Positives = 46/67 (68%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E+E ++ EE +EE EE EK+ + + ++K EE+ ++ +++ EE+++E EE + K KE +
Sbjct: 227 EKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELK 286
Query: 145 EEEEKEK 151
E +EK +
Sbjct: 287 ELKEKAE 293
Score = 29.6 bits (67), Expect = 0.82
Identities = 24/72 (33%), Positives = 39/72 (54%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
EE +K+ +E E+ EE +E+ E E+ + KKEE E +K E+ E+E +
Sbjct: 337 EERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEEL 396
Query: 140 KKEEEEEEEKEK 151
+K +EE EE+
Sbjct: 397 EKAKEEIEEEIS 408
Score = 27.0 bits (60), Expect = 6.0
Identities = 26/73 (35%), Positives = 46/73 (63%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
+E +E +EE EE E+E E E ++K E+K E E++ ++ ++E+EE EE+ K K+ +E
Sbjct: 231 KELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKE 290
Query: 145 EEEEKEKVEMYRN 157
+ EE K+ +
Sbjct: 291 KAEEYIKLSEFYE 303
>gnl|CDD|236154 PRK08119, PRK08119, flagellar motor switch protein; Validated.
Length = 382
Score = 29.8 bits (68), Expect = 0.69
Identities = 16/78 (20%), Positives = 32/78 (41%), Gaps = 4/78 (5%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK--- 137
E EEEE+E EEEE + E + K+E+++ + ++ E+ + Q
Sbjct: 225 GEEEEEEEEVEEEEAQASPAAEPATAQAAPAPKQEQQQAPPQRQEPEKEAQPVNVQPAQF 284
Query: 138 -NKKKEEEEEEEKEKVEM 154
+ E +++
Sbjct: 285 QSFDPPPLATTEPRNLDL 302
Score = 28.3 bits (64), Expect = 2.1
Identities = 15/74 (20%), Positives = 33/74 (44%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EEEE+E+E EEEE + + + ++E+++ +++E +E + + +
Sbjct: 226 EEEEEEEEVEEEEAQASPAAEPATAQAAPAPKQEQQQAPPQRQEPEKEAQPVNVQPAQFQ 285
Query: 144 EEEEEKEKVEMYRN 157
+ RN
Sbjct: 286 SFDPPPLATTEPRN 299
>gnl|CDD|227391 COG5058, LAG1, Protein transporter of the TRAM (translocating
chain-associating membrane) superfamily, longevity
assurance factor [Intracellular trafficking and
secretion].
Length = 395
Score = 29.8 bits (67), Expect = 0.72
Identities = 16/71 (22%), Positives = 24/71 (33%), Gaps = 18/71 (25%)
Query: 51 PAQYHK--PIRANARLTVILPQAIS-------------ASETLENNEWEEEEKEKEEEEE 95
Q +K + IL QA+ A + E ++E + E EEE
Sbjct: 328 ATQQYKCWISLP---IVFILLQALQLVNIYWLFLIVRVAYRVIWEGELKDERSDDESEEE 384
Query: 96 EEEEEKEKDED 106
+ E E D
Sbjct: 385 SDLESSEDKND 395
>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
secretion].
Length = 407
Score = 29.6 bits (67), Expect = 0.72
Identities = 12/72 (16%), Positives = 25/72 (34%), Gaps = 3/72 (4%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE---EEEEKQKDKKEEEVEEEEEE 135
+ EE K ++ E E+ E K ++ +EK ++ EE+ E
Sbjct: 83 PAEKKREERKAAKKIERSTPSLIERKTQEVKDSGEEIAEMMRDEKVPIRELEEIPPEFVA 142
Query: 136 QKNKKKEEEEEE 147
++ +
Sbjct: 143 LYKQEIQSPTRL 154
Score = 29.6 bits (67), Expect = 0.78
Identities = 16/86 (18%), Positives = 39/86 (45%), Gaps = 10/86 (11%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREK--KEEEEEKQKDKKEEEVEEEE 133
+T E + ++ E + ++ E++ E+ K + ++ E + ++ KD EE E
Sbjct: 65 QTREAAKVKDAEAQLLKDPAEKKREERKAAKKIERSTPSLIERKTQEVKDSGEEIAEMMR 124
Query: 134 EEQKNKKKEEE--------EEEEKEK 151
+E+ ++ EE ++E +
Sbjct: 125 DEKVPIRELEEIPPEFVALYKQEIQS 150
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 29.9 bits (67), Expect = 0.74
Identities = 20/63 (31%), Positives = 31/63 (49%), Gaps = 1/63 (1%)
Query: 93 EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ-KNKKKEEEEEEEKEK 151
E + EK EEKKR+K+EEE + + EE E K K++ E E +K +
Sbjct: 487 SEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLE 546
Query: 152 VEM 154
++
Sbjct: 547 HDL 549
>gnl|CDD|223626 COG0552, FtsY, Signal recognition particle GTPase [Intracellular
trafficking and secretion].
Length = 340
Score = 29.5 bits (67), Expect = 0.75
Identities = 24/67 (35%), Positives = 35/67 (52%), Gaps = 11/67 (16%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR-----------EKKEEEEEKQKDKKEEEV 129
E EE E EKE+ EEE+EEE+++ E K+ K +K K+K +E++
Sbjct: 9 EELEELEAEKEKIEEEDEEEEKEGWFERLKQGLSKTKKNFGKGIKGLFLKKIKEKLDEDL 68
Query: 130 EEEEEEQ 136
EE EE
Sbjct: 69 LEELEEL 75
Score = 29.1 bits (66), Expect = 1.1
Identities = 22/74 (29%), Positives = 38/74 (51%), Gaps = 3/74 (4%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ---KDKKEEEVEEEEEEQKNKK 140
++ EEE EE E EKEK E+E+++ EK+ E + K+ + + K
Sbjct: 1 MGFFEKLEEELEELEAEKEKIEEEDEEEEKEGWFERLKQGLSKTKKNFGKGIKGLFLKKI 60
Query: 141 KEEEEEEEKEKVEM 154
KE+ +E+ E++E
Sbjct: 61 KEKLDEDLLEELEE 74
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function
of NS2 is not known, but it may be involved in the
transport and condensation of viral mRNAs.
Length = 363
Score = 29.5 bits (66), Expect = 0.75
Identities = 17/68 (25%), Positives = 33/68 (48%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
E K++++EE E+ E + EE ++ +KE+E+EE E + + E E
Sbjct: 205 PETSKQDQKEERRAAVERRLAELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGE 264
Query: 146 EEEKEKVE 153
+ E + +
Sbjct: 265 DSEDGETK 272
Score = 28.3 bits (63), Expect = 2.1
Identities = 18/84 (21%), Positives = 36/84 (42%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E E E EE+ +D +E++ E+ E + +D+ E+ E+ E + ++ E
Sbjct: 221 ERRLAELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGEDSEDGETKPESYITSE 280
Query: 144 EEEEEKEKVEMYRNPVPMYRNPVP 167
E E +M + + +P
Sbjct: 281 YIERISEIRKMKDERLSSLASMMP 304
Score = 28.3 bits (63), Expect = 2.2
Identities = 12/68 (17%), Positives = 39/68 (57%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
N +EE + E ++++++E+ E++ + E +++ ++ +E+E ++N +
Sbjct: 194 NPLDEEAPDMTPETSKQDQKEERRAAVERRLAELVEMINWNLEERRRDLRKEQELEENVE 253
Query: 141 KEEEEEEE 148
++ ++E+E
Sbjct: 254 RDSDDEDE 261
Score = 27.5 bits (61), Expect = 3.2
Identities = 20/75 (26%), Positives = 34/75 (45%), Gaps = 2/75 (2%)
Query: 71 AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDE--DEEKKREKKEEEEEKQKDKKEEE 128
+E +E W EE+ ++ +E+E EE + + DE++ E E+ E K + E
Sbjct: 221 ERRLAELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGEDSEDGETKPESYITSE 280
Query: 129 VEEEEEEQKNKKKEE 143
E E + K E
Sbjct: 281 YIERISEIRKMKDER 295
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 29.5 bits (67), Expect = 0.75
Identities = 21/65 (32%), Positives = 37/65 (56%), Gaps = 4/65 (6%)
Query: 93 EEEEEEEEKEKDEDEEKKREKKEEE----EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
E E E +KEKDE +++ E E+E EE+ D +E+ E+ Q ++ +EE E+
Sbjct: 418 EIEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKAAIQGIQQIKEEIEQV 477
Query: 149 KEKVE 153
+ ++E
Sbjct: 478 RLELE 482
>gnl|CDD|216881 pfam02115, Rho_GDI, RHO protein GDP dissociation inhibitor.
Length = 200
Score = 29.2 bits (66), Expect = 0.76
Identities = 13/44 (29%), Positives = 26/44 (59%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
E+E ++ EE E+E+++D K +K +E ++ DK++E
Sbjct: 3 EQEPTAEQAEEIAAEDEEDEDSVNYKPPPQKSLKEIQELDKEDE 46
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 29.9 bits (68), Expect = 0.77
Identities = 17/73 (23%), Positives = 33/73 (45%), Gaps = 3/73 (4%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE---EEEEQKNKK 140
E+ E ++E K + E + REK + + + ++E E E E+K ++
Sbjct: 136 PPEDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQE 195
Query: 141 KEEEEEEEKEKVE 153
E + E+ +EK
Sbjct: 196 LEAQLEQLQEKAA 208
Score = 26.8 bits (60), Expect = 7.0
Identities = 15/84 (17%), Positives = 38/84 (45%), Gaps = 5/84 (5%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
P+ + + E +++ + + E+ + + E + E E E E+++ + E +
Sbjct: 140 PENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQ 199
Query: 129 VEEEEEEQKNKKKEEEEEEEKEKV 152
+E+ +E+ E +E K+K
Sbjct: 200 LEQLQEKA-----AETSQERKQKR 218
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 29.7 bits (66), Expect = 0.77
Identities = 21/82 (25%), Positives = 45/82 (54%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E E ++ E ++E++++E + E K E + + ++ ++KEE+ E EE E+ + +E
Sbjct: 283 EIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADE 342
Query: 145 EEEEKEKVEMYRNPVPMYRNPV 166
E ++ +E R + NP+
Sbjct: 343 LNELEKGIEEKRRQMESATNPI 364
Score = 29.3 bits (65), Expect = 0.88
Identities = 19/65 (29%), Positives = 32/65 (49%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E ++EE E E +EE + DE E+K E E EE ++ + + E E+ +K+
Sbjct: 295 EQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELEKGIEEKR 354
Query: 142 EEEEE 146
+ E
Sbjct: 355 RQMES 359
Score = 27.4 bits (60), Expect = 3.9
Identities = 24/70 (34%), Positives = 38/70 (54%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E E EE EE+ + E E E E +E+++EE E + KEE + +E +K+
Sbjct: 265 EEEGEEGMSEEDLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKE 324
Query: 142 EEEEEEEKEK 151
E++E EE E+
Sbjct: 325 EDDENEENER 334
Score = 26.2 bits (57), Expect = 10.0
Identities = 15/70 (21%), Positives = 37/70 (52%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
SE + + EE E + +EE + + ++ +E+++ ++ EE E + +E E E+
Sbjct: 290 SEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELEKG 349
Query: 135 EQKNKKKEEE 144
++ +++ E
Sbjct: 350 IEEKRRQMES 359
>gnl|CDD|179867 PRK04598, tatA, twin arginine translocase protein A; Provisional.
Length = 81
Score = 27.9 bits (62), Expect = 0.79
Identities = 16/41 (39%), Positives = 23/41 (56%), Gaps = 3/41 (7%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKK---REKKEEEEEKQKDKKE 126
+K EEE + +KD D E K + K E EK+KDK++
Sbjct: 40 KKAMSEEESAKANKKDADFEPKNLEQAKTEAAAEKKKDKEQ 80
Score = 26.4 bits (58), Expect = 2.7
Identities = 11/36 (30%), Positives = 22/36 (61%)
Query: 116 EEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
EEE + +KK+ + E + EQ + E++++KE+
Sbjct: 45 EEESAKANKKDADFEPKNLEQAKTEAAAEKKKDKEQ 80
>gnl|CDD|234306 TIGR03674, fen_arch, flap structure-specific endonuclease.
Endonuclease that cleaves the 5'-overhanging flap
structure that is generated by displacement synthesis
when DNA polymerase encounters the 5'-end of a
downstream Okazaki fragment. Has 5'-endo-/exonuclease
and 5'-pseudo-Y-endonuclease activities. Cleaves the
junction between single and double-stranded regions of
flap DNA.
Length = 338
Score = 29.5 bits (67), Expect = 0.81
Identities = 13/30 (43%), Positives = 19/30 (63%)
Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
K E E++RE +EE EEK ++ E+ EE
Sbjct: 87 KAETLEERREIREEAEEKWEEALEKGDLEE 116
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 29.8 bits (67), Expect = 0.81
Identities = 20/81 (24%), Positives = 39/81 (48%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
LE + EE KE+E E E +E+ +E ++++++ ++EE+QK +E + E
Sbjct: 2569 LEKAKIEESRKERERIESETQEDNTDEEQINRQQQERLQKEEEQKAYSQERLNREVSGTD 2628
Query: 138 NKKKEEEEEEEKEKVEMYRNP 158
+ K ++ Y N
Sbjct: 2629 DTNKNHNTGHDESNYGRYSNK 2649
>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
Length = 278
Score = 29.4 bits (66), Expect = 0.81
Identities = 22/75 (29%), Positives = 40/75 (53%), Gaps = 5/75 (6%)
Query: 83 WEEEEKEKEEEEE-----EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
WEEE ++ + EE EE +++ D K+ K E+ E + K++ E + E +K
Sbjct: 10 WEEELRKAQSLEELKQKYEEAQKQIADGKTLKRLYKVYEKREFELKKQQFEQLKAELSKK 69
Query: 138 NKKKEEEEEEEKEKV 152
KK ++E+ + + KV
Sbjct: 70 KKKFKKEKVDVRVKV 84
>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6. The VP6 protein a
minor protein in the core of the virion is probably the
viral helicase.
Length = 322
Score = 29.3 bits (65), Expect = 0.82
Identities = 13/49 (26%), Positives = 32/49 (65%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
+W E + KE+E ++E+ ++++ D E ++++K +EE K+ ++ V+
Sbjct: 26 DWFESGESKEKEPKDEDGQEQRISDGEGEQKQKGGKEESDKETEDASVD 74
Score = 28.5 bits (63), Expect = 1.7
Identities = 18/48 (37%), Positives = 30/48 (62%)
Query: 93 EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
E E +E++ KDED +++R E E+KQK KEE +E E+ +++
Sbjct: 29 ESGESKEKEPKDEDGQEQRISDGEGEQKQKGGKEESDKETEDASVDRR 76
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 29.0 bits (65), Expect = 0.82
Identities = 17/64 (26%), Positives = 34/64 (53%), Gaps = 5/64 (7%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
++ +K+++E+ EEEK+ + D++K EE E D EE + +E N + +
Sbjct: 65 TKKSKKKDKEKLTEEEKKPESDDDK-----TEENENDPDNNEESGDSQESASANSLSDID 119
Query: 145 EEEE 148
E++
Sbjct: 120 NEDD 123
Score = 29.0 bits (65), Expect = 0.93
Identities = 12/51 (23%), Positives = 26/51 (50%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
EEE+K + ++++ EE E + D +EE ++ D E+ ++ +
Sbjct: 78 EEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSDIDNEDDMDDSD 128
>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
[DNA replication, recombination, and repair].
Length = 417
Score = 29.7 bits (67), Expect = 0.83
Identities = 15/67 (22%), Positives = 29/67 (43%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
K K E +E K K+ KK+ + +E E ++ +++ K E + E
Sbjct: 346 KVKTAYEAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLE 405
Query: 148 EKEKVEM 154
E +++E
Sbjct: 406 ELKRLEN 412
Score = 27.0 bits (60), Expect = 5.6
Identities = 12/58 (20%), Positives = 28/58 (48%), Gaps = 2/58 (3%)
Query: 62 ARLTVILPQAISASETLENN--EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
R + L + S++ + N +W E +K +EE+ + + +E K+ E +++
Sbjct: 359 KRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLENGKQK 416
Score = 26.6 bits (59), Expect = 7.2
Identities = 13/71 (18%), Positives = 32/71 (45%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
N+W ++ + E +E ++++ +K+ + +E K E + +E++ K
Sbjct: 340 NDWARKKVKTAYEAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKL 399
Query: 141 KEEEEEEEKEK 151
+ + EE K
Sbjct: 400 EFIKLEELKRL 410
>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
Reviewed.
Length = 129
Score = 28.8 bits (65), Expect = 0.83
Identities = 14/55 (25%), Positives = 32/55 (58%)
Query: 94 EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
+E +EE +E +KK++K + ++ D ++ +EE +++ +E+EE+E
Sbjct: 74 DEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDE 128
Score = 26.5 bits (59), Expect = 4.3
Identities = 18/71 (25%), Positives = 37/71 (52%), Gaps = 1/71 (1%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
L +N+W +E +EE E+ D++KK+ + +++ D + + +EE+
Sbjct: 60 LGDNKWGLRSWYPVDEIDEEIIPLEEKFDKKKKKFM-DGDDDIIDDDILPDDDFDEEDLD 118
Query: 138 NKKKEEEEEEE 148
+ E+EE+EE
Sbjct: 119 EEDDEDEEDEE 129
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 29.4 bits (66), Expect = 0.84
Identities = 15/86 (17%), Positives = 43/86 (50%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
E+ + + K+ E+ +++ K+ ++ ++ K KK + ++K+K KK+ ++++ +
Sbjct: 35 LESDIKDSLLNLDINKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYD 94
Query: 135 EQKNKKKEEEEEEEKEKVEMYRNPVP 160
+ K ++ + + R P P
Sbjct: 95 NFFDSKNNSKQFAGPLAISLMRKPKP 120
Score = 27.5 bits (61), Expect = 4.2
Identities = 20/102 (19%), Positives = 50/102 (49%), Gaps = 7/102 (6%)
Query: 71 AISASET--LENNEWEEEEKEKEEEEEEEEEEKEK--DEDEEKKREKKEEEEEKQKDKKE 126
+IS+ T L+N + + K + E + K+ + D KK +K +++ K+ +K +
Sbjct: 8 SISSENTIDLKNPLFIKNLKHSSYKIRLESDIKDSLLNLDINKKLHEKLDKKNKKFNKTD 67
Query: 127 EEVEEEEEEQKNKKKEEEE---EEEKEKVEMYRNPVPMYRNP 165
+ + ++ + K KKK +++ +++ + +N + P
Sbjct: 68 DLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGP 109
>gnl|CDD|227602 COG5277, COG5277, Actin and related proteins [Cytoskeleton].
Length = 444
Score = 29.3 bits (66), Expect = 0.87
Identities = 20/76 (26%), Positives = 31/76 (40%), Gaps = 3/76 (3%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
L E + KEE E ++E E+ EEEEEK +K E + +
Sbjct: 214 KSELVEYSSEIVNEIKEEVCETDDESAY---VSLDAEEEFEEEEEKPAEKSTESTFQLSK 270
Query: 135 EQKNKKKEEEEEEEKE 150
E K+ +E + +E
Sbjct: 271 ETSIAKESKELPDGEE 286
>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
Length = 222
Score = 29.2 bits (65), Expect = 0.92
Identities = 14/58 (24%), Positives = 35/58 (60%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
K K + +++E +EK++ +E + + +K+E+ +Q + + ++ ++ NK+K EE
Sbjct: 147 KTKSKNQKQENQEKQEKLEESEIQSEKQEQPSQQPQSQPQPQQQPQQPNNNKRKAMEE 204
Score = 26.1 bits (57), Expect = 9.0
Identities = 11/42 (26%), Positives = 30/42 (71%)
Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
K + +K+E +EKQ+ +E E++ E++EQ +++ + + + +++
Sbjct: 149 KSKNQKQENQEKQEKLEESEIQSEKQEQPSQQPQSQPQPQQQ 190
Score = 26.1 bits (57), Expect = 9.9
Identities = 12/54 (22%), Positives = 33/54 (61%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
+N + E +EK+++ EE E + EK++ ++ + + + +++ +Q + + + EE
Sbjct: 151 KNQKQENQEKQEKLEESEIQSEKQEQPSQQPQSQPQPQQQPQQPNNNKRKAMEE 204
>gnl|CDD|189027 cd09857, PIN_EXO1, PIN domain of Exonuclease-1, a
structure-specific, divalent-metal-ion dependent, 5'
nuclease and homologs. Exonuclease-1 (EXO1) is involved
in multiple, eukaryotic DNA metabolic pathways,
including DNA replication processes (5' flap DNA
endonuclease activity and double stranded DNA
5'-exonuclease activity), DNA repair processes (DNA
mismatch repair (MMR) and post-replication repair
(PRR)), recombination, and telomere integrity. EXO1
functions in the MMS2 error-free branch of the PRR
pathway in the maintenance and repair of stalled
replication forks. Studies also suggest that EXO1 plays
both structural and catalytic roles during MMR-mediated
mutation avoidance. EXO1 belongs to the FEN1-EXO1-like
family of structure-specific, 5' nucleases. These
nucleases contain a PIN (PilT N terminus) domain with a
helical arch/clamp region (I domain) of variable length
(approximately 43 residues in EXO1 PIN domains) and a
H3TH (helix-3-turn-helix) domain, an atypical
helix-hairpin-helix-2-like region. Both the H3TH domain
(not included here) and the helical arch/clamp region
are involved in DNA binding. Nucleases within this group
also have a carboxylate-rich active site that is
involved in binding essential divalent metal ion
cofactors (Mg2+/Mn2+). EXO1 nucleases also have
C-terminal Mlh1- and Msh2-binding domains which allow
interaction with MMR and PRR proteins, respectively.
Length = 210
Score = 29.0 bits (66), Expect = 0.93
Identities = 10/33 (30%), Positives = 14/33 (42%)
Query: 124 KKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
KK E E E ++N +K E E + E
Sbjct: 85 KKGTEEERRERREENLEKALELLREGNRSEARE 117
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 29.3 bits (65), Expect = 0.98
Identities = 21/104 (20%), Positives = 48/104 (46%), Gaps = 5/104 (4%)
Query: 56 KPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
KP+ + +L V + S++ E + EK+ ++ +++E++EKEK+ D++KK+E +
Sbjct: 165 KPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEG 224
Query: 116 EEE-----EKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
+ + E +E N + E ++ +
Sbjct: 225 FKSLLLALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKD 268
Score = 27.7 bits (61), Expect = 3.3
Identities = 14/39 (35%), Positives = 24/39 (61%)
Query: 97 EEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
EEE + + E+ KREKK+ E+EK+ ++ + E +E
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDE 123
>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
subunit beta; Reviewed.
Length = 463
Score = 29.3 bits (66), Expect = 1.0
Identities = 14/20 (70%), Positives = 16/20 (80%)
Query: 83 WEEEEKEKEEEEEEEEEEKE 102
W EE+E+EEEEEEEEEE
Sbjct: 402 WAAEEEEEEEEEEEEEEEPV 421
>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
motility and secretion / Intracellular trafficking and
secretion].
Length = 696
Score = 29.1 bits (66), Expect = 1.0
Identities = 13/31 (41%), Positives = 21/31 (67%)
Query: 121 QKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
K K+++ E+E+ ++ KK+EEEEEEE
Sbjct: 324 SKRKEQQAEEKEKPAEEAKKEEEEEEEESVD 354
Score = 28.0 bits (63), Expect = 2.4
Identities = 12/37 (32%), Positives = 23/37 (62%)
Query: 111 REKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
+ K+++ EEK+K +E + EEEEEE+++ +
Sbjct: 325 KRKEQQAEEKEKPAEEAKKEEEEEEEESVDDVLLIDP 361
Score = 28.0 bits (63), Expect = 2.8
Identities = 13/31 (41%), Positives = 22/31 (70%)
Query: 120 KQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
K+K+++ EE E+ EE K +++EEEEE +
Sbjct: 325 KRKEQQAEEKEKPAEEAKKEEEEEEEESVDD 355
Score = 26.8 bits (60), Expect = 6.1
Identities = 16/40 (40%), Positives = 24/40 (60%)
Query: 93 EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
+E++ EEKEK +E KK E++EEEE + +E E
Sbjct: 326 RKEQQAEEKEKPAEEAKKEEEEEEEESVDDVLLIDPIELE 365
Score = 26.8 bits (60), Expect = 7.3
Identities = 12/43 (27%), Positives = 25/43 (58%)
Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
KR++++ EE+++ ++ ++ EEEEEE+ + E E
Sbjct: 323 LSKRKEQQAEEKEKPAEEAKKEEEEEEEESVDDVLLIDPIELE 365
>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2). All proteins
in this family for which functions are known are flap
endonucleases that generate the 3' incision next to DNA
damage as part of nucleotide excision repair. This
family is related to many other flap endonuclease
families including the fen1 family. This family is based
on the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1034
Score = 29.5 bits (66), Expect = 1.0
Identities = 16/88 (18%), Positives = 36/88 (40%)
Query: 65 TVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
+ P E+ + + E + E + + + ++ E+ E K E + K
Sbjct: 642 MEVEPMESEKEESESDGSFIEVDSVSSTLELQVPSKSQPTDESEENAENKVASIEGEHRK 701
Query: 125 KEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
+ E++ +E E+ N EEE++ +
Sbjct: 702 EIEDLLFDESEEDNIVGMIEEEKDADDF 729
Score = 28.7 bits (64), Expect = 1.6
Identities = 25/90 (27%), Positives = 39/90 (43%), Gaps = 6/90 (6%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEE--EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
Q S S+ + +E E K E E +E E+ DE EE EEE+ D K E
Sbjct: 673 QVPSKSQPTDESEENAENKVASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKDADDFKNE 732
Query: 128 ----EVEEEEEEQKNKKKEEEEEEEKEKVE 153
+EE E + N E+ + +++ +
Sbjct: 733 WQDISLEELEALEANLLAEQNSLKAQKQQQ 762
>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
Length = 281
Score = 28.8 bits (65), Expect = 1.1
Identities = 24/71 (33%), Positives = 37/71 (52%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
KEKE EEEEE+ E+ +E K+ EE ++ + E EEE +++ KK +
Sbjct: 29 VLSIKEKERLREEEEEQVEQLREEANDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTD 88
Query: 144 EEEEEKEKVEM 154
E EK++M
Sbjct: 89 EASSIIEKLQM 99
Score = 28.1 bits (63), Expect = 2.2
Identities = 25/91 (27%), Positives = 39/91 (42%)
Query: 63 RLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
R E + + EE E++ E E E+E E+ KK ++ EK +
Sbjct: 39 REEEEEQVEQLREEANDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTDEASSIIEKLQ 98
Query: 123 DKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
+ E E EE EEE + +E + E +E E
Sbjct: 99 MQIEREQEEWEEELERLIEEAKAEGYEEGYE 129
Score = 27.7 bits (62), Expect = 2.8
Identities = 24/80 (30%), Positives = 40/80 (50%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E E E EE E +E EE E++ E E E+ +EE K+ D+ +E+ + +
Sbjct: 41 EEEEQVEQLREEANDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQ 100
Query: 136 QKNKKKEEEEEEEKEKVEMY 155
+ +++E EEE E+ E
Sbjct: 101 IEREQEEWEEELERLIEEAK 120
Score = 26.9 bits (60), Expect = 5.8
Identities = 21/70 (30%), Positives = 42/70 (60%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+E+E+ +EEEEE+ E+ +E+ DE K+ ++ EE+ + + EE +EE +K +
Sbjct: 33 KEKERLREEEEEQVEQLREEANDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTDEASS 92
Query: 144 EEEEEKEKVE 153
E+ + ++E
Sbjct: 93 IIEKLQMQIE 102
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 28.7 bits (65), Expect = 1.1
Identities = 14/47 (29%), Positives = 23/47 (48%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
N E E +EE+ ++ D+D+E + E EEE + D +E
Sbjct: 33 NAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEE 79
Score = 27.9 bits (63), Expect = 2.2
Identities = 13/43 (30%), Positives = 27/43 (62%)
Query: 71 AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREK 113
A + L+ + E+++ + E+E+E++EEE + D E+ RE+
Sbjct: 41 AAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEEARER 83
Score = 27.2 bits (61), Expect = 3.8
Identities = 14/40 (35%), Positives = 27/40 (67%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEE 108
P A +A+ T E E +E++ E++++++E+E E DE+E
Sbjct: 32 PNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEA 71
Score = 26.8 bits (60), Expect = 5.3
Identities = 13/48 (27%), Positives = 22/48 (45%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
E E +EE +D+D++ + E +++EEE EE E
Sbjct: 35 AAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEEARE 82
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 29.1 bits (66), Expect = 1.1
Identities = 28/83 (33%), Positives = 47/83 (56%), Gaps = 3/83 (3%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEE---EEEKQKDKKEEEVEEEEE 134
++ E +EE KE EEE +E EEE EK + KK KK EK++ ++ +E+
Sbjct: 327 VDFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKL 386
Query: 135 EQKNKKKEEEEEEEKEKVEMYRN 157
++ ++ EEE +E KE++E +
Sbjct: 387 SEELEELEEELKELKEELESLYS 409
>gnl|CDD|224273 COG1354, scpA, Rec8/ScpA/Scc1-like protein (kleisin family)
[Replication, recombination, and repair].
Length = 248
Score = 28.8 bits (65), Expect = 1.1
Identities = 21/79 (26%), Positives = 41/79 (51%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
+EE+E E+EE EE ++ EE +R K+ E + +++ +V + + + KKE
Sbjct: 77 KEEEEAEDEELEEPRDELVARLEEYERYKEAAELLAELEEERRDVFSKIKPEIKIKKERR 136
Query: 145 EEEEKEKVEMYRNPVPMYR 163
EE ++++R + R
Sbjct: 137 PVEEISLIDLFRAYQKILR 155
>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 472
Score = 29.0 bits (65), Expect = 1.2
Identities = 18/55 (32%), Positives = 32/55 (58%), Gaps = 1/55 (1%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK-EEEVEEEEEEQKNK 139
+E + EE+ + + +EKK+E+ + +EEKQ+D + E+ +E EE K K
Sbjct: 335 SSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKELMEELKEK 389
Score = 29.0 bits (65), Expect = 1.4
Identities = 14/41 (34%), Positives = 26/41 (63%)
Query: 107 EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
E EK + +QK+KK+EE + +EE+Q++ + E+ +E
Sbjct: 341 ENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKE 381
>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator. This protein is
found in a wide range of eukaryotes. It is a nuclear
protein and is suggested to be DNA binding. In plants,
this family is essential for correct circadian clock
functioning by acting as a light-quality regulator
coordinating the activities of blue and red light
signalling pathways during plant growth - inhibiting
growth in red light but promoting growth in blue light.
Length = 233
Score = 28.9 bits (65), Expect = 1.2
Identities = 20/76 (26%), Positives = 40/76 (52%), Gaps = 8/76 (10%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ--------KDKKEEEVEEEEEE 135
++EE++++E E+E++ KE E +E +++ K DK EE E E E
Sbjct: 16 DDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSVDTSFLPDKAREEKEAELRE 75
Query: 136 QKNKKKEEEEEEEKEK 151
+ ++ +++E KE+
Sbjct: 76 ELREEFLKKQEAVKEE 91
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 28.7 bits (64), Expect = 1.2
Identities = 21/75 (28%), Positives = 35/75 (46%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
E E E E +E +E + E E + E+ KK + + +K+ D+ E VE EEE +
Sbjct: 157 ELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDELEPGVELPEEELIS 216
Query: 139 KKKEEEEEEEKEKVE 153
+E + +E
Sbjct: 217 DLVKETLNLAPKDIE 231
>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
Length = 258
Score = 28.6 bits (65), Expect = 1.2
Identities = 14/36 (38%), Positives = 24/36 (66%)
Query: 112 EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
E ++ + +++EEE EEEEEE++ ++ EE E E
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 28.6 bits (65), Expect = 1.5
Identities = 14/33 (42%), Positives = 23/33 (69%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
+ + EEEEEE E+E++E+EE++ E+ E E
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 28.6 bits (65), Expect = 1.6
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEE 116
+ E EEEEEE EE+E++E+EE+ E + E
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 28.2 bits (64), Expect = 1.9
Identities = 21/57 (36%), Positives = 32/57 (56%), Gaps = 5/57 (8%)
Query: 58 IRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKK 114
IR+ T + AI LE + E +EEEEE EEEE+E++E+E ++ E +
Sbjct: 207 IRSIKLYTSKIADAI-----LEGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 27.8 bits (63), Expect = 2.7
Identities = 15/36 (41%), Positives = 23/36 (63%)
Query: 118 EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
E +Q E EEEEE ++ +++EEEEE E+ + E
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 27.8 bits (63), Expect = 2.8
Identities = 13/33 (39%), Positives = 23/33 (69%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
+ + E EEEEEE +E++E+EE++ ++ E E
Sbjct: 226 QGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 27.1 bits (61), Expect = 4.5
Identities = 12/36 (33%), Positives = 23/36 (63%)
Query: 99 EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
E ++ E + E++E EEE++++++EE E E E
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 27.1 bits (61), Expect = 5.1
Identities = 13/36 (36%), Positives = 24/36 (66%)
Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E ++ + E E+++++ EEE EEEEEE+ + + E
Sbjct: 223 EGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 27.1 bits (61), Expect = 5.3
Identities = 15/31 (48%), Positives = 21/31 (67%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
E EEEEEE E++E+EE++ E +E E E
Sbjct: 228 RLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain. This family consists
of several ATP synthase E chain sequences which are
components of the CF(0) subunit.
Length = 83
Score = 27.4 bits (61), Expect = 1.2
Identities = 9/34 (26%), Positives = 22/34 (64%)
Query: 113 KKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
KK+EE+ ++ + +E+ +E+ + KK+ ++E
Sbjct: 33 KKKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEA 66
>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
pre-ribosomes [Translation, ribosomal structure and
biogenesis / Intracellular trafficking and secretion].
Length = 657
Score = 29.2 bits (65), Expect = 1.2
Identities = 15/50 (30%), Positives = 22/50 (44%), Gaps = 2/50 (4%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDE--DEEKKREKKEEEEEKQKDKKEEE 128
N EE E E E E + KDE ++++ E KQ+ E+E
Sbjct: 89 NNGEEFLSESESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSEKE 138
>gnl|CDD|235604 PRK05778, PRK05778, 2-oxoglutarate ferredoxin oxidoreductase
subunit beta; Validated.
Length = 301
Score = 28.7 bits (65), Expect = 1.3
Identities = 12/51 (23%), Positives = 25/51 (49%), Gaps = 1/51 (1%)
Query: 113 KKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV-EMYRNPVPMY 162
++ ++ K K EE+ + + ++ +K EEE K + Y+N P +
Sbjct: 221 REYYKKRVYKLKLEEDYDPTDRDKAAEKMLEEELGGKIPIGVFYKNERPTF 271
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 28.9 bits (65), Expect = 1.3
Identities = 17/62 (27%), Positives = 31/62 (50%), Gaps = 1/62 (1%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
EW E+ ++K++ E E + +K ++ EE KKEEE ++ EE+ +
Sbjct: 844 KEWIEDLRQKKKLIERLIEAINQYR-AKKLDTAEKLEELYILAKKEEEFKQFAEEEGLNE 902
Query: 141 KE 142
+E
Sbjct: 903 EE 904
Score = 27.4 bits (61), Expect = 4.7
Identities = 14/73 (19%), Positives = 29/73 (39%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
E + ++ + E+ E+ KK E+ ++ E++ +EE ++ K
Sbjct: 858 ERLIEAINQYRAKKLDTAEKLEELYILAKKEEEFKQFAEEEGLNEEELAFYDDLALNGGK 917
Query: 141 KEEEEEEEKEKVE 153
E E EK+
Sbjct: 918 LPENGTELVEKLA 930
Score = 26.6 bits (59), Expect = 8.4
Identities = 19/106 (17%), Positives = 43/106 (40%), Gaps = 11/106 (10%)
Query: 58 IRANARLTVILPQAISASETLENNEWEEEEKEKEEEE-----------EEEEEEKEKDED 106
+ N + S + L++ ++E E K+E ++ E ++
Sbjct: 778 LIDNLNDAREIKDKESKAIDLDDIDFELELIGKQEINIDYILELLQTFNDKNGAYESLKE 837
Query: 107 EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
++ K+ E+ +QK K E + E + + KK + E+ E+ +
Sbjct: 838 LIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELYI 883
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 28.0 bits (63), Expect = 1.3
Identities = 13/25 (52%), Positives = 17/25 (68%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEE 108
EE++EEEEEE+EK+E EE
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEE 95
Score = 27.2 bits (61), Expect = 1.8
Identities = 13/24 (54%), Positives = 18/24 (75%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDE 107
E+++EEEEEEEEKE+ E+E
Sbjct: 73 AAAAAEEKKEEEEEEEEKEESEEE 96
Score = 26.8 bits (60), Expect = 2.5
Identities = 10/25 (40%), Positives = 18/25 (72%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEE 108
+E++EEEEEEE++++ +EE
Sbjct: 72 AAAAAAEEKKEEEEEEEEKEESEEE 96
Score = 26.1 bits (58), Expect = 4.6
Identities = 12/25 (48%), Positives = 18/25 (72%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEE 108
E++KEEEEEEEE+E+ ++E
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEEAAA 99
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 28.9 bits (65), Expect = 1.3
Identities = 13/65 (20%), Positives = 28/65 (43%), Gaps = 3/65 (4%)
Query: 92 EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE---EEQKNKKKEEEEEEE 148
+ + + K+ + ++ E E K+ ++ E +E+E EE N EE+ +
Sbjct: 56 DTGGKPDVPPSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLD 115
Query: 149 KEKVE 153
+ E
Sbjct: 116 ENLEE 120
Score = 28.1 bits (63), Expect = 2.2
Identities = 18/63 (28%), Positives = 30/63 (47%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
+ +E + EEE E E K+ E+ E EK+ ++ EE + +EE+ + EE
Sbjct: 61 PDVPPSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLDENLEE 120
Query: 146 EEE 148
E
Sbjct: 121 LSE 123
Score = 27.3 bits (61), Expect = 4.5
Identities = 17/68 (25%), Positives = 29/68 (42%), Gaps = 2/68 (2%)
Query: 83 WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
+ K +E + +E+ D E E KE EE + +KE EE +++K
Sbjct: 55 KDTGGKPDVPPSKEFLDLEEEILDLEA--EIKEVEENLESLEKEINELEEWLNVLDEEKS 112
Query: 143 EEEEEEKE 150
+E +E
Sbjct: 113 FLDENLEE 120
>gnl|CDD|221893 pfam13010, pRN1_helical, Primase helical domain. This alpha
helical domain is found in a set of bacterial plasmid
replication proteins. The domain is found to the
C-terminus of the primase/polymerase domain. Mutants of
this domain are defective in template binding,
dinucleotide formation and conformation change prior to
DNA extension.
Length = 135
Score = 28.2 bits (63), Expect = 1.4
Identities = 20/57 (35%), Positives = 31/57 (54%)
Query: 99 EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMY 155
E K+++ED E+ EK +EE K K + VE EE K K+ EE +++K +
Sbjct: 1 EGKKEEEDSEEDFEKLKEEMAKYDRFKGKTVEAIREEVCKKIKKSLEELKEKKKAIL 57
>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 548
Score = 29.0 bits (65), Expect = 1.4
Identities = 20/88 (22%), Positives = 39/88 (44%), Gaps = 8/88 (9%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKRE----KKEEEEEKQ----KD 123
I A E W+ E + + E+ E E+ ++E + ++E K E E + +
Sbjct: 270 ILAETEAEVAAWKAETRREAEQAEILAEQAIQEEKAQAEQEVQHAKALEAREMRVGLIER 329
Query: 124 KKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
+KE E+E +E + + +EE +
Sbjct: 330 QKETELEPQERSYFINAAQRQAQEEAKA 357
Score = 26.7 bits (59), Expect = 7.8
Identities = 15/104 (14%), Positives = 34/104 (32%), Gaps = 5/104 (4%)
Query: 58 IRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
A + A + + E++ + E+ E + E E + K E
Sbjct: 226 AENEAEKETEIAIAEANRDAKLVELEVEQQPAGKTAEQTREVKIILAETEAEVAAWKAET 285
Query: 118 EEKQKDKKEE-----EVEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
+ + + + E+ + EQ+ + + E E + R
Sbjct: 286 RREAEQAEILAEQAIQEEKAQAEQEVQHAKALEAREMRVGLIER 329
Score = 26.3 bits (58), Expect = 8.4
Identities = 19/94 (20%), Positives = 34/94 (36%), Gaps = 5/94 (5%)
Query: 60 ANARLTVILPQAISASETLENNEWEEEEKEKEEEEE-----EEEEEKEKDEDEEKKREKK 114
+ T + PQ S + +EE K E E + E EE +R ++
Sbjct: 328 ERQKETELEPQERSYFINAAQRQAQEEAKAAANIAEAIGAQAEAAVETARETEEAERAEQ 387
Query: 115 EEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
+ ++E+VE + K + E + E
Sbjct: 388 AALVAAAEAAEQEQVEIAVRAEAAKAEAEAQAAE 421
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 28.9 bits (65), Expect = 1.4
Identities = 19/112 (16%), Positives = 47/112 (41%), Gaps = 6/112 (5%)
Query: 46 QCQVGPAQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDE 105
+V QY P + + +S T + + E +++++ E+K +
Sbjct: 1275 PKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARK 1334
Query: 106 DEEKKREKKEEEEE------KQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
+ K R K+ + + + KK + E++++ + E+E++E+ E
Sbjct: 1335 KKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDED 1386
Score = 28.5 bits (64), Expect = 2.2
Identities = 23/84 (27%), Positives = 38/84 (45%), Gaps = 12/84 (14%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK-----------EEEVEEE 132
++ K+K E+ EEEE +ED+E E EEE +E+VE+
Sbjct: 1049 KDIIKKKSEKITAEEEEGA-EEDDEADDEDDEEELGAAVSYDYLLSMPIWSLTKEKVEKL 1107
Query: 133 EEEQKNKKKEEEEEEEKEKVEMYR 156
E + K+KE E+ + +M+
Sbjct: 1108 NAELEKKEKELEKLKNTTPKDMWL 1131
Score = 28.1 bits (63), Expect = 2.5
Identities = 26/93 (27%), Positives = 44/93 (47%), Gaps = 8/93 (8%)
Query: 62 ARLTVILPQAISASETLENNE----WEEE----EKEKEEEEEEEEEEKEKDEDEEKKREK 113
+L L + E L+N W E+ E+ EE+EE EE+E K++ + K +
Sbjct: 1105 EKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKG 1164
Query: 114 KEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
K + K K KK+E+ +++ K+KK
Sbjct: 1165 KASKLRKPKLKKKEKKKKKSSADKSKKASVVGN 1197
Score = 27.7 bits (62), Expect = 3.8
Identities = 22/100 (22%), Positives = 46/100 (46%), Gaps = 18/100 (18%)
Query: 76 ETLENNEWEEEEKEKEEEE-----------EEEEEEKEKDEDEEKKREKKEEEEEKQK-- 122
E +E E E+KEKE E+ E+ ++ +E E++E+ EK+ +E++ K
Sbjct: 1102 EKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSK 1161
Query: 123 -----DKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
K + +++E++K K ++ ++ R
Sbjct: 1162 TKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRV 1201
Score = 27.3 bits (61), Expect = 5.5
Identities = 19/74 (25%), Positives = 35/74 (47%), Gaps = 1/74 (1%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E LE E E EEKE +E+ + + K K K + KK+E+++K+ + +
Sbjct: 1139 EALEEQE-EVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGN 1197
Query: 136 QKNKKKEEEEEEEK 149
K +E+ + +
Sbjct: 1198 SKRVDSDEKRKLDD 1211
Score = 26.5 bits (59), Expect = 7.7
Identities = 27/109 (24%), Positives = 47/109 (43%), Gaps = 26/109 (23%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEE------------------KEKDEDEEKKR 111
+ + E EE ++ ++E++EEE KEK E +
Sbjct: 1052 IKKKSEKITAEEEEGAEEDDEADDEDDEEELGAAVSYDYLLSMPIWSLTKEKVEKLNAEL 1111
Query: 112 EKKEEEEEKQKDKKEE--------EVEEEEEEQKNKKKEEEEEEEKEKV 152
EKKE+E EK K+ + + EE EEQ+ +++E +E++ K
Sbjct: 1112 EKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKS 1160
>gnl|CDD|223363 COG0286, HsdM, Type I restriction-modification system
methyltransferase subunit [Defense mechanisms].
Length = 489
Score = 28.9 bits (65), Expect = 1.5
Identities = 17/73 (23%), Positives = 35/73 (47%), Gaps = 1/73 (1%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
E++E E + EE E + + + + ++EEE+ D+ +E+ E + E +
Sbjct: 416 REFKEIEGFSKSVSLEEIRENDYNLNPGRYVAEEEEEDIDL-DELLQELSEIDAELAELE 474
Query: 141 KEEEEEEEKEKVE 153
KE EE ++ +
Sbjct: 475 KELEEILKELLLN 487
>gnl|CDD|235285 PRK04335, PRK04335, cell division protein ZipA; Provisional.
Length = 313
Score = 28.6 bits (64), Expect = 1.5
Identities = 16/78 (20%), Positives = 34/78 (43%), Gaps = 1/78 (1%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEE-EKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
+ E + + + + E+ E + E+ +DE+ ++ E + + + EEE
Sbjct: 99 EEEEDKFEQEEAPIPVQAQSQPQPEKVEPQVEEPRDEEVLEEPEPVAAKVPMAEVQPEEE 158
Query: 129 VEEEEEEQKNKKKEEEEE 146
E E +E + K E E +
Sbjct: 159 TEIEVDEPEEPKPEPELD 176
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 28.1 bits (63), Expect = 1.5
Identities = 14/57 (24%), Positives = 38/57 (66%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
K KE+ ++ + K+++ ++EK++E++ + + +K+ E+E+++ E K K+E++
Sbjct: 67 KAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKEEKK 123
Score = 27.7 bits (62), Expect = 1.7
Identities = 23/74 (31%), Positives = 42/74 (56%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
+N +E E+E++E E E +K+ K++ +K K KKEE +E+E+E++
Sbjct: 36 SNSSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFM 95
Query: 140 KKEEEEEEEKEKVE 153
K E E+E+ ++E
Sbjct: 96 KALAEAEKERAELE 109
>gnl|CDD|223945 COG1013, PorB, Pyruvate:ferredoxin oxidoreductase and related
2-oxoacid:ferredoxin oxidoreductases, beta subunit
[Energy production and conversion].
Length = 294
Score = 28.4 bits (64), Expect = 1.5
Identities = 12/74 (16%), Positives = 29/74 (39%), Gaps = 6/74 (8%)
Query: 78 LENNEW-EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
+E W + + EEE+ ++ ++ + EE + E +++ +++
Sbjct: 224 VETGYWPLYRYEPGKAEEEKGKDIPIGIFYPVEEYLEYEERFKHLTKSNPELIQKLQKD- 282
Query: 137 KNKKKEEEEEEEKE 150
+E EE K
Sbjct: 283 ----VDERWEELKR 292
>gnl|CDD|201540 pfam00992, Troponin, Troponin. Troponin (Tn) contains three
subunits, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). this Pfam contains members of
the TnT subunit. Troponin is a complex of three
proteins, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). The troponin complex
regulates Ca++ induced muscle contraction. This family
includes troponin T and troponin I. Troponin I binds to
actin and troponin T binds to tropomyosin.
Length = 131
Score = 27.9 bits (63), Expect = 1.6
Identities = 18/68 (26%), Positives = 34/68 (50%), Gaps = 3/68 (4%)
Query: 81 NEWEEEEKEKEEEEE---EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
E E E+++KEEE+E E + + ++ ++ + D+ +EE + EE+
Sbjct: 14 EELEFEQRKKEEEKEKYLAERCPPLRLSLSRAELQELCKKLHARIDRLDEERYDIEEKVA 73
Query: 138 NKKKEEEE 145
K KE E+
Sbjct: 74 KKDKEIED 81
>gnl|CDD|184536 PRK14145, PRK14145, heat shock protein GrpE; Provisional.
Length = 196
Score = 28.3 bits (63), Expect = 1.6
Identities = 17/59 (28%), Positives = 36/59 (61%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
+E E+E +EEE+ + ++ E E+E+ Q+++ +++ +E EE K K +++E E
Sbjct: 2 EEVEKEINKEEEKDVNNLSSNEQMEGPPEDEQAQQNQPQQQTVDEIEELKQKLQQKEVE 60
>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 28.2 bits (63), Expect = 1.6
Identities = 9/51 (17%), Positives = 27/51 (52%)
Query: 100 EKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
E + + + + R+ + E QK+++E ++ + ++ ++E ++KE
Sbjct: 142 EADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESSKKKE 192
Score = 27.1 bits (60), Expect = 3.5
Identities = 11/53 (20%), Positives = 28/53 (52%)
Query: 99 EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
++ E D + K R +K + + K +EE+ + ++ +++ ++E K+K
Sbjct: 139 DKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESSKKK 191
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 28.0 bits (63), Expect = 1.6
Identities = 16/88 (18%), Positives = 52/88 (59%), Gaps = 3/88 (3%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
P +A + LE E ++ + E +++E+E +KE+ + +++ EE + ++ + +++
Sbjct: 31 PAGKAAQKQLEK---EFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQK 87
Query: 129 VEEEEEEQKNKKKEEEEEEEKEKVEMYR 156
+E +++Q+ ++E ++++++ +Y
Sbjct: 88 QQELQQKQQAAQQELQQKQQELLQPIYD 115
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 28.4 bits (64), Expect = 1.6
Identities = 16/59 (27%), Positives = 27/59 (45%)
Query: 92 EEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
E E+ K + KK+++KE ++++ K E E + EE + KK E E
Sbjct: 5 EALLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63
>gnl|CDD|224106 COG1185, Pnp, Polyribonucleotide nucleotidyltransferase
(polynucleotide phosphorylase) [Translation, ribosomal
structure and biogenesis].
Length = 692
Score = 28.7 bits (65), Expect = 1.7
Identities = 14/64 (21%), Positives = 31/64 (48%), Gaps = 1/64 (1%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
K+K E E +E+ + + + E + +E ++K+E + ++K +++ EEE
Sbjct: 227 KKKWELEPPSLDEELEAKVRDL-AEDELKEAVGIREKQERSAALDAIKEKIEEELSGEEE 285
Query: 148 EKEK 151
K
Sbjct: 286 SSLK 289
>gnl|CDD|223997 COG1071, AcoA, Pyruvate/2-oxoglutarate dehydrogenase complex,
dehydrogenase (E1) component, eukaryotic type, alpha
subunit [Energy production and conversion].
Length = 358
Score = 28.4 bits (64), Expect = 1.7
Identities = 16/60 (26%), Positives = 26/60 (43%), Gaps = 6/60 (10%)
Query: 93 EEEEEEEEKEKD--EDEEKKREKK----EEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
+EE EE K++D K + EEE E + + + EV+E E + + E
Sbjct: 277 SKEEVEEWKKRDPIVRLRKYLIEAGILSEEELEAIEAEAKAEVDEAVEFAEASPYPDVSE 336
>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
Validated.
Length = 605
Score = 28.7 bits (64), Expect = 1.7
Identities = 18/103 (17%), Positives = 41/103 (39%)
Query: 44 EFQCQVGPAQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEK 103
EF+ + P + + I + + ++ EE +
Sbjct: 352 EFEIYLYKIINANPDKLENKKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTL 411
Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
+ + +KK + K++D K+EE+ E+E +K++K + +E
Sbjct: 412 YKTKIFYHKKKINQNNKEQDIKKEELLEKEFVKKSEKIPKNDE 454
Score = 26.3 bits (58), Expect = 9.8
Identities = 15/77 (19%), Positives = 34/77 (44%), Gaps = 2/77 (2%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEE--KKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
+ E + ++K + K+ + EE K + + + ++++ + +EQ
Sbjct: 371 KKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQNNKEQ 430
Query: 137 KNKKKEEEEEEEKEKVE 153
KK+E E+E +K E
Sbjct: 431 DIKKEELLEKEFVKKSE 447
>gnl|CDD|235672 PRK06012, flhA, flagellar biosynthesis protein FlhA; Validated.
Length = 697
Score = 28.5 bits (65), Expect = 1.8
Identities = 12/27 (44%), Positives = 17/27 (62%)
Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEE 135
+KREKK E ++ ++EE E EEE
Sbjct: 327 RKREKKAAELAAEEAEEEEAAEPEEES 353
Score = 27.8 bits (63), Expect = 3.1
Identities = 10/26 (38%), Positives = 15/26 (57%)
Query: 120 KQKDKKEEEVEEEEEEQKNKKKEEEE 145
K++ K E EE EE++ + EEE
Sbjct: 328 KREKKAAELAAEEAEEEEAAEPEEES 353
>gnl|CDD|225887 COG3351, FlaD, Putative archaeal flagellar protein D/E [Cell
motility and secretion].
Length = 214
Score = 28.3 bits (63), Expect = 1.8
Identities = 18/63 (28%), Positives = 32/63 (50%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
E+ +E+ EE + K D+E E E E+ +++ E+V+ EEE K+K E
Sbjct: 18 FALEELKEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEKVDSIEEEISEKEKVMSE 77
Query: 146 EEE 148
+ +
Sbjct: 78 KLK 80
Score = 26.7 bits (59), Expect = 6.2
Identities = 23/83 (27%), Positives = 38/83 (45%), Gaps = 1/83 (1%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
+ EE+ EK E+ E+ K++ EE + K+ ++E ++ E + +E K
Sbjct: 1 KKPYLEEKIEKAEKVTAFALEELKEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEK 60
Query: 140 KKE-EEEEEEKEKVEMYRNPVPM 161
EEE EKEKV + P
Sbjct: 61 VDSIEEEISEKEKVMSEKLKEPA 83
>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
protein. Members of this protein family are designated
TraM and are found in a proposed transfer region of a
class of conjugative transposon found in the Bacteroides
lineage [Cellular processes, DNA transformation].
Length = 410
Score = 28.5 bits (64), Expect = 1.9
Identities = 15/76 (19%), Positives = 28/76 (36%), Gaps = 1/76 (1%)
Query: 41 DDAEFQCQVGPAQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEE-EE 99
D E A +PIR++A + + + + + E+E + EE E
Sbjct: 102 FDEEPDEPAETAGSLRPIRSSAAAYRDINRELGSFYEYPKTDEEKELLREVEELESRLAT 161
Query: 100 EKEKDEDEEKKREKKE 115
E + E++ E
Sbjct: 162 EPSPAPELEEQLALME 177
Score = 27.8 bits (62), Expect = 3.2
Identities = 14/68 (20%), Positives = 24/68 (35%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE ++ E + RE E + D+++E + E EE + E
Sbjct: 104 EEPDEPAETAGSLRPIRSSAAAYRDINRELGSFYEYPKTDEEKELLREVEELESRLATEP 163
Query: 144 EEEEEKEK 151
E E+
Sbjct: 164 SPAPELEE 171
Score = 27.4 bits (61), Expect = 4.5
Identities = 11/79 (13%), Positives = 34/79 (43%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
++ ++ + E+E E+++K+K + + E+ +++ +D + +EE +
Sbjct: 50 KDEGIIGDKMKAYEQEALEQKQKDKKRTLQDLADSFEKSDKESQDYSSSDSGFDEEPDEP 109
Query: 139 KKKEEEEEEEKEKVEMYRN 157
+ + YR+
Sbjct: 110 AETAGSLRPIRSSAAAYRD 128
>gnl|CDD|220297 pfam09581, Spore_III_AF, Stage III sporulation protein AF
(Spore_III_AF). This family represents the stage III
sporulation protein AF (Spore_III_AF) of the bacterial
endospore formation program, which exists in some but
not all members of the Firmicutes (formerly called
low-GC Gram-positives). The C-terminal region of these
proteins is poorly conserved.
Length = 185
Score = 28.0 bits (63), Expect = 1.9
Identities = 24/77 (31%), Positives = 40/77 (51%), Gaps = 6/77 (7%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEE------EEEKQKDKKEEEVEEEEEEQKN 138
E++ ++E + ++ E E DED E +E EE K+K K + E + +
Sbjct: 93 EKKLKEEYGVKVKDVEVEIDEDLESNNFDIKEVNVTLKEESKEKQKSKVEPVVIDTQTSK 152
Query: 139 KKKEEEEEEEKEKVEMY 155
K+EEEE EE EK++ +
Sbjct: 153 PKEEEEESEEAEKIKNF 169
>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
Length = 815
Score = 28.6 bits (64), Expect = 1.9
Identities = 18/80 (22%), Positives = 38/80 (47%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
S+ E + ++++ E +E E E+ + KK +K+++ E+ V+E EE
Sbjct: 7 SQAHAPAAEEGKGGTRKQKAENKEHEGEQSPKKAKKEKKQDDSGNGNGKSAEDAVKEFEE 66
Query: 135 EQKNKKKEEEEEEEKEKVEM 154
K ++ E+ +E +E
Sbjct: 67 FCKAIEEHLSIEQMREILEE 86
>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
(DUF2052). This entry is of sequences of two conserved
domains separated by a region of low complexity,
spanning some 200 residues. The function is unknown.
Length = 178
Score = 28.2 bits (63), Expect = 1.9
Identities = 27/61 (44%), Positives = 36/61 (59%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
LE E +EEE+ EEEEE+ EEE E++E+E ++ EEEEE E EE EE +
Sbjct: 71 LELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYDELPTPEEREELR 130
Query: 138 N 138
Sbjct: 131 E 131
Score = 27.4 bits (61), Expect = 3.4
Identities = 25/78 (32%), Positives = 42/78 (53%), Gaps = 5/78 (6%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
S LE++ EE + + E++EEEE E++E++ ++ ++EEEE + EE
Sbjct: 55 GLSGVLESSLDREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEY-----QRGPFGEE 109
Query: 133 EEEQKNKKKEEEEEEEKE 150
EEE + E EE+E
Sbjct: 110 EEEDGDSYDELPTPEERE 127
>gnl|CDD|148511 pfam06936, Selenoprotein_S, Selenoprotein S (SelS). This family
consists of several mammalian selenoprotein S (SelS)
sequences. SelS is a plasma membrane protein and is
present in a variety of tissues and cell types. The
function of this family is unknown.
Length = 190
Score = 27.9 bits (62), Expect = 1.9
Identities = 17/61 (27%), Positives = 36/61 (59%), Gaps = 3/61 (4%)
Query: 95 EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
E+E D D+ +R+ E E + + +E+++ + + K K+K+ EEE+ ++K+EM
Sbjct: 65 SSEKEITVFDPDKVVRRQ---EALEAARLRMQEDLDAKAAKFKEKQKQLEEEKRRQKIEM 121
Query: 155 Y 155
+
Sbjct: 122 W 122
>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
Length = 499
Score = 28.5 bits (63), Expect = 2.0
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 1/39 (2%)
Query: 67 ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDE 105
+ +A +E E+N+ E + E EEE+EEEE E E
Sbjct: 78 LFAEAGEEAEE-EDNDRECPDTEAEEEDEEEEIEAPDPE 115
Score = 27.3 bits (60), Expect = 4.1
Identities = 14/34 (41%), Positives = 19/34 (55%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDE 107
A E E + + E + E EEE+EEEE E + E
Sbjct: 82 AGEEAEEEDNDRECPDTEAEEEDEEEEIEAPDPE 115
Score = 27.3 bits (60), Expect = 4.8
Identities = 13/31 (41%), Positives = 19/31 (61%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKRE 112
E EEE+ ++E + E EEE E++E E E
Sbjct: 85 EAEEEDNDRECPDTEAEEEDEEEEIEAPDPE 115
Score = 26.6 bits (58), Expect = 7.8
Identities = 11/30 (36%), Positives = 20/30 (66%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
E EE EEE+ ++E + E ++ +++EE E
Sbjct: 81 EAGEEAEEEDNDRECPDTEAEEEDEEEEIE 110
Score = 26.6 bits (58), Expect = 9.1
Identities = 12/27 (44%), Positives = 20/27 (74%)
Query: 127 EEVEEEEEEQKNKKKEEEEEEEKEKVE 153
EE EEE+ +++ E EEE+E+E++E
Sbjct: 84 EEAEEEDNDRECPDTEAEEEDEEEEIE 110
Score = 26.2 bits (57), Expect = 9.4
Identities = 12/27 (44%), Positives = 16/27 (59%)
Query: 93 EEEEEEEEKEKDEDEEKKREKKEEEEE 119
EE EEE+ + D E + E +EEE E
Sbjct: 84 EEAEEEDNDRECPDTEAEEEDEEEEIE 110
>gnl|CDD|222508 pfam14033, DUF4246, Protein of unknown function (DUF4246). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria and fungi.
Proteins in this family are typically between 392 and
644 amino acids in length.
Length = 496
Score = 28.4 bits (64), Expect = 2.0
Identities = 17/72 (23%), Positives = 34/72 (47%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
E KE+E EE + E +DED E E+ EE + ++++++ E +
Sbjct: 262 EYHPKEKEPEEPKPEPGEDEDYEDYWERLEEYLDWREERRKAIHPEPGTFFPYEPTPPHI 321
Query: 146 EEEKEKVEMYRN 157
E+ + + +R+
Sbjct: 322 EKTVDLQKDFRD 333
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 26.8 bits (60), Expect = 2.0
Identities = 11/22 (50%), Positives = 16/22 (72%)
Query: 85 EEEKEKEEEEEEEEEEKEKDED 106
EEE++EEEEE+E+D+D
Sbjct: 61 AAAAAAEEEKKEEEEEEEEDDD 82
Score = 26.0 bits (58), Expect = 4.3
Identities = 11/17 (64%), Positives = 15/17 (88%)
Query: 84 EEEEKEKEEEEEEEEEE 100
EE++KEEEEEEEE++
Sbjct: 65 AAEEEKKEEEEEEEEDD 81
Score = 25.3 bits (56), Expect = 7.9
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 84 EEEEKEKEEEEEEEEEEKEKD 104
E+E++EEEEEEE++ D
Sbjct: 62 AAAAAEEEKKEEEEEEEEDDD 82
>gnl|CDD|189037 cd09867, PIN_FEN1, PIN domain of Flap Endonuclease-1, a
structure-specific, divalent-metal-ion dependent, 5'
nuclease and homologs. Flap endonuclease-1 (FEN1) is
involved in multiple DNA metabolic pathways, including
DNA replication processes (5' flap DNA endonuclease
activity and double stranded DNA 5'-exonuclease
activity) and DNA repair processes (long-patch base
excision repair) in eukaryotes and archaea. Interaction
between FEN1 and PCNA (Proliferating cell nuclear
antigen) is an essential prerequisite to FEN1's DNA
replication functionality and stimulates FEN1 nuclease
activity by 10-50 fold. FEN1 belongs to the
FEN1-EXO1-like family of structure-specific, 5'
nucleases. These nucleases contain a PIN (PilT N
terminus) domain with a helical arch/clamp region (I
domain) of variable length (approximately 45 residues in
FEN1 PIN domains) and a H3TH (helix-3-turn-helix)
domain, an atypical helix-hairpin-helix-2-like region.
Both the H3TH domain (not included here) and the helical
arch/clamp region are involved in DNA binding. Nucleases
within this group also have a carboxylate-rich active
site that is involved in binding essential divalent
metal ion cofactors (Mg2+/Mn2+). FEN1 has a C-terminal
extension containing residues forming the consensus
PIP-box - Qxx(M/L/I)xxF(Y/F) which serves to anchor FEN1
to PCNA.
Length = 261
Score = 28.2 bits (64), Expect = 2.0
Identities = 15/30 (50%), Positives = 21/30 (70%)
Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
K + EK+RE++EE EEK ++ KEE EE
Sbjct: 88 KSGELEKRRERREEAEEKLEEAKEEGDAEE 117
>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
Length = 648
Score = 28.2 bits (64), Expect = 2.0
Identities = 9/31 (29%), Positives = 17/31 (54%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
+EE +E+ + E +++EKK E+ E
Sbjct: 512 VEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 27.8 bits (63), Expect = 2.9
Identities = 11/45 (24%), Positives = 25/45 (55%)
Query: 110 KREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
+ + +EE ++ + +E EE++K +K E ++ +KVE+
Sbjct: 509 RIDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPEITIDDFDKVEL 553
Score = 27.8 bits (63), Expect = 3.1
Identities = 9/32 (28%), Positives = 19/32 (59%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEE 116
+ E+E +E+ E K+ +E++K+ +K E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 27.5 bits (62), Expect = 4.4
Identities = 10/32 (31%), Positives = 19/32 (59%)
Query: 96 EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
+ EEE +++ + KE EE+++K +K E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 27.1 bits (61), Expect = 5.7
Identities = 10/32 (31%), Positives = 18/32 (56%)
Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
D +EE K++ E K+ +E+E + E+ E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
Score = 26.7 bits (60), Expect = 7.6
Identities = 9/30 (30%), Positives = 18/30 (60%)
Query: 91 EEEEEEEEEEKEKDEDEEKKREKKEEEEEK 120
+ EEE +++ + K+ E+KE++ EK
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEK 540
Score = 26.3 bits (59), Expect = 9.4
Identities = 9/32 (28%), Positives = 18/32 (56%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
+ E+E +E+ E ++ E+K +K E+ E
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 28.4 bits (63), Expect = 2.0
Identities = 22/75 (29%), Positives = 42/75 (56%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
LE++E ++E +KE + + EEE++E DE E+ + E++ E K K +E V E +
Sbjct: 273 AELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDL 332
Query: 136 QKNKKKEEEEEEEKE 150
+ K+ E+ + +
Sbjct: 333 ESPKELSSFEKRQAK 347
Score = 26.9 bits (59), Expect = 7.2
Identities = 22/83 (26%), Positives = 41/83 (49%)
Query: 75 SETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE 134
E + + E ++EEEEEE+E +DE + + E E + + + E E++
Sbjct: 100 HEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETEDD 159
Query: 135 EQKNKKKEEEEEEEKEKVEMYRN 157
E+K ++EE EE++ + R
Sbjct: 160 EEKKMEEEEAGEEKESVEQATRE 182
>gnl|CDD|227657 COG5354, COG5354, Uncharacterized protein, contains Trp-Asp (WD)
repeat [General function prediction only].
Length = 561
Score = 28.3 bits (63), Expect = 2.0
Identities = 22/59 (37%), Positives = 29/59 (49%), Gaps = 3/59 (5%)
Query: 97 EEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE--EEEKEKVE 153
E+ + DED+ + KK E K++ EE E Q NK + EEE E KE VE
Sbjct: 503 EQWPEYSDEDKIRSLLKKLRAIEALKERMRSG-EELEVIQVNKIETEEEVLSELKELVE 560
>gnl|CDD|237200 PRK12778, PRK12778, putative bifunctional 2-polyprenylphenol
hydroxylase/glutamate synthase subunit beta;
Provisional.
Length = 752
Score = 28.6 bits (64), Expect = 2.1
Identities = 17/60 (28%), Positives = 24/60 (40%), Gaps = 1/60 (1%)
Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPMYR 163
D DE KR + E ++ K EE + K + +E E+V M P YR
Sbjct: 249 DFDEMLKRMGAYKTIEGEELLKLEERTAAWRAELRKSMKPKERTAIERVPMPELD-PEYR 307
Score = 28.2 bits (63), Expect = 2.3
Identities = 16/66 (24%), Positives = 23/66 (34%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E EE K EE + + + K+R E + D + EE KE+
Sbjct: 264 EGEELLKLEERTAAWRAELRKSMKPKERTAIERVPMPELDPEYRAHNRFEEVNLGLTKEQ 323
Query: 144 EEEEEK 149
E K
Sbjct: 324 AMTEAK 329
Score = 27.8 bits (62), Expect = 3.3
Identities = 9/53 (16%), Positives = 20/53 (37%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
E EE + E+ + R+ + +E ++ + E N+ +E
Sbjct: 262 TIEGEELLKLEERTAAWRAELRKSMKPKERTAIERVPMPELDPEYRAHNRFEE 314
>gnl|CDD|226908 COG4531, ZnuA, ABC-type Zn2+ transport system, periplasmic
component/surface adhesin [Inorganic ion transport and
metabolism].
Length = 318
Score = 28.2 bits (63), Expect = 2.1
Identities = 24/101 (23%), Positives = 34/101 (33%), Gaps = 13/101 (12%)
Query: 23 GDSSHGVYNLRIRNVSLTDDAEFQCQVGP---AQYHKPIRANARLTVI-LPQAISASETL 78
G S H Y+LR +V A+ VGP A KP+ V+ L L
Sbjct: 57 GASPH-DYSLRPSDVKRLQSADLVVWVGPDLEAFLDKPLSGLPGAKVVTLADLPGVKPLL 115
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
+E + EEE + + + K E E
Sbjct: 116 F--------REAHDHEEEHDHGHDHAGADHKGDHDHHHEGE 148
>gnl|CDD|240416 PTZ00432, PTZ00432, falcilysin; Provisional.
Length = 1119
Score = 28.2 bits (63), Expect = 2.2
Identities = 21/80 (26%), Positives = 37/80 (46%), Gaps = 4/80 (5%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E + EKE + ++E KE+ K+ + E E+ +K KKE E +++ E +
Sbjct: 575 ESSKYEKEFNKLVKDELKERLSHLTKE-QVDEMEKAYEKFKKEREADDDPEHLDSFPILS 633
Query: 144 EEEEEKEKVEMYRNPVPMYR 163
+ KE E P +Y+
Sbjct: 634 LSDLNKETEE---IPTKLYK 650
Score = 26.7 bits (59), Expect = 7.8
Identities = 19/57 (33%), Positives = 29/57 (50%), Gaps = 1/57 (1%)
Query: 97 EEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
E E K E E K K E +E K E+V+E E+ + KKE E +++ E ++
Sbjct: 572 EAVESSKYEKEFNKLVKDELKERLSHLTK-EQVDEMEKAYEKFKKEREADDDPEHLD 627
>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein. This
family consists of REP proteins from Dictyostelium
(Slime molds). REP protein is likely involved in
transcription regulation and control of DNA replication,
specifically amplification of plasmid at low copy
numbers. The formation of homomultimers may be required
for their regulatory activity.
Length = 910
Score = 28.3 bits (63), Expect = 2.2
Identities = 10/38 (26%), Positives = 28/38 (73%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK 109
++ E ++++E E + +E+E+++E++++DEDE++
Sbjct: 873 LTDLEEYLISQFQENEDDDADEDEDQDEDEDEDEDEDE 910
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 28.3 bits (63), Expect = 2.2
Identities = 20/79 (25%), Positives = 34/79 (43%), Gaps = 8/79 (10%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDED--------EEKKREKKEEEEEKQKDKKEEEV 129
L+ E E ++ +KE+ + EE EK + ++ R+ ++ K+ KK E
Sbjct: 1158 LDALEKELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASES 1217
Query: 130 EEEEEEQKNKKKEEEEEEE 148
E EE + E E E
Sbjct: 1218 ETTEETYGSSAMETENVAE 1236
Score = 27.1 bits (60), Expect = 6.5
Identities = 15/67 (22%), Positives = 31/67 (46%), Gaps = 6/67 (8%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E+E + ++E+ + EE +EK + + E ++ ++ K+ +K KK
Sbjct: 1162 EKELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPA------PKKTTKKAS 1215
Query: 144 EEEEEKE 150
E E +E
Sbjct: 1216 ESETTEE 1222
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 27.1 bits (60), Expect = 2.2
Identities = 19/32 (59%), Positives = 24/32 (75%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEE 100
P A +A+ E E +EEEK+KEEE+EEEEEE
Sbjct: 66 PAAAAAAAAEEKAEEKEEEKKKEEEKEEEEEE 97
>gnl|CDD|236842 PRK11091, PRK11091, aerobic respiration control sensor protein
ArcB; Provisional.
Length = 779
Score = 28.4 bits (64), Expect = 2.2
Identities = 16/42 (38%), Positives = 24/42 (57%), Gaps = 4/42 (9%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ 121
N E E EK EE +E E+ ++E K+RE+ + E E+Q
Sbjct: 116 NQEIAEREKA----EEARQEAFEQLKNEIKEREETQIELEQQ 153
>gnl|CDD|205286 pfam13105, DUF3959, Protein of unknown function (DUF3959). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 260 amino acids in length.
Length = 242
Score = 27.9 bits (62), Expect = 2.3
Identities = 8/48 (16%), Positives = 20/48 (41%)
Query: 110 KREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
++ ++ K + EE V + + +E + + ++E RN
Sbjct: 195 RKPSNSKDPRKYEHNYEENVPMYHMTEPIDQYKEPTTQGQTRMERRRN 242
>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
SseC is a secreted protein that forms a complex together
with SecB and SecD on the surface of Salmonella. All
these proteins are secreted by the type III secretion
system. Many mucosal pathogens use type III secretion
systems for the injection of effector proteins into
target cells. SecB, SseC and SecD are inserted into the
target cell membrane. where they form a small pore or
translocon. In addition to SseC, this family includes
the bacterial secreted proteins PopB, PepB, YopB and
EspD which are thought to be directly involved in pore
formation, and type III secretion system translocon.
Length = 303
Score = 28.2 bits (63), Expect = 2.4
Identities = 16/47 (34%), Positives = 30/47 (63%), Gaps = 3/47 (6%)
Query: 105 EDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
+ EK+ + K ++ E+ +DK+E++ EE +E+ K K E+ EE +K
Sbjct: 11 KLAEKQAKSKLQQLERARDKQEKKAEEYQEQIK---KAIEKAEEAKK 54
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 26.9 bits (60), Expect = 2.4
Identities = 14/24 (58%), Positives = 19/24 (79%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDE 107
+E+EEEEEEEEEE+E+ E+E
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEEE 95
Score = 26.5 bits (59), Expect = 3.1
Identities = 14/24 (58%), Positives = 18/24 (75%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEE 108
+EEEEEEEEEE+E++E EE
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEE 94
Score = 26.5 bits (59), Expect = 3.2
Identities = 14/25 (56%), Positives = 19/25 (76%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEE 108
E+EEEEEEEEEE+E++ +EE
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEE 95
>gnl|CDD|222259 pfam13608, Potyvirid-P3, Protein P3 of Potyviral polyprotein. This
is the P3 protein section of the Potyviridae
polyproteins. The function is not known except that the
protein is essential to viral survival.
Length = 445
Score = 28.3 bits (64), Expect = 2.4
Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNP 158
E ++ K+E EEK+ +K+E+E+ + K + K EEE E +E +NP
Sbjct: 271 EHRKLKREAAEEKE-EKEEKEIRKLYLYSKLEGKLPTEEEFLEYIE-EKNP 319
Score = 27.9 bits (63), Expect = 2.5
Identities = 24/70 (34%), Positives = 35/70 (50%), Gaps = 12/70 (17%)
Query: 86 EEKEKEEEEEEEEEEK--EKDEDEEKKREKKEEEEE-----KQKDKK-----EEEVEEEE 133
+ K + EE+EE+EEK K K K EEE ++K+ + EE +EEEE
Sbjct: 274 KLKREAAEEKEEKEEKEIRKLYLYSKLEGKLPTEEEFLEYIEEKNPELLETAEELIEEEE 333
Query: 134 EEQKNKKKEE 143
E + K +E
Sbjct: 334 VEHQAKTADE 343
>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B. A
small number of taxonomically diverse prokaryotic
species, including Methanosarcina barkeri, have what
appears to be a second ATP synthase, in addition to the
normal F1F0 ATPase in bacteria and A1A0 ATPase in
archaea. These enzymes use ion gradients to synthesize
ATP, CC and in principle may run in either direction.
This model represents the F0 subunit B of this apparent
second ATP synthase.
Length = 246
Score = 27.7 bits (62), Expect = 2.4
Identities = 19/70 (27%), Positives = 36/70 (51%), Gaps = 3/70 (4%)
Query: 84 EEEEKEKEEEEEEEEEEK---EKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
+ + K++E E+E E E+ E D+ E K +EE + ++ + +E EE +E + K
Sbjct: 47 DADTKKREAEQERREYEEKNEELDQQREVLLTKAKEEAQAERQRLLDEAREEADEIREKW 106
Query: 141 KEEEEEEEKE 150
+E E+
Sbjct: 107 QEALRREQAA 116
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The CAF-1
or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression in
addition to DNA-based checkpoints.
Length = 76
Score = 26.4 bits (59), Expect = 2.4
Identities = 14/27 (51%), Positives = 22/27 (81%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEE 108
EWEEEE+ ++ E E+EE+E+E D+D+
Sbjct: 47 EWEEEEEGEDLESEDEEDEEEDDDDDM 73
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 25.8 bits (57), Expect = 2.5
Identities = 14/25 (56%), Positives = 19/25 (76%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKR 111
E+EK E+EEE E+E+E +E E KR
Sbjct: 22 EEEKREDEEENEDEEEGEEQSEVKR 46
>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
Length = 205
Score = 27.6 bits (62), Expect = 2.5
Identities = 17/39 (43%), Positives = 23/39 (58%)
Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
KK+ KKEEE +K K K E V + E +KN K ++ E
Sbjct: 124 KKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKALLE 162
>gnl|CDD|217830 pfam03986, Autophagy_N, Autophagocytosis associated protein (Atg3),
N-terminal domain. Autophagocytosis is a
starvation-induced process responsible for transport of
cytoplasmic proteins to the lysosome/vacuole. Atg3 is a
ubiquitin like modifier that is topologically similar to
the canonical E2 enzyme. It catalyzes the conjugation of
Atg8 and phosphatidylethanolamine.
Length = 146
Score = 27.3 bits (61), Expect = 2.6
Identities = 11/62 (17%), Positives = 32/62 (51%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
E + EE E+E+E + R+K++++ ++D E ++++ + E+++
Sbjct: 83 EYGDGAEEIVEDEDEDDGWVTTHGNRDKQKDDIADEEDIPEIGDDDDDVVDSSDADEDDD 142
Query: 146 EE 147
++
Sbjct: 143 DD 144
Score = 26.9 bits (60), Expect = 4.2
Identities = 8/62 (12%), Positives = 30/62 (48%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
E EE E+E+++ K++++ ++ E +++++ + +E+++
Sbjct: 83 EYGDGAEEIVEDEDEDDGWVTTHGNRDKQKDDIADEEDIPEIGDDDDDVVDSSDADEDDD 142
Query: 147 EE 148
++
Sbjct: 143 DD 144
Score = 25.8 bits (57), Expect = 9.1
Identities = 9/68 (13%), Positives = 34/68 (50%), Gaps = 4/68 (5%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
+ E + +E E+E+E++ + ++K + +E+ E+ +++++ +
Sbjct: 80 KQMEYGDGAEEIVEDEDEDDGWVTTHGNRDKQKDDIADEEDIP----EIGDDDDDVVDSS 135
Query: 141 KEEEEEEE 148
+E++++
Sbjct: 136 DADEDDDD 143
Score = 25.8 bits (57), Expect = 9.8
Identities = 12/63 (19%), Positives = 30/63 (47%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
K+ E + EE +++DED+ +++K EE++ E ++ + + +E
Sbjct: 80 KQMEYGDGAEEIVEDEDEDDGWVTTHGNRDKQKDDIADEEDIPEIGDDDDDVVDSSDADE 139
Query: 148 EKE 150
+ +
Sbjct: 140 DDD 142
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 27.9 bits (62), Expect = 2.6
Identities = 20/98 (20%), Positives = 41/98 (41%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+A +S+ + E E E++ K EE E++ + E++ED + + + + +EE
Sbjct: 162 EAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEGE 221
Query: 130 EEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPMYRNPVP 167
E E E +E ++ R+ + P
Sbjct: 222 EAPSINYNEDTSESESDESDSEISESRSVSDSEESSPP 259
Score = 27.5 bits (61), Expect = 4.1
Identities = 11/61 (18%), Positives = 37/61 (60%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
E + ++ + +++ +K K ++ ++ +K +EEE + +D+ + E E++ ++++ + E
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSE 202
Query: 147 E 147
+
Sbjct: 203 D 203
Score = 26.3 bits (58), Expect = 9.9
Identities = 21/78 (26%), Positives = 37/78 (47%), Gaps = 6/78 (7%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV------EEE 132
+ E +E + +EEE E E+E + +E E + +EEE+ +D + + +EE
Sbjct: 159 KKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEE 218
Query: 133 EEEQKNKKKEEEEEEEKE 150
E E+ E+ E E
Sbjct: 219 EGEEAPSINYNEDTSESE 236
>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 145
Score = 27.4 bits (61), Expect = 2.6
Identities = 7/38 (18%), Positives = 13/38 (34%), Gaps = 1/38 (2%)
Query: 112 EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
K + E + E + + K+ EE +K
Sbjct: 104 RKNDAHTHASILLSSNE-QNSTEALQLRAKKREEHRKK 140
Score = 25.9 bits (57), Expect = 8.4
Identities = 5/41 (12%), Positives = 14/41 (34%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE 119
+N+ E+ E + + ++ KK + +
Sbjct: 105 KNDAHTHASILLSSNEQNSTEALQLRAKKREEHRKKWYQND 145
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 27.5 bits (61), Expect = 2.6
Identities = 24/68 (35%), Positives = 42/68 (61%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E +E+++E+EEE+EEE EE E EDEE+ E +EEEEE ++D + + E++
Sbjct: 46 EKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNS 105
Query: 142 EEEEEEEK 149
+++ +
Sbjct: 106 TQDDNAQN 113
Score = 27.1 bits (60), Expect = 4.0
Identities = 23/86 (26%), Positives = 53/86 (61%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E ++ W++E+ KE E+ ++E++++ +E+EE+ E+ EE E+ + +++ E EEEEEE
Sbjct: 24 EHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEE 83
Query: 136 QKNKKKEEEEEEEKEKVEMYRNPVPM 161
+ + ++ E+K +++ +
Sbjct: 84 DEEDNVDLKDIEKKNINDIFNSTQDD 109
>gnl|CDD|222863 PHA02540, 61, DNA primase; Provisional.
Length = 337
Score = 28.0 bits (63), Expect = 2.6
Identities = 13/50 (26%), Positives = 24/50 (48%)
Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKV 152
KD + + RE E +++ K V + + E K +KK E+ E++
Sbjct: 72 KDYEPDLYREYIMERFKERGTGKGRPVPKPKFEFKKEKKVIEKLPFCERL 121
>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
(DUF2098). This domain, found in various hypothetical
prokaryotic proteins, has no known function.
Length = 91
Score = 26.9 bits (60), Expect = 2.6
Identities = 14/46 (30%), Positives = 26/46 (56%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
+ E ++ +E++EE+E+D++E +R KKEEE + D
Sbjct: 43 TDYLEVTDKVKEKKEEREEDKEELIERIKKEEETFEDVDLGSAGCG 88
>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
outer membrane].
Length = 170
Score = 27.4 bits (61), Expect = 2.7
Identities = 18/77 (23%), Positives = 37/77 (48%), Gaps = 2/77 (2%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREK--KEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
+E + ++ + E E +K + E +K +K K +E + Q D K E + + + + KKE
Sbjct: 37 QESPQAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKE 96
Query: 143 EEEEEEKEKVEMYRNPV 159
+ +K + Y +
Sbjct: 97 KLVNAFNKKQQEYEKDL 113
>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases.and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain.
Expression of a construct consisting of only the
N-terminal domain of Saccharomyces pombe Cdc37 results
in cellular viability. This indicates that interactions
with the cochaperone Hsp90 may not be essential for
Cdc37 function.
Length = 154
Score = 27.4 bits (61), Expect = 2.8
Identities = 16/46 (34%), Positives = 27/46 (58%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
E EE + E EE ++ ++K + E+K+ KK +E EK++ KK
Sbjct: 96 KKELEEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEKKK 141
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 27.5 bits (62), Expect = 2.8
Identities = 19/51 (37%), Positives = 35/51 (68%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
+KEE + ++E EK E+ +++EK+ ++Q ++KEEE+EE EEQ+ +
Sbjct: 89 QKEETLDRKDESLEKKEESLEEKEKELAARQQQLEEKEEELEELIEEQQQE 139
>gnl|CDD|235185 PRK03980, PRK03980, flap endonuclease-1; Provisional.
Length = 292
Score = 27.9 bits (63), Expect = 2.8
Identities = 15/30 (50%), Positives = 21/30 (70%)
Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
K E+ E++RE +EE EEK ++ KEE EE
Sbjct: 40 KAEEIEERREVREEAEEKYEEAKEEGDLEE 69
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 28.0 bits (63), Expect = 2.9
Identities = 10/45 (22%), Positives = 23/45 (51%), Gaps = 1/45 (2%)
Query: 114 KEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNP 158
+E+++ K+ +K ++K K+ + E+++ V Y NP
Sbjct: 84 REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRH-VLHYHNP 127
>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
Length = 627
Score = 27.8 bits (63), Expect = 3.0
Identities = 28/93 (30%), Positives = 43/93 (46%), Gaps = 15/93 (16%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR----------EKK-EEEEEK 120
I+AS L + E E K+ E EE+++ KE E R EK +E +K
Sbjct: 494 ITASSGLSDEEIERMVKDAEANAEEDKKRKELVE----ARNQADSLIYQTEKTLKELGDK 549
Query: 121 QKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
++E++E +E K K E++E K K E
Sbjct: 550 VPADEKEKIEAAIKELKEALKGEDKEAIKAKTE 582
>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
Length = 567
Score = 27.9 bits (63), Expect = 3.0
Identities = 15/62 (24%), Positives = 30/62 (48%), Gaps = 1/62 (1%)
Query: 58 IRANARLTV-ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEE 116
+R+ A+ + I+ + + +L E E +E E EEEEEK++++ E+
Sbjct: 40 LRSKAKEIIPIVKEVVEEVNSLSLEEQRERLEELAPELLEEEEEKKEEKKGLPPLPNAEK 99
Query: 117 EE 118
+
Sbjct: 100 GK 101
Score = 27.5 bits (62), Expect = 4.1
Identities = 15/39 (38%), Positives = 22/39 (56%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
EE++E+ EE E E+E+++ EEKK EK K
Sbjct: 63 LEEQRERLEELAPELLEEEEEKKEEKKGLPPLPNAEKGK 101
Score = 27.1 bits (61), Expect = 4.8
Identities = 11/36 (30%), Positives = 20/36 (55%)
Query: 118 EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
E+Q+++ EE E EE++ KK+E++ E
Sbjct: 63 LEEQRERLEELAPELLEEEEEKKEEKKGLPPLPNAE 98
>gnl|CDD|216249 pfam01025, GrpE, GrpE.
Length = 165
Score = 27.2 bits (61), Expect = 3.0
Identities = 20/56 (35%), Positives = 36/56 (64%)
Query: 96 EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
EE+EE+E+ EDEE+ E++ EE E++ ++ ++ + E +N +K E E E+ K
Sbjct: 1 EEKEEEEELEDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRKRTEREREEAK 56
>gnl|CDD|220410 pfam09798, LCD1, DNA damage checkpoint protein. This is a family
of proteins which regulate checkpoint kinases. In
Schizosaccharomyces pombe this protein is called Rad26
and in Saccharomyces cerevisiae it is called LCD1.
Length = 648
Score = 27.9 bits (62), Expect = 3.0
Identities = 13/38 (34%), Positives = 24/38 (63%), Gaps = 1/38 (2%)
Query: 114 KEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
K+EE KQK + E++E+ +++ K K+E + E E+
Sbjct: 13 KQEERNKQK-SRVNELKEKHDQELQKLKQELQSLEDER 49
>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein. This family consists of several
eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
to suppress GCR2 and is highly expressed in the muscle
and heart. The function of this family is unknown
although it has been speculated that SGT1 may be
functionally analogous to the Gcr2p protein of
Saccharomyces cerevisiae which is known to be a
regulatory factor of glycolytic gene expression.
Length = 557
Score = 27.8 bits (62), Expect = 3.1
Identities = 10/67 (14%), Positives = 29/67 (43%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ +E + E+++E+E+ + +E ++ + + E KQ D + ++
Sbjct: 487 DADEDDDEDDDEDEDSSSDSTLEELEEYMDQMDAELKQTDSSNNADISNSGSSGAEDDDD 546
Query: 144 EEEEEKE 150
+ E +
Sbjct: 547 DIEGVEP 553
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 27.8 bits (62), Expect = 3.1
Identities = 17/77 (22%), Positives = 29/77 (37%), Gaps = 4/77 (5%)
Query: 66 VILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE-EEKQKDK 124
V+LP+A + E E E EEE E D+ + E+ ++ EE D
Sbjct: 258 VLLPRA---TRLPEPEPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIPEELMFDA 314
Query: 125 KEEEVEEEEEEQKNKKK 141
E ++ + +
Sbjct: 315 VEADLPDNILATLQTVQ 331
>gnl|CDD|178493 PLN02905, PLN02905, beta-amylase.
Length = 702
Score = 28.0 bits (62), Expect = 3.1
Identities = 10/25 (40%), Positives = 19/25 (76%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREK 113
E+++EEE + + KE+D+ + + REK
Sbjct: 13 EEDDEEEMDMDVKEEDDGDRRNREK 37
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 27.7 bits (61), Expect = 3.1
Identities = 16/103 (15%), Positives = 39/103 (37%), Gaps = 9/103 (8%)
Query: 64 LTVILPQAISAS---------ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKK 114
+TV+ ISA+ L + ++ ++ + E + E + E++
Sbjct: 46 VTVLTGMLISAATLAILFLLNRNLRSGVFQLDDIRPQLRALRTELGTAQGEKRAAETERE 105
Query: 115 EEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
E QK ++E E +E + + ++E + ++
Sbjct: 106 AARSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQAQD 148
>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
insulinase-like [General function prediction only].
Length = 978
Score = 27.7 bits (62), Expect = 3.1
Identities = 14/47 (29%), Positives = 26/47 (55%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
E EE+ +++E E ++ + DED EK + ++ +E+Q EE
Sbjct: 460 ELEEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEE 506
Score = 27.3 bits (61), Expect = 4.3
Identities = 11/49 (22%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 104 DEDEEKKREKKEEEE--EKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
+ E+K EK+E E ++ + +E++E+ ++ K K+ +++ + +E
Sbjct: 458 SPELEEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEE 506
Score = 26.5 bits (59), Expect = 9.1
Identities = 11/28 (39%), Positives = 18/28 (64%)
Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
EE++E+EE E K+ E +E+ EK+
Sbjct: 462 EEKLEKEERELLQKRSSELTDEDLEKII 489
>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB. Topo IB proteins include the
monomeric yeast and human topo I and heterodimeric topo
I from Leishmania donvanni. Topo I enzymes are divided
into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topos I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I have putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 217
Score = 27.6 bits (61), Expect = 3.2
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 130 EEEEEEQKNKKKEEEEEEEKEKVE 153
EEE+E++KN KEE+E ++KE+ +
Sbjct: 94 EEEKEKKKNLNKEEKEAKKKERAK 117
>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
Length = 194
Score = 27.5 bits (61), Expect = 3.2
Identities = 19/59 (32%), Positives = 34/59 (57%), Gaps = 4/59 (6%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEE----EQKNKKKEEE 144
KEE E++ +E+ DE K+E KE +E+ ++K+ + E+E + KNK E++
Sbjct: 3 KEEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVEDLEKEIADLKAKNKDLEDK 61
>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 486
Score = 27.5 bits (61), Expect = 3.5
Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 26/100 (26%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEK--------------------EKDEDEEKKREKKEEEE 118
E +E+ ++KEEE+EE + +K E E+E K K E+ E
Sbjct: 368 SKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKITIKVEKSE 427
Query: 119 EKQKDKKEEEVE------EEEEEQKNKKKEEEEEEEKEKV 152
E D + + ++ E EE +K K+KE+E E EKV
Sbjct: 428 EDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKERPEFVEKV 467
>gnl|CDD|221514 pfam12297, EVC2_like, Ellis van Creveld protein 2 like protein.
This family of proteins is found in eukaryotes. Proteins
in this family are typically between 571 and 1310 amino
acids in length. There are two conserved sequence
motifs: LPA and ELH. EVC2 is implicated in Ellis van
Creveld chondrodysplastic dwarfism in humans. Mutations
in this protein can give rise to this congenital
condition. LIMBIN is a protein which shares around 80%
sequence homology with EVC2 and it is implicated in a
similar condition in bovine chondrodysplastic dwarfism.
Length = 429
Score = 27.5 bits (61), Expect = 3.5
Identities = 17/79 (21%), Positives = 35/79 (44%), Gaps = 5/79 (6%)
Query: 71 AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKRE-----KKEEEEEKQKDKK 125
A++A LE + E + ++E E E+ EE K E + E E++ ++
Sbjct: 221 ALTAECNLETRKKMEAQHQREMAEMEQAEELLKRAPEREAVECSSLLDTLHGLEQEHLQR 280
Query: 126 EEEVEEEEEEQKNKKKEEE 144
+++EE+ K ++
Sbjct: 281 SLLLQQEEDFAKAHRQLAV 299
>gnl|CDD|227371 COG5038, COG5038, Ca2+-dependent lipid-binding protein, contains C2
domain [General function prediction only].
Length = 1227
Score = 27.8 bits (62), Expect = 3.5
Identities = 12/54 (22%), Positives = 22/54 (40%)
Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
+E + +K E+ K + +K E+ + EE E +E +MY
Sbjct: 847 SLEEVRYVDKVSSEKRKSEKRKSALDEKTISLVDKEDSVEESIEVEELTDMYSL 900
Score = 27.0 bits (60), Expect = 5.2
Identities = 16/56 (28%), Positives = 25/56 (44%), Gaps = 4/56 (7%)
Query: 91 EEEEEEEEEEKEKDEDEEKK---REKKEEEEEKQKDKKEE-EVEEEEEEQKNKKKE 142
EE ++ EK + E++K EK +K+ +E EVEE + K K
Sbjct: 849 EEVRYVDKVSSEKRKSEKRKSALDEKTISLVDKEDSVEESIEVEELTDMYSLKPKL 904
>gnl|CDD|215406 PLN02761, PLN02761, lipase class 3 family protein.
Length = 527
Score = 27.7 bits (61), Expect = 3.6
Identities = 9/37 (24%), Positives = 22/37 (59%)
Query: 96 EEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
++++ ++ + KRE++ EEE ++K+ E+ E
Sbjct: 57 QQKQSNKQTHVSDNKREEEPEEELEEKEVSLREIWRE 93
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 27.2 bits (61), Expect = 3.6
Identities = 13/41 (31%), Positives = 25/41 (60%), Gaps = 6/41 (14%)
Query: 95 EEEEEEKEKDEDEEKKREKKEEEEEKQK------DKKEEEV 129
E E+E+K+ EEKK K+E+++ ++ D ++E+V
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEPYGYCLVDGRKEKV 136
>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7). RRP7
is an essential protein in yeast that is involved in
pre-rRNA processing and ribosome assembly. It is
speculated to be required for correct assembly of rpS27
into the pre-ribosomal particle.
Length = 131
Score = 26.8 bits (60), Expect = 3.7
Identities = 21/65 (32%), Positives = 34/65 (52%), Gaps = 6/65 (9%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDED----EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
E+ +K +EE +EE + E DED + K++ + K EE + +E++K K
Sbjct: 28 EKYDKREEEAKEEAKARSEPDEDGWTTVTRGGRKRKAGASRN--KAAEERRKLKEKKKKK 85
Query: 140 KKEEE 144
KKE E
Sbjct: 86 KKELE 90
>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
MW N terminal. This domain is found in eukaryotes. This
domain is about 90 amino acids in length. This domain is
found associated with pfam00076. This domain is part of
U1 snRNP, which is the pre-mRNA binding protein of the
penta-snRNP spliceosome complex. It extends over a
distance of 180 A from its RNA binding domain, wraps
around the core domain of U1 snRNP consisting of the
seven Sm proteins and finally contacts U1-C, which is
crucial for 5'-splice-site recognition.
Length = 94
Score = 26.5 bits (59), Expect = 3.7
Identities = 16/47 (34%), Positives = 28/47 (59%), Gaps = 4/47 (8%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
+E+++ + E E E EK E +KREKKE+ E+K +++ +E
Sbjct: 45 SEFKDYKDEPPPEPTETWLEKR----EREKREKKEKLEKKLEEELKE 87
Score = 25.3 bits (56), Expect = 7.6
Identities = 13/40 (32%), Positives = 23/40 (57%)
Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
KD +E E E EK++ +K E+ E+ E++ + + KE
Sbjct: 48 KDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKE 87
>gnl|CDD|167284 PRK01833, tatA, twin arginine translocase protein A; Provisional.
Length = 74
Score = 26.0 bits (57), Expect = 3.8
Identities = 15/43 (34%), Positives = 24/43 (55%)
Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
D E K KK ++K KD + E+VE +E +K +E+E+
Sbjct: 31 DLGESVKGFKKAMADDKPKDAEFEKVEAKEAASTEQKAKEKEQ 73
>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366). This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are typically between 227 and
387 amino acids in length.
Length = 213
Score = 27.3 bits (61), Expect = 3.8
Identities = 17/101 (16%), Positives = 35/101 (34%), Gaps = 25/101 (24%)
Query: 62 ARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEK------------ 109
A L ++ + +A+ T + E E E + ++E E+K
Sbjct: 108 ADLLALMEEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEKKSGMGPLLLVLAV 167
Query: 110 -------------KREKKEEEEEKQKDKKEEEVEEEEEEQK 137
+ K++E+ D E + +E+EE+
Sbjct: 168 ALIGGGAYYYFKFYKPKQQEKGAPDDDLDEYDYGDEDEEED 208
Score = 26.9 bits (60), Expect = 4.9
Identities = 10/42 (23%), Positives = 15/42 (35%)
Query: 97 EEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKN 138
EEE+ E + + E EEE E+K+
Sbjct: 115 EEEDAAAPTCSCCPVCSVNMTECTGPEPEPEPEPEEEPEKKS 156
>gnl|CDD|239500 cd03406, Band_7_3, A subgroup of the band 7 domain of flotillin
(reggie) like proteins. This subgroup contains proteins
similar to stomatin, prohibitin, flotillin, HlfK/C and
podicin. Many of these band 7 domain-containing
proteins are lipid raft-associated. Individual proteins
of this band 7 domain family may cluster to form
membrane microdomains which may in turn recruit
multiprotein complexes. Microdomains formed from
flotillin proteins may in addition be dynamic units with
their own regulatory functions. Flotillins have been
implicated in signal transduction, vesicle trafficking,
cytoskeleton rearrangement and are known to interact
with a variety of proteins. Stomatin interacts with and
regulates members of the degenerin/epithelia Na+ channel
family in mechanosensory cells of Caenorhabditis elegans
and vertebrate neurons and participates in trafficking
of Glut1 glucose transporters. Prohibitin may act as a
chaperone for the stabilization of mitochondrial
proteins. Prokaryotic HflK/C plays a role in the
decision between lysogenic and lytic cycle growth during
lambda phage infection. Flotillins have been implicated
in the progression of prion disease, in the pathogenesis
of neurodegenerative diseases such as Parkinson's and
Alzheimer's disease and, in cancer invasion and
metastasis. Mutations in the podicin gene give rise to
autosomal recessive steroid resistant nephritic
syndrome.
Length = 280
Score = 27.4 bits (61), Expect = 3.8
Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 2/74 (2%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ--KDKKEEEVEEEEEEQK 137
N E E EK K ++++ EK+ + E+K+ E E+ Q K ++V E+E E++
Sbjct: 164 NYELMEAEKTKLLIAIQKQKVVEKEAETERKKAVIEAEKVAQVAKILFGQKVMEKETEKR 223
Query: 138 NKKKEEEEEEEKEK 151
+ E+E +EK
Sbjct: 224 ISEIEDEAFLAREK 237
>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
Length = 155
Score = 27.0 bits (60), Expect = 3.9
Identities = 5/45 (11%), Positives = 10/45 (22%)
Query: 91 EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E + + KK++ E + E
Sbjct: 107 NAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAA 151
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 27.6 bits (62), Expect = 3.9
Identities = 20/66 (30%), Positives = 32/66 (48%), Gaps = 5/66 (7%)
Query: 96 EEEEEKEKDEDEEKKREKKEEEEE-----KQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
+EE K E EE + E+ E +E K+ + EE+ E +E K K KE E ++
Sbjct: 35 DEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEKLKELEAALDEL 94
Query: 151 KVEMYR 156
+ E+
Sbjct: 95 EAELDT 100
>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 27.1 bits (60), Expect = 3.9
Identities = 15/73 (20%), Positives = 30/73 (41%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E E++ + E E+ + D+ +K+ E+ K + E ++ E++
Sbjct: 40 EDAAAELAEKKAAAQSEIEKFCANIADAAADQLYALQKELLEKLKDLAEVNERLKALEKK 99
Query: 136 QKNKKKEEEEEEE 148
+ K EEE E
Sbjct: 100 KAELKDLEEEREG 112
>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
partitioning].
Length = 324
Score = 27.5 bits (61), Expect = 3.9
Identities = 17/103 (16%), Positives = 42/103 (40%), Gaps = 10/103 (9%)
Query: 51 PAQYHKPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKE-------- 102
P ++P A+A++ + +PQ S+ + + + ++E+ EE +
Sbjct: 86 PQHQYQPEYASAQIKIPVPQPPQISDPPAHPQPTQPALDQEQPPEEARQPVLPQEAPAPQ 145
Query: 103 --KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ + + +Q+ + EE VE E ++ +K+
Sbjct: 146 PVHSAAPQPAVQTVQPAVPEQQVQPEEVVEPAPEVKRPPRKDT 188
>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 881
Score = 27.6 bits (61), Expect = 3.9
Identities = 15/48 (31%), Positives = 28/48 (58%)
Query: 97 EEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
+E +K++ +E+ + E++ +KDKK+EE E+ EE K + E
Sbjct: 3 DESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEEDAMLKGDLE 50
Score = 26.4 bits (58), Expect = 8.2
Identities = 11/44 (25%), Positives = 27/44 (61%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE 127
+E +K+++ +E+ + EK +K ++K+EEE+ ++D +
Sbjct: 3 DESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEEDAMLK 46
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 27.7 bits (61), Expect = 4.0
Identities = 16/65 (24%), Positives = 33/65 (50%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+E + + ++E+ + E ++ + EE E + +Q + EE+E ++KEE
Sbjct: 724 DEPKLDAIDDEDAKSEGSQESDQEEGLDEIFYSFDGEQDNSDSFAESSEEDESSEEEKEE 783
Query: 144 EEEEE 148
EE +E
Sbjct: 784 EENKE 788
Score = 27.3 bits (60), Expect = 4.3
Identities = 15/40 (37%), Positives = 22/40 (55%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
+ ++ EE E EEE+EEEE KE KK+++K
Sbjct: 762 DNSDSFAESSEEDESSEEEKEEEENKEVSAKRAKKKQRKN 801
Score = 27.3 bits (60), Expect = 4.6
Identities = 15/73 (20%), Positives = 40/73 (54%), Gaps = 4/73 (5%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEE----EKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
+ +++ + + E ++ ++EE +E + ++ + E EE+E +++K+EEE +E
Sbjct: 729 DAIDDEDAKSEGSQESDQEEGLDEIFYSFDGEQDNSDSFAESSEEDESSEEEKEEEENKE 788
Query: 132 EEEEQKNKKKEEE 144
++ KK+ +
Sbjct: 789 VSAKRAKKKQRKN 801
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 27.8 bits (61), Expect = 4.0
Identities = 18/65 (27%), Positives = 35/65 (53%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
E LE E E E+E+ E E E E+E+ E E +R++ E + + +++ + E +
Sbjct: 461 ERLEREERERLERERMERIERERLERERLERERLERDRLERDRLDRLERERVDRLERDRL 520
Query: 136 QKNKK 140
+K ++
Sbjct: 521 EKARR 525
Score = 27.4 bits (60), Expect = 4.6
Identities = 17/64 (26%), Positives = 34/64 (53%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
E E+EE E E E E+ E E +RE+ E E ++ + + ++ E E+ ++ + +
Sbjct: 461 ERLEREERERLERERMERIERERLERERLERERLERDRLERDRLDRLERERVDRLERDRL 520
Query: 146 EEEK 149
E+ +
Sbjct: 521 EKAR 524
Score = 26.6 bits (58), Expect = 7.5
Identities = 19/70 (27%), Positives = 39/70 (55%), Gaps = 2/70 (2%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
E+ + E+E + E E+K+ E+ E EE ++ ++ E +E E E+ +++ E E
Sbjct: 436 ERARIEKENAHRKALEMKILEKKRIERLEREERERLER--ERMERIERERLERERLERER 493
Query: 147 EEKEKVEMYR 156
E++++E R
Sbjct: 494 LERDRLERDR 503
Score = 26.6 bits (58), Expect = 8.1
Identities = 15/64 (23%), Positives = 35/64 (54%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
E+K E E EE E E++ E +RE+ E E +++ + + +E + ++ +++ +
Sbjct: 456 EKKRIERLEREERERLERERMERIERERLERERLERERLERDRLERDRLDRLERERVDRL 515
Query: 146 EEEK 149
E ++
Sbjct: 516 ERDR 519
Score = 26.6 bits (58), Expect = 8.5
Identities = 19/71 (26%), Positives = 38/71 (53%), Gaps = 1/71 (1%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE-EEVEEEEEEQKNKKKEEEE 145
EKE + E + EK E +RE++E E ++ ++ E E +E E E++ +++ E
Sbjct: 441 EKENAHRKALEMKILEKKRIERLEREERERLERERMERIERERLERERLERERLERDRLE 500
Query: 146 EEEKEKVEMYR 156
+ +++E R
Sbjct: 501 RDRLDRLERER 511
>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
Provisional.
Length = 130
Score = 26.7 bits (59), Expect = 4.0
Identities = 12/36 (33%), Positives = 23/36 (63%), Gaps = 2/36 (5%)
Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEE--EQKNKKKE 142
K+E+++EEE + K+EE + E + +Q+N K+
Sbjct: 95 SKKEEEKEEEIPEPTKEEELLGEIRDLLKQQNSSKD 130
Score = 25.6 bits (56), Expect = 9.8
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 88 KEKEEEEEEEEEEKEKDED 106
+KEEE+EEE E K+E+
Sbjct: 95 SKKEEEKEEEIPEPTKEEE 113
>gnl|CDD|130844 TIGR01784, T_den_put_tspse, conserved hypothetical protein
(putative transposase or invertase). Several lines of
evidence suggest that members of this family (loaded as
a fragment mode model to find part-length matches) are
associated with transposition, inversion, or
recombination. Members are found in small numbers of
genomes, but in large copy numbers in many of those
species, including over 30 full length and fragmentary
members in Treponema denticola. The strongest
similarities are usually within rather than between
species. PSI-BLAST shows similarity to proteins
designated as possible transposases, DNA invertases
(resolvases), and recombinases. In the oral pathogenic
spirochete Treponema denticola, full-length members are
often found near transporters or other membrane
proteins. This family includes members of the putative
transposase family pfam04754.
Length = 270
Score = 27.4 bits (61), Expect = 4.2
Identities = 16/89 (17%), Positives = 34/89 (38%), Gaps = 4/89 (4%)
Query: 70 QAISASETLENNEWEEEEKEKEEE----EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKK 125
I + W K K+ + + +K E+E K +E E + ++
Sbjct: 154 AIIDERLLDVLDRWLLLLKGKDNQSVNLIFLQIPFIQKAEEEIKTLLLSSKELELYEQRE 213
Query: 126 EEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
+ + E++ K++ EE + K+E
Sbjct: 214 KYMTDAISAEEEGKEEGIEEGILEAKLET 242
>gnl|CDD|217553 pfam03430, TATR, Trans-activating transcriptional regulator. This
family of trans-activating transcriptional regulator
(TATR), also known as intermediate early protein 1, are
common to the Nucleopolyhedroviruses.
Length = 571
Score = 27.6 bits (61), Expect = 4.2
Identities = 25/126 (19%), Positives = 42/126 (33%), Gaps = 10/126 (7%)
Query: 36 NVSLTDDAEFQCQVGPAQYHKPIRANARLTVILPQAI-SASETLENNEWEEEEKEKEEEE 94
N SLT A + +N L +PQ S E+ + E
Sbjct: 63 NASLTFFALYIP---------LTDSNKPLPRAVPQDYERESAVSPIVEYPQALSPGESLP 113
Query: 95 EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
E E K+ + K E + + K ++ V + ++ QK KK + + E
Sbjct: 114 ESLSETKQVTVSKRKGSELDSDSDSSDSSKGKKLVNKPKKRQKYKKATIQSPTSLTEEEN 173
Query: 155 YRNPVP 160
Y +
Sbjct: 174 YNTEIC 179
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 27.4 bits (61), Expect = 4.2
Identities = 9/55 (16%), Positives = 22/55 (40%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
E E+ E E++ ++ K + +++ K ++ E + K +E
Sbjct: 174 AAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEE 228
>gnl|CDD|218954 pfam06246, Isy1, Isy1-like splicing family. Isy1 protein is
important in the optimisation of splicing.
Length = 253
Score = 27.3 bits (61), Expect = 4.3
Identities = 20/76 (26%), Positives = 35/76 (46%), Gaps = 11/76 (14%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV--------- 129
E+ E E E+E+E E+ E+ K+ +E + +++E EE +++
Sbjct: 173 EDGVLLEYEAEREKELIEKLVEEWKEIQDEDLQAEEDEREEDIYIPLDDDAEFVAHVPVP 232
Query: 130 --EEEEEEQKNKKKEE 143
EE EE K+K E
Sbjct: 233 TQEEVEEWLLEKRKRE 248
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels
of palindromic sequences in DNA. This family is based on
the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1042
Score = 27.6 bits (61), Expect = 4.3
Identities = 11/81 (13%), Positives = 29/81 (35%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
+ E + N + + E+ E E+ E R+ + E++ + ++ +E
Sbjct: 581 RSKEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQE 640
Query: 133 EEEQKNKKKEEEEEEEKEKVE 153
+ + +E+V
Sbjct: 641 LALKLTALHALQLTLTQERVR 661
>gnl|CDD|219805 pfam08347, CTNNB1_binding, N-terminal CTNNB1 binding. This region
tends to appear at the N-terminus of proteins also
containing DNA-binding HMG (high mobility group) boxes
(pfam00505) and appears to bind the armadillo repeat of
CTNNB1 (beta-catenin), forming a stable complex.
Signaling by Wnt through TCF/LCF is involved in
developmental patterning, induction of neural tissues,
cell fate decisions and stem cell differentiation.
Isoforms of HMG T-cell factors lacking the N-terminal
CTNNB1-binding domain cannot fulfill their role as
transcriptional activators in T-cell differentiation.
Length = 200
Score = 27.2 bits (60), Expect = 4.4
Identities = 15/66 (22%), Positives = 25/66 (37%), Gaps = 1/66 (1%)
Query: 103 KDE-DEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPM 161
KDE ++E+K + E D K V E E + + + + + E YR+
Sbjct: 22 KDEGEQEEKISENVGAERDLADVKSSLVNESESSESSSSEAISRRDHPPRYESYRDKSRE 81
Query: 162 YRNPVP 167
V
Sbjct: 82 KLEDVS 87
>gnl|CDD|224035 COG1110, COG1110, Reverse gyrase [DNA replication, recombination,
and repair].
Length = 1187
Score = 27.3 bits (61), Expect = 4.4
Identities = 9/33 (27%), Positives = 14/33 (42%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKD 104
I L + E +E+ E E E E+K +
Sbjct: 243 IKLRRKLYGEKRAERVREELREVEREREKKRRK 275
>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D. SpoVID, the
stage VI sporulation protein D, is restricted to
endospore-forming members of the bacteria, all of which
are found among the Firmicutes. It is widely distributed
but not quite universal in this group. Between
well-conserved N-terminal and C-terminal domains is a
poorly conserved, low-complexity region of variable
length, rich enough in glutamic acid to cause spurious
BLAST search results unless a filter is used. The seed
alignment for this model was trimmed, in effect, by
choosing member sequences in which these regions are
relatively short. SpoVID is involved in spore coat
assembly by the mother cell compartment late in the
process of sporulation [Cellular processes, Sporulation
and germination].
Length = 338
Score = 27.2 bits (60), Expect = 4.4
Identities = 22/72 (30%), Positives = 40/72 (55%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE 131
ISASE + E E+E EE E+E + + EDEEK +++ E+ +++ + +
Sbjct: 202 ISASEPFQLESEVEASPEEENYEEYEDETELEVEDEEKALDEQTEDPQQEDALAGDAKKA 261
Query: 132 EEEEQKNKKKEE 143
EEE++ ++ E
Sbjct: 262 LEEEEEKGERPE 273
Score = 26.8 bits (59), Expect = 6.3
Identities = 28/83 (33%), Positives = 43/83 (51%), Gaps = 3/83 (3%)
Query: 71 AISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE 130
A + + + E + E E E EEE E+ EDE E + E+EEK D++ E+ +
Sbjct: 193 ADDDPDEWKISASEPFQLESEVEASPEEENYEEYEDET---ELEVEDEEKALDEQTEDPQ 249
Query: 131 EEEEEQKNKKKEEEEEEEKEKVE 153
+E+ + KK EEEEEK +
Sbjct: 250 QEDALAGDAKKALEEEEEKGERP 272
>gnl|CDD|234045 TIGR02876, spore_yqfD, sporulation protein YqfD. YqfD is part of
the sigma-E regulon in the sporulation program of
endospore-forming Gram-positive bacteria. Mutation
results in a sporulation defect in Bacillus subtilis.
Members are found in all currently known
endospore-forming bacteria, including the genera
Bacillus, Symbiobacterium, Carboxydothermus,
Clostridium, and Thermoanaerobacter [Cellular processes,
Sporulation and germination].
Length = 382
Score = 27.3 bits (61), Expect = 4.6
Identities = 22/69 (31%), Positives = 34/69 (49%), Gaps = 7/69 (10%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEE 146
KE E +E+ E+ K+E EK +EK E+E +K+ D + V + K E E
Sbjct: 312 VKETYYEVKEKVEKVTKEEAIEKAKEKAEKELKKELDPNAKIVSD-------KILSERVE 364
Query: 147 EEKEKVEMY 155
K KV ++
Sbjct: 365 GGKVKVTVH 373
>gnl|CDD|218601 pfam05477, SURF2, Surfeit locus protein 2 (SURF2). Surfeit locus
protein 2 is part of a group of at least six sequence
unrelated genes (Surf-1 to Surf-6). The six Surfeit
genes have been classified as housekeeping genes, being
expressed in all tissue types tested and not containing
a TATA box in their promoter region. The exact function
of SURF2 is unknown.
Length = 244
Score = 27.2 bits (60), Expect = 4.7
Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 1/50 (2%)
Query: 77 TLENNEWEEEEKEKEEEEEEEEEEKEKDEDE-EKKREKKEEEEEKQKDKK 125
TL+N E+ E ++ E ++E+E E + E ++KR KK+ +K KK
Sbjct: 181 TLKNPGKEQNGDEDDDFETDDEDEMEVESPELQQKRSKKQSGSLTKKFKK 230
>gnl|CDD|220113 pfam09111, SLIDE, SLIDE. The SLIDE domain adopts a secondary
structure comprising a main core of three alpha-helices.
It has a role in DNA binding, contacting DNA target
sites similar to c-Myb (pfam00249) repeats or
homeodomains.
Length = 118
Score = 26.4 bits (59), Expect = 4.8
Identities = 9/27 (33%), Positives = 19/27 (70%)
Query: 132 EEEEQKNKKKEEEEEEEKEKVEMYRNP 158
E E+K +K++ +++ + K+E Y+NP
Sbjct: 9 ERGEEKIEKRKNQQKALRRKLEQYKNP 35
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 27.2 bits (60), Expect = 4.8
Identities = 17/72 (23%), Positives = 34/72 (47%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E ++ K+K EE++ +K E K +K + + KK +V++ + + KK
Sbjct: 3 ETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKK 62
Query: 142 EEEEEEEKEKVE 153
+ E+ E V+
Sbjct: 63 VTVKFEKTESVK 74
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 27.0 bits (60), Expect = 4.9
Identities = 16/72 (22%), Positives = 46/72 (63%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
+ E +EE +K E + ++ K+++++++++++++ +++EEE++ +K EE ++
Sbjct: 169 AKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQKAEEAWQKWM 228
Query: 150 EKVEMYRNPVPM 161
+ V+ PVP+
Sbjct: 229 KNVKQRPKPVPL 240
Score = 27.0 bits (60), Expect = 5.5
Identities = 15/49 (30%), Positives = 31/49 (63%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
+ EE++++E EKE++E E ++R KE+ EE + K ++ ++ + K
Sbjct: 106 KLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHK 154
Score = 26.2 bits (58), Expect = 8.4
Identities = 13/61 (21%), Positives = 40/61 (65%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ ++K ++ EE++++E+EK+ +E + R++ +E+ ++ +++ + ++ K+KK+
Sbjct: 99 QRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAA 158
Query: 144 E 144
E
Sbjct: 159 E 159
>gnl|CDD|118278 pfam09746, Membralin, Tumour-associated protein. Membralin is
evolutionarily highly conserved; though it seems to
represent a unique protein family. The protein appears
to contain several transmembrane regions. In humans it
is expressed in certain cancers, particularly ovarian
cancers. Membralin-like gene homologues have been
identified in plants including grape, cotton and tomato.
Length = 375
Score = 27.1 bits (60), Expect = 4.9
Identities = 7/56 (12%), Positives = 19/56 (33%), Gaps = 8/56 (14%)
Query: 120 KQKDKKEEEVEEEEEEQKNKKK-----EEEEEEEKEKVEMYRNP---VPMYRNPVP 167
+++ ++E + K + + E +E Y+N + + P
Sbjct: 88 EKRAIWLGIKRDQEIVRSLKDSYYYGIGPQTRQNHETLERYQNILGKLGLPVRPTF 143
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 27.2 bits (60), Expect = 5.0
Identities = 26/76 (34%), Positives = 42/76 (55%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
E E EEE++ EEEEEE EEE+E D +++ + K K+ +E E E ++K
Sbjct: 2 TEVLELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEK 61
Query: 138 NKKKEEEEEEEKEKVE 153
++E + + E+ K E
Sbjct: 62 ALEEEAKRKAEERKRE 77
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 26.7 bits (59), Expect = 5.0
Identities = 21/78 (26%), Positives = 38/78 (48%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
+ E +K ++ E+ E K ++KK ++++ K+ KK++E EE E +
Sbjct: 26 STTESVDKVNDDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEEL 85
Query: 141 KEEEEEEEKEKVEMYRNP 158
+ EE EE +K Y P
Sbjct: 86 SDSEENEENDKKVDYELP 103
Score = 26.3 bits (58), Expect = 7.7
Identities = 10/46 (21%), Positives = 24/46 (52%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+E+E+ E EE + +E +E+++K + + + + E+V
Sbjct: 72 KEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAEVNHEDV 117
Score = 25.9 bits (57), Expect = 9.1
Identities = 14/42 (33%), Positives = 27/42 (64%)
Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
KK +KK++E+E+ + EE+ + EE ++N KK + E + +
Sbjct: 65 KKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQ 106
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 26.5 bits (59), Expect = 5.0
Identities = 18/64 (28%), Positives = 36/64 (56%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E E+ KEE + E +++ E+ ++ + E+ +K+ ++ EE++ E EE+ + KE
Sbjct: 47 AEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEELERIKEA 106
Query: 144 EEEE 147
E E
Sbjct: 107 AEAE 110
>gnl|CDD|145361 pfam02161, Prog_receptor, Progesterone receptor.
Length = 456
Score = 27.3 bits (60), Expect = 5.1
Identities = 10/34 (29%), Positives = 18/34 (52%)
Query: 93 EEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
+ +EE +EK +D+ E+ E E E + + E
Sbjct: 54 QGQEEPDEKTQDQQSLSDVERAEPEVEASRGRGE 87
>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
Provisional.
Length = 1164
Score = 27.3 bits (60), Expect = 5.1
Identities = 11/55 (20%), Positives = 26/55 (47%)
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
+ +E+ +E++K +D + ++ K E+ + +V K KK+E+
Sbjct: 220 DKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEK 274
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 26.5 bits (59), Expect = 5.2
Identities = 24/59 (40%), Positives = 35/59 (59%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
EE +KE+EEE+EE+E+E E DED KK + + + E+E++K KKKE
Sbjct: 28 EEYDKEEEEEKEEKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKKE 86
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 26.5 bits (59), Expect = 5.3
Identities = 12/34 (35%), Positives = 24/34 (70%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
E+ E + +EEK+ DE+E K++ ++ +E E++K
Sbjct: 91 ERAEAQRARDEEKKLDEEEAKRQHEEAKEREREK 124
>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 433 to 538 amino acids in length. This domain is
found associated with pfam00616, pfam00168. This domain
has two conserved sequence motifs: DLQ and PLSFQNP.
Length = 489
Score = 27.0 bits (59), Expect = 5.3
Identities = 14/51 (27%), Positives = 28/51 (54%)
Query: 95 EEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
E+ E E E++ +E + EK E+E + K++ V EE ++ +E++
Sbjct: 349 EDIEYETEENREEGTQAEKYEQEIARLKERLRVSVRRLEEYERRLLGQEQQ 399
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family
consists of several Mycoplasma species specific
Cytadhesin P32 and P30 proteins. P30 has been found to
be membrane associated and localised on the tip
organelle. It is thought that it is important in
cytadherence and virulence.
Length = 279
Score = 27.0 bits (59), Expect = 5.3
Identities = 14/60 (23%), Positives = 29/60 (48%), Gaps = 3/60 (5%)
Query: 109 KKREKK---EEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPMYRNP 165
K++EK+ E+E ++Q ++ + + E+ E+Q + EE +E + V P
Sbjct: 96 KRKEKRLLEEKERQEQMAEQLQRISEQNEQQAIEIDPTEEVNTQEPTQPAGVNVANNPQP 155
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 27.0 bits (60), Expect = 5.4
Identities = 18/83 (21%), Positives = 28/83 (33%), Gaps = 2/83 (2%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E E E E E E ++ K E K + + K K K ++VEE+ + + K E
Sbjct: 69 EPVVEPEPEPEPIPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPK--REVKPVEP 126
Query: 145 EEEEKEKVEMYRNPVPMYRNPVP 167
+ P
Sbjct: 127 RPASPFENTAPARPTSSTATAAA 149
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 26.8 bits (59), Expect = 5.5
Identities = 14/68 (20%), Positives = 36/68 (52%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E E +EE + + K+K E+E K++ + E+ ++E+ +E + ++K
Sbjct: 81 EAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLA 140
Query: 144 EEEEEKEK 151
E+ +++++
Sbjct: 141 EDSDDRQE 148
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 26.4 bits (58), Expect = 5.5
Identities = 15/49 (30%), Positives = 29/49 (59%)
Query: 112 EKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVP 160
+ +EE+ + Q ++ EE+ EE E+ + + +E+EE EE E+ + P
Sbjct: 18 KDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEETAASSKAP 66
>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
(DUF3340). This presumed domain is found at the
C-terminus of tail specific proteases. Its function is
unknown. This family is found in bacteria and
eukaryotes. This presumed domain is typically between 88
to 187 amino acids in length.
Length = 139
Score = 26.4 bits (59), Expect = 5.5
Identities = 15/45 (33%), Positives = 26/45 (57%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
E E + E EE+EK++ + E ++R+ EE K D ++ EE +
Sbjct: 77 EAERKAEREEDEKKRLKRENERRKALGEEPLKSLDDLPKDKEEPD 121
>gnl|CDD|223642 COG0568, RpoD, DNA-directed RNA polymerase, sigma subunit
(sigma70/sigma32) [Transcription].
Length = 342
Score = 26.9 bits (60), Expect = 5.6
Identities = 11/73 (15%), Positives = 23/73 (31%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
+ + E ++ E+E+ E E + K E ++ ++
Sbjct: 23 LLVREAEVELAKQLEDEQLLVELGEDLTDLKLGREPSERARRPAGRLSFYIRAIEAAPLL 82
Query: 144 EEEEEKEKVEMYR 156
EEEK +
Sbjct: 83 TPEEEKALARRLK 95
>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV.
Length = 336
Score = 27.1 bits (60), Expect = 5.6
Identities = 20/64 (31%), Positives = 31/64 (48%), Gaps = 9/64 (14%)
Query: 110 KREKKEE-------EEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRNPVPMY 162
KR+ KEE E + K+E + + K +KK EE++E ++VE R+ P
Sbjct: 3 KRKIKEEMLQILAPEIYGPRPVKDEAKPRKIKRVKKRKKREEKDELDDEVEFVRSFAP-- 60
Query: 163 RNPV 166
R V
Sbjct: 61 RRRV 64
>gnl|CDD|184539 PRK14150, PRK14150, heat shock protein GrpE; Provisional.
Length = 193
Score = 26.4 bits (59), Expect = 5.7
Identities = 16/56 (28%), Positives = 28/56 (50%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
S E E EE E E E++EE +E E +++ ++ + E E + + + EE
Sbjct: 2 SNEEQKTPQEQVSEEIEAEVEQQEEADEAELEDELDEADARIAELEAQLAEAQAEE 57
Score = 26.0 bits (58), Expect = 9.3
Identities = 16/59 (27%), Positives = 33/59 (55%), Gaps = 2/59 (3%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
EE+K +E+ EE E + E +E+ E + E+E + D + E+E + E + ++++
Sbjct: 3 NEEQKTPQEQVSEEIEAEV--EQQEEADEAELEDELDEADARIAELEAQLAEAQAEERD 59
>gnl|CDD|225087 COG2176, PolC, DNA polymerase III, alpha subunit (gram-positive
type) [DNA replication, recombination, and repair].
Length = 1444
Score = 27.3 bits (61), Expect = 5.7
Identities = 21/94 (22%), Positives = 39/94 (41%), Gaps = 3/94 (3%)
Query: 58 IRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEE 117
+ +L+V P L + + + E+ E + +++ + +KK +
Sbjct: 99 KKIIEQLSVKSP---IFKSLLNKLKLKVKGNNILIEQVLNNPEFDHFKNKSPELQKKLQS 155
Query: 118 EEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEK 151
+ E EV + EEQ+ +K EE EE EK
Sbjct: 156 FGFPQLLIEFEVNDISEEQEFEKFEEAINEEVEK 189
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 26.7 bits (59), Expect = 5.7
Identities = 24/93 (25%), Positives = 46/93 (49%), Gaps = 5/93 (5%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEV 129
+A A LE E EE+ K +E+ ++E++ +E R++ ++E E+Q+ + EE +
Sbjct: 76 EAQQAQAKLERARVEAEERRKTLQEQTQQEQQRAQYQDELARKRYQKELEQQRRQNEELL 135
Query: 130 EEEEE-----EQKNKKKEEEEEEEKEKVEMYRN 157
+ +EE E + EEE E + +
Sbjct: 136 KMQEESVLRQEAMRRATEEEILEMRRETIEEEA 168
>gnl|CDD|206307 pfam14138, COX16, Cytochrome c oxidase assembly protein COX16.
This family represents homologues of COX16 which has
been shown to be involved in assembly of cytochrome
oxidase. Protein in this family are typically between
106 and 134 amino acids in length.
Length = 79
Score = 25.6 bits (57), Expect = 5.8
Identities = 12/41 (29%), Positives = 23/41 (56%)
Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
+K ++ EEEE + KK +V+ EEE + + + ++ E
Sbjct: 31 RKVQQLEEEEALKLLKKRRKVDLEEEYYRLQGLDIDDWENV 71
>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
modification].
Length = 429
Score = 26.9 bits (59), Expect = 5.9
Identities = 15/71 (21%), Positives = 28/71 (39%), Gaps = 2/71 (2%)
Query: 79 ENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEE--EEEEQ 136
E E E + E EE +EE+ + E E+ ++ + E +E+
Sbjct: 351 ETAEGELLQFEAEEIYNDEEQYTRPVYENEDFEERVDDVSNGGLAEFAPAAVEVAPKEDA 410
Query: 137 KNKKKEEEEEE 147
K K+ + + E
Sbjct: 411 KVTKRGKAKRE 421
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plant species have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site located in the C-terminal
region of P2. TCS inactivates the ribosome by
depurinating a specific adenine in the sarcin-ricin loop
of 28S rRNA.
Length = 103
Score = 25.7 bits (57), Expect = 5.9
Identities = 10/20 (50%), Positives = 16/20 (80%)
Query: 87 EKEKEEEEEEEEEEKEKDED 106
E ++EE++EEEE+E D+D
Sbjct: 78 AAEAKKEEKKEEEEEESDDD 97
Score = 25.7 bits (57), Expect = 6.5
Identities = 9/17 (52%), Positives = 15/17 (88%)
Query: 84 EEEEKEKEEEEEEEEEE 100
E +++EK+EEEEEE ++
Sbjct: 80 EAKKEEKKEEEEEESDD 96
Score = 25.7 bits (57), Expect = 6.7
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 85 EEEKEKEEEEEEEEEEKEKD 104
E +KEE++EEEEEE + D
Sbjct: 78 AAEAKKEEKKEEEEEESDDD 97
Score = 25.4 bits (56), Expect = 7.9
Identities = 14/32 (43%), Positives = 21/32 (65%), Gaps = 1/32 (3%)
Query: 69 PQAISASETLENNEWEEEEKEKEEEEEEEEEE 100
P A +A+ E ++EEK KEEEEEE +++
Sbjct: 67 PAAAAAAAAAAAAEAKKEEK-KEEEEEESDDD 97
>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
[Transcription].
Length = 175
Score = 26.6 bits (59), Expect = 6.1
Identities = 21/69 (30%), Positives = 42/69 (60%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEE 144
E+K+ + +++E + +E DEDE E KEEEE+ + D ++E ++E+E+ +
Sbjct: 95 TEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVEILI 154
Query: 145 EEEEKEKVE 153
E++E ++ E
Sbjct: 155 EDDEVDEDE 163
>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1. This is
the N-terminal 200 residues of a set of proteins
conserved from yeasts to humans. Most of the proteins in
this entry have an RhoGEF pfam00621 domain at their
C-terminal end.
Length = 158
Score = 26.2 bits (58), Expect = 6.2
Identities = 26/91 (28%), Positives = 48/91 (52%), Gaps = 3/91 (3%)
Query: 67 ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
++P I E L++ E +E E E E E+ E+ + E REK+ ++ +QK +
Sbjct: 27 LMPLVIRVLENLDSLASENQELEVELELLREDNERLSTQYE---REKELRKQAEQKLLEF 83
Query: 127 EEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
E+ E+E+++ KK E+ EE ++ +N
Sbjct: 84 EDELEQEKKELQKKIEDLEENVRQLELKAKN 114
>gnl|CDD|191675 pfam07047, OPA3, Optic atrophy 3 protein (OPA3). This family
consists of several optic atrophy 3 (OPA3) proteins.
OPA3 deficiency causes type III 3-methylglutaconic
aciduria (MGA) in humans. This disease manifests with
early bilateral optic atrophy, spasticity,
extrapyramidal dysfunction, ataxia, and cognitive
deficits, but normal longevity.
Length = 134
Score = 26.1 bits (58), Expect = 6.3
Identities = 10/33 (30%), Positives = 22/33 (66%)
Query: 107 EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
+E K+E++ ++E ++ + + E+E E E QK +
Sbjct: 102 KEAKKEEELQQELEELEARVGELELEIERQKAR 134
>gnl|CDD|184060 PRK13453, PRK13453, F0F1 ATP synthase subunit B; Provisional.
Length = 173
Score = 26.4 bits (58), Expect = 6.3
Identities = 16/68 (23%), Positives = 38/68 (55%)
Query: 80 NNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNK 139
N + ++ E+ K ++ EEE K+K ++ +++ +K E+ + Q +++E++ E + N
Sbjct: 55 NRDIDDAEQAKLNAQKLEEENKQKLKETQEEVQKILEDAKVQARQQQEQIIHEANVRANG 114
Query: 140 KKEEEEEE 147
E + E
Sbjct: 115 MIETAQSE 122
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 26.1 bits (58), Expect = 6.3
Identities = 24/68 (35%), Positives = 39/68 (57%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EEEK ++E ++ KEK + E+K+ EKK E E+ K EE+ E E E+ + + +E
Sbjct: 2 PEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEEL 61
Query: 144 EEEEEKEK 151
E E + +
Sbjct: 62 EAELARRE 69
Score = 26.1 bits (58), Expect = 6.6
Identities = 21/63 (33%), Positives = 39/63 (61%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
E EEE+ ++E +K +EK + +K++EE+K + +K ++ EE+ + +K E+E EE
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEE 60
Query: 149 KEK 151
E
Sbjct: 61 LEA 63
>gnl|CDD|224236 COG1317, FliH, Flagellar biosynthesis/type III secretory pathway
protein [Cell motility and secretion / Intracellular
trafficking and secretion].
Length = 234
Score = 26.6 bits (59), Expect = 6.3
Identities = 24/88 (27%), Positives = 38/88 (43%), Gaps = 1/88 (1%)
Query: 63 RLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEE-EKEKDEDEEKKREKKEEEEEKQ 121
R + Q I + EK EEEE E+ E +++E E +E +E EE
Sbjct: 9 RDDALPFQVIEFKDLANAGPKVAPEKAVLEEEELEQALEAKEEELESAAQELQEGIEEGA 68
Query: 122 KDKKEEEVEEEEEEQKNKKKEEEEEEEK 149
++ EE + EE + +EE E+
Sbjct: 69 REGYEEGFQLGYEEGFEEGQEEGRVLER 96
Score = 26.2 bits (58), Expect = 9.1
Identities = 17/69 (24%), Positives = 31/69 (44%)
Query: 83 WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
E ++ + E+ ++E+ E+ E KEEE E + +E +EE E + +
Sbjct: 18 IEFKDLANAGPKVAPEKAVLEEEELEQALEAKEEELESAAQELQEGIEEGAREGYEEGFQ 77
Query: 143 EEEEEEKEK 151
EE E+
Sbjct: 78 LGYEEGFEE 86
>gnl|CDD|236356 PRK08944, motB, flagellar motor protein MotB; Reviewed.
Length = 302
Score = 26.9 bits (60), Expect = 6.3
Identities = 12/69 (17%), Positives = 32/69 (46%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
+ ++ E + E + +++ +++ E + K +K E+ E+ K ++ E+
Sbjct: 98 DMTQQTLEFQAGESDSAGGTQQQRGKQRGGEASAEVKAEKAAAQEQIEDLVKKLAQQLEK 157
Query: 146 EEEKEKVEM 154
E E +E+
Sbjct: 158 EIEDGAIEI 166
Score = 26.1 bits (58), Expect = 9.7
Identities = 15/74 (20%), Positives = 36/74 (48%), Gaps = 1/74 (1%)
Query: 64 LTVILPQAISASE-TLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
+ VI+ Q I ++ TLE E + +++ ++ E + + ++ +E+ E
Sbjct: 89 IEVIMQQTIDMTQQTLEFQAGESDSAGGTQQQRGKQRGGEASAEVKAEKAAAQEQIEDLV 148
Query: 123 DKKEEEVEEEEEEQ 136
K +++E+E E+
Sbjct: 149 KKLAQQLEKEIEDG 162
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 26.9 bits (60), Expect = 6.3
Identities = 14/34 (41%), Positives = 23/34 (67%)
Query: 103 KDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQ 136
K + EK+RE++EE EE+ + EE +EE ++Q
Sbjct: 95 KSGELEKRRERREEAEEELEKAIEEGDDEEIKKQ 128
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 27.1 bits (60), Expect = 6.3
Identities = 16/72 (22%), Positives = 37/72 (51%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E E EE+ ++ ++EEE + DE E+K + +K++ + + +++ +
Sbjct: 399 EDEAEEEARQRLADDEEEVRAGDEAREEKIAANSQVIDKEEVCNLYDRRDTAWQKQRFLR 458
Query: 142 EEEEEEEKEKVE 153
E++ E++K E
Sbjct: 459 EKQTAFERQKTE 470
>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
Length = 567
Score = 26.8 bits (59), Expect = 6.4
Identities = 14/63 (22%), Positives = 33/63 (52%), Gaps = 3/63 (4%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
+EK E E+ E +++ K + E RE+K +EE ++ + E + + + + +++
Sbjct: 9 REKLEREQRERKQRAKLKLE---RERKAKEEAAKQREAIEAAQRSRRLDAIEAQIKADQQ 65
Query: 148 EKE 150
+E
Sbjct: 66 MQE 68
Score = 26.4 bits (58), Expect = 10.0
Identities = 12/42 (28%), Positives = 25/42 (59%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
E + +E+ E E+ E K++ + + ++ K +EE KQ++ E
Sbjct: 4 ELRRAREKLEREQRERKQRAKLKLERERKAKEEAAKQREAIE 45
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 25.8 bits (57), Expect = 6.4
Identities = 18/49 (36%), Positives = 24/49 (48%)
Query: 58 IRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDED 106
I+ A A +A+ E E E EEEE+EEE E+E D+D
Sbjct: 54 IKNAAEAPAAAAAAGAAAAAAAGAEAAAEADEAEEEEKEEEAEEESDDD 102
>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
Length = 202
Score = 26.6 bits (59), Expect = 6.4
Identities = 9/43 (20%), Positives = 20/43 (46%)
Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKE 150
E+KR+++ K+K K + +E ++ K + K+
Sbjct: 156 EEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKK 198
>gnl|CDD|173635 cd05054, PTKc_VEGFR, Catalytic domain of the Protein Tyrosine
Kinases, Vascular Endothelial Growth Factor Receptors.
Protein Tyrosine Kinase (PTK) family; Vascular
Endothelial Growth Factor Receptor (VEGFR) subfamily;
catalytic (c) domain. The VEGFR subfamily consists of
VEGFR1 (Flt1), VEGFR2 (Flk1), VEGFR3 (Flt4), and similar
proteins. The PTKc family is part of a larger
superfamily that includes the catalytic domains of other
kinases such as protein serine/threonine kinases, RIO
kinases, and phosphoinositide 3-kinase (PI3K). PTKs
catalyze the transfer of the gamma-phosphoryl group from
ATP to tyrosine (tyr) residues in protein substrates.
VEGFR subfamily members are receptor tyr kinases (RTKs)
containing an extracellular ligand-binding region with
seven immunoglobulin (Ig)-like domains, a transmembrane
segment, and an intracellular catalytic domain. In
VEGFR3, the fifth Ig-like domain is replaced by a
disulfide bridge. The binding of VEGFRs to their
ligands, the VEGFs, leads to receptor dimerization,
activation, and intracellular signaling. There are five
VEGF ligands in mammals, which bind, in an overlapping
pattern to the three VEGFRs, which can form homo or
heterodimers. VEGFRs regulate the cardiovascular system.
They are critical for vascular development during
embryogenesis and blood vessel formation in adults. They
induce cellular functions common to other growth factor
receptors such as cell migration, survival, and
proliferation. VEGFR1 binds VEGFA, VEGFB, and placenta
growth factor (PLGF). It regulates monocyte and
macrophage migration, vascular permeability,
haematopoiesis, and the recruitment of haematopietic
progenitor cells from the bone marrow.
Length = 337
Score = 26.7 bits (59), Expect = 6.5
Identities = 9/34 (26%), Positives = 23/34 (67%)
Query: 71 AISASETLENNEWEEEEKEKEEEEEEEEEEKEKD 104
++S+S++ ++ + E++ + EE+EE +E K+
Sbjct: 135 SVSSSQSSASSGFIEDKSLSDVEEDEEGDELYKE 168
>gnl|CDD|221288 pfam11882, DUF3402, Domain of unknown function (DUF3402). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 350 to 473 amino acids in length. This domain is
found associated with pfam07923.
Length = 402
Score = 26.5 bits (59), Expect = 6.7
Identities = 21/79 (26%), Positives = 35/79 (44%), Gaps = 4/79 (5%)
Query: 83 WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
WEE EK + E + E D + + KE + E E++ ++EE E+ + E
Sbjct: 25 WEEREKFMKYERGWDTESLVGDPLDISE-SVKELKLEMYTSLAEDQPKKEEIERLSTDSE 83
Query: 143 EEEEEEKEKVE-MYRNPVP 160
E ++VE Y +P
Sbjct: 84 EIPRL--DRVESFYSEALP 100
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 26.9 bits (60), Expect = 6.9
Identities = 25/69 (36%), Positives = 38/69 (55%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
E E++ + EE E+ E+ E E EK+E EE ++ E E E EE+ + + EE
Sbjct: 506 EAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEE 565
Query: 144 EEEEEKEKV 152
E EE +E+V
Sbjct: 566 EAEEAREEV 574
Score = 26.9 bits (60), Expect = 6.9
Identities = 18/65 (27%), Positives = 31/65 (47%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EE +E EE E E E E+ + + R ++ EE E++ ++ E + + N +
Sbjct: 355 EERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFL 414
Query: 144 EEEEE 148
EE E
Sbjct: 415 EELRE 419
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
This family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 26.8 bits (59), Expect = 6.9
Identities = 16/82 (19%), Positives = 40/82 (48%)
Query: 73 SASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
S +ET E E + ++ E E K E ++ E+E + ++ + E E
Sbjct: 646 SLAETQLKCMVESYESLELRAKDLEAEHKSLQEKISSLEDELEKERQNHQELIAKCRELE 705
Query: 133 EEEQKNKKKEEEEEEEKEKVEM 154
E+ ++ +++E ++ ++E+ ++
Sbjct: 706 EKIERAEQEENMQKLDEEEQKI 727
>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
Length = 621
Score = 26.6 bits (59), Expect = 6.9
Identities = 23/92 (25%), Positives = 45/92 (48%), Gaps = 13/92 (14%)
Query: 72 ISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR---------EKK-EEEEEKQ 121
I + TL +E E+ +E E+ E+KEK E + K EK+ +E ++K
Sbjct: 496 IQGASTLPK---DEVERMVKEAEKNAAEDKEKREKIDLKNQAESLCYQAEKQLKELKDKI 552
Query: 122 KDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
++K+E++E ++ + + + E K +E
Sbjct: 553 SEEKKEKIENLIKKLRQALQNDNYESIKSLLE 584
>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
Length = 534
Score = 26.9 bits (59), Expect = 6.9
Identities = 18/70 (25%), Positives = 32/70 (45%), Gaps = 3/70 (4%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVE---EEEEEQKNKK 140
E++ E+E +E + E + D E+E + D +E + E ++ +
Sbjct: 454 EDDMDEQESGDERADGEDDSDSSYSYSTTSSEDESDSADDSWGDESDSGIEHDDGGVGQA 513
Query: 141 KEEEEEEEKE 150
EEEEEEE+
Sbjct: 514 IEEEEEEERA 523
>gnl|CDD|223336 COG0258, Exo, 5'-3' exonuclease (including N-terminal domain of
PolI) [DNA replication, recombination, and repair].
Length = 310
Score = 26.5 bits (59), Expect = 7.1
Identities = 9/39 (23%), Positives = 15/39 (38%)
Query: 109 KKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
K + E E+ K +E+E+ +E Q E
Sbjct: 70 KPPTFRHELLEEYKANREKEMPDELAPQIPILTELLVAL 108
>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
kDa subunit. This family contains several DNA-directed
RNA polymerase 19 kDa polypeptides. The Poxvirus
DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
DNA-template-directed extension of the 3'-end of an RNA
strand by one nucleotide at a time.
Length = 167
Score = 26.2 bits (58), Expect = 7.1
Identities = 14/62 (22%), Positives = 30/62 (48%), Gaps = 5/62 (8%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEE-----KQKDKKEEEVEEEEEEQKNK 139
E+ + + E ++++ +E +E+EE + + + E KQ + K E E E+ +
Sbjct: 2 EDSDDIIDYESDDDDSEEYEEEEEDEEDAESLESSDVSSLKQSNYKIESASENIEDATSN 61
Query: 140 KK 141
K
Sbjct: 62 PK 63
>gnl|CDD|204614 pfam11221, Med21, Subunit 21 of Mediator complex. Med21 has been
known as Srb7 in yeasts, hSrb7 in humans and Trap 19 in
Drosophila. The heterodimer of the two subunits Med7 and
Med21 appears to act as a hinge between the middle and
the tail regions of Mediator.
Length = 132
Score = 26.1 bits (58), Expect = 7.2
Identities = 14/39 (35%), Positives = 26/39 (66%), Gaps = 2/39 (5%)
Query: 115 EEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVE 153
EEE+ ++ + EEE+ E E E+ ++ +E+E+ +KVE
Sbjct: 87 EEEQLRRIKELEEELREVEAER--EEAVKEKEKLLKKVE 123
>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 758
Score = 26.8 bits (59), Expect = 7.2
Identities = 15/68 (22%), Positives = 35/68 (51%)
Query: 88 KEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEE 147
K+ ++ E+ E +KE + +K+ K E + + ++E+E ++ + ++ EE
Sbjct: 111 KKTNDDLEKIESKKESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKNEER 170
Query: 148 EKEKVEMY 155
KE +E +
Sbjct: 171 FKEYLENF 178
>gnl|CDD|217417 pfam03194, LUC7, LUC7 N_terminus. This family contains the N
terminal region of several LUC7 protein homologues and
only contains eukaryotic proteins. LUC7 has been shown
to be a U1 snRNA associated protein with a role in
splice site recognition. The family also contains human
and mouse LUC7 like (LUC7L) proteins and human cisplatin
resistance-associated overexpressed protein (CROP).
Length = 252
Score = 26.6 bits (59), Expect = 7.3
Identities = 23/71 (32%), Positives = 37/71 (52%)
Query: 78 LENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQK 137
+ E +EE+ K + + EE E DE+ K + E E+ K + ++ +E EE K
Sbjct: 103 KQRLELTQEEQTKIAADSKAEELAELDEEIGKLLAEAEALGEEGKVDEAMKLMKEVEELK 162
Query: 138 NKKKEEEEEEE 148
KKKE E+ +E
Sbjct: 163 AKKKELEDSDE 173
>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423). This
family represents a conserved region approximately 500
residues long within a number of Arabidopsis thaliana
proteins of unknown function.
Length = 446
Score = 26.6 bits (59), Expect = 7.3
Identities = 29/126 (23%), Positives = 48/126 (38%), Gaps = 34/126 (26%)
Query: 68 LPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEE-----------------KK 110
L +A SA + + E EEK +E E + E +++K + +E K
Sbjct: 317 LKKARSALDACDR---ELEEKAREVSELKMERQRKKPQIDELESIVRLKQAEADMFQLKA 373
Query: 111 REKKEEEEEKQ------KDKKEEEVE--------EEEEEQKNKKKEEEEEEEKEKVEMYR 156
E + E E Q +K EEE E EE++ EE + +E+ +
Sbjct: 374 DEARREAERLQRIALAKTEKSEEEYASKYLKLRLSEAEEERQYLFEELKLQEESHRDSDS 433
Query: 157 NPVPMY 162
+ M
Sbjct: 434 MKMRME 439
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 26.6 bits (59), Expect = 7.4
Identities = 18/69 (26%), Positives = 36/69 (52%)
Query: 81 NEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
E E +E + + EE+ KDE E + + + +++ + E+E+ E EE + +
Sbjct: 64 LESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLE 123
Query: 141 KEEEEEEEK 149
KE E+ +E+
Sbjct: 124 KEIEDLKER 132
>gnl|CDD|233935 TIGR02577, cas_TM1794_Cmr2, CRISPR-associated protein Cas10/Cmr2,
subtype III-B. This model represent a Crm2 family of
the CRISPR-associated RAMP module, a set of six genes
recurring found together in prokaryotic genomes. This
gene cluster is found only in species with CRISPR
repeats, usually near the repeats themselves. Because
most of the six (but not this family) contain RAMP
domains, and because its appearance in a genome appears
to depend on other CRISPR-associated Cas genes, the set
is designated the CRISPR RAMP module. This protein,
typified by TM1794 from Thermotoga maritima, is
designated Crm2, for CRISPR RAMP Module protein 2
[Mobile and extrachromosomal element functions, Other].
Length = 483
Score = 26.6 bits (59), Expect = 7.4
Identities = 18/107 (16%), Positives = 37/107 (34%), Gaps = 6/107 (5%)
Query: 56 KPIRANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEE-----KEKDEDEEKK 110
R + L + + EK +E + ++E K+ +D +
Sbjct: 71 LEFRVKGSEEYAELPNL-FLAILPAKDEKILEKLQETIRIKIKDELAELLKKAVKDLIEG 129
Query: 111 REKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
+ EE K K+E +E E+ + E++E + +N
Sbjct: 130 PAVITDLEEGLKQKEEALKKELEKYADKRLSAPSLLEKEEVYKDVKN 176
>gnl|CDD|219009 pfam06404, PSK, Phytosulfokine precursor protein (PSK). This
family consists of several plant specific phytosulfokine
precursor proteins. Phytosulfokines, are active as
either a pentapeptide or a C-terminally truncated
tetrapeptide. These compounds were first isolated
because of their ability to stimulate cell division in
somatic embryo cultures of Asparagus officinalis.
Length = 74
Score = 25.0 bits (55), Expect = 7.7
Identities = 9/34 (26%), Positives = 17/34 (50%)
Query: 74 ASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDE 107
E+++ E E K +E+E E + + E+E
Sbjct: 20 TDTGHESSKIEAAESSKVDEDEVVSECEGEGEEE 53
>gnl|CDD|218055 pfam04386, SspB, Stringent starvation protein B. Escherichia coli
stringent starvation protein B (SspB), is thought to
enhance the specificity of degradation of tmRNA-tagged
proteins by the ClpXP protease. The tmRNA tag, also
known as ssrA, is an 11-aa peptide added to the C
terminus of proteins stalled during translation, targets
proteins for degradation by ClpXP and ClpAP. SspB a
cytoplasmic protein that specifically binds to residues
1-4 and 7 of the tag. Binding of SspB enhances
degradation of tagged proteins by ClpX, and masks
sequence elements important for ClpA interactions,
inhibiting degradation by ClpA. However, more recent
work has cast doubt on the importance of SspB in
wild-type cells. SspB is encoded in an operon whose
synthesis is stimulated by carbon, amino acid, and
phosphate starvation. SspB may play a special role
during nutrient stress, for example by ensuring rapid
degradation of the products of stalled translation,
without causing a global increase in degradation of all
ClpXP substrates.
Length = 153
Score = 26.0 bits (58), Expect = 7.7
Identities = 13/36 (36%), Positives = 21/36 (58%)
Query: 76 ETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR 111
E E +E EEEE + ++ E+E+ + KD + K R
Sbjct: 111 EPEEADEEEEEEDDDADDSPEDEDPEPKDPPKTKGR 146
Score = 25.6 bits (57), Expect = 9.7
Identities = 12/40 (30%), Positives = 23/40 (57%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDK 124
E E+ EEEEEE+++ + EDE+ + + + + + K
Sbjct: 111 EPEEADEEEEEEDDDADDSPEDEDPEPKDPPKTKGRPSLK 150
>gnl|CDD|220821 pfam10602, RPN7, 26S proteasome subunit RPN7. RPN7 (known as the
non ATPase regulatory subunit 6 in higher eukaryotes) is
one of the lid subunits of the 26S proteasome and has
been shown in Saccharomyces cerevisiae to be required
for structural integrity. The 26S proteasome is is
involved in the ATP-dependent degradation of
ubiquitinated proteins.
Length = 174
Score = 26.0 bits (58), Expect = 7.7
Identities = 12/34 (35%), Positives = 19/34 (55%)
Query: 99 EEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
+E +E +K E+ EE + + KD KE +EE
Sbjct: 1 DEAWVEELRKKNEEELEELDAELKDAKENLGKEE 34
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 26.5 bits (59), Expect = 7.8
Identities = 11/53 (20%), Positives = 24/53 (45%), Gaps = 2/53 (3%)
Query: 107 EEKKREK--KEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEMYRN 157
KKR K + EEE+ K++ + + +++K ++E + + N
Sbjct: 12 ALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQYRAN 64
>gnl|CDD|234032 TIGR02855, spore_yabG, sporulation peptidase YabG. Members of this
family are the protein YabG, demonstrated for Bacillus
subtilis to be an endopeptidase able to release
N-terminal peptides from a number of sporulation
proteins, including CotT, CotF, and SpoIVA. It appears
to be expressed under control of sigma-K [Cellular
processes, Sporulation and germination].
Length = 283
Score = 26.5 bits (59), Expect = 7.8
Identities = 11/51 (21%), Positives = 23/51 (45%)
Query: 104 DEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEEKEKVEM 154
DE E KKR+K+ +E + + + + +E+K + + + M
Sbjct: 53 DESERKKRKKRFKERMETSYRLFRQDYKLMKEKKEYRATGGYKNNQLYFGM 103
>gnl|CDD|226123 COG3595, COG3595, Uncharacterized conserved protein [Function
unknown].
Length = 318
Score = 26.3 bits (58), Expect = 7.9
Identities = 14/76 (18%), Positives = 24/76 (31%), Gaps = 5/76 (6%)
Query: 94 EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE-----EEEQKNKKKEEEEEEE 148
E E EKDE + K+ + +EE EE + E + E ++
Sbjct: 19 TFEAGELIEKDEKDVKEISEDKEEINVDVSGSEETTVPDVEKNRPELLELSANVESSVQK 78
Query: 149 KEKVEMYRNPVPMYRN 164
+ V +
Sbjct: 79 ENDVSLDGFEGDTLIV 94
>gnl|CDD|147685 pfam05663, DUF809, Protein of unknown function (DUF809). This
family consists of several proteins of unknown function
Raphanus sativus (Radish) and Brassica napus (Rape).
Length = 138
Score = 25.9 bits (56), Expect = 7.9
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
E +EE+KE + E E +EE+KE + E K EKKE E +K
Sbjct: 98 EGKEEKKEGKGEIEGKEEKKEGKGEIEGKEEKKEVENGPRK 138
>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
Members of this family are bacterial proteins with a
conserved motif [KR]FYDLN, sometimes flanked by a pair
of CXXC motifs, followed by a long region of low
complexity sequence in which roughly half the residues
are Asp and Glu, including multiple runs of five or more
acidic residues. The function of members of this family
is unknown.
Length = 104
Score = 25.7 bits (57), Expect = 7.9
Identities = 11/55 (20%), Positives = 32/55 (58%), Gaps = 3/55 (5%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEE---EVEEEEEE 135
E+ +K+E+EE+E++ D+D++ + + ++ D ++ +E++++E
Sbjct: 50 AEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDFLEDDDDE 104
>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region. The
timeless (tim) gene is essential for circadian function
in Drosophila. Putative homologues of Drosophila tim
have been identified in both mice and humans (mTim and
hTIM, respectively). Mammalian TIM is not the true
orthologue of Drosophila TIM, but is the likely
orthologue of a fly gene, timeout (also called tim-2).
mTim has been shown to be essential for embryonic
development, but does not have substantiated circadian
function. Some family members contain a SANT domain in
this region.
Length = 507
Score = 26.6 bits (58), Expect = 8.0
Identities = 19/87 (21%), Positives = 43/87 (49%)
Query: 59 RANARLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEE 118
R A+L+ + +AS + E E E+E K+ + E+ +E E E + R++K
Sbjct: 361 RIPAKLSSTQLRRRAASLSGEEEEPEDELKDDVDGEQADESEHETLALRKNARQRKAGLA 420
Query: 119 EKQKDKKEEEVEEEEEEQKNKKKEEEE 145
+++ EE ++ ++K ++ ++
Sbjct: 421 SPEEEALGEEEQKAPPKKKQLNQKNKQ 447
>gnl|CDD|235246 PRK04184, PRK04184, DNA topoisomerase VI subunit B; Validated.
Length = 535
Score = 26.4 bits (59), Expect = 8.1
Identities = 18/75 (24%), Positives = 34/75 (45%), Gaps = 13/75 (17%)
Query: 87 EKEKEEEEEEEEEEKEK-------------DEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
K KEEE +++ + EK +++E E + K+ +++E E EEEE
Sbjct: 461 RKRKEEEAKKKAKTFEKYIPEIARKLAEILEKEEPDIEEVLAKLIGKKLEEEEVEEEEEE 520
Query: 134 EEQKNKKKEEEEEEE 148
E + + E++
Sbjct: 521 EAVVVESAKNYTEKD 535
>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
ribosomal structure and biogenesis / Posttranslational
modification, protein turnover, chaperones].
Length = 379
Score = 26.5 bits (58), Expect = 8.2
Identities = 15/65 (23%), Positives = 32/65 (49%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEEEEE 148
+++E+E ++ + E++ K + + + K+K E E E KKK +E ++
Sbjct: 236 KEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKK 295
Query: 149 KEKVE 153
K+E
Sbjct: 296 ALKME 300
>gnl|CDD|224712 COG1799, COG1799, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 167
Score = 26.2 bits (58), Expect = 8.2
Identities = 11/45 (24%), Positives = 18/45 (40%)
Query: 91 EEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEE 135
++EE+ EE + E + E +E K +K E E
Sbjct: 16 LDDEEDYYEEDPRQEPRDPAVEPEEYAPPKYNFRKREPARMEMRG 60
>gnl|CDD|131942 TIGR02896, spore_III_AF, stage III sporulation protein AF. This
family represents the stage III sporulation protein AF
of the bacterial endospore formation program, which
exists in some but not all members of the Firmicutes
(formerly called low-GC Gram-positives). The C-terminal
region of this protein is poorly conserved, so only the
N-terminal region, which includes two predicted
transmembrane domains, is included in the seed alignment
[Cellular processes, Sporulation and germination].
Length = 106
Score = 25.3 bits (56), Expect = 8.3
Identities = 10/47 (21%), Positives = 22/47 (46%)
Query: 67 ILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREK 113
I ASE N+ EE++K+ + + E+ + E++ ++
Sbjct: 59 IDIYVFKASELSVKNQIEEQKKKIQAKTRAYILEEMAVQLEKQVEKE 105
>gnl|CDD|237259 PRK12904, PRK12904, preprotein translocase subunit SecA; Reviewed.
Length = 830
Score = 26.6 bits (60), Expect = 8.3
Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 3/66 (4%)
Query: 70 QAISASETLENNEWEEEEKEKEEE---EEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKE 126
I E+ + E E+ + + E EE E+ DEE+ RE+ E E+ ++KE
Sbjct: 672 AYIPPGSYEEDWDLEGLEEALKTDFGLELPIEEWLEEGLDEEELRERILEAAEEAYEEKE 731
Query: 127 EEVEEE 132
EE+ EE
Sbjct: 732 EELGEE 737
>gnl|CDD|222440 pfam13897, GOLD_2, Golgi-dynamics membrane-trafficking. Sec14-like
Golgi-trafficking domain The GOLD domain is always found
combined with lipid- or membrane-association domains.
Length = 136
Score = 25.8 bits (57), Expect = 8.3
Identities = 12/42 (28%), Positives = 22/42 (52%)
Query: 70 QAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKR 111
S + ++ +E +EE+E+E EEEE E + + + R
Sbjct: 41 PPTSDAVSVHVSESSDEEEEEEAEEEEAETGDVEAGSKSQSR 82
>gnl|CDD|215094 PLN00188, PLN00188, enhanced disease resistance protein (EDR2);
Provisional.
Length = 719
Score = 26.7 bits (59), Expect = 8.6
Identities = 19/86 (22%), Positives = 40/86 (46%), Gaps = 7/86 (8%)
Query: 63 RLTVILPQAISASETLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQK 122
R+ V++ A ++ + +N + +E ++ DED + ++EE Q
Sbjct: 401 RIPVMVNMASASVSSKKNQKPQESSPSLDQTNAASRNSVMMDEDSD-------DDEEFQI 453
Query: 123 DKKEEEVEEEEEEQKNKKKEEEEEEE 148
+ E+E E + E K+ EEE +++
Sbjct: 454 PESEQEPETTKNETKDTAMEEEPQDK 479
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 26.1 bits (58), Expect = 8.6
Identities = 12/28 (42%), Positives = 20/28 (71%)
Query: 95 EEEEEEKEKDEDEEKKREKKEEEEEKQK 122
EEE+E+K+ EEKK K+E+E+ ++
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 26.4 bits (59), Expect = 8.7
Identities = 5/27 (18%), Positives = 18/27 (66%)
Query: 108 EKKREKKEEEEEKQKDKKEEEVEEEEE 134
+++ ++ +E +K+++E+ +E+E
Sbjct: 239 LEQKAERLRQEAAAYEKQQKELAKEQE 265
>gnl|CDD|217956 pfam04194, PDCD2_C, Programmed cell death protein 2, C-terminal
putative domain.
Length = 165
Score = 25.8 bits (57), Expect = 8.8
Identities = 21/66 (31%), Positives = 34/66 (51%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKK 141
E EE E EE EE+E D EE ++E +E+++ D+ E+ +++ QK KK+
Sbjct: 9 EEEEYETLHAPSLPEEPEEREIDAFEELEKEGGGGKEDEETDESEKAKSDDKTFQKFKKR 68
Query: 142 EEEEEE 147
E
Sbjct: 69 VARNPE 74
>gnl|CDD|223136 COG0058, GlgP, Glucan phosphorylase [Carbohydrate transport and
metabolism].
Length = 750
Score = 26.5 bits (59), Expect = 8.8
Identities = 12/51 (23%), Positives = 23/51 (45%)
Query: 82 EWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEE 132
EW + +E +++ ++ E KRE K+ E+ D+ EV+
Sbjct: 436 EWLNDLDILDELLWFADDKAFRELIAEIKRENKKRLAEEIADRTGIEVDPN 486
>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
protein 2. THO and TREX form a eukaryotic complex which
functions in messenger ribonucleoprotein metabolism and
plays a role in preventing the transcription-associated
genetic instability. Tho2, along with four other
subunits forms THO.
Length = 296
Score = 26.1 bits (58), Expect = 8.9
Identities = 17/59 (28%), Positives = 31/59 (52%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
+KE E E++ +E +KK+++K+ + K +EE + E +K KK+ EE
Sbjct: 30 DKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEE 88
>gnl|CDD|220634 pfam10220, DUF2146, Uncharacterized conserved protein (DUF2146).
This is a family of proteins conserved from plants to
humans. In Dictyostelium it is annotated as Mss11p but
this could not be confirmed. Mss11p is required for the
activation of pseudo-hyphal and invasive growth by
Ste12p in yeast.
Length = 890
Score = 26.4 bits (58), Expect = 9.1
Identities = 21/124 (16%), Positives = 40/124 (32%), Gaps = 8/124 (6%)
Query: 30 YNLRIRNVSLTDDAEFQCQVGPAQYHKPIRANARLTVILPQAISASETLENNEWEEEEKE 89
+ ++ N F+C V +Y PI +A + ++ ++
Sbjct: 510 FTIKQANYDFYQHIAFKCCVKLWRYQFPI-----FQPSSSDIRAADFENNSLSAAKKMEQ 564
Query: 90 KEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKD---KKEEEVEEEEEEQKNKKKEEEEE 146
E+E +EE ++E+ E E + + D E EE N +
Sbjct: 565 AEDELADEETDQEQPESLEPQLQGSSTSPSDASDLNFSTASSSEASSEESDNYARPTSRS 624
Query: 147 EEKE 150
E
Sbjct: 625 GTDE 628
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 26.6 bits (58), Expect = 9.1
Identities = 11/44 (25%), Positives = 22/44 (50%)
Query: 85 EEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEE 128
+E E ++ E+ ++ K D E+ + EEE+ K K+ +
Sbjct: 94 SQESEVQDNTEQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGK 137
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel, four-helix
bundle structure with a large hydrophobic patch in which
23 residues of one monomer form van der Waals contacts
with the other monomer. This CENP-B dimer configuration
may be suitable for capturing two distant CENP-B boxes
during centromeric heterochromatin formation.
Length = 101
Score = 25.5 bits (55), Expect = 9.2
Identities = 14/39 (35%), Positives = 24/39 (61%)
Query: 77 TLENNEWEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKE 115
TL E EE+ +EEE++++E E+D+DE+ + E
Sbjct: 2 TLHFLEGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40
>gnl|CDD|187810 cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Multidomain protein with permuted HD nuclease domain,
palm domain and Zn-ribbon; MTH326-like has inactivated
polymerase catalytic domain; alr1562 and slr7011 -
predicted only on the basis of size, presence of HD
domain, and location with RAMPs in one operon; signature
gene for type III; also known as Crm2 family.
Length = 475
Score = 26.3 bits (58), Expect = 9.4
Identities = 21/78 (26%), Positives = 39/78 (50%), Gaps = 3/78 (3%)
Query: 84 EEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEE 143
EEE +E +EE EEE E ++ +E + E + D E + E+ +++++ E
Sbjct: 90 EEEAEELKEEIEEETWESILNKVKEYLDIENVEPFLLKSDALERISDRIEDGNEDREEAE 149
Query: 144 EEEEEKEKVEMYRNPVPM 161
E E ++ V R+ P+
Sbjct: 150 ELEAARKNV---RDFAPV 164
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 25.6 bits (57), Expect = 9.5
Identities = 18/69 (26%), Positives = 44/69 (63%)
Query: 86 EEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKEEEE 145
E++ K+ + E E+ EKE + +EK ++ E ++KKE+E++++ +E + K+++ ++
Sbjct: 24 EKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQKLQQ 83
Query: 146 EEEKEKVEM 154
+ +K + E
Sbjct: 84 DLQKRQQEE 92
>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family. This model
represents a subfamily of RNA splicing factors including
the Pad-1 protein (N. crassa), CAPER (M. musculus) and
CC1.3 (H.sapiens). These proteins are characterized by
an N-terminal arginine-rich, low complexity domain
followed by three (or in the case of 4 H. sapiens
paralogs, two) RNA recognition domains (rrm: pfam00706).
These splicing factors are closely related to the U2AF
splicing factor family (TIGR01642). A homologous gene
from Plasmodium falciparum was identified in the course
of the analysis of that genome at TIGR and was included
in the seed.
Length = 457
Score = 26.4 bits (58), Expect = 9.5
Identities = 8/68 (11%), Positives = 22/68 (32%), Gaps = 3/68 (4%)
Query: 87 EKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQ---KDKKEEEVEEEEEEQKNKKKEE 143
E+ + + + +D D + R + + + + ++ KE
Sbjct: 22 ERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPNRYYRPRGDRSYRRDDRRSGRNTKEP 81
Query: 144 EEEEEKEK 151
E E++
Sbjct: 82 LTEAERDD 89
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 26.4 bits (59), Expect = 9.6
Identities = 14/36 (38%), Positives = 24/36 (66%)
Query: 107 EEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKKKE 142
E++ ++ +EE +K K+++EEE EEEE + KE
Sbjct: 513 EDEWQKIREEFLQKHKNEEEEEEEEEELPLIPEAKE 548
>gnl|CDD|143202 cd05725, Ig3_Robo, Third immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Ig3_Robo: domain similar to
the third immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Robo receptors play a role in
the development of the central nervous system (CNS),
and are receptors of Slit protein. Slit is a repellant
secreted by the neural cells in the midline. Slit acts
through Robo to prevent most neurons from crossing the
midline from either side. Three mammalian Robo homologs
(robo1, -2, and -3), and three mammalian Slit homologs
(Slit-1,-2, -3), have been identified. Commissural
axons, which cross the midline, express low levels of
Robo; longitudinal axons, which avoid the midline,
express high levels of Robo. robo1, -2, and -3 are
expressed by commissural neurons in the vertebrate
spinal cord and Slits 1, -2, -3 are expressed at the
ventral midline. Robo-3 is a divergent member of the
Robo family which instead of being a positive regulator
of slit responsiveness, antagonizes slit responsiveness
in precrossing axons. The Slit-Robo interaction is
mediated by the second leucine-rich repeat (LRR) domain
of Slit and the two N-terminal Ig domains of Robo, Ig1
and Ig2. The primary Robo binding site for Slit2 has
been shown by surface plasmon resonance experiments and
mutational analysis to be is the Ig1 domain, while the
Ig2 domain has been proposed to harbor a weak secondary
binding site.
Length = 69
Score = 24.7 bits (54), Expect = 9.7
Identities = 15/39 (38%), Positives = 21/39 (53%), Gaps = 10/39 (25%)
Query: 32 LRIRNVSLTDDAEFQCQ----VGPAQYHKPIRANARLTV 66
L+IRNV+ D+ + C+ VG I A+A LTV
Sbjct: 37 LKIRNVTAGDEGSYTCEAENMVG------KIEASASLTV 69
>gnl|CDD|224510 COG1594, RPB9, DNA-directed RNA polymerase, subunit M/Transcription
elongation factor TFIIS [Transcription].
Length = 113
Score = 25.5 bits (56), Expect = 9.8
Identities = 13/52 (25%), Positives = 23/52 (44%)
Query: 89 EKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEEEEQKNKK 140
E+E ++ K+ E+KK E+E Q K +E+ + NK+
Sbjct: 31 EEEASNKKVYRYSVKEAVEKKKEVVLVVEDETQGAKTLPTAKEKCPKCGNKE 82
>gnl|CDD|224403 COG1486, CelF, Alpha-galactosidases/6-phospho-beta-glucosidases,
family 4 of glycosyl hydrolases [Carbohydrate transport
and metabolism].
Length = 442
Score = 26.1 bits (58), Expect = 9.9
Identities = 16/64 (25%), Positives = 31/64 (48%), Gaps = 5/64 (7%)
Query: 76 ETLENNE--WEEEEKEKEEEEEEEEEEKEKDEDEEKKREKKEEEEEKQKDKKEEEVEEEE 133
L ++ + + +E++ +E E+ +E KREK+ E K K E + + EE
Sbjct: 251 PVLPSSYHRYYYYPYDMKEKKPDEIEKFGTRANEVMKREKELFELYK---KPELKEKPEE 307
Query: 134 EEQK 137
E++
Sbjct: 308 LEKR 311
>gnl|CDD|224553 COG1638, DctP, TRAP-type C4-dicarboxylate transport system,
periplasmic component [Carbohydrate transport and
metabolism].
Length = 332
Score = 26.1 bits (58), Expect = 10.0
Identities = 19/76 (25%), Positives = 31/76 (40%), Gaps = 3/76 (3%)
Query: 67 ILPQAISASETLENNEWEEEEKE-KEEEEEEEEEEKEKDEDEEKK--REKKEEEEEKQKD 123
IL +A + + EE E E E+ +E E E D E + + +E K+
Sbjct: 257 ILLEAAKEAAEEQRKLVEELEDELLEKLKEAGVEVVEPDAAEAFREAAKPVYDEFAKKAG 316
Query: 124 KKEEEVEEEEEEQKNK 139
+ E+ E E + K
Sbjct: 317 EIGAELIEAIEAARKK 332
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.299 0.123 0.324
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,921,396
Number of extensions: 903211
Number of successful extensions: 34604
Number of sequences better than 10.0: 1
Number of HSP's gapped: 16476
Number of HSP's successfully gapped: 5512
Length of query: 167
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 77
Effective length of database: 6,945,742
Effective search space: 534822134
Effective search space used: 534822134
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 44 (22.0 bits)
S2: 55 (24.8 bits)