RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17567
(362 letters)
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 54.7 bits (132), Expect = 3e-09
Identities = 34/93 (36%), Positives = 55/93 (59%), Gaps = 4/93 (4%)
Query: 10 EKKKRIEKRNKEE-EKEENEQEEKEKEDVEEEKNKKKKKK--KKKKKKKKKKKKKKKEEQ 66
E+ + ++++ K+E E EE +Q+ +EK+ +EEK KK+ K KKK+KKKKKKK KK ++
Sbjct: 50 ERLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKK 109
Query: 67 SVKEIEESLKERKEKTLTEDSQCDTMIPVFKKG 99
KE +S +E ++ E + PV
Sbjct: 110 EEKEGSKSSEESSDEEE-EGEEDKQEEPVEIME 141
Score = 35.8 bits (83), Expect = 0.010
Identities = 22/80 (27%), Positives = 45/80 (56%), Gaps = 12/80 (15%)
Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK----------KKKKKKKKKKKKKKEEQS 67
R +E E+ E E+ +KE E++ ++K+++ KK+ K++KKK+KKK+++
Sbjct: 45 RRREYERLELMDEKWKKE--TEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKK 102
Query: 68 VKEIEESLKERKEKTLTEDS 87
K+ + ++ K+ E S
Sbjct: 103 AKKGNKKEEKEGSKSSEESS 122
Score = 33.5 bits (77), Expect = 0.070
Identities = 23/74 (31%), Positives = 44/74 (59%), Gaps = 3/74 (4%)
Query: 11 KKKRIEKRNKEEEKEEN--EQEEKEKEDVEE-EKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
++KR EK+ K+EEK + +K+K+ ++ +K KK KK++K+ K ++ +EE+
Sbjct: 69 QQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEG 128
Query: 68 VKEIEESLKERKEK 81
++ +E E EK
Sbjct: 129 EEDKQEEPVEIMEK 142
Score = 27.7 bits (62), Expect = 5.1
Identities = 13/60 (21%), Positives = 37/60 (61%), Gaps = 4/60 (6%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
R +R +KKK+ +K+ K+ +K ++E++ + EE +++++ ++ K+++ + +K
Sbjct: 87 RAKR----QKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEIMEK 142
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 57.6 bits (139), Expect = 4e-09
Identities = 22/79 (27%), Positives = 46/79 (58%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
K G E+K++ + + ++++K+E +EE + +EE +K+ K+K+K+K+KK ++ +
Sbjct: 99 KNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPR 158
Query: 63 KEEQSVKEIEESLKERKEK 81
E+ K K R +K
Sbjct: 159 DREEEKKRERVRAKSRPKK 177
Score = 52.6 bits (126), Expect = 1e-07
Identities = 23/81 (28%), Positives = 47/81 (58%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
K + K + + +EE++E EQ ++EK+ +E+ ++ K +K K++ K+K+
Sbjct: 84 GSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP 143
Query: 61 KKKEEQSVKEIEESLKERKEK 81
K+KE++ K++EE +EK
Sbjct: 144 KEKEKEKEKKVEEPRDREEEK 164
Score = 52.2 bits (125), Expect = 2e-07
Identities = 26/82 (31%), Positives = 47/82 (57%), Gaps = 1/82 (1%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
E K EK ++ ++E+ +++EK KE+ ++ K K++ K+K+ K+K+K+K+KK EE
Sbjct: 98 PKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEP 157
Query: 67 SVKEIEESLKERKEKTLTEDSQ 88
+E EE +ER
Sbjct: 158 RDRE-EEKKRERVRAKSRPKKP 178
Score = 50.3 bits (120), Expect = 1e-06
Identities = 20/87 (22%), Positives = 49/87 (56%), Gaps = 3/87 (3%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK-- 60
+ +++ + EK+ K+E+ +E ++ K KE+ +E++ K+K+K+K+KK ++ + +
Sbjct: 103 GKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREE 162
Query: 61 -KKKEEQSVKEIEESLKERKEKTLTED 86
KK+E K + ++K ++
Sbjct: 163 EKKRERVRAKSRPKKPPKKKPPNKKKE 189
Score = 49.5 bits (118), Expect = 2e-06
Identities = 21/73 (28%), Positives = 40/73 (54%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
+++ K K + + KE E +++K+K++ K++KKKKK+K KEE ++ +E
Sbjct: 77 VKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEE 136
Query: 75 LKERKEKTLTEDS 87
KE++ E
Sbjct: 137 AKEKRPPKEKEKE 149
Score = 45.6 bits (108), Expect = 2e-05
Identities = 22/72 (30%), Positives = 43/72 (59%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
K++ EKR +E+++E E++ +E D EEEK +++ + K + KK KKK K+++ +E
Sbjct: 134 KEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEE 193
Query: 71 IEESLKERKEKT 82
++ R+
Sbjct: 194 EKQRQAAREAVK 205
Score = 44.9 bits (106), Expect = 4e-05
Identities = 16/73 (21%), Positives = 39/73 (53%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
++K++ + ++ E+E+ + + K ++ KK KKK+ +++K+++ +E
Sbjct: 147 EKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKG 206
Query: 69 KEIEESLKERKEK 81
K E + E +EK
Sbjct: 207 KPEEPDVNEEREK 219
Score = 44.1 bits (104), Expect = 7e-05
Identities = 22/76 (28%), Positives = 37/76 (48%), Gaps = 2/76 (2%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
RK + A EK+ EK ++E+K E + +E+ + E+ + K + KK KKK KKK
Sbjct: 131 RKPKEEAKEKRPPKEKEKEKEKKVEEPR--DREEEKKRERVRAKSRPKKPPKKKPPNKKK 188
Query: 62 KKEEQSVKEIEESLKE 77
+ E+ +
Sbjct: 189 EPPEEEKQRQAAREAV 204
Score = 44.1 bits (104), Expect = 8e-05
Identities = 24/75 (32%), Positives = 45/75 (60%), Gaps = 3/75 (4%)
Query: 13 KRIEKRNKEEE--KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
KR+EK + K + +E K + EEEK +K++ K++KKKKK+K K++ K+ + +E
Sbjct: 78 KRVEKGGSKGPAAKTKPAKEPKNESGKEEEK-EKEQVKEEKKKKKEKPKEEPKDRKPKEE 136
Query: 71 IEESLKERKEKTLTE 85
+E ++++ E
Sbjct: 137 AKEKRPPKEKEKEKE 151
Score = 34.1 bits (78), Expect = 0.12
Identities = 22/90 (24%), Positives = 41/90 (45%), Gaps = 8/90 (8%)
Query: 3 KRRGGAGEK---KKRIEKRNKEEEKEENEQ--EEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
K+ ++ KKR R K K+ ++ K+KE EEE K+++ ++ K K
Sbjct: 152 KKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEE---KQRQAAREAVKGKP 208
Query: 58 KKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
++ EE+ +E + +E + ED
Sbjct: 209 EEPDVNEEREKEEDDGKDRETTTSPMEEDE 238
Score = 32.9 bits (75), Expect = 0.23
Identities = 17/86 (19%), Positives = 40/86 (46%), Gaps = 2/86 (2%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
R R +K + + NK++E E+E++ + E K K ++ ++++K++ K
Sbjct: 168 RVRAKSRPKKPPKKKPPNKKKEP--PEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGK 225
Query: 62 KKEEQSVKEIEESLKERKEKTLTEDS 87
+E + E+ ++ E + S
Sbjct: 226 DRETTTSPMEEDESRQSSEISRRSSS 251
Score = 30.6 bits (69), Expect = 1.3
Identities = 15/65 (23%), Positives = 35/65 (53%), Gaps = 2/65 (3%)
Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
+ E + +E ++ ++ + K K K+ K + K+++K+KE+ VKE ++ KE
Sbjct: 65 KCAESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQ--VKEEKKKKKE 122
Query: 78 RKEKT 82
+ ++
Sbjct: 123 KPKEE 127
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 55.1 bits (133), Expect = 5e-09
Identities = 28/74 (37%), Positives = 42/74 (56%)
Query: 6 GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
G + ++ EKE + EKE E EEEK +KKKKK+ KK+KK+KK KK+K
Sbjct: 119 GAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMV 178
Query: 66 QSVKEIEESLKERK 79
+ ++ K++K
Sbjct: 179 EPKGSKKKKKKKKK 192
Score = 53.6 bits (129), Expect = 2e-08
Identities = 28/77 (36%), Positives = 43/77 (55%)
Query: 6 GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
G E ++ E E+ + EKE EE+ KK+KKKKK+ KK+KK+KK KKE+
Sbjct: 117 GYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEK 176
Query: 66 QSVKEIEESLKERKEKT 82
+ + K++K+K
Sbjct: 177 MVEPKGSKKKKKKKKKK 193
Score = 52.8 bits (127), Expect = 3e-08
Identities = 30/81 (37%), Positives = 46/81 (56%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
MR G G + E E E E K +++ E E+ +KK+KKKKK+ KK+KK+K
Sbjct: 111 MRFFPTGYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEK 170
Query: 61 KKKEEQSVKEIEESLKERKEK 81
K K+E+ V+ K++K+K
Sbjct: 171 KDKKEKMVEPKGSKKKKKKKK 191
Score = 45.9 bits (109), Expect = 7e-06
Identities = 25/61 (40%), Positives = 40/61 (65%), Gaps = 6/61 (9%)
Query: 5 RGGAGEKKKRIEKRNK--EEEKEENEQEEKEKEDVEEEKNKKKK----KKKKKKKKKKKK 58
E ++EK + EEEK+E +++++ K++ +E+K+KK+K K KKKKKKKKK
Sbjct: 133 ETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192
Query: 59 K 59
K
Sbjct: 193 K 193
Score = 44.3 bits (105), Expect = 2e-05
Identities = 21/52 (40%), Positives = 36/52 (69%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK 53
+ + E++++ EK+ K+E K+E ++++ +KE + E K KKKKKKKKKK
Sbjct: 142 KVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 37.8 bits (88), Expect = 0.003
Identities = 13/62 (20%), Positives = 34/62 (54%)
Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQC 89
+ + E + + K +K+ + ++++KK+KK+++ VK+ ++ K++KEK +
Sbjct: 125 PSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSK 184
Query: 90 DT 91
Sbjct: 185 KK 186
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 53.9 bits (130), Expect = 1e-08
Identities = 22/73 (30%), Positives = 42/73 (57%), Gaps = 2/73 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E + IEK KE E+++ + + +K+ +++K+K K KK KK K +KK +K+ E ++
Sbjct: 66 ELAEEIEKVKKEYEEKQ--KWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLE 123
Query: 70 EIEESLKERKEKT 82
++ +S E
Sbjct: 124 DLTKSYSETLSTL 136
Score = 47.0 bits (112), Expect = 2e-06
Identities = 24/64 (37%), Positives = 40/64 (62%), Gaps = 3/64 (4%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK---KKKKKEEQSVKEIEESLKE 77
E K++ ++ +E E V++E +K+K K KKKK KKKK K KK +++ K ++ KE
Sbjct: 58 TEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKE 117
Query: 78 RKEK 81
++K
Sbjct: 118 AEDK 121
Score = 42.0 bits (99), Expect = 1e-04
Identities = 20/57 (35%), Positives = 37/57 (64%)
Query: 25 EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
E + +K+K+++ EE K KK+ ++K+K K KKKK KK++ K+ ++ K+ K +
Sbjct: 55 AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSE 111
Score = 41.2 bits (97), Expect = 2e-04
Identities = 19/69 (27%), Positives = 37/69 (53%), Gaps = 1/69 (1%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
KK E++ K + K++ +++K+K+ D +++K K +KK +K+ + K + K
Sbjct: 75 KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLS 134
Query: 71 IEESLKERK 79
LK RK
Sbjct: 135 TLSELKPRK 143
Score = 33.1 bits (76), Expect = 0.13
Identities = 16/55 (29%), Positives = 34/55 (61%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
E E +K+ E + +K KK+ ++K+K K KKKK +++ K+ ++ ++ +K+
Sbjct: 56 EYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKS 110
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 52.9 bits (127), Expect = 5e-08
Identities = 21/73 (28%), Positives = 33/73 (45%), Gaps = 1/73 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
++ + E EE+ + + E E + +K K +KK KK K K K K K + VK
Sbjct: 53 TEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVK 112
Query: 70 EIEESLKERKEKT 82
+ K+ KT
Sbjct: 113 PQPKP-KKPPSKT 124
Score = 49.4 bits (118), Expect = 7e-07
Identities = 20/71 (28%), Positives = 29/71 (40%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
E + + E EE + E E E K K+K K +KK KK K K K + K +
Sbjct: 55 EPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQ 114
Query: 76 KERKEKTLTED 86
+ K+
Sbjct: 115 PKPKKPPSKTA 125
Score = 39.4 bits (92), Expect = 0.001
Identities = 20/81 (24%), Positives = 27/81 (33%), Gaps = 5/81 (6%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK-----KKKKKKKKKKKKKKE 64
E + K E E K KE + EK KK K K K K K K + K KK
Sbjct: 61 EPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKP 120
Query: 65 EQSVKEIEESLKERKEKTLTE 85
+ + + +
Sbjct: 121 PSKTAAKAPAAPNQPARPPSA 141
Score = 32.8 bits (75), Expect = 0.22
Identities = 11/60 (18%), Positives = 23/60 (38%), Gaps = 5/60 (8%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKK-----KKKKKKEEQSVKEIEESLKERKEKT 82
E +E + E ++ K + + + K+K K E++ K + + K K
Sbjct: 50 EAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKP 109
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 53.8 bits (130), Expect = 6e-08
Identities = 26/77 (33%), Positives = 52/77 (67%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
KK +K K EK E ++EE++KE ++ KKK+++++++K+KK+++K++EE+ +E
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464
Query: 71 IEESLKERKEKTLTEDS 87
+E +E+K+K T
Sbjct: 465 EKEEEEEKKKKQATLFD 481
Score = 53.4 bits (129), Expect = 8e-08
Identities = 25/79 (31%), Positives = 48/79 (60%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+ + + K +K + E+ EK+ EE+K KKKK KKK+++++++K+K+E+ +
Sbjct: 397 EEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456
Query: 70 EIEESLKERKEKTLTEDSQ 88
E EE +E KE+ + +
Sbjct: 457 EEEEEAEEEKEEEEEKKKK 475
Score = 49.1 bits (118), Expect = 2e-06
Identities = 19/79 (24%), Positives = 56/79 (70%), Gaps = 4/79 (5%)
Query: 5 RGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
A +K K+I ++ +++ +EE ++++K+ ++++ +++++K+KK+++K++++++ +E
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464
Query: 65 EQSVKEIEESLKERKEKTL 83
E + EE K++K+ TL
Sbjct: 465 E----KEEEEEKKKKQATL 479
Score = 47.2 bits (113), Expect = 8e-06
Identities = 24/55 (43%), Positives = 40/55 (72%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
+KR EKKK+ K+EE+EE E+E+KE+E EEE+ +++K+++++KKKK
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 46.1 bits (110), Expect = 2e-05
Identities = 20/67 (29%), Positives = 41/67 (61%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
EEE E +K + +++ K +KK++++KK+KKKK K+++ +E E+ KE ++
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455
Query: 81 KTLTEDS 87
+ E++
Sbjct: 456 EEEEEEA 462
Score = 44.1 bits (105), Expect = 7e-05
Identities = 15/57 (26%), Positives = 43/57 (75%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
+++ +K+K+ + ++++EE E+E+++KE+ +EE+ ++ +++K+++++KKKK
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 42.6 bits (101), Expect = 2e-04
Identities = 16/69 (23%), Positives = 33/69 (47%)
Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQC 89
E+E E + K KK KK +K +KK++++KKE++ + +E +E+ + +
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Query: 90 DTMIPVFKK 98
+
Sbjct: 455 KEEEEEEAE 463
Score = 42.2 bits (100), Expect = 3e-04
Identities = 22/54 (40%), Positives = 38/54 (70%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK 54
R+ +KK K+ +EEE+EE E++E+EKE+ EEE ++K+++++KKKK
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 40.3 bits (95), Expect = 0.001
Identities = 20/68 (29%), Positives = 38/68 (55%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
EE+ E K+ ++ +K +KK++++KK+KKKK +++ +E EE K+ +E
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Query: 81 KTLTEDSQ 88
K E+
Sbjct: 455 KEEEEEEA 462
Score = 38.0 bits (89), Expect = 0.007
Identities = 16/68 (23%), Positives = 36/68 (52%)
Query: 33 EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTM 92
+E++E KK KK KK +K +KK+++E++ K+ + K+++E+ E + +
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Query: 93 IPVFKKGV 100
++
Sbjct: 455 KEEEEEEA 462
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 51.6 bits (124), Expect = 1e-07
Identities = 25/64 (39%), Positives = 38/64 (59%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
+E E +++EE EKE EE+ KKKK+ K K K+ KKKKKK+ + K + + K+K
Sbjct: 64 DEPESDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKK 123
Query: 82 TLTE 85
+
Sbjct: 124 SERI 127
Score = 45.4 bits (108), Expect = 1e-05
Identities = 23/68 (33%), Positives = 34/68 (50%), Gaps = 1/68 (1%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK-KEEQSVKEIEESLKERK 79
E E ++ E+ EKE + E K KK+ K K K+ KKKKKK +S K K++
Sbjct: 65 EPESDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKS 124
Query: 80 EKTLTEDS 87
E+ +
Sbjct: 125 ERISWAPT 132
Score = 30.8 bits (70), Expect = 0.98
Identities = 20/75 (26%), Positives = 36/75 (48%), Gaps = 16/75 (21%)
Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE----------------QSVK 69
+N++ E+ E + KK + K +K+K+KKK+K+ +EE + +
Sbjct: 148 QNKEATHERLKEREIRRKKIQAKARKRKEKKKEKELTQEERLAEAKETERINLKSLERYE 207
Query: 70 EIEESLKERKEKTLT 84
E EE K+ K + L
Sbjct: 208 EQEEEKKKAKIQALK 222
Score = 30.0 bits (68), Expect = 1.8
Identities = 17/88 (19%), Positives = 38/88 (43%), Gaps = 11/88 (12%)
Query: 5 RGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
+ E++ + +KR K + +E +++K+K+ + K + KKK ++ +
Sbjct: 76 KELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLD 135
Query: 65 E-----------QSVKEIEESLKERKEK 81
Q+ + E LKER+ +
Sbjct: 136 SPRRKSSRSSTVQNKEATHERLKEREIR 163
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 50.4 bits (121), Expect = 4e-07
Identities = 19/69 (27%), Positives = 41/69 (59%), Gaps = 4/69 (5%)
Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE----QSVKEIEESLKER 78
++EE+E++E+EK V + KK K K ++K+K K++K+++ + +E ++
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKL 97
Query: 79 KEKTLTEDS 87
+ + L E+S
Sbjct: 98 RLRKLQEES 106
Score = 49.7 bits (119), Expect = 5e-07
Identities = 21/65 (32%), Positives = 37/65 (56%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
++ K+ ++EE E+++ E+ K K K KK K K ++K+K K E+ K + E ++ E
Sbjct: 31 DDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPE 90
Query: 81 KTLTE 85
L E
Sbjct: 91 DELAE 95
Score = 48.9 bits (117), Expect = 1e-06
Identities = 19/65 (29%), Positives = 39/65 (60%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
E+++++ ++ ++E+ EE++ +K K K K KK K K +++E++ +E EE E+
Sbjct: 27 EDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE 86
Query: 82 TLTED 86
ED
Sbjct: 87 DTPED 91
Score = 48.5 bits (116), Expect = 1e-06
Identities = 21/78 (26%), Positives = 41/78 (52%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
K + K ++E E+++ ++ E+ +EEK ++K K K K KK K K +E++ K
Sbjct: 15 PAKAVVKDKWDDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKR 74
Query: 71 IEESLKERKEKTLTEDSQ 88
+E R+ + T + +
Sbjct: 75 EKEEKGLRELEEDTPEDE 92
Score = 34.6 bits (80), Expect = 0.048
Identities = 10/34 (29%), Positives = 13/34 (38%)
Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
EK K + + KKKK K K K +
Sbjct: 193 EKLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDD 226
Score = 34.2 bits (79), Expect = 0.062
Identities = 10/32 (31%), Positives = 11/32 (34%)
Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
EK E+ KKKK K K K
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDD 223
Score = 33.9 bits (78), Expect = 0.10
Identities = 12/39 (30%), Positives = 16/39 (41%), Gaps = 4/39 (10%)
Query: 27 NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
NE+ + EK K KKKK K K K ++
Sbjct: 192 NEKLKAEKAA----KGGKKKKGKAKAKLNVGGANDDDDD 226
Score = 32.3 bits (74), Expect = 0.28
Identities = 10/41 (24%), Positives = 24/41 (58%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK 50
K +IE++ K + ++E + + +ED E++ +K + +K
Sbjct: 61 ALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRK 101
Score = 31.2 bits (71), Expect = 0.64
Identities = 13/44 (29%), Positives = 28/44 (63%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK 54
KK ++ + +E+EK + E+EEK ++EE+ + + +K + +K
Sbjct: 58 AKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRK 101
Score = 29.2 bits (66), Expect = 2.8
Identities = 13/58 (22%), Positives = 33/58 (56%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
+ K + + K+ K + E++EK K + EE+ ++ ++ + + +K + +K +E+S
Sbjct: 49 KAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRKLQEES 106
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 51.5 bits (123), Expect = 4e-07
Identities = 24/65 (36%), Positives = 33/65 (50%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E K E K E +++ + EE E+E EEE KK KK KK K KK K + S
Sbjct: 301 EDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGD 360
Query: 70 EIEES 74
+ ++S
Sbjct: 361 DSDDS 365
Score = 46.5 bits (110), Expect = 1e-05
Identities = 28/92 (30%), Positives = 48/92 (52%), Gaps = 5/92 (5%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK----KKK 58
++ GG K +I+ ++E + +E ++ ED +EEK+KKKKKK K KKK KK
Sbjct: 203 EKGGGGRGKDLKIKDLEGDDEDDGDESDKGG-EDGDEEKSKKKKKKLAKNKKKLDDDKKG 261
Query: 59 KKKKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
K+ ++ + ++ E +E+ DS
Sbjct: 262 KRGGDDDADEYDSDDGDDEGREEDYISDSSAS 293
Score = 41.1 bits (96), Expect = 7e-04
Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 2/56 (3%)
Query: 10 EKKKRIEKRNKEE-EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
+ ++R +K + E K E EQ+E +E EEEKN+++ KK KK KK K KK
Sbjct: 296 DPEEREDKLSPEIPAKPEIEQDEDSEES-EEEKNEEEGGLSKKGKKLKKLKGKKNG 350
Score = 34.9 bits (80), Expect = 0.068
Identities = 11/50 (22%), Positives = 27/50 (54%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
+ KK K+N ++ + + ++ + D++ E + KK+K+ KK++
Sbjct: 340 KLKKLKGKKNGLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQKEPKKEEP 389
Score = 32.2 bits (73), Expect = 0.45
Identities = 14/102 (13%), Positives = 42/102 (41%), Gaps = 12/102 (11%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEE----------NEQEEKEK--EDVEEEKNKKKKKKK 49
+ +RGG + + +E +EE N+ EE+E K + ++ +
Sbjct: 260 KGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDED 319
Query: 50 KKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDT 91
++ +++K +++ + K++++ ++ + D
Sbjct: 320 SEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDD 361
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 51.3 bits (122), Expect = 6e-07
Identities = 17/81 (20%), Positives = 44/81 (54%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A E KK+ E+ K+ + + + EE +K + + + + ++K + +KK+E++
Sbjct: 1317 ADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEA 1376
Query: 68 VKEIEESLKERKEKTLTEDSQ 88
K+ + + K+ +EK ++++
Sbjct: 1377 KKKADAAKKKAEEKKKADEAK 1397
Score = 48.6 bits (115), Expect = 4e-06
Identities = 26/90 (28%), Positives = 47/90 (52%), Gaps = 3/90 (3%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK---KKKK 58
A E+K ++ KEE K++ + +K+ E+ ++ KKK ++ KKK KK
Sbjct: 1355 AADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAA 1414
Query: 59 KKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
KKK +++ K+ EE K + K E+++
Sbjct: 1415 AAKKKADEAKKKAEEKKKADEAKKKAEEAK 1444
Score = 48.2 bits (114), Expect = 5e-06
Identities = 29/83 (34%), Positives = 49/83 (59%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A E KK+ E+ K EE E+ E +KE E +K ++ KKK+ ++KKK ++ KK +EE
Sbjct: 1670 AEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENK 1729
Query: 68 VKEIEESLKERKEKTLTEDSQCD 90
+K E + ++K E+++ D
Sbjct: 1730 IKAEEAKKEAEEDKKKAEEAKKD 1752
Score = 47.8 bits (113), Expect = 7e-06
Identities = 26/84 (30%), Positives = 49/84 (58%), Gaps = 3/84 (3%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVE---EEKNKKKKKKKKKKKKKKKKKKKKKE 64
A E KK+ E+ K +E ++ +E K+ E+ + EE K + KKK ++ KK + KKK
Sbjct: 1433 ADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKA 1492
Query: 65 EQSVKEIEESLKERKEKTLTEDSQ 88
E++ K+ +E+ K + K ++++
Sbjct: 1493 EEAKKKADEAKKAAEAKKKADEAK 1516
Score = 47.8 bits (113), Expect = 7e-06
Identities = 24/81 (29%), Positives = 40/81 (49%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A E K + +E + E+ E ++ EE K K KKK ++KKK + KKK E+
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEED 1403
Query: 68 VKEIEESLKERKEKTLTEDSQ 88
K+ +E K K ++++
Sbjct: 1404 KKKADELKKAAAAKKKADEAK 1424
Score = 47.8 bits (113), Expect = 7e-06
Identities = 19/81 (23%), Positives = 39/81 (48%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A E KK+ E+ K +E ++ +E K+K D ++K ++ KK + K + + + E
Sbjct: 1304 ADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAE 1363
Query: 68 VKEIEESLKERKEKTLTEDSQ 88
K K+ + K + ++
Sbjct: 1364 EKAEAAEKKKEEAKKKADAAK 1384
Score = 47.8 bits (113), Expect = 8e-06
Identities = 18/79 (22%), Positives = 34/79 (43%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A E KK E + K EE ++ K+K + ++ + K + + + + ++K E
Sbjct: 1311 AEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAE 1370
Query: 68 VKEIEESLKERKEKTLTED 86
K+ E K K E+
Sbjct: 1371 KKKEEAKKKADAAKKKAEE 1389
Score = 47.4 bits (112), Expect = 1e-05
Identities = 27/79 (34%), Positives = 46/79 (58%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A E+KK++E+ K+E +E+ + EE +K + E + ++ KK ++ KKK ++ KK EE
Sbjct: 1628 AEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDE 1687
Query: 68 VKEIEESLKERKEKTLTED 86
K E KE +E E+
Sbjct: 1688 KKAAEALKKEAEEAKKAEE 1706
Score = 47.1 bits (111), Expect = 1e-05
Identities = 28/78 (35%), Positives = 44/78 (56%), Gaps = 2/78 (2%)
Query: 4 RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
++ A EKKK E + EEE + EE +K EE+K K ++ KK ++ +KK + KK
Sbjct: 1639 KKKEAEEKKKAEELKKAEEENKIKAAEEAKKA--EEDKKKAEEAKKAEEDEKKAAEALKK 1696
Query: 64 EEQSVKEIEESLKERKEK 81
E + K+ EE K+ E+
Sbjct: 1697 EAEEAKKAEELKKKEAEE 1714
Score = 47.1 bits (111), Expect = 1e-05
Identities = 24/81 (29%), Positives = 48/81 (59%), Gaps = 2/81 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVE--EEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
+KKK E + K++ ++ +K+ E+ + +E KK ++ KK + KKK ++ KK E++
Sbjct: 1403 DKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEA 1462
Query: 68 VKEIEESLKERKEKTLTEDSQ 88
K+ EE+ K + K E+++
Sbjct: 1463 KKKAEEAKKADEAKKKAEEAK 1483
Score = 46.7 bits (110), Expect = 2e-05
Identities = 24/72 (33%), Positives = 41/72 (56%), Gaps = 3/72 (4%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
K E + EE+K++ E+ +K +ED EK + KK+ ++ KK ++ KKKE + K+
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKKAEED---EKKAAEALKKEAEEAKKAEELKKKEAEEKKKA 1718
Query: 72 EESLKERKEKTL 83
EE K +E +
Sbjct: 1719 EELKKAEEENKI 1730
Score = 46.7 bits (110), Expect = 2e-05
Identities = 25/77 (32%), Positives = 45/77 (58%), Gaps = 6/77 (7%)
Query: 10 EKKKRIEKRNKEEEK------EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
E+KK+ E+ K EE+ EE ++ E++K+ EE K ++ +KK + KK+ ++ KK
Sbjct: 1644 EEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703
Query: 64 EEQSVKEIEESLKERKE 80
E+ K+ E K+ +E
Sbjct: 1704 AEELKKKEAEEKKKAEE 1720
Score = 46.3 bits (109), Expect = 2e-05
Identities = 21/81 (25%), Positives = 40/81 (49%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A E KK+ + K+ E+ + E + E + ++K + +KKK++ KKK + +
Sbjct: 1324 AEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAA 1383
Query: 68 VKEIEESLKERKEKTLTEDSQ 88
K+ EE K + K E+ +
Sbjct: 1384 KKKAEEKKKADEAKKKAEEDK 1404
Score = 46.3 bits (109), Expect = 2e-05
Identities = 27/79 (34%), Positives = 42/79 (53%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
K++ A +KK K+ E K E E E E EE+ +KKK++ KKK KKK
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKA 1387
Query: 63 KEEQSVKEIEESLKERKEK 81
+E++ E ++ +E K+K
Sbjct: 1388 EEKKKADEAKKKAEEDKKK 1406
Score = 46.3 bits (109), Expect = 2e-05
Identities = 27/78 (34%), Positives = 44/78 (56%), Gaps = 4/78 (5%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDV----EEEKNKKKKKKKKKKKKKKKKKKKKK 63
A E KK E + K EE ++ E++EK+ + EE K ++ KKK+ ++KKK ++ KK
Sbjct: 1664 AEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKK 1723
Query: 64 EEQSVKEIEESLKERKEK 81
E+ K E K+ E+
Sbjct: 1724 AEEENKIKAEEAKKEAEE 1741
Score = 46.3 bits (109), Expect = 2e-05
Identities = 25/74 (33%), Positives = 48/74 (64%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A E +K+ + K+E +E + EE +K++ EE+K ++ KK +++ K K ++ KK+ E+
Sbjct: 1683 AEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEED 1742
Query: 68 VKEIEESLKERKEK 81
K+ EE+ K+ +EK
Sbjct: 1743 KKKAEEAKKDEEEK 1756
Score = 45.9 bits (108), Expect = 3e-05
Identities = 27/83 (32%), Positives = 52/83 (62%), Gaps = 2/83 (2%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVE--EEKNKKKKKKKKKKKKKKKKKKKKKEE 65
A KKK E + K EEK++ ++ +K+ E+ + +E KK ++ KK ++ KKK ++ KK +
Sbjct: 1414 AAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKAD 1473
Query: 66 QSVKEIEESLKERKEKTLTEDSQ 88
++ K+ EE+ K + K E+++
Sbjct: 1474 EAKKKAEEAKKADEAKKKAEEAK 1496
Score = 45.9 bits (108), Expect = 3e-05
Identities = 25/91 (27%), Positives = 51/91 (56%), Gaps = 2/91 (2%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEK--EKEDVEEEKNKKKKKKKKKKKKKKKKK 59
R++ E+ ++ ++ K EEK++ ++ +K EK+ +E K K ++ KK + KKK ++
Sbjct: 1267 RRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEE 1326
Query: 60 KKKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
KKK + + K+ EE+ K + ++ D
Sbjct: 1327 AKKKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357
Score = 45.9 bits (108), Expect = 3e-05
Identities = 19/78 (24%), Positives = 39/78 (50%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
KKK + K EE ++ + K + + ++ + ++K + +KKK++ KKK + K
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKA 1387
Query: 71 IEESLKERKEKTLTEDSQ 88
E+ + +K ED +
Sbjct: 1388 EEKKKADEAKKKAEEDKK 1405
Score = 45.9 bits (108), Expect = 3e-05
Identities = 22/78 (28%), Positives = 46/78 (58%), Gaps = 4/78 (5%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK----KKKKKKKKKK 63
A EKKK E + K EE ++ ++ +K+ E+ +++ + KKK ++ KK K + +
Sbjct: 1298 AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357
Query: 64 EEQSVKEIEESLKERKEK 81
E ++ +E E+ +++KE+
Sbjct: 1358 EAEAAEEKAEAAEKKKEE 1375
Score = 45.9 bits (108), Expect = 3e-05
Identities = 28/86 (32%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEK---EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
A E KK E + K EE ++ E+ +K E + +E K K ++ KK + KKK ++ KKK
Sbjct: 1440 AEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKA 1499
Query: 65 EQSVKEIEESLKERKEKTLTEDSQCD 90
+++ K E K + K E + D
Sbjct: 1500 DEAKKAAEAKKKADEAKKAEEAKKAD 1525
Score = 45.5 bits (107), Expect = 3e-05
Identities = 26/75 (34%), Positives = 47/75 (62%), Gaps = 2/75 (2%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK--KKKKKKKKKEE 65
A EKKK E + K EE ++ ++ +K+ E+ ++ + KKK ++ KK KKK ++ KK +
Sbjct: 1427 AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKAD 1486
Query: 66 QSVKEIEESLKERKE 80
++ K+ EE+ K+ E
Sbjct: 1487 EAKKKAEEAKKKADE 1501
Score = 45.5 bits (107), Expect = 4e-05
Identities = 24/80 (30%), Positives = 50/80 (62%), Gaps = 2/80 (2%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVE--EEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
KKK EK+ +E K++ E+++K+ ++++ KK + KKK ++KKK + KK+ +
Sbjct: 1384 KKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEA 1443
Query: 69 KEIEESLKERKEKTLTEDSQ 88
K+ +E+ K+ +E E+++
Sbjct: 1444 KKADEAKKKAEEAKKAEEAK 1463
Score = 45.5 bits (107), Expect = 4e-05
Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 2/77 (2%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEK--EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
A EKKK E + EE+K+ +E ++K E + +E K K ++ KKK KKK ++ KK
Sbjct: 1286 AEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAA 1345
Query: 66 QSVKEIEESLKERKEKT 82
++ K E+ + E
Sbjct: 1346 EAAKAEAEAAADEAEAA 1362
Score = 45.1 bits (106), Expect = 4e-05
Identities = 28/79 (35%), Positives = 51/79 (64%), Gaps = 1/79 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
EKK + E+ K EE + + EE +K + E++K ++ KKK+ ++KKK ++ KK +EE +K
Sbjct: 1604 EKKMKAEEAKKAEE-AKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIK 1662
Query: 70 EIEESLKERKEKTLTEDSQ 88
EE+ K ++K E+++
Sbjct: 1663 AAEEAKKAEEDKKKAEEAK 1681
Score = 44.7 bits (105), Expect = 6e-05
Identities = 25/90 (27%), Positives = 42/90 (46%), Gaps = 7/90 (7%)
Query: 8 AGEKKKRIEKRNKEEEK---EENEQEEKEKEDVEEEKNK----KKKKKKKKKKKKKKKKK 60
A E KK EK+ +E K EE ++ ++ K+ EE K K KKK ++ KK + K +
Sbjct: 1292 ADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAE 1351
Query: 61 KKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
+ + EE + ++K + D
Sbjct: 1352 AEAAADEAEAAEEKAEAAEKKKEEAKKKAD 1381
Score = 44.7 bits (105), Expect = 6e-05
Identities = 28/95 (29%), Positives = 55/95 (57%), Gaps = 4/95 (4%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDV--EEEKNKKKKKKKKKKKKKKKKKK 60
+++ KK+ E + EE K++ +E+K+ E++ EE+NK K ++ KK+ ++ KKK
Sbjct: 1687 EKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKA 1746
Query: 61 K--KKEEQSVKEIEESLKERKEKTLTEDSQCDTMI 93
+ KK+E+ K+I KE ++K + + +I
Sbjct: 1747 EEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVI 1781
Score = 43.6 bits (102), Expect = 2e-04
Identities = 26/82 (31%), Positives = 49/82 (59%), Gaps = 3/82 (3%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK---KKKEEQ 66
KKK + K EEK++ ++ +K+ E+ +++ ++ KK KKK + KKK KKK ++
Sbjct: 1376 AKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADE 1435
Query: 67 SVKEIEESLKERKEKTLTEDSQ 88
+ K+ EE+ K + K E+++
Sbjct: 1436 AKKKAEEAKKADEAKKKAEEAK 1457
Score = 43.6 bits (102), Expect = 2e-04
Identities = 26/81 (32%), Positives = 45/81 (55%), Gaps = 3/81 (3%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVE---EEKNKKKKKKKKKKKKKKKKKKKKKE 64
A E KK+ E+ K EE ++ +E K+ ++ + EE K + KKK ++ KKK + KK
Sbjct: 1446 ADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKA 1505
Query: 65 EQSVKEIEESLKERKEKTLTE 85
++ K+ +E+ K + K E
Sbjct: 1506 AEAKKKADEAKKAEEAKKADE 1526
Score = 43.2 bits (101), Expect = 2e-04
Identities = 29/87 (33%), Positives = 43/87 (49%), Gaps = 4/87 (4%)
Query: 8 AGEKKKRIEKRNKEEEK----EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
A EKKK E + K EE +E ++ K+ +E K K ++KKK + KKK ++ KK
Sbjct: 1387 AEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKA 1446
Query: 64 EEQSVKEIEESLKERKEKTLTEDSQCD 90
+E K E E +K E + D
Sbjct: 1447 DEAKKKAEEAKKAEEAKKKAEEAKKAD 1473
Score = 43.2 bits (101), Expect = 2e-04
Identities = 32/90 (35%), Positives = 55/90 (61%), Gaps = 10/90 (11%)
Query: 4 RRGGAGEKKKRIEKRNKEEE--------KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
++ A EKKK E + EEE K+E E+++K+ E+ ++++ +KKK KK+++
Sbjct: 1708 KKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEE 1767
Query: 56 KKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
KK ++ +KE+++V IEE L E EK E
Sbjct: 1768 KKAEEIRKEKEAV--IEEELDEEDEKRRME 1795
Score = 43.2 bits (101), Expect = 2e-04
Identities = 25/80 (31%), Positives = 45/80 (56%), Gaps = 4/80 (5%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE----Q 66
KKK E + E K++ ++ +K +E + ++ KK ++ KK + KK ++KKK +E +
Sbjct: 1496 KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAE 1555
Query: 67 SVKEIEESLKERKEKTLTED 86
+K+ EE K + K ED
Sbjct: 1556 ELKKAEEKKKAEEAKKAEED 1575
Score = 42.4 bits (99), Expect = 3e-04
Identities = 26/85 (30%), Positives = 50/85 (58%), Gaps = 5/85 (5%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEK-----EKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
A E KK E + K EE ++ E+E+K +K++ EE+K ++ KK +++ K K ++ K
Sbjct: 1609 AEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAK 1668
Query: 63 KEEQSVKEIEESLKERKEKTLTEDS 87
K E+ K+ EE+ K +++ ++
Sbjct: 1669 KAEEDKKKAEEAKKAEEDEKKAAEA 1693
Score = 42.4 bits (99), Expect = 3e-04
Identities = 26/73 (35%), Positives = 45/73 (61%), Gaps = 1/73 (1%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
++ RIE+ K E+E+ + E+ K+ EE K K ++ KK +++KKK ++ KKKE + K+
Sbjct: 1590 EEARIEEVMKLYEEEKKMKAEEAKKA-EEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKK 1648
Query: 71 IEESLKERKEKTL 83
EE K +E +
Sbjct: 1649 AEELKKAEEENKI 1661
Score = 42.4 bits (99), Expect = 4e-04
Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 4/86 (4%)
Query: 8 AGEKKKRIEKRNKEEEK---EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
A E KK E + K EE +E +++ +E + +E K K ++ KKK + KK + KKK
Sbjct: 1453 AEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKA 1512
Query: 65 EQSVKEIEESLKERKEKTLTEDSQCD 90
++ K+ EE+ K + K E + D
Sbjct: 1513 DE-AKKAEEAKKADEAKKAEEAKKAD 1537
Score = 41.7 bits (97), Expect = 6e-04
Identities = 26/79 (32%), Positives = 48/79 (60%), Gaps = 5/79 (6%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVE---EEKNKKKKKKKKKKKKKKKKKKKKKE 64
A E KK+ E++ K +E ++ +E K+ ++ + EE K ++ KKK ++ KK + KKK
Sbjct: 1420 ADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKA 1479
Query: 65 EQSVK--EIEESLKERKEK 81
E++ K E ++ +E K+K
Sbjct: 1480 EEAKKADEAKKKAEEAKKK 1498
Score = 41.7 bits (97), Expect = 6e-04
Identities = 25/76 (32%), Positives = 47/76 (61%), Gaps = 2/76 (2%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A E KK+ ++ K EE ++ ++ +K +E + ++ KK ++KKK + KK ++ KK EE
Sbjct: 1505 AAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEE-- 1562
Query: 68 VKEIEESLKERKEKTL 83
K+ EE+ K ++K +
Sbjct: 1563 KKKAEEAKKAEEDKNM 1578
Score = 41.7 bits (97), Expect = 6e-04
Identities = 29/100 (29%), Positives = 49/100 (49%), Gaps = 19/100 (19%)
Query: 8 AGEKKKRIEKRNKEEEKEENE---------------QEEKEKEDVEEEKNKKKKKKKKKK 52
A E KK E E E +E +E K+K D ++K ++KKK + K
Sbjct: 1338 AEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAK 1397
Query: 53 KKKKKKKKK----KKEEQSVKEIEESLKERKEKTLTEDSQ 88
KK ++ KKK KK + K+ +E+ K+ +EK ++++
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAK 1437
Score = 41.3 bits (96), Expect = 8e-04
Identities = 24/80 (30%), Positives = 43/80 (53%), Gaps = 4/80 (5%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKED----VEEEKNKKKKKKKKKKKKKKKKKKKKK 63
A E KK+ E+ K +E ++ +E K+K D E K K + KK ++ KK + KK +
Sbjct: 1472 ADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAE 1531
Query: 64 EEQSVKEIEESLKERKEKTL 83
E + E +++ +++K L
Sbjct: 1532 EAKKADEAKKAEEKKKADEL 1551
Score = 41.3 bits (96), Expect = 9e-04
Identities = 31/83 (37%), Positives = 47/83 (56%), Gaps = 11/83 (13%)
Query: 10 EKKKRIEKRNKEEEKEENEQ----EEKEKEDVE-----EEKNKKKKKKKKKKKKKKKKKK 60
K K E + EEEK++ EQ E +EK+ E EE+NK K ++ KK ++ KKK
Sbjct: 1618 AKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKA 1677
Query: 61 K--KKEEQSVKEIEESLKERKEK 81
+ KK E+ K+ E+LK+ E+
Sbjct: 1678 EEAKKAEEDEKKAAEALKKEAEE 1700
Score = 40.9 bits (95), Expect = 0.001
Identities = 25/84 (29%), Positives = 47/84 (55%), Gaps = 1/84 (1%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK-KKKKKKKKKKKKKKKKKKKKEEQ 66
A E KK E + K EE ++ ++ +K+ E+ +++ ++ KK + KKK + KK ++ K+
Sbjct: 1466 AEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKAD 1525
Query: 67 SVKEIEESLKERKEKTLTEDSQCD 90
K+ EE+ K + K E + D
Sbjct: 1526 EAKKAEEAKKADEAKKAEEKKKAD 1549
Score = 40.1 bits (93), Expect = 0.002
Identities = 23/78 (29%), Positives = 41/78 (52%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A E KK E + K EE ++ E K+ + +++ ++ KK ++ KK + KK ++ K+
Sbjct: 1479 AEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADE 1538
Query: 68 VKEIEESLKERKEKTLTE 85
K+ EE K + K E
Sbjct: 1539 AKKAEEKKKADELKKAEE 1556
Score = 39.7 bits (92), Expect = 0.002
Identities = 21/76 (27%), Positives = 47/76 (61%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E K + E+ KE E+++ + EE +K++ E++K KK+++KK ++ +K+K+ E+ +
Sbjct: 1727 ENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELD 1786
Query: 70 EIEESLKERKEKTLTE 85
E +E + +K + +
Sbjct: 1787 EEDEKRRMEVDKKIKD 1802
Score = 39.7 bits (92), Expect = 0.002
Identities = 24/82 (29%), Positives = 46/82 (56%), Gaps = 4/82 (4%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
KKK E + K +E ++ + +K+ ++ ++ + KK + KK ++ KK + KK E+ K
Sbjct: 1489 KKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKA 1548
Query: 71 IE----ESLKERKEKTLTEDSQ 88
E E LK+ +EK E+++
Sbjct: 1549 DELKKAEELKKAEEKKKAEEAK 1570
Score = 39.0 bits (90), Expect = 0.004
Identities = 22/80 (27%), Positives = 43/80 (53%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+RK R + K EE + ++ +K +E + ++ KK ++KKK + KKK ++
Sbjct: 1254 IRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEE 1313
Query: 61 KKKEEQSVKEIEESLKERKE 80
KK +++ K+ EE+ K+
Sbjct: 1314 AKKADEAKKKAEEAKKKADA 1333
Score = 38.6 bits (89), Expect = 0.005
Identities = 27/78 (34%), Positives = 43/78 (55%), Gaps = 1/78 (1%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A EKKK E + EE+K ++ +E + EE + ++ K +++KK K ++ KK EE
Sbjct: 1560 AEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAK 1619
Query: 68 VKEIEESLKERKEKTLTE 85
+K EE K +EK E
Sbjct: 1620 IKA-EELKKAEEEKKKVE 1636
Score = 37.8 bits (87), Expect = 0.009
Identities = 22/78 (28%), Positives = 45/78 (57%), Gaps = 2/78 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+ K+ ++ K EE ++ ++ +K +E + ++ KK ++ KK ++KKK ++ KK EE
Sbjct: 1519 EEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNM 1578
Query: 70 EIE--ESLKERKEKTLTE 85
+ E K+ +E + E
Sbjct: 1579 ALRKAEEAKKAEEARIEE 1596
Score = 36.7 bits (84), Expect = 0.025
Identities = 26/89 (29%), Positives = 49/89 (55%), Gaps = 8/89 (8%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A E KK E + EE K+ +E ++ E++ +E K ++ KK ++KKK ++ KK +E+++
Sbjct: 1518 AEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKN 1577
Query: 68 V--------KEIEESLKERKEKTLTEDSQ 88
+ K+ EE+ E K E+ +
Sbjct: 1578 MALRKAEEAKKAEEARIEEVMKLYEEEKK 1606
Score = 35.5 bits (81), Expect = 0.045
Identities = 23/85 (27%), Positives = 46/85 (54%), Gaps = 6/85 (7%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVE------EEKNKKKKKKKKKKKKKKKKKKKKK 63
++ K+ E+ K EEK++ E+ +K +ED EE K ++ + ++ K +++KK K
Sbjct: 1549 DELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMK 1608
Query: 64 EEQSVKEIEESLKERKEKTLTEDSQ 88
E++ K E +K + K E+ +
Sbjct: 1609 AEEAKKAEEAKIKAEELKKAEEEKK 1633
Score = 35.5 bits (81), Expect = 0.048
Identities = 18/72 (25%), Positives = 34/72 (47%), Gaps = 2/72 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
K+ EE + E+ +K + EE K ++ KKK + +K ++ +K E+ +
Sbjct: 1083 AKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAED--AR 1140
Query: 70 EIEESLKERKEK 81
+ EE+ K K
Sbjct: 1141 KAEEARKAEDAK 1152
Score = 35.1 bits (80), Expect = 0.065
Identities = 25/89 (28%), Positives = 44/89 (49%), Gaps = 8/89 (8%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK--------KKK 59
A E KK E + EE+K+ +E ++ E+ EEK K ++ KK ++ K KK
Sbjct: 1530 AEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKA 1589
Query: 60 KKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
++ + E+ +K EE K + E+ +
Sbjct: 1590 EEARIEEVMKLYEEEKKMKAEEAKKAEEA 1618
Score = 34.0 bits (77), Expect = 0.16
Identities = 21/72 (29%), Positives = 37/72 (51%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E + R + K E ++ E + EE+K + KK ++KKK + KKK +E +
Sbjct: 1259 EARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKAD 1318
Query: 70 EIEESLKERKEK 81
E ++ +E K+K
Sbjct: 1319 EAKKKAEEAKKK 1330
Score = 33.2 bits (75), Expect = 0.25
Identities = 17/79 (21%), Positives = 49/79 (62%), Gaps = 1/79 (1%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
K + +K+ +K+ EE K++ E+E+K+ +++E+ KK ++ +K+K+ +++
Sbjct: 1728 NKIKAEEAKKEAEEDKKKAEEAKKD-EEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELD 1786
Query: 62 KKEEQSVKEIEESLKERKE 80
+++E+ E+++ +K+ +
Sbjct: 1787 EEDEKRRMEVDKKIKDIFD 1805
Score = 32.8 bits (74), Expect = 0.30
Identities = 31/97 (31%), Positives = 49/97 (50%), Gaps = 16/97 (16%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVE------------EEKNK----KKKKKKKK 51
A E KK E+ K EE+ NE+ K +E EE K KK ++KKK
Sbjct: 1232 AEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKK 1291
Query: 52 KKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
+ KK ++KKK +++ K+ EE+ K + K E+++
Sbjct: 1292 ADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1328
Score = 32.8 bits (74), Expect = 0.35
Identities = 22/72 (30%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
Query: 8 AGEKKKRIEKRNKEE-EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
A E +K + R E K E E++ +E E+ K + KK ++ KK ++ KK +EE+
Sbjct: 1190 AEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEER 1249
Query: 67 SVKEIEESLKER 78
+ +EI + + R
Sbjct: 1250 NNEEIRKFEEAR 1261
Score = 32.4 bits (73), Expect = 0.45
Identities = 22/86 (25%), Positives = 44/86 (51%), Gaps = 2/86 (2%)
Query: 5 RGGAGEKKKRIEKRNKEEEK--EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
+ A E KK E+RN EE + EE + + + +K + KK ++KKK +
Sbjct: 1236 KKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEA 1295
Query: 63 KEEQSVKEIEESLKERKEKTLTEDSQ 88
K+ + K+ +E+ K+ +E ++++
Sbjct: 1296 KKAEEKKKADEAKKKAEEAKKADEAK 1321
Score = 30.9 bits (69), Expect = 1.4
Identities = 22/84 (26%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
K G E K K + KE+N +E +E + + KK + K ++ +K ++ K
Sbjct: 1062 AKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAK 1121
Query: 62 KKEEQSVKEIEESLKERKEKTLTE 85
KK E ++ EE+ K + E
Sbjct: 1122 KKAED-ARKAEEARKAEDARKAEE 1144
Score = 29.7 bits (66), Expect = 3.4
Identities = 21/72 (29%), Positives = 38/72 (52%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+ ++ E+ K E+ + E K +E+ + E+ +K + KK + KK ++ KK E++ K
Sbjct: 1185 EEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKK 1244
Query: 70 EIEESLKERKEK 81
EE E K
Sbjct: 1245 AEEERNNEEIRK 1256
Score = 29.3 bits (65), Expect = 3.6
Identities = 22/73 (30%), Positives = 36/73 (49%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A E +K E R E+ ++ + E+E EE K + KK + KK ++ KK EE
Sbjct: 1184 AEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAK 1243
Query: 68 VKEIEESLKERKE 80
E E + +E ++
Sbjct: 1244 KAEEERNNEEIRK 1256
Score = 29.0 bits (64), Expect = 6.0
Identities = 22/98 (22%), Positives = 45/98 (45%), Gaps = 4/98 (4%)
Query: 10 EKKKRIEKRNKEEEKEENEQ----EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
+KKK E + EEEK++ EEK+ E++ +EK +++ ++ +K++ + KK +
Sbjct: 1742 DKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIK 1801
Query: 66 QSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYK 103
++ KE L + + K+ K
Sbjct: 1802 DIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADSK 1839
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 47.3 bits (113), Expect = 1e-06
Identities = 19/59 (32%), Positives = 41/59 (69%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
+K + KE+E+EE E+ E E+ D EE+ ++ +K+ K K++K+++ ++K+++ +KE
Sbjct: 91 RKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKE 149
Score = 45.7 bits (109), Expect = 4e-06
Identities = 21/59 (35%), Positives = 37/59 (62%), Gaps = 3/59 (5%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
G +K+K E+ EEE E E +E+E+ D EK K K++K+++ ++K+K+ KE+
Sbjct: 95 GLDKKEKEEEE---EEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQ 150
Score = 45.4 bits (108), Expect = 6e-06
Identities = 17/55 (30%), Positives = 36/55 (65%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
+ +K +EEE+EE E EE ++E+ +E +K+ K K++K+++ ++K+K+
Sbjct: 92 KLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEI 146
Score = 39.6 bits (93), Expect = 6e-04
Identities = 15/56 (26%), Positives = 39/56 (69%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
+K +++ +K+E++EE E+E + +E EEE+ + +K+ K K++K+++ ++++
Sbjct: 88 KKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQ 143
Score = 38.4 bits (90), Expect = 0.001
Identities = 19/56 (33%), Positives = 39/56 (69%), Gaps = 2/56 (3%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK--KKKKKKKKKKKKK 63
+KK++ E+ +E E EE ++EE+ E +E+E K K++K++ ++K+K+ K++ K
Sbjct: 97 DKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQMK 152
Score = 28.4 bits (64), Expect = 3.8
Identities = 11/50 (22%), Positives = 31/50 (62%)
Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
K ++ V + KK+K+++++++ + ++ +EEQ + +E+ L + K +
Sbjct: 85 KWRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKRE 134
Score = 27.3 bits (61), Expect = 8.2
Identities = 11/44 (25%), Positives = 26/44 (59%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
+K +K KK+K+++++++ + EE +E + L E++ L
Sbjct: 88 KKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKL 131
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 49.8 bits (119), Expect = 1e-06
Identities = 27/89 (30%), Positives = 41/89 (46%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E KK+++ N + + E + DV E KK KKKKKKKKK++K + E +
Sbjct: 242 EVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPEA 301
Query: 70 EIEESLKERKEKTLTEDSQCDTMIPVFKK 98
E S K + E++ P +K
Sbjct: 302 EGLGSSDSGSRKDVEEENARLEDSPKKRK 330
Score = 35.5 bits (82), Expect = 0.041
Identities = 26/90 (28%), Positives = 49/90 (54%), Gaps = 15/90 (16%)
Query: 2 RKRRGGAGEKKKRIEKRNK-EEEKEENEQEEK--------EKEDVEE------EKNKKKK 46
+KR+ A K+KR E R K + +E+ E+ K E +D ++ +K+KK++
Sbjct: 43 KKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDDDTKAWLKKSKKRQ 102
Query: 47 KKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
KKK+ ++KK +K++E++ + E L
Sbjct: 103 KKKEAERKKALLLDEKEKERAAEYTSEDLA 132
Score = 35.1 bits (81), Expect = 0.059
Identities = 24/103 (23%), Positives = 42/103 (40%), Gaps = 24/103 (23%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK--------------- 54
K+ R + +K + E E K K + EK K ++K+++ K
Sbjct: 30 TKESRDAAAYENWKKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDD 89
Query: 55 ---------KKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
KK++KKK+ E + ++E KER + +ED
Sbjct: 90 DTKAWLKKSKKRQKKKEAERKKALLLDEKEKERAAEYTSEDLA 132
Score = 34.7 bits (80), Expect = 0.077
Identities = 15/69 (21%), Positives = 32/69 (46%), Gaps = 3/69 (4%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKE--KEDVEEEKNKKKKKKKK-KKKKKKKKKKKKKEE 65
G K + KE + E K+ EE + K+K+++ + K K ++K+++ +
Sbjct: 16 GLKPLPVPGAKPGSTKESRDAAAYENWKKRQEEAEAKRKREELREKIAKAREKRERNSKL 75
Query: 66 QSVKEIEES 74
+K + E
Sbjct: 76 GGIKTLGED 84
Score = 32.0 bits (73), Expect = 0.51
Identities = 23/109 (21%), Positives = 49/109 (44%), Gaps = 33/109 (30%)
Query: 11 KKKRIEKRNKEEEKEENEQE---------------EKEKEDVEEEKNKKKKKKK------ 49
KKK+ +K+ + ++ +E+E E + E+E+ E + KK+K++
Sbjct: 279 KKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKRKEEQEDDDF 338
Query: 50 ------------KKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
K+++ +KK+KK + E ++I E E +E+ +
Sbjct: 339 VEDDDDLQASLAKQRRLAQKKRKKLRPEDIARQIAEERSEDEEEAAENN 387
Score = 30.1 bits (68), Expect = 2.3
Identities = 17/82 (20%), Positives = 37/82 (45%), Gaps = 12/82 (14%)
Query: 15 IEKRNK----------EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
IE+ NK + ++ ++ E KK++++ + K+K+++ ++K +
Sbjct: 5 IEETNKLRAKLGLKPLPVPGAKPGSTKESRDAAAYENWKKRQEEAEAKRKREELREKIAK 64
Query: 65 EQSVKEIEESLKERKEKTLTED 86
+ +E L KTL ED
Sbjct: 65 AREKRERNSKLG--GIKTLGED 84
Score = 29.3 bits (66), Expect = 3.4
Identities = 23/100 (23%), Positives = 37/100 (37%), Gaps = 29/100 (29%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKK----------------------------- 51
+++K +N ++E KK+++ KKK
Sbjct: 216 KKKKSDNLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKF 275
Query: 52 KKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDT 91
KK KKKKKKKKK + + E E + + S+ D
Sbjct: 276 KKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDV 315
Score = 28.6 bits (64), Expect = 5.8
Identities = 13/84 (15%), Positives = 37/84 (44%), Gaps = 3/84 (3%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+ + +KR +E+E ++ +++ + + ++ + +KK+KK + + ++
Sbjct: 312 RKDVEEENARLEDSPKKRKEEQEDDDFVEDDDDLQASLAKQRRLAQKKRKKLRPEDIARQ 371
Query: 61 KKKEEQSVKEIEESLKERKEKTLT 84
+E E EE E + +
Sbjct: 372 IAEERS---EDEEEAAENNDNGIV 392
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 49.6 bits (119), Expect = 1e-06
Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 5/66 (7%)
Query: 12 KKRI----EKRNKEE-EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
K R+ + K EK+ + +K E+K K+K+K K KK+ + K K+ +
Sbjct: 369 KARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKP 428
Query: 67 SVKEIE 72
S E
Sbjct: 429 SGTSEE 434
Score = 44.5 bits (106), Expect = 5e-05
Identities = 19/76 (25%), Positives = 40/76 (52%), Gaps = 2/76 (2%)
Query: 14 RIEKRNKEEEKEE--NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
+IE+ +E K +E K K E++ K KK K+ +KK+K+K+K + +
Sbjct: 359 KIERYIEEPLKARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRD 418
Query: 72 EESLKERKEKTLTEDS 87
+++ +R++ + T +
Sbjct: 419 TKNIGKRRKPSGTSEE 434
Score = 38.8 bits (91), Expect = 0.004
Identities = 16/54 (29%), Positives = 28/54 (51%), Gaps = 3/54 (5%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
+K+ K +K+ + E++EKEKE K KK+ + K K++K +E
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKP---KVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 38.4 bits (90), Expect = 0.005
Identities = 16/64 (25%), Positives = 24/64 (37%), Gaps = 15/64 (23%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
K+ G +K KE+EK EK K KK+ + K K++K
Sbjct: 386 KKTGKPSKKVLAKRAEKKEKEK---------------EKPKVKKRHRDTKNIGKRRKPSG 430
Query: 63 KEEQ 66
E+
Sbjct: 431 TSEE 434
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 48.7 bits (116), Expect = 2e-06
Identities = 24/92 (26%), Positives = 44/92 (47%), Gaps = 9/92 (9%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEE------KEKEDVEEEKNKKKKKKKKKKKKKK 56
++R A + K+ E+ K+ E+++ + EE E + E + +KK K++ KK+ ++
Sbjct: 95 EQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEE 154
Query: 57 KKKKKKKEEQSVKEIEESLK---ERKEKTLTE 85
+ K K E K E K E K K +
Sbjct: 155 EAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAK 186
Score = 46.0 bits (109), Expect = 1e-05
Identities = 15/70 (21%), Positives = 36/70 (51%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
K+ ++R K+ E++ E E++ + +K +++ +K K+ ++ K+ EE+ +
Sbjct: 63 AKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAE 122
Query: 72 EESLKERKEK 81
E K+ E
Sbjct: 123 EAKAKQAAEA 132
Score = 44.4 bits (105), Expect = 5e-05
Identities = 23/80 (28%), Positives = 38/80 (47%), Gaps = 1/80 (1%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEK-EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
A E K + K + + E E++ KE+ + EE+ K K + KKK + KKK + E +
Sbjct: 121 AEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAK 180
Query: 67 SVKEIEESLKERKEKTLTED 86
+ E + K + K E
Sbjct: 181 AKAEAKAKAKAEEAKAKAEA 200
Score = 41.4 bits (97), Expect = 5e-04
Identities = 19/77 (24%), Positives = 38/77 (49%), Gaps = 6/77 (7%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKE-KEDVEEEKNKKKKKKKKKKKKKKKK-----KKKKKE 64
KK + E+++ + +KE ++ EK K+ ++ K+ ++K+K K K+
Sbjct: 71 KKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAA 130
Query: 65 EQSVKEIEESLKERKEK 81
E K E+ K+ KE+
Sbjct: 131 EAKAKAEAEAEKKAKEE 147
Score = 40.6 bits (95), Expect = 7e-04
Identities = 23/85 (27%), Positives = 35/85 (41%), Gaps = 6/85 (7%)
Query: 2 RKRRGGAGEKKKRIEKRNK--EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
K + A +KK E+ K EEE + E +K+ E K K + + K K + K K K
Sbjct: 132 AKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKK-AAEAKKKAEAEAKAKAEAKAKAK 190
Query: 60 ---KKKKEEQSVKEIEESLKERKEK 81
K K E + + + E
Sbjct: 191 AEEAKAKAEAAKAKAAAEAAAKAEA 215
Score = 40.2 bits (94), Expect = 0.001
Identities = 22/92 (23%), Positives = 43/92 (46%), Gaps = 11/92 (11%)
Query: 1 MRKRRGGAGEKKKRIEKRNK-EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK- 58
K+R ++K +E+R E+ ++ EQ K+ E+ +++ + K K+ + K K +
Sbjct: 80 AEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAE 139
Query: 59 ---------KKKKKEEQSVKEIEESLKERKEK 81
KK+ +EE K E+ K+ E
Sbjct: 140 AEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEA 171
Score = 39.8 bits (93), Expect = 0.001
Identities = 20/78 (25%), Positives = 41/78 (52%), Gaps = 7/78 (8%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
+KK K+ +E +K+ +Q E+ ++ E+ ++K+ +++ +K K + E+ + K+
Sbjct: 58 QKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAK---QAEQAA-KQ 113
Query: 71 IEESLK---ERKEKTLTE 85
EE K E K K E
Sbjct: 114 AEEKQKQAEEAKAKQAAE 131
Score = 39.8 bits (93), Expect = 0.001
Identities = 17/80 (21%), Positives = 30/80 (37%), Gaps = 2/80 (2%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK--KKKEE 65
+K+ E + K + + + E +K+ E K K + K K K ++ K K + K K
Sbjct: 147 EAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAA 206
Query: 66 QSVKEIEESLKERKEKTLTE 85
E+ E
Sbjct: 207 AEAAAKAEAEAAAAAAAEAE 226
Score = 39.0 bits (91), Expect = 0.003
Identities = 20/71 (28%), Positives = 30/71 (42%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
K + A KKK E + K E + + + E K K EE K K + K K + K +
Sbjct: 156 AKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEA 215
Query: 62 KKEEQSVKEIE 72
+ + E E
Sbjct: 216 EAAAAAAAEAE 226
Score = 31.3 bits (71), Expect = 0.81
Identities = 13/70 (18%), Positives = 38/70 (54%), Gaps = 8/70 (11%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK--------KKKKKEEQSVKEI 71
++ +Q++K E+E+ KK +++ ++ +K++ +++ E++ K+
Sbjct: 48 VAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQA 107
Query: 72 EESLKERKEK 81
E++ K+ +EK
Sbjct: 108 EQAAKQAEEK 117
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 48.4 bits (116), Expect = 2e-06
Identities = 20/72 (27%), Positives = 46/72 (63%), Gaps = 6/72 (8%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E ++++K +EEE++ K E+ +E+ ++KK++KKK++++ K K EEQ +
Sbjct: 256 EVLRKVDKTREEEEEKIL----KAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQ--R 309
Query: 70 EIEESLKERKEK 81
++EE ++++ +
Sbjct: 310 KLEEKERKKQAR 321
Score = 42.2 bits (100), Expect = 2e-04
Identities = 17/50 (34%), Positives = 34/50 (68%), Gaps = 3/50 (6%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKK---KKKKKKKKKKKKKKKKK 63
+ EEE++E QE+KE++ EE + K K ++++K ++K++KK+ +K
Sbjct: 273 LKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
Score = 38.4 bits (90), Expect = 0.004
Identities = 11/47 (23%), Positives = 32/47 (68%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
+++R E+ +++E+++ E+ E + + E+ +K ++K++KK+ +K
Sbjct: 276 AEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
Score = 29.1 bits (66), Expect = 3.5
Identities = 14/52 (26%), Positives = 28/52 (53%), Gaps = 8/52 (15%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK 54
+R+ A EKK+ +K E + + K EE++ ++K++KK+ +K
Sbjct: 279 ERQEEAQEKKEEKKK--------EEREAKLAKLSPEEQRKLEEKERKKQARK 322
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 48.5 bits (116), Expect = 2e-06
Identities = 18/67 (26%), Positives = 36/67 (53%)
Query: 14 RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
E + EE+ ++E + E E NK + K+K K ++ K+K++K+ E+ KE ++
Sbjct: 242 LEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQ 301
Query: 74 SLKERKE 80
K+ +
Sbjct: 302 LKKKLAQ 308
Score = 42.0 bits (99), Expect = 4e-04
Identities = 19/82 (23%), Positives = 39/82 (47%), Gaps = 15/82 (18%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEE----KNKKKKKKKKKKKKKKK---------KKKKK 62
+ EEE ++ E + + E + K+K K ++ K+K++K K+ KK
Sbjct: 245 SDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKK 304
Query: 63 KEEQ--SVKEIEESLKERKEKT 82
K Q +KEI + + ++++
Sbjct: 305 KLAQLARLKEIAKEVAQKEKAR 326
Score = 40.1 bits (94), Expect = 0.001
Identities = 17/61 (27%), Positives = 32/61 (52%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
EE +++ +EE + E E + + K + K+K K ++ +E+ KE+E KE K+
Sbjct: 242 LEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQ 301
Query: 81 K 81
Sbjct: 302 L 302
Score = 38.5 bits (90), Expect = 0.004
Identities = 14/70 (20%), Positives = 39/70 (55%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
++K + ++ ++ KE + ++EK+ ++ + K+ K+ +K+K + +K+EQ +
Sbjct: 276 KRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKE 335
Query: 70 EIEESLKERK 79
E+ +R+
Sbjct: 336 RGEKKKLKRR 345
Score = 36.6 bits (85), Expect = 0.015
Identities = 16/61 (26%), Positives = 39/61 (63%), Gaps = 8/61 (13%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKE--------KEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
+K+R E + +E+++ +++ + KE ++EK + +KK+++K++ +KKK K++
Sbjct: 286 EKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKKLKRR 345
Query: 63 K 63
K
Sbjct: 346 K 346
Score = 32.0 bits (73), Expect = 0.41
Identities = 23/90 (25%), Positives = 46/90 (51%), Gaps = 13/90 (14%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEEN-------EQEEKEKEDVEEEKNKKKKKKKKKKKK 54
+++R E++ + EK+ K++ + E +KEK +K ++K++ +KKK K
Sbjct: 285 KEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKA-RARKKEQRKERGEKKKLK 343
Query: 55 KKKKKKKKKEEQSVK-----EIEESLKERK 79
++K K K E ++ E+ SL+ K
Sbjct: 344 RRKLGKHKYPEPPLEVVLPDELSGSLRTLK 373
Score = 30.8 bits (70), Expect = 1.2
Identities = 18/74 (24%), Positives = 35/74 (47%), Gaps = 4/74 (5%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK----KKKKKKEEQSVKEI 71
E NK + + ++ KE +E ++ K++K+ KKK + K+ K+ Q K
Sbjct: 267 EPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKAR 326
Query: 72 EESLKERKEKTLTE 85
++RKE+ +
Sbjct: 327 ARKKEQRKERGEKK 340
Score = 29.7 bits (67), Expect = 2.3
Identities = 15/54 (27%), Positives = 30/54 (55%)
Query: 35 EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
E+ E+E +KK+++ ++ ++KK +K E + E+ E L E + E+S
Sbjct: 201 EEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESD 254
Score = 28.1 bits (63), Expect = 6.8
Identities = 12/50 (24%), Positives = 23/50 (46%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
+E++ E E+ +++ + + KKK++KK KK K E
Sbjct: 21 LEEQRDERITGGVLAEKPDEDLFVVDTTGDEATIKKKRRKKLKKPLKSDE 70
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 48.2 bits (115), Expect = 4e-06
Identities = 34/132 (25%), Positives = 52/132 (39%), Gaps = 16/132 (12%)
Query: 6 GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
+ K+ K++K++ +++ + + +E +NK+ KKKKKKKKKKKKK +
Sbjct: 16 QNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAY 75
Query: 66 Q---------SVKEIEESLKERKEKTLTEDSQCDTMIP------VFKKGVLYKGIYCPSL 110
S +++ RK E T P FK G G
Sbjct: 76 DLAYDLPVVWSSAAFQDNSHIRKLGNWPEQEWKQTQPPTIPVSKQFKDGEYPVGEIQEYP 135
Query: 111 -TNSFREPSLEM 121
NS R S E
Sbjct: 136 GENSSRTSSEEK 147
Score = 35.8 bits (83), Expect = 0.029
Identities = 15/57 (26%), Positives = 26/57 (45%)
Query: 25 EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
NE E E + +++ +K KKK KK KK + + E +E+ ++ K
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNK 57
Score = 31.2 bits (71), Expect = 0.91
Identities = 13/49 (26%), Positives = 25/49 (51%)
Query: 38 EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
E + K++K++ K+K KKK KK ++ V + + L E + +
Sbjct: 3 NENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAE 51
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 47.8 bits (114), Expect = 5e-06
Identities = 33/107 (30%), Positives = 56/107 (52%), Gaps = 24/107 (22%)
Query: 20 KEEEKE----ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
KE EKE E E+E++EK++ EE+ K+++ ++++K++KKKK KK +++ E E
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLN 88
Query: 76 KE-----RKEKTLTEDSQCDTMIPVFKKGVLYKGIYCPSLTNSFREP 117
K R K +T++ Y Y SLTN + +P
Sbjct: 89 KTKPIWTRNPKDVTKEE--------------YAAFY-KSLTNDWEDP 120
Score = 40.9 bits (96), Expect = 8e-04
Identities = 23/79 (29%), Positives = 45/79 (56%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E++++ EK+ +EE+ + E+E E+E+ EE+K K KK K+ + + K K ++ K
Sbjct: 40 EEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTKPIWTRNPK 99
Query: 70 EIEESLKERKEKTLTEDSQ 88
++ + K+LT D +
Sbjct: 100 DVTKEEYAAFYKSLTNDWE 118
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 47.7 bits (114), Expect = 5e-06
Identities = 26/53 (49%), Positives = 38/53 (71%), Gaps = 3/53 (5%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
K+ + KR EE KE+ + K+K EE+K +K+KKKKK+KKK KK+KKK ++
Sbjct: 365 KEELNKR-IEEIKEKYPKPPKKKR--EEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 39.6 bits (93), Expect = 0.002
Identities = 18/45 (40%), Positives = 31/45 (68%), Gaps = 5/45 (11%)
Query: 27 NEQEEKEKEDVEEE-----KNKKKKKKKKKKKKKKKKKKKKKEEQ 66
E+ K E+++E+ K K+++KK +K+KKKKK+KKK K+ +
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRK 409
Score = 33.0 bits (76), Expect = 0.23
Identities = 16/50 (32%), Positives = 34/50 (68%), Gaps = 1/50 (2%)
Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
E+ + +EE K +K K KKK+++KK +K+KK+++ K+ ++ K+ ++
Sbjct: 366 EELNKRIEEIK-EKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 32.6 bits (75), Expect = 0.32
Identities = 16/53 (30%), Positives = 35/53 (66%), Gaps = 2/53 (3%)
Query: 31 EKEKEDVEE--EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
++ KE++ + E+ K+K K KKK+++KK +K+K+++ K+ + K++ K
Sbjct: 362 DELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 45.6 bits (108), Expect = 7e-06
Identities = 25/61 (40%), Positives = 36/61 (59%), Gaps = 4/61 (6%)
Query: 11 KKKRIEKRNKEEEKEENEQE---EKEKEDVEEEKNKKKKKKK-KKKKKKKKKKKKKKEEQ 66
KKK K K ++ +E + E E+K+KKKK + K++KKKKK+KKKKK+
Sbjct: 110 KKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRH 169
Query: 67 S 67
S
Sbjct: 170 S 170
Score = 42.9 bits (101), Expect = 6e-05
Identities = 22/56 (39%), Positives = 34/56 (60%), Gaps = 3/56 (5%)
Query: 17 KRNKEEEKEENEQEEKEKE---DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
K++K + K+ Q+ +E D E K +KK KKKK + K++KKKKKE++ K
Sbjct: 111 KKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKK 166
Score = 30.6 bits (69), Expect = 0.74
Identities = 17/66 (25%), Positives = 31/66 (46%), Gaps = 5/66 (7%)
Query: 14 RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK-----KKKKKEEQSV 68
I+ K+ + + + ++ E + + K +KK KKKK +KKKK+E+
Sbjct: 105 HIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKK 164
Query: 69 KEIEES 74
K+ S
Sbjct: 165 KKKRHS 170
Score = 30.6 bits (69), Expect = 0.79
Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 15/55 (27%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
E K EK++K+++ E +K++KKKKK+KKKKKK+ +
Sbjct: 135 EGLKGHEKKHKKKKH---------------EDDKERKKKKKEKKKKKKRHSPEHP 174
Score = 28.3 bits (63), Expect = 5.1
Identities = 16/61 (26%), Positives = 27/61 (44%), Gaps = 18/61 (29%)
Query: 39 EEKNKKKKKKKKKKKKK------------------KKKKKKKKEEQSVKEIEESLKERKE 80
+ KK K K KK + + KK KKKK E+ ++ ++ K++K+
Sbjct: 107 QPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKK 166
Query: 81 K 81
K
Sbjct: 167 K 167
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 45.1 bits (107), Expect = 8e-06
Identities = 21/47 (44%), Positives = 31/47 (65%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
++EEE EE +E E+ED E K + K+KKK++ K K +K+K K E
Sbjct: 96 DEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142
Score = 44.4 bits (105), Expect = 2e-05
Identities = 19/68 (27%), Positives = 35/68 (51%), Gaps = 1/68 (1%)
Query: 14 RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
+I + +E E+ E+E D E+++ + K + K+KKK++ K K E++ K +
Sbjct: 86 KIFRELSSS-DDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPK 144
Query: 74 SLKERKEK 81
K K K
Sbjct: 145 KPKPSKPK 152
Score = 38.6 bits (90), Expect = 0.001
Identities = 16/57 (28%), Positives = 29/57 (50%)
Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
E + E+E+ EEE + +++ + K + K+KKK+E K +E K +K
Sbjct: 90 ELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKP 146
Score = 34.3 bits (79), Expect = 0.036
Identities = 16/63 (25%), Positives = 33/63 (52%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
R+ E+++ E+ E E+E+ + + E ++ ++ + K K +K+K K + KK K
Sbjct: 88 FRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPK 147
Query: 61 KKK 63
K
Sbjct: 148 PSK 150
Score = 29.0 bits (65), Expect = 2.3
Identities = 11/43 (25%), Positives = 22/43 (51%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK 52
E++ E + + +EK++ E + + E + + KK K K K
Sbjct: 110 EQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 45.0 bits (107), Expect = 1e-05
Identities = 23/54 (42%), Positives = 37/54 (68%), Gaps = 1/54 (1%)
Query: 11 KKKRIEKRNKE-EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
K+K+ +K KE +E++E +++K + + E+N KK+K KK KKKKK KKK +
Sbjct: 151 KEKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKKGR 204
Score = 39.2 bits (92), Expect = 0.001
Identities = 24/59 (40%), Positives = 41/59 (69%), Gaps = 7/59 (11%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK------KKKKKKKKKKKKKKKKE 64
KK ++++ K+++K + E +E+ KE VE++K +++KK K+K KK KKKKK KK+
Sbjct: 145 KKALKRKEKQKKKSKKEWKER-KEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKK 202
Score = 37.7 bits (88), Expect = 0.004
Identities = 28/80 (35%), Positives = 45/80 (56%), Gaps = 9/80 (11%)
Query: 10 EKKKRIEKR--NKEEEKEENEQEEKEKEDVEEEKNK-------KKKKKKKKKKKKKKKKK 60
+KK++E+ +K E EE E+ K E K K K K+K+K+KKK KK+
Sbjct: 103 ARKKKLEELDEDKAAEIEEKEKWTKALAKAEGVKVKDDEKLLKKALKRKEKQKKKSKKEW 162
Query: 61 KKKEEQSVKEIEESLKERKE 80
K+++E+ K+ E K+R+E
Sbjct: 163 KERKEKVEKKKAERQKKREE 182
Score = 36.9 bits (86), Expect = 0.008
Identities = 29/98 (29%), Positives = 50/98 (51%), Gaps = 18/98 (18%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK----------- 50
++RR K ++ +KR + ++KE+ ++ E E+ EE K+KKK +
Sbjct: 10 QRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNIVFSKV 69
Query: 51 -------KKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
KK K KKKKKK++ K++ + L+ RK+K
Sbjct: 70 EFADGEQAKKDLKLKKKKKKKKTDYKQLLKKLEARKKK 107
Score = 35.7 bits (83), Expect = 0.019
Identities = 13/45 (28%), Positives = 30/45 (66%)
Query: 38 EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
+ +++K++++K +KK+K+K+ KK+E + K E +K + K+
Sbjct: 6 ALLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKS 50
Score = 35.0 bits (81), Expect = 0.031
Identities = 16/56 (28%), Positives = 33/56 (58%)
Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
+E + E++ +K++++K +KK+K+K+ KKK++ Q + E +E K K
Sbjct: 2 SSREALLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPI 57
Score = 32.3 bits (74), Expect = 0.23
Identities = 14/58 (24%), Positives = 29/58 (50%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
E +++ E++ +KK+K+K+ KKK+ +K + EE +E + K +
Sbjct: 1 PSSREALLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIE 58
Score = 30.0 bits (68), Expect = 1.6
Identities = 15/57 (26%), Positives = 33/57 (57%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
+E+R ++ E+ + +++K KE ++E +K + ++ K ++ K KKK E + I
Sbjct: 8 LEQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNI 64
Score = 28.4 bits (64), Expect = 4.1
Identities = 10/58 (17%), Positives = 34/58 (58%)
Query: 45 KKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLY 102
+++++K++++K +KK+K+KE + ++ ++S E + + + I + +++
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNIVF 66
Score = 28.0 bits (63), Expect = 5.9
Identities = 11/55 (20%), Positives = 31/55 (56%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
E+++R ++ K +K++ ++ +K+++ + E + K ++ K KKK + +
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 46.3 bits (111), Expect = 2e-05
Identities = 19/92 (20%), Positives = 49/92 (53%), Gaps = 8/92 (8%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK--------KKKKKKK 61
E KK E KE E E+ K + + E+E +++ + +K +K+ +K +
Sbjct: 46 EAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELL 105
Query: 62 KKEEQSVKEIEESLKERKEKTLTEDSQCDTMI 93
+K E+ +++ E+ L++++++ ++ + + +I
Sbjct: 106 EKREEELEKKEKELEQKQQELEKKEEELEELI 137
Score = 44.4 bits (106), Expect = 7e-05
Identities = 19/76 (25%), Positives = 44/76 (57%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+ + EK +E E + E++ + E K + +K++++ +KK+K+ ++++Q ++
Sbjct: 68 KLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELE 127
Query: 70 EIEESLKERKEKTLTE 85
+ EE L+E E+ L E
Sbjct: 128 KKEEELEELIEEQLQE 143
Score = 40.9 bits (97), Expect = 7e-04
Identities = 18/62 (29%), Positives = 42/62 (67%), Gaps = 1/62 (1%)
Query: 13 KRIEKRNKE-EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
+++EKR + EE + + E EK + E EK +K+ ++K+++ +KK+++ ++ E+ ++E+
Sbjct: 85 QKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQEL 144
Query: 72 EE 73
E
Sbjct: 145 ER 146
Score = 40.5 bits (96), Expect = 0.001
Identities = 22/81 (27%), Positives = 44/81 (54%), Gaps = 4/81 (4%)
Query: 4 RRGGAGEKKKRIEKRNK---EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
R+ A K K E+ K EE K+E E +KE +E+ K + + +K+ ++++ +
Sbjct: 25 RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNEL 84
Query: 61 KKKEEQSVKEIEESLKERKEK 81
+K E++ + + EE+L + E
Sbjct: 85 QKLEKR-LLQKEENLDRKLEL 104
Score = 36.7 bits (86), Expect = 0.016
Identities = 17/79 (21%), Positives = 49/79 (62%), Gaps = 5/79 (6%)
Query: 8 AGEKKKRIEKRNKEEEKE-ENEQEEKE---KEDVEEEKNKKKKKKKKKKKKKKKKKKK-K 62
E ++ ++ +E +KE E ++E KE++ + +N+ +K+ ++++ + +K +K+
Sbjct: 33 IKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLL 92
Query: 63 KEEQSVKEIEESLKERKEK 81
++E+++ E L++R+E+
Sbjct: 93 QKEENLDRKLELLEKREEE 111
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 43.0 bits (102), Expect = 2e-05
Identities = 19/62 (30%), Positives = 34/62 (54%), Gaps = 14/62 (22%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+++R G KK+RI R + E +E EK K+ +K ++KK K+++K+K
Sbjct: 83 KKRKRPG---KKRRIALRLRRERTKE-----------RAEKEKRTRKNREKKFKRRQKEK 128
Query: 61 KK 62
+K
Sbjct: 129 EK 130
Score = 37.6 bits (88), Expect = 0.002
Identities = 16/58 (27%), Positives = 34/58 (58%), Gaps = 2/58 (3%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
K+ N + EKE +E+K K+K+ KK++ + ++++ KE + + + K R++K
Sbjct: 65 KKHNAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKE--KRTRKNREKK 120
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 42.0 bits (99), Expect = 2e-05
Identities = 22/48 (45%), Positives = 26/48 (54%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
K KK KKKKKKKKKK K K++ +E E+S E K ED
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDED 60
Score = 35.5 bits (82), Expect = 0.005
Identities = 18/62 (29%), Positives = 30/62 (48%)
Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLY 102
K K KK KKKKKKKKKK K ++ V +E ++ ++ ++ + D ++
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDG 70
Query: 103 KG 104
Sbjct: 71 MN 72
Score = 34.7 bits (80), Expect = 0.009
Identities = 20/45 (44%), Positives = 25/45 (55%)
Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
K K K KK KKKKKKKK++ KE + KE +EK+ E
Sbjct: 7 GGKLKLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESD 51
Score = 34.3 bits (79), Expect = 0.013
Identities = 22/61 (36%), Positives = 32/61 (52%)
Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDT 91
K K + K KKKKKKKK K K++ +K++EE+S E + E E + + D
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDG 70
Query: 92 M 92
M
Sbjct: 71 M 71
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
This domain is found to the N-terminus of bacterial
signal peptidases of the S49 family (pfam01343).
Length = 154
Score = 43.7 bits (104), Expect = 2e-05
Identities = 21/65 (32%), Positives = 33/65 (50%), Gaps = 1/65 (1%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
++ + + I N EE K+ E E D +E K +K +KK +K K K +KKK
Sbjct: 32 AQKKKGKKGELEITDLN-EEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKA 90
Query: 63 KEEQS 67
K+E+
Sbjct: 91 KKEEP 95
Score = 37.9 bits (89), Expect = 0.002
Identities = 17/48 (35%), Positives = 26/48 (54%)
Query: 27 NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
NE+ + KE +E KK+ K +K +KK +K K K E+ + EE
Sbjct: 48 NEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEP 95
Score = 28.6 bits (65), Expect = 2.7
Identities = 15/50 (30%), Positives = 22/50 (44%), Gaps = 16/50 (32%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
KK ++ K E+K E K K K +KKK KK++ K +
Sbjct: 65 KKELKAWEKAEKKAE----------------KAKAKAEKKKAKKEEPKPR 98
Score = 28.6 bits (65), Expect = 3.0
Identities = 14/50 (28%), Positives = 21/50 (42%), Gaps = 17/50 (34%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
++ K EK K+ EK K K +KKK KK++ K +
Sbjct: 66 KELKAWEKAEKKAEKA-----------------KAKAEKKKAKKEEPKPR 98
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 43.6 bits (103), Expect = 3e-05
Identities = 24/64 (37%), Positives = 34/64 (53%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
E+ E EQ+E KE E+E K KK+KK+KK+KK +K KK+ + +K
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTT 151
Query: 82 TLTE 85
T E
Sbjct: 152 TKKE 155
Score = 40.9 bits (96), Expect = 2e-04
Identities = 20/64 (31%), Positives = 35/64 (54%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+K++ E + EEKE + ++++KE E++ +K KKK K K KK K+ + K
Sbjct: 95 SEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKK 154
Query: 70 EIEE 73
E +
Sbjct: 155 EEGK 158
Score = 40.1 bits (94), Expect = 4e-04
Identities = 22/62 (35%), Positives = 37/62 (59%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
EE ++E ++ KE E+ E K KK+KK+KK+KK +K KKK + + +++ K+
Sbjct: 93 EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTT 152
Query: 81 KT 82
K
Sbjct: 153 KK 154
Score = 39.7 bits (93), Expect = 5e-04
Identities = 21/62 (33%), Positives = 33/62 (53%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
KE+++ E EEKE ++EK +KK+KK +K KKK K K + + K+ +
Sbjct: 97 KEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEE 156
Query: 80 EK 81
K
Sbjct: 157 GK 158
Score = 37.8 bits (88), Expect = 0.003
Identities = 18/64 (28%), Positives = 35/64 (54%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
++ ++E +E E++E K E+++ K+KK +K KKK K K +++ K+
Sbjct: 95 SEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKK 154
Query: 76 KERK 79
+E K
Sbjct: 155 EEGK 158
Score = 36.7 bits (85), Expect = 0.006
Identities = 12/58 (20%), Positives = 30/58 (51%)
Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
E+ +++ +E + ++K+ K KK+KK+KK+KK + + + + + +
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKK 149
Score = 33.6 bits (77), Expect = 0.061
Identities = 20/63 (31%), Positives = 34/63 (53%)
Query: 38 EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFK 97
E EK +K+ K+ ++K+ K KK+KKE++ K E+ K++ KT ++ T K
Sbjct: 94 ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTK 153
Query: 98 KGV 100
K
Sbjct: 154 KEE 156
Score = 32.0 bits (73), Expect = 0.24
Identities = 13/56 (23%), Positives = 35/56 (62%)
Query: 33 EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
E+ + E+++ K+ ++K+ K KK+KK+KK+++ + K ++ + + T + ++
Sbjct: 93 EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATK 148
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 45.1 bits (107), Expect = 3e-05
Identities = 20/72 (27%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
K K+ +K N E +K+ ++++K+KE E + + K K KK KK KKK + + ++
Sbjct: 56 KDKKEDK-NNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114
Query: 71 IEESLKERKEKT 82
++ + + E
Sbjct: 115 VDNAFNKIAELA 126
Score = 42.0 bits (99), Expect = 3e-04
Identities = 18/64 (28%), Positives = 36/64 (56%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+K +K++++++K++ E++E + E + K KK KK KKK K K + + +
Sbjct: 61 DKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFN 120
Query: 70 EIEE 73
+I E
Sbjct: 121 KIAE 124
Score = 42.0 bits (99), Expect = 3e-04
Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 2/64 (3%)
Query: 20 KEEEKEENEQEEKEKED--VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
KE +E+E K +++K K + KKK +KKKKKKK+KKE +S E + K
Sbjct: 36 KEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKT 95
Query: 78 RKEK 81
K+
Sbjct: 96 PKKS 99
Score = 40.9 bits (96), Expect = 8e-04
Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 5 RGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
K+ + ++EE K +K+K+ E+KN + KKK +KKKKKKK+KK+ K
Sbjct: 29 SNVLVLSKEILSTFSEEENKVATTSTKKDKK---EDKNNESKKKSEKKKKKKKEKKEPKS 85
Query: 65 EQSVKEIEESLKERKEK 81
E K ++ K+ K+
Sbjct: 86 EGETKLGFKTPKKSKKT 102
Score = 40.5 bits (95), Expect = 9e-04
Identities = 18/69 (26%), Positives = 28/69 (40%), Gaps = 1/69 (1%)
Query: 21 EEEKEENEQEEKEKED-VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
KE EE+NK KK KK+ K + KK+ + K+ ++ KE K
Sbjct: 25 SYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPK 84
Query: 80 EKTLTEDSQ 88
+ T+
Sbjct: 85 SEGETKLGF 93
Score = 40.5 bits (95), Expect = 0.001
Identities = 20/75 (26%), Positives = 43/75 (57%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
E+ NK + ++++K + ++K++KKKKKKK+KK+ K + + K ++ K+
Sbjct: 40 STFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKS 99
Query: 72 EESLKERKEKTLTED 86
+++ K+ + ED
Sbjct: 100 KKTKKKPPKPKPNED 114
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 45.4 bits (108), Expect = 4e-05
Identities = 20/73 (27%), Positives = 44/73 (60%), Gaps = 1/73 (1%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
E + I + + E + E E+ EKE V+E + K++ ++ +K+ + + K+K E+ +
Sbjct: 203 EEVLREINEISSELPELREELEKLEKE-VKELEELKEEIEELEKELESLEGSKRKLEEKI 261
Query: 69 KEIEESLKERKEK 81
+E+EE ++E K++
Sbjct: 262 RELEERIEELKKE 274
Score = 44.3 bits (105), Expect = 9e-05
Identities = 25/87 (28%), Positives = 47/87 (54%), Gaps = 8/87 (9%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKK-------KKKKKKKK 54
+ EK + +E+R +E +KE E EEK KE ++E K K ++ ++ +
Sbjct: 251 EGSKRKLEEKIRELEERIEELKKEIEELEEKVKE-LKELKEKAEEYIKLSEFYEEYLDEL 309
Query: 55 KKKKKKKKKEEQSVKEIEESLKERKEK 81
++ +K+ + E+ + IEE +KE +EK
Sbjct: 310 REIEKRLSRLEEEINGIEERIKELEEK 336
Score = 39.7 bits (93), Expect = 0.002
Identities = 26/72 (36%), Positives = 48/72 (66%), Gaps = 1/72 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+ +++EK KE E+ + E EE EKE E +K+K ++K ++ +++ ++ KK+ E+ ++
Sbjct: 221 EELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEE-LE 279
Query: 70 EIEESLKERKEK 81
E + LKE KEK
Sbjct: 280 EKVKELKELKEK 291
Score = 38.9 bits (91), Expect = 0.003
Identities = 15/72 (20%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
+ + + + K E+ E EE EK+ EE + +++ + ++ + + ++ E+
Sbjct: 630 KAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRR 689
Query: 69 KEIEESLKERKE 80
+EI+++L++ KE
Sbjct: 690 EEIKKTLEKLKE 701
Score = 38.1 bits (89), Expect = 0.007
Identities = 24/68 (35%), Positives = 44/68 (64%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E +++++K N EE +++ E+ EK KE + + K + K KK+ +K ++ KKK + E+ +
Sbjct: 507 ELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLD 566
Query: 70 EIEESLKE 77
E+EE L E
Sbjct: 567 ELEEELAE 574
Score = 37.7 bits (88), Expect = 0.008
Identities = 19/87 (21%), Positives = 49/87 (56%), Gaps = 2/87 (2%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
+ G E+ K +E++ + E+ + + +E EK E E+ + ++ K KK++ ++ KK
Sbjct: 320 EEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKK 379
Query: 62 KKEEQSVKEIEESLK--ERKEKTLTED 86
+ + +++E+ L+ E+ ++ + E+
Sbjct: 380 RLTGLTPEKLEKELEELEKAKEEIEEE 406
Score = 37.0 bits (86), Expect = 0.016
Identities = 19/73 (26%), Positives = 49/73 (67%), Gaps = 1/73 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE-EQSV 68
E +KR+E+ KE E+ E + E+E E++ EE + ++ + + ++ +K+++E ++++
Sbjct: 637 ETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTL 696
Query: 69 KEIEESLKERKEK 81
++++E L+ER++
Sbjct: 697 EKLKEELEEREKA 709
Score = 36.2 bits (84), Expect = 0.029
Identities = 20/74 (27%), Positives = 40/74 (54%), Gaps = 3/74 (4%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE-EQSVK 69
++ E R E+ E+E E+ +E +K+++ ++ KKK K+ +K+ +E E+ +
Sbjct: 303 EEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHE 362
Query: 70 EIEE--SLKERKEK 81
EE + KE E+
Sbjct: 363 LYEEAKAKKEELER 376
Score = 35.8 bits (83), Expect = 0.037
Identities = 21/76 (27%), Positives = 44/76 (57%), Gaps = 4/76 (5%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEK--EDVEEEKNKKKKKKKKKKKKKKKKKKKKKE--E 65
E+ + ++K+ KE EK E EE+ + E+ + +K + ++ KK+ +K +K+ E E
Sbjct: 338 ERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELE 397
Query: 66 QSVKEIEESLKERKEK 81
++ +EIEE + + +
Sbjct: 398 KAKEEIEEEISKITAR 413
Score = 35.8 bits (83), Expect = 0.039
Identities = 21/58 (36%), Positives = 35/58 (60%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
KKR+ E+ ++E E+ EK KE++EEE +K + + KK+ K+ KK +E + K
Sbjct: 378 KKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAK 435
Score = 35.0 bits (81), Expect = 0.064
Identities = 21/91 (23%), Positives = 44/91 (48%), Gaps = 5/91 (5%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+ + + E +K +E + K E ++ +E E+ EE K+ ++ ++K K+ K+ K+
Sbjct: 233 LEELKEEIEELEKELESLEGSKRKLE--EKIRELEERIEELKKEIEELEEKVKELKELKE 290
Query: 61 KKKEEQSVKEIEESLKERK---EKTLTEDSQ 88
K +E + E E + EK L+ +
Sbjct: 291 KAEEYIKLSEFYEEYLDELREIEKRLSRLEE 321
Score = 35.0 bits (81), Expect = 0.066
Identities = 18/72 (25%), Positives = 40/72 (55%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
++ EE +E + EK +EEE N +++ K+ ++K+++ ++ KK+ + ++
Sbjct: 292 AEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELE 351
Query: 70 EIEESLKERKEK 81
+ E L+ER E
Sbjct: 352 KRLEELEERHEL 363
Score = 33.9 bits (78), Expect = 0.14
Identities = 20/73 (27%), Positives = 43/73 (58%), Gaps = 4/73 (5%)
Query: 11 KKKRIEKRNKEEE----KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
++ R E E + E E+ EK +E++++ K K++ ++++K KK+ +K +K +
Sbjct: 662 EELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALE 721
Query: 67 SVKEIEESLKERK 79
V+E+ E +K+ K
Sbjct: 722 RVEELREKVKKYK 734
Score = 33.5 bits (77), Expect = 0.17
Identities = 17/64 (26%), Positives = 39/64 (60%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
++ E E++E ++ E+E + EE + +K+ ++ +K+ ++ +KK EE+ + EE L+
Sbjct: 611 EKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLE 670
Query: 77 ERKE 80
+E
Sbjct: 671 LSRE 674
Score = 33.5 bits (77), Expect = 0.19
Identities = 23/90 (25%), Positives = 47/90 (52%), Gaps = 5/90 (5%)
Query: 1 MRKRRGGAGEKKKRIEKRNKE--EEKEENEQEEKEKEDVEEE-KNKKKKKKKKKKKKKKK 57
+R+ ++ K +E+ +E E ++E E E K +EE+ + +++ ++ KK+ ++
Sbjct: 219 LREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEEL 278
Query: 58 KKKKK--KEEQSVKEIEESLKERKEKTLTE 85
++K K KE + E L E E+ L E
Sbjct: 279 EEKVKELKELKEKAEEYIKLSEFYEEYLDE 308
Score = 33.1 bits (76), Expect = 0.26
Identities = 17/72 (23%), Positives = 40/72 (55%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+ + K +E E E+ E+ +++E N+ + K +K+ ++++K+ KK E+ +
Sbjct: 570 EELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELD 629
Query: 70 EIEESLKERKEK 81
+ E L E +++
Sbjct: 630 KAFEELAETEKR 641
Score = 33.1 bits (76), Expect = 0.28
Identities = 21/77 (27%), Positives = 43/77 (55%), Gaps = 4/77 (5%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKE----ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
+ + G + + +EKR +E +K + E EE+EK E EK +K ++ ++ ++K
Sbjct: 671 LSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELREKV 730
Query: 57 KKKKKKKEEQSVKEIEE 73
KK K +E+++ ++ E
Sbjct: 731 KKYKALLKERALSKVGE 747
Score = 32.3 bits (74), Expect = 0.39
Identities = 13/65 (20%), Positives = 35/65 (53%), Gaps = 1/65 (1%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEE-KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
+ + E E+ K E++EE K K+K+ ++ ++ + + + + ++++E+ +
Sbjct: 171 IKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEV 230
Query: 76 KERKE 80
KE +E
Sbjct: 231 KELEE 235
Score = 32.3 bits (74), Expect = 0.41
Identities = 14/74 (18%), Positives = 42/74 (56%), Gaps = 1/74 (1%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
GE K I++R E ++ ++ E +E ++E++ + ++ ++ + + + ++E +
Sbjct: 167 LGEVIKEIKRR-IERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEK 225
Query: 68 VKEIEESLKERKEK 81
+++ + L+E KE+
Sbjct: 226 LEKEVKELEELKEE 239
Score = 31.2 bits (71), Expect = 0.89
Identities = 22/76 (28%), Positives = 44/76 (57%), Gaps = 3/76 (3%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV---KEIE 72
K EE E ++ EKE +++EE++ K +K+ ++ +K KK+ + K ++ KE+E
Sbjct: 450 RKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELE 509
Query: 73 ESLKERKEKTLTEDSQ 88
E LK+ + L + ++
Sbjct: 510 EKLKKYNLEELEKKAE 525
Score = 31.2 bits (71), Expect = 1.0
Identities = 21/65 (32%), Positives = 39/65 (60%), Gaps = 3/65 (4%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV---KEIEESLKE 77
E KE E+ E + +E+E + ++K++K +K+ ++ +K K+E + KE+ E LKE
Sbjct: 448 EHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKE 507
Query: 78 RKEKT 82
+EK
Sbjct: 508 LEEKL 512
Score = 30.8 bits (70), Expect = 1.3
Identities = 18/70 (25%), Positives = 37/70 (52%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+K K +EKR +E E+ EE + + E E+ KK+ +K +K+ ++ +K ++ ++
Sbjct: 345 KKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIE 404
Query: 70 EIEESLKERK 79
E + R
Sbjct: 405 EEISKITARI 414
Score = 30.8 bits (70), Expect = 1.5
Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 3/74 (4%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A KK+ +E+ K E+ EKE E++E+ K + +++ K + + KK+ KE +
Sbjct: 367 AKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELK- 425
Query: 68 VKEIEESLKERKEK 81
K IEE LK+ K K
Sbjct: 426 -KAIEE-LKKAKGK 437
Score = 30.0 bits (68), Expect = 2.0
Identities = 27/93 (29%), Positives = 47/93 (50%), Gaps = 13/93 (13%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEE--KEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
K EK++++ K +E EK ++ E K KE E+ K ++K KK ++ +KK
Sbjct: 465 EKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKA 524
Query: 60 KK-----------KKEEQSVKEIEESLKERKEK 81
++ K E +S+K+ E L+E K+K
Sbjct: 525 EEYEKLKEKLIKLKGEIKSLKKELEKLEELKKK 557
Score = 28.9 bits (65), Expect = 5.0
Identities = 13/64 (20%), Positives = 38/64 (59%), Gaps = 2/64 (3%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE--QSVKEIEE 73
E ++ ++E E+ E+ +K+ ++E++ ++ +++ + K+ ++ + EE + +KE+E
Sbjct: 540 EIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEP 599
Query: 74 SLKE 77
E
Sbjct: 600 FYNE 603
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 43.5 bits (103), Expect = 4e-05
Identities = 24/78 (30%), Positives = 54/78 (69%), Gaps = 4/78 (5%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK----KKK 62
GA ++ K EK+ + +++E E+E +E++ +EE++ ++K++++ +++++KKK +K+
Sbjct: 4 GAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKE 63
Query: 63 KEEQSVKEIEESLKERKE 80
+EEQ+ KE EE K +
Sbjct: 64 REEQARKEQEEYEKLKSS 81
Score = 41.6 bits (98), Expect = 2e-04
Identities = 20/62 (32%), Positives = 45/62 (72%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
K+ K E +Q +++ + EEE+ +++KK ++K++ ++K++++ +EE+ K+ EE KER+
Sbjct: 6 KKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKERE 65
Query: 80 EK 81
E+
Sbjct: 66 EQ 67
Score = 39.3 bits (92), Expect = 0.001
Identities = 23/77 (29%), Positives = 46/77 (59%), Gaps = 2/77 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+KK EKR E ++EE +EE+EK+ EEE+ +++++ +K++++ +K K E+
Sbjct: 30 ERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEE-- 87
Query: 70 EIEESLKERKEKTLTED 86
++ + + L ED
Sbjct: 88 GTDKLSADEESNELLED 104
Score = 37.0 bits (86), Expect = 0.007
Identities = 16/78 (20%), Positives = 51/78 (65%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
+++R + + EE+++ E++ + + EEE ++++KKK+++++K+++++ +KE++ ++
Sbjct: 18 RQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEK 77
Query: 71 IEESLKERKEKTLTEDSQ 88
++ S +E T +
Sbjct: 78 LKSSFVVEEEGTDKLSAD 95
Score = 30.4 bits (69), Expect = 1.1
Identities = 19/76 (25%), Positives = 42/76 (55%), Gaps = 2/76 (2%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
+++ + +R E+ EEE+E+ ++EE+ KE EE+ +K++++ +K K +++
Sbjct: 31 RKKLEEKREGERKEEEELEEEREKKKEEEERKE--REEQARKEQEEYEKLKSSFVVEEEG 88
Query: 63 KEEQSVKEIEESLKER 78
++ S E L E
Sbjct: 89 TDKLSADEESNELLED 104
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 45.3 bits (107), Expect = 4e-05
Identities = 27/72 (37%), Positives = 47/72 (65%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E ++ K NKEEEKE+ QEE+ K +EE+ K + K +++K ++K K+ E+ +K
Sbjct: 266 EILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELK 325
Query: 70 EIEESLKERKEK 81
++E+ LK+ KE+
Sbjct: 326 KLEKELKKEKEE 337
Score = 43.8 bits (103), Expect = 1e-04
Identities = 23/72 (31%), Positives = 44/72 (61%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E++K + + +E + E+EE + E ++ E+ K ++K K+ +K+ KK +K+ ++ +
Sbjct: 277 EEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKE 336
Query: 70 EIEESLKERKEK 81
EIEE KE KE
Sbjct: 337 EIEELEKELKEL 348
Score = 43.0 bits (101), Expect = 2e-04
Identities = 24/80 (30%), Positives = 50/80 (62%), Gaps = 1/80 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK-KKKKKKKKKKKKKEEQSV 68
EK+K++++ + +E E+ + E +E K ++K K+ +K+ KK +K+ KKE++ +
Sbjct: 279 EKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEI 338
Query: 69 KEIEESLKERKEKTLTEDSQ 88
+E+E+ LKE + K E+ +
Sbjct: 339 EELEKELKELEIKREAEEEE 358
Score = 41.9 bits (98), Expect = 5e-04
Identities = 23/92 (25%), Positives = 45/92 (48%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
++ +++K+ E + EE +E E ++ + + +EE K+ K K K + KK +
Sbjct: 415 SEQEEDLLKEEKKEELKIVEELEESLETKQGKLTEEKEELEKQALKLLKDKLELKKSEDL 474
Query: 62 KKEEQSVKEIEESLKERKEKTLTEDSQCDTMI 93
KE + VK +E+ + L E SQ ++
Sbjct: 475 LKETKLVKLLEQLELLLLRQKLEEASQKESKA 506
Score = 41.9 bits (98), Expect = 5e-04
Identities = 19/74 (25%), Positives = 42/74 (56%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
E+ + +EN++EEKEK+ EEE K++++ K + K +++K +++ + E
Sbjct: 263 KEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEK 322
Query: 75 LKERKEKTLTEDSQ 88
++ EK L ++ +
Sbjct: 323 ELKKLEKELKKEKE 336
Score = 41.9 bits (98), Expect = 5e-04
Identities = 20/71 (28%), Positives = 41/71 (57%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
++++ IE +E EKEE + KE+ EEEK KK ++++ K K++++ K + + +
Sbjct: 249 DEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLER 308
Query: 70 EIEESLKERKE 80
+ ++ KE
Sbjct: 309 RKVDDEEKLKE 319
Score = 41.5 bits (97), Expect = 6e-04
Identities = 21/76 (27%), Positives = 38/76 (50%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
K E+ + +E +EQEE E E EK ++ + K+ K+++K+KK +EE+
Sbjct: 232 YLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKL 291
Query: 70 EIEESLKERKEKTLTE 85
+E + + E E
Sbjct: 292 LAKEEEELKSELLKLE 307
Score = 41.5 bits (97), Expect = 7e-04
Identities = 17/83 (20%), Positives = 38/83 (45%), Gaps = 1/83 (1%)
Query: 6 GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
G+ EK+K+ ++R K+ +E E + E + + K K++ KK + + K+K E
Sbjct: 164 AGSREKRKK-KERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLEL 222
Query: 66 QSVKEIEESLKERKEKTLTEDSQ 88
+ + + E+ + +
Sbjct: 223 EEENLLYLDYLKLNEERIDLLQE 245
Score = 41.1 bits (96), Expect = 0.001
Identities = 22/81 (27%), Positives = 45/81 (55%), Gaps = 2/81 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEE--EKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
E+K R++K +EEEK E +EKE + EE EK K +++K++K K ++++ + +EE
Sbjct: 753 EEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELK 812
Query: 68 VKEIEESLKERKEKTLTEDSQ 88
+ ++ + + +
Sbjct: 813 EEAELLEEEQLLIEQEEKIKE 833
Score = 39.6 bits (92), Expect = 0.002
Identities = 23/81 (28%), Positives = 48/81 (59%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E++++ + +EEE+E++E KEKE EEE+ +K K +++K++K K ++++ +
Sbjct: 751 EEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEE 810
Query: 70 EIEESLKERKEKTLTEDSQCD 90
EE+ +E+ L E +
Sbjct: 811 LKEEAELLEEEQLLIEQEEKI 831
Score = 39.6 bits (92), Expect = 0.003
Identities = 19/77 (24%), Positives = 45/77 (58%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
K+ E +++ + E + +++EK E++ KK +K+ KK+K++ ++ +K+ +E +K
Sbjct: 294 KEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKRE 353
Query: 72 EESLKERKEKTLTEDSQ 88
E +E + + L E +
Sbjct: 354 AEEEEEEQLEKLQEKLE 370
Score = 39.6 bits (92), Expect = 0.003
Identities = 22/83 (26%), Positives = 42/83 (50%), Gaps = 4/83 (4%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK----KKKKKKKKKKKKKKKKEE 65
EKK + E+ ++EE + E K + + +++K K + KK +K+ KK+K++ EE
Sbjct: 281 EKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEE 340
Query: 66 QSVKEIEESLKERKEKTLTEDSQ 88
+ E +K E+ E +
Sbjct: 341 LEKELKELEIKREAEEEEEEQLE 363
Score = 38.4 bits (89), Expect = 0.005
Identities = 19/79 (24%), Positives = 41/79 (51%), Gaps = 2/79 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
EK+ E+ E+ K E E+EEK K EE + +++ K++ + ++++ ++EE+
Sbjct: 774 EKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEK--I 831
Query: 70 EIEESLKERKEKTLTEDSQ 88
+ EE + E + +
Sbjct: 832 KEEELEELALELKEEQKLE 850
Score = 38.0 bits (88), Expect = 0.007
Identities = 21/73 (28%), Positives = 44/73 (60%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
E++ ++ + +EE K E + E+ K D EE+ + +K+ KK +K+ KK+K++ +E +
Sbjct: 285 QEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKE 344
Query: 69 KEIEESLKERKEK 81
+ E +E +E+
Sbjct: 345 LKELEIKREAEEE 357
Score = 36.9 bits (85), Expect = 0.016
Identities = 19/80 (23%), Positives = 45/80 (56%), Gaps = 2/80 (2%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK- 69
++R ++ ++ E E ++ EKE +E++ ++ +K+ K+ + K++ ++++EEQ K
Sbjct: 306 LERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKL 365
Query: 70 -EIEESLKERKEKTLTEDSQ 88
E E L+E +S+
Sbjct: 366 QEKLEQLEEELLAKKKLESE 385
Score = 35.3 bits (81), Expect = 0.048
Identities = 16/56 (28%), Positives = 29/56 (51%), Gaps = 1/56 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
E+ KR+ +E E E+ EE NK + KK++ +++KK+ ++ EE
Sbjct: 961 ERNKRLLLAKEELGNVNLMAIA-EFEEKEERYNKDELKKERLEEEKKELLREIIEE 1015
Score = 35.3 bits (81), Expect = 0.049
Identities = 19/77 (24%), Positives = 45/77 (58%), Gaps = 2/77 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+ + +R K +++E+ ++ EKE + + EK KK+K++ ++ +K+ K+ + K E +
Sbjct: 300 KSELLKLERRKVDDEEKLKESEKELK--KLEKELKKEKEEIEELEKELKELEIKREAEEE 357
Query: 70 EIEESLKERKEKTLTED 86
E E+ K +++ E+
Sbjct: 358 EEEQLEKLQEKLEQLEE 374
Score = 34.6 bits (79), Expect = 0.096
Identities = 23/83 (27%), Positives = 43/83 (51%), Gaps = 7/83 (8%)
Query: 10 EKKKRIEKRNKEEEKEENE-------QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
EK+ + EK EE ++E + EE+E+E +E+ + K ++ +++ KKK + ++
Sbjct: 328 EKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERL 387
Query: 63 KEEQSVKEIEESLKERKEKTLTE 85
+KE E LK +EK
Sbjct: 388 SSAAKLKEEELELKNEEEKEAKL 410
Score = 34.2 bits (78), Expect = 0.12
Identities = 22/82 (26%), Positives = 40/82 (48%), Gaps = 1/82 (1%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK-KKKEEQ 66
E K+ KR EEE+EE ++ +EK + EE+ KKK + ++ K K ++ E +
Sbjct: 342 EKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELK 401
Query: 67 SVKEIEESLKERKEKTLTEDSQ 88
+ +E E L + + +
Sbjct: 402 NEEEKEAKLLLELSEQEEDLLK 423
Score = 34.2 bits (78), Expect = 0.14
Identities = 22/83 (26%), Positives = 43/83 (51%), Gaps = 4/83 (4%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKED----VEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
+ E++ KEEE EE E KE++ EEE + +++ K++ ++ K+++ E
Sbjct: 822 QLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELE 881
Query: 66 QSVKEIEESLKERKEKTLTEDSQ 88
+ + E KE KEK ++ +
Sbjct: 882 EQKLKDELESKEEKEKEEKKELE 904
Score = 33.8 bits (77), Expect = 0.14
Identities = 15/76 (19%), Positives = 34/76 (44%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E + E EE + E + KE+ E + K+K + +++ K E+ +
Sbjct: 182 ETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERID 241
Query: 70 EIEESLKERKEKTLTE 85
++E L++ +E+ +
Sbjct: 242 LLQELLRDEQEEIESS 257
Score = 33.4 bits (76), Expect = 0.19
Identities = 19/79 (24%), Positives = 43/79 (54%), Gaps = 6/79 (7%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
EK+ + ++ ++EKEE E+ EKE K + K++ +++++++ +K ++K EQ +
Sbjct: 321 EKELKKLEKELKKEKEEIEELEKEL------KELEIKREAEEEEEEQLEKLQEKLEQLEE 374
Query: 70 EIEESLKERKEKTLTEDSQ 88
E+ K E+ +
Sbjct: 375 ELLAKKKLESERLSSAAKL 393
Score = 33.4 bits (76), Expect = 0.23
Identities = 20/89 (22%), Positives = 43/89 (48%), Gaps = 13/89 (14%)
Query: 13 KRIEKRNKEEEKEENE-QEEKEKEDVEEE------------KNKKKKKKKKKKKKKKKKK 59
K E + KE+ K+ E + KEK ++EEE + ++ + ++++ +
Sbjct: 197 KLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIES 256
Query: 60 KKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
K++ E+ + + + LKE KE+ + Q
Sbjct: 257 SKQELEKEEEILAQVLKENKEEEKEKKLQ 285
Score = 31.5 bits (71), Expect = 0.78
Identities = 11/64 (17%), Positives = 29/64 (45%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+ EE++E ++E +KE +EEEK + ++ ++ ++ K+ + +
Sbjct: 974 GNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEFLELFVSINRG 1033
Query: 70 EIEE 73
+
Sbjct: 1034 LNKV 1037
Score = 31.1 bits (70), Expect = 1.1
Identities = 10/79 (12%), Positives = 36/79 (45%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E + +E + +E + +E+ K+ +E + K+K + +++ K +E +
Sbjct: 184 ENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLL 243
Query: 70 EIEESLKERKEKTLTEDSQ 88
+ ++ + ++ ++ +
Sbjct: 244 QELLRDEQEEIESSKQELE 262
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 43.6 bits (103), Expect = 5e-05
Identities = 12/59 (20%), Positives = 35/59 (59%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
+++ + KEEEKE E+KE + E+++++ +++ +++ ++ + +K+ E+
Sbjct: 55 DDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEK 113
Score = 42.8 bits (101), Expect = 9e-05
Identities = 20/80 (25%), Positives = 43/80 (53%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
+++E K+ ++QE E E+V+EE+ + + K+ K +K+ ++ EE++
Sbjct: 37 PSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENE 96
Query: 69 KEIEESLKERKEKTLTEDSQ 88
+E EES E +++T +
Sbjct: 97 EEDEESSDENEKETEEKTES 116
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 44.2 bits (105), Expect = 5e-05
Identities = 18/55 (32%), Positives = 30/55 (54%), Gaps = 1/55 (1%)
Query: 10 EKKKRIEKRNKEEEKE-ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
E K + + ++K+ E E++ + E E+K K K++ KKKK K+ K K K
Sbjct: 219 EFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALK 273
Score = 41.9 bits (99), Expect = 3e-04
Identities = 18/49 (36%), Positives = 24/49 (48%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
+KR EE+ E E ++K +EE KKK K+ K K KK K
Sbjct: 232 KKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVVAKGM 280
Score = 38.5 bits (90), Expect = 0.004
Identities = 18/54 (33%), Positives = 35/54 (64%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
K + KE +++ + +EDVE +++ +KK+K K++ KKKK K+ K +++K
Sbjct: 220 FKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALK 273
Score = 35.8 bits (83), Expect = 0.025
Identities = 19/59 (32%), Positives = 34/59 (57%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
K + K +K E ++K++E E+ + + + +KK+K K++ KKKK KE + VK
Sbjct: 212 LKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVK 270
Score = 33.8 bits (78), Expect = 0.091
Identities = 14/55 (25%), Positives = 29/55 (52%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
+ +E ++ ++E +E + E + +KK+K K++ KKKK K+ K +
Sbjct: 216 LIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVK 270
Score = 28.8 bits (65), Expect = 4.0
Identities = 16/54 (29%), Positives = 23/54 (42%), Gaps = 6/54 (11%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
R+ + R EK+ K +E E K+K+ E + K KK K KK
Sbjct: 235 RETEEDVEAAESRAEKKRKSKE------EIKKKKPKESKGVKALKKVVAKGMKK 282
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 43.1 bits (102), Expect = 6e-05
Identities = 21/53 (39%), Positives = 33/53 (62%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
K +R++K+ + ++ E +++K+ + EEEK KKKKK KKKK KK K
Sbjct: 152 KVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMK 204
Score = 31.5 bits (72), Expect = 0.50
Identities = 13/48 (27%), Positives = 26/48 (54%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
+K+ I++ +K+++ E+EK+ +++ KKKK KK K
Sbjct: 158 KKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205
Score = 30.4 bits (69), Expect = 1.0
Identities = 15/40 (37%), Positives = 26/40 (65%)
Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
++KE + E E + KKK +++KKKKKKK +++ +K
Sbjct: 157 KKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLK 196
Score = 30.0 bits (68), Expect = 1.6
Identities = 16/48 (33%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 28 EQEEKEKEDVEE--EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
E+ +K++ D++E KKK +++KKKKKKK K+++ K
Sbjct: 154 ERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAV 201
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 43.6 bits (103), Expect = 1e-04
Identities = 13/51 (25%), Positives = 35/51 (68%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
K++ + E + E + + ++E +++E+N+KK+K++ + K+K++K ++K
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433
Score = 42.9 bits (101), Expect = 2e-04
Identities = 13/49 (26%), Positives = 33/49 (67%)
Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
+ E K+E + E+ + +EEK K+++ +KK+K++ + K+K+++++
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431
Score = 42.5 bits (100), Expect = 2e-04
Identities = 15/51 (29%), Positives = 32/51 (62%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+ + ++ + EE E +EEK K++ E+K K++ + K+K++K ++KK
Sbjct: 384 KLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 42.5 bits (100), Expect = 2e-04
Identities = 15/52 (28%), Positives = 30/52 (57%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
KK K + +E + ++EK EE + K+K++ + K+K++K ++KK
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 42.1 bits (99), Expect = 3e-04
Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 16 EKRNKEEEKEENEQEEKEKED--VEEEKNKKKKKKKKKKKKKKKKKKKKK 63
K+E E E K KE+ +EE KK+K++ + K+K++K ++KK
Sbjct: 385 LGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 39.8 bits (93), Expect = 0.002
Identities = 13/52 (25%), Positives = 31/52 (59%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
+K ++ E+ E + +E++ + E EK +K++ + K+K++K ++KK
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 37.5 bits (87), Expect = 0.008
Identities = 16/62 (25%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQ-EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
E K K+ ++ K+ E +E + + E K K++K K+++ +KK+K++ + +E+
Sbjct: 369 ELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQ 428
Query: 69 KE 70
K+
Sbjct: 429 KD 430
Score = 36.3 bits (84), Expect = 0.019
Identities = 17/55 (30%), Positives = 31/55 (56%), Gaps = 2/55 (3%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
+K E E K +E E+ +QEE EK+ +E+ ++ K+K++K ++KK
Sbjct: 382 TKKLGEVKDETDASEEAEAKAKE-EKLKQEENEKKQ-KEQADEDKEKRQKDERKK 434
Score = 35.5 bits (82), Expect = 0.034
Identities = 14/63 (22%), Positives = 33/63 (52%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
+ ++ E +K ++ ++ K + ++ + K K++K K+E++ K+ +E E
Sbjct: 364 GDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADED 423
Query: 79 KEK 81
KEK
Sbjct: 424 KEK 426
Score = 32.1 bits (73), Expect = 0.53
Identities = 17/71 (23%), Positives = 35/71 (49%), Gaps = 2/71 (2%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
K K N + ++ ++ KE ++ KK + K + ++ + K KEE+ +
Sbjct: 351 KLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKL--K 408
Query: 71 IEESLKERKEK 81
EE+ K++KE+
Sbjct: 409 QEENEKKQKEQ 419
Score = 31.3 bits (71), Expect = 0.80
Identities = 17/71 (23%), Positives = 34/71 (47%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
EE K + +++++ +E NKK + KK + K + +E ++ + E+ +E
Sbjct: 353 YEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEEN 412
Query: 80 EKTLTEDSQCD 90
EK E + D
Sbjct: 413 EKKQKEQADED 423
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 42.2 bits (100), Expect = 1e-04
Identities = 15/47 (31%), Positives = 30/47 (63%)
Query: 42 NKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
+KKK KK KK+K+K++K+ +K + E++++ +E K + D +
Sbjct: 13 DKKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRE 59
Score = 39.1 bits (92), Expect = 0.001
Identities = 14/45 (31%), Positives = 30/45 (66%)
Query: 37 VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
V+++K KK KK+K+K++K+ +K +++ + EE+ E+ E+
Sbjct: 12 VDKKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAER 56
Score = 34.1 bits (79), Expect = 0.054
Identities = 14/65 (21%), Positives = 34/65 (52%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
KK+ +K KE+ K+ + + + +E K ++ K +K ++ ++ ++++ E K I
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQKAI 73
Query: 72 EESLK 76
+K
Sbjct: 74 VAQIK 78
Score = 32.2 bits (74), Expect = 0.21
Identities = 14/70 (20%), Positives = 41/70 (58%), Gaps = 2/70 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+K+KR K+ K+ K ++ +++ K+ EE K +K ++ ++ ++++ + ++K +K
Sbjct: 21 KKEKR--KQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQKAIVAQIK 78
Query: 70 EIEESLKERK 79
++ E + +
Sbjct: 79 QLIEQNRIDR 88
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 43.9 bits (104), Expect = 1e-04
Identities = 18/59 (30%), Positives = 34/59 (57%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
++ +E++ + EEKE +++ + + + +K K KKK+KKKKK K ++ SV
Sbjct: 1138 EEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVG 1196
Score = 38.5 bits (90), Expect = 0.005
Identities = 23/92 (25%), Positives = 43/92 (46%), Gaps = 11/92 (11%)
Query: 10 EKKKRIEK-RNKEEE----------KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
+K+K +EK +N + +E E++E+ +E ++ + K K K K K +K
Sbjct: 1113 KKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKP 1172
Query: 59 KKKKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
K KKKE++ K + K+ ++ D
Sbjct: 1173 KLKKKEKKKKKSSADKSKKASVVGNSKRVDSD 1204
Score = 36.2 bits (84), Expect = 0.028
Identities = 21/82 (25%), Positives = 38/82 (46%), Gaps = 21/82 (25%)
Query: 10 EKKKRIEKRNKEEEKEENE---------------------QEEKEKEDVEEEKNKKKKKK 48
EK +++ +++EKE + +E++E E+ E K ++ K K
Sbjct: 1102 EKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSK 1161
Query: 49 KKKKKKKKKKKKKKKEEQSVKE 70
K K K +K K KK+E+ K+
Sbjct: 1162 TKGKASKLRKPKLKKKEKKKKK 1183
Score = 34.3 bits (79), Expect = 0.13
Identities = 23/79 (29%), Positives = 37/79 (46%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
KR + + K + ++K K+ +E KKKKK +KK +KKK K + +Q+
Sbjct: 1289 SKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQ 1348
Query: 72 EESLKERKEKTLTEDSQCD 90
L R K ++ S D
Sbjct: 1349 SSRLLRRPRKKKSDSSSED 1367
Score = 32.7 bits (75), Expect = 0.37
Identities = 18/85 (21%), Positives = 30/85 (35%), Gaps = 1/85 (1%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
+ G KR+ + + + KKK KK+ + KKKK
Sbjct: 1266 SKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKK 1325
Query: 63 KEEQSVKEIEESLKERKEKTLTEDS 87
K E+ ++S K R ++ S
Sbjct: 1326 KSEKKTARKKKS-KTRVKQASASQS 1349
Score = 30.0 bits (68), Expect = 2.4
Identities = 19/89 (21%), Positives = 37/89 (41%), Gaps = 7/89 (7%)
Query: 9 GEKKKRIEKRNK---EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK----KKKKKK 61
G K K+ E +++K+ E K K K + K+ + ++ +K
Sbjct: 1299 GSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRK 1358
Query: 62 KKEEQSVKEIEESLKERKEKTLTEDSQCD 90
KK + S ++ ++S + E ED + D
Sbjct: 1359 KKSDSSSEDDDDSEVDDSEDEDDEDDEDD 1387
Score = 29.2 bits (66), Expect = 4.5
Identities = 12/80 (15%), Positives = 32/80 (40%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
KK+ ++K+++E++ K+ + + + + ++ +KKK +
Sbjct: 1308 KKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSED 1367
Query: 68 VKEIEESLKERKEKTLTEDS 87
+ E E ++ ED
Sbjct: 1368 DDDSEVDDSEDEDDEDDEDD 1387
Score = 28.9 bits (65), Expect = 5.4
Identities = 17/78 (21%), Positives = 31/78 (39%), Gaps = 5/78 (6%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK---KKKKKKKKKKKKKEEQSVK 69
+ ++ +K E KKKK +KK KKK K + K+ + S
Sbjct: 1293 DGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSS-- 1350
Query: 70 EIEESLKERKEKTLTEDS 87
+ +++K + +ED
Sbjct: 1351 RLLRRPRKKKSDSSSEDD 1368
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 43.4 bits (103), Expect = 1e-04
Identities = 15/68 (22%), Positives = 30/68 (44%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
+ E E+ E E K +KK++K +KK +K++ +K ++ + + K
Sbjct: 390 KPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGP 449
Query: 79 KEKTLTED 86
+T D
Sbjct: 450 DGETKKVD 457
Score = 42.6 bits (101), Expect = 2e-04
Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 3/50 (6%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
E+KK +K+ K E+K E E+ EK ++K + KK K + KK
Sbjct: 410 ERKKLRKKQRKAEKKAEKEEAEKAA---AKKKAEAAAKKAKGPDGETKKV 456
Score = 34.9 bits (81), Expect = 0.061
Identities = 11/44 (25%), Positives = 22/44 (50%)
Query: 45 KKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
++KK +KK++K +KK ++E++ K + E K
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDG 451
Score = 31.8 bits (73), Expect = 0.57
Identities = 15/56 (26%), Positives = 30/56 (53%), Gaps = 5/56 (8%)
Query: 27 NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
+++ + + EE +N ++KK +KK++K +KK E+ EE+ K +K
Sbjct: 388 HDKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEK-----EEAEKAAAKKK 438
Score = 29.5 bits (67), Expect = 3.0
Identities = 10/59 (16%), Positives = 24/59 (40%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
E+ E + +K ++ + + K +K++ +K KKK + K+ +
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDG 451
Score = 29.1 bits (66), Expect = 4.0
Identities = 11/44 (25%), Positives = 22/44 (50%), Gaps = 1/44 (2%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK 44
+RK++ A +K ++ E K K++ E K+ + + E K
Sbjct: 414 LRKKQRKAEKKAEKEEAE-KAAAKKKAEAAAKKAKGPDGETKKV 456
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 42.7 bits (101), Expect = 1e-04
Identities = 23/73 (31%), Positives = 35/73 (47%), Gaps = 11/73 (15%)
Query: 27 NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE-----------ESL 75
E+ E K +EEE+ KKKKKKKKKK KK E + E+
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAK 225
Query: 76 KERKEKTLTEDSQ 88
K +K++++ D++
Sbjct: 226 KLKKKRSIAPDNE 238
Score = 38.5 bits (90), Expect = 0.003
Identities = 19/62 (30%), Positives = 25/62 (40%), Gaps = 1/62 (1%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
+EE E + E+ E K KKKKKKKK KK + S E S +
Sbjct: 166 EEEVELLKARLEE-ERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEA 224
Query: 84 TE 85
+
Sbjct: 225 KK 226
Score = 30.0 bits (68), Expect = 1.7
Identities = 14/60 (23%), Positives = 26/60 (43%), Gaps = 5/60 (8%)
Query: 34 KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMI 93
+E+VE K + ++++ KKKKKKKKK+ + S + + +
Sbjct: 166 EEEVEL-----LKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQ 220
Score = 28.5 bits (64), Expect = 4.7
Identities = 14/66 (21%), Positives = 25/66 (37%), Gaps = 2/66 (3%)
Query: 1 MRKRRGGAGEKK--KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
+ + R +KK K+ +K N E ++ + + KK KKK+
Sbjct: 176 LEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIAP 235
Query: 59 KKKKKE 64
+K E
Sbjct: 236 DNEKSE 241
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 40.7 bits (96), Expect = 1e-04
Identities = 20/78 (25%), Positives = 40/78 (51%), Gaps = 1/78 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+K +K + +E + EK++E+ + E K K ++K + + +K +K+ E+
Sbjct: 4 EEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEA 63
Query: 70 EIEES-LKERKEKTLTED 86
E+ LK +K L+E
Sbjct: 64 ELARRELKAEAKKMLSEK 81
Score = 38.4 bits (90), Expect = 0.001
Identities = 19/61 (31%), Positives = 38/61 (62%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
E E+E+ +++ + + +EK K +KK+++KK + +K K EE++ E+E+ KE +E
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEE 60
Query: 81 K 81
Sbjct: 61 L 61
Score = 33.8 bits (78), Expect = 0.039
Identities = 19/60 (31%), Positives = 34/60 (56%)
Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
E E+E+ + ++ K+K K +KK+++KK + +K + S +E E E+ EK L E
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEE 60
Score = 33.0 bits (76), Expect = 0.068
Identities = 21/70 (30%), Positives = 36/70 (51%), Gaps = 5/70 (7%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK-----KKKKKEEQSVKEIEES 74
EEEK ++E + E+ K +KK+++KK + +K K K + + E+ KE+EE
Sbjct: 2 PEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEEL 61
Query: 75 LKERKEKTLT 84
E + L
Sbjct: 62 EAELARRELK 71
Score = 32.2 bits (74), Expect = 0.12
Identities = 16/50 (32%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 10 EKKKRIEKRNKEEEKE-ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
+ +++ K + EE+ E E E+ EKE E++E E +++ K + KK +K
Sbjct: 33 SEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKAEAKKMLSEKG 82
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 43.3 bits (102), Expect = 1e-04
Identities = 16/70 (22%), Positives = 39/70 (55%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
K+ E+R K+E+++ E ++K+ + E K +K++ +++KK+ ++ K+ K+
Sbjct: 75 KRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQA 134
Query: 72 EESLKERKEK 81
EE+ +
Sbjct: 135 EEAAAKAAAA 144
Score = 41.3 bits (97), Expect = 5e-04
Identities = 12/60 (20%), Positives = 25/60 (41%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
+K+ + E K++ E K+K E + K + K +K K E+ + + +
Sbjct: 200 KKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEV 259
Score = 40.9 bits (96), Expect = 8e-04
Identities = 17/82 (20%), Positives = 34/82 (41%), Gaps = 10/82 (12%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK---------- 59
E+ K++EK ++++ + EE K+ ++K ++ K K K +
Sbjct: 101 ERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAA 160
Query: 60 KKKKEEQSVKEIEESLKERKEK 81
KK E K E+ K+ +
Sbjct: 161 KKAAAEAKKKAEAEAAKKAAAE 182
Score = 40.6 bits (95), Expect = 8e-04
Identities = 19/76 (25%), Positives = 43/76 (56%), Gaps = 2/76 (2%)
Query: 8 AGEKKKRIEKRNKEE--EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
A E++K+ E++ EE +K+ EQE ++ + E +++KK+ ++ K+ K+K+ EE
Sbjct: 77 AEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEE 136
Query: 66 QSVKEIEESLKERKEK 81
+ K + + + +
Sbjct: 137 AAAKAAAAAKAKAEAE 152
Score = 37.5 bits (87), Expect = 0.009
Identities = 22/83 (26%), Positives = 40/83 (48%), Gaps = 4/83 (4%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK----KKKKKKKK 58
K R A E+KK+ E+ K+ ++ + EE + K K + + K+ KK +
Sbjct: 108 KERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEA 167
Query: 59 KKKKKEEQSVKEIEESLKERKEK 81
KKK + E + K E+ K+ + +
Sbjct: 168 KKKAEAEAAKKAAAEAKKKAEAE 190
Score = 36.7 bits (85), Expect = 0.014
Identities = 21/85 (24%), Positives = 34/85 (40%), Gaps = 6/85 (7%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK------KKKKKKKKKKKK 56
+R A KK E + K E + + + K+ E E K KK + + KKK
Sbjct: 153 AKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAA 212
Query: 57 KKKKKKKEEQSVKEIEESLKERKEK 81
+ KKK ++ ++ E K
Sbjct: 213 AEAKKKAAAEAKAAAAKAAAEAKAA 237
Score = 36.7 bits (85), Expect = 0.016
Identities = 19/83 (22%), Positives = 30/83 (36%), Gaps = 6/83 (7%)
Query: 8 AGEKKKRIE----KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
A E KK+ E + E K++ E E K+K E +K K + K K + K
Sbjct: 180 AAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKK--KAAAEAKAAAAKAAAEAKAA 237
Query: 64 EEQSVKEIEESLKERKEKTLTED 86
E++ + D
Sbjct: 238 AEKAAAAKAAEKAAAAKAAAEVD 260
Score = 35.6 bits (82), Expect = 0.036
Identities = 15/72 (20%), Positives = 31/72 (43%), Gaps = 7/72 (9%)
Query: 16 EKRNKEEEK-----EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK--KKKEEQSV 68
+K+ E+E+ +E +++K+ EE + K+K+ ++ K K K E
Sbjct: 94 QKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEA 153
Query: 69 KEIEESLKERKE 80
K + K+
Sbjct: 154 KRAAAAAKKAAA 165
Score = 34.4 bits (79), Expect = 0.083
Identities = 17/79 (21%), Positives = 29/79 (36%), Gaps = 10/79 (12%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKED------VEEEKNKKKKKKKKKKKKKKKK----K 59
+KK+ E + K++ +E K E E + KK + KKK
Sbjct: 116 QKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEA 175
Query: 60 KKKKEEQSVKEIEESLKER 78
KK ++ K+ E +
Sbjct: 176 AKKAAAEAKKKAEAEAAAK 194
Score = 33.6 bits (77), Expect = 0.14
Identities = 15/76 (19%), Positives = 41/76 (53%), Gaps = 4/76 (5%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
KR E++ K++E ++ +E ++K+ E+E+ K+ +K++ +++KK+ +E ++
Sbjct: 75 KRAEEQRKKKE-QQQAEELQQKQAAEQER---LKQLEKERLAAQEQKKQAEEAAKQAALK 130
Query: 73 ESLKERKEKTLTEDSQ 88
+ E ++
Sbjct: 131 QKQAEEAAAKAAAAAK 146
Score = 33.2 bits (76), Expect = 0.22
Identities = 19/76 (25%), Positives = 45/76 (59%), Gaps = 3/76 (3%)
Query: 6 GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
G E+ R +++ K ++ E EQ +K+++ EE +K+ ++++ K+ +K++ +E+
Sbjct: 58 GAVVEQYNRQQQQQKSAKRAE-EQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQ 116
Query: 66 QSVKEIEESLKERKEK 81
+ K+ EE+ K+ K
Sbjct: 117 K--KQAEEAAKQAALK 130
Score = 31.3 bits (71), Expect = 0.80
Identities = 18/83 (21%), Positives = 33/83 (39%), Gaps = 4/83 (4%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK--- 59
+ A ++K+ E K + + E + K K + KKK + + KK
Sbjct: 123 AAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAE 182
Query: 60 -KKKKEEQSVKEIEESLKERKEK 81
KKK E ++ + K++ E
Sbjct: 183 AKKKAEAEAAAKAAAEAKKKAEA 205
Score = 30.9 bits (70), Expect = 0.99
Identities = 17/85 (20%), Positives = 35/85 (41%), Gaps = 1/85 (1%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
+ A + K E + ++ E K+K + E K + KKK + + K
Sbjct: 138 AAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAA 197
Query: 62 KKEEQSVKEIEESLK-ERKEKTLTE 85
+ ++++ E ++ E K+K E
Sbjct: 198 EAKKKAEAEAKKKAAAEAKKKAAAE 222
Score = 27.8 bits (62), Expect = 9.0
Identities = 15/65 (23%), Positives = 38/65 (58%), Gaps = 13/65 (20%)
Query: 37 VEEEKNKKKKKKKKKKKKKKKKKKKK------------KEEQSVKEIE-ESLKERKEKTL 83
V E+ N++++++K K+ ++++KKK+ E++ +K++E E L +++K
Sbjct: 60 VVEQYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQ 119
Query: 84 TEDSQ 88
E++
Sbjct: 120 AEEAA 124
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 43.1 bits (101), Expect = 2e-04
Identities = 22/56 (39%), Positives = 34/56 (60%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
K E ++ EK E+K+KK KKK+KK+K+K++ K KKKE + K + +L
Sbjct: 177 KHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLAL 232
Score = 40.0 bits (93), Expect = 0.002
Identities = 19/53 (35%), Positives = 32/53 (60%)
Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
+K N + K E + +KK KK KKK+KK+K+K++ K+++ E +SL
Sbjct: 176 QKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSL 228
Score = 39.3 bits (91), Expect = 0.002
Identities = 17/62 (27%), Positives = 32/62 (51%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
+ + E +E + E E+ + KKK++K+K++KKKKKK + S E+
Sbjct: 254 VSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQP 313
Query: 75 LK 76
++
Sbjct: 314 VQ 315
Score = 38.5 bits (89), Expect = 0.005
Identities = 27/123 (21%), Positives = 45/123 (36%), Gaps = 12/123 (9%)
Query: 18 RNKEEEKEENEQEEKEKEDV----EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
RN E K EK DV ++ K KKK+KK+K+K++ K KKK+ E +
Sbjct: 179 RNAETSKS------PEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLAL 232
Query: 74 SLKERKEKTLTEDSQCDTMIPVFKKG--VLYKGIYCPSLTNSFREPSLEMSYQRYSHRQR 131
++ E + V + + P + QR ++
Sbjct: 233 DDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEK 292
Query: 132 QKS 134
+K
Sbjct: 293 KKK 295
Score = 37.4 bits (86), Expect = 0.011
Identities = 20/71 (28%), Positives = 34/71 (47%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
+ E K+ E E +K+ K KKKK++K+K++KKKKKK + ++ +
Sbjct: 257 TAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPVQN 316
Query: 82 TLTEDSQCDTM 92
E+ M
Sbjct: 317 GAVEEEPLPPM 327
Score = 36.6 bits (84), Expect = 0.019
Identities = 23/103 (22%), Positives = 46/103 (44%), Gaps = 34/103 (33%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEK----------------------------------E 35
+K K+ EK+ KE+E+++++++E E
Sbjct: 200 KKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSGTAP 259
Query: 36 DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
D E ++ K + ++ KK K KKKK++KE++ K+ ++ R
Sbjct: 260 DSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302
Score = 34.7 bits (79), Expect = 0.082
Identities = 9/41 (21%), Positives = 28/41 (68%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
E+E + ++ +E++K +KKK++K+K+ +++ + ++ +
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDI 125
Score = 33.9 bits (77), Expect = 0.15
Identities = 10/41 (24%), Positives = 25/41 (60%)
Query: 35 EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
E+ + + +K K++KKK++K+K+ +++ E +E +
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDI 125
Score = 30.8 bits (69), Expect = 1.2
Identities = 17/71 (23%), Positives = 33/71 (46%), Gaps = 4/71 (5%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
G K+ E EE ++ K K+ +K +K+K++KKKKKK + + +
Sbjct: 256 GTAPDSEPDEPKDAEAEETKKSPKHKK----KKQRKEKEEKKKKKKHHHHRCHHSDGGAE 311
Query: 69 KEIEESLKERK 79
+ ++ E +
Sbjct: 312 QPVQNGAVEEE 322
Score = 30.4 bits (68), Expect = 1.7
Identities = 10/50 (20%), Positives = 29/50 (58%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
K +++++ +++ +K K++KKK+++E+ + SL ++ + D
Sbjct: 83 KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVD 132
Score = 30.0 bits (67), Expect = 1.9
Identities = 21/82 (25%), Positives = 37/82 (45%), Gaps = 1/82 (1%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
G + E ++ E E+ + + K+K+ +E++ KKKKKK + ++ Q
Sbjct: 256 GTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPVQ 315
Query: 67 SVKEIEESL-KERKEKTLTEDS 87
+ EE L K L E+S
Sbjct: 316 NGAVEEEPLPPMSSYKLLAENS 337
Score = 29.6 bits (66), Expect = 2.7
Identities = 9/27 (33%), Positives = 22/27 (81%)
Query: 37 VEEEKNKKKKKKKKKKKKKKKKKKKKK 63
V+ E+ ++ +++ +K K++KKK++K+K
Sbjct: 82 VKLEEERRHRQRLEKDKREKKKREKEK 108
Score = 28.9 bits (64), Expect = 4.4
Identities = 15/66 (22%), Positives = 33/66 (50%), Gaps = 4/66 (6%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
EEE+ ++ EK+K EK K++K+K+ +++ + ++ + ++ +E E
Sbjct: 85 EEERRHRQRLEKDKR----EKKKREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTEEMPE 140
Query: 81 KTLTED 86
L D
Sbjct: 141 NALPSD 146
Score = 28.5 bits (63), Expect = 6.2
Identities = 12/58 (20%), Positives = 31/58 (53%), Gaps = 4/58 (6%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE----QSVKEIEESLKE 77
K E E+ +++ + ++ + KK++K+K+ +++ + E+ Q V + E + E
Sbjct: 83 KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTEEMPE 140
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 41.2 bits (97), Expect = 2e-04
Identities = 20/86 (23%), Positives = 44/86 (51%), Gaps = 6/86 (6%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK---KKKKEEQSV 68
KK IE N + +EE E +++ E++K +K K+K ++++ K + + E +V
Sbjct: 33 KKDIEI-NTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTV 91
Query: 69 KEIEESL--KERKEKTLTEDSQCDTM 92
K ++ L E ++ + + +S +
Sbjct: 92 KATKKQLFSSEYEQTSSSSESTSEEE 117
Score = 37.4 bits (87), Expect = 0.003
Identities = 18/78 (23%), Positives = 39/78 (50%), Gaps = 6/78 (7%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ-----SVKEI 71
K N+ E+K+ + +E+ E +K +++KK +K K+K ++EE S
Sbjct: 27 KPNRYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENAT 86
Query: 72 E-ESLKERKEKTLTEDSQ 88
E ++K K++ + + +
Sbjct: 87 ENNTVKATKKQLFSSEYE 104
Score = 34.7 bits (80), Expect = 0.030
Identities = 15/63 (23%), Positives = 27/63 (42%), Gaps = 2/63 (3%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
K E+ + E EE + K+ +++KK +K K+K EQ +E++ L
Sbjct: 26 VKPNRYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQ--EELKNQLFSE 83
Query: 79 KEK 81
Sbjct: 84 NAT 86
Score = 30.0 bits (68), Expect = 0.89
Identities = 13/73 (17%), Positives = 28/73 (38%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
+K+ ++ +E ++E+ K + E + K KK+ + ++ S
Sbjct: 54 FTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTVKATKKQLFSSEYEQTSSSSEST 113
Query: 71 IEESLKERKEKTL 83
EE K+ L
Sbjct: 114 SEEETKKTSSILL 126
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 42.9 bits (101), Expect = 2e-04
Identities = 19/93 (20%), Positives = 44/93 (47%), Gaps = 5/93 (5%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEE----KNKKKKKKKKKKKKKKKKKKKK 62
K+K+ +K E +E +EE K DV E + + + + + K ++
Sbjct: 223 KKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSD 282
Query: 63 KEEQSVKEIEESLKERKEKTLTEDSQCDTMIPV 95
EE++ ++ +E ++R +K + ++ + + M V
Sbjct: 283 SEEETEEKEKEK-RKRLKKMMEDEDEDEEMEIV 314
Score = 40.2 bits (94), Expect = 0.001
Identities = 25/85 (29%), Positives = 43/85 (50%), Gaps = 7/85 (8%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK-------KKKKKKKKKKKK 62
E K + K+ ++ KE ++ + K V+ K+ KK KKK K+KK+KK+
Sbjct: 175 EVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKE 234
Query: 63 KEEQSVKEIEESLKERKEKTLTEDS 87
E +VKE E +++ L ++S
Sbjct: 235 ASESTVKEESEEESGKRDVILEDES 259
Score = 37.1 bits (86), Expect = 0.013
Identities = 21/73 (28%), Positives = 31/73 (42%), Gaps = 10/73 (13%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEE-----KEENEQEE--KEKEDVEEEKNKKKKKKKKKKKK 54
+RRG KK+ K +EE K+ E E +++ + K K K
Sbjct: 349 GRRRGRRRVMKKKTFK---DEEGYLVTKKVYEWESFSEDEAEPPPTKPKPKVSTPAVPAA 405
Query: 55 KKKKKKKKKEEQS 67
KK K KK++QS
Sbjct: 406 AKKPKAPKKKKQS 418
Score = 35.6 bits (82), Expect = 0.034
Identities = 17/78 (21%), Positives = 41/78 (52%), Gaps = 3/78 (3%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E ++ KR+ E E E ++++ E+E ++ +++ ++K+K+K+K +K
Sbjct: 243 ESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKR---LK 299
Query: 70 EIEESLKERKEKTLTEDS 87
++ E E +E + +S
Sbjct: 300 KMMEDEDEDEEMEIVPES 317
Score = 34.4 bits (79), Expect = 0.084
Identities = 19/61 (31%), Positives = 35/61 (57%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
E+E E+ + E+ D EEE +K+K+K+K+ KK + + + +E + V E +E +
Sbjct: 266 DEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESE 325
Query: 80 E 80
E
Sbjct: 326 E 326
Score = 32.5 bits (74), Expect = 0.33
Identities = 12/46 (26%), Positives = 24/46 (52%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
EE +E +KE+ +EE +++ +++ KKK K+E+
Sbjct: 322 EESEEPEPPPLPKKEEEKEEVTVSPDGGRRRGRRRVMKKKTFKDEE 367
Score = 31.3 bits (71), Expect = 0.78
Identities = 16/80 (20%), Positives = 31/80 (38%), Gaps = 21/80 (26%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEK---------------------EKEDVEEEK 41
EK+K KR K+ ++E+E EE K++ E+E+
Sbjct: 282 DSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEE 341
Query: 42 NKKKKKKKKKKKKKKKKKKK 61
+++ +++ KKK
Sbjct: 342 VTVSPDGGRRRGRRRVMKKK 361
Score = 31.0 bits (70), Expect = 1.0
Identities = 15/48 (31%), Positives = 25/48 (52%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
KK K+KK+KK+ + K++ EE+S K E E T ++ +
Sbjct: 222 FKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDE 269
>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal. This domain
is specific to the N-terminal part of the prp1 splicing
factor, which is involved in mRNA splicing (and possibly
also poly(A)+ RNA nuclear export and cell cycle
progression). This domain is specific to the N terminus
of the RNA splicing factor encoded by prp1. It is
involved in mRNA splicing and possibly also poly(A)and
RNA nuclear export and cell cycle progression.
Length = 131
Score = 40.7 bits (96), Expect = 2e-04
Identities = 19/87 (21%), Positives = 50/87 (57%), Gaps = 10/87 (11%)
Query: 9 GEKKKRIEKRNKEEEKE------ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
+++ KR ++ + E + + E++E + + E +++ +++KK++++K+K++ +
Sbjct: 32 DDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERMDERRKKRREQKEKEEIE 91
Query: 63 KEEQSVKEIEES---LKERKEKTLTED 86
K + +I++ LK R T+TED
Sbjct: 92 KYREENPKIQQQFADLK-RNLATVTED 117
>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain.
Length = 225
Score = 41.7 bits (98), Expect = 2e-04
Identities = 19/70 (27%), Positives = 38/70 (54%), Gaps = 11/70 (15%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVE---EEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
+ RIE+R++ EK++ E EK +++++ E N +KK+K K + +K+ EQ
Sbjct: 120 RRDLRIEERDEASEKKKEELIEKAQKEIDDFYENYN--------EKKEKTKAQNRKEAEQ 171
Query: 67 SVKEIEESLK 76
+ E ++
Sbjct: 172 FLAERDDFSS 181
Score = 31.6 bits (72), Expect = 0.52
Identities = 21/77 (27%), Positives = 39/77 (50%), Gaps = 14/77 (18%)
Query: 20 KEEEKEENE--QEEKEKEDVEEEKNKKKKKKKKKKKKKKKK------------KKKKKEE 65
E EE E ++ +E+ D+ E+ + +KKK++ +K + KK+K +
Sbjct: 104 DRVEGEEPESIRKWRERRDLRIEERDEASEKKKEELIEKAQKEIDDFYENYNEKKEKTKA 163
Query: 66 QSVKEIEESLKERKEKT 82
Q+ KE E+ L ER + +
Sbjct: 164 QNRKEAEQFLAERDDFS 180
Score = 29.3 bits (66), Expect = 2.4
Identities = 16/62 (25%), Positives = 37/62 (59%), Gaps = 4/62 (6%)
Query: 25 EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE---ESLKERKEK 81
+ + E +E E + + + ++ + +++ + +KKK++ E+ + KEI+ E+ E+KEK
Sbjct: 102 QADRVEGEEPESIRKWRERRDLRIEERDEASEKKKEELIEK-AQKEIDDFYENYNEKKEK 160
Query: 82 TL 83
T
Sbjct: 161 TK 162
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 42.4 bits (100), Expect = 3e-04
Identities = 18/86 (20%), Positives = 35/86 (40%), Gaps = 1/86 (1%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
R R G+ + K+ +K ++ +K + E + + +KK K K KK +K
Sbjct: 729 RAPRN-VGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKA 787
Query: 62 KKEEQSVKEIEESLKERKEKTLTEDS 87
KK ++I + K ++
Sbjct: 788 KKPSAKTQKIAAATKAKRAAKKKVAE 813
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 42.2 bits (99), Expect = 3e-04
Identities = 19/72 (26%), Positives = 46/72 (63%), Gaps = 1/72 (1%)
Query: 10 EKKKRIEKRNKE-EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
++KK+ E+ +E + K+ EQE ++ + E K ++++K+ ++ +K+ + ++K++EEQ+
Sbjct: 80 QRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQAR 139
Query: 69 KEIEESLKERKE 80
K E K+ +
Sbjct: 140 KAAAEQKKKAEA 151
Score = 40.3 bits (94), Expect = 0.001
Identities = 25/85 (29%), Positives = 45/85 (52%), Gaps = 4/85 (4%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK- 59
+ K R A E++K+ E+ K+ + E+ +QEE+ ++ E+K K + K K + K K
Sbjct: 106 LEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKA 165
Query: 60 ---KKKKEEQSVKEIEESLKERKEK 81
KKK E++ K EE+ + +
Sbjct: 166 AAEAKKKAEEAAKAAEEAKAKAEAA 190
Score = 39.5 bits (92), Expect = 0.002
Identities = 19/84 (22%), Positives = 47/84 (55%), Gaps = 3/84 (3%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK---KKKKKKKK 57
++ ++ E+ K++EK + ++++ + EE EK+ E+K ++++ +K ++KKK +
Sbjct: 92 LKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEA 151
Query: 58 KKKKKKEEQSVKEIEESLKERKEK 81
K K E + + K++ E+
Sbjct: 152 AKAKAAAEAAKLKAAAEAKKKAEE 175
Score = 39.2 bits (91), Expect = 0.003
Identities = 14/78 (17%), Positives = 33/78 (42%)
Query: 4 RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
+ K K K++ + E + ++ + E K K +KK + ++K +KKK
Sbjct: 177 AKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKA 236
Query: 64 EEQSVKEIEESLKERKEK 81
++ + + + E+
Sbjct: 237 AAKAKADKAAAAAKAAER 254
Score = 38.0 bits (88), Expect = 0.007
Identities = 19/80 (23%), Positives = 50/80 (62%), Gaps = 4/80 (5%)
Query: 6 GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK----KKKKKKKKKKKKKKKKK 61
G ++ RI+ + +K E ++++KE++ EE K K+ ++ K+ +K++ K ++++
Sbjct: 58 GAVVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQ 117
Query: 62 KKEEQSVKEIEESLKERKEK 81
K+ E++ K+ + K+++E+
Sbjct: 118 KQAEEAEKQAQLEQKQQEEQ 137
Score = 35.3 bits (81), Expect = 0.039
Identities = 20/62 (32%), Positives = 28/62 (45%), Gaps = 1/62 (1%)
Query: 3 KRRGGAGEKKKRIEKRNK-EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
K + A KK+ E K EK + E E K K + + E ++K +KKK K K
Sbjct: 184 KAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKAD 243
Query: 62 KK 63
K
Sbjct: 244 KA 245
Score = 34.9 bits (80), Expect = 0.049
Identities = 21/97 (21%), Positives = 39/97 (40%), Gaps = 14/97 (14%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK-------- 59
A ++K+ E++ ++ E+ ++ E K E K K + KKK ++ K
Sbjct: 127 AQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAK 186
Query: 60 ------KKKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
KKK E ++ E++ E + K E
Sbjct: 187 AEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEA 223
Score = 34.2 bits (78), Expect = 0.090
Identities = 12/74 (16%), Positives = 30/74 (40%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E +K E + + + + + + EK + ++K +KKK K K +++
Sbjct: 188 EAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAA 247
Query: 70 EIEESLKERKEKTL 83
+ + ++ L
Sbjct: 248 AAKAAERKAAAAAL 261
Score = 33.8 bits (77), Expect = 0.12
Identities = 20/74 (27%), Positives = 29/74 (39%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
K + A KKK E EE + + K+ E E +K K + + K K +KK
Sbjct: 162 KLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKA 221
Query: 63 KEEQSVKEIEESLK 76
+ K E K
Sbjct: 222 EAAAEEKAAAEKKK 235
Score = 33.0 bits (75), Expect = 0.26
Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 4/79 (5%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
+ A E KK+ E+ K E+ + + E + E + K +K K + + K K +KK
Sbjct: 161 AKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKK 220
Query: 63 KEEQSVKEIEESLKERKEK 81
E EE K+K
Sbjct: 221 AE----AAAEEKAAAEKKK 235
Score = 31.5 bits (71), Expect = 0.63
Identities = 15/54 (27%), Positives = 23/54 (42%), Gaps = 2/54 (3%)
Query: 8 AGEKKKRIEKRNKE--EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
A +K + E K E+K E EEK + ++ K K K K ++K
Sbjct: 202 AAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERK 255
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 42.3 bits (99), Expect = 3e-04
Identities = 19/95 (20%), Positives = 41/95 (43%), Gaps = 13/95 (13%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK---------- 52
R KK +E E EE + ++ ++++KK ++ ++K
Sbjct: 74 TRSVKRATKKTVVEI---SEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEE 130
Query: 53 KKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
+K +KK +K+++ + + E E E + E+S
Sbjct: 131 EKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEES 165
Score = 37.7 bits (87), Expect = 0.008
Identities = 19/94 (20%), Positives = 46/94 (48%), Gaps = 14/94 (14%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKE------KEDVEEEKNKKKKKKKKKKK--------K 54
+K R KR ++ E + +E ED +K KK ++ ++K +
Sbjct: 70 KKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVE 129
Query: 55 KKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
++K +KK ++ + VK+++E ++++ ++ D +
Sbjct: 130 EEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVE 163
Score = 36.9 bits (85), Expect = 0.016
Identities = 23/80 (28%), Positives = 32/80 (40%), Gaps = 11/80 (13%)
Query: 6 GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
KK R K DVEEEK +KK +K++K KK + + + E
Sbjct: 106 LDKESKKTPRRTRRK---------AAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSE 156
Query: 66 QSVKEIEESLKERKEKTLTE 85
V ++EES E E
Sbjct: 157 SEVSDVEES--EFVTSLENE 174
Score = 33.9 bits (77), Expect = 0.13
Identities = 22/81 (27%), Positives = 37/81 (45%), Gaps = 4/81 (4%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
+ +R +NK + ++ + V+EE ++ KKK + K+ KK V E
Sbjct: 32 RGERWRLQNKFLLAAGSRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKT----VVE 87
Query: 71 IEESLKERKEKTLTEDSQCDT 91
I E L+E E + ED+ D
Sbjct: 88 ISEPLEEGSELVVNEDAALDK 108
Score = 31.9 bits (72), Expect = 0.48
Identities = 15/74 (20%), Positives = 30/74 (40%), Gaps = 8/74 (10%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
AG +KK +EE +EN +K + K+ KK + + +E
Sbjct: 46 AGSRKKIESALAVDEEPDEN--------GAVSKKKPTRSVKRATKKTVVEISEPLEEGSE 97
Query: 68 VKEIEESLKERKEK 81
+ E++ +++ K
Sbjct: 98 LVVNEDAALDKESK 111
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 42.1 bits (100), Expect = 3e-04
Identities = 21/79 (26%), Positives = 47/79 (59%), Gaps = 2/79 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E ++ +E++ +E E E E K KE++EE+K K ++++ K ++ +K+ ++ +E + K
Sbjct: 527 ELERELEQKAEEAEALLKEAE-KLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKE-AKK 584
Query: 70 EIEESLKERKEKTLTEDSQ 88
E +E +KE ++ +
Sbjct: 585 EADEIIKELRQLQKGGYAS 603
Score = 37.5 bits (88), Expect = 0.010
Identities = 21/89 (23%), Positives = 43/89 (48%), Gaps = 27/89 (30%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEK---------------------------NKK 44
+++ EK +EE+K E E++ ++ ++E K +
Sbjct: 554 EEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEAR 613
Query: 45 KKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
K+ K +KK+KKKKK+K++++ +K +E
Sbjct: 614 KRLNKANEKKEKKKKKQKEKQEELKVGDE 642
Score = 36.3 bits (85), Expect = 0.027
Identities = 24/74 (32%), Positives = 48/74 (64%), Gaps = 3/74 (4%)
Query: 9 GEKKKRIEK--RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
GE K+++ + + EE + E EQ+ +E E + +E K K++ ++KK+K ++++ K EE
Sbjct: 512 GEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEA 571
Query: 67 SVKEIEESLKERKE 80
KE ++++KE K+
Sbjct: 572 E-KEAQQAIKEAKK 584
Score = 35.2 bits (82), Expect = 0.060
Identities = 18/57 (31%), Positives = 31/57 (54%), Gaps = 1/57 (1%)
Query: 14 RIEKRNKEEEKEEN-EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
I K ++ +K + E + + NK +KK+KKKKK+K+K+++ K VK
Sbjct: 588 EIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGDEVK 644
Score = 33.3 bits (77), Expect = 0.25
Identities = 20/68 (29%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E ++ I++ KE ++ E + +K K + + +K+ K +KK+KKK++Q K
Sbjct: 574 EAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQ--K 631
Query: 70 EIEESLKE 77
E +E LK
Sbjct: 632 EKQEELKV 639
Score = 32.9 bits (76), Expect = 0.29
Identities = 23/97 (23%), Positives = 42/97 (43%), Gaps = 31/97 (31%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEK-------------NKKKKK--------- 47
E +KR+ K N+++EK++ +Q+EK++E ++ + K
Sbjct: 611 EARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQKGEVLSIPDDKEAIVQAGIM 670
Query: 48 ---------KKKKKKKKKKKKKKKKEEQSVKEIEESL 75
+K +K KKKKKKK K + + + L
Sbjct: 671 KMKVPLSDLEKIQKPKKKKKKKPKTVKPKPRTVSLEL 707
Score = 32.1 bits (74), Expect = 0.46
Identities = 20/70 (28%), Positives = 39/70 (55%), Gaps = 9/70 (12%)
Query: 17 KRNKEEEKEENEQ-----EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
K+ E+KE+ + EE E+E ++ + + K+ +K K++ ++KK+K ++E
Sbjct: 508 KKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEK----LQEE 563
Query: 72 EESLKERKEK 81
E+ L E EK
Sbjct: 564 EDKLLEEAEK 573
Score = 31.3 bits (72), Expect = 0.89
Identities = 20/69 (28%), Positives = 40/69 (57%), Gaps = 8/69 (11%)
Query: 22 EEKEENEQEEKEK-----EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
EE ++ E+KEK +EE + + ++K ++ + K+ +K K+E ++E +E L+
Sbjct: 505 EEAKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEE---LEEKKEKLQ 561
Query: 77 ERKEKTLTE 85
E ++K L E
Sbjct: 562 EEEDKLLEE 570
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 42.1 bits (100), Expect = 4e-04
Identities = 13/71 (18%), Positives = 38/71 (53%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
+ + KK +K+ + K++ + + + + K +++++ +++++KKKKK+
Sbjct: 107 AEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQ 166
Query: 62 KKEEQSVKEIE 72
K E+ +E+
Sbjct: 167 KPTEKIPREVV 177
Score = 41.0 bits (97), Expect = 9e-04
Identities = 14/74 (18%), Positives = 39/74 (52%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
R +K + +K K ++K+ + + + + K +++++ +++++KKKKK+K
Sbjct: 109 AAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKP 168
Query: 62 KKEEQSVKEIEESL 75
++ I E++
Sbjct: 169 TEKIPREVVIPETI 182
Score = 37.9 bits (89), Expect = 0.007
Identities = 13/72 (18%), Positives = 24/72 (33%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E +K KR EEE + +E E EE K + ++ + + +
Sbjct: 42 EARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAED 101
Query: 70 EIEESLKERKEK 81
E + +
Sbjct: 102 EAARPAEAAARR 113
Score = 36.8 bits (86), Expect = 0.019
Identities = 12/74 (16%), Positives = 28/74 (37%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
G + K +EEE + E + + +E+ + E + + +++ K + EE +
Sbjct: 27 LGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAA 86
Query: 68 VKEIEESLKERKEK 81
R +
Sbjct: 87 EAAAAAEAAARPAE 100
Score = 36.8 bits (86), Expect = 0.019
Identities = 16/77 (20%), Positives = 35/77 (45%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
+ + G K K+ + + K + + K + + ++KKKKK+K +K ++
Sbjct: 118 KAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIPREVV 177
Query: 62 KKEEQSVKEIEESLKER 78
E +V E+ E + +
Sbjct: 178 IPETITVAELAEKMAVK 194
Score = 33.7 bits (78), Expect = 0.15
Identities = 11/83 (13%), Positives = 44/83 (53%), Gaps = 2/83 (2%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
R A ++ + +K K+++ + ++++ +++ K+ K K +++++ +++++
Sbjct: 103 AARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRK--AARGGKRGKGGKGRRRRRGRRRRR 160
Query: 62 KKEEQSVKEIEESLKERKEKTLT 84
KK+++ + + +T+T
Sbjct: 161 KKKKKQKPTEKIPREVVIPETIT 183
Score = 32.1 bits (74), Expect = 0.48
Identities = 14/81 (17%), Positives = 33/81 (40%), Gaps = 9/81 (11%)
Query: 11 KKKRIEKRNKE---------EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
K R+ + KE E+ +E E K EE+ +K++ K++ +++ K + +
Sbjct: 2 SKVRVYELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAE 61
Query: 62 KKEEQSVKEIEESLKERKEKT 82
+ +E ++
Sbjct: 62 EAAAAEAEEEAKAEAAAAAPA 82
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 41.9 bits (98), Expect = 4e-04
Identities = 13/50 (26%), Positives = 25/50 (50%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK 52
+ G +K+ +IE + E E E +++E++ ++ KKKK K
Sbjct: 801 PKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYK 850
Score = 36.9 bits (85), Expect = 0.017
Identities = 9/50 (18%), Positives = 19/50 (38%)
Query: 48 KKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFK 97
K + K+K K+K +++ E E + T+ + + P
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAIL 841
Score = 35.3 bits (81), Expect = 0.054
Identities = 14/61 (22%), Positives = 27/61 (44%), Gaps = 7/61 (11%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK-------KKKKKKKKKKK 63
K K+ +E+ +++ + E VE E + K++++K KKKK K
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKV 851
Query: 64 E 64
+
Sbjct: 852 D 852
Score = 31.8 bits (72), Expect = 0.70
Identities = 14/59 (23%), Positives = 26/59 (44%), Gaps = 5/59 (8%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
++K + + N +E K E+ E E E+ K++++K KKK+ V
Sbjct: 798 KQKPKEKGPNDKEIKIESPSVETEG-----ERCTIKQREEKGIDAPAILNVKKKKPYKV 851
Score = 31.5 bits (71), Expect = 0.93
Identities = 18/88 (20%), Positives = 30/88 (34%), Gaps = 4/88 (4%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
+++ E EE + + + K+K K+K K+ K E SV
Sbjct: 761 EEEEFEYPRMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSV-- 818
Query: 71 IEESLKERKEKTLTEDSQCDTMIPVFKK 98
E+ ER E+ D + K
Sbjct: 819 --ETEGERCTIKQREEKGIDAPAILNVK 844
Score = 30.7 bits (69), Expect = 1.4
Identities = 12/60 (20%), Positives = 27/60 (45%), Gaps = 4/60 (6%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
+Q+ KEK ++E K + + + ++ K+ E+ + L +K+K D+
Sbjct: 798 KQKPKEKGPNDKEI----KIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKVDN 853
Score = 30.7 bits (69), Expect = 1.5
Identities = 10/60 (16%), Positives = 27/60 (45%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
K ++K KE +K K + + + ++ K+++E+ ++K++K +
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKV 851
Score = 28.4 bits (63), Expect = 8.6
Identities = 15/57 (26%), Positives = 19/57 (33%), Gaps = 1/57 (1%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQE-EKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
R K+K + + E E E E+ EEK KKKK K
Sbjct: 794 AARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYK 850
>gnl|CDD|237200 PRK12778, PRK12778, putative bifunctional 2-polyprenylphenol
hydroxylase/glutamate synthase subunit beta;
Provisional.
Length = 752
Score = 41.7 bits (98), Expect = 5e-04
Identities = 18/71 (25%), Positives = 31/71 (43%), Gaps = 1/71 (1%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE-SLKER 78
K E EE + E+ E K K K++ ++ + E ++ EE +L
Sbjct: 261 KTIEGEELLKLEERTAAWRAELRKSMKPKERTAIERVPMPELDPEYRAHNRFEEVNLGLT 320
Query: 79 KEKTLTEDSQC 89
KE+ +TE +C
Sbjct: 321 KEQAMTEAKRC 331
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 41.2 bits (97), Expect = 5e-04
Identities = 19/72 (26%), Positives = 40/72 (55%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+ KR++K E + + + + + + + + K+KKK+KKK++KKKK+ E+ +
Sbjct: 291 YQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEE 350
Query: 70 EIEESLKERKEK 81
IE+ + +K
Sbjct: 351 RIEKLEVQATDK 362
Score = 39.3 bits (92), Expect = 0.002
Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
EK K ++ + K +K E + + ++ +K + K+KKK+KKKEE+ K
Sbjct: 284 EKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKK 343
Query: 70 EIEESLKERKEK 81
+IE L+ER EK
Sbjct: 344 QIER-LEERIEK 354
Score = 34.6 bits (80), Expect = 0.069
Identities = 19/58 (32%), Positives = 35/58 (60%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
K+K K ++ EK + E +EK+KE +EEK KK+ ++ +++ +K + + KEE
Sbjct: 309 LKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEENK 366
Score = 31.2 bits (71), Expect = 0.80
Identities = 15/62 (24%), Positives = 36/62 (58%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
KR K E + E+ + E KEK+ ++++ KKKK+ ++ +++ +K + + +++ K
Sbjct: 307 SDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEENK 366
Query: 70 EI 71
+
Sbjct: 367 TV 368
Score = 28.1 bits (63), Expect = 8.5
Identities = 14/69 (20%), Positives = 29/69 (42%), Gaps = 3/69 (4%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL---KERKE 80
K + EK +E ++ K + K+ KK + K+K + + E L + K+
Sbjct: 273 KTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKK 332
Query: 81 KTLTEDSQC 89
K ++ +
Sbjct: 333 KEKKKEEKK 341
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 41.5 bits (98), Expect = 5e-04
Identities = 19/77 (24%), Positives = 32/77 (41%), Gaps = 4/77 (5%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E KK+ ++ + + + + E + KKK K K K KKK +
Sbjct: 43 ESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLK----D 98
Query: 70 EIEESLKERKEKTLTED 86
E++ S K K+ L +D
Sbjct: 99 ELDSSKKAEKKNALDKD 115
Score = 38.4 bits (90), Expect = 0.004
Identities = 20/89 (22%), Positives = 33/89 (37%), Gaps = 10/89 (11%)
Query: 10 EKKKRIEKRNKEEEKEENEQEE--------KEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
E K+ +E + K E+ + + + + K K K K K KKK
Sbjct: 37 EIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKL 96
Query: 62 KKEEQSVK--EIEESLKERKEKTLTEDSQ 88
K E S K E + +L + + +D
Sbjct: 97 KDELDSSKKAEKKNALDKDDDLNYVKDID 125
Score = 36.9 bits (86), Expect = 0.014
Identities = 16/83 (19%), Positives = 30/83 (36%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
K E E E ++K K+ + K+K K++ K+ + KKK ++ V +
Sbjct: 4 ASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMV 63
Query: 77 ERKEKTLTEDSQCDTMIPVFKKG 99
+ + D K
Sbjct: 64 KDTDDATESDIPKKKTKTAAKAA 86
Score = 30.0 bits (68), Expect = 2.0
Identities = 11/63 (17%), Positives = 27/63 (42%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
++ E++ +K+ + + K KKK K + KK +K+ K+ + + +
Sbjct: 65 DTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDI 124
Query: 80 EKT 82
+
Sbjct: 125 DVL 127
Score = 30.0 bits (68), Expect = 2.3
Identities = 10/79 (12%), Positives = 25/79 (31%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
+ + E ++ + + + KKK K + KK +KK ++
Sbjct: 57 IFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDD 116
Query: 68 VKEIEESLKERKEKTLTED 86
+ + + +D
Sbjct: 117 DLNYVKDIDVLNQADDDDD 135
Score = 29.2 bits (66), Expect = 3.4
Identities = 10/57 (17%), Positives = 22/57 (38%), Gaps = 1/57 (1%)
Query: 31 EKEKEDVEEEKNKKKKKK-KKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
+ + KK K + KK +KK K ++ + + + L + + +D
Sbjct: 82 AAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDD 138
Score = 28.8 bits (65), Expect = 4.8
Identities = 13/65 (20%), Positives = 24/65 (36%)
Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
+ + E E+ ++K KK K K K K++ K+ E K E+ + +
Sbjct: 1 MTTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLS 60
Query: 83 LTEDS 87
Sbjct: 61 GMVKD 65
Score = 28.4 bits (64), Expect = 7.0
Identities = 12/66 (18%), Positives = 33/66 (50%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
+++ ++++ +E + ED +++ + ++KK+ K+ +K + E + RK
Sbjct: 143 DDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARK 202
Query: 80 EKTLTE 85
+ LT
Sbjct: 203 DAKLTA 208
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 41.0 bits (97), Expect = 7e-04
Identities = 22/72 (30%), Positives = 48/72 (66%), Gaps = 3/72 (4%)
Query: 12 KKRIEKRNKEEEKEEN---EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
KKRI+ KEEE+ + E+E + EEE+ +K+K+++++ + +++ +++E++
Sbjct: 29 KKRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQ 88
Query: 69 KEIEESLKERKE 80
+E EE L+ER++
Sbjct: 89 EEYEERLQEREQ 100
Score = 34.9 bits (81), Expect = 0.062
Identities = 15/60 (25%), Positives = 41/60 (68%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
EEK+ + EEKE+E +E ++++ K +++++++K+K++ + ++E ++ER+++
Sbjct: 27 EEKKRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKR 86
Score = 33.0 bits (76), Expect = 0.21
Identities = 26/84 (30%), Positives = 52/84 (61%), Gaps = 6/84 (7%)
Query: 3 KRRGGAGEKKKRIEKR--NKEEEKEENEQEEKEK----EDVEEEKNKKKKKKKKKKKKKK 56
+RR G +++IE+R ++EE EE QE ++ E ++EE + ++K++K+KK +
Sbjct: 68 ERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLR 127
Query: 57 KKKKKKKEEQSVKEIEESLKERKE 80
++ + EE+ ++ EE +ER+E
Sbjct: 128 EEIDEFNEERIERKEEEKEREREE 151
Score = 32.6 bits (75), Expect = 0.33
Identities = 20/78 (25%), Positives = 50/78 (64%), Gaps = 8/78 (10%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEK---EDVEEEKNKKKKKKKKKKKKKKKKKKK-----KK 63
+ IE+ +E+E E E+ EK+K E+++E ++ ++K+++K+++++++ K ++
Sbjct: 102 DEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQRE 161
Query: 64 EEQSVKEIEESLKERKEK 81
+ + +E E +ERKE+
Sbjct: 162 KAEREEEREAERRERKEE 179
Score = 31.4 bits (72), Expect = 0.60
Identities = 20/83 (24%), Positives = 50/83 (60%), Gaps = 12/83 (14%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK------KKKKKKKKKKKKKKKK 63
E+++RI++ EEE+ + EE+E+E +E+ ++ + ++++K+++++ +++ +
Sbjct: 38 EEERRIDEM-MEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQ 96
Query: 64 EEQSVKEI-----EESLKERKEK 81
E + + EI EE E +EK
Sbjct: 97 EREQMDEIIERIQEEDEAEAQEK 119
Score = 31.4 bits (72), Expect = 0.64
Identities = 15/83 (18%), Positives = 41/83 (49%), Gaps = 13/83 (15%)
Query: 12 KKRIEKRNKEEEKEENE-----QEEKEKEDVEEEKNKKKKKKKKKK--------KKKKKK 58
K+ + + EE+ E E Q E E+ + E + ++ K+ + ++ K++++
Sbjct: 246 KEERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEKEERRA 305
Query: 59 KKKKKEEQSVKEIEESLKERKEK 81
++++E + + + E ER+ +
Sbjct: 306 AEREEELEEGERLREEEAERQAR 328
Score = 31.4 bits (72), Expect = 0.77
Identities = 23/94 (24%), Positives = 54/94 (57%), Gaps = 14/94 (14%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEK---EDVEEEKNKKKKKKKKKKKKKKK 57
+RR EK++ + + ++E+ E+E+EE ++ + +EE +K+++K+K++ +K++
Sbjct: 170 EAERRERKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKEKEEAEKRR 229
Query: 58 KKKKK-----------KEEQSVKEIEESLKERKE 80
++K++ KEE+ +E E ER+
Sbjct: 230 RQKQELQRAREEQIEEKEERLQEERAEEEAERER 263
Score = 31.0 bits (71), Expect = 0.94
Identities = 21/86 (24%), Positives = 51/86 (59%), Gaps = 14/86 (16%)
Query: 10 EKKKRIEKRNKE-----------EEKEENEQEEKEKEDVEEEKNKK--KKKKKKKKKKKK 56
++ + EKR K+ EE+ E ++EEKE+E EE K + ++K +++++++
Sbjct: 112 DEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEEREA 171
Query: 57 KKKKKKKE-EQSVKEIEESLKERKEK 81
+++++K+E E+ V + +E +++
Sbjct: 172 ERRERKEEKEREVARLRAQQEEAEDE 197
Score = 30.3 bits (69), Expect = 1.7
Identities = 15/85 (17%), Positives = 48/85 (56%), Gaps = 14/85 (16%)
Query: 11 KKKRIEKRNKEEEKEE------------NEQEEKEKEDVEEEKNK--KKKKKKKKKKKKK 56
+K++ E + +K+E E+ ++E+ + E E+ + +K+ + ++ +++
Sbjct: 219 QKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEEAERERMLEKQAEDEELEQEN 278
Query: 57 KKKKKKKEEQSVKEIEESLKERKEK 81
+K++ K + +E+E+ ++E++E+
Sbjct: 279 AEKRRMKRLEHRRELEQQIEEKEER 303
Score = 30.3 bits (69), Expect = 1.8
Identities = 15/73 (20%), Positives = 44/73 (60%), Gaps = 3/73 (4%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK---KKKEEQ 66
E+ +R E+ + E +EE + E ++E E E+ ++ +++++K++K+++ + +++E +
Sbjct: 136 ERIERKEEEKEREREEELKILEYQREKAEREEEREAERRERKEEKEREVARLRAQQEEAE 195
Query: 67 SVKEIEESLKERK 79
+E + L+
Sbjct: 196 DEREELDELRADL 208
Score = 29.9 bits (68), Expect = 2.0
Identities = 12/76 (15%), Positives = 46/76 (60%), Gaps = 2/76 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
EK+ E+ +E ++ + + + ++E++ +K++++ +++++ ++ ++ +EE+
Sbjct: 266 EKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEKEERRAAEREEELEEGERLREEE--A 323
Query: 70 EIEESLKERKEKTLTE 85
E + ++E +++ L E
Sbjct: 324 ERQARIEEERQRLLKE 339
Score = 29.5 bits (67), Expect = 2.9
Identities = 18/96 (18%), Positives = 48/96 (50%), Gaps = 27/96 (28%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKED-----------VEEEKNKKKKKKKKKKK-------- 53
+ +E+ + EE E+E K KE+ +EE + +++++ +++ +
Sbjct: 45 EMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEI 104
Query: 54 --------KKKKKKKKKKEEQSVKEIEESLKERKEK 81
+ + ++K++K+++ +EI+E +ER E+
Sbjct: 105 IERIQEEDEAEAQEKREKQKKLREEIDEFNEERIER 140
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 41.2 bits (97), Expect = 7e-04
Identities = 24/83 (28%), Positives = 43/83 (51%), Gaps = 9/83 (10%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK-------KKKKKKKKKKKKKKK 62
E IE+ +E E EE EE+ +E +KN ++K K+ + KK KK+ K +
Sbjct: 389 ENDAEIEELRRELEGEEESDEEENEE--PSKKNVGRRKFGPENGEKEAESKKLKKENKNE 446
Query: 63 KEEQSVKEIEESLKERKEKTLTE 85
+E+ + EE L++ +E + +
Sbjct: 447 FKEKKESDEEEELEDEEEAKVEK 469
Score = 40.8 bits (96), Expect = 0.001
Identities = 25/71 (35%), Positives = 38/71 (53%), Gaps = 2/71 (2%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEE--EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
+R E R KEE E E+ +E E EE E+ ++ KK ++K + +KE +S K
Sbjct: 380 QRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKL 439
Query: 71 IEESLKERKEK 81
+E+ E KEK
Sbjct: 440 KKENKNEFKEK 450
Score = 38.1 bits (89), Expect = 0.006
Identities = 20/61 (32%), Positives = 37/61 (60%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
+ K+++K NK E KE+ E +E+E+ + EEE +K K K+ +K +K++++EE
Sbjct: 432 KEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELD 491
Query: 68 V 68
Sbjct: 492 E 492
Score = 37.7 bits (88), Expect = 0.008
Identities = 25/74 (33%), Positives = 36/74 (48%), Gaps = 1/74 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEK-EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
+ KR EK KEEE+EE ++E K K+ KK+ KKK K K K + +V
Sbjct: 473 KLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAV 532
Query: 69 KEIEESLKERKEKT 82
K ++ KE+
Sbjct: 533 KVKKKKKKEKSIDL 546
Score = 37.0 bits (86), Expect = 0.015
Identities = 22/85 (25%), Positives = 44/85 (51%), Gaps = 10/85 (11%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK----------KEE 65
E+ ++EE +E +++ ++ E K+ + KK KK+ K + K+KK +EE
Sbjct: 405 EESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEE 464
Query: 66 QSVKEIEESLKERKEKTLTEDSQCD 90
V+++ L +R EK E+ + +
Sbjct: 465 AKVEKVANKLLKRSEKAQKEEEEEE 489
Score = 36.6 bits (85), Expect = 0.019
Identities = 17/75 (22%), Positives = 38/75 (50%), Gaps = 1/75 (1%)
Query: 17 KRNKEEEKEENEQEEKE-KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
+R + +KEEN+ E +E + ++E E+ +++ ++ KK ++K E + + L
Sbjct: 380 QRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKL 439
Query: 76 KERKEKTLTEDSQCD 90
K+ + E + D
Sbjct: 440 KKENKNEFKEKKESD 454
Score = 36.6 bits (85), Expect = 0.020
Identities = 23/81 (28%), Positives = 42/81 (51%), Gaps = 3/81 (3%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEE---KNKKKKKKKKKKKKKKKKKKKKKEEQ 66
E+ K + NK ++ E Q+E+E+E+++EE K KK+ KKK K ++
Sbjct: 463 EEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDK 522
Query: 67 SVKEIEESLKERKEKTLTEDS 87
+ +I ++ + K+K E S
Sbjct: 523 AANKISKAAVKVKKKKKKEKS 543
Score = 35.0 bits (81), Expect = 0.061
Identities = 21/75 (28%), Positives = 42/75 (56%), Gaps = 2/75 (2%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
RR E ++ + K +++ +NE +EK++ D EEE +++ K +K K K+ +K
Sbjct: 421 GRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEK 480
Query: 63 KEEQSVKEIEESLKE 77
+++ +E EE +E
Sbjct: 481 AQKE--EEEEELDEE 493
Score = 35.0 bits (81), Expect = 0.061
Identities = 17/57 (29%), Positives = 30/57 (52%), Gaps = 14/57 (24%)
Query: 21 EEEKEENEQEEKEKE--------------DVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
E+EK+E +EE KE +++ K K+K+K++ K + KK+K+K
Sbjct: 596 EKEKKEVIEEEDPKEIDLTLPGWGSWAGDGIKKRKKKRKRKRRFLTKIEGVKKEKRK 652
Score = 33.5 bits (77), Expect = 0.17
Identities = 19/103 (18%), Positives = 43/103 (41%), Gaps = 19/103 (18%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK-------------- 55
E K +++ + +E+EE E EE+ K + K K+ +K +K++++
Sbjct: 442 ENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTS 501
Query: 56 -----KKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMI 93
KK+ KK+ S + + + + + + + I
Sbjct: 502 SVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSI 544
Score = 33.1 bits (76), Expect = 0.24
Identities = 22/78 (28%), Positives = 36/78 (46%), Gaps = 25/78 (32%)
Query: 23 EKEENEQE-EKEKEDVEEEKNKK----------------KKKKKKKKKKKKKKKKKKKEE 65
++ E EKEK++V EE++ K KK+KKK+K+K++ K +
Sbjct: 587 AGDDVVAEFEKEKKEVIEEEDPKEIDLTLPGWGSWAGDGIKKRKKKRKRKRRFLTKIEGV 646
Query: 66 QSVKEIEESLKERKEKTL 83
+ K RK+K L
Sbjct: 647 KKEK--------RKDKKL 656
Score = 33.1 bits (76), Expect = 0.25
Identities = 16/94 (17%), Positives = 37/94 (39%), Gaps = 14/94 (14%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKED--------------VEEEKNKKKKKKKKKKK 53
+R E++ E+++ +KE ++ +EE+ K +K K K
Sbjct: 417 KKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLK 476
Query: 54 KKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
+ +K +K+++EE+ +E +
Sbjct: 477 RSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQ 510
Score = 32.7 bits (75), Expect = 0.30
Identities = 16/60 (26%), Positives = 27/60 (45%), Gaps = 1/60 (1%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK-EEQSVK 69
+ K K+++ ++ + +K + K K KKKKKK+K EE S+K
Sbjct: 499 TTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIK 558
Score = 32.0 bits (73), Expect = 0.53
Identities = 20/39 (51%), Positives = 24/39 (61%), Gaps = 2/39 (5%)
Query: 38 EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
EE K K+ KK K KK + KK+K KEE KE EE +K
Sbjct: 216 EEAKAKRIKKIKSKKYHRVHKKEKLKEEL--KEFEELVK 252
Score = 32.0 bits (73), Expect = 0.57
Identities = 24/81 (29%), Positives = 34/81 (41%), Gaps = 7/81 (8%)
Query: 10 EKKKRIEKRNKE-----EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
EK K K +E E E E+ E+ EE K K K K K + K +
Sbjct: 238 EKLKEELKEFEELVKADPEAALEELEKLERRRAEERMTLKHKNTSKWAKSMLKTGRAKYD 297
Query: 65 EQSVKEIEESLKERKEKTLTE 85
E++ K +EE L R+ + L
Sbjct: 298 EEARKALEEQL--RQGEELRR 316
Score = 31.6 bits (72), Expect = 0.73
Identities = 18/86 (20%), Positives = 41/86 (47%), Gaps = 20/86 (23%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK------------ 59
+++IE ++ EE E+ + + +E+++ ++E + + +KK K K
Sbjct: 315 RRKIEGKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLL 374
Query: 60 --------KKKKEEQSVKEIEESLKE 77
+ +K+E++ EIEE +E
Sbjct: 375 SMKFMQRAEARKKEENDAEIEELRRE 400
Score = 29.6 bits (67), Expect = 2.7
Identities = 18/80 (22%), Positives = 40/80 (50%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
E K+ R K + ++ E +K E + K+KK+ ++++ + +++ K E+ +
Sbjct: 412 NEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVA 471
Query: 68 VKEIEESLKERKEKTLTEDS 87
K ++ S K +KE+ E
Sbjct: 472 NKLLKRSEKAQKEEEEEELD 491
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 40.3 bits (95), Expect = 7e-04
Identities = 14/64 (21%), Positives = 35/64 (54%), Gaps = 2/64 (3%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E K +++ + E++ ++ E ++++ + +K K K +KK+K + +K + + V
Sbjct: 105 EAKAKVQAQRAEQQAKKREAAGEKEK--APRRERKPKPKAPRKKRKPRAQKPEPQHTPVS 162
Query: 70 EIEE 73
+I E
Sbjct: 163 DISE 166
Score = 34.5 bits (80), Expect = 0.046
Identities = 9/49 (18%), Positives = 27/49 (55%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
EE + Q ++ ++ ++ + +K+K ++++K K K +K+ + +
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152
Score = 33.7 bits (78), Expect = 0.085
Identities = 13/53 (24%), Positives = 29/53 (54%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
EE K + + + E++ + E +K+K ++++K K K +KK + ++ E
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEP 156
Score = 33.0 bits (76), Expect = 0.15
Identities = 8/46 (17%), Positives = 24/46 (52%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
E+ KK++ +K+K ++++K K + K+ + ++ + +
Sbjct: 116 EQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPV 161
Score = 31.8 bits (73), Expect = 0.40
Identities = 7/55 (12%), Positives = 26/55 (47%)
Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
+ E +++ +K+K ++++K K K ++++ + + + ++E
Sbjct: 112 AQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISE 166
Score = 27.6 bits (62), Expect = 8.0
Identities = 14/64 (21%), Positives = 30/64 (46%), Gaps = 5/64 (7%)
Query: 11 KKKRIEKRNK--EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE---E 65
+ +R E++ K E E+ + +E++ + KK+K + +K + + E
Sbjct: 111 QAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVG 170
Query: 66 QSVK 69
Q+VK
Sbjct: 171 QAVK 174
Score = 27.6 bits (62), Expect = 9.2
Identities = 13/64 (20%), Positives = 33/64 (51%), Gaps = 1/64 (1%)
Query: 26 ENEQEEKEKEDVEEEKNK-KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLT 84
E E E ++ +EE K K + ++ +++ KK++ +K+K + ++ + +K K
Sbjct: 92 EEEHVEHARKQLEEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRA 151
Query: 85 EDSQ 88
+ +
Sbjct: 152 QKPE 155
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 41.0 bits (96), Expect = 8e-04
Identities = 20/71 (28%), Positives = 39/71 (54%), Gaps = 3/71 (4%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+KKR +N+E K ENE++ +E ++E EK +++ + + K+ +++ +S
Sbjct: 141 IEKKRENNKNEERLKFENEKKLEESLELEREK---FEEQLHEANLDLEFKENEEQRESKW 197
Query: 70 EIEESLKERKE 80
I + LK R E
Sbjct: 198 AILKKLKRRAE 208
Score = 37.1 bits (86), Expect = 0.012
Identities = 20/76 (26%), Positives = 45/76 (59%), Gaps = 1/76 (1%)
Query: 11 KKKRIEKRNKEEEKE-ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
K + +E N E++KE E ++E ++ E +K + + +KK++ K +++ + ++ K
Sbjct: 101 KLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFENEK 160
Query: 70 EIEESLKERKEKTLTE 85
++EESL+ +EK +
Sbjct: 161 KLEESLELEREKFEEQ 176
Score = 36.3 bits (84), Expect = 0.019
Identities = 17/72 (23%), Positives = 37/72 (51%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E + E E + N +++KE E +E+E ++ K+ +K+ + + +KK+E +
Sbjct: 92 ELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNE 151
Query: 70 EIEESLKERKEK 81
E + E+K +
Sbjct: 152 ERLKFENEKKLE 163
Score = 32.9 bits (75), Expect = 0.24
Identities = 17/70 (24%), Positives = 36/70 (51%), Gaps = 1/70 (1%)
Query: 10 EKKKRIEKRNKE-EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
+KK++ N++ ++ E Q+E K + E N +K K+ + +K+ + K+ ++Q
Sbjct: 76 QKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQ 135
Query: 69 KEIEESLKER 78
E K+R
Sbjct: 136 NTAEIIEKKR 145
Score = 32.5 bits (74), Expect = 0.39
Identities = 14/79 (17%), Positives = 34/79 (43%), Gaps = 2/79 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+ + +K + E++ Q++++K E+ + + + + K + + E+ K
Sbjct: 57 QYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKD--K 114
Query: 70 EIEESLKERKEKTLTEDSQ 88
E+E KE E + Q
Sbjct: 115 ELELLEKELDELSKELQKQ 133
Score = 31.3 bits (71), Expect = 0.75
Identities = 25/82 (30%), Positives = 42/82 (51%), Gaps = 8/82 (9%)
Query: 10 EKKKRIEKRNKEEEKEENEQE-------EKEKEDVEEE-KNKKKKKKKKKKKKKKKKKKK 61
E + RI +E EKE NE+ +KE +EE+ N+KK++K ++ K+ +
Sbjct: 35 EDQSRILNTLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELA 94
Query: 62 KKEEQSVKEIEESLKERKEKTL 83
++E + E E L K+K L
Sbjct: 95 LQDEIAKLEALELLNLEKDKEL 116
Score = 29.8 bits (67), Expect = 2.5
Identities = 11/87 (12%), Positives = 41/87 (47%), Gaps = 8/87 (9%)
Query: 10 EKKKRIEKRNKEEEKEENEQEE---KEKEDVEEEKNKKKKKKKKKKKKKKK-----KKKK 61
E + E+ ++++++N E + + +++E K + + +K K+ K+
Sbjct: 65 ELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELD 124
Query: 62 KKEEQSVKEIEESLKERKEKTLTEDSQ 88
+ ++ K+++ + + ++K ++
Sbjct: 125 ELSKELQKQLQNTAEIIEKKRENNKNE 151
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 40.7 bits (96), Expect = 8e-04
Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 1/63 (1%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNK-KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
+E E+E E+ E+E ++ E EK + +KK +KK KK KK K EE +E+ E E
Sbjct: 64 EEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123
Query: 79 KEK 81
E
Sbjct: 124 PEP 126
Score = 38.0 bits (89), Expect = 0.005
Identities = 30/92 (32%), Positives = 44/92 (47%), Gaps = 12/92 (13%)
Query: 6 GGAGEKK-----------KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK 54
+GE+K + IE+ +EE +EE ++ E EKE E EK K +KK KK K
Sbjct: 46 AESGERKSAVDKLAMKPLEEIEEELREEYEEELKEYEAEKEIWEAEK-KGLEKKAKKAIK 104
Query: 55 KKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
K K ++ EE E EE + + + D
Sbjct: 105 KGKDEEALAEELLELEAEEPEPPLRPRLIVND 136
>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2. This family consists
of several Barren protein homologues from several
eukaryotic organisms. In Drosophila Barren (barr) is
required for sister-chromatid segregation in mitosis.
barr encodes a novel protein that is present in
proliferating cells and has homologues in yeast and
human. Mitotic defects in barr embryos become apparent
during cycle 16, resulting in a loss of PNS and CNS
neurons. Centromeres move apart at the
metaphase-anaphase transition and Cyclin B is degraded,
but sister chromatids remain connected, resulting in
chromatin bridging. Barren protein localises to
chromatin throughout mitosis. Colocalisation and
biochemical experiments indicate that Barren associates
with Topoisomerase II throughout mitosis and alters the
activity of Topoisomerase II. It has been suggested that
this association is required for proper chromosomal
segregation by facilitating the decatenation of
chromatids at anaphase. This family forms one of the
three non-structural maintenance of chromosomes (SMC)
subunits of the mitotic condensation complex along with
Cnd1 and Cnd3.
Length = 719
Score = 40.9 bits (96), Expect = 9e-04
Identities = 22/88 (25%), Positives = 37/88 (42%), Gaps = 16/88 (18%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
E E ++ E +++E D EE+ KKK KKK+++K
Sbjct: 151 DSDNKEAPETGREGDDGEDDDEEGSDGEEDGAKKKAKKKRQRK-------------PEST 197
Query: 71 IEESLKERKEKTLTEDSQCDTMIPVFKK 98
+E++ + K L + D P+FKK
Sbjct: 198 LEKNFEALNVKKLDLEFAVD---PLFKK 222
Score = 29.0 bits (65), Expect = 4.6
Identities = 9/35 (25%), Positives = 14/35 (40%)
Query: 44 KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
K +K+KK+ K+ E +EE E
Sbjct: 435 AAADTKSAAEKQKKRAKEPFEIDFGAPLEEIDFEV 469
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 40.8 bits (95), Expect = 0.001
Identities = 17/89 (19%), Positives = 41/89 (46%), Gaps = 8/89 (8%)
Query: 8 AGEKKKRIEKRNKEEEKEENE---QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
A +++ + ++ +E + KE + V E + ++ +K + + KK ++ K K+
Sbjct: 243 ADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKD 302
Query: 65 ---EQSVKEIEESLKERKEKTLTEDSQCD 90
+E + S KE ++K ++Q
Sbjct: 303 HKAFDLKQESKASEKEAEDK--ELEAQKK 329
Score = 40.0 bits (93), Expect = 0.002
Identities = 25/99 (25%), Positives = 43/99 (43%), Gaps = 12/99 (12%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK-KKKKKKKKKKKKKKEE 65
E K+ E + +E EK Q E +K D E K K K K++ K +K+ + KE
Sbjct: 268 SPKEDKQVAENQKREIEKA---QIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKEL 324
Query: 66 QSVKEIEESLKE--------RKEKTLTEDSQCDTMIPVF 96
++ K+ E ++ + T + D+ PV+
Sbjct: 325 EAQKKREPVAEDLQKTKPQVEAQPTSLNEDAIDSSNPVY 363
Score = 39.2 bits (91), Expect = 0.002
Identities = 21/80 (26%), Positives = 39/80 (48%), Gaps = 3/80 (3%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV- 68
K ++++ EN++ E EK +E +KN ++ K K K K++ K E+
Sbjct: 261 PKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAE 320
Query: 69 KEIEESLKERKEKTLTEDSQ 88
+ E+ K+R+ + ED Q
Sbjct: 321 DKELEAQKKREP--VAEDLQ 338
Score = 37.3 bits (86), Expect = 0.010
Identities = 18/82 (21%), Positives = 38/82 (46%), Gaps = 1/82 (1%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKK 60
++ + E K+ +K ++K + Q+ +K+ D +K ++ K K K+
Sbjct: 213 KRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKED 272
Query: 61 KKKEEQSVKEIEESLKERKEKT 82
K+ E +EIE++ E K+
Sbjct: 273 KQVAENQKREIEKAQIEIKKND 294
Score = 36.1 bits (83), Expect = 0.028
Identities = 12/74 (16%), Positives = 37/74 (50%), Gaps = 1/74 (1%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVE-EEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
A + +++ + +K+ +E +K++E + K+ K+ + +K++ +K +
Sbjct: 229 ADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQI 288
Query: 67 SVKEIEESLKERKE 80
+K+ +E + K+
Sbjct: 289 EIKKNDEEALKAKD 302
Score = 35.7 bits (82), Expect = 0.034
Identities = 15/72 (20%), Positives = 42/72 (58%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
++K + N +++++E Q+++E +++ + + K+ K+ + +K++ +K + + K
Sbjct: 233 QQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKK 292
Query: 70 EIEESLKERKEK 81
EE+LK + K
Sbjct: 293 NDEEALKAKDHK 304
Score = 33.4 bits (76), Expect = 0.17
Identities = 13/70 (18%), Positives = 31/70 (44%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+ +K + R + +E E +E K + ++ KK+ K ++K + ++
Sbjct: 189 DNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRD 248
Query: 70 EIEESLKERK 79
E+ + +E K
Sbjct: 249 EVRQKQQEAK 258
Score = 31.9 bits (72), Expect = 0.58
Identities = 18/74 (24%), Positives = 38/74 (51%), Gaps = 4/74 (5%)
Query: 12 KKRIEKRNKEEEKEENEQEE----KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
KK +E ++ EK N + + KE+E E+ K ++ K++ KK+ K ++K + +
Sbjct: 180 KKVVEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFA 239
Query: 68 VKEIEESLKERKEK 81
++ E ++K
Sbjct: 240 QDNADKQRDEVRQK 253
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 39.5 bits (92), Expect = 0.001
Identities = 17/62 (27%), Positives = 40/62 (64%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
+E EE + + KEK++ EE K+ ++ + +K +K ++K++++ ++E++E L+E
Sbjct: 141 GQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELL 200
Query: 80 EK 81
E+
Sbjct: 201 EE 202
Score = 36.0 bits (83), Expect = 0.015
Identities = 18/61 (29%), Positives = 40/61 (65%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
EE K + +++++ E++E+E + + +K +K ++K+KKK+ E+ +E+EE L+E
Sbjct: 144 RLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELLEEL 203
Query: 79 K 79
+
Sbjct: 204 E 204
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 40.8 bits (96), Expect = 0.001
Identities = 27/79 (34%), Positives = 42/79 (53%), Gaps = 5/79 (6%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E + +IE+ KE +K E + E+ E E E+K ++ +K K K K KK KK + ++
Sbjct: 92 EIEAQIEELKKELKKLEEKIEQLEAE--IEKKEEELEKAKNKFLDKAWKKLAKKYDSNLS 149
Query: 70 EIEESLKER---KEKTLTE 85
E + L + KEK L E
Sbjct: 150 EALKGLNYKKNFKEKLLKE 168
Score = 38.1 bits (89), Expect = 0.007
Identities = 20/69 (28%), Positives = 37/69 (53%), Gaps = 5/69 (7%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
GE+ IE + +E +KE + EEK E+ + +KK+++ +K K K K ++
Sbjct: 87 GEENIEIEAQIEELKKELKKLEEKI-----EQLEAEIEKKEEELEKAKNKFLDKAWKKLA 141
Query: 69 KEIEESLKE 77
K+ + +L E
Sbjct: 142 KKYDSNLSE 150
Score = 35.4 bits (82), Expect = 0.047
Identities = 23/84 (27%), Positives = 39/84 (46%), Gaps = 11/84 (13%)
Query: 9 GEKKKRIEKRNKEEEK--EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
+ +++IE+ E EK EE E+ + + D +K KK + K KK +E+
Sbjct: 105 KKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGLNYKKNFKEK 164
Query: 67 SVKEIE---------ESLKERKEK 81
+KE++ SL+E K K
Sbjct: 165 LLKELKSVILNASSLLSLEELKAK 188
Score = 35.0 bits (81), Expect = 0.071
Identities = 16/76 (21%), Positives = 40/76 (52%), Gaps = 1/76 (1%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEE-KNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
++I+ KE+ K + + +++E+ +K+KK +K +K+ K+ E +K
Sbjct: 381 NEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKA 440
Query: 71 IEESLKERKEKTLTED 86
+E+ +KE +++ +
Sbjct: 441 LEKEIKELEKQLTNIE 456
Score = 34.2 bits (79), Expect = 0.096
Identities = 33/151 (21%), Positives = 70/151 (46%), Gaps = 8/151 (5%)
Query: 34 KEDVEEEKNKKKKKKKKKKKKKKKKKK----KKKEEQSVKEIEESLKERKEKTLTEDSQC 89
+E++E E ++ KK+ KK ++K ++ +KKEE+ K + L + +K +
Sbjct: 88 EENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKY--- 144
Query: 90 DTMIPVFKKGVLYKGIYCPSLTNSFREPSLEMSYQRYSHRQRQKSLIIVNLVDMVLKAVL 149
D+ + KG+ YK + L + L S S + + + + + A+L
Sbjct: 145 DSNLSEALKGLNYKKNFKEKLLKELKSVILNASS-LLSLEELKAKIKTLFSSNKPELALL 203
Query: 150 TVAWISYNEVELSQIDSTKLTWTVYTMMSNL 180
T++ I ++E+E ++I + + +S L
Sbjct: 204 TLSVIDFDEIEQAEILEKSIIGSSDVPISEL 234
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 39.7 bits (93), Expect = 0.001
Identities = 16/75 (21%), Positives = 44/75 (58%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
++ K N+ +E+++ ++E+ E E++E+E+ + ++ + K ++ K + K ++
Sbjct: 11 ENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEEN 70
Query: 69 KEIEESLKERKEKTL 83
K++E L+ K++ L
Sbjct: 71 KKLENELEALKDRLL 85
Score = 37.0 bits (86), Expect = 0.007
Identities = 21/85 (24%), Positives = 44/85 (51%), Gaps = 9/85 (10%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEK-EDVEEEKNKKKKKKKKKKKK--------KKKKKK 60
+ K ++ EN++E+K K ED+E E+ +K++ + ++ K + K
Sbjct: 6 KDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNK 65
Query: 61 KKKEEQSVKEIEESLKERKEKTLTE 85
K+E + ++ E+LK+R +T+ E
Sbjct: 66 LKEENKKLENELEALKDRLLRTVAE 90
Score = 35.9 bits (83), Expect = 0.016
Identities = 16/65 (24%), Positives = 38/65 (58%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
++ ++ K EN +E+ KE+ +E++K K++ + ++ +K++ + EE + +IEE
Sbjct: 1 MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELK 60
Query: 76 KERKE 80
E +
Sbjct: 61 DENNK 65
Score = 34.7 bits (80), Expect = 0.038
Identities = 18/71 (25%), Positives = 39/71 (54%), Gaps = 5/71 (7%)
Query: 16 EKRNKEEEKEENEQE-EKEKEDVEEEKNKKKKKK----KKKKKKKKKKKKKKKEEQSVKE 70
EK K+ + E E++ KE E+ EE+K K++ + +K++ + ++ + + + +K+
Sbjct: 2 EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKD 61
Query: 71 IEESLKERKEK 81
LKE +K
Sbjct: 62 ENNKLKEENKK 72
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4;
Provisional.
Length = 191
Score = 39.4 bits (92), Expect = 0.001
Identities = 22/58 (37%), Positives = 32/58 (55%), Gaps = 4/58 (6%)
Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK---KKKKKKKEEQSVKEIE 72
++ E ++ ++ EDVE K KKKK +KKK K KKKKKE++ V E+
Sbjct: 25 KSTTESVDKVN-DDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELA 81
Score = 33.6 bits (77), Expect = 0.098
Identities = 18/67 (26%), Positives = 29/67 (43%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
K E +K +D+ E + KK KKKK +KKK ++ K+ E+
Sbjct: 25 KSTTESVDKVNDDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEE 84
Query: 84 TEDSQCD 90
DS+ +
Sbjct: 85 LSDSEEN 91
Score = 29.8 bits (67), Expect = 1.4
Identities = 9/57 (15%), Positives = 26/57 (45%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
++ + K ++K++ ++E E E +++ ++ KK + K + E +
Sbjct: 57 TPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAEVN 113
Score = 28.2 bits (63), Expect = 4.6
Identities = 15/64 (23%), Positives = 29/64 (45%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
+K+ +K + K+ K+E++E E +E D EE + KK + K + +
Sbjct: 53 KKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAEV 112
Query: 62 KKEE 65
E+
Sbjct: 113 NHED 116
>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
region. Members of this family have very varied
localisations within the eukaryotic cell. pinin is known
to localise at the desmosomes and is implicated in
anchoring intermediate filaments to the desmosomal
plaque. SDK2/3 is a dynamically localised nuclear
protein thought to be involved in modulation of
alternative pre-mRNA splicing. memA is a tumour marker
preferentially expressed in human melanoma cell lines. A
common feature of the members of this family is that
they may all participate in regulating protein-protein
interactions.
Length = 131
Score = 38.2 bits (89), Expect = 0.001
Identities = 17/74 (22%), Positives = 43/74 (58%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
R +++ IE++ +E+EK+E E+ KEK ++ EE+ +K+ + +K ++K + +K ++
Sbjct: 28 SRLTSKEKRRAEIEQKLEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKMEDEKLQE 87
Query: 63 KEEQSVKEIEESLK 76
+ + ++
Sbjct: 88 TWHEHNLALANFIR 101
Score = 34.8 bits (80), Expect = 0.022
Identities = 17/74 (22%), Positives = 43/74 (58%), Gaps = 1/74 (1%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
K + ++R + E+K E EQE++E+E++ +EK + +++++K+ + +K ++K
Sbjct: 22 KFSQEESRLTSKEKRRAEIEQKLE-EQEKQEREELRKEKRELFEERRRKQLELRKLEQKM 80
Query: 63 KEEQSVKEIEESLK 76
++E+ + E
Sbjct: 81 EDEKLQETWHEHNL 94
Score = 33.2 bits (76), Expect = 0.063
Identities = 17/74 (22%), Positives = 42/74 (56%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+K + E R +EK E E+K +E ++E+ + +K+K++ +++++K+ + ++ +
Sbjct: 21 QKFSQEESRLTSKEKRRAEIEQKLEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKM 80
Query: 70 EIEESLKERKEKTL 83
E E+ + E L
Sbjct: 81 EDEKLQETWHEHNL 94
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called
The RES complex, pre-mRNA retention and splicing
complex. Subunits of this complex are not essential for
viability of yeasts but they are required for efficient
splicing in vitro and in vivo. Furthermore,
inactivation of this complex causes pre-mRNA leakage
from the nucleus. Bud13 contains a unique,
phylogenetically conserved C-terminal region of unknown
function.
Length = 141
Score = 38.4 bits (90), Expect = 0.001
Identities = 22/79 (27%), Positives = 48/79 (60%), Gaps = 2/79 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK--KKKKKKKKKKKKKKKKKKKEEQS 67
+K RI ++ E++E E+EEKE+++ +E++ K +K++++K+ ++ +K K K
Sbjct: 6 DKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLAR 65
Query: 68 VKEIEESLKERKEKTLTED 86
+ E+ +E KE+ +D
Sbjct: 66 YADDEDYDEELKEQERWDD 84
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 38.5 bits (90), Expect = 0.001
Identities = 21/35 (60%), Positives = 27/35 (77%)
Query: 29 QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
+E+K + V +EK K+KK+KKKKKKKKKKK KK
Sbjct: 96 EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKA 130
Score = 37.7 bits (88), Expect = 0.002
Identities = 26/52 (50%), Positives = 34/52 (65%), Gaps = 2/52 (3%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
+ RI + EE++ +Q KEK +++K KKKKKKKKKKK KK KKKK
Sbjct: 86 RDRIVRAFLVEEQKIVKQVLKEKA--KQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 37.0 bits (86), Expect = 0.003
Identities = 20/37 (54%), Positives = 25/37 (67%), Gaps = 4/37 (10%)
Query: 37 VEEEK----NKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
VEE+K K+K K+KK+KKKKKKKKKKK +
Sbjct: 95 VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAA 131
Score = 36.6 bits (85), Expect = 0.005
Identities = 18/29 (62%), Positives = 21/29 (72%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
K K K+KK+KKKKKKKKKKK K+ K
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 35.0 bits (81), Expect = 0.015
Identities = 18/37 (48%), Positives = 28/37 (75%)
Query: 33 EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E++ V++ +K K+KK+KKKKKKKKKKK ++ + K
Sbjct: 97 EQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKK 133
Score = 35.0 bits (81), Expect = 0.019
Identities = 15/38 (39%), Positives = 25/38 (65%)
Query: 39 EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
EE+ K+ K+K K+KK+KKKKKK+++ + + K
Sbjct: 96 EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKK 133
Score = 31.6 bits (72), Expect = 0.25
Identities = 16/40 (40%), Positives = 29/40 (72%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
E+ K K+ K+K K+KK+KKKKK+++ K +++ K++K
Sbjct: 96 EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 28.1 bits (63), Expect = 4.2
Identities = 13/34 (38%), Positives = 23/34 (67%)
Query: 49 KKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
K+ K+K K+KK+KK+++ K+ + S K K+K
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 27.3 bits (61), Expect = 7.1
Identities = 13/41 (31%), Positives = 25/41 (60%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
++K K+ K+K K+KK+KKK+++ K+ +K+K
Sbjct: 95 VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
Provisional.
Length = 330
Score = 39.8 bits (94), Expect = 0.001
Identities = 15/44 (34%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 27 NEQEEKEKEDVEEEK-NKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
NEQ ++ KE+++ +KK+ K K +KKK+K++ K + K
Sbjct: 46 NEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89
Score = 39.0 bits (92), Expect = 0.002
Identities = 19/62 (30%), Positives = 32/62 (51%), Gaps = 3/62 (4%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
++++ GE + N E+ KE E+ + D +E K K +KKK+K++ K K K
Sbjct: 31 QRKKSQKGEL--EVTNLN-EQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAK 87
Query: 62 KK 63
K
Sbjct: 88 SK 89
Score = 34.4 bits (80), Expect = 0.076
Identities = 13/54 (24%), Positives = 25/54 (46%), Gaps = 8/54 (14%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
E+ K +++ K ++ E + K +KKK+K++ K K K K +
Sbjct: 46 NEQYKEMKEELKAALLDKKELKAWHKA--------QKKKEKQEAKAAKAKSKPR 91
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 39.4 bits (92), Expect = 0.001
Identities = 15/60 (25%), Positives = 32/60 (53%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+ K +K N E + N Q++ ++ K + ++ KK ++KK++KKK +++ V
Sbjct: 20 KYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKVP 79
Score = 38.6 bits (90), Expect = 0.002
Identities = 17/50 (34%), Positives = 30/50 (60%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
G E +I ++K EK + + + E V++ +++KK++KKK KKKK
Sbjct: 28 GNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77
Score = 37.9 bits (88), Expect = 0.004
Identities = 18/58 (31%), Positives = 31/58 (53%), Gaps = 1/58 (1%)
Query: 5 RGGAGEKKKRIEKRNKEEEKEE-NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
+ A + K E NK +++ E+ + + E+ KK ++KK++KKK KKKK
Sbjct: 20 KYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77
Score = 33.2 bits (76), Expect = 0.13
Identities = 11/57 (19%), Positives = 20/57 (35%)
Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
E E + K+K + K + K +K K + E K ++ E +
Sbjct: 16 LESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKK 72
Score = 31.7 bits (72), Expect = 0.44
Identities = 16/77 (20%), Positives = 33/77 (42%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
K+ + E+ K ++ E + ++K +K K K + ++ KK Q KE
Sbjct: 10 KRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEE 69
Query: 72 EESLKERKEKTLTEDSQ 88
++ K++K +Q
Sbjct: 70 KKKPKKKKVPLQVNPAQ 86
Score = 29.0 bits (65), Expect = 3.2
Identities = 13/24 (54%), Positives = 14/24 (58%)
Query: 39 EEKNKKKKKKKKKKKKKKKKKKKK 62
K K KK KK KKKKKKK+
Sbjct: 194 HYKGDKSSAKKDKKSKKKKKKKRS 217
Score = 28.2 bits (63), Expect = 5.0
Identities = 13/24 (54%), Positives = 14/24 (58%)
Query: 42 NKKKKKKKKKKKKKKKKKKKKKEE 65
+ K K KK KK KKKKKKK
Sbjct: 194 HYKGDKSSAKKDKKSKKKKKKKRS 217
Score = 27.5 bits (61), Expect = 9.9
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 38 EEEKNKKKKKKKKKKKKKKKKKKK 61
+ +K KK KK KKKKKKK+
Sbjct: 194 HYKGDKSSAKKDKKSKKKKKKKRS 217
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many
hypothetical proteins.
Length = 98
Score = 37.4 bits (87), Expect = 0.001
Identities = 22/72 (30%), Positives = 38/72 (52%), Gaps = 13/72 (18%)
Query: 22 EEKEENEQEEKEKEDV-------EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
EEK+ + ++++ +D+ +++K + K + K KK +KK KKK+E E E
Sbjct: 2 EEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEF--PEFPEE 59
Query: 75 LKERKEKTLTED 86
K R+ TED
Sbjct: 60 SKRRR----TED 67
Score = 32.4 bits (74), Expect = 0.080
Identities = 18/70 (25%), Positives = 30/70 (42%), Gaps = 10/70 (14%)
Query: 17 KRNKEEEKEENE----------QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
K K+ EK + + +K+K+ + K KK +KK KKK + + EE
Sbjct: 1 KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEES 60
Query: 67 SVKEIEESLK 76
+ E+ L
Sbjct: 61 KRRRTEDGLP 70
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The
function is unclear.
Length = 151
Score = 38.4 bits (89), Expect = 0.001
Identities = 22/53 (41%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKK--KKKKKKKKKEEQSVKEIEES 74
KE + +E K+ E K + +K KKKK+K +KK K KKKEE+ +I E
Sbjct: 2 KETSYEEALAKQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEKPNGKIPEH 54
Score = 35.0 bits (80), Expect = 0.021
Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 2/77 (2%)
Query: 20 KEEEKEEN-EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
KE EE ++ KE + +K+ KKKK+K +KK K KKK++K + E E +E
Sbjct: 2 KETSYEEALAKQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEKPNGKIPE-HEPNQEV 60
Query: 79 KEKTLTEDSQCDTMIPV 95
E + + + + V
Sbjct: 61 TEVEVIIEKEPVPAVAV 77
Score = 30.7 bits (69), Expect = 0.60
Identities = 14/39 (35%), Positives = 25/39 (64%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
K+ KE K ++++ +K+K++ EK K KKK++K K
Sbjct: 12 KQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEKPNGK 50
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 39.3 bits (92), Expect = 0.001
Identities = 16/86 (18%), Positives = 40/86 (46%), Gaps = 9/86 (10%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKK---------KKKKKKKKKKKKKKKKK 62
K+ ++ ++ EE +++E+EKE E E ++ ++K ++ K++ K KK
Sbjct: 96 KQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKK 155
Query: 63 KEEQSVKEIEESLKERKEKTLTEDSQ 88
+ +S + + E+++
Sbjct: 156 EAAESASSSLSGSAKPERNVSQEEAK 181
Score = 37.0 bits (86), Expect = 0.011
Identities = 21/77 (27%), Positives = 44/77 (57%), Gaps = 13/77 (16%)
Query: 10 EKKKRIEKRNKEEEK------------EENEQEEKEKEDVEE-EKNKKKKKKKKKKKKKK 56
K+R K KE + E N +E+ K+ ++E E K K++++K++++++
Sbjct: 145 AAKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERR 204
Query: 57 KKKKKKKEEQSVKEIEE 73
K++KK++EE+ K+ E
Sbjct: 205 KQRKKQQEEEERKQKAE 221
Score = 36.2 bits (84), Expect = 0.017
Identities = 14/54 (25%), Positives = 38/54 (70%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
K ++EE K+ ++ E +K +++K +++++K++KK+++++++K+K EE
Sbjct: 169 AKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQKAEE 222
Score = 32.0 bits (73), Expect = 0.40
Identities = 13/50 (26%), Positives = 38/50 (76%), Gaps = 3/50 (6%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
+KR +E E ++ +Q+++++ EEE+ K++KK+++++++K+K ++ ++
Sbjct: 180 AKKRLQEWELKKLKQQQQKR---EEERRKQRKKQQEEEERKQKAEEAWQK 226
Score = 31.6 bits (72), Expect = 0.55
Identities = 18/106 (16%), Positives = 54/106 (50%), Gaps = 26/106 (24%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKED-----------------------VE 38
K R A +++ +++ +E +++ +Q K++ V
Sbjct: 117 EKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPERNVS 176
Query: 39 EEKNKKKKKK---KKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
+E+ KK+ ++ KK K++++K+++++++++ ++ EE K++ E+
Sbjct: 177 QEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQKAEE 222
Score = 29.3 bits (66), Expect = 2.5
Identities = 14/49 (28%), Positives = 31/49 (63%)
Query: 14 RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
++K ++++K E E+ ++ K+ EEE+ K+K ++ +K K K++ K
Sbjct: 188 ELKKLKQQQQKREEERRKQRKKQQEEEERKQKAEEAWQKWMKNVKQRPK 236
Score = 29.3 bits (66), Expect = 2.8
Identities = 16/68 (23%), Positives = 35/68 (51%)
Query: 14 RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
+ +E + E + +E E + K+ +++KK +K ++K+K++ E+ +E E
Sbjct: 66 LSDSSTYSRSLKEVKLERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAEL 125
Query: 74 SLKERKEK 81
+ KEK
Sbjct: 126 RQRLAKEK 133
Score = 28.5 bits (64), Expect = 4.7
Identities = 16/58 (27%), Positives = 37/58 (63%), Gaps = 4/58 (6%)
Query: 10 EKKKRIE----KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
E KKR++ K+ K+++++ E+ K+++ +EE+ +K+K ++ +K K K++ K
Sbjct: 179 EAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQKAEEAWQKWMKNVKQRPK 236
>gnl|CDD|220839 pfam10661, EssA, WXG100 protein secretion system (Wss), protein
EssA. The WXG100 protein secretion system (Wss) is
responsible for the secretion of WXG100 proteins
(pfam06013) such as ESAT-6 and CFP-10 in Mycobacterium
tuberculosis or EsxA and EsxB in Staphylococcus aureus.
In S. aureus, the Wss seems to be encoded by a locus of
eight CDS, called ess (eSAT-6 secretion system). This
locus encodes, amongst several other proteins, EssA, a
protein predicted to possess one transmembrane domain.
Due to its predicted membrane location and its absolute
requirement for WXG100 protein secretion, it has been
speculated that EssA could form a secretion apparatus
in conjunction with the polytopic membrane protein
EsaA, YukC (pfam10140) and YukAB, which is a
membrane-bound ATPase containing Ftsk/SpoIIIE domains
(pfam01580) called EssC in S. aureus and Snm1/Snm2 in
Mycobacterium tuberculosis. Proteins homologous to
EssA, YukC, EsaA and YukD seem absent from
mycobacteria.
Length = 145
Score = 38.3 bits (89), Expect = 0.002
Identities = 20/74 (27%), Positives = 41/74 (55%), Gaps = 9/74 (12%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEK-------EDVEEEKNKKKKKKKKKKKKKKKK--KKK 61
K R++K ++E+ +++ + E +K + EEE NKKK ++K+ + + +
Sbjct: 18 KVDRLQKTDQEKNEKKLRETELDKLGIELFTTETEEEINKKKNAEQKEMEDIENSLFSED 77
Query: 62 KKEEQSVKEIEESL 75
K+ +VKE ++SL
Sbjct: 78 KEGNVAVKETKDSL 91
Score = 31.4 bits (71), Expect = 0.30
Identities = 18/86 (20%), Positives = 42/86 (48%), Gaps = 9/86 (10%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKN-----KKKKKKKKKKKKKKKKKKKKKE 64
E +++ + +K + E+ EK+ + E +K + +++ KKK ++K+ +
Sbjct: 10 EDDGKMQFKVDRLQKTDQEKNEKKLRETELDKLGIELFTTETEEEINKKKNAEQKEMEDI 69
Query: 65 EQSV----KEIEESLKERKEKTLTED 86
E S+ KE ++KE K+ + +
Sbjct: 70 ENSLFSEDKEGNVAVKETKDSLFSSE 95
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association
with pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 36.9 bits (86), Expect = 0.002
Identities = 18/51 (35%), Positives = 28/51 (54%), Gaps = 2/51 (3%)
Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK--KKKKKEEQ 66
EE EE++++ + + ++KKKKKKK KKK K +KKE+
Sbjct: 14 NGLGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQM 64
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 40.0 bits (94), Expect = 0.002
Identities = 19/43 (44%), Positives = 35/43 (81%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
K ++KEE ++E++EKE ++E+K +K KK+++KKKK+ +K +K
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 39.2 bits (92), Expect = 0.003
Identities = 24/83 (28%), Positives = 44/83 (53%), Gaps = 12/83 (14%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
+KEE + E E++E K ++K+ +K KK+++KKKK+ +K E++ E K +
Sbjct: 549 DKEELQREKEEKEALK--------EQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQ 600
Query: 79 KEKTLTEDSQCDTMIPV-FKKGV 100
++K D +T +P G
Sbjct: 601 EDKYSAFD---ETGLPTHDADGE 620
Score = 36.2 bits (84), Expect = 0.027
Identities = 16/40 (40%), Positives = 30/40 (75%)
Query: 34 KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
K D +EE ++K++K+ K++K+ +K KK+EE+ KE+E+
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585
Score = 35.4 bits (82), Expect = 0.051
Identities = 13/43 (30%), Positives = 31/43 (72%)
Query: 37 VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
++E+ +++K++K+ K++K+ +K KK+E+ K+ E L++ K
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590
Score = 35.0 bits (81), Expect = 0.065
Identities = 16/49 (32%), Positives = 32/49 (65%), Gaps = 4/49 (8%)
Query: 22 EEKEENE----QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
E+K + ++KE+ E+E+ + K++K+ +K KK+++KKKKE +
Sbjct: 536 EDKPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELE 584
Score = 34.2 bits (79), Expect = 0.11
Identities = 12/48 (25%), Positives = 30/48 (62%)
Query: 39 EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
++K + +++K++K+ K++K+ +K ++Q K+ +E K K K +
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAE 595
Score = 33.8 bits (78), Expect = 0.14
Identities = 21/79 (26%), Positives = 39/79 (49%), Gaps = 14/79 (17%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK-------KKKKKKKKKKEEQSV- 68
+R KEE++ EQ+ K +EEK KK+ +K +K K K+++ K +E +
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYSAFDETGLP 613
Query: 69 ------KEIEESLKERKEK 81
+EI + +++ K
Sbjct: 614 THDADGEEISKKERKKLSK 632
Score = 32.3 bits (74), Expect = 0.38
Identities = 22/93 (23%), Positives = 39/93 (41%), Gaps = 23/93 (24%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK-------KKKKKKKKK------------ 52
+K K K K + ++E+K+KE + EK K K+++ K
Sbjct: 559 EKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYSAFDETGLPTHDAD 618
Query: 53 ----KKKKKKKKKKKEEQSVKEIEESLKERKEK 81
KK++KK K+ ++ K EE L + +
Sbjct: 619 GEEISKKERKKLSKEYDKQAKLHEEYLAKGGKS 651
Score = 29.6 bits (67), Expect = 2.9
Identities = 10/40 (25%), Positives = 26/40 (65%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
K K++ +++K++K+ K++K+ + K+ E+ KE ++
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 37.6 bits (87), Expect = 0.002
Identities = 19/48 (39%), Positives = 25/48 (52%)
Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
+N E + E +D E +K KKKKKK KK KK KK K++
Sbjct: 73 QNGECHLTTSSGEPSHPDDHPPEPTEKPKKKKKKSKKTKKPKKSSKKD 120
Score = 31.4 bits (71), Expect = 0.25
Identities = 13/20 (65%), Positives = 14/20 (70%)
Query: 47 KKKKKKKKKKKKKKKKKEEQ 66
K KKKKKK KK KK KK +
Sbjct: 99 KPKKKKKKSKKTKKPKKSSK 118
Score = 31.0 bits (70), Expect = 0.30
Identities = 13/21 (61%), Positives = 15/21 (71%)
Query: 46 KKKKKKKKKKKKKKKKKKEEQ 66
K KKKKKK KK KK KK ++
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKK 119
Score = 29.5 bits (66), Expect = 1.2
Identities = 13/22 (59%), Positives = 16/22 (72%)
Query: 49 KKKKKKKKKKKKKKKEEQSVKE 70
K KKKKKK KK KK ++ S K+
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 28.3 bits (63), Expect = 2.9
Identities = 12/20 (60%), Positives = 13/20 (65%)
Query: 48 KKKKKKKKKKKKKKKKEEQS 67
K KKKKKK KK KK K+
Sbjct: 99 KPKKKKKKSKKTKKPKKSSK 118
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 38.3 bits (89), Expect = 0.002
Identities = 19/82 (23%), Positives = 47/82 (57%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E K ++ + EEE+EE+E+E +E ED+E+E+ + ++++++ ++ K E++++
Sbjct: 41 EDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNIN 100
Query: 70 EIEESLKERKEKTLTEDSQCDT 91
+I S ++ + L +
Sbjct: 101 DIFNSTQDDNAQNLISKNYKKN 122
Score = 36.8 bits (85), Expect = 0.007
Identities = 17/74 (22%), Positives = 33/74 (44%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+ + + ++ E EE E++E+E+ +EE N K +KK + Q++
Sbjct: 56 EEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQNLI 115
Query: 70 EIEESLKERKEKTL 83
E+ +KT
Sbjct: 116 SKNYKKNEKSKKTA 129
Score = 33.7 bits (77), Expect = 0.059
Identities = 20/88 (22%), Positives = 37/88 (42%), Gaps = 6/88 (6%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEE------KNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+ EEE EE E E E+E VE+E + K +KK +++ +
Sbjct: 54 EEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQN 113
Query: 70 EIEESLKERKEKTLTEDSQCDTMIPVFK 97
I ++ K+ ++ T + T+ +
Sbjct: 114 LISKNYKKNEKSKKTAEDIVKTLFGLLN 141
Score = 31.8 bits (72), Expect = 0.33
Identities = 15/65 (23%), Positives = 27/65 (41%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
E+ ++E+ + + E+K D+ + K KK +K KK E VK + L
Sbjct: 81 EEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQNLISKNYKKNEKSKKTAEDIVKTLFGLL 140
Query: 76 KERKE 80
+
Sbjct: 141 NGNNQ 145
Score = 27.1 bits (60), Expect = 9.4
Identities = 21/85 (24%), Positives = 47/85 (55%), Gaps = 12/85 (14%)
Query: 16 EKRNKEEEKEENEQEE------------KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
EK+ +EE+K+EN E KE EDV++EK + ++++++ +++ ++ + +
Sbjct: 10 EKKPEEEKKDENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIE 69
Query: 64 EEQSVKEIEESLKERKEKTLTEDSQ 88
+E+ + E EE +E +E +
Sbjct: 70 DEEEIVEDEEEEEEDEEDNVDLKDI 94
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 38.9 bits (91), Expect = 0.002
Identities = 16/52 (30%), Positives = 26/52 (50%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
E + E E ++ V K + K K K K K K KK +++ ++ VK +E
Sbjct: 75 EPEPEPIPEPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEP 126
Score = 35.8 bits (83), Expect = 0.021
Identities = 16/58 (27%), Positives = 24/58 (41%), Gaps = 1/58 (1%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
E KE K E + K K K K KK +++ K++ K E + E + R
Sbjct: 83 EPPKEAPVVIPKP-EPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFENTAPAR 139
Score = 30.8 bits (70), Expect = 0.83
Identities = 18/53 (33%), Positives = 22/53 (41%), Gaps = 2/53 (3%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
E E E E KE + K K K K K K KK EEQ +E++
Sbjct: 72 VEPEPEPEPIPEPPKEAPVVIPKPEPKPKPKPKPKPKP--VKKVEEQPKREVK 122
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 40.1 bits (94), Expect = 0.002
Identities = 15/72 (20%), Positives = 38/72 (52%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+ + +E +E E+ E +EK + EE + ++ ++ ++ + ++ K++ E+ +
Sbjct: 309 ERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLS 368
Query: 70 EIEESLKERKEK 81
+ E L+E E
Sbjct: 369 ALLEELEELFEA 380
Score = 39.7 bits (93), Expect = 0.002
Identities = 18/72 (25%), Positives = 38/72 (52%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E K++IE +E E+ E EE E+ E E+ K++ ++K ++ ++ + + +
Sbjct: 327 ELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELA 386
Query: 70 EIEESLKERKEK 81
E+E L E + +
Sbjct: 387 ELEAELAEIRNE 398
Score = 38.5 bits (90), Expect = 0.005
Identities = 22/85 (25%), Positives = 50/85 (58%), Gaps = 1/85 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E ++ +E+ KE E+ + E EE E E E E K+ +++K++ +++ ++ + + + +
Sbjct: 846 ELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKE 905
Query: 70 EIEESLKERKEKTLTEDSQCDTMIP 94
EIE+ L+ER E+ + + + +P
Sbjct: 906 EIEK-LRERLEELEAKLERLEVELP 929
Score = 38.5 bits (90), Expect = 0.006
Identities = 29/151 (19%), Positives = 54/151 (35%), Gaps = 33/151 (21%)
Query: 7 GAGEKKKRIEK-RNKEEEKEENEQE--------EKEKEDVEEEKNKKKKKKKKKKKKKK- 56
G + K+R E+ K E EEN + EK+ E +E + K ++ ++ K + ++
Sbjct: 166 GVSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELREL 225
Query: 57 -------KKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYKGIYCPS 109
K K+ +KE + ++E L+E E+ E + + I K
Sbjct: 226 ELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKS----------- 274
Query: 110 LTNSFREPSLEMSYQRYSHRQRQKSLIIVNL 140
L + + I L
Sbjct: 275 -----ELEELREELEELQEELLELKEEIEEL 300
Score = 37.4 bits (87), Expect = 0.013
Identities = 15/75 (20%), Positives = 41/75 (54%), Gaps = 3/75 (4%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKE---DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
E ++ +E+ +E + + E EE E E E + + + ++ +++ ++ K+K + ++
Sbjct: 278 ELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKE 337
Query: 67 SVKEIEESLKERKEK 81
++E E L+E ++
Sbjct: 338 ELEERETLLEELEQL 352
Score = 35.1 bits (81), Expect = 0.061
Identities = 20/90 (22%), Positives = 49/90 (54%), Gaps = 4/90 (4%)
Query: 1 MRKRRGGAGEKKKRIEKR--NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
+R+R + + +E+R +E+ E ++E +E+E + EE + + ++ K++ ++K
Sbjct: 307 LRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEK 366
Query: 59 KKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
EE ++E+ E+L+E + E ++
Sbjct: 367 LSALLEE--LEELFEALREELAELEAELAE 394
Score = 34.7 bits (80), Expect = 0.074
Identities = 22/83 (26%), Positives = 48/83 (57%), Gaps = 2/83 (2%)
Query: 10 EKKKRIEKRNKEEEKEENE--QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
E K+ +E+ E+E+ E+E + E+EKE++EEE + + + + K++ +K +++ +E ++
Sbjct: 860 ELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEA 919
Query: 68 VKEIEESLKERKEKTLTEDSQCD 90
E E E+ L E+ +
Sbjct: 920 KLERLEVELPELEEELEEEYEDT 942
Score = 34.3 bits (79), Expect = 0.10
Identities = 21/80 (26%), Positives = 54/80 (67%), Gaps = 1/80 (1%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK-KKKKKKKKKKK 61
+RR A E++ ++ +E ++E E+ E+E E++EE+ ++ +++ ++ +K+ ++ K++
Sbjct: 806 ERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEEL 865
Query: 62 KKEEQSVKEIEESLKERKEK 81
++ E +E+E+ LKE +E+
Sbjct: 866 EELEAEKEELEDELKELEEE 885
Score = 33.5 bits (77), Expect = 0.20
Identities = 16/85 (18%), Positives = 40/85 (47%), Gaps = 6/85 (7%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKE--DVEEEKNKKKKKKKKKKKKKK----KKKKKKK 63
++ + + E + NE EE ++E +EE + ++ + K++ K + ++ +
Sbjct: 380 ALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQT 439
Query: 64 EEQSVKEIEESLKERKEKTLTEDSQ 88
E + + E E L+E+ E+ +
Sbjct: 440 ELEELNEELEELEEQLEELRDRLKE 464
Score = 33.5 bits (77), Expect = 0.20
Identities = 18/79 (22%), Positives = 43/79 (54%), Gaps = 7/79 (8%)
Query: 10 EKKKRIEKRNKEEEK--EENEQEEKEKEDVEEEKNK---KKKKKKKKKKKKKK--KKKKK 62
E K +E+ +E E+ EE + ++E E++E E + + ++ + + ++ ++ ++ K+
Sbjct: 271 ELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKE 330
Query: 63 KEEQSVKEIEESLKERKEK 81
K E +E+EE +E
Sbjct: 331 KIEALKEELEERETLLEEL 349
Score = 33.1 bits (76), Expect = 0.22
Identities = 16/86 (18%), Positives = 37/86 (43%), Gaps = 1/86 (1%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKK-KKKKKKKKK 60
K E ++ E +E E+ E++ + EE + + +++ + + + +
Sbjct: 335 LKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAE 394
Query: 61 KKKEEQSVKEIEESLKERKEKTLTED 86
+ E + +K ESL+ER E+
Sbjct: 395 IRNELEELKREIESLEERLERLSERL 420
Score = 33.1 bits (76), Expect = 0.25
Identities = 17/87 (19%), Positives = 44/87 (50%), Gaps = 11/87 (12%)
Query: 10 EKKKRIEKRNKEEEKEENEQE---------EKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
E ++++E+ +E E E E E+E E++EEE + +++ ++ +++ + ++
Sbjct: 713 ELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEE 772
Query: 61 KKKEEQSVKEIEESLKERKEKTLTEDS 87
+ + EIEE ++R+ +
Sbjct: 773 ALAKLKE--EIEELEEKRQALQEELEE 797
Score = 33.1 bits (76), Expect = 0.28
Identities = 21/79 (26%), Positives = 44/79 (55%), Gaps = 4/79 (5%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E + +++ +E+E+ E E E E E E K++ +K +++ ++ + K ++ E +
Sbjct: 874 ELEDELKELEEEKEELEEELRELESELAE----LKEEIEKLRERLEELEAKLERLEVELP 929
Query: 70 EIEESLKERKEKTLTEDSQ 88
E+EE L+E E TL + +
Sbjct: 930 ELEEELEEEYEDTLETELE 948
Score = 32.0 bits (73), Expect = 0.57
Identities = 17/73 (23%), Positives = 48/73 (65%), Gaps = 2/73 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK-KKKKKKKKKKKKKKKKKKKKEEQSV 68
E ++++ EE +E+ + ++E E++EEE + +++ +++ + ++++++ EQ +
Sbjct: 772 EALAKLKEEI-EELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEI 830
Query: 69 KEIEESLKERKEK 81
+E+EE ++E +EK
Sbjct: 831 EELEEEIEELEEK 843
Score = 32.0 bits (73), Expect = 0.66
Identities = 17/84 (20%), Positives = 41/84 (48%), Gaps = 2/84 (2%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEE--KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
R ++++ KE E +EE + E+E E+++EE + +K+ ++ K + ++ +
Sbjct: 221 ELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELR 280
Query: 60 KKKKEEQSVKEIEESLKERKEKTL 83
++ +E Q + E E +
Sbjct: 281 EELEELQEELLELKEEIEELEGEI 304
Score = 31.6 bits (72), Expect = 0.77
Identities = 24/107 (22%), Positives = 50/107 (46%), Gaps = 17/107 (15%)
Query: 2 RKRRGGAGEKKKRIEKRNKE-----EEKEEN-EQEEKEKEDVEEE-----------KNKK 44
RK E+ R+E+ +E EE E+ E+ + E E++ EE K +
Sbjct: 238 RKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEI 297
Query: 45 KKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDT 91
++ + + +++ ++ + E + ++E E LKE+ E E + +T
Sbjct: 298 EELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERET 344
Score = 30.8 bits (70), Expect = 1.2
Identities = 16/68 (23%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE--QSVKEIEE 73
E R + EE E +E K + EE+ ++ + + ++ +++ ++ +++ EE + ++E+EE
Sbjct: 706 ELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEE 765
Query: 74 SLKERKEK 81
L+ +E
Sbjct: 766 ELESLEEA 773
Score = 30.8 bits (70), Expect = 1.2
Identities = 15/88 (17%), Positives = 46/88 (52%), Gaps = 16/88 (18%)
Query: 10 EKKKRIE-KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK-------------KKKKK 55
K +R+E + + EE+ E E E+ + ++E E + +++ + + +++
Sbjct: 919 AKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEER 978
Query: 56 KKKKKKKKE--EQSVKEIEESLKERKEK 81
++ K ++E E++ +++ E ++E ++
Sbjct: 979 YEELKSQREDLEEAKEKLLEVIEELDKE 1006
Score = 30.5 bits (69), Expect = 1.8
Identities = 18/71 (25%), Positives = 43/71 (60%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E ++R++ +E E E +E E+E E E+ ++ ++K + +++ ++ +K+ E+ +
Sbjct: 804 EAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKE 863
Query: 70 EIEESLKERKE 80
E+EE E++E
Sbjct: 864 ELEELEAEKEE 874
Score = 30.1 bits (68), Expect = 2.0
Identities = 20/97 (20%), Positives = 54/97 (55%), Gaps = 7/97 (7%)
Query: 2 RKRRGGAGEKKKRIEKR--NKEEEKEE-NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK- 57
++ E+ ++++ R EEE EE E+ E+ +E +EE + + + ++ K K++
Sbjct: 722 KRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEI 781
Query: 58 ---KKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDT 91
++K++ ++ ++E+EE L+E + + + + ++
Sbjct: 782 EELEEKRQALQEELEELEEELEEAERRLDALERELES 818
Score = 29.7 bits (67), Expect = 2.9
Identities = 17/80 (21%), Positives = 41/80 (51%), Gaps = 4/80 (5%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK----KKKKKKKKKKKKKKKKEE 65
E+ +++ +E ++E E+ E++EE +++ + + + + + ++ K+E
Sbjct: 347 EELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREI 406
Query: 66 QSVKEIEESLKERKEKTLTE 85
+S++E E L ER E E
Sbjct: 407 ESLEERLERLSERLEDLKEE 426
Score = 29.3 bits (66), Expect = 4.2
Identities = 13/82 (15%), Positives = 42/82 (51%), Gaps = 6/82 (7%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKE------KEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
E ++ +E+ +E E+ + EE E +E + + K + ++ ++K++ +++ ++ ++
Sbjct: 741 ELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEE 800
Query: 64 EEQSVKEIEESLKERKEKTLTE 85
E + + ++L+ E
Sbjct: 801 ELEEAERRLDALERELESLEQR 822
Score = 28.5 bits (64), Expect = 7.0
Identities = 11/71 (15%), Positives = 36/71 (50%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E ++ + + + EK E E + + E E ++ +++ ++ +++ ++ K++ +
Sbjct: 671 ELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEE 730
Query: 70 EIEESLKERKE 80
E+E+ +E
Sbjct: 731 ELEQLQSRLEE 741
Score = 28.1 bits (63), Expect = 8.3
Identities = 13/82 (15%), Positives = 43/82 (52%), Gaps = 1/82 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
++K + EE E +EE + + E + + + ++ K++ + +++ ++ E+ ++
Sbjct: 363 LEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSER-LE 421
Query: 70 EIEESLKERKEKTLTEDSQCDT 91
+++E LKE + + ++ +
Sbjct: 422 DLKEELKELEAELEELQTELEE 443
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 39.7 bits (92), Expect = 0.002
Identities = 22/82 (26%), Positives = 35/82 (42%), Gaps = 3/82 (3%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVE--EEKNKKKKKKKKKKKKKKKKK 59
R G+KK R E E+ E + D + K KKK ++ ++ K+ KK
Sbjct: 605 SMDRILHGQKK-RAEGAVVFEKPLEATENFNPWLDRKMRRIKRIKKKAYRRIRRDKRLKK 663
Query: 60 KKKKEEQSVKEIEESLKERKEK 81
K +EE + + S K+R
Sbjct: 664 KMPEEENTQENHLGSEKKRHGG 685
Score = 29.3 bits (65), Expect = 3.8
Identities = 14/86 (16%), Positives = 40/86 (46%), Gaps = 2/86 (2%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
KK+ +R + +++ +++ E+E+ +E +KK+ K+ + ++++ +
Sbjct: 648 KKKAYRRIRRDKR--LKKKMPEEENTQENHLGSEKKRHGGVPDILLKEIEVEDDEKTPIL 705
Query: 72 EESLKERKEKTLTEDSQCDTMIPVFK 97
E E+ L+ +Q + + F
Sbjct: 706 SPGGDEEVEEGLSIKTQEELVALAFA 731
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 38.2 bits (88), Expect = 0.002
Identities = 20/72 (27%), Positives = 45/72 (62%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E K+R + E+E QEE+ + EE+ +K ++++K++++++++ +K+KEE +
Sbjct: 59 ELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEAR 118
Query: 70 EIEESLKERKEK 81
EE+ + R E+
Sbjct: 119 AREEAERMRLER 130
Score = 36.2 bits (83), Expect = 0.009
Identities = 16/65 (24%), Positives = 44/65 (67%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
++R E++E+ EQE +E+E+ + + ++ K++ +++ +++++ +++EE+ +E EE
Sbjct: 31 KRRQAREQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKA 90
Query: 76 KERKE 80
K + E
Sbjct: 91 KRKAE 95
Score = 36.2 bits (83), Expect = 0.010
Identities = 20/80 (25%), Positives = 50/80 (62%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
R RR +++ R KEE+ + +EE+++E E+E+ +K+K++ + + +++ ++ +
Sbjct: 68 RLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAREEAERMR 127
Query: 62 KKEEQSVKEIEESLKERKEK 81
+ E+ ++IE+ ERK++
Sbjct: 128 LEREKHFQQIEQERLERKKR 147
Score = 36.2 bits (83), Expect = 0.010
Identities = 19/75 (25%), Positives = 55/75 (73%), Gaps = 3/75 (4%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK---KKKKKKKKKKKKKKKKKKKKEEQ 66
EK+++ ++ ++EE+E EQEE+++ + EE K + ++ +++++ +++++++ ++KEE+
Sbjct: 30 EKRRQAREQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEK 89
Query: 67 SVKEIEESLKERKEK 81
+ ++ EE K+ +E+
Sbjct: 90 AKRKAEEEEKQEQEE 104
Score = 35.5 bits (81), Expect = 0.021
Identities = 22/87 (25%), Positives = 56/87 (64%), Gaps = 4/87 (4%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK----KKKKKKKKKKKKKK 58
++R A E++++ E+ +E+E+++ + E+ K EE+ ++ +++++++ ++K++K
Sbjct: 30 EKRRQAREQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEK 89
Query: 59 KKKKKEEQSVKEIEESLKERKEKTLTE 85
K+K EE+ +E EE + +K+K E
Sbjct: 90 AKRKAEEEEKQEQEEQERIQKQKEEAE 116
Score = 33.5 bits (76), Expect = 0.071
Identities = 15/73 (20%), Positives = 45/73 (61%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
E+ +EEE E+E +++ + ++ ++++K++++++++ +K+K++ E +E E +
Sbjct: 67 ERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAREEAERM 126
Query: 76 KERKEKTLTEDSQ 88
+ +EK + Q
Sbjct: 127 RLEREKHFQQIEQ 139
Score = 32.0 bits (72), Expect = 0.25
Identities = 19/71 (26%), Positives = 44/71 (61%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+ + + K EE + E+E EEE+ ++K++K K+K ++++K++++++E+ K
Sbjct: 51 EQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQK 110
Query: 70 EIEESLKERKE 80
+ EE+ +E
Sbjct: 111 QKEEAEARARE 121
Score = 31.2 bits (70), Expect = 0.47
Identities = 20/80 (25%), Positives = 50/80 (62%), Gaps = 1/80 (1%)
Query: 4 RRGGAGEKKKRIEKRNKE-EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
RR +++ EK ++ EE+E+ EQEE+E+ ++E+ + + +++ ++ + +++K +
Sbjct: 76 RRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAREEAERMRLEREKHFQ 135
Query: 63 KEEQSVKEIEESLKERKEKT 82
+ EQ E ++ L+E ++T
Sbjct: 136 QIEQERLERKKRLEEIMKRT 155
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 39.7 bits (92), Expect = 0.002
Identities = 25/73 (34%), Positives = 44/73 (60%), Gaps = 1/73 (1%)
Query: 15 IEKRNKEEEKEENEQEEKEK-EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
+EK ++E +E+E++ K+ ED EE K ++ K+ + KKKK KK K+ +E V+ +
Sbjct: 207 VEKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYEVQNKHK 266
Query: 74 SLKERKEKTLTED 86
L R K +T++
Sbjct: 267 PLWTRDPKDVTKE 279
Score = 31.6 bits (71), Expect = 0.82
Identities = 18/55 (32%), Positives = 33/55 (60%), Gaps = 1/55 (1%)
Query: 12 KKRIEKRNKEEEKEENEQ-EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
+K + ++E+ K+ +E EE + E+V+E KKKK KK K+ K+ + + K +
Sbjct: 212 EKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYEVQNKHK 266
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 39.6 bits (93), Expect = 0.002
Identities = 20/80 (25%), Positives = 44/80 (55%), Gaps = 7/80 (8%)
Query: 13 KRIEKRNKEEEKE-EN---EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
K++ + EEE+E E E+ E+ +EEE + + K+ +K+ + + ++++E+
Sbjct: 177 KKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELR 236
Query: 69 KEIEESLK---ERKEKTLTE 85
K+ EE L+ ER+ + +
Sbjct: 237 KKYEEKLRQELERQAEAHEQ 256
Score = 39.2 bits (92), Expect = 0.003
Identities = 12/77 (15%), Positives = 40/77 (51%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
++ ++++ +E + E+ E E K+ + + +++K++ +KK ++K Q ++
Sbjct: 191 ERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQ 250
Query: 72 EESLKERKEKTLTEDSQ 88
E+ +++ + L +
Sbjct: 251 AEAHEQKLKNELALQAI 267
Score = 33.1 bits (76), Expect = 0.24
Identities = 22/79 (27%), Positives = 42/79 (53%), Gaps = 3/79 (3%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E++ +KE E+ + E E+E E K ++K +++ +++ + ++K K E +++
Sbjct: 206 EEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQ 265
Query: 70 EIE---ESLKERKEKTLTE 85
IE E KE KEK E
Sbjct: 266 AIELQREFNKEIKEKVEEE 284
Score = 30.8 bits (70), Expect = 1.3
Identities = 14/83 (16%), Positives = 36/83 (43%), Gaps = 4/83 (4%)
Query: 14 RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
+ K +K + E + + + + EE + + K + K ++ V+ +EE
Sbjct: 67 SVTKLSKGPKDEPKQVKIPRQSGLSEEVAEAEAKATSVAAEATTPKS---IQELVEALEE 123
Query: 74 SLKERKEKTLTEDSQCDTMIPVF 96
L+E ++T ++ ++ +F
Sbjct: 124 LLEELLKETASDPVV-QELVSIF 145
Score = 28.1 bits (63), Expect = 7.6
Identities = 14/85 (16%), Positives = 38/85 (44%), Gaps = 2/85 (2%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
K + + I +E ++ + E + E+ EE + K+K+++ K +++ +
Sbjct: 153 KEDNLKDDLESLIASAKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLAR 212
Query: 63 KEEQSVKEIEESLKE--RKEKTLTE 85
E + ++ E R+++ L +
Sbjct: 213 LESKEAALEKQLRLEFEREKEELRK 237
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 39.4 bits (92), Expect = 0.002
Identities = 24/81 (29%), Positives = 47/81 (58%), Gaps = 1/81 (1%)
Query: 6 GGAGEK-KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
G E+ IEK + E++ E + E EK E+EK KK+ +++ ++ K++++ KK +
Sbjct: 507 GEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLEL 566
Query: 65 EQSVKEIEESLKERKEKTLTE 85
E+ +E ++LK+ E + E
Sbjct: 567 EKEAQEALKALKKEVESIIRE 587
Score = 39.4 bits (92), Expect = 0.003
Identities = 19/68 (27%), Positives = 43/68 (63%), Gaps = 3/68 (4%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE---QSVKEIEE 73
++ E++ E E+ KE+E +++E ++ ++ K++++ KK + +K+ +E KE+E
Sbjct: 524 EKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVES 583
Query: 74 SLKERKEK 81
++E KEK
Sbjct: 584 IIRELKEK 591
Score = 37.1 bits (86), Expect = 0.014
Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 13/83 (15%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK------------KKKKKKKKKK 57
+K + +EK KE+EK + E E +E E+++E + KK + KK+ + ++
Sbjct: 529 QKNEHLEKLLKEQEKLKKELE-QEMEELKERERNKKLELEKEAQEALKALKKEVESIIRE 587
Query: 58 KKKKKKEEQSVKEIEESLKERKE 80
K+KK + + E L + KE
Sbjct: 588 LKEKKIHKAKEIKSIEDLVKLKE 610
Score = 36.0 bits (83), Expect = 0.030
Identities = 23/73 (31%), Positives = 46/73 (63%), Gaps = 3/73 (4%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEE--EKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
+K +E++N+ EK EQE K K+++E+ E+ K++++ KK + +K+ ++ K ++
Sbjct: 522 ALEKELEQKNEHLEKLLKEQE-KLKKELEQEMEELKERERNKKLELEKEAQEALKALKKE 580
Query: 68 VKEIEESLKERKE 80
V+ I LKE+K
Sbjct: 581 VESIIRELKEKKI 593
Score = 33.3 bits (76), Expect = 0.21
Identities = 26/74 (35%), Positives = 42/74 (56%), Gaps = 1/74 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
KK+ ++ + +E+E N++ E EKE E K KK+ + ++ K+KK K KE +S+
Sbjct: 544 LKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSI- 602
Query: 70 EIEESLKERKEKTL 83
E LKE K+K
Sbjct: 603 EDLVKLKETKQKIP 616
Score = 28.6 bits (64), Expect = 5.5
Identities = 17/74 (22%), Positives = 34/74 (45%), Gaps = 1/74 (1%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
I ++ K E E+ E + + K+ ++K + +K K+++K K+E + E
Sbjct: 498 IIEQAKTFYGEFKEEINVLIEKLSALE-KELEQKNEHLEKLLKEQEKLKKELEQEMEELK 556
Query: 75 LKERKEKTLTEDSQ 88
+ER +K E
Sbjct: 557 ERERNKKLELEKEA 570
>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947). Family of
eukaryotic proteins with unknown function.
Length = 168
Score = 38.0 bits (89), Expect = 0.002
Identities = 19/78 (24%), Positives = 46/78 (58%), Gaps = 4/78 (5%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKK----KKKKKKKKKKKKKKKEEQSV 68
+ ++ R K + ++ E+E ++ +E++ K+ KK KK + KK KKK E +
Sbjct: 89 QSMKSRLKTLKNKDREREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKS 148
Query: 69 KEIEESLKERKEKTLTED 86
K+++++L+++++K ++
Sbjct: 149 KQLDKALEKKRKKNAGKE 166
Score = 33.0 bits (76), Expect = 0.13
Identities = 24/88 (27%), Positives = 40/88 (45%), Gaps = 10/88 (11%)
Query: 4 RRGGAGEKKKRIEKRNKEEEKEE---------NEQEEKEKEDVEEEKNKKKKKKKKKKKK 54
R E +K ++K EEKEE + + + +D E E K+ KK++K+ K
Sbjct: 61 REKEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIK 120
Query: 55 KKKKKK-KKKEEQSVKEIEESLKERKEK 81
+ KK KK E +++ E K+
Sbjct: 121 EGKKPYYLKKSEIKKLVLKKKFDELKKS 148
Score = 30.7 bits (70), Expect = 0.60
Identities = 19/67 (28%), Positives = 37/67 (55%), Gaps = 1/67 (1%)
Query: 16 EKRNKEEEKEENE-QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
+ R KE E+ E ++ K+ E+ EE K + K + K K K ++++ ++ K+ +E
Sbjct: 59 DYREKEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKEL 118
Query: 75 LKERKEK 81
+KE K+
Sbjct: 119 IKEGKKP 125
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 39.7 bits (93), Expect = 0.002
Identities = 19/75 (25%), Positives = 38/75 (50%), Gaps = 3/75 (4%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ--- 66
+K IE N ++E+ E E EE E + E KK++ + + + ++ ++K E+
Sbjct: 851 SIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEA 910
Query: 67 SVKEIEESLKERKEK 81
+++ + L E K K
Sbjct: 911 QIEKKRKRLSELKAK 925
Score = 34.3 bits (79), Expect = 0.12
Identities = 17/91 (18%), Positives = 47/91 (51%), Gaps = 5/91 (5%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK----KKKKKKK 57
+ E ++++ + E+E E E +E + E + K + K +K+ KK++
Sbjct: 808 SRIEARLREIEQKLNRLTLEKEYLEKEIQELQ-EQRIDLKEQIKSIEKEIENLNGKKEEL 866
Query: 58 KKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
+++ ++ E +++++E L + K++ ++Q
Sbjct: 867 EEELEELEAALRDLESRLGDLKKERDELEAQ 897
Score = 33.9 bits (78), Expect = 0.16
Identities = 21/83 (25%), Positives = 45/83 (54%), Gaps = 3/83 (3%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKE--DVEEEK-NKKKKKKKKKKKKKKKKKKKKKEE 65
GE +K IE+ +EEEK + EE E++ +E+E N K + K+ + + ++ ++ K E
Sbjct: 719 GEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLE 778
Query: 66 QSVKEIEESLKERKEKTLTEDSQ 88
+++ ++E L + + +
Sbjct: 779 EALNDLEARLSHSRIPEIQAELS 801
Score = 32.7 bits (75), Expect = 0.35
Identities = 12/72 (16%), Positives = 37/72 (51%), Gaps = 3/72 (4%)
Query: 12 KKRIEKRNKEEEKEENE--QEEKEKEDVEEE-KNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
+ E+ +E E+ E E D+++E + + ++ ++K ++ + + +K+ + +
Sbjct: 860 NGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRL 919
Query: 69 KEIEESLKERKE 80
E++ L+ +E
Sbjct: 920 SELKAKLEALEE 931
Score = 32.0 bits (73), Expect = 0.53
Identities = 18/88 (20%), Positives = 45/88 (51%), Gaps = 7/88 (7%)
Query: 1 MRKRRGGAGEKKKRIEKRNKE------EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK 54
+++ + +RIE R E + + + EKE E +E+E+ K K++ ++ ++
Sbjct: 686 LKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEED 745
Query: 55 -KKKKKKKKKEEQSVKEIEESLKERKEK 81
+++ + + +KE+E ++E +E
Sbjct: 746 LSSLEQEIENVKSELKELEARIEELEED 773
Score = 32.0 bits (73), Expect = 0.62
Identities = 20/87 (22%), Positives = 45/87 (51%), Gaps = 7/87 (8%)
Query: 2 RKRRGGAGEKKKRIEK-RNKEEEKEEN-EQEEKEKEDVEEEKNKKKKKKKKK-----KKK 54
K E ++ IE+ +EK + E+ +E+E E + K+K++ + K+K
Sbjct: 173 EKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEK 232
Query: 55 KKKKKKKKKEEQSVKEIEESLKERKEK 81
+ +++K+ E+ + +EE L++ E+
Sbjct: 233 EALERQKEAIERQLASLEEELEKLTEE 259
Score = 31.6 bits (72), Expect = 0.84
Identities = 23/82 (28%), Positives = 44/82 (53%), Gaps = 6/82 (7%)
Query: 10 EKKKRIEKRNKEEEKEENEQE-EKEKEDVEEE---KNKKKKKKKKKKKKKKKKKKKKKEE 65
EK++++E+ +E EK E Q KEK + E K K+ +++K+ +++ +++ E
Sbjct: 195 EKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELE 254
Query: 66 QSVKEIEESLKE--RKEKTLTE 85
+ +EI E K E+ L E
Sbjct: 255 KLTEEISELEKRLEEIEQLLEE 276
Score = 31.2 bits (71), Expect = 1.1
Identities = 18/80 (22%), Positives = 44/80 (55%), Gaps = 8/80 (10%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK----KKKKK 62
G E ++ EK EE EE E+ + + + +EK ++ ++ +++++K ++ K+K+
Sbjct: 164 GVAEFDRKKEK--ALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKR 221
Query: 63 KEEQS--VKEIEESLKERKE 80
+ E +KE E ++++
Sbjct: 222 EYEGYELLKEKEALERQKEA 241
Score = 30.8 bits (70), Expect = 1.6
Identities = 19/76 (25%), Positives = 40/76 (52%), Gaps = 7/76 (9%)
Query: 16 EKRNKEEEKEENEQEEKEK-EDVEEEKNKK-----KKKKKKKKKKKKKKKKKKKEEQSVK 69
E+ E++ EN +KE+ E+ EE + KK++ + + + ++ E+ ++
Sbjct: 847 EQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIE 906
Query: 70 EIEESLKERKEKTLTE 85
E+E + E+K K L+E
Sbjct: 907 ELEAQI-EKKRKRLSE 921
Score = 29.3 bits (66), Expect = 3.8
Identities = 18/95 (18%), Positives = 40/95 (42%), Gaps = 18/95 (18%)
Query: 11 KKKRIEKRNKEEEKEENEQE----EKEKEDVEEEKNKKKKK-------------KKKKKK 53
++ + + E+E E + E E E++EE+ +K ++ + + +
Sbjct: 740 EELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAE 799
Query: 54 KKKKKKKKKKEEQSVKEIEESLKERK-EKTLTEDS 87
K +++ + E ++EIE+ L EK E
Sbjct: 800 LSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKE 834
Score = 29.3 bits (66), Expect = 4.4
Identities = 15/85 (17%), Positives = 40/85 (47%), Gaps = 9/85 (10%)
Query: 10 EKKKRIEKRNKEEEKEENE----QEEKEK--EDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
+ ++++EK +E + + E QEE ++ E++ + + K + +++K+ K
Sbjct: 389 DYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKAL 448
Query: 64 EEQSVKEIEESLKERKEKTLTEDSQ 88
E +K+ E L++ + +
Sbjct: 449 E---IKKQEWKLEQLAADLSKYEQE 470
Score = 28.9 bits (65), Expect = 4.9
Identities = 17/83 (20%), Positives = 31/83 (37%), Gaps = 7/83 (8%)
Query: 4 RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKK-------KKKKKKKK 56
R ++ E + E + E E+ +E+K + KK+ K +
Sbjct: 409 DRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYE 468
Query: 57 KKKKKKKEEQSVKEIEESLKERK 79
++ KEE E E S +R+
Sbjct: 469 QELYDLKEEYDRVEKELSKLQRE 491
Score = 28.9 bits (65), Expect = 5.2
Identities = 21/76 (27%), Positives = 40/76 (52%), Gaps = 11/76 (14%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+ RIE R +E E++ N EKE +E+E + ++++ K++ K S++
Sbjct: 805 EEVSRIEARLREIEQKLNR-LTLEKEYLEKEIQELQEQRIDLKEQIK----------SIE 853
Query: 70 EIEESLKERKEKTLTE 85
+ E+L +KE+ E
Sbjct: 854 KEIENLNGKKEELEEE 869
Score = 28.9 bits (65), Expect = 5.6
Identities = 15/70 (21%), Positives = 36/70 (51%), Gaps = 1/70 (1%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
+R + EE E ++E ++ ++EE + + + + K ++K +K K+E +K
Sbjct: 348 ERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRE 407
Query: 72 EESLKERKEK 81
+ L+E ++
Sbjct: 408 LDRLQEELQR 417
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 38.5 bits (90), Expect = 0.002
Identities = 15/88 (17%), Positives = 40/88 (45%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
A + ++ + N E + + E E E + +K +K+ + K++ ++ +K +
Sbjct: 83 SAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAE 142
Query: 67 SVKEIEESLKERKEKTLTEDSQCDTMIP 94
+ +EE + E +E+ S+ + +
Sbjct: 143 AEARLEEEVAEIREEGQELSSKREELKE 170
Score = 32.7 bits (75), Expect = 0.19
Identities = 16/82 (19%), Positives = 37/82 (45%), Gaps = 15/82 (18%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK---------------KKKKKKKKKKKKK 60
E+ + EEK ++E+E + E K++ +K +K+ + K++
Sbjct: 73 ERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKER 132
Query: 61 KKKEEQSVKEIEESLKERKEKT 82
++ E+++ E E L+E +
Sbjct: 133 LERLEKNLAEAEARLEEEVAEI 154
Score = 31.2 bits (71), Expect = 0.73
Identities = 18/91 (19%), Positives = 42/91 (46%), Gaps = 14/91 (15%)
Query: 10 EKKKRIEKRNKEEEKEENEQE---------EKEKEDVEEEKNK-----KKKKKKKKKKKK 55
+ RI++ K +K + E E E E ED+E + ++ ++ +++ K+ ++
Sbjct: 21 RLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEE 80
Query: 56 KKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
K K + E IE + + + +L ++
Sbjct: 81 KLSAVKDERELRALNIEIQIAKERINSLEDE 111
Score = 29.6 bits (67), Expect = 1.9
Identities = 17/82 (20%), Positives = 34/82 (41%), Gaps = 9/82 (10%)
Query: 10 EKKKRIEKRNKEEEKEENE--------QEEKEKEDVEEEKNKKKKKKKKKKK-KKKKKKK 60
K +E E E EN+ QE +E+ EEK K +++ + + +
Sbjct: 42 ALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIA 101
Query: 61 KKKEEQSVKEIEESLKERKEKT 82
K++ E+ E ++E ++
Sbjct: 102 KERINSLEDELAELMEEIEKLE 123
>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 472
Score = 39.4 bits (92), Expect = 0.002
Identities = 18/53 (33%), Positives = 33/53 (62%)
Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
N QE +E + ++K+KKK++ K K++K++ + E+ KE+ E LKE+
Sbjct: 337 PNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKELMEELKEK 389
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 39.3 bits (92), Expect = 0.003
Identities = 17/57 (29%), Positives = 28/57 (49%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
E K+ I + E + +K E++ EE ++ K KKKK+KKK+ +K
Sbjct: 386 LSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEKFR 442
Score = 38.9 bits (91), Expect = 0.004
Identities = 14/50 (28%), Positives = 27/50 (54%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
E E + ++ + K+ +EE + + ++ K KKKK+KKK+ E+
Sbjct: 391 EAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440
Score = 31.6 bits (72), Expect = 0.72
Identities = 17/78 (21%), Positives = 31/78 (39%), Gaps = 3/78 (3%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKE---DVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
+K K + + K+ + + + + E K + K +K + KK E
Sbjct: 354 ELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIE 413
Query: 65 EQSVKEIEESLKERKEKT 82
E + IEE L + K+K
Sbjct: 414 EIREELIEEGLLKSKKKK 431
Score = 30.4 bits (69), Expect = 1.7
Identities = 13/64 (20%), Positives = 31/64 (48%), Gaps = 2/64 (3%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE--IEESLKERKEK 81
K +++ +++E + K +K + KK ++ E+ ++E ++ K+RK+K
Sbjct: 376 KGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKK 435
Query: 82 TLTE 85
E
Sbjct: 436 EWFE 439
Score = 29.7 bits (67), Expect = 2.8
Identities = 17/76 (22%), Positives = 29/76 (38%), Gaps = 6/76 (7%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKK--KKKKKKKKKKKKKKKKKKEE-- 65
E K + + +E + +K K E + K KK K K ++ + KE
Sbjct: 335 EGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIA 394
Query: 66 --QSVKEIEESLKERK 79
+S K E + +K
Sbjct: 395 YYESAKTALEKAEGKK 410
Score = 29.3 bits (66), Expect = 3.3
Identities = 18/100 (18%), Positives = 38/100 (38%), Gaps = 28/100 (28%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKED----------------------VEEEKNKKKKK 47
++ K++E + +++E E E E+ +E + + ++ K
Sbjct: 293 KELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQLIEEGLKSVRLADFYGNEEIK 352
Query: 48 KKKKKKKK------KKKKKKKKEEQSVKEIEESLKERKEK 81
+ K K + KK KK + + ++ L E KE
Sbjct: 353 IELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEA 392
Score = 28.5 bits (64), Expect = 6.6
Identities = 14/40 (35%), Positives = 23/40 (57%)
Query: 38 EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
E +K K+ + +KK +K+ KK + K E+ E+EE K
Sbjct: 276 ERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKA 315
Score = 28.1 bits (63), Expect = 9.2
Identities = 12/45 (26%), Positives = 28/45 (62%)
Query: 37 VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
++E+ + K K+ + +KK +K+ KK E +++ E+ L+E ++
Sbjct: 271 LDEKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKA 315
Score = 28.1 bits (63), Expect = 9.2
Identities = 12/53 (22%), Positives = 33/53 (62%), Gaps = 1/53 (1%)
Query: 33 EKEDVEEEKNKKKKK-KKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLT 84
E++ +++ ++ +KK +K+ KK + K +K++ E + +++ E L+++ E
Sbjct: 276 ERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYA 328
>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807). This is
a family of conserved fungal proteins of unknown
function.
Length = 169
Score = 37.8 bits (88), Expect = 0.003
Identities = 17/69 (24%), Positives = 34/69 (49%), Gaps = 3/69 (4%)
Query: 16 EKRNKEEEKEENEQE---EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
E+ KEE + E E E + V E + +K+ +++ +K+K + KE +S K+
Sbjct: 68 ERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAKDPR 127
Query: 73 ESLKERKEK 81
++ +K
Sbjct: 128 GGTQDVVDK 136
Score = 34.7 bits (80), Expect = 0.028
Identities = 13/60 (21%), Positives = 21/60 (35%), Gaps = 1/60 (1%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
AG KE E++ Q+ K D E K+ K + + K + E+
Sbjct: 89 AGAVAGESSADRKEAEQQGAAQKRKSCRDK-ERKSAKDPRGGTQDVVDKSQASLDYGEEE 147
Score = 32.0 bits (73), Expect = 0.24
Identities = 15/92 (16%), Positives = 40/92 (43%), Gaps = 7/92 (7%)
Query: 1 MRKRR----GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
+R+R A E++ ++ ++++ + + K+K + K++K K
Sbjct: 66 LRERELKEEAEA-EEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAK 124
Query: 57 KKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
+ + V + + SL +E+T +++Q
Sbjct: 125 DPRGGT--QDVVDKSQASLDYGEEETQQQEAQ 154
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 37.4 bits (87), Expect = 0.003
Identities = 23/68 (33%), Positives = 37/68 (54%), Gaps = 3/68 (4%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK---KKKKKEEQSVKEIE 72
EE E+ + E + E + K K+K ++ K K KK+ K+K+KEE+ +K +
Sbjct: 40 PGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALA 99
Query: 73 ESLKERKE 80
E+ KER E
Sbjct: 100 EAEKERAE 107
Score = 36.6 bits (85), Expect = 0.005
Identities = 18/64 (28%), Positives = 39/64 (60%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+ KR E R K +EK ++ + +KE+ E+EK K+++ K + +K++ + +K++ K
Sbjct: 57 AESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAK 116
Query: 70 EIEE 73
++E
Sbjct: 117 LMKE 120
Score = 34.7 bits (80), Expect = 0.025
Identities = 17/65 (26%), Positives = 40/65 (61%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
KR G + K+++ + + +KEE E+E++++E + + +K++ + +KKK + K
Sbjct: 59 SKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLM 118
Query: 62 KKEEQ 66
K+E++
Sbjct: 119 KEEKK 123
Score = 30.8 bits (70), Expect = 0.58
Identities = 15/51 (29%), Positives = 29/51 (56%)
Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
KEK ++ K KK++ +K+K+K+++ K + E+ E+E+ E K
Sbjct: 68 AKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLM 118
>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein. The PcfK-like protein family
includes the Enterococcus faecalis PcfK protein, which
is functionally uncharacterized. This family of proteins
is found in bacteria and viruses. Proteins in this
family are typically between 137 and 257 amino acids in
length. There are two completely conserved residues (D
and L) that may be functionally important.
Length = 136
Score = 37.0 bits (86), Expect = 0.003
Identities = 14/57 (24%), Positives = 27/57 (47%), Gaps = 9/57 (15%)
Query: 19 NKEEEKEENEQE---------EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
+ + K N + E+EK + +E K ++++ +K +K+ KK KK E
Sbjct: 71 DIKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPV 127
Score = 34.7 bits (80), Expect = 0.019
Identities = 16/58 (27%), Positives = 29/58 (50%), Gaps = 3/58 (5%)
Query: 10 EKKKRIEKR---NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
+ K I R N E E E+ E KE ++ + ++ +K +K+ KK KK + + +
Sbjct: 73 KVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPVQGQ 130
>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 753
Score = 38.9 bits (91), Expect = 0.004
Identities = 24/93 (25%), Positives = 38/93 (40%), Gaps = 17/93 (18%)
Query: 10 EKKKRIEKRNKEEEKEENEQE------EKEKEDVEEEKNKKK-----------KKKKKKK 52
E +K +E+ +E EK +E E K + E K + ++ K +
Sbjct: 515 EVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLAHMGLPVPAEEAKVEA 574
Query: 53 KKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
+ K KKKK E +LK++K K L E
Sbjct: 575 VDEVKFLKKKKGILDAGAFESTLKDKKNKVLPE 607
Score = 32.3 bits (74), Expect = 0.50
Identities = 12/65 (18%), Positives = 23/65 (35%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
G K +E + + +E E ++ K KKKK + +++
Sbjct: 543 GGKTSLLELKAQIVVLAHMGLPVPAEEAKVEAVDEVKFLKKKKGILDAGAFESTLKDKKN 602
Query: 69 KEIEE 73
K + E
Sbjct: 603 KVLPE 607
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 38.8 bits (91), Expect = 0.004
Identities = 19/76 (25%), Positives = 42/76 (55%), Gaps = 4/76 (5%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEK--EKEDVEEEKNKKKKKKKKK--KKKKKKKKKKKKEE 65
+ + +E+ KE E E + ++E+ + K + KK++ KK+K+ K+K +
Sbjct: 62 KLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLD 121
Query: 66 QSVKEIEESLKERKEK 81
+ +E+EE + E++E+
Sbjct: 122 EKEEELEELIAEQREE 137
Score = 38.0 bits (89), Expect = 0.007
Identities = 26/97 (26%), Positives = 51/97 (52%), Gaps = 10/97 (10%)
Query: 1 MRKRRGGAGEKKKRIE---KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
K+ G A E KRI K+ E K+E E KE+ + +++ K+++ + ++ +
Sbjct: 23 AEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLE 82
Query: 58 KKKKKKEEQ------SVKEIEESLKERKEKTLTEDSQ 88
++ ++EE S+ + EE+L E+KEK L+ +
Sbjct: 83 RRLLQREETLDRKMESLDKKEENL-EKKEKELSNKEK 118
Score = 37.2 bits (87), Expect = 0.010
Identities = 21/82 (25%), Positives = 42/82 (51%), Gaps = 8/82 (9%)
Query: 8 AGEKKKRIEKRNKEE---EKEENEQEEKEKED---VEEEKNKKKKKKKKKKKKKKKKKKK 61
A KK KEE + E E+E KE+ + E + ++++ +K + KK++
Sbjct: 45 AETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEE 104
Query: 62 K--KEEQSVKEIEESLKERKEK 81
K+E+ + E++L E++E+
Sbjct: 105 NLEKKEKELSNKEKNLDEKEEE 126
Score = 34.1 bits (79), Expect = 0.12
Identities = 23/83 (27%), Positives = 43/83 (51%), Gaps = 7/83 (8%)
Query: 10 EKKKRIEKRNKEEEKE----ENEQEEKEKEDVEEEKNKKKKKKKKKKKKK---KKKKKKK 62
E K+ + K E E+E NE + E+ ++ E+ +K + KK++ KK+K+
Sbjct: 55 EAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELS 114
Query: 63 KEEQSVKEIEESLKERKEKTLTE 85
+E+++ E EE L+E + E
Sbjct: 115 NKEKNLDEKEEELEELIAEQREE 137
Score = 31.4 bits (72), Expect = 0.85
Identities = 22/85 (25%), Positives = 45/85 (52%), Gaps = 3/85 (3%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEE-EKNKKKKKKKKKKKKKKKKK 59
+ K+ +K+K + + K +++E E EE E EE E+ +++ K+ ++ +
Sbjct: 99 LDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERISGLTQEEAKEILLEEVE 158
Query: 60 KKKKEEQS--VKEIEESLKERKEKT 82
++ + E + +KEIEE KE +K
Sbjct: 159 EEARHEAAKLIKEIEEEAKEEADKK 183
Score = 31.0 bits (71), Expect = 1.0
Identities = 15/62 (24%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 13 KRIEKRNKE-EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
+R+E+R + EE + + E +K++ EK +K+ K+K +K+++ ++ + +E+
Sbjct: 79 QRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREEL 138
Query: 72 EE 73
E
Sbjct: 139 ER 140
Score = 29.5 bits (67), Expect = 2.7
Identities = 19/108 (17%), Positives = 43/108 (39%), Gaps = 19/108 (17%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKK---KKKKKKKKKKKEEQSVKEIEESLKERKEKTLT 84
E+ + + EE K+ + KK+ K++ K + + E+ +KE L+ + + L
Sbjct: 28 GSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQ 87
Query: 85 EDSQCDTMIPVFKKGVLYKGIYCPSLTNSFREPSLEMSYQRYSHRQRQ 132
+ D + K +E +LE + S++++
Sbjct: 88 REETLDRKMESLDK----------------KEENLEKKEKELSNKEKN 119
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 38.1 bits (89), Expect = 0.005
Identities = 13/54 (24%), Positives = 32/54 (59%), Gaps = 1/54 (1%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
++ +EKR E+ KE+ + + + ++K + + ++KKK+KK K +++ +
Sbjct: 343 REELEKR-IEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
Score = 32.4 bits (74), Expect = 0.32
Identities = 12/52 (23%), Positives = 27/52 (51%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
E+ EK E ++E+ K K K ++ KK++ + +++++ K E +
Sbjct: 344 EELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
Score = 32.4 bits (74), Expect = 0.41
Identities = 12/57 (21%), Positives = 31/57 (54%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
+ + +EE E+ ++ ++ + K K ++ KK++ + ++KKKE+++ E
Sbjct: 335 GEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERR 391
Score = 31.6 bits (72), Expect = 0.56
Identities = 15/53 (28%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKK-KKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
EE E+ ++ KEK K K + KK++ + ++KKK+KK + +
Sbjct: 343 REELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
Score = 29.3 bits (66), Expect = 3.6
Identities = 14/53 (26%), Positives = 27/53 (50%), Gaps = 9/53 (16%)
Query: 34 KEDVEE------EKNKKKKKKKKKKKKKKK---KKKKKKEEQSVKEIEESLKE 77
+E++E+ EK K K K ++ KK+ + ++KK+E+ K L+
Sbjct: 343 REELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
Score = 28.5 bits (64), Expect = 6.7
Identities = 13/60 (21%), Positives = 28/60 (46%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
E + E+ ++ +E+ K K K K K ++ KK++ + + KE + + R +
Sbjct: 335 GEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQ 394
Score = 28.1 bits (63), Expect = 8.6
Identities = 6/48 (12%), Positives = 26/48 (54%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
++ ++++++ + + + +K++ + KK+KK K +++ +
Sbjct: 348 KRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 38.5 bits (90), Expect = 0.005
Identities = 18/77 (23%), Positives = 30/77 (38%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E K + + + EE+ +N E K + + KK K K K E + +
Sbjct: 849 EIKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDE 908
Query: 70 EIEESLKERKEKTLTED 86
+ + K+ EKT E
Sbjct: 909 KTTKKKKKDLEKTDEEA 925
Score = 32.3 bits (74), Expect = 0.48
Identities = 16/72 (22%), Positives = 34/72 (47%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E +I + +E K Q E+ EE + +++K K++ K + + ++ E+ ++
Sbjct: 806 ELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQ 865
Query: 70 EIEESLKERKEK 81
I E+ E E
Sbjct: 866 NINENKNEFVEF 877
>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
non-specific RNA binding region part 2. This is a
region found N terminal to the catalytic domain of
glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
but not in Escherichia coli. This region is thought to
bind RNA in a non-specific manner, enhancing
interactions between the tRNA and enzyme, but is not
essential for enzyme function.
Length = 83
Score = 35.4 bits (82), Expect = 0.005
Identities = 13/31 (41%), Positives = 18/31 (58%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
E + KKKKKKKKKK + K +++ E
Sbjct: 19 EADLVKKKKKKKKKKAEDTAATAKAKKATAE 49
Score = 34.6 bits (80), Expect = 0.008
Identities = 13/35 (37%), Positives = 17/35 (48%)
Query: 46 KKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
KKKKKKKKKK + ++ K E + E
Sbjct: 22 LVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAM 56
Score = 34.3 bits (79), Expect = 0.011
Identities = 19/51 (37%), Positives = 25/51 (49%), Gaps = 6/51 (11%)
Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE----IEESLKER 78
K + D+ K KKKKKKKK + K KK + V E +E L E+
Sbjct: 17 KTEADLV--KKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFMEGFLGEK 65
Score = 28.5 bits (64), Expect = 1.4
Identities = 13/43 (30%), Positives = 22/43 (51%)
Query: 44 KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
K + KKKKKKKKKK + + K + + ++ E + +
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFME 59
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97
family. This model family represents the major capsid
protein component of the heads (capsids) of
bacteriophage HK97, phi-105, P27, and related phage.
This model represents one of several analogous families
lacking detectable sequence similarity. The gene
encoding this component is typically located in an
operon encoding the small and large terminase subunits,
the portal protein and the prohead or maturation
protease [Mobile and extrachromosomal element
functions, Prophage functions].
Length = 384
Score = 38.1 bits (89), Expect = 0.006
Identities = 17/82 (20%), Positives = 38/82 (46%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+++ R E K + K ++E+ E E+E+ E + ++EE +K + + ++ + +
Sbjct: 1 LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEA 60
Query: 61 KKKEEQSVKEIEESLKERKEKT 82
K EE +E K +
Sbjct: 61 KPAASGEGGGGEEEEEEAKAEA 82
Score = 31.9 bits (73), Expect = 0.52
Identities = 11/73 (15%), Positives = 31/73 (42%), Gaps = 3/73 (4%)
Query: 1 MRKRRGGAGEKKKRIEKRNKE---EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
+ + K+ I+K + E E+ +E E K E ++++++ K + + +
Sbjct: 27 AEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEEEAKAEAAEFR 86
Query: 58 KKKKKKEEQSVKE 70
+ ++ +E
Sbjct: 87 AYLRGGDDALAEE 99
Score = 29.6 bits (67), Expect = 2.3
Identities = 10/66 (15%), Positives = 30/66 (45%), Gaps = 6/66 (9%)
Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
KE E + +++ + +K + +++K + K++ + +++ + +R E+
Sbjct: 1 LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEE-----IDKLDAEI-DRLEEL 54
Query: 83 LTEDSQ 88
L E
Sbjct: 55 LDELEA 60
>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
(DUF3340). This presumed domain is found at the
C-terminus of tail specific proteases. Its function is
unknown. This family is found in bacteria and
eukaryotes. This presumed domain is typically between 88
to 187 amino acids in length.
Length = 139
Score = 36.4 bits (85), Expect = 0.006
Identities = 19/77 (24%), Positives = 44/77 (57%), Gaps = 8/77 (10%)
Query: 12 KKRIEKRNKE----EEKEENEQEEKEKEDV---EEEKNKKKKKKKKKKKKKKKKKKKKKE 64
K RI K + E E +EE++ + V E E+ ++++ +KK+ K++ +++K
Sbjct: 45 KARIAK-DPEFQYLNEDIAWLKEERDDKTVSLNEAERKAEREEDEKKRLKRENERRKALG 103
Query: 65 EQSVKEIEESLKERKEK 81
E+ +K +++ K+++E
Sbjct: 104 EEPLKSLDDLPKDKEEP 120
Score = 30.6 bits (70), Expect = 0.50
Identities = 13/46 (28%), Positives = 24/46 (52%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
N+ E K E E++EK++ E E+ K ++ K K K++ +
Sbjct: 76 NEAERKAEREEDEKKRLKRENERRKALGEEPLKSLDDLPKDKEEPD 121
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 38.2 bits (89), Expect = 0.006
Identities = 18/71 (25%), Positives = 42/71 (59%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
++ E R EEE E + E+ ++ EEK +K + ++ ++ K+K + ++ ++ ++
Sbjct: 503 IEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELR 562
Query: 70 EIEESLKERKE 80
++E+ L+E KE
Sbjct: 563 QLEDRLQELKE 573
Score = 37.8 bits (88), Expect = 0.010
Identities = 16/72 (22%), Positives = 42/72 (58%), Gaps = 2/72 (2%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
+ ++ + +E ++ ENE++ +EK EE + +++ ++ +++ ++ KK + EQ +
Sbjct: 661 EKVEELEAEIRRELQRIENEEQLEEKL--EELEQLEEELEQLREELEELLKKLGEIEQLI 718
Query: 69 KEIEESLKERKE 80
+E+E E +E
Sbjct: 719 EELESRKAELEE 730
Score = 37.0 bits (86), Expect = 0.017
Identities = 17/87 (19%), Positives = 44/87 (50%), Gaps = 2/87 (2%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
R+R +K K +E+R + E+ E E E E + +++ + + +K + + +
Sbjct: 592 RERLKELKKKLKELEERLSQLEELLQSLELSEAE--NELEEAEEELESELEKLNLQAELE 649
Query: 62 KKEEQSVKEIEESLKERKEKTLTEDSQ 88
+ + +++E+EE ++E + + E +
Sbjct: 650 ELLQAALEELEEKVEELEAEIRRELQR 676
Score = 36.7 bits (85), Expect = 0.020
Identities = 22/74 (29%), Positives = 41/74 (55%), Gaps = 2/74 (2%)
Query: 10 EKKKRIE--KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
EK +R+E +R EE +EE E E++EE K K +++ +K ++K +K + E +
Sbjct: 288 EKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEE 347
Query: 68 VKEIEESLKERKEK 81
+ E + L + E+
Sbjct: 348 LAEEKNELAKLLEE 361
Score = 36.3 bits (84), Expect = 0.026
Identities = 16/72 (22%), Positives = 43/72 (59%), Gaps = 1/72 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+ + +E+ +E E++ EE E+E +EE + + + + ++ ++ +K K E+ ++
Sbjct: 274 EELRELERLLEELEEKIERLEELERE-IEELEEELEGLRALLEELEELLEKLKSLEERLE 332
Query: 70 EIEESLKERKEK 81
++EE L++ + +
Sbjct: 333 KLEEKLEKLESE 344
Score = 35.5 bits (82), Expect = 0.043
Identities = 21/73 (28%), Positives = 38/73 (52%), Gaps = 3/73 (4%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E K +E+R KE E+ EE EKE + + K+ ++ ++ K++ + E+ +
Sbjct: 354 ELAKLLEERLKELEERL---EELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQE 410
Query: 70 EIEESLKERKEKT 82
E+EE KE +E
Sbjct: 411 ELEELEKELEELE 423
Score = 35.5 bits (82), Expect = 0.043
Identities = 19/73 (26%), Positives = 39/73 (53%), Gaps = 2/73 (2%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
E K + K EE E E E + E +++ K+ KK ++ ++++++++ EQ
Sbjct: 176 LKEVIKEAKA--KIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQE 233
Query: 68 VKEIEESLKERKE 80
++ +EE L E +E
Sbjct: 234 IEALEERLAELEE 246
Score = 35.1 bits (81), Expect = 0.057
Identities = 16/72 (22%), Positives = 43/72 (59%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E ++++EK E+ E +E+ + + ++EE + + + ++ K+ ++ + + ++ ++
Sbjct: 530 ELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELE 589
Query: 70 EIEESLKERKEK 81
E+ E LKE K+K
Sbjct: 590 ELRERLKELKKK 601
Score = 35.1 bits (81), Expect = 0.059
Identities = 18/75 (24%), Positives = 36/75 (48%), Gaps = 1/75 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS-V 68
+ R EE E+ + E+ E +EE+ K + + ++ ++K + K +E +
Sbjct: 307 LEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKEL 366
Query: 69 KEIEESLKERKEKTL 83
+E E L++ EK L
Sbjct: 367 EERLEELEKELEKAL 381
Score = 35.1 bits (81), Expect = 0.063
Identities = 20/77 (25%), Positives = 42/77 (54%), Gaps = 2/77 (2%)
Query: 8 AGEKKKRIEKRNKEE--EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
+K + I++ +EE E+E EE+ E EE++ ++ K + + + + + K E
Sbjct: 214 ELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIRE 273
Query: 66 QSVKEIEESLKERKEKT 82
+ ++E+E L+E +EK
Sbjct: 274 EELRELERLLEELEEKI 290
Score = 34.4 bits (79), Expect = 0.10
Identities = 15/77 (19%), Positives = 43/77 (55%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E K++++ + +EE + E +E +++ EE + +K++ ++ +++ K+ KK+ + ++
Sbjct: 547 ELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELE 606
Query: 70 EIEESLKERKEKTLTED 86
E L+E + +
Sbjct: 607 ERLSQLEELLQSLELSE 623
Score = 34.4 bits (79), Expect = 0.11
Identities = 20/83 (24%), Positives = 40/83 (48%), Gaps = 11/83 (13%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE---- 65
E+ +++E R +E ++ E E E+ +++ K+ KKK K+ +++ + EE
Sbjct: 559 EELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQS 618
Query: 66 -------QSVKEIEESLKERKEK 81
++E EE L+ EK
Sbjct: 619 LELSEAENELEEAEEELESELEK 641
Score = 34.0 bits (78), Expect = 0.15
Identities = 22/76 (28%), Positives = 43/76 (56%), Gaps = 1/76 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E ++ + R KEE + E EE EKE E E+ + + ++ K++ ++K +K E ++
Sbjct: 485 ELEEEL-SREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLE 543
Query: 70 EIEESLKERKEKTLTE 85
E+EE ++ + + L E
Sbjct: 544 ELEELKEKLQLQQLKE 559
Score = 33.6 bits (77), Expect = 0.18
Identities = 21/72 (29%), Positives = 48/72 (66%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
++R+EK ++ EK E+E EE +E E K +++ K+ +++ ++ +K+ +K + +K
Sbjct: 326 SLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLK 385
Query: 70 EIEESLKERKEK 81
++EE+++E KE+
Sbjct: 386 QLEEAIQELKEE 397
Score = 33.6 bits (77), Expect = 0.20
Identities = 17/89 (19%), Positives = 49/89 (55%), Gaps = 6/89 (6%)
Query: 10 EKKKRIEKRNK---EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK---KKKKKKKKKKK 63
E+K+R+E+ E E E E + +E++ E + ++ ++K + + +++ ++ ++
Sbjct: 246 EEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEE 305
Query: 64 EEQSVKEIEESLKERKEKTLTEDSQCDTM 92
E + ++ + E L+E EK + + + + +
Sbjct: 306 ELEGLRALLEELEELLEKLKSLEERLEKL 334
Score = 33.2 bits (76), Expect = 0.24
Identities = 20/81 (24%), Positives = 45/81 (55%), Gaps = 2/81 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E + R E E+E E E+E E ++EE ++ ++K +K + ++ ++ KE+ ++
Sbjct: 496 EAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQ 555
Query: 70 EIEESLK--ERKEKTLTEDSQ 88
+++E L+ E + + L E +
Sbjct: 556 QLKEELRQLEDRLQELKELLE 576
Score = 31.3 bits (71), Expect = 1.0
Identities = 20/73 (27%), Positives = 44/73 (60%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
E ++ +EK EE+ E +E+ EK + E E+ ++K + K +++ K+ +++ E+
Sbjct: 314 LEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEEL 373
Query: 68 VKEIEESLKERKE 80
KE+E++L+ K+
Sbjct: 374 EKELEKALERLKQ 386
Score = 31.3 bits (71), Expect = 1.0
Identities = 15/87 (17%), Positives = 43/87 (49%), Gaps = 8/87 (9%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEK--------EDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
+K+ + + ++ EE QE KE+ E+++EE + +K+ ++ +++ ++ +++
Sbjct: 374 EKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEI 433
Query: 63 KEEQSVKEIEESLKERKEKTLTEDSQC 89
K+ + ES + + +C
Sbjct: 434 KKLEEQINQLESKELMIAELAGAGEKC 460
Score = 31.3 bits (71), Expect = 1.1
Identities = 15/76 (19%), Positives = 41/76 (53%), Gaps = 3/76 (3%)
Query: 9 GEKKKRIEKRNKEEE---KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
E +K + + + E +EE +E++E E EE + +K+ ++ +++ + + ++ +
Sbjct: 469 EEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALK 528
Query: 66 QSVKEIEESLKERKEK 81
+ ++E E L+ E+
Sbjct: 529 EELEEKLEKLENLLEE 544
Score = 30.5 bits (69), Expect = 1.5
Identities = 15/72 (20%), Positives = 41/72 (56%), Gaps = 3/72 (4%)
Query: 13 KRIEKRNKE--EEKEENEQEEKE-KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+ +E+R E EEKE E+ + E E K ++++ ++ ++ ++ +++ + ++
Sbjct: 235 EALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLE 294
Query: 70 EIEESLKERKEK 81
E+E ++E +E+
Sbjct: 295 ELEREIEELEEE 306
Score = 30.1 bits (68), Expect = 2.3
Identities = 18/65 (27%), Positives = 42/65 (64%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
+ +E EKE E E E E++EEE +++K++ + +++ ++ +K+ ++ E+ + E+ E +
Sbjct: 466 ELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEE 525
Query: 77 ERKEK 81
KE+
Sbjct: 526 ALKEE 530
Score = 29.7 bits (67), Expect = 3.0
Identities = 11/66 (16%), Positives = 35/66 (53%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
+ + EEK E + E+ + +EK + ++ K++ ++ + + ++ K+ + ++ +
Sbjct: 526 ALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRK 585
Query: 76 KERKEK 81
+E +E
Sbjct: 586 EELEEL 591
Score = 29.7 bits (67), Expect = 3.0
Identities = 17/73 (23%), Positives = 35/73 (47%), Gaps = 3/73 (4%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
++ E EE EE ++ E E + EE +K + ++ KK+ +K +K + ++E
Sbjct: 691 EQLEEELEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELEKLEKALEL---LEE 747
Query: 71 IEESLKERKEKTL 83
+ E L + +
Sbjct: 748 LREKLGKAGLRAD 760
Score = 29.0 bits (65), Expect = 5.2
Identities = 15/73 (20%), Positives = 44/73 (60%), Gaps = 1/73 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+ +++ + + EE + EE E++ VEE + + +++ ++ + +++ ++K ++ EQ +
Sbjct: 637 SELEKLNLQAELEELLQAALEELEEK-VEELEAEIRRELQRIENEEQLEEKLEELEQLEE 695
Query: 70 EIEESLKERKEKT 82
E+E+ +E +E
Sbjct: 696 ELEQLREELEELL 708
>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.
This presumed domain is found at the N-terminus of
Ribosomal L30 proteins and has been termed RL30NT or
NUC018.
Length = 71
Score = 34.9 bits (81), Expect = 0.006
Identities = 17/41 (41%), Positives = 22/41 (53%), Gaps = 2/41 (4%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
K KK+ KK +KKK+K K+ E+ VKE ER E
Sbjct: 14 AKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRA--AERDE 52
Score = 29.5 bits (67), Expect = 0.40
Identities = 15/48 (31%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE--RKEKTLTE 85
++N+K + K+ KK+ KK +KKK + K E+ +KE E+
Sbjct: 7 KRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERDEIR 54
Score = 26.8 bits (60), Expect = 4.4
Identities = 13/39 (33%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
KK+K+ +K + K+ KK+ KK + K + +R EK
Sbjct: 4 KKRKRNEKLRAKRAKKRAAKKAARKKKRKL--IFKRAEK 40
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 37.4 bits (86), Expect = 0.006
Identities = 19/81 (23%), Positives = 50/81 (61%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+KKK + R ++ EK E+EE E+ E + +K +++ +++++++++++ EEQS
Sbjct: 116 KKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDD 175
Query: 70 EIEESLKERKEKTLTEDSQCD 90
E +++ + +T ++D + +
Sbjct: 176 SEHEIIEQDESETESDDDKTE 196
Score = 29.3 bits (65), Expect = 2.3
Identities = 16/77 (20%), Positives = 49/77 (63%), Gaps = 8/77 (10%)
Query: 14 RIEKRNKEEEKEENEQEEK---EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
RI + + +K++ ++++ +E +E+ + ++++ +++ + +++++K ++E+Q
Sbjct: 102 RIRDQLRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQ---- 157
Query: 71 IEESLKERKEKTLTEDS 87
EE +ER+E+T+ E S
Sbjct: 158 -EEQEREREEQTIEEQS 173
Score = 28.1 bits (62), Expect = 5.1
Identities = 12/60 (20%), Positives = 36/60 (60%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A ++++ +E+R + E +EE E +++E+ E E+ ++ +++ + + ++ + E +S
Sbjct: 131 ARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETES 190
Score = 28.1 bits (62), Expect = 5.9
Identities = 11/53 (20%), Positives = 31/53 (58%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
+R E++ E E +E ++ E+E++ +EE+ + + + ++ + + + K E+
Sbjct: 145 ERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESDDDKTEK 197
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region
covers both the Brf homology II and III regions. This
region is involved in binding TATA binding protein.
Length = 95
Score = 35.3 bits (82), Expect = 0.006
Identities = 20/53 (37%), Positives = 32/53 (60%), Gaps = 3/53 (5%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE-IEESLKERK 79
EQEEKE + +E N KKKKK+ K KKK+ + + E +++ L+++K
Sbjct: 31 EQEEKELKQKADEGNNSGKKKKKR--KAKKKRDEAGPASTAAEAVKKMLQKKK 81
Score = 34.2 bits (79), Expect = 0.015
Identities = 20/72 (27%), Positives = 33/72 (45%), Gaps = 11/72 (15%)
Query: 27 NEQEEKEKEDV-----------EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
+E+E K KE + +EEK K+K + KKKKK+K K+++ +
Sbjct: 10 SEEEVKLKERIWDELNKDYLEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTA 69
Query: 76 KERKEKTLTEDS 87
E +K L +
Sbjct: 70 AEAVKKMLQKKK 81
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 38.2 bits (89), Expect = 0.006
Identities = 16/85 (18%), Positives = 42/85 (49%), Gaps = 8/85 (9%)
Query: 5 RGGAGEKKKRIEKRNKE------EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK--K 56
+ A EK + KE ++ + + E+E+ EEE+ +K+++ + +K
Sbjct: 43 QKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDN 102
Query: 57 KKKKKKKEEQSVKEIEESLKERKEK 81
+ + ++ E+++ E L+E +++
Sbjct: 103 LENQLEEREKALSARELELEELEKQ 127
>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
[Posttranslational modification, protein turnover,
chaperones].
Length = 193
Score = 36.9 bits (86), Expect = 0.007
Identities = 22/91 (24%), Positives = 47/91 (51%), Gaps = 11/91 (12%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE----- 64
+ E+ K EE+E E+E +E+ ++EEE+ + + + + ++ K K + + E
Sbjct: 11 PDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRAQAEFENLR 70
Query: 65 EQSVKEIEESLKERKEKTLTEDSQCDTMIPV 95
+++ +E EE+ K EK + ++PV
Sbjct: 71 KRTEREREEAKKYAIEKFAKD------LLPV 95
Score = 35.0 bits (81), Expect = 0.028
Identities = 17/72 (23%), Positives = 43/72 (59%), Gaps = 2/72 (2%)
Query: 12 KKRIEKRNKEEEKEENEQEEKE--KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
K++ + EE EE E+ E+E +E+ EE+N+ ++++++ + + + ++ K + +
Sbjct: 4 KEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRAQ 63
Query: 70 EIEESLKERKEK 81
E+L++R E+
Sbjct: 64 AEFENLRKRTER 75
Score = 31.5 bits (72), Expect = 0.41
Identities = 15/58 (25%), Positives = 40/58 (68%)
Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
+ EQ+ +E + E E+ +K ++++ ++++ +++ + ++E+Q + E+E L+E K+K L
Sbjct: 3 DKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYL 60
>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p), ribosomal RNA-processing protein 7 homolog A
(Rrp7A), and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
YCL031C gene from Saccharomyces cerevisiae. It is an
essential yeast protein involved in pre-rRNA processing
and ribosome assembly, and is speculated to be required
for correct assembly of rpS27 into the pre-ribosomal
particle. Rrp7A, also termed gastric cancer antigen
Zg14, is the Rrp7p homolog mainly found in Metazoans.
The cellular function of Rrp7A remains unclear
currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 118
Score = 35.7 bits (83), Expect = 0.007
Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 23/74 (31%)
Query: 16 EKRNKEEEKEENEQEEKEKED--------------VEEEKNKKKKKKKKKKKKKKKKKK- 60
+KR +EE++E E + ED EE + K K+K+KKKKKKK+ +
Sbjct: 31 DKREEEEKEEAKEARNEPDEDGFVTVTRGGRKGKTAREEAVEAKAKEKEKKKKKKKELED 90
Query: 61 --------KKKEEQ 66
KKKEE
Sbjct: 91 FYRFQIREKKKEEL 104
Score = 27.2 bits (61), Expect = 5.7
Identities = 16/70 (22%), Positives = 33/70 (47%), Gaps = 19/70 (27%)
Query: 21 EEEKEENEQEEKEKEDVEEE--------KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
++ +EE ++E KE + +E +K K +++ + K K+K+KK+
Sbjct: 31 DKREEEEKEEAKEARNEPDEDGFVTVTRGGRKGKTAREEAVEAKAKEKEKKK-------- 82
Query: 73 ESLKERKEKT 82
K++KE
Sbjct: 83 ---KKKKELE 89
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 37.1 bits (86), Expect = 0.007
Identities = 28/102 (27%), Positives = 45/102 (44%), Gaps = 15/102 (14%)
Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES-----LKERKEKTLTE 85
+ K++ E+ + ++ K+KKKKK KKKK K + +EES + E+K +TL+
Sbjct: 73 QAGKKETEDWFSPNQENKQKKKKKDKKKKSPKSLLLNFVTVEESKPDDEVHEQKSETLSL 132
Query: 86 DSQCDTMIPVFKKGVLYKGIYCPSLTNSFREPSLEMSYQRYS 127
+ + G Y P T P L Y+ S
Sbjct: 133 PIELEEQSM---------GRYLP-TTFKPGSPDLARHYKSAS 164
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 38.3 bits (89), Expect = 0.007
Identities = 18/60 (30%), Positives = 29/60 (48%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
K ++K +KE+ K E +E+ ++ E KK ++ KK KK KK +S E
Sbjct: 1163 KELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEE 1222
Score = 36.0 bits (83), Expect = 0.035
Identities = 21/97 (21%), Positives = 37/97 (38%), Gaps = 12/97 (12%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK------------KKKKKKK 50
RG +G KK + K+ ++ ++ E E EE K K +
Sbjct: 1186 AARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVKPKGRAG 1245
Query: 51 KKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
KKK K+K+EE + ++++ L + S
Sbjct: 1246 AKKKAPAAAKEKEEEDEILDLKDRLAAYNLDSAPAQS 1282
Score = 34.5 bits (79), Expect = 0.099
Identities = 17/89 (19%), Positives = 36/89 (40%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
+EK + +KE+ + EE ++ + KK ++ KK KK + E E +
Sbjct: 1161 LEKELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETT 1220
Query: 75 LKERKEKTLTEDSQCDTMIPVFKKGVLYK 103
+ + ++ + + P + G K
Sbjct: 1221 EETYGSSAMETENVAEVVKPKGRAGAKKK 1249
Score = 33.7 bits (77), Expect = 0.16
Identities = 14/62 (22%), Positives = 32/62 (51%), Gaps = 7/62 (11%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK-------KKKKKKKKKKKEEQSVKEIE 72
K+ + E E ++ +KED + E+ ++K ++ + K ++ KK +++ K+
Sbjct: 1156 KDLDALEKELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKAS 1215
Query: 73 ES 74
ES
Sbjct: 1216 ES 1217
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 37.7 bits (87), Expect = 0.008
Identities = 18/64 (28%), Positives = 36/64 (56%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
E + +EE + ++ E E + ++ + + K K K KK+K +++EE+ +K I S
Sbjct: 492 EDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSN 551
Query: 76 KERK 79
K++K
Sbjct: 552 KQKK 555
Score = 37.0 bits (85), Expect = 0.013
Identities = 21/78 (26%), Positives = 40/78 (51%), Gaps = 8/78 (10%)
Query: 14 RIEKRNKE----EEKEENEQEEKEKEDVEEE----KNKKKKKKKKKKKKKKKKKKKKKEE 65
R E E EE +++++E + ++++E E K + + K K K KK+K EE
Sbjct: 479 RFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEE 538
Query: 66 QSVKEIEESLKERKEKTL 83
+ K+++ + K+K L
Sbjct: 539 EEEKKLKMIMMSNKQKKL 556
Score = 36.6 bits (84), Expect = 0.021
Identities = 21/61 (34%), Positives = 36/61 (59%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
E +K+ N+ + K+++ EEE+ KK K K+KK KK K +E E+LK++K+
Sbjct: 520 EADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKK 579
Query: 81 K 81
+
Sbjct: 580 Q 580
Score = 32.7 bits (74), Expect = 0.33
Identities = 19/72 (26%), Positives = 37/72 (51%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E K + K ++ K + E+EEK+ + + +KK KK K KK+++ + ++ K
Sbjct: 520 EADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKK 579
Query: 70 EIEESLKERKEK 81
+I + K +K
Sbjct: 580 QIAKQKKLDSKK 591
>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
unknown].
Length = 622
Score = 37.8 bits (87), Expect = 0.008
Identities = 19/77 (24%), Positives = 38/77 (49%), Gaps = 6/77 (7%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK------KKKKKKKKKKKEEQSVKEIEES 74
++K E E+ +++ E ++N+K+++ KK K K KK+ KK ++ IE +
Sbjct: 426 SDDKLETTIEKLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGKHAIERT 485
Query: 75 LKERKEKTLTEDSQCDT 91
++E L + D
Sbjct: 486 AASKEELELIKADDEDD 502
Score = 34.0 bits (77), Expect = 0.14
Identities = 19/68 (27%), Positives = 32/68 (47%), Gaps = 2/68 (2%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK--KKKKKKKEE 65
EK R + +E KE ++ K+ DV+ + K+ KK KK K + K+E
Sbjct: 433 TIEKLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGKHAIERTAASKEEL 492
Query: 66 QSVKEIEE 73
+ +K +E
Sbjct: 493 ELIKADDE 500
Score = 32.8 bits (74), Expect = 0.32
Identities = 13/62 (20%), Positives = 32/62 (51%), Gaps = 2/62 (3%)
Query: 27 NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
NE E +K + EK +K +++++ +K+++ KK K + S ++++ + +K
Sbjct: 422 NESESDDKLETTIEKLDRKLRERQENRKERQLKKTK--DDSDVDLKDKKESINKKNKKGK 479
Query: 87 SQ 88
Sbjct: 480 HA 481
Score = 31.7 bits (71), Expect = 0.67
Identities = 30/100 (30%), Positives = 45/100 (45%), Gaps = 11/100 (11%)
Query: 10 EKKKRIEKRNKEEE---------KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+KK+ I K+NK+ + KEE E + + ED E+ + K K +K KK +K
Sbjct: 466 DKKESINKKNKKGKHAIERTAASKEELELIKADDEDDEQLDHFDMKSILKAEKFKKNRKL 525
Query: 61 KKKEEQSVKEIEESLKERKEKTLTEDSQ--CDTMIPVFKK 98
KKK + K+ + + ED D P FKK
Sbjct: 526 KKKASNLEEGFVFDPKDPRFVAIFEDHNFAIDPTHPEFKK 565
Score = 28.2 bits (62), Expect = 8.6
Identities = 18/90 (20%), Positives = 45/90 (50%), Gaps = 9/90 (10%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
K E+++ ++R ++ K++++ + K+K++ +KNKK K ++ K++ +
Sbjct: 436 KLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGKHAIERTAASKEELELI 495
Query: 63 KEEQ---------SVKEIEESLKERKEKTL 83
K + +K I ++ K +K + L
Sbjct: 496 KADDEDDEQLDHFDMKSILKAEKFKKNRKL 525
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 37.7 bits (87), Expect = 0.008
Identities = 15/48 (31%), Positives = 33/48 (68%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
K +KR + EK + E E+K +E+ E EK K+K+++++++++ ++ K
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAK 623
Score = 35.8 bits (82), Expect = 0.041
Identities = 17/46 (36%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKK 48
K+R A EK KR ++ EE+E +++EKE+E E E+ ++ K
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKERER-EREREAERAAK 623
Score = 33.9 bits (77), Expect = 0.14
Identities = 17/65 (26%), Positives = 36/65 (55%), Gaps = 2/65 (3%)
Query: 6 GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
+ KKR E E+ K E EQ+ +E+ + E+EK K+++++++++ ++ K E
Sbjct: 573 ASSKLAKKREEA--VEKAKREAEQKAREEREREKEKEKEREREREREAERAAKASSSSHE 630
Query: 66 QSVKE 70
+ E
Sbjct: 631 SRMSE 635
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 37.9 bits (89), Expect = 0.009
Identities = 17/59 (28%), Positives = 31/59 (52%), Gaps = 3/59 (5%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKN---KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
+ +E+ E + +KE+ED + K KK + +KKK+ K K+ Q K+I++
Sbjct: 166 DDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKE 224
Score = 36.4 bits (85), Expect = 0.026
Identities = 16/51 (31%), Positives = 25/51 (49%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
E + E+ EK + EEE K K+ + KK + +KKK+ + KE
Sbjct: 162 DFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKE 212
Score = 36.0 bits (84), Expect = 0.039
Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 7/71 (9%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK-------KK 62
E E+ K E ++E E E+ KE +E K + +KKK+ K K+ K
Sbjct: 164 EIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDK 223
Query: 63 KEEQSVKEIEE 73
+E +KEI E
Sbjct: 224 EEITPMKEINE 234
Score = 33.3 bits (77), Expect = 0.21
Identities = 14/63 (22%), Positives = 32/63 (50%), Gaps = 2/63 (3%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKK--KKKKKKKKKKKKEEQSVKEIEESLKE 77
+ E + ++E E E +K ++ +K K+ + KK + +KKK+ ++ E ++
Sbjct: 158 ILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQI 217
Query: 78 RKE 80
K+
Sbjct: 218 GKK 220
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 37.7 bits (87), Expect = 0.009
Identities = 18/39 (46%), Positives = 28/39 (71%)
Query: 6 GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK 44
GG G E+ +EEE+EE E+EE+E+E+ EEE+N++
Sbjct: 854 GGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 33.8 bits (77), Expect = 0.14
Identities = 13/31 (41%), Positives = 24/31 (77%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKK 46
+ +EEE+EE E+EE+E+E+ EEE+ + ++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 33.4 bits (76), Expect = 0.19
Identities = 13/30 (43%), Positives = 26/30 (86%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKK 48
++EEE+EE E+EE+E+E+ EEE+ +++ ++
Sbjct: 863 SEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 32.7 bits (74), Expect = 0.36
Identities = 13/31 (41%), Positives = 26/31 (83%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKK 49
+ EEE+EE E+EE+E+E+ EEE+ ++++ ++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 31.5 bits (71), Expect = 0.83
Identities = 25/97 (25%), Positives = 46/97 (47%), Gaps = 1/97 (1%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
+R G GE+ E N EE E EQE E E E +++ + ++K +++ + + +
Sbjct: 643 ERTGEEGERPTEAEGENGEESGGEAEQEG-ETETKGENESEGEIPAERKGEQEGEGEIEA 701
Query: 63 KEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKG 99
KE E E E + +T E ++ + I ++G
Sbjct: 702 KEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEG 738
Score = 30.3 bits (68), Expect = 1.8
Identities = 11/31 (35%), Positives = 26/31 (83%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKK 51
+ E+EE E+EE+E+E+ EEE+ ++++++ ++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 28.8 bits (64), Expect = 5.9
Identities = 13/39 (33%), Positives = 27/39 (69%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKK 48
E+++ E+ +EEE+EE E+EE+ +E + E + ++K+
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905
Score = 28.8 bits (64), Expect = 6.0
Identities = 10/31 (32%), Positives = 26/31 (83%)
Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKK 53
+ EE E+EE+E+E+ EEE+ ++++++++ ++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 28.4 bits (63), Expect = 8.4
Identities = 17/73 (23%), Positives = 37/73 (50%), Gaps = 2/73 (2%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS--VKEIEESLKER 78
E +++E+E E + ED E + ++ + K + + + + +K + E QS + E E
Sbjct: 773 EGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDET 832
Query: 79 KEKTLTEDSQCDT 91
E+ L ++Q +
Sbjct: 833 GEQELNAENQGEA 845
>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
N-terminal. This is the very acidic N-terminal region
of the early transcription elongation factor Spt5. The
Spt5-Spt4 complex regulates early transcription
elongation by RNA polymerase II and has an imputed role
in pre-mRNA processing via its physical association
with mRNA capping enzymes. The actual function of this
N-terminal domain is not known although it is
dispensable for binding to Spt4.
Length = 92
Score = 34.7 bits (80), Expect = 0.009
Identities = 17/68 (25%), Positives = 41/68 (60%), Gaps = 2/68 (2%)
Query: 22 EEKEENEQEEKEKEDVEEEKN--KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
+E+EE E+EE + ED+ +E + + + ++ ++ ++++K+EE+ +E+ E L++R
Sbjct: 10 DEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAEELAEYLRKRY 69
Query: 80 EKTLTEDS 87
D+
Sbjct: 70 GDEADADA 77
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 37.6 bits (87), Expect = 0.009
Identities = 16/56 (28%), Positives = 29/56 (51%), Gaps = 2/56 (3%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
E+KK+ K K +EKE + + +KE + + ++ KK +KK +K+ E
Sbjct: 19 ERKKK--KEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVE 72
Score = 37.2 bits (86), Expect = 0.013
Identities = 18/42 (42%), Positives = 27/42 (64%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E E ++K EEE +KKKK++K K+K+ KK K ++E K
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAK 46
Score = 37.2 bits (86), Expect = 0.014
Identities = 20/62 (32%), Positives = 34/62 (54%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
E EK+ +EE E++ +EEK K+K+ KK K +K+ K K + ++ S E+K
Sbjct: 6 SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKK 65
Query: 80 EK 81
+
Sbjct: 66 SR 67
Score = 36.0 bits (83), Expect = 0.035
Identities = 20/82 (24%), Positives = 44/82 (53%), Gaps = 1/82 (1%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
R E +++ E E E K+K++ E+ K K+ KK K +K+ K K + ++ +
Sbjct: 2 SRTESEAEKKILTEEELERKKKKE-EKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPK 60
Query: 73 ESLKERKEKTLTEDSQCDTMIP 94
+S K+ +++ + +++ D + P
Sbjct: 61 KSEKKSRKRDVEDENPEDFIDP 82
Score = 32.9 bits (75), Expect = 0.29
Identities = 21/90 (23%), Positives = 41/90 (45%), Gaps = 9/90 (10%)
Query: 5 RGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK----- 59
E +K+I + E K++ E++ KEK E +K K +K+ K K + ++
Sbjct: 2 SRTESEAEKKILTEEELERKKKKEEKAKEK---ELKKLKAAQKEAKAKLQAQQASDGTNV 58
Query: 60 -KKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
KK +++ +++E+ E T Q
Sbjct: 59 PKKSEKKSRKRDVEDENPEDFIDPDTPFGQ 88
>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
Length = 300
Score = 37.0 bits (86), Expect = 0.010
Identities = 18/42 (42%), Positives = 24/42 (57%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
K KKKKKKKKK KK K KK +Q ++ +R+E+
Sbjct: 253 SKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRER 294
Score = 33.9 bits (78), Expect = 0.10
Identities = 15/48 (31%), Positives = 31/48 (64%)
Query: 25 EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
+ ++++K+K+ V +K K KK K++K + K KK ++ E+ K+I+
Sbjct: 253 SKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKKIK 300
Score = 33.1 bits (76), Expect = 0.17
Identities = 24/74 (32%), Positives = 31/74 (41%), Gaps = 6/74 (8%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
AG +E K+ K K K KKKKKK KK K KK K+ +
Sbjct: 230 AGVGPDSLEDMYKKAHAAIRANPSKVK------KKKKKKKKVVHKKYKTKKLTGKQRKAR 283
Query: 68 VKEIEESLKERKEK 81
VK + +ER +K
Sbjct: 284 VKAKKAQRRERLQK 297
Score = 29.7 bits (67), Expect = 2.2
Identities = 20/79 (25%), Positives = 37/79 (46%), Gaps = 8/79 (10%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVE--EEKNKKKKKK------KKKKKKKKKKKKKKKE 64
K++++ + ++ K++ + K + E+ KK K KKKKKKKKK K+
Sbjct: 210 KQLKEEDPDKYKKQFSKYIKAGVGPDSLEDMYKKAHAAIRANPSKVKKKKKKKKKVVHKK 269
Query: 65 EQSVKEIEESLKERKEKTL 83
++ K + K R +
Sbjct: 270 YKTKKLTGKQRKARVKAKK 288
Score = 28.1 bits (63), Expect = 6.7
Identities = 13/59 (22%), Positives = 32/59 (54%), Gaps = 3/59 (5%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK---KKKKKKKKKKKKKKKKKKKEEQS 67
KK K + ++++K+K ++ K KK K++K + K KK +++++ +++
Sbjct: 242 KKAHAAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKKIK 300
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 37.3 bits (86), Expect = 0.010
Identities = 21/68 (30%), Positives = 44/68 (64%), Gaps = 1/68 (1%)
Query: 10 EKKKRIEKRNK-EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
KKKR E+R EEE++ +QEE +++ EEE+ ++ K++ ++++ + +K++K E +
Sbjct: 219 LKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAEKRQKVPEDGL 278
Query: 69 KEIEESLK 76
E ++ K
Sbjct: 279 SEDKKPFK 286
Score = 36.9 bits (85), Expect = 0.015
Identities = 18/69 (26%), Positives = 41/69 (59%), Gaps = 2/69 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQ--EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
E++++ E R + EE EE E + ++K D + + +K++K+ + ++++K K+ EE +
Sbjct: 112 EEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENN 171
Query: 68 VKEIEESLK 76
+ + LK
Sbjct: 172 GEFMTHKLK 180
Score = 36.2 bits (83), Expect = 0.024
Identities = 32/126 (25%), Positives = 67/126 (53%), Gaps = 9/126 (7%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
K++ ++ E E+ + ++EE+ K +EEE+ ++K+++ +K +++++K++ KEE +
Sbjct: 206 KQKQQEAALELEELKKKREERRKV-LEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRA 264
Query: 72 EESLKERKEKTLTEDSQCDTMIPVFKKGVLYKGIYCPSLTNSFREPSLEMSYQRYSHRQR 131
E + K +K + ED + P K KG SL + R L S Q+ S ++
Sbjct: 265 EAAEKRQK---VPEDGLSEDKKPF--KCFTPKG---SSLKITERAEFLNRSIQKSSVKKS 316
Query: 132 QKSLII 137
L++
Sbjct: 317 HPPLVV 322
Score = 34.2 bits (78), Expect = 0.090
Identities = 20/68 (29%), Positives = 32/68 (47%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E + E++N + EE ++EEKE E EEEK K+ ++ + K K + S
Sbjct: 130 EGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENTFSRG 189
Query: 70 EIEESLKE 77
E + E
Sbjct: 190 GAEGAQVE 197
Score = 31.9 bits (72), Expect = 0.58
Identities = 21/75 (28%), Positives = 43/75 (57%), Gaps = 1/75 (1%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
E E KE + ++K++E E + KKK+++++K +++++++K+EE K EE
Sbjct: 192 EGAQVEAGKEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEE 251
Query: 76 KER-KEKTLTEDSQC 89
K R KE+ ++
Sbjct: 252 KRRLKEEIERRRAEA 266
Score = 30.4 bits (68), Expect = 1.5
Identities = 20/90 (22%), Positives = 40/90 (44%), Gaps = 18/90 (20%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK--------------- 54
+K + ++E++E E EE+EK + + K K
Sbjct: 137 QKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQV 196
Query: 55 ---KKKKKKKKKEEQSVKEIEESLKERKEK 81
K+ +K K+K++++ E+EE K+R+E+
Sbjct: 197 EAGKEFEKLKQKQQEAALELEELKKKREER 226
Score = 29.2 bits (65), Expect = 3.8
Identities = 19/79 (24%), Positives = 43/79 (54%), Gaps = 4/79 (5%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE---- 65
E +E+ K+ E+ EE+E+ +EE ++K +++++K++ K++ ++++ E
Sbjct: 211 EAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAEKR 270
Query: 66 QSVKEIEESLKERKEKTLT 84
Q V E S ++ K T
Sbjct: 271 QKVPEDGLSEDKKPFKCFT 289
Score = 28.1 bits (62), Expect = 8.7
Identities = 17/66 (25%), Positives = 33/66 (50%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
E E E E E++E+ +E+ EE + + K ++K + ++ +KEE+ + EE
Sbjct: 102 EDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEK 161
Query: 76 KERKEK 81
+R
Sbjct: 162 PKRGSL 167
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 37.5 bits (87), Expect = 0.010
Identities = 19/46 (41%), Positives = 30/46 (65%)
Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
KKKKKKK K+KKKKK ++ K+ +++ IEE ++ + +E Q
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQ 730
Score = 35.2 bits (81), Expect = 0.059
Identities = 22/53 (41%), Positives = 33/53 (62%), Gaps = 2/53 (3%)
Query: 48 KKKKKKKKKKKKKKKKEEQSVKEIE--ESLKERKEKTLTEDSQCDTMIPVFKK 98
+KKKKKKK K+KKKKK ++ K+I E+++E E + E S + + KK
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKLEKK 736
Score = 30.6 bits (69), Expect = 1.5
Identities = 18/44 (40%), Positives = 29/44 (65%), Gaps = 1/44 (2%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE-IEESLKERKEKT 82
+K KKKK K+KKKKK ++ K+ ++++E IE S E+ +T
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSET 728
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A peptide
from the central, syntaxin binding domain of synaphin
competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal exocytosis
by promoting interaction between the complementary
syntaxin and synaptobrevin transmembrane regions that
reside in opposing membranes prior to fusion.
Length = 139
Score = 35.6 bits (82), Expect = 0.011
Identities = 26/87 (29%), Positives = 48/87 (55%), Gaps = 7/87 (8%)
Query: 6 GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKK-------KKKKKKK 58
G G K+ ++ + EEE EE ++ +E E+ + K++K +++++ K KKK
Sbjct: 18 GMLGGKEDEGDESDAEEEDEEIQEALREAEEERKAKHRKMEEEREVMRQGIRDKYGIKKK 77
Query: 59 KKKKKEEQSVKEIEESLKERKEKTLTE 85
++ ++E Q+ E EE RK+KT E
Sbjct: 78 EEDEEEPQAAAEEEEGRLGRKKKTPEE 104
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 37.8 bits (88), Expect = 0.011
Identities = 24/79 (30%), Positives = 36/79 (45%), Gaps = 15/79 (18%)
Query: 9 GEKKKRIEKRNKEEEKEEN-EQEEKEKEDVEEEKNKKKKK--------KKKKK------K 53
+KK I K+ + EN QEEKE E ++ K+KK ++K +
Sbjct: 1504 QKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAE 1563
Query: 54 KKKKKKKKKKEEQSVKEIE 72
KK+K KK+ +S E E
Sbjct: 1564 SDIKKRKNKKQYKSNTEAE 1582
Score = 35.8 bits (83), Expect = 0.043
Identities = 19/75 (25%), Positives = 38/75 (50%), Gaps = 9/75 (12%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK--------KKKKKKKKK 62
+KK IE N+ +E++E + + + D E++ N + ++K KK+K KK
Sbjct: 1514 EKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKK 1573
Query: 63 KEEQSV-KEIEESLK 76
+ + + E++ LK
Sbjct: 1574 QYKSNTEAELDFFLK 1588
Score = 34.7 bits (80), Expect = 0.092
Identities = 17/36 (47%), Positives = 26/36 (72%)
Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
E + D EEK KKKKKK+KKK+++ K+++K + E
Sbjct: 733 AEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768
Score = 33.2 bits (76), Expect = 0.28
Identities = 15/47 (31%), Positives = 30/47 (63%)
Query: 29 QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
++ E + + + K KKKKKK+KKK+++ K+++K + E +S+
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAEAWDSI 776
Score = 33.2 bits (76), Expect = 0.28
Identities = 15/54 (27%), Positives = 26/54 (48%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
K G GE + EK+ E N+++ E++ E + K+K KK+ K +
Sbjct: 1528 KEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEA 1581
Score = 30.5 bits (69), Expect = 1.7
Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 4/54 (7%)
Query: 34 KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
+ ++ K ++K KKKKKK+KKKEE+ + EE K R E DS
Sbjct: 726 RNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEY--KREE--KARIEIAEAWDS 775
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 37.3 bits (87), Expect = 0.011
Identities = 13/75 (17%), Positives = 41/75 (54%), Gaps = 2/75 (2%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE-EQ 66
+ R+E++ K+ +E E++ E++E + + + K + ++ + ++K +E ++
Sbjct: 293 LANEISRLEQQ-KQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKE 351
Query: 67 SVKEIEESLKERKEK 81
++ +E L+E + +
Sbjct: 352 ELESLEAELEELEAE 366
Score = 30.4 bits (69), Expect = 1.8
Identities = 12/69 (17%), Positives = 33/69 (47%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
+ K E + E EE +E KE+ + E + ++ + + ++ + + ++ +++ E K
Sbjct: 329 ESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKV 388
Query: 71 IEESLKERK 79
+ L+
Sbjct: 389 AQLELQIAS 397
Score = 29.3 bits (66), Expect = 3.8
Identities = 20/83 (24%), Positives = 36/83 (43%), Gaps = 9/83 (10%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ---- 66
K++R E K E EN ED+ E ++ K +++ +K ++ K+ K E +
Sbjct: 171 KERRKETERKLERTRENLDR---LEDILNELERQLKSLERQAEKAERYKELKAELRELEL 227
Query: 67 --SVKEIEESLKERKEKTLTEDS 87
V +EE +E +E
Sbjct: 228 ALLVLRLEELREELEELQEELKE 250
Score = 28.5 bits (64), Expect = 7.0
Identities = 15/81 (18%), Positives = 34/81 (41%), Gaps = 7/81 (8%)
Query: 8 AGEKKKRIE--KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK-----KKK 60
++R+E + EE E+ E E E++EE + + + + ++
Sbjct: 833 IAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALL 892
Query: 61 KKKEEQSVKEIEESLKERKEK 81
+ + E+ +E+ E +R E
Sbjct: 893 RSELEELSEELRELESKRSEL 913
Score = 28.5 bits (64), Expect = 7.8
Identities = 24/93 (25%), Positives = 41/93 (44%), Gaps = 9/93 (9%)
Query: 2 RKRRGGAGEKKKRIEK-RNKEEEKEENEQE--------EKEKEDVEEEKNKKKKKKKKKK 52
K E+++ IE+ K EE EE E KE E++EEE + +K+ ++
Sbjct: 666 AKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELS 725
Query: 53 KKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
++ +K ++ E E + K LTE
Sbjct: 726 RQISALRKDLARLEAEVEQLEERIAQLSKELTE 758
Score = 28.1 bits (63), Expect = 9.8
Identities = 16/85 (18%), Positives = 42/85 (49%), Gaps = 6/85 (7%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK----KKKKKKEE 65
E+ + K +EE +E + ++ +E +EE + + + +++ ++ +K+ + + E
Sbjct: 242 EELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLE 301
Query: 66 QSVKEIEESLK--ERKEKTLTEDSQ 88
Q + + E L ER+ + L +
Sbjct: 302 QQKQILRERLANLERQLEELEAQLE 326
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 36.6 bits (85), Expect = 0.011
Identities = 15/66 (22%), Positives = 42/66 (63%), Gaps = 5/66 (7%)
Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK-----E 80
++Q+ ++++++ + ++ KKK++ + KKKK+ ++ + K+I++ +KE++ +
Sbjct: 69 LSDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQ 128
Query: 81 KTLTED 86
LTE+
Sbjct: 129 TKLTEE 134
Score = 34.3 bits (79), Expect = 0.066
Identities = 17/59 (28%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK--KEEQSVKEIEESLKE 77
E+EKE + + ++ ++E+ + KKKK+ ++ K +KK + KE+Q ++ L E
Sbjct: 75 EDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQTKLTE 133
Score = 30.1 bits (68), Expect = 1.6
Identities = 17/62 (27%), Positives = 33/62 (53%), Gaps = 2/62 (3%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK--KKKKKKKKKKKKKKEEQSVKEIEE 73
E+ K++ E E K+K+++EE K +KK + K+K+ + K EE + I +
Sbjct: 83 SSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQTKLTEEGLLITILD 142
Query: 74 SL 75
++
Sbjct: 143 NI 144
Score = 28.9 bits (65), Expect = 3.8
Identities = 12/36 (33%), Positives = 20/36 (55%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
KE E E+ +K++ E K KK+ ++ K +KK
Sbjct: 78 KELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113
>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
Length = 336
Score = 36.9 bits (86), Expect = 0.012
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 3/67 (4%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
K+K + K EEK E E+EE+ E EEE+ + +++KK+K K K + +E K+
Sbjct: 6 KEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKE---KD 62
Query: 71 IEESLKE 77
IE+ L+E
Sbjct: 63 IEDLLEE 69
Score = 36.5 bits (85), Expect = 0.015
Identities = 26/67 (38%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 12 KKRIEK-RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
K +EK K EE+EE E E E+E+ EE++ +KK+K K K + K+K E ++E
Sbjct: 10 SKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIEDLLEE 69
Query: 71 IEESLKE 77
+E L E
Sbjct: 70 LELELLE 76
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 36.3 bits (84), Expect = 0.014
Identities = 22/76 (28%), Positives = 37/76 (48%), Gaps = 7/76 (9%)
Query: 18 RNKEEEKEENEQEEKEKEDVE---EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
R+ E ++ ++ ED+E ++ KKK+ +K KK KKK +E K EE
Sbjct: 25 RSTTESVDKVR-DDIFPEDIEIPSPKQPKKKRPTTPRKPATTKKSKKKDKE---KLTEEE 80
Query: 75 LKERKEKTLTEDSQCD 90
K + TE+++ D
Sbjct: 81 KKPESDDDKTEENEND 96
Score = 30.1 bits (68), Expect = 1.1
Identities = 11/73 (15%), Positives = 32/73 (43%), Gaps = 4/73 (5%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
++ + KK+ +++ EEEK+ ++K EE +N ++ ++
Sbjct: 59 PRKPATTKKSKKKDKEKLTEEEKKPESDDDKT----EENENDPDNNEESGDSQESASANS 114
Query: 62 KKEEQSVKEIEES 74
+ + ++++S
Sbjct: 115 LSDIDNEDDMDDS 127
Score = 27.4 bits (61), Expect = 8.4
Identities = 19/71 (26%), Positives = 30/71 (42%), Gaps = 8/71 (11%)
Query: 29 QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK------KKEEQS--VKEIEESLKERKE 80
Q +K++ + KK KKK K+K +++KK K EE EES ++
Sbjct: 50 QPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQES 109
Query: 81 KTLTEDSQCDT 91
+ S D
Sbjct: 110 ASANSLSDIDN 120
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 36.7 bits (85), Expect = 0.014
Identities = 17/52 (32%), Positives = 27/52 (51%), Gaps = 3/52 (5%)
Query: 10 EKKKRIEKRNKE---EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
E K EK E +E +E E ++ +E+ +KK+K KKK +K+K
Sbjct: 319 EIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370
Score = 33.6 bits (77), Expect = 0.13
Identities = 18/51 (35%), Positives = 31/51 (60%), Gaps = 3/51 (5%)
Query: 16 EKRNKEEEKEENE---QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
E+ K +EK +E ED E+E +++ K++ +KK+K KKK +K+K
Sbjct: 318 EEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRK 368
Score = 33.6 bits (77), Expect = 0.16
Identities = 18/77 (23%), Positives = 33/77 (42%), Gaps = 11/77 (14%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK-----------KKKKKKEEQSV 68
+E E EE E+E V+ + + ++ KKK +K+ K +
Sbjct: 254 REIEAEEAGIGPIEEEVVDSQAANDEPRRVFKKKGQKRTTRRVKMRPVRAKPSDEPSLPE 313
Query: 69 KEIEESLKERKEKTLTE 85
+I E + + EK+L+E
Sbjct: 314 SDIHEEIPKLDEKSLSE 330
Score = 32.5 bits (74), Expect = 0.32
Identities = 13/45 (28%), Positives = 26/45 (57%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK 53
+ + +E +E+E +E+ KE+VE+++ KKK +K+K
Sbjct: 326 KSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370
>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
production and conversion].
Length = 108
Score = 34.7 bits (80), Expect = 0.015
Identities = 17/77 (22%), Positives = 41/77 (53%), Gaps = 3/77 (3%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE---QSV 68
+ IE+ +E E+ E E+ +E +EE + + +K ++ ++ +++ +++ EE ++
Sbjct: 20 DEEIEEAKEEAEQIIKEAREEAREIIEEAEEEAEKLAQEILEEAREEAEEEAEEILAEAE 79
Query: 69 KEIEESLKERKEKTLTE 85
KE L + E + E
Sbjct: 80 KEASAILSKAAEGKVVE 96
Score = 29.7 bits (67), Expect = 0.76
Identities = 11/76 (14%), Positives = 34/76 (44%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E ++ I++ +E + E EE+ ++ +E + +++ +++ ++ + +K+ K
Sbjct: 29 EAEQIIKEAREEAREIIEEAEEEAEKLAQEILEEAREEAEEEAEEILAEAEKEASAILSK 88
Query: 70 EIEESLKERKEKTLTE 85
E + E
Sbjct: 89 AAEGKVVEAALSEFLA 104
Score = 28.9 bits (65), Expect = 1.3
Identities = 15/65 (23%), Positives = 36/65 (55%), Gaps = 4/65 (6%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
E K+ ++E E+ +EE ++ K+ +++ ++ ++ EE++ K +E L+E +E
Sbjct: 10 REIKKAEISADEEIEEAKEE----AEQIIKEAREEAREIIEEAEEEAEKLAQEILEEARE 65
Query: 81 KTLTE 85
+ E
Sbjct: 66 EAEEE 70
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 37.0 bits (85), Expect = 0.015
Identities = 16/49 (32%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 17 KRNKEEEKEENEQEEKEKE--DVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
K +E+E+ E+ Q KE+E E+E+ ++ +K K+ KK+ KK
Sbjct: 1027 KDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKK 1075
Score = 34.7 bits (79), Expect = 0.093
Identities = 16/51 (31%), Positives = 26/51 (50%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
K + EK E + E+E +KE E++ +K K+ KK+ KKK+
Sbjct: 1027 KDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKKR 1077
Score = 33.6 bits (76), Expect = 0.20
Identities = 17/68 (25%), Positives = 41/68 (60%), Gaps = 4/68 (5%)
Query: 16 EKRNKEEEKE----ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
E R K E K+ E ++++EKE +E + K+++ KK+K+++++ +K + +
Sbjct: 1010 ECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMA 1069
Query: 72 EESLKERK 79
++ LK+++
Sbjct: 1070 KKRLKKKR 1077
Score = 33.6 bits (76), Expect = 0.20
Identities = 14/52 (26%), Positives = 31/52 (59%)
Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
E +EK ++++ K++ K +++K++ + ++ K+EE KE E + RK
Sbjct: 1010 ECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTI 1061
Score = 32.4 bits (73), Expect = 0.49
Identities = 15/55 (27%), Positives = 33/55 (60%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
K R E+K+R+E + +E+E ++E++ ++ + + + K+ KK+ KKK+
Sbjct: 1023 KERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKKR 1077
Score = 28.6 bits (63), Expect = 6.9
Identities = 20/97 (20%), Positives = 41/97 (42%), Gaps = 8/97 (8%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK-------KKKKKKKKKK 62
+E+ ENE E ++ D E E+ + +KK++ + + +KK
Sbjct: 613 SSDNEMEESRGSSVTAENE-ESADEVDYETEREENARKKEELRGNFELEERGDPEKKDVD 671
Query: 63 KEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKG 99
+ ++IEE LK + + T + +I ++ G
Sbjct: 672 WYTEEKRKIEEQLKINRSEFETMVPESRVVIEGYRAG 708
Score = 28.2 bits (62), Expect = 8.9
Identities = 19/75 (25%), Positives = 45/75 (60%), Gaps = 6/75 (8%)
Query: 13 KRIEKRNKEEEKEENE-QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV--- 68
+RIE E +E++E ++ KE +++++ +K++ + ++ K+++ KK++E+
Sbjct: 1001 RRIELPVPPECREKHEIKDRIVKERIKDQE--EKERMESLQRAKEEEIGKKEKEREQRIR 1058
Query: 69 KEIEESLKERKEKTL 83
K I ++ KE +K L
Sbjct: 1059 KTIHDNYKEMAKKRL 1073
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 35.9 bits (83), Expect = 0.017
Identities = 14/78 (17%), Positives = 30/78 (38%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
E K K KE + + + + + ++ + +KK + + + +KK E +
Sbjct: 98 QAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAA 157
Query: 68 VKEIEESLKERKEKTLTE 85
V E + E +E
Sbjct: 158 VAAEEAAAAEEEEAEEAP 175
Score = 32.8 bits (75), Expect = 0.16
Identities = 21/57 (36%), Positives = 29/57 (50%), Gaps = 2/57 (3%)
Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE-EQSVKE-IEESLKERKE 80
E + E K + EE++ K KK K K KK KK E E+ V E E++ E+K
Sbjct: 96 EAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKA 152
Score = 31.7 bits (72), Expect = 0.39
Identities = 13/77 (16%), Positives = 33/77 (42%), Gaps = 1/77 (1%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
EK+ ++ K++ + + +K + E++ N+ + + +KK + +E +
Sbjct: 108 EEKEGKVNA-KKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAA 166
Query: 69 KEIEESLKERKEKTLTE 85
+E E +E E
Sbjct: 167 EEEEAEEAPAEEAPAEE 183
Score = 28.2 bits (63), Expect = 5.9
Identities = 15/54 (27%), Positives = 27/54 (50%), Gaps = 3/54 (5%)
Query: 35 EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK---EIEESLKERKEKTLTE 85
E E K + K++K+ K KK K K ++ + K E E+ + E + + + E
Sbjct: 96 EAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAE 149
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 35.8 bits (83), Expect = 0.017
Identities = 13/39 (33%), Positives = 20/39 (51%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
E K+ ++EE E+ +E KK +K + KKKK
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 32.0 bits (73), Expect = 0.28
Identities = 11/39 (28%), Positives = 21/39 (53%)
Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
E+++ +Q+E+ + +E KK +K + KKKK
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 30.4 bits (69), Expect = 0.99
Identities = 10/39 (25%), Positives = 21/39 (53%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
+++ K++ E +E+++ KK +K + KKKK
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 28.9 bits (65), Expect = 3.6
Identities = 11/34 (32%), Positives = 17/34 (50%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKK 46
K+ E+ EE KE K+ ++ + KKKK
Sbjct: 163 KQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 28.5 bits (64), Expect = 4.1
Identities = 10/40 (25%), Positives = 17/40 (42%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKK 48
E+K +K E+ + K+ +K + KKKK
Sbjct: 157 AERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 28.1 bits (63), Expect = 5.5
Identities = 13/39 (33%), Positives = 19/39 (48%), Gaps = 2/39 (5%)
Query: 27 NEQEEKEKEDV--EEEKNKKKKKKKKKKKKKKKKKKKKK 63
++ K+KE+ EE K KK +K + KKKK
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 28.1 bits (63), Expect = 6.4
Identities = 9/37 (24%), Positives = 19/37 (51%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK 53
K K++E+ E+ ++ ++ +K + KKKK
Sbjct: 160 KDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 36.4 bits (84), Expect = 0.018
Identities = 19/45 (42%), Positives = 24/45 (53%)
Query: 42 NKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
+ K KK K KKK KK KKK+ +E S K+ EE + E E
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEES 186
Score = 35.6 bits (82), Expect = 0.039
Identities = 22/89 (24%), Positives = 43/89 (48%), Gaps = 6/89 (6%)
Query: 11 KKKRIEKRNKEE--EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
K + + E +K + + +K +++ K KKK+ K+ K +++ + E++S
Sbjct: 123 KPVKNVLESIMEGLDKILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESK 182
Query: 69 KEIEE---SLKERKEKTLTED-SQCDTMI 93
E S E +E + +ED SQ D M+
Sbjct: 183 SEESAEDDSDDEEEEDSDSEDYSQYDGML 211
Score = 30.2 bits (68), Expect = 1.6
Identities = 19/93 (20%), Positives = 41/93 (44%), Gaps = 10/93 (10%)
Query: 2 RKRRGGAGEKKKRIEKR--------NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK 53
RK R G ++ EK+ KE EKE+ E+E ++ E E + K++ K
Sbjct: 306 RKNRRGQRARQAIWEKKYGSGAKHVKKEREKEQKEREGRQSE--WEARQAKREGGDAKAG 363
Query: 54 KKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
+ + + +++ + K+ K+ ++ +
Sbjct: 364 RAAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKP 396
Score = 30.2 bits (68), Expect = 1.9
Identities = 12/56 (21%), Positives = 20/56 (35%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
GE+ I E E +E + + E +++ KK K+KK
Sbjct: 220 GEEAPSINYNEDTSESESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTFLPS 275
Score = 29.0 bits (65), Expect = 3.8
Identities = 17/71 (23%), Positives = 27/71 (38%), Gaps = 9/71 (12%)
Query: 2 RKRRGGAGEKK-KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK------ 54
R+ ++ KR K E +++ ++ +KKK KK K
Sbjct: 342 EGRQSEWEARQAKREGGDAKAGRAAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKPLHPSW 401
Query: 55 --KKKKKKKKK 63
KKK K+KK
Sbjct: 402 EAKKKAKEKKA 412
Score = 29.0 bits (65), Expect = 4.4
Identities = 16/63 (25%), Positives = 28/63 (44%), Gaps = 7/63 (11%)
Query: 4 RRGGAGEKKKRIEKRN--KEEEKEENEQEEKEKEDVEEEKN-----KKKKKKKKKKKKKK 56
AG + R K + + E+++ +K V++ + KKK K+KK K +
Sbjct: 358 GDAKAGRAAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKPLHPSWEAKKKAKEKKANAKFQ 417
Query: 57 KKK 59
KK
Sbjct: 418 GKK 420
Score = 28.3 bits (63), Expect = 6.6
Identities = 16/68 (23%), Positives = 24/68 (35%), Gaps = 16/68 (23%)
Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKK----------------KKKKKKKKKEEQSVKEIEE 73
E E +D E+ + KK K+K K KK+ E+ KE E
Sbjct: 284 SEDEDDDDEDIDPDQVVKKPVKRKNRRGQRARQAIWEKKYGSGAKHVKKEREKEQKEREG 343
Query: 74 SLKERKEK 81
E + +
Sbjct: 344 RQSEWEAR 351
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 35.8 bits (83), Expect = 0.019
Identities = 11/39 (28%), Positives = 21/39 (53%)
Query: 25 EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
E + + E+ EE + K+++ + +KK+ KKKK
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 35.0 bits (81), Expect = 0.031
Identities = 13/39 (33%), Positives = 21/39 (53%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
E K+ EE E+ E K+++ + +KK+ KKKK
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 31.5 bits (72), Expect = 0.39
Identities = 14/37 (37%), Positives = 19/37 (51%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK 53
K K E+ E+ E EEE + +KK+ KKKK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 31.1 bits (71), Expect = 0.64
Identities = 14/40 (35%), Positives = 17/40 (42%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKK 48
E K E+ E KE+E E EK + KKKK
Sbjct: 157 AEWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 30.8 bits (70), Expect = 0.73
Identities = 11/39 (28%), Positives = 19/39 (48%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
++ K E +E E K ++ + +KK+ KKKK
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 30.4 bits (69), Expect = 1.0
Identities = 12/38 (31%), Positives = 18/38 (47%)
Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
+ EE EE E +E+ + +KK+ KKKK
Sbjct: 159 WKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 30.4 bits (69), Expect = 1.0
Identities = 14/35 (40%), Positives = 17/35 (48%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK 50
K +E EE E KE+ E KK+ KKKK
Sbjct: 162 AKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 30.4 bits (69), Expect = 1.2
Identities = 10/39 (25%), Positives = 20/39 (51%)
Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
E + + +E EE K+++ + +KK+ KKK+
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
chaperones [Posttranslational modification, protein
turnover, chaperones].
Length = 786
Score = 36.5 bits (85), Expect = 0.019
Identities = 19/88 (21%), Positives = 40/88 (45%), Gaps = 19/88 (21%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK------------ 57
E ++ + + E+E E EQ+EKEK+ ++E K+ + + +K+ +
Sbjct: 405 ELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRIPELEKELEAEVDEDDIAEVVA 464
Query: 58 -------KKKKKKEEQSVKEIEESLKER 78
K + E++ + +E LK+R
Sbjct: 465 RWTGIPVAKLLEDEKEKLLNLERRLKKR 492
Score = 34.2 bits (79), Expect = 0.12
Identities = 16/64 (25%), Positives = 34/64 (53%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+K + +++ +E + E E+E E+E E+EK + K K+ + + +K+ + E
Sbjct: 398 DKPEELDELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRIPELEKELEAEVDED 457
Query: 70 EIEE 73
+I E
Sbjct: 458 DIAE 461
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 36.1 bits (83), Expect = 0.019
Identities = 22/97 (22%), Positives = 41/97 (42%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
EKK +K +E + ++ K +K K KK K K KK K ++ +SVK
Sbjct: 15 EKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVK 74
Query: 70 EIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYKGIY 106
+ + K K++ ++ + + L K ++
Sbjct: 75 KESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKKLF 111
Score = 31.1 bits (70), Expect = 0.88
Identities = 16/54 (29%), Positives = 25/54 (46%)
Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLT 84
+K+ EE+K KK K+ K KK K ++ K+ + K + KT T
Sbjct: 7 TTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTT 60
Score = 28.8 bits (64), Expect = 3.9
Identities = 11/68 (16%), Positives = 31/68 (45%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A +K +++K + + + EK + ++++ KK KK+ + + K ++
Sbjct: 43 AAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKN 102
Query: 68 VKEIEESL 75
++ + L
Sbjct: 103 TSKLPKKL 110
Score = 28.8 bits (64), Expect = 4.7
Identities = 24/95 (25%), Positives = 34/95 (35%), Gaps = 18/95 (18%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK----KKKKKKKKKKKKKKEE 65
E KK +K+ EE+K +K +E K KK K K KK K KK K +
Sbjct: 3 ETKKTTKKKTTEEKKPA-----AKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVK 57
Query: 66 QSVK---------EIEESLKERKEKTLTEDSQCDT 91
+ K E + K+ E +
Sbjct: 58 TTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEV 92
Score = 28.4 bits (63), Expect = 5.6
Identities = 20/83 (24%), Positives = 30/83 (36%), Gaps = 6/83 (7%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK------KKKKKEEQSVKEIEESLKE 77
E ++ K+K E++ KK K+ K KK K KK + K
Sbjct: 1 MAETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTT 60
Query: 78 RKEKTLTEDSQCDTMIPVFKKGV 100
+K E ++ V KK V
Sbjct: 61 KKVTVKFEKTESVKKESVAKKTV 83
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 36.1 bits (84), Expect = 0.020
Identities = 18/76 (23%), Positives = 41/76 (53%), Gaps = 3/76 (3%)
Query: 8 AGEKKKRIEKRNKEE---EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
A EK E+ E E+E +++KE+E + E + + ++ K+ +K + +++K
Sbjct: 201 AKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLL 260
Query: 65 EQSVKEIEESLKERKE 80
+ + +E L+E++E
Sbjct: 261 AEQERMLEHKLQEQEE 276
Score = 31.5 bits (72), Expect = 0.65
Identities = 15/64 (23%), Positives = 38/64 (59%), Gaps = 1/64 (1%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
KE+ + E+ K + E + + ++K+K++++ + +++ +E VK++ E ++ +EK
Sbjct: 200 TAKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEH-VKQLIEKMEAEREK 258
Query: 82 TLTE 85
L E
Sbjct: 259 LLAE 262
Score = 29.9 bits (68), Expect = 2.0
Identities = 15/58 (25%), Positives = 33/58 (56%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
E+ +E K+ E+ E E+E + E+ + + K +++++ K+ K + E KEI++
Sbjct: 238 ERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQEQEELLKEGFKTEAESLQKEIQD 295
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This
family includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 36.6 bits (83), Expect = 0.022
Identities = 24/87 (27%), Positives = 50/87 (57%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
K+ E+E E E K++ EEE + K ++++ +K+++KK+ K+ Q+ K+ E+++K
Sbjct: 95 KKAAEKEARRAEAEAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKEKAVK 154
Query: 77 ERKEKTLTEDSQCDTMIPVFKKGVLYK 103
K + + + T PV ++ V+ K
Sbjct: 155 AEKAEKAEKTKKASTPAPVEEEIVVKK 181
Score = 34.7 bits (78), Expect = 0.084
Identities = 21/76 (27%), Positives = 43/76 (56%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
+K E R E E ++ +E+E + + E+ + +K+++KK+ KK + +KK+E++V
Sbjct: 94 AKKAAEKEARRAEAEAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKEKAV 153
Query: 69 KEIEESLKERKEKTLT 84
K + E+ +K T
Sbjct: 154 KAEKAEKAEKTKKAST 169
Score = 34.3 bits (77), Expect = 0.093
Identities = 24/79 (30%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 4 RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK---KKKKKKK 60
RR A KK+ ++ ++ K E E+ +KE+E E + K + +KKK+K +K +K +
Sbjct: 103 RRAEAEAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKEKAVKAEKAEKAE 162
Query: 61 KKKEEQSVKEIEESLKERK 79
K K+ + +EE + +K
Sbjct: 163 KTKKASTPAPVEEEIVVKK 181
Score = 33.1 bits (74), Expect = 0.24
Identities = 20/76 (26%), Positives = 43/76 (56%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+ +++EK K E+K+ +Q K+ E ++ KK +K+ ++ + + KK+ +EE+ +
Sbjct: 62 EQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEHKQ 121
Query: 70 EIEESLKERKEKTLTE 85
E + +KE+ E
Sbjct: 122 WKAEQERIQKEQEKKE 137
Score = 33.1 bits (74), Expect = 0.25
Identities = 26/75 (34%), Positives = 43/75 (57%), Gaps = 7/75 (9%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A EK+ R R + E K+ QEE+ K+ E++ +K+++KK+ KK + +KKKE
Sbjct: 97 AAEKEAR---RAEAEAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKE--- 150
Query: 68 VKEIEESLKERKEKT 82
K ++ E+ EKT
Sbjct: 151 -KAVKAEKAEKAEKT 164
Score = 30.0 bits (66), Expect = 2.0
Identities = 23/67 (34%), Positives = 30/67 (44%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
EKK + KR E E E + VE + +KK KK KKK + +K EE
Sbjct: 256 EKKNKKNKRKSESENVPAASETPVEPVVETTPPASENQKKNKKDKKKSESEKVVEEPVQA 315
Query: 70 EIEESLK 76
E +S K
Sbjct: 316 EAPKSKK 322
Score = 29.6 bits (65), Expect = 2.8
Identities = 19/60 (31%), Positives = 34/60 (56%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
K E+ KE+EK+E + ++ + E +E+ K +K +K +K KK +EE VK++
Sbjct: 123 KAEQERIQKEQEKKEADLKKLQAEKKKEKAVKAEKAEKAEKTKKASTPAPVEEEIVVKKV 182
Score = 28.5 bits (62), Expect = 7.1
Identities = 20/75 (26%), Positives = 32/75 (42%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
EK+NK+ +++ + + E + + +KK KK KKK E + V E
Sbjct: 256 EKKNKKNKRKSESENVPAASETPVEPVVETTPPASENQKKNKKDKKKSESEKVVEEPVQA 315
Query: 76 KERKEKTLTEDSQCD 90
+ K K T D D
Sbjct: 316 EAPKSKKPTADDNMD 330
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 34.9 bits (80), Expect = 0.022
Identities = 18/51 (35%), Positives = 35/51 (68%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
EKK+ ++R +E+ ++E + +KE E +E K K+K++++++KK KK K
Sbjct: 76 EKKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERERRRKKLTKKTKS 126
Score = 34.5 bits (79), Expect = 0.031
Identities = 22/74 (29%), Positives = 44/74 (59%), Gaps = 3/74 (4%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKK---KKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
E E++ +Q + KED + EK KK KK+ K++K+++++K+ K ++ +++IE
Sbjct: 45 EGYAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIE 104
Query: 73 ESLKERKEKTLTED 86
S K++KE+
Sbjct: 105 LSKKKQKERERRRK 118
Score = 34.1 bits (78), Expect = 0.046
Identities = 22/75 (29%), Positives = 43/75 (57%), Gaps = 5/75 (6%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKED--VEEEKNKKKKKK---KKKKKKKKKKKKKKKEEQ 66
KK K ++E E+E EK+ +E++ +KKKK KK+ K++K+++++KE
Sbjct: 35 KKEYLKLLEKEGYAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELA 94
Query: 67 SVKEIEESLKERKEK 81
++ E ++ K+K
Sbjct: 95 KRQKELEKIELSKKK 109
Score = 32.9 bits (75), Expect = 0.096
Identities = 19/60 (31%), Positives = 40/60 (66%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
EKKK+++++ + ++ + EQ EKE ++E K + KKK+K++++++KK K+ +S
Sbjct: 67 KFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERERRRKKLTKKTKS 126
Score = 31.4 bits (71), Expect = 0.37
Identities = 18/59 (30%), Positives = 37/59 (62%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
K E R E++K+ +E++E K+ E++ K+ K++K+ +K + KKK+KE + ++
Sbjct: 60 KSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERERRRK 118
Score = 30.2 bits (68), Expect = 0.95
Identities = 18/64 (28%), Positives = 38/64 (59%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
K EKE EKE + + + +K+ +K +KKKK +KK+ K+ ++ +E E + ++++
Sbjct: 40 KLLEKEGYAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKE 99
Query: 80 EKTL 83
+ +
Sbjct: 100 LEKI 103
Score = 28.7 bits (64), Expect = 2.8
Identities = 17/60 (28%), Positives = 41/60 (68%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E +K +K+ +E+KE +Q ++E+ + E K +K+ +K + KKK+K++++++++ + K
Sbjct: 64 EDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERERRRKKLTKK 123
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 36.4 bits (84), Expect = 0.022
Identities = 16/88 (18%), Positives = 32/88 (36%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
+K K K+E ++ E+E ++ + + ++ K KKK + S E
Sbjct: 339 EKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNED 398
Query: 72 EESLKERKEKTLTEDSQCDTMIPVFKKG 99
S E + + + T P +
Sbjct: 399 TPSENEESKGSPPQVEATTTAEPNREPS 426
Score = 33.7 bits (77), Expect = 0.15
Identities = 18/90 (20%), Positives = 35/90 (38%), Gaps = 7/90 (7%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKK-----KKKKKKKKKKKKKKK 61
A KK R + K E K ++ +++ E E N++ + + + ++ K K
Sbjct: 329 NAEIKKTRTAE--KNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAK 386
Query: 62 KKEEQSVKEIEESLKERKEKTLTEDSQCDT 91
KK E++ E +E + T
Sbjct: 387 KKGLIDASPNEDTPSENEESKGSPPQVEAT 416
Score = 33.3 bits (76), Expect = 0.22
Identities = 9/79 (11%), Positives = 32/79 (40%), Gaps = 2/79 (2%)
Query: 14 RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE--QSVKEI 71
+ N E +K ++ + K +E K++ +++ ++ ++++ +
Sbjct: 324 ELHAENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAV 383
Query: 72 EESLKERKEKTLTEDSQCD 90
+ K + + ED+ +
Sbjct: 384 KAKKKGLIDASPNEDTPSE 402
Score = 32.9 bits (75), Expect = 0.23
Identities = 13/69 (18%), Positives = 26/69 (37%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
+ A +KK I ++ + E+E N + +E+ ++ K KKK
Sbjct: 337 TAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPN 396
Query: 62 KKEEQSVKE 70
+ +E
Sbjct: 397 EDTPSENEE 405
Score = 32.9 bits (75), Expect = 0.26
Identities = 14/75 (18%), Positives = 28/75 (37%), Gaps = 1/75 (1%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
++K R + K +K ++ + + +E + + ++ K KKK
Sbjct: 332 IKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLI 391
Query: 61 KKKE-EQSVKEIEES 74
E + E EES
Sbjct: 392 DASPNEDTPSENEES 406
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 35.0 bits (81), Expect = 0.023
Identities = 17/74 (22%), Positives = 39/74 (52%), Gaps = 6/74 (8%)
Query: 10 EKKKRIE------KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
E++ +I +R KEE + + E+E E+ E+ ++ ++ KK+ ++ ++ K +
Sbjct: 37 ERQAKIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEA 96
Query: 64 EEQSVKEIEESLKE 77
EE+ + E + E
Sbjct: 97 EEELERIKEAAEAE 110
Score = 29.2 bits (66), Expect = 1.8
Identities = 12/70 (17%), Positives = 36/70 (51%), Gaps = 1/70 (1%)
Query: 15 IEKRNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
+++R + + E E ++E + +++ ++ +++ + ++ KK+ EQ +EI+
Sbjct: 35 LDERQAKIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKA 94
Query: 74 SLKERKEKTL 83
+E E+
Sbjct: 95 EAEEELERIK 104
Score = 28.4 bits (64), Expect = 3.7
Identities = 12/77 (15%), Positives = 35/77 (45%), Gaps = 1/77 (1%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
+ + A E+ +E ++ +E E+ K++ E+ + K + +++ ++ K+ +
Sbjct: 50 ERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEELERIKEAAEA 109
Query: 62 KKEEQSVKEIEESLKER 78
+ E + E L+
Sbjct: 110 EI-EAEKERALEELRAE 125
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 34.5 bits (80), Expect = 0.023
Identities = 24/83 (28%), Positives = 49/83 (59%), Gaps = 7/83 (8%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVE-------EEKNKKKKKKKKKKKKKKKKKKKK 62
E+ +R E+ K+ E+E ++EE+ +E + E + K+++ +KK +++KK +K+K+
Sbjct: 21 EEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEEKKLRKEKE 80
Query: 63 KEEQSVKEIEESLKERKEKTLTE 85
+E + +K E LK EK +
Sbjct: 81 EEIKELKAELEELKAEIEKLEEK 103
Score = 28.0 bits (63), Expect = 4.0
Identities = 19/74 (25%), Positives = 40/74 (54%), Gaps = 3/74 (4%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKK---KKKKKKKKKKKKEEQSVKEIE 72
KR + E +EE ++ +E+ + +EE+ ++ K K + + K+++ +KK E+ K +
Sbjct: 18 AKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEEKKLRK 77
Query: 73 ESLKERKEKTLTED 86
E +E KE +
Sbjct: 78 EKEEEIKELKAELE 91
Score = 26.8 bits (60), Expect = 9.2
Identities = 16/77 (20%), Positives = 47/77 (61%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+++R +K++ +++ + +K E E K + ++ + +KK +K+K+++ K+ K +
Sbjct: 30 LKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEEKKLRKEKEEEIKELKAE 89
Query: 61 KKKEEQSVKEIEESLKE 77
++ + ++++EE L+E
Sbjct: 90 LEELKAEIEKLEEKLEE 106
>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2. This
family consists of several Eukaryotic rRNA processing
protein EBP2 sequences. Ebp2p is required for the
maturation of 25S rRNA and 60S subunit assembly. Ebp2p
may be one of the target proteins of Rrs1p for executing
the signal to regulate ribosome biogenesis. This family
also plays a role in chromosome segregation.
Length = 270
Score = 35.8 bits (83), Expect = 0.023
Identities = 17/66 (25%), Positives = 31/66 (46%), Gaps = 3/66 (4%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+++R A EKK +EK K ++K + +E + D + K KKK +K + K
Sbjct: 155 LQER---AKEKKDMLEKIKKLKKKRKGGGDEGDLFDFLVDDAAKGGSKKKGRKGGAARGK 211
Query: 61 KKKEEQ 66
+ +
Sbjct: 212 PNAKRK 217
Score = 30.0 bits (68), Expect = 1.8
Identities = 20/88 (22%), Positives = 39/88 (44%), Gaps = 8/88 (9%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKE---DVEEEKNKKKKK---KKKKKKK 54
M+K R + + EE +++ E ++ K+ + +E+ K+KK K KK KK
Sbjct: 117 MQKVRKKLLAEAAAKKAS--EEARKQRELKKFGKQVQVEKLQERAKEKKDMLEKIKKLKK 174
Query: 55 KKKKKKKKKEEQSVKEIEESLKERKEKT 82
K+K + + + + K+K
Sbjct: 175 KRKGGGDEGDLFDFLVDDAAKGGSKKKG 202
Score = 28.5 bits (64), Expect = 5.4
Identities = 22/85 (25%), Positives = 39/85 (45%), Gaps = 14/85 (16%)
Query: 11 KKKR-IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK-------------KKKKKK 56
+K+R ++K K+ + E+ ++ KEK+D+ E+ K KKK+K K
Sbjct: 138 RKQRELKKFGKQVQVEKLQERAKEKKDMLEKIKKLKKKRKGGGDEGDLFDFLVDDAAKGG 197
Query: 57 KKKKKKKEEQSVKEIEESLKERKEK 81
KKK +K + + K + K
Sbjct: 198 SKKKGRKGGAARGKPNAKRKAKDAK 222
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 36.0 bits (83), Expect = 0.024
Identities = 15/46 (32%), Positives = 20/46 (43%)
Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
E E E ++ KK KK + K K K KK K++ SV
Sbjct: 128 EMHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSV 173
Score = 34.1 bits (78), Expect = 0.097
Identities = 15/50 (30%), Positives = 19/50 (38%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
E E + +E K KK + K K K KK KKK S +
Sbjct: 128 EMHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRS 177
Score = 32.2 bits (73), Expect = 0.40
Identities = 14/66 (21%), Positives = 23/66 (34%)
Query: 14 RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
R + E KE + ++ ++ K K K KK KKK + +
Sbjct: 127 REMHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPR 186
Query: 74 SLKERK 79
S + K
Sbjct: 187 SKPDWK 192
Score = 29.9 bits (67), Expect = 2.0
Identities = 13/73 (17%), Positives = 26/73 (35%)
Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
E +E +K + K K KK KKK + + + + + ++
Sbjct: 138 STAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWKSQDLV 197
Query: 83 LTEDSQCDTMIPV 95
+ S + +PV
Sbjct: 198 INGVSMDISGMPV 210
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 35.8 bits (83), Expect = 0.027
Identities = 13/71 (18%), Positives = 34/71 (47%), Gaps = 4/71 (5%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+++ +++R K+ E+ E E+ +E E+ +++K+ K ++ +KK
Sbjct: 303 LKEIHEARLNEEERELKKKFTEKIREKEKRLEEL----EQNLIEERKELNSKLEEIQKKL 358
Query: 61 KKKEEQSVKEI 71
+ E++ K
Sbjct: 359 EDLEKRLEKLK 369
Score = 32.3 bits (74), Expect = 0.41
Identities = 16/79 (20%), Positives = 39/79 (49%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
K G K+ + N+EE + + + EK +E + + ++ +++K+ K ++
Sbjct: 295 LKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEI 354
Query: 62 KKEEQSVKEIEESLKERKE 80
+K+ + +++ E LK K
Sbjct: 355 QKKLEDLEKRLEKLKSNKS 373
Score = 31.9 bits (73), Expect = 0.49
Identities = 17/75 (22%), Positives = 38/75 (50%), Gaps = 9/75 (12%)
Query: 11 KKKRIEKR-NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+ +++ N E + E + E+ E K KK +K ++K+K+ ++ E+ ++
Sbjct: 288 RTEKLSGLKNSGEPSLKEIHEARLNEEERELK-----KKFTEKIREKEKRLEELEQNLIE 342
Query: 70 EIEE---SLKERKEK 81
E +E L+E ++K
Sbjct: 343 ERKELNSKLEEIQKK 357
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid
layers. The innermost capsid (core) is made of VP2. The
genomic RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 36.1 bits (83), Expect = 0.028
Identities = 19/85 (22%), Positives = 45/85 (52%), Gaps = 8/85 (9%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEE-KNKKKKKKKKKKKKKKKKKK 60
R RR + I ++ +EK++ +Q++K + +++E+ +KK++ K++
Sbjct: 4 RNRR------EANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQS 57
Query: 61 KKKEEQSVKEIEESLKERKEKTLTE 85
++ + E+++S KE + K L E
Sbjct: 58 SQENLKIADEVKKSTKE-ESKQLLE 81
Score = 34.9 bits (80), Expect = 0.072
Identities = 7/48 (14%), Positives = 28/48 (58%)
Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
+++ + ++K +K+++++ E++E + ++KE+ +T++
Sbjct: 5 NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSP 52
Score = 32.2 bits (73), Expect = 0.51
Identities = 24/105 (22%), Positives = 48/105 (45%), Gaps = 12/105 (11%)
Query: 2 RKRRGGAGEKKKRIEKRN-KEEEKEENEQEEK---EKEDVEEEKNKKKKKKKK-----KK 52
+R + EK + K+++K E +EK +KE+V + K++ K
Sbjct: 5 NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKI 64
Query: 53 KKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFK 97
+ KK K+E + + E+ ++ +E +++ E Q IP F+
Sbjct: 65 ADEV-KKSTKEESKQLLEVLKTKEEHQKEIQYEILQ--KTIPSFE 106
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 35.8 bits (83), Expect = 0.029
Identities = 15/62 (24%), Positives = 33/62 (53%), Gaps = 1/62 (1%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
+E + + + +KE E + E ++++K + KK K+ ++ +E++ EE + KE
Sbjct: 114 DEGEWIDVESDKEIESSDSE-DEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKE 172
Query: 81 KT 82
K
Sbjct: 173 KA 174
Score = 31.9 bits (73), Expect = 0.51
Identities = 19/90 (21%), Positives = 37/90 (41%), Gaps = 19/90 (21%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK-------------------KKKKKK 49
E K+ ++ + EE EE+E+E E+E+ E EK K + + +K
Sbjct: 139 DEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATTRILTPADFAKIQELRLEK 198
Query: 50 KKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
K K K++ ++ + +E +
Sbjct: 199 GVDKALGGKLKRRDKDAPERHSDELVDADD 228
Score = 31.5 bits (72), Expect = 0.60
Identities = 16/45 (35%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 20 KEEEKEEN-EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
K++ KEE ++ +ED E+ ++K KK K+ K K+K +KK
Sbjct: 236 KKQTKEERIATAKEGREDREKFGSRKGKKDKEGKSTTNKEKARKK 280
Score = 30.8 bits (70), Expect = 1.0
Identities = 13/83 (15%), Positives = 39/83 (46%), Gaps = 10/83 (12%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK----------KKKKEE 65
E+ E K+ E ++E + +EE+ ++++ + +K+K + K +E
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATTRILTPADFAKIQEL 194
Query: 66 QSVKEIEESLKERKEKTLTEDSQ 88
+ K ++++L + ++ + +
Sbjct: 195 RLEKGVDKALGGKLKRRDKDAPE 217
Score = 30.4 bits (69), Expect = 1.4
Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 7/75 (9%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKK-------KKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
E E++E+ +E E DVE +K + ++K + KK K+ ++ EE + EE
Sbjct: 105 EVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEE 164
Query: 74 SLKERKEKTLTEDSQ 88
E +++ +E +
Sbjct: 165 EEAEAEKEKASELAT 179
Score = 29.2 bits (66), Expect = 2.9
Identities = 15/66 (22%), Positives = 36/66 (54%), Gaps = 1/66 (1%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
+ E +KE + E E+ ++E KK K+ ++ ++ +++ +EE++ E E++ +
Sbjct: 120 DVESDKEIESSDS-EDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELA 178
Query: 79 KEKTLT 84
+ LT
Sbjct: 179 TTRILT 184
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 36.2 bits (83), Expect = 0.029
Identities = 17/45 (37%), Positives = 29/45 (64%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
E+ N + E +E++E +E+ EEE+NK+ K+ KKK++K K
Sbjct: 760 EQDNSDSFAESSEEDESSEEEKEEEENKEVSAKRAKKKQRKNMLK 804
>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit. This
family includes the vacuolar ATP synthase E subunit, as
well as the archaebacterial ATP synthase E subunit.
Length = 195
Score = 35.0 bits (81), Expect = 0.029
Identities = 17/65 (26%), Positives = 35/65 (53%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
+E E++ E + +E+ E EK + ++ +KK ++ +KK+K+ E + I + E
Sbjct: 3 RQEAEEKAEEIRAEAEEEFEIEKAEAVEEAEKKIEEIYEKKEKQAEMEKQIIISNAKNEA 62
Query: 79 KEKTL 83
+ K L
Sbjct: 63 RLKVL 67
Score = 32.3 bits (74), Expect = 0.23
Identities = 25/106 (23%), Positives = 50/106 (47%), Gaps = 4/106 (3%)
Query: 5 RGGAGEKKKRIEKRNKEE-EKEENEQEEKEKEDVEEEKNKKKKK---KKKKKKKKKKKKK 60
R A EK + I +EE E E+ E E+ ++ +EE KK+K+ +K+ K +
Sbjct: 3 RQEAEEKAEEIRAEAEEEFEIEKAEAVEEAEKKIEEIYEKKEKQAEMEKQIIISNAKNEA 62
Query: 61 KKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYKGIY 106
+ K + +E+ +S+ E ++ L S+ K ++ + +
Sbjct: 63 RLKVLNAREELLDSVFEEAKERLANLSEDKDEYKDLLKDLIKQALV 108
>gnl|CDD|205480 pfam13300, DUF4078, Domain of unknown function (DUF4078). This
family is found from fungi to humans, but its exact
function is not known.
Length = 88
Score = 33.4 bits (77), Expect = 0.030
Identities = 16/48 (33%), Positives = 37/48 (77%)
Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
+++EE KE+ E+ EK +E+ E E+ +++++K+K+K+ ++++KK +E
Sbjct: 40 KDEEERKEQMEELEKAREETERERKEREERKEKRKRAIEERRKKIEER 87
Score = 32.2 bits (74), Expect = 0.077
Identities = 16/47 (34%), Positives = 35/47 (74%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
K+EE+ + + EE EK E E+ +K+++++K+K+K+ ++++KK E+
Sbjct: 40 KDEEERKEQMEELEKAREETERERKEREERKEKRKRAIEERRKKIEE 86
Score = 30.7 bits (70), Expect = 0.22
Identities = 12/47 (25%), Positives = 34/47 (72%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
E++++ + E+ +EE E+E KE+E+ +E++ + ++++KK ++++
Sbjct: 42 EEERKEQMEELEKAREETERERKEREERKEKRKRAIEERRKKIEERR 88
Score = 29.2 bits (66), Expect = 0.77
Identities = 15/46 (32%), Positives = 33/46 (71%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
E+R ++ E+ E +EE E+E E E+ K+K+K+ ++++KK ++++
Sbjct: 43 EERKEQMEELEKAREETERERKEREERKEKRKRAIEERRKKIEERR 88
Score = 28.4 bits (64), Expect = 1.4
Identities = 12/44 (27%), Positives = 32/44 (72%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
++E+ EE E+ +E E +E+ ++K+K+K+ ++++KK ++++
Sbjct: 45 RKEQMEELEKAREETERERKEREERKEKRKRAIEERRKKIEERR 88
>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
TopJ. This model describes TopJ (MG_200, CbpA), a DnaJ
homolog and probable assembly protein of the Mycoplasma
terminal organelle. The terminal organelle is involved
in both cytadherence and gliding motility [Cellular
processes, Chemotaxis and motility].
Length = 871
Score = 35.9 bits (82), Expect = 0.032
Identities = 18/46 (39%), Positives = 20/46 (43%), Gaps = 2/46 (4%)
Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKK--KKKKKEEQSVKEIEE 73
EE K + KKKKKKKKK K K+E E EE
Sbjct: 89 EEINKSGEFDNITDDDTPSKKKKKKKKKGWFWAKSKQESKTIETEE 134
Score = 28.2 bits (62), Expect = 7.7
Identities = 19/66 (28%), Positives = 34/66 (51%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
EE ++ + + +D K KKKKKKK K K++ K E + + ++ S+ + E
Sbjct: 89 EEINKSGEFDNITDDDTPSKKKKKKKKKGWFWAKSKQESKTIETEEIIDVGASVNQANET 148
Query: 82 TLTEDS 87
L +D+
Sbjct: 149 RLFDDT 154
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 35.0 bits (81), Expect = 0.034
Identities = 12/45 (26%), Positives = 29/45 (64%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
E+ E E+E++++E+ E K + ++ + K+K K+ +K+E++
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDE 164
Score = 33.1 bits (76), Expect = 0.14
Identities = 13/49 (26%), Positives = 29/49 (59%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
EQ E+E+E+ +E+ +++ + K + ++ + K+K +E +E EE
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRH 168
Score = 31.5 bits (72), Expect = 0.41
Identities = 12/51 (23%), Positives = 29/51 (56%), Gaps = 1/51 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
E+ +R E+ ++EE+ E K E+ + K+K K+ +K++ ++ + +
Sbjct: 120 EQLEREEEEKRDEEERERL-LRAAKSRSEQSRLKQKAKEMQKEEDEEMRHR 169
Score = 30.4 bits (69), Expect = 1.2
Identities = 13/54 (24%), Positives = 28/54 (51%), Gaps = 9/54 (16%)
Query: 30 EEKEKED----VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
E+ E+E+ EEE+ + + K + ++ + K+K K+ ++ EE + R
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQK-----EEDEEMRH 168
>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
Length = 1622
Score = 35.9 bits (83), Expect = 0.036
Identities = 15/47 (31%), Positives = 20/47 (42%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
EE EE + + K N KK KKK K+ K K+E+
Sbjct: 847 VEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEE 893
Score = 32.0 bits (73), Expect = 0.56
Identities = 19/57 (33%), Positives = 24/57 (42%), Gaps = 3/57 (5%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
+K +E K EE E E E+ED + KK KKK KE +SV
Sbjct: 834 QKLMENAGKMEEYVE---ENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSV 887
Score = 28.2 bits (63), Expect = 9.7
Identities = 11/46 (23%), Positives = 24/46 (52%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK 53
AG+ ++ +E+ +EE+ + + + +K+ KKK K+ K
Sbjct: 840 AGKMEEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGK 885
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 35.8 bits (83), Expect = 0.038
Identities = 20/86 (23%), Positives = 38/86 (44%), Gaps = 1/86 (1%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+ G K+ IE+ +E E+ Q++K E + E N+ + KK +K ++
Sbjct: 825 FNDKNGAYESLKELIERI-IKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELYI 883
Query: 61 KKKEEQSVKEIEESLKERKEKTLTED 86
K+E+ K+ E +E+ D
Sbjct: 884 LAKKEEEFKQFAEEEGLNEEELAFYD 909
Score = 32.0 bits (73), Expect = 0.56
Identities = 14/52 (26%), Positives = 23/52 (44%), Gaps = 2/52 (3%)
Query: 14 RIEKRNKEEEKEENEQE--EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
R+E EE +E +E E+ +E E+ K KKK K + + +
Sbjct: 452 RVELELIEESIKEEAEELDERIEEITEDILEKIKKKTKNLEFLAMLAVRLIR 503
Score = 30.8 bits (70), Expect = 1.4
Identities = 15/80 (18%), Positives = 31/80 (38%), Gaps = 9/80 (11%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK---------KKKKKKKKKKKKKKKKEEQ 66
+K E ++ + E ++KKK + + KK +K ++
Sbjct: 824 TFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELYI 883
Query: 67 SVKEIEESLKERKEKTLTED 86
K+ EE + +E+ L E+
Sbjct: 884 LAKKEEEFKQFAEEEGLNEE 903
Score = 30.1 bits (68), Expect = 2.4
Identities = 11/51 (21%), Positives = 23/51 (45%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
+ + + + +EE EE ++ +E + EK KKK K + + +
Sbjct: 453 VELELIEESIKEEAEELDERIEEITEDILEKIKKKTKNLEFLAMLAVRLIR 503
Score = 29.7 bits (67), Expect = 3.0
Identities = 13/71 (18%), Positives = 28/71 (39%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
+ K+++ K EE ++E+E + EE+ +++ K + + E
Sbjct: 868 RAKKLDTAEKLEELYILAKKEEEFKQFAEEEGLNEEELAFYDDLALNGGKLPENGTELVE 927
Query: 71 IEESLKERKEK 81
K +EK
Sbjct: 928 KLAKEKSLREK 938
Score = 29.3 bits (66), Expect = 4.4
Identities = 11/54 (20%), Positives = 23/54 (42%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
G E++ + E + E + +EK+ ++K K K K++ + K
Sbjct: 899 GLNEEELAFYDDLALNGGKLPENGTELVEKLAKEKSLREKNKDDWKAKEEVEAK 952
Score = 28.9 bits (65), Expect = 5.8
Identities = 18/70 (25%), Positives = 29/70 (41%), Gaps = 11/70 (15%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKK-----------KKKKKKKKKKKKKKK 60
KK E + EE+ NE+E +D+ K K+K ++K K K
Sbjct: 886 KKEEEFKQFAEEEGLNEEELAFYDDLALNGGKLPENGTELVEKLAKEKSLREKNKDDWKA 945
Query: 61 KKKEEQSVKE 70
K++ E +K
Sbjct: 946 KEEVEAKLKA 955
Score = 28.1 bits (63), Expect = 8.4
Identities = 11/66 (16%), Positives = 24/66 (36%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
+ R K+ + E +E EEE + +++ +++ + E L
Sbjct: 866 QYRAKKLDTAEKLEELYILAKKEEEFKQFAEEEGLNEEELAFYDDLALNGGKLPENGTEL 925
Query: 76 KERKEK 81
E+ K
Sbjct: 926 VEKLAK 931
>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
Length = 194
Score = 34.4 bits (79), Expect = 0.042
Identities = 13/70 (18%), Positives = 39/70 (55%)
Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
++E +++ +E+ +E KK+ K+ K++ ++K+ + E+ + +++ K+ ++K
Sbjct: 3 KEEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVEDLEKEIADLKAKNKDLEDKY 62
Query: 83 LTEDSQCDTM 92
L ++ M
Sbjct: 63 LRSQAEIQNM 72
Score = 27.5 bits (61), Expect = 8.9
Identities = 14/53 (26%), Positives = 25/53 (47%), Gaps = 5/53 (9%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK-----KKKKKKKEEQ 66
K+ +EE E +KE E K + ++K+ + +K K K K E++
Sbjct: 9 EKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVEDLEKEIADLKAKNKDLEDK 61
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 35.4 bits (83), Expect = 0.045
Identities = 17/81 (20%), Positives = 39/81 (48%), Gaps = 11/81 (13%)
Query: 12 KKRIEKRNKEEEKEE-NEQEEKEKE---DVEEEKNKKKKKKKK--KKKKKKKKKKKKKEE 65
K+ + KR + +E E +E+ +E ++EE + ++ K+ + K+K + + E
Sbjct: 15 KEALAKRGFPLDVDELLELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAE 74
Query: 66 -----QSVKEIEESLKERKEK 81
+ +K +E L E + +
Sbjct: 75 VKELKEEIKALEAELDELEAE 95
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 35.1 bits (80), Expect = 0.045
Identities = 15/38 (39%), Positives = 23/38 (60%)
Query: 4 RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEK 41
++ G KK E + E+E+EE E+EE+E E E E+
Sbjct: 120 KKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQ 157
Score = 32.4 bits (73), Expect = 0.31
Identities = 16/41 (39%), Positives = 28/41 (68%), Gaps = 1/41 (2%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEE-NEQEEKEKEDVEEEKN 42
K+ G AG+K + ++ +++EE+EE E+EE E E+ E+ N
Sbjct: 120 KKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTN 160
>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region. The
timeless (tim) gene is essential for circadian function
in Drosophila. Putative homologues of Drosophila tim
have been identified in both mice and humans (mTim and
hTIM, respectively). Mammalian TIM is not the true
orthologue of Drosophila TIM, but is the likely
orthologue of a fly gene, timeout (also called tim-2).
mTim has been shown to be essential for embryonic
development, but does not have substantiated circadian
function. Some family members contain a SANT domain in
this region.
Length = 507
Score = 35.5 bits (81), Expect = 0.046
Identities = 12/76 (15%), Positives = 37/76 (48%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
E+ ++E K++ + E+ ++ + E +K +++K +++ +EEQ ++ L
Sbjct: 382 EEEPEDELKDDVDGEQADESEHETLALRKNARQRKAGLASPEEEALGEEEQKAPPKKKQL 441
Query: 76 KERKEKTLTEDSQCDT 91
++ ++ + D
Sbjct: 442 NQKNKQQTGSGTNSDE 457
Score = 29.3 bits (65), Expect = 3.7
Identities = 16/73 (21%), Positives = 33/73 (45%), Gaps = 2/73 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
KKR +K ++ E E+ +ED EEE+ + K ++ +K+ + + +
Sbjct: 225 GGKKRRKKLKPKQPNGEESGEDDFQEDPEEEEQ--LPESKPEETEKRVSAFQVEGSTLIS 282
Query: 70 EIEESLKERKEKT 82
+ ++EKT
Sbjct: 283 AENLRQQLKQEKT 295
Score = 28.5 bits (63), Expect = 6.1
Identities = 18/96 (18%), Positives = 43/96 (44%), Gaps = 10/96 (10%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK------ 55
R+ +GE+++ ++ + + E+ ++ E E + + ++K +++
Sbjct: 373 RRAASLSGEEEEPEDELKDDVDGEQADESEHETLALRKNARQRKAGLASPEEEALGEEEQ 432
Query: 56 ----KKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
KKK+ +K +Q S +ER + +L ED
Sbjct: 433 KAPPKKKQLNQKNKQQTGSGTNSDEERDDTSLDEDR 468
>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
Length = 822
Score = 35.4 bits (82), Expect = 0.046
Identities = 10/72 (13%), Positives = 24/72 (33%), Gaps = 4/72 (5%)
Query: 3 KRRGGAGEKKK----RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
+ +GG G++ ++++KE+ + E E K +K K +
Sbjct: 745 RVKGGFGDELGCCNNPKCNYTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLT 804
Query: 59 KKKKKEEQSVKE 70
+ +
Sbjct: 805 AADPDAVAAKVD 816
Score = 32.7 bits (75), Expect = 0.34
Identities = 12/78 (15%), Positives = 27/78 (34%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
R + G E + EK++ E+E K + + + K +K K +
Sbjct: 745 RVKGGFGDELGCCNNPKCNYTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLT 804
Query: 62 KKEEQSVKEIEESLKERK 79
+ +V + + +
Sbjct: 805 AADPDAVAAKVDGVSADR 822
Score = 30.0 bits (68), Expect = 2.1
Identities = 8/56 (14%), Positives = 22/56 (39%), Gaps = 1/56 (1%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE-ESLKERKEKT 82
EK+K++ E + + K K +K K + + + +++ + +
Sbjct: 763 NYTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLTAADPDAVAAKVDGV 818
>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
protein 2. THO and TREX form a eukaryotic complex
which functions in messenger ribonucleoprotein
metabolism and plays a role in preventing the
transcription-associated genetic instability. Tho2,
along with four other subunits forms THO.
Length = 296
Score = 35.0 bits (81), Expect = 0.050
Identities = 17/57 (29%), Positives = 28/57 (49%)
Query: 27 NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
E +KE E +E++ + KKKK+KK+ K + ++E + E EKT
Sbjct: 26 KELYDKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTK 82
Score = 31.5 bits (72), Expect = 0.60
Identities = 19/63 (30%), Positives = 35/63 (55%), Gaps = 1/63 (1%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
+KE E+ E + +E + +K KKK+KK+ K KK +++ KK + ++ ++ L E
Sbjct: 30 DKEIERLEKQIKELDSSSSGIDK-KKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEE 88
Query: 79 KEK 81
K
Sbjct: 89 KSS 91
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 31.5 bits (72), Expect = 0.050
Identities = 11/25 (44%), Positives = 19/25 (76%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKK 45
EEEK E+E+E +++E+ EE+ K+
Sbjct: 22 EEEKREDEEENEDEEEGEEQSEVKR 46
Score = 26.9 bits (60), Expect = 2.2
Identities = 10/23 (43%), Positives = 16/23 (69%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEK 41
K E++EENE EE+ +E E ++
Sbjct: 24 EKREDEEENEDEEEGEEQSEVKR 46
Score = 25.8 bits (57), Expect = 5.5
Identities = 8/23 (34%), Positives = 19/23 (82%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVE 38
E++ ++EE+ E+E+E +E+ +V+
Sbjct: 23 EEKREDEEENEDEEEGEEQSEVK 45
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein
7 homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 33.4 bits (77), Expect = 0.052
Identities = 22/56 (39%), Positives = 31/56 (55%), Gaps = 13/56 (23%)
Query: 22 EEKEENEQEEKEKEDVEEE-------------KNKKKKKKKKKKKKKKKKKKKKKE 64
+++EE E+EEKEKE +E K +K+ K +K+KKKKKKKE
Sbjct: 31 DKEEEEEKEEKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKKE 86
Score = 30.3 bits (69), Expect = 0.56
Identities = 18/58 (31%), Positives = 32/58 (55%), Gaps = 6/58 (10%)
Query: 10 EKKKRIEKRNKEEEKEENE------QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
+K++ EK KE+E E +E ++ + +++ K +K+KKKKKKK+ K
Sbjct: 31 DKEEEEEKEEKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKKELK 88
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 35.0 bits (80), Expect = 0.053
Identities = 23/77 (29%), Positives = 32/77 (41%), Gaps = 1/77 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDV-EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
E K+R K + +Q+ + E+V EEEK K K K K K K+K E +
Sbjct: 58 EAKRRAAAAAKAKAAALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTE 117
Query: 69 KEIEESLKERKEKTLTE 85
+ EE K K
Sbjct: 118 EVTEEEKAAAKAKAAAA 134
Score = 33.8 bits (77), Expect = 0.12
Identities = 24/79 (30%), Positives = 32/79 (40%), Gaps = 4/79 (5%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
K + A K K ++ E E EE+E+ D +EK K K K K K+K
Sbjct: 128 KAKAAAAAKAKAAALAKQKREGTEEVTEEEEETD--KEKAKAKAAAAAKAKAAALAKQKA 185
Query: 63 KEEQSVKEIEESLKERKEK 81
E EE +E K K
Sbjct: 186 AEAGE--GTEEVTEEEKAK 202
Score = 32.6 bits (74), Expect = 0.32
Identities = 31/105 (29%), Positives = 37/105 (35%), Gaps = 26/105 (24%)
Query: 3 KRRGGAGEKKKRIE-KRNKEEEKEENEQEEKEK-------------------------ED 36
KRR A K K + K E EE +EEK K E
Sbjct: 60 KRRAAAAAKAKAAALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEV 119
Query: 37 VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
EEEK K K K K K+K E + + EE + KEK
Sbjct: 120 TEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEK 164
Score = 30.7 bits (69), Expect = 1.3
Identities = 19/74 (25%), Positives = 28/74 (37%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A K K + ++ E +E EEE+ K+K K K K K +Q
Sbjct: 125 AAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQK 184
Query: 68 VKEIEESLKERKEK 81
E E +E E+
Sbjct: 185 AAEAGEGTEEVTEE 198
Score = 30.3 bits (68), Expect = 1.6
Identities = 15/56 (26%), Positives = 33/56 (58%)
Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
EK+ ED+++E ++ K++ +K+ K + K E+ +E E++L + + T+ E
Sbjct: 3 PEKDLEDLKKEAARRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEE 58
Score = 30.3 bits (68), Expect = 1.6
Identities = 18/68 (26%), Positives = 32/68 (47%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
K E EEE E E+ + +D+ E+ K++ K K K+K++ + V E
Sbjct: 29 KHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAAALAKQKREGTEEVTEE 88
Query: 72 EESLKERK 79
E++ + K
Sbjct: 89 EKAKAKAK 96
Score = 30.3 bits (68), Expect = 1.8
Identities = 25/103 (24%), Positives = 40/103 (38%), Gaps = 21/103 (20%)
Query: 4 RRGGAGEKKKRIEKRNKEEEKEENEQEEKEK-------EDVEEEKNKKKKKKKKKKKKKK 56
RR +K+ + K E K E E EKEK +EE K + K K
Sbjct: 16 RRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAAALA 75
Query: 57 KKKKKKKEEQSVKEIEE--------------SLKERKEKTLTE 85
K+K++ EE + +E + +L ++K + E
Sbjct: 76 KQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEE 118
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 34.7 bits (80), Expect = 0.055
Identities = 19/86 (22%), Positives = 45/86 (52%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
+ +E + + ++ ++ +KKK KKK+K K+ K++K+ E+ +K++E+ L+ ++E
Sbjct: 150 LDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVEQRLELQRE 209
Query: 81 KTLTEDSQCDTMIPVFKKGVLYKGIY 106
+ ++ V+YK
Sbjct: 210 LMKKGKGKKKKIVKDKDGKVVYKWKK 235
Score = 33.9 bits (78), Expect = 0.093
Identities = 19/62 (30%), Positives = 33/62 (53%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
+K+R E K+R E+ K ++ E+ + ++E + K KK K K K K KK++
Sbjct: 178 KKKRKLYKELKERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGKVVYKWKKER 237
Query: 62 KK 63
K+
Sbjct: 238 KR 239
Score = 33.1 bits (76), Expect = 0.16
Identities = 19/54 (35%), Positives = 29/54 (53%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
KKKR + +E KE ++ +K ++ +E ++ KK K KKKK K K K
Sbjct: 176 SAKKKRKLYKELKERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGKV 229
Score = 32.7 bits (75), Expect = 0.19
Identities = 19/53 (35%), Positives = 30/53 (56%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
KK +KR +E +E ++ EK+ + VE+ +++ KK K KKKK K K
Sbjct: 174 KKSAKKKRKLYKELKERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKD 226
Score = 32.0 bits (73), Expect = 0.37
Identities = 15/63 (23%), Positives = 32/63 (50%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+ + K+ E++ +E++ ++ EQ + + ++ ++ KKKK K K K K
Sbjct: 174 KKSAKKKRKLYKELKERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGKVVYKW 233
Query: 61 KKK 63
KK+
Sbjct: 234 KKE 236
Score = 30.0 bits (68), Expect = 1.5
Identities = 17/73 (23%), Positives = 38/73 (52%), Gaps = 3/73 (4%)
Query: 19 NKEEEKEEN--EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE-SL 75
++EE+K + E + E ++ +N+ + + +K +K+KKK ++ K +E
Sbjct: 130 DEEEQKSFDPAEYFDTTPELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKE 189
Query: 76 KERKEKTLTEDSQ 88
++ +EK L + Q
Sbjct: 190 RKEREKKLKKVEQ 202
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 34.8 bits (81), Expect = 0.056
Identities = 15/50 (30%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
E+ + +E+RN E +E+ Q E + D+E E K +++ ++ K + +K K
Sbjct: 1 ERLEALEERN-SELEEQIRQLELKLRDLEAENEKLERELERLKSELEKLK 49
>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
This family consists of several eukaryotic ribosome
biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
protein that is essential for the maturation of 25 S
rRNA and the 60 S ribosomal subunit assembly in
Saccharomyces cerevisiae.
Length = 164
Score = 33.8 bits (78), Expect = 0.057
Identities = 23/77 (29%), Positives = 34/77 (44%), Gaps = 26/77 (33%)
Query: 11 KKKRIEKRNKE-----EEKEE----------NEQEEKE-----------KEDVEEEKNKK 44
KKK I+KR K E E N+ EK+ KED +K ++
Sbjct: 88 KKKGIKKRKKSKLVYDEATGEWVPRWGYKRANKDLEKDWIIEVKPNDDPKEDPFAKKRRE 147
Query: 45 KKKKKKKKKKKKKKKKK 61
KK++ K +K++ K KK
Sbjct: 148 KKERVAKNEKRELKNKK 164
>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
(DUF874). This family consists of several hypothetical
proteins specific to Helicobacter pylori. The function
of this family is unknown.
Length = 417
Score = 34.9 bits (79), Expect = 0.057
Identities = 21/73 (28%), Positives = 49/73 (67%), Gaps = 1/73 (1%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKK-KKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
+EE+K E E+++ EKE +E ++ K +++K+K +++K+K +++K++ S + +++
Sbjct: 153 QEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELE 212
Query: 79 KEKTLTEDSQCDT 91
+EK TE+ + D
Sbjct: 213 QEKQKTENEKQDL 225
Score = 33.7 bits (76), Expect = 0.13
Identities = 17/70 (24%), Positives = 43/70 (61%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A + K +++ K E++++ ++EK+K KN + +++K+K + +K+ K+++
Sbjct: 173 ANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKEQKDF 232
Query: 68 VKEIEESLKE 77
+KE E++ +E
Sbjct: 233 IKEAEQNCQE 242
Score = 32.9 bits (74), Expect = 0.25
Identities = 20/70 (28%), Positives = 44/70 (62%), Gaps = 2/70 (2%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
E++ E+EK++ E+E E + + + ++K+K +++K+K +++K+K + IE L
Sbjct: 154 EEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIE--L 211
Query: 76 KERKEKTLTE 85
++ K+KT E
Sbjct: 212 EQEKQKTENE 221
Score = 31.0 bits (69), Expect = 1.1
Identities = 20/77 (25%), Positives = 41/77 (53%), Gaps = 6/77 (7%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNK------KKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
+++K E Q +KE E+ + NK ++++K +++K+K +K+ + +K +E
Sbjct: 124 QDKKIELAQAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEK 183
Query: 75 LKERKEKTLTEDSQCDT 91
K +EK TE + T
Sbjct: 184 QKTEQEKQKTEQEKQKT 200
Score = 30.6 bits (68), Expect = 1.3
Identities = 17/79 (21%), Positives = 42/79 (53%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E++K+ ++ E + E+EK+ E+EK K +++K+K K + ++E+
Sbjct: 159 EQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKT 218
Query: 70 EIEESLKERKEKTLTEDSQ 88
E E+ +++K ++++
Sbjct: 219 ENEKQDLIKEQKDFIKEAE 237
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 34.8 bits (80), Expect = 0.059
Identities = 26/113 (23%), Positives = 54/113 (47%), Gaps = 15/113 (13%)
Query: 10 EKKKRI-EKRNKE-----EEKEENEQEE-KEKEDVEEEKNKKKKK--KKKKKKKKKKKKK 60
+KK +K NK+ +EK EQEE +E + E+E+ ++++ +K++++++ K+K
Sbjct: 119 KKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRK 178
Query: 61 KKKE-----EQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYKGIYCP 108
K+ E S E + + K+ ++ + Q + K GI
Sbjct: 179 NKQALLDELETSTLPAAELIAQHKKNSVKLEMQVEKP-KPEKPNTFSTGIKMG 230
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 34.7 bits (80), Expect = 0.060
Identities = 20/85 (23%), Positives = 41/85 (48%), Gaps = 5/85 (5%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+RK E+ + +K+ ++ E + E E KK K + K+K K+++K
Sbjct: 205 LRKEADELHEE---FVELSKKIDELHEEFRNLQNE--LRELEKKIKALRAKEKAAKRREK 259
Query: 61 KKKEEQSVKEIEESLKERKEKTLTE 85
+++ ++ +EI E K ++ T E
Sbjct: 260 REELKERAEEIYEKFKRGEKLTTEE 284
Score = 33.5 bits (77), Expect = 0.13
Identities = 20/89 (22%), Positives = 44/89 (49%), Gaps = 3/89 (3%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
+ G K + + E+K++ E+E +K K+ +K+ + KK ++ +K
Sbjct: 101 EFNLGGRSIKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKELEDAKKALEENEKL 160
Query: 63 KEEQS-VKEIEESLKERKEK--TLTEDSQ 88
KE ++ + E+++ +E EK L ++Q
Sbjct: 161 KELKAEIDELKKKAREIHEKIQELANEAQ 189
Score = 33.1 bits (76), Expect = 0.19
Identities = 24/106 (22%), Positives = 54/106 (50%), Gaps = 9/106 (8%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQE------EKEKEDVEEEKNKKKKKKKKKKKKK 55
+++R + I+ +E E+ E +Q+ E+E+E V++ K +K+ + KK +
Sbjct: 96 KEKRNEFNLGGRSIKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKELEDAKKALE 155
Query: 56 KKKKKK--KKEEQSVKEIEESLKERKEKTLTEDSQC-DTMIPVFKK 98
+ +K K K E +K+ + E+ ++ E + + MI +F++
Sbjct: 156 ENEKLKELKAEIDELKKKAREIHEKIQELANEAQEYHEEMIKLFEE 201
Score = 30.8 bits (70), Expect = 0.90
Identities = 17/71 (23%), Positives = 43/71 (60%), Gaps = 2/71 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E K++ K EE KE+ ++ KE ++ E++++ K ++ ++K ++ ++++ E +
Sbjct: 12 ELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDE--INE 69
Query: 70 EIEESLKERKE 80
E++E ++R E
Sbjct: 70 EVQELKEKRDE 80
Score = 30.4 bits (69), Expect = 1.2
Identities = 17/73 (23%), Positives = 40/73 (54%), Gaps = 2/73 (2%)
Query: 10 EKKKRIEKRNK-EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
E+++ ++K + +E E+ ++ +E E ++E K + + KKK ++ +K ++ E Q
Sbjct: 132 EERELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKAREIHEKIQELANEAQEY 191
Query: 69 KE-IEESLKERKE 80
E + + +E E
Sbjct: 192 HEEMIKLFEEADE 204
Score = 29.3 bits (66), Expect = 2.8
Identities = 17/77 (22%), Positives = 41/77 (53%), Gaps = 6/77 (7%)
Query: 10 EKKKRIEKR----NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
+ KR + + +E+++E +E E + +E N K ++ ++K ++ ++++ + EE
Sbjct: 11 LELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINEE 70
Query: 66 QSVKEIEESLKERKEKT 82
V+E++E E K
Sbjct: 71 --VQELKEKRDEINAKL 85
Score = 28.9 bits (65), Expect = 4.3
Identities = 21/86 (24%), Positives = 45/86 (52%), Gaps = 2/86 (2%)
Query: 3 KRRGGAGEKKKRIEKRN--KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
KR + ++ EK +EE E NE+ ++ KE +E K ++ +K+ ++ K+K+ +
Sbjct: 42 KRDELNAKVRELREKAQELREERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNE 101
Query: 61 KKKEEQSVKEIEESLKERKEKTLTED 86
+S+K +E ++ ++K T
Sbjct: 102 FNLGGRSIKSLEREIERLEKKQQTSV 127
Score = 28.5 bits (64), Expect = 5.6
Identities = 15/72 (20%), Positives = 40/72 (55%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
EK+ + K E ++ +E K +E E+ + ++++ + ++ ++ K+K+ + ++
Sbjct: 27 EKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINEEVQELKEKRDEINAKLQ 86
Query: 70 EIEESLKERKEK 81
E+ + +E KEK
Sbjct: 87 ELRKEYRELKEK 98
Score = 28.5 bits (64), Expect = 5.9
Identities = 13/68 (19%), Positives = 38/68 (55%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
++ E + ++ KE++EE K K+ + +K+ + +K+ + + + ++E + L+E ++
Sbjct: 6 DKLDELELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERD 65
Query: 81 KTLTEDSQ 88
+ E +
Sbjct: 66 EINEEVQE 73
Score = 28.1 bits (63), Expect = 6.3
Identities = 17/79 (21%), Positives = 37/79 (46%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
R+ K+KR E + E+E + E ++ +++++ +K K+ +K+
Sbjct: 88 LRKEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKEL 147
Query: 63 KEEQSVKEIEESLKERKEK 81
++ + E E LKE K +
Sbjct: 148 EDAKKALEENEKLKELKAE 166
Score = 27.7 bits (62), Expect = 8.5
Identities = 19/68 (27%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE--QSVKEIEE 73
E K ++ +E +E KEK D ++ + +K+ + K ++ ++K +E + EI E
Sbjct: 10 ELELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINE 69
Query: 74 SLKERKEK 81
++E KEK
Sbjct: 70 EVQELKEK 77
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 34.5 bits (79), Expect = 0.061
Identities = 23/91 (25%), Positives = 49/91 (53%), Gaps = 21/91 (23%)
Query: 16 EKRNKEEEKEENEQEE-------------------KEKEDVEEEKNKKKKKKKKKKKKKK 56
E + EEE+EE+E+EE K++ ++E + + K+K +++ K+
Sbjct: 10 EDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKALEEEAKR 69
Query: 57 KKKKKKKEEQSV--KEIEESLKERKEKTLTE 85
K +++K+E + +E+++ L+ +K TL E
Sbjct: 70 KAEERKRETLKIVEEEVKKELELKKRNTLLE 100
Score = 31.8 bits (72), Expect = 0.50
Identities = 21/72 (29%), Positives = 42/72 (58%), Gaps = 4/72 (5%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK----KKKKKKKKEEQSVKEI 71
E+ ++ E+EE E EE+E+ D E++ + K +KK + +++++ KE+ +E
Sbjct: 8 EEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKALEEEA 67
Query: 72 EESLKERKEKTL 83
+ +ERK +TL
Sbjct: 68 KRKAEERKRETL 79
Score = 28.0 bits (62), Expect = 7.7
Identities = 17/54 (31%), Positives = 28/54 (51%), Gaps = 7/54 (12%)
Query: 16 EKRNKEEEKEENEQEEKEKEDV-------EEEKNKKKKKKKKKKKKKKKKKKKK 62
E + + ++EE E+ E+EK ++ EEE+ + +K K K KK K K
Sbjct: 124 ELKRIKRDREEREEMEREKAEIEKMRNMTEEERRAELRKNPKVSTNKAKKGKYK 177
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 34.1 bits (79), Expect = 0.062
Identities = 18/85 (21%), Positives = 44/85 (51%), Gaps = 6/85 (7%)
Query: 2 RKRRGGAGEKKKRIEK-RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
K + + KK +E+ + + E + ++ K+ + EE+ + ++ K+ +K+ KK K
Sbjct: 65 NKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKA 124
Query: 61 K-----KKEEQSVKEIEESLKERKE 80
+ K + + +++++E K KE
Sbjct: 125 ELEKYEKNDPERIEKLKEETKVAKE 149
Score = 29.1 bits (66), Expect = 2.7
Identities = 21/64 (32%), Positives = 33/64 (51%), Gaps = 8/64 (12%)
Query: 10 EKKKRIEKRNKE-EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK-------KKKKKKKK 61
E + +IEK K EE EE + +E + +E+E K K + +K +K K K++ K
Sbjct: 87 ELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKAELEKYEKNDPERIEKLKEETKV 146
Query: 62 KKEE 65
KE
Sbjct: 147 AKEA 150
Score = 28.7 bits (65), Expect = 2.9
Identities = 11/65 (16%), Positives = 39/65 (60%), Gaps = 1/65 (1%)
Query: 12 KKRIEKRNKEEE-KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
+ + + + E+ K+E E+ ++ +++ + K KK +++ +++ + ++ K+ E+ +K+
Sbjct: 62 QALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKK 121
Query: 71 IEESL 75
++ L
Sbjct: 122 LKAEL 126
Score = 28.0 bits (63), Expect = 5.6
Identities = 15/72 (20%), Positives = 40/72 (55%), Gaps = 2/72 (2%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
+ + + EK K+++EE K + + + + +K KK +++ ++ + ++E+++ KE K+
Sbjct: 62 QALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKK 121
Query: 81 --KTLTEDSQCD 90
L + + D
Sbjct: 122 LKAELEKYEKND 133
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 34.9 bits (81), Expect = 0.065
Identities = 17/66 (25%), Positives = 25/66 (37%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A E K R E R E+E+ +E + K+ E K K + K KK +
Sbjct: 448 AEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIV 507
Query: 68 VKEIEE 73
+K
Sbjct: 508 IKAGAR 513
Score = 34.2 bits (79), Expect = 0.10
Identities = 16/69 (23%), Positives = 36/69 (52%), Gaps = 4/69 (5%)
Query: 14 RIEK---RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
R K R E+EK++ E E K + + + + +++K ++ + KK + + K++ +V
Sbjct: 432 RQAKAEIRAIEQEKKKAE-EAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAA 490
Query: 71 IEESLKERK 79
+K +K
Sbjct: 491 ALARVKAKK 499
Score = 29.1 bits (66), Expect = 3.7
Identities = 17/79 (21%), Positives = 32/79 (40%), Gaps = 12/79 (15%)
Query: 10 EKKKRIEKRNKEEEKEEN-EQEEKEKEDVEEEKNKKKKKKKKKKKKK-------KKKKKK 61
EKKK E + + E ++ E+E+ + E ++KK + + K K + K KK
Sbjct: 444 EKKKAEEAKARFEARQARLEREKAAR----EARHKKAAEARAAKDKDAVAAALARVKAKK 499
Query: 62 KKEEQSVKEIEESLKERKE 80
Q + + +
Sbjct: 500 AAATQPIVIKAGARPDNSA 518
>gnl|CDD|224237 COG1318, COG1318, Predicted transcriptional regulators
[Transcription].
Length = 182
Score = 34.1 bits (78), Expect = 0.066
Identities = 21/81 (25%), Positives = 32/81 (39%), Gaps = 4/81 (4%)
Query: 4 RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK----KKKKKKK 59
+ GG + IEK KE K E +K E K+ ++K K K + ++
Sbjct: 100 KEGGLDAVEVEIEKLEKEGLKIRWAVEVLKKIKGEHFPMDKELLEEKLKGEVIKGEPLEE 159
Query: 60 KKKKEEQSVKEIEESLKERKE 80
K E V+ + L E K
Sbjct: 160 VIDKLEYPVESPADLLHEIKI 180
Score = 31.0 bits (70), Expect = 0.68
Identities = 14/79 (17%), Positives = 36/79 (45%), Gaps = 5/79 (6%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDV-----EEEKNKKKKKKKKKKKKKKKKKKKK 62
E +++++ + + E E+ EKE + +K K + K+ ++K K +
Sbjct: 92 VRETYEKLKEGGLDAVEVEIEKLEKEGLKIRWAVEVLKKIKGEHFPMDKELLEEKLKGEV 151
Query: 63 KEEQSVKEIEESLKERKEK 81
+ + ++E+ + L+ E
Sbjct: 152 IKGEPLEEVIDKLEYPVES 170
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 34.8 bits (81), Expect = 0.066
Identities = 8/42 (19%), Positives = 15/42 (35%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
EK E + + + K K +K K++ +K
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDG 856
Score = 33.2 bits (77), Expect = 0.20
Identities = 10/41 (24%), Positives = 16/41 (39%)
Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
EK ++ K KK K +K K++ S K+
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKD 855
Score = 32.8 bits (76), Expect = 0.31
Identities = 12/42 (28%), Positives = 17/42 (40%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
EE E E+ EK + K KK K +K K++
Sbjct: 808 EEALELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQY 849
Score = 32.5 bits (75), Expect = 0.37
Identities = 9/40 (22%), Positives = 14/40 (35%)
Query: 33 EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
EK + K KK K +K K++ E +
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKD 855
Score = 29.4 bits (67), Expect = 3.4
Identities = 10/42 (23%), Positives = 17/42 (40%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+ EK + + KK K +K K++ E+ K
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGK 857
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 34.1 bits (78), Expect = 0.068
Identities = 19/85 (22%), Positives = 34/85 (40%), Gaps = 10/85 (11%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK---------- 56
AG+ + E EE + ++ KE E++N+ K+ +K ++ K
Sbjct: 151 EAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEED 210
Query: 57 KKKKKKKEEQSVKEIEESLKERKEK 81
++ E S EE KE E+
Sbjct: 211 YDEEDNPVEDSKAIKEELAKEPVEE 235
Score = 34.1 bits (78), Expect = 0.080
Identities = 18/77 (23%), Positives = 34/77 (44%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
G K+K + E+ E EE+ + V + K K +++ K++ + +K K+E+
Sbjct: 66 GLKEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDG 125
Query: 69 KEIEESLKERKEKTLTE 85
E L E K +
Sbjct: 126 PGKEPQLDEDKFLLAED 142
Score = 33.8 bits (77), Expect = 0.090
Identities = 17/82 (20%), Positives = 34/82 (41%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
G+ R K+ EEE +E Q EK V +++ K+ + + K + + Q
Sbjct: 88 EEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQ 147
Query: 67 SVKEIEESLKERKEKTLTEDSQ 88
E + +E ++ E++
Sbjct: 148 ETLEAGKVHEETEDSYHVEETA 169
Score = 29.5 bits (66), Expect = 2.0
Identities = 16/82 (19%), Positives = 35/82 (42%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
G EK EE + E+E + ++K +++ K++ + +K K++E+
Sbjct: 66 GLKEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDG 125
Query: 67 SVKEIEESLKERKEKTLTEDSQ 88
KE + + ++D Q
Sbjct: 126 PGKEPQLDEDKFLLAEDSDDRQ 147
Score = 29.1 bits (65), Expect = 3.4
Identities = 15/71 (21%), Positives = 31/71 (43%), Gaps = 6/71 (8%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEE------EKNKKKKKKKKKKKKKKKKKKKKK 63
E+ + + +EK ++ E KE VE+ E + ++ ++ + K K
Sbjct: 166 EETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIK 225
Query: 64 EEQSVKEIEES 74
EE + + +EE
Sbjct: 226 EELAKEPVEEQ 236
Score = 28.7 bits (64), Expect = 3.7
Identities = 13/73 (17%), Positives = 33/73 (45%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
+K E + +E E + +E ED + ++ K+ K+K +++ + ++
Sbjct: 133 DEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEP 192
Query: 68 VKEIEESLKERKE 80
V++ E + E +
Sbjct: 193 VEKAERTKAETDD 205
Score = 28.7 bits (64), Expect = 4.0
Identities = 13/78 (16%), Positives = 40/78 (51%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+K + + +E+ + Q +++K + E+ + +++ + K ++ + EE + +
Sbjct: 112 LLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASE 171
Query: 70 EIEESLKERKEKTLTEDS 87
+ ++ +KE+ + EDS
Sbjct: 172 QYKQDMKEKASEQENEDS 189
Score = 28.7 bits (64), Expect = 4.6
Identities = 12/76 (15%), Positives = 35/76 (46%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E K++++ ++ + E++ KE +E + +++ + K +E +
Sbjct: 104 EVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSY 163
Query: 70 EIEESLKERKEKTLTE 85
+EE+ E+ ++ + E
Sbjct: 164 HVEETASEQYKQDMKE 179
Score = 28.4 bits (63), Expect = 6.0
Identities = 20/85 (23%), Positives = 37/85 (43%), Gaps = 1/85 (1%)
Query: 6 GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
G G++ + E + E ++ QE E V EE ++ ++ K+ K+K E
Sbjct: 124 DGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASE 183
Query: 66 QSVKEIEESLKERKEKTLTEDSQCD 90
Q ++ +E E+ E+T E
Sbjct: 184 QENEDSKEP-VEKAERTKAETDDVT 207
Score = 28.0 bits (62), Expect = 6.6
Identities = 14/67 (20%), Positives = 31/67 (46%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
+ + K + EE E EE + +K K+K +++ K++ + E+ V + EE
Sbjct: 65 LGLKEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEED 124
Query: 75 LKERKEK 81
++ +
Sbjct: 125 GPGKEPQ 131
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 34.8 bits (80), Expect = 0.068
Identities = 17/55 (30%), Positives = 24/55 (43%), Gaps = 4/55 (7%)
Query: 39 EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMI 93
E+K K KKK K K K KKK + + +++ K+R T MI
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSK----AKSKKTTKKRAGPLYTPSPALAAMI 798
Score = 31.7 bits (72), Expect = 0.71
Identities = 15/36 (41%), Positives = 19/36 (52%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
E++ K K+ K K K KKK K K KK KK+
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783
Score = 29.8 bits (67), Expect = 3.0
Identities = 14/37 (37%), Positives = 19/37 (51%)
Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
E + + K+K + K K KKK K K KK KK+
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784
Score = 28.2 bits (63), Expect = 7.7
Identities = 14/37 (37%), Positives = 17/37 (45%)
Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
E+K K + K K K KKK K K KK K+
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784
Score = 28.2 bits (63), Expect = 7.8
Identities = 12/39 (30%), Positives = 21/39 (53%)
Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
EK+ + ++ + K K K KKK K K KK ++++
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGP 786
>gnl|CDD|216403 pfam01271, Granin, Granin (chromogranin or secretogranin).
Length = 585
Score = 35.0 bits (80), Expect = 0.068
Identities = 14/66 (21%), Positives = 33/66 (50%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
RR ++K ++ N+ E N Q + + V EE K + K+K+++ +++K
Sbjct: 141 FPRRYEDNSEEKHSKRTNEIVEVFYNPQSQATLKSVFEEVGKLDGQSKQKRERSDEREKS 200
Query: 62 KKEEQS 67
++++
Sbjct: 201 YQDDED 206
>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein. This is a family of
121-amino acid secretory proteins. Laps functions in the
regulation of neuronal cell adhesion and/or movement and
synapse attachment. Laps binds to the ApC/EBP (Aplysia
CCAAT/enhancer binding protein) promoter and activates
the transcription of ApC/EBP mRNA.
Length = 124
Score = 32.9 bits (75), Expect = 0.071
Identities = 20/74 (27%), Positives = 38/74 (51%), Gaps = 11/74 (14%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK-----------KKKKKKKKK 58
++ EK+ KE + E + E+ E ++ +NKK + + + KK KKK+
Sbjct: 51 PAEEVKEKKKKEGTESEEDDEKMEVDNKAAVRNKKTLRDQHGQYPAWMNQRQAKKLKKKR 110
Query: 59 KKKKKEEQSVKEIE 72
+KKKK+++ K +
Sbjct: 111 EKKKKKKKGAKNLA 124
>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.
This domain is the C-terminal end of ribosomal L18/L5
proteins.
Length = 93
Score = 32.1 bits (74), Expect = 0.071
Identities = 12/39 (30%), Positives = 21/39 (53%), Gaps = 10/39 (25%)
Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
+KKK+KK+ K + K+ KK +L++RK +
Sbjct: 61 RKKKEKKEVKAESKRYNAKK----------LTLEQRKAR 89
>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
ribosomal structure and biogenesis / Posttranslational
modification, protein turnover, chaperones].
Length = 379
Score = 34.6 bits (79), Expect = 0.072
Identities = 16/72 (22%), Positives = 30/72 (41%), Gaps = 5/72 (6%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK-----KKKKKKKKEEQSVKEI 71
K KE+EKE + + E+E K K K + K K + ++ KE+
Sbjct: 233 KSFKEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEV 292
Query: 72 EESLKERKEKTL 83
+ + ++K +
Sbjct: 293 MKKALKMEKKAI 304
Score = 33.5 bits (76), Expect = 0.17
Identities = 17/61 (27%), Positives = 28/61 (45%), Gaps = 2/61 (3%)
Query: 1 MRKRRGGAGEKKKRIEK--RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
M+K R E R++ K + + +N+ E + + KKK K+ KK K +K
Sbjct: 242 MKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKMEK 301
Query: 59 K 59
K
Sbjct: 302 K 302
Score = 33.5 bits (76), Expect = 0.18
Identities = 17/74 (22%), Positives = 30/74 (40%), Gaps = 6/74 (8%)
Query: 10 EKKKRIEKRNKEE------EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
E++K ++K K E K + K + + E + KKK K+ KK
Sbjct: 237 EQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKA 296
Query: 64 EEQSVKEIEESLKE 77
+ K I+ + K+
Sbjct: 297 LKMEKKAIKNAAKD 310
Score = 30.0 bits (67), Expect = 2.1
Identities = 17/62 (27%), Positives = 34/62 (54%), Gaps = 3/62 (4%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK---KKKKEEQSVKEIEESLKER 78
+E ++ EE++++ E KN++K+ K K + + K+ KK + +K +E KE
Sbjct: 183 DEDYPDDMEERDRKRYSEAKNREKRAKLKNQDNARLKRLVQIAKKRDPRIKSFKEQEKEM 242
Query: 79 KE 80
K+
Sbjct: 243 KK 244
Score = 29.2 bits (65), Expect = 3.4
Identities = 16/66 (24%), Positives = 30/66 (45%), Gaps = 5/66 (7%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ-----SVKEIEESLKERKEKT 82
EQE++ K+ + E+ + K K K + K K E + S +++ KE +K
Sbjct: 237 EQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKA 296
Query: 83 LTEDSQ 88
L + +
Sbjct: 297 LKMEKK 302
Score = 29.2 bits (65), Expect = 4.0
Identities = 15/50 (30%), Positives = 18/50 (36%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
A K K K E E E K+ +E K K +KK K K
Sbjct: 260 AALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKMEKKAIKNAAK 309
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 33.0 bits (76), Expect = 0.076
Identities = 17/52 (32%), Positives = 36/52 (69%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
KK I + ++ K E+E+ ++EK+ +EK +++K++K++ K++KKK K +
Sbjct: 79 KKPGISTKAQQALKLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130
Score = 29.9 bits (68), Expect = 0.96
Identities = 13/29 (44%), Positives = 26/29 (89%)
Query: 38 EEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
E E+NK++KKK+ K+KK+++K++K++ +Q
Sbjct: 94 EHERNKQEKKKRSKEKKEEEKERKRQLKQ 122
Score = 29.9 bits (68), Expect = 1.0
Identities = 12/32 (37%), Positives = 25/32 (78%)
Query: 34 KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
K + E K +KKK+ K+KK+++K++K++ K++
Sbjct: 92 KLEHERNKQEKKKRSKEKKEEEKERKRQLKQQ 123
Score = 28.0 bits (63), Expect = 3.8
Identities = 11/32 (34%), Positives = 24/32 (75%)
Query: 39 EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
E ++KKK+ K+KK+++K++K++ ++Q K
Sbjct: 96 ERNKQEKKKRSKEKKEEEKERKRQLKQQKKKA 127
Score = 28.0 bits (63), Expect = 4.2
Identities = 11/39 (28%), Positives = 29/39 (74%)
Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
K + ++ K++KKK+ K+KK++E++ +++++ K+ K +
Sbjct: 92 KLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130
Score = 26.8 bits (60), Expect = 9.7
Identities = 14/47 (29%), Positives = 27/47 (57%), Gaps = 12/47 (25%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
E+ K+ +K+ +E+KE EE++ K++ K++KKK K +
Sbjct: 96 ERNKQEKKKRSKEKKE------------EEKERKRQLKQQKKKAKHR 130
>gnl|CDD|227818 COG5531, COG5531, SWIB-domain-containing proteins implicated in
chromatin remodeling [Chromatin structure and dynamics].
Length = 237
Score = 34.0 bits (78), Expect = 0.077
Identities = 14/68 (20%), Positives = 24/68 (35%), Gaps = 4/68 (5%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
R R ++ +N ++E++ E E + K+ KKKK K
Sbjct: 63 RVLRKEKYNITRKTTGKNDLPKEEDS----SLPSSKETENGDTEGKETDKKKKSSTISKN 118
Query: 62 KKEEQSVK 69
+ VK
Sbjct: 119 SPSGEKVK 126
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this
family (Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 32.2 bits (74), Expect = 0.077
Identities = 13/34 (38%), Positives = 23/34 (67%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
E++E+EK EEE + K K+++ ++K +K KK
Sbjct: 2 ERKEEEKAQREEELKRLKNLKREEIEEKLEKIKK 35
Score = 31.1 bits (71), Expect = 0.21
Identities = 12/36 (33%), Positives = 23/36 (63%)
Query: 29 QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
+E KE+E + E+ K+ K K+++ ++K +K KK
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREEIEEKLEKIKKV 36
Score = 28.0 bits (63), Expect = 2.5
Identities = 7/35 (20%), Positives = 27/35 (77%)
Query: 39 EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
+E+ +++K +++++ K+ K K+++ E+ +++I++
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREEIEEKLEKIKK 35
Score = 26.8 bits (60), Expect = 6.6
Identities = 13/52 (25%), Positives = 34/52 (65%), Gaps = 1/52 (1%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
+E++E E+ ++E+E ++ KN K+++ ++K +K KK + + S +++ +
Sbjct: 1 KERKEEEKAQREEE-LKRLKNLKREEIEEKLEKIKKVAGLRGADLSEEDLAD 51
>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564. This
family of proteins has no known function. However, one
of the members is annotated as an EF-hand family
protein.
Length = 349
Score = 34.4 bits (79), Expect = 0.079
Identities = 15/72 (20%), Positives = 41/72 (56%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
+ ++ E +++E ++++++KE + +K KK+ K K++ + +K +E+ ++
Sbjct: 271 WESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEANDPAQKLQEERKEQ 330
Query: 71 IEESLKERKEKT 82
+ + KE KE+
Sbjct: 331 LAKLRKEEKERE 342
Score = 31.7 bits (72), Expect = 0.59
Identities = 21/87 (24%), Positives = 51/87 (58%), Gaps = 10/87 (11%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK---- 57
R++ + +K + + K + E KE +E KE+++ +E + +KKKKK+ K K++
Sbjct: 261 RRKLPPSTKKWESLVKFLRTERKE---KEAKEQQEKKELEQRKKKKKEMAPKVKQRFEAN 317
Query: 58 ---KKKKKKEEQSVKEIEESLKERKEK 81
+K +++ ++ + ++ + KER+++
Sbjct: 318 DPAQKLQEERKEQLAKLRKEEKEREKE 344
Score = 30.1 bits (68), Expect = 1.5
Identities = 14/38 (36%), Positives = 25/38 (65%)
Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
EK++ E++ + +EEE NK +KK+++ + KKK K
Sbjct: 15 EKKKREKKSIRQSKLEEELNKLEKKEEEAECKKKFKAN 52
Score = 29.4 bits (66), Expect = 3.1
Identities = 20/88 (22%), Positives = 33/88 (37%), Gaps = 2/88 (2%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEK--EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+RR EK K I +++ E EQ++ +++ KK K K K
Sbjct: 72 ERREEVREKSKAILLSSQKPFSFYEREEQKKAILPRKLRSSTSEREPKKFKAKPVPKSIY 131
Query: 61 KKKEEQSVKEIEESLKERKEKTLTEDSQ 88
+ ++E E K R + E Q
Sbjct: 132 IPLLKDKMQEEELKRKIRVQMRAQELLQ 159
Score = 29.0 bits (65), Expect = 4.2
Identities = 14/113 (12%), Positives = 38/113 (33%), Gaps = 13/113 (11%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
++ E+ + KK + K + ++K++E ++ ++ L++RK+K
Sbjct: 243 TYLDQENISAGEENLKPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKK 302
Query: 82 TLTEDSQCDTMIPVFKKGVLYKGIYCPSLTNSFREPSLEMSYQRY-SHRQRQK 133
+ + +E E + ++R+K
Sbjct: 303 ------------KKEMAPKVKQRFEANDPAQKLQEERKEQLAKLRKEEKEREK 343
Score = 28.2 bits (63), Expect = 7.5
Identities = 15/71 (21%), Positives = 42/71 (59%), Gaps = 10/71 (14%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKN----------KKKKKKKKKKKKKKKKKK 60
+ +R EK KE+++++ ++ K+K+ K +K ++++K++ K +K++
Sbjct: 279 RTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEANDPAQKLQEERKEQLAKLRKEE 338
Query: 61 KKKEEQSVKEI 71
K++E++ +E+
Sbjct: 339 KEREKEYEQEL 349
Score = 27.8 bits (62), Expect = 9.0
Identities = 20/73 (27%), Positives = 39/73 (53%), Gaps = 3/73 (4%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK-- 69
K K +K E+ + E E+E ++++KK+ +++KKKKK+ K +Q +
Sbjct: 258 KPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEAN 317
Query: 70 -EIEESLKERKEK 81
++ +ERKE+
Sbjct: 318 DPAQKLQEERKEQ 330
>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease. The Slx4 protein is a
heteromeric structure-specific endonuclease found in
fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
substrates, particularly simple-Y, 5'-flap, or
replication fork structures by cleaving the strand
bearing the 5' non-homologous arm at the branch junction
and thus generating ligatable nicked products from
5'-flap or replication fork substrates.
Length = 627
Score = 34.6 bits (79), Expect = 0.080
Identities = 17/82 (20%), Positives = 35/82 (42%), Gaps = 8/82 (9%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNK--------KKKKKKKKKKKKKKKKKKKKE 64
R++ +EE + ++ + E ++ K+ KKKK KK K +K+ K++
Sbjct: 30 GRLDDFEEEENQRLQLRQVISRFTSFESDDQANSGNVSGKRVPKKKKIKKPKLRKRTKRK 89
Query: 65 EQSVKEIEESLKERKEKTLTED 86
+ +K + +E E
Sbjct: 90 NKKIKSLTAFNEENFETDRAPS 111
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 33.7 bits (77), Expect = 0.080
Identities = 14/27 (51%), Positives = 17/27 (62%)
Query: 39 EEKNKKKKKKKKKKKKKKKKKKKKKEE 65
E+K K K KK K K KK + KKK +E
Sbjct: 90 EKKKKGKSKKNKLKGKKDEDKKKAREG 116
Score = 33.3 bits (76), Expect = 0.13
Identities = 15/29 (51%), Positives = 16/29 (55%)
Query: 37 VEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
EK KK K KK K K KK + KKK E
Sbjct: 87 TAVEKKKKGKSKKNKLKGKKDEDKKKARE 115
Score = 32.9 bits (75), Expect = 0.15
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 37 VEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
VE++K K KK K K KK + KKK ++
Sbjct: 89 VEKKKKGKSKKNKLKGKKDEDKKKAREG 116
Score = 30.2 bits (68), Expect = 1.1
Identities = 13/25 (52%), Positives = 16/25 (64%)
Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQS 67
KKKK K KK K K KK + KK+ +
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKARE 115
Score = 27.9 bits (62), Expect = 7.6
Identities = 12/25 (48%), Positives = 17/25 (68%)
Query: 46 KKKKKKKKKKKKKKKKKKEEQSVKE 70
KKKK K KK K K KK ++++ +E
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKARE 115
>gnl|CDD|227780 COG5493, COG5493, Uncharacterized conserved protein containing a
coiled-coil domain [Function unknown].
Length = 231
Score = 34.1 bits (78), Expect = 0.081
Identities = 21/100 (21%), Positives = 46/100 (46%), Gaps = 16/100 (16%)
Query: 10 EKKKRIEKRNKEE----------------EKEENEQEEKEKEDVEEEKNKKKKKKKKKKK 53
E+ KR+ N E K Q+ K+DVEE + + ++++K+
Sbjct: 5 EEIKRVLLENPEILVDVLTQRPEILYEVLAKLTPWQQLATKQDVEELRKETEQRQKELAD 64
Query: 54 KKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMI 93
+K + +K+K ++ +K ++ +E T + + +T+I
Sbjct: 65 EKLEVRKQKATKEDLKLLQRFQEEEFRATKEDIKRLETII 104
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 34.5 bits (80), Expect = 0.081
Identities = 20/62 (32%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
+ EQ E+E + ++ + ++K+KKK+KKK KKKK + +I + LKE + L
Sbjct: 375 LQRVEQLEQELKQLKAQP-VGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQDL 433
Query: 84 TE 85
Sbjct: 434 EL 435
Score = 33.7 bits (78), Expect = 0.14
Identities = 20/67 (29%), Positives = 37/67 (55%), Gaps = 2/67 (2%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
+R+E+ E+E ++ + + ++EK K+KKK KKKK K + K K +++ ++
Sbjct: 375 LQRVEQL--EQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQD 432
Query: 72 EESLKER 78
E LK
Sbjct: 433 LELLKNV 439
>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
outer membrane].
Length = 170
Score = 33.5 bits (77), Expect = 0.082
Identities = 15/57 (26%), Positives = 33/57 (57%)
Query: 25 EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
+E+ Q +K D+E E K++K+ +K +K+ K K+ K +++ ++ + + K E
Sbjct: 37 QESPQAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEI 93
Score = 31.2 bits (71), Expect = 0.52
Identities = 17/83 (20%), Positives = 40/83 (48%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
+ + + + + +KR KE +K + E + KE + ++ K + + K + + KK+K
Sbjct: 41 QAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLVN 100
Query: 63 KEEQSVKEIEESLKERKEKTLTE 85
+ +E E+ L R+ + +
Sbjct: 101 AFNKKQQEYEKDLNRREAEEEQK 123
Score = 27.4 bits (61), Expect = 8.3
Identities = 15/76 (19%), Positives = 37/76 (48%), Gaps = 8/76 (10%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEE--------EKNKKKKKKKKKKKKKKKKKKK 61
E KKR ++ K +++ + ++ + + + E E KK+K KK+++ +K
Sbjct: 52 EFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLVNAFNKKQQEYEK 111
Query: 62 KKEEQSVKEIEESLKE 77
+ +E ++ L++
Sbjct: 112 DLNRREAEEEQKLLEK 127
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 33.9 bits (78), Expect = 0.082
Identities = 16/55 (29%), Positives = 27/55 (49%)
Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
E ++++ + K+ K KKK KKK+K K+ E VKE ++ +E
Sbjct: 1 ELLLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIR 55
>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein. YkyA is a
family of proteins containing a lipoprotein signal and a
hydrolase domain. It is similar to cell wall binding
proteins and might also be recognisable by a host immune
defence system. It is thus likely to belong to pathways
important for pathogenicity.
Length = 205
Score = 33.9 bits (78), Expect = 0.085
Identities = 25/79 (31%), Positives = 43/79 (54%), Gaps = 8/79 (10%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEE--EKNKKKKKKKKKKKKK------KKKKKK 61
EK EK +EE+K+ E E+KE+E ++ + K + KK + K++K
Sbjct: 26 EKAAEQEKPFEEEQKKLEELEKKEQELYDQIIQLGKDDNEAIKKLSDQALANVDKREKLL 85
Query: 62 KKEEQSVKEIEESLKERKE 80
KKE++S+++ EE K K+
Sbjct: 86 KKEKESIEKSEEEFKSAKK 104
Score = 30.0 bits (68), Expect = 1.3
Identities = 17/74 (22%), Positives = 38/74 (51%), Gaps = 3/74 (4%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
G + + + ++ + ++ +KE EK++++ K KK +K + KK KK+
Sbjct: 60 GKDDNEAIKKLSDQALANVDKREKLLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKK-- 117
Query: 67 SVKEIEESLKERKE 80
K++ + +KER +
Sbjct: 118 -AKQLVKVMKERYK 130
Score = 28.9 bits (65), Expect = 3.4
Identities = 22/88 (25%), Positives = 43/88 (48%), Gaps = 5/88 (5%)
Query: 6 GGAGEKKKRIEKRNKEEEK----EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
G KK E ++ EK E+ +EE++K + E+K ++ + + K + K
Sbjct: 9 TGCTTGKKPEESIYEQLEKAAEQEKPFEEEQKKLEELEKKEQELYDQIIQLGKDDNEAIK 68
Query: 62 KKEEQSVKEIEESLKE-RKEKTLTEDSQ 88
K +Q++ +++ K +KEK E S+
Sbjct: 69 KLSDQALANVDKREKLLKKEKESIEKSE 96
Score = 27.3 bits (61), Expect = 9.9
Identities = 15/77 (19%), Positives = 44/77 (57%), Gaps = 1/77 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQ-EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
+K++ + + + K++NE ++ + + ++K KK+K+ +K +++ K ++ +
Sbjct: 47 KKEQELYDQIIQLGKDDNEAIKKLSDQALANVDKREKLLKKEKESIEKSEEEFKSAKKYI 106
Query: 69 KEIEESLKERKEKTLTE 85
++IE+ ++K K L +
Sbjct: 107 EKIEDKKLKKKAKQLVK 123
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 33.1 bits (76), Expect = 0.087
Identities = 11/30 (36%), Positives = 19/30 (63%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKK 50
+EE EE E +E++ ++ E E+ + KK K
Sbjct: 117 DEEDEEEEDDEEDDDEDESEEEESPVKKVK 146
Score = 31.5 bits (72), Expect = 0.29
Identities = 10/32 (31%), Positives = 23/32 (71%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK 52
++++E+ E+E+ E++D E+E +++ KK K
Sbjct: 115 DDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146
Score = 30.0 bits (68), Expect = 0.88
Identities = 9/32 (28%), Positives = 22/32 (68%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKK 51
++E++E E++++E +D +E + ++ KK K
Sbjct: 115 DDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146
Score = 29.6 bits (67), Expect = 1.3
Identities = 10/37 (27%), Positives = 27/37 (72%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
+E+E +++E++E+E++D E++ + ++++ KK K
Sbjct: 110 EEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146
Score = 28.1 bits (63), Expect = 4.2
Identities = 11/37 (29%), Positives = 28/37 (75%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
EE++ ++++E++E+ED EE+ ++ + ++++ KK K
Sbjct: 110 EEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 33.7 bits (77), Expect = 0.087
Identities = 17/81 (20%), Positives = 42/81 (51%), Gaps = 1/81 (1%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
+ +E++EE E ++ + EEE+ + ++ +K+K+ + +++ E+Q +EI + +
Sbjct: 33 IKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQE-E 91
Query: 77 ERKEKTLTEDSQCDTMIPVFK 97
E++E+ Q K
Sbjct: 92 EKEEEAEDVKQQEVFSFKRKK 112
Score = 29.8 bits (67), Expect = 1.7
Identities = 16/56 (28%), Positives = 32/56 (57%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
+K EK+K + +E+ E+ E+EE +E+ +EE+ + K+++ K+KK
Sbjct: 57 QKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQQEVFSFKRKK 112
Score = 28.3 bits (63), Expect = 4.8
Identities = 14/77 (18%), Positives = 43/77 (55%), Gaps = 1/77 (1%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKK-KKKKKKKKKKKK 60
+ + + E K E+ K E +E +++E + + E+ +++K++ ++++K+++ +
Sbjct: 40 EEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAED 99
Query: 61 KKKEEQSVKEIEESLKE 77
K++E + ++ KE
Sbjct: 100 VKQQEVFSFKRKKPFKE 116
Score = 27.9 bits (62), Expect = 6.0
Identities = 18/68 (26%), Positives = 41/68 (60%), Gaps = 2/68 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+K E+ KE+E E E+E+ ++ +EE ++++K+++ + K+++ K ++ K
Sbjct: 56 EQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQQEVFSFKRKKPFK 115
Query: 70 E--IEESL 75
E +EE +
Sbjct: 116 EMNLEEKI 123
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 34.0 bits (78), Expect = 0.091
Identities = 12/31 (38%), Positives = 24/31 (77%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEE 40
+K K +E + +EE E++E+EE+E+E+ +E+
Sbjct: 158 KKLKELEAEDVDEEDEKDEEEEEEEEEEDED 188
Score = 30.1 bits (68), Expect = 1.2
Identities = 13/32 (40%), Positives = 23/32 (71%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEK 41
EK +EK+ KE E E+ ++E+++ E+ EEE+
Sbjct: 151 EKLSMLEKKLKELEAEDVDEEDEKDEEEEEEE 182
Score = 30.1 bits (68), Expect = 1.3
Identities = 10/40 (25%), Positives = 24/40 (60%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKN 42
K + E +++++EEE+EE E++E +D +++ +
Sbjct: 159 KLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDD 198
Score = 28.2 bits (63), Expect = 5.6
Identities = 14/46 (30%), Positives = 23/46 (50%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
NKK KK K K+K +EE+ + E L+++ ++ ED
Sbjct: 122 INKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAED 167
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 34.6 bits (80), Expect = 0.094
Identities = 17/67 (25%), Positives = 41/67 (61%), Gaps = 1/67 (1%)
Query: 16 EKRNKEEE-KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
E R + EE + E E E+E E+VEE + + + + + ++ +++++ E+ + E E+
Sbjct: 472 EDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRET 531
Query: 75 LKERKEK 81
++E++E+
Sbjct: 532 IEEKRER 538
Score = 32.3 bits (74), Expect = 0.48
Identities = 19/67 (28%), Positives = 42/67 (62%), Gaps = 1/67 (1%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
EE++E E+ E E ED+EEE + +++ ++ + + + + ++ E+ +++EE + ER+E
Sbjct: 471 EEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRE 530
Query: 81 KTLTEDS 87
T+ E
Sbjct: 531 -TIEEKR 536
Score = 31.5 bits (72), Expect = 0.76
Identities = 18/77 (23%), Positives = 37/77 (48%), Gaps = 1/77 (1%)
Query: 4 RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
RR EK++R E+ + + E E EEK E E + + ++ +++ + K + K+
Sbjct: 528 RRETIEEKRERAEELRERAAELEAEAEEKR-EAAAEAEEEAEEAREEVAELNSKLAELKE 586
Query: 64 EEQSVKEIEESLKERKE 80
+S++ I L +
Sbjct: 587 RIESLERIRTLLAAIAD 603
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 33.8 bits (78), Expect = 0.098
Identities = 13/76 (17%), Positives = 38/76 (50%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+K + +++ + +E + ++E+ + + + K +K+K+ ++ ++E Q +
Sbjct: 182 KKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRKQLEELQQELQELT 241
Query: 70 EIEESLKERKEKTLTE 85
E+L +K + L E
Sbjct: 242 IAIEALTNKKSELLEE 257
>gnl|CDD|221122 pfam11490, DNA_pol3_alph_N, DNA polymerase III polC-type
N-terminus. This is an N-terminal domain of DNA
polymerase III polC subunit A that is found only in
Firmicutes. DNA polymerase polC-type III enzyme
functions as the 'replicase' in low G + C Gram-positive
bacteria. Purine asymmetry is a characteristic of
organisms with a heterodimeric DNA polymerase III
alpha-subunit constituted by polC which probably plays a
direct role in the maintenance of strand-biased gene
distribution; since, among prokaryotic genomes, the
distribution of genes on the leading and lagging strands
of the replication fork is known to be biased. The
domain is associated with DNA_pol3_alpha pfam07733.
Length = 180
Score = 33.4 bits (77), Expect = 0.098
Identities = 15/48 (31%), Positives = 29/48 (60%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
+++ EE+ E +E+KE+E+ + + + KKK+ +KKKK+K
Sbjct: 132 DVEVDEDESSEEEIEEFEEQKEEEEAKLAEEALEALKKKEAEKKKKEK 179
Score = 31.9 bits (73), Expect = 0.27
Identities = 19/45 (42%), Positives = 28/45 (62%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
E+E E E EE E++ EEE ++ + KKK+ +KKKK+KE
Sbjct: 136 DEDESSEEEIEEFEEQKEEEEAKLAEEALEALKKKEAEKKKKEKE 180
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 34.3 bits (79), Expect = 0.098
Identities = 13/63 (20%), Positives = 27/63 (42%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
EK N + + EQ+ + + E+ + + K+K +K K K+K ++ +
Sbjct: 222 EKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKKLGIS 281
Query: 70 EIE 72
I
Sbjct: 282 IIL 284
Score = 31.6 bits (72), Expect = 0.68
Identities = 18/83 (21%), Positives = 30/83 (36%), Gaps = 5/83 (6%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
K K I +E+ N K + EE+K + + + K+K E+
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKI--- 268
Query: 71 IEESLKERKEKTLTEDSQCDTMI 93
+ LKE+ +K D M
Sbjct: 269 --KDLKEKAKKLGISIILFDDMT 289
Score = 28.1 bits (63), Expect = 9.5
Identities = 16/75 (21%), Positives = 25/75 (33%), Gaps = 12/75 (16%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK--------- 53
G K + E++ + E + E +E+ K K K+K KK
Sbjct: 227 SNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKKLGISIILFD 286
Query: 54 ---KKKKKKKKKKEE 65
K K K + E
Sbjct: 287 DMTKNKTTNYKIQNE 301
>gnl|CDD|218769 pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA).
This family consists of the N-terminal region of the
prokaryotic fibronectin-binding protein. Fibronectin
binding is considered to be an important virulence
factor in streptococcal infections. Fibronectin is a
dimeric glycoprotein that is present in a soluble form
in plasma and extracellular fluids; it is also present
in a fibrillar form on cell surfaces. Both the soluble
and cellular forms of fibronectin may be incorporated
into the extracellular tissue matrix. While fibronectin
has critical roles in eukaryotic cellular processes,
such as adhesion, migration and differentiation, it is
also a substrate for the attachment of bacteria. The
binding of pathogenic Streptococcus pyogenes and
Staphylococcus aureus to epithelial cells via
fibronectin facilitates their internalisation and
systemic spread within the host.
Length = 447
Score = 34.1 bits (79), Expect = 0.100
Identities = 20/49 (40%), Positives = 31/49 (63%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
K+ IE E + EN + ++ E++ EE ++ K+KK+KKKKKKKK
Sbjct: 399 KEEIEYLESVEAQLENAESLEDLEEIREELIEQGYLKEKKRKKKKKKKK 447
Score = 31.8 bits (73), Expect = 0.52
Identities = 21/66 (31%), Positives = 37/66 (56%), Gaps = 4/66 (6%)
Query: 2 RKRRGGAGEKKKRIEKRNKE----EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
+K + +++IE+ +E E E + + ED+EE + + ++ K+KK+KK
Sbjct: 382 KKLKRAVEAVEEQIEETKEEIEYLESVEAQLENAESLEDLEEIREELIEQGYLKEKKRKK 441
Query: 58 KKKKKK 63
KKKKKK
Sbjct: 442 KKKKKK 447
Score = 28.0 bits (63), Expect = 9.1
Identities = 11/55 (20%), Positives = 32/55 (58%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
+K E ++ ++++ D+E+ + +K +KK +K +K+ ++ + ++ + E L
Sbjct: 282 SKKAERDRVKQKRSDLEKRVENELEKLEKKLEKLEKELEEAENAENYRLYGELLT 336
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 32.2 bits (74), Expect = 0.10
Identities = 7/21 (33%), Positives = 15/21 (71%)
Query: 20 KEEEKEENEQEEKEKEDVEEE 40
E+++ E+EE+E+++ EE
Sbjct: 75 AAAEEKKEEEEEEEEKEESEE 95
Score = 32.2 bits (74), Expect = 0.11
Identities = 13/21 (61%), Positives = 17/21 (80%)
Query: 20 KEEEKEENEQEEKEKEDVEEE 40
EEK+E E+EE+EKE+ EEE
Sbjct: 76 AAEEKKEEEEEEEEKEESEEE 96
Score = 28.0 bits (63), Expect = 3.0
Identities = 11/19 (57%), Positives = 14/19 (73%)
Query: 20 KEEEKEENEQEEKEKEDVE 38
KEEE+EE E+EE E+E
Sbjct: 81 KEEEEEEEEKEESEEEAAA 99
Score = 27.2 bits (61), Expect = 4.9
Identities = 8/20 (40%), Positives = 15/20 (75%)
Query: 20 KEEEKEENEQEEKEKEDVEE 39
K+EE+EE E++E+ +E+
Sbjct: 80 KKEEEEEEEEKEESEEEAAA 99
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 34.2 bits (79), Expect = 0.11
Identities = 13/39 (33%), Positives = 28/39 (71%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
+EE++E +++ +++D EEE+ +K+KKKKK + + +
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391
Score = 32.3 bits (74), Expect = 0.45
Identities = 12/43 (27%), Positives = 30/43 (69%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
++ ++EE++E+ + ++++ E+ EEE+ +KKKKK + + +
Sbjct: 349 VDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391
Score = 31.5 bits (72), Expect = 0.83
Identities = 15/40 (37%), Positives = 27/40 (67%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+E+E+ E E+ + ED EEE+ ++K+KKKKK + + +
Sbjct: 352 SDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391
Score = 31.1 bits (71), Expect = 1.2
Identities = 14/48 (29%), Positives = 29/48 (60%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
E+ + +E E EE E D E+++ +++++K+KKKKK + + +
Sbjct: 344 EEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391
Score = 28.8 bits (65), Expect = 5.0
Identities = 14/44 (31%), Positives = 32/44 (72%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
E+ +E+E++E ED ++E ++++++++K+KKKKK + + E
Sbjct: 347 EDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390
Score = 28.8 bits (65), Expect = 5.2
Identities = 13/50 (26%), Positives = 35/50 (70%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
EEE+E+ + ++E+++ +E+ + + +++++++K+KKKKK + +S
Sbjct: 342 LEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391
>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN. FtsN is a poorly
conserved protein active in cell division in a number of
Proteobacteria. The N-terminal 30 residue region tends
to by Lys/Arg-rich, and is followed by a
membrane-spanning region. This is followed by an acidic
low-complexity region of variable length and a
well-conserved C-terminal domain of two tandem regions
matched by pfam05036 (Sporulation related repeat), found
in several cell division and sporulation proteins. The
role of FtsN as a suppressor for other cell division
mutations is poorly understood; it may involve cell wall
hydrolysis [Cellular processes, Cell division].
Length = 298
Score = 33.9 bits (77), Expect = 0.11
Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 1/71 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS-V 68
E+ +E R + EK+ + E + E K K K+ K+K+K K+ E QS
Sbjct: 142 EQTVAVEARKQTAEKKPQKARTAEAQKTPVETEKIASKVKEAKQKQKALPKQTAETQSNS 201
Query: 69 KEIEESLKERK 79
K IE + K K
Sbjct: 202 KPIETAPKADK 212
Score = 28.9 bits (64), Expect = 4.4
Identities = 13/74 (17%), Positives = 30/74 (40%), Gaps = 6/74 (8%)
Query: 16 EKRNKEEEKEENEQE------EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+R E+ + + + E + +K+ +KK +K + + +K E
Sbjct: 117 EQRQLLEQMQADMRAAEKVLATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTPVETEKI 176
Query: 70 EIEESLKERKEKTL 83
+ ++K+K L
Sbjct: 177 ASKVKEAKQKQKAL 190
>gnl|CDD|215456 PLN02850, PLN02850, aspartate-tRNA ligase.
Length = 530
Score = 34.3 bits (79), Expect = 0.11
Identities = 14/61 (22%), Positives = 26/61 (42%), Gaps = 2/61 (3%)
Query: 30 EEKEKEDVEEEKNKKKKKKK--KKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
+E + EK KK KK K +K +++ K S+++ ++ L E+
Sbjct: 1 SSQEAVEESGEKISKKAAKKAAAKAEKLRREATAKAAAASLEDEDDPLASNYGDVPLEEL 60
Query: 88 Q 88
Q
Sbjct: 61 Q 61
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 33.9 bits (77), Expect = 0.11
Identities = 18/62 (29%), Positives = 32/62 (51%), Gaps = 4/62 (6%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK----KKEEQSVKEI 71
++ + E EE E E E ++ E+E NK K +K+K KK +K ++E+ ++I
Sbjct: 238 DQSMETSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQERENEKI 297
Query: 72 EE 73
Sbjct: 298 PA 299
Score = 30.0 bits (67), Expect = 1.7
Identities = 23/81 (28%), Positives = 42/81 (51%), Gaps = 5/81 (6%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A E++KR + E++ E+EE+ ++E + +K+K KKK +K + E S
Sbjct: 190 ATEREKRQD-----EKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMETS 244
Query: 68 VKEIEESLKERKEKTLTEDSQ 88
E EES + ++ ED++
Sbjct: 245 ESEEEESSESESDEDEDEDNK 265
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 34.4 bits (78), Expect = 0.12
Identities = 21/70 (30%), Positives = 44/70 (62%), Gaps = 1/70 (1%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
+ N EE EEN +E E E+VEE + ++ ++ ++ ++ ++ +E++V+EIEE+ +
Sbjct: 992 EENVEENVEENIEENVE-ENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAE 1050
Query: 77 ERKEKTLTED 86
E E+ + E+
Sbjct: 1051 ENVEENIEEN 1060
Score = 31.3 bits (70), Expect = 0.94
Identities = 19/68 (27%), Positives = 43/68 (63%), Gaps = 1/68 (1%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
EE EEN +E E E++EE + ++ ++ ++ ++ ++ E++V+EIEE+++E E
Sbjct: 1046 EENAEENVEENIE-ENIEEYDEENVEEIEENIEENIEENVEENVEENVEEIEENVEENVE 1104
Query: 81 KTLTEDSQ 88
+ E+++
Sbjct: 1105 ENAEENAE 1112
Score = 30.9 bits (69), Expect = 1.2
Identities = 24/93 (25%), Positives = 52/93 (55%), Gaps = 5/93 (5%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEK---EDVEE--EKNKKKKKKKKKKKKK 55
++ + E K + N EE+ EEN +E+ E+ E+VEE E+N ++ ++ ++
Sbjct: 928 LKNQNENVPEHLKEHAEANIEEDAEENVEEDAEENVEENVEENVEENVEENVEENVEENV 987
Query: 56 KKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
++ ++ EE + IEE+++E E+ + E+ +
Sbjct: 988 EENVEENVEENVEENIEENVEENVEENIEENVE 1020
Score = 29.4 bits (65), Expect = 4.1
Identities = 19/72 (26%), Positives = 42/72 (58%), Gaps = 1/72 (1%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
+ N EE EEN +E E E+VEE + ++ ++ ++ ++ ++ EE + + +EE+++
Sbjct: 1000 EENIEENVEENVEENIE-ENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIE 1058
Query: 77 ERKEKTLTEDSQ 88
E E+ E+ +
Sbjct: 1059 ENIEEYDEENVE 1070
>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
This domain is found associated with the Pfam tRNA
synthetase class II domain (pfam00587) and represents
the N-terminal domain of seryl-tRNA synthetase.
Length = 108
Score = 31.8 bits (73), Expect = 0.12
Identities = 21/79 (26%), Positives = 41/79 (51%), Gaps = 6/79 (7%)
Query: 12 KKRIEKRNKEEE-----KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
K+++ KR + E +E+ + + ++EE + ++ + K+ K KKKK+ +
Sbjct: 15 KEKLRKRGVDVLDVDELLELDEERRELQVELEELQAERNELSKEIGKAKKKKEDAEALIA 74
Query: 67 SVKEIEESLKERKEKTLTE 85
VKE+++ LK E L E
Sbjct: 75 EVKELKDELKA-LEAELRE 92
>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e.
Length = 108
Score = 31.8 bits (73), Expect = 0.12
Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 12/66 (18%)
Query: 36 DVEEEKNKKKKKKKKKKKKKKKKK---KKKKEEQSVKEIEESLKERKEKTLTEDSQCDTM 92
V E+ N + K++KKKKKKK + +KKE++ V E +RK D+ +
Sbjct: 27 KVPEKINDEYFKREKKKKKKKSEGEFFAEKKEKKEVSE------QRKADQKAVDAA---L 77
Query: 93 IPVFKK 98
+ KK
Sbjct: 78 LAAIKK 83
Score = 28.4 bits (64), Expect = 2.4
Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 9/50 (18%)
Query: 33 EKEDVEEEKNKKKKKKKKK-------KKKKKKKKKKKKEEQSVKEIEESL 75
EK + E K +KKKKKKK KK+KK+ +++K +Q K ++ +L
Sbjct: 30 EKINDEYFKREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQ--KAVDAAL 77
>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
Length = 815
Score = 34.0 bits (78), Expect = 0.13
Identities = 23/77 (29%), Positives = 36/77 (46%), Gaps = 1/77 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E + + EE K +++ E ++ E E++ KK KK+ KK+ K E +VK
Sbjct: 4 ETRSQAHAPAAEEGKGGTRKQKAENKEHEGEQSPKKAKKE-KKQDDSGNGNGKSAEDAVK 62
Query: 70 EIEESLKERKEKTLTED 86
E EE K +E E
Sbjct: 63 EFEEFCKAIEEHLSIEQ 79
>gnl|CDD|218601 pfam05477, SURF2, Surfeit locus protein 2 (SURF2). Surfeit locus
protein 2 is part of a group of at least six sequence
unrelated genes (Surf-1 to Surf-6). The six Surfeit
genes have been classified as housekeeping genes, being
expressed in all tissue types tested and not containing
a TATA box in their promoter region. The exact function
of SURF2 is unknown.
Length = 244
Score = 33.4 bits (76), Expect = 0.13
Identities = 19/63 (30%), Positives = 30/63 (47%), Gaps = 3/63 (4%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK---KKKKKKKKKKKKKKEEQ 66
K E+ E++ E + E++ + + E + K+ KK+ KK KK KKK
Sbjct: 182 LKNPGKEQNGDEDDDFETDDEDEMEVESPELQQKRSKKQSGSLTKKFKKNHKKKGPFRFS 241
Query: 67 SVK 69
SVK
Sbjct: 242 SVK 244
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 31.9 bits (72), Expect = 0.13
Identities = 15/72 (20%), Positives = 38/72 (52%), Gaps = 1/72 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
EK+ E EEN +E + E + EEE ++ ++++ + ++++ +++++ E +
Sbjct: 25 EKENGKNAPANGNENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATG 84
Query: 69 KEIEESLKERKE 80
K E ++ E
Sbjct: 85 KRAAEDEEDDAE 96
Score = 31.1 bits (70), Expect = 0.24
Identities = 16/52 (30%), Positives = 32/52 (61%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
E+ ++++E+EE E EE+E E+ EE + K+ + ++ + KK+K +E
Sbjct: 55 EEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQKTDEDD 106
Score = 28.4 bits (63), Expect = 2.1
Identities = 14/50 (28%), Positives = 32/50 (64%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
E+++ +++ ++EEE E E+E +E+E+ E K+ + ++ + KK+K
Sbjct: 52 EEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQK 101
Score = 28.0 bits (62), Expect = 3.5
Identities = 13/50 (26%), Positives = 28/50 (56%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
E+ E+++EE + E+E+ + EEE K+ + ++ + KK+K +
Sbjct: 54 EEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQKTD 103
>gnl|CDD|238288 cd00520, RRF, Ribosome recycling factor (RRF). Ribosome recycling
factor dissociates the posttermination complex, composed
of the ribosome, deacylated tRNA, and mRNA, after
termination of translation. Thus ribosomes are
"recycled" and ready for another round of protein
synthesis. RRF is believed to bind the ribosome at the
A-site in a manner that mimics tRNA, but the specific
mechanisms remain unclear. RRF is essential for
bacterial growth. It is not necessary for cell growth
in archaea or eukaryotes, but is found in mitochondria
or chloroplasts of some eukaryotic species.
Length = 179
Score = 33.0 bits (76), Expect = 0.14
Identities = 20/66 (30%), Positives = 37/66 (56%), Gaps = 7/66 (10%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE------QSVKEIEESLKE 77
K+ E+ + ++ + N K KK +K+K+ + + KK +E+ + +K+I+E LK
Sbjct: 114 KKIAEEAKVAIRNIRRDANDKIKKLEKEKEISEDEVKKAEEDLQKLTDEYIKKIDELLKS 173
Query: 78 RKEKTL 83
KEK L
Sbjct: 174 -KEKEL 178
>gnl|CDD|223626 COG0552, FtsY, Signal recognition particle GTPase [Intracellular
trafficking and secretion].
Length = 340
Score = 33.8 bits (78), Expect = 0.14
Identities = 21/71 (29%), Positives = 35/71 (49%), Gaps = 15/71 (21%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK---------------KKKKKKKKKKKE 64
K EE+ E + EKEK + E+E+ +K+ ++ K KK K+K +
Sbjct: 6 KLEEELEELEAEKEKIEEEDEEEEKEGWFERLKQGLSKTKKNFGKGIKGLFLKKIKEKLD 65
Query: 65 EQSVKEIEESL 75
E ++E+EE L
Sbjct: 66 EDLLEELEELL 76
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 33.8 bits (78), Expect = 0.14
Identities = 14/55 (25%), Positives = 24/55 (43%), Gaps = 2/55 (3%)
Query: 10 EKKKRIEKRN--KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
E IE + K ++E+ E++ EEK K + K + K+KK +
Sbjct: 238 ETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDE 292
Score = 33.4 bits (77), Expect = 0.21
Identities = 13/71 (18%), Positives = 29/71 (40%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
KK + +E E+E + + E E ++ ++ ++ ++K+ K E K
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283
Query: 71 IEESLKERKEK 81
K+ +E
Sbjct: 284 EILKEKKDEEL 294
Score = 33.1 bits (76), Expect = 0.25
Identities = 14/69 (20%), Positives = 29/69 (42%), Gaps = 3/69 (4%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKK---KKKKKKKKKKKKKKKKKEEQ 66
++ E+R +E + E E + E+E + ++ ++K+ K + K E
Sbjct: 227 KETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEIL 286
Query: 67 SVKEIEESL 75
K+ EE
Sbjct: 287 KEKKDEELF 295
Score = 31.5 bits (72), Expect = 0.83
Identities = 13/41 (31%), Positives = 26/41 (63%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKK 51
K K+I + E++KE + E++ +EEK +K K+++K+
Sbjct: 638 KIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678
Score = 28.0 bits (63), Expect = 9.0
Identities = 13/46 (28%), Positives = 29/46 (63%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
I + K ++ N E+K++ + + +K KK++KK+K K+++K+
Sbjct: 633 ISELMKIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678
>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 32.8 bits (75), Expect = 0.14
Identities = 17/63 (26%), Positives = 25/63 (39%), Gaps = 4/63 (6%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE----QSVKEIEESLK 76
E++K E + E+E+E + K + K K K K E + I LK
Sbjct: 97 EKKKAELKDLEEEREGILRSKQAEDGKLVKIYSKMKPDAAAAILENLPDEEAAAILMKLK 156
Query: 77 ERK 79
RK
Sbjct: 157 PRK 159
Score = 27.4 bits (61), Expect = 8.3
Identities = 14/59 (23%), Positives = 29/59 (49%), Gaps = 6/59 (10%)
Query: 24 KEENEQEEKEKE--DVEEEKNKKKKKKKKKKKKKKKKKK----KKKEEQSVKEIEESLK 76
+ E EK K+ +V E +KKK + K +++++ K+ E+ + +I +K
Sbjct: 74 ALQKELLEKLKDLAEVNERLKALEKKKAELKDLEEEREGILRSKQAEDGKLVKIYSKMK 132
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 31.6 bits (72), Expect = 0.15
Identities = 13/65 (20%), Positives = 27/65 (41%)
Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
K+ + E +++ + +K + K K K+ K KK + + E K +K+K
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61
Query: 83 LTEDS 87
+
Sbjct: 62 DCKKG 66
Score = 29.7 bits (67), Expect = 0.75
Identities = 13/54 (24%), Positives = 21/54 (38%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
E + + + + +K + + EK K KK KK KK K + K
Sbjct: 22 EDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNTCK 75
Score = 28.9 bits (65), Expect = 1.1
Identities = 12/47 (25%), Positives = 21/47 (44%), Gaps = 3/47 (6%)
Query: 19 NKEEEKEE---NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
+K+E+ + E + K+ + + +K K KK KK KK
Sbjct: 19 DKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKK 65
Score = 28.9 bits (65), Expect = 1.3
Identities = 10/54 (18%), Positives = 18/54 (33%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
++ K + + K+ + + K KK KK KK K +
Sbjct: 20 KKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNT 73
Score = 28.6 bits (64), Expect = 1.6
Identities = 12/43 (27%), Positives = 16/43 (37%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
E K + + E +K K KK KK KK K +
Sbjct: 31 EGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNT 73
Score = 28.6 bits (64), Expect = 1.7
Identities = 13/51 (25%), Positives = 22/51 (43%), Gaps = 4/51 (7%)
Query: 19 NKEEEKE----ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
KE++ + E + ++ + + +K K KK KK KK K E
Sbjct: 20 KKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWE 70
Score = 28.6 bits (64), Expect = 1.7
Identities = 10/57 (17%), Positives = 22/57 (38%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
++E + + ++ K + + +K K KK KK ++ K + K
Sbjct: 19 DKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNTCK 75
Score = 28.6 bits (64), Expect = 1.8
Identities = 14/65 (21%), Positives = 27/65 (41%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
+K + + E KEK ++K K K K+ K KK + + E+ ++ +K
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61
Query: 82 TLTED 86
+
Sbjct: 62 DCKKG 66
Score = 28.2 bits (63), Expect = 2.0
Identities = 10/58 (17%), Positives = 27/58 (46%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
+ K +++ + +K+E++ + K KE ++ + + +K K KK ++
Sbjct: 4 KNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61
>gnl|CDD|219911 pfam08572, PRP3, pre-mRNA processing factor 3 (PRP3). Pre-mRNA
processing factor 3 (PRP3) is a U4/U6-associated
splicing factor. The human PRP3 has been implicated in
autosomal retinitis pigmentosa.
Length = 222
Score = 33.1 bits (76), Expect = 0.16
Identities = 24/100 (24%), Positives = 41/100 (41%), Gaps = 9/100 (9%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
R+RRG +K + ++ EE + E + EE+E + EE K +K KK+
Sbjct: 17 RRRRGLQFIEKGKFSQQ-AEELRREAKLEEEEAR-ISEEAQNAGLKSATDLAEKIPKKEL 74
Query: 62 KKEEQSVKEIE----ESLKERKEKTLTEDSQCDTMIPVFK 97
+ V +IE L + L ++S D +
Sbjct: 75 PDD---VPDIEWWDMPILDDPDYNDLEDESDEDIDEEMSS 111
>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p. Rrp15p is required for the formation
of 60S ribosomal subunits.
Length = 132
Score = 31.9 bits (73), Expect = 0.16
Identities = 15/54 (27%), Positives = 31/54 (57%), Gaps = 2/54 (3%)
Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS-VKEIEESLKERKEKTL 83
+ K+ ++ +K K +K +KK K++ + +K++ E+ VK + E KE+ L
Sbjct: 25 SRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPEDLE-KERRL 77
Score = 26.9 bits (60), Expect = 8.3
Identities = 16/43 (37%), Positives = 23/43 (53%)
Query: 44 KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
KK K KKK K +K +KK K + ++ + K R + L ED
Sbjct: 28 KKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPED 70
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 32.9 bits (75), Expect = 0.17
Identities = 17/79 (21%), Positives = 35/79 (44%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
++R G ++K+R+E K +E +QEEK + D ++ + K +
Sbjct: 159 HRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLSTKDEPP 218
Query: 62 KKEEQSVKEIEESLKERKE 80
K++ Q + E L+ +
Sbjct: 219 KEKRQKHHDPERRLEPQSH 237
Score = 32.1 bits (73), Expect = 0.29
Identities = 18/86 (20%), Positives = 36/86 (41%), Gaps = 10/86 (11%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK-KKK 60
RK+ K+ K++EK E +K KED + KK+++K++ + K+
Sbjct: 152 RKKSDDEHRKRS-----GKQKEKRRVEDSQKHKED----RRKKQEEKRRNDEDKRPGGGG 202
Query: 61 KKKEEQSVKEIEESLKERKEKTLTED 86
QS ++ + K + +
Sbjct: 203 GSSGGQSGLSTKDEPPKEKRQKHHDP 228
>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 758
Score = 33.8 bits (77), Expect = 0.17
Identities = 17/64 (26%), Positives = 34/64 (53%), Gaps = 1/64 (1%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
KE ++NE++ K + ++ + +++K++K + KK EE+ KE E+ K +K
Sbjct: 124 KESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKNEERF-KEYLENFKRKK 182
Query: 80 EKTL 83
K
Sbjct: 183 FKRK 186
>gnl|CDD|165026 PHA02644, PHA02644, hypothetical protein; Provisional.
Length = 112
Score = 31.9 bits (71), Expect = 0.17
Identities = 24/73 (32%), Positives = 41/73 (56%), Gaps = 2/73 (2%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVE--EEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
K +I N++E K +E +KE ED + E+ + +KK +K + + +KK +K E +
Sbjct: 40 KYDKIFISNEDEMKTTHEHIKKENEDEKKPEKPENEDEKKPEKPENEDEKKPEKPENEDE 99
Query: 69 KEIEESLKERKEK 81
KE E+S K+K
Sbjct: 100 KEFEDSDASDKKK 112
Score = 26.9 bits (58), Expect = 8.4
Identities = 19/55 (34%), Positives = 31/55 (56%), Gaps = 1/55 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEK-EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
E K+ + K+ EK ENE E+K EK + E+EK +K + + +K+ + KK
Sbjct: 57 EHIKKENEDEKKPEKPENEDEKKPEKPENEDEKKPEKPENEDEKEFEDSDASDKK 111
>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
Length = 381
Score = 33.5 bits (77), Expect = 0.18
Identities = 12/37 (32%), Positives = 21/37 (56%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
EK K +K K ++ +KK KK + + K + +S+K
Sbjct: 344 EKATAKGAQKVKNRRARKKAKKARLAKVAKALYKSIK 380
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 33.6 bits (77), Expect = 0.18
Identities = 11/62 (17%), Positives = 24/62 (38%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+R + ++ +K E +N + +EK E ++ K+ K+ + K
Sbjct: 881 IRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKH 940
Query: 61 KK 62
K
Sbjct: 941 KG 942
Score = 29.3 bits (66), Expect = 3.5
Identities = 7/44 (15%), Positives = 16/44 (36%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
+ E Q K E+ ++ ++ K+ K+ + K
Sbjct: 897 NHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKH 940
>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7). RRP7
is an essential protein in yeast that is involved in
pre-rRNA processing and ribosome assembly. It is
speculated to be required for correct assembly of rpS27
into the pre-ribosomal particle.
Length = 131
Score = 31.8 bits (73), Expect = 0.19
Identities = 21/63 (33%), Positives = 33/63 (52%), Gaps = 9/63 (14%)
Query: 10 EKKKRIEKRNKEEEKEENEQEE---------KEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
EK + E+ KEE K +E +E K +NK ++++K K+KKKKKKK
Sbjct: 28 EKYDKREEEAKEEAKARSEPDEDGWTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKKKK 87
Query: 61 KKK 63
+ +
Sbjct: 88 ELE 90
>gnl|CDD|216249 pfam01025, GrpE, GrpE.
Length = 165
Score = 32.2 bits (74), Expect = 0.19
Identities = 17/69 (24%), Positives = 40/69 (57%), Gaps = 5/69 (7%)
Query: 10 EKKKRIEKRNKEEE-KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
EK++ E ++EE +EE E+ E+E E++++ + + + + +K+ ++++EE
Sbjct: 2 EKEEEEELEDEEEALEEELEELEEEIEELKD----RLLRLLAEFENYRKRTEREREEAKK 57
Query: 69 KEIEESLKE 77
IE+ K+
Sbjct: 58 YAIEKFAKD 66
Score = 32.2 bits (74), Expect = 0.22
Identities = 19/79 (24%), Positives = 42/79 (53%), Gaps = 10/79 (12%)
Query: 20 KEEEKEE--NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
+E+E+EE ++EE +E++EE + + ++ K + + + + +K + +E EE+ K
Sbjct: 1 EEKEEEEELEDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRKRTE--REREEAKKY 58
Query: 78 RKEKTLTEDSQCDTMIPVF 96
EK + ++PV
Sbjct: 59 AIEKFAKD------LLPVL 71
>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
Length = 281
Score = 33.1 bits (76), Expect = 0.19
Identities = 20/67 (29%), Positives = 41/67 (61%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
IEK + E+E+ E EE+ + +EE K + ++ +K +++ ++ ++ EQ K IE+
Sbjct: 94 IEKLQMQIEREQEEWEEELERLIEEAKAEGYEEGYEKGREEGLEEVRELIEQLNKIIEKL 153
Query: 75 LKERKEK 81
+K+R+E
Sbjct: 154 IKKREEI 160
Score = 28.4 bits (64), Expect = 5.2
Identities = 15/72 (20%), Positives = 37/72 (51%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+ K EE +E+ + + E+ +E+ KK + +K + + ++++E+ +
Sbjct: 51 EEANDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQIEREQEEWEE 110
Query: 70 EIEESLKERKEK 81
E+E ++E K +
Sbjct: 111 ELERLIEEAKAE 122
Score = 28.4 bits (64), Expect = 5.2
Identities = 16/78 (20%), Positives = 41/78 (52%), Gaps = 4/78 (5%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQ----EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
A E + E+ KEE ++ ++ EK + +E E+ + +++ ++ ++ K + ++
Sbjct: 68 AFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQIEREQEEWEEELERLIEEAKAEGYEEG 127
Query: 64 EEQSVKEIEESLKERKEK 81
E+ +E E ++E E+
Sbjct: 128 YEKGREEGLEEVRELIEQ 145
>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210). This is
a family of proteins conserved in yeasts. The function
is not known. The Schizosaccharomyces pombe member is
SPBC18E5.07 and the Saccharomyces cerevisiae member is
AIM21.
Length = 671
Score = 33.4 bits (76), Expect = 0.19
Identities = 15/62 (24%), Positives = 29/62 (46%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
KK + +EK+E +++ KEK + + + + + ++K +K E SV
Sbjct: 602 KKVLESPKEPSKEKKEEDEDTKEKAPLSDARKGRARGPARRKPATVAATEKLPEIPSVSI 661
Query: 71 IE 72
IE
Sbjct: 662 IE 663
Score = 30.7 bits (69), Expect = 1.3
Identities = 12/58 (20%), Positives = 28/58 (48%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
K+ E KE ++++++E ED +E+ +K + + ++K + + EI
Sbjct: 602 KKVLESPKEPSKEKKEEDEDTKEKAPLSDARKGRARGPARRKPATVAATEKLPEIPSV 659
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 33.1 bits (76), Expect = 0.19
Identities = 15/51 (29%), Positives = 33/51 (64%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+++++ E+R E E EE E+ EE++ +KK+ K++K+++K+ +K
Sbjct: 159 QQQRQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQREKETAEK 209
>gnl|CDD|146145 pfam03357, Snf7, Snf7. This family of proteins are involved in
protein sorting and transport from the endosome to the
vacuole/lysosome in eukaryotic cells.
Vacuoles/lysosomes play an important role in the
degradation of both lipids and cellular proteins. In
order to perform this degradative function,
vacuoles/lysosomes contain numerous hydrolases which
have been transported in the form of inactive
precursors via the biosynthetic pathway and are
proteolytically activated upon delivery to the
vacuole/lysosome. The delivery of transmembrane
proteins, such as activated cell surface receptors to
the lumen of the vacuole/lysosome, either for
degradation/downregulation, or in the case of
hydrolases, for proper localisation, requires the
formation of multivesicular bodies (MVBs). These late
endosomal structures are formed by invaginating and
budding of the limiting membrane into the lumen of the
compartment. During this process, a subset of the
endosomal membrane proteins is sorted into the forming
vesicles. Mature MVBs fuse with the vacuole/lysosome,
thereby releasing cargo containing vesicles into its
hydrolytic lumen for degradation. Endosomal proteins
that are not sorted into the intralumenal MVB vesicles
are either recycled back to the plasma membrane or
Golgi complex, or remain in the limiting membrane of
the MVB and are thereby transported to the limiting
membrane of the vacuole/lysosome as a consequence of
fusion. Therefore, the MVB sorting pathway plays a
critical role in the decision between recycling and
degradation of membrane proteins. A few archaeal
sequences are also present within this family.
Length = 169
Score = 32.2 bits (74), Expect = 0.19
Identities = 16/58 (27%), Positives = 27/58 (46%), Gaps = 1/58 (1%)
Query: 19 NKEEEKEENEQEEKE-KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
K + + +QE E K E + KK KK K K+KK+ E+ + +++ L
Sbjct: 7 RKAIRELDKKQESLEKKIKKLEAEIKKLAKKGNKDAALILLKQKKRYEKQLDQLDGQL 64
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 33.3 bits (76), Expect = 0.20
Identities = 17/74 (22%), Positives = 32/74 (43%), Gaps = 3/74 (4%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDV---EEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
++KK+I+K+ ++ +N + K + +K K K +K KKK K
Sbjct: 77 LKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTN 136
Query: 66 QSVKEIEESLKERK 79
+ K++ S E
Sbjct: 137 KKKKKVLSSKDELI 150
Score = 32.9 bits (75), Expect = 0.33
Identities = 18/73 (24%), Positives = 32/73 (43%), Gaps = 6/73 (8%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK------KKKKKKEEQSV 68
I K+ E+ ++N++ K + + +K K K+KKK KKK K +Q
Sbjct: 48 INKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFA 107
Query: 69 KEIEESLKERKEK 81
+ SL + +
Sbjct: 108 GPLAISLMRKPKP 120
Score = 32.1 bits (73), Expect = 0.46
Identities = 13/45 (28%), Positives = 18/45 (40%)
Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
K K E+ K K K KKKKK K + + +S+
Sbjct: 116 RKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNKPKSI 160
Score = 30.6 bits (69), Expect = 1.4
Identities = 20/96 (20%), Positives = 34/96 (35%), Gaps = 9/96 (9%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKK-------KKKKKKKKKKKKE 64
K + + E ++ + EK KK KK K KK K K+KKK K+
Sbjct: 26 LKHSSYKIRLESDIKDSLLNLDINKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKK 85
Query: 65 EQSVKEIEESLKERKEKT--LTEDSQCDTMIPVFKK 98
+ + + ++ + K + M K
Sbjct: 86 KLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPK 121
Score = 30.6 bits (69), Expect = 1.7
Identities = 17/94 (18%), Positives = 38/94 (40%), Gaps = 3/94 (3%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+ K ++ + + + E ++++ K KKKKK K + K + +
Sbjct: 98 DSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNKP 157
Query: 70 E---IEESLKERKEKTLTEDSQCDTMIPVFKKGV 100
+ I L ++ TL + + + +F KG+
Sbjct: 158 KSISIHSPLTIQELSTLLCIPETEIIKSLFLKGI 191
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 33.2 bits (76), Expect = 0.20
Identities = 13/52 (25%), Positives = 23/52 (44%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
E+R+ E E+EE E E+E E+ + + + K K+ + S
Sbjct: 379 EQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDS 430
Score = 32.4 bits (74), Expect = 0.36
Identities = 11/62 (17%), Positives = 29/62 (46%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
EE +E+E EE+E+ E E+ + + +++ + ++ + + ++ KE
Sbjct: 367 FEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESAS 426
Query: 81 KT 82
+
Sbjct: 427 DS 428
Score = 30.5 bits (69), Expect = 1.7
Identities = 12/64 (18%), Positives = 27/64 (42%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+ E +E+ +E+E+EE E + E ++++ + + K +E +
Sbjct: 368 EEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASD 427
Query: 70 EIEE 73
E
Sbjct: 428 SDSE 431
Score = 29.7 bits (67), Expect = 2.7
Identities = 9/60 (15%), Positives = 30/60 (50%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
I+ +E+++E E++ ++ + EE ++ +++ + ++ + + K +ES
Sbjct: 365 IDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKES 424
>gnl|CDD|227468 COG5139, COG5139, Uncharacterized conserved protein [Function
unknown].
Length = 397
Score = 33.1 bits (75), Expect = 0.20
Identities = 13/69 (18%), Positives = 32/69 (46%), Gaps = 3/69 (4%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
+ K+ + E+ E+ ++E N + ++K+K + ++ ++SL+ +E
Sbjct: 53 DTKDPDNGEKNEEAAIDENSNVEAAERKRKHISTDFSDMSLLRK---RKNDQSLQPTREP 109
Query: 82 TLTEDSQCD 90
+ DS D
Sbjct: 110 MDSRDSGQD 118
Score = 32.4 bits (73), Expect = 0.37
Identities = 15/88 (17%), Positives = 36/88 (40%), Gaps = 3/88 (3%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK---KKKKKKKKKKKKKKK 58
+KR ++ + +E + + ++ E + + ++ K + +
Sbjct: 85 STDFSDMSLLRKRKNDQSLQPTREPMDSRDSGQDFTEAQSGELGDTGDRQLKAPAASRAR 144
Query: 59 KKKKKEEQSVKEIEESLKERKEKTLTED 86
+K+ EQ+V EI LK+R + +D
Sbjct: 145 RKEDLLEQTVDEISLRLKKRMQDAAKKD 172
>gnl|CDD|219748 pfam08214, KAT11, Histone acetylation protein. Histone acetylation
is required in many cellular processes including
transcription, DNA repair, and chromatin assembly. This
family contains the fungal KAT11 protein (previously
known as RTT109) which is required for H3K56
acetylation. Loss of KAT11 results in the loss of H3K56
acetylation, both on bulk histone and on chromatin.
KAT11 and H3K56 acetylation appear to correlate with
actively transcribed genes and associate with the
elongating form of Pol II in yeast. This family also
incorporates the p300/CBP histone acetyltransferase
domain which has different catalytic properties and
cofactor regulation to KAT11.
Length = 341
Score = 33.2 bits (76), Expect = 0.21
Identities = 15/68 (22%), Positives = 22/68 (32%), Gaps = 2/68 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEK--NKKKKKKKKKKKKKKKKKKKKKEEQS 67
E + + ++ E+ QE V K KK+ K+ KK EE
Sbjct: 238 EGRWKSVSLDQFWEELGFRQEFSLGRLVGFIGLVINPPKLVVSKKQYKRAKKAITGEEYD 297
Query: 68 VKEIEESL 75
SL
Sbjct: 298 TLSGAASL 305
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 31.2 bits (71), Expect = 0.22
Identities = 11/21 (52%), Positives = 16/21 (76%)
Query: 20 KEEEKEENEQEEKEKEDVEEE 40
EE+EE E+EE+E+E+ EE
Sbjct: 74 AAEEEEEEEEEEEEEEEESEE 94
Score = 31.2 bits (71), Expect = 0.23
Identities = 13/20 (65%), Positives = 18/20 (90%)
Query: 21 EEEKEENEQEEKEKEDVEEE 40
EEE+EE E+EE+E+E+ EEE
Sbjct: 76 EEEEEEEEEEEEEEEESEEE 95
Score = 30.0 bits (68), Expect = 0.61
Identities = 12/20 (60%), Positives = 17/20 (85%)
Query: 20 KEEEKEENEQEEKEKEDVEE 39
+EEE+EE E+EE+E+E EE
Sbjct: 76 EEEEEEEEEEEEEEEESEEE 95
Score = 29.6 bits (67), Expect = 0.72
Identities = 12/33 (36%), Positives = 18/33 (54%)
Query: 6 GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVE 38
A E+ +EEE+EE E+EE E+E +
Sbjct: 66 AAAAAAAAAAEEEEEEEEEEEEEEEESEEEAMA 98
Score = 28.9 bits (65), Expect = 1.7
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 20 KEEEKEENEQEEKEKEDVEEE 40
E+EE E+EE+E+E+ E E
Sbjct: 73 AAAEEEEEEEEEEEEEEEESE 93
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 31.9 bits (73), Expect = 0.22
Identities = 24/83 (28%), Positives = 48/83 (57%), Gaps = 6/83 (7%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
KR ++R +++ +EE +++E+E E +EE K ++++K++ +K+ K++K+ K E E
Sbjct: 29 KRKQQR-RKKAQEEAKEKERE-ERIEERKRIREERKQELEKQLKERKEALKLL----EEE 82
Query: 73 ESLKERKEKTLTEDSQCDTMIPV 95
+E E TED + D
Sbjct: 83 NDDEEDAETEDTEDVEDDEWEGF 105
Score = 30.0 bits (68), Expect = 0.99
Identities = 20/56 (35%), Positives = 28/56 (50%), Gaps = 18/56 (32%)
Query: 43 KKKKKKKKKK------------------KKKKKKKKKKKEEQSVKEIEESLKERKE 80
KKKK KKK K+K++++KK +EE KE EE ++ERK
Sbjct: 1 KKKKAKKKGVEEVVFDKEARKEYLTGFHKRKQQRRKKAQEEAKEKEREERIEERKR 56
Score = 27.3 bits (61), Expect = 7.0
Identities = 19/76 (25%), Positives = 36/76 (47%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+K+ E+R +E EK+ E++E K EE +++ + + + + + + E
Sbjct: 53 ERKRIREERKQELEKQLKERKEALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTD 112
Query: 70 EIEESLKERKEKTLTE 85
EE + E K KT
Sbjct: 113 YEEEYIDEDKYKTTVT 128
>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
Length = 621
Score = 33.2 bits (76), Expect = 0.22
Identities = 24/80 (30%), Positives = 45/80 (56%), Gaps = 9/80 (11%)
Query: 22 EEKEENEQEEKEKEDVEEEKNK------KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
+E E+N E+KEK + + KN+ + +K+ K+ K K ++KK+K E +K++ ++L
Sbjct: 512 KEAEKNAAEDKEKREKIDLKNQAESLCYQAEKQLKELKDKISEEKKEKIENLIKKLRQAL 571
Query: 76 KE---RKEKTLTEDSQCDTM 92
+ K+L E+ Q M
Sbjct: 572 QNDNYESIKSLLEELQKALM 591
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 32.8 bits (75), Expect = 0.24
Identities = 17/62 (27%), Positives = 36/62 (58%), Gaps = 3/62 (4%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
++E+K + E +EE ++K + +K+ + K + KK +KE + E++E++ ER+
Sbjct: 44 QKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKE---IAELKENIVERQ 100
Query: 80 EK 81
E
Sbjct: 101 EL 102
Score = 28.9 bits (65), Expect = 3.5
Identities = 13/72 (18%), Positives = 27/72 (37%), Gaps = 1/72 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
EK+ +E + + +NE E + + +K +K K+ +K + K
Sbjct: 162 EKQAALEDKLETLVALQNELETQLN-SLNSQKAEKNALIAALAAKEASALGEKAALEEQK 220
Query: 70 EIEESLKERKEK 81
+ E+ K
Sbjct: 221 ALAEAAAAEAAK 232
Score = 28.1 bits (63), Expect = 7.0
Identities = 14/72 (19%), Positives = 28/72 (38%), Gaps = 1/72 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E KK +E++ + +++ E + ++E + N +K +K K+ K
Sbjct: 155 EDKKSLEEK-QAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASALGEK 213
Query: 70 EIEESLKERKEK 81
E K E
Sbjct: 214 AALEEQKALAEA 225
Score = 27.8 bits (62), Expect = 8.7
Identities = 13/75 (17%), Positives = 33/75 (44%), Gaps = 6/75 (8%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK------KKEE 65
KK +E++ ++++ E +Q E + + + + + +K +K +E
Sbjct: 147 KKILEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKE 206
Query: 66 QSVKEIEESLKERKE 80
S + +L+E+K
Sbjct: 207 ASALGEKAALEEQKA 221
>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 486
Score = 32.9 bits (75), Expect = 0.24
Identities = 22/88 (25%), Positives = 43/88 (48%), Gaps = 5/88 (5%)
Query: 8 AGEKKKRIEKRNKEEEKEEN-----EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
G+ I K + +E+E E+ E++ D+E + KK + ++ KK+K+K+K
Sbjct: 399 EGKIISAILKNAEIKEEEGKITIKVEKSEEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEK 458
Query: 63 KEEQSVKEIEESLKERKEKTLTEDSQCD 90
+ + V+++ E K K T + D
Sbjct: 459 ERPEFVEKVLELFKGSKIITYKKKKGKD 486
Score = 28.3 bits (63), Expect = 6.8
Identities = 21/113 (18%), Positives = 46/113 (40%), Gaps = 17/113 (15%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEK-----------------EKEDVEEEKNKKKKKKKKK 51
G K++ E+ K+EE++E +K + +++EE+ K K +K
Sbjct: 367 GSKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKITIKVEKS 426
Query: 52 KKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYKG 104
++ + K ++ E +K+ KEK + ++ +FK +
Sbjct: 427 EEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKERPEFVEKVLELFKGSKIITY 479
>gnl|CDD|225887 COG3351, FlaD, Putative archaeal flagellar protein D/E [Cell
motility and secretion].
Length = 214
Score = 32.5 bits (74), Expect = 0.25
Identities = 17/71 (23%), Positives = 38/71 (53%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
KK +E++ ++ EK E+ KE +EE + KK + ++ ++ ++ KE +++
Sbjct: 1 KKPYLEEKIEKAEKVTAFALEELKEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEK 60
Query: 71 IEESLKERKEK 81
++ +E EK
Sbjct: 61 VDSIEEEISEK 71
Score = 30.2 bits (68), Expect = 1.1
Identities = 22/84 (26%), Positives = 39/84 (46%), Gaps = 6/84 (7%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK------KKKKKKKKKKKK 62
E K++IE+ + +K ++E E+ E E+ K +K +K+K +K K
Sbjct: 21 EELKEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEKVDSIEEEISEKEKVMSEKLK 80
Query: 63 KEEQSVKEIEESLKERKEKTLTED 86
+ Q EE K+ K + + ED
Sbjct: 81 EPAQMSSTSEEEEKKAKLEEIPED 104
Score = 27.9 bits (62), Expect = 7.0
Identities = 16/72 (22%), Positives = 35/72 (48%), Gaps = 1/72 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+++IEK K E +EK +E + K + ++ ++ ++ K+ E+ V
Sbjct: 4 YLEEKIEKAEKVTAFALEELKEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEK-VD 62
Query: 70 EIEESLKERKEK 81
IEE + E+++
Sbjct: 63 SIEEEISEKEKV 74
>gnl|CDD|224308 COG1390, NtpE, Archaeal/vacuolar-type H+-ATPase subunit E [Energy
production and conversion].
Length = 194
Score = 32.3 bits (74), Expect = 0.25
Identities = 11/79 (13%), Positives = 41/79 (51%), Gaps = 8/79 (10%)
Query: 8 AGEKKKRI----EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
A E+ + I + ++ ++E + E+ E++ + K+ ++++++ + ++K
Sbjct: 15 AEEEAEEILEEAREEAEKIKEEAKREAEEAIEEILRKAEKEAERERQRIISSALLEARRK 74
Query: 64 E----EQSVKEIEESLKER 78
E+ ++ + E+++E+
Sbjct: 75 LLEAKEEILESVFEAVEEK 93
Score = 32.0 bits (73), Expect = 0.28
Identities = 21/75 (28%), Positives = 43/75 (57%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
E +K I+K +E E+E E E+ +E+ E+ K + K++ ++ ++ +K +K+ E +
Sbjct: 2 MELEKLIKKILREAEEEAEEILEEAREEAEKIKEEAKREAEEAIEEILRKAEKEAERERQ 61
Query: 69 KEIEESLKERKEKTL 83
+ I +L E + K L
Sbjct: 62 RIISSALLEARRKLL 76
Score = 28.9 bits (65), Expect = 3.2
Identities = 9/51 (17%), Positives = 32/51 (62%)
Query: 35 EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
E + ++ ++ +++ ++ ++ +++ +K +E++ +E EE+++E K E
Sbjct: 5 EKLIKKILREAEEEAEEILEEAREEAEKIKEEAKREAEEAIEEILRKAEKE 55
>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
Validated.
Length = 605
Score = 32.9 bits (75), Expect = 0.25
Identities = 22/79 (27%), Positives = 33/79 (41%), Gaps = 2/79 (2%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK--KKKKKKKKKKKKKKKKEEQSV 68
+KK I K+ N EE K V + K K KKK + K++ KKEE
Sbjct: 380 EKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQNNKEQDIKKEELLE 439
Query: 69 KEIEESLKERKEKTLTEDS 87
KE + ++ + D+
Sbjct: 440 KEFVKKSEKIPKNDELLDN 458
Score = 32.5 bits (74), Expect = 0.39
Identities = 21/79 (26%), Positives = 31/79 (39%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E KK E++ N Q K+ E++ K K K KKK Q+ K
Sbjct: 369 ENKKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQNNK 428
Query: 70 EIEESLKERKEKTLTEDSQ 88
E + +E EK + S+
Sbjct: 429 EQDIKKEELLEKEFVKKSE 447
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 31.7 bits (73), Expect = 0.26
Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 9/55 (16%)
Query: 44 KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKK 98
K+KKKKKKKKK+ + ++KK++ V E ERKE D+ ++ KK
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSE------ERKEDQKAVDAA---LLAAIKK 127
Score = 29.1 bits (66), Expect = 2.1
Identities = 22/62 (35%), Positives = 31/62 (50%), Gaps = 11/62 (17%)
Query: 33 EKEDVEEEKNKKKKKKKKK-------KKKKKKKKKKKKEEQSVKEIEESLKE--RKEKTL 83
K + K KKKKKKKKK KKKK +++KE+Q K ++ +L +K L
Sbjct: 74 PKLNDAYFKRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQ--KAVDAALLAAIKKVPLL 131
Query: 84 TE 85
Sbjct: 132 KG 133
Score = 28.7 bits (65), Expect = 3.2
Identities = 10/43 (23%), Positives = 26/43 (60%)
Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
++K+K+ +E + ++KKKK +++K+ +K + + I+
Sbjct: 84 KKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVDAALLAAIK 126
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 33.0 bits (75), Expect = 0.26
Identities = 11/43 (25%), Positives = 24/43 (55%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
E ++ +E E +E++K + EE +K+ ++ ++KK K
Sbjct: 145 ESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDK 187
Score = 31.5 bits (71), Expect = 0.68
Identities = 15/81 (18%), Positives = 40/81 (49%), Gaps = 6/81 (7%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE- 65
G E + + + K+++K+ + E E ++E +K+ KK+ K +++ ++ ++E
Sbjct: 250 GNIEYEDFFDPKEKDKKKDAGDDAELE----DDEPDKEAVKKEADSKPEEEDEEDDEQED 305
Query: 66 -QSVKEIEESLKERKEKTLTE 85
Q +E E+ ++ +
Sbjct: 306 DQDEEEPPEAAMDKVKLDEPV 326
Score = 31.1 bits (70), Expect = 1.0
Identities = 14/64 (21%), Positives = 33/64 (51%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
EE ++ + EK K +++ ++++KKK+K++ KK K +++ E+ K+
Sbjct: 538 EEIYKKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKK 597
Query: 81 KTLT 84
+
Sbjct: 598 SSSA 601
Score = 30.3 bits (68), Expect = 1.8
Identities = 19/72 (26%), Positives = 37/72 (51%), Gaps = 3/72 (4%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
KK + + +EE++E++EQE+ + D EE K K + + + +E S E
Sbjct: 285 VKKEADSKPEEEDEEDDEQEDDQ--DEEEPPEAAMDKVKLDEPVLEGVDLESPKELSSFE 342
Query: 71 IEES-LKERKEK 81
++ LK++ E+
Sbjct: 343 KRQAKLKQQIEQ 354
Score = 30.0 bits (67), Expect = 2.4
Identities = 18/80 (22%), Positives = 42/80 (52%), Gaps = 4/80 (5%)
Query: 6 GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
E E + E+ ++E E+ E++ +EEE+ ++K+ ++ ++KK K ++
Sbjct: 133 DDEDEADLFNESESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDD 192
Query: 66 Q--SVKEIEESLK--ERKEK 81
+ + E+ E L+ E +E+
Sbjct: 193 KFFKLDEMNEFLEATEAEEE 212
Score = 29.6 bits (66), Expect = 2.7
Identities = 16/79 (20%), Positives = 39/79 (49%), Gaps = 3/79 (3%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEK---EDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
A K + ++ + E+E +++E+E E + +E + + K+ +K++ K
Sbjct: 289 ADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLESPKELSSFEKRQAKL 348
Query: 65 EQSVKEIEESLKERKEKTL 83
+Q ++++E+ K TL
Sbjct: 349 KQQIEQLEKENLAPKSWTL 367
Score = 29.2 bits (65), Expect = 3.6
Identities = 17/62 (27%), Positives = 32/62 (51%), Gaps = 6/62 (9%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKK------KKKKKKKKKKKKKKEEQSVKEIEESL 75
E++E++E+ E ED + K K KKK + + K+ KK+ + + +E EE
Sbjct: 242 EDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDD 301
Query: 76 KE 77
++
Sbjct: 302 EQ 303
Score = 29.2 bits (65), Expect = 3.6
Identities = 14/76 (18%), Positives = 35/76 (46%), Gaps = 1/76 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E + ++ K+E + E+E++E +D +E+ +++ + K K + +
Sbjct: 276 EDDEPDKEAVKKEADSKPEEEDEE-DDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLES 334
Query: 70 EIEESLKERKEKTLTE 85
E S E+++ L +
Sbjct: 335 PKELSSFEKRQAKLKQ 350
Score = 29.2 bits (65), Expect = 4.4
Identities = 15/66 (22%), Positives = 33/66 (50%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
E E + + E ED EE+K ++++ ++K+ ++ ++KK ++ V + L E E
Sbjct: 143 ESESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNE 202
Query: 81 KTLTED 86
+
Sbjct: 203 FLEATE 208
Score = 28.4 bits (63), Expect = 7.2
Identities = 19/69 (27%), Positives = 30/69 (43%), Gaps = 7/69 (10%)
Query: 38 EEEKNKKKKKKKKK------KKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQC-D 90
EE K K +K K K ++++KKK+K + K + ++ KERK D
Sbjct: 538 EEIYKKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKK 597
Query: 91 TMIPVFKKG 99
+ F K
Sbjct: 598 SSSAFFAKL 606
Score = 28.0 bits (62), Expect = 8.8
Identities = 19/73 (26%), Positives = 34/73 (46%), Gaps = 1/73 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+K K IEK E ++ + +E ++K+ K K KK++K K+K ++ S
Sbjct: 542 KKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSSA 601
Query: 70 EIEESLKERKEKT 82
+ L+E K
Sbjct: 602 FFAK-LQETKAVE 613
>gnl|CDD|215618 PLN03184, PLN03184, chloroplast Hsp70; Provisional.
Length = 673
Score = 32.9 bits (75), Expect = 0.27
Identities = 21/68 (30%), Positives = 36/68 (52%), Gaps = 3/68 (4%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE-IEESLKERKE 80
+E E+ +E+KEK D + KN+ + +K+ K+ K + VKE +E LKE K+
Sbjct: 549 QEAEKFAKEDKEKRDAVDTKNQADSVVYQTEKQLKELGDKVPAD--VKEKVEAKLKELKD 606
Query: 81 KTLTEDSQ 88
+ +Q
Sbjct: 607 AIASGSTQ 614
>gnl|CDD|218611 pfam05499, DMAP1, DNA methyltransferase 1-associated protein 1
(DMAP1). DNA methylation can contribute to
transcriptional silencing through several
transcriptionally repressive complexes, which include
methyl-CpG binding domain proteins (MBDs) and histone
deacetylases (HDACs). The chief enzyme that maintains
mammalian DNA methylation, DNMT1, can also establish a
repressive transcription complex. The non-catalytic
amino terminus of DNMT1 binds to HDAC2 and DMAP1 (for
DNMT1 associated protein), and can mediate
transcriptional repression. DMAP1 has intrinsic
transcription repressive activity, and binds to the
transcriptional co-repressor TSG101. DMAP1 is targeted
to replication foci through interaction with the far N
terminus of DNMT1 throughout S phase, whereas HDAC2
joins DNMT1 and DMAP1 only during late S phase,
providing a platform for how histones may become
deacetylated in heterochromatin following replication.
Length = 175
Score = 32.1 bits (73), Expect = 0.27
Identities = 18/65 (27%), Positives = 30/65 (46%), Gaps = 7/65 (10%)
Query: 13 KRIEKRNKEEEKEE-------NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
++IE R KE EK+ +++ E K++KK KKK +K K+ +
Sbjct: 10 RKIEARKKEREKKAQDLQKLITAADQQAAGFSTAEARKREKKLPKKKIPQKPGPSKEDSK 69
Query: 66 QSVKE 70
+V E
Sbjct: 70 VNVPE 74
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 32.5 bits (74), Expect = 0.27
Identities = 14/73 (19%), Positives = 36/73 (49%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
E+ ++ + E + E + E ++ + KK++++ + ++ +KE+ + L
Sbjct: 44 ERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVEL 103
Query: 76 KERKEKTLTEDSQ 88
+E KE+ E+ Q
Sbjct: 104 QEEKEQLENEELQ 116
>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D. SpoVID, the
stage VI sporulation protein D, is restricted to
endospore-forming members of the bacteria, all of which
are found among the Firmicutes. It is widely distributed
but not quite universal in this group. Between
well-conserved N-terminal and C-terminal domains is a
poorly conserved, low-complexity region of variable
length, rich enough in glutamic acid to cause spurious
BLAST search results unless a filter is used. The seed
alignment for this model was trimmed, in effect, by
choosing member sequences in which these regions are
relatively short. SpoVID is involved in spore coat
assembly by the mother cell compartment late in the
process of sporulation [Cellular processes, Sporulation
and germination].
Length = 338
Score = 32.5 bits (74), Expect = 0.27
Identities = 18/68 (26%), Positives = 27/68 (39%), Gaps = 7/68 (10%)
Query: 19 NKEEEKEENEQEEKEKE---DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
E EE EE E E +VE+E+ ++ + +++ KK E EE
Sbjct: 213 EVEASPEEENYEEYEDETELEVEDEEKALDEQTEDPQQEDALAGDAKKAL----EEEEEK 268
Query: 76 KERKEKTL 83
ER E
Sbjct: 269 GERPENAT 276
Score = 32.5 bits (74), Expect = 0.33
Identities = 12/58 (20%), Positives = 29/58 (50%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
A +++ E+ E E E ++E+ E E+ + + KK +++++K ++ E
Sbjct: 216 ASPEEENYEEYEDETELEVEDEEKALDEQTEDPQQEDALAGDAKKALEEEEEKGERPE 273
Score = 29.1 bits (65), Expect = 3.7
Identities = 18/61 (29%), Positives = 30/61 (49%), Gaps = 1/61 (1%)
Query: 14 RIEKRNKEEEKEENEQE-EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
+E +EE EE E E E E ED E+ +++ + +++ KK +EE+ E
Sbjct: 213 EVEASPEEENYEEYEDETELEVEDEEKALDEQTEDPQQEDALAGDAKKALEEEEEKGERP 272
Query: 73 E 73
E
Sbjct: 273 E 273
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 32.2 bits (74), Expect = 0.27
Identities = 16/66 (24%), Positives = 36/66 (54%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E ++ I+K +E+E+ E E E + EK ++++++ ++K+ + KK+ Q +K
Sbjct: 124 ELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLKKQNQQLK 183
Query: 70 EIEESL 75
E +
Sbjct: 184 SQLEQI 189
Score = 30.3 bits (69), Expect = 0.95
Identities = 14/53 (26%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 29 QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
Q E+ K ++E+E KK +++K++ +K+ + + K E K EE + +++
Sbjct: 117 QAEQGKSELEQEI--KKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKR 167
>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 594
Score = 32.7 bits (75), Expect = 0.27
Identities = 13/57 (22%), Positives = 22/57 (38%), Gaps = 1/57 (1%)
Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
+K+K + ++K K +K KK + KE E + +E DS
Sbjct: 453 DKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFI-KEFETDALKLQEILEKVDS 508
>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
Length = 177
Score = 31.8 bits (73), Expect = 0.28
Identities = 19/63 (30%), Positives = 27/63 (42%), Gaps = 17/63 (26%)
Query: 10 EKKKRIEK------RNKEEEKEENEQEEKEKED-----------VEEEKNKKKKKKKKKK 52
K+K+ K R+ E + + + +K+D EEK K KKKK K K
Sbjct: 31 RKRKKKHKGLKSGSRHNEGNTQSKGKGQAQKKDPRIGSKKPIPLGVEEKVKPKKKKPKSK 90
Query: 53 KKK 55
K K
Sbjct: 91 KPK 93
Score = 29.9 bits (68), Expect = 1.2
Identities = 15/39 (38%), Positives = 21/39 (53%), Gaps = 1/39 (2%)
Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
KKKK K+ K K KKK +EE + E +++K K
Sbjct: 1 KKKKSSPKRSKGMAKSKKKTREELDAEARERK-RKKKHK 38
Score = 27.2 bits (61), Expect = 9.4
Identities = 22/108 (20%), Positives = 42/108 (38%), Gaps = 37/108 (34%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK----------- 49
+K+ K ++K++ +EE + E +E K+KKK K
Sbjct: 1 KKKKSSPKRSKGMA---KSKKKTREELDAEARE--------RKRKKKHKGLKSGSRHNEG 49
Query: 50 --KKKKKKKKKKK------KKK-----EEQSVKEIEESLKERKEKTLT 84
+ K K + +KK KK EE+ + ++ + K+ L+
Sbjct: 50 NTQSKGKGQAQKKDPRIGSKKPIPLGVEEKVKPKKKK--PKSKKPKLS 95
>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal. The C-terminal domain of
FCP-1 is required for interaction with the carboxy
terminal domain of RAP74. Interaction relies extensively
on van der Waals contacts between hydrophobic residues
situated within alpha-helices in both domains.
Length = 263
Score = 32.3 bits (73), Expect = 0.29
Identities = 25/127 (19%), Positives = 49/127 (38%), Gaps = 36/127 (28%)
Query: 2 RKRRGGAGEK-------KKRIEKRNKE------EEKEENEQEEKEKEDVEEEKNKKKKKK 48
RKR+ E K+ +E +KE E ++++ E+K+ EEE+ + + +
Sbjct: 125 RKRQPSMSETMPLYTLCKEDLESMDKEVDDILGEGSDDSDSEKKKPPGNEEEQEQAPQPR 184
Query: 49 KKKKKKKKK-----------------------KKKKKKEEQSVKEIEESLKERKEKTLTE 85
K++ ++ K+K +E++ E E S + E
Sbjct: 185 KQQPPGPRREPDEGTPSSSERSNIGGRGPRGHKRKLNEEDEEDAESESSFESSNEDEEGS 244
Query: 86 DSQCDTM 92
S+ D M
Sbjct: 245 SSEADEM 251
>gnl|CDD|220624 pfam10187, Nefa_Nip30_N, N-terminal domain of NEFA-interacting
nuclear protein NIP30. This is a the N-terminal 100
amino acids of a family of proteins conserved from
plants to humans. The full-length protein has
putatively been called NEFA-interacting nuclear protein
NIP30, however no reference could be found to confirm
this.
Length = 99
Score = 30.8 bits (70), Expect = 0.29
Identities = 24/90 (26%), Positives = 41/90 (45%), Gaps = 28/90 (31%)
Query: 10 EKKKRIEKRNKEEE-KEENE------------QEEKEKEDVE-EEKNKKKKK-------- 47
+K+R E+ + K E E QE K+K+ E EEK K K +
Sbjct: 10 ARKRRQEEVRAPRDPKAEPEEEYDGRSLYERLQENKDKKQEEFEEKFKLKNQFRGLDEDE 69
Query: 48 ----KKKKKKKKKKKKKKKKEEQSVKEIEE 73
+ ++ ++ +KK+K+EE +E+EE
Sbjct: 70 VEFLDEVEESRRAAEKKRKREEA--EELEE 97
>gnl|CDD|235219 PRK04098, PRK04098, sec-independent translocase; Provisional.
Length = 158
Score = 31.6 bits (72), Expect = 0.30
Identities = 9/69 (13%), Positives = 27/69 (39%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
K + + ++ + +K E+ + ++ + K ++ + +E +
Sbjct: 90 KITAENEIKSIQDLLQDYKKSLEEDTIPNHLNEEVSNETKLTQESSSDESPKEVKLATKN 149
Query: 73 ESLKERKEK 81
++ K KEK
Sbjct: 150 KTKKHDKEK 158
Score = 31.2 bits (71), Expect = 0.43
Identities = 12/62 (19%), Positives = 25/62 (40%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
K + ++ ++ EE+ E+V E ++ + K+ K K K ++
Sbjct: 96 EIKSIQDLLQDYKKSLEEDTIPNHLNEEVSNETKLTQESSSDESPKEVKLATKNKTKKHD 155
Query: 69 KE 70
KE
Sbjct: 156 KE 157
Score = 28.9 bits (65), Expect = 2.2
Identities = 14/64 (21%), Positives = 26/64 (40%), Gaps = 2/64 (3%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI--EESLKER 78
EE + E E + +++ KK ++ ++ E + +E +ES KE
Sbjct: 84 EELDDLKITAENEIKSIQDLLQDYKKSLEEDTIPNHLNEEVSNETKLTQESSSDESPKEV 143
Query: 79 KEKT 82
K T
Sbjct: 144 KLAT 147
Score = 27.8 bits (62), Expect = 6.7
Identities = 12/47 (25%), Positives = 19/47 (40%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
E+ N+E E +E ++ +E K K KK K+K
Sbjct: 112 EEDTIPNHLNEEVSNETKLTQESSSDESPKEVKLATKNKTKKHDKEK 158
Score = 27.8 bits (62), Expect = 6.8
Identities = 13/51 (25%), Positives = 18/51 (35%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
KK +E+ E E + K+ K K K KK K+K
Sbjct: 108 KKSLEEDTIPNHLNEEVSNETKLTQESSSDESPKEVKLATKNKTKKHDKEK 158
>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3. The SLD3 DNA
replication regulator is required for loading and
maintenance of Cdc45 on chromatin during DNA
replication.
Length = 437
Score = 32.9 bits (75), Expect = 0.30
Identities = 11/44 (25%), Positives = 27/44 (61%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
E ++ ++E +K+ ++ +N + K++K++K+KK KK+
Sbjct: 119 ELDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPV 162
Score = 30.9 bits (70), Expect = 1.1
Identities = 12/44 (27%), Positives = 24/44 (54%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
R + EKE ++ K ++++ K +K++K+KK KK+
Sbjct: 120 LDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPVL 163
Score = 29.4 bits (66), Expect = 3.0
Identities = 12/45 (26%), Positives = 26/45 (57%), Gaps = 4/45 (8%)
Query: 29 QEEKEKEDVEEEKNKKKKKK----KKKKKKKKKKKKKKKEEQSVK 69
+ ++ + +E K K K+ + K++K++K+KK KK++ V
Sbjct: 119 ELDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPVL 163
Score = 29.4 bits (66), Expect = 3.4
Identities = 11/45 (24%), Positives = 25/45 (55%), Gaps = 1/45 (2%)
Query: 23 EKEENEQEEKEKEDVEEEKNKK-KKKKKKKKKKKKKKKKKKKEEQ 66
E + + EKE + ++++ + K++K++K+KK KK+
Sbjct: 119 ELDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPVL 163
Score = 28.6 bits (64), Expect = 5.4
Identities = 13/52 (25%), Positives = 27/52 (51%), Gaps = 3/52 (5%)
Query: 30 EEKEKEDVEEEKNKKKKKKK---KKKKKKKKKKKKKKEEQSVKEIEESLKER 78
E E+E KK K+ + K++K++K+KK ++Q + + + +R
Sbjct: 119 ELDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPVLLDLHLDR 170
>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
Length = 191
Score = 31.9 bits (73), Expect = 0.30
Identities = 21/74 (28%), Positives = 37/74 (50%), Gaps = 15/74 (20%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
EK +++E+ +E E EE ++E E+E VEEE + +++ K +
Sbjct: 3 EKNEQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAK---------------IA 47
Query: 70 EIEESLKERKEKTL 83
E+E L E +E+ L
Sbjct: 48 ELEAKLDELEERYL 61
>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein. NinG or Rap
is involved in recombination. Rap (recombination adept
with plasmid) increases lambda-by-plasmid recombination
catalyzed by Escherichia coli's RecBCD pathway.
Length = 188
Score = 31.9 bits (73), Expect = 0.30
Identities = 11/35 (31%), Positives = 22/35 (62%)
Query: 46 KKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
K++K ++KK+K + + E + +K +E LK R +
Sbjct: 33 ALKREKAQEKKRKAEAQAERRELKARKEKLKTRSD 67
>gnl|CDD|173966 cd08047, TAF7, TATA Binding Protein (TBP) Associated Factor 7
(TAF7) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 7 (TAF7) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryotes. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
preinitiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs are named after their
electrophoretic mobility in polyacrylamide gels in
different species. A new, unified nomenclature has been
suggested for the pol II TAFs to show the relationship
between TAF orthologs and paralogs. Several hypotheses
are proposed for TAFs functions such as serving as
activator-binding sites, core-promoter recognition or a
role in essential catalytic activity. Each TAF, with the
help of a specific activator, is required only for
expression of subset of genes and is not universally
involved for transcription as are GTFs. TAF7 is involved
in the regulation of the transition from PIC assembly to
initiation and elongation. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; the HFD is the interaction motif involved
in heterodimerization of the core histones and their
assembly into nucleosome octamers.
Length = 162
Score = 31.9 bits (73), Expect = 0.30
Identities = 17/61 (27%), Positives = 31/61 (50%), Gaps = 11/61 (18%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKK-----------KKKKKKKKKKKKKKEEQSVKEIEESLK 76
E ++ EKE E + +KK K KK K +K++ +K K+ ++E+E+ +K
Sbjct: 86 EPDDSEKEAKEAKLDKKDKPKKFEYPHGLTPPMKNVRKRRFRKTPSKKIAEIEEVEKEVK 145
Query: 77 E 77
Sbjct: 146 R 146
Score = 27.2 bits (61), Expect = 8.5
Identities = 13/74 (17%), Positives = 35/74 (47%), Gaps = 7/74 (9%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNK-------KKKKKKKKKKKKKKKKKKKKEEQS 67
E + E+E +E + ++K+K E + +K++ +K KK + ++ E++
Sbjct: 85 YEPDDSEKEAKEAKLDKKDKPKKFEYPHGLTPPMKNVRKRRFRKTPSKKIAEIEEVEKEV 144
Query: 68 VKEIEESLKERKEK 81
+ ++E + +
Sbjct: 145 KRLLKEDTEAVSVE 158
>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
structure and biogenesis].
Length = 107
Score = 31.0 bits (70), Expect = 0.31
Identities = 19/51 (37%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE-QSVKEIEESLKERKEK 81
K + +EK +K K K KKK KKKKKEE + ++E L ++ K
Sbjct: 2 APKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELLDKIRK 52
Score = 30.3 bits (68), Expect = 0.49
Identities = 15/48 (31%), Positives = 23/48 (47%)
Query: 38 EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
+ K KK +K K K KKK KKK+++ + +E +K E
Sbjct: 6 QLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELLDKIRKE 53
>gnl|CDD|234547 TIGR04330, cas_Cpf1, CRISPR-associated protein Cpf1, subtype
PREFRAN. This family is the long protein of a novel
CRISPR subtype, PREFRAN, which is most common in
Prevotella and Francisella, although widely distributed.
The PREFRAN type has Cas1, Cas2, and Cas4, but lacks the
helicase Cas3 and endonuclease Cas3-HD.
Length = 1287
Score = 32.9 bits (75), Expect = 0.31
Identities = 19/53 (35%), Positives = 30/53 (56%), Gaps = 1/53 (1%)
Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE-IEESLKERKEK 81
EE K+ +E K K KK+K ++K KK KK+ Q + + +E+ +KE K
Sbjct: 378 EEYIKKQIEPRKGKSKKEKYEEKVKKVVKKQDYLSIQFINDLLEKYIKEFDSK 430
>gnl|CDD|232925 TIGR00335, primase_sml, DNA primase, eukaryotic-type, small
subunit, putative. Archaeal members differ
substantially from eukaryotic members and should be
considered putative pending experimental evidence. The
protein is universal and single copy among completed
archaeal and eukarotic genomes to date. DNA primase
creates RNA primers needed for DNA replication.This
model is named putative because the assignment is
putative for archaeal proteins. Eukaryotic proteins
scoring above the trusted cutoff can be considered
authentic [DNA metabolism, DNA replication,
recombination, and repair].
Length = 297
Score = 32.2 bits (73), Expect = 0.32
Identities = 13/62 (20%), Positives = 25/62 (40%)
Query: 38 EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFK 97
EE+ K K +K K K+ +S K+ +L+ +K + +D I + +
Sbjct: 201 LEEEILTSKLKILPNDLRKWKLIKEVIFKSEKKDYSALEIYIDKIVLDDKVTLDRIRLLR 260
Query: 98 KG 99
Sbjct: 261 HP 262
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 32.4 bits (74), Expect = 0.32
Identities = 18/63 (28%), Positives = 36/63 (57%), Gaps = 3/63 (4%)
Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
RN E +K + EQEE+ +++ EEE KK + + K++ + + ++E++E L+
Sbjct: 106 RNYEADKLDEEQEERVEKEREEE--LAGDAMKKLENRTADSKREMEVLERLEELKE-LQS 162
Query: 78 RKE 80
R+
Sbjct: 163 RRA 165
Score = 30.1 bits (68), Expect = 1.9
Identities = 27/116 (23%), Positives = 38/116 (32%), Gaps = 37/116 (31%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKE-------------------KEDVEEEKNKKKKKKKK 50
E R EK+ +EEE+EE+E K ++D E+ N K
Sbjct: 174 EALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGS 233
Query: 51 KKKKK-----KKKKKKKKEEQSVKEIE-------------ESLKERKEKTLTEDSQ 88
K KK K+ E S + + SL RK+ SQ
Sbjct: 234 SSPAKPTSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKKAAPESTSQ 289
>gnl|CDD|216581 pfam01577, Peptidase_S30, Potyvirus P1 protease. The potyviridae
family positive stand RNA viruses with genome encoding a
polyprotein. members include zucchini yellow mosaic
virus, and turnip mosaic viruses which cause
considerable losses of crops worldwide. This family
consists of a C terminus region from various plant
potyvirus P1 proteins (found at the N terminus of the
polyprotein). The C terminus of P1 is a serine-type
protease responsible for autocatalytic cleavage between
P1 and the helper component protease pfam00851. The
entire P1 protein may be involved in virus-host
interactions.
Length = 245
Score = 32.3 bits (74), Expect = 0.32
Identities = 24/133 (18%), Positives = 56/133 (42%), Gaps = 14/133 (10%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
E K KE +++EK + +K + KK++K K E+ +
Sbjct: 2 DLEAKVAERLLRKEMSKIKQEKKGRIILRKLSPAQVAKKREKLKREE----------REE 51
Query: 80 EKTLTEDSQCDTMIPVFKKGVLYKGIYCPSLTNSFREPSLEMSYQRYSHRQRQKSLIIVN 139
+ L Q V K + + + SFR P + + ++ ++++K +++ +
Sbjct: 52 RQFL----QGAYASIVSKITPIGTDKVSKTESVSFRTPYYKRTTKKMKKKKKKKKVVMSD 107
Query: 140 LVDMVLKAVLTVA 152
++ +++ VL +A
Sbjct: 108 KINYLIRQVLKIA 120
Score = 31.9 bits (73), Expect = 0.43
Identities = 25/95 (26%), Positives = 38/95 (40%), Gaps = 14/95 (14%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKE--ENEQEEKE------------KEDVEEEKNKKKK 46
+RK KK+ KR + EE++ + K + + K
Sbjct: 29 LRKLSPAQVAKKREKLKREEREERQFLQGAYASIVSKITPIGTDKVSKTESVSFRTPYYK 88
Query: 47 KKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
+ KK KKKKKKKK ++ I + LK K+K
Sbjct: 89 RTTKKMKKKKKKKKVVMSDKINYLIRQVLKIAKKK 123
>gnl|CDD|192718 pfam11176, DUF2962, Protein of unknown function (DUF2962). This
eukaryotic family of proteins has no known function.
Length = 155
Score = 31.5 bits (72), Expect = 0.33
Identities = 11/48 (22%), Positives = 24/48 (50%)
Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
+ K +KKK ++ +K+ E + +E +KE +++ T + D
Sbjct: 26 ATLRDDKLEKKKAERAEKRSLELDRLFFFQEQIKEDEKEPFTLEEIKD 73
>gnl|CDD|223587 COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA
replication, recombination, and repair / Transcription /
Translation, ribosomal structure and biogenesis].
Length = 513
Score = 32.5 bits (74), Expect = 0.34
Identities = 14/67 (20%), Positives = 27/67 (40%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
KRIEKR + + E ED + K + +++ + KK K ++ ++ +
Sbjct: 387 KRIEKRLERKLPSAVLLPLDEPEDAKLLKTTRPGLEEESDISDEIKKLKSSKKALLRGLG 446
Query: 73 ESLKERK 79
K
Sbjct: 447 VRFTLSK 453
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 32.0 bits (73), Expect = 0.35
Identities = 19/90 (21%), Positives = 41/90 (45%), Gaps = 26/90 (28%)
Query: 10 EKKKRIEK--------RNKEEEKEENEQEEKEKEDVEEEK-------NKKKKKKKKKKKK 54
K+KRIE+ + ++ ++ + +D E+E K KK++ ++K++
Sbjct: 105 IKRKRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKER 164
Query: 55 KKKKKKKKKEEQSVKEIEESLKERKEKTLT 84
++++K E K R+E+ LT
Sbjct: 165 EEEEKA-----------AEEEKAREEEILT 183
Score = 27.8 bits (62), Expect = 8.8
Identities = 14/82 (17%), Positives = 34/82 (41%), Gaps = 5/82 (6%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK-----KKK 55
R+ G ++ ++ + R + EE E + +KE + E+ +K + ++
Sbjct: 43 YRQPGQGTEDELRKRDLRAELEEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEE 102
Query: 56 KKKKKKKKEEQSVKEIEESLKE 77
+ K+K+ EE + +
Sbjct: 103 DEIKRKRIEEDARNSDADDSDS 124
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized.
This family of proteins is found in bacteria. Proteins
in this family are approximately 90 amino acids in
length. There are two completely conserved L residues
that may be functionally important.
Length = 70
Score = 29.6 bits (67), Expect = 0.35
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK 43
K K+ E + +EE++EE E+ K +E E EKNK
Sbjct: 23 KAKKKELKAEEEKREEEEEARKREERKEREKNK 55
Score = 25.8 bits (57), Expect = 9.6
Identities = 11/38 (28%), Positives = 27/38 (71%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
+K K +++K++++++ +K++++KE + K EE L E
Sbjct: 26 KKELKAEEEKREEEEEARKREERKEREKNKSFEELLNE 63
>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
Length = 80
Score = 30.1 bits (68), Expect = 0.36
Identities = 8/33 (24%), Positives = 13/33 (39%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
E EE E +D +E+ + K K +
Sbjct: 46 EGEEPEDDDDDEDDDDDDDKDDKDDDDDDDDED 78
Score = 29.0 bits (65), Expect = 0.76
Identities = 5/31 (16%), Positives = 12/31 (38%)
Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
E+ ++D ++E + K K +
Sbjct: 48 EEPEDDDDDEDDDDDDDKDDKDDDDDDDDED 78
Score = 28.2 bits (63), Expect = 1.8
Identities = 11/52 (21%), Positives = 19/52 (36%)
Query: 14 RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
R E +EE+ E E E+ E++ + + K K E+
Sbjct: 27 RTEALIEEEKVEIARSVMIEGEEPEDDDDDEDDDDDDDKDDKDDDDDDDDED 78
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 32.4 bits (74), Expect = 0.36
Identities = 9/61 (14%), Positives = 26/61 (42%), Gaps = 2/61 (3%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
+ E E+ E + E+ + ++ K + +++KK + + ++ L+E +
Sbjct: 175 AVRAEIAAEQAELTTLLSEQ--RAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRAN 232
Query: 82 T 82
Sbjct: 233 E 233
Score = 31.6 bits (72), Expect = 0.60
Identities = 10/61 (16%), Positives = 25/61 (40%), Gaps = 3/61 (4%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI---EESLK 76
E E+ E E ++ + +++KK + + +++ ++E+ E LK
Sbjct: 178 AEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLK 237
Query: 77 E 77
Sbjct: 238 N 238
Score = 30.8 bits (70), Expect = 1.0
Identities = 12/73 (16%), Positives = 29/73 (39%), Gaps = 2/73 (2%)
Query: 8 AGEKKKRIEKRNKEEEK--EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
A + + ++ + E+ Q+ K + +EE K + + +KK ++ + E
Sbjct: 174 AAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANE 233
Query: 66 QSVKEIEESLKER 78
+K S +
Sbjct: 234 SRLKNEIASAEAA 246
>gnl|CDD|220297 pfam09581, Spore_III_AF, Stage III sporulation protein AF
(Spore_III_AF). This family represents the stage III
sporulation protein AF (Spore_III_AF) of the bacterial
endospore formation program, which exists in some but
not all members of the Firmicutes (formerly called
low-GC Gram-positives). The C-terminal region of these
proteins is poorly conserved.
Length = 185
Score = 31.5 bits (72), Expect = 0.36
Identities = 23/80 (28%), Positives = 38/80 (47%), Gaps = 14/80 (17%)
Query: 12 KKRIEKRNKEEEK--------EENEQEEKEKEDVEE------EKNKKKKKKKKKKKKKKK 57
+K++EK+ KEE E +E E D++E E++K+K+K K +
Sbjct: 89 EKQVEKKLKEEYGVKVKDVEVEIDEDLESNNFDIKEVNVTLKEESKEKQKSKVEPVVIDT 148
Query: 58 KKKKKKEEQSVKEIEESLKE 77
+ K KEE+ E E +K
Sbjct: 149 QTSKPKEEEEESEEAEKIKN 168
>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
Length = 627
Score = 32.4 bits (75), Expect = 0.37
Identities = 23/75 (30%), Positives = 35/75 (46%), Gaps = 13/75 (17%)
Query: 8 AGEKKKR---IEKRNKEE------EKEENEQEEK----EKEDVEEEKNKKKKKKKKKKKK 54
A E KKR +E RN+ + EK E +K EKE +E + K+ K + K+
Sbjct: 516 AEEDKKRKELVEARNQADSLIYQTEKTLKELGDKVPADEKEKIEAAIKELKEALKGEDKE 575
Query: 55 KKKKKKKKKEEQSVK 69
K K ++ + S K
Sbjct: 576 AIKAKTEELTQASQK 590
>gnl|CDD|223618 COG0544, Tig, FKBP-type peptidyl-prolyl cis-trans isomerase
(trigger factor) [Posttranslational modification,
protein turnover, chaperones].
Length = 441
Score = 32.3 bits (74), Expect = 0.38
Identities = 20/76 (26%), Positives = 41/76 (53%), Gaps = 5/76 (6%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK-----EEQS 67
K ++KR E +E ++ E++ +EE K K +K +++ K+ +K+K++ E +
Sbjct: 238 KEVKKRELPELDDEFAKKLGEEDTLEELKEKLRKNLERELKEATLEKRKEQLLDALVEAN 297
Query: 68 VKEIEESLKERKEKTL 83
++ ESL E + L
Sbjct: 298 DFDLPESLVEAEIDNL 313
Score = 29.2 bits (66), Expect = 4.0
Identities = 11/52 (21%), Positives = 21/52 (40%), Gaps = 4/52 (7%)
Query: 20 KEEEKEENEQEEKE----KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
EE + ++ K D+ EEK KKK +K+ ++ ++
Sbjct: 389 PEEVIKLYYNNQELLDALKADILEEKAVDLLLANKKKVTEKEVSFEELMNEA 440
>gnl|CDD|131972 TIGR02926, AhaH, ATP synthase archaeal, H subunit. he A1/A0 ATP
synthase is homologous to the V-type (V1/V0, vacuolar)
ATPase, but functions in the ATP synthetic direction as
does the F1/F0 ATPase of bacteria. The hydrophilic A1
"stalk" complex (AhaABCDEFG) is the site of ATP
generation and is coupled to the membrane-embedded
proton translocating A0 complex. It is unclear
precisely where AhaH fits into these complexes.
Length = 85
Score = 30.2 bits (68), Expect = 0.39
Identities = 16/61 (26%), Positives = 40/61 (65%)
Query: 25 EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLT 84
EE ++ E++ E++ EE +++K++ + +++ ++ ++ EE++ K EE +KE +E+
Sbjct: 2 EEIKKAEEDAEELIEEAEEERKQRIAEAREEARELLEEAEEEASKLGEEIIKEAEEEIEK 61
Query: 85 E 85
E
Sbjct: 62 E 62
Score = 29.0 bits (65), Expect = 1.1
Identities = 17/66 (25%), Positives = 39/66 (59%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A E++K+ +EE +E E+ E+E + EE K+ +++ +K+ +K +++ +K+ E
Sbjct: 18 AEEERKQRIAEAREEARELLEEAEEEASKLGEEIIKEAEEEIEKEAEKIREEGEKEIEAM 77
Query: 68 VKEIEE 73
+ +E
Sbjct: 78 KSKAKE 83
Score = 27.5 bits (61), Expect = 3.1
Identities = 16/75 (21%), Positives = 40/75 (53%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+ K+ E+ +E +E E+ ++ + EE + ++ +++ K ++ K+ EE+ K
Sbjct: 2 EEIKKAEEDAEELIEEAEEERKQRIAEAREEARELLEEAEEEASKLGEEIIKEAEEEIEK 61
Query: 70 EIEESLKERKEKTLT 84
E E+ +E +++
Sbjct: 62 EAEKIREEGEKEIEA 76
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 32.5 bits (74), Expect = 0.40
Identities = 19/62 (30%), Positives = 36/62 (58%), Gaps = 3/62 (4%)
Query: 10 EKKKRIEKRNKEE---EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
EK++ +EK KEE E++ EQ +E+E E ++ + K + +K+++K + KK +
Sbjct: 253 EKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQNLLKKASR 312
Query: 67 SV 68
S
Sbjct: 313 SA 314
Score = 30.2 bits (68), Expect = 2.2
Identities = 12/51 (23%), Positives = 31/51 (60%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
+E++ +E EK E+ E+E++ E+ + +++K + + + K + +K+ E
Sbjct: 251 LEEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRRE 301
Score = 29.5 bits (66), Expect = 4.1
Identities = 29/116 (25%), Positives = 48/116 (41%), Gaps = 30/116 (25%)
Query: 21 EEEKE---ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI--EESL 75
EEE + + ++E + +E+ + K +K + K + K+K K S K I E L
Sbjct: 430 EEEHQIYRKLQEERRLREEAI----RAKAEKTARMKAEMKEKTLKMFLLSQKHIVYTEPL 485
Query: 76 KERKEKTLTEDSQCDTMIPVFKKGVLYKGIYCPSLT--NSFREPSLEMSYQRYSHR 129
+ + T+T VLY P+ T N E S+ R++HR
Sbjct: 486 EVQAGTTVT---------------VLYN----PANTVLNGKPEVWFRGSFNRWTHR 522
>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
function prediction only].
Length = 386
Score = 32.1 bits (73), Expect = 0.41
Identities = 11/32 (34%), Positives = 15/32 (46%)
Query: 39 EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
E K+K +K K KK KK + E + E
Sbjct: 355 EAARKRKGDRKGVSHKAKKGGKKNQAETAGFE 386
>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090). This
family consists of several bacterial proteins of unknown
function and is known as YqjC in E. coli.
Length = 115
Score = 30.7 bits (70), Expect = 0.41
Identities = 18/52 (34%), Positives = 29/52 (55%), Gaps = 4/52 (7%)
Query: 24 KEENEQ--EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
+ E + EEKE+E E E+ K+ + K K +KK++K E Q E++E
Sbjct: 65 RAERQAKVEEKEQEVAEREQELKEAQAKGDADKIEKKQRKLAEAQ--AELQE 114
Score = 29.9 bits (68), Expect = 0.64
Identities = 14/50 (28%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 17 KRNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
+ ++ + EE EQE E+E +++E + K K +KK++K + + + +E
Sbjct: 65 RAERQAKVEEKEQEVAEREQELKEAQAKGDADKIEKKQRKLAEAQAELQE 114
Score = 27.6 bits (62), Expect = 4.1
Identities = 17/79 (21%), Positives = 37/79 (46%), Gaps = 11/79 (13%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQE----EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
G K++ IE + + + N+ EK ++V+ + +++ K ++K+++
Sbjct: 21 GCAAKRQAIETQIEYAKAHGNQHRIAGLEKALKEVQAHCTDAGLRAERQAKVEEKEQE-- 78
Query: 63 KEEQSVKEIEESLKERKEK 81
V E E+ LKE + K
Sbjct: 79 -----VAEREQELKEAQAK 92
Score = 27.2 bits (61), Expect = 5.6
Identities = 9/48 (18%), Positives = 29/48 (60%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
E++ ++E++ +E + E E +E + + ++ KK++K + + + ++
Sbjct: 67 ERQAKVEEKEQEVAEREQELKEAQAKGDADKIEKKQRKLAEAQAELQE 114
Score = 27.2 bits (61), Expect = 5.9
Identities = 18/57 (31%), Positives = 30/57 (52%), Gaps = 5/57 (8%)
Query: 8 AGEKKKRIEK-RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
AG + +R K KE+E E EQE KE + K K +KK++K + + + ++
Sbjct: 62 AGLRAERQAKVEEKEQEVAEREQELKEAQ----AKGDADKIEKKQRKLAEAQAELQE 114
>gnl|CDD|201540 pfam00992, Troponin, Troponin. Troponin (Tn) contains three
subunits, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). this Pfam contains members of
the TnT subunit. Troponin is a complex of three
proteins, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). The troponin complex
regulates Ca++ induced muscle contraction. This family
includes troponin T and troponin I. Troponin I binds to
actin and troponin T binds to tropomyosin.
Length = 131
Score = 31.0 bits (71), Expect = 0.42
Identities = 18/74 (24%), Positives = 33/74 (44%), Gaps = 9/74 (12%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK----KKKKKK---KKKKKKKKKKKKKE 64
RI++ +EE+ + E++ +K+ E+ KK + K KK KK +K K
Sbjct: 55 HARIDRL--DEERYDIEEKVAKKDKEIEDLKKKVNDLRGKFKKPTLKKVRKSADAMLKAL 112
Query: 65 EQSVKEIEESLKER 78
S ++ L+
Sbjct: 113 LGSKHKVSMDLRAN 126
Score = 28.3 bits (64), Expect = 3.3
Identities = 16/54 (29%), Positives = 27/54 (50%), Gaps = 10/54 (18%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKK----KKKK---KKKKKKK---KKKKKKE 64
++ ++ ++E+ D+EE+ KK K KKK + K KK KK +K
Sbjct: 52 KKLHARIDRLDEERYDIEEKVAKKDKEIEDLKKKVNDLRGKFKKPTLKKVRKSA 105
>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
binding domain. Human SGLT1 (hSGLT1) is a
high-affinity/low-capacity glucose transporter, which
can also transport galactose. In the transport
mechanism, two Na+ ions first bind to the extracellular
side of the transporter and induce a conformational
change in the glucose binding site. This results in an
increased affinity for glucose. A second conformational
change in the transporter follows, bringing the Na+ and
glucose binding sites to the inner surface of the
membrane. Glucose is then released, followed by the Na+
ions. In the process, hSGLT1 is also able to transport
water and urea and may be a major pathway for transport
of these across the intestinal brush-border membrane.
hSGLT1 is encoded by the SLC5A1 gene and expressed
mostly in the intestine, but also in the trachea,
kidney, heart, brain, testis, and prostate. The
WHO/UNICEF oral rehydration solution (ORS) for the
treatment of secretory diarrhea contains salt and
glucose. The glucose, along with sodium ions, is
transported by hSGLT1 and water is either co-transported
along with these or follows by osmosis. Mutations in
SGLT1 are associated with intestinal glucose galactose
malabsorption (GGM). Up-regulation of intestinal SGLT1
may protect against enteric infections. SGLT1 is
expressed in colorectal, head and neck, and prostate
tumors. Epidermal growth factor receptor (EGFR)
functions in cell survival by stabilizing SGLT1, and
thereby maintaining intracellular glucose levels. SGLT1
is predicted to have 14 membrane-spanning regions. This
subgroup belongs to the solute carrier 5
(SLC5)transporter family.
Length = 635
Score = 32.5 bits (74), Expect = 0.42
Identities = 15/74 (20%), Positives = 33/74 (44%), Gaps = 8/74 (10%)
Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK--------KKKKKKKKEEQSVK 69
RN EE+ + + ++ +++ E E +++KK +K + K K E+
Sbjct: 538 RNSTEERIDLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEA 597
Query: 70 EIEESLKERKEKTL 83
++ + + EK L
Sbjct: 598 ALKMKMTDTSEKPL 611
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 32.1 bits (74), Expect = 0.43
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 20 KEEEKEENEQEEKEKEDVEEE 40
E+EE E+EE+E+E+ EE
Sbjct: 299 AAAEEEEEEEEEEEEEEPSEE 319
Score = 31.8 bits (73), Expect = 0.50
Identities = 12/21 (57%), Positives = 16/21 (76%)
Query: 20 KEEEKEENEQEEKEKEDVEEE 40
EE+EE E+EE+E+E EEE
Sbjct: 300 AAEEEEEEEEEEEEEEPSEEE 320
>gnl|CDD|234706 PRK00269, zipA, cell division protein ZipA; Reviewed.
Length = 293
Score = 32.1 bits (73), Expect = 0.44
Identities = 19/100 (19%), Positives = 46/100 (46%), Gaps = 14/100 (14%)
Query: 1 MRKRRGGAGEKKKRIEKR--NKEEEKEENEQE-------EKEKEDVEEEKNK-----KKK 46
R+ RGG G+ K R+++ N +E+E + + + KE +E + + +
Sbjct: 25 WRRMRGGKGKLKFRLDRSYSNLPDEEEGSAELLGPARVLDTHKEPQLDEHDLPSMSARPR 84
Query: 47 KKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
++++ K K++K+ + E ++ L E + ++
Sbjct: 85 ERRRDTKTAKQQKRGRGSEPQQGDLNLDLDEVEPALFSDR 124
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 32.0 bits (73), Expect = 0.44
Identities = 11/85 (12%), Positives = 23/85 (27%), Gaps = 21/85 (24%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQ---------------------EEKEKEDVEE 39
M +R+ K++ K+ +E + + +
Sbjct: 127 MIERKPKEPGLKRKQRKKAQESATSPESSPSSTPNSSRPSTPHLLKAKEGPSRRAKKAAK 186
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKE 64
+ +K K K KK K+
Sbjct: 187 LSSTASSGDEKSPKSKAAPKKAGKK 211
Score = 31.3 bits (71), Expect = 0.77
Identities = 11/44 (25%), Positives = 18/44 (40%), Gaps = 1/44 (2%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
K +E + +K + + +K K K KK KK +K
Sbjct: 172 KAKEGPSR-RAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214
Score = 27.8 bits (62), Expect = 7.9
Identities = 15/60 (25%), Positives = 27/60 (45%)
Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
R E+E ++ + K EE K KK ++K K+ K+K+ + +E S +
Sbjct: 95 RELEKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPES 154
>gnl|CDD|115072 pfam06391, MAT1, CDK-activating kinase assembly factor MAT1. MAT1
is an assembly/targeting factor for cyclin-dependent
kinase-activating kinase (CAK), which interacts with the
transcription factor TFIIH. The domain found to the
N-terminal side of this domain is a C3HC4 RING finger.
Length = 200
Score = 31.7 bits (72), Expect = 0.44
Identities = 17/77 (22%), Positives = 42/77 (54%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
+ + ++EE E+ +EEKE ++ + +K+++++K K+K K++ + E S
Sbjct: 83 RNKRRLTREQEELEQALEEEKEMKEEKRLHLQKEEQEQKMAKEKDKQEIIDELETSNLPA 142
Query: 72 EESLKERKEKTLTEDSQ 88
+ + K+++ +SQ
Sbjct: 143 NVIIAQHKKQSKQLESQ 159
Score = 29.0 bits (65), Expect = 3.5
Identities = 24/100 (24%), Positives = 42/100 (42%), Gaps = 19/100 (19%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK------------KKKKKKKK 57
E+ EK KEE++ ++EE+E ++ K+K K++ + K
Sbjct: 96 EQALEEEKEMKEEKRLHLQKEEQE-----QKMAKEKDKQEIIDELETSNLPANVIIAQHK 150
Query: 58 KKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFK 97
K+ K+ E Q E E K T + + + +PV K
Sbjct: 151 KQSKQLESQ--VEKLERKKRVTFSTGIKKGEHISFVPVIK 188
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 32.2 bits (73), Expect = 0.44
Identities = 21/107 (19%), Positives = 55/107 (51%), Gaps = 18/107 (16%)
Query: 1 MRKRRGGAGEKKKRIEK-------RNKEEEKEENE---------QEEKEKEDVEEEKNKK 44
M++ E++KR E+ R+ + K + QE +E+E VEEE+ ++
Sbjct: 393 MKQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEE 452
Query: 45 KKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDT 91
+++++++++ ++++ + ++EE+ V+ S E + + +E
Sbjct: 453 EEEEEEEQESEEEEGEDEEEEEEVEADNGS--EEEMEGSSEGDGDGE 497
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 32.2 bits (74), Expect = 0.44
Identities = 16/72 (22%), Positives = 40/72 (55%), Gaps = 3/72 (4%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E K+ +++ +E ++ E E E+ +K +K KK + + +K+++ +K + ++ +
Sbjct: 331 ELKEELKELEEELKELEEELEKIKKL---LKKLPKKARGQLPPEKREQLEKLLETKEKLS 387
Query: 70 EIEESLKERKEK 81
E E L+E ++
Sbjct: 388 EELEELEEELKE 399
Score = 31.5 bits (72), Expect = 0.82
Identities = 26/93 (27%), Positives = 48/93 (51%), Gaps = 15/93 (16%)
Query: 19 NKEEEKEENEQEEKEKEDVEEE-------------KNKKKKKKKKKKKKKKKKKKKKKEE 65
+ E KEE ++ E+E +++EEE K + + +K+++ +K + K+K
Sbjct: 328 DFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLS 387
Query: 66 QSVKEIEESLKERKEKTLTEDSQCDTMIPVFKK 98
+ ++E+EE LKE KE+ E + I V K
Sbjct: 388 EELEELEEELKELKEE--LESLYSEGKISVNKT 418
Score = 28.8 bits (65), Expect = 5.0
Identities = 12/61 (19%), Positives = 33/61 (54%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
KK ++K K+ + ++ ++ E + E K K ++ ++ +++ K+ K++ + S +I
Sbjct: 354 KKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEELEELEEELKELKEELESLYSEGKI 413
Query: 72 E 72
Sbjct: 414 S 414
>gnl|CDD|233573 TIGR01781, Trep_dent_lipo, Treponema denticola clustered
lipoprotein. This model represents a family of six
predicted lipoproteins from a region of about 20
tandemly arranged genes in the Treponema denticola
genome. Two other neighboring genes share the
lipoprotein signal peptide region but do not show more
extensive homology. The function of this locus is
unknown.
Length = 412
Score = 32.2 bits (73), Expect = 0.45
Identities = 16/55 (29%), Positives = 27/55 (49%), Gaps = 1/55 (1%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
G ++ R+EK +K EEK ENE K + + + ++ K+ K K + K
Sbjct: 29 GLREMRVEKFSKMEEK-ENEFLSKPEYNTHVKSPEQIKELSNKISGVSGKGESDK 82
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 31.7 bits (72), Expect = 0.45
Identities = 14/69 (20%), Positives = 35/69 (50%), Gaps = 10/69 (14%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
+ N+E++KE+ + + + ++ + K + + +K+K+++K + K
Sbjct: 106 EENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQKARAA----------K 155
Query: 77 ERKEKTLTE 85
ERKE+ + E
Sbjct: 156 ERKERLVAE 164
Score = 29.8 bits (67), Expect = 2.0
Identities = 20/77 (25%), Positives = 34/77 (44%), Gaps = 16/77 (20%)
Query: 2 RKRRGGAGEKKKRIEKRNKE------------EEKEENEQEEKEKEDVEEEKNKKKKKKK 49
RKR A K+R E+ E + + + ++KEKE EK K K+ K+
Sbjct: 145 RKREQKARAAKERKERLVAEAREHFGYWVDPRDPRFQEMLQQKEKE----EKKKVKEAKR 200
Query: 50 KKKKKKKKKKKKKKEEQ 66
++K++K+ Q
Sbjct: 201 REKEEKRMAALVAAAAQ 217
Score = 27.9 bits (62), Expect = 6.7
Identities = 9/52 (17%), Positives = 28/52 (53%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
++++ EK + + +E + K + +K+K+++K + K++K++
Sbjct: 109 REQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQKARAAKERKER 160
>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family. Members of this family are
coiled-coil proteins that are involved in pre-rRNA
processing.
Length = 105
Score = 30.4 bits (69), Expect = 0.47
Identities = 18/70 (25%), Positives = 45/70 (64%), Gaps = 2/70 (2%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
+KR+EKR +++ + E+E K++++ E ++ + K+++ K++K++ +K + K++
Sbjct: 31 EKRMEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAKEEKERYEKMAAKMHAKKV 90
Query: 72 EESLKERKEK 81
E +R+EK
Sbjct: 91 ER--LKRREK 98
Score = 29.7 bits (67), Expect = 0.78
Identities = 14/39 (35%), Positives = 24/39 (61%)
Query: 42 NKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
KK + K K +K+ +K+ E+Q++K E+ LK+ KE
Sbjct: 17 EKKAFRPKSKLTSWEKRMEKRLEQQAIKAREKELKDEKE 55
Score = 27.4 bits (61), Expect = 5.6
Identities = 23/72 (31%), Positives = 34/72 (47%), Gaps = 8/72 (11%)
Query: 1 MRKRRGGAGEKKKRIE-KRNKEEEKEENEQEEKEKEDVEEEKN-------KKKKKKKKKK 52
M KR K + E K KE E++ Q KE+ +EEK K KK ++
Sbjct: 34 MEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAKEEKERYEKMAAKMHAKKVERL 93
Query: 53 KKKKKKKKKKKE 64
K+++K+ K KE
Sbjct: 94 KRREKRNKLLKE 105
>gnl|CDD|234635 PRK00104, scpA, segregation and condensation protein A; Reviewed.
Length = 242
Score = 31.7 bits (73), Expect = 0.47
Identities = 22/92 (23%), Positives = 41/92 (44%), Gaps = 14/92 (15%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKK--------------KKKKKKKK 53
A E K++ E+R + K ++ EE + E E+E + +KK K +
Sbjct: 105 AEELKEQEEERAQYFSKPPSDLEEFDAELDEDELDGDVTLLDLFLAFSKLMERKKANKPR 164
Query: 54 KKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
K ++ + K E + I E L+ +K + +E
Sbjct: 165 HTKIERDEYKIEDRMLYILELLESKKRLSFSE 196
>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724). This
family contains several uncharacterized proteins found
in Arabidopsis thaliana and other plants. This region is
often found associated with Agenet domains and may
contain coiled-coil.
Length = 190
Score = 31.3 bits (71), Expect = 0.48
Identities = 17/67 (25%), Positives = 31/67 (46%), Gaps = 1/67 (1%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
K + +K +E + E E EKE E + + ++K + +++ K+K+E KE
Sbjct: 102 KDDQTKKL-EERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKE 160
Query: 71 IEESLKE 77
I E
Sbjct: 161 IARLKSE 167
Score = 30.1 bits (68), Expect = 1.4
Identities = 18/88 (20%), Positives = 40/88 (45%), Gaps = 3/88 (3%)
Query: 1 MRKRRGGAGEKKKRIEKR--NKEEEKEENEQEEKE-KEDVEEEKNKKKKKKKKKKKKKKK 57
++ + E++K +EK KE ++E + E E + + E + + K+KK+ + K+
Sbjct: 101 LKDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKE 160
Query: 58 KKKKKKEEQSVKEIEESLKERKEKTLTE 85
+ K E +K+ + + T
Sbjct: 161 IARLKSEASKIKQELVDAELEFQTTAAA 188
>gnl|CDD|145900 pfam02994, Transposase_22, L1 transposable element.
Length = 370
Score = 31.9 bits (72), Expect = 0.49
Identities = 22/97 (22%), Positives = 42/97 (43%), Gaps = 8/97 (8%)
Query: 4 RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
++ ++ IE + KE + E EE + VE +K K+ + KKK ++ K+
Sbjct: 53 KKDNYSSLREDIETKGKEVQNFLKELEECITKQVEAHIENTEKCLKELMELKKKARELKE 112
Query: 64 EEQSVK--------EIEESLKERKEKTLTEDSQCDTM 92
E +S+K +IE K + + ++ M
Sbjct: 113 ECRSIKKEHMETTLDIENQKKRQGVVDTSFTNRIQEM 149
>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
pleckstrin homology (PH) domain. The BCR gene is one of
the two genes in the BCR-ABL complex, which is
associated with the Philadelphia chromosome, a product
of a reciprocal translocation between chromosomes 22 and
9. BCR is a GTPase-activating protein (GAP) for RAC1
(primarily) and CDC42. The Dbl region of BCR has the
most RhoGEF activity for Cdc42, and less activity
towards Rac and Rho. Since BCR possesses both GAP and
GEF activities, it may function to temporally regulate
the activity of these GTPases. It also displays
serine/threonine kinase activity. The BCR protein
contains multiple domains including an N-terminal kinase
domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
domain, and a C-terminal RhoGAP domain. ABR, a related
smaller protein, is structurally similar to BCR, but
lacks the N-terminal kinase domain and has GAP activity
for both Rac and Cdc42. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 179
Score = 31.2 bits (71), Expect = 0.50
Identities = 9/48 (18%), Positives = 20/48 (41%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
EE + ++ + + + ++KK + +K +KK EQ
Sbjct: 88 SEELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRKKLAEQEA 135
Score = 29.6 bits (67), Expect = 1.9
Identities = 18/65 (27%), Positives = 26/65 (40%), Gaps = 5/65 (7%)
Query: 22 EEKEENEQEEKEK-EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
K+E+E + EE KK K+K K+ ++KK S E L+
Sbjct: 72 HPKDESEAAPDVVPKSSEELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLR---- 127
Query: 81 KTLTE 85
K L E
Sbjct: 128 KKLAE 132
>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p) and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
gene from Saccharomyces cerevisiae. It is an essential
yeast protein involved in pre-rRNA processing and
ribosome assembly. Rrp7p contains an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 128
Score = 30.7 bits (70), Expect = 0.50
Identities = 31/97 (31%), Positives = 46/97 (47%), Gaps = 31/97 (31%)
Query: 16 EKRNKEEEKEENEQEEKEKED--------------VEEEKNKKKKKKKKKKKKKKKKKK- 60
E R +E ++E + ED EE K +++KK+K+KKKKKKK
Sbjct: 31 EAREEERKRERKRLRNEPDEDGFVTVVRGGRKGPAAGEEAGKAAEEEKKEKEKKKKKKKE 90
Query: 61 -----------KKKEEQS--VKEIEES---LKERKEK 81
KKKEEQ+ +K+ EE +++ KEK
Sbjct: 91 LEDFYRFQLREKKKEEQADLLKKFEEDKERVEKMKEK 127
Score = 29.2 bits (66), Expect = 1.6
Identities = 19/69 (27%), Positives = 38/69 (55%), Gaps = 7/69 (10%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKED-----VEEEKNKKKKKKKKKKKKKK 56
RK E K +++EKE+ ++++KE ED + E+K +++ KK ++ K
Sbjct: 61 RKGPAAGEEAGKA--AEEEKKEKEKKKKKKKELEDFYRFQLREKKKEEQADLLKKFEEDK 118
Query: 57 KKKKKKKEE 65
++ +K KE+
Sbjct: 119 ERVEKMKEK 127
Score = 27.2 bits (61), Expect = 6.4
Identities = 15/65 (23%), Positives = 34/65 (52%), Gaps = 8/65 (12%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQE--------EKEKEDVEEEKNKKKKKKKKKKK 53
AG+ + +K ++++K++ E E EK+KE+ + K ++ K++ +K
Sbjct: 64 PAAGEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKEEQADLLKKFEEDKERVEK 123
Query: 54 KKKKK 58
K+K+
Sbjct: 124 MKEKR 128
>gnl|CDD|169428 PRK08404, PRK08404, V-type ATP synthase subunit H; Validated.
Length = 103
Score = 30.1 bits (68), Expect = 0.50
Identities = 25/72 (34%), Positives = 49/72 (68%), Gaps = 1/72 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E ++RIEK KEE K+ + ++E + +EEE KK +++ +K +KKKK+ +++ ++ ++
Sbjct: 14 EAEERIEKA-KEEAKKIIRKAKEEAKKIEEEIIKKAEEEAQKLIEKKKKEGEEEAKKILE 72
Query: 70 EIEESLKERKEK 81
E E+ ++E K K
Sbjct: 73 EGEKEIEELKVK 84
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 32.1 bits (73), Expect = 0.51
Identities = 23/94 (24%), Positives = 36/94 (38%), Gaps = 8/94 (8%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK--------KKKKKK 61
EK + + + K+ E +E E ++E + +K K KK K+ KK K
Sbjct: 11 EKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSSK 70
Query: 62 KKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPV 95
E + E+ K RK T S + P
Sbjct: 71 PTESSAASSEEKPAKPRKSAESTRSSHPKSKAPS 104
Score = 29.0 bits (65), Expect = 4.7
Identities = 15/92 (16%), Positives = 38/92 (41%), Gaps = 6/92 (6%)
Query: 3 KRRGGAGEKKKRIEKRNKE--EEKEENEQEEKEKEDVEEEKNKK----KKKKKKKKKKKK 56
++ G+ K + +++ E +E EQ EK K + ++ + + KK K +
Sbjct: 15 TKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTES 74
Query: 57 KKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
+++ ++ ES + K+ ++
Sbjct: 75 SAASSEEKPAKPRKSAESTRSSHPKSKAPSTE 106
>gnl|CDD|129587 TIGR00496, frr, ribosome recycling factor. This model finds only
eubacterial proteins. Mitochondrial and/or chloroplast
forms might be expected but are not currently known.
This protein was previously called ribosome releasing
factor. By releasing ribosomes from mRNA at the end of
protein biosynthesis, it prevents inappropriate
translation from 3-prime regions of the mRNA and frees
the ribosome for new rounds of translation.
EGAD|53116|YHR038W is part of the frr superfamily
[Protein synthesis, Translation factors].
Length = 176
Score = 31.3 bits (71), Expect = 0.51
Identities = 22/68 (32%), Positives = 39/68 (57%), Gaps = 7/68 (10%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE------QSVKEIEESLKE 77
K+ EQ + +V + N K KK +K K+ + ++++ +EE + +K+I+E LK+
Sbjct: 109 KKIAEQAKVAVRNVRRDANDKVKKLEKDKEISEDEERRLQEEIQKLTDEYIKKIDEILKD 168
Query: 78 RKEKTLTE 85
KEK L E
Sbjct: 169 -KEKELME 175
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents
the seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is
recognized by the pfam model tRNA-synt_2b. The
seryl-tRNA synthetases of two archaeal species,
Methanococcus jannaschii and Methanobacterium
thermoautotrophicum, differ considerably and are
included in a different model [Protein synthesis, tRNA
aminoacylation].
Length = 418
Score = 31.9 bits (73), Expect = 0.51
Identities = 19/79 (24%), Positives = 39/79 (49%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
+ R +++ + E +K +E EE + + E K K K +KK K ++ KK+
Sbjct: 19 KARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKEL 78
Query: 62 KKEEQSVKEIEESLKERKE 80
K+ ++ + E+ +LK +
Sbjct: 79 KELKEELTELSAALKALEA 97
Score = 31.9 bits (73), Expect = 0.57
Identities = 18/79 (22%), Positives = 40/79 (50%), Gaps = 1/79 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
K+ ++ R + + + + E + ++ + K+ + K+ K K +++ ++
Sbjct: 13 LVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIE 72
Query: 70 EIEESLKERKEKTLTEDSQ 88
EI++ LKE KE+ LTE S
Sbjct: 73 EIKKELKELKEE-LTELSA 90
>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
Length = 278
Score = 31.8 bits (72), Expect = 0.52
Identities = 19/50 (38%), Positives = 31/50 (62%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
K +++ K EK E E ++++ E ++ E +KKKKK KK+K + K KK
Sbjct: 38 KTLKRLYKVYEKREFELKKQQFEQLKAELSKKKKKFKKEKVDVRVKVVKK 87
>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
Length = 202
Score = 31.2 bits (71), Expect = 0.53
Identities = 11/50 (22%), Positives = 23/50 (46%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQC 89
K ++K+K++ KKK K +K +++ K+ L + L +
Sbjct: 153 AKLEEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKKFGYA 202
Score = 29.6 bits (67), Expect = 1.6
Identities = 10/43 (23%), Positives = 22/43 (51%)
Query: 38 EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
EE++ ++ KKK K +K K+ +++ K + + K+
Sbjct: 156 EEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKK 198
Score = 28.9 bits (65), Expect = 2.9
Identities = 11/43 (25%), Positives = 25/43 (58%)
Query: 39 EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
EEK K++ KKK K +K K+ ++++ ++ +++ +K
Sbjct: 156 EEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKK 198
>gnl|CDD|192488 pfam10228, DUF2228, Uncharacterized conserved protein (DUF2228).
This is a family of conserved proteins of approximately
700 residues found from worms to humans.
Length = 253
Score = 31.6 bits (72), Expect = 0.54
Identities = 14/50 (28%), Positives = 28/50 (56%)
Query: 35 EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLT 84
+ +E KKKK KK ++ + K+ S+++ + +K+RK+K +T
Sbjct: 94 KKSKEIPFKKKKISALKKIQELLTEWAKELGLSLEQKTKVMKQRKKKVVT 143
Score = 29.3 bits (66), Expect = 2.6
Identities = 16/51 (31%), Positives = 27/51 (52%)
Query: 53 KKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYK 103
KK K+ KKK+ ++K+I+E L E ++ Q ++ KK V+ K
Sbjct: 94 KKSKEIPFKKKKISALKKIQELLTEWAKELGLSLEQKTKVMKQRKKKVVTK 144
>gnl|CDD|224236 COG1317, FliH, Flagellar biosynthesis/type III secretory pathway
protein [Cell motility and secretion / Intracellular
trafficking and secretion].
Length = 234
Score = 31.6 bits (72), Expect = 0.55
Identities = 20/81 (24%), Positives = 35/81 (43%), Gaps = 7/81 (8%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEK------EDVEEEKNK-KKKKKKKKKKKKKKKKKKK 62
E++ KEEE E QE +E E EE ++ ++ +++ + ++
Sbjct: 39 EEELEQALEAKEEELESAAQELQEGIEEGAREGYEEGFQLGYEEGFEEGQEEGRVLERLA 98
Query: 63 KEEQSVKEIEESLKERKEKTL 83
K + E+LKE EK L
Sbjct: 99 KLIAEFQAELEALKEVVEKQL 119
>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family.
This family includes the HABP4 family of
hyaluronan-binding proteins, and the PAI-1 mRNA-binding
protein, PAI-RBP1. HABP4 has been observed to bind
hyaluronan (a glucosaminoglycan), but it is not known
whether this is its primary role in vivo. It has also
been observed to bind RNA, but with a lower affinity
than that for hyaluronan. PAI-1 mRNA-binding protein
specifically binds the mRNA of type-1 plasminogen
activator inhibitor (PAI-1), and is thought to be
involved in regulation of mRNA stability. However, in
both cases, the sequence motifs predicted to be
important for ligand binding are not conserved
throughout the family, so it is not known whether
members of this family share a common function.
Length = 106
Score = 30.1 bits (68), Expect = 0.56
Identities = 21/82 (25%), Positives = 43/82 (52%), Gaps = 10/82 (12%)
Query: 4 RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEE-EKNKKKKKKKKKKKKKKKKKKKK 62
+R GAG + N K+E + +E+ + EE + ++K+ +++ K+ ++++
Sbjct: 18 KREGAG-------RGNWGSVKDEIAELTEEQGEEEEKNEVEEKQAVEEEANKEGVVEEEE 70
Query: 63 KEEQSVKEI--EESLKERKEKT 82
EE+ KE+ EE K +EK
Sbjct: 71 VEEEEDKEMTLEEYEKALQEKR 92
Score = 27.8 bits (62), Expect = 3.8
Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 5/60 (8%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK-----KKKKKKKKKKKKKKKKKKKKE 64
EK + EK+ EEE + E+E+ + EE+K +K ++K+K K +K KE
Sbjct: 46 EKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKEMTLEEYEKALQEKRKALPALKVRKAKE 105
>gnl|CDD|221893 pfam13010, pRN1_helical, Primase helical domain. This alpha
helical domain is found in a set of bacterial plasmid
replication proteins. The domain is found to the
C-terminus of the primase/polymerase domain. Mutants of
this domain are defective in template binding,
dinucleotide formation and conformation change prior to
DNA extension.
Length = 135
Score = 30.6 bits (69), Expect = 0.57
Identities = 19/52 (36%), Positives = 32/52 (61%)
Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
E K++E+ EE +K K++ K + K K + E+ K+I++SL+E KEK
Sbjct: 1 EGKKEEEDSEEDFEKLKEEMAKYDRFKGKTVEAIREEVCKKIKKSLEELKEK 52
Score = 28.2 bits (63), Expect = 3.6
Identities = 19/68 (27%), Positives = 33/68 (48%), Gaps = 10/68 (14%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKK----------KKKKKKKKKKKKKEEQSVKEI 71
E K+E E E++ E ++EE K + K K KK KK ++ K+K++ +
Sbjct: 1 EGKKEEEDSEEDFEKLKEEMAKYDRFKGKTVEAIREEVCKKIKKSLEELKEKKKAILNTA 60
Query: 72 EESLKERK 79
+ + E K
Sbjct: 61 KGVICEGK 68
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 31.5 bits (72), Expect = 0.57
Identities = 19/55 (34%), Positives = 30/55 (54%), Gaps = 3/55 (5%)
Query: 20 KEEEKEENEQEEK-EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
K KEE +K K D +++K+KKK K++KK KEE+ +++EE
Sbjct: 69 KILTKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEK--EKLEE 121
Score = 30.7 bits (70), Expect = 0.96
Identities = 14/28 (50%), Positives = 20/28 (71%)
Query: 38 EEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
EEEK KKK K++KK K++K+K +E
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
Score = 28.4 bits (64), Expect = 5.5
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKE 33
EKKK + K K+ KEE E+ E+
Sbjct: 99 EKKKAMSKEEKKAIKEEKEKLEEP 122
>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
Members of this family of small ribosomal proteins
possess one of three conserved blocks of sequence found
in proteins that stimulate the dissociation of guanine
nucleotides from G-proteins, leaving open the
possibility that MRP-S27 might be a functional partner
of GTP-binding ribosomal proteins.
Length = 417
Score = 31.7 bits (72), Expect = 0.57
Identities = 16/74 (21%), Positives = 33/74 (44%), Gaps = 8/74 (10%)
Query: 9 GEKKKRIEKRNKEE----EKEENEQEEKEKEDVEEEKNKKK----KKKKKKKKKKKKKKK 60
E + +++ K E E EKED+E + +++ + +K ++KKK +++
Sbjct: 344 QEWRSKLQALAKVESRSLLSVEERLPTIEKEDLELYEQRQQLWFFENRKLWQRKKKLREQ 403
Query: 61 KKKEEQSVKEIEES 74
+E Q S
Sbjct: 404 ADEEYQQRHATRAS 417
>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein. PACS-1 is a
cytosolic sorting protein that directs the localisation
of membrane proteins in the trans-Golgi network
(TGN)/endosomal system. PACS-1 connects the clathrin
adaptor AP-1 to acidic cluster sorting motifs contained
in the cytoplasmic domain of cargo proteins such as
furin, the cation-independent mannose-6-phosphate
receptor and in viral proteins such as human
immunodeficiency virus type 1 Nef.
Length = 413
Score = 31.7 bits (72), Expect = 0.58
Identities = 17/62 (27%), Positives = 27/62 (43%), Gaps = 4/62 (6%)
Query: 44 KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYK 103
K+K KK KK K+KE +S + E + + + Q TM+ V GV +
Sbjct: 331 TKEKNKKVPVMFLGKKPKEKEVESKSQCIEGIS----RLICSAKQQQTMLRVSIDGVEWN 386
Query: 104 GI 105
+
Sbjct: 387 DV 388
Score = 28.2 bits (63), Expect = 7.3
Identities = 11/30 (36%), Positives = 15/30 (50%)
Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
K+K KK KK K+K+ E + IE
Sbjct: 331 TKEKNKKVPVMFLGKKPKEKEVESKSQCIE 360
>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
[Posttranslational modification, protein turnover,
chaperones].
Length = 782
Score = 31.8 bits (73), Expect = 0.60
Identities = 18/59 (30%), Positives = 36/59 (61%), Gaps = 4/59 (6%)
Query: 12 KKRIEKRNKE---EEKEENEQEE-KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
K+++EK +E E+ + Q+E E +D ++E + ++K +K K K+ K+K +KE +
Sbjct: 219 KEQMEKSQREYYLREQLKAIQKELGEDDDDKDEVEELREKIEKLKLPKEAKEKAEKELK 277
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 31.7 bits (72), Expect = 0.60
Identities = 17/67 (25%), Positives = 32/67 (47%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
G KKK+ + + + E+ +++E + +KN+KK + K K K KKE
Sbjct: 419 GSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDSKAGGKKESVKS 478
Query: 69 KEIEESL 75
+E ++
Sbjct: 479 QEDNNNI 485
Score = 29.8 bits (67), Expect = 2.9
Identities = 14/51 (27%), Positives = 20/51 (39%)
Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
EK E K KKKK K K +S+ + EE ++ +K
Sbjct: 401 EKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKK 451
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 31.6 bits (71), Expect = 0.62
Identities = 13/76 (17%), Positives = 37/76 (48%), Gaps = 8/76 (10%)
Query: 6 GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
G+K+++ E+ E +E Q ++ E EE K++ + ++ ++ +
Sbjct: 289 VSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEE--------KEEDDENEENERHTELLA 340
Query: 66 QSVKEIEESLKERKEK 81
+ E+E+ ++E++ +
Sbjct: 341 DELNELEKGIEEKRRQ 356
Score = 30.8 bits (69), Expect = 1.1
Identities = 17/73 (23%), Positives = 39/73 (53%), Gaps = 1/73 (1%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
E++ +++E EENE+ + D E K ++K+++ + +K+ E + + +
Sbjct: 320 GEEKEEDDENEENERHTELLADELNELEKGIEEKRRQMESATNPILQKRFESQLNVLLKE 379
Query: 75 LK-ERKEKTLTED 86
L+ +RK+ + ED
Sbjct: 380 LELKRKQLEMEED 392
Score = 29.3 bits (65), Expect = 3.9
Identities = 14/68 (20%), Positives = 32/68 (47%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
+ + + +E E+++ED E E+N++ + + + +K ++K Q L
Sbjct: 306 AHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELEKGIEEKRRQMESATNPIL 365
Query: 76 KERKEKTL 83
++R E L
Sbjct: 366 QKRFESQL 373
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is
functionally equivalent to L7/L12 in bacteria and the
P1 and P2 proteins in eukaryotes. L12p is homologous to
P1 and P2 but is not homologous to bacterial L7/L12. It
is located in the L12 stalk, with proteins L10, L11,
and 23S rRNA. L12p is the only protein in the ribosome
to occur as multimers, always appearing as sets of
dimers. Recent data indicate that most archaeal species
contain six copies of L12p (three homodimers), while
eukaryotes have four copies (two heterodimers), and
bacteria may have four or six copies (two or three
homodimers), depending on the species. The organization
of proteins within the stalk has been characterized
primarily in bacteria, where L7/L12 forms either two or
three homodimers and each homodimer binds to the
extended C-terminal helix of L10. L7/L12 is attached to
the ribosome through L10 and is the only ribosomal
protein that does not directly interact with rRNA.
Archaeal L12p is believed to function in a similar
fashion. However, hybrid ribosomes containing the large
subunit from E. coli with an archaeal stalk are able to
bind archaeal and eukaryotic elongation factors but not
bacterial elongation factors. In several mesophilic and
thermophilic archaeal species, the binding of 23S rRNA
to protein L11 and to the L10/L12p pentameric complex
was found to be temperature-dependent and cooperative.
Length = 106
Score = 29.8 bits (67), Expect = 0.62
Identities = 13/25 (52%), Positives = 18/25 (72%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEE 40
K EEKEE +++E+EKE+ EEE
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEEE 97
Score = 28.2 bits (63), Expect = 2.3
Identities = 11/27 (40%), Positives = 21/27 (77%)
Query: 49 KKKKKKKKKKKKKKKEEQSVKEIEESL 75
++K ++K+++KKKEE+ +E EE+L
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEEEAL 99
Score = 28.2 bits (63), Expect = 2.5
Identities = 10/31 (32%), Positives = 19/31 (61%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVE 38
A + + +EE+K+E E+EE+E+E +
Sbjct: 70 AAAAAEEKAEEKEEEKKKEEEKEEEEEEALA 100
Score = 27.8 bits (62), Expect = 3.7
Identities = 8/25 (32%), Positives = 23/25 (92%)
Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQS 67
++K ++K+++KKK+++K+++EE++
Sbjct: 74 AEEKAEEKEEEKKKEEEKEEEEEEA 98
>gnl|CDD|226034 COG3503, COG3503, Predicted membrane protein [Function unknown].
Length = 323
Score = 31.6 bits (72), Expect = 0.63
Identities = 26/103 (25%), Positives = 33/103 (32%), Gaps = 20/103 (19%)
Query: 180 LAMCLLGWW----RCFANNYLQWAAVGTWIL-LNTQGWWRCFANNYLQWAA------VGT 228
LA LG W R A+++L A V + WR L+ AA T
Sbjct: 47 LATTALGLWRYFARLIASSFLFLAGVSLSLSHSRGLRRWRFLVKRGLKLAALALAITAVT 106
Query: 229 WILLNTQGFISQGIGLNNKEYLIWYILFIVFVTYAMLPLPLKW 271
W FI GI + I + A L LP
Sbjct: 107 WFAF-PDSFIFFGI--------LHAIGLASLLGAAFLWLPRAV 140
>gnl|CDD|220222 pfam09403, FadA, Adhesion protein FadA. FadA (Fusobacterium
adhesin A) is an adhesin which forms two alpha helices.
Length = 126
Score = 30.3 bits (68), Expect = 0.64
Identities = 16/71 (22%), Positives = 41/71 (57%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E + EK+ KE ++E ++ ++ + EE K K++ K + K + KK K+ ++ +
Sbjct: 44 EARFEEEKQEKETAEKEVQELKERQLGREELYAKLKEESKTRFHKDEYKKLLKRYDEYLN 103
Query: 70 EIEESLKERKE 80
++E+ + ++++
Sbjct: 104 KLEQKITDKEQ 114
>gnl|CDD|184536 PRK14145, PRK14145, heat shock protein GrpE; Provisional.
Length = 196
Score = 31.0 bits (70), Expect = 0.64
Identities = 26/88 (29%), Positives = 47/88 (53%), Gaps = 14/88 (15%)
Query: 12 KKRIEKR-NKEEEK-----EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
+ +EK NKEEEK NEQ E ED + ++N+ +++ + ++ K+K ++K+ E
Sbjct: 1 MEEVEKEINKEEEKDVNNLSSNEQMEGPPEDEQAQQNQPQQQTVDEIEELKQKLQQKEVE 60
Query: 66 --------QSVKEIEESLKERKEKTLTE 85
Q +K E+ ++R EK +E
Sbjct: 61 AQEYLDIAQRLKAEFENYRKRTEKEKSE 88
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 31.6 bits (72), Expect = 0.64
Identities = 11/44 (25%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 21 EEEKEENEQEEKEKEDVE--EEKNKKKKKKKKKKKKKKKKKKKK 62
E+EE E EE E+E E +++ +K+ ++ K++++ +++ +
Sbjct: 149 LPEEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELRRRSQ 192
Score = 30.8 bits (70), Expect = 1.2
Identities = 14/55 (25%), Positives = 30/55 (54%), Gaps = 1/55 (1%)
Query: 16 EKRNKEEEKEENEQEE-KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
K E E E E+EE +E E+ EE + +K+ ++ K++++ ++ Q ++
Sbjct: 141 PKNEFELELPEEEEEEPEEMEEELEEDAADRDARKRAAEEAKEQEELRRRSQVIQ 195
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 31.5 bits (72), Expect = 0.65
Identities = 11/70 (15%), Positives = 26/70 (37%), Gaps = 4/70 (5%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES- 74
E +K K+ + ++ E+E +E+ K K + +K E + +
Sbjct: 6 EPLSKNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDETMDPTQYRANR 65
Query: 75 ---LKERKEK 81
++ + K
Sbjct: 66 LKKVEALRAK 75
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel,
four-helix bundle structure with a large hydrophobic
patch in which 23 residues of one monomer form van der
Waals contacts with the other monomer. This CENP-B
dimer configuration may be suitable for capturing two
distant CENP-B boxes during centromeric heterochromatin
formation.
Length = 101
Score = 29.7 bits (66), Expect = 0.66
Identities = 9/28 (32%), Positives = 19/28 (67%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNK 43
+ + EEE +++E EE + ED +E+ ++
Sbjct: 13 DSDSDEEEDDDDEDEEDDDEDDDEDDDE 40
>gnl|CDD|184064 PRK13461, PRK13461, F0F1 ATP synthase subunit B; Provisional.
Length = 159
Score = 30.8 bits (70), Expect = 0.67
Identities = 24/93 (25%), Positives = 44/93 (47%), Gaps = 16/93 (17%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEE-----KNKKKKKKKKKKKKKKKKKKKKKE 64
K +R K KEE K+ E+ + + E+V EE + ++ K + +++K+K +
Sbjct: 60 LKNERELKNAKEEGKKIVEEYKSKAENVYEEIVKEAHEEADLIIERAKLEAQREKEKAEY 119
Query: 65 E-----------QSVKEIEESLKERKEKTLTED 86
E S K +EES+ E + + L +D
Sbjct: 120 EIKNQAVDLAVLLSSKALEESIDESEHRRLIKD 152
Score = 29.2 bits (66), Expect = 1.7
Identities = 20/70 (28%), Positives = 36/70 (51%), Gaps = 1/70 (1%)
Query: 12 KKRIEKRNKE-EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
K I+ R E + K E E+++K + KN+++ K K++ KK ++ K K E +E
Sbjct: 31 KAVIDSRQSEIDNKIEKADEDQKKARELKLKNERELKNAKEEGKKIVEEYKSKAENVYEE 90
Query: 71 IEESLKERKE 80
I + E +
Sbjct: 91 IVKEAHEEAD 100
>gnl|CDD|150786 pfam10153, DUF2361, Uncharacterized conserved protein (DUF2361).
This is a region of 120 amino acids that is conserved
in a family of proteins found from plants to fungi. The
function is not known.
Length = 114
Score = 29.9 bits (68), Expect = 0.68
Identities = 26/89 (29%), Positives = 42/89 (47%), Gaps = 16/89 (17%)
Query: 13 KRIEKRNK--EEEKEENEQE----EKEKEDVEEEKNKKKKKKK---------KKKKKKKK 57
+R+ K+ + + E E+ +KE ED E K +KK KK KK ++ K
Sbjct: 10 ERLLKKKDLPADVRVEKERALEALKKELEDAERRKKRKKMAKKYHMVRFFERKKATRRLK 69
Query: 58 KKKKKKEEQSVKEIEESLKERKEKTLTED 86
+ +KK EE S ++ + LKE + D
Sbjct: 70 QLEKKLEETSDEKEIKDLKE-QLSKAEVD 97
>gnl|CDD|187861 cd09730, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Large proteins, some contain Zn-finger domain; signature
gene for I-A subtype; also known as TM1802 family.
Length = 579
Score = 31.6 bits (72), Expect = 0.69
Identities = 13/59 (22%), Positives = 24/59 (40%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
K +K K+ E+ + KK+ K K K +K E+ +K+I++
Sbjct: 87 TYLLKTTEPLNKKIKKWFEKSYLLLESKKEILKLKNIKDVLEKNIEKILKDIQDLKPGE 145
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 31.5 bits (72), Expect = 0.69
Identities = 10/35 (28%), Positives = 22/35 (62%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
+ +KKK++KK+++K+ K+ +E+ E E+
Sbjct: 176 GHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDD 210
Score = 29.6 bits (67), Expect = 2.6
Identities = 8/37 (21%), Positives = 25/37 (67%)
Query: 37 VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
V + +KKK++KK+++K+ K+ ++++ ++ ++
Sbjct: 174 VPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDD 210
Score = 28.8 bits (65), Expect = 4.1
Identities = 9/36 (25%), Positives = 20/36 (55%)
Query: 42 NKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
+KKK++KK+++K+ KE + + +E +
Sbjct: 175 PGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDD 210
>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
(ETRAMP). These sequences represent a family of
proteins from the malaria parasite Plasmodium
falciparum, several of which have been shown to be
expressed specifically in the ring stage as well as the
rodent parasite Plasmodium yoelii. A homologue from
Plasmodium chabaudi was localized to the
parasitophorous vacuole membrane. Members have an
initial hydrophobic, Phe/Tyr-rich, stretch long enough
to span the membrane, a highly charged region rich in
Lys, a second putative transmembrane region and a
second highly charged, low complexity sequence region.
Some members have up to 100 residues of additional
C-terminal sequence. These genes have been shown to be
found in the sub-telomeric regions of both Plasmodium
falciparum and P. yoelii chromosomes.
Length = 82
Score = 29.1 bits (66), Expect = 0.70
Identities = 13/25 (52%), Positives = 14/25 (56%)
Query: 39 EEKNKKKKKKKKKKKKKKKKKKKKK 63
K KK KK KK +KKKK KK
Sbjct: 28 VAKKKKLKKLKKIDDDLEKKKKNKK 52
Score = 28.8 bits (65), Expect = 0.91
Identities = 14/28 (50%), Positives = 17/28 (60%)
Query: 36 DVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
+V +K K KK KK +KKKK KKK
Sbjct: 26 NVVAKKKKLKKLKKIDDDLEKKKKNKKK 53
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 30.7 bits (70), Expect = 0.71
Identities = 14/64 (21%), Positives = 40/64 (62%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+ + ++K+ KE +KEE + +++ EE + K+++ ++K+++ ++K++ ++E K
Sbjct: 46 KLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAAQQELQQK 105
Query: 70 EIEE 73
+ E
Sbjct: 106 QQEL 109
Score = 28.4 bits (64), Expect = 3.7
Identities = 18/80 (22%), Positives = 47/80 (58%), Gaps = 1/80 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
K ++ KE +K + E ++KEKE EE+K +K+ ++ +K K+++ ++++Q +
Sbjct: 32 AGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQEL 91
Query: 69 KEIEESLKERKEKTLTEDSQ 88
++ +++ ++ ++ E Q
Sbjct: 92 QQKQQAAQQELQQKQQELLQ 111
>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
pre-ribosomes [Translation, ribosomal structure and
biogenesis / Intracellular trafficking and secretion].
Length = 657
Score = 31.5 bits (71), Expect = 0.71
Identities = 21/78 (26%), Positives = 32/78 (41%), Gaps = 2/78 (2%)
Query: 2 RKRRGGAGEKKKRIE-KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
R E R+ K + N EE E E E + + K +K+K +++
Sbjct: 63 RVSSSWNNESYNRLPIKTKDNVVADVNNGEEFLSES-ESEASLEIDSDIKDEKQKSLEEQ 121
Query: 61 KKKEEQSVKEIEESLKER 78
K E VK+ +S KER
Sbjct: 122 KIAPEIPVKQQIDSEKER 139
Score = 28.1 bits (62), Expect = 10.0
Identities = 16/61 (26%), Positives = 32/61 (52%), Gaps = 7/61 (11%)
Query: 27 NEQEEKEKEDVEEEKNKKKKKKKKK------KKKKKKKKKKKKEEQSVKEIEESLKERKE 80
N ++EK E NK K++KK KK +K +K++ + + +++ E ++E E
Sbjct: 338 NSLRDEEKARKYWEANKPVSKREKKDIFHLSKKLRKIEKERLRIQSEMRD-AEDIEEAIE 396
Query: 81 K 81
+
Sbjct: 397 E 397
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 31.7 bits (73), Expect = 0.71
Identities = 17/67 (25%), Positives = 32/67 (47%), Gaps = 9/67 (13%)
Query: 21 EEEKEENEQEEKEKEDVEEE---------KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
E+E+EE ++ + E E K K+ KK +K ++KK + + + + ++
Sbjct: 196 EDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLRKAQEKKVEGRLAQHKKYAKL 255
Query: 72 EESLKER 78
E LKE
Sbjct: 256 REKLKEE 262
Score = 31.3 bits (72), Expect = 0.81
Identities = 15/82 (18%), Positives = 39/82 (47%), Gaps = 11/82 (13%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK-----------KKKKKK 58
++ E E+ +++++E+E+ED ++ + + +K KK +K
Sbjct: 177 NAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLRK 236
Query: 59 KKKKKEEQSVKEIEESLKERKE 80
++KK E + + ++ K R++
Sbjct: 237 AQEKKVEGRLAQHKKYAKLREK 258
>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain. DBINO is a DNA-binding
domain found on global transcription activator SNF2L1
proteins and chromatin re-modelling proteins.
Length = 140
Score = 30.3 bits (69), Expect = 0.72
Identities = 14/37 (37%), Positives = 21/37 (56%), Gaps = 4/37 (10%)
Query: 13 KRIEK----RNKEEEKEENEQEEKEKEDVEEEKNKKK 45
K+ EK K EKE EQ +KE+E E ++ ++K
Sbjct: 79 KKNEKEERELRKRAEKEALEQAKKEEELREAKRQQRK 115
>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777). This is
a family of eukaryotic proteins of unknown function.
Some of the proteins in this family are putative nucleic
acid binding proteins.
Length = 158
Score = 30.6 bits (69), Expect = 0.74
Identities = 19/92 (20%), Positives = 39/92 (42%), Gaps = 16/92 (17%)
Query: 2 RKRRGGAGEKKKRIEK-----RNKEEEKEENEQEEKEKEDVEEEKNKK---------KKK 47
R RR G + R E+ R++E ++ + ++ +++
Sbjct: 11 RSRRRGRSRSRDRRERRRERSRSRERDRRRRSRSRSPHRSRRSRSPRRHRSRSRSPSRRR 70
Query: 48 KKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
+K+++ K ++ KK+E Q K I+E E K
Sbjct: 71 DRKRERDKDAREPKKRERQ--KLIKEEDLEGK 100
Score = 27.5 bits (61), Expect = 7.9
Identities = 9/65 (13%), Positives = 33/65 (50%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
R+ R + + +R + + + ++++ ++ ++ KK++++K K++ +
Sbjct: 40 RRSRSRSPHRSRRSRSPRRHRSRSRSPSRRRDRKRERDKDAREPKKRERQKLIKEEDLEG 99
Query: 62 KKEEQ 66
K +E+
Sbjct: 100 KSDEE 104
Score = 27.1 bits (60), Expect = 9.5
Identities = 14/72 (19%), Positives = 34/72 (47%), Gaps = 2/72 (2%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
R+RR + + R+ + + + ++ E K ++ KK++++K K++
Sbjct: 38 RRRRSRSRSPHRSRRSRSPRRHRSRSRSPSRRRDRKRERD--KDAREPKKRERQKLIKEE 95
Query: 62 KKEEQSVKEIEE 73
E +S +E+E
Sbjct: 96 DLEGKSDEEVEM 107
>gnl|CDD|219783 pfam08297, U3_snoRNA_assoc, U3 snoRNA associated. This family of
proteins is associated with U3 snoRNA. U3 snoRNA is
required for nucleolar processing of pre-18S ribosomal
RNA.
Length = 87
Score = 29.3 bits (66), Expect = 0.75
Identities = 12/41 (29%), Positives = 24/41 (58%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
+E++ E +E+++++K K K + KK +K+ K KK
Sbjct: 12 DEDRPPTPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52
Score = 28.2 bits (63), Expect = 2.2
Identities = 13/43 (30%), Positives = 21/43 (48%)
Query: 27 NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+E E +++ KK+KKK K + KK +K + VK
Sbjct: 9 EAIDEDRPPTPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVK 51
>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
II/NMEMI. CRISPR loci appear to be mobile elements with
a wide host range. This model represents a protein found
only in CRISPR-containing species, near other
CRISPR-associated proteins (cas), as part of the NMENI
subtype of CRISPR/Cas locus. The species range so far
for this protein is animal pathogens and commensals only
[Mobile and extrachromosomal element functions, Other].
Length = 805
Score = 31.6 bits (72), Expect = 0.77
Identities = 15/64 (23%), Positives = 26/64 (40%), Gaps = 4/64 (6%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
E+ E ++EK+++ + KKKK KK +K E+ K + +
Sbjct: 301 IILEQGETKILSKEEKQELLDLL----FKKKKLTYKKLRKLLGLSEDAIFKGLRYEGLDN 356
Query: 79 KEKT 82
EK
Sbjct: 357 AEKA 360
Score = 28.5 bits (64), Expect = 7.4
Identities = 10/36 (27%), Positives = 18/36 (50%)
Query: 42 NKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
+ K+ K++ KK + K +E + +E LKE
Sbjct: 526 EQGTNFGKRNSKERYKKNEDKIKEFASALGKEILKE 561
>gnl|CDD|221643 pfam12572, DUF3752, Protein of unknown function (DUF3752). This
domain family is found in eukaryotes, and is typically
between 140 and 163 amino acids in length.
Length = 148
Score = 30.5 bits (69), Expect = 0.78
Identities = 20/71 (28%), Positives = 36/71 (50%), Gaps = 13/71 (18%)
Query: 10 EKKKR-------IEKRNKEEEKEENEQEEKEKEDVE--EEKNKKKKKK----KKKKKKKK 56
EK KR +E + +++KE +E N+KK+ K + +KK+KK
Sbjct: 41 EKAKRLQDEVLGVEAGASAPAAASAKASKRDKEMARKVKEYNEKKRGKSLVEQHQKKQKK 100
Query: 57 KKKKKKKEEQS 67
KKK+++ ++ S
Sbjct: 101 KKKEEENDDPS 111
>gnl|CDD|217943 pfam04180, LTV, Low temperature viability protein. The
low-temperature viability protein LTV1 is involved in
ribosome biogenesis 40S subunit production.
Length = 426
Score = 31.5 bits (71), Expect = 0.78
Identities = 21/87 (24%), Positives = 34/87 (39%), Gaps = 4/87 (4%)
Query: 10 EKKKRIEKRNKEEEKEE---NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
E+ K+ EK+N + E +E E + D + K +K K K K
Sbjct: 255 ERFKKFEKKNDDVEIGAADNSELEGSIEVDSNRLQEVLNDYYKTEKAK-SKVKLGAMTPF 313
Query: 67 SVKEIEESLKERKEKTLTEDSQCDTMI 93
S + + S +R EK T D + +
Sbjct: 314 SDQSMSSSALDRTEKLTTLDDRFEKAE 340
>gnl|CDD|163426 TIGR03714, secA2, accessory Sec system translocase SecA2. Members
of this protein family are homologous to SecA and part
of the accessory Sec system. This system, including both
five core proteins for export and a variable number of
proteins for glycosylation, operates in certain
Gram-positive pathogens for the maturation and delivery
of serine-rich glycoproteins such as the cell surface
glycoprotein GspB in Streptococcus gordonii [Protein
fate, Protein and peptide secretion and trafficking].
Length = 762
Score = 31.5 bits (72), Expect = 0.79
Identities = 15/66 (22%), Positives = 30/66 (45%), Gaps = 6/66 (9%)
Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK------EIEESLKERK 79
++ +K V++ K K K++ +K +K ++ E++ E EESL ++
Sbjct: 547 SWLKKYYKKYSVKDSKLKPSALFKRRFRKIVEKAQRASEDKGESAREQTNEFEESLSIQR 606
Query: 80 EKTLTE 85
E E
Sbjct: 607 ENIYAE 612
>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
modification].
Length = 429
Score = 31.5 bits (71), Expect = 0.80
Identities = 17/75 (22%), Positives = 30/75 (40%), Gaps = 1/75 (1%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
+K+ +NK N+ E D + +KKK K K K K E+ K+
Sbjct: 7 RKRSGNNQNKNASVVNNKAEIAAMIDARRLE-QKKKGGVTNSKGKTNKVVDAKLEKEFKD 65
Query: 71 IEESLKERKEKTLTE 85
+ + + ++ T E
Sbjct: 66 VLQRFQVQENDTPKE 80
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 30.7 bits (70), Expect = 0.80
Identities = 27/89 (30%), Positives = 45/89 (50%), Gaps = 13/89 (14%)
Query: 10 EKKKRIEKRNKEEEKE--ENEQE--------EKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
E+ ++K + EK E QE +K +E+VEE + K K +K K+ K K
Sbjct: 34 EEIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLKA 93
Query: 60 KKKKEEQSVKEIE---ESLKERKEKTLTE 85
+ K+ E+ +K ++ E L++R EK E
Sbjct: 94 RLKELEKELKNLKWESEVLEQRFEKVERE 122
Score = 28.0 bits (63), Expect = 7.0
Identities = 15/62 (24%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 18 RNKEEEKEENEQE-EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
+ + E ++NE+ EK ++ +E + + KK +++ ++ +KK K+ + K+ ++LK
Sbjct: 33 KEEIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLK 92
Query: 77 ER 78
R
Sbjct: 93 AR 94
>gnl|CDD|204032 pfam08703, PLC-beta_C, PLC-beta C terminal. This domain
corresponds to the alpha helical C terminal domain of
phospholipase C beta.
Length = 181
Score = 30.5 bits (69), Expect = 0.80
Identities = 27/96 (28%), Positives = 50/96 (52%), Gaps = 18/96 (18%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK----KKKKKKKKKKKEE-- 65
K++I+K + +++ Q +K KE E+EK + KKK +K+ ++ K K+K +E
Sbjct: 36 KEQIQKLTELAREKQAAQLKKLKEISEKEKKELKKKLDRKRLERIQEAKTKEKHAQEREK 95
Query: 66 ------------QSVKEIEESLKERKEKTLTEDSQC 89
QS+K +EE+ K R+EK + ++
Sbjct: 96 TEINRSHIQEVVQSIKRLEEAQKRRQEKLEEKQAEI 131
>gnl|CDD|223095 COG0016, PheS, Phenylalanyl-tRNA synthetase alpha subunit
[Translation, ribosomal structure and biogenesis].
Length = 335
Score = 31.0 bits (71), Expect = 0.84
Identities = 14/72 (19%), Positives = 29/72 (40%), Gaps = 1/72 (1%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK- 69
KK I + + +E K K ++ KK K +++K+ +K
Sbjct: 10 AKKGIAAIELASDLLKALEELKVKYLGKKGVLTDLLKKLGKLSPLEERKEVGALINELKK 69
Query: 70 EIEESLKERKEK 81
E+E+++ E +
Sbjct: 70 EVEDAITELTPE 81
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 31.4 bits (72), Expect = 0.86
Identities = 14/55 (25%), Positives = 26/55 (47%), Gaps = 1/55 (1%)
Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKK-KKKKKKKKKEEQSVKEIEESLKERKEKTL 83
E +E+E K +K+ + +KK + K E+ V++ +E L E + K
Sbjct: 812 LAAELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQVKLA 866
Score = 31.1 bits (71), Expect = 1.2
Identities = 17/63 (26%), Positives = 33/63 (52%), Gaps = 8/63 (12%)
Query: 26 ENEQEEKEKEDVEEEKNKKKKK--------KKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
E + EKE E +E+E ++ +KK K ++ +K+K+K + + + ++EE L
Sbjct: 815 ELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQVKLAKLEERLAV 874
Query: 78 RKE 80
K
Sbjct: 875 LKA 877
Score = 28.0 bits (63), Expect = 9.2
Identities = 17/65 (26%), Positives = 30/65 (46%), Gaps = 4/65 (6%)
Query: 6 GGAGEKKKRIEKRNKEEEKEENEQEEKEK----EDVEEEKNKKKKKKKKKKKKKKKKKKK 61
G + + + KE EK E E + EK E + ++ +K+K+K + + K
Sbjct: 807 AGLIDLAAELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQVKLA 866
Query: 62 KKEEQ 66
K EE+
Sbjct: 867 KLEER 871
>gnl|CDD|240334 PTZ00258, PTZ00258, GTP-binding protein; Provisional.
Length = 390
Score = 31.1 bits (71), Expect = 0.89
Identities = 17/38 (44%), Positives = 26/38 (68%)
Query: 42 NKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
K+ + KK+KKKKKKK++K E +K++ E L+E K
Sbjct: 160 EKRLDELTKKRKKKKKKKEEKVELDVLKKVLEWLEEGK 197
>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
Length = 155
Score = 30.0 bits (68), Expect = 0.92
Identities = 9/54 (16%), Positives = 15/54 (27%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
K + E E + + KKKK + + E + E
Sbjct: 99 KPSKLELFNAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAE 152
Score = 28.9 bits (65), Expect = 2.8
Identities = 6/48 (12%), Positives = 11/48 (22%)
Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
E + KKKK + + + +E
Sbjct: 107 NAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAEEE 154
>gnl|CDD|234537 TIGR04313, aro_clust_Mycop, aromatic cluster surface protein.
Members of this family are absolutely restricted to the
Mollicutes (Mycoplasma and Ureaplasma). All have a
signal peptide, usually of the lipoprotein type,
suggesting surface expression. Most members have lengths
of about 280 residues but some members have a nearly
full-length duplication. The mostly nearly invariant
residue, a Trp,is part of a strongly conserved 9-residue
motif, [ND]-W-[LY]-[WF]-X-[LF]-X-N-[LI], where X usually
is hydrophobic. Because the hydrophobic six-residue core
of this motif almost always contains three to four
aromatic residues, we name this family aromatic cluster
surface protein. Multiple paralogs may occur in a given
Mycoplasma, usually clustered on the genome.
Length = 297
Score = 30.8 bits (70), Expect = 0.94
Identities = 9/76 (11%), Positives = 29/76 (38%), Gaps = 6/76 (7%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEE------EKNKKKKKKKKKKKKKKKKKKKKK 63
+K + + K+++K++ + + + + ++ +K K K +K + K
Sbjct: 33 KKLIKNKFDVKKDKKDKKWKVFLNNKYINNLLNLVFKNDEDEKNKYIKSQKNINSELYLK 92
Query: 64 EEQSVKEIEESLKERK 79
+ E ++
Sbjct: 93 LLKLSLEYLNNINNSS 108
>gnl|CDD|223966 COG1035, FrhB, Coenzyme F420-reducing hydrogenase, beta subunit
[Energy production and conversion].
Length = 332
Score = 30.8 bits (70), Expect = 0.94
Identities = 12/36 (33%), Positives = 19/36 (52%)
Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
E KE E+V +K KKK K+ K+ +++ E
Sbjct: 289 EVKEIEEVALGLRLVEKLAKKKLKRAAKEPEERVEF 324
>gnl|CDD|221586 pfam12457, TIP_N, Tuftelin interacting protein N terminal. This
domain family is found in eukaryotes, and is typically
between 99 and 114 amino acids in length. The family is
found in association with pfam08697, pfam01585. There
are two completely conserved residues (G and F) that may
be functionally important. TIP is involved in enamel
assembly by interacting with one of the major proteins
responsible for biomineralisation of enamel - tuftelin.
Length = 106
Score = 29.3 bits (66), Expect = 0.96
Identities = 12/53 (22%), Positives = 26/53 (49%), Gaps = 11/53 (20%)
Query: 2 RKRRGGAGEKKKR-----------IEKRNKEEEKEENEQEEKEKEDVEEEKNK 43
R R +KKK I++ KE E+++ E E ++ ++ +E+ ++
Sbjct: 52 RGRGRRRSKKKKDYSKPVAFVSGGIKQAAKEPEEDDKEDESEDDDESDEDDDE 104
>gnl|CDD|219186 pfam06810, Phage_GP20, Phage minor structural protein GP20. This
family consists of several phage minor structural
protein GP20 sequences of around 180 residues in
length. The function of this family is unknown.
Length = 156
Score = 30.0 bits (68), Expect = 0.98
Identities = 19/65 (29%), Positives = 36/65 (55%), Gaps = 3/65 (4%)
Query: 20 KEEEKEENEQEEKEKE---DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
K++ K ++Q E K+ D EE + K +K K++ K K++ + K + + IE +LK
Sbjct: 33 KKQLKNRDKQIEDLKKKVKDNEELQKKIEKLKQQNKTAKEEYEAKLADTKLNNAIELALK 92
Query: 77 ERKEK 81
+ K +
Sbjct: 93 KAKAR 97
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 31.2 bits (71), Expect = 0.98
Identities = 18/71 (25%), Positives = 43/71 (60%), Gaps = 3/71 (4%)
Query: 12 KKRIEKRNKEEEKEENEQEE-KEKEDVEEEKNKKKKKKKKK--KKKKKKKKKKKKEEQSV 68
K+ +E+ +E EK E+E E + + + K+++ + + ++ + +K+ ++KKK+ E+
Sbjct: 442 KRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELE 501
Query: 69 KEIEESLKERK 79
+++ E K RK
Sbjct: 502 RKLAELRKMRK 512
Score = 30.8 bits (70), Expect = 1.2
Identities = 22/84 (26%), Positives = 43/84 (51%), Gaps = 5/84 (5%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK----KKKKKKK 57
+ R +KRI+K + E+ E E E ++E +EE K + +K + + +++ + K
Sbjct: 411 EEERREITVYEKRIKKLEETVERLEEENSELKRE-LEELKREIEKLESELERFRREVRDK 469
Query: 58 KKKKKKEEQSVKEIEESLKERKEK 81
+K ++ + IE KE +EK
Sbjct: 470 VRKDREIRARDRRIERLEKELEEK 493
Score = 30.8 bits (70), Expect = 1.2
Identities = 20/82 (24%), Positives = 39/82 (47%), Gaps = 6/82 (7%)
Query: 9 GEKKKRIEKRNK------EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
E +R+E+ N EE K E E+ E E E E K +K ++ + + ++ ++ +
Sbjct: 428 EETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLE 487
Query: 63 KEEQSVKEIEESLKERKEKTLT 84
KE + K+ E L+ + +
Sbjct: 488 KELEEKKKRVEELERKLAELRK 509
Score = 30.4 bits (69), Expect = 1.6
Identities = 13/80 (16%), Positives = 38/80 (47%), Gaps = 12/80 (15%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK------------KKKKKKKKKK 60
K E+ E++ E+E +E E+ K ++ ++ ++ K++ +K
Sbjct: 396 KVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKL 455
Query: 61 KKKEEQSVKEIEESLKERKE 80
+ + E+ +E+ + +++ +E
Sbjct: 456 ESELERFRREVRDKVRKDRE 475
Score = 28.9 bits (65), Expect = 4.7
Identities = 19/61 (31%), Positives = 31/61 (50%), Gaps = 1/61 (1%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
+ KEE EKE EEE+ + +K+ KK ++ ++ ++E +K E LK E
Sbjct: 395 SKVKEEERPREKE-GTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIE 453
Query: 81 K 81
K
Sbjct: 454 K 454
>gnl|CDD|223731 COG0658, ComEC, Predicted membrane metal-binding protein [General
function prediction only].
Length = 453
Score = 30.9 bits (70), Expect = 0.99
Identities = 28/128 (21%), Positives = 43/128 (33%), Gaps = 14/128 (10%)
Query: 175 TMMSNLAM-CLLGWWRCFANNYLQWAAVGTWILLNTQGWWRCFANNYLQWAAVGTWILLN 233
+S LA+ L+ W + F L W T LL T L +LLN
Sbjct: 338 FWLSFLAVAALILWSQLFRLLGLSWPLPITLALLATLPLL-ILQFGRLGL----LGLLLN 392
Query: 234 TQGFISQGIGLNNKEYLIWYILFIVFVTYAMLPLPLKWCFIGGCTTSVLHVIITAQIKLH 293
+S I L + L+ L V++ ++PL L +I A ++
Sbjct: 393 --LIVSPLISLISLPGLLANFLLSPLVSFLIVPLILLG------LLLAPLLIGAALARVR 444
Query: 294 RGPLSPCS 301
G
Sbjct: 445 PGVFGLIQ 452
>gnl|CDD|220303 pfam09592, DUF2031, Protein of unknown function (DUF2031). This
protein is expressed in Plasmodium; its function is
unknown. It may be the product of gene family pyst-b.
Length = 228
Score = 30.4 bits (69), Expect = 1.0
Identities = 20/60 (33%), Positives = 26/60 (43%), Gaps = 11/60 (18%)
Query: 13 KRIEKRN-----------KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
KRI K++ K+ E EEN E + E KK K +K KK +KK K
Sbjct: 144 KRIIKKDENSSVSEHEDFKQLENEENFLETEYNEITSSNNYKKLKINRKLKKAEKKLIKS 203
>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 158 and 187 amino
acids in length. This family includes the PcfB protein.
Length = 137
Score = 29.9 bits (68), Expect = 1.0
Identities = 14/37 (37%), Positives = 25/37 (67%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
++E+ EK V ++ K K+++ KKK++K K K KK+
Sbjct: 101 KKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKNKKK 137
Score = 28.8 bits (65), Expect = 2.4
Identities = 15/45 (33%), Positives = 26/45 (57%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
K + ++E++E K+ + +EE+ KKK++K K K KKK
Sbjct: 93 KEFTAKKLKKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKNKKK 137
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 31.1 bits (71), Expect = 1.0
Identities = 16/85 (18%), Positives = 32/85 (37%), Gaps = 2/85 (2%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
+++ E+ +E E+++KE +E + K K KK K + K+ +K E + E
Sbjct: 240 EQKAERLRQEAA--AYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEKLEARLAEE 297
Query: 72 EESLKERKEKTLTEDSQCDTMIPVF 96
+ + V
Sbjct: 298 RPVEEGKPLAFRFPPPGKRLGKLVL 322
Score = 30.7 bits (70), Expect = 1.3
Identities = 13/62 (20%), Positives = 27/62 (43%), Gaps = 8/62 (12%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKK--------KKKKKKKKKKKKEEQSVKEIEESLKERK 79
+++ E+ E +K++K+ K K K KK K + ++++E L E +
Sbjct: 239 LEQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEKLEARLAEER 298
Query: 80 EK 81
Sbjct: 299 PV 300
>gnl|CDD|224427 COG1510, COG1510, Predicted transcriptional regulators
[Transcription].
Length = 177
Score = 30.4 bits (69), Expect = 1.0
Identities = 15/73 (20%), Positives = 29/73 (39%), Gaps = 3/73 (4%)
Query: 3 KRRGGAGEKKKRIEKRNKE---EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
R EK KR KE + EE ++ +++ E K K K ++ K + +
Sbjct: 93 IFRTLFEEKWKREIDPTKEALKKLLEELNEDLDDRDLTERIKEIKSKLERLLKWSEDYYE 152
Query: 60 KKKKEEQSVKEIE 72
+ + ++ E
Sbjct: 153 LLTRLLEFLESEE 165
>gnl|CDD|223619 COG0545, FkpA, FKBP-type peptidyl-prolyl cis-trans isomerases 1
[Posttranslational modification, protein turnover,
chaperones].
Length = 205
Score = 30.5 bits (69), Expect = 1.0
Identities = 9/51 (17%), Positives = 18/51 (35%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
K E + EQ ++ + K+ + K +K K+K +
Sbjct: 50 KEAALEALQRAEQRLQQAAKAKRAKDAAANAAEGKAFLEKNAKEKGVKTLP 100
Score = 29.3 bits (66), Expect = 2.4
Identities = 9/54 (16%), Positives = 21/54 (38%), Gaps = 1/54 (1%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
A +K+ + + E+ + + ++ N + K +K K+K K
Sbjct: 46 ALVQKEAALEALQRAEQRLQQAAKAKRAKDAAA-NAAEGKAFLEKNAKEKGVKT 98
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 31.0 bits (71), Expect = 1.1
Identities = 16/76 (21%), Positives = 42/76 (55%), Gaps = 6/76 (7%)
Query: 12 KKRIEKRNKEEEKEEN--EQEEKEKE---DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
+++++KR + + E +E+ ++ ++EE + ++ + K+ + K+ + +E
Sbjct: 15 REKLKKRGGDALDVDKLLELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELI 74
Query: 67 S-VKEIEESLKERKEK 81
+ VKE++E LKE +
Sbjct: 75 AEVKELKEKLKELEAA 90
Score = 28.3 bits (64), Expect = 7.3
Identities = 12/67 (17%), Positives = 34/67 (50%), Gaps = 3/67 (4%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
+++ ++ +E E + + E +E K+ + ++ + K+ K++ +KE+E +
Sbjct: 34 LDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEK---LKELEAA 90
Query: 75 LKERKEK 81
L E + +
Sbjct: 91 LDELEAE 97
>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
Length = 819
Score = 31.2 bits (71), Expect = 1.1
Identities = 14/53 (26%), Positives = 26/53 (49%)
Query: 29 QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
+EE + +EE K K+ ++ ++K ++ EQ + EE KE +K
Sbjct: 444 EEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKK 496
Score = 29.7 bits (67), Expect = 3.0
Identities = 16/75 (21%), Positives = 35/75 (46%), Gaps = 3/75 (4%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK---EEQSV 68
K EE +E+ ++ ++ + +EE K+ KK KK + ++ E++
Sbjct: 460 VKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEVDTYLLLEELGINEETY 519
Query: 69 KEIEESLKERKEKTL 83
+++E L ++ TL
Sbjct: 520 EKLEALLAKKNNFTL 534
Score = 29.3 bits (66), Expect = 3.6
Identities = 10/59 (16%), Positives = 27/59 (45%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
E K E+ + ++ + EE +K ++ ++ + +++ KK+ ++ K E
Sbjct: 446 EAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEVD 504
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 30.2 bits (69), Expect = 1.1
Identities = 14/32 (43%), Positives = 19/32 (59%)
Query: 34 KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
E E EK KKK K++KK K++K K +E
Sbjct: 92 YEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
Score = 28.7 bits (65), Expect = 4.4
Identities = 10/24 (41%), Positives = 15/24 (62%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKE 33
EKKK + K K+ KEE ++ E+
Sbjct: 100 EKKKAMSKEEKKAIKEEKDKLEEP 123
Score = 27.9 bits (63), Expect = 6.6
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
E K+KKK K++KK K++K K E+
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEE 122
Score = 27.9 bits (63), Expect = 7.5
Identities = 16/56 (28%), Positives = 29/56 (51%), Gaps = 2/56 (3%)
Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
R +++ ++ EK D + +K+KKK K++KK KEE+ ++EE
Sbjct: 69 RKILKKEGGAIIKDLEKCDFTPIYEYFEAEKEKKKAMSKEEKKAIKEEK--DKLEE 122
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This
family includes outer membrane proteins such as OmpH
among others. Skp (OmpH) has been characterized as a
molecular chaperone that interacts with unfolded
proteins as they emerge in the periplasm from the Sec
translocation machinery.
Length = 140
Score = 29.9 bits (68), Expect = 1.1
Identities = 18/67 (26%), Positives = 41/67 (61%), Gaps = 3/67 (4%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDV---EEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
E KKR + K E++ + +E+ +K+ E + KK+K+ +KK ++ ++K++K +++
Sbjct: 26 EFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQKLQQDL 85
Query: 67 SVKEIEE 73
++ EE
Sbjct: 86 QKRQQEE 92
Score = 28.7 bits (65), Expect = 2.8
Identities = 18/72 (25%), Positives = 46/72 (63%), Gaps = 2/72 (2%)
Query: 12 KKRIEKRNKEEEKE-ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS-VK 69
+K++EK K+ + E E ++E +K + +K+ + ++KK+K+ +KK +E Q +
Sbjct: 20 QKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQ 79
Query: 70 EIEESLKERKEK 81
++++ L++R+++
Sbjct: 80 KLQQDLQKRQQE 91
Score = 27.5 bits (62), Expect = 6.3
Identities = 17/57 (29%), Positives = 29/57 (50%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
K +Q EKE + + E K +K+ +K K+K +K E K+ +E K+ +E
Sbjct: 17 KAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQE 73
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This
is a family of uncharacterized proteins.
Length = 449
Score = 30.7 bits (69), Expect = 1.1
Identities = 8/40 (20%), Positives = 22/40 (55%)
Query: 51 KKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
KK+ +K K+ EE++++E ++ ++ + ++ D
Sbjct: 33 KKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDD 72
Score = 30.7 bits (69), Expect = 1.3
Identities = 11/32 (34%), Positives = 21/32 (65%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKN 42
K+ I K KE E+E E+E+ ++ED +++ +
Sbjct: 34 KENAIRKLGKEAEEEAMEEEDDDEEDDDDDDD 65
Score = 30.3 bits (68), Expect = 1.7
Identities = 8/47 (17%), Positives = 25/47 (53%)
Query: 44 KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCD 90
KK+ +K K+ +++ ++E+ ++ ++ E ++ +D + D
Sbjct: 33 KKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDD 79
>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694). Family of
hypothetical bacterial proteins.
Length = 278
Score = 30.7 bits (70), Expect = 1.1
Identities = 21/43 (48%), Positives = 30/43 (69%), Gaps = 1/43 (2%)
Query: 20 KEEEKEENEQEE-KEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
+E+EK +E E E+ D EE K+KKKKKKKKKKK++ ++
Sbjct: 63 QEKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREW 105
Score = 29.9 bits (68), Expect = 1.8
Identities = 16/43 (37%), Positives = 28/43 (65%)
Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+ KE+ +E+ + + + EE +KKKKKKKKKKK++ ++
Sbjct: 63 QEKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREW 105
Score = 29.5 bits (67), Expect = 2.3
Identities = 16/42 (38%), Positives = 25/42 (59%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
EK +E E + E++ +EK KKKKKKKKK++ ++
Sbjct: 64 EKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREW 105
Score = 29.5 bits (67), Expect = 2.6
Identities = 14/38 (36%), Positives = 24/38 (63%)
Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
N++ + E + + K+KKKKKKKKKKK++ ++
Sbjct: 68 LNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREW 105
Score = 29.1 bits (66), Expect = 3.1
Identities = 15/41 (36%), Positives = 26/41 (63%)
Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
EK ++ E + + ++ K+KKKKKKKKKK+E+ + +
Sbjct: 66 EKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREWM 106
Score = 28.7 bits (65), Expect = 4.3
Identities = 19/38 (50%), Positives = 25/38 (65%), Gaps = 3/38 (7%)
Query: 31 EKEKEDVEEEK---NKKKKKKKKKKKKKKKKKKKKKEE 65
EKEK + E + + ++ K+KKKKKKKKKKKEE
Sbjct: 64 EKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEE 101
>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
CCCH-type Zn-finger protein [General function
prediction only].
Length = 299
Score = 30.8 bits (69), Expect = 1.1
Identities = 16/73 (21%), Positives = 41/73 (56%), Gaps = 4/73 (5%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEE----KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
M K++ +G+K R ++ E++ K +N + + + E K++ +KK+K +
Sbjct: 6 MAKKQQESGKKATRDMRKELEDKTFGLKNKNRSTKVQAIIKQIETLNLKEQLEKKEKMRM 65
Query: 57 KKKKKKKEEQSVK 69
++K+++ E+Q ++
Sbjct: 66 EEKRREPEKQVIR 78
Score = 29.3 bits (65), Expect = 3.0
Identities = 24/103 (23%), Positives = 40/103 (38%), Gaps = 5/103 (4%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDV-----EEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
+K R+E++ +E EK+ K V + K K K++ +K +K
Sbjct: 61 EKMRMEEKRREPEKQVIRAGVDPKTVVCALFLNKTCAKGDACKFAHGKEEARKTEKPDLY 120
Query: 66 QSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYKGIYCP 108
V++ EE + K + D C I + G G CP
Sbjct: 121 SDVRDKEEDVPLGKRPWINTDRVCKFFIEAMESGKYGWGWTCP 163
>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
Length = 648
Score = 30.9 bits (71), Expect = 1.1
Identities = 11/34 (32%), Positives = 20/34 (58%)
Query: 36 DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
DVEEE K++ + K+ ++K+KK E+ +
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPEIT 544
Score = 28.2 bits (64), Expect = 7.6
Identities = 7/30 (23%), Positives = 17/30 (56%)
Query: 33 EKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
+E++ K + + K+ ++K+KK +K
Sbjct: 512 VEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541
Score = 28.2 bits (64), Expect = 9.1
Identities = 8/31 (25%), Positives = 17/31 (54%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKK 51
+ E+E +E+ + +E +K+KK +K
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 30.5 bits (69), Expect = 1.1
Identities = 17/76 (22%), Positives = 43/76 (56%), Gaps = 1/76 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE-EQSV 68
EKK + +EE E+E+E +E E ++ + K +++++KK+ + ++ +Q +
Sbjct: 53 EKKADELEEENRRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQEL 112
Query: 69 KEIEESLKERKEKTLT 84
+E +E+ + +++ L
Sbjct: 113 REAQEAHERARQELLE 128
Score = 29.7 bits (67), Expect = 1.9
Identities = 13/75 (17%), Positives = 40/75 (53%), Gaps = 2/75 (2%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
++++ E + E+ EE + ++++EE + + ++K K+++++ + +K+ ++E
Sbjct: 4 EREQQELEERMEQMEE--DMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEE 61
Query: 71 IEESLKERKEKTLTE 85
L+E + E
Sbjct: 62 ENRRLEEEAAASEEE 76
>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain. This family
consists of several ATP synthase E chain sequences
which are components of the CF(0) subunit.
Length = 83
Score = 28.6 bits (64), Expect = 1.1
Identities = 13/33 (39%), Positives = 19/33 (57%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK 52
KEE+ E E +EK E + KKK+ K++ K
Sbjct: 35 KEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAK 67
Score = 28.6 bits (64), Expect = 1.3
Identities = 13/36 (36%), Positives = 23/36 (63%), Gaps = 1/36 (2%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
K++E++ E E +EK +E+ K KKK+ K++ K
Sbjct: 33 KKKEEKIREYEAQEKL-IEKAKAAYAKKKQAKQEAK 67
Score = 26.7 bits (59), Expect = 5.6
Identities = 13/50 (26%), Positives = 27/50 (54%), Gaps = 4/50 (8%)
Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ----SVKEIEESLK 76
+K++E + E + ++K +K K KKK+ K++ + S +E+ K
Sbjct: 33 KKKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAKSLAEGSSAPFDETPK 82
>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
Length = 773
Score = 31.0 bits (71), Expect = 1.1
Identities = 17/73 (23%), Positives = 41/73 (56%), Gaps = 4/73 (5%)
Query: 13 KRIEKRNKEE----EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
+R EK+ KEE + N++ ++ E+ + E+ +K ++ K K K++++EE+
Sbjct: 490 RRKEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKE 549
Query: 69 KEIEESLKERKEK 81
++ +E+ ++ E
Sbjct: 550 EKDKETEEDEPEG 562
>gnl|CDD|220774 pfam10477, EIF4E-T, Nucleocytoplasmic shuttling protein for mRNA
cap-binding EIF4E. EIF4E-T is the transporter protein
for shuttling the mRNA cap-binding protein EIF4E
protein, targeting it for nuclear import. EIF4E-T
contains several key binding domains including two
functional leucine-rich NESs (nuclear export signals)
between residues 438-447 and 613-638 in the human
protein. The other two binding domains are an
EIF4E-binding site, between residues 27-42 in Q9EST3,
and a bipartite NLS (nuclear localisation signals)
between 194-211, and these lie in family EIF4E-T_N.
EIF4E is the eukaryotic translation initiation factor 4E
that is the rate-limiting factor for cap-dependent
translation initiation.
Length = 520
Score = 31.1 bits (69), Expect = 1.1
Identities = 17/62 (27%), Positives = 28/62 (45%)
Query: 33 EKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTM 92
+K E++K K+K +K+ KK+ KE SL+ E L E++ D
Sbjct: 224 DKILEEDDKTSNGDGKQKGRKRTKKRTASVKEGHVECNGGVSLEPHDEVGLEEENAADQE 283
Query: 93 IP 94
+P
Sbjct: 284 VP 285
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 29.7 bits (67), Expect = 1.2
Identities = 15/72 (20%), Positives = 40/72 (55%), Gaps = 1/72 (1%)
Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
+ +E ++ +E+ D +E + K K++KK+ K ++K+ ++ + + + E+ +
Sbjct: 1 KKARKEAQKALEEDPSIYDYDEVYDSIKAAKEEKKQAKLSERKENRKPKYIGSLLEAAER 60
Query: 78 RK-EKTLTEDSQ 88
RK E+ + E+ +
Sbjct: 61 RKREREIAEERK 72
Score = 29.3 bits (66), Expect = 1.3
Identities = 13/31 (41%), Positives = 21/31 (67%)
Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKK 48
+ + EE + E+EEKE+E++EEE + K K
Sbjct: 96 KKQLEENRKLEEEEKEREELEEENDVTKGKD 126
Score = 28.9 bits (65), Expect = 1.8
Identities = 22/89 (24%), Positives = 40/89 (44%), Gaps = 19/89 (21%)
Query: 11 KKKRIEKRNKEEEKEEN--------------EQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
K + EK+ + + + E+ ++E+E EE K +K+++K+ + K
Sbjct: 28 KAAKEEKKQAKLSERKENRKPKYIGSLLEAAERRKREREIAEERKLQKEREKEGDEFADK 87
Query: 57 KK-----KKKKKEEQSVKEIEESLKERKE 80
+K KK+ EE E EE +E E
Sbjct: 88 EKFVTSAYKKQLEENRKLEEEEKEREELE 116
Score = 27.8 bits (62), Expect = 4.4
Identities = 22/74 (29%), Positives = 40/74 (54%), Gaps = 11/74 (14%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKK------KKKKKKKKKKKKKKKKKEEQSVK 69
E+R +E E E + +KE+E +E K+K KK+ ++ +K ++++K++E
Sbjct: 59 ERRKREREIAEERKLQKEREKEGDEFADKEKFVTSAYKKQLEENRKLEEEEKERE----- 113
Query: 70 EIEESLKERKEKTL 83
E+EE K K L
Sbjct: 114 ELEEENDVTKGKDL 127
>gnl|CDD|151173 pfam10669, Phage_Gp23, Protein gp23 (Bacteriophage A118). This is
the highly conserved family of the major tail subunit
protein.
Length = 121
Score = 29.3 bits (65), Expect = 1.2
Identities = 16/68 (23%), Positives = 37/68 (54%), Gaps = 1/68 (1%)
Query: 14 RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
R+E + +E +K E E+E+++KE EE ++++ K+ +++QS+ + +
Sbjct: 45 RVEMK-EERDKMETEREKRDKESKEERDKFISTMNEQQRLMDKQNDMMGQQQQSIDSLSK 103
Query: 74 SLKERKEK 81
S+ + K
Sbjct: 104 SVGKLAHK 111
>gnl|CDD|221007 pfam11158, DUF2938, Protein of unknown function (DUF2938). This
bacterial family of proteins has no known function. Some
members are thought to be membrane proteins however this
cannot be confirmed.
Length = 150
Score = 29.9 bits (68), Expect = 1.2
Identities = 16/61 (26%), Positives = 25/61 (40%), Gaps = 5/61 (8%)
Query: 214 RCFANNYLQWAAVGTWILLNTQG-FISQGIG----LNNKEYLIWYILFIVFVTYAMLPLP 268
R F L +A VG W+ +G F I + + L W + + V +A L L
Sbjct: 20 RLFGVPSLNYAMVGRWLGHLPRGRFFHDSIAAAPPVPGERALGWLAHYAIGVLFAALLLL 79
Query: 269 L 269
+
Sbjct: 80 I 80
>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
Length = 977
Score = 31.1 bits (70), Expect = 1.2
Identities = 16/33 (48%), Positives = 20/33 (60%)
Query: 34 KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
K VE K+KK KKKK KKK+ K + E+Q
Sbjct: 115 KNRVESWKDKKNKKKKSAKKKEAHKAQIPPEQQ 147
Score = 29.2 bits (65), Expect = 4.3
Identities = 12/28 (42%), Positives = 17/28 (60%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
KN+ + K KK KKKK KKK+ + +
Sbjct: 115 KNRVESWKDKKNKKKKSAKKKEAHKAQI 142
>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2. A Saccharomyces
cerevisiae member of this family (PGA2) is an ER protein
which has been implicated in protein trafficking.
Length = 139
Score = 29.8 bits (67), Expect = 1.2
Identities = 13/63 (20%), Positives = 32/63 (50%), Gaps = 5/63 (7%)
Query: 20 KEEEKEENEQEEKEKEDVEE-----EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
EE+ +E+++ED + KK +K+++K +K + +++ E + +E
Sbjct: 73 TAGHGEEDTDDEEDEEDFATPSAVPQWGKKARKRQRKVIRKLLEAEEQLREDQYDDEDED 132
Query: 75 LKE 77
++E
Sbjct: 133 IEE 135
>gnl|CDD|217935 pfam04158, Sof1, Sof1-like domain. Sof1 is essential for cell
growth and is a component of the nucleolar rRNA
processing machinery.
Length = 88
Score = 28.8 bits (65), Expect = 1.2
Identities = 12/41 (29%), Positives = 23/41 (56%)
Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
KK +K K++ K+ KK+K++ + + S ERK+ +
Sbjct: 48 KKAQKIKREMKEAKKRKEENRRKHSKPGSVPPKPERKKHVV 88
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 30.7 bits (70), Expect = 1.2
Identities = 19/80 (23%), Positives = 37/80 (46%), Gaps = 6/80 (7%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEK-----EDVEEEKNKKKKKKKKKKKK-KKKKKK 60
E +E+ K+ E E EE+ +++EE + +K+++ ++ K+ KK
Sbjct: 185 VLKELSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELAKK 244
Query: 61 KKKEEQSVKEIEESLKERKE 80
+E ++ E E ER E
Sbjct: 245 YLEELLALYEYLEIELERAE 264
Score = 30.7 bits (70), Expect = 1.3
Identities = 9/77 (11%), Positives = 33/77 (42%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
++ +K + + +E + E+E E +E+E + +++ + + + K+ +++ + +
Sbjct: 69 LNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLEP 128
Query: 68 VKEIEESLKERKEKTLT 84
+ L
Sbjct: 129 WGNFDLDLSLLLGFKYV 145
>gnl|CDD|237748 PRK14537, PRK14537, 50S ribosomal protein L20/unknown domain fusion
protein; Provisional.
Length = 230
Score = 30.3 bits (68), Expect = 1.2
Identities = 14/87 (16%), Positives = 35/87 (40%), Gaps = 4/87 (4%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
+ + K ++ NE +++ + EEK + +K + ++K + E Q K
Sbjct: 119 QFQSTTITKPKKVLINEVLQEKTINQNEEKTSLQNEKVLSPELSEEKSDSELETQPQK-- 176
Query: 72 EESLKERKEKTLTEDSQCDTMIPVFKK 98
+ + K+ ++ ++ KK
Sbjct: 177 --TQLKEKKPSIEHIDLSKMLLKELKK 201
>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
insulinase-like [General function prediction only].
Length = 978
Score = 30.8 bits (70), Expect = 1.2
Identities = 11/56 (19%), Positives = 29/56 (51%), Gaps = 4/56 (7%)
Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
++EE+ K++++ +K+ + + +K + K LKER+++ +E+
Sbjct: 456 LPSPELEEKLEKEERELLQKRSSELTDEDLEKIIKDSK----KLKERQDQPDSEED 507
Score = 30.0 bits (68), Expect = 2.6
Identities = 18/96 (18%), Positives = 33/96 (34%), Gaps = 21/96 (21%)
Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKK--KKKKKKKEEQSVKEIEESLK----------ER 78
E++ E E E +K+ + + +K K KK KE Q + EE L
Sbjct: 462 EEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEEDLATLPTLKLGDVPD 521
Query: 79 KEKTLTEDSQCDTMIPVFKKGVLYKGIYCPSLTNSF 114
+ + +++ V ++ TN
Sbjct: 522 PIEKTSLETEVSNEAKV---------LHHDLFTNGI 548
>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
[DNA replication, recombination, and repair].
Length = 417
Score = 30.8 bits (70), Expect = 1.2
Identities = 12/53 (22%), Positives = 26/53 (49%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
KK+ ++E + E +K +EE+ K + K ++ K+ + K+K+
Sbjct: 365 LAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLENGKQKR 417
Score = 30.4 bits (69), Expect = 1.3
Identities = 16/75 (21%), Positives = 29/75 (38%), Gaps = 12/75 (16%)
Query: 1 MRKRRGGAGEKKKRIEKRN------------KEEEKEENEQEEKEKEDVEEEKNKKKKKK 48
RK+ A E K KR K +E + K+ +E+ K +
Sbjct: 343 ARKKVKTAYEAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFI 402
Query: 49 KKKKKKKKKKKKKKK 63
K ++ K+ + K+K+
Sbjct: 403 KLEELKRLENGKQKR 417
>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
Length = 448
Score = 30.8 bits (70), Expect = 1.3
Identities = 17/66 (25%), Positives = 30/66 (45%), Gaps = 5/66 (7%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
R+R+ +K K NKE K + KED E + K+ KK+ +K+ + ++
Sbjct: 43 RQRQFELDSLRKEFNKLNKEVAKLK-----IAKEDATELIAETKELKKEITEKEAEVQEA 97
Query: 62 KKEEQS 67
K +
Sbjct: 98 KAALDA 103
>gnl|CDD|226115 COG3587, COG3587, Restriction endonuclease [Defense mechanisms].
Length = 985
Score = 30.9 bits (70), Expect = 1.3
Identities = 17/79 (21%), Positives = 33/79 (41%), Gaps = 1/79 (1%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
K K + ++ +++ E ++E ++ + K K K+ +E
Sbjct: 405 PAKLKGLFEKLYKQKYAEVLKKELDRSFYRDYLQSTDDDILKVHGGYFSKDNKEGDEAIE 464
Query: 69 KEIEESLKERKEKTLTEDS 87
EI+ LK+ KEK L+ D
Sbjct: 465 DEIDLILKD-KEKLLSFDE 482
>gnl|CDD|235551 PRK05667, dnaG, DNA primase; Validated.
Length = 580
Score = 30.6 bits (70), Expect = 1.3
Identities = 8/48 (16%), Positives = 21/48 (43%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
+ +++ + EE EE +E+ E ++ + + + K+K
Sbjct: 533 DALEKLRDQLLEERLEELIAKERLLEGHGLSSEERLELLQLLIELKRK 580
>gnl|CDD|218010 pfam04293, SpoVR, SpoVR like protein. Bacillus subtilis stage V
sporulation protein R is involved in spore cortex
formation. Little is known about cortex biosynthesis,
except that it depends on several sigma E controlled
genes, including spoVR.
Length = 427
Score = 30.6 bits (70), Expect = 1.3
Identities = 7/47 (14%), Positives = 23/47 (48%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
K + K ++E E + + D+ +++K ++++ ++K+
Sbjct: 172 PIKLSLEEEKARQEEREEYLQSQVNDLWRTIPPQREKLEEERARRKR 218
Score = 29.5 bits (67), Expect = 2.9
Identities = 12/55 (21%), Positives = 31/55 (56%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
K+ I+ +EE+ + E+EE + V + +++K ++++ ++K+ +E Q
Sbjct: 170 KRPIKLSLEEEKARQEEREEYLQSQVNDLWRTIPPQREKLEEERARRKRFPEEPQ 224
>gnl|CDD|217011 pfam02388, FemAB, FemAB family. The femAB operon codes for two
nearly identical approximately 50-kDa proteins involved
in the formation of the Staphylococcal pentaglycine
interpeptide bridge in peptidoglycan. These proteins are
also considered as a factor influencing the level of
methicillin resistance.
Length = 408
Score = 30.7 bits (70), Expect = 1.3
Identities = 11/59 (18%), Positives = 31/59 (52%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
K+ +++ ++ EK E + + E++ + +KKK K + +++ +K+ E + +
Sbjct: 241 KEYLDELQEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELIA 299
Score = 30.7 bits (70), Expect = 1.3
Identities = 15/59 (25%), Positives = 29/59 (49%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
E ++ +EK E +++ K + + EKN KKK K + +++ +K KE+
Sbjct: 242 EYLDELQEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELIAK 300
Score = 29.5 bits (67), Expect = 3.1
Identities = 16/60 (26%), Positives = 32/60 (53%), Gaps = 2/60 (3%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE--EQSVKEIEESLKE 77
KE E E+ EK ++D+ + + K +K KKK K + +++ E+ + E +E + +
Sbjct: 241 KEYLDELQEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELIAK 300
Score = 28.0 bits (63), Expect = 7.9
Identities = 19/61 (31%), Positives = 31/61 (50%), Gaps = 5/61 (8%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
+E +E+ E+ EK+ +EE K +K KKK K + +++ K I+E KE
Sbjct: 245 DELQEKLEKLEKDLAKLEE----KLEKNPNSKKKNKLAELEQQLASLEKRIDE-AKELIA 299
Query: 81 K 81
K
Sbjct: 300 K 300
>gnl|CDD|129200 TIGR00092, TIGR00092, GTP-binding protein YchF. This predicted
GTP-binding protein is found in a single copy in every
complete bacterial genome, and is found in Eukaryotes. A
more distantly related protein, separated from this
model, is found in the archaea. It is known to bind GTP
and double-stranded nucleic acid. It is suggested to
belong to a nucleoprotein complex and act as a
translation factor [Unknown function, General].
Length = 368
Score = 30.5 bits (69), Expect = 1.3
Identities = 16/55 (29%), Positives = 21/55 (38%), Gaps = 5/55 (9%)
Query: 25 EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
E EE K D + K+ + KK + K KKEE + EI L
Sbjct: 126 FEIIDEELLKADEFLVE-----KRIGRSKKSAEGGKDKKEELLLLEIILPLLNGG 175
>gnl|CDD|218581 pfam05416, Peptidase_C37, Southampton virus-type processing
peptidase. Corresponds to Merops family C37.
Norwalk-like viruses (NLVs), including the Southampton
virus, cause acute non-bacterial gastroenteritis in
humans. The NLV genome encodes three open reading frames
(ORFs). ORF1 encodes a polyprotein, which is processed
by the viral protease into six proteins.
Length = 535
Score = 30.6 bits (69), Expect = 1.3
Identities = 17/65 (26%), Positives = 27/65 (41%), Gaps = 5/65 (7%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK-----EIEESLK 76
E + E E++D + + KK K KK + KK S + E +E K
Sbjct: 204 XEVTYYDGCESEEDDEGWYIEPQDATPEGKKGKNKKGRGKKHNAFSRRGLSDEEYDEYKK 263
Query: 77 ERKEK 81
R+E+
Sbjct: 264 IREER 268
Score = 30.2 bits (68), Expect = 2.1
Identities = 25/109 (22%), Positives = 46/109 (42%), Gaps = 19/109 (17%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK-----------KKKKKKKKKK 63
+ + E +E++E E +D E K K KK + KK ++ + KK +
Sbjct: 206 VTYYDGCESEEDDEGWYIEPQDATPEGKKGKNKKGRGKKHNAFSRRGLSDEEYDEYKKIR 265
Query: 64 EEQSVK-EIEESLKERK-------EKTLTEDSQCDTMIPVFKKGVLYKG 104
EE+ K I+E L++R+ E+ TE C+ ++ +
Sbjct: 266 EERGGKYSIQEYLEDRERYEEELAERQATEADFCEEEEAKIRQRIFGLR 314
Score = 28.7 bits (64), Expect = 5.2
Identities = 16/63 (25%), Positives = 34/63 (53%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+R+ RGG ++ +E R + EE+ Q + EEE +++ +K +K++K++
Sbjct: 264 IREERGGKYSIQEYLEDRERYEEELAERQATEADFCEEEEAKIRQRIFGLRKTRKQRKEE 323
Query: 61 KKK 63
+ K
Sbjct: 324 RAK 326
>gnl|CDD|115185 pfam06513, DUF1103, Repeat of unknown function (DUF1103). This
family consists of several repeats of around 30
residues in length which are found specifically in
mature-parasite-infected erythrocyte surface antigen
proteins from Plasmodium falciparum. This family often
found in conjunction with pfam00226.
Length = 215
Score = 30.2 bits (67), Expect = 1.3
Identities = 25/85 (29%), Positives = 47/85 (55%), Gaps = 7/85 (8%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK-------KK 61
GE K K + +K++ EK +++++E+ K+ K+K KKK KK K+ K
Sbjct: 4 GEGDKEDVKEKNDGKKDKVIGSEKTQKEIKEKVEKRVKEKCKKKVKKGIKENDTEGNDKV 63
Query: 62 KKEEQSVKEIEESLKERKEKTLTED 86
K E ++E++E +K++ E + E+
Sbjct: 64 KGPEIIIEEVKEEIKKQVEDGIKEN 88
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 30.7 bits (70), Expect = 1.3
Identities = 14/39 (35%), Positives = 16/39 (41%)
Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
K KK K K KKKKKK E + K +K
Sbjct: 351 KPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKK 389
Score = 30.0 bits (68), Expect = 1.8
Identities = 26/73 (35%), Positives = 33/73 (45%), Gaps = 13/73 (17%)
Query: 22 EEKEENEQE-EKEKEDVEEEKNKK------------KKKKKKKKKKKKKKKKKKKEEQSV 68
+EK NE+ EK E +++ K KK KK KK K K KKKKKK S
Sbjct: 314 KEKNFNEERVEKYIERLKKAKTKKTQTRLDSFFTATKKPIKKSNSKAKLKKKKKKAGASA 373
Query: 69 KEIEESLKERKEK 81
E+ +E K
Sbjct: 374 VPKSETSQEAKSS 386
Score = 28.8 bits (65), Expect = 5.4
Identities = 14/41 (34%), Positives = 17/41 (41%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
K KK K K KKKKKK +S E +K+
Sbjct: 351 KPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKV 391
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 30.7 bits (70), Expect = 1.4
Identities = 21/90 (23%), Positives = 37/90 (41%), Gaps = 20/90 (22%)
Query: 6 GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKN----------KKKKKKKKKKKKK 55
+G IE+ K+ E+ E+K++++ E KN K K++ K +
Sbjct: 497 ASSGLSDDEIERMVKDAEE--YAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEA 554
Query: 56 KKKKKKKKEEQ--------SVKEIEESLKE 77
KKK ++ E +EIE +E
Sbjct: 555 DKKKVEEAIEWLKEELEGEDKEEIEAKTEE 584
Score = 30.3 bits (69), Expect = 1.8
Identities = 17/75 (22%), Positives = 41/75 (54%), Gaps = 9/75 (12%)
Query: 10 EKKKRIEKRNKEE------EKEENEQEEKEKEDVEEEKNKKK---KKKKKKKKKKKKKKK 60
++K+RIE +N+ E EK E+ +K E +++ + K++ + + K++ + K
Sbjct: 522 KRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKKKVEEAIEWLKEELEGEDKEEIEAK 581
Query: 61 KKKEEQSVKEIEESL 75
++ ++ V+ I E +
Sbjct: 582 TEELQKVVQPIGERM 596
Score = 29.5 bits (67), Expect = 3.1
Identities = 17/76 (22%), Positives = 36/76 (47%), Gaps = 4/76 (5%)
Query: 15 IEKRNKEEEKEENEQEEKE----KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
+ ++K KE+ +++E ++ + KK+K++ + K + E+ V
Sbjct: 480 VSAKDKGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYS 539
Query: 71 IEESLKERKEKTLTED 86
+E+SLKE +K D
Sbjct: 540 LEKSLKEEGDKLPEAD 555
>gnl|CDD|225816 COG3277, GAR1, RNA-binding protein involved in rRNA processing
[Translation, ribosomal structure and biogenesis].
Length = 98
Score = 28.9 bits (65), Expect = 1.4
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKK 63
K +KK+K +KK++ KK K
Sbjct: 76 PPDKLIRKKRKLPRKKRRPKKPK 98
>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases.and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain.
Expression of a construct consisting of only the
N-terminal domain of Saccharomyces pombe Cdc37 results
in cellular viability. This indicates that interactions
with the cochaperone Hsp90 may not be essential for
Cdc37 function.
Length = 154
Score = 29.7 bits (67), Expect = 1.4
Identities = 14/39 (35%), Positives = 23/39 (58%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKK 46
G+ + +E+ K +K + EQ+E K+ E EK +KKK
Sbjct: 103 NGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEKKK 141
Score = 28.5 bits (64), Expect = 2.9
Identities = 17/83 (20%), Positives = 41/83 (49%), Gaps = 9/83 (10%)
Query: 13 KRIEKRNKEEEKEENEQE--------EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK-KK 63
KRI+K K +EE E + ++ +++E + + ++ KK + K KK
Sbjct: 63 KRIDKLLKGLREEELSPETPTYNEMLAELQDQLKKELEEANGDSEGLLEELKKHRDKLKK 122
Query: 64 EEQSVKEIEESLKERKEKTLTED 86
E++ +++ + L++ ++K +
Sbjct: 123 EQKELRKKLDELEKEEKKKIWSV 145
Score = 28.5 bits (64), Expect = 3.8
Identities = 14/46 (30%), Positives = 27/46 (58%), Gaps = 2/46 (4%)
Query: 12 KKRIEKRNKEEEK--EENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
KK +E+ N + E EE ++ + + ++E KK + +K++KKK
Sbjct: 96 KKELEEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEKKK 141
Score = 27.4 bits (61), Expect = 8.8
Identities = 15/54 (27%), Positives = 34/54 (62%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
E ++ + K+E +E N E E++++ ++K KK++K+ +KK + +K++K
Sbjct: 86 EMLAELQDQLKKELEEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEK 139
>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
Length = 205
Score = 30.0 bits (68), Expect = 1.4
Identities = 19/40 (47%), Positives = 25/40 (62%), Gaps = 1/40 (2%)
Query: 47 KKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK-EKTLTE 85
KKKK KK+++ KK KKK E V ++E K K +K L E
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKALLE 162
Score = 29.2 bits (66), Expect = 2.6
Identities = 14/39 (35%), Positives = 22/39 (56%)
Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
K+K EEE K KKK + K + +KK K ++++ E
Sbjct: 124 KKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKALLE 162
>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
Length = 64
Score = 27.9 bits (63), Expect = 1.4
Identities = 11/26 (42%), Positives = 19/26 (73%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEE 65
EK +K+K+KK +K+++KK +EE
Sbjct: 39 EKPSEKRKRKKAAARKRRRKKLAREE 64
Score = 26.4 bits (59), Expect = 5.2
Identities = 8/21 (38%), Positives = 15/21 (71%)
Query: 46 KKKKKKKKKKKKKKKKKKEEQ 66
K+K+KK +K+++KK E+
Sbjct: 44 KRKRKKAAARKRRRKKLAREE 64
Score = 26.0 bits (58), Expect = 7.0
Identities = 8/25 (32%), Positives = 16/25 (64%)
Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQS 67
K +K+K+KK +K+++KK +
Sbjct: 40 KPSEKRKRKKAAARKRRRKKLAREE 64
>gnl|CDD|216298 pfam01101, HMG14_17, HMG14 and HMG17.
Length = 93
Score = 28.6 bits (64), Expect = 1.4
Identities = 22/68 (32%), Positives = 31/68 (45%), Gaps = 8/68 (11%)
Query: 11 KKKRIEKRNKEEEKEENEQEE-----KEKEDVEEEKNKKKKKKKKK---KKKKKKKKKKK 62
K+K K + K+E ++ K E K KK KKKK K KK K KK+
Sbjct: 2 KRKGEAKDAAKVTKQEPKRRSARLSAKPAPPKPEPKPKKAAAKKKKPAVKGKKGAKGKKE 61
Query: 63 KEEQSVKE 70
+++ KE
Sbjct: 62 TKQEEAKE 69
>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 881
Score = 30.7 bits (69), Expect = 1.4
Identities = 11/56 (19%), Positives = 31/56 (55%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
+E ++ +Q E+ + EK KK KKK+++++ ++ + ++ + E +++
Sbjct: 3 DESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEEDAMLKGDLELLVERIQD 58
Score = 29.5 bits (66), Expect = 3.1
Identities = 14/61 (22%), Positives = 30/61 (49%), Gaps = 3/61 (4%)
Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE-KTLTEDSQCD 90
++ D +++ ++ + +K+ KK KKK+EE+ + EE + + + L E Q
Sbjct: 2 VDESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLS--EEDAMLKGDLELLVERIQDP 59
Query: 91 T 91
Sbjct: 60 D 60
>gnl|CDD|148491 pfam06898, YqfD, Putative stage IV sporulation protein YqfD. This
family consists of several putative bacterial stage IV
sporulation (SpoIV) proteins. YqfD of Bacillus subtilis
is known to be essential for efficient sporulation
although its exact function is unknown.
Length = 383
Score = 30.4 bits (69), Expect = 1.4
Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 15/91 (16%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKE------------KEDVEEEKNKKKKKKKKKKKKK 55
G KK ++ R E EK + E++EK +E E ++K K K++ +
Sbjct: 277 IGGKKLPLKFRKNEFEKYDKEEDEKPLFLKWKLPISIVRETYYEVQDKVVKLTKEEAVE- 335
Query: 56 KKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
K KK +++ KEI++ K + EK L E
Sbjct: 336 --KGKKLAKKKLEKEIDKDAKIKDEKVLHER 364
>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 379
Score = 30.3 bits (68), Expect = 1.4
Identities = 15/83 (18%), Positives = 32/83 (38%), Gaps = 4/83 (4%)
Query: 3 KRRGGAGEKKKRIEKRNKE-EEKEENEQEEKEKEDVEEEK---NKKKKKKKKKKKKKKKK 58
+R G G E + E E + + EK D E+ + K + K++
Sbjct: 188 RRIGVDGISLSLEETKASEVEHLSASLKTATEKVDALEKDLHAAQAKLDSGQALTKEELD 247
Query: 59 KKKKKEEQSVKEIEESLKERKEK 81
KK + +++ +E + ++
Sbjct: 248 AKKAELSKALAALEAANAADEDP 270
>gnl|CDD|151665 pfam11223, DUF3020, Protein of unknown function (DUF3020). This
family of fungal proteins is conserved towards the
C-terminus of HMG domains. The function is not known.
Length = 49
Score = 27.6 bits (61), Expect = 1.4
Identities = 13/42 (30%), Positives = 20/42 (47%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
K E+ K++ + KKK KKK K+ K+K +E
Sbjct: 7 KWREANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDAYIDDE 48
Score = 26.8 bits (59), Expect = 2.6
Identities = 13/47 (27%), Positives = 25/47 (53%)
Query: 27 NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
N + +K+ + E+NK + + KKK KKK K+ ++ I++
Sbjct: 1 NRERKKKWREANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDAYIDD 47
>gnl|CDD|241471 cd13317, PH_PLEKHO1_PLEKHO2, Pleckstrin homology
domain-containing family O Pleckstrin homology domain.
The PLEKHO family members are PLEKHO1 (also called
CKIP-1/Casein kinase 2-interacting protein
1/CK2-interacting protein 1) and PLEKHO2 (PLEKHQ1/PH
domain-containing family Q member 1). They both contain
a single PH domain. PLEKHO1 acts as a scaffold protein
that functions in plasma membrane recruitment,
transcriptional activity modulation, and
posttranscriptional modification regulation. As an
adaptor protein it is involved in signaling pathways,
apoptosis, differentiation, cytoskeleton, and bone
formation. Not much is know about PLEKHO2. PH domains
have diverse functions, but in general are involved in
targeting proteins to the appropriate cellular location
or in the interaction with a binding partner. They
share little sequence conservation, but all have a
common fold, which is electrostatically polarized. Less
than 10% of PH domains bind phosphoinositide phosphates
(PIPs) with high affinity and specificity. PH domains
are distinguished from other PIP-binding domains by
their specific high-affinity binding to PIPs with two
vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
or PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved
across all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 101
Score = 28.6 bits (64), Expect = 1.5
Identities = 12/32 (37%), Positives = 19/32 (59%)
Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
E EQE + ED E + + +K + KKK+K +
Sbjct: 38 EKEQEVFDLEDYELCEYLRCQKSRSKKKRKSR 69
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 30.6 bits (69), Expect = 1.5
Identities = 25/83 (30%), Positives = 39/83 (46%), Gaps = 18/83 (21%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKK-----KKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
K E + E +K E K KK+ K++ +K +++ +++ K E Q +K+IE L ER
Sbjct: 50 KAERDAEHIKKTAKRESKALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRLTER 109
Query: 79 -------------KEKTLTEDSQ 88
KEKTL Q
Sbjct: 110 ATSLDRKDENLSSKEKTLESKEQ 132
Score = 29.1 bits (65), Expect = 4.0
Identities = 24/93 (25%), Positives = 53/93 (56%), Gaps = 10/93 (10%)
Query: 5 RGGAGEKKKRIEKRNKEEEK--------EENEQEEKEKEDVEEE-KNKKKKKKKKKKKKK 55
RG A + I+K K E K E E+ K +E++E+E K+++++ K+ + +
Sbjct: 48 RGKAERDAEHIKKTAKRESKALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRLT 107
Query: 56 KKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
++ ++++++ E++L E KE++LT+ S+
Sbjct: 108 ERATSLDRKDENLSSKEKTL-ESKEQSLTDKSK 139
>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein. This family includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 116
Score = 29.1 bits (65), Expect = 1.5
Identities = 12/32 (37%), Positives = 20/32 (62%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK 47
E+R+ + E+E+E +E+ E+ K KKK K
Sbjct: 85 ERRSARMARLEDERELAREEEEEKRKKKKKNK 116
Score = 27.9 bits (62), Expect = 4.0
Identities = 12/39 (30%), Positives = 21/39 (53%), Gaps = 3/39 (7%)
Query: 21 EEEKEENEQEEK---EKEDVEEEKNKKKKKKKKKKKKKK 56
E ++ E+ ED E +++++K+KKKKK K
Sbjct: 78 FMEIQKEERRSARMARLEDERELAREEEEEKRKKKKKNK 116
Score = 27.5 bits (61), Expect = 4.3
Identities = 14/37 (37%), Positives = 21/37 (56%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK 52
E + +E + E E+E EE+ +K+KKKKK K
Sbjct: 80 EIQKEERRSARMARLEDERELAREEEEEKRKKKKKNK 116
>gnl|CDD|232933 TIGR00348, hsdR, type I site-specific deoxyribonuclease, HsdR
family. This gene is part of the type I restriction and
modification system which is composed of three
polypeptides R (restriction endonuclease), M
(modification) and S (specificity). This group of
enzymes recognize specific short DNA sequences and have
an absolute requirement for ATP (or dATP) and
S-adenosyl-L-methionine. They also catalyse the
reactions of EC 2.1.1.72 and EC 2.1.1.73, with similar
site specificity.(J. Mol. Biol. 271 (3), 342-348
(1997)). Members of this family are assumed to differ
from each other in DNA site specificity [DNA metabolism,
Restriction/modification].
Length = 667
Score = 30.4 bits (69), Expect = 1.5
Identities = 16/69 (23%), Positives = 33/69 (47%), Gaps = 3/69 (4%)
Query: 14 RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK--KKKKKKKKKKKEEQSVKEI 71
R+ + + +++K + ++ E + E + K+ K+K K KK + + E K+I
Sbjct: 444 RLPEDHLDKKKLDAF-FDEIFELLPERIREITKESLKEKLQKTKKILFNEDRLESIAKDI 502
Query: 72 EESLKERKE 80
E + KE
Sbjct: 503 AEHYAKFKE 511
>gnl|CDD|129660 TIGR00569, ccl1, cyclin ccl1. All proteins in this family for
which functions are known are cyclins that are
components of TFIIH, a complex that is involved in
nucleotide excision repair and transcription initiation.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, StanfordUniversity) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 305
Score = 30.2 bits (68), Expect = 1.5
Identities = 19/62 (30%), Positives = 31/62 (50%), Gaps = 1/62 (1%)
Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE-QSVKEIEESLKERKEKTLT 84
N +E + D+ E KK ++ + +K KKK +E S++E S+K RK K
Sbjct: 236 GNREELPQLIDIMRELRILVKKYEEPRSEKVAALKKKLDECHSIEENLSSVKIRKRKGYE 295
Query: 85 ED 86
+D
Sbjct: 296 DD 297
>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La. This protein is induced
by heat shock and other stresses in E. coli, B.
subtilis, and other species. The yeast member,
designated PIM1, is located in the mitochondrial matrix,
required for mitochondrial function, and also induced by
heat shock [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 775
Score = 30.7 bits (70), Expect = 1.5
Identities = 16/60 (26%), Positives = 35/60 (58%), Gaps = 8/60 (13%)
Query: 12 KKRIEKRNKE----EE----KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
++++EK +E E+ K+E E+ +K+++E+ K K ++ K ++ KK +K+ K
Sbjct: 216 EEKMEKTQREYYLREQLKAIKKELGIEKDDKDELEKLKEKLEELKLPEEVKKVIEKELTK 275
>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
Provisional.
Length = 130
Score = 29.4 bits (66), Expect = 1.5
Identities = 11/34 (32%), Positives = 23/34 (67%)
Query: 45 KKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
K KK+++K+++ + KEE+ + EI + LK++
Sbjct: 92 KLTSKKEEEKEEEIPEPTKEEELLGEIRDLLKQQ 125
>gnl|CDD|206035 pfam13864, Enkurin, Calmodulin-binding. This is a family of
apparent calmodulin-binding proteins found at high
levels in the testis and vomeronasal organ and at lower
levels in certain other tissues. Enkurin is a scaffold
protein that binds PI3 kinase to sperm transient
receptor potential (canonical) (TRPC) channels. The
mammalian transient receptor potential (canonical)
channels are the primary candidates for the Ca(2+)
entry pathway activated by the hormones, growth
factors, and neurotransmitters that exert their effect
through activation of PLC. Calmodulin binds to the
C-terminus of all TRPC channels, and dissociation of
calmodulin from TRPC4 results in profound activation of
the channel.
Length = 98
Score = 28.7 bits (65), Expect = 1.5
Identities = 21/85 (24%), Positives = 39/85 (45%), Gaps = 23/85 (27%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDV---------EEEKNKKKKKKKKKKKKK---------- 55
+ KR +E +E+ E+E +E++ EEE+ + KK +
Sbjct: 7 LLKRKEEIAEEKEERERQEEDPDCPPGHRLLSEEERLELLNGLKKNWDELNKEYQKLPVV 66
Query: 56 ----KKKKKKKKEEQSVKEIEESLK 76
KK++K++ E+ + EIEE +K
Sbjct: 67 IDTPSKKRRKEELEKELAEIEEDIK 91
>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
SecD/SecF; Reviewed.
Length = 755
Score = 30.6 bits (70), Expect = 1.5
Identities = 12/23 (52%), Positives = 18/23 (78%)
Query: 43 KKKKKKKKKKKKKKKKKKKKKEE 65
+K++ KKKKK+KK KK + +K E
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755
Score = 29.4 bits (67), Expect = 3.9
Identities = 11/22 (50%), Positives = 17/22 (77%)
Query: 47 KKKKKKKKKKKKKKKKKEEQSV 68
+K++ KKKKK+KK KK E + +
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKI 754
Score = 29.0 bits (66), Expect = 4.2
Identities = 10/21 (47%), Positives = 16/21 (76%)
Query: 41 KNKKKKKKKKKKKKKKKKKKK 61
+ ++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 29.0 bits (66), Expect = 4.7
Identities = 12/21 (57%), Positives = 16/21 (76%)
Query: 40 EKNKKKKKKKKKKKKKKKKKK 60
EK + KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 28.7 bits (65), Expect = 6.5
Identities = 11/23 (47%), Positives = 17/23 (73%)
Query: 45 KKKKKKKKKKKKKKKKKKKEEQS 67
+K++ KKKKK+KK KK + E+
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755
>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain. FH proteins control
rearrangements of the actin cytoskeleton, especially in
the context of cytokinesis and cell polarisation.
Members of this family have been found to interact with
Rho-GTPases, profilin and other actin-assoziated
proteins. These interactions are mediated by the
proline-rich FH1 domain, usually located in front of FH2
(but not listed in SMART). Despite this cytosolic
function, vertebrate formins have been assigned
functions within the nucleus. A set of Formin-Binding
Proteins (FBPs) has been shown to bind FH1 with their WW
domain.
Length = 392
Score = 30.4 bits (69), Expect = 1.6
Identities = 22/83 (26%), Positives = 34/83 (40%), Gaps = 20/83 (24%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV--------------- 68
EE+E + E E++ K K K K +KK KKK +E +
Sbjct: 32 DEESEGDLDELEELFSAKEKTKSASKDVSEKKSILKKKASQEFKILDPKRSQNLAILLRK 91
Query: 69 -----KEIEESLKERKEKTLTED 86
+EI+E++ E E L+ D
Sbjct: 92 LHMSYEEIKEAILEGDEDVLSVD 114
Score = 28.9 bits (65), Expect = 4.3
Identities = 13/46 (28%), Positives = 25/46 (54%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
E K K +E K+E E+EE+ K+ V+E ++ ++K++
Sbjct: 325 EFLKEFSKAAEENIKKEEEEEERRKKLVKETTEYEQSSSRQKERNP 370
>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of
70kDa MW N terminal. This domain is found in
eukaryotes. This domain is about 90 amino acids in
length. This domain is found associated with pfam00076.
This domain is part of U1 snRNP, which is the pre-mRNA
binding protein of the penta-snRNP spliceosome complex.
It extends over a distance of 180 A from its RNA
binding domain, wraps around the core domain of U1
snRNP consisting of the seven Sm proteins and finally
contacts U1-C, which is crucial for 5'-splice-site
recognition.
Length = 94
Score = 28.4 bits (64), Expect = 1.6
Identities = 11/41 (26%), Positives = 28/41 (68%)
Query: 37 VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
++ K++ + + +K++++K++K+E+ K++EE LKE
Sbjct: 47 FKDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKE 87
Score = 28.4 bits (64), Expect = 1.8
Identities = 11/40 (27%), Positives = 27/40 (67%)
Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
++ ++E E E K++++K++KK+K +KK +++ +E
Sbjct: 48 KDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKE 87
Score = 27.6 bits (62), Expect = 3.3
Identities = 11/43 (25%), Positives = 25/43 (58%)
Query: 25 EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
E + + E E +K++++K++KK+K +KK ++E +
Sbjct: 45 SEFKDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKE 87
Score = 27.6 bits (62), Expect = 3.3
Identities = 9/36 (25%), Positives = 25/36 (69%)
Query: 35 EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
+D ++E + + +K++++K++KK+K E+ ++E
Sbjct: 48 KDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEE 83
Score = 27.2 bits (61), Expect = 4.1
Identities = 11/40 (27%), Positives = 26/40 (65%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
K+ ++ E + EK +++K++KK+K +KK +++ K+
Sbjct: 48 KDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKE 87
Score = 26.5 bits (59), Expect = 7.6
Identities = 10/40 (25%), Positives = 26/40 (65%)
Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
+ ++E + E E++ ++K++KK+K +KK +++ K+
Sbjct: 48 KDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKE 87
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 30.4 bits (68), Expect = 1.6
Identities = 20/73 (27%), Positives = 41/73 (56%), Gaps = 5/73 (6%)
Query: 12 KKRIEKRNKEEEKEENE-----QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
K +EK +E + +N+ +EE+ + D E E+ ++ KK+ +K + ++K K ++
Sbjct: 220 KHPLEKLEEEIDILKNDGDKVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDE 279
Query: 67 SVKEIEESLKERK 79
+K IE +K+R
Sbjct: 280 EIKFIEHDVKDRN 292
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 30.0 bits (67), Expect = 1.6
Identities = 11/27 (40%), Positives = 19/27 (70%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKED 36
E+K+ E++ +EE +EE +QEE + D
Sbjct: 216 EEKQPQEEQEEEEVEEEAKQEEGQGTD 242
Score = 28.9 bits (64), Expect = 3.9
Identities = 15/75 (20%), Positives = 34/75 (45%), Gaps = 12/75 (16%)
Query: 20 KEEEKEENEQEE--------KEKEDVEEEKNKKK----KKKKKKKKKKKKKKKKKKEEQS 67
E+ N EE V+ K K++ K ++ ++ K ++K+ ++E++
Sbjct: 167 PWPERLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEE 226
Query: 68 VKEIEESLKERKEKT 82
+ EE+ +E + T
Sbjct: 227 EEVEEEAKQEEGQGT 241
Score = 28.5 bits (63), Expect = 5.4
Identities = 11/62 (17%), Positives = 34/62 (54%), Gaps = 5/62 (8%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
G + + K+E+ QE ++E K ++K+ ++++++++ +++ K++E Q
Sbjct: 187 GGSVQVKAPKPKQEQLLSKLQEY-----LQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGT 241
Query: 69 KE 70
+
Sbjct: 242 DD 243
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 30.4 bits (69), Expect = 1.6
Identities = 17/83 (20%), Positives = 39/83 (46%), Gaps = 8/83 (9%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK--------KKKKKKKKKKKKKKK 59
A K+K+ E++ + E EQ+ E E K + +++ K K + +
Sbjct: 619 AVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQNEQQSLKDKLELAIAE 678
Query: 60 KKKKEEQSVKEIEESLKERKEKT 82
+K++ E +++++ LK+ E+
Sbjct: 679 RKQQAETQLRQLDAQLKQLLEQQ 701
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 30.4 bits (69), Expect = 1.6
Identities = 12/33 (36%), Positives = 22/33 (66%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEK 41
G ++ EEE++E+E+EE+E+E+ EE +
Sbjct: 337 GRCTWVNPEQKDEEEEQEDEEEEEEEEEPEEPE 369
>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
Length = 763
Score = 30.6 bits (69), Expect = 1.6
Identities = 15/31 (48%), Positives = 17/31 (54%)
Query: 25 EENEQEEKEKEDVEEEKNKKKKKKKKKKKKK 55
E E EK + + E E K KK KK KKKK
Sbjct: 228 EGGELIEKPQAEAENEAGKSDKKDKKSKKKK 258
Score = 28.7 bits (64), Expect = 5.4
Identities = 13/25 (52%), Positives = 13/25 (52%)
Query: 33 EKEDVEEEKNKKKKKKKKKKKKKKK 57
EK E E K KK KK KKKK
Sbjct: 234 EKPQAEAENEAGKSDKKDKKSKKKK 258
>gnl|CDD|224234 COG1315, COG1315, Uncharacterized conserved protein [Function
unknown].
Length = 543
Score = 30.5 bits (69), Expect = 1.7
Identities = 19/105 (18%), Positives = 36/105 (34%), Gaps = 10/105 (9%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE- 65
G + K EE E+ +K + + K + KK K+
Sbjct: 403 GMDPEIVERLKELTEEISLHEERLKKLTKLLVALVKVKIESKKNILPPDKESLLTAVNNT 462
Query: 66 -----QSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYKGI 105
S+++I+ L+ +E+ L + V K +Y G+
Sbjct: 463 KITLRNSIEKIKAELEGLQEE-LEVVGIEGS---VSVKNQVYSGV 503
>gnl|CDD|153331 cd07647, F-BAR_PSTPIP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
Phosphatase-Interacting Proteins. F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Vetebrates contain two
Proline-Serine-Threonine Phosphatase-Interacting
Proteins (PSTPIPs), PSTPIP1 and PSTPIP2. PSTPIPs are
mainly expressed in hematopoietic cells and are involved
in the regulation of cell adhesion and motility.
Mutations in PSTPIPs have been shown to cause
autoinflammatory disorders. PSTPIP1 contains an
N-terminal F-BAR domain, PEST motifs, and a C-terminal
SH3 domain, while PSTPIP2 contains only the N-terminal
F-BAR domain. F-BAR domains form banana-shaped dimers
with a positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 239
Score = 29.8 bits (67), Expect = 1.7
Identities = 21/63 (33%), Positives = 34/63 (53%), Gaps = 5/63 (7%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
+ R E EE +E+ ++E K+ ED+ K+ +K KK+ KK K KK EQ +E
Sbjct: 81 QSLREEAEKLEEFREKQKEERKKTEDI-----MKRSQKNKKELYKKTMKAKKSYEQKCRE 135
Query: 71 IEE 73
++
Sbjct: 136 KDK 138
Score = 29.0 bits (65), Expect = 3.1
Identities = 14/59 (23%), Positives = 29/59 (49%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
K K+ + + K+ EQ+ +EK+ E+ K + K+ +K KKK + + + +
Sbjct: 113 KNKKELYKKTMKAKKSYEQKCREKDKAEQAYEKSSSGAQPKEAEKLKKKAAQCKTSAEE 171
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 30.5 bits (69), Expect = 1.7
Identities = 17/55 (30%), Positives = 32/55 (58%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
+ +E E+E E++E ++ E K + K++K + K+ KK+KK+E Q + E
Sbjct: 27 SDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKARLKELKKQKKQEIQKILE 81
Score = 28.6 bits (64), Expect = 6.2
Identities = 17/58 (29%), Positives = 35/58 (60%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
E+E E+ E +E+ + + E +K++K + K+ KK+KK++ +K EQ I+ + +
Sbjct: 37 EDEDEDEEDDEEAESPAKAEISKREKARLKELKKQKKQEIQKILEQQNAAIDADMNNK 94
Score = 28.2 bits (63), Expect = 9.8
Identities = 15/54 (27%), Positives = 32/54 (59%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
++ E ++E E E+ +EE K + K++K + K+ KK+++Q +++I E
Sbjct: 29 SDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKARLKELKKQKKQEIQKILEQ 82
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 30.0 bits (67), Expect = 1.7
Identities = 12/75 (16%), Positives = 38/75 (50%), Gaps = 1/75 (1%)
Query: 9 GEKKKRIEKRNKEEEK-EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
+K R+ + N + E+N++ + DV+ E+ ++ ++ +++++ ++ +
Sbjct: 145 LAEKPRVTRFNIVWDNDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEG 204
Query: 68 VKEIEESLKERKEKT 82
+E+ E +E E +
Sbjct: 205 NRELNEEEEEEAEGS 219
Score = 29.2 bits (65), Expect = 2.8
Identities = 14/68 (20%), Positives = 33/68 (48%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+++R+E+ + EE+E+ E + E N++++++ + + + E K
Sbjct: 177 EEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKK 236
Query: 70 EIEESLKE 77
+ EE E
Sbjct: 237 QGEEEEME 244
Score = 28.0 bits (62), Expect = 7.8
Identities = 14/75 (18%), Positives = 37/75 (49%), Gaps = 8/75 (10%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK--------KKKKKKKKKEEQS 67
+ EEE+EE E + ++ V+ E + KK+ ++++ + + +++ E+
Sbjct: 205 NRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEEV 264
Query: 68 VKEIEESLKERKEKT 82
+ EES ++++
Sbjct: 265 PRNNEESPAKKQKVE 279
>gnl|CDD|233936 TIGR02578, cas_TM1811_Csm1, CRISPR-associated protein Cas10/Csm1,
subtype III-A/MTUBE. The family is designated Csm2, for
CRISPR/Cas Subtype Mtube Protein 2. A typical example is
TM1811 from Thermotoga maritima. CRISPR are Clustered
Regularly Interspaced Short Palindromic Repeats. This
protein family belongs to a conserved gene cluster
regularly found near CRISPR repeats [Mobile and
extrachromosomal element functions, Other].
Length = 648
Score = 30.4 bits (69), Expect = 1.7
Identities = 22/122 (18%), Positives = 43/122 (35%), Gaps = 13/122 (10%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEK----------EKEDVEEEKNKKKKKKKKKKKKKKKK 58
E R+E + EE+E +EE K K K + +
Sbjct: 82 LEIIDRLEAGDDREEEELGNEEEPLVSPFPKIPLGNSLRNAGGFKPKYPLKSLNRAVPPE 141
Query: 59 KKKKKEEQSVKEIEESLKERKEKTL-TEDSQCDTMIPVFKKGVLYKGIYCPSLTNSFREP 117
K +++ K + E EK + + + ++ +F+K L+ + + S +
Sbjct: 142 NDNKASQENYKRLMEKFLGDLEKLKPSYPDKIELLLTLFEK--LWSFVPAATNGGSKPDV 199
Query: 118 SL 119
SL
Sbjct: 200 SL 201
>gnl|CDD|153367 cd07683, F-BAR_srGAP1, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Protein 1. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs. srGAP1, also called Rho
GTPase-Activating Protein 13 (ARHGAP13), is a Cdc42- and
RhoA-specific GAP and is expressed later in the
development of CNS (central nervous system) tissues. It
is an important downstream signaling molecule of Robo1.
srGAP1 contains an N-terminal F-BAR domain, a Rho GAP
domain, and a C-terminal SH3 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 253
Score = 30.0 bits (67), Expect = 1.7
Identities = 20/69 (28%), Positives = 40/69 (57%), Gaps = 1/69 (1%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
E + KE EK+E +Q + + V + + + +++ KK +K K+K++ + S ++ +S+
Sbjct: 154 ESKLKEAEKQEEKQIGRSGDPVFHIRLEDRHQRRSSVKKIEKMKEKRQAKYSENKL-KSI 212
Query: 76 KERKEKTLT 84
K R E LT
Sbjct: 213 KARNEYLLT 221
>gnl|CDD|218481 pfam05177, RCSD, RCSD region. Proteins contain this region include
C.elegans UNC-89. This region is found repeated in
UNC-89 and shows conservation in prolines, lysines and
glutamic acids. Proteins with RCSD are involved in
muscle M-line assembly, but the function of this region
RCSD is not clear.
Length = 101
Score = 28.7 bits (63), Expect = 1.7
Identities = 27/93 (29%), Positives = 41/93 (44%), Gaps = 3/93 (3%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKED--VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
K K+ K + + EK +ED EE K+ +KK+K +K K K KKE+
Sbjct: 1 GVKSPGKKEKSPLRRSPSRTEKTEEDRATEEAKSPEKKEKSPEKVDGKPKSPTKKEKSPE 60
Query: 69 KEIEESLKE-RKEKTLTEDSQCDTMIPVFKKGV 100
K E +K K++ E + P K+ V
Sbjct: 61 KSATEEVKSPTKKEKSPEKVEEKPASPTKKERV 93
Score = 28.3 bits (62), Expect = 2.6
Identities = 19/61 (31%), Positives = 32/61 (52%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+K+K EK + + + +++ EK EE K+ KK+K +K ++K KKE K
Sbjct: 37 KKEKSPEKVDGKPKSPTKKEKSPEKSATEEVKSPTKKEKSPEKVEEKPASPTKKERVQKK 96
Query: 70 E 70
E
Sbjct: 97 E 97
Score = 27.9 bits (61), Expect = 3.2
Identities = 19/72 (26%), Positives = 35/72 (48%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+ + E+ E+KE++ ++ K +K K +K ++ K KK+K E+ K
Sbjct: 24 EEDRATEEAKSPEKKEKSPEKVDGKPKSPTKKEKSPEKSATEEVKSPTKKEKSPEKVEEK 83
Query: 70 EIEESLKERKEK 81
+ KER +K
Sbjct: 84 PASPTKKERVQK 95
>gnl|CDD|218637 pfam05558, DREPP, DREPP plasma membrane polypeptide. This family
contains several plant plasma membrane proteins termed
DREPPs as they are developmentally regulated plasma
membrane polypeptides.
Length = 201
Score = 29.6 bits (66), Expect = 1.7
Identities = 21/68 (30%), Positives = 27/68 (39%), Gaps = 3/68 (4%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
EKK E E EK+ KEKE V EE+ K+++ K + K EE K
Sbjct: 133 EKKPEEEAPAAETEKKP---AVKEKEIVVEEEKKEEEAVPVAAAPSTKAPETKVEETPAK 189
Query: 70 EIEESLKE 77
E
Sbjct: 190 APAAPAAE 197
>gnl|CDD|138835 PRK12279, PRK12279, 50S ribosomal protein L22/unknown domain fusion
protein; Provisional.
Length = 311
Score = 30.0 bits (67), Expect = 1.7
Identities = 16/74 (21%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 10 EKKKRIEK-RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
E++K + + K+ +K + + + E + K K + K +K + + K+E+Q +
Sbjct: 114 EREKELAAIKAKKSKKPLPVEPKAKVETKKVAKPSKVEVKPVEKDENVDPELLKREQQVL 173
Query: 69 KEIEESLKERKEKT 82
K +E++ +++E+T
Sbjct: 174 KVVEKTASQKEEET 187
Score = 29.3 bits (65), Expect = 3.5
Identities = 15/60 (25%), Positives = 30/60 (50%), Gaps = 5/60 (8%)
Query: 29 QEEKEKEDVEEEKNKKKK-----KKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
+ EKE ++ +K+KK K + KK K K + K + + ++ L +R+++ L
Sbjct: 114 EREKELAAIKAKKSKKPLPVEPKAKVETKKVAKPSKVEVKPVEKDENVDPELLKREQQVL 173
>gnl|CDD|217838 pfam04004, Leo1, Leo1-like protein. Members of this family are
part of the Paf1/RNA polymerase II complex. The Paf1
complex probably functions during the elongation phase
of transcription. The Leo1 subunit of the yeast
Paf1-complex binds RNA and contributes to complex
recruitment. The subunit acts by co-ordinating
co-transcriptional chromain modifications and helping
recruitment of mRNA 3prime-end processing factors.
Length = 312
Score = 30.2 bits (68), Expect = 1.7
Identities = 19/91 (20%), Positives = 38/91 (41%), Gaps = 7/91 (7%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+ R+ +G K I ++ E EK E E++E E+K + +++++ ++K K K
Sbjct: 216 LATRQRKSGGVKITIVGKDPEHEKREREKKE-------EQKLRARRRRQNREKMKNKPPN 268
Query: 61 KKKEEQSVKEIEESLKERKEKTLTEDSQCDT 91
+ + E S C+T
Sbjct: 269 RPGHGSGSDSNVAKAATTYSEDEDEGSDCET 299
>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
Length = 472
Score = 30.4 bits (69), Expect = 1.7
Identities = 13/35 (37%), Positives = 22/35 (62%), Gaps = 3/35 (8%)
Query: 34 KEDV---EEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
++DV EEK K+KK KK+ K++++ + K E
Sbjct: 386 RDDVYKEREEKEKEKKAKKEGKEERRIHFQNKSIE 420
Score = 29.2 bits (66), Expect = 4.0
Identities = 15/38 (39%), Positives = 22/38 (57%), Gaps = 8/38 (21%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
E+EEKEKE KK KK+ K++++ + K EE
Sbjct: 392 EREEKEKE--------KKAKKEGKEERRIHFQNKSIEE 421
>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex. This Mediator
complex subunit was formerly known as Srb4 in yeasts or
Trap80 in Drosophila and human. The Med17 subunit is
located within the head domain and is essential for cell
viability to the extent that a mutant strain of
cerevisiae lacking it shows all RNA polymerase
II-dependent transcription ceasing at non-permissive
temperatures.
Length = 454
Score = 30.4 bits (69), Expect = 1.8
Identities = 15/66 (22%), Positives = 32/66 (48%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
IE+ N+E + EE +E++ +E K ++ +++ ++ E++ +EE
Sbjct: 36 IERINQERGSFRDLTEESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEEF 95
Query: 75 LKERKE 80
K R E
Sbjct: 96 PKARDE 101
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 30.4 bits (69), Expect = 1.8
Identities = 15/75 (20%), Positives = 28/75 (37%), Gaps = 2/75 (2%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK--KKKKKKEEQS 67
E R++K + EE E K + ++ + K K+ ++ + KK +
Sbjct: 928 ELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREG 987
Query: 68 VKEIEESLKERKEKT 82
K E +KE
Sbjct: 988 NKANSELKNFKKELA 1002
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 30.3 bits (68), Expect = 1.8
Identities = 25/97 (25%), Positives = 42/97 (43%), Gaps = 12/97 (12%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKN---------KKKKKKKKK 51
M KR + + EK+ EE+K + E+EE + K+ K+ +
Sbjct: 481 MEKRLKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEM 540
Query: 52 KKKKKKKKKKKKEEQSV---KEIEESLKERKEKTLTE 85
+ KK + K KEE+ KE +E K ++ + TE
Sbjct: 541 EIKKLEHDLKLKEEECRMLEKEAQELRKYQESEKETE 577
Score = 29.9 bits (67), Expect = 2.7
Identities = 22/90 (24%), Positives = 44/90 (48%), Gaps = 7/90 (7%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
+KR+ E R + +E E ++ K+D+E E K + K K+++ + +K+
Sbjct: 503 KKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKE 562
Query: 62 ----KKEEQSVKEIE---ESLKERKEKTLT 84
+K ++S KE E +L+ ++K L
Sbjct: 563 AQELRKYQESEKETEVLMSALQAMQDKNLM 592
>gnl|CDD|203820 pfam07956, DUF1690, Protein of Unknown function (DUF1690). Family
of uncharacterized fungal proteins.
Length = 142
Score = 29.4 bits (66), Expect = 1.8
Identities = 25/97 (25%), Positives = 42/97 (43%), Gaps = 9/97 (9%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDV-----EEEKNKKKKKKKKKKKKKKKKKKKKKE 64
E K+ E ++ ++ + + E +K ++ K ++K+K KK K+
Sbjct: 50 ETLKKFEDTLNSSLLSDDSKDNENGLSSNLLNEKIESLTEKLEEFSKLEEKEKSKKLKEV 109
Query: 65 EQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVL 101
E + E+ E L E K K L C + FKK VL
Sbjct: 110 EDARSELTECLLENKGKPLN----CYDEVEAFKKLVL 142
>gnl|CDD|153271 cd07307, BAR, The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization
module that binds membranes and detects membrane
curvature. BAR domains are dimerization, lipid binding
and curvature sensing modules found in many different
proteins with diverse functions including organelle
biogenesis, membrane trafficking or remodeling, and cell
division and migration. Mutations in BAR containing
proteins have been linked to diseases and their
inactivation in cells leads to altered membrane
dynamics. A BAR domain with an additional N-terminal
amphipathic helix (an N-BAR) can drive membrane
curvature. These N-BAR domains are found in amphiphysins
and endophilins, among others. BAR domains are also
frequently found alongside domains that determine lipid
specificity, such as the Pleckstrin Homology (PH) and
Phox Homology (PX) domains which are present in beta
centaurins (ACAPs and ASAPs) and sorting nexins,
respectively. A FES-CIP4 Homology (FCH) domain together
with a coiled coil region is called the F-BAR domain and
is present in Pombe/Cdc15 homology (PCH) family
proteins, which include Fes/Fes tyrosine kinases, PACSIN
or syndapin, CIP4-like proteins, and srGAPs, among
others. The Inverse (I)-BAR or IRSp53/MIM homology
Domain (IMD) is found in multi-domain proteins, such as
IRSp53 and MIM, that act as scaffolding proteins and
transducers of a variety of signaling pathways that link
membrane dynamics and the underlying actin cytoskeleton.
BAR domains form dimers that bind to membranes, induce
membrane bending and curvature, and may also be involved
in protein-protein interactions. The I-BAR domain
induces membrane protrusions in the opposite direction
compared to classical BAR and F-BAR domains, which
produce membrane invaginations. BAR domains that also
serve as protein interaction domains include those of
arfaptin and OPHN1-like proteins, among others, which
bind to Rac and Rho GAP domains, respectively.
Length = 194
Score = 29.7 bits (67), Expect = 1.8
Identities = 19/80 (23%), Positives = 42/80 (52%), Gaps = 1/80 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+ K I+KR K+ +K + + +E +++ + KKK K + +++ ++ K+K E+ +
Sbjct: 91 KDLKEIKKRRKKLDKARLDYDAA-REKLKKLRKKKKDSSKLAEAEEELQEAKEKYEELRE 149
Query: 70 EIEESLKERKEKTLTEDSQC 89
E+ E L + +EK
Sbjct: 150 ELIEDLNKLEEKRKELFLSL 169
>gnl|CDD|222918 PHA02687, PHA02687, ORF061 late transcription factor VLTF-4;
Provisional.
Length = 231
Score = 30.0 bits (67), Expect = 1.8
Identities = 20/69 (28%), Positives = 35/69 (50%), Gaps = 1/69 (1%)
Query: 21 EEEKEENEQEE-KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
E ++E +QE+ E V K K++ KKK K K +K K + + ++ + +
Sbjct: 61 EPSEQECQQEQLPVPESVPPAPVKTPKRRTKKKAKADKPEKSPKAVEKLCPPDDRDDKNE 120
Query: 80 EKTLTEDSQ 88
EK TE++Q
Sbjct: 121 EKEPTEEAQ 129
>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
Autographa californica nuclear polyhedrosis virus
(AcMNPV) pp31 is a nuclear phosphoprotein that
accumulates in the virogenic stroma, which is the viral
replication centre in the infected-cell nucleus, binds
to DNA, and serves as a late expression factor.
Length = 267
Score = 30.0 bits (68), Expect = 1.9
Identities = 12/56 (21%), Positives = 21/56 (37%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
K I + + +K ++K+K KK KK KK+K++
Sbjct: 195 KNILLKLNDNDKTAASSASVVPTPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRSPP 250
>gnl|CDD|220184 pfam09332, Mcm10, Mcm10 replication factor. Mcm10 is a eukaryotic
DNA replication factor that regulates the stability and
chromatin association of DNA polymerase alpha.
Length = 346
Score = 29.9 bits (67), Expect = 1.9
Identities = 18/89 (20%), Positives = 37/89 (41%), Gaps = 4/89 (4%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
KR+ K++ I++R ++ Q K +EE KK++ + ++ +K K
Sbjct: 158 KRKRSESGKRRAIDERVEKNASSAKRQ----KSPEDEEPAMKKRRYLESEEFQKILAAKS 213
Query: 63 KEEQSVKEIEESLKERKEKTLTEDSQCDT 91
++E E +E L + Q +
Sbjct: 214 SHTGLLEEAEREAQEEYFNPLEKKEQMEE 242
>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
domain [Chromatin structure and dynamics].
Length = 211
Score = 29.4 bits (66), Expect = 1.9
Identities = 20/77 (25%), Positives = 31/77 (40%), Gaps = 2/77 (2%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
GE K + ++ KE EE E KE E ++K++ KK K + E +
Sbjct: 99 GEVGKLLSEKWKELTDEEKEPYYKEANSDRERYQREKEEYNKKLPNKAPIGPFIENEPKI 158
Query: 69 KEIEESLKERKEKTLTE 85
+ E +K L E
Sbjct: 159 RPKVE--GPSPDKALVE 173
Score = 28.7 bits (64), Expect = 4.4
Identities = 20/95 (21%), Positives = 32/95 (33%), Gaps = 31/95 (32%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEK--------EDVEEEKNKKKKKKKK----------- 50
EK+ ++ N + E+ + E+EE K E + K + K +
Sbjct: 116 EKEPYYKEANSDRERYQREKEEYNKKLPNKAPIGPFIENEPKIRPKVEGPSPDKALVEET 175
Query: 51 ------------KKKKKKKKKKKKKEEQSVKEIEE 73
KKKK K KK +E+ E
Sbjct: 176 KIISKAWSELDESKKKKYIDKYKKLKEEYDSFYPE 210
>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
Members of the NSCC2 family have been sequenced from
various yeast, fungal and animals species including
Saccharomyces cerevisiae, Drosophila melanogaster and
Homo sapiens. These proteins are the Sec62 proteins,
believed to be associated with the Sec61 and Sec63
constituents of the general protein secretary systems of
yeast microsomes. They are also the non-selective cation
(NS) channels of the mammalian cytoplasmic membrane. The
yeast Sec62 protein has been shown to be essential for
cell growth. The mammalian NS channel proteins has been
implicated in platelet derived growth factor(PGDF)
dependent single channel current in fibroblasts. These
channels are essentially closed in serum deprived
tissue-culture cells and are specifically opened by
exposure to PDGF. These channels are reported to exhibit
equal selectivity for Na+, K+ and Cs+ with low
permeability to Ca2+, and no permeability to anions
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 232
Score = 29.9 bits (67), Expect = 2.0
Identities = 13/26 (50%), Positives = 15/26 (57%)
Query: 42 NKKKKKKKKKKKKKKKKKKKKKEEQS 67
K K KKK K KK KKK+ K E +
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKREAN 232
Score = 28.7 bits (64), Expect = 4.3
Identities = 13/23 (56%), Positives = 16/23 (69%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKK 62
EK+K KKK K KK KKK+ K+
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKR 229
Score = 27.6 bits (61), Expect = 9.5
Identities = 12/23 (52%), Positives = 16/23 (69%)
Query: 38 EEEKNKKKKKKKKKKKKKKKKKK 60
E++K KKK K KK KKK+ K+
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKR 229
>gnl|CDD|222855 PHA02529, O, capsid-scaffolding protein; Provisional.
Length = 278
Score = 29.7 bits (67), Expect = 2.0
Identities = 20/83 (24%), Positives = 38/83 (45%), Gaps = 4/83 (4%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVE---EEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
EK K++ R KE++ +EQ E VE E+ + + + + ++ K EE
Sbjct: 176 EKVKKLFSRKKEDDAMTDEQFADVHEAVEGVAEKIDHTSAQVATTETAEPPQEGKDTEEN 235
Query: 67 SVKEIEES-LKERKEKTLTEDSQ 88
+ S LKE +K ++ ++
Sbjct: 236 GETAEQFSALKETLDKLASKFNE 258
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 30.0 bits (67), Expect = 2.0
Identities = 19/74 (25%), Positives = 36/74 (48%), Gaps = 6/74 (8%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK------KK 63
E +K I EE+EE E EE E+ +E +K +++++++ K K +
Sbjct: 129 EPRKPIVPDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQT 188
Query: 64 EEQSVKEIEESLKE 77
++ SVK+ +S
Sbjct: 189 KKPSVKDNGKSFWS 202
>gnl|CDD|225664 COG3122, COG3122, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 215
Score = 29.5 bits (66), Expect = 2.0
Identities = 11/51 (21%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
KK R++ R E + E+ + E+ + +++ ++++K+ K+ K + K+
Sbjct: 67 AKKSRVDAR--EAYRIAAEENKAEQLERDKQLSEQQKQAALAKEYKAQVKQ 115
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 30.4 bits (68), Expect = 2.0
Identities = 20/100 (20%), Positives = 41/100 (41%), Gaps = 15/100 (15%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKE---------DVEEEKNKKKKKKKKKKKKKKKK 58
E E+++ E+ NE + KE D +E++++++ + +
Sbjct: 3912 NEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQP 3971
Query: 59 KKKKKEEQSVKEIE-----ESLK-ERKEKTLTEDSQCDTM 92
++ Q E E E LK + KE +++DS + M
Sbjct: 3972 DIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLEDM 4011
Score = 28.4 bits (63), Expect = 8.2
Identities = 16/76 (21%), Positives = 35/76 (46%), Gaps = 3/76 (3%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQE-EKEKEDVEEEKNKKKKKK--KKKKKKKKKKKKKKKE 64
G K + N+ + EE Q+ ++E D+ E+ + +K + ++ + ++K
Sbjct: 3865 NGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSN 3924
Query: 65 EQSVKEIEESLKERKE 80
EQS E L +++
Sbjct: 3925 EQSAANNESDLVSKED 3940
>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family. This protein is
found in bacteria. Proteins in this family are typically
between 234 to 465 amino acids in length. There is a
conserved GEV sequence motif.Most members are annotated
as being SprA-related.
Length = 261
Score = 29.8 bits (67), Expect = 2.1
Identities = 16/58 (27%), Positives = 36/58 (62%), Gaps = 4/58 (6%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
+E ++ E ++E V EE+ +K + + K+ +K+++++++EE+ V+E LK R
Sbjct: 66 SQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQEEKQVRE----LKAR 119
>gnl|CDD|218675 pfam05648, PEX11, Peroxisomal biogenesis factor 11 (PEX11). This
family consists of several peroxisomal biogenesis factor
11 (PEX11) proteins from several eukaryotic species. The
PEX11 peroxisomal membrane proteins promote peroxisome
division in multiple eukaryotes.
Length = 225
Score = 29.5 bits (67), Expect = 2.1
Identities = 14/50 (28%), Positives = 27/50 (54%), Gaps = 3/50 (6%)
Query: 48 KKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT--LTEDSQCDTMIPV 95
++K+KK KK+K + + K +++ LK+R L ++ CD I +
Sbjct: 148 QEKEKKLKKEKDSEGESTSERKLLKKILKKRPAALLDLVKNL-CDLFIAL 196
>gnl|CDD|218274 pfam04801, Sin_N, Sin-like protein conserved region. Family of
higher eukaryotic proteins. SIN was identified as a
protein that interacts specifically with SXL (sex
lethal) in a yeast two-hybrid assay. The interaction is
mediated by one of the SXL RNA binding domains.
Length = 422
Score = 30.1 bits (68), Expect = 2.1
Identities = 15/59 (25%), Positives = 36/59 (61%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
+K++++++ E E ED +EE ++ K+ K + + +K++K+ EQS +++ + E
Sbjct: 140 DKKDKRKKEEDTADEDEDPDEEAEEELKQVTVKFSRPETEKQRKRREQSYNFLQKKIAE 198
Score = 28.9 bits (65), Expect = 4.7
Identities = 18/64 (28%), Positives = 33/64 (51%), Gaps = 1/64 (1%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
+KR KEE+ + +++ E E EE K K + + +K++K++++ K EE
Sbjct: 143 DKRKKEEDTADEDEDPDE-EAEEELKQVTVKFSRPETEKQRKRREQSYNFLQKKIAEEPW 201
Query: 76 KERK 79
E K
Sbjct: 202 IELK 205
>gnl|CDD|177400 PHA02571, a-gt.4, hypothetical protein; Provisional.
Length = 109
Score = 28.5 bits (64), Expect = 2.1
Identities = 10/43 (23%), Positives = 24/43 (55%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
+ +E ++E +E + +N+ + +KK K KK +++ K+
Sbjct: 6 TDVEELTDEEVEELLSELQARNEAEAEKKAAKILKKNRREIKR 48
>gnl|CDD|217023 pfam02414, Borrelia_orfA, Borrelia ORF-A. This protein is encoded
by an open reading frame in plasmid borne DNA repeats of
Borrelia species. This protein is known as ORF-A. The
function of this putative protein is unknown.
Length = 285
Score = 29.7 bits (67), Expect = 2.2
Identities = 16/78 (20%), Positives = 38/78 (48%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
K+K ++ + +E E++ +K+ +E+ K +K K K +K K K + +
Sbjct: 200 NKQKTLKNILYNLKDKELEKKGYKKKQIEKFFQKIYEKYKNKPHFIIEKNKYKDLNKIIG 259
Query: 70 EIEESLKERKEKTLTEDS 87
+ ++S K++K+ +
Sbjct: 260 KFKKSFKKKKKNSKKNYE 277
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 30.0 bits (68), Expect = 2.2
Identities = 18/70 (25%), Positives = 34/70 (48%), Gaps = 10/70 (14%)
Query: 10 EKKKRIEKRNKEE------EKEENEQEEK----EKEDVEEEKNKKKKKKKKKKKKKKKKK 59
++K+ IE RN + EK E +K EKE +E+ + K+ K + ++ K K
Sbjct: 519 KRKEEIEARNNADSLAYQAEKTLKEAGDKLPAEEKEKIEKAVAELKEALKGEDVEEIKAK 578
Query: 60 KKKKEEQSVK 69
++ ++ K
Sbjct: 579 TEELQQALQK 588
Score = 29.6 bits (67), Expect = 3.4
Identities = 26/92 (28%), Positives = 50/92 (54%), Gaps = 11/92 (11%)
Query: 6 GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNK------KKKKKKKKKKKKKKKK 59
+G ++ IE+ KE E N +E+K++++ E +N + +K K+ K +
Sbjct: 494 ASSGLSEEEIERMVKE--AEANAEEDKKRKEEIEARNNADSLAYQAEKTLKEAGDKLPAE 551
Query: 60 KKKKEEQSVKEIEESLKER---KEKTLTEDSQ 88
+K+K E++V E++E+LK + K TE+ Q
Sbjct: 552 EKEKIEKAVAELKEALKGEDVEEIKAKTEELQ 583
>gnl|CDD|221821 pfam12871, PRP38_assoc, Pre-mRNA-splicing factor 38-associated
hydrophilic C-term. This domain is a hydrophilic
region found at the C-terminus of plant and metazoan
pre-mRNA-splicing factor 38 proteins. The function is
not known.
Length = 97
Score = 28.2 bits (63), Expect = 2.2
Identities = 9/47 (19%), Positives = 31/47 (65%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
++EEE EE E +E+ + E + ++ ++ +++ +++ +++K+ ++
Sbjct: 11 DEEEESEEEEDDEEIRRKAERDVDRGRRSPRRRTRRRSRRRKRSRKR 57
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 30.1 bits (68), Expect = 2.3
Identities = 21/71 (29%), Positives = 36/71 (50%), Gaps = 13/71 (18%)
Query: 20 KEEEKEENEQEEK-----EKEDVEEEKNKKKKKKKK------KKKKKKKKKKKKKEEQS- 67
KEE+ EE +QE K +K+ ++K+KK K + +KK + K E QS
Sbjct: 191 KEEKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVSTSTAASQKKSSDLESKLEAQSK 250
Query: 68 -VKEIEESLKE 77
+ +++ LK+
Sbjct: 251 ELWSLKDDLKK 261
Score = 29.0 bits (65), Expect = 5.0
Identities = 20/82 (24%), Positives = 37/82 (45%), Gaps = 9/82 (10%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK--------KKKKKKKKKK 62
KK E + ++ E+ + E ++ K + + K KK K + +KK +
Sbjct: 184 KKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVSTSTAASQKKSSDLES 243
Query: 63 KEEQSVKEIEESLKERKEKTLT 84
K E KE+ SLK+ +K ++
Sbjct: 244 KLEAQSKEL-WSLKDDLKKHVS 264
>gnl|CDD|217145 pfam02616, ScpA_ScpB, ScpA/B protein. ScpA and ScpB participate in
chromosomal partition during cell division. It may act
via the formation of a condensin-like complex containing
smc that pull DNA away from mid-cell into both cell
halves. These proteins are part of the Kleisin
superfamily.
Length = 177
Score = 29.3 bits (66), Expect = 2.3
Identities = 16/80 (20%), Positives = 39/80 (48%), Gaps = 6/80 (7%)
Query: 12 KKRIEKRNKEEE--KEENEQEEKEKEDVEEEKNKK----KKKKKKKKKKKKKKKKKKKEE 65
+ +K +E K E+ + ++ V + + K+K+K + +KK + + + E
Sbjct: 56 RSFSKKPVTLDEFIKTLEEELVEVEKSVLDLIHAFNKLVKRKRKNRLRKKNETELEVSVE 115
Query: 66 QSVKEIEESLKERKEKTLTE 85
+ ++E+ L R ++ LT+
Sbjct: 116 EVLEELLAFLGVRLKEVLTK 135
>gnl|CDD|219083 pfam06548, Kinesin-related, Kinesin-related. This family
represents a region within kinesin-related proteins from
higher plants. Many family members also contain the
pfam00225 domain. Kinesins are ATP-driven microtubule
motor proteins that produce directed force. Some family
members are associated with the phragmoplast, a
structure composed mainly of microtubules that executes
cytokinesis in higher plants.
Length = 488
Score = 29.9 bits (67), Expect = 2.3
Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 1/35 (2%)
Query: 23 EKEENEQEEKEK-EDVEEEKNKKKKKKKKKKKKKK 56
E EE +E+ E+E K KK+ +K K+K K
Sbjct: 438 EAEEAATLAQERAMLAEQENEKLKKQIEKLKRKHK 472
>gnl|CDD|224559 COG1645, COG1645, Uncharacterized Zn-finger containing protein
[General function prediction only].
Length = 131
Score = 28.6 bits (64), Expect = 2.3
Identities = 10/58 (17%), Positives = 27/58 (46%)
Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
E EE+E+E E + + ++ + + + KK+ E+ + +++ ++ K
Sbjct: 51 GYREVVVEEEEEEVEAEVQEQLRRSRPELPDDSDELKKEIDEDPASEKLLSVIETLKR 108
>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein. This entry includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 106
Score = 28.1 bits (63), Expect = 2.4
Identities = 10/32 (31%), Positives = 21/32 (65%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK 47
E+R+ + E+ +EE+ +++ E EK +K+K
Sbjct: 75 ERRSARLARLEDREEERLEKEEEREKRARKRK 106
Score = 27.3 bits (61), Expect = 4.7
Identities = 11/34 (32%), Positives = 23/34 (67%), Gaps = 2/34 (5%)
Query: 25 EENEQEEK--EKEDVEEEKNKKKKKKKKKKKKKK 56
EE + + ED EEE+ +K+++++K+ +K+K
Sbjct: 73 EEERRSARLARLEDREEERLEKEEEREKRARKRK 106
>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein).
This family consists of several brain acid soluble
protein 1 (BASP1) or neuronal axonal membrane protein
NAP-22. The BASP1 is a neuron enriched Ca(2+)-dependent
calmodulin-binding protein of unknown function.
Length = 233
Score = 29.4 bits (65), Expect = 2.4
Identities = 25/95 (26%), Positives = 43/95 (45%), Gaps = 12/95 (12%)
Query: 6 GGAGEKKKRIEKRNKEEEKEEN--------EQEEKEKEDVEE----EKNKKKKKKKKKKK 53
GG KKK+ N E+ K+++ E+E KE+ E E + K+ K++K
Sbjct: 1 GGKLSKKKKGYNVNDEKAKDKDKKAEGAATEEEGTPKENEEAQAAAETTEVKEAKEEKPD 60
Query: 54 KKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
K + K EE+ ++ + KE K E ++
Sbjct: 61 KDAQDTANKTEEKEGEKEAAAAKEEAPKAEPEKTE 95
Score = 29.4 bits (65), Expect = 2.4
Identities = 18/72 (25%), Positives = 37/72 (51%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
K+ GA +++ K N+E + E KE ++ + +K+ + K ++K+ +K+
Sbjct: 23 KKAEGAATEEEGTPKENEEAQAAAETTEVKEAKEEKPDKDAQDTANKTEEKEGEKEAAAA 82
Query: 63 KEEQSVKEIEES 74
KEE E E++
Sbjct: 83 KEEAPKAEPEKT 94
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 29.4 bits (67), Expect = 2.4
Identities = 25/93 (26%), Positives = 46/93 (49%), Gaps = 15/93 (16%)
Query: 8 AGEKKKRIEKRNKEE---EKEENEQEEKEKED---------VEEEKNKKKKK---KKKKK 52
A KK KEE + E E+E KE+ + +++E+ +K +KK++
Sbjct: 47 AEALKKEALLEAKEEIHKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESLEKKEE 106
Query: 53 KKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
++K+K+ +Q ++E EE L+E E+ E
Sbjct: 107 SLEEKEKELAARQQQLEEKEEELEELIEEQQQE 139
Score = 28.7 bits (65), Expect = 3.5
Identities = 21/92 (22%), Positives = 49/92 (53%), Gaps = 8/92 (8%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK--------KKKKKKK 61
E KK E KE E E+ K + + E E +++ + ++++K+ +K +
Sbjct: 42 EAKKEAEALKKEALLEAKEEIHKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESL 101
Query: 62 KKEEQSVKEIEESLKERKEKTLTEDSQCDTMI 93
+K+E+S++E E+ L R+++ ++ + + +I
Sbjct: 102 EKKEESLEEKEKELAARQQQLEEKEEELEELI 133
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 28.1 bits (63), Expect = 2.4
Identities = 10/21 (47%), Positives = 13/21 (61%)
Query: 20 KEEEKEENEQEEKEKEDVEEE 40
E +E E+EEKE+E EE
Sbjct: 79 AAAEADEAEEEEKEEEAEEES 99
Score = 28.1 bits (63), Expect = 2.7
Identities = 10/29 (34%), Positives = 15/29 (51%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKED 36
AG + +EEEKEE +EE + +
Sbjct: 75 AGAEAAAEADEAEEEEKEEEAEEESDDDM 103
>gnl|CDD|227479 COG5150, COG5150, Class 2 transcription repressor NC2, beta subunit
(Dr1) [Transcription].
Length = 148
Score = 28.8 bits (64), Expect = 2.4
Identities = 23/80 (28%), Positives = 38/80 (47%), Gaps = 3/80 (3%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E KK I + + E E EE + +EE +N K +K+K+ K K K E+ ++
Sbjct: 61 EAKKTIAYEHVIKALENLEFEEYIESCMEEHENYKSYQKQKESKISKFKDSGLSMEELLR 120
Query: 70 EIEESL---KERKEKTLTED 86
+ EE + R E + +D
Sbjct: 121 QQEELFQNARLRFEHSFEDD 140
>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal. The yeast member
of this family (Kri1p) is found to be required for 40S
ribosome biogenesis in the nucleolus. This is the
C-terminal domain of the family.
Length = 93
Score = 27.9 bits (63), Expect = 2.4
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 38 EEEKNKKKKKKKKKKKKKKKKKKKKK 63
++E+ +K KKK KK + ++ KKK
Sbjct: 68 DKEERRKDKKKYGKKARLREWKKKVF 93
>gnl|CDD|236544 PRK09506, mrcB, bifunctional glycosyl transferase/transpeptidase;
Reviewed.
Length = 830
Score = 30.1 bits (68), Expect = 2.4
Identities = 9/44 (20%), Positives = 24/44 (54%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK 54
K+++ +R ++ + ++ ++ E E+ K K K +K + K+
Sbjct: 19 VKQKVSRRRYRDDDDYDDYDDYEDEEPMPRKGKGKGRKPRGKRG 62
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 30.0 bits (67), Expect = 2.4
Identities = 16/49 (32%), Positives = 22/49 (44%), Gaps = 8/49 (16%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKE--------DVEEEKNKKKKKKKKKK 52
K IE RN EE + E E E+ E E D +E +N + +K
Sbjct: 372 KMEIENRNPEESEHEEEVEDYEDENDHSKRICDDDELENHFRAADEKNS 420
>gnl|CDD|215038 PLN00040, PLN00040, Protein MAK16 homolog; Provisional.
Length = 233
Score = 29.4 bits (66), Expect = 2.4
Identities = 9/37 (24%), Positives = 16/37 (43%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
K K +E +E E+ + +K KKK + +
Sbjct: 183 KSYNKVLEMEEVEEAEEELPKSDKNPNSKKKSRVHVE 219
>gnl|CDD|222791 PHA00431, PHA00431, internal virion protein C.
Length = 746
Score = 29.8 bits (67), Expect = 2.4
Identities = 12/57 (21%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 9 GEKKKRIEKRNKEEEKEENEQ--EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
G+ + R +K + +EN+Q E+ +E ++ NK+ + +++ ++K+K+
Sbjct: 679 GQVRVRYDKELLSKMYQENQQKLAEEAREKALKDANKRAPIVAATQAREEVREKRKQ 735
Score = 29.0 bits (65), Expect = 4.3
Identities = 6/48 (12%), Positives = 21/48 (43%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
K ++ ++ +E E+ K+ + +++ ++K+K+
Sbjct: 691 SKMYQENQQKLAEEAREKALKDANKRAPIVAATQAREEVREKRKQTSG 738
Score = 29.0 bits (65), Expect = 4.5
Identities = 7/43 (16%), Positives = 20/43 (46%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
+ ++K E EK +D + + +++ ++K+K+
Sbjct: 696 ENQQKLAEEAREKALKDANKRAPIVAATQAREEVREKRKQTSG 738
>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
secretion].
Length = 407
Score = 29.6 bits (67), Expect = 2.4
Identities = 15/78 (19%), Positives = 28/78 (35%), Gaps = 5/78 (6%)
Query: 12 KKRIEK----RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
KK E + + E + + K K + + K +KK+++ +
Sbjct: 35 KKGGFLGLLFSKTAVEVTKLAAVDSELRKFQTREAAKVKDAEAQLLKDPAEKKREERKA- 93
Query: 68 VKEIEESLKERKEKTLTE 85
K+IE S E+ E
Sbjct: 94 AKKIERSTPSLIERKTQE 111
Score = 28.1 bits (63), Expect = 7.4
Identities = 10/77 (12%), Positives = 31/77 (40%), Gaps = 3/77 (3%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
EKK+ K K+ E+ E++ +E K+ ++ + + +K ++ ++
Sbjct: 85 EKKREERKAAKKIERSTPSLIERK---TQEVKDSGEEIAEMMRDEKVPIRELEEIPPEFV 141
Query: 70 EIEESLKERKEKTLTED 86
+ + + + +
Sbjct: 142 ALYKQEIQSPTRLNLIN 158
>gnl|CDD|219174 pfam06780, Erp_C, Erp protein C-terminus. This family represents
the C-terminus of bacterial Erp proteins that seem to be
specific to Borrelia burgdorferi (a causative agent of
Lyme disease). Borrelia Erp proteins are particularly
heterogeneous, which might enable them to interact with
a wide variety of host components.
Length = 141
Score = 28.9 bits (65), Expect = 2.5
Identities = 18/60 (30%), Positives = 30/60 (50%)
Query: 22 EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
EE+E++ + KE + + + K + K K + K KE V EI+E L++ K K
Sbjct: 58 EEEEDSGLGKLLKELEDARDSLRTKLNEGNKPYTKDNEPKLKENVKVSEIKEDLEKLKSK 117
>gnl|CDD|218584 pfam05422, SIN1, Stress-activated map kinase interacting protein 1
(SIN1). This family consists of several
stress-activated map kinase interacting protein 1
(MAPKAP1 OR SIN1) sequences. The fission yeast Sty1/Spc1
mitogen-activated protein (MAP) kinase is a member of
the eukaryotic stress-activated MAP kinase (SAPK)
family. Sin1 interacts with Sty1/Spc1. Cells lacking
Sin1 display many, but not all, of the phenotypes of
cells lacking the Sty1/Spc1 MAP kinase including
sterility, multiple stress sensitivity and a cell-cycle
delay. Sin1 is phosphorylated after stress but this is
not Sty1/Spc1-dependent.
Length = 482
Score = 29.6 bits (66), Expect = 2.5
Identities = 11/90 (12%), Positives = 30/90 (33%), Gaps = 9/90 (10%)
Query: 1 MRKRRGGAGEKKK-------RIEKRNKEEEKEENEQEEKEKEDVEEEKN--KKKKKKKKK 51
+ +R A E ++ K++ + K + + + E KK+
Sbjct: 93 IHRRSNTAQELERLDQAVNLTSAKQSAIKIKSSVSTDYDDLRSISELDFLFSKKELPLTT 152
Query: 52 KKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
K + + + ++ L+ + E+
Sbjct: 153 HNTVNKARSVSNAKAPISGLQSLLEHKLEE 182
>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
[Translation, ribosomal structure and biogenesis].
Length = 569
Score = 29.7 bits (66), Expect = 2.5
Identities = 17/78 (21%), Positives = 38/78 (48%), Gaps = 5/78 (6%)
Query: 11 KKKRIEKRNKEEEKEE-NEQEEKEKEDV----EEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
+ K ++E ++EE + +K K D+ E+E N + + K +K++ + +
Sbjct: 226 SDVKDPKEDEELDEEEHDSAMDKVKLDLFADEEDEPNAEGVGEASDKNLSSFEKQQIEMD 285
Query: 66 QSVKEIEESLKERKEKTL 83
+ ++E+E+ L KE
Sbjct: 286 EQIEELEKELVAPKEWKY 303
Score = 28.1 bits (62), Expect = 8.9
Identities = 22/84 (26%), Positives = 34/84 (40%), Gaps = 19/84 (22%)
Query: 15 IEKRNK-----EEEKEENEQEEKE--------KEDVEEEKNK------KKKKKKKKKKKK 55
+EK N+ E+ E +E E E D EEE+ K K+ KK
Sbjct: 167 LEKFNRDTLAAEDSNEASEGSEDEDIDYFQDMPSDDEEEEAIYYEDFFDKPTKEPVKKHS 226
Query: 56 KKKKKKKKEEQSVKEIEESLKERK 79
K K+ EE +E + ++ + K
Sbjct: 227 DVKDPKEDEELDEEEHDSAMDKVK 250
>gnl|CDD|222649 pfam14284, PcfJ, PcfJ-like protein. The PcfJ-like protein family
includes the E. faecalis PcfJ protein, which is
functionally uncharacterized. It is found in bacteria
and viruses, and is typically between 159 and 170 amino
acids in length. There is a conserved HCV sequence
motif.
Length = 169
Score = 28.8 bits (65), Expect = 2.5
Identities = 9/35 (25%), Positives = 19/35 (54%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKK 47
+ + K ++EKE+ E+ K ++E+ + K K
Sbjct: 33 RAVAKYQAKKEKEKAEKMRKRYPELEKIFRELKSK 67
Score = 28.8 bits (65), Expect = 2.8
Identities = 8/48 (16%), Positives = 25/48 (52%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
K +++ + + +E++ +K +K+ + +K ++ K K E + +
Sbjct: 28 KAAHDRAVAKYQAKKEKEKAEKMRKRYPELEKIFRELKSKFEFTDGDY 75
Score = 27.3 bits (61), Expect = 9.1
Identities = 10/42 (23%), Positives = 21/42 (50%)
Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
+ + K + E+EK +K +K+ + +K ++ K K E
Sbjct: 28 KAAHDRAVAKYQAKKEKEKAEKMRKRYPELEKIFRELKSKFE 69
>gnl|CDD|226369 COG3851, UhpB, Signal transduction histidine kinase,
glucose-6-phosphate specific [Signal transduction
mechanisms].
Length = 497
Score = 29.7 bits (67), Expect = 2.6
Identities = 14/41 (34%), Positives = 19/41 (46%), Gaps = 2/41 (4%)
Query: 228 TWILLNTQGFISQGIGLNNKEYLIWYILFIVFVTYAMLPLP 268
TW+ L I I L K +L+WY+L + L LP
Sbjct: 162 TWLPLGPS-LIHHPINLRGK-HLVWYLLLFIIGLLLQLGLP 200
>gnl|CDD|217970 pfam04220, YihI, Der GTPase activator (YihI). YihI activates the
GTPase activity of Der, a 50S ribosomal subunit
stability factor. The stimulation is specific to Der as
YihI does not stimulate the GTPase activity of Era or
ObgE. The interaction of YihI with Der requires only
the C-terminal 78 amino acids of YihI. A yihI deletion
mutant is viable and shows a shorter lag period, but
the same post-lag growth rate as a wild-type strain.
yihI is expressed during the lag period. Overexpression
of yihI inhibits cell growth and biogenesis of the 50S
ribosomal subunit. YihI is an unusual, highly
hydrophilic protein with an uneven distribution of
charged residues, resulting in an N-terminal region
with high pI and a C-terminal region with low pI.
Length = 169
Score = 28.9 bits (65), Expect = 2.6
Identities = 19/66 (28%), Positives = 31/66 (46%), Gaps = 17/66 (25%)
Query: 10 EKKKRIEK------RNKEEEKEENEQEEKEKED-----------VEEEKNKKKKKKKKKK 52
K+K+ K R+ EE + + ++ +K+D + E K K +K+ KKK
Sbjct: 29 RKRKKKRKGLKSGSRHNEESESQKQKGAAQKKDPRIGSKKPVPLIVEFKVKPEKQAPKKK 88
Query: 53 KKKKKK 58
K KK K
Sbjct: 89 KVKKPK 94
>gnl|CDD|217452 pfam03250, Tropomodulin, Tropomodulin. Tropomodulin is a novel
tropomyosin regulatory protein that binds to the end of
erythrocyte tropomyosin and blocks head-to-tail
association of tropomyosin along actin filaments.
Limited proteolysis shows this protein is composed of
two domains. The amino terminal domain contains the
tropomyosin binding function.
Length = 147
Score = 28.9 bits (65), Expect = 2.7
Identities = 16/60 (26%), Positives = 29/60 (48%), Gaps = 2/60 (3%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV--KEIEESLKERKEKTLTE 85
E++ E +D E+ +K+ K K+K ++ EEQ E+EE+L + L +
Sbjct: 74 EKQALEHKDREDLVPFTGEKRGKVFVPKQKPRELPAEEQITLDPELEEALANATDAELCD 133
>gnl|CDD|217417 pfam03194, LUC7, LUC7 N_terminus. This family contains the N
terminal region of several LUC7 protein homologues and
only contains eukaryotic proteins. LUC7 has been shown
to be a U1 snRNA associated protein with a role in
splice site recognition. The family also contains human
and mouse LUC7 like (LUC7L) proteins and human cisplatin
resistance-associated overexpressed protein (CROP).
Length = 252
Score = 29.3 bits (66), Expect = 2.7
Identities = 17/85 (20%), Positives = 40/85 (47%), Gaps = 4/85 (4%)
Query: 10 EKKKRIEKRNKEEEKEE--NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
++K K+ E +EE + + E++ E +++ K + + ++ K + +
Sbjct: 96 DRKIDKAKQRLELTQEEQTKIAADSKAEEL-AELDEEIGKLLAEAEALGEEGKVDEAMKL 154
Query: 68 VKEIEESLKERKEKTLTEDSQCDTM 92
+KE+EE LK +K++ D +
Sbjct: 155 MKEVEE-LKAKKKELEDSDEVRNAA 178
>gnl|CDD|220121 pfam09135, Alb1, Alb1. Alb1 is a nuclear shuttling factor
involved in ribosome biogenesis.
Length = 112
Score = 28.2 bits (63), Expect = 2.7
Identities = 11/26 (42%), Positives = 21/26 (80%)
Query: 39 EEKNKKKKKKKKKKKKKKKKKKKKKE 64
E +KKK++KKK K+ +K++K++K+
Sbjct: 44 EGVSKKKQRKKKPKRLTRKQRKRQKK 69
Score = 28.2 bits (63), Expect = 3.0
Identities = 11/30 (36%), Positives = 19/30 (63%)
Query: 34 KEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
E KK++KKK K+ +K++K++KK
Sbjct: 40 ASKNEGVSKKKQRKKKPKRLTRKQRKRQKK 69
Score = 27.5 bits (61), Expect = 5.1
Identities = 9/26 (34%), Positives = 19/26 (73%)
Query: 42 NKKKKKKKKKKKKKKKKKKKKKEEQS 67
KKK++KKK K+ +K++K++++
Sbjct: 45 GVSKKKQRKKKPKRLTRKQRKRQKKG 70
Score = 27.1 bits (60), Expect = 7.6
Identities = 10/25 (40%), Positives = 19/25 (76%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKE 64
+K ++KKK K+ +K++K++KK E
Sbjct: 48 KKKQRKKKPKRLTRKQRKRQKKGLE 72
Score = 27.1 bits (60), Expect = 7.9
Identities = 11/28 (39%), Positives = 21/28 (75%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSV 68
K K++KKK K+ +K++K++KK E++
Sbjct: 48 KKKQRKKKPKRLTRKQRKRQKKGLERAE 75
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 29.9 bits (67), Expect = 2.7
Identities = 18/75 (24%), Positives = 31/75 (41%), Gaps = 8/75 (10%)
Query: 7 GAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
G K+KR E E ED +E +K++K+++++ K + K +
Sbjct: 353 GPTRKRKRRRVPP------LPEYSSDEDEDDSDEDEVDYEKERKRRREEDKNFLRLKALE 406
Query: 67 SVKEIEESLKERKEK 81
K + ER EK
Sbjct: 407 LSKYA--GVNERMEK 419
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 29.3 bits (66), Expect = 2.8
Identities = 18/67 (26%), Positives = 28/67 (41%), Gaps = 9/67 (13%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQ---------EEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
E ++ EK KE+ K + +Q E +K E + + + + KKKK
Sbjct: 157 KLELLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSEDYVEYDRAGKKKK 216
Query: 59 KKKKKEE 65
K K EE
Sbjct: 217 AKSKYEE 223
Score = 27.8 bits (62), Expect = 9.2
Identities = 15/47 (31%), Positives = 24/47 (51%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
K K+KK++ K +KK+ +K EE K +E L + E + D
Sbjct: 164 KEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSEDYVEYDR 210
>gnl|CDD|193580 cd09891, NGN_Bact_1, Bacterial N-Utilization Substance G (NusG)
N-terminal (NGN) domain, subgroup 1. The N-Utilization
Substance G (NusG) protein is involved in transcription
elongation and termination in bacteria. NusG is
essential in Escherichia coli and associates with RNA
polymerase elongation and Rho-termination. Homologs of
the NusG gene exist in all bacteria. The NusG
N-terminal domain (NGN) is similar in all NusG
homologs, but its C-terminal domain and the linker that
separates these two domains are different. The domain
organization of NusG suggests that the common
properties of NusG and its homologs are due to their
similar NGN domains.
Length = 107
Score = 28.2 bits (64), Expect = 2.9
Identities = 17/41 (41%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 12 KKRIEKRNKEEEKEENEQE-EKEKEDVEEEKNKKKKKKKKK 51
K+ +EKR + E E+ E E+V E KN KKK K++K
Sbjct: 16 KENLEKRIESEGLEDYIGEVLVPTEEVVEVKNGKKKVKERK 56
>gnl|CDD|180161 PRK05609, nusG, transcription antitermination protein NusG;
Validated.
Length = 181
Score = 29.0 bits (66), Expect = 2.9
Identities = 15/41 (36%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Query: 12 KKRIEKRNKEEEKEENEQE-EKEKEDVEEEKNKKKKKKKKK 51
K+ +E R + E+ E E+V E KN KKKK ++K
Sbjct: 22 KENLENRIETLGMEDLIGEVLVPTEEVVEVKNGKKKKVERK 62
>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
Validated.
Length = 725
Score = 29.8 bits (67), Expect = 2.9
Identities = 17/78 (21%), Positives = 33/78 (42%), Gaps = 3/78 (3%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
K + EEE E + +K +E ++ K+KK + + + K K +E K+
Sbjct: 351 KLLALSELEEEDENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDN 410
Query: 72 EE--SLKERKEKTLTEDS 87
+ S K + + +S
Sbjct: 411 KNKNSSFINKTENILTNS 428
Score = 28.6 bits (64), Expect = 6.5
Identities = 15/63 (23%), Positives = 27/63 (42%), Gaps = 3/63 (4%)
Query: 15 IEKRNKEEEKEENEQEEKEKED---VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
I+ + +E+K ENE + + K D +EE K K K K + + +
Sbjct: 376 IDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKNSSFINKTENILTNSPLKDELL 435
Query: 72 EES 74
E++
Sbjct: 436 EKT 438
>gnl|CDD|148790 pfam07376, Prosystemin, Prosystemin. This family consists of
several plant specific prosystemin proteins.
Prosystemin is the precursor protein of the 18 amino
acid wound signal systemin which activates systemic
defence in plant leaves against insect herbivores.
Length = 193
Score = 28.9 bits (63), Expect = 2.9
Identities = 19/91 (20%), Positives = 47/91 (51%), Gaps = 4/91 (4%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK----KKKK 58
K +G ++K ++E +EKE+ ++E +D++ + + ++ K + ++ K+K
Sbjct: 9 KNKGDDVQEKTKLEHEKGGDEKEKIIEKETPSQDIKNKDDDAQEIPKVEHEEGGDEKEKI 68
Query: 59 KKKKKEEQSVKEIEESLKERKEKTLTEDSQC 89
+K+ + + +IE + K K E+ +C
Sbjct: 69 IEKETISKCIIKIEGDDAQEKIKVEYEEEEC 99
Score = 27.3 bits (59), Expect = 9.7
Identities = 21/87 (24%), Positives = 43/87 (49%), Gaps = 6/87 (6%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVE------EEKNKKKKKKKKKKKKKK 56
K G ++K ++E +E EKE+ ++E +D+ +EK K + ++ +K+
Sbjct: 80 KIEGDDAQEKIKVEYEEEECEKEKIVEKETPSQDIGNKGDDAQEKPKVEHEEDGDEKETP 139
Query: 57 KKKKKKKEEQSVKEIEESLKERKEKTL 83
+ K E + +EI + E +EK +
Sbjct: 140 SQDIIKIEGEDAQEIPKVECEEREKIV 166
>gnl|CDD|223831 COG0760, SurA, Parvulin-like peptidyl-prolyl isomerase
[Posttranslational modification, protein turnover,
chaperones].
Length = 320
Score = 29.3 bits (65), Expect = 3.0
Identities = 9/53 (16%), Positives = 18/53 (33%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
K K ++ +QE +K +E+ ++K K K+
Sbjct: 263 RDAKLDFPEVKAVKEISLKQELLQKAALEKLLKGEQKSFDKLLKEAAIDILSA 315
>gnl|CDD|225048 COG2137, OraA, Uncharacterized protein conserved in bacteria
[General function prediction only].
Length = 174
Score = 28.8 bits (65), Expect = 3.0
Identities = 17/64 (26%), Positives = 31/64 (48%), Gaps = 3/64 (4%)
Query: 2 RKRRGGAGEKK---KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
+ R G G + + +K +E EE + E+++ E + +KK K++ K KK
Sbjct: 79 SRSRKGKGPARLKQELKQKGIDDEIIEEALELIDEEDEQERARKVLRKKFKRENKPPDKK 138
Query: 59 KKKK 62
+K K
Sbjct: 139 EKAK 142
>gnl|CDD|215822 pfam00257, Dehydrin, Dehydrin.
Length = 137
Score = 28.6 bits (64), Expect = 3.0
Identities = 14/58 (24%), Positives = 22/58 (37%), Gaps = 5/58 (8%)
Query: 6 GGAGEKKKRIEKRNKE-----EEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
G GE+KK I+++ KE + ++KK K K K+K
Sbjct: 73 GQGGERKKGIKEKIKEKLPGGHGGKGQATTTTTTGTGGGYGQHGHEEKKGKMDKIKEK 130
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 29.6 bits (66), Expect = 3.0
Identities = 12/41 (29%), Positives = 26/41 (63%)
Query: 42 NKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
++K +++++ K K KK+++K +Q+ KE E +E+K
Sbjct: 55 YQRKGQQQRESKPKISKKERRKRKQAEKETEGKTEEKKSTA 95
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 29.6 bits (67), Expect = 3.1
Identities = 9/36 (25%), Positives = 17/36 (47%)
Query: 46 KKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
++KKK KK +K+ + + K E K++
Sbjct: 84 REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQR 119
Score = 29.6 bits (67), Expect = 3.2
Identities = 11/37 (29%), Positives = 21/37 (56%), Gaps = 1/37 (2%)
Query: 44 KKKKKKKKKKKKKKKKKKKKEEQSVKEI-EESLKERK 79
++KKK KK +K+ + + ++ KE+ S K+R
Sbjct: 84 REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRH 120
>gnl|CDD|216985 pfam02349, MSG, Major surface glycoprotein. This is a novel
repeat in Pneumocystis carinii Major surface
glycoprotein (MSG) some members of the alignment have
up to nine repeats of this family, the repeats
containing several conserved cysteines. The MSG of P.
carinii is an important protein in host-pathogen
interactions. Surface glycoprotein A from Pneumocystis
carinii is a main target for the host immune system,
this protein is implicated in the attachment of
Pneumocystis carinii to the host alveolar epithelial
cells, alveolar macrophages, host surfactant and
possibly accounts in part for the hypoxia seen in
Pneumocystis carinii pneumonia (PCP).
Length = 81
Score = 27.4 bits (61), Expect = 3.1
Identities = 21/63 (33%), Positives = 36/63 (57%)
Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
++ + E++ E EE + E K+K +K K+K K KK K EE +KE++ LK+
Sbjct: 1 KDFKLEEKCKELEEYCEFLEEACPLLKEKCEKLKEKCYKLKKDLKLEELLLKELKGELKK 60
Query: 78 RKE 80
+K+
Sbjct: 61 KKK 63
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 29.6 bits (66), Expect = 3.1
Identities = 13/71 (18%), Positives = 37/71 (52%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
+ ++++ + + E + E ++ +++D E +K +++K++ K+ KK +
Sbjct: 883 RQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSNKKAQDK 942
Query: 68 VKEIEESLKER 78
V +I+E +K
Sbjct: 943 VNDIKEKVKNI 953
>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
Length = 145
Score = 28.5 bits (64), Expect = 3.1
Identities = 13/23 (56%), Positives = 14/23 (60%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKK 63
K KKK KK K KK K KK+K
Sbjct: 6 KAKKKAKKAAKAAKKGVKVKKRK 28
Score = 28.5 bits (64), Expect = 3.5
Identities = 12/24 (50%), Positives = 13/24 (54%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKK 63
K KKK KK K KK K KK+
Sbjct: 4 AKKAKKKAKKAAKAAKKGVKVKKR 27
Score = 27.0 bits (60), Expect = 8.9
Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 3/44 (6%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
+ KK KKK KK K KK K ++ +++ S++ + KTL
Sbjct: 1 KAPAKKAKKKAKKAAKAAKKGVKVKK---RKVRTSVRFFRPKTL 41
>gnl|CDD|240272 PTZ00109, PTZ00109, DNA gyrase subunit b; Provisional.
Length = 903
Score = 29.5 bits (66), Expect = 3.2
Identities = 14/44 (31%), Positives = 22/44 (50%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
K ++ + + D E+KN K KK K K K+ K++E S
Sbjct: 197 KFQDTFPKNSRSDKSEDKNDTKSSKKGKSSHVKGPKEAKEKESS 240
>gnl|CDD|135898 PRK06397, PRK06397, V-type ATP synthase subunit H; Validated.
Length = 111
Score = 28.0 bits (62), Expect = 3.2
Identities = 16/58 (27%), Positives = 33/58 (56%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
KE+ E +KE +++ E+ + K+ K K ++K KK +++ + E+ KE ++K
Sbjct: 16 KEKEESIDKEIANIKNEQENEIKEAKSKYEEKAKKTEEESLNMYNAALMEARKEAEKK 73
Score = 26.4 bits (58), Expect = 9.9
Identities = 17/74 (22%), Positives = 30/74 (40%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
+ K E E E KE + EEK KK +++ + +K+ E+ EI K
Sbjct: 23 DKEIANIKNEQENEIKEAKSKYEEKAKKTEEESLNMYNAALMEARKEAEKKAVEIINKAK 82
Query: 77 ERKEKTLTEDSQCD 90
+ + S+ +
Sbjct: 83 QEASLIKLKISEDE 96
>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV.
Length = 336
Score = 29.4 bits (66), Expect = 3.2
Identities = 10/32 (31%), Positives = 22/32 (68%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
+ K + K +K K+ KK+KK++++++ E+E
Sbjct: 22 RPVKDEAKPRKIKRVKKRKKREEKDELDDEVE 53
Score = 28.6 bits (64), Expect = 5.8
Identities = 11/38 (28%), Positives = 23/38 (60%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
K++ K +K K+ KK+KK+++K + + V+ + R
Sbjct: 25 KDEAKPRKIKRVKKRKKREEKDELDDEVEFVRSFAPRR 62
Score = 28.6 bits (64), Expect = 5.9
Identities = 9/39 (23%), Positives = 21/39 (53%)
Query: 35 EDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
+ + E + K + K +K K+ KK+K+ + E+++
Sbjct: 12 QILAPEIYGPRPVKDEAKPRKIKRVKKRKKREEKDELDD 50
>gnl|CDD|217483 pfam03305, Lipoprotein_X, Mycoplasma MG185/MG260 protein. Most of
the aligned regions in this family are found towards the
middle of the member proteins.
Length = 237
Score = 28.9 bits (65), Expect = 3.2
Identities = 10/75 (13%), Positives = 27/75 (36%), Gaps = 6/75 (8%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKED------VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
K ++ K +K K + +KN + K K+ K + + + +
Sbjct: 153 KDLKAPTKNNQKGGALGSVTSKGNANLLGKFTLDKNSLTSPEVKAFKEADKSEDPQTDSK 212
Query: 67 SVKEIEESLKERKEK 81
+++ ++ + K
Sbjct: 213 TIELYKDKIPTDKNA 227
>gnl|CDD|218657 pfam05615, THOC7, Tho complex subunit 7. The Tho complex is
involved in transcription elongation and mRNA export
from the nucleus.
Length = 132
Score = 28.4 bits (64), Expect = 3.2
Identities = 19/66 (28%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 18 RNKEEEKEENEQEEKEKE-DVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
E E+E EQE++E E +E K++ ++ KK+ ++ K+ +K +K+ ++ K+I
Sbjct: 63 EMNERERENYEQEKEEIETTIENAKDEIEELKKELEEAKRIRKNRKEYDELAKKITSLPV 122
Query: 77 ERKEKT 82
R E
Sbjct: 123 TRAETE 128
>gnl|CDD|148701 pfam07246, Phlebovirus_NSM, Phlebovirus nonstructural protein NS-M.
This family consists of several Phlebovirus
nonstructural NS-M proteins which represent the
N-terminal region of the M polyprotein precursor. The
function of this family is unknown.
Length = 264
Score = 29.1 bits (64), Expect = 3.3
Identities = 21/97 (21%), Positives = 49/97 (50%), Gaps = 3/97 (3%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
K + A EK I+++ KE E++ Q + ++++KN+ +K + K+ ++ + K
Sbjct: 144 EKLKKNAEEKDATIQRKTKEMEEDSRNQIAHHE--IQQKKNEIQKLRNDLKRGQEHRDAK 201
Query: 62 KKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKK 98
+ E+S+ ++ LKE ++ E + + +K
Sbjct: 202 LRVEESMARVKG-LKEELKQLTDELQHAKSEMQSLRK 237
>gnl|CDD|221805 pfam12848, ABC_tran_2, ABC transporter. This domain is related
to pfam00005.
Length = 85
Score = 27.5 bits (62), Expect = 3.3
Identities = 14/67 (20%), Positives = 36/67 (53%), Gaps = 4/67 (5%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEE--EKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+++ E+ ++E+ E +Q+E +K EE ++ + K K K+ + + K +K + +
Sbjct: 20 EQKEERLEQQEKAYEKQQKEIKKL--EEFIDRFRAKASKAKQAQSRIKALEKMERIEKPV 77
Query: 70 EIEESLK 76
E + ++
Sbjct: 78 EKKPKIR 84
>gnl|CDD|218078 pfam04419, 4F5, 4F5 protein family. Members of this family are
short proteins that are rich in aspartate, glutamate,
lysine and arginine. Although the function of these
proteins is unknown, they are found to be ubiquitously
expressed.
Length = 38
Score = 26.0 bits (58), Expect = 3.3
Identities = 12/24 (50%), Positives = 14/24 (58%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKK 63
E+N KK KKKKK K + K K
Sbjct: 12 ERNAKKAAKKKKKGAKSQLKAAAK 35
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 29.1 bits (65), Expect = 3.3
Identities = 17/79 (21%), Positives = 34/79 (43%), Gaps = 4/79 (5%)
Query: 4 RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
++ K K+ + K ++ K+K K KK K K KK KK
Sbjct: 200 KKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAA----KTAVSAKKAAKTAAKAAKKAKKT 255
Query: 64 EEQSVKEIEESLKERKEKT 82
++++K+ +++K+ +K
Sbjct: 256 AKKALKKAAKAVKKAAKKA 274
Score = 29.1 bits (65), Expect = 3.7
Identities = 20/96 (20%), Positives = 33/96 (34%), Gaps = 3/96 (3%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A + +K K K + ++ K+ + K K KK KK K K K
Sbjct: 231 AAKTAVSAKKAAKTAAKAAKKAKKTAKKAL---KKAAKAVKKAAKKAAKAAAKAAKGAAK 287
Query: 68 VKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYK 103
+ + K++ K S+ K+G K
Sbjct: 288 ATKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGK 323
Score = 28.3 bits (63), Expect = 6.3
Identities = 14/86 (16%), Positives = 31/86 (36%)
Query: 3 KRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
K+ + + + K + + ++ + +K KK KK KK K KK K
Sbjct: 213 KKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAK 272
Query: 63 KEEQSVKEIEESLKERKEKTLTEDSQ 88
K ++ + + + + +
Sbjct: 273 KAAKAAAKAAKGAAKATKGKAKAKKK 298
Score = 27.9 bits (62), Expect = 9.0
Identities = 19/80 (23%), Positives = 32/80 (40%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
+K A K++ K+ + ++ K ++ K KK KK K KK K
Sbjct: 213 KKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAK 272
Query: 62 KKEEQSVKEIEESLKERKEK 81
K + + K + + K K K
Sbjct: 273 KAAKAAAKAAKGAAKATKGK 292
>gnl|CDD|219289 pfam07083, DUF1351, Protein of unknown function (DUF1351). This
family consists of several bacterial and phage proteins
of around 230 residues in length. The function of this
family is unknown.
Length = 220
Score = 28.9 bits (65), Expect = 3.3
Identities = 19/80 (23%), Positives = 33/80 (41%), Gaps = 8/80 (10%)
Query: 10 EKKKRIEKRNKEEEKEEN--------EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
K E++ KE ++ E+ +K D EE+ + KK K K KK +
Sbjct: 96 SGIKEFEEKQKELRRDTVKEYLSEKAEEYGLDKRDFEEKYDDYVKKSKWLNKSVSLKKIE 155
Query: 62 KKEEQSVKEIEESLKERKEK 81
+ + + + L+ERK
Sbjct: 156 ESIDTLILSEAQKLEERKAA 175
>gnl|CDD|219106 pfam06614, Neuromodulin, Neuromodulin. This family consists of
several neuromodulin (Axonal membrane protein GAP-43)
sequences and is found in conjunction with pfam00612.
GAP-43 is a neuronal calmodulin-binding phosphoprotein
that is concentrated in growth cones and pre-synaptic
terminals.
Length = 174
Score = 28.7 bits (63), Expect = 3.3
Identities = 13/38 (34%), Positives = 26/38 (68%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
E+E E+ QEE++K+ VEE K + ++++ K+++ K
Sbjct: 131 PEQETAESSQEEEKKDAVEETKPSESAQQEEAKEEEAK 168
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 29.1 bits (65), Expect = 3.4
Identities = 19/86 (22%), Positives = 45/86 (52%), Gaps = 2/86 (2%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEE-KNKKKKKKKKKKKKKKKKK 59
+++ EK + ++K KEE +E E+ E E E+V+E K + + + ++ KK
Sbjct: 133 LKEDYEELKEKLEELQKE-KEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPG 191
Query: 60 KKKKEEQSVKEIEESLKERKEKTLTE 85
+ ++ E+E ++ +E+ +++
Sbjct: 192 EVYDLKKRWDELEPGVELPEEELISD 217
>gnl|CDD|212057 cd11488, SLC5sbd_SGLT4, Na(+)/glucose cotransporter SGLT4 and
related proteins; solute-binding domain. Human SGLT4
(hSGLT4) has been reported to be a low-affinity glucose
transporter with unusual sugar selectivity: it
transports D-mannose but not galactose or
3-O-methyl-D-glucoside. It is encoded by the SLC5A9 gene
and is expressed in intestine, kidney, liver, brain,
lung, trachea, uterus, and pancreas. hSLGT4 is predicted
to contain 14 membrane-spanning regions. This subgroup
belongs to the solute carrier 5 (SLC5 )transporter
family.
Length = 605
Score = 29.5 bits (66), Expect = 3.4
Identities = 17/70 (24%), Positives = 35/70 (50%), Gaps = 6/70 (8%)
Query: 91 TMIPVFKKGVLYKGIYCP---SLTNSFREPSLEMS---YQRYSHRQRQKSLIIVNLVDMV 144
++PV +G++ I SLT+ F S + +QR ++ L++V V ++
Sbjct: 340 ELMPVGLRGLMIAVIMAALMSSLTSIFNSSSTLFTIDVWQRIRRNASEQELMVVGRVFIL 399
Query: 145 LKAVLTVAWI 154
+ V+++ WI
Sbjct: 400 ILVVISILWI 409
>gnl|CDD|227602 COG5277, COG5277, Actin and related proteins [Cytoskeleton].
Length = 444
Score = 29.3 bits (66), Expect = 3.5
Identities = 19/76 (25%), Positives = 32/76 (42%), Gaps = 7/76 (9%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK-------KKKKK 63
I KEE E +++ D EEE ++++K +K + + K+ K
Sbjct: 220 YSSEIVNEIKEEVCETDDESAYVSLDAEEEFEEEEEKPAEKSTESTFQLSKETSIAKESK 279
Query: 64 EEQSVKEIEESLKERK 79
E +EIE +ER
Sbjct: 280 ELPDGEEIEFGNEERF 295
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 29.3 bits (66), Expect = 3.5
Identities = 10/34 (29%), Positives = 17/34 (50%)
Query: 27 NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+ + + +K++ KK+KKKK KK K
Sbjct: 341 HSPPAAFERPLALSPKRKREGDKKQKKKKSKKLK 374
Score = 28.9 bits (65), Expect = 4.1
Identities = 12/45 (26%), Positives = 18/45 (40%), Gaps = 4/45 (8%)
Query: 22 EEKEENEQEEKEKEDVEEEKNK----KKKKKKKKKKKKKKKKKKK 62
+ E+ K+K++ KK+KKKK KK K
Sbjct: 330 STETPPPASLSHSPPAAFERPLALSPKRKREGDKKQKKKKSKKLK 374
>gnl|CDD|219419 pfam07462, MSP1_C, Merozoite surface protein 1 (MSP1) C-terminus.
This family represents the C-terminal region of
merozoite surface protein 1 (MSP1) which are found in a
number of Plasmodium species. MSP-1 is a 200-kDa protein
expressed on the surface of the P. vivax merozoite.
MSP-1 of Plasmodium species is synthesised as a
high-molecular-weight precursor and then processed into
several fragments. At the time of red cell invasion by
the merozoite, only the 19-kDa C-terminal fragment
(MSP-119), which contains two epidermal growth
factor-like domains, remains on the surface. Antibodies
against MSP-119 inhibit merozoite entry into red cells,
and immunisation with MSP-119 protects monkeys from
challenging infections. Hence, MSP-119 is considered a
promising vaccine candidate.
Length = 574
Score = 29.5 bits (66), Expect = 3.5
Identities = 17/71 (23%), Positives = 34/71 (47%), Gaps = 1/71 (1%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
KK+ I + + + KE ++ ++ K K K ++ KK++ + + +K+
Sbjct: 44 KKELITALTETKVNALYLEIAHLKELLQHSYDRYYKYKLKLERLYNKKEQIGQSKMQIKK 103
Query: 71 IEESLKERKEK 81
+ LKER EK
Sbjct: 104 L-TLLKERLEK 113
>gnl|CDD|235841 PRK06606, PRK06606, branched-chain amino acid aminotransferase;
Validated.
Length = 306
Score = 29.0 bits (66), Expect = 3.6
Identities = 10/29 (34%), Positives = 17/29 (58%), Gaps = 9/29 (31%)
Query: 230 ILLNTQGFISQGIGLNNKEYLIWYILFIV 258
+LL+ +G++S+G G N +FIV
Sbjct: 183 LLLDVEGYVSEGSGEN---------IFIV 202
>gnl|CDD|223327 COG0249, MutS, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 843
Score = 29.6 bits (67), Expect = 3.6
Identities = 20/89 (22%), Positives = 33/89 (37%), Gaps = 15/89 (16%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKK----KKKKKKKKKKKKKEEQSVK-- 69
N +E + E EE+E+ ++ K K K K K Q++K
Sbjct: 439 LLNNAKEWIAKLELEERERTGIKSLKIKYNKVYGYYIEVTKSNAKLVPDDYIRRQTLKNA 498
Query: 70 ---------EIEESLKERKEKTLTEDSQC 89
E+EE L + +EK L + +
Sbjct: 499 ERFTTPELKELEEKLLDAEEKILALEYEL 527
>gnl|CDD|217010 pfam02387, IncFII_repA, IncFII RepA protein family. This protein
is plasmid encoded and found to be essential for plasmid
replication.
Length = 279
Score = 28.9 bits (65), Expect = 3.6
Identities = 15/62 (24%), Positives = 27/62 (43%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E KK +K +E + KE + ++K KK+++ +K K E+ K
Sbjct: 179 ENKKLKKKGLIPLTLDEARRRAKEFHIERAFSYRTERKAFGKKRRRARKLAKLDEKDIRK 238
Query: 70 EI 71
+I
Sbjct: 239 KI 240
>gnl|CDD|216919 pfam02181, FH2, Formin Homology 2 Domain.
Length = 372
Score = 29.2 bits (66), Expect = 3.6
Identities = 19/42 (45%), Positives = 26/42 (61%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
+EE E + + E E++ K KKKK KK +KK KKKKK+
Sbjct: 33 DEESFEKDLDLSELEELFSAKAKKKKSKKSEKKSSSKKKKKE 74
>gnl|CDD|223839 COG0768, FtsI, Cell division protein FtsI/penicillin-binding
protein 2 [Cell envelope biogenesis, outer membrane].
Length = 599
Score = 29.3 bits (66), Expect = 3.6
Identities = 13/77 (16%), Positives = 28/77 (36%), Gaps = 3/77 (3%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE---EQSVKEI 71
K E + ++ + + E+ +K+ K+ K K + +K + I
Sbjct: 88 PRKITDEPLEATKDELAAKLLALLEDLDKRALLKELKSKSRLFTPLARKTNLTPEQAARI 147
Query: 72 EESLKERKEKTLTEDSQ 88
E L + + DS+
Sbjct: 148 AELLLKLPGVDVETDSR 164
>gnl|CDD|220223 pfam09405, Btz, CASC3/Barentsz eIF4AIII binding. This domain is
found on CASC3 (cancer susceptibility candidate gene 3
protein) which is also known as Barentsz (Btz). CASC3
is a component of the EJC (exon junction complex) which
is a complex that is involved in post-transcriptional
regulation of mRNA in metazoa. The complex is formed by
the association of four proteins (eIF4AIII, Barentsz,
Mago, and Y14), mRNA, and ATP. This domain wraps around
eIF4AIII and stacks against the 5' nucleotide.
Length = 116
Score = 27.8 bits (62), Expect = 3.7
Identities = 7/39 (17%), Positives = 19/39 (48%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKK 57
E E++ E + +E+K +K+++ +K + +
Sbjct: 1 KVESERQSGRTPSAEPTEPKEDKERKRREHEKYDDEDDE 39
>gnl|CDD|181988 PRK09609, PRK09609, hypothetical protein; Provisional.
Length = 312
Score = 29.2 bits (66), Expect = 3.7
Identities = 20/65 (30%), Positives = 33/65 (50%), Gaps = 5/65 (7%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK-----KKKKKKKKKKEEQ 66
KK K ++ + + ++KE+ D E +K +K K+K KKKKK +K EE+
Sbjct: 105 KKFFGKESRIKRYDNKIFKQKEQYDFALENPNSEKIQKIKQKIILLEKKKKKLEKTNEEK 164
Query: 67 SVKEI 71
S+
Sbjct: 165 SLLNF 169
>gnl|CDD|214659 smart00433, TOP2c, TopoisomeraseII. Eukaryotic DNA topoisomerase
II, GyrB, ParE.
Length = 594
Score = 29.4 bits (67), Expect = 3.7
Identities = 13/55 (23%), Positives = 22/55 (40%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
E + +EK E E+ E + E+ K + KK ++ +KKK
Sbjct: 309 EVRFGVEKIVSECLLSFLEENPVEASKIVEKVLLAAKARAAAKKARELTRKKKLS 363
>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11. The WW domain is
a small protein module with a triple-stranded
beta-sheet fold. This is a family of WW domain binding
proteins.
Length = 78
Score = 27.2 bits (61), Expect = 3.8
Identities = 12/50 (24%), Positives = 26/50 (52%)
Query: 39 EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
E+K K+ KK K +++ +++ K KK ++ + LK ++E +
Sbjct: 13 EQKKKELKKNKAERQARREAKLAKKNPDRLQRELDKLKAQEESGKLRPVE 62
>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain
pfam08565. Expression of a construct consisting of only
the N-terminal domain of Saccharomyces pombe Cdc37
results in cellular viability. This indicates that
interactions with the cochaperone Hsp90 may not be
essential for Cdc37 function.
Length = 172
Score = 28.6 bits (64), Expect = 3.8
Identities = 21/72 (29%), Positives = 42/72 (58%), Gaps = 1/72 (1%)
Query: 16 EKRNKEEEKEE-NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
+K N E E+ NE E + V++E ++K ++ +K + K KK++++ +K+++E
Sbjct: 93 DKENVEPEQPTYNEMVEDLFDQVKDEVDEKNGAALIEELQKHRDKLKKEQKELLKKLDEL 152
Query: 75 LKERKEKTLTED 86
KE K+K + D
Sbjct: 153 EKEEKKKIWSVD 164
>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
Length = 3164
Score = 29.6 bits (66), Expect = 3.8
Identities = 16/85 (18%), Positives = 32/85 (37%)
Query: 1 MRKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
+ G K+K + E E+E K +++ +K+K K
Sbjct: 2118 LSNTVVILGVKEKNADDALSGTPGERLEREVKSVFVEAPRDMLFLLEEEVRKRKGSVMKF 2177
Query: 61 KKKEEQSVKEIEESLKERKEKTLTE 85
K ++ +V E + + K+ +L E
Sbjct: 2178 KSSKKPAVLEAVLFVYKIKKASLRE 2202
>gnl|CDD|232945 TIGR00376, TIGR00376, DNA helicase, putative. The gene product may
represent a DNA helicase. Eukaryotic members of this
family have been characterized as binding certain
single-stranded G-rich DNA sequences (GGGGT and GGGCT).
A number of related proteins are characterized as
helicases [DNA metabolism, DNA replication,
recombination, and repair].
Length = 637
Score = 29.4 bits (66), Expect = 3.8
Identities = 16/94 (17%), Positives = 43/94 (45%), Gaps = 1/94 (1%)
Query: 14 RIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
R+ K NK+ + + + + V + + K + +++ KK K +K++ +K + +
Sbjct: 236 RLLKSNKQHSLDYLIENHPKYQIVADIREKIDELIEERNKKLKPSPQKRRGLSDIKILRK 295
Query: 74 SLKERKEKTLTEDSQCDTMIPVFKKGVLYKGIYC 107
+LK+R+ + + E + +M + +
Sbjct: 296 ALKKREARGI-ESLKIASMAEWIETNKSIDRLLK 328
>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
Length = 801
Score = 29.4 bits (66), Expect = 3.8
Identities = 11/49 (22%), Positives = 22/49 (44%)
Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
++ + + K+ ++ K K KKK+ KK K + + +EI
Sbjct: 590 DRAARAEAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKAAAEREI 638
Score = 29.4 bits (66), Expect = 3.8
Identities = 11/48 (22%), Positives = 20/48 (41%)
Query: 25 EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIE 72
+ + E + ++ KKK KKK+ KK + ++ E E
Sbjct: 590 DRAARAEAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKAAAERE 637
Score = 29.0 bits (65), Expect = 4.3
Identities = 17/65 (26%), Positives = 26/65 (40%), Gaps = 5/65 (7%)
Query: 4 RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
R AG + R + E K+ +Q +K+ + KKK+ KK K K +
Sbjct: 581 REKFAGSQADRAARAEAAEAKKLAKQLKKKAL-----SDGGKKKQGKKAGGGGKSKAAAE 635
Query: 64 EEQSV 68
E V
Sbjct: 636 REIDV 640
Score = 28.2 bits (63), Expect = 7.6
Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 7/51 (13%)
Query: 20 KEEEKEE-------NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
K+EE E ++ + + + E K K+ KKK KKK+ KK
Sbjct: 573 KDEEVEAYREKFAGSQADRAARAEAAEAKKLAKQLKKKALSDGGKKKQGKK 623
>gnl|CDD|177973 PLN02339, PLN02339, NAD+ synthase (glutamine-hydrolysing).
Length = 700
Score = 29.3 bits (66), Expect = 3.9
Identities = 12/30 (40%), Positives = 14/30 (46%)
Query: 73 ESLKERKEKTLTEDSQCDTMIPVFKKGVLY 102
E L E LT+ CD +PV GV Y
Sbjct: 67 ECLAEILVGDLTDGILCDIGMPVIHGGVRY 96
>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein. This model
describes nucleoid occlusion protein, a close homolog to
ParB chromosome partitioning proteins including Spo0J in
Bacillus subtilis. Its gene often is located near the
gene for the Spo0J ortholog. This protein bind a
specific DNA sequence and blocks cytokinesis from
happening until chromosome segregation is complete.
Length = 255
Score = 28.6 bits (65), Expect = 3.9
Identities = 17/45 (37%), Positives = 26/45 (57%), Gaps = 9/45 (20%)
Query: 39 EEKNKKKKKKKKKKKKKKKKKKKKKE---------EQSVKEIEES 74
EE KK +K +K KKKKK++K + +QSVK I+++
Sbjct: 188 EELIKKLLEKPEKPKKKKKRRKGFSKDVRIAVNTIKQSVKMIKKT 232
>gnl|CDD|216597 pfam01601, Corona_S2, Coronavirus S2 glycoprotein. The coronavirus
spike glycoprotein forms the characteristic 'corona'
after which the group is named. The Spike glycoprotein
is translated as a large polypeptide that is
subsequently cleaved to S1 pfam01600 and S2.
Length = 609
Score = 29.2 bits (66), Expect = 4.0
Identities = 14/43 (32%), Positives = 17/43 (39%), Gaps = 9/43 (20%)
Query: 244 LNNKEYLI---WYI-LFIVFVTYAMLPLPLKWCFI----GGCT 278
LN E I WY+ L I A + L L +C GC
Sbjct: 536 LNRYETYIKWPWYVWLAIGAALIAFVLL-LLFCCCMTGCCGCC 577
>gnl|CDD|234032 TIGR02855, spore_yabG, sporulation peptidase YabG. Members of
this family are the protein YabG, demonstrated for
Bacillus subtilis to be an endopeptidase able to
release N-terminal peptides from a number of
sporulation proteins, including CotT, CotF, and SpoIVA.
It appears to be expressed under control of sigma-K
[Cellular processes, Sporulation and germination].
Length = 283
Score = 28.9 bits (65), Expect = 4.0
Identities = 11/36 (30%), Positives = 21/36 (58%)
Query: 46 KKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
+ + ++KK+KK+ K++ E S + + K KEK
Sbjct: 50 ELVDESERKKRKKRFKERMETSYRLFRQDYKLMKEK 85
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 28.9 bits (65), Expect = 4.0
Identities = 12/40 (30%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 13 KRIEKRNKEE-EKEENEQEEKEKEDVEEEKNKKKKKKKKK 51
+ + ++ KEE E+ + E ED E+ + K+K K+++K
Sbjct: 116 RFLAQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQK 155
Score = 28.1 bits (63), Expect = 6.8
Identities = 10/43 (23%), Positives = 24/43 (55%), Gaps = 1/43 (2%)
Query: 25 EENEQEEKEKEDVEE-EKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
+ ++EE+E+ E + ++ + K+K K+++K + EE
Sbjct: 119 AQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEET 161
>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
subunit. CENP-Q is one of the components that assembles
onto the CENP-A-nucleosome distal (CAD) centromere. The
centromere, which is the basic element of chromosome
inheritance, is epigenetically determined in mammals.
CENP-A, the centromere-specific histone H3 variant,
assembles an array of nucleosomes and it is this that
seems to be the prime candidate for specifying
centromere identity. CENP-A nucleosomes directly recruit
a proximal CENP-A nucleosome associated complex (NAC)
comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
CENP-C and CENP-H. Assembly of the CENP-A NAC at
centromeres is dependent on CENP-M, CENP-N and CENP-T.
Additionally, there are seven other subunits which make
up the CENP-A-nucleosome distal (CAD) centromere,
CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
CENP-S, also assembling on the CENP-A NAC. Fta7 is the
equivalent component of the fission yeast Sim4 complex.
Length = 159
Score = 28.5 bits (64), Expect = 4.1
Identities = 22/77 (28%), Positives = 40/77 (51%), Gaps = 3/77 (3%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEE-EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
K++E +E +EE E E K+ E ++E EKN K +++++++ K + E V E
Sbjct: 40 LKQLELLEEEIRREEAELE-KDLEYLQELEKNAKALEREREEESKNLHPVLRLLESEVLE 98
Query: 71 IEESLKERKEKTLTEDS 87
E L + L+E +
Sbjct: 99 ENELL-QDSLLELSERN 114
>gnl|CDD|225350 COG2740, COG2740, Predicted nucleic-acid-binding protein
implicated in transcription termination
[Transcription].
Length = 95
Score = 27.3 bits (61), Expect = 4.1
Identities = 10/43 (23%), Positives = 19/43 (44%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
+++ +K KKKK + K K + E+ + L E +
Sbjct: 51 PDRECIEKAKKKKLFSRALKAKVSDSLYDELIQLLAELEPPRN 93
>gnl|CDD|225123 COG2213, MtlA, Phosphotransferase system, mannitol-specific IIBC
component [Carbohydrate transport and metabolism].
Length = 472
Score = 29.2 bits (66), Expect = 4.1
Identities = 10/44 (22%), Positives = 21/44 (47%), Gaps = 1/44 (2%)
Query: 29 QEEKEKEDVEEEKNK-KKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
+ + ++D+E K K+ K + K + K + +VK+I
Sbjct: 338 SKTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSNVKKI 381
>gnl|CDD|191868 pfam07839, CaM_binding, Plant calmodulin-binding domain. The
sequences featured in this family are found repeated in
a number of plant calmodulin-binding proteins, and are
thought to constitute the calmodulin-binding domains.
Binding of the proteins to calmodulin depends on the
presence of calcium ions. These proteins are thought to
be involved in various processes, such as plant defence
responses and stolonisation or tuberization.
Length = 115
Score = 27.7 bits (62), Expect = 4.1
Identities = 15/52 (28%), Positives = 25/52 (48%), Gaps = 2/52 (3%)
Query: 33 EKEDVEEEKNKKKKKKKKKKKKK--KKKKKKKKEEQSVKEIEESLKERKEKT 82
+E EE ++KK+ K +K+K K KK ++ VK +E+ K
Sbjct: 1 TEEKKEERRSKKEDPKPEKEKPKSWSNLKKVILLKRFVKSLEKVRKFNPRGP 52
>gnl|CDD|234306 TIGR03674, fen_arch, flap structure-specific endonuclease.
Endonuclease that cleaves the 5'-overhanging flap
structure that is generated by displacement synthesis
when DNA polymerase encounters the 5'-end of a
downstream Okazaki fragment. Has 5'-endo-/exonuclease
and 5'-pseudo-Y-endonuclease activities. Cleaves the
junction between single and double-stranded regions of
flap DNA.
Length = 338
Score = 28.8 bits (65), Expect = 4.1
Identities = 12/30 (40%), Positives = 19/30 (63%), Gaps = 1/30 (3%)
Query: 11 KKKRIEKRNKEEEKEENEQEE-KEKEDVEE 39
K + +E+R + E+ E + EE EK D+EE
Sbjct: 87 KAETLEERREIREEAEEKWEEALEKGDLEE 116
>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain. This presumed domain is
found at the C-terminus of a family of FtsJ-like
methyltransferases. Members of this family are involved
in 60S ribosomal biogenesis.
Length = 212
Score = 28.4 bits (64), Expect = 4.1
Identities = 26/69 (37%), Positives = 35/69 (50%), Gaps = 14/69 (20%)
Query: 9 GEKKKRIEKR-NKEEEKEE--NEQEEK-EKEDVEE-----EKNKKKKKKKKK-----KKK 54
KK R KR K ++K E E E+ E+E ++ +K KKKKKK+KK KK
Sbjct: 118 ARKKMRALKRLEKAKKKAEAIAEDEDISEREKAKQIKKLYKKAKKKKKKRKKTYVVAKKG 177
Query: 55 KKKKKKKKK 63
K K + K
Sbjct: 178 NKGKAGRPK 186
>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2). This family
represents a conserved region approximately 60 residues
long within the eukaryotic targeting protein for Xklp2
(TPX2). Xklp2 is a kinesin-like protein localised on
centrosomes throughout the cell cycle and on spindle
pole microtubules during metaphase. In Xenopus, it has
been shown that Xklp2 protein is required for
centrosome separation and maintenance of spindle
bi-polarity. TPX2 is a microtubule-associated protein
that mediates the binding of the C-terminal domain of
Xklp2 to microtubules. It is phosphorylated during
mitosis in a microtubule-dependent way.
Length = 57
Score = 26.6 bits (59), Expect = 4.2
Identities = 12/34 (35%), Positives = 22/34 (64%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKK 46
K++E++ K E E+ E E ++KE+ EE + +K
Sbjct: 15 KKLEEKEKALEAEKEEAEARQKEEEEEAIKQLRK 48
>gnl|CDD|149587 pfam08586, Rsc14, RSC complex, Rsc14/Ldb7 subunit. RSC is an
ATP-dependent chromatin remodelling complex found in
yeast. The RSC components Rsc7/Npl6 and Rsc14/Ldb7
interact physically and/or functionally with Rsc3,
Rsc30, and Htl1 to form a module important for a broad
range of RSC functions.
Length = 101
Score = 27.5 bits (61), Expect = 4.2
Identities = 10/33 (30%), Positives = 21/33 (63%)
Query: 20 KEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK 52
KEE +E +QE+ +++D + + K+ +K K+
Sbjct: 23 KEELRELTQQEDDQRKDRDLPELKRNDTEKGKR 55
>gnl|CDD|224911 COG2000, COG2000, Predicted Fe-S protein [General function
prediction only].
Length = 226
Score = 28.7 bits (64), Expect = 4.2
Identities = 18/90 (20%), Positives = 34/90 (37%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
R E + ++ K + E + + + K K+ K +KE + V ++
Sbjct: 26 SRCDEFAEALLKKLVKIDDCPFLELERFAENYDRLKAILKELGKAEKEMKIVGVLDGYEA 85
Query: 77 ERKEKTLTEDSQCDTMIPVFKKGVLYKGIY 106
+ K L + C ++ FK L KG
Sbjct: 86 DFVLKPLPNECSCREILFPFKSKELEKGDI 115
>gnl|CDD|227463 COG5134, COG5134, Uncharacterized conserved protein [Function
unknown].
Length = 272
Score = 28.9 bits (64), Expect = 4.3
Identities = 15/96 (15%), Positives = 31/96 (32%), Gaps = 4/96 (4%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
N E+ E + E EK+ +++ K + K+ S + ++ +
Sbjct: 122 AENVEKVPESDAIEALEKQLTQQKSEKHNSSAINFIDELNKRLWSDPFVSSQRLRKQFRE 181
Query: 77 ERKEKTLTEDSQCDTMIPVFKKGVLYKGIYCPSLTN 112
+K E Q + + + L I S
Sbjct: 182 RKK----IEKKQEAKDLSLKNRAALDIDILPSSSDK 213
>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
beta and delta phosphatidylinositol 3-kinases (PI3Ks).
PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
regulate cell processes such as cell growth,
differentiation, proliferation, and motility. PI3Ks
work on phosphorylation of phosphatidylinositol,
phosphatidylinositide (4)P (PtdIns (4)P),2 or
PtdIns(4,5)P2. Specifically they phosphorylate the D3
hydroxyl group of phosphoinositol lipids on the inositol
ring. There are 3 classes of PI3Ks based on structure,
regulation, and specificity. All classes contain a C2
domain, a PIK domain, and a kinase catalytic domain.
The members here are class I, beta and delta isoforms of
PI3Ks and contain both a Ras-binding domain and a
p85-binding domain. Class II PI3Ks contain both of
these as well as a PX domain, and a C-terminal C2 domain
containing a nuclear localization signal. C2 domains
fold into an 8-standed beta-sandwich that can adopt 2
structural arrangements: Type I and Type II,
distinguished by a circular permutation involving their
N- and C-terminal beta strands. Many C2 domains are
Ca2+-dependent membrane-targeting modules that bind a
wide variety of substances including bind phospholipids,
inositol polyphosphates, and intracellular proteins.
Most C2 domain proteins are either signal transduction
enzymes that contain a single C2 domain, such as protein
kinase C, or membrane trafficking proteins which contain
at least two C2 domains, such as synaptotagmin 1.
However, there are a few exceptions to this including
RIM isoforms and some splice variants of piccolo/aczonin
and intersectin which only have a single C2 domain. C2
domains with a calcium binding region have negatively
charged residues, primarily aspartates, that serve as
ligands for calcium ions. Members have a type-I
topology.
Length = 173
Score = 28.4 bits (64), Expect = 4.4
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 42 NKKKKKKKKKKKKKKKKKKKK 62
KK K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
Score = 28.0 bits (63), Expect = 5.0
Identities = 12/22 (54%), Positives = 16/22 (72%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKK 61
E +KK K K+ +K + KKKKKK
Sbjct: 84 EVSKKAKGKRSRKNQTKKKKKK 105
Score = 27.7 bits (62), Expect = 6.3
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 43 KKKKKKKKKKKKKKKKKKKKK 63
KK K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
>gnl|CDD|236413 PRK09210, PRK09210, RNA polymerase sigma factor RpoD; Validated.
Length = 367
Score = 28.8 bits (65), Expect = 4.4
Identities = 19/42 (45%), Positives = 24/42 (57%), Gaps = 2/42 (4%)
Query: 54 KKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPV 95
KK KKK+ EQ VKE+ E K +K TLT D + +IP
Sbjct: 1 KKTKKKETTTLEQVVKELIE--KGKKRGTLTYDEIAEKLIPF 40
Score = 28.4 bits (64), Expect = 6.5
Identities = 10/38 (26%), Positives = 18/38 (47%)
Query: 40 EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
+ KK+ ++ K+ +K KK+ + EI E L
Sbjct: 2 KTKKKETTTLEQVVKELIEKGKKRGTLTYDEIAEKLIP 39
>gnl|CDD|233224 TIGR00993, 3a0901s04IAP86, chloroplast protein import component
Toc86/159, G and M domains. The long precursor of the
86K protein originally described is proposed to have
three domains. The N-terminal A-domain is acidic,
repetitive, weakly conserved, readily removed by
proteolysis during chloroplast isolation, and not
required for protein translocation. The other domains
are designated G (GTPase) and M (membrane anchor); this
family includes most of the G domain and all of M
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 763
Score = 29.2 bits (65), Expect = 4.4
Identities = 14/64 (21%), Positives = 35/64 (54%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
K ++ K +KE+ K E+ + + +++++ +++ K+ K KK K+ + + S +
Sbjct: 417 TKAQMAKLSKEQRKAYLEEYDYRVKLLQKKQWREELKRMKMMKKFGKEIGELPDGYSEEV 476
Query: 71 IEES 74
EE+
Sbjct: 477 DEEN 480
>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 28.6 bits (64), Expect = 4.4
Identities = 13/45 (28%), Positives = 23/45 (51%), Gaps = 2/45 (4%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKK 56
K R+ K E + QEE K ++ N+ +++ +K+ KKK
Sbjct: 149 KARLRKAQILLEGLQKNQEELFKL--LDKYNELREQVQKESSKKK 191
>gnl|CDD|216095 pfam00748, Calpain_inhib, Calpain inhibitor. This region is found
multiple times in calpain inhibitor proteins.
Length = 131
Score = 27.8 bits (62), Expect = 4.5
Identities = 17/53 (32%), Positives = 29/53 (54%), Gaps = 7/53 (13%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKED-------VEEEKNKKKKKKKKKKKKK 55
E+KK +E + KE+ KEE ++ E+ED + E K+K K K +++
Sbjct: 79 EEKKPVEDKVKEKAKEEKLEKLGEREDTIPPEYRLLEAKDKDGKPLLPKPEEE 131
Score = 27.8 bits (62), Expect = 5.2
Identities = 15/59 (25%), Positives = 26/59 (44%), Gaps = 14/59 (23%)
Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKK--------------KKKKKKKKKKKKKKEEQS 67
++E + +E+K ED +EK K++K +K + K K K K E+
Sbjct: 73 QREPDPEEKKPVEDKVKEKAKEEKLEKLGEREDTIPPEYRLLEAKDKDGKPLLPKPEEE 131
>gnl|CDD|223403 COG0326, HtpG, Molecular chaperone, HSP90 family [Posttranslational
modification, protein turnover, chaperones].
Length = 623
Score = 29.2 bits (66), Expect = 4.5
Identities = 17/78 (21%), Positives = 33/78 (42%), Gaps = 6/78 (7%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
EE++ ++E+E+KE + + E + K+ K K + + + L + E
Sbjct: 491 EEDEADSEEEKKEFKPLLE---RVKEILGDKVKDVRLSHRLTDSPACLTTDGADLSTQME 547
Query: 81 KTLTEDSQCDTMIPVFKK 98
+ L Q +P KK
Sbjct: 548 RLLKAQGQ---EVPESKK 562
>gnl|CDD|218370 pfam04997, RNA_pol_Rpb1_1, RNA polymerase Rpb1, domain 1. RNA
polymerases catalyze the DNA dependent polymerisation of
RNA. Prokaryotes contain a single RNA polymerase
compared to three in eukaryotes (not including
mitochondrial. and chloroplast polymerases). This
domain, domain 1, represents the clamp domain, which a
mobile domain involved in positioning the DNA,
maintenance of the transcription bubble and positioning
of the nascent RNA strand.
Length = 330
Score = 28.8 bits (65), Expect = 4.5
Identities = 14/52 (26%), Positives = 20/52 (38%), Gaps = 5/52 (9%)
Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKK-----KKKKKKKKKKKEEQSVKEIE 72
NE + V + K KK+ KK KKK K ++ +K E
Sbjct: 105 LNESVKYFFLKVVIDPKGKNSKKRLKKINNLCKKKSICSKCGEDNGGLKAFE 156
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 28.8 bits (65), Expect = 4.6
Identities = 7/44 (15%), Positives = 30/44 (68%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
+EE ++E +E + +D +++ +++ ++++++++ ++K+ E
Sbjct: 137 DEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAE 180
Score = 28.4 bits (64), Expect = 7.0
Identities = 6/46 (13%), Positives = 30/46 (65%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
+EE EE+ + E +++D +++ + +++ ++++++++ +++
Sbjct: 134 DEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRA 179
Score = 28.0 bits (63), Expect = 8.5
Identities = 10/53 (18%), Positives = 31/53 (58%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
EE EE+E+ K ++D +++ + +++ ++++++++ E+ E+E
Sbjct: 131 EESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAELEF 183
>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45;
Provisional.
Length = 193
Score = 28.5 bits (63), Expect = 4.7
Identities = 18/70 (25%), Positives = 40/70 (57%), Gaps = 3/70 (4%)
Query: 4 RRGGAGEKKKRIEKRNKEEEKEENEQE---EKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
R+ A E K+R E EK + E E E++ ED+ EE + ++ +++++ +++K
Sbjct: 6 RKNKAKEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVEELEDQPEEPPEQEEENEEQKP 65
Query: 61 KKKEEQSVKE 70
K++ + ++E
Sbjct: 66 KEEIDYPIQE 75
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 29.0 bits (65), Expect = 4.7
Identities = 13/31 (41%), Positives = 19/31 (61%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEE 40
EK + +R EEE+EE E+EE+E + E
Sbjct: 392 EKGHPVVERWAEEEEEEEEEEEEEAAEAEAP 422
>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
peptidyl-prolyl isomerase. This family is only found at
the amino terminus of pfam00254. This domain is of
unknown function.
Length = 124
Score = 27.6 bits (62), Expect = 4.7
Identities = 9/44 (20%), Positives = 23/44 (52%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
+E K +E+ +K + +++ K++ K +KK ++ K +
Sbjct: 59 AGKELKLTDEEIQKALQAFQKKLQAKQQAKAEKKAEENKAAGEA 102
Score = 27.2 bits (61), Expect = 7.0
Identities = 14/44 (31%), Positives = 24/44 (54%)
Query: 38 EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
+EE K + +KK + K++ K +KK E++ E L E +K
Sbjct: 67 DEEIQKALQAFQKKLQAKQQAKAEKKAEENKAAGEAFLAENAKK 110
>gnl|CDD|211398 cd11560, W2_eIF5C_like, C-terminal W2 domain of the eukaryotic
translation initiation factor 5C and similar proteins.
eIF5C appears to be essential for the initiation of
protein translation; its actual function, and
specifically that of the C-terminal W2 domain, are not
well understood. The Drosophila ortholog, kra
(krasavietz) or exba (extra bases), may be involved in
translational inhibition in neural development. The
structure of this C-terminal domain resembles that of a
set of concatenated HEAT repeats.
Length = 194
Score = 28.3 bits (64), Expect = 4.8
Identities = 19/65 (29%), Positives = 40/65 (61%), Gaps = 2/65 (3%)
Query: 25 EENEQEEKEKEDVEE--EKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
EE+ E ++E ++E E +K+ ++ KK+ +++ K+ EE+ VKEI ++KE+ +K+
Sbjct: 10 EEHFAEHFKEEGLDELVEFYRKQASQEIKKELQQELKEMIAEEEPVKEIIAAVKEQMKKS 69
Query: 83 LTEDS 87
+
Sbjct: 70 SLPEH 74
>gnl|CDD|220265 pfam09491, RE_AlwI, AlwI restriction endonuclease. This family
includes the AlwI (recognises GGATC), Bsp6I (recognises
GC^NGC), BstNBI (recognises GASTC), PleI(recognises
GAGTC) and MlyI (recognises GAGTC) restriction
endonucleases.
Length = 429
Score = 28.9 bits (65), Expect = 4.8
Identities = 11/51 (21%), Positives = 25/51 (49%)
Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
+ + +++ K +K++ K K KK K+ E++ + S K++ E
Sbjct: 92 DNDIDEIVNRILKFRKERLKNKDNIKKLSKEYFEKECYSISDSSSKKKFET 142
>gnl|CDD|181784 PRK09334, PRK09334, 30S ribosomal protein S25e; Provisional.
Length = 86
Score = 26.9 bits (60), Expect = 4.9
Identities = 12/36 (33%), Positives = 24/36 (66%)
Query: 48 KKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
++KKKKKK+ K +K+ + ++ ++E L +R K +
Sbjct: 1 QQKKKKKKEGSKTEKEIKSTIITLDEELLKRVAKEV 36
Score = 26.5 bits (59), Expect = 7.3
Identities = 16/45 (35%), Positives = 28/45 (62%), Gaps = 5/45 (11%)
Query: 43 KKKKKKKKKKKKKKKKKKKKK---EEQSVKEIEESLKERKEKTLT 84
++KKKKKK+ K +K+ K +E+ +K + + +K KEK +T
Sbjct: 1 QQKKKKKKEGSKTEKEIKSTIITLDEELLKRVAKEVK--KEKIVT 43
>gnl|CDD|222346 pfam13728, TraF, F plasmid transfer operon protein. TraF protein
undergoes proteolytic processing associated with
export. The 19 amino acids at the amino terminus of the
polypeptides appear to constitute a typical membrane
leader peptide - not included in this family, while the
remainder of the molecule is predicted to be primarily
hydrophilic in character. F plasmid TraF and TraH are
required for F pilus assembly and F plasmid transfer,
and they are both localised to the outer membrane in
the presence of the complete F transfer region,
especially TraV, the putative anchor.
Length = 215
Score = 28.3 bits (64), Expect = 4.9
Identities = 6/39 (15%), Positives = 14/39 (35%)
Query: 43 KKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
+ K+ +K + + + + E L+E K
Sbjct: 8 PPEPKEPEKAPAPAAPAQSPSATEQMAWLREYLEEALAK 46
>gnl|CDD|221630 pfam12542, CWC25, Pre-mRNA splicing factor. This domain family
is found in eukaryotes, and is approximately 100 amino
acids in length. The family is found in association
with pfam10197. There is a single completely conserved
residue Y that may be functionally important. Cwc25 has
been identified to associate with pre-mRNA splicing
factor Cef1/Ntc85, a component of the Prp19-associated
complex (NTC) involved in spliceosome activation. Cwc25
is neither tightly associated with NTC nor required for
spliceosome activation, but is required for the first
catalytic reaction.
Length = 95
Score = 27.0 bits (60), Expect = 4.9
Identities = 13/64 (20%), Positives = 28/64 (43%), Gaps = 9/64 (14%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKK---------KKKKKKKKKK 62
KKR++K +E++ EE +Q+ + +E N K KK++++ ++
Sbjct: 31 KKRVDKLLEEKDAEEAKQKSSGEALIEGASNANSANDTWNKLREDPLLAIKKREQEARQA 90
Query: 63 KEEQ 66
Sbjct: 91 LMNN 94
>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated. This domain
appears in eukaryotes as well as bacteria and tends to
be found near the C-terminus of the metalloprotease
M16C (pfam05193).
Length = 248
Score = 28.3 bits (64), Expect = 5.0
Identities = 11/41 (26%), Positives = 23/41 (56%)
Query: 46 KKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
+K +++++K +KKK E+ ++I E E +E T +
Sbjct: 9 EKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEELQSTPE 49
Score = 28.3 bits (64), Expect = 6.0
Identities = 13/44 (29%), Positives = 26/44 (59%)
Query: 34 KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
E +EE+ N+++++K +KKK ++ K+K + E+EE
Sbjct: 4 DETLEEKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEELQST 47
>gnl|CDD|220897 pfam10883, DUF2681, Protein of unknown function (DUF2681). This
family of proteins is found in bacteria. Proteins in
this family are typically between 81 and 117 amino
acids in length.
Length = 87
Score = 27.0 bits (60), Expect = 5.0
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEE-KNKKKKKK 48
K K+ ++ N + + ENEQ EK E E KN K ++K
Sbjct: 24 KLKKAQREN-RKLQAENEQLATEKAVAETEVKNAKVRQK 61
>gnl|CDD|221550 pfam12366, Casc1, Cancer susceptibility candidate 1. This domain
family is found in eukaryotes, and is typically between
216 and 263 amino acids in length. Casc1 has many SNPs
associated with cancer susceptibility.
Length = 227
Score = 28.5 bits (64), Expect = 5.0
Identities = 10/51 (19%), Positives = 19/51 (37%), Gaps = 7/51 (13%)
Query: 38 EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE--KTLTED 86
E+ + + K+ K K K EI + +K+ E + + D
Sbjct: 71 IAEEAEGARGYVTDIPKEYKAKPVKY-----LEIPKPIKDPPELPEGMFPD 116
Score = 28.5 bits (64), Expect = 5.1
Identities = 9/43 (20%), Positives = 15/43 (34%)
Query: 29 QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
+ +E E K+ K K K + K K+ + E
Sbjct: 70 KIAEEAEGARGYVTDIPKEYKAKPVKYLEIPKPIKDPPELPEG 112
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 28.4 bits (64), Expect = 5.0
Identities = 15/62 (24%), Positives = 30/62 (48%), Gaps = 3/62 (4%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E + N E+K +E +E+ E+ ++ ++ ++K+ + K +K K E V
Sbjct: 29 ELAIQEAHENALEKKLLDELDEELDEEEDDRFLEEYRRKRLAEMKALAEKSKFGE---VY 85
Query: 70 EI 71
EI
Sbjct: 86 EI 87
>gnl|CDD|235185 PRK03980, PRK03980, flap endonuclease-1; Provisional.
Length = 292
Score = 28.6 bits (65), Expect = 5.0
Identities = 13/30 (43%), Positives = 20/30 (66%), Gaps = 1/30 (3%)
Query: 11 KKKRIEKRNKEEEKEENEQEE-KEKEDVEE 39
K + IE+R + E+ E + EE KE+ D+EE
Sbjct: 40 KAEEIEERREVREEAEEKYEEAKEEGDLEE 69
>gnl|CDD|191022 pfam04538, BEX, Brain expressed X-linked like family. This is a
family of transcription elongation factors which
includes those referred to as Bex proteins as well as
those named TCEAL7. Bex1 was shown to be a novel link
between neurotrophin signalling, the cell cycle, and
neuronal differentiation, suggesting it might function
by coordinating internal cellular states with the
ability of cells to respond to external signals. TCEAL7
has been shown negatively to regulate the NF-kappaB
pathway, hence being important in ovarian cancer as it
one of the genes frequently downregulated in this
cancer. A closely related protein, TFIIS/TCEA, found in
pfam07500 is involved in transcription elongation and
transcript fidelity. TFIIS/TCEA promotes 3'
endoribonuclease activity of RNA polymerase II (pol II)
and allows pol II to bypass transcript pause or
'arrest' during elongation process. It is thus possible
that BEX is also acting in this way.
Length = 97
Score = 27.3 bits (61), Expect = 5.0
Identities = 16/40 (40%), Positives = 25/40 (62%), Gaps = 2/40 (5%)
Query: 15 IEKRNKE-EEKEENEQ-EEKEKEDVEEEKNKKKKKKKKKK 52
+EK KE E K E+E EE+EK +EE + KK + +++
Sbjct: 1 MEKPCKENEGKPESEPKEEEEKRPLEEGEGKKPEGNFRRR 40
>gnl|CDD|153355 cd07671, F-BAR_PSTPIP1, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
Phosphatase-Interacting Protein 1. F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Proline-Serine-Threonine
Phosphatase-Interacting Protein 1 (PSTPIP1), also known
as CD2 Binding Protein 1 (CD2BP1), is mainly expressed
in hematopoietic cells. It is a binding partner of the
cell surface receptor CD2 and PTP-PEST, a tyrosine
phosphatase which functions in cell motility and Rac1
regulation. It also plays a role in the activation of
the Wiskott-Aldrich syndrome protein (WASP), which
couples actin rearrangement and T cell activation.
Mutations in the gene encoding PSTPIP1 cause the
autoinflammatory disorder known as PAPA (pyogenic
sterile arthritis, pyoderma gangrenosum, and acne)
syndrome. PSTPIP1 contains an N-terminal F-BAR domain,
PEST motifs, and a C-terminal SH3 domain. F-BAR domains
form banana-shaped dimers with a positively-charged
concave surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 242
Score = 28.4 bits (63), Expect = 5.0
Identities = 19/80 (23%), Positives = 33/80 (41%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
KK K+ E+ E ++ E+ E N K+ +K + K K+ + + E +
Sbjct: 120 KKTMESKKTYEQRCREADEAEQTFERSSSTGNPKQSEKSQNKAKQCRDAATEAERVYKQN 179
Query: 71 IEESLKERKEKTLTEDSQCD 90
IE+ K R E C+
Sbjct: 180 IEQLDKARTEWETEHILTCE 199
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 29.1 bits (65), Expect = 5.0
Identities = 18/73 (24%), Positives = 31/73 (42%), Gaps = 3/73 (4%)
Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKK---KKKKKKKKKKKKEEQSVKEIEES 74
K EK+ +K+ EK+ K K+K K + K + KK++ EE+S S
Sbjct: 720 PPKPAEKDSLSAPKKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSS 779
Query: 75 LKERKEKTLTEDS 87
+ ++S
Sbjct: 780 SSSHHHSSSNKES 792
>gnl|CDD|153289 cd07605, I-BAR_IMD, Inverse (I)-BAR, also known as the IRSp53/MIM
homology Domain (IMD), a dimerization module that binds
and bends membranes. Inverse (I)-BAR (or IMD) is a
member of the Bin/Amphiphysin/Rvs (BAR) domain family.
It is a dimerization and lipid-binding module that bends
membranes and induces membrane protrusions in the
opposite direction compared to classical BAR and F-BAR
domains, which produce membrane invaginations. IMD
domains are found in Insulin Receptor tyrosine kinase
Substrate p53 (IRSp53), Missing in Metastasis (MIM), and
Brain-specific Angiogenesis Inhibitor 1-Associated
Protein 2-like (BAIAP2L) proteins. These are
multi-domain proteins that act as scaffolding proteins
and transducers of a variety of signaling pathways that
link membrane dynamics and the underlying actin
cytoskeleton. Most members contain an N-terminal IMD, an
SH3 domain, and a WASP homology 2 (WH2) actin-binding
motif at the C-terminus, exccept for MIM which does not
carry an SH3 domain. Some members contain additional
domains and motifs. The IMD domain binds and bundles
actin filaments, binds membranes and produces membrane
protrusions, and interacts with the small GTPase Rac.
Length = 223
Score = 28.5 bits (64), Expect = 5.0
Identities = 17/68 (25%), Positives = 42/68 (61%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
EKK ++++ + +++ ++E K+K + ++ + KK +KK +K K ++K +Q+++
Sbjct: 100 EKKLELDQKVINKFEKDYKKEYKQKREDLDKARSELKKLQKKSQKSGTGKYQEKLDQALE 159
Query: 70 EIEESLKE 77
E+ + KE
Sbjct: 160 ELNDKQKE 167
>gnl|CDD|187774 cd09643, Csn1, CRISPR/Cas system-associated protein Cas9. CRISPR
(Clustered Regularly Interspaced Short Palindromic
Repeats) and associated Cas proteins comprise a system
for heritable host defense by prokaryotic cells against
phage and other foreign DNA; Very large protein
containing McrA/HNH-nuclease related domain and a
RuvC-like nuclease domain; signature gene for type II.
Length = 799
Score = 28.9 bits (65), Expect = 5.1
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 37 VEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
K K +KK++KK + K+ EQ +KE+
Sbjct: 524 NGTNKGTKNRKKRQKKNEDNIKEAASALEQKLKEL 558
Score = 28.9 bits (65), Expect = 5.6
Identities = 16/68 (23%), Positives = 30/68 (44%), Gaps = 4/68 (5%)
Query: 18 RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKE 77
R EE+ E ++EK+++ + KK K K+K+K KEE+ K + +
Sbjct: 299 RIIEEQGETKILSKEEKQELLDLL----FKKNKLTYKQKRKLLGLKEEEIFKGLRYEGLK 354
Query: 78 RKEKTLTE 85
++
Sbjct: 355 AEKNFNIS 362
>gnl|CDD|217360 pfam03087, DUF241, Arabidopsis protein of unknown function. This
family represents a number of Arabidopsis proteins.
Their functions are unknown.
Length = 230
Score = 28.5 bits (64), Expect = 5.1
Identities = 14/64 (21%), Positives = 31/64 (48%), Gaps = 6/64 (9%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
EK++K+ + + E ++EEK + ++ K K+ ++ E S++E+
Sbjct: 156 SLLSEKKSKKGGSQSSLSIRSELSRLDEEKRRDNEEVKDILKRLEEL------ENSIEEL 209
Query: 72 EESL 75
E+ L
Sbjct: 210 EDEL 213
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast)
and GNL3L (human) are putative GTPases which are
required for growth and play a role in processing of
nucleolar pre-rRNA. This family contains a potential
nuclear localisation signal.
Length = 80
Score = 26.8 bits (60), Expect = 5.1
Identities = 9/35 (25%), Positives = 27/35 (77%)
Query: 31 EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
E+ E++EE+K K++++K+++K+ +K ++ + ++
Sbjct: 45 EEILEEIEEKKRKQEEEKERRKEARKAERAEARKR 79
>gnl|CDD|184349 PRK13824, PRK13824, replication initiation protein RepC;
Provisional.
Length = 404
Score = 28.7 bits (65), Expect = 5.1
Identities = 11/20 (55%), Positives = 13/20 (65%)
Query: 315 INSAGMYTKYLTDRGQRRAF 334
INSAG Y + LT R +R F
Sbjct: 364 INSAGGYLRDLTRRAERGEF 383
>gnl|CDD|224023 COG1098, VacB, Predicted RNA binding protein (contains ribosomal
protein S1 domain) [Translation, ribosomal structure and
biogenesis].
Length = 129
Score = 27.6 bits (62), Expect = 5.2
Identities = 14/44 (31%), Positives = 25/44 (56%)
Query: 39 EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
EE+ +K+ +K + K + K+ + + +K IEESL K +T
Sbjct: 77 EEEPEKQHRKPRFSKSRPKEGFETLLSRLLKWIEESLSTLKRQT 120
Score = 26.9 bits (60), Expect = 9.2
Identities = 10/66 (15%), Positives = 29/66 (43%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
G +++E+ +++ ++ + + KE E ++ K ++ K++ KK
Sbjct: 63 IDENGKISLSIRKLEEEPEKQHRKPRFSKSRPKEGFETLLSRLLKWIEESLSTLKRQTKK 122
Query: 62 KKEEQS 67
K+ +
Sbjct: 123 KRGGRG 128
>gnl|CDD|232985 TIGR00463, gltX_arch, glutamyl-tRNA synthetase, archaeal and
eukaryotic family. The glutamyl-tRNA synthetases of
the eukaryotic cytosol and of the Archaea are more
similar to glutaminyl-tRNA synthetases than to
bacterial glutamyl-tRNA synthetases. This model models
just the eukaryotic cytosolic and archaeal forms of the
enzyme. In some eukaryotes, the glutamyl-tRNA
synthetase is part of a longer, multifunctional
aminoacyl-tRNA ligase. In many species, the charging of
tRNA(gln) proceeds first through misacylation with Glu
and then transamidation. For this reason, glutamyl-tRNA
synthetases, including all known archaeal enzymes (as
of 2010) may act on both tRNA(gln) and tRNA(glu)
[Protein synthesis, tRNA aminoacylation].
Length = 556
Score = 28.6 bits (64), Expect = 5.2
Identities = 22/71 (30%), Positives = 30/71 (42%), Gaps = 8/71 (11%)
Query: 3 KRRGGAGEK-------KKRIEKRNKEEE-KEENEQEEKEKEDVEEEKNKKKKKKKKKKKK 54
K RG A K E R K +E E E +E + E+ K+ K+ K
Sbjct: 15 KYRGKANPKAVMGAVMSNNPELRKKAKEVLEAVEAAVEEVNSLSPEEQKELMKRLGLDIK 74
Query: 55 KKKKKKKKKEE 65
KK+KK+K E
Sbjct: 75 KKEKKRKGLRE 85
>gnl|CDD|217611 pfam03552, Cellulose_synt, Cellulose synthase. Cellulose, an
aggregate of unbranched polymers of beta-1,4-linked
glucose residues, is the major component of wood and
thus paper, and is synthesised by plants, most algae,
some bacteria and fungi, and even some animals. The
genes that synthesise cellulose in higher plants differ
greatly from the well-characterized genes found in
Acetobacter and Agrobacterium sp. More correctly
designated as 'cellulose synthase catalytic subunits',
plant cellulose synthase (CesA) proteins are integral
membrane proteins, approximately 1,000 amino acids in
length. There are a number of highly conserved residues,
including several motifs shown to be necessary for
processive glycosyltransferase activity.
Length = 716
Score = 28.9 bits (65), Expect = 5.2
Identities = 13/47 (27%), Positives = 27/47 (57%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
+ KK K K KKK K+K+ + ++++I+E ++ K +++ S
Sbjct: 312 RRKKPKSASKAKKKDSKRKESEAPIFNLEDIDEGDEDEKSSLMSQLS 358
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 28.8 bits (64), Expect = 5.2
Identities = 29/131 (22%), Positives = 56/131 (42%), Gaps = 17/131 (12%)
Query: 11 KKKRIEKRNKEEEK------EENEQEEKEKEDVEEEKNKKKKKKKKKKKK---------- 54
K RI+K + E K EE Q EK + ++ EK+ + + + +++
Sbjct: 369 DKDRIQKLSAELLKLEEWLQEERSQREKLEVELGIEKDCNRVQLSENRRELSELRSALRV 428
Query: 55 -KKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVLYKGIYCPSLTNS 113
+K+K++ ++E+Q + + L+ R +K E D K L++ Y L +
Sbjct: 429 LQKEKEQLQEEKQELLDYIRVLELRLDKEADEKWAEDAATCEDAKSYLHRAAYIDLLDSI 488
Query: 114 FREPSLEMSYQ 124
P E S +
Sbjct: 489 DESPLDESSSE 499
>gnl|CDD|181442 PRK08476, PRK08476, F0F1 ATP synthase subunit B'; Validated.
Length = 141
Score = 27.7 bits (62), Expect = 5.3
Identities = 21/83 (25%), Positives = 39/83 (46%), Gaps = 4/83 (4%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
K N + E + E ++ EE NK ++K K K++ +KK + K+ + + E K
Sbjct: 51 KTNSSDVSEIEHEIETILKNAREEANKIRQKAIAKAKEEAEKKIEAKKAELESKYEAFAK 110
Query: 77 --ERKEKTLTEDSQCDTMIPVFK 97
+++ L E + +P FK
Sbjct: 111 QLANQKQELKEQLL--SQMPEFK 131
>gnl|CDD|235516 PRK05582, PRK05582, DNA topoisomerase I; Validated.
Length = 650
Score = 28.6 bits (65), Expect = 5.3
Identities = 15/40 (37%), Positives = 23/40 (57%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
+ + E K +EDV+E + KKK K K KKK++K+
Sbjct: 211 YKGKKIELKNEEDVKEILAELKKKDFKVSKVKKKERKRNP 250
Score = 28.2 bits (64), Expect = 7.7
Identities = 14/57 (24%), Positives = 27/57 (47%), Gaps = 5/57 (8%)
Query: 31 EKEKEDVEEE---KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLT 84
+K K+ E KK + K ++ K+ + KK++ V ++++ KERK
Sbjct: 198 KKGKKKFEASFYGYKGKKIELKNEEDVKEILAELKKKDFKVSKVKK--KERKRNPPP 252
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 28.8 bits (63), Expect = 5.3
Identities = 15/53 (28%), Positives = 24/53 (45%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
+++ R + R +E K E ++E +ED K+K KK KK K
Sbjct: 166 DEQHRHKARKQELRKREKDRERARREDAAAAAAAKQKAAAKKAAAPSGKKSAK 218
Score = 28.4 bits (62), Expect = 6.7
Identities = 13/49 (26%), Positives = 30/49 (61%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
KRN E + + E+K++ + E+ K+ K ++ + K +K++ +K+E+
Sbjct: 135 KRNLMEHIHKVKNEKKKERQLAEQLAAKRLKDEQHRHKARKQELRKREK 183
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 28.6 bits (64), Expect = 5.3
Identities = 13/32 (40%), Positives = 20/32 (62%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEE 39
+ EK KE+E+EE +EE+E+ED +E
Sbjct: 254 DQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285
>gnl|CDD|143630 cd07892, PolyPPase_VTC2-3_like, Polyphosphate(polyP) polymerase
domain of yeast vacuolar transport chaperone (VTC)
proteins VTC-2, and -3 , and similar proteins.
Saccharomyces cerevisiae VTC-1, -2, -3, and -4 comprise
the membrane-integral VTC complex. VTC-2, -3, and -4
contain polyP polymerase domains. S. cerevisiae
VTC-2,and -3 belong to this subgroup. For VTC4 it has
been shown that this domain generates polyP from ATP by
a phosphotransfer reaction releasing ADP. This activity
is metal ion-dependent. The ATP gamma phosphate may be
cleaved and then transferred to an acceptor phosphate to
form polyP. PolyP is ubiquitous. In prokaryotes, it is a
store of phosphate and energy. In eukaryotes, polyPs
have roles in bone calcification, and osmoregulation,
and in phosphate transport in the symbiosis of
mycorrhizal fungi and plants. This subgroup belongs to
the CYTH/triphosphate tunnel metalloenzyme (TTM)-like
superfamily, whose enzymes have a unique active site
located within an eight-stranded beta barrel.
Length = 303
Score = 28.5 bits (64), Expect = 5.3
Identities = 17/68 (25%), Positives = 36/68 (52%), Gaps = 4/68 (5%)
Query: 15 IEKRNKEEEKEENEQEE---KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
+EK+ +E E+++ KEK + K K +KK +K +K+ + ++ V+ I
Sbjct: 98 VEKKTFDENTSSFEEDKLQLKEKY-INGFIFGKYKFEKKLQKMEKRGADLENLKKDVENI 156
Query: 72 EESLKERK 79
++ ++E K
Sbjct: 157 QDFIRENK 164
>gnl|CDD|203471 pfam06518, DUF1104, Protein of unknown function (DUF1104). This
family consists of several hypothetical proteins of
unknown function which appear to be found largely in
Helicobacter pylori.
Length = 93
Score = 26.9 bits (60), Expect = 5.4
Identities = 15/65 (23%), Positives = 29/65 (44%), Gaps = 2/65 (3%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E KR+ K ++E K + + + + ++K +K+ +++ KK SVK
Sbjct: 29 EIDKRLGKMKEKEAKNFKAGFKAAMRKNLSKMSVEDRRKFRKEVREEMNKKVDA--MSVK 86
Query: 70 EIEES 74
E E
Sbjct: 87 EAREL 91
>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
Length = 405
Score = 28.6 bits (64), Expect = 5.5
Identities = 15/56 (26%), Positives = 26/56 (46%), Gaps = 4/56 (7%)
Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
K + + K KK+K KK+K+ K+E ++K ++ KT+ DS
Sbjct: 338 KTARRMALLAEAARVKAKKEKLAKKRKQVSKEEAAAIKAAGKAW----YKTMISDS 389
>gnl|CDD|234045 TIGR02876, spore_yqfD, sporulation protein YqfD. YqfD is part of
the sigma-E regulon in the sporulation program of
endospore-forming Gram-positive bacteria. Mutation
results in a sporulation defect in Bacillus subtilis.
Members are found in all currently known
endospore-forming bacteria, including the genera
Bacillus, Symbiobacterium, Carboxydothermus,
Clostridium, and Thermoanaerobacter [Cellular processes,
Sporulation and germination].
Length = 382
Score = 28.5 bits (64), Expect = 5.5
Identities = 16/73 (21%), Positives = 33/73 (45%), Gaps = 3/73 (4%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSV- 68
EK K+ E ++ E ++ E+ K++ +K K+K +K+ KK+ +
Sbjct: 295 EKTKKFGLWRWSLPIEIVKETYYEVKEKVEKVTKEEAIEKAKEKAEKELKKELDPNAKIV 354
Query: 69 --KEIEESLKERK 79
K + E ++ K
Sbjct: 355 SDKILSERVEGGK 367
Score = 28.5 bits (64), Expect = 5.9
Identities = 22/78 (28%), Positives = 36/78 (46%), Gaps = 3/78 (3%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
K EK K + E KE E K+K +K K++ +K K+K E++ K
Sbjct: 289 AKYDVEEKTKKFGLWRWSLPIEIVKETYYE---VKEKVEKVTKEEAIEKAKEKAEKELKK 345
Query: 70 EIEESLKERKEKTLTEDS 87
E++ + K +K L+E
Sbjct: 346 ELDPNAKIVSDKILSERV 363
>gnl|CDD|227003 COG4656, RnfC, Predicted NADH:ubiquinone oxidoreductase, subunit
RnfC [Energy production and conversion].
Length = 529
Score = 28.9 bits (65), Expect = 5.5
Identities = 13/70 (18%), Positives = 31/70 (44%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
EK + + +R + ++ EE + + + E + ++ + KK K+ ++
Sbjct: 429 EKAEILAQRQELKKAEEAKTRFEARTARLEREKAERIPRMKKAAVKRLSFVTAALIDAIA 488
Query: 70 EIEESLKERK 79
E +KE+K
Sbjct: 489 AALERVKEKK 498
>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6. The VP6 protein a
minor protein in the core of the virion is probably the
viral helicase.
Length = 322
Score = 28.5 bits (63), Expect = 5.5
Identities = 13/65 (20%), Positives = 35/65 (53%)
Query: 4 RRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
+R K+++I+ + + +E++ K++ +E+ + ++K+K K++ K+
Sbjct: 9 KRSSEELKQRQIQINLVDWFESGESKEKEPKDEDGQEQRISDGEGEQKQKGGKEESDKET 68
Query: 64 EEQSV 68
E+ SV
Sbjct: 69 EDASV 73
>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
splicing MRS1. Members of this family adopt a secondary
structure consisting of two beta sheets and one alpha
helix, arranged as a beta-alpha-beta motif. Each beta
sheet has five strands, arranged in a 32145 order, with
the second strand being antiparallel to the rest.
Mitochondrial resolvase Ydc2 is capable of resolving
Holliday junctions and cleaves DNA after 5'-CT-3' and
5'-TT-3' sequences. This family also contains the
mitochondrial RNA-splicing protein MRS1 which is
involved in the excision of group I introns.
Length = 254
Score = 28.5 bits (64), Expect = 5.5
Identities = 10/28 (35%), Positives = 15/28 (53%)
Query: 33 EKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
E+ ++ EK K + KKK K KK +
Sbjct: 147 ERTEILAEKKKPRSKKKSSKNSKKLRID 174
>gnl|CDD|222635 pfam14264, Glucos_trans_II, Glucosyl transferase GtrII. This
family includes glucosyl transferase II from the
Shigella phage SfII, which mediates seroconversion of S.
flexneri when the phage is integrated into the host
chromosome.
Length = 317
Score = 28.4 bits (64), Expect = 5.5
Identities = 13/73 (17%), Positives = 24/73 (32%), Gaps = 10/73 (13%)
Query: 218 NNYLQWAAVGTWILLNTQGFISQGIGLNNKEYLIWY--ILFIVFVTYAMLPLPLKWCFIG 275
Y W +G +L + + + L IL I+F+ + L I
Sbjct: 20 YGYGGWVGLGRPLL----DLLKRILFGTGLTDLPPLPLILSILFLALSA----LLLARIF 71
Query: 276 GCTTSVLHVIITA 288
++ V+I
Sbjct: 72 DLRNKLIAVLIAL 84
>gnl|CDD|233034 TIGR00584, mug, mismatch-specific thymine-DNA glycosylate (mug).
All proteins in this family for whcih functions are
known are G-T or G-U mismatch glycosylases that function
in base excision repair. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University). Used 2pf model [DNA metabolism,
DNA replication, recombination, and repair].
Length = 328
Score = 28.5 bits (63), Expect = 5.5
Identities = 27/116 (23%), Positives = 46/116 (39%), Gaps = 20/116 (17%)
Query: 2 RKRRGGAGEKKKRIE-KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
RKR+ + E + +E K+ + +K + KEK++ K K K K+K +
Sbjct: 60 RKRKPRSNEPYRPVEPKKPSDSKKSGKSTKSKEKQE-------KITDKFKVKRKVDRFNG 112
Query: 61 KKKEEQSVKEIEESLKERKEKTLTEDSQCDTMIPVFKKGVL--YKGIYCPSLTNSF 114
+ E K + + L D +I G++ YKG + P N F
Sbjct: 113 VSEAELLTKTLPDILTF----------NLDIVIIGINPGLMAAYKGHHYPGPGNHF 158
>gnl|CDD|217479 pfam03297, Ribosomal_S25, S25 ribosomal protein.
Length = 105
Score = 27.3 bits (61), Expect = 5.6
Identities = 11/30 (36%), Positives = 14/30 (46%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
K +KK K K KKKK + VK+
Sbjct: 5 KQQKKAAKAAAASAGGKAKKKKWSKGKVKD 34
>gnl|CDD|113902 pfam05149, Flagellar_rod, Paraflagellar rod protein. This family
consists of several eukaryotic paraflagellar rod
component proteins. The eukaryotic flagellum represents
one of the most complex macromolecular structures found
in any organism and contains more than 250 proteins. In
addition to its locomotive role, the flagellum is
probably involved in nutrient uptake since receptors
for host low-density lipoproteins are localised on the
flagellar membrane as well as on the flagellar pocket
membrane.
Length = 289
Score = 28.5 bits (64), Expect = 5.6
Identities = 19/76 (25%), Positives = 42/76 (55%), Gaps = 4/76 (5%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E K R + EE E +E++E+ED + K +K+ ++++ +K ++ ++++ +
Sbjct: 9 EAKHRQKSAC--EEDLERCKEKREEEDAADAKQRKRYSAQRRESEKFLQQNVEQQQACWR 66
Query: 70 EIEESLKERKEKTLTE 85
I+E ER+ + L E
Sbjct: 67 AIQE--LERELRDLAE 80
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 28.1 bits (63), Expect = 5.6
Identities = 16/53 (30%), Positives = 27/53 (50%), Gaps = 6/53 (11%)
Query: 20 KEEEKEENEQ-EEKEKEDVEE-----EKNKKKKKKKKKKKKKKKKKKKKKEEQ 66
K KEE ++ K D + + K++KK K++KK K +K+K E+
Sbjct: 69 KVMTKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEE 121
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 28.6 bits (64), Expect = 5.7
Identities = 13/72 (18%), Positives = 36/72 (50%), Gaps = 1/72 (1%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
++ R EE+ + E +E+ E E+ + + K + + + + K++++ E+ + + +
Sbjct: 143 LRQEAMRRATEEEILEMRRETIEEEAELER-ENIRAKIEAEARGRAKEERENEDINREML 201
Query: 72 EESLKERKEKTL 83
+ E +E L
Sbjct: 202 KLKANEERETVL 213
>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB. Topo IB proteins include the
monomeric yeast and human topo I and heterodimeric topo
I from Leishmania donvanni. Topo I enzymes are divided
into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topos I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I have putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 217
Score = 28.3 bits (63), Expect = 5.7
Identities = 14/30 (46%), Positives = 22/30 (73%)
Query: 34 KEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
K +EEEK KKK K++K+ KKK++ K++
Sbjct: 90 KNHLEEEKEKKKNLNKEEKEAKKKERAKRE 119
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 28.8 bits (65), Expect = 5.8
Identities = 22/72 (30%), Positives = 41/72 (56%), Gaps = 8/72 (11%)
Query: 14 RIEKRNKEEEKEENEQE-----EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE-EQS 67
IE+ ++EK+E +E EKE ++EEE +++ K +K + ++ K+E EQ
Sbjct: 418 EIEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKAAIQGIQQIKEEIEQV 477
Query: 68 VKEIEESLKERK 79
E+E++ ER+
Sbjct: 478 RLELEQA--ERE 487
>gnl|CDD|216016 pfam00602, Flu_PB1, Influenza RNA-dependent RNA polymerase subunit
PB1. Two GTP binding sites exist in this protein.
Length = 740
Score = 28.7 bits (64), Expect = 5.9
Identities = 13/48 (27%), Positives = 26/48 (54%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQ 88
KN K++ K+ K K+K+ KE+ + E ++ T+T+D++
Sbjct: 186 KNNKRRYDDKETGKLLKRKQPSKEKDRITREEYLIRALTLNTMTKDAE 233
>gnl|CDD|212058 cd11489, SLC5sbd_SGLT5, Na(+)/glucose cotransporter SGLT5 and
related proteins; solute-binding domain. Human SGLT5 is
a glucose transporter, which also transports galactose.
It is encoded by the SLC5A10 gene, and is exclusively
expressed in the renal cortex. This subgroup belongs to
the solute carrier 5 (SLC5) transporter family.
Length = 604
Score = 28.8 bits (64), Expect = 5.9
Identities = 12/49 (24%), Positives = 24/49 (48%), Gaps = 3/49 (6%)
Query: 109 SLTNSFREPSLEMS---YQRYSHRQRQKSLIIVNLVDMVLKAVLTVAWI 154
SLT+ F S + ++R ++ L++V + V+ ++V WI
Sbjct: 364 SLTSIFNSSSTLFTMDIWRRLRPGASERELLLVGRLVTVILVGVSVVWI 412
>gnl|CDD|224520 COG1604, COG1604, CRISPR system related protein, RAMP superfamily
[Defense mechanisms].
Length = 257
Score = 28.2 bits (63), Expect = 5.9
Identities = 16/58 (27%), Positives = 24/58 (41%)
Query: 25 EENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKT 82
+E +K EEE K + + K K+ + EQ VK + +L K KT
Sbjct: 25 KEQIINLDKKNAFEEEIIKDFVNEIQSYLSKVLKRLGRSYEQYVKNTKITLNSFKLKT 82
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 28.5 bits (64), Expect = 6.0
Identities = 28/90 (31%), Positives = 47/90 (52%), Gaps = 15/90 (16%)
Query: 11 KKKRIEKRNKEEE---KEENEQEEKEKEDVEE-----------EKNKKKKKKKKKKKKKK 56
K K +E+ KEEE K + EK ED+EE E+ ++K+ + +K K
Sbjct: 92 KLKELEREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQMEILEKALKS 151
Query: 57 KKKKKKKEEQSVKEIEESL-KERKEKTLTE 85
K K+E +S++ + ++L KE +E+T E
Sbjct: 152 YLKIVKEENKSLQRLAKALQKESEERTQDE 181
>gnl|CDD|227480 COG5151, SSL1, RNA polymerase II transcription
initiation/nucleotide excision repair factor TFIIH,
subunit SSL1 [Transcription / DNA replication,
recombination, and repair].
Length = 421
Score = 28.4 bits (63), Expect = 6.0
Identities = 13/55 (23%), Positives = 24/55 (43%)
Query: 29 QEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTL 83
Q+ + + E E +K + K + +K K EQ K + + + KE +L
Sbjct: 5 QKSFDSDKSESEDEQKNGRVKVRSRKTDDMNKGYSWEQEYKRSWDDVNDDKEGSL 59
>gnl|CDD|227621 COG5305, COG5305, Predicted membrane protein [Function unknown].
Length = 552
Score = 28.5 bits (64), Expect = 6.1
Identities = 19/123 (15%), Positives = 37/123 (30%), Gaps = 18/123 (14%)
Query: 171 WTVYTMMSNLAMCLLGWWRC-FANNYLQWAAVGTWILLNTQGWWRCFANNYLQWAAVGTW 229
T++S LL R + L + +L+ + + +
Sbjct: 173 AVATTLISAT--LLLRAIRLPTSRKLLPGWLIYALLLIL--SLLTHY--FFALTV-IAHG 225
Query: 230 ILLNTQGFISQGIGLNNKEYLIWYILFIVFVTYAMLPLPLKWCFIGGCTTSVLHVIITAQ 289
+ + + + +LIW++L A LP W + + V I
Sbjct: 226 VYV----LLWASLKNRQSLFLIWWLLATAAGLLAFLP----WLLVYF--ANSFRVGIPDL 275
Query: 290 IKL 292
I L
Sbjct: 276 ITL 278
>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
protein. This domain family is found in eukaryotes, and
is typically between 212 and 238 amino acids in length.
The family is found in association with pfam01805. There
are two completely conserved residues (W and H) that may
be functionally important. PRP21 is required for
assembly of the prespliceosome and it interacts with U2
snRNP and/or pre-mRNA in the prespliceosome. This family
also contains proteins similar to PRP21, such as the
mammalian SF3a. SF3a also interacts with U2 snRNP from
the prespliceosome, converting it to its active form.
Length = 230
Score = 28.2 bits (63), Expect = 6.2
Identities = 20/80 (25%), Positives = 32/80 (40%), Gaps = 11/80 (13%)
Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK-------KKKKEEQSVKEIEESL 75
EE E+ E+E+ED++EEK ++ KK K+ K K +K+
Sbjct: 104 APEEEEEMEEEEEDIDEEKKEESAKKSTPPIKEMKIPAAGESRLKIRKDYVPS---GNKP 160
Query: 76 KERKEKTLTEDSQ-CDTMIP 94
+ K K +IP
Sbjct: 161 RAEKPKKKMIKCPITGELIP 180
Score = 27.4 bits (61), Expect = 9.5
Identities = 16/57 (28%), Positives = 23/57 (40%), Gaps = 8/57 (14%)
Query: 19 NKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK--------KKKKKKKKKEEQS 67
EEE+E E+EE E+ +EE KK K+ K K +K +
Sbjct: 104 APEEEEEMEEEEEDIDEEKKEESAKKSTPPIKEMKIPAAGESRLKIRKDYVPSGNKP 160
>gnl|CDD|236539 PRK09490, metH, B12-dependent methionine synthase; Provisional.
Length = 1229
Score = 28.6 bits (65), Expect = 6.2
Identities = 27/97 (27%), Positives = 43/97 (44%), Gaps = 22/97 (22%)
Query: 267 LPLKWCFIGGCTTSVLHVIITAQIKLH---RGPLSPCSVHEMGALCLVYIGINSAGMYTK 323
+PL IGG TTS H TA +K+ GP V+ A + G+ +
Sbjct: 833 IPL---LIGGATTSKAH---TA-VKIAPNYSGP----VVYVTDA-------SRAVGVVSS 874
Query: 324 YLTDRGQRRAFLETHRSTETRMKTQKENDQQEKLLLS 360
L+D QR A++ R+ +++ Q + K LL+
Sbjct: 875 LLSDE-QRDAYVAETRAEYEKVREQHARKKPRKPLLT 910
>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16. Nop16 is a
protein involved in ribosome biogenesis.
Length = 173
Score = 27.8 bits (62), Expect = 6.3
Identities = 14/45 (31%), Positives = 23/45 (51%), Gaps = 2/45 (4%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
KN+ K +K+ K+K +KKK I ++ K+KTL +
Sbjct: 6 KNRSSNYKVNRKRLKRKDRKKKINIIGNPIIAKNWD--KKKTLAQ 48
Score = 27.4 bits (61), Expect = 8.1
Identities = 8/24 (33%), Positives = 14/24 (58%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKE 64
K K + K +K+ K+K +KK+
Sbjct: 4 KKKNRSSNYKVNRKRLKRKDRKKK 27
>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
binding. This is the C-terminal conserved coiled coil
region of a family of TATA element modulatory factor 1
proteins conserved in eukaryotes. The proteins bind to
the TATA element of some RNA polymerase II promoters
and repress their activity. by competing with the
binding of TATA binding protein. TMF1_TATA_bd is the
most conserved part of the TMFs. TMFs are
evolutionarily conserved golgins that bind Rab6, a
ubiquitous ras-like GTP-binding Golgi protein, and
contribute to Golgi organisation in animal and plant
cells. The Rab6-binding domain appears to be the same
region as this C-terminal family.
Length = 121
Score = 27.2 bits (61), Expect = 6.3
Identities = 17/73 (23%), Positives = 40/73 (54%), Gaps = 8/73 (10%)
Query: 18 RNKEEE----KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
R E E K+E + E E+++ +E K ++ ++ ++ KK+ ++ +KE + +++ E
Sbjct: 27 RRLEGELASLKDELARLEAERDEARQEIVKLTEENEELRELKKEIEELEKELEDLEQRYE 86
Query: 74 S----LKERKEKT 82
+ L E+ E+
Sbjct: 87 TTLELLGEKSERV 99
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 28.7 bits (64), Expect = 6.4
Identities = 21/80 (26%), Positives = 39/80 (48%), Gaps = 1/80 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ-SV 68
++ +E R K E+ E + E + E EE K K K + K + +K+ + KE + S+
Sbjct: 107 DRNLELEVRLKALEELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESI 166
Query: 69 KEIEESLKERKEKTLTEDSQ 88
I+ L E + + D++
Sbjct: 167 SRIKNDLSEMQCRAQNADTE 186
Score = 28.4 bits (63), Expect = 7.3
Identities = 19/86 (22%), Positives = 33/86 (38%), Gaps = 5/86 (5%)
Query: 9 GEKKKRIEKRNKEEEKEENEQEEKEKEDV---EEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
EK I K+ E + + + ++ V E K K + K ++ +++ KE
Sbjct: 356 KEKNGSISSSAKQLETTLQQLQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKER 415
Query: 66 QSVKEIEESLKERKEKTLTEDSQCDT 91
++ I S KE T T S
Sbjct: 416 DGLRAILNSY--DKELTETSVSGQLM 439
>gnl|CDD|223436 COG0359, RplI, Ribosomal protein L9 [Translation, ribosomal
structure and biogenesis].
Length = 148
Score = 27.6 bits (62), Expect = 6.4
Identities = 15/72 (20%), Positives = 30/72 (41%), Gaps = 5/72 (6%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQ---- 66
K + + E + + EK+ EE + K+K + K + K ++ +
Sbjct: 33 KGLAVPATKGNLKLLEARRAKLEKKAAEELAEAEALKEKLEGKTVEIAVKAGEDGKLFGS 92
Query: 67 -SVKEIEESLKE 77
+ K+I E+LK
Sbjct: 93 VTSKDIAEALKA 104
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 28.4 bits (64), Expect = 6.5
Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 3/69 (4%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
R KE+ K+ QE K EE ++ ++ + K + KK + E + E E+ L+
Sbjct: 207 DRAKEKLKKL-LQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTE--IAEAEKKLE 263
Query: 77 ERKEKTLTE 85
+ + T E
Sbjct: 264 QCRGFTFKE 272
>gnl|CDD|190741 pfam03763, Remorin_C, Remorin, C-terminal region. Remorins are
plant-specific plasma membrane-associated proteins. In
tobacco remorin co-purifies with lipid rafts. Most
remorins have a variable, proline-rich C-half and a
more conserved N-half that is predicted to form coiled
coils. Consistent with this, circular dichroism studies
have demonstrated that much of the protein is
alpha-helical. Remorins exist in plasma membrane
preparations as oligomeric structures and form
filaments in vitro. The proteins can bind polyanions
including the extracellular matrix component
oligogalacturonic acid (OGA). In vitro, remorin in
plasma membrane preparations is phosphorylated
(principally on threonine residues) in the presence of
OGA and thus co-purifies with a protein kinases(s). The
biological functions of remorins are unknown but roles
as components of the membrane/cytoskeleton are
possible.
Length = 112
Score = 26.9 bits (60), Expect = 6.7
Identities = 17/55 (30%), Positives = 28/55 (50%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKE 64
+K +I + + EE + E K+K E E K ++K +KKK + +K K K
Sbjct: 16 AEKAKINNKYQREEAKIQAWENKKKAKAEAELKKIERKLEKKKAEAAEKLKNKLA 70
>gnl|CDD|212164 cd11650, AT4G37440_like, Uncharacterized protein domain conserved
in plants. This domain contains an extensive protein
sequence fragment that appears conserved in a number of
plant proteins, including the gene product of
Arabidopsis thaliana locus AT4G37440, which has been
identified in transcriptional profiling as expressed at
different levels in white cabbage cultivars.
Length = 253
Score = 28.2 bits (63), Expect = 6.7
Identities = 13/50 (26%), Positives = 27/50 (54%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
++ E+E+ Q E K + E K+ ++ ++K K++K+K+ EE
Sbjct: 114 DRELALIEQEKELQLEGTKAEGENSKSAPFSGERHERKVMKRRKRKRVEE 163
>gnl|CDD|218871 pfam06046, Sec6, Exocyst complex component Sec6. Sec6 is a
component of the multiprotein exocyst complex. Sec6
interacts with Sec8, Sec10 and Exo70.These exocyst
proteins localise to regions of active exocytosis-at
the growing ends of interphase cells and in the medial
region of cells undergoing cytokinesis-in an
F-actin-dependent and exocytosis- independent manner.
Length = 557
Score = 28.5 bits (64), Expect = 6.8
Identities = 16/59 (27%), Positives = 30/59 (50%), Gaps = 3/59 (5%)
Query: 28 EQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK--KKKEEQSVKE-IEESLKERKEKTL 83
E+EEKE E E + K + + + + + K K +++E +EE +E +E+ L
Sbjct: 41 EREEKEDEKAEALQEATKDENNDDAGRFRGRPRGYKDKFLDALEESVEERFEECRERFL 99
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 27.2 bits (61), Expect = 6.9
Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 2/34 (5%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK 49
+R ++EEK+ EE+ K EE K ++++KKK
Sbjct: 95 AQRARDEEKK--LDEEEAKRQHEEAKEREREKKK 126
Score = 26.9 bits (60), Expect = 9.7
Identities = 10/37 (27%), Positives = 27/37 (72%)
Query: 27 NEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
+E+ E ++ EE+K +++ K++ ++ K+++++KKK
Sbjct: 90 DERAEAQRARDEEKKLDEEEAKRQHEEAKEREREKKK 126
>gnl|CDD|240256 PTZ00071, PTZ00071, 40S ribosomal protein S24; Provisional.
Length = 132
Score = 27.3 bits (61), Expect = 7.0
Identities = 11/24 (45%), Positives = 16/24 (66%)
Query: 37 VEEEKNKKKKKKKKKKKKKKKKKK 60
+E KN++KK + K+K K KKK
Sbjct: 109 RKELKNRRKKVRGKEKTKVSGKKK 132
Score = 26.9 bits (60), Expect = 8.7
Identities = 10/23 (43%), Positives = 14/23 (60%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKK 63
K K ++KK + K+K K KKK
Sbjct: 110 KELKNRRKKVRGKEKTKVSGKKK 132
>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Proteins. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs, all of which are expressed
during embryonic and early development in the nervous
system but with different localization and timing.
srGAPs contain an N-terminal F-BAR domain, a Rho GAP
domain, and a C-terminal SH3 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 241
Score = 28.1 bits (63), Expect = 7.0
Identities = 12/37 (32%), Positives = 25/37 (67%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK 60
KE +QEEK+++ E++ + + KK +K+ +K++ K
Sbjct: 155 KEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191
Score = 28.1 bits (63), Expect = 7.1
Identities = 12/40 (30%), Positives = 29/40 (72%)
Query: 42 NKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEK 81
K K+ +K+++K+++ +KK + +S K+IE+ +++R+ K
Sbjct: 152 RKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191
>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 145
Score = 27.4 bits (61), Expect = 7.0
Identities = 7/37 (18%), Positives = 16/37 (43%)
Query: 17 KRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKK 53
++N E+ E + + KK+++ +KK
Sbjct: 104 RKNDAHTHASILLSSNEQNSTEALQLRAKKREEHRKK 140
>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase. This domain family is found in
eukaryotes, and is approximately 140 amino acids in
length. The family is found in association with
pfam00069. Polo-like kinase 1 (Plx1) is essential during
mitosis for the activation of Cdc25C, for spindle
assembly, and for cyclin B degradation. This family is
Polo kinase kinase (PKK) which phosphorylates Polo
kinase and Polo-like kinase to activate them. PKK is a
serine/threonine kinase.
Length = 142
Score = 27.3 bits (61), Expect = 7.0
Identities = 18/81 (22%), Positives = 44/81 (54%), Gaps = 1/81 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
E+++ E R ++ EQ+ + K E K +KK+ K++ +K + ++++KK ++ K
Sbjct: 43 EQRQTQELRR-LPKRIRAEQKTRLKMFKESLKIEKKELKQEVEKLPRFQEQEKKRMKAEK 101
Query: 70 EIEESLKERKEKTLTEDSQCD 90
E +E +++E+ + +
Sbjct: 102 EEQEQKHQKQEREFLAKQEEN 122
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 28.7 bits (64), Expect = 7.1
Identities = 16/48 (33%), Positives = 29/48 (60%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKK 63
E + K+EEK+E + EE + E+V+EE +++K + +K + KK
Sbjct: 278 ELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKK 325
>gnl|CDD|113815 pfam05058, ActA, ActA Protein. The ActA family is found in
Listeria and is associated with motility. ActA protein
acts as a scaffold to assemble and activate host cell
actin cytoskeletal factors at the bacterial surface,
resulting in directional actin polymerisation and
propulsion of the bacterium through the cytoplasm of the
host cell.
Length = 601
Score = 28.3 bits (62), Expect = 7.1
Identities = 17/57 (29%), Positives = 31/57 (54%), Gaps = 3/57 (5%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK---KKKKKKKKKEE 65
+K +R+KEE K + E++ E+ + + N KK+ ++ K K + +K KEE
Sbjct: 513 QKEATERDKEEMKPQTEEKMVEESEPANDANGKKRSAGIEEGKLIAKSAEDEKAKEE 569
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 27.2 bits (60), Expect = 7.1
Identities = 9/17 (52%), Positives = 14/17 (82%)
Query: 20 KEEEKEENEQEEKEKED 36
E +KEE ++EE+E+ED
Sbjct: 88 AEAKKEEKKEEEEEEED 104
>gnl|CDD|148820 pfam07432, Hc1, Histone H1-like protein Hc1. This family
consists of several bacterial histone H1-like Hc1
proteins. In Chlamydia, Hc1 is expressed in the late
stages of the life cycle, concomitant with the
reorganisation of chlamydial reticulate bodies into
elementary bodies. This suggests that Hc1 protein plays
a role in the condensation of chromatin during
intracellular differentiation.
Length = 123
Score = 27.0 bits (59), Expect = 7.2
Identities = 14/37 (37%), Positives = 19/37 (51%)
Query: 34 KEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
KE ++ EK+ KKK KK K K KK + K+
Sbjct: 50 KESIKAEKSGLMKKKPAKKAPKAAKAPKKAAKAPAKK 86
>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
Length = 567
Score = 28.3 bits (64), Expect = 7.2
Identities = 11/37 (29%), Positives = 23/37 (62%), Gaps = 5/37 (13%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKE-----DVEEEK 41
E+++R+E+ E +EE E++E++K + E+ K
Sbjct: 65 EQRERLEELAPELLEEEEEKKEEKKGLPPLPNAEKGK 101
>gnl|CDD|222559 pfam14126, DUF4293, Domain of unknown function (DUF4293). This
family of integral membrane proteins is functionally
uncharacterized. This family of proteins is found in
bacteria. Proteins in this family are typically between
136 and 154 amino acids in length.
Length = 111
Score = 27.1 bits (61), Expect = 7.3
Identities = 5/31 (16%), Positives = 13/31 (41%)
Query: 126 YSHRQRQKSLIIVNLVDMVLKAVLTVAWISY 156
+ +R+ Q L +N++ + L +
Sbjct: 36 FKNRKLQIRLCRLNILLNLGLLGLFAYFFLN 66
>gnl|CDD|217930 pfam04152, Mre11_DNA_bind, Mre11 DNA-binding presumed domain. The
Mre11 complex is a multi-subunit nuclease that is
composed of Mre11, Rad50 and Nbs1/Xrs2, and is involved
in checkpoint signalling and DNA replication. Mre11 has
an intrinsic DNA-binding activity that is stimulated by
Rad50 on its own or in combination with Nbs1.
Length = 166
Score = 27.5 bits (62), Expect = 7.3
Identities = 12/40 (30%), Positives = 20/40 (50%)
Query: 47 KKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTED 86
KKKK+K K KK ++ ++ E L K + L ++
Sbjct: 105 YKKKKRKTKTKKTEEDDPDEEELLPPEELDTLKVEDLVKE 144
>gnl|CDD|224811 COG1899, DYS1, Deoxyhypusine synthase [Posttranslational
modification, protein turnover, chaperones].
Length = 318
Score = 28.0 bits (63), Expect = 7.4
Identities = 12/41 (29%), Positives = 24/41 (58%), Gaps = 7/41 (17%)
Query: 105 IYCPSLTNSFREPSLEMSYQRYSHRQRQKSLIIVNLV-DMV 144
I+CP++T+S S+ + HR+ Q S + +++V D+
Sbjct: 187 IFCPAITDS----SIGD--MLWFHREEQGSDLKIDIVEDVH 221
>gnl|CDD|223624 COG0550, TopA, Topoisomerase IA [DNA replication, recombination,
and repair].
Length = 570
Score = 28.3 bits (64), Expect = 7.4
Identities = 17/52 (32%), Positives = 24/52 (46%), Gaps = 1/52 (1%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
R E E ++ E K+K++ EE NK K K K +KK KK+
Sbjct: 205 GGGEFSARLTEIEGKK-EGRLKDKDEAEEIVNKLKGKPAKVVSVEKKPKKRS 255
Score = 28.0 bits (63), Expect = 9.0
Identities = 11/42 (26%), Positives = 21/42 (50%)
Query: 38 EEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
E + KK+ + K K + ++ K K + V +E+ K+R
Sbjct: 214 TEIEGKKEGRLKDKDEAEEIVNKLKGKPAKVVSVEKKPKKRS 255
>gnl|CDD|232988 TIGR00468, pheS, phenylalanyl-tRNA synthetase, alpha subunit.
Most phenylalanyl-tRNA synthetases are heterodimeric,
with 2 alpha (pheS) and 2 beta (pheT) subunits. This
model describes the alpha subunit, which shows some
similarity to class II aminoacyl-tRNA ligases.
Mitochondrial phenylalanyl-tRNA synthetase is a single
polypeptide chain, active as a monomer, and similar to
this chain rather than to the beta chain, but excluded
from this model. An interesting feature of the
alignment of all sequences captured by this model is a
deep split between non-spirochete bacterial examples
and all other examples; supporting this split is a
relative deletion of about 50 residues in the former
set between two motifs well conserved throughout the
alignment [Protein synthesis, tRNA aminoacylation].
Length = 293
Score = 28.0 bits (63), Expect = 7.5
Identities = 13/53 (24%), Positives = 23/53 (43%), Gaps = 1/53 (1%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKE-KEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
+ K++ K KEE K E K ++++E K K + + K K +
Sbjct: 4 DLLKQLGKLTKEETKPALGALINEVKIELQDELTKLKPELESAGLWSKLKFET 56
>gnl|CDD|187810 cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Multidomain protein with permuted HD nuclease domain,
palm domain and Zn-ribbon; MTH326-like has inactivated
polymerase catalytic domain; alr1562 and slr7011 -
predicted only on the basis of size, presence of HD
domain, and location with RAMPs in one operon; signature
gene for type III; also known as Crm2 family.
Length = 475
Score = 28.2 bits (63), Expect = 7.5
Identities = 12/57 (21%), Positives = 25/57 (43%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKK 58
K + ++ + KRN+EE + + E +E ++ E++ + K KK
Sbjct: 270 LKEKLVLIDEVIKRSKRNREEPRLKYENFREELRELLEKEPEIVYLFIKAKKNPGPY 326
>gnl|CDD|153342 cd07658, F-BAR_NOSTRIN, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Nitric Oxide Synthase
TRaffic INducer (NOSTRIN). F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Nitric Oxide Synthase TRaffic
INducer (NOSTRIN) is expressed in endothelial and
epithelial cells and is involved in the regulation,
trafficking and targeting of endothelial NOS (eNOS).
NOSTRIN facilitates the endocytosis of eNOS by
coordinating the functions of dynamin and the
Wiskott-Aldrich syndrome protein (WASP). Increased
expression of NOSTRIN may be correlated to preeclampsia.
NOSTRIN contains an N-terminal F-BAR domain and a
C-terminal SH3 domain. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
The F-BAR domain of NOSTRIN is necessary and sufficient
for its membrane association and is responsible for its
subcellular localization.
Length = 239
Score = 27.7 bits (62), Expect = 7.5
Identities = 17/51 (33%), Positives = 28/51 (54%)
Query: 26 ENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLK 76
ENE+ + + ED ++ K+K K KK + + K+ +K E K+ EES
Sbjct: 138 ENEKLQDQVEDNKQSCTKQKMLNKLKKSAEVQDKEDEKLEAKRKKGEESRL 188
>gnl|CDD|100810 PRK01194, PRK01194, V-type ATP synthase subunit E; Provisional.
Length = 185
Score = 27.6 bits (61), Expect = 7.5
Identities = 22/77 (28%), Positives = 39/77 (50%), Gaps = 3/77 (3%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKK---KKKKKKKKKKKKKKKKKKKEEQSVK 69
K IEK +E++KE N++ K E +E+E + K K+ +KK + + + KK
Sbjct: 8 KDIEKSREEKKKEINDEYSKRIEKLEKECDSKIQSIKEYYEKKMRAEISRLKKSIIDKAN 67
Query: 70 EIEESLKERKEKTLTED 86
S+K K + + +D
Sbjct: 68 IEARSIKREKRREILKD 84
>gnl|CDD|217636 pfam03606, DcuC, C4-dicarboxylate anaerobic carrier.
Length = 456
Score = 28.0 bits (63), Expect = 7.6
Identities = 7/34 (20%), Positives = 17/34 (50%)
Query: 13 KRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKK 46
++I+ EK+E+++ +K+ E + K
Sbjct: 216 QKIKDDPVYSEKDESDEHFMDKQSEVTEIFSRPK 249
>gnl|CDD|216599 pfam01603, B56, Protein phosphatase 2A regulatory B subunit (B56
family). Protein phosphatase 2A (PP2A) is a major
intracellular protein phosphatase that regulates
multiple aspects of cell growth and metabolism. The
ability of this widely distributed heterotrimeric enzyme
to act on a diverse array of substrates is largely
controlled by the nature of its regulatory B subunit.
There are multiple families of B subunits (See also
pfam01240), this family is called the B56 family.
Length = 405
Score = 28.0 bits (63), Expect = 7.7
Identities = 13/28 (46%), Positives = 20/28 (71%)
Query: 35 EDVEEEKNKKKKKKKKKKKKKKKKKKKK 62
D+ +E K K+K+ K+KKK+KK+KK
Sbjct: 369 PDLFDECGNKYKEKEAKEKKKEKKRKKT 396
>gnl|CDD|149105 pfam07851, TMPIT, TMPIT-like protein. A number of members of
this family are annotated as being transmembrane
proteins induced by tumour necrosis factor alpha, but
no literature was found to support this.
Length = 330
Score = 28.2 bits (63), Expect = 7.7
Identities = 18/58 (31%), Positives = 34/58 (58%), Gaps = 2/58 (3%)
Query: 24 KEENEQEEKEKEDVEEEKNKKKKKKKKKKK--KKKKKKKKKKEEQSVKEIEESLKERK 79
K++ E+ EK +E +++K+ K+ KK KK ++ + V+++EE +KERK
Sbjct: 24 KQKLEEVEKLQEQCTSAIARQRKRLKQLIVSLKKLKKSLTPEDSELVEQLEEQIKERK 81
>gnl|CDD|119001 pfam10481, Cenp-F_N, Cenp-F N-terminal domain. Mitosin or
centromere-associated protein-F (Cenp-F) is found bound
across the centromere as one of the proteins of the
outer layer of the kinetochore. Most of the
kinetochore/centromere functions appear to depend upon
binding of the C-terminal par to f the molecule,
whereas the N-terminal part, here, may be a cytoplasmic
player in controlling the function of microtubules and
dynein.
Length = 288
Score = 27.8 bits (61), Expect = 7.7
Identities = 17/61 (27%), Positives = 36/61 (59%), Gaps = 3/61 (4%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
++ K+E +Q + + + +E +K+K+K + +K + K+E Q + EI E L++ K+
Sbjct: 28 DKLKKEKQQRQFQLDSLEA---ALQKQKQKVENEKTEGANLKRENQRLMEICEHLEKTKQ 84
Query: 81 K 81
K
Sbjct: 85 K 85
>gnl|CDD|235751 PRK06241, PRK06241, phosphoenolpyruvate synthase; Validated.
Length = 871
Score = 28.3 bits (64), Expect = 7.8
Identities = 9/37 (24%), Positives = 18/37 (48%)
Query: 23 EKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKK 59
E+ E EKE+E + + ++K K+ K+ +
Sbjct: 625 EQGRQEALEKEEELLSRLQQLPDGEQKAKETKRMISR 661
>gnl|CDD|225087 COG2176, PolC, DNA polymerase III, alpha subunit (gram-positive type)
[DNA replication, recombination, and repair].
Length = 1444
Score = 28.4 bits (64), Expect = 7.8
Identities = 11/35 (31%), Positives = 17/35 (48%)
Query: 44 KKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
K K ++ K+K K K+K +V EI + R
Sbjct: 1321 KAKMEEINKRKGNKASPKEKNLLTVLEIVLEMLAR 1355
Score = 28.1 bits (63), Expect = 9.0
Identities = 14/64 (21%), Positives = 28/64 (43%), Gaps = 2/64 (3%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKK--KKKKKKKKKEEQSVK 69
+ E EE E ++ ++ E+K K + K +K K +K +K K + +K
Sbjct: 171 SEEQEFEKFEEAINEEVEKAAQEALEAEKKLKAESPKVEKPKPLFDGQKGRKIKSTEEIK 230
Query: 70 EIEE 73
+ +
Sbjct: 231 PLIK 234
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 28.1 bits (63), Expect = 7.9
Identities = 17/54 (31%), Positives = 27/54 (50%), Gaps = 1/54 (1%)
Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
K E EE + K ++K KK+KK K+ E+ K +E +K++ K E
Sbjct: 161 KTVETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEA-IKKKDIKNPLE 213
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil
motif at the N terminus region. Yeast cells with mutant
Atg14 are defective not only in autophagy but also in
sorting of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically
required for the function of Apg6/Vps30p through the
autophagic pathway. There are 17 auto-phagosomal
component proteins which are categorized into six
functional units, one of which is the AS-PI3K complex
(Vps30/Atg6 and Atg14). The AS-PI3K complex and the
Atg2-Atg18 complex are essential for nucleation, and
the specific function of the AS-PI3K apparently is to
produce phosphatidylinositol 3-phosphate (PtdIns(3)P)
at the pre-autophagosomal structure (PAS). The
localisation of this complex at the PAS is controlled
by Atg14. Autophagy mediates the cellular response to
nutrient deprivation, protein aggregation, and pathogen
invasion in humans, and malfunction of autophagy has
been implicated in multiple human diseases including
cancer. This effect seems to be mediated through direct
interaction of the human Atg14 with Beclin 1 in the
human phosphatidylinositol 3-kinase class III complex.
Length = 307
Score = 28.1 bits (63), Expect = 8.0
Identities = 15/74 (20%), Positives = 34/74 (45%), Gaps = 4/74 (5%)
Query: 12 KKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKK----KKKEEQS 67
+ R++ E EE +Q+ +E + ++ K K + +KK++ + + Q
Sbjct: 24 ELRLDLARLLLENEELKQKVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQL 83
Query: 68 VKEIEESLKERKEK 81
+EIE+ + +E
Sbjct: 84 KEEIEQKRERIEEL 97
>gnl|CDD|204614 pfam11221, Med21, Subunit 21 of Mediator complex. Med21 has been
known as Srb7 in yeasts, hSrb7 in humans and Trap 19 in
Drosophila. The heterodimer of the two subunits Med7 and
Med21 appears to act as a hinge between the middle and
the tail regions of Mediator.
Length = 132
Score = 27.2 bits (61), Expect = 8.0
Identities = 10/23 (43%), Positives = 14/23 (60%)
Query: 25 EENEQEEKEKEDVEEEKNKKKKK 47
EE + E E+E+ +EK K KK
Sbjct: 99 EELREVEAEREEAVKEKEKLLKK 121
>gnl|CDD|177283 PHA00451, PHA00451, protein kinase.
Length = 362
Score = 28.2 bits (63), Expect = 8.1
Identities = 11/49 (22%), Positives = 22/49 (44%)
Query: 32 KEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKE 80
K K D +K K++K+ +K + + +E + +E + R E
Sbjct: 253 KAKRDPNGTFQVARKAAMKRRKRNRKLRARNAKELAAMRMEANQIRRNE 301
>gnl|CDD|221324 pfam11933, DUF3451, Domain of unknown function (DUF3451). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 199 to 238 amino acids in length. This domain
is found associated with pfam06512, pfam00520. This
domain has a conserved ADD sequence motif.
Length = 222
Score = 27.8 bits (62), Expect = 8.2
Identities = 6/33 (18%), Positives = 15/33 (45%)
Query: 46 KKKKKKKKKKKKKKKKKKEEQSVKEIEESLKER 78
+ K+++ + K KK++ S + E +
Sbjct: 9 ESAKERRNRNDKNKKEEHSIGSEEGDSEKEPKS 41
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 28.2 bits (63), Expect = 8.2
Identities = 14/72 (19%), Positives = 32/72 (44%), Gaps = 4/72 (5%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK----KKEEQSVKEIEESLK 76
EE E + + + V EE +++ ++ +++ K KK+ E +V +++ +
Sbjct: 287 EEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIKKRTVELLPDAENNVAKLQALVV 346
Query: 77 ERKEKTLTEDSQ 88
E+ L Q
Sbjct: 347 ASSERLLELAQQ 358
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 28.1 bits (62), Expect = 8.3
Identities = 22/84 (26%), Positives = 42/84 (50%), Gaps = 2/84 (2%)
Query: 2 RKRRGGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKK 61
R R AG + + +++ EK E E+++ E E K +K ++ K+++K + +
Sbjct: 559 RSRADSAGARNE-VDRLLDRLEKAEQERDDTEMEAGRLAKELEKAQRHLTKQQEKTEATR 617
Query: 62 KKEEQSVKEIEESLKERKEKTLTE 85
+ E+ E+ E ER EK+ E
Sbjct: 618 IEFERKSAELLEEA-ERLEKSEAE 640
>gnl|CDD|227708 COG5421, COG5421, Transposase [DNA replication, recombination, and
repair].
Length = 480
Score = 28.2 bits (63), Expect = 8.3
Identities = 18/73 (24%), Positives = 31/73 (42%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
K K+EE+ + EKE E E+ K K ++ +K + +K ++ S E E
Sbjct: 276 YKLQKKEEQTLRTRIEKELEKAEKSLEKLKGREFNCEKDARIAAEKILKDYSSVEFLEVD 335
Query: 76 KERKEKTLTEDSQ 88
+ K+K
Sbjct: 336 FKSKKKREEAKRG 348
>gnl|CDD|227481 COG5152, COG5152, Uncharacterized conserved protein, contains RING
and CCCH-type Zn-fingers [General function prediction
only].
Length = 259
Score = 27.7 bits (61), Expect = 8.4
Identities = 22/95 (23%), Positives = 44/95 (46%), Gaps = 8/95 (8%)
Query: 2 RKRRGGAGEK-KKRIEKRNKEEEKEENEQEEKEKED---VEEEKNKKKKKKKKKKKKKKK 57
+ + EK +K+ +K N EEK EEK D + + N + + + + K +
Sbjct: 6 LVNKSSSDEKNQKKRQKINFSEEKLVASDEEKGSSDLMSLAKSGNSRTLQLSHENEGKLQ 65
Query: 58 KKKKKKEEQSVKEIEESLKE----RKEKTLTEDSQ 88
KK + ++ ++ ++S KE + K L E ++
Sbjct: 66 KKGEDLDKYTLTVNDDSTKEDLLNFERKELAEKAK 100
>gnl|CDD|215581 PLN03109, PLN03109, ETHYLENE-INSENSITIVE3-like3 protein;
Provisional.
Length = 599
Score = 28.3 bits (63), Expect = 8.5
Identities = 16/50 (32%), Positives = 29/50 (58%)
Query: 30 EEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERK 79
EE E ED+E K + K K+ K+++KK ++ E+ K+I + + +K
Sbjct: 35 EEIEAEDLERRMWKDRIKLKRIKERQKKLQQAALEKSKPKKISDQARRKK 84
>gnl|CDD|240416 PTZ00432, PTZ00432, falcilysin; Provisional.
Length = 1119
Score = 28.2 bits (63), Expect = 8.5
Identities = 16/69 (23%), Positives = 33/69 (47%), Gaps = 5/69 (7%)
Query: 21 EEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES-----L 75
E K E E + K++++E + K++ + +K +K KK++E E +S L
Sbjct: 575 ESSKYEKEFNKLVKDELKERLSHLTKEQVDEMEKAYEKFKKEREADDDPEHLDSFPILSL 634
Query: 76 KERKEKTLT 84
+ ++T
Sbjct: 635 SDLNKETEE 643
>gnl|CDD|218585 pfam05424, Duffy_binding, Duffy binding domain. This domain is
found in Plasmodium Duffy binding proteins. Plasmodium
vivax and Plasmodium knowlesi merozoites invade human
erythrocytes that express Duffy blood group surface
determinants. The Duffy receptor family is localised in
micronemes, an organelle found in all organisms of the
phylum Apicomplexa.
Length = 276
Score = 27.7 bits (62), Expect = 8.6
Identities = 22/71 (30%), Positives = 40/71 (56%), Gaps = 7/71 (9%)
Query: 10 EKKKRIEK-----RNKEEEKEENEQEEKEKEDVEEEKNKKKKKKK--KKKKKKKKKKKKK 62
E+KK++EK + K+ K +++ K K ++ K KKK K+KKK KK KK+
Sbjct: 200 ERKKKLEKLKKKCKEKKCNKYCSKKCTKCKNACKKYKKWIDNKKKEWDKQKKKYKKYKKR 259
Query: 63 KEEQSVKEIEE 73
+++ +I++
Sbjct: 260 NNKKNYSDIKD 270
>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ. Members of
this family are the FliJ protein found, in nearly every
case, in the midst of other flagellar biosynthesis genes
in bacgterial genomes. Typically the fliJ gene is found
adjacent to the gene for the flagellum-specific ATPase
FliI. Sequence scoring in the gray zone between trusted
and noise cutoffs include both probable FliJ proteins
and components of bacterial type III secretion systems.
Length = 141
Score = 27.3 bits (61), Expect = 8.7
Identities = 17/66 (25%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 IEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEES 74
I + ++ ++++ E ++E VE ++ + + +++ K +K K+KK+KE ++ EE+
Sbjct: 70 IRQLDQRIQQQQQELALLQQE-VEAKRERLLEARRELKALEKLKEKKQKEYRA----EEA 124
Query: 75 LKERKE 80
+E+KE
Sbjct: 125 KREQKE 130
>gnl|CDD|218140 pfam04547, Anoctamin, Calcium-activated chloride channel. The
family carries eight putative transmembrane domains,
and, although it has no similarity to other known
channel proteins, it is clearly a calcium-activated
ionic channel. It is expressed in various secretory
epithelia, the retina and sensory neurons, and mediates
receptor-activated chloride currents in diverse
physiological processes.
Length = 449
Score = 27.9 bits (63), Expect = 8.7
Identities = 16/30 (53%), Positives = 23/30 (76%)
Query: 44 KKKKKKKKKKKKKKKKKKKKEEQSVKEIEE 73
KKK +KKK KKKK+K+KK +E + ++ IE
Sbjct: 270 KKKIRKKKIKKKKEKEKKPEENEILERIEN 299
>gnl|CDD|129130 TIGR00019, prfA, peptide chain release factor 1. This model
describes peptide chain release factor 1 (PrfA, RF-1),
and excludes the related peptide chain release factor 2
(PrfB, RF-2). RF-1 helps recognize and terminate
translation at UAA and UAG stop codons. The
mitochondrial release factors are prfA-like, although
not included above the trusted cutoff for this model.
RF-1 does not have a translational frameshift [Protein
synthesis, Translation factors].
Length = 360
Score = 28.1 bits (63), Expect = 8.8
Identities = 14/80 (17%), Positives = 39/80 (48%), Gaps = 2/80 (2%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKE 70
K +EK E+ E + +V +++K +K K+ + ++ ++ +Q+ ++
Sbjct: 2 KPSLLEKLESLLERYEELEALLSDPEVISDQDKLRKLSKEYSQLEEIVDCYREYQQAQED 61
Query: 71 IEES--LKERKEKTLTEDSQ 88
I+E+ + E + + E ++
Sbjct: 62 IKEAKEILEESDPEMREMAK 81
>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1. This
family consists of several acidic phosphoprotein
precursor PCEMA1 sequences which appear to be found
exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
that is associated with the membrane of the infected
erythrocyte throughout the entire intraerythrocytic
cycle. The exact function of this family is unclear.
Length = 286
Score = 27.9 bits (62), Expect = 8.9
Identities = 15/71 (21%), Positives = 32/71 (45%)
Query: 16 EKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESL 75
+ N+E + E + +D E + K+K +++ ++ K K + E + +E
Sbjct: 212 DGENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDESEDEYE 271
Query: 76 KERKEKTLTED 86
+E E+ ED
Sbjct: 272 EEAGEEQENED 282
>gnl|CDD|110537 pfam01540, Lipoprotein_7, Adhesin lipoprotein. This family
consists of the p50 and variable adherence-associated
antigen (Vaa) adhesins from Mycoplasma hominis. M.
hominis is a mycoplasma associated with human urogenital
diseases, pneumonia, and septic arthritis. An adhesin is
a cell surface molecule that mediates adhesion to other
cells or to the surrounding surface or substrate. The
Vaa antigen is a 50-kDa surface lipoprotein that has
four tandem repetitive DNA sequences encoding a periodic
peptide structure, and is highly immunogenic in the
human host. p50 is also a 50-kDa lipoprotein, having
three repeats A,B and C, that may be a tetramer of
191-kDa in its native environment.
Length = 353
Score = 27.7 bits (61), Expect = 9.1
Identities = 22/91 (24%), Positives = 41/91 (45%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A +KK I+ +N + KE E+ K + ++EE K + + K K + +K+ +
Sbjct: 102 AKNEKKAIDDKNAQIAKELAERNAKIQSNIEELKKINNEAFELSKTVNKTIAEVEKKFKI 161
Query: 68 VKEIEESLKERKEKTLTEDSQCDTMIPVFKK 98
K+ +E L+ + L + Q D V
Sbjct: 162 PKDFKEQLENFADDLLDKSRQIDEFTTVTST 192
>gnl|CDD|233294 TIGR01159, DRP1, density-regulated protein DRP1. This protein
family shows weak but suggestive similarity to
translation initiation factor SUI1 and its prokaryotic
homologs.
Length = 173
Score = 27.5 bits (61), Expect = 9.1
Identities = 12/57 (21%), Positives = 26/57 (45%), Gaps = 5/57 (8%)
Query: 20 KEEEKEENEQE-----EKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEI 71
K + E E ++ + EK+ +K ++ ++ KKKKK ++ ++K
Sbjct: 40 KLYGTDSPTAEAEAVTIGEAQEEKGEKDLLGIQKAQEAREGGKKKKKLPQKVTIKRE 96
>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 406
Score = 28.0 bits (63), Expect = 9.2
Identities = 10/64 (15%), Positives = 30/64 (46%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQSVK 69
+ I + E +E + ++ + EK K+ ++++ + + + K++ ++ +
Sbjct: 4 DALDEILGDLESYEPQEYLNKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDRLRE 63
Query: 70 EIEE 73
EIE
Sbjct: 64 EIER 67
>gnl|CDD|240247 PTZ00055, PTZ00055, glutathione synthetase; Provisional.
Length = 619
Score = 28.2 bits (63), Expect = 9.2
Identities = 8/40 (20%), Positives = 16/40 (40%), Gaps = 1/40 (2%)
Query: 246 NKEYLIWYILFIVFVTYAMLPLPL-KWCFIGGCTTSVLHV 284
+ L + V++ + PLP + C ++L V
Sbjct: 64 KAKQLEANDGILKMVSFVLFPLPFPRKLLEDCCLCTLLLV 103
>gnl|CDD|153366 cd07682, F-BAR_srGAP2, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Protein 2. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs. srGAP2 is expressed in zones
of neuronal differentiation. It plays a role in the
regeneration of neurons and axons. srGAP2 contains an
N-terminal F-BAR domain, a Rho GAP domain, and a
C-terminal SH3 domain. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
Length = 263
Score = 27.7 bits (61), Expect = 9.4
Identities = 18/74 (24%), Positives = 35/74 (47%)
Query: 8 AGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEEQS 67
A K K EK+ +++ QE+++ + + ++K ++ KK +K KE++
Sbjct: 152 AQSKLKEAEKQEEKQMSRSVRQEDRQTPRSPDSTTNIRIEEKHVRRSSVKKIEKMKEKRQ 211
Query: 68 VKEIEESLKERKEK 81
K E LK K +
Sbjct: 212 AKYTENKLKAIKAR 225
>gnl|CDD|235860 PRK06778, PRK06778, hypothetical protein; Validated.
Length = 289
Score = 27.6 bits (61), Expect = 9.5
Identities = 10/48 (20%), Positives = 19/48 (39%)
Query: 46 KKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDSQCDTMI 93
K K ++ E+ +++ E K+K+ TE + T I
Sbjct: 74 KISPSHPPKPATVAVPEETEKKARDVNEKTALLKKKSATELGELATSI 121
>gnl|CDD|176058 cd08676, C2A_Munc13-like, C2 domain first repeat in Munc13
(mammalian uncoordinated)-like proteins. C2-like
domains are thought to be involved in phospholipid
binding in a Ca2+ independent manner in both Unc13 and
Munc13. Caenorabditis elegans Unc13 has a central domain
with sequence similarity to PKC, which includes C1 and
C2-related domains. Unc13 binds phorbol esters and DAG
with high affinity in a phospholipid manner. Mutations
in Unc13 results in abnormal neuronal connections and
impairment in cholinergic neurotransmission in the
nematode. Munc13 is the mammalian homolog which are
expressed in the brain. There are 3 isoforms (Munc13-1,
-2, -3) and are thought to play a role in
neurotransmitter release and are hypothesized to be
high-affinity receptors for phorbol esters. Unc13 and
Munc13 contain both C1 and C2 domains. There are two C2
related domains present, one central and one at the
carboxyl end. Munc13-1 contains a third C2-like domain.
Munc13 interacts with syntaxin, synaptobrevin, and
synaptotagmin suggesting a role for these as scaffolding
proteins. C2 domains fold into an 8-standed
beta-sandwich that can adopt 2 structural arrangements:
Type I and Type II, distinguished by a circular
permutation involving their N- and C-terminal beta
strands. Many C2 domains are Ca2+-dependent
membrane-targeting modules that bind a wide variety of
substances including bind phospholipids, inositol
polyphosphates, and intracellular proteins. Most C2
domain proteins are either signal transduction enzymes
that contain a single C2 domain, such as protein kinase
C, or membrane trafficking proteins which contain at
least two C2 domains, such as synaptotagmin 1. However,
there are a few exceptions to this including RIM
isoforms and some splice variants of piccolo/aczonin and
intersectin which only have a single C2 domain. C2
domains with a calcium binding region have negatively
charged residues, primarily aspartates, that serve as
ligands for calcium ions. This cd contains the second C2
repeat, C2B, and has a type-II topology.
Length = 153
Score = 27.0 bits (60), Expect = 9.6
Identities = 13/46 (28%), Positives = 24/46 (52%), Gaps = 2/46 (4%)
Query: 39 EEKNKKKKKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLT 84
++++ +K KK+K +KK K+ K I+ + E K +TL
Sbjct: 58 VPASRERNSEKSKKRKSHRKKAVLKDTVPAKSIKVT--EVKPQTLN 101
>gnl|CDD|221525 pfam12314, IMCp, Inner membrane complex protein. This domain is
found in bacteria and eukaryotes. This domain is about
120 amino acids in length. This family is the inner
membrane complex of parasitic organisms. This is a
cytoskeletal structure associated with the pellicle of
these parasites.
Length = 98
Score = 26.5 bits (59), Expect = 9.8
Identities = 17/76 (22%), Positives = 28/76 (36%), Gaps = 13/76 (17%)
Query: 11 KKKRIEKRNKEEEKEENEQEEKEKE-----------DVEE--EKNKKKKKKKKKKKKKKK 57
K I+++ KE K + EK E +V E K K K + K++
Sbjct: 1 PKPVIQEKIKEVPKYIVKYVEKIVEVPEVKYVDKIVEVPEYVYKEKIIYVPKVEIKERII 60
Query: 58 KKKKKKEEQSVKEIEE 73
K ++ K +E
Sbjct: 61 PVPKIVTKEIEKYVEV 76
>gnl|CDD|150240 pfam09495, DUF2462, Protein of unknown function (DUF2462). This
protein is highly conserved, but its function is
unknown. It can be isolated from HeLa cell nucleoli and
is found to be homologous with Leydig cell tumour
protein whose function is unknown.
Length = 82
Score = 26.2 bits (58), Expect = 9.8
Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 3/48 (6%)
Query: 41 KNKKKKKKKKKKKK---KKKKKKKKKEEQSVKEIEESLKERKEKTLTE 85
KKK+K KK K KK + + ++ K + L + EK +
Sbjct: 18 AGKKKQKGPKKAGPRVIKPKKAQAVQAQKLKKVLTSGLNKSTEKLIAS 65
>gnl|CDD|219555 pfam07753, DUF1609, Protein of unknown function (DUF1609). This
region is found in a number of hypothetical proteins
thought to be expressed by the eukaryote
Encephalitozoon cuniculi, an obligate intracellular
microsporidial parasite. It is approximately 200
residues long.
Length = 230
Score = 27.4 bits (61), Expect = 9.8
Identities = 19/71 (26%), Positives = 36/71 (50%)
Query: 6 GGAGEKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
GGA KKK +K++K + + EK K++ K ++K K + ++ KE+
Sbjct: 18 GGARAKKKGGKKKSKGGRHCYKIHKRVLRWRKSPEKIKEELDKGSEEKWKGRSIEEIKEQ 77
Query: 66 QSVKEIEESLK 76
+ + +I E L+
Sbjct: 78 KVLHDITEVLE 88
>gnl|CDD|216689 pfam01765, RRF, Ribosome recycling factor. The ribosome recycling
factor (RRF / ribosome release factor) dissociates the
ribosome from the mRNA after termination of translation,
and is essential bacterial growth. Thus ribosomes are
"recycled" and ready for another round of protein
synthesis.
Length = 165
Score = 27.1 bits (61), Expect = 10.0
Identities = 16/48 (33%), Positives = 29/48 (60%), Gaps = 7/48 (14%)
Query: 42 NKKKKKKKKKKKKKKKKKKKKKEE------QSVKEIEESLKERKEKTL 83
N K KK +K K+ + + K+ ++E + +K+I+E LK+ KEK +
Sbjct: 118 NDKLKKLEKDKEISEDEVKRAEKEIQKLTDKYIKKIDELLKK-KEKEI 164
>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755). This
family is predominated by ORFs from Circoviridae. The
function of this family remains to be determined.
Length = 122
Score = 26.6 bits (59), Expect = 10.0
Identities = 9/27 (33%), Positives = 13/27 (48%)
Query: 41 KNKKKKKKKKKKKKKKKKKKKKKEEQS 67
K K+KK + + KK KK+ S
Sbjct: 79 KKKRKKSRHVSSRSAKKISAKKRRRSS 105
>gnl|CDD|240380 PTZ00362, PTZ00362, hypothetical protein; Provisional.
Length = 479
Score = 27.8 bits (62), Expect = 10.0
Identities = 14/42 (33%), Positives = 20/42 (47%)
Query: 46 KKKKKKKKKKKKKKKKKKEEQSVKEIEESLKERKEKTLTEDS 87
KKKKKKKKK K+ + +L RK++ E +
Sbjct: 9 KKKKKKKKKASKESSNGSQNSKKTIFMYNLSRRKKRHNEEGT 50
>gnl|CDD|148511 pfam06936, Selenoprotein_S, Selenoprotein S (SelS). This family
consists of several mammalian selenoprotein S (SelS)
sequences. SelS is a plasma membrane protein and is
present in a variety of tissues and cell types. The
function of this family is unknown.
Length = 190
Score = 27.2 bits (60), Expect = 10.0
Identities = 13/56 (23%), Positives = 31/56 (55%)
Query: 10 EKKKRIEKRNKEEEKEENEQEEKEKEDVEEEKNKKKKKKKKKKKKKKKKKKKKKEE 65
+K I + ++ E E + ++E+ + K K K+K+K+ +++K+++K E
Sbjct: 65 SSEKEITVFDPDKVVRRQEALEAARLRMQEDLDAKAAKFKEKQKQLEEEKRRQKIE 120
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.132 0.400
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 18,104,481
Number of extensions: 1786278
Number of successful extensions: 29912
Number of sequences better than 10.0: 1
Number of HSP's gapped: 17711
Number of HSP's successfully gapped: 2924
Length of query: 362
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 264
Effective length of database: 6,590,910
Effective search space: 1740000240
Effective search space used: 1740000240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (26.7 bits)