RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7145
(191 letters)
>gnl|CDD|215741 pfam00135, COesterase, Carboxylesterase family.
Length = 510
Score = 99.3 bits (248), Expect = 1e-24
Identities = 37/110 (33%), Positives = 46/110 (41%), Gaps = 20/110 (18%)
Query: 23 VARTKYGPLRGILIQ--NPPVEAYLGVPYATPPLGSLRYMPPVTPSTWRAPRFADTYSAV 80
V T YG +RG+ + PV A+LG+PYA PP+G LR+ P P W A Y
Sbjct: 2 VVETSYGKVRGLRVTVDGGPVYAFLGIPYAKPPVGELRFKKPQPPEPWTGVLDATKYGPA 61
Query: 81 CPQRLPDIGNRTEALLQLPRGRLVFLEKLLPLLSNQSEDCLYLNLYVPRP 130
CPQ SEDCLYLN+Y P+
Sbjct: 62 CPQNNDLGSEMWNKNT------------------GMSEDCLYLNVYTPKL 93
>gnl|CDD|238191 cd00312, Esterase_lipase, Esterases and lipases (includes fungal
lipases, cholinesterases, etc.) These enzymes act on
carboxylic esters (EC: 3.1.1.-). The catalytic apparatus
involves three residues (catalytic triad): a serine, a
glutamate or aspartate and a histidine.These catalytic
residues are responsible for the nucleophilic attack on
the carbonyl carbon atom of the ester bond. In contrast
with other alpha/beta hydrolase fold family members,
p-nitrobenzyl esterase and acetylcholine esterase have a
Glu instead of Asp at the active site carboxylate.
Length = 493
Score = 88.5 bits (220), Expect = 8e-21
Identities = 34/109 (31%), Positives = 46/109 (42%), Gaps = 20/109 (18%)
Query: 23 VARTKYGPLRGILIQNPPVEAYLGVPYATPPLGSLRYMPPVTPSTWRAPRFADTYSAVCP 82
+ T G +RG+ V ++LG+PYA PP+G LR+ P W A +Y C
Sbjct: 1 LVVTPNGKVRGVD--EGGVYSFLGIPYAEPPVGDLRFKEPQPYEPWSDVLDATSYPPSCM 58
Query: 83 QRLPDIGNRTEALLQLPRGRLVFLEKLLPLLSNQSEDCLYLNLYVPRPV 131
Q G L +L SEDCLYLN+Y P+
Sbjct: 59 QWDQLGG-------GLWNAKLPG-----------SEDCLYLNVYTPKNT 89
>gnl|CDD|225181 COG2272, PnbA, Carboxylesterase type B [Lipid metabolism].
Length = 491
Score = 86.7 bits (215), Expect = 3e-20
Identities = 36/109 (33%), Positives = 48/109 (44%), Gaps = 22/109 (20%)
Query: 20 STRVARTKYGPLRGILIQNPPVEAYLGVPYATPPLGSLRYMPPVTPSTWRAPRFADTYSA 79
VA T G + GI + V ++LG+PYA PP+G LR+ PV P W R A +
Sbjct: 1 RAPVAETTTGKVEGITVNG--VHSWLGIPYAAPPVGELRFRRPVPPEPWSGVRDATQFGP 58
Query: 80 VCPQRLPDIGNRTEALLQLPRGRLVFLEKLLPLLSNQSEDCLYLNLYVP 128
CPQ ++ G SEDCLYLN++ P
Sbjct: 59 ACPQ----------PFNRMGSGEDF----------TGSEDCLYLNIWAP 87
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 54.6 bits (132), Expect = 4e-09
Identities = 19/56 (33%), Positives = 29/56 (51%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
RP +K+ K K K+ +KK+K+K+K K KK+ + TK PSG +
Sbjct: 377 RPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTS 432
Score = 48.4 bits (116), Expect = 5e-07
Identities = 19/55 (34%), Positives = 28/55 (50%), Gaps = 1/55 (1%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKK-KKKKKKKKKTKKFLPPSDPS 181
P+ A +K K K K+ +KK+K+K+K K KK+ + K K PS S
Sbjct: 378 PKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTS 432
Score = 45.3 bits (108), Expect = 6e-06
Identities = 13/41 (31%), Positives = 21/41 (51%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
L + K + K KK K+ +KK+K+K+K K K +
Sbjct: 376 LRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRH 416
Score = 44.9 bits (107), Expect = 7e-06
Identities = 18/52 (34%), Positives = 27/52 (51%)
Query: 122 YLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
Y+ + V E K K ++ K K KK K+ +KK+K+K+K K KK
Sbjct: 363 YIEEPLKARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKK 414
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 45.0 bits (107), Expect = 7e-06
Identities = 18/35 (51%), Positives = 26/35 (74%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
K+K KKK+++KK +K+KKKKK+KKK K +K
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKK 410
Score = 43.8 bits (104), Expect = 2e-05
Identities = 21/38 (55%), Positives = 28/38 (73%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E KEK K K+++KK +K+KKKKK+KKK KK+ KK
Sbjct: 374 EIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKK 411
Score = 43.0 bits (102), Expect = 3e-05
Identities = 19/31 (61%), Positives = 25/31 (80%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
KKK + K +K+KKKKK+KKK KK+KKK +K
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 41.9 bits (99), Expect = 7e-05
Identities = 17/31 (54%), Positives = 26/31 (83%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
KK++++ K + +KKKKK+KKK KK+KKK +K
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 41.9 bits (99), Expect = 8e-05
Identities = 21/38 (55%), Positives = 28/38 (73%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
EK K K + +KK +K+KKKKK+KKK KK+KKK +K
Sbjct: 377 EKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 40.3 bits (95), Expect = 3e-04
Identities = 19/38 (50%), Positives = 28/38 (73%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
++ E+ K K KKK+++KK +K+KKKKK+KKK KK
Sbjct: 370 KRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKK 407
Score = 39.6 bits (93), Expect = 5e-04
Identities = 18/36 (50%), Positives = 27/36 (75%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+E K+ K KKK+++KK +K+KKKKK+KKK K+
Sbjct: 373 EEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKR 408
Score = 34.6 bits (80), Expect = 0.022
Identities = 15/28 (53%), Positives = 20/28 (71%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKK 161
EKK +K+ K K +KKK KK+KKK +K
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 34.2 bits (79), Expect = 0.035
Identities = 17/35 (48%), Positives = 25/35 (71%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKK 162
P+P + E+K+ +K K K K+KKK KK+KKK +K
Sbjct: 380 PKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 31.9 bits (73), Expect = 0.19
Identities = 16/49 (32%), Positives = 28/49 (57%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
G++ +++ NK + K+K K KKK+++KK +K+K KK K
Sbjct: 361 GDELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRK 409
Score = 29.2 bits (66), Expect = 1.3
Identities = 12/37 (32%), Positives = 18/37 (48%)
Query: 153 KKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
K+K K KKK+++KK + +K GK K
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKG 412
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 43.6 bits (103), Expect = 2e-05
Identities = 21/37 (56%), Positives = 29/37 (78%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
KK+KK++KN KKK +KKKKKKK+KK+ K + +TK
Sbjct: 55 KKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKL 91
Score = 41.2 bits (97), Expect = 1e-04
Identities = 21/47 (44%), Positives = 27/47 (57%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
E +KK +KK K K +KK+ K + + K K KK KKTKK P P
Sbjct: 65 ESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKP 111
Score = 36.2 bits (84), Expect = 0.006
Identities = 17/36 (47%), Positives = 26/36 (72%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+K K ++KN + KKK +KKKKKKK+KK+ K + +
Sbjct: 53 STKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGE 88
Score = 35.8 bits (83), Expect = 0.009
Identities = 21/54 (38%), Positives = 29/54 (53%), Gaps = 4/54 (7%)
Query: 124 NLYVPRPVALEGEKKEKKKNKNKNKKKKKKKK----KKKKKKKKKKKKKKKTKK 173
L + + + ++E K KK KK+ K KKK +KKKKKKK+KK K
Sbjct: 31 VLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPK 84
Score = 35.1 bits (81), Expect = 0.015
Identities = 17/37 (45%), Positives = 22/37 (59%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+KKEKK+ K++ + K K KK KK KKK K K
Sbjct: 76 KKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPN 112
Score = 34.7 bits (80), Expect = 0.022
Identities = 14/38 (36%), Positives = 18/38 (47%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+KK++KK + K K KK KK KKK K
Sbjct: 75 KKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPN 112
Score = 33.9 bits (78), Expect = 0.040
Identities = 13/41 (31%), Positives = 21/41 (51%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
+ ++K++ K++ + K K KK KK KKK K K
Sbjct: 76 KKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVD 116
Score = 32.8 bits (75), Expect = 0.093
Identities = 15/38 (39%), Positives = 16/38 (42%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
+ EKKE K K KK KK KKK K K
Sbjct: 75 KKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPN 112
Score = 32.0 bits (73), Expect = 0.15
Identities = 14/54 (25%), Positives = 22/54 (40%), Gaps = 12/54 (22%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK------------KKKKKKTKKFL 175
+ E K + + K K KK KK KKK K K + +K+ ++
Sbjct: 80 KKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKSNVYI 133
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 43.1 bits (102), Expect = 2e-05
Identities = 20/62 (32%), Positives = 30/62 (48%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
P + + E+ EK+ + + KKKK+ K K K+ KKKKKK P + + KS
Sbjct: 66 PESDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSE 125
Query: 190 EQ 191
Sbjct: 126 RI 127
Score = 34.3 bits (79), Expect = 0.025
Identities = 19/58 (32%), Positives = 26/58 (44%), Gaps = 6/58 (10%)
Query: 130 PVALEGEKKEKKKNKNKNK--KKKKKKKKKKK----KKKKKKKKKKKTKKFLPPSDPS 181
L+ E++ KKK + K K K+ KKKKKK K K + K K P+
Sbjct: 75 EKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPT 132
Score = 27.7 bits (62), Expect = 3.1
Identities = 12/34 (35%), Positives = 23/34 (67%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
+ KE + K ++ ++KK + K +K+K+KKK+K
Sbjct: 148 QNKEATHERLKEREIRRKKIQAKARKRKEKKKEK 181
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 41.9 bits (99), Expect = 6e-05
Identities = 19/72 (26%), Positives = 26/72 (36%)
Query: 102 RLVFLEKLLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKK 161
L L ++ E + +KK K+K + KKKK K+ K K
Sbjct: 211 LLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVK 270
Query: 162 KKKKKKKKKTKK 173
KK K KK
Sbjct: 271 ALKKVVAKGMKK 282
Score = 37.3 bits (87), Expect = 0.002
Identities = 22/75 (29%), Positives = 38/75 (50%), Gaps = 7/75 (9%)
Query: 109 LLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNK-------NKNKKKKKKKKKKKKKK 161
L P L + L + + P L+ KK+K++ + ++ +KK+K K++ KKKK
Sbjct: 203 LPPDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKK 262
Query: 162 KKKKKKKKKTKKFLP 176
K+ K K KK +
Sbjct: 263 PKESKGVKALKKVVA 277
Score = 26.9 bits (60), Expect = 6.4
Identities = 15/53 (28%), Positives = 30/53 (56%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
E K K ++KKKK++ ++ + + + +KK+K+K+ + P GVK+
Sbjct: 219 EFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKA 271
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 41.6 bits (98), Expect = 7e-05
Identities = 18/56 (32%), Positives = 31/56 (55%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
+++EK K K K KK K K ++K+K K++K++K + L P + K R +
Sbjct: 45 KEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLR 100
Score = 38.9 bits (91), Expect = 7e-04
Identities = 15/40 (37%), Positives = 25/40 (62%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E +EK++ K K K K KK K K ++K+K K++ ++
Sbjct: 39 EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEE 78
Score = 31.2 bits (71), Expect = 0.23
Identities = 11/33 (33%), Positives = 12/33 (36%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
EK K + K KKKK K K K
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDDD 224
Score = 31.2 bits (71), Expect = 0.27
Identities = 11/33 (33%), Positives = 12/33 (36%)
Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
K K +K K KKKK K K K
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDDD 224
Score = 30.8 bits (70), Expect = 0.35
Identities = 10/37 (27%), Positives = 12/37 (32%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
K K + K KKKK K K K +
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDDYDGY 230
Score = 27.7 bits (62), Expect = 3.2
Identities = 9/40 (22%), Positives = 19/40 (47%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
A K + KK ++K+K K++K++K ++
Sbjct: 45 KEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84
Score = 26.9 bits (60), Expect = 7.1
Identities = 13/34 (38%), Positives = 16/34 (47%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
KK + N +K K +K K KKKK K K
Sbjct: 181 KKVSSTLNVLINEKLKAEKAAKGGKKKKGKAKAK 214
Score = 26.5 bits (59), Expect = 7.9
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
E E + K + ++ + ++K+++K K K K KK K K
Sbjct: 27 EDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAK 65
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 41.2 bits (97), Expect = 8e-05
Identities = 24/40 (60%), Positives = 28/40 (70%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E EKKEKKK K K+KK+KK KK+K + K KKKK KK
Sbjct: 151 EEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKK 190
Score = 39.3 bits (92), Expect = 4e-04
Identities = 20/47 (42%), Positives = 28/47 (59%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E++EKK+ K K + KK+KK+KK KK+K + K K KK
Sbjct: 141 AKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKK 187
Score = 38.5 bits (90), Expect = 8e-04
Identities = 20/64 (31%), Positives = 35/64 (54%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGV 186
E E K + + + ++++KK+KKKKK+ KK+KK+KK K+ + S K
Sbjct: 129 GSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKK 188
Query: 187 KSRE 190
K ++
Sbjct: 189 KKKK 192
Score = 32.0 bits (73), Expect = 0.13
Identities = 16/44 (36%), Positives = 27/44 (61%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
P L E + +K +K+ + ++++KK+KKKKK+ KK KK
Sbjct: 125 PSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKK 168
Score = 29.7 bits (67), Expect = 0.77
Identities = 17/46 (36%), Positives = 29/46 (63%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
P + E E EK+ K+ + ++++KK+KKKKK+ KK+K +K
Sbjct: 125 PSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEK 170
Score = 27.4 bits (61), Expect = 3.9
Identities = 13/44 (29%), Positives = 23/44 (52%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
P E + + K K +K+ + ++++KK+KKKKK K
Sbjct: 121 PDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVK 164
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 42.1 bits (100), Expect = 9e-05
Identities = 18/78 (23%), Positives = 28/78 (35%), Gaps = 16/78 (20%)
Query: 97 QLPRG---RLVFLEKLLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKK 153
LP+G R +E+ L LL+ +P L K+ K K
Sbjct: 795 TLPKGRETRAPTVEEALELLAE-------------KPEKLRYLADAPAKDPAGKKAAVKF 841
Query: 154 KKKKKKKKKKKKKKKKKT 171
+K K++ +K K T
Sbjct: 842 SRKTKQQYVASEKDGKAT 859
Score = 41.7 bits (99), Expect = 1e-04
Identities = 11/43 (25%), Positives = 17/43 (39%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ L EK EK + K KK K +K K++ +
Sbjct: 811 LELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASE 853
Score = 34.4 bits (80), Expect = 0.028
Identities = 13/42 (30%), Positives = 17/42 (40%), Gaps = 3/42 (7%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKK---KTKKFLPPSDPSGKA 184
K +K + K KK K KTK+ S+ GKA
Sbjct: 817 KPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 40.0 bits (94), Expect = 1e-04
Identities = 20/49 (40%), Positives = 30/49 (61%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
K E+K K + K++KKK+KKKKKKK KK KK++ + + S +
Sbjct: 77 RKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEE 125
Score = 37.7 bits (88), Expect = 0.001
Identities = 19/37 (51%), Positives = 27/37 (72%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
E+K++K + KK+ K++KKK+KKKKKKK KK K
Sbjct: 73 EEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKK 109
Score = 37.3 bits (87), Expect = 0.001
Identities = 21/50 (42%), Positives = 32/50 (64%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAG 185
+K+E+KK K++ K KK+ K++KKK+KKKKKKK K + S +
Sbjct: 70 QKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSE 119
Score = 35.4 bits (82), Expect = 0.005
Identities = 18/37 (48%), Positives = 31/37 (83%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
++++++ K K+++K KK+ K++KKK+KKKKKKK KK
Sbjct: 69 QQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKK 105
Score = 34.6 bits (80), Expect = 0.011
Identities = 17/38 (44%), Positives = 27/38 (71%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
++K ++K + +K KK+ K++KKK+KKKKKKK K
Sbjct: 69 QQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKG 106
Score = 33.1 bits (76), Expect = 0.034
Identities = 17/40 (42%), Positives = 29/40 (72%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E +E ++ + + K+K ++K KK+ K++KKK+KKK KK
Sbjct: 62 ETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKK 101
Score = 31.2 bits (71), Expect = 0.15
Identities = 18/56 (32%), Positives = 24/56 (42%), Gaps = 3/56 (5%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
K++KKK K KKKKKK KK KK++K+ K + K
Sbjct: 88 AKRQKKKQK---KKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEIM 140
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 39.4 bits (93), Expect = 2e-04
Identities = 15/34 (44%), Positives = 21/34 (61%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
KKE K + KK +K K K +KKK KK++ K +
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 38.3 bits (90), Expect = 7e-04
Identities = 16/42 (38%), Positives = 25/42 (59%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+LE +KK+ K K +KK +K K K +KKK KK++ +
Sbjct: 57 SLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 36.0 bits (84), Expect = 0.003
Identities = 17/43 (39%), Positives = 25/43 (58%), Gaps = 3/43 (6%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
AL +K+ K K +KK +K K K +KKK KK++ K + F
Sbjct: 61 ALLDKKELKAWEK---AEKKAEKAKAKAEKKKAKKEEPKPRLF 100
Score = 35.6 bits (83), Expect = 0.006
Identities = 12/33 (36%), Positives = 17/33 (51%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
KK+ K +K +KK +K K K +K K P
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97
Score = 26.3 bits (59), Expect = 6.9
Identities = 9/33 (27%), Positives = 14/33 (42%)
Query: 152 KKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
KK+ K +K +KK +K K + K
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 39.7 bits (93), Expect = 2e-04
Identities = 21/40 (52%), Positives = 25/40 (62%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E+K+K K K K KKKK K K KK KK K +KK +K
Sbjct: 77 EYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEK 116
Score = 38.9 bits (91), Expect = 5e-04
Identities = 19/39 (48%), Positives = 24/39 (61%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
EK++ K K K+KKKK K K KK KK K +KK K+
Sbjct: 79 EEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKE 117
Score = 37.4 bits (87), Expect = 0.002
Identities = 21/52 (40%), Positives = 26/52 (50%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
++ E +EK+K K K KK KKKK K K KK KK K + K D
Sbjct: 74 VKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDL 125
Score = 36.6 bits (85), Expect = 0.003
Identities = 28/73 (38%), Positives = 32/73 (43%), Gaps = 10/73 (13%)
Query: 125 LYVPRPVALEGEKKEKKKNKNKNKK------KKKKKKKKKKKKKKKKKKKKKTKKFLPPS 178
+Y + +KKE + K KK K K KKKK KKKK K K KK KK
Sbjct: 52 IYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKK----D 107
Query: 179 DPSGKAGVKSREQ 191
D S K K E
Sbjct: 108 DKSEKKDEKEAED 120
Score = 35.8 bits (83), Expect = 0.005
Identities = 16/52 (30%), Positives = 24/52 (46%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
KK+K K K K KK KK K +KK +K+ + + L S + +
Sbjct: 86 WKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLS 137
Score = 35.4 bits (82), Expect = 0.007
Identities = 15/39 (38%), Positives = 24/39 (61%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+ KK+K K+K+K KK K +KK +K+ + K + TK
Sbjct: 89 KKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTK 127
Score = 35.4 bits (82), Expect = 0.007
Identities = 15/40 (37%), Positives = 26/40 (65%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ +K +KKK+K+K+KK KK K +KK +K+ + K +
Sbjct: 87 KKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLT 126
Score = 32.4 bits (74), Expect = 0.077
Identities = 13/40 (32%), Positives = 23/40 (57%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+KK+K K+K +KK K +KK +K+ + K + K+
Sbjct: 91 SKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYS 130
Score = 31.2 bits (71), Expect = 0.18
Identities = 11/39 (28%), Positives = 22/39 (56%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+ + K+KK +K +K +KK +K+ + K + K +T
Sbjct: 95 KDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETL 133
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 39.3 bits (92), Expect = 2e-04
Identities = 24/41 (58%), Positives = 29/41 (70%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+E +K K+ K K K+KK+KKKKKKKKKKK KK K KK
Sbjct: 95 VEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 33.1 bits (76), Expect = 0.034
Identities = 18/34 (52%), Positives = 22/34 (64%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKK 162
+ V E K++K+K K K KKKKK KK KKKK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 31.6 bits (72), Expect = 0.10
Identities = 18/32 (56%), Positives = 22/32 (68%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKK 163
L+ + K+KK+ K K KKKKKK KK KKKK
Sbjct: 104 VLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 39.5 bits (93), Expect = 5e-04
Identities = 17/59 (28%), Positives = 30/59 (50%), Gaps = 2/59 (3%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
+ ++K +KK + + +K KKK + KK K +TKK P DP G+ ++ +
Sbjct: 414 LRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDP--DPLGEKLARTED 470
Score = 39.1 bits (92), Expect = 7e-04
Identities = 13/37 (35%), Positives = 22/37 (59%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E+KK + K +K +KK +K++ +K KKK + K
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAK 444
Score = 34.1 bits (79), Expect = 0.031
Identities = 17/54 (31%), Positives = 35/54 (64%), Gaps = 2/54 (3%)
Query: 121 LYLNLYVPRPVALEGEKKEKKKNK-NKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+YL L+ +P+ EGE++E + + ++KK +KK++K +KK +K++ +K
Sbjct: 383 IYLKLH-DKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAA 435
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 39.4 bits (92), Expect = 6e-04
Identities = 20/53 (37%), Positives = 25/53 (47%), Gaps = 2/53 (3%)
Query: 136 EKKEKKKNKNKNKKKKKKK--KKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGV 186
EKK K K K+ + K K K KKK K K KK KK+ + P + G
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSPALAAMIGA 800
Score = 34.8 bits (80), Expect = 0.023
Identities = 19/51 (37%), Positives = 21/51 (41%), Gaps = 4/51 (7%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK----KTKKFLPPSDPS 181
G K + K K KKK K K K KKK K K K+ P PS
Sbjct: 741 GTPKTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPS 791
Score = 26.7 bits (59), Expect = 9.0
Identities = 15/47 (31%), Positives = 16/47 (34%)
Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
K K +KK K KKK K K K S K K R
Sbjct: 737 TKYAGTPKTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 39.5 bits (93), Expect = 6e-04
Identities = 19/58 (32%), Positives = 30/58 (51%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
L G KK KK K +K +KK++++KK+KKKK KK ++ + + E
Sbjct: 402 LTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459
Score = 39.1 bits (92), Expect = 7e-04
Identities = 17/60 (28%), Positives = 35/60 (58%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
+ EK EKK+ + K +KKKK KKK+++++++K+KK ++ + + + + E
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471
Score = 39.1 bits (92), Expect = 8e-04
Identities = 15/61 (24%), Positives = 37/61 (60%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
V +K+E++K + K K KKK+++++++K+KK+++K ++ + + K ++
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Query: 191 Q 191
Q
Sbjct: 476 Q 476
Score = 38.7 bits (91), Expect = 0.001
Identities = 14/57 (24%), Positives = 31/57 (54%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
+ KK +K + K +++KK+KKKK KKK++++++ K+ + + +
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466
Score = 38.0 bits (89), Expect = 0.002
Identities = 17/55 (30%), Positives = 32/55 (58%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
EK + K + +KK+KKKK KKK+++++++K KK + +A + E+
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469
Score = 38.0 bits (89), Expect = 0.002
Identities = 16/56 (28%), Positives = 31/56 (55%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
KK KK + KK++++KK+KKKK KKK+++ ++ + + + E+
Sbjct: 409 TKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464
Score = 36.1 bits (84), Expect = 0.009
Identities = 14/56 (25%), Positives = 33/56 (58%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
+K +K K + K++++KK+KKKK KKK++++ ++ + + ++ E+
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465
Score = 35.7 bits (83), Expect = 0.009
Identities = 14/60 (23%), Positives = 35/60 (58%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
+E +K++++ K + KKK KKK+++++++K+KK++ K+ K + ++
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473
Score = 35.3 bits (82), Expect = 0.015
Identities = 12/53 (22%), Positives = 30/53 (56%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
++ ++ +K + + KK+KKKK KKK+++++++K+ K+ +
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Score = 34.5 bits (80), Expect = 0.025
Identities = 15/46 (32%), Positives = 25/46 (54%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
E E + +K KK KK +K +KK++++KK+KKK +
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKE 441
Score = 32.2 bits (74), Expect = 0.13
Identities = 14/53 (26%), Positives = 28/53 (52%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
+E+ + +KK KK KK +K +KK++++KK KK + + + +
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEK 448
Score = 31.4 bits (72), Expect = 0.28
Identities = 10/38 (26%), Positives = 32/38 (84%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
+ E++E++K K + +K++++++ +++K+++++KKKK+
Sbjct: 440 KEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477
Score = 30.3 bits (69), Expect = 0.70
Identities = 10/39 (25%), Positives = 29/39 (74%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
E E++EK+K + + ++++++ +++K+++++KKKK
Sbjct: 437 GKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 38.0 bits (89), Expect = 0.001
Identities = 14/45 (31%), Positives = 26/45 (57%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
E++ KK+ K+K ++++K K K +KK+K + +K P P
Sbjct: 116 EQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTP 160
Score = 38.0 bits (89), Expect = 0.001
Identities = 10/41 (24%), Positives = 23/41 (56%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
+ E++ K + +K+K ++++K K K +KK+ + P
Sbjct: 114 RAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKP 154
Score = 37.2 bits (87), Expect = 0.002
Identities = 15/46 (32%), Positives = 26/46 (56%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
+K+E K K ++++K K K +KK+K + +K + P SD S
Sbjct: 120 KKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDIS 165
Score = 36.4 bits (85), Expect = 0.004
Identities = 11/49 (22%), Positives = 24/49 (48%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
E K K + + ++ KK++ +K+K ++++K K K P +
Sbjct: 105 EAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQK 153
Score = 34.9 bits (81), Expect = 0.011
Identities = 10/45 (22%), Positives = 24/45 (53%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
++ +++ K + +K+K ++++K K K +KK K +P
Sbjct: 112 AQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEP 156
Score = 34.5 bits (80), Expect = 0.020
Identities = 10/44 (22%), Positives = 23/44 (52%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
+++ K + +K+K ++++K K K +KK+K + P
Sbjct: 115 AEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQH 158
Score = 33.4 bits (77), Expect = 0.043
Identities = 10/45 (22%), Positives = 23/45 (51%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPS 178
E + K+++ K K ++++K K K +KK+K + + +
Sbjct: 116 EQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTP 160
Score = 31.8 bits (73), Expect = 0.13
Identities = 9/39 (23%), Positives = 20/39 (51%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
E +++K + + K K K +KK+K + +K + +
Sbjct: 123 EAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPV 161
Score = 31.0 bits (71), Expect = 0.29
Identities = 13/56 (23%), Positives = 27/56 (48%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
A + E +K+ + ++K K K +KK+K + +K + + S+ + VK
Sbjct: 119 AKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQAVK 174
Score = 30.7 bits (70), Expect = 0.38
Identities = 9/50 (18%), Positives = 23/50 (46%)
Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
K K + ++ +++ KK++ +K+K +++ K K + E
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEP 156
Score = 30.3 bits (69), Expect = 0.40
Identities = 12/45 (26%), Positives = 22/45 (48%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
R A E EK +++ K K K +KK+K + +K + + +
Sbjct: 122 REAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISE 166
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 38.3 bits (90), Expect = 0.001
Identities = 13/37 (35%), Positives = 22/37 (59%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
L+ +KK+ K +K +KKK+K++ K K K K +
Sbjct: 55 ELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 36.7 bits (86), Expect = 0.004
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K + K K +KKK+K++ K K K+K
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKP 90
Score = 36.0 bits (84), Expect = 0.007
Identities = 13/37 (35%), Positives = 20/37 (54%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
K +K + K K +KKK+K++ K K K K + F
Sbjct: 57 KAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPRLF 93
Score = 34.8 bits (81), Expect = 0.017
Identities = 13/35 (37%), Positives = 17/35 (48%), Gaps = 6/35 (17%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
KKE K K +KKK+K++ K K K K
Sbjct: 63 KKELKA------WHKAQKKKEKQEAKAAKAKSKPR 91
Score = 34.8 bits (81), Expect = 0.020
Identities = 13/36 (36%), Positives = 19/36 (52%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
KE K+ KK+ K K +KKK+K++ K K
Sbjct: 50 KEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAK 85
Score = 34.0 bits (79), Expect = 0.027
Identities = 15/38 (39%), Positives = 22/38 (57%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E KE+ K +KK+ K K +KKK+K++ K K K
Sbjct: 51 EMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKS 88
Score = 33.7 bits (78), Expect = 0.042
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 149 KKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
KK+ K K +KKK+K++ K K P
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKP 90
Score = 29.4 bits (67), Expect = 1.2
Identities = 12/36 (33%), Positives = 20/36 (55%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
++ K+ K + K KK+ K K +KKK+K+ K
Sbjct: 47 EQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAK 82
Score = 28.3 bits (64), Expect = 2.2
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 153 KKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
KK+ K K +KKK+K++ K S P
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKP 90
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 38.5 bits (90), Expect = 0.001
Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 10/74 (13%)
Query: 110 LPLLSNQSEDC----LYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKK 165
LP L + ED L + E E+KE K ++K+ +K KK+++KKKK
Sbjct: 528 LPNLGIRIEDKPDGPSVWKLDDKEELQREKEEKEALK------EQKRLRKLKKQEEKKKK 581
Query: 166 KKKKKTKKFLPPSD 179
+ +K K +PP++
Sbjct: 582 ELEKLEKAKIPPAE 595
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 38.1 bits (89), Expect = 0.002
Identities = 17/72 (23%), Positives = 22/72 (30%)
Query: 114 SNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ ++L+ V KKK K K K KKK K + KK KK
Sbjct: 50 EQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKK 109
Query: 174 FLPPSDPSGKAG 185
D
Sbjct: 110 NALDKDDDLNYV 121
Score = 34.6 bits (80), Expect = 0.022
Identities = 15/61 (24%), Positives = 20/61 (32%), Gaps = 1/61 (1%)
Query: 131 VALEGEKKEKKKNKNKNKKK-KKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
V + K + KKK K K K KKK K L S + K +
Sbjct: 55 VLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDK 114
Query: 190 E 190
+
Sbjct: 115 D 115
Score = 33.8 bits (78), Expect = 0.038
Identities = 11/52 (21%), Positives = 15/52 (28%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
+ + K KKK K + KK +KK K D
Sbjct: 72 SDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKD 123
Score = 32.7 bits (75), Expect = 0.099
Identities = 14/64 (21%), Positives = 19/64 (29%)
Query: 124 NLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
+ + K+ + + KKK K K K KK D S K
Sbjct: 46 KKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKK 105
Query: 184 AGVK 187
A K
Sbjct: 106 AEKK 109
Score = 31.5 bits (72), Expect = 0.21
Identities = 12/42 (28%), Positives = 18/42 (42%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPS 178
K + + + KKK KK K K K K++ K+ L
Sbjct: 4 ASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESK 45
Score = 31.5 bits (72), Expect = 0.23
Identities = 14/46 (30%), Positives = 21/46 (45%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A E KK+ KK K+K K K++ K+ + KKK +
Sbjct: 8 AELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQID 53
Score = 30.3 bits (69), Expect = 0.52
Identities = 9/45 (20%), Positives = 13/45 (28%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
K K K K++ KK +KK K +
Sbjct: 88 AKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADD 132
Score = 30.3 bits (69), Expect = 0.52
Identities = 16/53 (30%), Positives = 24/53 (45%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
E++ KKK K K K K K++ K+ + KKKT + + VK
Sbjct: 13 EEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKD 65
Score = 30.0 bits (68), Expect = 0.70
Identities = 10/37 (27%), Positives = 18/37 (48%)
Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
+ K + +++ KKK KK K K+K F+ +
Sbjct: 1 MTTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEE 37
Score = 30.0 bits (68), Expect = 0.88
Identities = 10/52 (19%), Positives = 15/52 (28%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
K K K K + KK +KK K K + + +
Sbjct: 86 AAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDD 137
Score = 29.6 bits (67), Expect = 0.92
Identities = 11/35 (31%), Positives = 16/35 (45%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K + +++ KKK KK K K K TK+
Sbjct: 2 TTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKE 36
Score = 29.6 bits (67), Expect = 0.95
Identities = 13/55 (23%), Positives = 22/55 (40%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
+KK KK K K++ K+ + KKK ++ + L K + E
Sbjct: 17 KKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATE 71
Score = 28.0 bits (63), Expect = 3.3
Identities = 13/55 (23%), Positives = 22/55 (40%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
E +++ K K KK K K K K++ K+ ++KK P +
Sbjct: 9 ELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMV 63
Score = 28.0 bits (63), Expect = 3.7
Identities = 9/44 (20%), Positives = 12/44 (27%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
+KK + KK +KK K K D
Sbjct: 90 APAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDD 133
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 37.3 bits (87), Expect = 0.002
Identities = 19/40 (47%), Positives = 22/40 (55%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E KK N +++KKKKKKK KKKK KK K
Sbjct: 165 EAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMK 204
Score = 36.1 bits (84), Expect = 0.005
Identities = 18/38 (47%), Positives = 23/38 (60%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
++KKK+ N ++KKKKKKK KKKK KK K
Sbjct: 168 AARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205
Score = 35.4 bits (82), Expect = 0.008
Identities = 18/39 (46%), Positives = 20/39 (51%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
A K+K N + KKKKKKK KKKK KK K
Sbjct: 166 AEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMK 204
Score = 31.9 bits (73), Expect = 0.14
Identities = 17/35 (48%), Positives = 23/35 (65%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K+ ++KKK +++KKKKKKK KKKK KK
Sbjct: 163 IKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKK 197
Score = 31.9 bits (73), Expect = 0.15
Identities = 17/42 (40%), Positives = 26/42 (61%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
L+ ++ E ++K K+ +++KKKKKKK KKKK KK
Sbjct: 160 ELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAV 201
Score = 28.8 bits (65), Expect = 1.5
Identities = 12/37 (32%), Positives = 15/37 (40%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKK 165
+ E++K K K K KKKK KK K
Sbjct: 169 ARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205
Score = 27.7 bits (62), Expect = 3.1
Identities = 17/47 (36%), Positives = 23/47 (48%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
KK++ K + KKK +++KKKKKKK KK L G
Sbjct: 157 KKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGM 203
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 37.9 bits (88), Expect = 0.002
Identities = 15/48 (31%), Positives = 17/48 (35%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
L K K K + KKK K KKKKK K + P
Sbjct: 110 LAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNKP 157
Score = 36.3 bits (84), Expect = 0.006
Identities = 14/45 (31%), Positives = 18/45 (40%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
++ K K KKK K KKKKK K + K+ S
Sbjct: 115 MRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNKPKS 159
Score = 34.4 bits (79), Expect = 0.028
Identities = 17/56 (30%), Positives = 23/56 (41%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
+ K+ KK K K KKK KKK K +K+F P S K + +
Sbjct: 68 DLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTE 123
Score = 33.7 bits (77), Expect = 0.045
Identities = 16/37 (43%), Positives = 19/37 (51%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
L + +K K NK K KK K K+KKK KKK
Sbjct: 50 KKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKK 86
Score = 33.7 bits (77), Expect = 0.048
Identities = 18/34 (52%), Positives = 20/34 (58%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
+K KKNK NK K KK K K+KKK KKK
Sbjct: 53 HEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKK 86
Score = 31.0 bits (70), Expect = 0.39
Identities = 18/38 (47%), Positives = 20/38 (52%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
KK +K KNKK K K KK K K+KKK KK
Sbjct: 50 KKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKL 87
Score = 27.9 bits (62), Expect = 3.8
Identities = 14/42 (33%), Positives = 17/42 (40%), Gaps = 2/42 (4%)
Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKK--KKTKKFLPPSDPS 181
+ +K K K +K KKK K KK KK L D
Sbjct: 108 GPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDEL 149
Score = 27.1 bits (60), Expect = 6.8
Identities = 15/36 (41%), Positives = 18/36 (50%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
E KK+ NK+ NKKKKK K + K K K
Sbjct: 123 EKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNKPK 158
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 36.1 bits (84), Expect = 0.003
Identities = 13/34 (38%), Positives = 27/34 (79%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
K++KK ++K KK+++K++K++ K++KKK K +
Sbjct: 97 RNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130
Score = 33.4 bits (77), Expect = 0.023
Identities = 16/41 (39%), Positives = 30/41 (73%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
AL+ E + K+ K K K+KK+++K++K++ K++KKK K +
Sbjct: 90 ALKLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130
Score = 32.6 bits (75), Expect = 0.040
Identities = 15/37 (40%), Positives = 29/37 (78%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K E ++NK + KK+ K+KK+++K++K++ K++KK K
Sbjct: 92 KLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAK 128
Score = 31.8 bits (73), Expect = 0.072
Identities = 12/35 (34%), Positives = 26/35 (74%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E+ K + K + K+KK+++K++K++ K++KKK +
Sbjct: 96 ERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130
Score = 27.6 bits (62), Expect = 2.4
Identities = 14/45 (31%), Positives = 25/45 (55%), Gaps = 6/45 (13%)
Query: 137 KKEKKKNKNKN------KKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
KE KK K + ++ K++KKK+ K+KK+++K +K
Sbjct: 75 AKEVKKPGISTKAQQALKLEHERNKQEKKKRSKEKKEEEKERKRQ 119
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 36.3 bits (84), Expect = 0.004
Identities = 20/42 (47%), Positives = 26/42 (61%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
+ + K KK KKKK + K++KKKKK+KKK KK P P
Sbjct: 133 DSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEHP 174
Score = 36.0 bits (83), Expect = 0.005
Identities = 18/35 (51%), Positives = 23/35 (65%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSG 182
K KKKK + K++KKKKK+KKKK K+ P G
Sbjct: 143 KHKKKKHEDDKERKKKKKEKKKKKKRHSPEHPGVG 177
Score = 35.6 bits (82), Expect = 0.006
Identities = 18/29 (62%), Positives = 23/29 (79%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
EKK KKK +K++KKKKK+KKKKKK+
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKKKKKRH 169
Score = 35.6 bits (82), Expect = 0.006
Identities = 18/50 (36%), Positives = 30/50 (60%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
+P + E + + K+K KK + K++KKKKK+KKKKKK+ + +
Sbjct: 126 LPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEHPG 175
Score = 34.4 bits (79), Expect = 0.019
Identities = 18/44 (40%), Positives = 24/44 (54%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
+ K K KKK + K++KKKKK+KKKKKK+ P
Sbjct: 135 EGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEHPGVGF 178
Score = 32.9 bits (75), Expect = 0.058
Identities = 16/41 (39%), Positives = 22/41 (53%)
Query: 126 YVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
P ++K K K K++KKKKK+KKKKKK+
Sbjct: 129 ETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRH 169
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 37.0 bits (86), Expect = 0.004
Identities = 22/72 (30%), Positives = 32/72 (44%), Gaps = 14/72 (19%)
Query: 134 EGEKKEKKKNKNK--------------NKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
+G KK+ KK+K N++ + K+ KKKKKKKKKKKKK + +
Sbjct: 20 KGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAY 79
Query: 180 PSGKAGVKSREQ 191
+ Q
Sbjct: 80 DLPVVWSSAAFQ 91
Score = 35.8 bits (83), Expect = 0.008
Identities = 17/43 (39%), Positives = 21/43 (48%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
L E ++ +NK KKKKKKKKKKKKK +
Sbjct: 39 FLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDL 81
Score = 34.7 bits (80), Expect = 0.022
Identities = 26/83 (31%), Positives = 31/83 (37%), Gaps = 27/83 (32%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKK---------------------------KKKKKK 164
E + E K+ K +NK+K KKK KK KKKKK
Sbjct: 2 MNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKK 61
Query: 165 KKKKKKTKKFLPPSDPSGKAGVK 187
KKKKKK K D + V
Sbjct: 62 KKKKKKKKNLGEAYDLAYDLPVV 84
Score = 33.1 bits (76), Expect = 0.073
Identities = 18/53 (33%), Positives = 26/53 (49%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
+ + + +NK+ KKKKKKKKKKKKK + + P S A +
Sbjct: 41 AELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDN 93
Score = 31.2 bits (71), Expect = 0.31
Identities = 12/32 (37%), Positives = 16/32 (50%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
E E + + K + K+K KKK KK KK
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKKD 32
Score = 30.8 bits (70), Expect = 0.45
Identities = 13/31 (41%), Positives = 22/31 (70%)
Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
N+N + + K++K++ K+K KKK KK+KK
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKK 31
Score = 30.5 bits (69), Expect = 0.58
Identities = 12/31 (38%), Positives = 19/31 (61%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
+N+ K++K++ K+K KKK KK KK
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKK 31
Score = 29.7 bits (67), Expect = 1.1
Identities = 15/58 (25%), Positives = 21/58 (36%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
E E K+ K K K KKKKKKK + + + K G ++
Sbjct: 49 EAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQE 106
Score = 28.9 bits (65), Expect = 1.8
Identities = 12/32 (37%), Positives = 19/32 (59%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
N+N + K++K++ K+K KKK KK K
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKKD 32
Score = 28.9 bits (65), Expect = 1.8
Identities = 19/68 (27%), Positives = 25/68 (36%), Gaps = 2/68 (2%)
Query: 114 SNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
S + D L ++ E + K+ NK K KKKKKKKKK +
Sbjct: 29 SKKDVDDDDAFL--AELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWS 86
Query: 174 FLPPSDPS 181
D S
Sbjct: 87 SAAFQDNS 94
Score = 28.1 bits (63), Expect = 2.7
Identities = 10/32 (31%), Positives = 17/32 (53%)
Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ + K++K++ K+K KKK KK K
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKKD 32
Score = 27.4 bits (61), Expect = 6.2
Identities = 10/32 (31%), Positives = 16/32 (50%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
+ ++K++ K+K KKK KK KK
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKKD 32
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 36.9 bits (86), Expect = 0.005
Identities = 18/48 (37%), Positives = 20/48 (41%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
+ K K K K K KKK+KKKKK K KK D K
Sbjct: 1159 KSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEK 1206
Score = 36.6 bits (85), Expect = 0.006
Identities = 22/58 (37%), Positives = 32/58 (55%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
ALE +++ ++K K ++ K K K K K +K K KKK+ KK +D S KA V
Sbjct: 1140 ALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGN 1197
Score = 36.2 bits (84), Expect = 0.008
Identities = 20/58 (34%), Positives = 23/58 (39%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
L+ + K K K K KKK+KKKKK K KK SD K K
Sbjct: 1158 LKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDN 1215
Score = 35.8 bits (83), Expect = 0.011
Identities = 18/55 (32%), Positives = 22/55 (40%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
K K K K +K K KKK+KKKKK K KK K ++
Sbjct: 1158 LKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDK 1212
Score = 35.4 bits (82), Expect = 0.016
Identities = 15/58 (25%), Positives = 25/58 (43%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
+ ++ + K +K +K K KKK+KKKKK K K + S + +
Sbjct: 1153 AKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLD 1210
Score = 35.0 bits (81), Expect = 0.019
Identities = 21/55 (38%), Positives = 27/55 (49%), Gaps = 1/55 (1%)
Query: 136 EKKEKKKNK-NKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
E +EK+ K + K K K K K +K K KKK+KKK K S + G R
Sbjct: 1146 EVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKR 1200
Score = 33.5 bits (77), Expect = 0.059
Identities = 18/45 (40%), Positives = 23/45 (51%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ E++ K K K K K +K K KKK+KKKKK K K
Sbjct: 1148 EEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKA 1192
Score = 33.5 bits (77), Expect = 0.067
Identities = 22/58 (37%), Positives = 30/58 (51%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
E E ++++ K+K K K K +K K KKK+KKKKK K S V S E+
Sbjct: 1149 EKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEK 1206
Score = 32.7 bits (75), Expect = 0.10
Identities = 12/45 (26%), Positives = 22/45 (48%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
AL+ +KK +KK K K K + K+ + + ++ +K K
Sbjct: 1318 LAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSD 1362
Score = 30.8 bits (70), Expect = 0.54
Identities = 14/61 (22%), Positives = 21/61 (34%), Gaps = 4/61 (6%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKK----KKKKKKKTKKFLPPSDPSGKAGVKSR 189
KK K + KKKKK +K KKK K + K+ + +
Sbjct: 1300 SKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKK 1359
Query: 190 E 190
+
Sbjct: 1360 K 1360
Score = 29.6 bits (67), Expect = 1.2
Identities = 13/47 (27%), Positives = 22/47 (46%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
L KK+KK K +KKK K + K+ + + ++ +K S
Sbjct: 1318 LAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSS 1364
Score = 28.5 bits (64), Expect = 2.9
Identities = 10/47 (21%), Positives = 20/47 (42%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
K+KKK++ K +KKK K + K+ + + + +
Sbjct: 1317 SLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDS 1363
Score = 27.7 bits (62), Expect = 4.7
Identities = 13/49 (26%), Positives = 18/49 (36%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
E + KKK KK+ + KKKKK++K S
Sbjct: 1292 PDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTR 1340
Score = 27.3 bits (61), Expect = 5.8
Identities = 15/46 (32%), Positives = 23/46 (50%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
P + KK + + KKKKK +KK +KKK K + K+ +
Sbjct: 1302 PSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASAS 1347
Score = 27.3 bits (61), Expect = 7.2
Identities = 12/38 (31%), Positives = 21/38 (55%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E+++ + + K+++ K K K K K +K K KK
Sbjct: 1139 EALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKK 1176
Score = 26.9 bits (60), Expect = 9.1
Identities = 14/40 (35%), Positives = 19/40 (47%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
+GE K + KKK KK+ + KKKKK +K
Sbjct: 1291 RPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKK 1330
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 36.0 bits (83), Expect = 0.005
Identities = 15/33 (45%), Positives = 19/33 (57%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+K +KKKK K KK K K KK + KKK +
Sbjct: 82 RKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAR 114
Score = 34.4 bits (79), Expect = 0.017
Identities = 16/32 (50%), Positives = 20/32 (62%), Gaps = 1/32 (3%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKK 159
P A+E +KK K K KNK K KK + KKK +
Sbjct: 84 PATTAVEKKKKGKSK-KNKLKGKKDEDKKKAR 114
Score = 31.0 bits (70), Expect = 0.29
Identities = 13/33 (39%), Positives = 17/33 (51%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
++ +KKKK K KK K K KK + K K
Sbjct: 81 RRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKA 113
Score = 30.6 bits (69), Expect = 0.35
Identities = 13/24 (54%), Positives = 15/24 (62%)
Query: 150 KKKKKKKKKKKKKKKKKKKKKTKK 173
KKKK K KK K K KK + KK +
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAR 114
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 34.9 bits (80), Expect = 0.006
Identities = 15/23 (65%), Positives = 15/23 (65%)
Query: 145 NKNKKKKKKKKKKKKKKKKKKKK 167
K KKKKKK KK KK KK KK
Sbjct: 98 EKPKKKKKKSKKTKKPKKSSKKD 120
Score = 34.9 bits (80), Expect = 0.007
Identities = 14/27 (51%), Positives = 15/27 (55%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKK 165
+ K K KKKK KK KK KK KK
Sbjct: 94 PEPTEKPKKKKKKSKKTKKPKKSSKKD 120
Score = 33.7 bits (77), Expect = 0.017
Identities = 15/23 (65%), Positives = 15/23 (65%)
Query: 147 NKKKKKKKKKKKKKKKKKKKKKK 169
K KKKKKK KK KK KK KK
Sbjct: 98 EKPKKKKKKSKKTKKPKKSSKKD 120
Score = 32.9 bits (75), Expect = 0.035
Identities = 15/22 (68%), Positives = 15/22 (68%)
Query: 149 KKKKKKKKKKKKKKKKKKKKKK 170
K KKKKKK KK KK KK KK
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 31.8 bits (72), Expect = 0.071
Identities = 15/21 (71%), Positives = 16/21 (76%)
Query: 153 KKKKKKKKKKKKKKKKKKTKK 173
K KKKKKK KK KK KK +KK
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKK 119
Score = 31.0 bits (70), Expect = 0.13
Identities = 14/22 (63%), Positives = 14/22 (63%)
Query: 152 KKKKKKKKKKKKKKKKKKKTKK 173
K KKKKKK KK KK KK K
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 30.2 bits (68), Expect = 0.25
Identities = 15/32 (46%), Positives = 15/32 (46%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKK 161
P E EK K K K KK KK KK KK
Sbjct: 89 PDDHPPEPTEKPKKKKKKSKKTKKPKKSSKKD 120
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region covers
both the Brf homology II and III regions. This region is
involved in binding TATA binding protein.
Length = 95
Score = 34.2 bits (79), Expect = 0.009
Identities = 14/40 (35%), Positives = 21/40 (52%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
++ + K K+K + KKKKK+K KKK + P S
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTA 69
Score = 31.5 bits (72), Expect = 0.078
Identities = 13/32 (40%), Positives = 20/32 (62%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
E+KE K+ ++ KKKKK+K KKK+ +
Sbjct: 33 EEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64
Score = 31.1 bits (71), Expect = 0.089
Identities = 11/38 (28%), Positives = 19/38 (50%)
Query: 144 KNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
+ + K+ K+K + KKKKK+K K K+ +
Sbjct: 31 EQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPAST 68
Score = 29.5 bits (67), Expect = 0.31
Identities = 14/36 (38%), Positives = 22/36 (61%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
E++E+K+ K K + KKKKK+K KKK+ +
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGP 65
Score = 28.0 bits (63), Expect = 1.2
Identities = 13/36 (36%), Positives = 21/36 (58%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
LE +++++ K K KKKKK+K KKK+ +
Sbjct: 29 LEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64
Score = 26.4 bits (59), Expect = 4.1
Identities = 10/31 (32%), Positives = 20/31 (64%)
Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
NK+ +++++K+ K+K + KKKK +K
Sbjct: 25 NKDYLEEQEEKELKQKADEGNNSGKKKKKRK 55
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 36.3 bits (83), Expect = 0.009
Identities = 13/42 (30%), Positives = 23/42 (54%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A E K E + ++ + ++K + +KKK++ KKK KK
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKK 1385
Score = 35.9 bits (82), Expect = 0.011
Identities = 7/38 (18%), Positives = 18/38 (47%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+K E+ K + K + + + + ++K + + KK
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373
Score = 34.7 bits (79), Expect = 0.028
Identities = 13/55 (23%), Positives = 26/55 (47%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
KK+ ++ K K KKK ++ KK + K + + ++ +A K +E+
Sbjct: 1321 KKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE 1375
Score = 34.3 bits (78), Expect = 0.038
Identities = 10/40 (25%), Positives = 21/40 (52%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E KK + K + + + + ++K + +KKK++ KK
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKK 1378
Score = 34.0 bits (77), Expect = 0.049
Identities = 9/42 (21%), Positives = 19/42 (45%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A E +K + + + ++K + +KKK++ KK
Sbjct: 1338 AEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379
Score = 33.6 bits (76), Expect = 0.054
Identities = 8/42 (19%), Positives = 18/42 (42%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A +KK ++ K K + + + + ++K + KK
Sbjct: 1331 ADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKK 1372
Score = 33.6 bits (76), Expect = 0.065
Identities = 8/40 (20%), Positives = 18/40 (45%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E KK+ K K ++ KK + K + + + + ++
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEE 1364
Score = 33.2 bits (75), Expect = 0.071
Identities = 8/42 (19%), Positives = 20/42 (47%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A + + +E KK K + + + + ++K + +K K+
Sbjct: 1333 AAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKE 1374
Score = 33.2 bits (75), Expect = 0.087
Identities = 7/38 (18%), Positives = 16/38 (42%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+K + K K + KK + K + + + + + K
Sbjct: 1329 KKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKA 1366
Score = 32.8 bits (74), Expect = 0.098
Identities = 7/39 (17%), Positives = 19/39 (48%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+ E+ +K K + + + + ++K + +KKK +
Sbjct: 1337 KAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE 1375
Score = 32.4 bits (73), Expect = 0.16
Identities = 13/40 (32%), Positives = 20/40 (50%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E + + K + +KKK++ KKK KKK +K
Sbjct: 1351 EAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEK 1390
Score = 32.0 bits (72), Expect = 0.18
Identities = 14/40 (35%), Positives = 22/40 (55%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E + + K + +KKK++ KKK KKK ++ KK
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392
Score = 32.0 bits (72), Expect = 0.18
Identities = 9/38 (23%), Positives = 18/38 (47%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ + + ++K + +KKK++ KKK KK
Sbjct: 1349 KAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386
Score = 32.0 bits (72), Expect = 0.20
Identities = 11/43 (25%), Positives = 21/43 (48%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A + + + +K + +KKK++ KKK KKK ++
Sbjct: 1347 AAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEE 1389
Score = 31.3 bits (70), Expect = 0.35
Identities = 15/42 (35%), Positives = 25/42 (59%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A E +KK ++ K + KKK ++ KK + KKK ++ KK +
Sbjct: 1446 ADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADE 1487
Score = 31.3 bits (70), Expect = 0.37
Identities = 18/53 (33%), Positives = 31/53 (58%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
A E +KK ++K K KKK ++ KK + KKK ++ KK ++ ++ + KA
Sbjct: 1420 ADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKA 1472
Score = 30.9 bits (69), Expect = 0.41
Identities = 18/53 (33%), Positives = 28/53 (52%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
A E +K K K KKK ++KKK + KKK ++ KK + ++ + KA
Sbjct: 1407 ADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKA 1459
Score = 30.9 bits (69), Expect = 0.47
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E E+K + KK++ KKK KKK ++KKK K
Sbjct: 1358 EAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAK 1397
Score = 30.9 bits (69), Expect = 0.53
Identities = 8/39 (20%), Positives = 18/39 (46%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+ E+ +KK + K K ++ KK + K + + +
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEA 1361
Score = 30.5 bits (68), Expect = 0.54
Identities = 20/53 (37%), Positives = 30/53 (56%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
A E +K+E KK + KKK ++KKK + KKK ++ KKK + + KA
Sbjct: 1368 AAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKA 1420
Score = 30.5 bits (68), Expect = 0.58
Identities = 19/53 (35%), Positives = 30/53 (56%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
A E +K E+ K K + KK + KKK ++ KK + KKK ++ +D + KA
Sbjct: 1453 AEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKA 1505
Score = 30.5 bits (68), Expect = 0.59
Identities = 19/53 (35%), Positives = 31/53 (58%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
A E +K E+KK ++ KKK ++ KK + KKK ++ KKK ++ + KA
Sbjct: 1292 ADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344
Score = 30.1 bits (67), Expect = 0.82
Identities = 17/53 (32%), Positives = 30/53 (56%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
A E +KK ++ K KKK ++ KK ++ KKK ++ KK + ++ + KA
Sbjct: 1433 ADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKA 1485
Score = 29.7 bits (66), Expect = 0.98
Identities = 13/41 (31%), Positives = 23/41 (56%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E +++ + + K ++ KKK KKK ++KKK + KK
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399
Score = 29.7 bits (66), Expect = 1.1
Identities = 17/53 (32%), Positives = 27/53 (50%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
A E +KK ++ K KKK ++ KKK + KK + KK ++ + KA
Sbjct: 1472 ADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKA 1524
Score = 29.7 bits (66), Expect = 1.1
Identities = 16/44 (36%), Positives = 24/44 (54%), Gaps = 4/44 (9%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKK----KKKKKKKKKKKKTKK 173
E +KK ++ K KKK ++ KKK KKK ++ KK + K
Sbjct: 1306 EAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAK 1349
Score = 29.7 bits (66), Expect = 1.2
Identities = 16/49 (32%), Positives = 28/49 (57%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
+K E+ K ++ KKK ++ KK + KKK ++ KKK + ++ KA
Sbjct: 1464 KKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKA 1512
Score = 29.7 bits (66), Expect = 1.2
Identities = 19/53 (35%), Positives = 29/53 (54%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
A E + + +K K + KKK KKK ++KKK + KKK ++ +D KA
Sbjct: 1361 AAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKA 1413
Score = 29.3 bits (65), Expect = 1.6
Identities = 17/49 (34%), Positives = 27/49 (55%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
+K E+KK ++ KKK ++ KKK + KK KKK + ++ KA
Sbjct: 1385 KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKA 1433
Score = 29.3 bits (65), Expect = 1.6
Identities = 17/49 (34%), Positives = 26/49 (53%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
+K E+ K K KK KKK + KKK ++KKK + ++ + KA
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKA 1446
Score = 29.0 bits (64), Expect = 1.8
Identities = 18/53 (33%), Positives = 29/53 (54%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
A E +K ++ K K + KK + KKK ++ KKK + KK + +D + KA
Sbjct: 1466 AEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKA 1518
Score = 29.0 bits (64), Expect = 2.0
Identities = 15/40 (37%), Positives = 24/40 (60%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ E+ +KK ++ K KKK + KKK ++KKK + KK
Sbjct: 1399 KAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKK 1438
Score = 28.6 bits (63), Expect = 2.9
Identities = 18/46 (39%), Positives = 24/46 (52%), Gaps = 4/46 (8%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKK----KKKKKKKKKKKKTKK 173
A +KK ++K K KKK ++ KKK KK KKK + KK
Sbjct: 1380 ADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKK 1425
Score = 28.2 bits (62), Expect = 3.4
Identities = 15/63 (23%), Positives = 34/63 (53%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
+ E KKE +++K K ++ KK +++KKK KK+++KK ++ + + +
Sbjct: 1728 NKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDE 1787
Query: 189 REQ 191
++
Sbjct: 1788 EDE 1790
Score = 28.2 bits (62), Expect = 3.4
Identities = 12/41 (29%), Positives = 27/41 (65%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ ++++KK KK+ ++ KK ++ KKK+ ++KKK ++
Sbjct: 1680 AKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEE 1720
Score = 28.2 bits (62), Expect = 3.9
Identities = 17/59 (28%), Positives = 30/59 (50%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
A E +K ++ K K + KKK + KK KKK + KK + +D + K ++++
Sbjct: 1387 AEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKK 1445
Score = 27.8 bits (61), Expect = 4.1
Identities = 16/42 (38%), Positives = 26/42 (61%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A E +K E+ K ++ KK ++ KK + KK ++KKK + KK
Sbjct: 1512 ADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKK 1553
Score = 27.8 bits (61), Expect = 4.5
Identities = 12/43 (27%), Positives = 26/43 (60%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ E K+ +++K K ++ KK ++ +KK + KK+ ++ KK
Sbjct: 1661 IKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703
Score = 27.8 bits (61), Expect = 5.3
Identities = 17/60 (28%), Positives = 29/60 (48%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
A E +K ++ K K + KK + KKK ++ KKK KK + + + KA ++
Sbjct: 1298 AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357
Score = 27.4 bits (60), Expect = 5.3
Identities = 12/40 (30%), Positives = 23/40 (57%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E K+K + K ++ +KK + KK+ ++ KK ++ KK
Sbjct: 1670 AEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKK 1709
Score = 27.0 bits (59), Expect = 7.2
Identities = 15/40 (37%), Positives = 26/40 (65%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E KK +++NK K ++ KK ++ KKK ++ KK ++ KK
Sbjct: 1650 EELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK 1689
Score = 27.0 bits (59), Expect = 7.4
Identities = 13/40 (32%), Positives = 26/40 (65%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E ++K+ + K ++ KK ++ KKK+ ++KKK ++ KK
Sbjct: 1684 EEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKK 1723
Score = 27.0 bits (59), Expect = 8.0
Identities = 15/36 (41%), Positives = 22/36 (61%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+EKKK K ++KKK + KKK ++ KK + KK
Sbjct: 1287 EEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKK 1322
Score = 27.0 bits (59), Expect = 8.2
Identities = 14/42 (33%), Positives = 24/42 (57%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A E +KK ++ K ++ KK + KKK + KK ++ KK +
Sbjct: 1485 ADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADE 1526
Score = 27.0 bits (59), Expect = 8.7
Identities = 17/42 (40%), Positives = 26/42 (61%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A E +K E+KK ++ KK ++KKK + KKK ++ KK K
Sbjct: 1280 ADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAK 1321
Score = 27.0 bits (59), Expect = 9.1
Identities = 17/50 (34%), Positives = 28/50 (56%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
E +KK + K + KKK + KKK ++ KK + KKK ++ +D + K
Sbjct: 1287 EEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKK 1336
Score = 27.0 bits (59), Expect = 9.1
Identities = 13/38 (34%), Positives = 22/38 (57%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ E KK ++ KK ++ KK + KK ++ KK + KK
Sbjct: 1504 KAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKK 1541
Score = 27.0 bits (59), Expect = 9.2
Identities = 14/41 (34%), Positives = 25/41 (60%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
A E +K E+ + K KK+ ++ KK ++ KKK+ ++K K
Sbjct: 1677 AEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKK 1717
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 35.5 bits (82), Expect = 0.011
Identities = 13/43 (30%), Positives = 17/43 (39%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A EG + KK + +K K K KK KK +K
Sbjct: 172 KAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214
Score = 28.2 bits (63), Expect = 2.9
Identities = 13/47 (27%), Positives = 26/47 (55%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSG 182
E K+ KK + ++K K+ K+K++KK ++ + + P S P+
Sbjct: 116 ESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPSSTPNS 162
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 35.6 bits (82), Expect = 0.011
Identities = 21/57 (36%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
KE++K K + K++KKKKK+K K++ K +K K++ K+ PP K K E
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP-KEKEKEKEKKVE 155
Score = 35.2 bits (81), Expect = 0.016
Identities = 15/57 (26%), Positives = 32/57 (56%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
E + + K+ + + ++ K++KKKKK+K K++ K +K ++ P K K ++
Sbjct: 97 EPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKK 153
Score = 34.9 bits (80), Expect = 0.018
Identities = 19/63 (30%), Positives = 32/63 (50%), Gaps = 3/63 (4%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
P A KE K N++ K+++K+K++ K++KKKKK+K K + K
Sbjct: 86 KGPAAKTKPAKEPK---NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRP 142
Query: 188 SRE 190
+E
Sbjct: 143 PKE 145
Score = 34.5 bits (79), Expect = 0.023
Identities = 18/46 (39%), Positives = 30/46 (65%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
+ EKK+KK+ + K +K K++ K+K+ K+K+K+K KK P D
Sbjct: 114 KEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRD 159
Score = 33.7 bits (77), Expect = 0.054
Identities = 13/56 (23%), Positives = 30/56 (53%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
EK+ K+ + + +KK ++ + ++++KK+++ + K K P P K E+
Sbjct: 139 EKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEE 194
Score = 33.3 bits (76), Expect = 0.061
Identities = 15/58 (25%), Positives = 29/58 (50%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
+G + K K + K+++K+K++ K++KKKKK+ K P + + R
Sbjct: 86 KGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP 143
Score = 32.9 bits (75), Expect = 0.086
Identities = 17/64 (26%), Positives = 33/64 (51%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
RP + ++KEKK + ++++++KK+++ + K + KK KKK K
Sbjct: 140 KRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQA 199
Query: 188 SREQ 191
+RE
Sbjct: 200 AREA 203
Score = 32.9 bits (75), Expect = 0.088
Identities = 14/57 (24%), Positives = 29/57 (50%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
G K K+ K + K+++K+K++ K++KK KK P +P + + ++
Sbjct: 83 GGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKE 139
Score = 31.8 bits (72), Expect = 0.20
Identities = 14/46 (30%), Positives = 28/46 (60%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
E K+E K K K + K+K+ K+K+K+K+KK ++ + ++ +
Sbjct: 122 EKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRE 167
Score = 31.0 bits (70), Expect = 0.32
Identities = 14/45 (31%), Positives = 26/45 (57%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ V G K K K + K + K+++K+K++ K++KKK K+
Sbjct: 78 KRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKE 122
Score = 29.1 bits (65), Expect = 1.6
Identities = 14/54 (25%), Positives = 26/54 (48%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
E + K K K+ K + K+++K+K++ K++K KK P + K
Sbjct: 81 EKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPK 134
Score = 28.7 bits (64), Expect = 1.9
Identities = 13/45 (28%), Positives = 26/45 (57%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
K+E+ + K++ KK KKK KKK+ +++K+++ + P
Sbjct: 165 KRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPE 209
Score = 28.7 bits (64), Expect = 2.1
Identities = 11/55 (20%), Positives = 25/55 (45%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSG 182
PR E +++ + K KKK KKK+ +++K+++ ++ +
Sbjct: 157 PRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEP 211
Score = 28.3 bits (63), Expect = 2.7
Identities = 14/56 (25%), Positives = 27/56 (48%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
E +E++K + + + K + KK KKK KKK+ + +K + + K +
Sbjct: 156 EPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEP 211
Score = 27.5 bits (61), Expect = 5.1
Identities = 15/47 (31%), Positives = 23/47 (48%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
V P E EKK ++ KK KKK KKK+ +++K++
Sbjct: 154 VEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAA 200
Score = 27.2 bits (60), Expect = 6.9
Identities = 11/56 (19%), Positives = 23/56 (41%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
K KK K KKK+ +++K+++ ++ K K P + + +
Sbjct: 171 AKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKD 226
Score = 26.8 bits (59), Expect = 8.2
Identities = 13/54 (24%), Positives = 24/54 (44%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
R + E+ K K KKK KKK+ +++K+++ + P +P
Sbjct: 159 DREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPD 212
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 35.5 bits (82), Expect = 0.012
Identities = 19/68 (27%), Positives = 27/68 (39%), Gaps = 13/68 (19%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKK----------KKKKKKKKKKKKKKKKTKKFLPPSDPS 181
A E KK K +++ KK +KK K K KK +K KK PS +
Sbjct: 738 AREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKK---PSAKT 794
Query: 182 GKAGVKSR 189
K ++
Sbjct: 795 QKIAAATK 802
Score = 35.1 bits (81), Expect = 0.015
Identities = 17/42 (40%), Positives = 21/42 (50%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A GEKK K K K+ +K KK K +K K K+ KK
Sbjct: 768 AFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKK 809
Score = 33.2 bits (76), Expect = 0.084
Identities = 13/40 (32%), Positives = 17/40 (42%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E + K K K+ K KK K +K K K+ K K
Sbjct: 772 EKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKV 811
Score = 32.4 bits (74), Expect = 0.13
Identities = 10/33 (30%), Positives = 14/33 (42%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
+K +K K K +K K K+ KKK
Sbjct: 780 AKKDARKAKKPSAKTQKIAAATKAKRAAKKKVA 812
Score = 32.0 bits (73), Expect = 0.21
Identities = 14/44 (31%), Positives = 20/44 (45%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+ V E + + + K K K KK +K KK K +K TK
Sbjct: 759 KKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATK 802
Score = 30.5 bits (69), Expect = 0.62
Identities = 11/35 (31%), Positives = 17/35 (48%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
+ KK+ +K K + K +K K K+ KKK
Sbjct: 778 KAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVA 812
Score = 30.1 bits (68), Expect = 0.87
Identities = 11/36 (30%), Positives = 17/36 (47%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKK 165
P A + + ++ KK K +K K K+ KKK
Sbjct: 777 PKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVA 812
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 35.0 bits (81), Expect = 0.015
Identities = 16/41 (39%), Positives = 26/41 (63%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E K+K K+K ++ +K + K+KKK+KKK++K KK
Sbjct: 303 FEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKK 343
Score = 33.1 bits (76), Expect = 0.069
Identities = 16/40 (40%), Positives = 29/40 (72%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ ++K K K + N+K + K+KKK+KKK++KKKK+ ++
Sbjct: 308 DLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIER 347
Score = 29.6 bits (67), Expect = 0.90
Identities = 14/37 (37%), Positives = 25/37 (67%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
L+ E KEKKK K K +KKKK+ ++ +++ +K + +
Sbjct: 324 LDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQAT 360
Score = 29.6 bits (67), Expect = 1.0
Identities = 13/37 (35%), Positives = 27/37 (72%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K + +++ K + K+KKK+KKK++KKKK+ ++ ++
Sbjct: 314 KSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEE 350
Score = 29.6 bits (67), Expect = 1.1
Identities = 15/64 (23%), Positives = 28/64 (43%), Gaps = 4/64 (6%)
Query: 132 ALEGEKKEKKKN----KNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
AL+ + K KK + + K+K K K ++ +K + K+ KK + K +
Sbjct: 288 ALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIER 347
Query: 188 SREQ 191
E+
Sbjct: 348 LEER 351
Score = 28.1 bits (63), Expect = 2.9
Identities = 13/37 (35%), Positives = 29/37 (78%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+++ +K + K+KKK+KKK++KKKK+ ++ +++ +K
Sbjct: 318 ERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEK 354
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 35.1 bits (81), Expect = 0.018
Identities = 17/44 (38%), Positives = 26/44 (59%)
Query: 147 NKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
++ K KK KKKKKKKKK++K + P ++ G + SR+
Sbjct: 270 SEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRK 313
Score = 34.3 bits (79), Expect = 0.034
Identities = 15/41 (36%), Positives = 22/41 (53%)
Query: 145 NKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAG 185
++ K KK KKKKKKKKK++K + + S +G
Sbjct: 270 SEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSG 310
Score = 34.0 bits (78), Expect = 0.038
Identities = 18/59 (30%), Positives = 28/59 (47%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
V+L+ + E + + + K KK KKKKKKKKK++K + + G S
Sbjct: 252 VSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSG 310
Score = 32.4 bits (74), Expect = 0.14
Identities = 12/42 (28%), Positives = 19/42 (45%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ + + + + K KK KKKKKKKKK++
Sbjct: 251 NVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDL 292
Score = 31.7 bits (72), Expect = 0.25
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAG 185
K KK KKKKKKKKK++K + + + + S SG
Sbjct: 274 KFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRK 313
Score = 30.1 bits (68), Expect = 0.75
Identities = 9/42 (21%), Positives = 25/42 (59%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A E +K++ + + K+K+++ ++K K ++K+++ K
Sbjct: 36 AAAYENWKKRQEEAEAKRKREELREKIAKAREKRERNSKLGG 77
Score = 27.4 bits (61), Expect = 5.2
Identities = 12/45 (26%), Positives = 26/45 (57%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
A E KK +++ + K K+++ ++K K ++K+++ K K L
Sbjct: 37 AAYENWKKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTL 81
Score = 27.0 bits (60), Expect = 6.8
Identities = 10/27 (37%), Positives = 19/27 (70%)
Query: 145 NKNKKKKKKKKKKKKKKKKKKKKKKKT 171
K+KK++KKK+ ++KK +K+K+
Sbjct: 96 KKSKKRQKKKEAERKKALLLDEKEKER 122
Score = 27.0 bits (60), Expect = 7.5
Identities = 8/41 (19%), Positives = 23/41 (56%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
+ E K + + + K+K+++ ++K K ++K+++ K
Sbjct: 35 DAAAYENWKKRQEEAEAKRKREELREKIAKAREKRERNSKL 75
Score = 26.6 bits (59), Expect = 8.5
Identities = 11/40 (27%), Positives = 17/40 (42%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
KK+K N + +KK+++ KKK K L
Sbjct: 215 GKKKKSDNLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSL 254
Score = 26.6 bits (59), Expect = 9.7
Identities = 10/27 (37%), Positives = 18/27 (66%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTK 172
K KK++KKK+ ++KK +K+K +
Sbjct: 96 KKSKKRQKKKEAERKKALLLDEKEKER 122
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 35.0 bits (81), Expect = 0.018
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ +EK + +KK+++ +K K K K KK KK
Sbjct: 105 KKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKK 143
Score = 33.8 bits (78), Expect = 0.049
Identities = 11/42 (26%), Positives = 25/42 (59%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
E E + ++ K K ++K ++ + + +KK+++ +K KFL
Sbjct: 92 EIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFL 133
Score = 32.3 bits (74), Expect = 0.13
Identities = 11/42 (26%), Positives = 19/42 (45%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
LE + ++ + K +++ +K K K K KK KK
Sbjct: 107 LEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNL 148
Score = 32.3 bits (74), Expect = 0.15
Identities = 9/37 (24%), Positives = 21/37 (56%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
E K++ K + ++ + + +KK+++ +K K K K
Sbjct: 99 ELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDK 135
Score = 31.9 bits (73), Expect = 0.21
Identities = 10/38 (26%), Positives = 23/38 (60%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E+ +K+ K + K ++ + + +KK+++ +K K K K
Sbjct: 98 EELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDK 135
Score = 31.1 bits (71), Expect = 0.31
Identities = 11/46 (23%), Positives = 26/46 (56%), Gaps = 1/46 (2%)
Query: 129 RPVALEGEK-KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ V GE+ E + + KK+ KK ++K ++ + + +KK++ +
Sbjct: 81 KGVFTLGEENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELE 126
Score = 30.7 bits (70), Expect = 0.45
Identities = 10/42 (23%), Positives = 22/42 (52%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
L+ E K+ ++ + + + +KK+++ +K K K K K
Sbjct: 100 LKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLA 141
Score = 29.6 bits (67), Expect = 1.2
Identities = 10/40 (25%), Positives = 17/40 (42%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
+ + + + K + +K K K K KK KK + L
Sbjct: 113 QLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEAL 152
Score = 28.8 bits (65), Expect = 1.9
Identities = 9/39 (23%), Positives = 22/39 (56%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
++ KK+ K +K ++ + + +KK+++ +K K K
Sbjct: 98 EELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKA 136
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 34.6 bits (80), Expect = 0.018
Identities = 19/52 (36%), Positives = 26/52 (50%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAG 185
E E + + + + KKKKKKKKKK KK + T P++ S AG
Sbjct: 168 EVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAG 219
Score = 34.2 bits (79), Expect = 0.026
Identities = 16/54 (29%), Positives = 26/54 (48%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
E+ E K + + ++ KKKKKKKKKK KK + + + P+ +
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAG 219
Score = 32.3 bits (74), Expect = 0.098
Identities = 13/56 (23%), Positives = 16/56 (28%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
E++ KKK K K KK KK + K K R
Sbjct: 178 EERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSI 233
Score = 31.2 bits (71), Expect = 0.24
Identities = 16/57 (28%), Positives = 25/57 (43%), Gaps = 2/57 (3%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP--PSDPSGKAGVKSREQ 191
K + + + KKKKKKKKK KK + +P S +G+ G + +
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLK 228
Score = 27.3 bits (61), Expect = 4.8
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
N + + + K + ++++ KKKKKKKKK K
Sbjct: 163 NPTEEEVELLKARLEEERAKKKKKKKKKKTK 193
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 34.0 bits (78), Expect = 0.020
Identities = 16/40 (40%), Positives = 27/40 (67%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E +KE+K+ + ++K+ K KK+KK+KK+KK +K K
Sbjct: 93 EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAK 132
Score = 32.0 bits (73), Expect = 0.095
Identities = 22/60 (36%), Positives = 30/60 (50%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
+ V+ E E+KE K K + K+KK+KK +K KKK K K T K + K K
Sbjct: 99 QKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEEGK 158
Score = 32.0 bits (73), Expect = 0.10
Identities = 14/38 (36%), Positives = 27/38 (71%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
++ +K+ K +K+ ++K+ K KK+KK+KK+KK +K
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEK 129
Score = 31.7 bits (72), Expect = 0.12
Identities = 18/40 (45%), Positives = 28/40 (70%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E EK++K+ +K +K+ K KK+KK+KK+KK +K KK
Sbjct: 94 ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKK 133
Score = 31.3 bits (71), Expect = 0.15
Identities = 21/62 (33%), Positives = 32/62 (51%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
P E E+KE K + + K KK+KK+KK+KK +K K K + + KA K+
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTT 151
Query: 190 EQ 191
+
Sbjct: 152 TK 153
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 34.7 bits (80), Expect = 0.021
Identities = 15/40 (37%), Positives = 29/40 (72%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E+KE+KK + + K+++ ++++K++KKKK KK K+
Sbjct: 40 EEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKE 79
Score = 33.2 bits (76), Expect = 0.066
Identities = 14/38 (36%), Positives = 30/38 (78%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E++E+K+ K + ++K K+++ ++++K++KKKKTKK
Sbjct: 39 EEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKK 76
Score = 32.4 bits (74), Expect = 0.12
Identities = 15/40 (37%), Positives = 28/40 (70%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E EK+EKK+ + K K+++ ++++K++KKKK KK +
Sbjct: 41 EEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKET 80
Score = 32.4 bits (74), Expect = 0.13
Identities = 12/40 (30%), Positives = 28/40 (70%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E+++++K + + K K+++ ++++K++KKKK K K
Sbjct: 39 EEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVK 78
Score = 32.4 bits (74), Expect = 0.14
Identities = 14/40 (35%), Positives = 30/40 (75%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E+K++++ K +K+++ ++++K++KKKK KK K+T
Sbjct: 43 EKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTT 82
Score = 30.5 bits (69), Expect = 0.47
Identities = 13/47 (27%), Positives = 29/47 (61%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
VP E ++++K++ + K+++ ++++K++KKKK KK K
Sbjct: 35 VPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETT 81
Score = 27.0 bits (60), Expect = 6.7
Identities = 13/37 (35%), Positives = 23/37 (62%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
KE++ ++ + K++KKKK KK K+ + + KTK
Sbjct: 55 TDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTK 91
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 34.8 bits (81), Expect = 0.024
Identities = 21/77 (27%), Positives = 37/77 (48%)
Query: 105 FLEKLLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
+K LP L Q E + L + + E+ EK + + + + +K K+ + KK +
Sbjct: 139 LKKKHLPKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLE 198
Query: 165 KKKKKKTKKFLPPSDPS 181
+KKK++K F P P
Sbjct: 199 AEKKKQSKNFDPKEGPV 215
Score = 29.4 bits (67), Expect = 1.4
Identities = 11/47 (23%), Positives = 21/47 (44%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
E E ++ K + KK + +KKK+ K K+ + K + +
Sbjct: 180 EEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEEI 226
>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein. The PcfK-like protein family
includes the Enterococcus faecalis PcfK protein, which
is functionally uncharacterized. This family of proteins
is found in bacteria and viruses. Proteins in this
family are typically between 137 and 257 amino acids in
length. There are two completely conserved residues (D
and L) that may be functionally important.
Length = 136
Score = 33.1 bits (76), Expect = 0.027
Identities = 11/42 (26%), Positives = 23/42 (54%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
V V L E+K + + + ++++ +K +K+ KK KK +
Sbjct: 84 VNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAE 125
Score = 28.1 bits (63), Expect = 1.7
Identities = 8/43 (18%), Positives = 20/43 (46%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
+ + K + +K+ K ++++ +K +K+ K K P
Sbjct: 86 HTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPVQ 128
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 34.5 bits (79), Expect = 0.027
Identities = 10/41 (24%), Positives = 20/41 (48%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
A E ++ + ++ KK ++ KK KK KK +++
Sbjct: 1178 AREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESE 1218
Score = 32.5 bits (74), Expect = 0.12
Identities = 11/41 (26%), Positives = 19/41 (46%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ E+ +K + + + KK ++ KK KK TKK
Sbjct: 1173 AKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKK 1213
Score = 31.4 bits (71), Expect = 0.29
Identities = 11/39 (28%), Positives = 16/39 (41%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
A E +K ++ KK ++ KK KK KK
Sbjct: 1175 AEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKK 1213
Score = 31.4 bits (71), Expect = 0.32
Identities = 11/42 (26%), Positives = 16/42 (38%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A E +EK + + KK ++ KK KK K
Sbjct: 1173 AKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKA 1214
Score = 27.1 bits (60), Expect = 8.1
Identities = 8/42 (19%), Positives = 19/42 (45%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
L+ E + ++ + K ++ + + KK ++ KK K
Sbjct: 1167 KLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPK 1208
>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
function prediction only].
Length = 386
Score = 34.4 bits (79), Expect = 0.028
Identities = 12/40 (30%), Positives = 17/40 (42%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
KK + K+K +K K KK KK + +T F
Sbjct: 347 VDAYPKKAEAARKRKGDRKGVSHKAKKGGKKNQAETAGFE 386
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 34.1 bits (78), Expect = 0.028
Identities = 13/49 (26%), Positives = 16/49 (32%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
E K+ K K + K K K KK KKK + P
Sbjct: 143 EAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDW 191
Score = 32.9 bits (75), Expect = 0.078
Identities = 18/64 (28%), Positives = 25/64 (39%), Gaps = 5/64 (7%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGV 186
+ E K+ KK ++ K K KK KKK + K P DP K
Sbjct: 137 ISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKM-----PGIDPRSKPDW 191
Query: 187 KSRE 190
KS++
Sbjct: 192 KSQD 195
>gnl|CDD|115185 pfam06513, DUF1103, Repeat of unknown function (DUF1103). This
family consists of several repeats of around 30 residues
in length which are found specifically in
mature-parasite-infected erythrocyte surface antigen
proteins from Plasmodium falciparum. This family often
found in conjunction with pfam00226.
Length = 215
Score = 33.7 bits (76), Expect = 0.031
Identities = 22/55 (40%), Positives = 33/55 (60%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
EK + KK+K +K +K+ K+K +K+ K+K KKK KK + +D G VK E
Sbjct: 13 EKNDGKKDKVIGSEKTQKEIKEKVEKRVKEKCKKKVKKGIKENDTEGNDKVKGPE 67
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 34.1 bits (79), Expect = 0.032
Identities = 19/44 (43%), Positives = 25/44 (56%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
R LE E K+ K ++K+KKK+KKK KKKK K + K
Sbjct: 377 RVEQLEQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGK 420
Score = 29.8 bits (68), Expect = 0.93
Identities = 15/38 (39%), Positives = 24/38 (63%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E+ E++ + K + ++K+KKK+KKK KKKK K
Sbjct: 379 EQLEQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKV 416
Score = 29.5 bits (67), Expect = 1.2
Identities = 15/37 (40%), Positives = 24/37 (64%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
++E K+ K + ++K+KKK+KKK KKKK K +
Sbjct: 382 EQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPR 418
Score = 29.5 bits (67), Expect = 1.3
Identities = 14/38 (36%), Positives = 22/38 (57%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E++ K+ ++K+KKK+KKK KKKK K +
Sbjct: 382 EQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRG 419
>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
Length = 822
Score = 34.2 bits (79), Expect = 0.034
Identities = 13/63 (20%), Positives = 20/63 (31%), Gaps = 2/63 (3%)
Query: 126 YVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAG 185
P EK++K+K + K K +K K + T +DP A
Sbjct: 756 CCNNPKCNYTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDL--TAADPDAVAA 813
Query: 186 VKS 188
Sbjct: 814 KVD 816
Score = 27.7 bits (62), Expect = 4.1
Identities = 8/48 (16%), Positives = 16/48 (33%)
Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
N + +KK+K+ K + + K K + K +
Sbjct: 763 NYTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLTAADPDA 810
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 34.3 bits (78), Expect = 0.036
Identities = 19/52 (36%), Positives = 29/52 (55%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
EKK K K K+KK+K+K++ K KKK+ + K L S S + ++ E
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADE 247
Score = 33.1 bits (75), Expect = 0.067
Identities = 16/37 (43%), Positives = 25/37 (67%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
EK + + K+KK KKK+KK+K+K++ K KKK+
Sbjct: 188 EKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEG 224
Score = 33.1 bits (75), Expect = 0.078
Identities = 18/48 (37%), Positives = 27/48 (56%)
Query: 144 KNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
K+K KKK+KK+K+K++ K KKK+ + K L D S + E
Sbjct: 198 KSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEA 245
Score = 32.0 bits (72), Expect = 0.19
Identities = 15/33 (45%), Positives = 24/33 (72%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
+ E+ K K+KKKK++K+K++KKKKKK +
Sbjct: 270 EAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302
Score = 31.6 bits (71), Expect = 0.23
Identities = 17/46 (36%), Positives = 24/46 (52%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
EKK KK K + K+K+K++ K KKK+ + K P S S
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAAS 241
Score = 31.6 bits (71), Expect = 0.26
Identities = 17/42 (40%), Positives = 26/42 (61%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E K +K + +KK KK KKK+KK+K+K++ K K K+
Sbjct: 180 NAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKE 221
Score = 31.2 bits (70), Expect = 0.29
Identities = 17/46 (36%), Positives = 27/46 (58%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
E ++ + + + K K KKKK++K+K++KKKKKK SD
Sbjct: 262 EPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSD 307
Score = 31.2 bits (70), Expect = 0.29
Identities = 16/35 (45%), Positives = 22/35 (62%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
EK KK KK KKK+KK+K+K++ K KK +
Sbjct: 188 EKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEV 222
Score = 30.4 bits (68), Expect = 0.51
Identities = 9/34 (26%), Positives = 24/34 (70%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
V LE E++ +++ + ++KKK++K+K+ +++
Sbjct: 82 VKLEEERRHRQRLEKDKREKKKREKEKRGRRRHH 115
Score = 30.4 bits (68), Expect = 0.53
Identities = 10/40 (25%), Positives = 27/40 (67%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
K + + +++++ +K K++KKK++K+K+ +++ SD
Sbjct: 83 KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESD 122
Score = 30.4 bits (68), Expect = 0.53
Identities = 8/31 (25%), Positives = 25/31 (80%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
+E+++++ + +K K++KKK++K+K+ +++
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGRRRHH 115
Score = 30.0 bits (67), Expect = 0.77
Identities = 15/37 (40%), Positives = 21/37 (56%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
P+ E KK K K K +K+K++KKKKKK +
Sbjct: 266 PKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302
Score = 30.0 bits (67), Expect = 0.84
Identities = 19/60 (31%), Positives = 26/60 (43%)
Query: 118 EDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
D L V + K + KK K KKKK++K+K++KKKKKK
Sbjct: 245 ADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCH 304
Score = 28.9 bits (64), Expect = 1.7
Identities = 18/71 (25%), Positives = 32/71 (45%)
Query: 103 LVFLEKLLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKK 162
L+ P + + +L + ++ K+ + KK K KKKK++K
Sbjct: 228 LLLALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRK 287
Query: 163 KKKKKKKKTKK 173
+K++KKKK K
Sbjct: 288 EKEEKKKKKKH 298
Score = 28.5 bits (63), Expect = 2.3
Identities = 13/35 (37%), Positives = 21/35 (60%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKK 165
V +K +K K K K +K+K++ K KKK+ + K
Sbjct: 192 VPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226
Score = 27.7 bits (61), Expect = 4.3
Identities = 13/34 (38%), Positives = 20/34 (58%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKK 163
P + KK KKK K + +K++ K KKK+ + K
Sbjct: 193 PAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226
Score = 27.3 bits (60), Expect = 6.1
Identities = 15/49 (30%), Positives = 23/49 (46%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
P E E+ +K K K++K+K++KKKKKK + P
Sbjct: 265 EPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQP 313
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 33.5 bits (77), Expect = 0.037
Identities = 13/38 (34%), Positives = 19/38 (50%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E KK+ + + K+ K KKK KKK+K K+
Sbjct: 1 ELLLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKR 38
Score = 26.9 bits (60), Expect = 6.4
Identities = 9/29 (31%), Positives = 16/29 (55%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
K +K +++ + K+ K KKK K K+
Sbjct: 5 KKRKARQELAVQVAKQAKAKKKANKKKRK 33
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 32.0 bits (73), Expect = 0.038
Identities = 16/33 (48%), Positives = 22/33 (66%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K K KKKKKKKKKK K K++ +K++ +K
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEK 45
Score = 31.6 bits (72), Expect = 0.054
Identities = 17/40 (42%), Positives = 23/40 (57%)
Query: 149 KKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
K K KK KKKKKKKKKK K ++ + + K+ +S
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAES 50
Score = 31.6 bits (72), Expect = 0.058
Identities = 19/34 (55%), Positives = 22/34 (64%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K K K KK KKKKKKKKKK K K++ T+K
Sbjct: 7 GGKLKLKGKKIDVKKKKKKKKKKNKSKEEVVTEK 40
Score = 31.2 bits (71), Expect = 0.080
Identities = 17/46 (36%), Positives = 21/46 (45%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
K K KK KKKKKKKKKK K K++ + S +
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGE 56
Score = 30.9 bits (70), Expect = 0.10
Identities = 14/37 (37%), Positives = 24/37 (64%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K +K K K KKKKKK K K++ +K++++K + +
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAE 49
Score = 29.3 bits (66), Expect = 0.35
Identities = 15/59 (25%), Positives = 30/59 (50%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
L+G+K + KK K K KKK K K++ +K++++K ++ D + + +
Sbjct: 12 LKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDG 70
Score = 29.3 bits (66), Expect = 0.37
Identities = 15/32 (46%), Positives = 20/32 (62%)
Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K K K KKKKKKKKKK K K++ + ++
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEE 42
Score = 29.3 bits (66), Expect = 0.40
Identities = 16/36 (44%), Positives = 23/36 (63%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K K K + KKKKKKKKK K K++ +K+++ K
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKS 46
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 33.7 bits (77), Expect = 0.042
Identities = 19/59 (32%), Positives = 22/59 (37%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
R +A G KK K K KK K K KK KKK K + + K K
Sbjct: 189 RVLAAVGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAK 247
Score = 32.2 bits (73), Expect = 0.14
Identities = 15/54 (27%), Positives = 20/54 (37%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
+K K K+ KK KKK K KK K K + + K +K
Sbjct: 209 KSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKK 262
Score = 31.0 bits (70), Expect = 0.36
Identities = 16/59 (27%), Positives = 20/59 (33%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
+ A +K +K K K K KK KK K K K K + K K
Sbjct: 243 KTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGK 301
Score = 30.2 bits (68), Expect = 0.51
Identities = 15/60 (25%), Positives = 19/60 (31%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
+ A K K + K KK KKK K KK K + + K K
Sbjct: 199 AKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKK 258
Score = 30.2 bits (68), Expect = 0.60
Identities = 21/63 (33%), Positives = 23/63 (36%), Gaps = 2/63 (3%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK--KKKTKKFLPPSDPSGKA 184
V A + K KK K KK KK K KK KK K K K + KA
Sbjct: 236 VSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKA 295
Query: 185 GVK 187
K
Sbjct: 296 KKK 298
Score = 29.9 bits (67), Expect = 0.74
Identities = 17/42 (40%), Positives = 19/42 (45%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A+ +K K K K KK KK KK K KK KK K
Sbjct: 235 AVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAK 276
Score = 29.9 bits (67), Expect = 0.75
Identities = 14/45 (31%), Positives = 16/45 (35%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ + K K K KK KK KK KK K KK
Sbjct: 225 KVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKK 269
Score = 29.5 bits (66), Expect = 0.94
Identities = 14/47 (29%), Positives = 16/47 (34%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ A +K K K KK K K KK KK KK
Sbjct: 212 AKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKK 258
Score = 29.1 bits (65), Expect = 1.5
Identities = 13/51 (25%), Positives = 17/51 (33%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
K+ K K KK K K K K K K K + + K+
Sbjct: 261 KKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKA 311
Score = 28.7 bits (64), Expect = 1.9
Identities = 16/53 (30%), Positives = 19/53 (35%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
K KK K K KK K K K K K K + AG K++
Sbjct: 258 KALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAK 310
Score = 28.7 bits (64), Expect = 1.9
Identities = 20/53 (37%), Positives = 21/53 (39%), Gaps = 2/53 (3%)
Query: 137 KKEKKKNKNKNKKKKK--KKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
KK KK K KK K KK KK K K K K + KAG K
Sbjct: 250 KKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKK 302
Score = 28.3 bits (63), Expect = 2.3
Identities = 17/56 (30%), Positives = 23/56 (41%), Gaps = 1/56 (1%)
Query: 137 KKEKKKNKNKNKKKKKK-KKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
K K K K KK KK K KK KK K K + + K K++++
Sbjct: 243 KTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKK 298
Score = 28.3 bits (63), Expect = 2.6
Identities = 16/48 (33%), Positives = 18/48 (37%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
KK KK K K K K K K KKK K + + KA
Sbjct: 268 KKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKA 315
Score = 27.9 bits (62), Expect = 3.5
Identities = 15/55 (27%), Positives = 19/55 (34%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
+ K K K KK KK KK KK K K + + KA +
Sbjct: 232 AKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAA 286
Score = 27.2 bits (60), Expect = 6.2
Identities = 21/61 (34%), Positives = 25/61 (40%), Gaps = 2/61 (3%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKK--KKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
P A G KK K K+ KK KKK K KK K K KK + + K K
Sbjct: 206 PAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAK 265
Query: 188 S 188
+
Sbjct: 266 A 266
Score = 26.4 bits (58), Expect = 9.7
Identities = 15/53 (28%), Positives = 18/53 (33%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
K+ K K KK K K KK KK K + KA K+
Sbjct: 223 AKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAA 275
>gnl|CDD|240309 PTZ00189, PTZ00189, 60S ribosomal protein L21; Provisional.
Length = 160
Score = 32.7 bits (75), Expect = 0.049
Identities = 10/43 (23%), Positives = 18/43 (41%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSG 182
+K ++ K+ K + KKK + TK+ P+G
Sbjct: 97 RKSRCREDFLKRVAANDAIKAEAKKKGELPSTKRQPAQPRPAG 139
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 32.2 bits (74), Expect = 0.054
Identities = 15/51 (29%), Positives = 32/51 (62%), Gaps = 3/51 (5%)
Query: 127 VPRPVALEGEKKEKKKNKNKNK---KKKKKKKKKKKKKKKKKKKKKKTKKF 174
V + + E EKK+K+K K + + ++++ K++ +K+K+ +K + KKF
Sbjct: 71 VEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKF 121
Score = 31.5 bits (72), Expect = 0.11
Identities = 10/34 (29%), Positives = 26/34 (76%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
+ +++ K + +K+K+ +K ++KK K+++K+K+K
Sbjct: 97 RLRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130
Score = 30.7 bits (70), Expect = 0.22
Identities = 10/33 (30%), Positives = 25/33 (75%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
++E+ K + + +K+ +K ++KK K+++K+K+K
Sbjct: 98 LRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130
Score = 29.2 bits (66), Expect = 0.59
Identities = 8/33 (24%), Positives = 23/33 (69%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
+ +++ + +K+K+ +K ++KK K+++K+K
Sbjct: 96 LRLRRERTKERAEKEKRTRKNREKKFKRRQKEK 128
Score = 28.8 bits (65), Expect = 1.0
Identities = 8/33 (24%), Positives = 23/33 (69%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
++ K++ + + + +K ++KK K+++K+K+K
Sbjct: 98 LRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130
Score = 28.0 bits (63), Expect = 1.6
Identities = 10/38 (26%), Positives = 24/38 (63%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
KEKKK + + KK++ + ++++ K++ +K+K +
Sbjct: 79 KEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKN 116
Score = 27.2 bits (61), Expect = 3.2
Identities = 11/36 (30%), Positives = 25/36 (69%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K +K+ + +KKKK+K+ KK++ + ++++TK+
Sbjct: 70 KVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKE 105
Score = 26.5 bits (59), Expect = 6.7
Identities = 11/33 (33%), Positives = 21/33 (63%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKK 163
+ E K+ +K K K ++KK K+++K+K+K
Sbjct: 98 LRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130
>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
Autographa californica nuclear polyhedrosis virus
(AcMNPV) pp31 is a nuclear phosphoprotein that
accumulates in the virogenic stroma, which is the viral
replication centre in the infected-cell nucleus, binds
to DNA, and serves as a late expression factor.
Length = 267
Score = 33.1 bits (76), Expect = 0.059
Identities = 12/32 (37%), Positives = 17/32 (53%)
Query: 147 NKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPS 178
+ +K+K KK KK KK+K K+ PP
Sbjct: 221 SSPSRKRKAPKKVAKKVAAAKKRKQKRSPPPP 252
Score = 29.3 bits (66), Expect = 1.1
Identities = 10/29 (34%), Positives = 15/29 (51%)
Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
+ K+K KK KK KK+K+K+
Sbjct: 221 SSPSRKRKAPKKVAKKVAAAKKRKQKRSP 249
Score = 29.3 bits (66), Expect = 1.2
Identities = 10/29 (34%), Positives = 19/29 (65%)
Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
++ + ++K+K KK KK KK+K+K+
Sbjct: 219 QSSSPSRKRKAPKKVAKKVAAAKKRKQKR 247
Score = 29.3 bits (66), Expect = 1.2
Identities = 11/33 (33%), Positives = 17/33 (51%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPS 178
++ +K+K KK KK KK+ +K PP
Sbjct: 219 QSSSPSRKRKAPKKVAKKVAAAKKRKQKRSPPP 251
Score = 28.5 bits (64), Expect = 1.9
Identities = 11/35 (31%), Positives = 16/35 (45%)
Query: 126 YVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKK 160
VP P + +K K K K KK+K+K+
Sbjct: 214 VVPTPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRS 248
Score = 28.1 bits (63), Expect = 2.8
Identities = 10/28 (35%), Positives = 13/28 (46%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
K K KK KK KK+K+K+
Sbjct: 221 SSPSRKRKAPKKVAKKVAAAKKRKQKRS 248
Score = 27.0 bits (60), Expect = 6.2
Identities = 10/27 (37%), Positives = 13/27 (48%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKK 166
K K KK KK KK+K+K+
Sbjct: 222 SPSRKRKAPKKVAKKVAAAKKRKQKRS 248
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 32.6 bits (75), Expect = 0.060
Identities = 11/23 (47%), Positives = 18/23 (78%)
Query: 147 NKKKKKKKKKKKKKKKKKKKKKK 169
+KKK KK KK+K+K++K+ +K
Sbjct: 13 DKKKAKKAKKEKRKQRKQARKGA 35
Score = 31.8 bits (73), Expect = 0.13
Identities = 11/23 (47%), Positives = 17/23 (73%)
Query: 149 KKKKKKKKKKKKKKKKKKKKKKT 171
KKK KK KK+K+K++K+ +K
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGAD 36
Score = 31.0 bits (71), Expect = 0.21
Identities = 11/22 (50%), Positives = 17/22 (77%)
Query: 151 KKKKKKKKKKKKKKKKKKKKTK 172
KKK KK KK+K+K++K+ +K
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGA 35
Score = 30.3 bits (69), Expect = 0.36
Identities = 13/39 (33%), Positives = 23/39 (58%)
Query: 153 KKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
KKK KK KK+K+K++K+ K D +A +++ +
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAE 52
Score = 29.5 bits (67), Expect = 0.74
Identities = 10/23 (43%), Positives = 17/23 (73%)
Query: 145 NKNKKKKKKKKKKKKKKKKKKKK 167
+K K KK KK+K+K++K+ +K
Sbjct: 13 DKKKAKKAKKEKRKQRKQARKGA 35
Score = 29.1 bits (66), Expect = 0.90
Identities = 10/23 (43%), Positives = 17/23 (73%)
Query: 143 NKNKNKKKKKKKKKKKKKKKKKK 165
+K K KK KK+K+K++K+ +K
Sbjct: 13 DKKKAKKAKKEKRKQRKQARKGA 35
Score = 28.0 bits (63), Expect = 2.6
Identities = 12/46 (26%), Positives = 25/46 (54%), Gaps = 8/46 (17%)
Query: 136 EKKEKKKNKNKNKKKKKKKKK--------KKKKKKKKKKKKKKTKK 173
+KK KK K K K++K+ +K K+ ++ K +K ++ ++
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRE 59
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 32.2 bits (74), Expect = 0.062
Identities = 13/40 (32%), Positives = 25/40 (62%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E+++ +K +K K+K K +KK+++KK + +K K
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAK 40
Score = 31.5 bits (72), Expect = 0.11
Identities = 10/43 (23%), Positives = 23/43 (53%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
A+ EK + +K + + K + +K K ++K + + +K K+
Sbjct: 16 AIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKEL 58
Score = 31.5 bits (72), Expect = 0.11
Identities = 7/42 (16%), Positives = 23/42 (54%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+K++++ K++ +K K ++K + + +K +K+ +
Sbjct: 20 EKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEEL 61
Score = 31.5 bits (72), Expect = 0.11
Identities = 7/41 (17%), Positives = 23/41 (56%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E +K E +K + ++K + + +K +K+ ++ + + ++
Sbjct: 28 QEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARR 68
Score = 29.1 bits (66), Expect = 0.69
Identities = 12/40 (30%), Positives = 24/40 (60%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E +K+ +K K+K K +KK+++KK + +K K + +
Sbjct: 5 EKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAE 44
Score = 27.2 bits (61), Expect = 3.0
Identities = 11/38 (28%), Positives = 20/38 (52%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
K K K K +KK+++KK + +K K ++K +
Sbjct: 14 DKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYEL 51
Score = 26.1 bits (58), Expect = 7.0
Identities = 8/40 (20%), Positives = 20/40 (50%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ E + + +KK+++KK + +K K ++K +
Sbjct: 10 DKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEY 49
>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
Validated.
Length = 725
Score = 33.3 bits (76), Expect = 0.065
Identities = 15/73 (20%), Positives = 24/73 (32%), Gaps = 6/73 (8%)
Query: 107 EKLLPLLSNQSEDCL--YLNLYVPRP----VALEGEKKEKKKNKNKNKKKKKKKKKKKKK 160
E ++ L N + +L P + E K KK K +
Sbjct: 638 ELIIKLNKNNYKVNFQNFLEKIFGGPKHIFAISKKLINEAKIYWKAINNTKKSVVIKPLE 697
Query: 161 KKKKKKKKKKTKK 173
K KK +K +K+
Sbjct: 698 KPKKYEKTSVSKE 710
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 33.3 bits (76), Expect = 0.067
Identities = 24/75 (32%), Positives = 28/75 (37%), Gaps = 18/75 (24%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKK---------------KKKKKKKKKKKTKKFLPPS 178
E N +K KKKK K K KK KK +KK + K PS
Sbjct: 406 SNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPS 465
Query: 179 DP---SGKAGVKSRE 190
D K VKS+E
Sbjct: 466 DSKAGGKKESVKSQE 480
Score = 28.2 bits (63), Expect = 2.8
Identities = 13/46 (28%), Positives = 19/46 (41%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
E K+ KKN+ K + K K K KK+ K + P +
Sbjct: 443 EDAPKKGKKNQKKGRDKSSKVPSDSKAGGKKESVKSQEDNNNIPPE 488
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 33.0 bits (76), Expect = 0.069
Identities = 15/48 (31%), Positives = 31/48 (64%), Gaps = 6/48 (12%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKK------KKKKKKKKKKKTKK 173
A E E++E+ + K + KKK++++ K K +K ++K++KK+ +K
Sbjct: 275 AAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
Score = 32.6 bits (75), Expect = 0.087
Identities = 11/37 (29%), Positives = 28/37 (75%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
E++E+K K ++++++ ++KK++KKK++++ K K
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAK 302
Score = 29.1 bits (66), Expect = 1.1
Identities = 9/45 (20%), Positives = 31/45 (68%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
++ ++E+++ K +++++++ ++KK++KKK++++ L P
Sbjct: 261 VDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSP 305
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 33.2 bits (76), Expect = 0.072
Identities = 13/36 (36%), Positives = 27/36 (75%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
E + KE+K + +N+KK+K++ + K+K++K ++KK
Sbjct: 399 EAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 32.8 bits (75), Expect = 0.099
Identities = 11/37 (29%), Positives = 25/37 (67%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
K +++K K + +KK+K++ + K+K++K ++KK
Sbjct: 398 AEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 32.1 bits (73), Expect = 0.17
Identities = 10/33 (30%), Positives = 24/33 (72%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKK 165
+ EK ++++N+ K K++ + K+K++K ++KK
Sbjct: 402 AKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 31.3 bits (71), Expect = 0.24
Identities = 12/43 (27%), Positives = 27/43 (62%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
A E + + K+ K K ++ +KK+K++ + K+K++K ++ K
Sbjct: 392 TDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 31.3 bits (71), Expect = 0.30
Identities = 9/38 (23%), Positives = 26/38 (68%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
++ + K K + K+++ +KK+K++ + K+K++K ++
Sbjct: 395 SEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDER 432
Score = 30.5 bits (69), Expect = 0.51
Identities = 11/40 (27%), Positives = 25/40 (62%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E + K K + +++ +KK+K++ + K+K++K + KK
Sbjct: 395 SEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 30.1 bits (68), Expect = 0.59
Identities = 9/38 (23%), Positives = 25/38 (65%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E ++ + K K++K K+++ +KK+K++ + K+ ++
Sbjct: 391 ETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQ 428
Score = 30.1 bits (68), Expect = 0.64
Identities = 13/42 (30%), Positives = 27/42 (64%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
V E + E+ + K K +K K+++ +KK+K++ + K+K+ K
Sbjct: 388 VKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQK 429
Score = 29.8 bits (67), Expect = 0.84
Identities = 12/40 (30%), Positives = 26/40 (65%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E+ E K + K K+++ +KK+K++ + K+K++K +K
Sbjct: 394 ASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433
Score = 27.5 bits (61), Expect = 4.7
Identities = 11/40 (27%), Positives = 23/40 (57%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
L K E ++ K K++K K+++ +KK+K++ + K
Sbjct: 385 LGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDK 424
>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
MW N terminal. This domain is found in eukaryotes. This
domain is about 90 amino acids in length. This domain is
found associated with pfam00076. This domain is part of
U1 snRNP, which is the pre-mRNA binding protein of the
penta-snRNP spliceosome complex. It extends over a
distance of 180 A from its RNA binding domain, wraps
around the core domain of U1 snRNP consisting of the
seven Sm proteins and finally contacts U1-C, which is
crucial for 5'-splice-site recognition.
Length = 94
Score = 31.5 bits (72), Expect = 0.073
Identities = 13/49 (26%), Positives = 29/49 (59%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
+ K E + +K++++K++KK+K +KK +++ K++ P DP
Sbjct: 46 EFKDYKDEPPPEPTETWLEKREREKREKKEKLEKKLEEELKEWDPNEDP 94
>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain. This
presumed domain is found at the N-terminus of Ribosomal
L30 proteins and has been termed RL30NT or NUC018.
Length = 71
Score = 30.7 bits (70), Expect = 0.079
Identities = 15/38 (39%), Positives = 24/38 (63%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
KK K+ K + K+ KK+ KK +KKK+K K+ +K+
Sbjct: 4 KKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEKY 41
Score = 26.4 bits (59), Expect = 2.6
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
K + +N+K + K+ KK+ KK +KKK+K
Sbjct: 2 LLKKRKRNEKLRAKRAKKRAAKKAARKKKRK 32
>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 472
Score = 32.8 bits (75), Expect = 0.081
Identities = 14/39 (35%), Positives = 27/39 (69%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
E ++KN N N ++K+KKK++ K K++K++ + K+F
Sbjct: 341 ENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRF 379
Score = 27.4 bits (61), Expect = 5.3
Identities = 8/28 (28%), Positives = 19/28 (67%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
++K K K + K K++K++ + +K+ K+
Sbjct: 354 QQKEKKKEESKAKEEKQEDIEFEKRFKE 381
Score = 26.7 bits (59), Expect = 8.9
Identities = 9/36 (25%), Positives = 20/36 (55%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
E + K K K++ K K++K++ + +K+ K+
Sbjct: 346 EKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKE 381
>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 594
Score = 32.7 bits (75), Expect = 0.091
Identities = 9/37 (24%), Positives = 14/37 (37%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K K N ++K K +K KK K+ +
Sbjct: 458 KPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFET 494
Score = 30.4 bits (69), Expect = 0.53
Identities = 9/38 (23%), Positives = 14/38 (36%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K N ++ K K +K KK K+ + K
Sbjct: 461 IKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDALK 498
Score = 30.4 bits (69), Expect = 0.61
Identities = 11/36 (30%), Positives = 15/36 (41%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
+K KNK K ++K K +K KK F
Sbjct: 453 DKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDF 488
Score = 30.0 bits (68), Expect = 0.74
Identities = 10/43 (23%), Positives = 15/43 (34%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
K K K ++K K +K KK K+F +
Sbjct: 455 DKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDAL 497
Score = 26.9 bits (60), Expect = 7.8
Identities = 7/35 (20%), Positives = 14/35 (40%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
++K K +K KK K+ + K ++
Sbjct: 468 NNSQEKAVLKLLQKIKKNNDFIKEFETDALKLQEI 502
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 32.2 bits (74), Expect = 0.096
Identities = 12/27 (44%), Positives = 20/27 (74%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKK 162
E+KEKKK +K +KK K++K+K ++
Sbjct: 96 EEKEKKKAMSKEEKKAIKEEKEKLEEP 122
Score = 28.8 bits (65), Expect = 1.4
Identities = 10/28 (35%), Positives = 21/28 (75%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
++EK+K K +K++KK K++K+K ++
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
Score = 28.4 bits (64), Expect = 1.8
Identities = 10/29 (34%), Positives = 20/29 (68%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
+ +K+KKK K++KK K++K+K + +
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEPY 123
Score = 27.6 bits (62), Expect = 3.5
Identities = 10/28 (35%), Positives = 19/28 (67%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
E++K K K K++KK K++K+K ++
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
Length = 977
Score = 33.0 bits (75), Expect = 0.099
Identities = 15/48 (31%), Positives = 24/48 (50%)
Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
KN+ ++ K KK KKKK KKK+ K + ++ + + SR
Sbjct: 115 KNRVESWKDKKNKKKKSAKKKEAHKAQIPPEQQMEEKPSADAYEPLSR 162
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 31.6 bits (72), Expect = 0.11
Identities = 15/43 (34%), Positives = 20/43 (46%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
E K + K K K + K K +K+K K + KK K K K
Sbjct: 110 EQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152
Score = 31.3 bits (71), Expect = 0.18
Identities = 12/39 (30%), Positives = 24/39 (61%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
E E ++ + ++ + K + K+KKK++ K K +K+K K
Sbjct: 102 EEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPK 140
Score = 30.9 bits (70), Expect = 0.19
Identities = 14/40 (35%), Positives = 26/40 (65%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E+++ + K ++K+KKK++ K K +K+K K + K K
Sbjct: 108 ETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPK 147
Score = 30.1 bits (68), Expect = 0.37
Identities = 11/40 (27%), Positives = 23/40 (57%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E +E+ ++ + + + K + K+KKK++ K K K+
Sbjct: 98 EEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKE 137
Score = 29.3 bits (66), Expect = 0.64
Identities = 11/39 (28%), Positives = 23/39 (58%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
E E++ + + ++ + K + K+KKK++ K K +K K
Sbjct: 100 ETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEK 138
Score = 29.3 bits (66), Expect = 0.82
Identities = 7/40 (17%), Positives = 25/40 (62%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ E++ ++++ ++ +++ + K + K+KKK++ K +
Sbjct: 96 DEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTE 135
Score = 28.2 bits (63), Expect = 1.7
Identities = 16/42 (38%), Positives = 21/42 (50%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
P + E KEKKK + K +K+K K + KK K K K
Sbjct: 111 QEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152
Score = 26.3 bits (58), Expect = 9.3
Identities = 7/40 (17%), Positives = 24/40 (60%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+++E+ + ++ ++ +++ + K + K+KKK++ K
Sbjct: 94 SDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPK 133
>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Proteins. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs, all of which are expressed
during embryonic and early development in the nervous
system but with different localization and timing.
srGAPs contain an N-terminal F-BAR domain, a Rho GAP
domain, and a C-terminal SH3 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 241
Score = 32.3 bits (74), Expect = 0.11
Identities = 12/36 (33%), Positives = 25/36 (69%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
E EK+E+K+ ++ KK ++ + KK +K+ +K++ K
Sbjct: 156 EAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191
Score = 27.7 bits (62), Expect = 3.5
Identities = 10/36 (27%), Positives = 23/36 (63%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+ EK++ K + +KK ++ + KK +K+ +K++ K
Sbjct: 156 EAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191
Score = 26.9 bits (60), Expect = 5.8
Identities = 11/40 (27%), Positives = 24/40 (60%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E + KE +K + K ++ +KK ++ + KK +K+ +K +
Sbjct: 151 ERKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQA 190
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 32.4 bits (74), Expect = 0.11
Identities = 10/35 (28%), Positives = 21/35 (60%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
K K ++KK++ + ++KKK+KK K +++
Sbjct: 358 PKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRG 392
Score = 32.4 bits (74), Expect = 0.12
Identities = 13/48 (27%), Positives = 25/48 (52%), Gaps = 1/48 (2%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
EK+ +K + K K K ++ KK++ + ++KK +K S+ G
Sbjct: 347 EKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKK-AKSERRGL 393
Score = 31.6 bits (72), Expect = 0.24
Identities = 12/40 (30%), Positives = 23/40 (57%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E K K K + KK++ + ++KKK+KK K +++ +
Sbjct: 355 EKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQ 394
Score = 30.8 bits (70), Expect = 0.43
Identities = 9/38 (23%), Positives = 19/38 (50%), Gaps = 1/38 (2%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
P A K+++ + +KKK+KK K +++ +
Sbjct: 359 KPPTKAKPERDKKERPGR-YRRKKKEKKAKSERRGLQN 395
>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7). RRP7
is an essential protein in yeast that is involved in
pre-rRNA processing and ribosome assembly. It is
speculated to be required for correct assembly of rpS27
into the pre-ribosomal particle.
Length = 131
Score = 31.4 bits (72), Expect = 0.11
Identities = 11/35 (31%), Positives = 23/35 (65%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
+ +K+ ++ K ++++K K+KKKKKKK+ +
Sbjct: 56 TRGGRKRKAGASRNKAAEERRKLKEKKKKKKKELE 90
Score = 28.3 bits (64), Expect = 1.2
Identities = 13/32 (40%), Positives = 24/32 (75%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
G K++ ++NK ++++K K+KKKKKKK+ +
Sbjct: 59 GRKRKAGASRNKAAEERRKLKEKKKKKKKELE 90
Score = 27.6 bits (62), Expect = 2.4
Identities = 10/40 (25%), Positives = 21/40 (52%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+G + + + + K ++++K K+KKKKKK +
Sbjct: 50 DGWTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKKKKEL 89
Score = 27.6 bits (62), Expect = 2.7
Identities = 11/35 (31%), Positives = 21/35 (60%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
+ + + K ++++K K+KKKKKKK+ + F
Sbjct: 58 GGRKRKAGASRNKAAEERRKLKEKKKKKKKELENF 92
>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
Length = 300
Score = 32.3 bits (74), Expect = 0.11
Identities = 14/46 (30%), Positives = 27/46 (58%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
P V + +KK+K +K KK K++K + K KK +++++ +K
Sbjct: 252 PSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQK 297
Score = 31.6 bits (72), Expect = 0.18
Identities = 15/44 (34%), Positives = 27/44 (61%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
P ++ +KK+KKK +K K KK K++K + K KK +++ +
Sbjct: 251 NPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRER 294
Score = 31.2 bits (71), Expect = 0.28
Identities = 18/33 (54%), Positives = 21/33 (63%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ N +K KKKKKKKKK KK K KK K +K
Sbjct: 249 RANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRK 281
Score = 30.0 bits (68), Expect = 0.57
Identities = 20/42 (47%), Positives = 22/42 (52%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
P +LE K+ N K KKKKKKKKK KK K KK
Sbjct: 234 PDSLEDMYKKAHAAIRANPSKVKKKKKKKKKVVHKKYKTKKL 275
Score = 29.7 bits (67), Expect = 0.81
Identities = 15/40 (37%), Positives = 20/40 (50%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
K +KKK K K KK K KK K++K + K K +
Sbjct: 253 SKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRR 292
Score = 28.1 bits (63), Expect = 2.7
Identities = 18/37 (48%), Positives = 20/37 (54%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K N +K KKKKKKKKK KK K KK K+
Sbjct: 243 KAHAAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQ 279
Score = 27.3 bits (61), Expect = 4.9
Identities = 16/37 (43%), Positives = 19/37 (51%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ K K K KKKKK KK K KK K++K K
Sbjct: 249 RANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVK 285
Score = 26.6 bits (59), Expect = 9.1
Identities = 11/33 (33%), Positives = 20/33 (60%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
KK K K ++K + K KK +++++ +KK K
Sbjct: 268 KKYKTKKLTGKQRKARVKAKKAQRRERLQKKIK 300
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 31.6 bits (72), Expect = 0.11
Identities = 15/45 (33%), Positives = 25/45 (55%), Gaps = 3/45 (6%)
Query: 126 YVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
RP EG KK K+K + K KK++ +K+K+K+++ K
Sbjct: 58 ESKRP---EGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALA 99
Score = 26.6 bits (59), Expect = 6.0
Identities = 11/29 (37%), Positives = 21/29 (72%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
K K+K ++ K K KK++ +K+K+K ++F
Sbjct: 66 KKAKEKLRRDKLKAKKEEAEKEKEKEERF 94
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 32.8 bits (75), Expect = 0.12
Identities = 14/35 (40%), Positives = 25/35 (71%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
K + K + ++K KKKKKK+KKK+++ K+++K
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEK 764
Score = 31.6 bits (72), Expect = 0.29
Identities = 15/36 (41%), Positives = 26/36 (72%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
G+ E K + + +K KKKKKK+KKK+++ K+++K
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKA 765
Score = 30.1 bits (68), Expect = 0.88
Identities = 14/35 (40%), Positives = 24/35 (68%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
+ E K + K KKKKKK+KKK+++ K+++K +
Sbjct: 732 DAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKAR 766
Score = 28.9 bits (65), Expect = 1.7
Identities = 8/31 (25%), Positives = 15/31 (48%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
+EK ++ + KK+K KK+ K +
Sbjct: 1551 SNQEKNIEEDYAESDIKKRKNKKQYKSNTEA 1581
Score = 28.9 bits (65), Expect = 2.2
Identities = 8/33 (24%), Positives = 18/33 (54%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
L ++K +++ ++ KK+K KK+ K +
Sbjct: 1549 VLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEA 1581
Score = 28.5 bits (64), Expect = 2.3
Identities = 13/34 (38%), Positives = 23/34 (67%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ K + ++K KKKKKK+KKK+++ K++ K
Sbjct: 732 DAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKA 765
Score = 28.1 bits (63), Expect = 3.4
Identities = 12/36 (33%), Positives = 26/36 (72%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K+ + + + ++K KKKKKK+KKK+++ K+++ +
Sbjct: 731 KDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKAR 766
Score = 28.1 bits (63), Expect = 3.6
Identities = 8/29 (27%), Positives = 16/29 (55%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
++KN ++ + KK+K KK+ K +
Sbjct: 1553 QEKNIEEDYAESDIKKRKNKKQYKSNTEA 1581
Score = 28.1 bits (63), Expect = 3.9
Identities = 15/51 (29%), Positives = 22/51 (43%), Gaps = 10/51 (19%)
Query: 135 GEKKEKKKN-----KNKNKKKKK-----KKKKKKKKKKKKKKKKKKTKKFL 175
KEKK N N+ K ++ KK+K KK+ K + + FL
Sbjct: 1537 ESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEAELDFFL 1587
Score = 27.0 bits (60), Expect = 8.0
Identities = 13/35 (37%), Positives = 22/35 (62%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K + ++K KKKKKK+KKK+++ K+ +K
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEK 764
Score = 27.0 bits (60), Expect = 9.3
Identities = 16/68 (23%), Positives = 28/68 (41%), Gaps = 7/68 (10%)
Query: 121 LYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
L LNL E KK N+ K +KK + + + +++K+ + SD
Sbjct: 1488 LLLNLN-------GNENVNKKINQKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDK 1540
Query: 181 SGKAGVKS 188
K ++S
Sbjct: 1541 EKKGNLES 1548
Score = 26.6 bits (59), Expect = 9.6
Identities = 12/48 (25%), Positives = 21/48 (43%), Gaps = 1/48 (2%)
Query: 128 PRPVALEGEKK-EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
V EK E+ ++ KK+K KK+ K + + K+ +F
Sbjct: 1546 LESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEAELDFFLKRYLRF 1593
>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region. This
domain is the C-terminal end of ribosomal L18/L5
proteins.
Length = 93
Score = 30.6 bits (70), Expect = 0.12
Identities = 10/28 (35%), Positives = 17/28 (60%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ +KKK+KK+ K + K+ KK T +
Sbjct: 57 ADPSRKKKEKKEVKAESKRYNAKKLTLE 84
Score = 28.7 bits (65), Expect = 0.71
Identities = 9/24 (37%), Positives = 16/24 (66%)
Query: 144 KNKNKKKKKKKKKKKKKKKKKKKK 167
+ ++KKK+KK+ K + K+ KK
Sbjct: 57 ADPSRKKKEKKEVKAESKRYNAKK 80
Score = 27.9 bits (63), Expect = 1.0
Identities = 10/26 (38%), Positives = 16/26 (61%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKK 173
KKK+KK+ K + K+ KK + +K
Sbjct: 62 KKKEKKEVKAESKRYNAKKLTLEQRK 87
Score = 27.1 bits (61), Expect = 2.1
Identities = 11/32 (34%), Positives = 21/32 (65%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
+KKEKK+ K ++K+ KK +++K + +K
Sbjct: 62 KKKEKKEVKAESKRYNAKKLTLEQRKARVAQK 93
Score = 26.7 bits (60), Expect = 3.0
Identities = 9/34 (26%), Positives = 20/34 (58%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
+K+K+K + K + K+ KK +++K + +K
Sbjct: 60 SRKKKEKKEVKAESKRYNAKKLTLEQRKARVAQK 93
Score = 26.0 bits (58), Expect = 5.4
Identities = 9/34 (26%), Positives = 17/34 (50%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+KK + K K + K+ KK +++K + K
Sbjct: 60 SRKKKEKKEVKAESKRYNAKKLTLEQRKARVAQK 93
Score = 26.0 bits (58), Expect = 6.4
Identities = 8/24 (33%), Positives = 15/24 (62%)
Query: 142 KNKNKNKKKKKKKKKKKKKKKKKK 165
+ ++ KK+KK+ K + K+ KK
Sbjct: 57 ADPSRKKKEKKEVKAESKRYNAKK 80
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 32.3 bits (74), Expect = 0.13
Identities = 18/56 (32%), Positives = 23/56 (41%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
+ K+K +K K K K KKKKKK+K L + S K V E
Sbjct: 511 DSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEED 566
Score = 31.6 bits (72), Expect = 0.27
Identities = 15/40 (37%), Positives = 20/40 (50%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
++ KKK+ +K K K K K KKKKKK+K
Sbjct: 507 AKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDL 546
Score = 31.6 bits (72), Expect = 0.28
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
KK+ K + +K++ K + KK+K+K KK K
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKLK 657
Score = 31.2 bits (71), Expect = 0.38
Identities = 19/47 (40%), Positives = 26/47 (55%), Gaps = 5/47 (10%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK-----KKKTKKFL 175
+G KK KKK K K + K + KK+K+K KK K +K+ KK
Sbjct: 624 DGIKKRKKKRKRKRRFLTKIEGVKKEKRKDKKLKNVIINEKRNKKAA 670
Score = 30.8 bits (70), Expect = 0.48
Identities = 14/35 (40%), Positives = 22/35 (62%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
G+ +K+K K K K++ K + KK+K+K KK K
Sbjct: 623 GDGIKKRKKKRKRKRRFLTKIEGVKKEKRKDKKLK 657
Score = 28.9 bits (65), Expect = 2.1
Identities = 15/38 (39%), Positives = 16/38 (42%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K KK +K K K K K K K KKK KK
Sbjct: 503 VGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKK 540
Score = 28.5 bits (64), Expect = 2.7
Identities = 12/49 (24%), Positives = 18/49 (36%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAG 185
K K +K K KKKKKK+K ++ + K +
Sbjct: 522 KAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDEDDE 570
Score = 27.3 bits (61), Expect = 6.7
Identities = 14/43 (32%), Positives = 17/43 (39%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K KK++ K K K K K K KKKK K+
Sbjct: 499 TTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKE 541
>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694). Family of
hypothetical bacterial proteins.
Length = 278
Score = 32.2 bits (74), Expect = 0.13
Identities = 16/47 (34%), Positives = 28/47 (59%)
Query: 124 NLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
++V L E E + ++ ++ K+KKKKKKKKKKK++ ++
Sbjct: 59 EIFVQEKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREW 105
Score = 27.6 bits (62), Expect = 4.1
Identities = 13/37 (35%), Positives = 25/37 (67%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
+EK+K +++ + ++ + ++ K+KKKKKKK KK
Sbjct: 63 QEKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKK 99
Score = 27.6 bits (62), Expect = 4.4
Identities = 14/37 (37%), Positives = 25/37 (67%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+KEK +++ ++ + ++ K+KKKKKKKKK K+
Sbjct: 64 EKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKE 100
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 32.2 bits (73), Expect = 0.15
Identities = 15/47 (31%), Positives = 25/47 (53%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
E E+K+KK+ K K K+ KK K +K+ K K + ++ +P
Sbjct: 17 ELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSE 63
Score = 32.2 bits (73), Expect = 0.17
Identities = 19/58 (32%), Positives = 29/58 (50%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
E EKK + + + KKKK++K K+K+ KK K +K+ K G K E+
Sbjct: 7 EAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEK 64
Score = 31.8 bits (72), Expect = 0.25
Identities = 16/64 (25%), Positives = 33/64 (51%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
++E ++ K K +K K+K+ KK K +K+ K K + ++ ++ K+ K
Sbjct: 6 SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKK 65
Query: 188 SREQ 191
SR++
Sbjct: 66 SRKR 69
Score = 31.0 bits (70), Expect = 0.44
Identities = 15/41 (36%), Positives = 26/41 (63%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
E E ++K + + ++KKKK++K K+K+ KK K +K K
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKA 45
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 31.5 bits (72), Expect = 0.15
Identities = 18/39 (46%), Positives = 26/39 (66%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
+E +K E++K + +N KK+K KK KKKKK KKK +
Sbjct: 168 KVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKKGRIL 206
Score = 29.6 bits (67), Expect = 0.80
Identities = 20/40 (50%), Positives = 27/40 (67%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
EK EKKK + + K+++ KK+K KK KKKKK KK + L
Sbjct: 167 EKVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKKGRIL 206
Score = 29.6 bits (67), Expect = 0.87
Identities = 12/53 (22%), Positives = 30/53 (56%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
+++++ + + K +KK+K+K+ KKK+ +K + + K A +++ E
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAE 61
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 30.7 bits (70), Expect = 0.18
Identities = 15/41 (36%), Positives = 22/41 (53%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
+G KK + +K+ K +K+KKKKKKK+ K F
Sbjct: 50 DGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKKELKNF 90
Score = 26.5 bits (59), Expect = 5.6
Identities = 11/32 (34%), Positives = 17/32 (53%)
Query: 144 KNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
K + K +K+ K +K+KKKKK K+
Sbjct: 57 KKGRRPKTARKESVAAKAAEKEKKKKKKKELK 88
>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
ribosomal structure and biogenesis / Posttranslational
modification, protein turnover, chaperones].
Length = 379
Score = 31.9 bits (72), Expect = 0.19
Identities = 14/45 (31%), Positives = 21/45 (46%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
AL+G+ + K K + + + KKK K+ KK K KK
Sbjct: 258 ALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKMEKK 302
>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3. The SLD3 DNA
replication regulator is required for loading and
maintenance of Cdc45 on chromatin during DNA
replication.
Length = 437
Score = 31.7 bits (72), Expect = 0.20
Identities = 13/42 (30%), Positives = 24/42 (57%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
R E E K+K +++N + K++K++K+KK KK+
Sbjct: 122 RFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPVL 163
Score = 30.6 bits (69), Expect = 0.50
Identities = 15/45 (33%), Positives = 28/45 (62%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
ALE ++ K + + K K K+ + + K++K++K+KK K+ LP
Sbjct: 117 ALELDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLP 161
Score = 27.9 bits (62), Expect = 3.9
Identities = 12/37 (32%), Positives = 23/37 (62%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
EK+ KKK +++ + K++K++K+KK KK+
Sbjct: 127 EKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPVL 163
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 31.7 bits (72), Expect = 0.20
Identities = 10/40 (25%), Positives = 15/40 (37%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E +KK + K K + K K + K +K K
Sbjct: 206 EAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAK 245
Score = 30.5 bits (69), Expect = 0.49
Identities = 11/38 (28%), Positives = 15/38 (39%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E + KKK + KKK + K K + K K
Sbjct: 204 EAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKA 241
Score = 30.5 bits (69), Expect = 0.54
Identities = 10/40 (25%), Positives = 15/40 (37%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E K+K + K K + K K + K +K
Sbjct: 204 EAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAA 243
Score = 30.2 bits (68), Expect = 0.64
Identities = 9/38 (23%), Positives = 14/38 (36%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ KKK + K K + K +K K +K
Sbjct: 212 AAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEK 249
Score = 29.4 bits (66), Expect = 1.1
Identities = 11/43 (25%), Positives = 17/43 (39%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A E +KK + + K K + KKK + K K +
Sbjct: 195 AAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAA 237
Score = 28.6 bits (64), Expect = 1.9
Identities = 10/42 (23%), Positives = 15/42 (35%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A E +KK + K K + K +K K +K
Sbjct: 212 AAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAA 253
Score = 28.2 bits (63), Expect = 2.4
Identities = 8/42 (19%), Positives = 13/42 (30%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A +K + K + K +K K +K K
Sbjct: 213 AEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAK 254
Score = 27.1 bits (60), Expect = 6.6
Identities = 8/41 (19%), Positives = 12/41 (29%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
A KK+ K + K +K K +K
Sbjct: 211 AAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAA 251
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 31.3 bits (71), Expect = 0.22
Identities = 18/62 (29%), Positives = 22/62 (35%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGV 186
+ EKK KK K K K K K + K KK K K P+ P+
Sbjct: 82 PKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAKAPAAPNQPARPPSA 141
Query: 187 KS 188
S
Sbjct: 142 AS 143
Score = 30.9 bits (70), Expect = 0.27
Identities = 23/62 (37%), Positives = 26/62 (41%), Gaps = 1/62 (1%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP-PSDPSGKAGV 186
P+P E K K K K +KK KK K K K K K K K P P P K
Sbjct: 67 PKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAA 126
Query: 187 KS 188
K+
Sbjct: 127 KA 128
Score = 29.0 bits (65), Expect = 1.5
Identities = 22/62 (35%), Positives = 26/62 (41%), Gaps = 1/62 (1%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGV 186
P K K+K K + K KK K K K K K K K K + K PPS + KA
Sbjct: 72 EPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPK-KPPSKTAAKAPA 130
Query: 187 KS 188
Sbjct: 131 AP 132
Score = 26.3 bits (58), Expect = 9.4
Identities = 16/62 (25%), Positives = 23/62 (37%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGV 186
+P + + K K K K K K + K KK K K + + + SG A
Sbjct: 91 EKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAKAPAAPNQPARPPSAASASGAATG 150
Query: 187 KS 188
S
Sbjct: 151 PS 152
>gnl|CDD|240380 PTZ00362, PTZ00362, hypothetical protein; Provisional.
Length = 479
Score = 31.7 bits (72), Expect = 0.24
Identities = 13/37 (35%), Positives = 16/37 (43%)
Query: 149 KKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAG 185
KKKKKKKKK K+ + K F+ K
Sbjct: 9 KKKKKKKKKASKESSNGSQNSKKTIFMYNLSRRKKRH 45
Score = 28.6 bits (64), Expect = 2.3
Identities = 13/24 (54%), Positives = 15/24 (62%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKT 171
KKKKKKKKK K+ + KKT
Sbjct: 9 KKKKKKKKKASKESSNGSQNSKKT 32
Score = 27.0 bits (60), Expect = 6.3
Identities = 11/39 (28%), Positives = 17/39 (43%), Gaps = 5/39 (12%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKK-----KKKKKKKTKK 173
KKK K K K K+ + KK ++KK+ +
Sbjct: 9 KKKKKKKKKASKESSNGSQNSKKTIFMYNLSRRKKRHNE 47
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 31.3 bits (72), Expect = 0.26
Identities = 13/43 (30%), Positives = 24/43 (55%), Gaps = 2/43 (4%)
Query: 134 EGEKKEKKKNKNKNKKKKKKK--KKKKKKKKKKKKKKKKTKKF 174
+ +K K + +K+ K +KK+KKKKK+K+K ++
Sbjct: 595 QLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEEL 637
Score = 30.6 bits (70), Expect = 0.55
Identities = 15/33 (45%), Positives = 23/33 (69%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
E E +K NK +KK+KKKKK+K+K+++ K
Sbjct: 606 AHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638
Score = 28.3 bits (64), Expect = 3.1
Identities = 13/23 (56%), Positives = 16/23 (69%)
Query: 149 KKKKKKKKKKKKKKKKKKKKKKT 171
+K +K KKKKKKK K K K +T
Sbjct: 680 EKIQKPKKKKKKKPKTVKPKPRT 702
Score = 27.9 bits (63), Expect = 3.7
Identities = 16/55 (29%), Positives = 29/55 (52%), Gaps = 3/55 (5%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK---KTKKFLPPSDPSGKAGVKSRE 190
E+KK K + ++ K ++ +K+ ++ K+ KK + K L G A VK+ E
Sbjct: 554 EEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHE 608
Score = 27.9 bits (63), Expect = 4.4
Identities = 13/33 (39%), Positives = 22/33 (66%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
+ + K NK +KK+KKKKK+K+K+++ K
Sbjct: 606 AHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 31.2 bits (71), Expect = 0.26
Identities = 15/55 (27%), Positives = 25/55 (45%), Gaps = 1/55 (1%)
Query: 127 VPRPVALE-GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
PV + E K K K K K K KK +++ K++ K + + + P+ P
Sbjct: 86 KEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFENTAPARP 140
Score = 27.0 bits (60), Expect = 5.8
Identities = 16/54 (29%), Positives = 23/54 (42%), Gaps = 1/54 (1%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
P P+ E K+ K K K K K K KK +++ K+ K + P S
Sbjct: 78 PEPIP-EPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPAS 130
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains separated
by a hinge in the middle. The eukaryotic SMC proteins
form two kind of heterodimers: the SMC1/SMC3 and the
SMC2/SMC4 types. These heterodimers constitute an
essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 31.5 bits (71), Expect = 0.27
Identities = 14/50 (28%), Positives = 32/50 (64%)
Query: 126 YVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
V E E+KE++ NK++ KK++ +++KK+ ++ ++ ++ K+FL
Sbjct: 975 NVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEFL 1024
Score = 27.2 bits (60), Expect = 7.3
Identities = 8/39 (20%), Positives = 26/39 (66%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
L E+++ +K K + +K++K K ++++ + +++ K++
Sbjct: 776 ELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEE 814
Score = 26.9 bits (59), Expect = 8.4
Identities = 9/40 (22%), Positives = 26/40 (65%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
++E+K K K +++K++K K ++++ + +++ K + L
Sbjct: 778 AEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAEL 817
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 31.2 bits (71), Expect = 0.27
Identities = 16/49 (32%), Positives = 26/49 (53%), Gaps = 8/49 (16%)
Query: 135 GEKKEKKKNKNK---NKKKKKKK-----KKKKKKKKKKKKKKKKTKKFL 175
G +K KK + K NK+K +KK KK+ + K+K+ + +K L
Sbjct: 258 GSRKGKKDKEGKSTTNKEKARKKNFMMTLHKKRVRGKQKRSLRDKQKVL 306
Score = 29.6 bits (67), Expect = 0.84
Identities = 11/43 (25%), Positives = 20/43 (46%), Gaps = 10/43 (23%)
Query: 138 KEKKKNKN----------KNKKKKKKKKKKKKKKKKKKKKKKK 170
KEK + KN + K+K+ + K+K + ++KK
Sbjct: 274 KEKARKKNFMMTLHKKRVRGKQKRSLRDKQKVLRAHILRQKKG 316
Score = 28.9 bits (65), Expect = 1.7
Identities = 10/45 (22%), Positives = 20/45 (44%), Gaps = 10/45 (22%)
Query: 135 GEKKEKKKN----------KNKNKKKKKKKKKKKKKKKKKKKKKK 169
++K +KKN + K K+ + K+K + ++KK
Sbjct: 273 NKEKARKKNFMMTLHKKRVRGKQKRSLRDKQKVLRAHILRQKKGG 317
Score = 26.9 bits (60), Expect = 7.3
Identities = 12/33 (36%), Positives = 20/33 (60%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
KE ++++ K +K KK K+ K K+K +KK
Sbjct: 248 KEGREDREKFGSRKGKKDKEGKSTTNKEKARKK 280
>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
non-specific RNA binding region part 2. This is a
region found N terminal to the catalytic domain of
glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
but not in Escherichia coli. This region is thought to
bind RNA in a non-specific manner, enhancing
interactions between the tRNA and enzyme, but is not
essential for enzyme function.
Length = 83
Score = 29.6 bits (67), Expect = 0.27
Identities = 14/34 (41%), Positives = 15/34 (44%)
Query: 150 KKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
KKKKKKKKKK + K KK G
Sbjct: 22 LVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGA 55
Score = 29.6 bits (67), Expect = 0.28
Identities = 13/25 (52%), Positives = 14/25 (56%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTK 172
KKKKKKKKKK + K KK
Sbjct: 23 VKKKKKKKKKKAEDTAATAKAKKAT 47
Score = 29.3 bits (66), Expect = 0.38
Identities = 12/26 (46%), Positives = 13/26 (50%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKK 173
KKKKKKKKKK + K K
Sbjct: 22 LVKKKKKKKKKKAEDTAATAKAKKAT 47
Score = 29.3 bits (66), Expect = 0.40
Identities = 14/30 (46%), Positives = 17/30 (56%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
KKKKKKKKKK + K KK T + +
Sbjct: 24 KKKKKKKKKKAEDTAATAKAKKATAEDVSE 53
Score = 27.7 bits (62), Expect = 1.4
Identities = 15/44 (34%), Positives = 18/44 (40%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
K + KKKKKKKKKK + T K + G E
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFMEG 60
Score = 27.3 bits (61), Expect = 1.8
Identities = 12/40 (30%), Positives = 17/40 (42%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
KK+KKK K K + K KK + + + FL
Sbjct: 23 VKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFMEGFL 62
Score = 26.2 bits (58), Expect = 3.7
Identities = 15/36 (41%), Positives = 16/36 (44%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
L G K E K K KKKKKK + K KK
Sbjct: 12 ELLGPKTEADLVKKKKKKKKKKAEDTAATAKAKKAT 47
>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
SecD/SecF; Reviewed.
Length = 755
Score = 31.4 bits (72), Expect = 0.27
Identities = 11/23 (47%), Positives = 17/23 (73%)
Query: 149 KKKKKKKKKKKKKKKKKKKKKKT 171
+K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755
Score = 30.6 bits (70), Expect = 0.46
Identities = 11/21 (52%), Positives = 17/21 (80%)
Query: 153 KKKKKKKKKKKKKKKKKKTKK 173
+K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 30.6 bits (70), Expect = 0.50
Identities = 11/21 (52%), Positives = 17/21 (80%)
Query: 148 KKKKKKKKKKKKKKKKKKKKK 168
+K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 30.2 bits (69), Expect = 0.71
Identities = 10/21 (47%), Positives = 16/21 (76%)
Query: 146 KNKKKKKKKKKKKKKKKKKKK 166
+ ++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 30.2 bits (69), Expect = 0.74
Identities = 11/21 (52%), Positives = 16/21 (76%)
Query: 152 KKKKKKKKKKKKKKKKKKKTK 172
+K++ KKKKK+KK KK + K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 29.0 bits (66), Expect = 1.5
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 125 LYVPRPVALEGEKKEKKKNKNKNKKKKKKKKK 156
+++ P+ L+ EK+ KK K + K KK + +K
Sbjct: 722 IFIAAPLWLDLEKRRLKKKKKRKKVKKWEVEK 753
Score = 28.7 bits (65), Expect = 2.0
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 144 KNKNKKKKKKKKKKKKKKKKK 164
+ + KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 27.5 bits (62), Expect = 5.7
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 139 EKKKNKNKNKKKKKKKKKKKK 159
EK++ K K K+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 31.1 bits (71), Expect = 0.29
Identities = 11/38 (28%), Positives = 21/38 (55%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
++E K+ + + + + +KK +KK KK KK K +
Sbjct: 75 EEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEAL 112
Score = 29.9 bits (68), Expect = 0.79
Identities = 14/40 (35%), Positives = 21/40 (52%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
E KE + K + +KK +KK KK KK K ++ + L
Sbjct: 77 ELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEEL 116
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 30.4 bits (69), Expect = 0.29
Identities = 13/64 (20%), Positives = 29/64 (45%), Gaps = 9/64 (14%)
Query: 136 EKKEKKKNKNKN---------KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGV 186
EKK+ + N + +++KK +K K+K ++++ K L + + V
Sbjct: 32 EKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTV 91
Query: 187 KSRE 190
K+ +
Sbjct: 92 KATK 95
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 31.0 bits (71), Expect = 0.29
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKK 162
+KEKKK +K +KK K++K K ++
Sbjct: 97 AEKEKKKAMSKEEKKAIKEEKDKLEEP 123
Score = 27.9 bits (63), Expect = 3.0
Identities = 10/28 (35%), Positives = 17/28 (60%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
E +K K K K++KK K++K K ++
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
Score = 27.9 bits (63), Expect = 3.1
Identities = 10/28 (35%), Positives = 19/28 (67%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
+ EK+K K +K++KK K++K K ++
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
Score = 26.8 bits (60), Expect = 5.9
Identities = 10/27 (37%), Positives = 18/27 (66%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKF 174
+K+KKK K++KK K++K K + +
Sbjct: 98 EKEKKKAMSKEEKKAIKEEKDKLEEPY 124
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 30.5 bits (69), Expect = 0.30
Identities = 16/31 (51%), Positives = 18/31 (58%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPS 178
K KKKK +KKK KK KKKK +K P
Sbjct: 49 KTSKKKKTTPRKKKTTKKTKKKKKEKEEVPE 79
Score = 30.2 bits (68), Expect = 0.51
Identities = 17/31 (54%), Positives = 20/31 (64%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
K KKKK +KKK KK KKKKK K+ +P
Sbjct: 48 KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVP 78
Score = 29.0 bits (65), Expect = 1.1
Identities = 16/33 (48%), Positives = 21/33 (63%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
KK KK K +KKK KK KKKKK+K++ +
Sbjct: 48 KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPEL 80
Score = 29.0 bits (65), Expect = 1.2
Identities = 15/32 (46%), Positives = 18/32 (56%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
KK KKKK +KKK KK KKK K+ +
Sbjct: 48 KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPE 79
Score = 29.0 bits (65), Expect = 1.3
Identities = 16/45 (35%), Positives = 22/45 (48%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
+ KK K K +KKK KK KKKKK+K++ + S
Sbjct: 44 EIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDS 88
Score = 28.6 bits (64), Expect = 1.7
Identities = 18/47 (38%), Positives = 23/47 (48%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSG 182
E KK +K K +KKK KK KKKKK+K++ SD
Sbjct: 44 EIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEE 90
>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
peptidyl-prolyl isomerase. This family is only found at
the amino terminus of pfam00254. This domain is of
unknown function.
Length = 124
Score = 30.2 bits (69), Expect = 0.31
Identities = 13/38 (34%), Positives = 23/38 (60%), Gaps = 2/38 (5%)
Query: 150 KKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
KK + K++ K +KK ++ K + FL ++ + K GVK
Sbjct: 79 KKLQAKQQAKAEKKAEENKAAGEAFL--AENAKKEGVK 114
>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755). This
family is predominated by ORFs from Circoviridae. The
function of this family remains to be determined.
Length = 122
Score = 30.1 bits (68), Expect = 0.31
Identities = 12/39 (30%), Positives = 23/39 (58%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
++KE++K+ +KKK+KK + + KK KK+ +
Sbjct: 66 RKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRS 104
Score = 29.7 bits (67), Expect = 0.47
Identities = 12/37 (32%), Positives = 20/37 (54%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ K+K + K+ + KKK+KK + + KK KK
Sbjct: 64 PQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKK 100
Score = 26.6 bits (59), Expect = 5.4
Identities = 13/45 (28%), Positives = 22/45 (48%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
+K K++ K+ KKK+KK + + KK K ++ S S
Sbjct: 66 RKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSS 110
Score = 25.8 bits (57), Expect = 8.6
Identities = 12/49 (24%), Positives = 25/49 (51%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
L+ ++++K+ + ++ KKK+KK + + KK K+ S S
Sbjct: 61 LQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSS 109
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 31.3 bits (71), Expect = 0.32
Identities = 13/56 (23%), Positives = 21/56 (37%), Gaps = 1/56 (1%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
E+ EK K K K+ + + KK K + P+ P K+ +R
Sbjct: 42 EQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPR-KSAESTRSS 96
>gnl|CDD|180505 PRK06278, PRK06278, cobyrinic acid a,c-diamide synthase; Validated.
Length = 476
Score = 31.2 bits (71), Expect = 0.32
Identities = 15/48 (31%), Positives = 22/48 (45%), Gaps = 3/48 (6%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF---LPPSDPSGK 183
EK++ KNK K+K K + KK+++ K L SGK
Sbjct: 205 EKEEIFKKNKILKEKLKSRSGISTNNKKEERNKPKGIILLATGSESGK 252
>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
Length = 801
Score = 31.3 bits (71), Expect = 0.32
Identities = 12/45 (26%), Positives = 17/45 (37%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
R E + +K + K K KKK+ KK K K +
Sbjct: 591 RAARAEAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKAAAE 635
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 31.1 bits (71), Expect = 0.33
Identities = 14/69 (20%), Positives = 29/69 (42%), Gaps = 1/69 (1%)
Query: 105 FLEKLLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
++E+L + +++ L + + ++ + K K K K K + ++ K
Sbjct: 326 YIERLKKAKTKKTQTRL-DSFFTATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAK 384
Query: 165 KKKKKKTKK 173
KKK KK
Sbjct: 385 SSGKKKVKK 393
Score = 30.0 bits (68), Expect = 0.72
Identities = 16/43 (37%), Positives = 21/43 (48%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
KK KK K K KKKKKK +P S+ S +A ++
Sbjct: 347 TATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKK 389
Score = 26.9 bits (60), Expect = 6.8
Identities = 13/47 (27%), Positives = 16/47 (34%)
Query: 144 KNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
K KK K K KKKKKK K + K ++
Sbjct: 347 TATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393
>gnl|CDD|236698 PRK10475, PRK10475, 23S rRNA pseudouridine synthase F; Provisional.
Length = 290
Score = 30.9 bits (70), Expect = 0.34
Identities = 13/46 (28%), Positives = 18/46 (39%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
E K K K K K K+ K +K +K + K+F P
Sbjct: 242 EAKPKAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRKK 287
Score = 28.5 bits (64), Expect = 2.2
Identities = 8/40 (20%), Positives = 14/40 (35%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
+P ++ K K K + K+ +KKK
Sbjct: 249 AKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRKKK 288
>gnl|CDD|191868 pfam07839, CaM_binding, Plant calmodulin-binding domain. The
sequences featured in this family are found repeated in
a number of plant calmodulin-binding proteins, and are
thought to constitute the calmodulin-binding domains.
Binding of the proteins to calmodulin depends on the
presence of calcium ions. These proteins are thought to
be involved in various processes, such as plant defence
responses and stolonisation or tuberization.
Length = 115
Score = 29.7 bits (67), Expect = 0.36
Identities = 15/68 (22%), Positives = 24/68 (35%)
Query: 124 NLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
R + + +++K N KK K+ K +K +K + FLP
Sbjct: 4 KKEERRSKKEDPKPEKEKPKSWSNLKKVILLKRFVKSLEKVRKFNPRGPNFLPLEPDPEA 63
Query: 184 AGVKSREQ 191
V R Q
Sbjct: 64 EKVDLRHQ 71
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many hypothetical
proteins.
Length = 98
Score = 29.4 bits (66), Expect = 0.38
Identities = 15/43 (34%), Positives = 18/43 (41%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
KK K K K+ K + K KK +KK KKK P
Sbjct: 19 SNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESK 61
Score = 29.0 bits (65), Expect = 0.52
Identities = 15/42 (35%), Positives = 21/42 (50%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
KK+ K K + K + K KK +KK KKK + +F S
Sbjct: 22 KKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRR 63
Score = 28.6 bits (64), Expect = 0.85
Identities = 13/39 (33%), Positives = 19/39 (48%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K++KK K + K + K KK +KK KKK +
Sbjct: 19 SNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFP 57
Score = 28.2 bits (63), Expect = 1.1
Identities = 12/40 (30%), Positives = 21/40 (52%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
K++KKK + + K KK +KK KKK + + ++
Sbjct: 23 KRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKR 62
Score = 27.8 bits (62), Expect = 1.4
Identities = 15/42 (35%), Positives = 20/42 (47%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPS 178
KK KKK K K + K KK +KK KKK + + +
Sbjct: 22 KKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRR 63
Score = 27.8 bits (62), Expect = 1.6
Identities = 11/41 (26%), Positives = 22/41 (53%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
+ + K K +KK KKK + + ++ K+++T+ LP
Sbjct: 30 RTAKTARPKATKKGQKKDKKKDEFPEFPEESKRRRTEDGLP 70
Score = 27.0 bits (60), Expect = 2.5
Identities = 17/42 (40%), Positives = 20/42 (47%)
Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
N K KKKKK+ K + K KK +KK KK P P
Sbjct: 19 SNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEES 60
Score = 25.9 bits (57), Expect = 7.5
Identities = 18/54 (33%), Positives = 25/54 (46%), Gaps = 1/54 (1%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
KE+KKN+ +K+ KK+KKKKK+ KT + K K E
Sbjct: 1 KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATK-KGQKKDKKKDEF 53
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 30.3 bits (69), Expect = 0.39
Identities = 10/42 (23%), Positives = 27/42 (64%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
V +E +E++ ++ K+ K K++K+++ ++K+K+ K +
Sbjct: 109 VEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQ 150
Score = 29.9 bits (68), Expect = 0.43
Identities = 9/42 (21%), Positives = 29/42 (69%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
E E+ ++++ ++ +K+ K K++K+++ ++K+K+ K+ +
Sbjct: 110 EVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKEQM 151
Score = 28.4 bits (64), Expect = 1.3
Identities = 10/37 (27%), Positives = 24/37 (64%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
E E+ ++ K K K++K+++ ++K+K+ K++ K
Sbjct: 116 EEEQIDELLEKELAKLKREKRRENERKQKEILKEQMK 152
Score = 28.4 bits (64), Expect = 1.5
Identities = 9/37 (24%), Positives = 24/37 (64%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
E E +E + + ++ +K+ K K++K+++ ++K+K
Sbjct: 108 EVEVEELDEEEQIDELLEKELAKLKREKRRENERKQK 144
Score = 28.4 bits (64), Expect = 1.7
Identities = 8/39 (20%), Positives = 22/39 (56%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
E E+++ +K+ K K++K+++ ++K+K+
Sbjct: 107 EEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKE 145
Score = 27.6 bits (62), Expect = 3.1
Identities = 8/40 (20%), Positives = 25/40 (62%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E + ++ ++ + + +K+ K K++K+++ ++K K+
Sbjct: 106 EEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKE 145
Score = 26.9 bits (60), Expect = 5.2
Identities = 8/40 (20%), Positives = 24/40 (60%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E+ E ++ + + + +K+ K K++K+++ ++ +K
Sbjct: 105 EEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQK 144
Score = 26.5 bits (59), Expect = 6.8
Identities = 7/33 (21%), Positives = 20/33 (60%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
E + + K++K+++ ++K+K+ K++ K
Sbjct: 122 ELLEKELAKLKREKRRENERKQKEILKEQMKML 154
Score = 26.5 bits (59), Expect = 7.2
Identities = 8/27 (29%), Positives = 19/27 (70%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTK 172
K K K++K+++ ++K+K+ K++ K
Sbjct: 126 KELAKLKREKRRENERKQKEILKEQMK 152
>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
beta and delta phosphatidylinositol 3-kinases (PI3Ks).
PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
regulate cell processes such as cell growth,
differentiation, proliferation, and motility. PI3Ks
work on phosphorylation of phosphatidylinositol,
phosphatidylinositide (4)P (PtdIns (4)P),2 or
PtdIns(4,5)P2. Specifically they phosphorylate the D3
hydroxyl group of phosphoinositol lipids on the inositol
ring. There are 3 classes of PI3Ks based on structure,
regulation, and specificity. All classes contain a C2
domain, a PIK domain, and a kinase catalytic domain.
The members here are class I, beta and delta isoforms of
PI3Ks and contain both a Ras-binding domain and a
p85-binding domain. Class II PI3Ks contain both of
these as well as a PX domain, and a C-terminal C2 domain
containing a nuclear localization signal. C2 domains
fold into an 8-standed beta-sandwich that can adopt 2
structural arrangements: Type I and Type II,
distinguished by a circular permutation involving their
N- and C-terminal beta strands. Many C2 domains are
Ca2+-dependent membrane-targeting modules that bind a
wide variety of substances including bind phospholipids,
inositol polyphosphates, and intracellular proteins.
Most C2 domain proteins are either signal transduction
enzymes that contain a single C2 domain, such as protein
kinase C, or membrane trafficking proteins which contain
at least two C2 domains, such as synaptotagmin 1.
However, there are a few exceptions to this including
RIM isoforms and some splice variants of piccolo/aczonin
and intersectin which only have a single C2 domain. C2
domains with a calcium binding region have negatively
charged residues, primarily aspartates, that serve as
ligands for calcium ions. Members have a type-I
topology.
Length = 173
Score = 30.4 bits (69), Expect = 0.40
Identities = 11/25 (44%), Positives = 14/25 (56%)
Query: 153 KKKKKKKKKKKKKKKKKKTKKFLPP 177
KK K K+ +K + KKK KK P
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKKDDNP 109
Score = 28.4 bits (64), Expect = 1.5
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 147 NKKKKKKKKKKKKKKKKKKKK 167
KK K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
Score = 28.0 bits (63), Expect = 2.2
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 148 KKKKKKKKKKKKKKKKKKKKK 168
KK K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
Score = 28.0 bits (63), Expect = 2.2
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 149 KKKKKKKKKKKKKKKKKKKKK 169
KK K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
Score = 28.0 bits (63), Expect = 2.2
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 150 KKKKKKKKKKKKKKKKKKKKK 170
KK K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
Score = 27.3 bits (61), Expect = 3.7
Identities = 10/21 (47%), Positives = 13/21 (61%)
Query: 145 NKNKKKKKKKKKKKKKKKKKK 165
K K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
Score = 26.9 bits (60), Expect = 5.4
Identities = 11/25 (44%), Positives = 14/25 (56%)
Query: 152 KKKKKKKKKKKKKKKKKKKTKKFLP 176
KK K K+ +K + KKKK K P
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKKDDNP 109
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 30.8 bits (70), Expect = 0.41
Identities = 9/40 (22%), Positives = 23/40 (57%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ E + K+K K ++ K+K++K+ +++ K+ K+
Sbjct: 262 FESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQ 301
Score = 30.4 bits (69), Expect = 0.51
Identities = 12/40 (30%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E E K + K K K ++ K+K++K+ +++ K++K K
Sbjct: 265 EYEPINKPV-RPKRKTKAQRNKEKRRKELEREAKEEKQLK 303
Score = 29.3 bits (66), Expect = 1.2
Identities = 11/36 (30%), Positives = 24/36 (66%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
+ K+K K + K+K++K+ +++ K++K+ KKK
Sbjct: 270 NKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKK 305
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 31.0 bits (70), Expect = 0.42
Identities = 14/22 (63%), Positives = 18/22 (81%)
Query: 152 KKKKKKKKKKKKKKKKKKKTKK 173
+KKKKKKK K+KKKKK ++ K
Sbjct: 684 RKKKKKKKSKEKKKKKNREASK 705
Score = 30.6 bits (69), Expect = 0.54
Identities = 14/23 (60%), Positives = 20/23 (86%)
Query: 151 KKKKKKKKKKKKKKKKKKKKTKK 173
+KKKKKKK K+KKKKK ++ +K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
Score = 29.8 bits (67), Expect = 0.87
Identities = 14/23 (60%), Positives = 19/23 (82%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKK 170
+KKKKKKK K+KKKKK ++ K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
Score = 28.7 bits (64), Expect = 2.0
Identities = 13/22 (59%), Positives = 17/22 (77%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKK 167
K KKKKK K+KKKKK ++ K+
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQ 706
Score = 27.9 bits (62), Expect = 4.5
Identities = 12/23 (52%), Positives = 17/23 (73%)
Query: 144 KNKNKKKKKKKKKKKKKKKKKKK 166
+ K KKKK K+KKKKK ++ K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
Score = 27.5 bits (61), Expect = 5.2
Identities = 13/23 (56%), Positives = 18/23 (78%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKK 168
+ KKKKKK K+KKKKK ++ K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
Score = 27.1 bits (60), Expect = 6.4
Identities = 11/23 (47%), Positives = 16/23 (69%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKK 162
+KK K K K+KKKKK ++ K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
Score = 26.8 bits (59), Expect = 9.6
Identities = 13/25 (52%), Positives = 19/25 (76%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKK 159
G +K+KKK K+K KKKKK ++ K+
Sbjct: 682 GSRKKKKKKKSKEKKKKKNREASKQ 706
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 29.3 bits (66), Expect = 0.43
Identities = 12/47 (25%), Positives = 19/47 (40%), Gaps = 7/47 (14%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKK-------KKKKKKKKKKKKKKKTKK 173
+ ++ + K + K KK + +K K KK KK KK
Sbjct: 19 DKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKK 65
Score = 27.4 bits (61), Expect = 2.0
Identities = 11/37 (29%), Positives = 14/37 (37%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
K +K + +K K KK KK KK K
Sbjct: 32 GKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCK 68
Score = 26.2 bits (58), Expect = 4.6
Identities = 13/39 (33%), Positives = 17/39 (43%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+ K K + K K K KK+ K K K+ K KK
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAP 40
Score = 26.2 bits (58), Expect = 5.2
Identities = 13/36 (36%), Positives = 18/36 (50%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+K K K + K+K K KK+ K K K+ K K
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKN 37
Score = 26.2 bits (58), Expect = 5.5
Identities = 16/52 (30%), Positives = 22/52 (42%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
KK K K + K K K KK+ K K K+ K KK + + + K
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEK 53
Score = 25.9 bits (57), Expect = 7.3
Identities = 11/37 (29%), Positives = 13/37 (35%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
KK + +K K KK KK KK K
Sbjct: 32 GKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCK 68
>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
Length = 381
Score = 30.4 bits (69), Expect = 0.46
Identities = 12/44 (27%), Positives = 19/44 (43%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
P A + K + K +K K ++ +KK KK + K K
Sbjct: 330 PAAKKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAK 373
Score = 30.0 bits (68), Expect = 0.70
Identities = 15/51 (29%), Positives = 21/51 (41%)
Query: 123 LNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
LN + L +EK K K K ++ +KK KK + K K K
Sbjct: 328 LNPAAKKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKALYKS 378
Score = 28.1 bits (63), Expect = 3.3
Identities = 10/35 (28%), Positives = 16/35 (45%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
+ K K +K K ++ +KK KK + K K
Sbjct: 341 LAQEKATAKGAQKVKNRRARKKAKKARLAKVAKAL 375
Score = 27.7 bits (62), Expect = 4.3
Identities = 12/42 (28%), Positives = 17/42 (40%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
L EK K + ++ +KK KK + K K K K
Sbjct: 340 VLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKALYKSIKA 381
Score = 26.6 bits (59), Expect = 9.3
Identities = 12/49 (24%), Positives = 21/49 (42%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKS 188
KK K ++K K +K K ++ +KK K L + +K+
Sbjct: 333 KKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKALYKSIKA 381
>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
topoisomerase IA [General function prediction only].
Length = 298
Score = 30.4 bits (69), Expect = 0.47
Identities = 15/38 (39%), Positives = 15/38 (39%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
KK K K KK KK KK K KK KK
Sbjct: 259 GGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKK 296
Score = 28.5 bits (64), Expect = 2.2
Identities = 12/40 (30%), Positives = 14/40 (35%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAG 185
K KK KK K KK KK K + + K
Sbjct: 258 KGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKKA 297
Score = 27.7 bits (62), Expect = 3.2
Identities = 14/39 (35%), Positives = 16/39 (41%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+G +K K K KK KK KK K KK K
Sbjct: 258 KGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKK 296
Score = 27.7 bits (62), Expect = 3.4
Identities = 15/41 (36%), Positives = 17/41 (41%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
L + + K KK K KK KK KK KTKK
Sbjct: 252 LAERRAKGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKK 292
Score = 27.4 bits (61), Expect = 4.5
Identities = 14/40 (35%), Positives = 15/40 (37%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ KK K K KK KK KK K KK K
Sbjct: 256 RAKGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAK 295
>gnl|CDD|197876 smart00792, Agouti, Agouti protein. The agouti protein regulates
pigmentation in the mouse hair follicle producing a
black hair with a subapical yellow band. A highly
homologous protein agouti signal protein (ASIP) is
present in humans and is expressed at highest levels in
adipose tissue where it may play a role in energy
homeostasis and possibly human pigmentation.
Length = 124
Score = 29.5 bits (66), Expect = 0.47
Identities = 16/44 (36%), Positives = 24/44 (54%)
Query: 147 NKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
NKK KK ++ +KK +KK+KK L P S + V+ R+
Sbjct: 51 NKKPKKISAEEAEKKLLQKKEKKALTNVLRPEPRSPRRCVRLRD 94
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 30.7 bits (70), Expect = 0.47
Identities = 12/45 (26%), Positives = 20/45 (44%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
KN KK+ K K+ +++K K++ K S K+ E
Sbjct: 10 KNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADDE 54
Score = 30.0 bits (68), Expect = 0.90
Identities = 8/34 (23%), Positives = 15/34 (44%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
KK K + + +K K++ K K + +K
Sbjct: 14 KKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRK 47
Score = 28.4 bits (64), Expect = 2.8
Identities = 10/40 (25%), Positives = 19/40 (47%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
KK+ K K +++K K++ K K +++K D
Sbjct: 14 KKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADD 53
Score = 26.9 bits (60), Expect = 8.8
Identities = 9/36 (25%), Positives = 16/36 (44%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
AL+ K K+ + K K++ K K + +K
Sbjct: 12 ALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRK 47
>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p), ribosomal RNA-processing protein 7 homolog A
(Rrp7A), and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
YCL031C gene from Saccharomyces cerevisiae. It is an
essential yeast protein involved in pre-rRNA processing
and ribosome assembly, and is speculated to be required
for correct assembly of rpS27 into the pre-ribosomal
particle. Rrp7A, also termed gastric cancer antigen
Zg14, is the Rrp7p homolog mainly found in Metazoans.
The cellular function of Rrp7A remains unclear
currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 118
Score = 29.5 bits (67), Expect = 0.49
Identities = 12/32 (37%), Positives = 21/32 (65%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
+ +K K ++ + K K+K+KKKKKKK+ +
Sbjct: 58 RGGRKGKTAREEAVEAKAKEKEKKKKKKKELE 89
Score = 29.5 bits (67), Expect = 0.51
Identities = 11/34 (32%), Positives = 21/34 (61%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
+ + +++ + K K+K+KKKKKKK+ + F
Sbjct: 58 RGGRKGKTAREEAVEAKAKEKEKKKKKKKELEDF 91
Score = 29.1 bits (66), Expect = 0.68
Identities = 12/31 (38%), Positives = 18/31 (58%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
K K + + K K+K+KKKKKKK+ +
Sbjct: 59 GGRKGKTAREEAVEAKAKEKEKKKKKKKELE 89
Score = 28.4 bits (64), Expect = 1.2
Identities = 12/32 (37%), Positives = 20/32 (62%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
+ K K +++ + K K+K+KKKKKKK+
Sbjct: 58 RGGRKGKTAREEAVEAKAKEKEKKKKKKKELE 89
Score = 26.4 bits (59), Expect = 4.7
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
+ +K + + + K K+K+KKKKKKK+ +
Sbjct: 58 RGGRKGKTAREEAVEAKAKEKEKKKKKKKELE 89
>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
Length = 145
Score = 29.7 bits (67), Expect = 0.55
Identities = 15/31 (48%), Positives = 17/31 (54%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
K K KKK KK K KK K KK+K +T
Sbjct: 1 KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRT 31
Score = 29.3 bits (66), Expect = 0.82
Identities = 15/38 (39%), Positives = 19/38 (50%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
K K KK KK K KK K KK+K + + +F P
Sbjct: 1 KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRTSVRFFRP 38
Score = 27.0 bits (60), Expect = 3.8
Identities = 14/31 (45%), Positives = 16/31 (51%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
K KK K K KK K KK K KK+K +
Sbjct: 1 KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRT 31
Score = 27.0 bits (60), Expect = 5.0
Identities = 14/38 (36%), Positives = 18/38 (47%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
+ K K K KK K KK K KK+K + + F P
Sbjct: 1 KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRTSVRFFRP 38
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association with
pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 28.8 bits (65), Expect = 0.55
Identities = 14/36 (38%), Positives = 24/36 (66%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
+ + + ++ +N++KKKKKKK KKK K+ +KK
Sbjct: 26 QIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKK 61
Score = 28.4 bits (64), Expect = 0.74
Identities = 15/37 (40%), Positives = 22/37 (59%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
E K + ++ ++KKKKKKK KKK K+ +KK
Sbjct: 25 EQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKK 61
Score = 28.0 bits (63), Expect = 0.86
Identities = 16/44 (36%), Positives = 28/44 (63%), Gaps = 3/44 (6%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKK---KKKKKKKKKKTKK 173
++ K + +N+ KKKKKKK KKK K +KK++ +++ K+
Sbjct: 27 IDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRRGKE 70
Score = 28.0 bits (63), Expect = 0.96
Identities = 15/40 (37%), Positives = 25/40 (62%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
GE+ E+ + K ++ ++ ++KKKKKKK KKK+K
Sbjct: 15 GLGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54
Score = 27.3 bits (61), Expect = 1.9
Identities = 16/42 (38%), Positives = 29/42 (69%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
P + EGE+ +++ + +K ++ ++ ++KKKKKKK KKK K
Sbjct: 13 PNGLGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54
Score = 26.1 bits (58), Expect = 4.4
Identities = 15/41 (36%), Positives = 25/41 (60%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
P L E +E + + +K ++ ++ ++KKKKKKK KKK
Sbjct: 13 PNGLGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKS 53
>gnl|CDD|234874 PRK00955, PRK00955, hypothetical protein; Provisional.
Length = 620
Score = 30.3 bits (69), Expect = 0.59
Identities = 15/34 (44%), Positives = 18/34 (52%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
L KK KKK+ KN KK+KKK+KK
Sbjct: 587 LIPPKKPKKKSNKKNNNNNYSSNNSKKRKKKRKK 620
Score = 29.1 bits (66), Expect = 1.4
Identities = 13/31 (41%), Positives = 16/31 (51%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
KK K K+ KK KK+KKK+KK
Sbjct: 590 PKKPKKKSNKKNNNNNYSSNNSKKRKKKRKK 620
Score = 29.1 bits (66), Expect = 1.6
Identities = 11/32 (34%), Positives = 15/32 (46%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
+K K K+ K KK+KKK+KK
Sbjct: 589 PPKKPKKKSNKKNNNNNYSSNNSKKRKKKRKK 620
Score = 27.6 bits (62), Expect = 4.7
Identities = 13/33 (39%), Positives = 15/33 (45%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
KK K NKK KK+KKK+KK
Sbjct: 588 IPPKKPKKKSNKKNNNNNYSSNNSKKRKKKRKK 620
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 30.3 bits (68), Expect = 0.59
Identities = 14/40 (35%), Positives = 23/40 (57%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ E KN+ + KKK+K++ KK K KK++K +K
Sbjct: 551 KTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEK 590
Score = 28.4 bits (63), Expect = 2.7
Identities = 13/38 (34%), Positives = 20/38 (52%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
KK++K+ K K KK++K K+K KK + F
Sbjct: 565 KKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSSAF 602
Score = 27.3 bits (60), Expect = 5.9
Identities = 12/36 (33%), Positives = 23/36 (63%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
K +K+K + + K ++++KKK+K++ KK TK
Sbjct: 544 MKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTK 579
Score = 26.9 bits (59), Expect = 7.5
Identities = 14/55 (25%), Positives = 27/55 (49%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
+K K + +K ++++KKK+K++ KK K K ++ + G A S
Sbjct: 547 IEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSSA 601
Score = 26.9 bits (59), Expect = 7.8
Identities = 12/37 (32%), Positives = 22/37 (59%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
KK K K+K + + K ++++KKK+K++ KK
Sbjct: 542 KKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVT 578
Score = 26.9 bits (59), Expect = 8.8
Identities = 15/61 (24%), Positives = 28/61 (45%)
Query: 125 LYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
V L E+ KK + K + + K ++++KKK+K++ KK + + K
Sbjct: 527 TAVSDAALLAPEEIYKKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKI 586
Query: 185 G 185
G
Sbjct: 587 G 587
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 29.7 bits (67), Expect = 0.61
Identities = 14/38 (36%), Positives = 20/38 (52%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E+KE K N K+K K KK KK + +KK + +
Sbjct: 108 EEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAE 145
Score = 28.2 bits (63), Expect = 2.2
Identities = 13/37 (35%), Positives = 18/37 (48%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K+EK+ N K K K KK KK + +KK +
Sbjct: 107 KEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEAR 143
Score = 27.0 bits (60), Expect = 4.5
Identities = 14/39 (35%), Positives = 17/39 (43%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
A + E K K +K+ K KK K K KK KK
Sbjct: 93 AFTEAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKA 131
>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
(ETRAMP). These sequences represent a family of
proteins from the malaria parasite Plasmodium
falciparum, several of which have been shown to be
expressed specifically in the ring stage as well as the
rodent parasite Plasmodium yoelii. A homologue from
Plasmodium chabaudi was localized to the parasitophorous
vacuole membrane. Members have an initial hydrophobic,
Phe/Tyr-rich, stretch long enough to span the membrane,
a highly charged region rich in Lys, a second putative
transmembrane region and a second highly charged, low
complexity sequence region. Some members have up to 100
residues of additional C-terminal sequence. These genes
have been shown to be found in the sub-telomeric regions
of both Plasmodium falciparum and P. yoelii chromosomes.
Length = 82
Score = 28.4 bits (64), Expect = 0.64
Identities = 15/29 (51%), Positives = 16/29 (55%)
Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
N KKK KK KK +KKKK KKK
Sbjct: 25 NNVVAKKKKLKKLKKIDDDLEKKKKNKKK 53
Score = 28.4 bits (64), Expect = 0.67
Identities = 14/29 (48%), Positives = 15/29 (51%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
K KK KK KK +KKKK KKK
Sbjct: 25 NNVVAKKKKLKKLKKIDDDLEKKKKNKKK 53
Score = 27.2 bits (61), Expect = 1.6
Identities = 16/29 (55%), Positives = 17/29 (58%)
Query: 145 NKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
N N KKKK KK KK +KKKK KK
Sbjct: 24 NNNVVAKKKKLKKLKKIDDDLEKKKKNKK 52
Score = 27.2 bits (61), Expect = 1.9
Identities = 15/29 (51%), Positives = 16/29 (55%)
Query: 145 NKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
N KKKK KK KK +KKKK KK
Sbjct: 25 NNVVAKKKKLKKLKKIDDDLEKKKKNKKK 53
Score = 26.8 bits (60), Expect = 2.8
Identities = 16/30 (53%), Positives = 17/30 (56%)
Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
N N KKKK KK KK +KKKK K K
Sbjct: 24 NNNVVAKKKKLKKLKKIDDDLEKKKKNKKK 53
Score = 25.7 bits (57), Expect = 5.9
Identities = 13/26 (50%), Positives = 14/26 (53%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKK 164
KK K K KK +KKKK KKK
Sbjct: 28 VAKKKKLKKLKKIDDDLEKKKKNKKK 53
Score = 25.7 bits (57), Expect = 7.2
Identities = 12/27 (44%), Positives = 17/27 (62%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKK 159
+ +KK+ KK K + +KKKK KKK
Sbjct: 27 VVAKKKKLKKLKKIDDDLEKKKKNKKK 53
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 30.2 bits (68), Expect = 0.65
Identities = 14/27 (51%), Positives = 17/27 (62%)
Query: 147 NKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K KK K KKK KK KKK+ K+ + K
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDK 169
Score = 29.4 bits (66), Expect = 1.1
Identities = 13/37 (35%), Positives = 22/37 (59%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
G + + KK K K K KK KKK+ K+ K +++ ++
Sbjct: 141 GIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESES 177
Score = 29.4 bits (66), Expect = 1.3
Identities = 14/29 (48%), Positives = 17/29 (58%)
Query: 145 NKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ K KK K KKK KK KKK+ K+ K
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKESSDKD 170
Score = 28.3 bits (63), Expect = 2.4
Identities = 14/44 (31%), Positives = 21/44 (47%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSG 182
E K K K KKK KK KKK+ K+ K +++++
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEES 186
Score = 27.5 bits (61), Expect = 5.1
Identities = 14/38 (36%), Positives = 20/38 (52%)
Query: 154 KKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
+ K KK K KKK KK K K+ SD + +S ++
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDE 180
Score = 26.7 bits (59), Expect = 8.2
Identities = 15/52 (28%), Positives = 22/52 (42%), Gaps = 11/52 (21%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKK-----------KKKKKKKKKTKKF 174
E + + ++ K+ +KKK KK KKK K+KK KF
Sbjct: 365 AAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKPLHPSWEAKKKAKEKKANAKF 416
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 30.4 bits (68), Expect = 0.68
Identities = 28/94 (29%), Positives = 46/94 (48%), Gaps = 7/94 (7%)
Query: 87 DIGNRTEALLQLPRGRLVFLE--KLLPLLSNQSEDCLY-LNLYVP----RPVALEGEKKE 139
D+ T L L ++ +LE +L L+ SE Y + L V + V E E+
Sbjct: 164 DMKRGTRITLHLKEDQMEYLEPRRLKELIKKHSEFIGYDIELMVEKTTEKEVTDEDEEDT 223
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
KK +++ + K ++ K+ + KKKK KK K+ K
Sbjct: 224 KKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTK 257
>gnl|CDD|222918 PHA02687, PHA02687, ORF061 late transcription factor VLTF-4;
Provisional.
Length = 231
Score = 30.0 bits (67), Expect = 0.68
Identities = 15/39 (38%), Positives = 19/39 (48%)
Query: 149 KKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
K K++ KKK K K +K K +K PP D K K
Sbjct: 84 KTPKRRTKKKAKADKPEKSPKAVEKLCPPDDRDDKNEEK 122
>gnl|CDD|146145 pfam03357, Snf7, Snf7. This family of proteins are involved in
protein sorting and transport from the endosome to the
vacuole/lysosome in eukaryotic cells. Vacuoles/lysosomes
play an important role in the degradation of both lipids
and cellular proteins. In order to perform this
degradative function, vacuoles/lysosomes contain
numerous hydrolases which have been transported in the
form of inactive precursors via the biosynthetic pathway
and are proteolytically activated upon delivery to the
vacuole/lysosome. The delivery of transmembrane
proteins, such as activated cell surface receptors to
the lumen of the vacuole/lysosome, either for
degradation/downregulation, or in the case of
hydrolases, for proper localisation, requires the
formation of multivesicular bodies (MVBs). These late
endosomal structures are formed by invaginating and
budding of the limiting membrane into the lumen of the
compartment. During this process, a subset of the
endosomal membrane proteins is sorted into the forming
vesicles. Mature MVBs fuse with the vacuole/lysosome,
thereby releasing cargo containing vesicles into its
hydrolytic lumen for degradation. Endosomal proteins
that are not sorted into the intralumenal MVB vesicles
are either recycled back to the plasma membrane or Golgi
complex, or remain in the limiting membrane of the MVB
and are thereby transported to the limiting membrane of
the vacuole/lysosome as a consequence of fusion.
Therefore, the MVB sorting pathway plays a critical role
in the decision between recycling and degradation of
membrane proteins. A few archaeal sequences are also
present within this family.
Length = 169
Score = 29.5 bits (67), Expect = 0.69
Identities = 14/40 (35%), Positives = 20/40 (50%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ E EKK K + + KK KK K K+KK+ +K
Sbjct: 16 KQESLEKKIKKLEAEIKKLAKKGNKDAALILLKQKKRYEK 55
Score = 26.4 bits (59), Expect = 7.2
Identities = 13/40 (32%), Positives = 20/40 (50%), Gaps = 4/40 (10%)
Query: 136 EKKEKK-KNKNKNKKKKKKKKKKK---KKKKKKKKKKKKT 171
EKK KK + + K KK K K+KK+ +K+ +
Sbjct: 21 EKKIKKLEAEIKKLAKKGNKDAALILLKQKKRYEKQLDQL 60
>gnl|CDD|236172 PRK08173, PRK08173, DNA topoisomerase III; Validated.
Length = 862
Score = 30.4 bits (69), Expect = 0.70
Identities = 14/46 (30%), Positives = 16/46 (34%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
PR KK K K +K KK KK KK +K
Sbjct: 815 PREPKAAAAKKTAAKATAAAATKAEKAAAKKAPAKKTAAKKTAARK 860
Score = 29.2 bits (66), Expect = 1.3
Identities = 11/44 (25%), Positives = 16/44 (36%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
P+ A + + +K KK KK KK +KT
Sbjct: 818 PKAAAAKKTAAKATAAAATKAEKAAAKKAPAKKTAAKKTAARKT 861
>gnl|CDD|218585 pfam05424, Duffy_binding, Duffy binding domain. This domain is
found in Plasmodium Duffy binding proteins. Plasmodium
vivax and Plasmodium knowlesi merozoites invade human
erythrocytes that express Duffy blood group surface
determinants. The Duffy receptor family is localised in
micronemes, an organelle found in all organisms of the
phylum Apicomplexa.
Length = 276
Score = 30.0 bits (68), Expect = 0.74
Identities = 16/37 (43%), Positives = 20/37 (54%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
KKK +K KKK KK KK+ KK K K K++
Sbjct: 240 DNKKKEWDKQKKKYKKYKKRNNKKNYSDIKDKYAKEY 276
>gnl|CDD|151665 pfam11223, DUF3020, Protein of unknown function (DUF3020). This
family of fungal proteins is conserved towards the
C-terminus of HMG domains. The function is not known.
Length = 49
Score = 27.6 bits (61), Expect = 0.83
Identities = 15/41 (36%), Positives = 22/41 (53%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
E +KK ++ N +NK + + KKK KKK K+ K K
Sbjct: 3 ERKKKWREANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDA 43
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 29.9 bits (67), Expect = 0.85
Identities = 14/36 (38%), Positives = 20/36 (55%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
+ E+ E++KN+ + KK KK KK K KK K
Sbjct: 319 DSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKD 354
Score = 29.2 bits (65), Expect = 1.4
Identities = 14/47 (29%), Positives = 24/47 (51%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+P +E ++ ++ + KN+++ KK KK KK K KK K
Sbjct: 308 IPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKD 354
Score = 28.4 bits (63), Expect = 2.4
Identities = 16/36 (44%), Positives = 22/36 (61%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
E +K + ++ K+KKKKKK K KKK KK K+
Sbjct: 228 ESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKR 263
Score = 27.2 bits (60), Expect = 6.7
Identities = 15/41 (36%), Positives = 19/41 (46%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
+ + K+K KKKK K KKK KK K+ SD
Sbjct: 235 DGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSD 275
Score = 26.8 bits (59), Expect = 8.1
Identities = 15/40 (37%), Positives = 21/40 (52%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+G++ +K +K KKKKKK K KKK KK +
Sbjct: 224 DDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKR 263
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 29.6 bits (67), Expect = 0.85
Identities = 12/34 (35%), Positives = 17/34 (50%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
++E K KKK K K K K KK++K+
Sbjct: 206 LQRELMKKGKGKKKKIVKDKDGKVVYKWKKERKR 239
Score = 28.9 bits (65), Expect = 1.5
Identities = 14/38 (36%), Positives = 20/38 (52%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
++ E ++ K K KKKK K K K K KK+ K+
Sbjct: 202 QRLELQRELMKKGKGKKKKIVKDKDGKVVYKWKKERKR 239
Score = 26.6 bits (59), Expect = 9.3
Identities = 15/38 (39%), Positives = 19/38 (50%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+K E++ + KK K KKKK K K K K KK
Sbjct: 198 KKVEQRLELQRELMKKGKGKKKKIVKDKDGKVVYKWKK 235
>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
Provisional.
Length = 1164
Score = 30.0 bits (67), Expect = 0.86
Identities = 14/38 (36%), Positives = 21/38 (55%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+ KNK KNK+K K KK + K+ K+ +K+K
Sbjct: 214 IKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSK 251
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 29.3 bits (66), Expect = 0.87
Identities = 15/43 (34%), Positives = 23/43 (53%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
P + + K+ KKK +K KK KKK K+K +++KK
Sbjct: 40 PEDIEIPSPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKK 82
Score = 28.2 bits (63), Expect = 1.9
Identities = 12/32 (37%), Positives = 17/32 (53%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
K+ KKK+ +K KK KKK K+ L +
Sbjct: 49 KQPKKKRPTTPRKPATTKKSKKKDKEKLTEEE 80
Score = 26.6 bits (59), Expect = 7.3
Identities = 12/27 (44%), Positives = 14/27 (51%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTK 172
K KKK+ +K KK KKK K K
Sbjct: 49 KQPKKKRPTTPRKPATTKKSKKKDKEK 75
Score = 26.3 bits (58), Expect = 8.5
Identities = 13/40 (32%), Positives = 18/40 (45%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
K+ K K +K KK KKK K+K + +K D
Sbjct: 48 PKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDD 87
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 29.8 bits (67), Expect = 0.90
Identities = 13/38 (34%), Positives = 22/38 (57%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E + +KK K + KK+ +++ K K + KKK + KK
Sbjct: 137 EAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKK 174
Score = 28.6 bits (64), Expect = 1.9
Identities = 12/38 (31%), Positives = 23/38 (60%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ +EK+K + K K+ + K K + + +KK K++ KK
Sbjct: 113 QAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKK 150
Score = 28.6 bits (64), Expect = 2.1
Identities = 14/53 (26%), Positives = 28/53 (52%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
A E + K + + + K K++ KK+ +++ K K + KKK + ++ KA
Sbjct: 129 AAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKA 181
Score = 28.3 bits (63), Expect = 2.9
Identities = 13/36 (36%), Positives = 18/36 (50%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
E E K K + K K + KKK + + K K + K K
Sbjct: 153 EEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAK 188
Score = 27.9 bits (62), Expect = 3.4
Identities = 13/38 (34%), Positives = 20/38 (52%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E++ K K + KKK + KKK + + K K + K K
Sbjct: 153 EEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAK 190
Score = 27.5 bits (61), Expect = 4.1
Identities = 13/45 (28%), Positives = 23/45 (51%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ E +K+ +++ K K + KKK + KKK + + K K K
Sbjct: 142 KKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAK 186
Score = 27.1 bits (60), Expect = 5.5
Identities = 11/37 (29%), Positives = 23/37 (62%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
E ++K+ ++ K K + K K + + +KK K++ KK+
Sbjct: 115 EEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQ 151
Score = 27.1 bits (60), Expect = 6.2
Identities = 12/41 (29%), Positives = 25/41 (60%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
A + E+K+K+ + K K+ + K K + + +KK K++ K +
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQ 151
Score = 27.1 bits (60), Expect = 6.7
Identities = 12/38 (31%), Positives = 18/38 (47%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
E + K + K K + KKK + + K K + K K K
Sbjct: 153 EEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAK 190
Score = 27.1 bits (60), Expect = 6.9
Identities = 12/42 (28%), Positives = 21/42 (50%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ + +E+ K K + KKK + KKK + + K K + K
Sbjct: 147 EAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAK 188
Score = 26.7 bits (59), Expect = 7.9
Identities = 12/37 (32%), Positives = 23/37 (62%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
E K K+ + K K + + +KK K++ KK+ +++ K K
Sbjct: 123 EAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAK 159
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 29.2 bits (66), Expect = 0.90
Identities = 10/29 (34%), Positives = 19/29 (65%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
+KE+KK +K +KK K +K+K +++
Sbjct: 96 AQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124
Score = 26.5 bits (59), Expect = 8.9
Identities = 10/30 (33%), Positives = 20/30 (66%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
K +K++KK K++KK K +K+K +++
Sbjct: 95 KAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 29.4 bits (66), Expect = 0.93
Identities = 12/39 (30%), Positives = 17/39 (43%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
ALE EK + K+K + K + K +K K K
Sbjct: 13 VRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIK 51
Score = 27.8 bits (62), Expect = 2.7
Identities = 14/40 (35%), Positives = 23/40 (57%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+ +K K K + ++ KK ++KK++KKK KKKK
Sbjct: 40 DKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKVP 79
Score = 27.5 bits (61), Expect = 4.6
Identities = 16/48 (33%), Positives = 22/48 (45%), Gaps = 7/48 (14%)
Query: 134 EGEKKEKKKNKNKNKKKKKKK-------KKKKKKKKKKKKKKKKTKKF 174
E K ++K K K K KK ++KK++KKK K KK
Sbjct: 31 EIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKV 78
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 29.3 bits (66), Expect = 0.94
Identities = 10/35 (28%), Positives = 30/35 (85%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
++ E KK K + +K++++++K++KK+++++++K+K
Sbjct: 185 QEWELKKLKQQQQKREEERRKQRKKQQEEEERKQK 219
Score = 27.4 bits (61), Expect = 5.1
Identities = 9/35 (25%), Positives = 29/35 (82%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+E + K K +++K++++++K++KK+++++++K K
Sbjct: 185 QEWELKKLKQQQQKREEERRKQRKKQQEEEERKQK 219
>gnl|CDD|241400 cd13246, PH_Scd1, Shape and Conjugation Deficiency 1 Pleckstrin
homology (PH) domain. Fission yeast Scd1 is an exchange
factor for Cdc42 and an effector of Ras1, the homolog of
the human H-Ras. Scd2/Bem1 mediates Cdc42 activation by
binding to Scd1/Cdc24 and to Cdc42. Ras1 regulates
Scd1/Cdc24/Ral1, which is a putative guanine nucleotide
exchange factor for Cdc42, a member of the Rho family of
Ras-like proteins. Cdc42 then activates the Shk1/Orb2
protein kinase. Scd1 interacts with Klp5 and Klp6
kinesins to mediate cytokinesis. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 148
Score = 28.8 bits (65), Expect = 0.94
Identities = 14/26 (53%), Positives = 15/26 (57%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKK 163
KE KK K K+ K K KKKKKK
Sbjct: 61 KEVKKKKKKSSLKSKSSSSSKKKKKK 86
Score = 26.8 bits (60), Expect = 4.7
Identities = 14/25 (56%), Positives = 14/25 (56%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTK 172
KKKKKK K K KKKKKK
Sbjct: 64 KKKKKKSSLKSKSSSSSKKKKKKGP 88
>gnl|CDD|222649 pfam14284, PcfJ, PcfJ-like protein. The PcfJ-like protein family
includes the E. faecalis PcfJ protein, which is
functionally uncharacterized. It is found in bacteria
and viruses, and is typically between 159 and 170 amino
acids in length. There is a conserved HCV sequence
motif.
Length = 169
Score = 29.2 bits (66), Expect = 0.98
Identities = 10/32 (31%), Positives = 18/32 (56%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
K + KK K K +K +K+ + +K ++ K K
Sbjct: 36 AKYQAKKEKEKAEKMRKRYPELEKIFRELKSK 67
Score = 27.3 bits (61), Expect = 3.5
Identities = 9/45 (20%), Positives = 21/45 (46%)
Query: 126 YVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
P+ + ++ K K K+K +K +K+ + +K ++ K
Sbjct: 22 VCPKNLKAAHDRAVAKYQAKKEKEKAEKMRKRYPELEKIFRELKS 66
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 29.8 bits (67), Expect = 0.98
Identities = 10/41 (24%), Positives = 20/41 (48%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
G ++ ++ K++ +KK+K KKK +K+K
Sbjct: 330 EFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370
Score = 27.8 bits (62), Expect = 3.8
Identities = 11/40 (27%), Positives = 23/40 (57%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
+ G E ++++ + K++ +KK+K KKK +K+K
Sbjct: 331 FLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The function
is unclear.
Length = 151
Score = 28.8 bits (64), Expect = 0.98
Identities = 20/49 (40%), Positives = 31/49 (63%), Gaps = 1/49 (2%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
AL ++KE K +++ KKKK+K +KK K KKK++K K +P +P
Sbjct: 9 ALAKQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEKPNGK-IPEHEP 56
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 29.5 bits (66), Expect = 1.0
Identities = 16/48 (33%), Positives = 18/48 (37%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
KE K K K K KK K KK K K TKK + +
Sbjct: 26 SKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESV 73
Score = 28.0 bits (62), Expect = 2.7
Identities = 14/51 (27%), Positives = 18/51 (35%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSRE 190
KK + K KK K+ K KK K TK + VK+
Sbjct: 10 KKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTT 60
Score = 27.2 bits (60), Expect = 5.0
Identities = 14/38 (36%), Positives = 15/38 (39%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+K K K KK K K KK K KK K K
Sbjct: 21 KKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKT 58
Score = 26.8 bits (59), Expect = 7.0
Identities = 15/48 (31%), Positives = 22/48 (45%), Gaps = 1/48 (2%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPS 178
V ++ EK E K ++ KK KK+ + + K K T K LP
Sbjct: 63 VTVKFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSK-LPKK 109
Score = 26.8 bits (59), Expect = 7.9
Identities = 15/38 (39%), Positives = 17/38 (44%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
EKK K +K+ K KK K K KK K KK
Sbjct: 15 EKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKK 52
>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
protein 2. THO and TREX form a eukaryotic complex which
functions in messenger ribonucleoprotein metabolism and
plays a role in preventing the transcription-associated
genetic instability. Tho2, along with four other
subunits forms THO.
Length = 296
Score = 29.6 bits (67), Expect = 1.0
Identities = 16/50 (32%), Positives = 26/50 (52%), Gaps = 4/50 (8%)
Query: 124 NLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
++YVP+ E KE ++ + + K+ KKKK+KK+ K KK
Sbjct: 21 DIYVPK----ELYDKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKK 66
>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16. Nop16 is a
protein involved in ribosome biogenesis.
Length = 173
Score = 28.9 bits (65), Expect = 1.1
Identities = 11/30 (36%), Positives = 18/30 (60%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
+KK KN++ K +K+ K+K +KKK
Sbjct: 1 VRKKKKNRSSNYKVNRKRLKRKDRKKKINI 30
Score = 27.8 bits (62), Expect = 2.5
Identities = 12/29 (41%), Positives = 20/29 (68%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
++KKKN++ N K +K+ K+K +KKK
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKKKINI 30
Score = 27.8 bits (62), Expect = 2.9
Identities = 10/29 (34%), Positives = 15/29 (51%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
K KK + K +K+ K+K +KKK
Sbjct: 3 KKKKNRSSNYKVNRKRLKRKDRKKKINII 31
Score = 27.4 bits (61), Expect = 3.2
Identities = 11/33 (33%), Positives = 18/33 (54%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
+KKKK + K +K+ K+K + KK +P
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGNP 34
Score = 27.4 bits (61), Expect = 4.1
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
+ K KN+ K +K+ K+K +KKK
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKKKINI 30
Score = 26.6 bits (59), Expect = 5.8
Identities = 10/30 (33%), Positives = 17/30 (56%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKK 165
+K+KK + K +K+ K+K +KKK
Sbjct: 1 VRKKKKNRSSNYKVNRKRLKRKDRKKKINI 30
Score = 26.2 bits (58), Expect = 9.7
Identities = 9/29 (31%), Positives = 15/29 (51%)
Query: 144 KNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+ K K + K +K+ K+K +KKK
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKKKINI 30
>gnl|CDD|188624 cd09904, H3TH_XPG, H3TH domain of Xeroderma pigmentosum
complementation group G (XPG) nuclease, a
structure-specific, divalent-metal-ion dependent, 5'
nuclease. The Xeroderma pigmentosum complementation
group G (XPG) nuclease plays a central role in
nucleotide excision repair (NER) in cleaving DNA bubble
structures or loops. XPG is a member of the
structure-specific, 5' nuclease family that catalyzes
hydrolysis of DNA duplex-containing nucleic acid
structures during DNA replication, repair, and
recombination. Members of this subgroup include the
H3TH (helix-3-turn-helix) domains of XPG and other
similar eukaryotic 5' nucleases. These nucleases contain
a PIN (PilT N terminus) domain with a helical arch/clamp
region/I domain (not included here) and inserted within
the PIN domain is an atypical helix-hairpin-helix-2
(HhH2)-like region. This atypical HhH2 region, the H3TH
domain, has an extended loop with at least three turns
between the first two helices, and only three of the
four helices appear to be conserved. Both the H3TH
domain and the helical arch/clamp region are involved in
DNA binding. Studies suggest that a glycine-rich loop
in the H3TH domain contacts the phosphate backbone of
the template strand in the downstream DNA duplex. These
nucleases have a carboxylate rich active site that is
involved in binding essential divalent metal ion
cofactors (Mg2+ or Mn2+) required for nuclease activity.
The first metal binding site is composed entirely of
Asp/Glu residues from the PIN domain, whereas, the
second metal binding site is composed generally of two
Asp residues from the PIN domain and one Asp residue
from the H3TH domain. Together with the helical arch and
network of amino acids interacting with metal binding
ions, the H3TH region defines a positively charged
active-site DNA-binding groove in structure-specific 5'
nucleases.
Length = 97
Score = 28.0 bits (63), Expect = 1.1
Identities = 10/37 (27%), Positives = 16/37 (43%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
+ +K + K++ K+K K LPP PS
Sbjct: 50 QPEKSEDSDNDKQEFKRKHKNYLKNLILPPGFPSPAV 86
>gnl|CDD|235219 PRK04098, PRK04098, sec-independent translocase; Provisional.
Length = 158
Score = 28.9 bits (65), Expect = 1.1
Identities = 10/44 (22%), Positives = 17/44 (38%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ E+ N+ K ++ + K+ K K K KK K
Sbjct: 113 EDTIPNHLNEEVSNETKLTQESSSDESPKEVKLATKNKTKKHDK 156
>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease. The Slx4 protein is a
heteromeric structure-specific endonuclease found in
fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
substrates, particularly simple-Y, 5'-flap, or
replication fork structures by cleaving the strand
bearing the 5' non-homologous arm at the branch junction
and thus generating ligatable nicked products from
5'-flap or replication fork substrates.
Length = 627
Score = 29.6 bits (66), Expect = 1.2
Identities = 15/32 (46%), Positives = 18/32 (56%)
Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
K KKKK KK K +K+ K+K KK K F
Sbjct: 68 GKRVPKKKKIKKPKLRKRTKRKNKKIKSLTAF 99
Score = 28.4 bits (63), Expect = 2.8
Identities = 14/30 (46%), Positives = 19/30 (63%)
Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
N + + KKKK KK K +K+ K+K KK K
Sbjct: 65 NVSGKRVPKKKKIKKPKLRKRTKRKNKKIK 94
Score = 26.9 bits (59), Expect = 9.7
Identities = 13/26 (50%), Positives = 16/26 (61%)
Query: 142 KNKNKNKKKKKKKKKKKKKKKKKKKK 167
K K KK KK K +K+ K+K KK K
Sbjct: 69 KRVPKKKKIKKPKLRKRTKRKNKKIK 94
>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807). This is
a family of conserved fungal proteins of unknown
function.
Length = 169
Score = 28.9 bits (65), Expect = 1.2
Identities = 13/55 (23%), Positives = 24/55 (43%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
P A+ GE +K + +K+K + K++K K + T+ + S S
Sbjct: 86 SPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAKDPRGGTQDVVDKSQAS 140
Score = 27.8 bits (62), Expect = 2.6
Identities = 9/48 (18%), Positives = 19/48 (39%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
++KE ++ K+K + K++K K + + K D
Sbjct: 97 SADRKEAEQQGAAQKRKSCRDKERKSAKDPRGGTQDVVDKSQASLDYG 144
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 29.5 bits (66), Expect = 1.2
Identities = 9/36 (25%), Positives = 19/36 (52%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K+K K K N K+ K + + + ++ K++ +K
Sbjct: 798 KQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEK 833
>gnl|CDD|235516 PRK05582, PRK05582, DNA topoisomerase I; Validated.
Length = 650
Score = 29.4 bits (67), Expect = 1.2
Identities = 16/42 (38%), Positives = 23/42 (54%), Gaps = 1/42 (2%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKK-KKKKKKKKTKKFLPP 177
+ KK + KN++ K+ + KKK K K KKK +K PP
Sbjct: 210 GYKGKKIELKNEEDVKEILAELKKKDFKVSKVKKKERKRNPP 251
Score = 27.5 bits (62), Expect = 4.6
Identities = 17/43 (39%), Positives = 24/43 (55%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
G K +K + KN+ K+ + KKK K K KKK+ K+ PP
Sbjct: 210 GYKGKKIELKNEEDVKEILAELKKKDFKVSKVKKKERKRNPPP 252
>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
II/NMEMI. CRISPR loci appear to be mobile elements with
a wide host range. This model represents a protein found
only in CRISPR-containing species, near other
CRISPR-associated proteins (cas), as part of the NMENI
subtype of CRISPR/Cas locus. The species range so far
for this protein is animal pathogens and commensals only
[Mobile and extrachromosomal element functions, Other].
Length = 805
Score = 29.3 bits (66), Expect = 1.2
Identities = 17/62 (27%), Positives = 30/62 (48%), Gaps = 1/62 (1%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
P + +E ++E+ N K K++ KK + K K+ K+ K P++ S K +K
Sbjct: 515 PDKIVIEMAREEQGTNFGKRNSKERYKKNEDKIKEFASALGKEILK-EEPTENSSKNILK 573
Query: 188 SR 189
R
Sbjct: 574 LR 575
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this family
(Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 28.0 bits (63), Expect = 1.2
Identities = 11/36 (30%), Positives = 26/36 (72%), Gaps = 1/36 (2%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
KE+K+ + K +++++ K+ K K+++ ++K +K KK
Sbjct: 1 KERKE-EEKAQREEELKRLKNLKREEIEEKLEKIKK 35
Score = 28.0 bits (63), Expect = 1.3
Identities = 10/33 (30%), Positives = 21/33 (63%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
K+E+K + + K+ K K+++ ++K +K KK
Sbjct: 3 RKEEEKAQREEELKRLKNLKREEIEEKLEKIKK 35
Score = 27.6 bits (62), Expect = 1.8
Identities = 11/34 (32%), Positives = 24/34 (70%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
E+KE++K + + + K+ K K+++ ++K +K KK
Sbjct: 2 ERKEEEKAQREEELKRLKNLKREEIEEKLEKIKK 35
Score = 27.2 bits (61), Expect = 2.4
Identities = 9/31 (29%), Positives = 20/31 (64%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
E EK ++++ + K K+++ ++K +K KK
Sbjct: 5 EEEKAQREEELKRLKNLKREEIEEKLEKIKK 35
Score = 25.7 bits (57), Expect = 7.6
Identities = 9/34 (26%), Positives = 23/34 (67%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
K+ K++ K + +++ K+ K K+++ ++K +K K
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREEIEEKLEKIK 34
>gnl|CDD|218071 pfam04409, DUF530, Protein of unknown function (DUF530). Family of
hypothetical archaeal proteins.
Length = 512
Score = 29.5 bits (66), Expect = 1.2
Identities = 22/80 (27%), Positives = 32/80 (40%), Gaps = 5/80 (6%)
Query: 96 LQLPRGRLVFLEKLLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKK 155
L + RL L +S+ E + L + E +K ++ KK
Sbjct: 433 LHIKINRLALGGVALYGISDWDEVEVAEILGID-----IEELLKKMAKIEPSEILLFKKV 487
Query: 156 KKKKKKKKKKKKKKKTKKFL 175
+KKK +K K K K KKFL
Sbjct: 488 EKKKFEKFGKPKSDKAKKFL 507
>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2. This
family consists of several Eukaryotic rRNA processing
protein EBP2 sequences. Ebp2p is required for the
maturation of 25S rRNA and 60S subunit assembly. Ebp2p
may be one of the target proteins of Rrs1p for executing
the signal to regulate ribosome biogenesis. This family
also plays a role in chromosome segregation.
Length = 270
Score = 29.2 bits (66), Expect = 1.3
Identities = 11/32 (34%), Positives = 15/32 (46%)
Query: 143 NKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
+ K KKK +K + K K+K K KF
Sbjct: 192 DAAKGGSKKKGRKGGAARGKPNAKRKAKDAKF 223
Score = 28.5 bits (64), Expect = 2.3
Identities = 11/51 (21%), Positives = 18/51 (35%), Gaps = 8/51 (15%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKK--------KKKKKKKKTKK 173
++ K K K + + K K+K K KK+ K T +
Sbjct: 188 FLVDDAAKGGSKKKGRKGGAARGKPNAKRKAKDAKFGFGGKKRGSKSNTAE 238
>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947). Family of
eukaryotic proteins with unknown function.
Length = 168
Score = 28.8 bits (65), Expect = 1.3
Identities = 15/38 (39%), Positives = 20/38 (52%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
K E KK K K + KK K+ K +KK+KK K+
Sbjct: 130 KSEIKKLVLKKKFDELKKSKQLDKALEKKRKKNAGKEK 167
>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160. This
family consists of several special lobe-specific silk
protein SSP160 sequences which appear to be specific to
Chironomus (Midge) species.
Length = 758
Score = 29.4 bits (65), Expect = 1.3
Identities = 18/57 (31%), Positives = 30/57 (52%), Gaps = 5/57 (8%)
Query: 132 ALEGEKKEKKKNK-NKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
A GEKK K+K + K + K ++ + K KK+ + + KKF + K+GV+
Sbjct: 378 AKSGEKKVKRKARLEKMRAKCRRAVGNRMKGSMKKRVRSRAKKF----GEAAKSGVR 430
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 29.0 bits (65), Expect = 1.3
Identities = 16/38 (42%), Positives = 22/38 (57%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKK 165
R + KKE + + N++ K+KKKKK KKKK K
Sbjct: 67 ERKKGYQAGKKETEDWFSPNQENKQKKKKKDKKKKSPK 104
Score = 28.3 bits (63), Expect = 1.9
Identities = 18/56 (32%), Positives = 25/56 (44%), Gaps = 5/56 (8%)
Query: 132 ALEGEKKEKKKNK-----NKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSG 182
+G + KK+ + N+ K+KKKKK KKKK K T + P D
Sbjct: 68 RKKGYQAGKKETEDWFSPNQENKQKKKKKDKKKKSPKSLLLNFVTVEESKPDDEVH 123
>gnl|CDD|233294 TIGR01159, DRP1, density-regulated protein DRP1. This protein
family shows weak but suggestive similarity to
translation initiation factor SUI1 and its prokaryotic
homologs.
Length = 173
Score = 28.7 bits (64), Expect = 1.3
Identities = 17/64 (26%), Positives = 27/64 (42%), Gaps = 1/64 (1%)
Query: 121 LYLNLYVPRPVALEGEK-KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
LY LY E E + + K +K +K ++ ++ KKKKK +K +
Sbjct: 37 LYAKLYGTDSPTAEAEAVTIGEAQEEKGEKDLLGIQKAQEAREGGKKKKKLPQKVTIKRE 96
Query: 180 PSGK 183
P K
Sbjct: 97 PRTK 100
>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1. Rec10 / Red1 is involved in meiotic
recombination and chromosome segregation during
homologous chromosome formation. This protein localises
to the synaptonemal complex in S. cerevisiae and the
analogous structures (linear elements) in S. pombe. This
family is currently only found in fungi.
Length = 706
Score = 29.5 bits (66), Expect = 1.4
Identities = 17/58 (29%), Positives = 24/58 (41%), Gaps = 14/58 (24%)
Query: 140 KKKNKNKNKKKKKKKKK--------------KKKKKKKKKKKKKKTKKFLPPSDPSGK 183
K +K K K +KK+ K+ +KK K K+ K K F P + S K
Sbjct: 429 KVVKVSKYKGNKSEKKRDINVLDTIFASPVSKELRKKVGKSKQTKLKNFKPVPNKSKK 486
>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
SseC is a secreted protein that forms a complex together
with SecB and SecD on the surface of Salmonella. All
these proteins are secreted by the type III secretion
system. Many mucosal pathogens use type III secretion
systems for the injection of effector proteins into
target cells. SecB, SseC and SecD are inserted into the
target cell membrane. where they form a small pore or
translocon. In addition to SseC, this family includes
the bacterial secreted proteins PopB, PepB, YopB and
EspD which are thought to be directly involved in pore
formation, and type III secretion system translocon.
Length = 303
Score = 29.0 bits (65), Expect = 1.4
Identities = 11/49 (22%), Positives = 25/49 (51%), Gaps = 5/49 (10%)
Query: 131 VALEGEKKEKKKNKN-----KNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
L ++ + K + K +KK ++ +++ KK +K ++ KK+ F
Sbjct: 10 SKLAEKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAKKSGIF 58
>gnl|CDD|221569 pfam12420, DUF3671, Protein of unknown function. This domain
family is found in eukaryotes, and is typically between
96 and 116 amino acids in length.
Length = 106
Score = 28.0 bits (63), Expect = 1.4
Identities = 16/41 (39%), Positives = 18/41 (43%), Gaps = 1/41 (2%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKK-KKKKKKKKTKKFLPPS 178
EKK + K KK KKK KK KK F+ PS
Sbjct: 13 EKKIFDKIDYIDKLKKDPNIDKKKFKKIIFKKYGLIFILPS 53
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 28.9 bits (65), Expect = 1.6
Identities = 11/37 (29%), Positives = 23/37 (62%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K E +K + + + ++ KKK++ + KKKK+ ++ K
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKA 109
Score = 28.2 bits (63), Expect = 2.3
Identities = 10/40 (25%), Positives = 20/40 (50%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
P + + + KK++ + KKKK+ ++ K +KK
Sbjct: 74 PEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113
Score = 27.8 bits (62), Expect = 3.2
Identities = 16/70 (22%), Positives = 29/70 (41%), Gaps = 2/70 (2%)
Query: 124 NLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK--KKKKKKTKKFLPPSDPS 181
E EK+ + + KKK++ + KKKK+ + K +KK +++ S
Sbjct: 65 VKDFLSDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLS 124
Query: 182 GKAGVKSREQ 191
K E+
Sbjct: 125 SSLQTKLTEE 134
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 29.2 bits (65), Expect = 1.7
Identities = 14/53 (26%), Positives = 33/53 (62%), Gaps = 1/53 (1%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAG 185
+G+++ + K K +KK+++K+K+ +K+ + K ++KK T P +P+ +
Sbjct: 57 RKGQQQRESKPK-ISKKERRKRKQAEKETEGKTEEKKSTAPKNAPVEPADELP 108
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 28.9 bits (64), Expect = 1.7
Identities = 9/42 (21%), Positives = 26/42 (61%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ E+KE+ ++ + K+++ KK+K+++++ +K K+
Sbjct: 1026 IKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKE 1067
Score = 27.8 bits (61), Expect = 4.2
Identities = 10/41 (24%), Positives = 24/41 (58%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+E ++ K++ K +K+++++ +K K+ KK+ KK
Sbjct: 1035 MESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKK 1075
Score = 27.8 bits (61), Expect = 4.3
Identities = 11/37 (29%), Positives = 20/37 (54%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
E++ KK K + ++ +K K+ KK+ KKK+
Sbjct: 1041 AKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKKR 1077
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 28.7 bits (64), Expect = 1.7
Identities = 9/35 (25%), Positives = 23/35 (65%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
++ K+ K K K++ + +K K+ ++KK+++K+
Sbjct: 156 DDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKR 190
Score = 27.5 bits (61), Expect = 4.1
Identities = 11/44 (25%), Positives = 26/44 (59%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
R ++ + E +K K K+K++ + +K K+ ++KK+++K
Sbjct: 146 ECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEK 189
Score = 27.5 bits (61), Expect = 4.2
Identities = 10/35 (28%), Positives = 26/35 (74%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
G++KEK++ ++ K K+ ++KK+++K++ + K+
Sbjct: 164 GKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRP 198
Score = 26.8 bits (59), Expect = 7.4
Identities = 12/42 (28%), Positives = 27/42 (64%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
AL +K + + K K+K+K++ + +K K+ ++KK++ K+
Sbjct: 149 ALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKR 190
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast) and
GNL3L (human) are putative GTPases which are required
for growth and play a role in processing of nucleolar
pre-rRNA. This family contains a potential nuclear
localisation signal.
Length = 80
Score = 27.2 bits (61), Expect = 1.7
Identities = 11/32 (34%), Positives = 16/32 (50%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
+ K + K +K +K+ KK K KKKK
Sbjct: 2 LRYKIEKKVAEHHRKLRKEAKKNPTWKSKKKK 33
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 28.4 bits (64), Expect = 1.8
Identities = 8/38 (21%), Positives = 24/38 (63%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E++E++K + +++ + K + ++ + K+K K+ +K
Sbjct: 123 EREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQK 160
Score = 27.3 bits (61), Expect = 4.4
Identities = 9/36 (25%), Positives = 22/36 (61%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
LE E++EK+ + + + + K + ++ + K+K K+
Sbjct: 122 LEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKE 157
>gnl|CDD|221550 pfam12366, Casc1, Cancer susceptibility candidate 1. This domain
family is found in eukaryotes, and is typically between
216 and 263 amino acids in length. Casc1 has many SNPs
associated with cancer susceptibility.
Length = 227
Score = 28.5 bits (64), Expect = 1.8
Identities = 9/40 (22%), Positives = 14/40 (35%), Gaps = 2/40 (5%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
K + + + K+ K K K + K P DP
Sbjct: 69 SKIAEEAEGARGYVTDIPKEYKAKPVKYLEIPK--PIKDP 106
Score = 27.7 bits (62), Expect = 3.5
Identities = 9/39 (23%), Positives = 13/39 (33%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
E+ + K+ K K K + K K LP
Sbjct: 73 EEAEGARGYVTDIPKEYKAKPVKYLEIPKPIKDPPELPE 111
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 28.9 bits (65), Expect = 2.0
Identities = 14/45 (31%), Positives = 23/45 (51%), Gaps = 6/45 (13%)
Query: 136 EKKEKKKNKNKNKK------KKKKKKKKKKKKKKKKKKKKKTKKF 174
E + K + KK ++ ++ K KKKK+KKK+ +KF
Sbjct: 397 ESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEKF 441
Score = 27.0 bits (60), Expect = 8.2
Identities = 11/34 (32%), Positives = 18/34 (52%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKK 161
A+E ++E + KKKK+KKK+ +K
Sbjct: 407 EGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440
>gnl|CDD|216213 pfam00956, NAP, Nucleosome assembly protein (NAP). NAP proteins
are involved in moving histones into the nucleus,
nucleosome assembly and chromatin fluidity. They affect
the transcription of many genes.
Length = 236
Score = 28.5 bits (64), Expect = 2.0
Identities = 10/28 (35%), Positives = 15/28 (53%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
K+ KN K KKK++ KK + + K
Sbjct: 152 KEGKNLTVKTVKKKQRNKKTGQTRTITK 179
>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p. Rrp15p is required for the formation of
60S ribosomal subunits.
Length = 132
Score = 27.7 bits (62), Expect = 2.0
Identities = 11/30 (36%), Positives = 19/30 (63%)
Query: 144 KNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
++K K KKK K +K +KK K++ + K+
Sbjct: 26 RSKKLLKAKKKLKSEKLEKKAKRQLRAEKR 55
Score = 27.3 bits (61), Expect = 3.0
Identities = 11/33 (33%), Positives = 20/33 (60%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
KK KK K +K +KK K++ + +K++ +K
Sbjct: 28 KKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEK 60
Score = 26.9 bits (60), Expect = 3.9
Identities = 15/51 (29%), Positives = 28/51 (54%)
Query: 130 PVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDP 180
P+ +K K K K K++K +KK K++ + +K++ +K + K LP
Sbjct: 22 PILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPEDLE 72
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 28.8 bits (65), Expect = 2.0
Identities = 16/62 (25%), Positives = 33/62 (53%), Gaps = 7/62 (11%)
Query: 111 PLLSNQSEDCLYLNLYVPRPVALE-------GEKKEKKKNKNKNKKKKKKKKKKKKKKKK 163
PL ++ E L+ + + + ++ +KKE K +K KK++KK+K K+++
Sbjct: 617 PLFLDRIEKTLFFSFDISELMKIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREE 676
Query: 164 KK 165
K+
Sbjct: 677 KE 678
>gnl|CDD|176148 cd08459, PBP2_DntR_NahR_LinR_like, The C-terminal substrate
binding domain of LysR-type transcriptional regulators
that are involved in the catabolism of dinitrotoluene,
naphthalene and gamma-hexachlorohexane; contains the
type 2 periplasmic binding fold. This CD includes
LysR-like bacterial transcriptional regulators, DntR,
NahR, and LinR, which are involved in the degradation
of aromatic compounds. The transcription of the genes
encoding enzymes involved in such degradation is
regulated and expression of these enzymes is enhanced
by inducers, which are either an intermediate in the
metabolic pathway or compounds to be degraded. DntR
from Burkholderia species controls genes encoding
enzymes for oxidative degradation of the nitro-aromatic
compound 2,4-dinitrotoluene. The active form of DntR is
homotetrameric, consisting of a dimer of dimers. NahR
is a salicylate-dependent transcription activator of
the nah and sal operons for naphthalene degradation.
Salicylic acid is an intermediate of the oxidative
degradation of the aromatic ring in soil bacteria.
LinR positively regulates expression of the genes (linD
and linE) encoding enzymes for
gamma-hexachlorocyclohexane (a haloorganic insecticide)
degradation. Expression of linD and linE are induced by
their substrates, 2,5-dichlorohydroquinone (2,5-DCHQ)
and chlorohydroquinone (CHQ). The structural topology
of this substrate-binding domain is most similar to
that of the type 2 periplasmic binding proteins (PBP2),
which are responsible for the uptake of a variety of
substrates such as phosphate, sulfate, polysaccharides,
lysine/arginine/ornithine, and histidine. The PBP2 bind
their ligand in the cleft between these domains in a
manner resembling a Venus flytrap. After binding their
specific ligand with high affinity, they can interact
with a cognate membrane transport complex comprised of
two integral membrane domains and two cytoplasmically
located ATPase domains. This interaction triggers the
ligand translocation across the cytoplasmic membrane
energized by ATP hydrolysis.
Length = 201
Score = 28.3 bits (64), Expect = 2.3
Identities = 9/43 (20%), Positives = 17/43 (39%), Gaps = 2/43 (4%)
Query: 59 YMPPVTPSTWRAPRFADTYSAVCPQRLPDIGNR--TEALLQLP 99
Y+P + ++ F + Y + + P IG+ E L
Sbjct: 56 YLPDLGAGFFQQRLFRERYVCLVRKDHPRIGSTLTLEQFLAAR 98
>gnl|CDD|185575 PTZ00361, PTZ00361, 26 proteosome regulatory subunit 4-like
protein; Provisional.
Length = 438
Score = 28.6 bits (64), Expect = 2.3
Identities = 19/47 (40%), Positives = 25/47 (53%), Gaps = 6/47 (12%)
Query: 143 NKNKNKKKKKKKKKKKK------KKKKKKKKKKKTKKFLPPSDPSGK 183
N K+K KKK+KKKK+ K+KKK+K LP P+ K
Sbjct: 9 NNQKDKNKKKEKKKKESPPPPHEIKRKKKRKGPDAASKLPKVTPNTK 55
>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are typically between 137 to 196
amino acids in length.
Length = 149
Score = 27.7 bits (62), Expect = 2.3
Identities = 11/39 (28%), Positives = 16/39 (41%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
P + +K+ K K K K K+KK K K+
Sbjct: 7 RLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAAS 45
Score = 26.5 bits (59), Expect = 5.7
Identities = 11/36 (30%), Positives = 18/36 (50%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
K+ +KK +K K K K+KK K K+ ++
Sbjct: 13 KRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASESG 48
Score = 26.5 bits (59), Expect = 6.5
Identities = 12/48 (25%), Positives = 16/48 (33%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKA 184
+K K+ +KK K +K K K K KK S
Sbjct: 1 MASSRKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASESG 48
Score = 26.1 bits (58), Expect = 7.6
Identities = 12/35 (34%), Positives = 16/35 (45%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
E K +K K +K K K K+KK K K+
Sbjct: 11 EPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAAS 45
>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB. Topo IB proteins include the
monomeric yeast and human topo I and heterodimeric topo
I from Leishmania donvanni. Topo I enzymes are divided
into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topos I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I have putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 217
Score = 28.3 bits (63), Expect = 2.3
Identities = 14/25 (56%), Positives = 21/25 (84%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKK 160
E+KEKKKN NK +K+ KKK++ K++
Sbjct: 95 EEKEKKKNLNKEEKEAKKKERAKRE 119
Score = 26.4 bits (58), Expect = 9.0
Identities = 12/25 (48%), Positives = 21/25 (84%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKK 162
+EK+K KN NK++K+ KKK++ K++
Sbjct: 95 EEKEKKKNLNKEEKEAKKKERAKRE 119
>gnl|CDD|219401 pfam07404, TEBP_beta, Telomere-binding protein beta subunit (TEBP
beta). This family consists of several telomere-binding
protein beta subunits which appear to be specific to the
family Oxytrichidae. Telomeres are specialised
protein-DNA complexes that compose the ends of
eukaryotic chromosomes. Telomeres protect chromosome
termini from degradation and recombination and act
together with telomerase to ensure complete genome
replication. TEBP beta forms a complex with TEBP alpha
and this complex is able to recognise and bind ssDNA to
form a sequence-specific, telomeric nucleoprotein
complex that caps the very 3' ends of chromosomes.
Length = 375
Score = 28.5 bits (63), Expect = 2.3
Identities = 17/52 (32%), Positives = 26/52 (50%), Gaps = 1/52 (1%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPS-DPSGKAGVKSREQ 191
K NK +K+ +K + K KK +KK +P + PSGK K+ +Q
Sbjct: 283 KYTPNKPSSRKETPQKSQAGKSSAKKTTTGSKKAVPANPSPSGKKSTKTTDQ 334
>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain. This family
contains a C-terminal presumed domain in Sas10 which
hash been identified as a regulator of chromatin
silencing.
Length = 76
Score = 26.8 bits (60), Expect = 2.3
Identities = 11/34 (32%), Positives = 21/34 (61%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
+K KK +N KK+KK +K K+ K +++ ++
Sbjct: 21 RKRKKDRRNPRVKKRKKYEKAVKRLKSQRRVVRE 54
Score = 26.1 bits (58), Expect = 4.8
Identities = 10/34 (29%), Positives = 21/34 (61%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K+K +N + KK+KK +K K+ K +++ ++
Sbjct: 22 KRKKDRRNPRVKKRKKYEKAVKRLKSQRRVVREE 55
>gnl|CDD|216249 pfam01025, GrpE, GrpE.
Length = 165
Score = 27.9 bits (63), Expect = 2.3
Identities = 11/53 (20%), Positives = 28/53 (52%), Gaps = 6/53 (11%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT------KKFLPPSD 179
LE ++E ++ K++ + + + +K+ ++++++ KK K LP D
Sbjct: 20 LEELEEEIEELKDRLLRLLAEFENYRKRTEREREEAKKYAIEKFAKDLLPVLD 72
>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
Length = 202
Score = 28.1 bits (63), Expect = 2.4
Identities = 13/38 (34%), Positives = 19/38 (50%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E+K K++ KKK K +K K+ +KK K K
Sbjct: 156 EEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIV 193
Score = 27.7 bits (62), Expect = 3.6
Identities = 16/45 (35%), Positives = 22/45 (48%), Gaps = 3/45 (6%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKK---KKKKKKKKKTKKF 174
LE ++KE+ K K K +K K+ +KK K K KKF
Sbjct: 155 LEEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKKF 199
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 28.1 bits (63), Expect = 2.4
Identities = 7/35 (20%), Positives = 18/35 (51%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
+ K ++ + + K+++ + +KK+ KKK
Sbjct: 161 DAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195
Score = 26.5 bits (59), Expect = 6.7
Identities = 8/30 (26%), Positives = 16/30 (53%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKK 163
E +E + +++ + +KK+ KKKK
Sbjct: 167 EFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 28.5 bits (63), Expect = 2.4
Identities = 11/43 (25%), Positives = 22/43 (51%), Gaps = 7/43 (16%)
Query: 134 EGEKKEKKKNKNKNKKK-------KKKKKKKKKKKKKKKKKKK 169
+K+ KK + +K +++K +KK +K++K KK
Sbjct: 105 ALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMD 147
Score = 27.7 bits (61), Expect = 5.1
Identities = 9/58 (15%), Positives = 21/58 (36%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVKSR 189
A++ E E K + K+ KK + + ++ ++ + K K+
Sbjct: 57 AVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKESKKTP 114
Score = 27.3 bits (60), Expect = 6.4
Identities = 10/40 (25%), Positives = 22/40 (55%), Gaps = 9/40 (22%)
Query: 143 NKNKNKKKKKKKKKK---------KKKKKKKKKKKKKTKK 173
++KK ++ ++K ++K +KK +K++K KK
Sbjct: 106 LDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKK 145
>gnl|CDD|202558 pfam03159, XRN_N, XRN 5'-3' exonuclease N-terminus. This family
aligns residues towards the N-terminus of several
proteins with multiple functions. The members of this
family all appear to possess 5'-3' exonuclease activity
EC:3.1.11.-. Thus, the aligned region may be necessary
for 5' to 3' exonuclease function. The family also
contains several Xrn1 and Xrn2 proteins. The 5'-3'
exoribonucleases Xrn1p and Xrn2p/Rat1p function in the
degradation and processing of several classes of RNA in
Saccharomyces cerevisiae. Xrn1p is the main enzyme
catalyzing cytoplasmic mRNA degradation in multiple
decay pathways, whereas Xrn2p/Rat1p functions in the
processing of rRNAs and small nucleolar RNAs (snoRNAs)
in the nucleus.
Length = 237
Score = 28.1 bits (63), Expect = 2.4
Identities = 8/33 (24%), Positives = 20/33 (60%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
A + ++KE + +N+ + + + K +K +K+K
Sbjct: 104 AKDAKEKEAEAEENREELETEGIKLPEKVEKEK 136
>gnl|CDD|218078 pfam04419, 4F5, 4F5 protein family. Members of this family are
short proteins that are rich in aspartate, glutamate,
lysine and arginine. Although the function of these
proteins is unknown, they are found to be ubiquitously
expressed.
Length = 38
Score = 25.6 bits (57), Expect = 2.5
Identities = 12/34 (35%), Positives = 18/34 (52%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
G ++E+ + +N K KKKKK K + K K
Sbjct: 4 GNQREQARERNAKKAAKKKKKGAKSQLKAAAKAL 37
Score = 24.1 bits (53), Expect = 8.6
Identities = 10/30 (33%), Positives = 17/30 (56%)
Query: 145 NKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
N++++ +++ KK KKKKK K K
Sbjct: 3 RGNQREQARERNAKKAAKKKKKGAKSQLKA 32
>gnl|CDD|240132 cd04862, PaeLigD_Pol_like, PaeLigD_Pol_like: Polymerase (Pol)
domain of bacterial LigD proteins similar to Pseudomonas
aeruginosa (Pae) LigD. The LigD Pol domain belongs to
the archaeal/eukaryal primase (AEP) superfamily. In
prokaryotes, LigD along with Ku is required for
non-homologous end joining (NHEJ)-mediated repair of DNA
double-strand breaks (DSB). NHEJ-mediated DNA DSB repair
is error-prone. PaeLigD is monomeric, containing an
N-terminal phosphoesterase module, a central polymerase
(Pol) domain, and a C-terminal ATP-dependent ligase
domain. It has been suggested that LigD Pol contributes
to NHEJ-mediated DNA DSB repair in vivo, by filling in
short 5'-overhangs with ribonucleotides; the filled in
termini would then be sealed by the associated LigD
ligase domain, resulting in short stretches of RNA
incorporated into the genomic DNA. The PaeLigD Pol
domain in vitro, in a manganese-dependent fashion,
catalyzes templated extensions of 5'-overhang duplex
DNA, and nontemplated single-nucleotide additions to
blunt-end duplex DNA; it preferentially adds single
ribonucleotides at blunt DNA ends. PaeLigD Pol adds a
correctly paired rNTP to the DNA primer termini more
rapidly than it does a correctly paired dNTP; it has
higher infidelity as an RNA polymerase than it does as a
DNA polymerase, which is in keeping with the mutagenic
property of NHEJ-mediated DNA DSB repair.
Length = 227
Score = 28.0 bits (63), Expect = 2.5
Identities = 11/29 (37%), Positives = 16/29 (55%)
Query: 74 ADTYSAVCPQRLPDIGNRTEALLQLPRGR 102
A Y+AV P LP + R +L++ P G
Sbjct: 5 ARYYAAVAPWMLPHLAGRPLSLVRCPDGI 33
>gnl|CDD|133326 cd04126, Rab20, Rab GTPase family 20 (Rab20). Rab20 is one of
several Rab proteins that appear to be restricted in
expression to the apical domain of murine polarized
epithelial cells. It is expressed on the apical side of
polarized kidney tubule and intestinal epithelial cells,
and in non-polarized cells. It also localizes to
vesico-tubular structures below the apical brush border
of renal proximal tubule cells and in the apical region
of duodenal epithelial cells. Rab20 has also been shown
to colocalize with vacuolar H+-ATPases (V-ATPases) in
mouse kidney cells, suggesting a role in the regulation
of V-ATPase traffic in specific portions of the nephron.
It was also shown to be one of several proteins whose
expression is upregulated in human myelodysplastic
syndrome (MDS) patients. GTPase activating proteins
(GAPs) interact with GTP-bound Rab and accelerate the
hydrolysis of GTP to GDP. Guanine nucleotide exchange
factors (GEFs) interact with GDP-bound Rabs to promote
the formation of the GTP-bound state. Rabs are further
regulated by guanine nucleotide dissociation inhibitors
(GDIs), which facilitate Rab recycling by masking
C-terminal lipid binding and promoting cytosolic
localization. Most Rab GTPases contain a lipid
modification site at the C-terminus, with sequence
motifs CC, CXC, or CCX. Lipid binding is essential for
membrane attachment, a key feature of most Rab proteins.
Length = 220
Score = 27.9 bits (62), Expect = 2.6
Identities = 18/89 (20%), Positives = 33/89 (37%), Gaps = 11/89 (12%)
Query: 107 EKLLPLLSNQSEDCLY------LNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKK 160
++ L L +EDCL+ L+L +A + EK + +++++ + K
Sbjct: 88 DRFLGLTDTANEDCLFAVVGNKLDLTEEGALAGQ-EKDAGDRVSPEDQRQVTLEDAKAFY 146
Query: 161 KKKKKKKKKKTKKFLPPSDP----SGKAG 185
K+ K K S K G
Sbjct: 147 KRINKYKMLDEDLSPAAEKMCFETSAKTG 175
>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
binding. This is the middle region of a family of TATA
element modulatory factor 1 proteins conserved in
eukaryotes that contains at its N-terminal section a
number of leucine zippers that could potentially form
coiled coil structures. The whole proteins bind to the
TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells.
Length = 74
Score = 26.5 bits (59), Expect = 2.6
Identities = 17/40 (42%), Positives = 24/40 (60%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
EGEK KK+ K+ N KK + K K+ +K+ + KKK K
Sbjct: 20 EGEKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKL 59
>gnl|CDD|223624 COG0550, TopA, Topoisomerase IA [DNA replication, recombination,
and repair].
Length = 570
Score = 28.3 bits (64), Expect = 2.6
Identities = 14/45 (31%), Positives = 24/45 (53%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
+EG+K+ + K+K++ ++ K K K K +KK KK P
Sbjct: 215 EIEGKKEGRLKDKDEAEEIVNKLKGKPAKVVSVEKKPKKRSPPPP 259
>gnl|CDD|240256 PTZ00071, PTZ00071, 40S ribosomal protein S24; Provisional.
Length = 132
Score = 27.3 bits (61), Expect = 2.6
Identities = 13/33 (39%), Positives = 19/33 (57%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
KK K K +K+ K ++KK + K+K K KK
Sbjct: 100 KKKKAGRKARKELKNRRKKVRGKEKTKVSGKKK 132
Score = 26.9 bits (60), Expect = 3.8
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKK 163
+K +K+ KN+ KK + K+K K KKK
Sbjct: 106 RKARKELKNRRKKVRGKEKTKVSGKKK 132
Score = 26.5 bits (59), Expect = 6.1
Identities = 13/31 (41%), Positives = 19/31 (61%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
KKK K +K+ K ++KK + K+K K KK
Sbjct: 101 KKKAGRKARKELKNRRKKVRGKEKTKVSGKK 131
>gnl|CDD|222658 pfam14296, O-ag_pol_Wzy, O-antigen polysaccharide polymerase Wzy.
This family includes O-antigen polysaccharide
polymerases. These enzymes link O-units via a glycosidic
linkage to form a long O-antigen. These enzymes vary in
specificity and sequence.
Length = 440
Score = 28.4 bits (64), Expect = 2.7
Identities = 10/30 (33%), Positives = 18/30 (60%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
+ KK+KK+KKK + + ++ K +K L
Sbjct: 103 RKKKRKKEKKKNQLEYERLLKNLRKLSLIL 132
Score = 27.3 bits (61), Expect = 6.5
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKK 160
G KK++KK K KN+ + ++ K +K
Sbjct: 102 GRKKKRKKEKKKNQLEYERLLKNLRK 127
>gnl|CDD|163426 TIGR03714, secA2, accessory Sec system translocase SecA2. Members
of this protein family are homologous to SecA and part
of the accessory Sec system. This system, including both
five core proteins for export and a variable number of
proteins for glycosylation, operates in certain
Gram-positive pathogens for the maturation and delivery
of serine-rich glycoproteins such as the cell surface
glycoprotein GspB in Streptococcus gordonii [Protein
fate, Protein and peptide secretion and trafficking].
Length = 762
Score = 28.5 bits (64), Expect = 2.7
Identities = 15/53 (28%), Positives = 19/53 (35%), Gaps = 2/53 (3%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKK--KKKKKTKKFLPPSDPSGKAGVKSREQ 191
K+ KK KK K K K K++ K S G +REQ
Sbjct: 542 KRWSPSWLKKYYKKYSVKDSKLKPSALFKRRFRKIVEKAQRASEDKGESAREQ 594
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 28.2 bits (63), Expect = 2.7
Identities = 9/37 (24%), Positives = 12/37 (32%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
KK N ++ K K K + KKF
Sbjct: 42 DSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKF 78
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 27.5 bits (62), Expect = 2.8
Identities = 12/34 (35%), Positives = 22/34 (64%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
+K+KKK K K + ++KKKK +++K+ +K
Sbjct: 83 RKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
Score = 27.1 bits (61), Expect = 3.5
Identities = 12/33 (36%), Positives = 21/33 (63%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
+KK+KKK K ++KKKK +++K+ +K
Sbjct: 84 KKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
Score = 27.1 bits (61), Expect = 3.8
Identities = 12/35 (34%), Positives = 23/35 (65%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
K++KKK K K + + ++KKKK +++K+ +K
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
Score = 27.1 bits (61), Expect = 4.0
Identities = 14/26 (53%), Positives = 19/26 (73%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKK 173
K+KKKKKKKKK+ + ++KKKK
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVS 107
Score = 26.0 bits (58), Expect = 8.8
Identities = 13/34 (38%), Positives = 23/34 (67%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K+K K K KKK+ + ++KKKK +++K+ +K
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQK 115
>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain. This presumed domain is
found at the C-terminus of a family of FtsJ-like
methyltransferases. Members of this family are involved
in 60S ribosomal biogenesis.
Length = 212
Score = 28.0 bits (63), Expect = 2.8
Identities = 23/52 (44%), Positives = 26/52 (50%), Gaps = 5/52 (9%)
Query: 134 EGEK-KEKKKNKNKNKKKKKKKKKK----KKKKKKKKKKKKKTKKFLPPSDP 180
E EK K+ KK K KKKKKK+KK KK K K + K K DP
Sbjct: 146 EREKAKQIKKLYKKAKKKKKKRKKTYVVAKKGNKGKAGRPKGVKGKYKVVDP 197
>gnl|CDD|220121 pfam09135, Alb1, Alb1. Alb1 is a nuclear shuttling factor involved
in ribosome biogenesis.
Length = 112
Score = 27.1 bits (60), Expect = 2.9
Identities = 13/31 (41%), Positives = 23/31 (74%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
KN+ +KKK++KKK K+ +K++K++KK
Sbjct: 40 ASKNEGVSKKKQRKKKPKRLTRKQRKRQKKG 70
Score = 25.9 bits (57), Expect = 6.9
Identities = 12/32 (37%), Positives = 22/32 (68%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+KN+ KKK++KKK K+ +K++K++K
Sbjct: 39 AASKNEGVSKKKQRKKKPKRLTRKQRKRQKKG 70
>gnl|CDD|177060 CHL00138, rps5, ribosomal protein S5; Validated.
Length = 143
Score = 27.6 bits (62), Expect = 2.9
Identities = 15/60 (25%), Positives = 21/60 (35%), Gaps = 12/60 (20%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKK------------KKKKKKKKKKTKKFLPPSDPSGKAGV 186
K NKKK +K K+ K K K KK + + + +G GV
Sbjct: 2 LFLLKKMYNKKKNRKSNIKENKWEERVIQIKRVSKVVKGGKKLSFRAIVVIGNENGWVGV 61
>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
pleckstrin homology (PH) domain. The BCR gene is one of
the two genes in the BCR-ABL complex, which is
associated with the Philadelphia chromosome, a product
of a reciprocal translocation between chromosomes 22 and
9. BCR is a GTPase-activating protein (GAP) for RAC1
(primarily) and CDC42. The Dbl region of BCR has the
most RhoGEF activity for Cdc42, and less activity
towards Rac and Rho. Since BCR possesses both GAP and
GEF activities, it may function to temporally regulate
the activity of these GTPases. It also displays
serine/threonine kinase activity. The BCR protein
contains multiple domains including an N-terminal kinase
domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
domain, and a C-terminal RhoGAP domain. ABR, a related
smaller protein, is structurally similar to BCR, but
lacks the N-terminal kinase domain and has GAP activity
for both Rac and Cdc42. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 179
Score = 27.7 bits (62), Expect = 2.9
Identities = 8/33 (24%), Positives = 13/33 (39%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
K K K ++KK + +K +KK
Sbjct: 97 AISALKQKIKRDSAERKKAASSGSRAIEKLRKK 129
>gnl|CDD|237803 PRK14724, PRK14724, DNA topoisomerase III; Provisional.
Length = 987
Score = 28.4 bits (63), Expect = 3.0
Identities = 14/43 (32%), Positives = 15/43 (34%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
K K K KK K K +K KKK L PS
Sbjct: 878 KAAKPAKKAAAKKVAAKTAAAKTPRKAAKKKAAPPAAGLKPSA 920
>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 158 and 187 amino
acids in length. This family includes the PcfB protein.
Length = 137
Score = 27.2 bits (61), Expect = 3.1
Identities = 17/38 (44%), Positives = 25/38 (65%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
L+ E+ EK + K KK K+++ KKK++K K K KKK
Sbjct: 100 LKKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKNKKK 137
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved L residues that may
be functionally important.
Length = 70
Score = 26.1 bits (58), Expect = 3.1
Identities = 10/41 (24%), Positives = 27/41 (65%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
L + K K K K K +++K++++++ +K++++K++ K
Sbjct: 13 MLNADLLAKLKAKKKELKAEEEKREEEEEARKREERKEREK 53
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 27.3 bits (61), Expect = 3.2
Identities = 11/44 (25%), Positives = 28/44 (63%), Gaps = 5/44 (11%)
Query: 136 EKKEKKKNKNKNKKKKKK-----KKKKKKKKKKKKKKKKKTKKF 174
EK+ +K+ K + ++K+K+ +K++++K+ ++ +K K K
Sbjct: 20 EKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPL 63
Score = 27.3 bits (61), Expect = 3.7
Identities = 9/36 (25%), Positives = 27/36 (75%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
++K ++K+++K+++K++K++K++K+K+ K
Sbjct: 5 RDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGK 40
>gnl|CDD|219755 pfam08231, SYF2, SYF2 splicing factor. Proteins in this family are
involved in cell cycle progression and pre-mRNA
splicing.
Length = 157
Score = 27.3 bits (61), Expect = 3.3
Identities = 12/33 (36%), Positives = 20/33 (60%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
+G E+K+ + + ++ +K KKKKKKK K
Sbjct: 3 QGIDFERKRALDWSAEESEKWDKKKKKKKNKDD 35
>gnl|CDD|218991 pfam06326, Vesiculo_matrix, Vesiculovirus matrix protein. This
family consists of several Vesiculovirus matrix
proteins. The matrix (M) protein of vesicular stomatitis
virus (VSV) expressed in the absence of other viral
components causes many of the cytopathic effects of VSV,
including an inhibition of host gene expression and the
induction of cell rounding. It has been shown that M
protein also induces apoptosis in the absence of other
viral components. It is thought that the activation of
apoptotic pathways causes the inhibition of host gene
expression and cell rounding by M protein.
Length = 229
Score = 27.8 bits (62), Expect = 3.3
Identities = 10/33 (30%), Positives = 12/33 (36%)
Query: 150 KKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSG 182
K K K K+ KK K KK + P
Sbjct: 4 FKLLKLKGKEVASKKSKGSKKLNMSMDWDAPPS 36
Score = 26.7 bits (59), Expect = 8.7
Identities = 11/33 (33%), Positives = 13/33 (39%)
Query: 149 KKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
K K K K+ KK K KK + PS
Sbjct: 4 FKLLKLKGKEVASKKSKGSKKLNMSMDWDAPPS 36
>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal. The yeast member
of this family (Kri1p) is found to be required for 40S
ribosome biogenesis in the nucleolus. This is the
C-terminal domain of the family.
Length = 93
Score = 26.8 bits (60), Expect = 3.3
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTK 172
K++++K KKK KK + ++ KKK
Sbjct: 69 KEERRKDKKKYGKKARLREWKKKVF 93
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 28.0 bits (62), Expect = 3.3
Identities = 12/43 (27%), Positives = 17/43 (39%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ EKK + + K +KKK K K K K +K
Sbjct: 213 AKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERK 255
Score = 27.2 bits (60), Expect = 5.8
Identities = 11/38 (28%), Positives = 17/38 (44%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
E + K K + K + ++K +KKK K K K
Sbjct: 210 EAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAA 247
Score = 26.8 bits (59), Expect = 8.5
Identities = 13/42 (30%), Positives = 18/42 (42%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
A E K E + KK + ++K +KKK K K K
Sbjct: 203 AAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADK 244
>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein. NinG or Rap is
involved in recombination. Rap (recombination adept with
plasmid) increases lambda-by-plasmid recombination
catalyzed by Escherichia coli's RecBCD pathway.
Length = 188
Score = 27.7 bits (62), Expect = 3.4
Identities = 9/34 (26%), Positives = 21/34 (61%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K + +KK+K + + ++++ K +K+K KT+
Sbjct: 33 ALKREKAQEKKRKAEAQAERRELKARKEKLKTRS 66
>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
structure and biogenesis].
Length = 107
Score = 26.8 bits (59), Expect = 3.5
Identities = 15/36 (41%), Positives = 18/36 (50%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K +K KK +K K K KKK KKKKK +
Sbjct: 1 MAPKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEA 36
Score = 26.0 bits (57), Expect = 7.1
Identities = 15/31 (48%), Positives = 18/31 (58%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
KEKK K K K KKK KKKKK++ +
Sbjct: 7 LSKEKKAEKAKAGTAKDKKKWSKKKKKEEAR 37
>gnl|CDD|217011 pfam02388, FemAB, FemAB family. The femAB operon codes for two
nearly identical approximately 50-kDa proteins involved
in the formation of the Staphylococcal pentaglycine
interpeptide bridge in peptidoglycan. These proteins are
also considered as a factor influencing the level of
methicillin resistance.
Length = 408
Score = 28.0 bits (63), Expect = 3.5
Identities = 12/40 (30%), Positives = 22/40 (55%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
K E+K KN N KKK K + +++ +K+ + K+ +
Sbjct: 259 AKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELI 298
>gnl|CDD|226659 COG4196, COG4196, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 808
Score = 28.0 bits (62), Expect = 3.7
Identities = 28/96 (29%), Positives = 40/96 (41%), Gaps = 18/96 (18%)
Query: 53 PLGSLRYMPP-----VTPSTWRAPRFADTYSAVCPQRLPDIGNRTEALLQLPRGRL--VF 105
PL S+ ++PP PS R P A+ ++A P RT AL+ PRG +F
Sbjct: 260 PLDSISWVPPRASFDADPSAPRGPLPAEPHTAAAVVENPGTAPRT-ALVAEPRGGRLCIF 318
Query: 106 ------LEKLLPLLSNQSEDCLYLNLYVPRPVALEG 135
LE L L++ + PV +EG
Sbjct: 319 LPPTEALEHFLELVAAVEAAATTIGC----PVVIEG 350
>gnl|CDD|184516 PRK14115, gpmA, phosphoglyceromutase; Provisional.
Length = 247
Score = 27.5 bits (62), Expect = 3.8
Identities = 12/47 (25%), Positives = 17/47 (36%), Gaps = 11/47 (23%)
Query: 49 YATPPLGSLRYMPPVTPSTWRAPRFADTYSAVCPQRLPDIGNRTEAL 95
Y PP P + R P Y+ + + LP TE+L
Sbjct: 116 YDVPP-------PALEKDDERYPGHDPRYAKLPEEELP----LTESL 151
>gnl|CDD|217754 pfam03836, RasGAP_C, RasGAP C-terminus.
Length = 139
Score = 27.2 bits (61), Expect = 3.8
Identities = 13/31 (41%), Positives = 16/31 (51%)
Query: 146 KNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
+KKKK K+K KK +KK K KF
Sbjct: 86 STLQKKKKFSKQKFSVKKLQKKHGKKPKFGS 116
Score = 26.4 bits (59), Expect = 6.8
Identities = 13/29 (44%), Positives = 16/29 (55%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
K+KK +K K KK +KK KK K K
Sbjct: 90 KKKKFSKQKFSVKKLQKKHGKKPKFGSYK 118
>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein. This model
describes nucleoid occlusion protein, a close homolog to
ParB chromosome partitioning proteins including Spo0J in
Bacillus subtilis. Its gene often is located near the
gene for the Spo0J ortholog. This protein bind a
specific DNA sequence and blocks cytokinesis from
happening until chromosome segregation is complete.
Length = 255
Score = 27.5 bits (62), Expect = 3.8
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKF 174
K+ ++ KK +K +K KKKKK+ K F
Sbjct: 185 KQTEELIKKLLEKPEKPKKKKKRRKGF 211
Score = 27.1 bits (61), Expect = 5.7
Identities = 14/36 (38%), Positives = 21/36 (58%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
EK N + ++ KK +K +K KKKKK++K K
Sbjct: 178 IEKGLNVKQTEELIKKLLEKPEKPKKKKKRRKGFSK 213
Score = 26.3 bits (59), Expect = 8.9
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
KK +K +K KKKKK++K K + +
Sbjct: 192 KKLLEKPEKPKKKKKRRKGFSKDVRIAV 219
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 27.7 bits (61), Expect = 4.0
Identities = 18/39 (46%), Positives = 26/39 (66%), Gaps = 1/39 (2%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKK-KKKKKKKTKK 173
+KK KK K N KK+++ + KKKKK+ K+KK +KK
Sbjct: 553 QKKLYKKMKYSNAKKEEQAENLKKKKKQIAKQKKLDSKK 591
Score = 27.0 bits (59), Expect = 7.7
Identities = 14/39 (35%), Positives = 24/39 (61%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
E + +K NK+KNKK+K +++++KK K K+ K
Sbjct: 517 ETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKK 555
Score = 27.0 bits (59), Expect = 8.9
Identities = 13/43 (30%), Positives = 23/43 (53%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
LE E + K ++ K K K KK+K +++++KK K +
Sbjct: 506 LELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIM 548
>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e.
Length = 108
Score = 26.8 bits (60), Expect = 4.0
Identities = 18/47 (38%), Positives = 28/47 (59%), Gaps = 4/47 (8%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKK----KKKKKKKKKKKTKK 173
V + G K +K N K++KKKKKKK + +KK+KK+ + +K
Sbjct: 21 VDISGVKVPEKINDEYFKREKKKKKKKSEGEFFAEKKEKKEVSEQRK 67
Score = 26.0 bits (58), Expect = 6.0
Identities = 13/35 (37%), Positives = 24/35 (68%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
K+EKKK K K++ + +KK+KK+ +++K +K
Sbjct: 38 KREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72
>gnl|CDD|165026 PHA02644, PHA02644, hypothetical protein; Provisional.
Length = 112
Score = 26.9 bits (58), Expect = 4.1
Identities = 16/45 (35%), Positives = 27/45 (60%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
EKK K +N+ +KK +K + + +KK +K + + +K SD S K
Sbjct: 66 EKKPEKPENEDEKKPEKPENEDEKKPEKPENEDEKEFEDSDASDK 110
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 27.7 bits (62), Expect = 4.1
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 152 KKKKKKKKKKKKKKKKKKKT 171
K+K++ KK+KKKK KK K
Sbjct: 356 KRKREGDKKQKKKKSKKLKL 375
Score = 27.7 bits (62), Expect = 4.1
Identities = 12/28 (42%), Positives = 16/28 (57%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKK 154
RP+AL ++K + K K KK KK K
Sbjct: 347 FERPLALSPKRKREGDKKQKKKKSKKLK 374
Score = 27.7 bits (62), Expect = 4.2
Identities = 12/21 (57%), Positives = 16/21 (76%)
Query: 151 KKKKKKKKKKKKKKKKKKKKT 171
K+K++ KK+KKKK KK K T
Sbjct: 356 KRKREGDKKQKKKKSKKLKLT 376
Score = 27.3 bits (61), Expect = 4.7
Identities = 11/32 (34%), Positives = 18/32 (56%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
++ + K+K++ KK+KKKK KK K
Sbjct: 343 PPAAFERPLALSPKRKREGDKKQKKKKSKKLK 374
Score = 27.3 bits (61), Expect = 5.2
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 150 KKKKKKKKKKKKKKKKKKK 168
K+K++ KK+KKKK KK K
Sbjct: 356 KRKREGDKKQKKKKSKKLK 374
Score = 26.6 bits (59), Expect = 8.8
Identities = 10/17 (58%), Positives = 15/17 (88%)
Query: 157 KKKKKKKKKKKKKKTKK 173
K+K++ KK+KKKK+KK
Sbjct: 356 KRKREGDKKQKKKKSKK 372
>gnl|CDD|240272 PTZ00109, PTZ00109, DNA gyrase subunit b; Provisional.
Length = 903
Score = 27.9 bits (62), Expect = 4.1
Identities = 11/37 (29%), Positives = 18/37 (48%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
K + KN K+ KK K K K+ K+K+ + +
Sbjct: 208 SDKSEDKNDTKSSKKGKSSHVKGPKEAKEKESSQMYE 244
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 27.7 bits (61), Expect = 4.2
Identities = 10/38 (26%), Positives = 20/38 (52%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K+ KKK + ++ K+ KKK +++ ++ KK
Sbjct: 644 IKRIKKKAYRRIRRDKRLKKKMPEEENTQENHLGSEKK 681
Score = 27.7 bits (61), Expect = 4.4
Identities = 18/60 (30%), Positives = 24/60 (40%), Gaps = 7/60 (11%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKK----KKKKKKKTKKFLPPSDPSGKAGVKSREQ 191
+ +KK N K K K+K +K KK KKFL P ++REQ
Sbjct: 771 KVIKKKSNTVKKVVGGIKYSKRKDFNLSHVIINEKVNKKDKKFLADEVP---FPFETREQ 827
>gnl|CDD|214659 smart00433, TOP2c, TopoisomeraseII. Eukaryotic DNA topoisomerase
II, GyrB, ParE.
Length = 594
Score = 27.9 bits (63), Expect = 4.2
Identities = 15/59 (25%), Positives = 22/59 (37%), Gaps = 4/59 (6%)
Query: 136 EKKEKKKNKNK--NKKKKKKKKKKKKKKKKKKKKKKKTKKF-LPPS-DPSGKAGVKSRE 190
+E +K K K + KK ++ +KKK LP + AG K E
Sbjct: 325 FLEENPVEASKIVEKVLLAAKARAAAKKARELTRKKKLSSISLPGKLADASSAGPKKCE 383
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 27.6 bits (62), Expect = 4.2
Identities = 12/50 (24%), Positives = 25/50 (50%), Gaps = 1/50 (2%)
Query: 135 GEKKEKKKNKNKNKKKK-KKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
G+ K K K++ K+ +K + + +++ ++ K +F PP GK
Sbjct: 270 GKAAASKAKKAKSRIKRLEKLEARLAEERPVEEGKPLAFRFPPPGKRLGK 319
>gnl|CDD|216919 pfam02181, FH2, Formin Homology 2 Domain.
Length = 372
Score = 27.6 bits (62), Expect = 4.4
Identities = 16/30 (53%), Positives = 17/30 (56%)
Query: 148 KKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
K KKKK KK +KK KKKKK L P
Sbjct: 51 SAKAKKKKSKKSEKKSSSKKKKKEISVLDP 80
>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
splicing MRS1. Members of this family adopt a secondary
structure consisting of two beta sheets and one alpha
helix, arranged as a beta-alpha-beta motif. Each beta
sheet has five strands, arranged in a 32145 order, with
the second strand being antiparallel to the rest.
Mitochondrial resolvase Ydc2 is capable of resolving
Holliday junctions and cleaves DNA after 5'-CT-3' and
5'-TT-3' sequences. This family also contains the
mitochondrial RNA-splicing protein MRS1 which is
involved in the excision of group I introns.
Length = 254
Score = 27.3 bits (61), Expect = 4.4
Identities = 10/27 (37%), Positives = 12/27 (44%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKK 165
K + K KNKK K+ K K K
Sbjct: 205 PKWEGKTKNKKSLKEYKDLIKLLDSGK 231
>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
Members of the NSCC2 family have been sequenced from
various yeast, fungal and animals species including
Saccharomyces cerevisiae, Drosophila melanogaster and
Homo sapiens. These proteins are the Sec62 proteins,
believed to be associated with the Sec61 and Sec63
constituents of the general protein secretary systems of
yeast microsomes. They are also the non-selective cation
(NS) channels of the mammalian cytoplasmic membrane. The
yeast Sec62 protein has been shown to be essential for
cell growth. The mammalian NS channel proteins has been
implicated in platelet derived growth factor(PGDF)
dependent single channel current in fibroblasts. These
channels are essentially closed in serum deprived
tissue-culture cells and are specifically opened by
exposure to PDGF. These channels are reported to exhibit
equal selectivity for Na+, K+ and Cs+ with low
permeability to Ca2+, and no permeability to anions
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 232
Score = 27.6 bits (61), Expect = 4.4
Identities = 12/22 (54%), Positives = 15/22 (68%)
Query: 142 KNKNKNKKKKKKKKKKKKKKKK 163
K+K KKK K KK KKK+ K+
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKR 229
Score = 26.8 bits (59), Expect = 6.4
Identities = 12/23 (52%), Positives = 15/23 (65%)
Query: 143 NKNKNKKKKKKKKKKKKKKKKKK 165
K+K KKK K KK KKK+ K+
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKR 229
Score = 26.8 bits (59), Expect = 7.1
Identities = 12/23 (52%), Positives = 15/23 (65%)
Query: 145 NKNKKKKKKKKKKKKKKKKKKKK 167
K+K KKK K KK KKK+ K+
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKR 229
Score = 26.4 bits (58), Expect = 8.4
Identities = 12/23 (52%), Positives = 14/23 (60%)
Query: 147 NKKKKKKKKKKKKKKKKKKKKKK 169
K K KKK K KK KKK+ K+
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKR 229
>gnl|CDD|221856 pfam12922, Cnd1_N, non-SMC mitotic condensation complex subunit 1,
N-term. The three non-SMC (structural maintenance of
chromosomes) subunits of the mitotic condensation
complex are Cnd1-3. The whole complex is essential for
viability and the condensing of chromosomes in mitosis.
This is the conserved N-terminus of the subunit 1.
Length = 169
Score = 27.2 bits (61), Expect = 4.7
Identities = 13/36 (36%), Positives = 16/36 (44%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKK 166
ALE +K + K KKK+KK KK K
Sbjct: 56 EALEKAEKSAAAAPARAGKGKKKRKKAKKAVVASWK 91
>gnl|CDD|234298 TIGR03659, IsdE, heme ABC transporter, heme-binding protein isdE.
This family of ABC substrate-binding proteins is
observed primarily in close proximity with proteins
localized to the cell wall and bearing the NEAT (NEAr
Transporter, pfam05031) heme-binding domain. IsdE has
been shown to bind heme and is involved in the process
of scavenging heme for the purpose of obtaining iron.
Length = 289
Score = 27.2 bits (61), Expect = 4.8
Identities = 14/35 (40%), Positives = 21/35 (60%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
GEK +++ K K+ +K+ + KKK K KKK K
Sbjct: 135 GEKYGREEQAEKLVKEINEKEAEVKKKVKGKKKPK 169
>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein. This is a family of
121-amino acid secretory proteins. Laps functions in the
regulation of neuronal cell adhesion and/or movement and
synapse attachment. Laps binds to the ApC/EBP (Aplysia
CCAAT/enhancer binding protein) promoter and activates
the transcription of ApC/EBP mRNA.
Length = 124
Score = 26.7 bits (59), Expect = 4.9
Identities = 18/52 (34%), Positives = 28/52 (53%), Gaps = 8/52 (15%)
Query: 133 LEGEKKEKKKNKNKNKKK--------KKKKKKKKKKKKKKKKKKKKTKKFLP 176
+E + K +NK + + +++ KK KKK++KKKKKKK K L
Sbjct: 73 MEVDNKAAVRNKKTLRDQHGQYPAWMNQRQAKKLKKKREKKKKKKKGAKNLA 124
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 27.0 bits (60), Expect = 4.9
Identities = 9/28 (32%), Positives = 14/28 (50%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKK 163
+E+K+ KK +K + KKKK
Sbjct: 169 AAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 26.2 bits (58), Expect = 9.4
Identities = 9/27 (33%), Positives = 12/27 (44%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKK 160
E+KE K +K + KKKK
Sbjct: 170 AEERKEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|233309 TIGR01200, GLPGLI, GLPGLI family protein. This protein family was
first noted as a paralogous set in Porphyromonas
gingivalis, but it is more widely distributed among the
Bacteroidetes. The protein family is now renamed GLPGLI
after its best-conserved motif.
Length = 227
Score = 27.3 bits (61), Expect = 5.0
Identities = 11/34 (32%), Positives = 15/34 (44%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
K + K KK KK ++K KK K+K
Sbjct: 193 IKIELNPTPIIIIPKPKKGKKISREKFKKIKEKY 226
Score = 26.6 bits (59), Expect = 7.3
Identities = 11/32 (34%), Positives = 14/32 (43%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
E K KK KK ++K KK K+K
Sbjct: 196 ELNPTPIIIIPKPKKGKKISREKFKKIKEKYY 227
Score = 26.6 bits (59), Expect = 8.4
Identities = 12/35 (34%), Positives = 15/35 (42%)
Query: 140 KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
K N K KK KK ++K KK K K +
Sbjct: 193 IKIELNPTPIIIIPKPKKGKKISREKFKKIKEKYY 227
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 27.4 bits (62), Expect = 5.1
Identities = 7/38 (18%), Positives = 28/38 (73%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
EK+E++ K + + ++K+++ +KK+++ ++ +++ ++
Sbjct: 106 EKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143
Score = 26.7 bits (60), Expect = 8.4
Identities = 8/39 (20%), Positives = 27/39 (69%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+ E EK++ + + K+K+ ++K+++ +KK+++ ++ +
Sbjct: 101 KLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEE 139
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 27.6 bits (62), Expect = 5.1
Identities = 10/39 (25%), Positives = 19/39 (48%), Gaps = 5/39 (12%)
Query: 142 KNKNKNKKKKKKKKKK-----KKKKKKKKKKKKKTKKFL 175
+ K K KK +K+ + KK+K+ + +K + L
Sbjct: 84 REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRHVL 122
>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
Length = 205
Score = 27.3 bits (61), Expect = 5.2
Identities = 14/33 (42%), Positives = 20/33 (60%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
KK+K K + +NKK KKK + K + +KK K
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNHK 155
Score = 26.5 bits (59), Expect = 7.1
Identities = 13/34 (38%), Positives = 19/34 (55%)
Query: 135 GEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
G+KK+ KK + K KKK + K + +KK K
Sbjct: 122 GKKKKYKKEEENKKAKKKSESLVAKLEVRKKNHK 155
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 27.0 bits (60), Expect = 5.3
Identities = 6/40 (15%), Positives = 30/40 (75%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
E+K+ ++ + + ++++++++KK ++K++ ++K+++ +
Sbjct: 12 EEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEE 51
>gnl|CDD|236108 PRK07824, PRK07824, O-succinylbenzoic acid--CoA ligase;
Provisional.
Length = 358
Score = 27.3 bits (61), Expect = 5.4
Identities = 11/32 (34%), Positives = 13/32 (40%)
Query: 41 VEAYLGVPYATPPLGSLRYMPPVTPSTWRAPR 72
V A +G P L +LR T APR
Sbjct: 299 VAAVVGDGGPAPTLEALRAHVARTLDRTAAPR 330
>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
Length = 1622
Score = 27.4 bits (61), Expect = 5.4
Identities = 13/40 (32%), Positives = 16/40 (40%)
Query: 126 YVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKK 165
YV E ++ K N N KK KKK K+ K
Sbjct: 846 YVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGK 885
>gnl|CDD|227881 COG5594, COG5594, Uncharacterized integral membrane protein
[Function unknown].
Length = 827
Score = 27.4 bits (61), Expect = 5.5
Identities = 12/43 (27%), Positives = 19/43 (44%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPS 178
++ K+++K K +K + K KK K K L PS
Sbjct: 251 QELYKERDKILKKLEKALNELLNKLLKKSHLKTNKKSGKLTPS 293
>gnl|CDD|188606 TIGR04091, LTA_dltB, D-alanyl-lipoteichoic acid biosynthesis
protein DltB. Members of this protein family are DltB,
part of a four-gene operon for D-alanyl-lipoteichoic
acid biosynthesis that is present in the vast majority
of low-GC Gram-positive organisms. This protein may be
involved in transport of D-alanine across the plasma
membrane [Cell envelope, Biosynthesis and degradation of
murein sacculus and peptidoglycan].
Length = 380
Score = 27.1 bits (61), Expect = 5.5
Identities = 10/16 (62%), Positives = 11/16 (68%)
Query: 157 KKKKKKKKKKKKKKTK 172
+K KK KK KK K TK
Sbjct: 343 RKSKKYKKNKKNKWTK 358
>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
unknown].
Length = 622
Score = 27.4 bits (60), Expect = 5.5
Identities = 12/36 (33%), Positives = 20/36 (55%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+E+++N+ + + KK K K KK+ KK KK
Sbjct: 442 RERQENRKERQLKKTKDDSDVDLKDKKESINKKNKK 477
Score = 27.1 bits (59), Expect = 7.3
Identities = 14/48 (29%), Positives = 21/48 (43%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPS 181
E++E +K + K K K KK+ KK KK K + + S
Sbjct: 441 LRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGKHAIERTAAS 488
>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
Length = 763
Score = 27.5 bits (61), Expect = 5.8
Identities = 13/34 (38%), Positives = 16/34 (47%)
Query: 131 VALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
LEG + +K + K KK KK KKKK
Sbjct: 225 AVLEGGELIEKPQAEAENEAGKSDKKDKKSKKKK 258
>gnl|CDD|222791 PHA00431, PHA00431, internal virion protein C.
Length = 746
Score = 27.5 bits (61), Expect = 6.0
Identities = 7/34 (20%), Positives = 19/34 (55%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
E+ +K K+ NK+ + +++ ++K+K+
Sbjct: 703 EEAREKALKDANKRAPIVAATQAREEVREKRKQT 736
Score = 27.1 bits (60), Expect = 6.4
Identities = 8/38 (21%), Positives = 21/38 (55%), Gaps = 1/38 (2%)
Query: 139 EKKKNKN-KNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
E+ + K K+ K+ + +++ ++K+K+T F+
Sbjct: 703 EEAREKALKDANKRAPIVAATQAREEVREKRKQTSGFM 740
>gnl|CDD|217479 pfam03297, Ribosomal_S25, S25 ribosomal protein.
Length = 105
Score = 26.2 bits (58), Expect = 6.0
Identities = 13/32 (40%), Positives = 14/32 (43%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
KK + K K K KKKK K K K K
Sbjct: 4 KKQQKKAAKAAAASAGGKAKKKKWSKGKVKDK 35
>gnl|CDD|215343 PLN02638, PLN02638, cellulose synthase A (UDP-forming), catalytic
subunit.
Length = 1079
Score = 27.2 bits (60), Expect = 6.1
Identities = 15/43 (34%), Positives = 19/43 (44%), Gaps = 10/43 (23%)
Query: 148 KKKKKK----------KKKKKKKKKKKKKKKKKTKKFLPPSDP 180
K K KK +KK K KK KKK+ K + P+ P
Sbjct: 646 KPKHKKPGFLSSLCGGSRKKSSKSSKKGSDKKKSGKHVDPTVP 688
>gnl|CDD|184210 PRK13651, PRK13651, cobalt transporter ATP-binding subunit;
Provisional.
Length = 305
Score = 27.0 bits (60), Expect = 6.1
Identities = 14/31 (45%), Positives = 21/31 (67%), Gaps = 3/31 (9%)
Query: 146 KNKKKKKKKKKKKKKKKK---KKKKKKKTKK 173
K++K KKK K+K+K +K +K + KK KK
Sbjct: 69 KDEKNKKKTKEKEKVLEKLVIQKTRFKKIKK 99
Score = 26.6 bits (59), Expect = 9.7
Identities = 15/37 (40%), Positives = 22/37 (59%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
K EK K K K K+K +K +K + KK KK K+ ++
Sbjct: 69 KDEKNKKKTKEKEKVLEKLVIQKTRFKKIKKIKEIRR 105
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 27.1 bits (60), Expect = 6.1
Identities = 14/48 (29%), Positives = 22/48 (45%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGK 183
+ K+N K KKK K+KK+KK+ ++ + SGK
Sbjct: 203 KAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGK 250
Score = 26.7 bits (59), Expect = 7.7
Identities = 13/41 (31%), Positives = 23/41 (56%)
Query: 139 EKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSD 179
KK N + KKK K+KK+KK+ + K+++++ D
Sbjct: 212 PKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGKRD 252
>gnl|CDD|234547 TIGR04330, cas_Cpf1, CRISPR-associated protein Cpf1, subtype
PREFRAN. This family is the long protein of a novel
CRISPR subtype, PREFRAN, which is most common in
Prevotella and Francisella, although widely distributed.
The PREFRAN type has Cas1, Cas2, and Cas4, but lacks the
helicase Cas3 and endonuclease Cas3-HD.
Length = 1287
Score = 27.1 bits (60), Expect = 6.4
Identities = 12/34 (35%), Positives = 20/34 (58%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKK 169
E+ KK+ + + K KK+K ++K KK KK+
Sbjct: 378 EEYIKKQIEPRKGKSKKEKYEEKVKKVVKKQDYL 411
Score = 27.1 bits (60), Expect = 6.9
Identities = 13/35 (37%), Positives = 20/35 (57%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
KK K K KK+K ++K KK KK+ + +F+
Sbjct: 382 KKQIEPRKGKSKKEKYEEKVKKVVKKQDYLSIQFI 416
>gnl|CDD|225087 COG2176, PolC, DNA polymerase III, alpha subunit (gram-positive
type) [DNA replication, recombination, and repair].
Length = 1444
Score = 27.3 bits (61), Expect = 6.5
Identities = 14/70 (20%), Positives = 30/70 (42%), Gaps = 1/70 (1%)
Query: 108 KLLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKK 167
+ +++ SE+ + +E +E + + K K + K +K K +K +
Sbjct: 162 LIEFEVNDISEEQEFEKFEEAINEEVEKAAQEALEAEKKLKAESPKVEKPKPLFDGQKGR 221
Query: 168 K-KKTKKFLP 176
K K T++ P
Sbjct: 222 KIKSTEEIKP 231
>gnl|CDD|217935 pfam04158, Sof1, Sof1-like domain. Sof1 is essential for cell
growth and is a component of the nucleolar rRNA
processing machinery.
Length = 88
Score = 25.7 bits (57), Expect = 6.7
Identities = 11/45 (24%), Positives = 21/45 (46%)
Query: 127 VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
VP+ + + K + K K K++ ++K K K ++KK
Sbjct: 43 VPKAIKKAQKIKREMKEAKKRKEENRRKHSKPGSVPPKPERKKHV 87
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 26.4 bits (59), Expect = 6.7
Identities = 12/45 (26%), Positives = 28/45 (62%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
A + ++EKK K K ++ K+ K + ++ K + +K ++K +++ P
Sbjct: 65 AEKKAEEEKKLRKEKEEEIKELKAELEELKAEIEKLEEKLEEYQP 109
>gnl|CDD|218601 pfam05477, SURF2, Surfeit locus protein 2 (SURF2). Surfeit locus
protein 2 is part of a group of at least six sequence
unrelated genes (Surf-1 to Surf-6). The six Surfeit
genes have been classified as housekeeping genes, being
expressed in all tissue types tested and not containing
a TATA box in their promoter region. The exact function
of SURF2 is unknown.
Length = 244
Score = 26.8 bits (59), Expect = 6.8
Identities = 11/40 (27%), Positives = 20/40 (50%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ ++ E + + + K+ KK+ KK KK KKK +
Sbjct: 201 DEDEMEVESPELQQKRSKKQSGSLTKKFKKNHKKKGPFRF 240
>gnl|CDD|240377 PTZ00352, PTZ00352, 60S ribosomal protein L13; Provisional.
Length = 212
Score = 26.6 bits (59), Expect = 7.0
Identities = 13/36 (36%), Positives = 21/36 (58%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
K+E+ N + ++ K K KK+KKK+ K+K K
Sbjct: 177 KEERAFNAYRTLRQAKLNAKFVGKKEKKKQAKEKKK 212
>gnl|CDD|182933 PRK11057, PRK11057, ATP-dependent DNA helicase RecQ; Provisional.
Length = 607
Score = 27.0 bits (60), Expect = 7.1
Identities = 17/53 (32%), Positives = 25/53 (47%), Gaps = 4/53 (7%)
Query: 107 EKLLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKK 159
E P+L + L L VPR VAL+ +K N ++K K +K +K
Sbjct: 496 EAARPVLRGEVS----LQLAVPRIVALKPRAMQKSFGGNYDRKLFAKLRKLRK 544
>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
modification].
Length = 429
Score = 26.9 bits (59), Expect = 7.3
Identities = 7/34 (20%), Positives = 14/34 (41%)
Query: 138 KEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKT 171
+ KN + K + ++ ++KKK T
Sbjct: 12 NNQNKNASVVNNKAEIAAMIDARRLEQKKKGGVT 45
>gnl|CDD|221122 pfam11490, DNA_pol3_alph_N, DNA polymerase III polC-type
N-terminus. This is an N-terminal domain of DNA
polymerase III polC subunit A that is found only in
Firmicutes. DNA polymerase polC-type III enzyme
functions as the 'replicase' in low G + C Gram-positive
bacteria. Purine asymmetry is a characteristic of
organisms with a heterodimeric DNA polymerase III
alpha-subunit constituted by polC which probably plays a
direct role in the maintenance of strand-biased gene
distribution; since, among prokaryotic genomes, the
distribution of genes on the leading and lagging strands
of the replication fork is known to be biased. The
domain is associated with DNA_pol3_alpha pfam07733.
Length = 180
Score = 26.5 bits (59), Expect = 7.3
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKK 162
E++E K + + KKK+ +KKKK+K
Sbjct: 153 EEEEAKLAEEALEALKKKEAEKKKKEK 179
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 27.0 bits (60), Expect = 7.3
Identities = 10/39 (25%), Positives = 18/39 (46%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
+ + K K+K ++K K K K++ + K K F
Sbjct: 919 PENGTELVEKLAKEKSLREKNKDDWKAKEEVEAKLKAFF 957
>gnl|CDD|143303 cd05895, Ig_Pro_neuregulin-1, Immunoglobulin (Ig)-like domain found
in neuregulin (NRG)-1. Ig_Pro_neuregulin-1:
immunoglobulin (Ig)-like domain found in neuregulin
(NRG)-1. There are many NRG-1 isoforms which arise from
the alternative splicing of mRNA. NRG-1 belongs to the
neuregulin gene family, which is comprised of four
genes. This group represents NRG-1. NRGs are signaling
molecules, which participate in cell-cell interactions
in the nervous system, breast, and heart, and other
organ systems, and are implicated in the pathology of
diseases including schizophrenia, multiple sclerosis,
and breast cancer. The NRG-1 protein binds to and
activates the tyrosine kinases receptors ErbB3 and
ErbB4, initiating signaling cascades. NRG-1 has multiple
functions; for example, in the brain it regulates
various processes such as radial glia formation and
neuronal migration, dendritic development, and
expression of neurotransmitters receptors; in the
peripheral nervous system NRG-1 regulates processes such
as target cell differentiation, and Schwann cell
survival.
Length = 76
Score = 25.3 bits (55), Expect = 7.4
Identities = 11/25 (44%), Positives = 14/25 (56%)
Query: 133 LEGEKKEKKKNKNKNKKKKKKKKKK 157
+ K+ KNK NK K +KKKK
Sbjct: 19 FKNGKEIGAKNKPDNKIKIRKKKKS 43
>gnl|CDD|130109 TIGR01037, pyrD_sub1_fam, dihydroorotate dehydrogenase (subfamily
1) family protein. This family includes subfamily 1
dihydroorotate dehydrogenases while excluding the
closely related subfamily 2 (TIGR01036). This family
also includes a number of uncharacterized proteins and
a domain of dihydropyrimidine dehydrogenase. The
uncharacterized proteins might all be dihydroorotate
dehydrogenase.
Length = 300
Score = 26.6 bits (59), Expect = 7.5
Identities = 11/24 (45%), Positives = 14/24 (58%)
Query: 22 RVARTKYGPLRGILIQNPPVEAYL 45
+ T G L I +QNP VEA+L
Sbjct: 56 TIVETPCGMLNAIGLQNPGVEAFL 79
>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
Length = 64
Score = 25.2 bits (56), Expect = 7.5
Identities = 12/42 (28%), Positives = 24/42 (57%)
Query: 129 RPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
R V G +E ++ + K +K+K+KK +K+++KK +
Sbjct: 21 RKVEKAGILRELRRREFYEKPSEKRKRKKAAARKRRRKKLAR 62
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 27.0 bits (60), Expect = 7.7
Identities = 10/42 (23%), Positives = 26/42 (61%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
LE E KE ++ K + ++ +K+ + + K+K ++K ++ ++
Sbjct: 225 KLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEE 266
Score = 26.6 bits (59), Expect = 9.4
Identities = 14/45 (31%), Positives = 28/45 (62%)
Query: 132 ALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
L E +E +K + + KK +K K++ ++++K KK+ +K +K L
Sbjct: 677 GLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALE 721
>gnl|CDD|217301 pfam02956, TT_ORF1, TT viral orf 1. TT virus (TTV), isolated
initially from a Japanese patient with hepatitis of
unknown aetiology, has since been found to infect both
healthy and diseased individuals and numerous prevalence
studies have raised questions about its role in
unexplained hepatitis. ORF1 is a large 750 residue
protein. The N-terminal half of this protein corresponds
to the capsid protein.
Length = 525
Score = 26.8 bits (60), Expect = 7.8
Identities = 9/27 (33%), Positives = 10/27 (37%)
Query: 152 KKKKKKKKKKKKKKKKKKKTKKFLPPS 178
K K K + K KK K PP
Sbjct: 189 KHKILIPSLKTRPKGKKYVKIKIKPPK 215
>gnl|CDD|216016 pfam00602, Flu_PB1, Influenza RNA-dependent RNA polymerase subunit
PB1. Two GTP binding sites exist in this protein.
Length = 740
Score = 27.1 bits (60), Expect = 7.9
Identities = 13/37 (35%), Positives = 22/37 (59%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTK 172
+K E K K+KN K++ K+ K K+K+ K+K +
Sbjct: 176 DKPEVKFTKHKNNKRRYDDKETGKLLKRKQPSKEKDR 212
>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
Length = 278
Score = 26.8 bits (59), Expect = 8.0
Identities = 16/42 (38%), Positives = 24/42 (57%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
E + E KK + + K + KKKKK KK+K + K KK++
Sbjct: 48 EKREFELKKQQFEQLKAELSKKKKKFKKEKVDVRVKVVKKWI 89
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 26.5 bits (58), Expect = 8.1
Identities = 11/31 (35%), Positives = 15/31 (48%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKK 164
E E E + NK K +K+K KK +K
Sbjct: 253 ESESDEDEDEDNKGKIRKRKTDDAKKSRKPH 283
>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
Length = 194
Score = 26.3 bits (58), Expect = 8.1
Identities = 7/34 (20%), Positives = 19/34 (55%)
Query: 137 KKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKK 170
K+E K+ +++ + KK+ K+ K++ ++
Sbjct: 3 KEEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQE 36
>gnl|CDD|217023 pfam02414, Borrelia_orfA, Borrelia ORF-A. This protein is encoded
by an open reading frame in plasmid borne DNA repeats of
Borrelia species. This protein is known as ORF-A. The
function of this putative protein is unknown.
Length = 285
Score = 26.6 bits (59), Expect = 8.4
Identities = 15/39 (38%), Positives = 19/39 (48%)
Query: 136 EKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
+++ KKKN +K K KK KK K KK K K
Sbjct: 144 KEESKKKNSKNIEKLKIKKYFKKCNFKTNLKKDIKIKLL 182
Score = 26.6 bits (59), Expect = 9.9
Identities = 16/34 (47%), Positives = 19/34 (55%)
Query: 141 KKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
KN NKK K++ KKK K +K K KK KK
Sbjct: 134 NKNNVINKKIKEESKKKNSKNIEKLKIKKYFKKC 167
>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
Length = 405
Score = 26.7 bits (59), Expect = 8.7
Identities = 18/56 (32%), Positives = 25/56 (44%), Gaps = 7/56 (12%)
Query: 121 LYLNLY--VPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
L LN Y R +AL E K KK+K KK+K+ K++ K K +
Sbjct: 331 LKLNPYAKTARRMALLAEAARVKA-----KKEKLAKKRKQVSKEEAAAIKAAGKAW 381
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 26.9 bits (59), Expect = 8.8
Identities = 16/64 (25%), Positives = 27/64 (42%), Gaps = 7/64 (10%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFLPPSDPSGKAGVK 187
P+P KE K+ K++ +K + + KK ++ K K K F K K
Sbjct: 261 PKPADTSS-PKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAF------DLKQESK 313
Query: 188 SREQ 191
+ E+
Sbjct: 314 ASEK 317
>gnl|CDD|223363 COG0286, HsdM, Type I restriction-modification system
methyltransferase subunit [Defense mechanisms].
Length = 489
Score = 26.6 bits (59), Expect = 8.8
Identities = 9/46 (19%), Positives = 19/46 (41%)
Query: 123 LNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKK 168
NL R VA E E+ + + + + +K+ ++ K+
Sbjct: 438 YNLNPGRYVAEEEEEDIDLDELLQELSEIDAELAELEKELEEILKE 483
>gnl|CDD|236078 PRK07726, PRK07726, DNA topoisomerase III; Provisional.
Length = 658
Score = 26.7 bits (60), Expect = 8.9
Identities = 11/59 (18%), Positives = 15/59 (25%), Gaps = 13/59 (22%)
Query: 128 PRPVALEGEKKEKKKNKNKNKKKKKKK-------------KKKKKKKKKKKKKKKKTKK 173
+ K + K K K K K+K KK + KK
Sbjct: 600 SEKYKFDNLTGPKCPDCGKPMLKVKGKNGKMLVCQDRECGKRKNVSKKTNARCPNCKKK 658
>gnl|CDD|235501 PRK05559, PRK05559, DNA topoisomerase IV subunit B; Reviewed.
Length = 631
Score = 26.6 bits (60), Expect = 8.9
Identities = 12/41 (29%), Positives = 19/41 (46%), Gaps = 6/41 (14%)
Query: 143 NKNKN------KKKKKKKKKKKKKKKKKKKKKKKTKKFLPP 177
N+N +K K + + + KK K+KKK + LP
Sbjct: 363 NQNPELAEKLAEKAIKAAQARLRAAKKVKRKKKTSGPALPG 403
>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
TopJ. This model describes TopJ (MG_200, CbpA), a DnaJ
homolog and probable assembly protein of the Mycoplasma
terminal organelle. The terminal organelle is involved
in both cytadherence and gliding motility [Cellular
processes, Chemotaxis and motility].
Length = 871
Score = 26.7 bits (58), Expect = 8.9
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 145 NKNKKKKKKKKKKKKKKKKKKKKKK 169
+ KKKKKKKKK K K++ K
Sbjct: 104 DTPSKKKKKKKKKGWFWAKSKQESK 128
>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
Validated.
Length = 605
Score = 26.7 bits (59), Expect = 9.0
Identities = 12/40 (30%), Positives = 22/40 (55%)
Query: 134 EGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKK 173
+ + KK N+N K++ KK++ +K+ KK +K K
Sbjct: 413 KTKIFYHKKKINQNNKEQDIKKEELLEKEFVKKSEKIPKN 452
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 26.6 bits (59), Expect = 9.0
Identities = 24/97 (24%), Positives = 37/97 (38%), Gaps = 22/97 (22%)
Query: 93 EALLQLPRGRLVFLEKLLPLLSNQSEDCLYLNLYVPRPVALEGEKKEKKKNKNKNKKKKK 152
E LL+ + L L+KL+ L D L + + + N NKN K+++
Sbjct: 189 EGLLER-KNELPHLKKLIIL------DTLIKSKEININKEEKNNGSNVNNNGNKNNKEEQ 241
Query: 153 KKKK---------------KKKKKKKKKKKKKKTKKF 174
K K+K +K K K+K KK
Sbjct: 242 KGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKKL 278
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 26.5 bits (59), Expect = 9.1
Identities = 7/25 (28%), Positives = 15/25 (60%)
Query: 147 NKKKKKKKKKKKKKKKKKKKKKKKT 171
+KKK++KK+++K+ K+
Sbjct: 175 PGHSSDSEKKKQRKKQRRKRSKELR 199
>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
outer membrane].
Length = 170
Score = 26.2 bits (58), Expect = 9.6
Identities = 11/55 (20%), Positives = 24/55 (43%)
Query: 121 LYLNLYVPRPVALEGEKKEKKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKFL 175
++ + V+ + E + KK+ K K +K+ K K+ K + K + +
Sbjct: 35 IFQESPQAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKA 89
>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 753
Score = 26.5 bits (59), Expect = 9.7
Identities = 14/50 (28%), Positives = 17/50 (34%), Gaps = 5/50 (10%)
Query: 130 PVALEGEKKE-----KKKNKNKNKKKKKKKKKKKKKKKKKKKKKKKTKKF 174
PV E K E K K K + K KK K + K +K
Sbjct: 564 PVPAEEAKVEAVDEVKFLKKKKGILDAGAFESTLKDKKNKVLPEAKKRKL 613
>gnl|CDD|214368 CHL00117, rpoC2, RNA polymerase beta'' subunit; Reviewed.
Length = 1364
Score = 26.8 bits (60), Expect = 9.8
Identities = 8/36 (22%), Positives = 15/36 (41%), Gaps = 6/36 (16%)
Query: 149 KKKKKKKKKKKK------KKKKKKKKKKTKKFLPPS 178
+KKKK+ + K + K + + +PP
Sbjct: 707 EKKKKRIELKIFSGDIHFPGETDKISRHSGILIPPG 742
>gnl|CDD|153294 cd07610, FCH_F-BAR, The Extended FES-CIP4 Homology (FCH) or F-BAR
(FCH and Bin/Amphiphysin/Rvs) domain, a dimerization
module that binds and bends membranes. F-BAR domains
are dimerization modules that bind and bend membranes
and are found in proteins involved in membrane dynamics
and actin reorganization. F-BAR domain containing
proteins, also known as Pombe Cdc15 homology (PCH)
family proteins, include Fes and Fer tyrosine kinases,
PACSINs/Syndapins, FCHO, PSTPIP, CIP4-like proteins and
srGAPs. Many members also contain an SH3 domain and play
roles in endocytosis. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
These tubules have diameters larger than those observed
with N-BARs. The F-BAR domains of some members such as
NOSTRIN and Rgd1 are important for the subcellular
localization of the protein.
Length = 191
Score = 26.1 bits (58), Expect = 9.8
Identities = 11/43 (25%), Positives = 25/43 (58%), Gaps = 2/43 (4%)
Query: 136 EKKEKKKNKNKNKKKKKKKKK--KKKKKKKKKKKKKKTKKFLP 176
+++ +KK + +K KKK ++ K KK ++ +++ +K P
Sbjct: 95 KEQARKKELAEGEKLKKKLQELWAKLAKKADEEYREQVEKLNP 137
>gnl|CDD|215059 PLN00111, PLN00111, accumulation of photosystem one; Provisional.
Length = 399
Score = 26.5 bits (59), Expect = 9.9
Identities = 9/40 (22%), Positives = 20/40 (50%), Gaps = 6/40 (15%)
Query: 146 KNKKKKKKKK------KKKKKKKKKKKKKKKTKKFLPPSD 179
K++KK +++ ++KKK +K + ++ L P
Sbjct: 46 PYSKREKKPYPRPMKLLRREAREKKKLRKLQPERPLDPPK 85
>gnl|CDD|227818 COG5531, COG5531, SWIB-domain-containing proteins implicated in
chromatin remodeling [Chromatin structure and dynamics].
Length = 237
Score = 26.3 bits (58), Expect = 9.9
Identities = 10/44 (22%), Positives = 16/44 (36%), Gaps = 1/44 (2%)
Query: 134 EGEKKEKKKNKN-KNKKKKKKKKKKKKKKKKKKKKKKKTKKFLP 176
+ E +K +N + K+ KKKK K +K
Sbjct: 84 KEEDSSLPSSKETENGDTEGKETDKKKKSSTISKNSPSGEKVKL 127
>gnl|CDD|216850 pfam02020, W2, eIF4-gamma/eIF5/eIF2-epsilon. This domain of
unknown function is found at the C-terminus of several
translation initiation factors.
Length = 75
Score = 24.9 bits (55), Expect = 10.0
Identities = 8/23 (34%), Positives = 13/23 (56%)
Query: 153 KKKKKKKKKKKKKKKKKKTKKFL 175
+K KK + KK +K+ K F+
Sbjct: 47 YEKSKKVDGEGSKKVRKQAKPFV 69
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.134 0.399
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,965,771
Number of extensions: 958378
Number of successful extensions: 19548
Number of sequences better than 10.0: 1
Number of HSP's gapped: 11567
Number of HSP's successfully gapped: 2205
Length of query: 191
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 100
Effective length of database: 6,901,388
Effective search space: 690138800
Effective search space used: 690138800
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 56 (25.5 bits)