RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy1374
(263 letters)
>gnl|CDD|214980 smart01024, BCS1_N, This domain is found at the N terminal of the
mitochondrial ATPase BSC1. It encodes the import and
intramitochondrial sorting for the protein.
Length = 170
Score = 202 bits (517), Expect = 3e-66
Identities = 80/150 (53%), Positives = 109/150 (72%), Gaps = 2/150 (1%)
Query: 82 RRHYMITLEIPCRDKSYHWLLHWITVRGA-KKTQHLSVETSFEKFDTGYVKTKYDFIPSI 140
RR +++TLE+P +D+SY WLL W++ + ++++HLSVETS+ + D G T + F+P
Sbjct: 22 RRRFLVTLEVPSKDESYAWLLEWLSQQPFRQRSRHLSVETSYTQHDNGKSSTSFSFVPGP 81
Query: 141 GTHLFSYQGNWIRVERSRESTTLDIQQGIPWETVTLTAFGRDRSLFFNILEEARTLALKQ 200
GTH F Y+G WI V R RE T D++ G P+ET+TLT GRDR +F +LEEAR LALK+
Sbjct: 82 GTHWFWYKGRWIWVTREREKTMADMRTGSPFETLTLTTLGRDRDVFKELLEEARELALKR 141
Query: 201 YEGKTVMYTALGSEWRPFGHPKRRRPLNSV 230
EGKTV+YTA G EWR F P+R+RPL+SV
Sbjct: 142 TEGKTVIYTADGPEWRRF-APRRKRPLSSV 170
>gnl|CDD|219997 pfam08740, BCS1_N, BCS1 N terminal. This domain is found at the N
terminal of the mitochondrial ATPase BSC1. It encodes
the import and intramitochondrial sorting for the
protein.
Length = 179
Score = 156 bits (397), Expect = 6e-48
Identities = 67/161 (41%), Positives = 96/161 (59%), Gaps = 15/161 (9%)
Query: 82 RRHYMITLEIPCRDKSYHWLLHWITVRG-AKKTQHLSVET----------SFEKFDTGYV 130
RR++ TLE+ D+SY+W+L W++ + A++++ L T S E
Sbjct: 22 RRYFTSTLEVRSDDESYNWVLEWLSQQPFAQRSRRLVASTSLNSRRDSHLSVETSSAQEK 81
Query: 131 KTKYDFIPSIGTHLFSYQGNWIRVERSRESTTLDIQQGIPWETVTLTAFGRDRSLFFNIL 190
K + F+P GTH F Y+G I ER RE+ + D+ P+ET+TLT GR R + +L
Sbjct: 82 KAPFRFVPGFGTHWFRYKGRLIWFERERETKSYDLS--TPFETLTLTTLGRSRQILKELL 139
Query: 191 EEARTLALKQYEGKTVMYTALGSE-WRPFGHPKRRRPLNSV 230
EEAR LALK+ EGKTV+Y A GSE WR G P+ +RPL++V
Sbjct: 140 EEARALALKKDEGKTVIYRADGSEGWRR-GAPRPKRPLSTV 179
>gnl|CDD|223541 COG0465, HflB, ATP-dependent Zn proteases [Posttranslational
modification, protein turnover, chaperones].
Length = 596
Score = 47.7 bits (114), Expect = 3e-06
Identities = 20/147 (13%), Positives = 36/147 (24%), Gaps = 12/147 (8%)
Query: 90 EIPCRDKSYHWLLHWITVRGAKKTQHLSVE--------TSFEKFDTGYVKTKYDFIPSIG 141
+P S + GA + + L + ++ D + G
Sbjct: 208 GVPFFSISGSDFVEMFVGVGASRVRDLFEQAKKNAPCIIFIDEIDAVGRQRG--AGLGGG 265
Query: 142 THLFSYQGNWIRVERSR--ESTTLDIQQGIPWETVTLTAFGRDRSLFFNILEEARTLALK 199
N + VE + + + V A R IL E + +
Sbjct: 266 NDEREQTLNQLLVEMDGFGGNEGVIVIAATNRPDVLDPALLRPGRFDRQILVELPDIKGR 325
Query: 200 QYEGKTVMYTALGSEWRPFGHPKRRRP 226
+ K +E R P
Sbjct: 326 EQILKVHAKNKPLAEDVDLKKIARGTP 352
Score = 30.0 bits (68), Expect = 1.5
Identities = 10/15 (66%), Positives = 12/15 (80%)
Query: 6 LTILDPALVRPGRVD 20
+LDPAL+RPGR D
Sbjct: 298 PDVLDPALLRPGRFD 312
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 38.8 bits (91), Expect = 0.002
Identities = 14/35 (40%), Positives = 23/35 (65%)
Query: 49 KFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
K+ KKK+ ++ +K+K KKK+KKK K+K +
Sbjct: 378 KYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKG 412
Score = 36.9 bits (86), Expect = 0.007
Identities = 14/29 (48%), Positives = 21/29 (72%)
Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
KK++ + +K+KK KK+KKK K +KK R
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 32.2 bits (74), Expect = 0.26
Identities = 14/28 (50%), Positives = 20/28 (71%)
Query: 53 QAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
+ +KK R+KKKK+ KK KK+KK +K
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 29.6 bits (67), Expect = 2.0
Identities = 13/23 (56%), Positives = 16/23 (69%)
Query: 52 DQAKKKKNFRRKKKKKTKKKKKK 74
Q +KKK R+KK KK KKK +K
Sbjct: 392 PQKRKKKKKRKKKGKKRKKKGRK 414
Score = 29.6 bits (67), Expect = 2.0
Identities = 12/28 (42%), Positives = 20/28 (71%)
Query: 56 KKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
K+K + KKK+ +KK +K+K KKK ++
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKKRKK 403
Score = 29.2 bits (66), Expect = 2.4
Identities = 9/24 (37%), Positives = 18/24 (75%)
Query: 61 RRKKKKKTKKKKKKKKTKKKNRRH 84
++K+++K +K+KKKK +KK +
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKK 407
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 36.1 bits (84), Expect = 0.017
Identities = 11/30 (36%), Positives = 17/30 (56%)
Query: 54 AKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
+KK R +KK+K K+K K KK + +
Sbjct: 392 SKKVLAKRAEKKEKEKEKPKVKKRHRDTKN 421
Score = 35.7 bits (83), Expect = 0.019
Identities = 10/33 (30%), Positives = 19/33 (57%)
Query: 51 ADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
+++ K + + K+ KK+K+K+K K K R
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRH 416
Score = 35.3 bits (82), Expect = 0.023
Identities = 11/36 (30%), Positives = 18/36 (50%)
Query: 48 SKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
SK + +K + K+K K KK+ + K K R+
Sbjct: 392 SKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRK 427
Score = 35.3 bits (82), Expect = 0.027
Identities = 9/31 (29%), Positives = 19/31 (61%)
Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKKKNRRHY 85
KK +R +KK+ +K+K K K + ++ ++
Sbjct: 392 SKKVLAKRAEKKEKEKEKPKVKKRHRDTKNI 422
Score = 34.5 bits (80), Expect = 0.052
Identities = 14/56 (25%), Positives = 28/56 (50%)
Query: 29 SRHQLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
R+ E + R ++ +K + K K ++ K+ +KK+K+K+ K +RH
Sbjct: 361 ERYIEEPLKARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRH 416
Score = 34.1 bits (79), Expect = 0.057
Identities = 12/36 (33%), Positives = 19/36 (52%)
Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
A K K K+ +K +K+K+K K KK++R
Sbjct: 382 APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHR 417
Score = 31.5 bits (72), Expect = 0.45
Identities = 8/35 (22%), Positives = 18/35 (51%)
Query: 46 NASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
K + + K+ +++K+K+ K KK+ + K
Sbjct: 387 KTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKN 421
Score = 31.5 bits (72), Expect = 0.50
Identities = 7/35 (20%), Positives = 16/35 (45%)
Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
A + + K+K+ + KK+ + K K++
Sbjct: 397 AKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431
Score = 30.3 bits (69), Expect = 1.00
Identities = 9/35 (25%), Positives = 17/35 (48%)
Query: 48 SKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
+K A++ +K+K + KK+ K K+ K
Sbjct: 397 AKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431
Score = 28.4 bits (64), Expect = 4.3
Identities = 7/29 (24%), Positives = 16/29 (55%)
Query: 53 QAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
+ +K+K +K+ + TK K++K +
Sbjct: 404 EKEKEKPKVKKRHRDTKNIGKRRKPSGTS 432
Score = 27.6 bits (62), Expect = 7.3
Identities = 4/29 (13%), Positives = 12/29 (41%)
Query: 54 AKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
K+K +++ + K++K +
Sbjct: 406 EKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 406
Score = 35.3 bits (82), Expect = 0.022
Identities = 14/22 (63%), Positives = 16/22 (72%)
Query: 1 MSTQALTILDPALVRPGRVDLK 22
M+T ILDPAL+RPGR D K
Sbjct: 295 MATNRPDILDPALLRPGRFDRK 316
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 35.0 bits (81), Expect = 0.022
Identities = 19/47 (40%), Positives = 27/47 (57%), Gaps = 3/47 (6%)
Query: 34 EQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
E++ K+ ++D A + KKK N ++KKK KKK KKK KK
Sbjct: 154 ERLKKK---ELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKK 197
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 34.5 bits (80), Expect = 0.029
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
A + QAKK++ K+K +++K K K +K
Sbjct: 112 AQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKK 146
Score = 32.6 bits (75), Expect = 0.12
Identities = 8/36 (22%), Positives = 20/36 (55%)
Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
++ A+Q KK+ +K+K ++++K K + +
Sbjct: 111 QAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKK 146
Score = 31.0 bits (71), Expect = 0.38
Identities = 12/34 (35%), Positives = 20/34 (58%)
Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
A K +K+K RR++K K K +KK+K + +
Sbjct: 119 AKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152
Score = 30.3 bits (69), Expect = 0.68
Identities = 7/37 (18%), Positives = 20/37 (54%)
Query: 48 SKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
+K QA++ + +K++ +K+K ++ +K +
Sbjct: 106 AKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKA 142
Score = 27.6 bits (62), Expect = 5.5
Identities = 10/43 (23%), Positives = 19/43 (44%)
Query: 38 KRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
+R Q ++ K + R+ K K +KK+K + K +
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPE 155
Score = 26.8 bits (60), Expect = 9.5
Identities = 5/38 (13%), Positives = 17/38 (44%)
Query: 45 DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
+ A + ++++ + K +K +K + +K +
Sbjct: 123 EAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTP 160
>gnl|CDD|185575 PTZ00361, PTZ00361, 26 proteosome regulatory subunit 4-like
protein; Provisional.
Length = 438
Score = 35.1 bits (81), Expect = 0.032
Identities = 13/22 (59%), Positives = 17/22 (77%)
Query: 1 MSTQALTILDPALVRPGRVDLK 22
M+T + LDPAL+RPGR+D K
Sbjct: 327 MATNRIESLDPALIRPGRIDRK 348
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 34.7 bits (80), Expect = 0.043
Identities = 19/68 (27%), Positives = 31/68 (45%), Gaps = 8/68 (11%)
Query: 18 RVDL--KEYIGFCSRHQLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKK 75
R+D E G R +LE+ ++ K K K R KK++ + ++KKK
Sbjct: 329 RIDAFSGEPDGISLREELEKRIEKL------KEKPPKPPTKAKPERDKKERPGRYRRKKK 382
Query: 76 KTKKKNRR 83
+ K K+ R
Sbjct: 383 EKKAKSER 390
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 33.6 bits (77), Expect = 0.057
Identities = 17/49 (34%), Positives = 22/49 (44%), Gaps = 6/49 (12%)
Query: 41 YQDVDNASKFADQA------KKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
QD +D KK K + + K+ KKKKK+KK KKK
Sbjct: 122 TQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHS 170
Score = 30.6 bits (69), Expect = 0.60
Identities = 16/55 (29%), Positives = 26/55 (47%)
Query: 30 RHQLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
+H+ + K ++ KKK K++K KKK+KKKK K+ + H
Sbjct: 119 KHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEH 173
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 32.7 bits (75), Expect = 0.078
Identities = 17/32 (53%), Positives = 22/32 (68%)
Query: 49 KFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
K Q K+K ++K+KKK KKKKKKK +KK
Sbjct: 99 KIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKA 130
Score = 32.4 bits (74), Expect = 0.090
Identities = 16/31 (51%), Positives = 21/31 (67%)
Query: 50 FADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
++AK+KK ++KKKKK KK KK KKK
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 29.7 bits (67), Expect = 0.87
Identities = 16/33 (48%), Positives = 20/33 (60%)
Query: 49 KFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
K + K K+ ++KKKKK KKKK KK KK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 26.6 bits (59), Expect = 9.6
Identities = 14/30 (46%), Positives = 19/30 (63%)
Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKK 76
K + +KKK ++KKKK +KK KKKK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 31.9 bits (73), Expect = 0.15
Identities = 19/42 (45%), Positives = 25/42 (59%), Gaps = 2/42 (4%)
Query: 51 ADQAK--KKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMITLE 90
D+ K KK+ R+KKK+K KKKKK KK KK + + E
Sbjct: 78 KDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSE 119
Score = 27.7 bits (62), Expect = 4.2
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 52 DQAKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
++ K+K + KK+ ++KKK+KK KKK
Sbjct: 73 EEKKRKDEEKTAKKRAKRQKKKQKKKKKKKA 103
>gnl|CDD|240423 PTZ00454, PTZ00454, 26S protease regulatory subunit 6B-like
protein; Provisional.
Length = 398
Score = 32.8 bits (75), Expect = 0.17
Identities = 13/22 (59%), Positives = 16/22 (72%)
Query: 1 MSTQALTILDPALVRPGRVDLK 22
M+T LDPAL+RPGR+D K
Sbjct: 289 MATNRADTLDPALLRPGRLDRK 310
>gnl|CDD|223898 COG0828, RpsU, Ribosomal protein S21 [Translation, ribosomal
structure and biogenesis].
Length = 67
Score = 30.4 bits (69), Expect = 0.17
Identities = 14/55 (25%), Positives = 31/55 (56%), Gaps = 4/55 (7%)
Query: 33 LEQMYKRFYQDVDNASKFAD----QAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
L++ +RF + V+ + + +K + +RK+KK +K+K K+ +K+ +R
Sbjct: 12 LDKALRRFKRKVEKEGILREMKEREFYEKPSEKRKRKKAAARKRKFKRLRKEQQR 66
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 32.5 bits (75), Expect = 0.17
Identities = 10/13 (76%), Positives = 12/13 (92%)
Query: 8 ILDPALVRPGRVD 20
ILDPA++RPGR D
Sbjct: 282 ILDPAILRPGRFD 294
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 30.9 bits (70), Expect = 0.18
Identities = 13/27 (48%), Positives = 15/27 (55%)
Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKKKN 81
K KK +KKKKK KKK K K+
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTE 39
Score = 27.0 bits (60), Expect = 4.2
Identities = 13/26 (50%), Positives = 16/26 (61%)
Query: 63 KKKKKTKKKKKKKKTKKKNRRHYMIT 88
K KK KKKKKKK KK + ++T
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVT 38
>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
Length = 300
Score = 32.3 bits (74), Expect = 0.20
Identities = 19/58 (32%), Positives = 28/58 (48%), Gaps = 5/58 (8%)
Query: 33 LEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMITLE 90
LE MYK+ + + A+ +K KK ++KKK KK K KK T K+ +
Sbjct: 237 LEDMYKKAHAAIR-----ANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKA 289
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
This domain is found to the N-terminus of bacterial
signal peptidases of the S49 family (pfam01343).
Length = 154
Score = 31.3 bits (72), Expect = 0.23
Identities = 15/51 (29%), Positives = 24/51 (47%), Gaps = 1/51 (1%)
Query: 33 LEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
L + YK + + A+ + K KK +K K K +KKK KK+ +
Sbjct: 47 LNEEYKDLKESL-EAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96
Score = 29.4 bits (67), Expect = 1.2
Identities = 10/28 (35%), Positives = 15/28 (53%)
Query: 53 QAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
K +K + K K KKK KK++ K +
Sbjct: 71 WEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
non-specific RNA binding region part 2. This is a
region found N terminal to the catalytic domain of
glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
but not in Escherichia coli. This region is thought to
bind RNA in a non-specific manner, enhancing
interactions between the tRNA and enzyme, but is not
essential for enzyme function.
Length = 83
Score = 30.0 bits (68), Expect = 0.24
Identities = 14/31 (45%), Positives = 16/31 (51%), Gaps = 3/31 (9%)
Query: 51 ADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
AD KKKK +KKKKK + K KK
Sbjct: 20 ADLVKKKK---KKKKKKAEDTAATAKAKKAT 47
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region
covers both the Brf homology II and III regions. This
region is involved in binding TATA binding protein.
Length = 95
Score = 29.9 bits (68), Expect = 0.32
Identities = 10/29 (34%), Positives = 17/29 (58%)
Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
+++K ++K + KKKKK K K +R
Sbjct: 32 QEEKELKQKADEGNNSGKKKKKRKAKKKR 60
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 31.8 bits (72), Expect = 0.35
Identities = 13/30 (43%), Positives = 13/30 (43%)
Query: 54 AKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
K KK K K K KK KKK NR
Sbjct: 148 KKPKKGQSPKVPKAPKPKKPKKKGSVSNRS 177
>gnl|CDD|182683 PRK10733, hflB, ATP-dependent metalloprotease; Reviewed.
Length = 644
Score = 31.9 bits (72), Expect = 0.38
Identities = 15/37 (40%), Positives = 21/37 (56%)
Query: 2 STQALTILDPALVRPGRVDLKEYIGFCSRHQLEQMYK 38
+T +LDPAL+RPGR D + +G EQ+ K
Sbjct: 296 ATNRPDVLDPALLRPGRFDRQVVVGLPDVRGREQILK 332
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 31.6 bits (71), Expect = 0.39
Identities = 14/34 (41%), Positives = 23/34 (67%)
Query: 51 ADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
A++ KK ++KK++K K++KKKKK +R H
Sbjct: 271 AEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCH 304
Score = 30.0 bits (67), Expect = 1.3
Identities = 14/37 (37%), Positives = 23/37 (62%), Gaps = 3/37 (8%)
Query: 51 ADQAKKKKNFRRKKK---KKTKKKKKKKKTKKKNRRH 84
+DQ K + RR ++ K ++KKK++K K+ RRH
Sbjct: 78 SDQYVKLEEERRHRQRLEKDKREKKKREKEKRGRRRH 114
Score = 28.9 bits (64), Expect = 3.4
Identities = 14/32 (43%), Positives = 23/32 (71%)
Query: 53 QAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
+A++ K + KKKK +K+K++KK KKK+ H
Sbjct: 270 EAEETKKSPKHKKKKQRKEKEEKKKKKKHHHH 301
Score = 28.5 bits (63), Expect = 4.6
Identities = 7/40 (17%), Positives = 22/40 (55%)
Query: 45 DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
D K ++ + ++ + K++K K++K+K+ ++ +
Sbjct: 79 DQYVKLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLG 118
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 30.3 bits (69), Expect = 0.40
Identities = 8/30 (26%), Positives = 20/30 (66%)
Query: 53 QAKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
+ K++ + K+ +K ++KK K++ K+K +
Sbjct: 101 ERTKERAEKEKRTRKNREKKFKRRQKEKEK 130
Score = 29.5 bits (67), Expect = 0.87
Identities = 10/28 (35%), Positives = 21/28 (75%)
Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
KK++ R ++++TK++ +K+K +KNR
Sbjct: 90 KKRRIALRLRRERTKERAEKEKRTRKNR 117
Score = 26.9 bits (60), Expect = 7.4
Identities = 9/31 (29%), Positives = 18/31 (58%)
Query: 53 QAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
A + + R K++ + +K+ +K + KK RR
Sbjct: 94 IALRLRRERTKERAEKEKRTRKNREKKFKRR 124
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
Provisional.
Length = 330
Score = 31.3 bits (72), Expect = 0.41
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
K+ K + + +KKK K++ K K K K R
Sbjct: 64 KELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 31.0 bits (71), Expect = 0.56
Identities = 8/27 (29%), Positives = 14/27 (51%)
Query: 57 KKNFRRKKKKKTKKKKKKKKTKKKNRR 83
KK + K + KK+K++ K K +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSK 89
Score = 30.2 bits (69), Expect = 1.2
Identities = 12/44 (27%), Positives = 23/44 (52%), Gaps = 1/44 (2%)
Query: 37 YKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
YK +++ A+ + K + +KKK+K + K K K+K +
Sbjct: 49 YKEMKEEL-KAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 31.7 bits (72), Expect = 0.45
Identities = 16/61 (26%), Positives = 20/61 (32%), Gaps = 16/61 (26%)
Query: 40 FYQDVDNASKFADQAKKKKNFRR----------------KKKKKTKKKKKKKKTKKKNRR 83
D + + K K N + K KK KKKKK KKK R+
Sbjct: 231 STDDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRK 290
Query: 84 H 84
Sbjct: 291 D 291
Score = 30.9 bits (70), Expect = 0.64
Identities = 13/30 (43%), Positives = 19/30 (63%)
Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKK 76
AS + D ++ K + KKKKK KKK++K
Sbjct: 263 ASDYYDVSEMVKFKKPKKKKKKKKKRRKDL 292
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 30.8 bits (70), Expect = 0.46
Identities = 13/36 (36%), Positives = 21/36 (58%)
Query: 49 KFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
K + KK+K ++ KK+K + K KK KKK ++
Sbjct: 158 KKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 30.8 bits (70), Expect = 0.53
Identities = 13/35 (37%), Positives = 20/35 (57%)
Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
A + +++ + KKKKK KK+KK+K KK
Sbjct: 141 AKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKE 175
Score = 29.7 bits (67), Expect = 1.2
Identities = 15/41 (36%), Positives = 20/41 (48%), Gaps = 5/41 (12%)
Query: 45 DNASKFADQAKKKKNFRRKKKKK-----TKKKKKKKKTKKK 80
+ + + + KK + KK KK K KKKKK KKK
Sbjct: 152 EEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192
Score = 27.8 bits (62), Expect = 4.3
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
+KKK KK+KK KK KK+K + K +
Sbjct: 156 EKKKKKEVKKEKKEKKDKKEKMVEPKGSK 184
Score = 27.8 bits (62), Expect = 4.4
Identities = 11/30 (36%), Positives = 19/30 (63%)
Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
++KK ++KK+ K +KK+KK K +K
Sbjct: 152 EEKKEKKKKKEVKKEKKEKKDKKEKMVEPK 181
Score = 27.8 bits (62), Expect = 4.4
Identities = 12/34 (35%), Positives = 20/34 (58%)
Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
A ++ K+KK + KK+K +KK KK+K +
Sbjct: 147 AEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEP 180
>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
Length = 64
Score = 28.7 bits (65), Expect = 0.49
Identities = 11/54 (20%), Positives = 31/54 (57%), Gaps = 6/54 (11%)
Query: 33 LEQMYKRFYQDVDNASKFADQAKKKKNF-----RRKKKKKTKKKKKKKKTKKKN 81
+++ +RF + V+ A + ++++ + +RK+KK +K+++KK ++
Sbjct: 12 IDKALRRFKRKVEKAGILR-ELRRREFYEKPSEKRKRKKAAARKRRRKKLAREE 64
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 31.3 bits (72), Expect = 0.49
Identities = 9/43 (20%), Positives = 17/43 (39%), Gaps = 1/43 (2%)
Query: 44 VDNASKFADQ-AKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHY 85
V+ A + + +K + K KK K +K ++ Y
Sbjct: 807 VEEALELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQY 849
Score = 30.1 bits (69), Expect = 1.2
Identities = 7/35 (20%), Positives = 15/35 (42%)
Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
AD K ++ K ++K K++ +K+
Sbjct: 821 LRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKD 855
Score = 27.8 bits (63), Expect = 7.4
Identities = 8/28 (28%), Positives = 12/28 (42%)
Query: 52 DQAKKKKNFRRKKKKKTKKKKKKKKTKK 79
D A KK + +K K + +K K
Sbjct: 831 DPAGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 27.4 bits (62), Expect = 9.3
Identities = 7/34 (20%), Positives = 12/34 (35%)
Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
A A KK + +K ++ +K K
Sbjct: 825 ADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 31.6 bits (71), Expect = 0.49
Identities = 8/36 (22%), Positives = 17/36 (47%)
Query: 45 DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
D A K A++AKK + + + + + + K +
Sbjct: 1332 DAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367
Score = 29.0 bits (64), Expect = 3.7
Identities = 10/36 (27%), Positives = 17/36 (47%)
Query: 45 DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
+ A K AD AKKK +K + K + + + +
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAE 1360
Score = 29.0 bits (64), Expect = 3.8
Identities = 13/39 (33%), Positives = 22/39 (56%)
Query: 45 DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
D A K A++AKKK + +KK ++ KK + K + +
Sbjct: 1318 DEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAA 1356
Score = 28.6 bits (63), Expect = 4.8
Identities = 8/33 (24%), Positives = 16/33 (48%)
Query: 48 SKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
+K A +A K + + + ++K + KKK
Sbjct: 1341 AKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373
Score = 28.2 bits (62), Expect = 5.9
Identities = 17/37 (45%), Positives = 23/37 (62%), Gaps = 3/37 (8%)
Query: 47 ASKFADQAKKKKNFRRKK---KKKTKKKKKKKKTKKK 80
A K AD+AKKK ++K KKK ++ KK + KKK
Sbjct: 1416 AKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKK 1452
Score = 28.2 bits (62), Expect = 6.0
Identities = 15/37 (40%), Positives = 21/37 (56%), Gaps = 4/37 (10%)
Query: 48 SKFADQAKKKKNFRRKK----KKKTKKKKKKKKTKKK 80
+K AD+AKKK +KK KKK ++ KK + K
Sbjct: 1314 AKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKA 1350
Score = 27.8 bits (61), Expect = 6.9
Identities = 12/38 (31%), Positives = 23/38 (60%)
Query: 45 DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
D A K A++AKK + ++K ++ K + KKK ++ +
Sbjct: 1447 DEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKK 1484
Score = 27.4 bits (60), Expect = 9.3
Identities = 9/35 (25%), Positives = 18/35 (51%)
Query: 45 DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKK 79
A+K +A + ++K + +KKK++ KK
Sbjct: 1345 AEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379
>gnl|CDD|233327 TIGR01241, FtsH_fam, ATP-dependent metalloprotease FtsH.
HflB(FtsH) is a pleiotropic protein required for correct
cell division in bacteria. It has ATP-dependent zinc
metalloprotease activity. It was formerly designated
cell division protein FtsH [Cellular processes, Cell
division, Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 495
Score = 31.5 bits (72), Expect = 0.50
Identities = 10/13 (76%), Positives = 12/13 (92%)
Query: 8 ILDPALVRPGRVD 20
+LDPAL+RPGR D
Sbjct: 205 VLDPALLRPGRFD 217
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 31.3 bits (71), Expect = 0.55
Identities = 14/43 (32%), Positives = 16/43 (37%), Gaps = 6/43 (13%)
Query: 45 DNASKFADQAKKKKNF------RRKKKKKTKKKKKKKKTKKKN 81
Q KK NF R +KK K K KK + KK
Sbjct: 748 GKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKP 790
Score = 28.6 bits (64), Expect = 4.5
Identities = 11/29 (37%), Positives = 15/29 (51%)
Query: 51 ADQAKKKKNFRRKKKKKTKKKKKKKKTKK 79
A + +KK + KK K KK KT+K
Sbjct: 768 AFRGEKKAKPKAAKKDARKAKKPSAKTQK 796
Score = 27.8 bits (62), Expect = 8.2
Identities = 12/34 (35%), Positives = 15/34 (44%)
Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
K A KK+ R+ KK K +K TK K
Sbjct: 771 GEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAK 804
>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
Autographa californica nuclear polyhedrosis virus
(AcMNPV) pp31 is a nuclear phosphoprotein that
accumulates in the virogenic stroma, which is the viral
replication centre in the infected-cell nucleus, binds
to DNA, and serves as a late expression factor.
Length = 267
Score = 30.8 bits (70), Expect = 0.57
Identities = 8/38 (21%), Positives = 15/38 (39%)
Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
++ + R++K K KK K+K +R
Sbjct: 211 SASVVPTPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRS 248
>gnl|CDD|130309 TIGR01242, 26Sp45, 26S proteasome subunit P45 family. Many
proteins may score above the trusted cutoff because an
internal.
Length = 364
Score = 30.9 bits (70), Expect = 0.66
Identities = 12/19 (63%), Positives = 14/19 (73%)
Query: 2 STQALTILDPALVRPGRVD 20
+T ILDPAL+RPGR D
Sbjct: 267 ATNRPDILDPALLRPGRFD 285
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 29.5 bits (66), Expect = 0.71
Identities = 13/20 (65%), Positives = 17/20 (85%)
Query: 62 RKKKKKTKKKKKKKKTKKKN 81
+KKKKK+KK KK KK+ KK+
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120
Score = 28.7 bits (64), Expect = 1.3
Identities = 11/20 (55%), Positives = 14/20 (70%)
Query: 61 RRKKKKKTKKKKKKKKTKKK 80
++KKKK K KK KK +KK
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120
Score = 26.8 bits (59), Expect = 7.1
Identities = 13/32 (40%), Positives = 18/32 (56%), Gaps = 1/32 (3%)
Query: 44 VDNASKFADQAKKKKNFRRKKKKKTKKKKKKK 75
D+ + ++ KKKK + KK KK KK KK
Sbjct: 90 DDHPPEPTEKPKKKK-KKSKKTKKPKKSSKKD 120
Score = 26.4 bits (58), Expect = 8.2
Identities = 12/23 (52%), Positives = 15/23 (65%)
Query: 61 RRKKKKKTKKKKKKKKTKKKNRR 83
K KKK KK KK KK KK +++
Sbjct: 97 TEKPKKKKKKSKKTKKPKKSSKK 119
>gnl|CDD|197876 smart00792, Agouti, Agouti protein. The agouti protein regulates
pigmentation in the mouse hair follicle producing a
black hair with a subapical yellow band. A highly
homologous protein agouti signal protein (ASIP) is
present in humans and is expressed at highest levels in
adipose tissue where it may play a role in energy
homeostasis and possibly human pigmentation.
Length = 124
Score = 29.5 bits (66), Expect = 0.75
Identities = 9/27 (33%), Positives = 13/27 (48%)
Query: 53 QAKKKKNFRRKKKKKTKKKKKKKKTKK 79
K KK + +KK +KK+KK
Sbjct: 51 NKKPKKISAEEAEKKLLQKKEKKALTN 77
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 30.4 bits (69), Expect = 0.77
Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 2/40 (5%)
Query: 42 QDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
+DV+ A A++ +K K KKKK K+ K K KK
Sbjct: 239 EDVEAAESRAEKKRKSKE--EIKKKKPKESKGVKALKKVV 276
Score = 27.7 bits (62), Expect = 6.6
Identities = 10/25 (40%), Positives = 11/25 (44%)
Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKK 79
KKK + K K KK K KK
Sbjct: 258 IKKKKPKESKGVKALKKVVAKGMKK 282
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 30.4 bits (69), Expect = 0.80
Identities = 12/36 (33%), Positives = 17/36 (47%)
Query: 46 NASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
N ++ + K + + KKK KKKKKK K
Sbjct: 163 NPTEEEVELLKARLEEERAKKKKKKKKKKTKKNNAT 198
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 30.0 bits (68), Expect = 0.86
Identities = 12/28 (42%), Positives = 19/28 (67%)
Query: 53 QAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
Q K+++N +++K K KKKKK K K +
Sbjct: 177 QKKREENLKKRKDDKKNKKKKKAKKKGR 204
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 30.7 bits (70), Expect = 0.88
Identities = 11/49 (22%), Positives = 18/49 (36%)
Query: 32 QLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
Q++Q+ V + + KK + K K KKK K +
Sbjct: 51 QIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDE 99
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 30.0 bits (68), Expect = 0.89
Identities = 13/38 (34%), Positives = 18/38 (47%)
Query: 53 QAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMITLE 90
KK+K + + K+ K KKK KK R+ Y E
Sbjct: 3 LLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAE 40
>gnl|CDD|220816 pfam10592, AIPR, AIPR protein. This family of proteins was
identified in as an abortive infection phage resistance
protein often found in restriction modification system
operons.
Length = 300
Score = 30.4 bits (69), Expect = 0.95
Identities = 14/58 (24%), Positives = 21/58 (36%), Gaps = 2/58 (3%)
Query: 32 QLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMITL 89
L+ +YK + D + K + + +K KKKKK N Y I
Sbjct: 193 DLKGIYKIIFNDHLEEDYYKSIVAKYRLIFKALEKIIKKKKKYF--GGANIVAYTILA 248
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 30.1 bits (68), Expect = 0.96
Identities = 8/21 (38%), Positives = 10/21 (47%)
Query: 63 KKKKKTKKKKKKKKTKKKNRR 83
+K K K KK KK R+
Sbjct: 194 GDEKSPKSKAAPKKAGKKMRK 214
>gnl|CDD|219553 pfam07751, Abi_2, Abi-like protein. This family, found in various
bacterial species, contains sequences that are similar
to the Abi group of proteins, which are involved in
bacteriophage resistance mediated by abortive infection
in Lactococcus species. The proteins are thought to have
helix-turn-helix motifs, found in many DNA-binding
proteins, allowing them to perform their function.
Length = 190
Score = 29.6 bits (67), Expect = 1.0
Identities = 17/63 (26%), Positives = 27/63 (42%), Gaps = 3/63 (4%)
Query: 31 HQLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMIT-L 89
+ L + Y F + S + D+ K+F +K K K+ K K + K+ R T L
Sbjct: 67 YHLSEKYGPFAYL--DKSLYKDRFNYSKDFIQKLSKIIKRSKDKLEDFIKHYRKNYDTEL 124
Query: 90 EIP 92
P
Sbjct: 125 LFP 127
>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease. The Slx4 protein is a
heteromeric structure-specific endonuclease found in
fungi. Slx4 with Slx1 acts as a nuclease on branched
DNA substrates, particularly simple-Y, 5'-flap, or
replication fork structures by cleaving the strand
bearing the 5' non-homologous arm at the branch
junction and thus generating ligatable nicked products
from 5'-flap or replication fork substrates.
Length = 627
Score = 30.3 bits (68), Expect = 1.1
Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 7/61 (11%)
Query: 30 RHQLEQMYKRF--YQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMI 87
R QL Q+ RF ++ D A+ K+ KKKK KK K +K+TK+KN++ +
Sbjct: 42 RLQLRQVISRFTSFESDDQANSGNVSGKRVP-----KKKKIKKPKLRKRTKRKNKKIKSL 96
Query: 88 T 88
T
Sbjct: 97 T 97
>gnl|CDD|200312 TIGR03689, pup_AAA, proteasome ATPase. In the Actinobacteria, as
shown for Mycobacterium tuberculosis, some proteins are
modified by ligation between an epsilon-amino group of a
lysine side chain and the C-terminal carboxylate of the
ubiquitin-like protein Pup. This modification leads to
protein degradation by the archaeal-like proteasome
found in the Actinobacteria. Members of this protein
family belong to the AAA family of ATPases and tend to
be clustered with the genes for Pup, the Pup ligase
PafA, and structural components of the proteasome. This
protein forms hexameric rings with ATPase activity
[Protein fate, Degradation of proteins, peptides, and
glycopeptides].
Length = 512
Score = 30.4 bits (69), Expect = 1.1
Identities = 10/19 (52%), Positives = 16/19 (84%)
Query: 8 ILDPALVRPGRVDLKEYIG 26
++DPA++RPGR+D+K I
Sbjct: 345 MIDPAILRPGRLDVKIRIE 363
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 30.2 bits (68), Expect = 1.2
Identities = 12/26 (46%), Positives = 12/26 (46%)
Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKKK 80
K KK K K K KKK K K K
Sbjct: 752 KAKKKSASTKGKAAKTVKKKSKAKSK 777
Score = 29.0 bits (65), Expect = 3.2
Identities = 14/31 (45%), Positives = 15/31 (48%), Gaps = 2/31 (6%)
Query: 54 AKKKKNFRRKKKKKTKKKKKK--KKTKKKNR 82
AKKK + K KT KKK K K K R
Sbjct: 753 AKKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783
Score = 28.6 bits (64), Expect = 4.2
Identities = 14/44 (31%), Positives = 19/44 (43%), Gaps = 5/44 (11%)
Query: 48 SKFADQAK---KKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMIT 88
+K+A K +KK + KKK K K KT KK +
Sbjct: 737 TKYAGTPKTPYEKKT--KAKKKSASTKGKAAKTVKKKSKAKSKK 778
Score = 27.5 bits (61), Expect = 8.5
Identities = 11/30 (36%), Positives = 14/30 (46%)
Query: 54 AKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
KK + + K K KKK K K K +R
Sbjct: 754 KKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783
>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.
This presumed domain is found at the N-terminus of
Ribosomal L30 proteins and has been termed RL30NT or
NUC018.
Length = 71
Score = 28.0 bits (63), Expect = 1.2
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
KK+K + + K+ KK+ KK +KK R
Sbjct: 4 KKRKRNEKLRAKRAKKRAAKKAARKKKR 31
Score = 26.0 bits (58), Expect = 5.5
Identities = 10/28 (35%), Positives = 15/28 (53%)
Query: 56 KKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
KK+ K + K KK+ KK +K +R
Sbjct: 4 KKRKRNEKLRAKRAKKRAAKKAARKKKR 31
Score = 26.0 bits (58), Expect = 6.4
Identities = 9/34 (26%), Positives = 15/34 (44%)
Query: 53 QAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYM 86
K + +K+ K +KKK+K K Y+
Sbjct: 9 NEKLRAKRAKKRAAKKAARKKKRKLIFKRAEKYV 42
>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755). This
family is predominated by ORFs from Circoviridae. The
function of this family remains to be determined.
Length = 122
Score = 28.5 bits (64), Expect = 1.4
Identities = 11/52 (21%), Positives = 22/52 (42%)
Query: 32 QLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
QL + ++ Q + ++ K R+K + + + KK KK+ R
Sbjct: 53 QLIHLSQQLQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRS 104
Score = 27.0 bits (60), Expect = 5.6
Identities = 10/50 (20%), Positives = 19/50 (38%)
Query: 31 HQLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
H +Q+ + + + KK+K R + KK KK+ +
Sbjct: 56 HLSQQLQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSS 105
>gnl|CDD|235501 PRK05559, PRK05559, DNA topoisomerase IV subunit B; Reviewed.
Length = 631
Score = 30.1 bits (69), Expect = 1.5
Identities = 13/37 (35%), Positives = 19/37 (51%)
Query: 40 FYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKK 76
Q+ + A K A++A K R + KK K+KKK
Sbjct: 362 LNQNPELAEKLAEKAIKAAQARLRAAKKVKRKKKTSG 398
>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
topoisomerase IA [General function prediction only].
Length = 298
Score = 29.7 bits (67), Expect = 1.5
Identities = 10/33 (30%), Positives = 15/33 (45%)
Query: 51 ADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
A++ K ++ KK T K KK KK +
Sbjct: 253 AERRAKGGPGKKPAKKATAAKAKKTTAKKAAAK 285
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 29.0 bits (65), Expect = 1.6
Identities = 9/28 (32%), Positives = 19/28 (67%)
Query: 53 QAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
+ +K+ + ++K+ K KK+KK+ K+K
Sbjct: 97 KEQKEVSKETEEKEAIKAKKEKKEKKEK 124
Score = 27.8 bits (62), Expect = 4.2
Identities = 9/32 (28%), Positives = 20/32 (62%)
Query: 52 DQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
+ K++K ++ ++K K KK+K +KK ++
Sbjct: 94 ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125
>gnl|CDD|233478 TIGR01577, oligosac_amyl, oligosaccharide amylase. The name of
this type of amylase is based on the characterization of
an glucoamylase family enzyme from Thermoactinomyces
vulgaris. The T. vulgaris enzyme was expressed in E.
coli and, like other glucoamylases, it releases
beta-D-glucose from starch. However, unlike previously
characterized glucoamylases, this T. vulgaris amylase
hydrolyzes maltooligosaccharides (maltotetraose,
maltose) more efficiently than starch (PMID: 11549021),
indicating this enzyme belongs to a class of
glucoamylase-type enzymes with
oligosaccharide-metabolizing activity.
Length = 616
Score = 29.9 bits (67), Expect = 1.6
Identities = 16/81 (19%), Positives = 26/81 (32%), Gaps = 11/81 (13%)
Query: 155 ERSRESTTLDIQQGIPWET----------VTLTAFGRDRSLFFNILEEARTLALKQYEGK 204
++E + WE AFG+ LEE RTL ++ +
Sbjct: 169 NCNKEDIGMHTDGACLWELGLCPPGSKCFTIFFAFGKTVEKALEELEELRTLGFERIRSQ 228
Query: 205 TVMY-TALGSEWRPFGHPKRR 224
T+ A + +P P
Sbjct: 229 TLADWEAYFGQAKPVQTPCLE 249
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association
with pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 27.6 bits (62), Expect = 1.7
Identities = 15/37 (40%), Positives = 23/37 (62%), Gaps = 5/37 (13%)
Query: 52 DQAKKKKNFRRKKKKKTKKKKKKKKTK-----KKNRR 83
D +K ++ +R ++KK KKKK KKK+K KK +
Sbjct: 28 DASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQM 64
>gnl|CDD|233328 TIGR01243, CDC48, AAA family ATPase, CDC48 subfamily. This
subfamily of the AAA family ATPases includes two members
each from three archaeal species. It also includes yeast
CDC48 (cell division control protein 48) and the human
ortholog, transitional endoplasmic reticulum ATPase
(valosin-containing protein). These proteins in
eukaryotes are involved in the budding and transfer of
membrane from the transitional endoplasmic reticulum to
the Golgi apparatus.
Length = 733
Score = 29.9 bits (67), Expect = 1.7
Identities = 11/13 (84%), Positives = 12/13 (92%)
Query: 8 ILDPALVRPGRVD 20
ILDPAL+RPGR D
Sbjct: 602 ILDPALLRPGRFD 614
>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.
This domain is the C-terminal end of ribosomal L18/L5
proteins.
Length = 93
Score = 27.9 bits (63), Expect = 1.7
Identities = 15/49 (30%), Positives = 23/49 (46%), Gaps = 11/49 (22%)
Query: 32 QLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
+E+MYK+ A A + R+KK+KK K + K+ KK
Sbjct: 43 DIEEMYKK-----------AHAAIRADPSRKKKEKKEVKAESKRYNAKK 80
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 29.7 bits (66), Expect = 1.7
Identities = 12/53 (22%), Positives = 27/53 (50%), Gaps = 2/53 (3%)
Query: 34 EQMYKRFYQDVDNASKFADQAKK--KKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
E K +D + DQ +K ++ +R K+++ KK+K+++ + + H
Sbjct: 1010 ECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIH 1062
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 29.8 bits (67), Expect = 1.7
Identities = 12/25 (48%), Positives = 20/25 (80%)
Query: 61 RRKKKKKTKKKKKKKKTKKKNRRHY 85
R+KKKKK K+KKKKK ++ +++ +
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIH 708
Score = 27.9 bits (62), Expect = 7.4
Identities = 13/31 (41%), Positives = 17/31 (54%)
Query: 60 FRRKKKKKTKKKKKKKKTKKKNRRHYMITLE 90
+KKKK KK K+KKK K + + LE
Sbjct: 681 CGSRKKKKKKKSKEKKKKKNREASKQIHALE 711
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 29.2 bits (66), Expect = 1.8
Identities = 9/26 (34%), Positives = 10/26 (38%)
Query: 56 KKKNFRRKKKKKTKKKKKKKKTKKKN 81
K K + K K KK K K K
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAKLNVGG 219
Score = 27.7 bits (62), Expect = 5.1
Identities = 11/36 (30%), Positives = 16/36 (44%)
Query: 45 DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
D + +K K + K KK K K ++K K K
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAK 73
Score = 27.7 bits (62), Expect = 5.9
Identities = 12/41 (29%), Positives = 21/41 (51%)
Query: 43 DVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
D + + A A K K + K K +K+K K++ ++K R
Sbjct: 42 DEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLR 82
Score = 27.7 bits (62), Expect = 6.1
Identities = 14/44 (31%), Positives = 19/44 (43%)
Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMITLE 90
K AK K K K + K+K K++K +K R T E
Sbjct: 47 EEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPE 90
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 29.5 bits (67), Expect = 1.8
Identities = 11/43 (25%), Positives = 23/43 (53%)
Query: 42 QDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
++ +N + + KK + +RK +KK +K++ +K KK
Sbjct: 399 EEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEA 441
>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain. This region has been
identified as the binding site of the SRP72 protein to
SRP RNA.
Length = 57
Score = 26.6 bits (59), Expect = 2.2
Identities = 16/52 (30%), Positives = 23/52 (44%), Gaps = 7/52 (13%)
Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMITLEIPCRDKSY 98
K + KKKK+ ++K+K K K T R +P RD+SY
Sbjct: 10 KKKPGTEQKKKKSAKKKRKPKLPKNFNPVVTPDPERW-------LPMRDRSY 54
Score = 26.3 bits (58), Expect = 3.1
Identities = 14/33 (42%), Positives = 19/33 (57%)
Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKK 79
+K + KKK +KKKK+ KKK+K K K
Sbjct: 1 EAKKPKKIAKKKPGTEQKKKKSAKKKRKPKLPK 33
>gnl|CDD|201461 pfam00829, Ribosomal_L21p, Ribosomal prokaryotic L21 protein.
Length = 96
Score = 27.5 bits (62), Expect = 2.3
Identities = 10/28 (35%), Positives = 14/28 (50%), Gaps = 2/28 (7%)
Query: 62 RKKKKKTKKKKKKKKTKKKN--RRHYMI 87
R KK K K++K +KK R+ Y
Sbjct: 68 RGKKVIVFKYKRRKNYRKKQGHRQPYTR 95
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 28.5 bits (64), Expect = 2.3
Identities = 13/40 (32%), Positives = 23/40 (57%)
Query: 42 QDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
++++ K ++ +K K ++K KKK K K KK KK +
Sbjct: 69 EEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDD 108
Score = 28.5 bits (64), Expect = 2.4
Identities = 11/29 (37%), Positives = 14/29 (48%)
Query: 54 AKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
KKK + K KK KK K +K +K
Sbjct: 90 KSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118
Score = 27.3 bits (61), Expect = 6.2
Identities = 12/30 (40%), Positives = 13/30 (43%)
Query: 51 ADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
Q K K + KKKK K KK K K
Sbjct: 80 EKQKWKWKKKKSKKKKDKDKDKKDDKKDDK 109
Score = 27.3 bits (61), Expect = 6.5
Identities = 13/30 (43%), Positives = 15/30 (50%)
Query: 52 DQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
+ K KK +KKK K K KK KK K
Sbjct: 82 QKWKWKKKKSKKKKDKDKDKKDDKKDDKSE 111
>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain. This family
contains a C-terminal presumed domain in Sas10 which
hash been identified as a regulator of chromatin
silencing.
Length = 76
Score = 27.2 bits (61), Expect = 2.4
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYM 86
K ++N R KK+KK +K K+ K++++ R
Sbjct: 25 KDRRNPRVKKRKKYEKAVKRLKSQRRVVREED 56
>gnl|CDD|222959 PHA02999, PHA02999, Hypothetical protein; Provisional.
Length = 382
Score = 29.0 bits (65), Expect = 2.5
Identities = 17/61 (27%), Positives = 24/61 (39%), Gaps = 4/61 (6%)
Query: 21 LKEYIGFCSRHQLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
L EYI + F + Q ++N + KKK K KKK KK ++
Sbjct: 63 LYEYIAYKQNAS----NPNFQNSDSVVNLTKKQISSEENTMKMDKKKYKLKKKIKKVREI 118
Query: 81 N 81
N
Sbjct: 119 N 119
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many
hypothetical proteins.
Length = 98
Score = 27.4 bits (61), Expect = 2.8
Identities = 13/47 (27%), Positives = 18/47 (38%), Gaps = 6/47 (12%)
Query: 45 DNASKFADQAKKKKNFRRKKKKKTKKKKKKKK------TKKKNRRHY 85
N K + K+ R K K +KK KKK ++ RR
Sbjct: 19 SNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRRRT 65
Score = 26.7 bits (59), Expect = 5.7
Identities = 11/39 (28%), Positives = 16/39 (41%)
Query: 45 DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
+ + +KKK R K + K KK +K KK
Sbjct: 14 IDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDE 52
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 29.1 bits (66), Expect = 2.9
Identities = 9/34 (26%), Positives = 19/34 (55%)
Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
KK++ +++KKKK KKK++ +++
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEE 446
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 28.9 bits (65), Expect = 3.0
Identities = 17/40 (42%), Positives = 22/40 (55%)
Query: 42 QDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
+DVD+ F + + K+ KKKKKKKK KKKN
Sbjct: 31 KDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKN 70
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 28.0 bits (63), Expect = 3.0
Identities = 10/30 (33%), Positives = 22/30 (73%)
Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
++KK ++KK++ K++K++ K +KK +H
Sbjct: 100 QEKKKRSKEKKEEEKERKRQLKQQKKKAKH 129
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 28.9 bits (65), Expect = 3.1
Identities = 11/33 (33%), Positives = 21/33 (63%)
Query: 52 DQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRH 84
D KK+K R++K++ K + KK K+K+++
Sbjct: 624 DGIKKRKKKRKRKRRFLTKIEGVKKEKRKDKKL 656
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 29.0 bits (65), Expect = 3.1
Identities = 16/49 (32%), Positives = 21/49 (42%), Gaps = 5/49 (10%)
Query: 37 YKRFYQDVDNASKFA-----DQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
Y F+ +N+ +FA +K K K KKK K K KKK
Sbjct: 93 YDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKK 141
Score = 27.9 bits (62), Expect = 6.6
Identities = 13/52 (25%), Positives = 17/52 (32%)
Query: 46 NASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYMITLEIPCRDKS 97
+ K K + KKK T K KK KK + + KS
Sbjct: 108 GPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNKPKS 159
>gnl|CDD|178450 PLN02859, PLN02859, glutamine-tRNA ligase.
Length = 788
Score = 29.0 bits (65), Expect = 3.4
Identities = 11/23 (47%), Positives = 13/23 (56%)
Query: 81 NRRHYMITLEIPCRDKSYHWLLH 103
N H + TLE R SY+WLL
Sbjct: 456 NITHSLCTLEFETRRASYYWLLD 478
>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
Length = 801
Score = 29.0 bits (65), Expect = 3.4
Identities = 14/36 (38%), Positives = 16/36 (44%)
Query: 44 VDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKK 79
A K A Q KKK KKK+ KK K+K
Sbjct: 597 AAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKA 632
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 28.8 bits (64), Expect = 3.5
Identities = 11/29 (37%), Positives = 17/29 (58%)
Query: 52 DQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
D+ K + R+KK+K+ KK K KK+
Sbjct: 555 DRTDKNRERRKKKRKRRAAKKAVTKAKKE 583
Score = 28.4 bits (63), Expect = 4.1
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 58 KNFRRKKKKKTKKKKKKKKTKKKNRR 83
KN R+KKK+ ++ KK TK K R
Sbjct: 559 KNRERRKKKRKRRAAKKAVTKAKKER 584
>gnl|CDD|225186 COG2303, BetA, Choline dehydrogenase and related flavoproteins
[Amino acid transport and metabolism].
Length = 542
Score = 28.6 bits (64), Expect = 3.6
Identities = 13/46 (28%), Positives = 18/46 (39%), Gaps = 1/46 (2%)
Query: 213 SEWRPFGHPKRRRPLNSVVLDSGVADRILADIRQFVADPAWYLDRG 258
S R + P +RP N +L RIL + + V D G
Sbjct: 203 SAARAYLKPALKRP-NLTLLTGARVRRILLEGDRAVGVEVEIGDGG 247
>gnl|CDD|187679 cd09621, CBM9_like_5, DOMON-like type 9 carbohydrate binding
module. Family 9 carbohydrate-binding modules (CBM9)
play a role in the microbial degradation of cellulose
and hemicellulose (materials found in plants). The
domain has previously been called cellulose-binding
domain. The polysaccharide binding sites of CBMs with
available 3D structure have been found to be either flat
surfaces with interactions formed by predominantly
aromatic residues (tryptophan and tyrosine), or extended
shallow grooves. CBM9 domains found in this
uncharacterized heterogeneous subfamily are often
located at the C-terminus of longer proteins and may
co-occur with various other functional domains such as
glycosyl hydrolases. The CBM9 module in these
architectures may be involved in binding to
carbohydrates.
Length = 188
Score = 28.1 bits (63), Expect = 3.7
Identities = 14/40 (35%), Positives = 19/40 (47%), Gaps = 1/40 (2%)
Query: 152 IRVERSR-ESTTLDIQQGIPWETVTLTAFGRDRSLFFNIL 190
IR+E SR E+ + IPW + LT FN+L
Sbjct: 116 IRLETSRDEAKVTVYEAAIPWAELGLTGPAAGDGFRFNLL 155
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 28.3 bits (63), Expect = 3.9
Identities = 14/35 (40%), Positives = 18/35 (51%)
Query: 47 ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
A K A +AKKK K K + K K K ++ K K
Sbjct: 164 AKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKA 198
Score = 27.5 bits (61), Expect = 6.8
Identities = 13/36 (36%), Positives = 19/36 (52%)
Query: 45 DNASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
+ +K A +AKKK +KK + K K + K K K
Sbjct: 155 EAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAK 190
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 28.5 bits (63), Expect = 4.0
Identities = 12/47 (25%), Positives = 21/47 (44%), Gaps = 10/47 (21%)
Query: 44 VDNASKFADQAKKKKNFRRKK----------KKKTKKKKKKKKTKKK 80
V+ + ++KK R+K +K KK +K++K KK
Sbjct: 100 VNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKM 146
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 28.5 bits (64), Expect = 4.2
Identities = 18/66 (27%), Positives = 24/66 (36%), Gaps = 18/66 (27%)
Query: 34 EQMYKRFYQDV-----DNASKFADQAKKKKNFRRKKKK-------------KTKKKKKKK 75
Q Y +V + S F+++ K KK K K KKKKK+K
Sbjct: 21 LQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEK 80
Query: 76 KTKKKN 81
K K
Sbjct: 81 KEPKSE 86
Score = 28.1 bits (63), Expect = 5.4
Identities = 12/31 (38%), Positives = 18/31 (58%)
Query: 52 DQAKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
+ KKK+ ++KKKKK KK+ K + K
Sbjct: 63 NNESKKKSEKKKKKKKEKKEPKSEGETKLGF 93
>gnl|CDD|222928 PHA02778, PHA02778, major capsid L1 protein; Provisional.
Length = 503
Score = 28.2 bits (64), Expect = 4.4
Identities = 18/68 (26%), Positives = 28/68 (41%), Gaps = 18/68 (26%)
Query: 19 VDLKEYIGFCSRHQLEQMYKRFYQDVDNAS---KFADQAKKKKNFRRKKKKKTKKKKKKK 75
VDL E RF D+D KF QA ++ RR+K+K T +
Sbjct: 451 VDLTE---------------RFSLDLDQFPLGRKFLFQAGLRRRTRRRKRKSTSSSSSRS 495
Query: 76 KTKKKNRR 83
+++ +R
Sbjct: 496 TKRRRRKR 503
>gnl|CDD|237625 PRK14147, PRK14147, heat shock protein GrpE; Provisional.
Length = 172
Score = 27.6 bits (61), Expect = 4.4
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 30 RHQLEQMYKRFYQDVDNASKFADQ 53
R LE KR +DV+ A KFA++
Sbjct: 45 RADLENQRKRIARDVEQARKFANE 68
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 27.9 bits (62), Expect = 4.8
Identities = 14/32 (43%), Positives = 17/32 (53%), Gaps = 7/32 (21%)
Query: 59 NFRRK-------KKKKTKKKKKKKKTKKKNRR 83
++RRK KKKK K KK K K KK +
Sbjct: 79 SWRRKPATTAVEKKKKGKSKKNKLKGKKDEDK 110
Score = 27.5 bits (61), Expect = 6.9
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 55 KKKKNFRRKKKKKTKKKKKKKKTK 78
KKKK +K K K KK + KKK +
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAR 114
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 27.7 bits (62), Expect = 5.2
Identities = 10/44 (22%), Positives = 21/44 (47%), Gaps = 4/44 (9%)
Query: 29 SRHQLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKK 72
+ + ++ + F + A A K++ +KK++KKKK
Sbjct: 157 AEWKDAKLLEEFAAE----EAEAAAAAKEEESAEGEKKESKKKK 196
>gnl|CDD|188874 cd09768, Luminal_EIF2AK3, The Luminal domain, a dimerization
domain, of the Serine/Threonine protein kinase,
eukaryotic translation Initiation Factor 2-Alpha Kinase
3. The Luminal domain is a dimerization domain present
in eukaryotic translation Initiation Factor 2-Alpha
Kinase 3 (EIF2AK3), also called PKR-like Endoplasmic
Reticulum Kinase (PERK). EIF2AK3 is a serine/threonine
protein kinase (STK) and a type I transmembrane protein
that is localized in the endoplasmic reticulum (ER). As
a EIF2AK, it phosphorylates the alpha subunit of eIF-2,
resulting in the downregulation of protein synthesis.
eIF-2 phosphorylation is induced in response to cellular
stresses including virus infection, heat shock, nutrient
deficiency, and the accummulation of unfolded proteins,
among others. There are four distinct kinases that
phosphorylate eIF-2 and control protein synthesis:
General Control Non-derepressible-2 (GCN2), protein
kinase regulated by RNA (PKR), heme-regulated inhibitor
kinase (HRI), and PERK. PERK contains a luminal domain
bound with the chaperone BiP under unstressed conditions
and a cytoplasmic catalytic kinase domain. In response
to the accumulation of misfolded or unfolded proteins in
the ER, PERK is activated through the release of BiP,
allowing it to dimerize through its luminal domain and
autophosphorylate. It functions as the central regulator
of translational control during the Unfolded Protein
Response (UPR) pathway. In addition to the eIF-2 alpha
subunit, PERK also phosphorylates Nrf2, a leucine zipper
transcription factor which regulates cellular redox
status and promotes cell survival during the UPR.
Length = 301
Score = 28.1 bits (63), Expect = 5.2
Identities = 8/35 (22%), Positives = 13/35 (37%)
Query: 118 VETSFEKFDTGYVKTKYDFIPSIGTHLFSYQGNWI 152
V +S + IPS+ L+ + G I
Sbjct: 35 VSSSLSTLELINNGKSVRLIPSLDGSLYQFDGESI 69
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The
function is unclear.
Length = 151
Score = 27.3 bits (60), Expect = 5.4
Identities = 11/29 (37%), Positives = 21/29 (72%)
Query: 54 AKKKKNFRRKKKKKTKKKKKKKKTKKKNR 82
AK++K + + +K+ KKKK+K ++KK +
Sbjct: 11 AKQRKESGKTQSQKSDKKKKEKVSEKKGK 39
Score = 26.9 bits (59), Expect = 7.5
Identities = 12/25 (48%), Positives = 17/25 (68%)
Query: 56 KKKNFRRKKKKKTKKKKKKKKTKKK 80
K ++ + KKKK K +KK K+KKK
Sbjct: 19 KTQSQKSDKKKKEKVSEKKGKSKKK 43
>gnl|CDD|130083 TIGR01010, BexC_CtrB_KpsE, polysaccharide export inner-membrane
protein, BexC/CtrB/KpsE family. This family contains
gamma proteobacterial proteins involved in capsule
polysaccharide export [Transport and binding proteins,
Carbohydrates, organic alcohols, and acids].
Length = 362
Score = 28.1 bits (63), Expect = 5.5
Identities = 16/58 (27%), Positives = 25/58 (43%), Gaps = 4/58 (6%)
Query: 141 GTHLFSYQGNWIRVERSRESTTLDIQQGIPWETVTLTAFGRDRSLFFN--ILEEARTL 196
H Q + R + R S +D GI T+ +TAF + + N +L+E L
Sbjct: 104 RLHKNGTQEAFYRYYKKRLSVDVDSVSGI--LTLNVTAFDAEEAQKINQRLLKEGERL 159
>gnl|CDD|153356 cd07672, F-BAR_PSTPIP2, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
Phosphatase-Interacting Protein 2. F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Proline-Serine-Threonine
Phosphatase-Interacting Protein 2 (PSTPIP2), also known
as Macrophage Actin-associated tYrosine Phosphorylated
protein (MAYP), is mostly expressed in hematopoietic
cells but is also expressed in the brain. It is involved
in regulating cell adhesion and motility. Mutations in
the gene encoding murine PSTPIP2 can cause
autoinflammatory disorders such as chronic multifocal
osteomyelitis and macrophage autoinflammatory disease.
PSTPIP2 contains an N-terminal F-BAR domain and lacks
the PEST motifs and SH3 domain that are found in
PSTPIP1. F-BAR domains form banana-shaped dimers with a
positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 240
Score = 27.6 bits (61), Expect = 5.7
Identities = 22/72 (30%), Positives = 31/72 (43%), Gaps = 15/72 (20%)
Query: 40 FYQDVDN--------ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR-HYMITLE 90
F Q +DN A D+AKK ++FR ++K KK + K R + T+E
Sbjct: 66 FKQQIDNVGQSHIQLAQTLRDEAKKMEDFRERQKLARKKIELIMDAIHKQRAMQFKKTME 125
Query: 91 IP------CRDK 96
CRDK
Sbjct: 126 SKKNYEQKCRDK 137
>gnl|CDD|177142 MTH00070, ND2, NADH dehydrogenase subunit 2; Provisional.
Length = 346
Score = 28.0 bits (63), Expect = 5.9
Identities = 9/20 (45%), Positives = 10/20 (50%), Gaps = 6/20 (30%)
Query: 86 MITLEIPCRDKSYHWLLHWI 105
MITL S HW L W+
Sbjct: 18 MITL------SSNHWFLAWM 31
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 27.7 bits (62), Expect = 6.2
Identities = 13/29 (44%), Positives = 16/29 (55%)
Query: 52 DQAKKKKNFRRKKKKKTKKKKKKKKTKKK 80
KKK+ + K+ TKKKKKK T K
Sbjct: 83 RLKKKKRVKTKAYKEPTKKKKKKDPTAAK 111
>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein. NinG or Rap
is involved in recombination. Rap (recombination adept
with plasmid) increases lambda-by-plasmid recombination
catalyzed by Escherichia coli's RecBCD pathway.
Length = 188
Score = 27.3 bits (61), Expect = 6.9
Identities = 7/34 (20%), Positives = 19/34 (55%)
Query: 53 QAKKKKNFRRKKKKKTKKKKKKKKTKKKNRRHYM 86
+ K ++ R+ + + +++ K +K K K R ++
Sbjct: 36 REKAQEKKRKAEAQAERRELKARKEKLKTRSDWL 69
>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
Length = 177
Score = 27.2 bits (61), Expect = 6.9
Identities = 10/21 (47%), Positives = 13/21 (61%)
Query: 62 RKKKKKTKKKKKKKKTKKKNR 82
+KKK K+ K K+KKK R
Sbjct: 1 KKKKSSPKRSKGMAKSKKKTR 21
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 27.9 bits (63), Expect = 7.2
Identities = 12/23 (52%), Positives = 13/23 (56%)
Query: 62 RKKKKKTKKKKKKKKTKKKNRRH 84
K +K KKKKKK KT K R
Sbjct: 680 EKIQKPKKKKKKKPKTVKPKPRT 702
>gnl|CDD|214619 smart00325, RhoGEF, Guanine nucleotide exchange factor for
Rho/Rac/Cdc42-like GTPases. Guanine nucleotide exchange
factor for Rho/Rac/Cdc42-like GTPases Also called
Dbl-homologous (DH) domain. It appears that PH domains
invariably occur C-terminal to RhoGEF/DH domains.
Improved coverage.
Length = 180
Score = 27.3 bits (61), Expect = 7.3
Identities = 12/60 (20%), Positives = 25/60 (41%), Gaps = 2/60 (3%)
Query: 20 DLKEYIGFCSRHQLE--QMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKKKKKKT 77
D E IG E ++Y + + +A + + KK F++ K+ + ++ T
Sbjct: 69 DSVERIGDVFLKLEEFFKIYSEYCSNHPDALELLKKLKKNPRFQKFLKEIESSPQCRRLT 128
>gnl|CDD|214386 CHL00176, ftsH, cell division protein; Validated.
Length = 638
Score = 27.7 bits (62), Expect = 7.3
Identities = 10/13 (76%), Positives = 11/13 (84%)
Query: 8 ILDPALVRPGRVD 20
ILD AL+RPGR D
Sbjct: 333 ILDAALLRPGRFD 345
>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 158 and 187 amino
acids in length. This family includes the PcfB protein.
Length = 137
Score = 26.8 bits (60), Expect = 7.6
Identities = 12/26 (46%), Positives = 17/26 (65%)
Query: 55 KKKKNFRRKKKKKTKKKKKKKKTKKK 80
KK K F+ ++ K K++K K K KKK
Sbjct: 112 KKLKKFKEEQAAKKKERKDKVKNKKK 137
>gnl|CDD|238288 cd00520, RRF, Ribosome recycling factor (RRF). Ribosome recycling
factor dissociates the posttermination complex, composed
of the ribosome, deacylated tRNA, and mRNA, after
termination of translation. Thus ribosomes are
"recycled" and ready for another round of protein
synthesis. RRF is believed to bind the ribosome at the
A-site in a manner that mimics tRNA, but the specific
mechanisms remain unclear. RRF is essential for
bacterial growth. It is not necessary for cell growth
in archaea or eukaryotes, but is found in mitochondria
or chloroplasts of some eukaryotic species.
Length = 179
Score = 26.8 bits (60), Expect = 8.4
Identities = 14/37 (37%), Positives = 22/37 (59%), Gaps = 1/37 (2%)
Query: 44 VDNASKFADQAKKK-KNFRRKKKKKTKKKKKKKKTKK 79
V +A K A++AK +N RR K KK +K+K+ +
Sbjct: 110 VKDAKKIAEEAKVAIRNIRRDANDKIKKLEKEKEISE 146
>gnl|CDD|216581 pfam01577, Peptidase_S30, Potyvirus P1 protease. The potyviridae
family positive stand RNA viruses with genome encoding a
polyprotein. members include zucchini yellow mosaic
virus, and turnip mosaic viruses which cause
considerable losses of crops worldwide. This family
consists of a C terminus region from various plant
potyvirus P1 proteins (found at the N terminus of the
polyprotein). The C terminus of P1 is a serine-type
protease responsible for autocatalytic cleavage between
P1 and the helper component protease pfam00851. The
entire P1 protein may be involved in virus-host
interactions.
Length = 245
Score = 27.3 bits (61), Expect = 8.6
Identities = 13/24 (54%), Positives = 14/24 (58%)
Query: 60 FRRKKKKKTKKKKKKKKTKKKNRR 83
FR K+T KK KKKK KKK
Sbjct: 82 FRTPYYKRTTKKMKKKKKKKKVVM 105
>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16. Nop16 is a
protein involved in ribosome biogenesis.
Length = 173
Score = 27.0 bits (60), Expect = 8.7
Identities = 8/24 (33%), Positives = 13/24 (54%)
Query: 61 RRKKKKKTKKKKKKKKTKKKNRRH 84
R+KKK ++ K +K K+ R
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRK 25
>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
Members of the NSCC2 family have been sequenced from
various yeast, fungal and animals species including
Saccharomyces cerevisiae, Drosophila melanogaster and
Homo sapiens. These proteins are the Sec62 proteins,
believed to be associated with the Sec61 and Sec63
constituents of the general protein secretary systems of
yeast microsomes. They are also the non-selective cation
(NS) channels of the mammalian cytoplasmic membrane. The
yeast Sec62 protein has been shown to be essential for
cell growth. The mammalian NS channel proteins has been
implicated in platelet derived growth factor(PGDF)
dependent single channel current in fibroblasts. These
channels are essentially closed in serum deprived
tissue-culture cells and are specifically opened by
exposure to PDGF. These channels are reported to exhibit
equal selectivity for Na+, K+ and Cs+ with low
permeability to Ca2+, and no permeability to anions
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 232
Score = 27.2 bits (60), Expect = 8.8
Identities = 11/25 (44%), Positives = 16/25 (64%)
Query: 57 KKNFRRKKKKKTKKKKKKKKTKKKN 81
K + KKK K+KK KKK+ ++ N
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKREAN 232
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 27.7 bits (62), Expect = 9.0
Identities = 13/56 (23%), Positives = 24/56 (42%), Gaps = 8/56 (14%)
Query: 34 EQMYKR-------FYQDVDN-ASKFADQAKKKKNFRRKKKKKTKKKKKKKKTKKKN 81
+ M+ ++ + K + ++ K+ + K K +K K KKK KKK
Sbjct: 1127 KDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKK 1182
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 27.5 bits (61), Expect = 9.4
Identities = 9/31 (29%), Positives = 21/31 (67%)
Query: 53 QAKKKKNFRRKKKKKTKKKKKKKKTKKKNRR 83
+ K ++ + ++K+K + K++KKK K+K +
Sbjct: 96 KEPKNESGKEEEKEKEQVKEEKKKKKEKPKE 126
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called
The RES complex, pre-mRNA retention and splicing
complex. Subunits of this complex are not essential for
viability of yeasts but they are required for efficient
splicing in vitro and in vivo. Furthermore,
inactivation of this complex causes pre-mRNA leakage
from the nucleus. Bud13 contains a unique,
phylogenetically conserved C-terminal region of unknown
function.
Length = 141
Score = 26.5 bits (59), Expect = 9.5
Identities = 9/28 (32%), Positives = 20/28 (71%)
Query: 52 DQAKKKKNFRRKKKKKTKKKKKKKKTKK 79
D +K++ R+K++K +K++K+K+ K
Sbjct: 13 DIEEKREEKEREKEEKERKEEKEKEWGK 40
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 27.1 bits (60), Expect = 9.6
Identities = 12/24 (50%), Positives = 14/24 (58%)
Query: 61 RRKKKKKTKKKKKKKKTKKKNRRH 84
K KK K KKK KK+KKK +
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKE 165
>gnl|CDD|220628 pfam10198, Ada3, Histone acetyltransferases subunit 3. Ada3 is a
family of proteins conserved from yeasts to humans. It
is an essential component of the Ada transcriptional
coactivator (alteration/deficiency in activation)
complex. Ada3 plays a key role in linking histone
acetyltransferase-containing complexes to p53 (tumour
suppressor protein) thereby regulating p53 acetylation,
stability and transcriptional activation following DNA
damage.
Length = 127
Score = 26.5 bits (59), Expect = 9.8
Identities = 17/52 (32%), Positives = 22/52 (42%), Gaps = 3/52 (5%)
Query: 21 LKEYIGFCSRHQLEQMYKRFYQDVDNASKFADQAKKKKNFRRKKKKKTKKKK 72
K + Q YK D+D K DQA K+ R KKKK +K+
Sbjct: 59 KKRLLPLAKEEMARQEYKTILDDLD---KQVDQAYLKRIRARGKKKKKEKRP 107
>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
SecD/SecF; Reviewed.
Length = 755
Score = 27.5 bits (62), Expect = 9.8
Identities = 11/21 (52%), Positives = 13/21 (61%)
Query: 58 KNFRRKKKKKTKKKKKKKKTK 78
+ R KKKKK KK KK + K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 27.1 bits (60), Expect = 9.9
Identities = 14/46 (30%), Positives = 21/46 (45%), Gaps = 3/46 (6%)
Query: 61 RRKKKKKTKKKKKKKKTKKKNRRHYMITLEIPCRDKSYHWLLHWIT 106
R + ++K K+KK T ++NR +IT P R H I
Sbjct: 291 RAQYREKQKEKKAFLWTLRRNRLRMIIT---PWRAPELHAENAEIK 333
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.135 0.409
Gapped
Lambda K H
0.267 0.0771 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,602,352
Number of extensions: 1288958
Number of successful extensions: 6991
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5501
Number of HSP's successfully gapped: 524
Length of query: 263
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 168
Effective length of database: 6,723,972
Effective search space: 1129627296
Effective search space used: 1129627296
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 58 (26.0 bits)