RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7704
(124 letters)
>gnl|CDD|215689 pfam00067, p450, Cytochrome P450. Cytochrome P450s are
haem-thiolate proteins involved in the oxidative
degradation of various compounds. They are particularly
well known for their role in the degradation of
environmental toxins and mutagens. They can be divided
into 4 classes, according to the method by which
electrons from NAD(P)H are delivered to the catalytic
site. Sequence conservation is relatively low within the
family - there are only 3 absolutely conserved residues
- but their general topography and structural fold are
highly conserved. The conserved core is composed of a
coil termed the 'meander', a four-helix bundle, helices
J and K, and two sets of beta-sheets. These constitute
the haem-binding loop (with an absolutely conserved
cysteine that serves as the 5th ligand for the haem
iron), the proton-transfer groove and the absolutely
conserved EXXR motif in helix K. While prokaryotic P450s
are soluble proteins, most eukaryotic P450s are
associated with microsomal membranes. their general
enzymatic function is to catalyze regiospecific and
stereospecific oxidation of non-activated hydrocarbons
at physiological temperatures.
Length = 461
Score = 76.2 bits (188), Expect = 2e-17
Identities = 32/74 (43%), Positives = 47/74 (63%), Gaps = 3/74 (4%)
Query: 2 LALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPI-IGRHI 60
LA HP++Q+++ +EI +V+GD PTYD +Q + L VIKETLRL+P P+ + R +
Sbjct: 288 LAKHPEVQEKLREEIDEVIGDK--RSPTYDDLQNMPYLDAVIKETLRLHPVVPLLLPREV 345
Query: 61 TSEIQADKYTIPIG 74
T + Y IP G
Sbjct: 346 TKDTVIPGYLIPKG 359
>gnl|CDD|215164 PLN02290, PLN02290, cytokinin trans-hydroxylase.
Length = 516
Score = 60.6 bits (147), Expect = 6e-12
Identities = 27/74 (36%), Positives = 41/74 (55%), Gaps = 3/74 (4%)
Query: 1 MLALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHI 60
+LA +P Q +V E+ +V G + P+ D + +L LL VI E+LRLYP A ++ R
Sbjct: 342 LLASNPTWQDKVRAEVAEVCGGET---PSVDHLSKLTLLNMVINESLRLYPPATLLPRMA 398
Query: 61 TSEIQADKYTIPIG 74
+I+ IP G
Sbjct: 399 FEDIKLGDLHIPKG 412
>gnl|CDD|225035 COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis,
transport, and catabolism].
Length = 411
Score = 59.4 bits (144), Expect = 1e-11
Identities = 21/75 (28%), Positives = 30/75 (40%), Gaps = 18/75 (24%)
Query: 2 LALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHIT 61
L HP ++ E + LL V++ETLRLYP P+ R T
Sbjct: 263 LLRHPDQLAKLRAE------------------PDRPLLEAVVEETLRLYPPVPLARRVAT 304
Query: 62 SEIQADKYTIPIGRK 76
+++ Y IP G
Sbjct: 305 EDVELGGYRIPAGTV 319
>gnl|CDD|215354 PLN02655, PLN02655, ent-kaurene oxidase.
Length = 466
Score = 53.6 bits (129), Expect = 2e-09
Identities = 27/74 (36%), Positives = 37/74 (50%), Gaps = 4/74 (5%)
Query: 2 LALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPII-GRHI 60
LA +P Q+ +Y EI +V GD E V T + + L L V ETLR Y P++ R +
Sbjct: 289 LAKNPDKQERLYREIREVCGD--ERV-TEEDLPNLPYLNAVFHETLRKYSPVPLLPPRFV 345
Query: 61 TSEIQADKYTIPIG 74
+ Y IP G
Sbjct: 346 HEDTTLGGYDIPAG 359
>gnl|CDD|215393 PLN02738, PLN02738, carotene beta-ring hydroxylase.
Length = 633
Score = 53.4 bits (128), Expect = 2e-09
Identities = 27/78 (34%), Positives = 42/78 (53%), Gaps = 5/78 (6%)
Query: 1 MLALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHI 60
+L+ P + ++ +E+ VLGD PT + +++L TRVI E+LRLYP P++ R
Sbjct: 417 LLSKEPSVVAKLQEEVDSVLGD---RFPTIEDMKKLKYTTRVINESLRLYPQPPVLIRRS 473
Query: 61 TSEIQADKYTIPIGRKEN 78
Y PI R E+
Sbjct: 474 LENDMLGGY--PIKRGED 489
>gnl|CDD|215235 PLN02426, PLN02426, cytochrome P450, family 94, subfamily C
protein.
Length = 502
Score = 53.2 bits (128), Expect = 2e-09
Identities = 14/55 (25%), Positives = 36/55 (65%), Gaps = 3/55 (5%)
Query: 1 MLALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPI 55
+L+ HP++ + +E +V+G + + ++++++E+H L + E++RL+P P+
Sbjct: 319 LLSKHPEVASAIREEADRVMGPN-QEAASFEEMKEMHYLHAALYESMRLFP--PV 370
>gnl|CDD|215221 PLN02394, PLN02394, trans-cinnamate 4-monooxygenase.
Length = 503
Score = 51.3 bits (123), Expect = 1e-08
Identities = 27/74 (36%), Positives = 40/74 (54%), Gaps = 5/74 (6%)
Query: 5 HPKIQQEVYDEIVQVLGDD-PETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHIT-S 62
HP+IQ+++ DE+ VLG T P +L L V+KETLRL+ A P++ H+
Sbjct: 323 HPEIQKKLRDELDTVLGPGNQVTEPD---THKLPYLQAVVKETLRLHMAIPLLVPHMNLE 379
Query: 63 EIQADKYTIPIGRK 76
+ + Y IP K
Sbjct: 380 DAKLGGYDIPAESK 393
>gnl|CDD|178524 PLN02936, PLN02936, epsilon-ring hydroxylase.
Length = 489
Score = 50.2 bits (120), Expect = 3e-08
Identities = 26/76 (34%), Positives = 44/76 (57%), Gaps = 4/76 (5%)
Query: 1 MLALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPI-IGRH 59
+L+ +P+ ++ +E+ +VL P PTY+ I+EL LTR I E++RLYP P+ I R
Sbjct: 304 LLSKNPEALRKAQEELDRVLQGRP---PTYEDIKELKYLTRCINESMRLYPHPPVLIRRA 360
Query: 60 ITSEIQADKYTIPIGR 75
++ Y + G+
Sbjct: 361 QVEDVLPGGYKVNAGQ 376
>gnl|CDD|215371 PLN02687, PLN02687, flavonoid 3'-monooxygenase.
Length = 517
Score = 45.6 bits (108), Expect = 1e-06
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 3/74 (4%)
Query: 2 LALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHIT 61
L HP I ++ +E+ V+G D V D + +L L VIKET RL+P+ P+ +
Sbjct: 324 LIRHPDILKKAQEELDAVVGRD-RLVSESD-LPQLTYLQAVIKETFRLHPSTPLSLPRMA 381
Query: 62 SE-IQADKYTIPIG 74
+E + + Y IP G
Sbjct: 382 AEECEINGYHIPKG 395
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 45.7 bits (109), Expect = 1e-06
Identities = 14/50 (28%), Positives = 27/50 (54%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
R + K +KK +K K+ +KK+K K++ K KK+ + + K++
Sbjct: 377 RPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRR 426
Score = 44.9 bits (107), Expect = 2e-06
Identities = 15/48 (31%), Positives = 27/48 (56%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
E K K KK K+ ++K+K+++K K+KK+ + K K++K
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431
Score = 43.4 bits (103), Expect = 6e-06
Identities = 15/49 (30%), Positives = 25/49 (51%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K +KK K K K+ +KK+K+++ K KK + K K++K
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPS 429
Score = 42.2 bits (100), Expect = 1e-05
Identities = 11/46 (23%), Positives = 24/46 (52%)
Query: 79 KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
KK + K+ ++ +KK+K+K++ K++ K K++K
Sbjct: 386 KKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431
Score = 39.9 bits (94), Expect = 9e-05
Identities = 12/53 (22%), Positives = 29/53 (54%)
Query: 72 PIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
P + ++KK K + K+ +KK+++K+K K +K+ + K ++++
Sbjct: 378 PKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSG 430
Score = 38.0 bits (89), Expect = 4e-04
Identities = 12/49 (24%), Positives = 25/49 (51%)
Query: 72 PIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
P +K K K+ + ++K+++K+K K +K + K K++K
Sbjct: 383 PSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431
Score = 38.0 bits (89), Expect = 5e-04
Identities = 12/48 (25%), Positives = 27/48 (56%)
Query: 66 ADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKK 113
++K T +K K+ +KK++ K++ K KK+ + + + K++K
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431
>gnl|CDD|215171 PLN02302, PLN02302, ent-kaurenoic acid oxidase.
Length = 490
Score = 44.3 bits (105), Expect = 3e-06
Identities = 21/74 (28%), Positives = 39/74 (52%), Gaps = 2/74 (2%)
Query: 5 HPKIQQEVYDEIVQVLGDDPET--VPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHITS 62
HP++ Q+ E ++ P T ++++ L++VI ETLRL + + R +
Sbjct: 317 HPEVLQKAKAEQEEIAKKRPPGQKGLTLKDVRKMEYLSQVIDETLRLINISLTVFREAKT 376
Query: 63 EIQADKYTIPIGRK 76
+++ + YTIP G K
Sbjct: 377 DVEVNGYTIPKGWK 390
>gnl|CDD|215086 PLN00168, PLN00168, Cytochrome P450; Provisional.
Length = 519
Score = 43.8 bits (103), Expect = 4e-06
Identities = 26/74 (35%), Positives = 39/74 (52%), Gaps = 2/74 (2%)
Query: 2 LALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHIT 61
L +P IQ +++DEI GDD E V D + ++ L V+ E LR +P A + H
Sbjct: 333 LVKNPSIQSKLHDEIKAKTGDDQEEVSEED-VHKMPYLKAVVLEGLRKHPPAHFVLPHKA 391
Query: 62 SE-IQADKYTIPIG 74
+E ++ Y IP G
Sbjct: 392 AEDMEVGGYLIPKG 405
>gnl|CDD|215627 PLN03195, PLN03195, fatty acid omega-hydroxylase; Provisional.
Length = 516
Score = 42.1 bits (99), Expect = 2e-05
Identities = 22/78 (28%), Positives = 37/78 (47%), Gaps = 18/78 (23%)
Query: 1 MLALHPKIQQEVYDEI--------VQVLGDDPETVP----------TYDQIQELHLLTRV 42
M+ ++P + +++Y E+ + +D ++ TYD + +L L V
Sbjct: 318 MIMMNPHVAEKLYSELKALEKERAKEEDPEDSQSFNQRVTQFAGLLTYDSLGKLQYLHAV 377
Query: 43 IKETLRLYPAAPIIGRHI 60
I ETLRLYPA P + I
Sbjct: 378 ITETLRLYPAVPQDPKGI 395
>gnl|CDD|173595 PTZ00404, PTZ00404, cytochrome P450; Provisional.
Length = 482
Score = 40.5 bits (95), Expect = 7e-05
Identities = 22/74 (29%), Positives = 32/74 (43%), Gaps = 4/74 (5%)
Query: 1 MLALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHI 60
ML +P+IQ++ Y+EI + + Q +IKETLR P +P
Sbjct: 309 MLCNYPEIQEKAYNEIKSTVNGRNKV--LLSDRQSTPYTVAIIKETLRYKPVSPFGLPRS 366
Query: 61 TSE--IQADKYTIP 72
TS I + IP
Sbjct: 367 TSNDIIIGGGHFIP 380
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 40.7 bits (95), Expect = 7e-05
Identities = 16/50 (32%), Positives = 30/50 (60%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
RK+ K++K K+K+ KK K +K+ + K + ++ KK +KK +K+
Sbjct: 20 RKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKR 69
Score = 33.7 bits (77), Expect = 0.014
Identities = 15/45 (33%), Positives = 30/45 (66%)
Query: 80 KKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ +KK T+++ ++KKKK++K K K+ KK K +K+ + K + +
Sbjct: 6 SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQ 50
Score = 28.3 bits (63), Expect = 1.1
Identities = 15/46 (32%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 79 KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
KK +++ +K+KK++K K+K ++KK K +K+ K K + Q+
Sbjct: 10 KKILTEEELERKKKKEEKAKEK--ELKKLKAAQKEAKAKLQAQQAS 53
>gnl|CDD|215583 PLN03112, PLN03112, cytochrome P450 family protein; Provisional.
Length = 514
Score = 39.0 bits (91), Expect = 2e-04
Identities = 17/71 (23%), Positives = 35/71 (49%), Gaps = 3/71 (4%)
Query: 5 HPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAP-IIGRHITSE 63
+P++ +++ +E+ V+G + + L+ L V++ET R++PA P +I
Sbjct: 326 NPRVLRKIQEELDSVVG--RNRMVQESDLVHLNYLRCVVRETFRMHPAGPFLIPHESLRA 383
Query: 64 IQADKYTIPIG 74
+ Y IP
Sbjct: 384 TTINGYYIPAK 394
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 38.8 bits (91), Expect = 3e-04
Identities = 17/48 (35%), Positives = 28/48 (58%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
++KK +KKQ K +KK +K++ +K KK+ + KK K + KK
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKK 455
Score = 34.1 bits (79), Expect = 0.011
Identities = 15/49 (30%), Positives = 25/49 (51%), Gaps = 3/49 (6%)
Query: 76 KENKK--KKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
E KK KKQ+K + KK +K++ +K +K + K K + K+
Sbjct: 409 AERKKLRKKQRKAE-KKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456
Score = 30.7 bits (70), Expect = 0.18
Identities = 11/42 (26%), Positives = 22/42 (52%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKK 116
RK+ +K ++K ++ + +K KKK + K + + KK
Sbjct: 415 RKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 38.4 bits (90), Expect = 3e-04
Identities = 21/46 (45%), Positives = 35/46 (76%)
Query: 78 NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
NK+ ++ K++ K K+K++++K QK KK+KK KKK KK++KK +K
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 36.9 bits (86), Expect = 0.001
Identities = 17/42 (40%), Positives = 31/42 (73%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
+E K+K K + K+++K+ +K++KK+K KK+ K +KKK +K
Sbjct: 373 EEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 35.7 bits (83), Expect = 0.003
Identities = 19/49 (38%), Positives = 34/49 (69%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
KE K+ ++ + K K KKK+++K+ K++KK K+KKK K++K+K +
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 34.6 bits (80), Expect = 0.007
Identities = 18/40 (45%), Positives = 29/40 (72%)
Query: 73 IGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKK 112
I K K K+K+++ K QK++KKKK+KK+ K++KK +K
Sbjct: 375 IKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 32.6 bits (75), Expect = 0.033
Identities = 17/36 (47%), Positives = 26/36 (72%)
Query: 68 KYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQK 103
KY P +K +KK QK+K+ KK+KK+ KK++KK +
Sbjct: 378 KYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 28.8 bits (65), Expect = 0.69
Identities = 14/32 (43%), Positives = 25/32 (78%)
Query: 72 PIGRKENKKKKQKKKQTKKQKKQKKKKQKKQK 103
P ++E KK +++KK+ K++KK KK+K+K +K
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 37.7 bits (88), Expect = 6e-04
Identities = 13/43 (30%), Positives = 30/43 (69%)
Query: 82 KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K K+ +++K++K+ K+QK ++ K +++KKKK+ ++ +K
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 34.2 bits (79), Expect = 0.010
Identities = 14/41 (34%), Positives = 32/41 (78%)
Query: 82 KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
K++ ++ K++K+ K++++ +K+KKQ++ KKK+ +K +K K
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590
Score = 33.1 bits (76), Expect = 0.027
Identities = 13/40 (32%), Positives = 30/40 (75%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKK 115
+E +++K++K+ K+QK+ +K K++++K KK+ + +K K
Sbjct: 551 EELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590
Score = 30.8 bits (70), Expect = 0.17
Identities = 11/37 (29%), Positives = 29/37 (78%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTK 111
++E ++K+ K+Q + +K +K++++KK++++K +K K
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590
Score = 29.2 bits (66), Expect = 0.53
Identities = 13/44 (29%), Positives = 29/44 (65%)
Query: 65 QADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQK 108
DK + ++E + K++K+ K +K+++KKK++ +K++K K
Sbjct: 547 LDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 36.8 bits (86), Expect = 0.001
Identities = 15/47 (31%), Positives = 33/47 (70%)
Query: 78 NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
KK KK + +K +KK++++K++ KK+ KKK+++++++K+K
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEK 450
Score = 35.7 bits (83), Expect = 0.003
Identities = 13/46 (28%), Positives = 37/46 (80%)
Query: 74 GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQK 119
+ KKK+++++ +K+KK+++K++++++ +++K+ +++KKKKQ
Sbjct: 432 KKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477
Score = 34.9 bits (81), Expect = 0.006
Identities = 15/53 (28%), Positives = 38/53 (71%), Gaps = 1/53 (1%)
Query: 73 IGRKENKKKKQKKKQTKKQKKQKK-KKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
G K+ KK +K + ++K++++ K++KK+ +KK ++++++K+KK+++K
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455
Score = 34.1 bits (79), Expect = 0.011
Identities = 13/49 (26%), Positives = 38/49 (77%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K+ KK +K ++ ++++K++KKK+ KK+++ +++K+KK+++++++
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEE 458
Score = 32.2 bits (74), Expect = 0.048
Identities = 13/49 (26%), Positives = 39/49 (79%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K+ +K +KK++ +K++K+KK K+K +++++ K+KK++++++++++
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461
Score = 31.8 bits (73), Expect = 0.069
Identities = 16/52 (30%), Positives = 40/52 (76%)
Query: 73 IGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
I K KK++++KK+ KK+ KKK+++++ +K+KK ++K++++++ +++K
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466
Score = 30.3 bits (69), Expect = 0.23
Identities = 9/50 (18%), Positives = 40/50 (80%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K+ +++K++KK+ K+K+++++++K KK+++ ++++++ +++++++
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469
Score = 29.9 bits (68), Expect = 0.26
Identities = 12/50 (24%), Positives = 42/50 (84%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
R+E KK+K+KK K+K+++++++K++K +++++ +++ +++++++++K
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 36.7 bits (86), Expect = 0.001
Identities = 15/42 (35%), Positives = 23/42 (54%), Gaps = 1/42 (2%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
KE K++ + KK+ K K QKK++ K++ K K K K
Sbjct: 50 KEMKEELKAALLDKKELKAWHKAQKKKE-KQEAKAAKAKSKP 90
Score = 36.0 bits (84), Expect = 0.002
Identities = 16/45 (35%), Positives = 24/45 (53%), Gaps = 1/45 (2%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
++ K+ K++ K KK+ K K QK KK+K+ K K K K
Sbjct: 47 EQYKEMKEELKAALLDKKELKAWHKAQK-KKEKQEAKAAKAKSKP 90
Score = 34.8 bits (81), Expect = 0.006
Identities = 14/41 (34%), Positives = 22/41 (53%), Gaps = 2/41 (4%)
Query: 86 KQTKKQKKQKK--KKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K+ K++ K KK+ K K QKK +K++ K K + K
Sbjct: 50 KEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKP 90
Score = 31.3 bits (72), Expect = 0.080
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 81 KKQKKKQTKKQKKQKKKKQKKQKMKKQKK 109
KK+ K K QKK++K++ K K K + +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 30.6 bits (70), Expect = 0.16
Identities = 14/46 (30%), Positives = 26/46 (56%)
Query: 79 KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
++KK +K + + ++ K+ K+++K KK+ K K QKKK
Sbjct: 31 QRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKK 76
Score = 29.0 bits (66), Expect = 0.54
Identities = 9/28 (32%), Positives = 15/28 (53%)
Query: 79 KKKKQKKKQTKKQKKQKKKKQKKQKMKK 106
K+ K K KK++KQ+ K K + +
Sbjct: 64 KELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 26.3 bits (59), Expect = 4.1
Identities = 9/24 (37%), Positives = 14/24 (58%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKK 98
+ +K +K+K+KQ K K K K
Sbjct: 67 KAWHKAQKKKEKQEAKAAKAKSKP 90
Score = 25.6 bits (57), Expect = 8.7
Identities = 7/21 (33%), Positives = 13/21 (61%)
Query: 75 RKENKKKKQKKKQTKKQKKQK 95
+ + KK+KQ+ K K + K +
Sbjct: 71 KAQKKKEKQEAKAAKAKSKPR 91
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 36.3 bits (84), Expect = 0.002
Identities = 14/43 (32%), Positives = 20/43 (46%)
Query: 82 KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ +KK K K+ +K KK K QK K K+ K+K
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVA 812
Score = 35.9 bits (83), Expect = 0.003
Identities = 15/42 (35%), Positives = 20/42 (47%)
Query: 78 NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQK 119
+KK + K K +K KK K QK+ K K+ KKK
Sbjct: 771 GEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVA 812
Score = 35.5 bits (82), Expect = 0.004
Identities = 15/42 (35%), Positives = 19/42 (45%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKK 116
++ K K KK +K KK K QK K K+ KKK
Sbjct: 771 GEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVA 812
Score = 33.9 bits (78), Expect = 0.014
Identities = 12/42 (28%), Positives = 21/42 (50%)
Query: 79 KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
+ +K+ K + K+ +K KK + K TK K+ K+K
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKV 811
Score = 32.4 bits (74), Expect = 0.039
Identities = 16/57 (28%), Positives = 26/57 (45%)
Query: 67 DKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
+K K+ K++Q K+ + + +KK K K KK +K KK K +K
Sbjct: 740 EKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQK 796
Score = 31.3 bits (71), Expect = 0.094
Identities = 17/58 (29%), Positives = 27/58 (46%), Gaps = 8/58 (13%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQK--------MKKQKKTKKKKKKKQKKQKKK 124
R K ++K K+ KK K++Q +K + +KK K K KK ++ KK
Sbjct: 732 RNVGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKK 789
Score = 30.5 bits (69), Expect = 0.19
Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 11/62 (17%)
Query: 74 GRKENKKKKQKKKQT----------KKQKKQKKKKQKKQKMKKQKKTKKKKKK-KQKKQK 122
G+K K+++ KK KK K + KK ++ K KT+K K K+
Sbjct: 748 GKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAA 807
Query: 123 KK 124
KK
Sbjct: 808 KK 809
Score = 29.3 bits (66), Expect = 0.53
Identities = 14/58 (24%), Positives = 25/58 (43%), Gaps = 9/58 (15%)
Query: 76 KENKKKKQKKKQTKKQKKQKKK---------KQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+E KK K+ K+++ KK + +K+ K K +K KK + +K
Sbjct: 739 REKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQK 796
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 35.2 bits (82), Expect = 0.002
Identities = 16/47 (34%), Positives = 26/47 (55%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
K+ K+ + KK+ K +K +KK + K K KKK KK++ K +
Sbjct: 52 KDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 32.9 bits (76), Expect = 0.017
Identities = 12/33 (36%), Positives = 21/33 (63%)
Query: 92 KKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
KK+ K +K +K ++ K K +KKK +K++ K
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97
Score = 27.1 bits (61), Expect = 1.7
Identities = 9/26 (34%), Positives = 16/26 (61%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQK 100
+ E K +K K K KK+ K+++ K +
Sbjct: 73 KAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 27.1 bits (61), Expect = 1.7
Identities = 8/29 (27%), Positives = 17/29 (58%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQK 103
+ K +K+ +K K +K+K KK++ +
Sbjct: 69 KAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 34.7 bits (80), Expect = 0.006
Identities = 18/57 (31%), Positives = 32/57 (56%), Gaps = 9/57 (15%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQ---------KMKKQKKTKKKKKKKQKKQKKK 124
EN+ K+QK++ +K K+K KK KK ++ + + + K+ +KK+KKK
Sbjct: 7 ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKK 63
Score = 28.5 bits (64), Expect = 0.79
Identities = 25/69 (36%), Positives = 34/69 (49%), Gaps = 20/69 (28%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQ--------------------KKTKKKKK 115
N+ + + KQ K+Q KQK K+K +K KK K+ KKKK
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60
Query: 116 KKQKKQKKK 124
KK+KK+KKK
Sbjct: 61 KKKKKKKKK 69
Score = 26.6 bits (59), Expect = 4.2
Identities = 11/47 (23%), Positives = 18/47 (38%)
Query: 78 NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ + ++ KQ +KKKK+KK+K KK
Sbjct: 41 AELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSS 87
Score = 25.8 bits (57), Expect = 7.0
Identities = 11/45 (24%), Positives = 19/45 (42%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
++ + K+ KK+KK+KKKK+KK +
Sbjct: 44 ISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSA 88
Score = 25.8 bits (57), Expect = 7.1
Identities = 11/45 (24%), Positives = 22/45 (48%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
++ ++ + KQ K+KK+KKKK+KK+ + +
Sbjct: 44 ISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSA 88
>gnl|CDD|165828 PLN02183, PLN02183, ferulate 5-hydroxylase.
Length = 516
Score = 34.8 bits (80), Expect = 0.006
Identities = 24/84 (28%), Positives = 39/84 (46%), Gaps = 13/84 (15%)
Query: 6 PKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHITSEIQ 65
P+ + V E+ V+G + V D +++L L +KETLRL+P P++ + +
Sbjct: 335 PEDLKRVQQELADVVGLN-RRVEESD-LEKLTYLKCTLKETLRLHPPIPLLLHETAEDAE 392
Query: 66 ADKYTIP-----------IGRKEN 78
Y IP IGR +N
Sbjct: 393 VAGYFIPKRSRVMINAWAIGRDKN 416
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 34.8 bits (81), Expect = 0.007
Identities = 12/47 (25%), Positives = 20/47 (42%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
E +K +K + K KK +K +KTK++ +K K
Sbjct: 812 ELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 33.2 bits (77), Expect = 0.020
Identities = 9/43 (20%), Positives = 15/43 (34%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
K K + K +K + +K K+Q +K K
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 30.9 bits (71), Expect = 0.15
Identities = 9/45 (20%), Positives = 15/45 (33%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
K +K + K KK + +K K+ +K K
Sbjct: 814 LAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 30.5 bits (70), Expect = 0.21
Identities = 8/36 (22%), Positives = 13/36 (36%)
Query: 89 KKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+K + K KK +K K+Q +K
Sbjct: 819 EKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEK 854
Score = 28.2 bits (64), Expect = 1.2
Identities = 11/48 (22%), Positives = 18/48 (37%)
Query: 63 EIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKT 110
E+ A+K + K K+ + +K K+Q K K T
Sbjct: 812 ELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKAT 859
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 34.3 bits (79), Expect = 0.008
Identities = 20/48 (41%), Positives = 31/48 (64%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
KE+K + KKK KK+KK+K+KK+ K + + + K KK K+ K+K
Sbjct: 59 KEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKP 106
Score = 34.3 bits (79), Expect = 0.010
Identities = 16/50 (32%), Positives = 26/50 (52%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E + K K +K+ K + KK+ KK+KK K+KK+ K + + K
Sbjct: 42 FSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKL 91
Score = 33.9 bits (78), Expect = 0.012
Identities = 21/51 (41%), Positives = 29/51 (56%)
Query: 74 GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E+KKK +KKK+ KK+KK+ K + + + K K KK KKK K K
Sbjct: 62 KNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPN 112
Score = 32.0 bits (73), Expect = 0.051
Identities = 16/49 (32%), Positives = 28/49 (57%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+++ K T +K +K+ K + K K +KK KKKK+KK+ K + +
Sbjct: 40 STFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGE 88
Score = 32.0 bits (73), Expect = 0.059
Identities = 16/49 (32%), Positives = 25/49 (51%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K +K K+ K + KKK +KK+K KK+KK K + + + K
Sbjct: 48 KVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTP 96
Score = 30.8 bits (70), Expect = 0.13
Identities = 16/45 (35%), Positives = 23/45 (51%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
E KKKK+K+K+ K + + K K K K+ K K K K +
Sbjct: 71 EKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDV 115
Score = 29.7 bits (67), Expect = 0.33
Identities = 15/44 (34%), Positives = 25/44 (56%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQK 119
++ KKKK++KK+ K + + K + +K KK KK K K +
Sbjct: 71 EKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114
Score = 27.4 bits (61), Expect = 2.1
Identities = 15/49 (30%), Positives = 21/49 (42%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
KE K + + K K KK KK K+K K K + K + +K
Sbjct: 81 KEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKS 129
Score = 26.6 bits (59), Expect = 4.0
Identities = 12/38 (31%), Positives = 19/38 (50%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKK 112
+KE K+ K + + K KK K+ K+K K K +
Sbjct: 77 KKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 33.9 bits (78), Expect = 0.011
Identities = 13/39 (33%), Positives = 20/39 (51%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKK 115
E K KQK++ +++K+ K+ QK KKKK
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 31.2 bits (71), Expect = 0.085
Identities = 12/40 (30%), Positives = 21/40 (52%), Gaps = 2/40 (5%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKK 114
RK+ K+K++ + +K+ K+ K + TKKKK
Sbjct: 159 RKDAKQKEEFAAEERKEALAAAAKKSATPQKVE--TKKKK 196
Score = 25.4 bits (56), Expect = 8.3
Identities = 9/38 (23%), Positives = 22/38 (57%)
Query: 85 KKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
+K K++++ +++K+ KK+ +K + KK+K
Sbjct: 159 RKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 33.8 bits (78), Expect = 0.011
Identities = 9/39 (23%), Positives = 20/39 (51%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKK 115
E K K ++ ++ + ++++ + +KK KKKK
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 30.4 bits (69), Expect = 0.15
Identities = 8/38 (21%), Positives = 18/38 (47%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKK 112
K+ K ++ + + K+++ + KK+ K KK
Sbjct: 159 WKDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 28.8 bits (65), Expect = 0.51
Identities = 6/35 (17%), Positives = 16/35 (45%)
Query: 90 KQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K K ++ ++ +++ + +KK+ KK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKK 194
Score = 26.5 bits (59), Expect = 3.3
Identities = 7/36 (19%), Positives = 18/36 (50%)
Query: 89 KKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K K ++ ++ + K ++ + ++K+ KKK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195
Score = 25.4 bits (56), Expect = 8.0
Identities = 6/35 (17%), Positives = 18/35 (51%)
Query: 89 KKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
K ++ ++ + +++ + +KK+ K+KK
Sbjct: 162 AKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
>gnl|CDD|178773 PLN03234, PLN03234, cytochrome P450 83B1; Provisional.
Length = 499
Score = 33.9 bits (77), Expect = 0.012
Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 3/72 (4%)
Query: 2 LALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPI-IGRHI 60
L +P+ ++ DE+ V+GD + + + I L L VIKE+LRL P PI + R
Sbjct: 315 LIKYPEAMKKAQDEVRNVIGD--KGYVSEEDIPNLPYLKAVIKESLRLEPVIPILLHRET 372
Query: 61 TSEIQADKYTIP 72
++ + Y IP
Sbjct: 373 IADAKIGGYDIP 384
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 34.0 bits (78), Expect = 0.013
Identities = 24/75 (32%), Positives = 30/75 (40%), Gaps = 12/75 (16%)
Query: 50 YPAAPIIGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKK 109
YPA +IG I + I KY K KK K +KK K K K K
Sbjct: 722 YPACSVIGNSIDAVIT--KYA----------GTPKTPYEKKTKAKKKSASTKGKAAKTVK 769
Query: 110 TKKKKKKKQKKQKKK 124
K K K K+ +K+
Sbjct: 770 KKSKAKSKKTTKKRA 784
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 33.8 bits (78), Expect = 0.014
Identities = 14/50 (28%), Positives = 35/50 (70%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K ++++Q++ Q KK++K+K++++ K ++ +K ++K++KKQ +K
Sbjct: 273 LKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
>gnl|CDD|177725 PLN00110, PLN00110, flavonoid 3',5'-hydroxylase (F3'5'H);
Provisional.
Length = 504
Score = 33.3 bits (76), Expect = 0.018
Identities = 19/68 (27%), Positives = 36/68 (52%), Gaps = 3/68 (4%)
Query: 6 PKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPI-IGRHITSEI 64
P I + ++E+ QV+G + V + + +L L + KE+ R +P+ P+ + R T
Sbjct: 320 PSILKRAHEEMDQVIGRNRRLVES--DLPKLPYLQAICKESFRKHPSTPLNLPRVSTQAC 377
Query: 65 QADKYTIP 72
+ + Y IP
Sbjct: 378 EVNGYYIP 385
>gnl|CDD|177847 PLN02196, PLN02196, abscisic acid 8'-hydroxylase.
Length = 463
Score = 33.0 bits (75), Expect = 0.025
Identities = 22/76 (28%), Positives = 40/76 (52%), Gaps = 1/76 (1%)
Query: 2 LALHPKIQQEVYDEIVQVLGDDPE-TVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHI 60
LA +P + + V +E + + D E T++ +++ L +RVI+ETLR+ R
Sbjct: 291 LAENPSVLEAVTEEQMAIRKDKEEGESLTWEDTKKMPLTSRVIQETLRVASILSFTFREA 350
Query: 61 TSEIQADKYTIPIGRK 76
+++ + Y IP G K
Sbjct: 351 VEDVEYEGYLIPKGWK 366
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 32.5 bits (75), Expect = 0.037
Identities = 12/49 (24%), Positives = 28/49 (57%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
++K++ +K + K+++ +K K+ + MKK + KKK+ K ++
Sbjct: 166 DDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214
Score = 31.7 bits (73), Expect = 0.073
Identities = 13/48 (27%), Positives = 28/48 (58%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E K++ ++ + QK+++ +K K+ ++ KK + +KKK+ K K
Sbjct: 164 EIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPK 211
Score = 30.2 bits (69), Expect = 0.29
Identities = 14/69 (20%), Positives = 31/69 (44%)
Query: 54 PIIGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKK 113
I+ + ++ +KE + +K K+ + KK + +K+K+ K K+ +
Sbjct: 157 GILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQ 216
Query: 114 KKKKQKKQK 122
KK K++
Sbjct: 217 IGKKIDKEE 225
Score = 26.3 bits (59), Expect = 4.5
Identities = 13/50 (26%), Positives = 26/50 (52%), Gaps = 2/50 (4%)
Query: 76 KENKKKKQKKKQTKKQKKQ--KKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
+ K + QK+++ +K K+ + K+ + + KKQ K K+ + KK
Sbjct: 171 ELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKK 220
>gnl|CDD|178550 PLN02966, PLN02966, cytochrome P450 83A1.
Length = 502
Score = 32.4 bits (73), Expect = 0.037
Identities = 21/78 (26%), Positives = 38/78 (48%), Gaps = 1/78 (1%)
Query: 2 LALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAP-IIGRHI 60
L +P++ ++ E+ + + + T T D ++ L ++KETLR+ P P +I R
Sbjct: 316 LMKYPQVLKKAQAEVREYMKEKGSTFVTEDDVKNLPYFRALVKETLRIEPVIPLLIPRAC 375
Query: 61 TSEIQADKYTIPIGRKEN 78
+ + Y IP G N
Sbjct: 376 IQDTKIAGYDIPAGTTVN 393
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 32.0 bits (73), Expect = 0.059
Identities = 14/48 (29%), Positives = 29/48 (60%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E + +K K+K K K K ++ KK++ + ++ KK+KK K +++ +
Sbjct: 347 EKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQ 394
Score = 30.1 bits (68), Expect = 0.23
Identities = 14/50 (28%), Positives = 30/50 (60%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
R+E +K+ +K K+ + K K ++ +K + + +KKK+KK K +++
Sbjct: 343 REELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRG 392
Score = 30.1 bits (68), Expect = 0.26
Identities = 14/64 (21%), Positives = 31/64 (48%)
Query: 59 HITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQ 118
+ E + ++ K K++ K K K ++ KK++ + +++KK KK K +++
Sbjct: 332 AFSGEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERR 391
Query: 119 KKQK 122
Q
Sbjct: 392 GLQN 395
Score = 27.4 bits (61), Expect = 2.1
Identities = 14/57 (24%), Positives = 31/57 (54%)
Query: 68 KYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
I +E +K+ +K + K K K K ++ K ++ + ++KKK+K+ K +++
Sbjct: 335 GEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERR 391
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 31.7 bits (72), Expect = 0.064
Identities = 13/50 (26%), Positives = 32/50 (64%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E K + ++ + + K++K KQ++ + K++++ + K+K+QK ++KK
Sbjct: 385 LGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 25.9 bits (57), Expect = 8.0
Identities = 9/46 (19%), Positives = 29/46 (63%)
Query: 79 KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
KK + K +T ++ + K ++++ +++ + K+K++ + K+K++
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQ 428
>gnl|CDD|235047 PRK02509, PRK02509, hypothetical protein; Provisional.
Length = 973
Score = 31.5 bits (72), Expect = 0.078
Identities = 25/107 (23%), Positives = 40/107 (37%), Gaps = 35/107 (32%)
Query: 6 PKIQQ------EVYD-EIVQVLGDDPETVPTYDQIQELHLLTRVIK--------ET---- 46
P IQ+ +D + ++V DP+ T + L R I+ ET
Sbjct: 400 PYIQRSIALTRAAFDLDKIEVRTFDPQGNLTAADLAANRLTIRNIRLWDTRPLLETNRQL 459
Query: 47 --LRLY---PAAPIIGRHITSEIQADKYTIPIGRKENKKKKQKKKQT 88
+RLY P A I D+YT+ +N+ +K+Q
Sbjct: 460 QQIRLYYRFPDADI-----------DRYTLKTENDDNRSISTEKQQV 495
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 31.4 bits (72), Expect = 0.082
Identities = 10/45 (22%), Positives = 28/45 (62%)
Query: 79 KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
++ +Q+ K+ + +++K ++++K K + KK+K + QK + +
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQ 157
Score = 31.0 bits (71), Expect = 0.095
Identities = 12/49 (24%), Positives = 29/49 (59%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K + ++ ++Q KK++ +K++ ++ +K K +KK+K + QK +
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPE 155
Score = 30.7 bits (70), Expect = 0.11
Identities = 12/49 (24%), Positives = 31/49 (63%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+E K K Q ++ ++ KK++ +K++ ++++K K K +K++K + +
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152
Score = 30.7 bits (70), Expect = 0.12
Identities = 11/48 (22%), Positives = 27/48 (56%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
R E + KK++ K++ ++++K K + +K++K + +K + Q
Sbjct: 114 RAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPV 161
Score = 27.6 bits (62), Expect = 1.5
Identities = 8/49 (16%), Positives = 27/49 (55%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ + ++Q KK+ +K+K +++++ K + K+K + ++ + +
Sbjct: 111 QAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHT 159
Score = 27.6 bits (62), Expect = 1.7
Identities = 11/49 (22%), Positives = 23/49 (46%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E +K +++++ K + +KK+K + QK + Q + Q K
Sbjct: 126 GEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQAVK 174
Score = 26.4 bits (59), Expect = 3.6
Identities = 9/48 (18%), Positives = 30/48 (62%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E+ +K+ ++ + K Q ++ +++ KK++ +K+ ++++K K + +
Sbjct: 97 EHARKQLEEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPR 144
Score = 26.0 bits (58), Expect = 4.7
Identities = 7/50 (14%), Positives = 23/50 (46%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
R+ +K++ ++ +K K + +K++K + +K + + +
Sbjct: 122 REAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQ 171
Score = 25.7 bits (57), Expect = 6.8
Identities = 8/48 (16%), Positives = 21/48 (43%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+K+K +++ K + K +KK+K + K + + + +
Sbjct: 125 AGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQA 172
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 31.3 bits (72), Expect = 0.085
Identities = 9/56 (16%), Positives = 32/56 (57%), Gaps = 4/56 (7%)
Query: 73 IGRKENKKKKQKKK----QTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
I E +++ ++K + ++ +K K++ ++K +K ++ + K ++ +K+ ++
Sbjct: 522 IASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQ 577
Score = 28.3 bits (64), Expect = 1.0
Identities = 13/46 (28%), Positives = 29/46 (63%)
Query: 79 KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K+ +Q +K K + + ++++ K + K+KKKKKQK+++++
Sbjct: 591 KELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEE 636
Score = 27.5 bits (62), Expect = 2.3
Identities = 13/50 (26%), Positives = 29/50 (58%)
Query: 73 IGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
I ++ + +K K + + +K+ + +K++K KKK+K+KQ++ K
Sbjct: 589 IIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638
Score = 26.7 bits (60), Expect = 4.0
Identities = 13/42 (30%), Positives = 24/42 (57%)
Query: 83 QKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
QK + + + +K+ +KK KKKKK+K+K+++ K
Sbjct: 597 QKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 31.2 bits (71), Expect = 0.086
Identities = 24/42 (57%), Positives = 32/42 (76%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
+E KK+K+KKK+ KK+KK+KK K++K K K KKKKKKK
Sbjct: 151 EEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192
Score = 26.2 bits (58), Expect = 4.9
Identities = 24/75 (32%), Positives = 45/75 (60%)
Query: 50 YPAAPIIGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKK 109
Y A + SE + + ++ + ++++K+ KK+KK+ KK++K++K KK+K
Sbjct: 118 YGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177
Query: 110 TKKKKKKKQKKQKKK 124
+ K KK+KK+KKK
Sbjct: 178 VEPKGSKKKKKKKKK 192
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 31.6 bits (72), Expect = 0.088
Identities = 15/50 (30%), Positives = 24/50 (48%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E K+ ++++ K K + K +K + KK+KK KK K KK
Sbjct: 1146 EVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVV 1195
Score = 30.4 bits (69), Expect = 0.20
Identities = 19/48 (39%), Positives = 27/48 (56%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E ++K+ K+ + + K K K K +K K +KK KKKKK K KK
Sbjct: 1145 EEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKA 1192
Score = 29.6 bits (67), Expect = 0.40
Identities = 16/51 (31%), Positives = 26/51 (50%), Gaps = 1/51 (1%)
Query: 75 RKENKKKKQKKKQTKKQ-KKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
++ K+++ K K K K +K K +KK+K KK+ K KK K+
Sbjct: 1150 KEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKR 1200
Score = 29.2 bits (66), Expect = 0.48
Identities = 14/49 (28%), Positives = 23/49 (46%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
++ K+Q+ K K K K +K K +K +K+KK K K+
Sbjct: 1148 EEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVG 1196
Score = 29.2 bits (66), Expect = 0.58
Identities = 17/48 (35%), Positives = 30/48 (62%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E +++ ++K+ K+Q+ + K K K K++K K KK+KKKK+ K
Sbjct: 1142 EEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKS 1189
Score = 28.1 bits (63), Expect = 1.3
Identities = 12/62 (19%), Positives = 25/62 (40%)
Query: 60 ITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQK 119
+ + KKK +K+ + +KKKK +K+ +K+K + K+
Sbjct: 1288 PSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASAS 1347
Query: 120 KQ 121
+
Sbjct: 1348 QS 1349
Score = 25.4 bits (56), Expect = 9.9
Identities = 15/47 (31%), Positives = 22/47 (46%)
Query: 78 NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+K+ + K KKK KK+ KKKKK ++K +KK
Sbjct: 1289 SKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKK 1335
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 30.6 bits (69), Expect = 0.15
Identities = 16/92 (17%), Positives = 35/92 (38%), Gaps = 8/92 (8%)
Query: 41 RVIKETLRLYPAAPIIGRHITSEIQADKYTIPIGRKENKKK--------KQKKKQTKKQK 92
+ TLR II E+ A+ I R K + QK++ +++
Sbjct: 302 KAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREI 361
Query: 93 KQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
++ ++++ M + + + K K+K
Sbjct: 362 NREARQERAAAMARARARRAAVKAKKKGLIDA 393
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 30.1 bits (68), Expect = 0.22
Identities = 14/49 (28%), Positives = 23/49 (46%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ + + K K++ K ++K +K K + K K K K K K Q K
Sbjct: 69 PPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKP 117
Score = 27.1 bits (60), Expect = 2.7
Identities = 12/49 (24%), Positives = 22/49 (44%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ ++Q K T+ + + K ++ K +K KK K K K + K
Sbjct: 59 EPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKP 107
Score = 26.7 bits (59), Expect = 3.7
Identities = 16/49 (32%), Positives = 23/49 (46%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E + K +K K +KK K+ K K K + K K K K + K +K
Sbjct: 73 PETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPP 121
Score = 25.5 bits (56), Expect = 8.5
Identities = 19/53 (35%), Positives = 24/53 (45%)
Query: 70 TIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
P K +K K +KK K + K K K + K K+K Q K KK K K
Sbjct: 77 PEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAKAP 129
Score = 25.5 bits (56), Expect = 9.4
Identities = 14/50 (28%), Positives = 21/50 (42%)
Query: 71 IPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
P + + +KK +K K K K + K K K Q K+ +K K
Sbjct: 83 KPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAKAPAAP 132
Score = 25.5 bits (56), Expect = 9.6
Identities = 15/46 (32%), Positives = 21/46 (45%)
Query: 79 KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K K++ K + K +K + K K K + K K K KK K K
Sbjct: 83 KPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAKA 128
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 30.1 bits (69), Expect = 0.27
Identities = 10/50 (20%), Positives = 35/50 (70%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
++EN +K + + ++++ +KK+K+ +QK ++ +K +++ ++ ++Q ++
Sbjct: 94 KEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143
Score = 27.0 bits (61), Expect = 2.5
Identities = 10/57 (17%), Positives = 35/57 (61%), Gaps = 6/57 (10%)
Query: 74 GRKENKKKKQ------KKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
++E K + ++++ + QK +K+ QK++ + ++ + +K++++ +K++K+
Sbjct: 62 AKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKE 118
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 29.6 bits (67), Expect = 0.28
Identities = 16/45 (35%), Positives = 28/45 (62%)
Query: 80 KKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
++ +KK+ K+ + + K+K +++KK KKKK K+KK KK
Sbjct: 154 ERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKV 198
Score = 27.3 bits (61), Expect = 1.8
Identities = 16/42 (38%), Positives = 23/42 (54%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
KE + + KKK ++++KKKK+K K KK KK K
Sbjct: 164 KEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205
>gnl|CDD|216872 pfam02096, 60KD_IMP, 60Kd inner membrane protein.
Length = 193
Score = 29.8 bits (68), Expect = 0.28
Identities = 10/38 (26%), Positives = 21/38 (55%), Gaps = 2/38 (5%)
Query: 86 KQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
KQ K K ++ + + +++ Q+K K +K Q++ K
Sbjct: 24 KQYKSMAKMQELQPEIKEI--QEKYKDDPQKLQQEMMK 59
>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
Length = 381
Score = 30.0 bits (68), Expect = 0.28
Identities = 11/38 (28%), Positives = 19/38 (50%)
Query: 81 KKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQ 118
KK +K Q+K K +K K ++ +K KK + +
Sbjct: 333 KKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAK 370
Score = 28.1 bits (63), Expect = 1.3
Identities = 12/47 (25%), Positives = 20/47 (42%), Gaps = 1/47 (2%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKK-QKKQKMKKQKKTKKKKKKKQKK 120
++ Q+K K +K K ++ +KK K + K K K K
Sbjct: 335 LRKLAVLAQEKATAKGAQKVKNRRARKKAKKARLAKVAKALYKSIKA 381
Score = 27.7 bits (62), Expect = 1.6
Identities = 16/61 (26%), Positives = 25/61 (40%), Gaps = 15/61 (24%)
Query: 79 KKKKQKKKQTKK---------------QKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
KKK K++ KK KK +K Q+ K +K K ++ +K+ K
Sbjct: 305 KKKPAKRRVQKKNPLKNRRVRLRLNPAAKKLRKLAVLAQEKATAKGAQKVKNRRARKKAK 364
Query: 124 K 124
K
Sbjct: 365 K 365
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 29.9 bits (68), Expect = 0.28
Identities = 11/54 (20%), Positives = 29/54 (53%), Gaps = 5/54 (9%)
Query: 72 PIGRKENKKKKQKKKQTKKQKKQK-----KKKQKKQKMKKQKKTKKKKKKKQKK 120
P+ E + +++ +++ K+ + +K +KK ++K KK K K ++ ++
Sbjct: 62 PLEEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEE 115
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 30.0 bits (68), Expect = 0.28
Identities = 16/54 (29%), Positives = 31/54 (57%)
Query: 71 IPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
P+ + + KK+K++ + + + + +KK+K K++ K KK K+ K K KK
Sbjct: 221 KPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKK 274
Score = 28.8 bits (65), Expect = 0.54
Identities = 19/71 (26%), Positives = 33/71 (46%), Gaps = 1/71 (1%)
Query: 55 IIGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKK-QKKQKKKKQKKQKMKKQKKTKKK 113
+ I DKY +K+ + ++ + + +KK+K K++ K+K K+ K K
Sbjct: 212 LKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKA 271
Query: 114 KKKKQKKQKKK 124
KK K KK
Sbjct: 272 LKKVVAKGMKK 282
Score = 25.8 bits (57), Expect = 8.2
Identities = 11/53 (20%), Positives = 24/53 (45%)
Query: 54 PIIGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKK 106
+ T E + ++++K++ +KKK + + + KK + MKK
Sbjct: 230 SKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVVAKGMKK 282
>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are typically between 137 to 196
amino acids in length.
Length = 149
Score = 29.2 bits (66), Expect = 0.34
Identities = 10/47 (21%), Positives = 20/47 (42%), Gaps = 2/47 (4%)
Query: 78 NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+++ K+ + KK K + K+K K+KK + K+
Sbjct: 1 MASSRKRLPFEPKRSR--KKGVKALRKAAVAKSKDKQKKPKSKRAAS 45
Score = 26.1 bits (58), Expect = 3.7
Identities = 10/39 (25%), Positives = 17/39 (43%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKK 113
R + K+ +KK K +K K K ++ K + K
Sbjct: 7 RLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAAS 45
Score = 26.1 bits (58), Expect = 4.8
Identities = 9/49 (18%), Positives = 19/49 (38%), Gaps = 1/49 (2%)
Query: 74 GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
K+ + K+ ++K K ++ K K+ K K K+ +
Sbjct: 1 MASSRKRLPFEPKR-SRKKGVKALRKAAVAKSKDKQKKPKSKRAASESG 48
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 29.8 bits (67), Expect = 0.37
Identities = 7/46 (15%), Positives = 21/46 (45%)
Query: 78 NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
+K+ + ++ +K ++ + + K ++ KK K+ +K
Sbjct: 1169 DKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKA 1214
Score = 27.5 bits (61), Expect = 2.0
Identities = 11/38 (28%), Positives = 17/38 (44%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKK 114
E ++K Q+ + KK ++ K KKT KK
Sbjct: 1177 EAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKA 1214
Score = 27.1 bits (60), Expect = 2.5
Identities = 11/53 (20%), Positives = 21/53 (39%)
Query: 63 EIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKK 115
E + DK + E ++K ++ + + KK ++ K K KK
Sbjct: 1162 EKELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKA 1214
Score = 26.7 bits (59), Expect = 3.7
Identities = 8/35 (22%), Positives = 16/35 (45%)
Query: 90 KQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K ++ ++K Q+ + KK ++ KK K
Sbjct: 1174 KAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPK 1208
Score = 26.7 bits (59), Expect = 4.1
Identities = 9/45 (20%), Positives = 21/45 (46%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQK 119
+++ K ++ ++K + + + KK + KK KK K+
Sbjct: 1170 KEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKA 1214
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 29.4 bits (66), Expect = 0.38
Identities = 14/46 (30%), Positives = 25/46 (54%), Gaps = 1/46 (2%)
Query: 79 KKKKQKKKQTKKQKKQKKKKQ-KKQKMKKQKKTKKKKKKKQKKQKK 123
K++K+ K KK K K KK KK ++ +KK + + + ++K
Sbjct: 107 KEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKA 152
Score = 28.6 bits (64), Expect = 0.75
Identities = 14/58 (24%), Positives = 24/58 (41%)
Query: 60 ITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
T K+ KE K +K K +K +K KK + +K + + + +KK
Sbjct: 94 FTEAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKK 151
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 29.7 bits (66), Expect = 0.42
Identities = 11/59 (18%), Positives = 38/59 (64%)
Query: 66 ADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ + +P+ + +K + K + K++ K +++K++ + +++ K+ + KK+K+++Q+ +
Sbjct: 1000 SRRIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIR 1058
Score = 28.6 bits (63), Expect = 0.84
Identities = 13/47 (27%), Positives = 27/47 (57%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
KE + Q+ K+ + KK+K+++Q+ +K + KK+ +KK+
Sbjct: 1031 EKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKKR 1077
Score = 28.6 bits (63), Expect = 1.0
Identities = 11/51 (21%), Positives = 34/51 (66%), Gaps = 2/51 (3%)
Query: 76 KENKKKKQKKKQTKKQKKQ--KKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K+ ++K++ + + ++++ KK+K+++Q+++K K+ K++ +KK+
Sbjct: 1027 KDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKKR 1077
Score = 28.2 bits (62), Expect = 1.3
Identities = 12/57 (21%), Positives = 35/57 (61%)
Query: 68 KYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K I R +++++K++ + ++ K+++ K++K++ ++ +KT K+ K++ K
Sbjct: 1018 KDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLK 1074
>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
Length = 977
Score = 29.6 bits (66), Expect = 0.47
Identities = 24/92 (26%), Positives = 45/92 (48%), Gaps = 10/92 (10%)
Query: 29 TYDQIQELHLLTRVIKET-LRLYPAAPIIGRHIT------SEIQADKYTIPIGRKENKKK 81
+D ++ H + + L I RHI+ SE+ D+Y PI + N+ +
Sbjct: 63 VFDDVETKHSRNQSTMASHLNDTQDVGIHARHISSVSTVDSELN-DEYGNPIWK--NRVE 119
Query: 82 KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKK 113
K K+ KK+K KKK+ K ++ +++ ++K
Sbjct: 120 SWKDKKNKKKKSAKKKEAHKAQIPPEQQMEEK 151
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 29.1 bits (65), Expect = 0.48
Identities = 18/53 (33%), Positives = 24/53 (45%)
Query: 72 PIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+K +KKK K + K+ + K K+ K KK KK K KK KK
Sbjct: 221 SAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKK 273
Score = 28.3 bits (63), Expect = 1.0
Identities = 19/49 (38%), Positives = 21/49 (42%), Gaps = 1/49 (2%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
+K K KK K K KK +K K K KK K KK KK K
Sbjct: 229 KKAAKTAVSAKKAAKTAAKAAKKAKKTAK-KALKKAAKAVKKAAKKAAK 276
Score = 27.9 bits (62), Expect = 1.4
Identities = 18/50 (36%), Positives = 23/50 (46%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+K + K K K+ K+K K K+ K K KK KK KK KK
Sbjct: 213 KKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKK 262
Score = 27.9 bits (62), Expect = 1.6
Identities = 20/59 (33%), Positives = 22/59 (37%)
Query: 66 ADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
A K + K KK KK KK K KK KK K K K + K K K
Sbjct: 238 AKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAK 296
Score = 26.4 bits (58), Expect = 4.4
Identities = 19/59 (32%), Positives = 23/59 (38%)
Query: 66 ADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
A K + K K+ K KK K+ K K K TK K K K+K KK
Sbjct: 245 AAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKA 303
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 28.9 bits (65), Expect = 0.48
Identities = 24/39 (61%), Positives = 27/39 (69%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKK 115
E K KQ K+ KQKKQKKKK+KK+K K KK KKKK
Sbjct: 97 EQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 27.0 bits (60), Expect = 2.2
Identities = 21/34 (61%), Positives = 25/34 (73%)
Query: 90 KQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
KQ ++K KQKKQK KK+KK KKK KK K+KK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 25.8 bits (57), Expect = 4.6
Identities = 18/34 (52%), Positives = 23/34 (67%)
Query: 91 QKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
QK K+ ++K K KKQKK KKKKKKK+ +K
Sbjct: 98 QKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAA 131
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 29.3 bits (65), Expect = 0.51
Identities = 8/50 (16%), Positives = 26/50 (52%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ + KKK ++ + + + + + + + ++ + +KK+++ KKK
Sbjct: 1330 KADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379
Score = 28.6 bits (63), Expect = 1.0
Identities = 12/50 (24%), Positives = 25/50 (50%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ E KK + + + + + + ++K + +K K++ KKK KKK
Sbjct: 1337 KAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386
Score = 28.2 bits (62), Expect = 1.4
Identities = 11/49 (22%), Positives = 24/49 (48%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+E KKK K+ ++ K+ + K + + + ++K + +KKK
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373
Score = 27.8 bits (61), Expect = 1.7
Identities = 9/46 (19%), Positives = 23/46 (50%)
Query: 79 KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
KK ++ KK+ KK+ ++ +K + K + + + ++K +
Sbjct: 1322 KKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367
Score = 27.8 bits (61), Expect = 1.8
Identities = 13/50 (26%), Positives = 25/50 (50%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ + + +K + +KKK++ +K K K ++KKK + KKK
Sbjct: 1350 AEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399
Score = 27.4 bits (60), Expect = 2.3
Identities = 11/50 (22%), Positives = 25/50 (50%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ K + + + + ++K + +K K++ K K KK+ ++KKK
Sbjct: 1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392
Score = 27.4 bits (60), Expect = 2.3
Identities = 15/50 (30%), Positives = 29/50 (58%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E + ++K + +K+K++ KKK K K ++K K + KK+ ++ KK
Sbjct: 1356 ADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKK 1405
Score = 27.4 bits (60), Expect = 2.7
Identities = 9/50 (18%), Positives = 24/50 (48%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ + KKK ++ + K +KK ++ K+ + K + + + ++K
Sbjct: 1316 KADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEK 1365
Score = 27.4 bits (60), Expect = 2.7
Identities = 11/52 (21%), Positives = 27/52 (51%)
Query: 73 IGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ E + + + +++ + +KK+++ K K KK ++KK+ + KK
Sbjct: 1347 AAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKK 1398
Score = 26.6 bits (58), Expect = 4.1
Identities = 10/48 (20%), Positives = 23/48 (47%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E KKK ++ K+ K+K ++ KK + + + + + ++K
Sbjct: 1319 EAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKA 1366
Score = 26.6 bits (58), Expect = 4.3
Identities = 10/50 (20%), Positives = 26/50 (52%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+K ++ KK+ ++ KK KKK ++ +K + K + + + + ++
Sbjct: 1315 KKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEE 1364
Score = 26.3 bits (57), Expect = 5.8
Identities = 11/50 (22%), Positives = 26/50 (52%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+K + K + + + + ++K + + KK++ KK K+K ++KK
Sbjct: 1342 KKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKK 1391
Score = 26.3 bits (57), Expect = 6.5
Identities = 13/46 (28%), Positives = 25/46 (54%)
Query: 79 KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
KK ++ KK + +KK ++ K+K KK+ + KK + K + +
Sbjct: 1309 KKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEA 1354
Score = 26.3 bits (57), Expect = 6.7
Identities = 14/48 (29%), Positives = 27/48 (56%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E KKK ++ K+ + KK+ ++ +KK K+K + KK + K + +
Sbjct: 1306 EAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAE 1353
Score = 25.9 bits (56), Expect = 7.3
Identities = 11/60 (18%), Positives = 26/60 (43%)
Query: 65 QADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+A K +K + KK + + + + + ++ + + KK++ KK+ KK
Sbjct: 1326 EAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKK 1385
Score = 25.9 bits (56), Expect = 9.2
Identities = 8/47 (17%), Positives = 23/47 (48%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
K+ ++ +KK K+K ++ KK + + + + + ++K +
Sbjct: 1321 KKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367
Score = 25.5 bits (55), Expect = 9.8
Identities = 11/49 (22%), Positives = 24/49 (48%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+E KK + KK+ ++ KK+ +KK + K+ K + + + +
Sbjct: 1312 EEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAE 1360
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 29.1 bits (66), Expect = 0.51
Identities = 11/48 (22%), Positives = 25/48 (52%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
K + +++ K+QK+ K+++ ++ K KK K + K+ +K
Sbjct: 242 KAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEK 289
Score = 29.1 bits (66), Expect = 0.52
Identities = 11/47 (23%), Positives = 26/47 (55%), Gaps = 1/47 (2%)
Query: 79 KKKKQKKKQTKKQKKQKKKKQKK-QKMKKQKKTKKKKKKKQKKQKKK 124
++K ++ +Q ++++K+ K Q+ ++ K K KK K + K+
Sbjct: 240 EQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKR 286
Score = 26.1 bits (58), Expect = 6.7
Identities = 12/42 (28%), Positives = 23/42 (54%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKK 116
++ +KQ+K+ K+Q+ ++ K K KK K K+ +K
Sbjct: 248 QEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEK 289
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 28.7 bits (64), Expect = 0.52
Identities = 12/20 (60%), Positives = 14/20 (70%)
Query: 101 KQKMKKQKKTKKKKKKKQKK 120
K+K KK KKTKK KK +K
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120
Score = 28.3 bits (63), Expect = 0.58
Identities = 13/24 (54%), Positives = 17/24 (70%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKK 98
++ KKKK+K K+TKK KK KK
Sbjct: 97 TEKPKKKKKKSKKTKKPKKSSKKD 120
Score = 27.5 bits (61), Expect = 1.2
Identities = 12/22 (54%), Positives = 16/22 (72%)
Query: 80 KKKQKKKQTKKQKKQKKKKQKK 101
K K+KKK++KK KK KK +K
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 27.5 bits (61), Expect = 1.2
Identities = 12/19 (63%), Positives = 13/19 (68%)
Query: 106 KQKKTKKKKKKKQKKQKKK 124
K+KK K KK KK KK KK
Sbjct: 101 KKKKKKSKKTKKPKKSSKK 119
Score = 27.2 bits (60), Expect = 1.8
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 96 KKKQKKQKMKKQKKTKKKKKKK 117
K K+KK+K KK KK KK KK
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 25.6 bits (56), Expect = 6.4
Identities = 10/23 (43%), Positives = 14/23 (60%)
Query: 87 QTKKQKKQKKKKQKKQKMKKQKK 109
+ K+KK+K KK KK K +K
Sbjct: 98 EKPKKKKKKSKKTKKPKKSSKKD 120
Score = 25.2 bits (55), Expect = 7.4
Identities = 13/22 (59%), Positives = 16/22 (72%)
Query: 93 KQKKKKQKKQKMKKQKKTKKKK 114
K KKKK+K +K KK KK+ KK
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 24.8 bits (54), Expect = 9.3
Identities = 12/23 (52%), Positives = 15/23 (65%)
Query: 102 QKMKKQKKTKKKKKKKQKKQKKK 124
+K KK+KK KK KK +K KK
Sbjct: 98 EKPKKKKKKSKKTKKPKKSSKKD 120
>gnl|CDD|234272 TIGR03592, yidC_oxa1_cterm, membrane protein insertase, YidC/Oxa1
family, C-terminal domain. This model describes
full-length from some species, and the C-terminal region
only from other species, of the YidC/Oxa1 family of
proteins. This domain appears to be univeral among
bacteria (although absent from Archaea). The
well-characterized YidC protein from Escherichia coli
and its close homologs contain a large N-terminal
periplasmic domain in addition to the region modeled
here [Protein fate, Protein and peptide secretion and
trafficking].
Length = 181
Score = 28.6 bits (65), Expect = 0.57
Identities = 11/39 (28%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 85 KKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
KQ K +K ++ + K +++ Q+K K +K Q++ K
Sbjct: 22 LKQYKSMRKMQELQPKLKEI--QEKYKDDPQKLQQEMMK 58
>gnl|CDD|166628 PLN02987, PLN02987, Cytochrome P450, family 90, subfamily A.
Length = 472
Score = 29.2 bits (65), Expect = 0.57
Identities = 20/70 (28%), Positives = 33/70 (47%), Gaps = 8/70 (11%)
Query: 10 QEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIG---RHITSEIQA 66
+E +++I + D + + + V+ ETLR+ A IIG R ++I+
Sbjct: 305 KEEHEKIRAMKSD--SYSLEWSDYKSMPFTQCVVNETLRV---ANIIGGIFRRAMTDIEV 359
Query: 67 DKYTIPIGRK 76
YTIP G K
Sbjct: 360 KGYTIPKGWK 369
>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
Length = 815
Score = 29.0 bits (65), Expect = 0.62
Identities = 14/43 (32%), Positives = 22/43 (51%)
Query: 82 KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
++ K T+KQK + K+ + +Q KK KK KK+ K
Sbjct: 15 EEGKGGTRKQKAENKEHEGEQSPKKAKKEKKQDDSGNGNGKSA 57
>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
Length = 1052
Score = 29.2 bits (65), Expect = 0.64
Identities = 18/51 (35%), Positives = 23/51 (45%)
Query: 74 GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+K+ KK KK K+ KK KK+ KK K +K KK KK
Sbjct: 992 MKKKVAKKAPAKKAAAKKAAAKKAAAKKKVAKKAPAKKVARKPAAKKAAKK 1042
Score = 28.8 bits (64), Expect = 0.74
Identities = 17/52 (32%), Positives = 23/52 (44%)
Query: 73 IGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ +K KK KK K+ KKK KK KK + KK +K +K
Sbjct: 996 VAKKAPAKKAAAKKAAAKKAAAKKKVAKKAPAKKVARKPAAKKAAKKPARKA 1047
Score = 26.5 bits (58), Expect = 5.2
Identities = 13/45 (28%), Positives = 20/45 (44%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQK 119
+K KK KK+ K+ KK +K K KK +K ++
Sbjct: 1008 KKAAAKKAAAKKKVAKKAPAKKVARKPAAKKAAKKPARKAAGRKA 1052
Score = 26.1 bits (57), Expect = 5.8
Identities = 14/45 (31%), Positives = 20/45 (44%)
Query: 79 KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
KK KK KK+ +K +K + KK KK +K +K
Sbjct: 1008 KKAAAKKAAAKKKVAKKAPAKKVARKPAAKKAAKKPARKAAGRKA 1052
Score = 26.1 bits (57), Expect = 7.3
Identities = 16/50 (32%), Positives = 21/50 (42%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+K KK KK K+K KK KK K K KK ++ +K
Sbjct: 1003 KKAAAKKAAAKKAAAKKKVAKKAPAKKVARKPAAKKAAKKPARKAAGRKA 1052
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 28.6 bits (64), Expect = 0.65
Identities = 12/47 (25%), Positives = 25/47 (53%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
+ +++ + + K++KK++ K K +K+K + KK K K K
Sbjct: 106 TDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152
Score = 27.8 bits (62), Expect = 1.2
Identities = 11/43 (25%), Positives = 25/43 (58%)
Query: 82 KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
++ +T+++ + K + K+K K++ K +K+K K + KK
Sbjct: 103 EESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKK 145
>gnl|CDD|213371 cd12837, EcCorA-like_u1, uncharacterized subgroup of the
Escherichia coli Mg2+ transporter CorA_like subfamily.
A uncharacterized subgroup of the Escherichia coli
CorA-Salmonella typhimurium ZntB_like family
(EcCorA_ZntB-like) family of the MIT superfamily of
essential membrane proteins involved in transporting
divalent cations (uptake or efflux) across membranes.
The EcCorA_ZntB-like family includes the Mg2+
transporters Escherichia coli and Salmonella
typhimurium CorAs, which can also transport Co2+, and
Ni2+. Structures of the intracellular domain of
EcCorA_ZntB-like family members, Vibrio
parahaemolyticus and Salmonella typhimurium ZntB, form
funnel-shaped homopentamers, the tip of the funnel is
formed from two C-terminal transmembrane (TM) helices
from each monomer, and the large opening of the funnel
from the N-terminal cytoplasmic domains. The GMN
signature motif of the MIT superfamily occurs just
after TM1, mutation within this motif is known to
abolish Mg2+ transport through Salmonella typhimurium
CorA. Natural variants such as GVN and GIN, such as
occur in some ZntB family proteins, may be associated
with the transport of different divalent cations, such
as zinc and cadmium. The functional diversity of MIT
transporters may also be due to minor structural
differences regulating gating, substrate selection, and
transport.
Length = 298
Score = 28.7 bits (65), Expect = 0.66
Identities = 16/55 (29%), Positives = 30/55 (54%), Gaps = 5/55 (9%)
Query: 4 LHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGR 58
L+P ++E E +LG +PT +++ E+ +R+ +E LY AP++ R
Sbjct: 6 LNPTPEEEAAVE--ALLG---IAIPTREEMSEIEASSRLYREGGALYMTAPLLSR 55
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 28.8 bits (64), Expect = 0.74
Identities = 15/51 (29%), Positives = 29/51 (56%)
Query: 74 GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K +K K + +T K ++++KKK+K++ KK KK++K K++
Sbjct: 543 KMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDG 593
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 28.9 bits (65), Expect = 0.76
Identities = 10/47 (21%), Positives = 21/47 (44%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
+N K + Q K ++K +++ ++ K+ K+ Q K K
Sbjct: 896 QNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKHKG 942
Score = 26.6 bits (59), Expect = 4.5
Identities = 12/60 (20%), Positives = 25/60 (41%), Gaps = 8/60 (13%)
Query: 73 IGRKENKKKKQ--------KKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
I +EN K Q K + + K ++K +++M ++ K+ ++ Q K
Sbjct: 881 IRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKH 940
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 28.5 bits (64), Expect = 0.78
Identities = 12/46 (26%), Positives = 31/46 (67%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
E + +++++++ K K K KK K K+++++K K++K++K ++
Sbjct: 39 EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84
Score = 27.7 bits (62), Expect = 1.4
Identities = 12/44 (27%), Positives = 27/44 (61%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQK 119
++ +K+++K K K K +K K K ++ +K K+ K++K ++
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84
Score = 27.7 bits (62), Expect = 1.5
Identities = 12/44 (27%), Positives = 27/44 (61%)
Query: 81 KKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ ++K++ K + K K +K K K ++K K K++K++K ++
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84
>gnl|CDD|129295 TIGR00191, thrB, homoserine kinase. Homoserine kinase is part of
the threonine biosynthetic pathway.Homoserine kinase is
a member of the GHMP kinases (Galactokinase, Homoserine
kinase, Mevalonate kinase, Phosphomevalonate kinase)
and shares with them an amino-terminal domain probably
related to ATP binding.P.aeruginosa homoserine kinase
seems not to be homologous (see PROSITE:PDOC0054)
[Amino acid biosynthesis, Aspartate family].
Length = 302
Score = 28.5 bits (64), Expect = 0.80
Identities = 17/70 (24%), Positives = 28/70 (40%), Gaps = 9/70 (12%)
Query: 6 PKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHITSEIQ 65
+ QE D ++ G+ E +PT +L+ +V K L + +T E
Sbjct: 33 DVVAQESDDTEIEAEGEGVEKIPTEPTD---NLIYQVAKRFLDQLGIR-MPPVKVTLEKN 88
Query: 66 ADKYTIPIGR 75
IP+GR
Sbjct: 89 -----IPLGR 93
>gnl|CDD|166612 PLN02971, PLN02971, tryptophan N-hydroxylase.
Length = 543
Score = 28.8 bits (64), Expect = 0.82
Identities = 21/72 (29%), Positives = 35/72 (48%), Gaps = 3/72 (4%)
Query: 6 PKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHIT-SEI 64
P+I + +EI +V+G E I +L+ + +I+E RL+P A H+ S+
Sbjct: 358 PEILHKAMEEIDRVVGK--ERFVQESDIPKLNYVKAIIREAFRLHPVAAFNLPHVALSDT 415
Query: 65 QADKYTIPIGRK 76
Y IP G +
Sbjct: 416 TVAGYHIPKGSQ 427
>gnl|CDD|169556 PRK08703, PRK08703, short chain dehydrogenase; Provisional.
Length = 239
Score = 28.4 bits (63), Expect = 0.86
Identities = 9/28 (32%), Positives = 16/28 (57%)
Query: 1 MLALHPKIQQEVYDEIVQVLGDDPETVP 28
++A H K ++VYD IV+ +P +
Sbjct: 35 LVARHQKKLEKVYDAIVEAGHPEPFAIR 62
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 28.8 bits (64), Expect = 0.88
Identities = 5/40 (12%), Positives = 22/40 (55%)
Query: 85 KKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K + ++K K+K +++K + + + + ++ ++++
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQRE 831
Score = 28.4 bits (63), Expect = 1.1
Identities = 9/44 (20%), Positives = 21/44 (47%), Gaps = 2/44 (4%)
Query: 80 KKKQKKKQTKKQK--KQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
K + KQ K+K K+ K + ++ + + K++++K
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGI 835
Score = 27.6 bits (61), Expect = 1.8
Identities = 10/63 (15%), Positives = 25/63 (39%)
Query: 62 SEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
+++ +++ P +E K +K T K + ++ K+K K+ K +
Sbjct: 758 EDLEEEEFEYPRMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPS 817
Query: 122 KKK 124
+
Sbjct: 818 VET 820
Score = 26.5 bits (58), Expect = 4.8
Identities = 10/43 (23%), Positives = 20/43 (46%)
Query: 78 NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
+K+ K + + + ++ KQ+++K KKKK K
Sbjct: 808 DKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYK 850
Score = 26.5 bits (58), Expect = 5.4
Identities = 8/40 (20%), Positives = 18/40 (45%)
Query: 84 KKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
K KQK ++K K+ + + + ++ KQ++
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQRE 831
Score = 25.7 bits (56), Expect = 8.3
Identities = 13/87 (14%), Positives = 27/87 (31%), Gaps = 5/87 (5%)
Query: 43 IKETLRLYPAAPIIGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQ 102
++E YP + + + K++ K++ K+ K + +
Sbjct: 760 LEEEEFEYPRMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVE 819
Query: 103 KMK-----KQKKTKKKKKKKQKKQKKK 124
KQ++ K KKK
Sbjct: 820 TEGERCTIKQREEKGIDAPAILNVKKK 846
>gnl|CDD|179310 PRK01622, PRK01622, OxaA-like protein precursor; Validated.
Length = 256
Score = 28.6 bits (64), Expect = 0.88
Identities = 13/40 (32%), Positives = 22/40 (55%), Gaps = 1/40 (2%)
Query: 83 QKKKQTKKQKKQKKKKQKKQKMK-KQKKTKKKKKKKQKKQ 121
Q K Q Q+K K + K++ K K TK +K+K+ ++
Sbjct: 82 QYKSQRGMQEKMAVMKPELDKIQAKLKVTKDLEKQKEYQK 121
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 28.5 bits (63), Expect = 0.90
Identities = 12/67 (17%), Positives = 42/67 (62%)
Query: 58 RHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
R S + D Y P+ + KK+++ + K++ ++K ++++++ K+++K +++++++
Sbjct: 557 RGYNSCARTDLYFTPLASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKERERERER 616
Query: 118 QKKQKKK 124
+ ++ K
Sbjct: 617 EAERAAK 623
>gnl|CDD|99959 cd01635, Glycosyltransferase_GTB_type, Glycosyltransferases
catalyze the transfer of sugar moieties from activated
donor molecules to specific acceptor molecules, forming
glycosidic bonds. The acceptor molecule can be a lipid,
a protein, a heterocyclic compound, or another
carbohydrate residue. The structures of the formed
glycoconjugates are extremely diverse, reflecting a wide
range of biological functions. The members of this
family share a common GTB topology, one of the two
protein topologies observed for
nucleotide-sugar-dependent glycosyltransferases. GTB
proteins have distinct N- and C- terminal domains each
containing a typical Rossmann fold. The two domains have
high structural homology despite minimal sequence
homology. The large cleft that separates the two domains
includes the catalytic center and permits a high degree
of flexibility.
Length = 229
Score = 28.4 bits (63), Expect = 0.93
Identities = 11/56 (19%), Positives = 21/56 (37%)
Query: 1 MLALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPII 56
++ +++ D + + GD PE + + L LL RVI +
Sbjct: 122 LIEAFALLKERGPDLKLVIAGDGPEREYLEELLAALLLLDRVIFLGGLDPEELLAL 177
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 28.4 bits (64), Expect = 0.94
Identities = 9/41 (21%), Positives = 18/41 (43%)
Query: 82 KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
K K+ K K+ +++K K++ K K + +K
Sbjct: 10 KNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAA 50
Score = 26.5 bits (59), Expect = 5.1
Identities = 10/39 (25%), Positives = 18/39 (46%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKK 113
K KK+ K KQ +++K K++ + K +K
Sbjct: 9 SKNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRK 47
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 28.1 bits (63), Expect = 0.95
Identities = 12/50 (24%), Positives = 20/50 (40%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ KKKK+KKK+TKK + + + + +K K
Sbjct: 179 ERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLK 228
Score = 25.8 bits (57), Expect = 7.6
Identities = 12/49 (24%), Positives = 20/49 (40%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E K + ++++ KK+KK+KKKK KK +
Sbjct: 169 VELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSG 217
>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
ribosomal structure and biogenesis / Posttranslational
modification, protein turnover, chaperones].
Length = 379
Score = 28.5 bits (63), Expect = 0.99
Identities = 17/85 (20%), Positives = 30/85 (35%)
Query: 39 LTRVIKETLRLYPAAPIIGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKK 98
L R+++ + P + K+ G + K K K K + + +
Sbjct: 218 LKRLVQIAKKRDPRIKSFKEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAE 277
Query: 99 QKKQKMKKQKKTKKKKKKKQKKQKK 123
+KK K+ KK K +KK
Sbjct: 278 ALASATAVKKKAKEVMKKALKMEKK 302
>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Proteins. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs, all of which are expressed
during embryonic and early development in the nervous
system but with different localization and timing.
srGAPs contain an N-terminal F-BAR domain, a Rho GAP
domain, and a C-terminal SH3 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 241
Score = 28.1 bits (63), Expect = 1.1
Identities = 12/41 (29%), Positives = 27/41 (65%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
E K K+ +K++ K+++ +KK ++ + KK +K +K++ K
Sbjct: 151 ERKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191
Score = 27.3 bits (61), Expect = 2.3
Identities = 10/40 (25%), Positives = 28/40 (70%)
Query: 81 KKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
+K K+ + +++K+++ ++K ++ + KK +K+ +K+Q K
Sbjct: 152 RKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191
>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
Length = 300
Score = 28.1 bits (63), Expect = 1.1
Identities = 17/48 (35%), Positives = 29/48 (60%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
KKK+KKK+ KK K KK ++ K + K KK +++++ ++K K
Sbjct: 253 SKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKKIK 300
Score = 27.7 bits (62), Expect = 1.7
Identities = 16/43 (37%), Positives = 24/43 (55%)
Query: 81 KKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
+ +K +KK+KKKK+ K K KK K++K + K KK
Sbjct: 246 AAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKK 288
Score = 27.3 bits (61), Expect = 1.8
Identities = 18/49 (36%), Positives = 30/49 (61%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ N K +KKK+ KK+ KK K KK K++K K KK +++++ +K
Sbjct: 249 RANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQK 297
Score = 26.2 bits (58), Expect = 6.0
Identities = 18/48 (37%), Positives = 25/48 (52%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E+ KK K KKKK+KK+K+ +K KK KQ+K + K
Sbjct: 238 EDMYKKAHAAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVK 285
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 28.3 bits (63), Expect = 1.1
Identities = 11/23 (47%), Positives = 20/23 (86%)
Query: 80 KKKQKKKQTKKQKKQKKKKQKKQ 102
+KK+KKK++K++KK+K ++ KQ
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
Score = 27.1 bits (60), Expect = 2.9
Identities = 16/53 (30%), Positives = 29/53 (54%), Gaps = 3/53 (5%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMK---KQKKTKKKKKKKQKKQKKK 124
RK+ KKKK K+K+ KK ++ K+ + ++ + +K + Q K +KK
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKLEKK 736
Score = 26.0 bits (57), Expect = 6.4
Identities = 10/20 (50%), Positives = 16/20 (80%)
Query: 104 MKKQKKTKKKKKKKQKKQKK 123
+K+KK KK K+KK+KK ++
Sbjct: 683 SRKKKKKKKSKEKKKKKNRE 702
>gnl|CDD|236861 PRK11147, PRK11147, ABC transporter ATPase component; Reviewed.
Length = 635
Score = 28.4 bits (64), Expect = 1.1
Identities = 10/42 (23%), Positives = 21/42 (50%)
Query: 82 KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
+Q++ Q K+ KK+++ K + K+ KK K ++
Sbjct: 527 RQQQAQYLALKQPAVKKKEEAAAPKAETVKRSSKKLSYKLQR 568
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 28.3 bits (63), Expect = 1.2
Identities = 20/51 (39%), Positives = 40/51 (78%)
Query: 74 GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
G++E K+K+Q K++ KK+K++ K++ K +K K++ K K+ K+K+K+++KK
Sbjct: 103 GKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKK 153
Score = 26.8 bits (59), Expect = 3.6
Identities = 14/51 (27%), Positives = 36/51 (70%)
Query: 74 GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+++K+K++ +++KK+K+K +++ K +K K+ K+K+ ++K+K+K
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEK 150
Score = 26.0 bits (57), Expect = 6.1
Identities = 15/50 (30%), Positives = 35/50 (70%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
KE K+++KKK+ K +++ K +K K++ +K+ +K+K+K++K ++ +
Sbjct: 109 EKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPR 158
>gnl|CDD|236698 PRK10475, PRK10475, 23S rRNA pseudouridine synthase F; Provisional.
Length = 290
Score = 28.2 bits (63), Expect = 1.2
Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 1/48 (2%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E K K + K +T K+ K +K + K + K+ +KKK
Sbjct: 242 EAKPKAKAKPKTAGIKRPVVKMEKTAE-KGGRPASNGKRFTSPGRKKK 288
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 28.1 bits (63), Expect = 1.2
Identities = 16/50 (32%), Positives = 27/50 (54%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K KK + K++ K K ++ K + K+KKK+K+K++KKK
Sbjct: 293 LKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKK 342
Score = 27.3 bits (61), Expect = 2.1
Identities = 13/48 (27%), Positives = 31/48 (64%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
+ K+K + K + +K + K+KK++ KK++K KK+ ++ +++ +K
Sbjct: 307 SDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEK 354
Score = 26.5 bits (59), Expect = 3.8
Identities = 13/40 (32%), Positives = 29/40 (72%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKK 115
+ +KKK+KKK+ KK+K+ ++ +++ +K++ Q K++ K
Sbjct: 327 EVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEENK 366
Score = 25.8 bits (57), Expect = 7.7
Identities = 16/53 (30%), Positives = 26/53 (49%), Gaps = 4/53 (7%)
Query: 76 KENKKKKQKKKQTKKQKKQ----KKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K +K K K Q K+ KK + K+K+K + + +K + K+KKK
Sbjct: 281 KLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKK 333
Score = 25.8 bits (57), Expect = 8.0
Identities = 13/48 (27%), Positives = 29/48 (60%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
+ K+K K ++ +K + K+K K++KK +KKKK+ ++ +++
Sbjct: 304 EMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEER 351
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 28.1 bits (62), Expect = 1.2
Identities = 13/39 (33%), Positives = 27/39 (69%)
Query: 67 DKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMK 105
+K +P K++KK K+K+K+ K++++ K KK++ + K
Sbjct: 188 EKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226
Score = 27.3 bits (60), Expect = 2.4
Identities = 16/59 (27%), Positives = 33/59 (55%)
Query: 66 ADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
AD +P+ + N + + ++ +KK K+ K+K KK+K+ ++ K KK++ + K
Sbjct: 168 ADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226
Score = 26.6 bits (58), Expect = 4.2
Identities = 15/42 (35%), Positives = 25/42 (59%)
Query: 83 QKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ + K + ++ KK K K KKQ+K K++KKKK+K +
Sbjct: 261 SEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302
Score = 26.6 bits (58), Expect = 4.4
Identities = 14/52 (26%), Positives = 26/52 (50%)
Query: 70 TIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
+ + + K + ++ +K K KK+K +K+K+ KKKKKK +
Sbjct: 251 ANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHR 302
>gnl|CDD|236544 PRK09506, mrcB, bifunctional glycosyl transferase/transpeptidase;
Reviewed.
Length = 830
Score = 28.2 bits (63), Expect = 1.3
Identities = 12/53 (22%), Positives = 25/53 (47%)
Query: 72 PIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
PIGRK + K+K ++++ + + ++ +K K K +K + K
Sbjct: 8 PIGRKGKPSRPVKQKVSRRRYRDDDDYDDYDDYEDEEPMPRKGKGKGRKPRGK 60
>gnl|CDD|180505 PRK06278, PRK06278, cobyrinic acid a,c-diamide synthase; Validated.
Length = 476
Score = 28.1 bits (63), Expect = 1.4
Identities = 15/48 (31%), Positives = 26/48 (54%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
EN K K K+ +K++ KK K K+K+K + KK+++ + K
Sbjct: 192 VENFLKYLKIKEDEKEEIFKKNKILKEKLKSRSGISTNNKKEERNKPK 239
>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106. This
archaebacterial protein family has no known function.
Members are predicted to be integral membrane proteins.
Length = 168
Score = 27.6 bits (62), Expect = 1.4
Identities = 17/74 (22%), Positives = 34/74 (45%), Gaps = 15/74 (20%)
Query: 56 IGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQ---------KKQKKKKQKKQK--M 104
I R + SE+ I ++ +K ++++K+ +K+ K KK +K+Q+ M
Sbjct: 24 ILRGLLSEL-LQWLLID---RKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEELM 79
Query: 105 KKQKKTKKKKKKKQ 118
+ QK+ K
Sbjct: 80 EDQKEMMMDMMKPN 93
Score = 25.3 bits (56), Expect = 9.8
Identities = 11/36 (30%), Positives = 24/36 (66%), Gaps = 2/36 (5%)
Query: 89 KKQKKQKKKKQKKQKMKKQKKT--KKKKKKKQKKQK 122
K +K QK++K+ +++ ++ +K K KK +K+Q+
Sbjct: 41 KMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQE 76
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 27.8 bits (62), Expect = 1.4
Identities = 15/50 (30%), Positives = 27/50 (54%)
Query: 74 GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
G E K + + ++ K K Q +++KK +KK++KK+ K+KK
Sbjct: 28 GNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77
Score = 26.7 bits (59), Expect = 3.2
Identities = 13/46 (28%), Positives = 22/46 (47%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
K +K K K ++ +K Q+K++ KK+ K KK + Q
Sbjct: 41 KAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKVPLQVNPAQ 86
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 28.0 bits (63), Expect = 1.5
Identities = 13/64 (20%), Positives = 21/64 (32%)
Query: 61 TSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
+I + K+ + KK K K K KK+ K + KK +K
Sbjct: 49 PEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEK 108
Query: 121 QKKK 124
+
Sbjct: 109 KNAL 112
Score = 27.3 bits (61), Expect = 2.6
Identities = 11/48 (22%), Positives = 23/48 (47%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ + +++ + K +K K K K K++ K + KKK +Q +
Sbjct: 7 KAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQ 54
Score = 26.9 bits (60), Expect = 3.0
Identities = 14/84 (16%), Positives = 28/84 (33%)
Query: 41 RVIKETLRLYPAAPIIGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQK 100
+ KE ++ + ++ + + + KK+TK K K
Sbjct: 32 FITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAP 91
Query: 101 KQKMKKQKKTKKKKKKKQKKQKKK 124
+K K + KK +K+ K
Sbjct: 92 AKKKLKDELDSSKKAEKKNALDKD 115
Score = 26.9 bits (60), Expect = 3.3
Identities = 10/49 (20%), Positives = 18/49 (36%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+KK +T + K KK+ + +KK +KK +
Sbjct: 69 ATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDD 117
Score = 26.9 bits (60), Expect = 3.4
Identities = 9/48 (18%), Positives = 22/48 (45%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
K +++ K+ K+ K K + ++ K+ + KKK ++ +
Sbjct: 7 KAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQ 54
>gnl|CDD|233383 TIGR01373, soxB, sarcosine oxidase, beta subunit family,
heterotetrameric form. This model describes the beta
subunit of a family of known and putative
heterotetrameric sarcosine oxidases. Five operons of
such oxidases are found in Mesorhizobium loti and three
in Agrobacterium tumefaciens, a high enough copy number
to suggest that not all members are share the same
function. The model is designated as subfamily rather
than equivalog for this reason. Sarcosine oxidase
catalyzes the oxidative demethylation of sarcosine to
glycine. The reaction converts tetrahydrofolate to
5,10-methylene-tetrahydrofolate. The enzyme is known in
monomeric and heterotetrameric (alpha,beta,gamma,delta)
forms [Energy metabolism, Amino acids and amines].
Length = 407
Score = 27.9 bits (62), Expect = 1.6
Identities = 13/50 (26%), Positives = 22/50 (44%), Gaps = 2/50 (4%)
Query: 3 ALHPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPA 52
A+H + Q E+V +G + +Y Q L L V+ L ++P
Sbjct: 278 AVHFYVSQSDKGELV--IGGGIDGYNSYAQRGNLPTLEHVLAAILEMFPI 325
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 27.7 bits (62), Expect = 1.7
Identities = 14/50 (28%), Positives = 27/50 (54%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
++E + KK+K+ +TK K+ KKK+KK + + K+K ++
Sbjct: 79 QREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERIS 128
Score = 25.8 bits (57), Expect = 5.8
Identities = 14/50 (28%), Positives = 22/50 (44%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ KKK+ K K K+ K+KKKK + + KKK ++
Sbjct: 82 ERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAP 131
>gnl|CDD|234988 PRK01889, PRK01889, GTPase RsgA; Reviewed.
Length = 356
Score = 27.6 bits (62), Expect = 1.7
Identities = 14/52 (26%), Positives = 27/52 (51%), Gaps = 7/52 (13%)
Query: 70 TIPIGRKENKKKKQKK------KQTKKQKKQKKKKQKKQKMKKQKKTKKKKK 115
+ R ++ +K Q++ K + + ++KK KK K+ KK K+KK+
Sbjct: 306 ELDERRLQSYRKLQREQAYNAAKLAARLARAERKKWKK-ISKQVKKMKRKKR 356
>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
unknown].
Length = 523
Score = 27.5 bits (61), Expect = 1.7
Identities = 9/46 (19%), Positives = 18/46 (39%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
+ ++KQ + K + + + KM+ K K+K K
Sbjct: 17 KNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSK 62
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 27.3 bits (61), Expect = 1.7
Identities = 12/51 (23%), Positives = 38/51 (74%)
Query: 74 GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
GR + ++K+++K+ +K++K++K++++K+ K + ++++K+ ++ +K K
Sbjct: 9 GRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAK 59
Score = 25.3 bits (56), Expect = 6.8
Identities = 12/46 (26%), Positives = 33/46 (71%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
++E K++++++K+ K++K+++ K QK +++K+ ++ +K K K
Sbjct: 17 KREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKP 62
Score = 25.0 bits (55), Expect = 9.6
Identities = 13/51 (25%), Positives = 37/51 (72%)
Query: 73 IGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
I +E +++K+++K+ K++K++K+K+ K ++K+++ K+ ++ ++ K K
Sbjct: 12 IDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKP 62
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 27.3 bits (61), Expect = 1.7
Identities = 19/49 (38%), Positives = 30/49 (61%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E +K +K+ + K++ K KKKK KK+K K + K KK K +K+ +K
Sbjct: 68 AEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEK 116
Score = 27.3 bits (61), Expect = 1.9
Identities = 18/49 (36%), Positives = 30/49 (61%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K+ ++KQK K KK+ K+KK K K +K K+ +KK +K+ + K +
Sbjct: 75 KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLE 123
Score = 27.0 bits (60), Expect = 2.7
Identities = 17/50 (34%), Positives = 33/50 (66%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+KE ++K++ K + KK KK+K K + K+ KK K++KK +K+ + + +
Sbjct: 75 KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124
Score = 26.6 bits (59), Expect = 3.6
Identities = 20/50 (40%), Positives = 31/50 (62%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+KE ++ +K K+ ++K++ K K+KK K KK K KK KK K +KK
Sbjct: 64 KKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK 113
Score = 25.8 bits (57), Expect = 6.3
Identities = 20/50 (40%), Positives = 30/50 (60%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K K+ ++K+K K+KK KKKK K + K KK K +KK +K+ + K
Sbjct: 72 EKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDK 121
Score = 25.4 bits (56), Expect = 8.4
Identities = 16/48 (33%), Positives = 29/48 (60%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E +K K KKK++KK+K + K K+ +K K +K +K+ + + + K
Sbjct: 80 EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTK 127
>gnl|CDD|225123 COG2213, MtlA, Phosphotransferase system, mannitol-specific IIBC
component [Carbohydrate transport and metabolism].
Length = 472
Score = 27.7 bits (62), Expect = 1.8
Identities = 9/42 (21%), Positives = 13/42 (30%), Gaps = 1/42 (2%)
Query: 76 KENKKKKQKKKQTK-KQKKQKKKKQKKQKMKKQKKTKKKKKK 116
K + + K K+ K + K Q K KK
Sbjct: 339 KTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSNVKK 380
Score = 26.9 bits (60), Expect = 3.3
Identities = 6/42 (14%), Positives = 12/42 (28%)
Query: 80 KKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
K + + K K+ K + + K K+
Sbjct: 339 KTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSNVKK 380
Score = 26.5 bits (59), Expect = 4.2
Identities = 10/45 (22%), Positives = 14/45 (31%), Gaps = 1/45 (2%)
Query: 81 KKQKKKQTKK-QKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K K + + K K+ K + K Q K KK
Sbjct: 336 KTSKTESEDDLEAATAKMKEMKGESKGQSSALAADVKNDLSNVKK 380
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 27.1 bits (61), Expect = 1.8
Identities = 11/35 (31%), Positives = 25/35 (71%)
Query: 89 KKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
K++KK+KKKK++ + +++KK +++++ QK
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 27.7 bits (62), Expect = 1.9
Identities = 16/47 (34%), Positives = 20/47 (42%)
Query: 68 KYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKK 114
T +K N K K KKK+ K K + Q+ K K K KK
Sbjct: 347 TATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393
Score = 26.5 bits (59), Expect = 4.0
Identities = 16/47 (34%), Positives = 23/47 (48%)
Query: 78 NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
KK KK K K +KKKK+ + +T ++ K KK+ KK
Sbjct: 347 TATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393
Score = 26.1 bits (58), Expect = 5.9
Identities = 12/46 (26%), Positives = 20/46 (43%)
Query: 72 PIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
+ K + K + KK+K K + ++ K + KKK KK
Sbjct: 348 ATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393
>gnl|CDD|234438 TIGR03999, thiol_BshA, N-acetyl-alpha-D-glucosaminyl L-malate
synthase BshA. Members of this protein family are BshA,
a glycosyltransferase required for bacillithiol
biosynthesis. This enzyme combines UDP-GlcNAc and
L-malate to form N-acetyl-alpha-D-glucosaminyl L-malate
synthase. Bacillithiol is a low-molecular-weight thiol,
an analog of glutathione and mycothiol, and is found
largely in the Firmicutes [Biosynthesis of cofactors,
prosthetic groups, and carriers, Glutathione and
analogs].
Length = 374
Score = 27.6 bits (62), Expect = 1.9
Identities = 15/36 (41%), Positives = 22/36 (61%), Gaps = 1/36 (2%)
Query: 7 KIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRV 42
++QQEV +++ V GD PE P ++EL L RV
Sbjct: 223 RVQQEVPAKLLLV-GDGPERSPAEQLVRELGLTDRV 257
>gnl|CDD|222911 PHA02616, PHA02616, VP2/VP3; Provisional.
Length = 259
Score = 27.3 bits (60), Expect = 2.0
Identities = 14/53 (26%), Positives = 21/53 (39%), Gaps = 1/53 (1%)
Query: 63 EIQADKYTIPI-GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKK 114
E+ D Y IP K + + TKK KK K K + K ++ +
Sbjct: 194 ELNKDIYKIPTQAVKRKQDELHPVSPTKKAALSKKSKWTGTKSSQSSKKRRGR 246
>gnl|CDD|178592 PLN03018, PLN03018, homomethionine N-hydroxylase.
Length = 534
Score = 27.7 bits (61), Expect = 2.0
Identities = 21/71 (29%), Positives = 36/71 (50%), Gaps = 3/71 (4%)
Query: 5 HPKIQQEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHIT-SE 63
+P+I ++ E+ +V+G D V D I L+ L +ET R++P+A + H+ +
Sbjct: 344 NPEILRKALKELDEVVGKD-RLVQESD-IPNLNYLKACCRETFRIHPSAHYVPPHVARQD 401
Query: 64 IQADKYTIPIG 74
Y IP G
Sbjct: 402 TTLGGYFIPKG 412
>gnl|CDD|215494 PLN02915, PLN02915, cellulose synthase A [UDP-forming], catalytic
subunit.
Length = 1044
Score = 27.6 bits (61), Expect = 2.1
Identities = 15/45 (33%), Positives = 23/45 (51%), Gaps = 5/45 (11%)
Query: 74 GRKENKKKKQKKKQTKKQ-----KKQKKKKQKKQKMKKQKKTKKK 113
GR+ KK +K K+ ++ KK+KKK M +K +KK
Sbjct: 606 GRRGKSKKSKKGKKGRRSLLGGLKKRKKKGGGGGSMMGKKYGRKK 650
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 27.5 bits (61), Expect = 2.1
Identities = 13/74 (17%), Positives = 18/74 (24%), Gaps = 7/74 (9%)
Query: 51 PAAPIIGRHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKT 110
H + KK Q K K K +K K KK +
Sbjct: 122 NPYGTREMHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPK-------KKGSVS 174
Query: 111 KKKKKKKQKKQKKK 124
+ K + K
Sbjct: 175 NRSVKMPGIDPRSK 188
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 27.5 bits (62), Expect = 2.1
Identities = 14/43 (32%), Positives = 26/43 (60%)
Query: 82 KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
Q+ +Q +++ KQ K + ++++K K+KKK K+KK K
Sbjct: 375 LQRVEQLEQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVP 417
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 26.8 bits (60), Expect = 2.1
Identities = 20/54 (37%), Positives = 35/54 (64%), Gaps = 4/54 (7%)
Query: 75 RKENKKKKQKKKQTKKQ----KKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
R+ K+ K+ TK Q + ++ KQ+K+K K+KK ++K++K+Q KQ+KK
Sbjct: 72 RQAAKEVKKPGISTKAQQALKLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKK 125
Score = 25.3 bits (56), Expect = 7.9
Identities = 14/39 (35%), Positives = 31/39 (79%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKK 113
+ E+++ KQ+KK+ K+KK+++K++K+Q +++KK K +
Sbjct: 92 KLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130
>gnl|CDD|234961 PRK01581, speE, spermidine synthase; Validated.
Length = 374
Score = 27.2 bits (60), Expect = 2.1
Identities = 13/55 (23%), Positives = 25/55 (45%)
Query: 67 DKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
Y I +K+ + + K +Q+ + K+ K+ K KQ + + + KQ Q
Sbjct: 11 KIYRITSYKKDKRSELDSNKFELEQQLVENKQDKQGKQYKQDRGIQYAETKQDNQ 65
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 27.5 bits (61), Expect = 2.2
Identities = 13/46 (28%), Positives = 17/46 (36%)
Query: 79 KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K KQ ++ K K +K KK K KK+KK
Sbjct: 97 FDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKV 142
Score = 26.3 bits (58), Expect = 5.3
Identities = 11/43 (25%), Positives = 18/43 (41%)
Query: 67 DKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKK 109
+ + + K +K KKK T + KKKK+ + K
Sbjct: 109 PLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIK 151
Score = 26.3 bits (58), Expect = 5.7
Identities = 14/65 (21%), Positives = 22/65 (33%)
Query: 60 ITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQK 119
I ++ D +N K+ ++ K K +K K K KKKK+
Sbjct: 83 IKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKV 142
Query: 120 KQKKK 124
K
Sbjct: 143 LSSKD 147
>gnl|CDD|223016 PHA03233, PHA03233, DNA packaging protein UL32; Provisional.
Length = 518
Score = 27.4 bits (61), Expect = 2.2
Identities = 7/31 (22%), Positives = 16/31 (51%)
Query: 82 KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKK 112
K ++ KK+K +++ + +KK + K
Sbjct: 219 KIRRFLQKKKKPAAAEEELLRLLKKHTRPSK 249
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 27.1 bits (60), Expect = 2.2
Identities = 15/35 (42%), Positives = 25/35 (71%)
Query: 87 QTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
+ K ++K KK+K + K++KK KK+KKKK+K+
Sbjct: 135 EGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRH 169
>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
Autographa californica nuclear polyhedrosis virus
(AcMNPV) pp31 is a nuclear phosphoprotein that
accumulates in the virogenic stroma, which is the viral
replication centre in the infected-cell nucleus, binds
to DNA, and serves as a late expression factor.
Length = 267
Score = 27.3 bits (61), Expect = 2.2
Identities = 14/44 (31%), Positives = 20/44 (45%)
Query: 59 HITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQ 102
T+ A P ++K+K KK KK KK+KQK+
Sbjct: 205 DKTAASSASVVPTPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRS 248
Score = 25.4 bits (56), Expect = 9.8
Identities = 6/30 (20%), Positives = 16/30 (53%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKK 106
++ +K+K KK K+ +K+++ +
Sbjct: 219 QSSSPSRKRKAPKKVAKKVAAAKKRKQKRS 248
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 27.3 bits (60), Expect = 2.2
Identities = 17/66 (25%), Positives = 33/66 (50%)
Query: 58 RHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
+ + E Q KY+ ++ K + KK+ +++++KK + KQKK KK K
Sbjct: 504 KELELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYS 563
Query: 118 QKKQKK 123
K+++
Sbjct: 564 NAKKEE 569
Score = 25.8 bits (56), Expect = 7.9
Identities = 20/48 (41%), Positives = 27/48 (56%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
+E KK K K++K KK K K ++Q + KKKKK+ KQKK
Sbjct: 539 EEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKKQIAKQKK 586
Score = 25.4 bits (55), Expect = 10.0
Identities = 19/49 (38%), Positives = 24/49 (48%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K+ K KQ K KK K KK++ + K KKK+ KQKK K
Sbjct: 542 KKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKKQIAKQKKLDSK 590
>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
Length = 205
Score = 27.3 bits (61), Expect = 2.3
Identities = 12/39 (30%), Positives = 23/39 (58%)
Query: 66 ADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKM 104
A Y I +G+K+ KK+++ K+ KK+ + K + +K
Sbjct: 114 AKHYGIDLGKKKKYKKEEENKKAKKKSESLVAKLEVRKK 152
>gnl|CDD|233224 TIGR00993, 3a0901s04IAP86, chloroplast protein import component
Toc86/159, G and M domains. The long precursor of the
86K protein originally described is proposed to have
three domains. The N-terminal A-domain is acidic,
repetitive, weakly conserved, readily removed by
proteolysis during chloroplast isolation, and not
required for protein translocation. The other domains
are designated G (GTPase) and M (membrane anchor); this
family includes most of the G domain and all of M
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 763
Score = 27.2 bits (60), Expect = 2.4
Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 3/48 (6%)
Query: 80 KKKQKKKQTKKQKKQ---KKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K Q K +K+Q+K + + K KKQ + + K+ K KK K+
Sbjct: 417 TKAQMAKLSKEQRKAYLEEYDYRVKLLQKKQWREELKRMKMMKKFGKE 464
>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
function prediction only].
Length = 386
Score = 27.1 bits (60), Expect = 2.8
Identities = 9/25 (36%), Positives = 13/25 (52%)
Query: 78 NKKKKQKKKQTKKQKKQKKKKQKKQ 102
K+K +K + K KK KK Q +
Sbjct: 358 RKRKGDRKGVSHKAKKGGKKNQAET 382
Score = 26.3 bits (58), Expect = 5.3
Identities = 9/31 (29%), Positives = 15/31 (48%)
Query: 88 TKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQ 118
K + +K+K +K K KK KK + +
Sbjct: 352 KKAEAARKRKGDRKGVSHKAKKGGKKNQAET 382
Score = 25.6 bits (56), Expect = 8.6
Identities = 9/32 (28%), Positives = 15/32 (46%)
Query: 90 KQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
+K + +K+K + K KK KK Q +
Sbjct: 351 PKKAEAARKRKGDRKGVSHKAKKGGKKNQAET 382
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 27.1 bits (60), Expect = 2.8
Identities = 13/46 (28%), Positives = 17/46 (36%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
K K K + T QKK + K++ Q K K KK
Sbjct: 217 KSKKAKTDRDVSTSTAASQKKSSDLESKLEAQSKELWSLKDDLKKH 262
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 26.7 bits (59), Expect = 2.9
Identities = 14/34 (41%), Positives = 21/34 (61%)
Query: 83 QKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKK 116
++K T +K+KK K KK K+K +K KKK +
Sbjct: 81 RRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAR 114
>gnl|CDD|197876 smart00792, Agouti, Agouti protein. The agouti protein regulates
pigmentation in the mouse hair follicle producing a
black hair with a subapical yellow band. A highly
homologous protein agouti signal protein (ASIP) is
present in humans and is expressed at highest levels in
adipose tissue where it may play a role in energy
homeostasis and possibly human pigmentation.
Length = 124
Score = 26.4 bits (58), Expect = 2.9
Identities = 14/49 (28%), Positives = 25/49 (51%)
Query: 58 RHITSEIQADKYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKK 106
R +S QAD ++ I K KK ++ +K+ QKK+K+ + +
Sbjct: 32 RSNSSLNQADFPSVSIVGLNKKPKKISAEEAEKKLLQKKEKKALTNVLR 80
>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
protein 2. THO and TREX form a eukaryotic complex which
functions in messenger ribonucleoprotein metabolism and
plays a role in preventing the transcription-associated
genetic instability. Tho2, along with four other
subunits forms THO.
Length = 296
Score = 26.9 bits (60), Expect = 2.9
Identities = 13/44 (29%), Positives = 24/44 (54%)
Query: 80 KKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
+K+ K+ + KKKK+KK+ KK +++ KK + +K
Sbjct: 37 EKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEK 80
Score = 26.5 bits (59), Expect = 4.5
Identities = 12/40 (30%), Positives = 21/40 (52%)
Query: 85 KKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K+ K+ ++ +K+ K+ KKKK+KK+ K K
Sbjct: 26 KELYDKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIK 65
Score = 26.1 bits (58), Expect = 6.1
Identities = 11/52 (21%), Positives = 28/52 (53%)
Query: 73 IGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
I ++ KK+ +K++ + K+ ++++KK + +K KK+ ++K
Sbjct: 40 IKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEEKSS 91
>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
Length = 201
Score = 26.9 bits (60), Expect = 2.9
Identities = 13/45 (28%), Positives = 28/45 (62%)
Query: 79 KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
+K K+ +K K+ +K+ ++ Q+ MKK KK ++K+ + Q++
Sbjct: 72 EKMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQMEMMDDQRE 116
>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
SseC is a secreted protein that forms a complex together
with SecB and SecD on the surface of Salmonella. All
these proteins are secreted by the type III secretion
system. Many mucosal pathogens use type III secretion
systems for the injection of effector proteins into
target cells. SecB, SseC and SecD are inserted into the
target cell membrane. where they form a small pore or
translocon. In addition to SseC, this family includes
the bacterial secreted proteins PopB, PepB, YopB and
EspD which are thought to be directly involved in pore
formation, and type III secretion system translocon.
Length = 303
Score = 27.1 bits (60), Expect = 3.0
Identities = 8/40 (20%), Positives = 27/40 (67%), Gaps = 1/40 (2%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKM-KKQKKTKKK 113
+ ++K ++ ++ + K++KK ++ +++ +K +K ++ KK
Sbjct: 16 QAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAKKS 55
Score = 26.7 bits (59), Expect = 3.4
Identities = 9/42 (21%), Positives = 28/42 (66%)
Query: 83 QKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+K+ ++K Q+ ++ + ++++K ++ ++ KK +K ++ KK
Sbjct: 14 EKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEAKKS 55
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 26.9 bits (60), Expect = 3.0
Identities = 18/49 (36%), Positives = 22/49 (44%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+EN K K KKQ KK+ K+ K K K KK+KKK
Sbjct: 492 EENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKK 540
Score = 26.9 bits (60), Expect = 3.7
Identities = 17/53 (32%), Positives = 23/53 (43%)
Query: 72 PIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
P + + K KKQ K+K K + K+ K KKKKKK+K
Sbjct: 495 PWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLD 547
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 26.7 bits (59), Expect = 3.1
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 80 KKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
++ ++KK + KK+ + + KQKK KK KKKK K
Sbjct: 64 RQAERKKGYQAGKKETEDWFSPNQENKQKKKKKDKKKKSPK 104
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 26.9 bits (60), Expect = 3.1
Identities = 13/33 (39%), Positives = 21/33 (63%)
Query: 92 KKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
KK+K +++ ++ KQ K KKK KK++K K
Sbjct: 5 KKRKARQELAVQVAKQAKAKKKANKKKRKIYFK 37
Score = 25.4 bits (56), Expect = 8.0
Identities = 11/34 (32%), Positives = 19/34 (55%)
Query: 89 KKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
KK+K +++ + K K KK KKK+K ++
Sbjct: 5 KKRKARQELAVQVAKQAKAKKKANKKKRKIYFKR 38
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 26.5 bits (59), Expect = 3.1
Identities = 23/46 (50%), Positives = 35/46 (76%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
++E KK+K ++K KK+ K++KKKQKK+K KK KK KK++K+ K
Sbjct: 71 KREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSK 116
Score = 25.4 bits (56), Expect = 6.8
Identities = 18/49 (36%), Positives = 37/49 (75%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
++ KK+ + ++ +K++++K+K ++K K+ K+ KKK+KKK+KK+ KK
Sbjct: 57 EKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKK 105
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 26.9 bits (60), Expect = 3.2
Identities = 8/48 (16%), Positives = 26/48 (54%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E + + + + K++ +K +++ +Q + +K +++ +K + K K
Sbjct: 88 EENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDK 135
Score = 26.9 bits (60), Expect = 3.2
Identities = 8/47 (17%), Positives = 27/47 (57%)
Query: 78 NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ + ++ + + +K ++K +Q + +++K+++ +K K K + K
Sbjct: 92 EIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWK 138
Score = 26.1 bits (58), Expect = 5.5
Identities = 13/47 (27%), Positives = 26/47 (55%)
Query: 78 NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ KK+ KK +K ++ + + +KK++ ++ K K K +K KK
Sbjct: 98 EELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKY 144
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 27.1 bits (60), Expect = 3.2
Identities = 13/61 (21%), Positives = 34/61 (55%), Gaps = 4/61 (6%)
Query: 65 QADKYTIPIGRKENKKKKQKKKQTKKQKK----QKKKKQKKQKMKKQKKTKKKKKKKQKK 120
++ + I + +K+ ++K +K +K KK+ +Q+M++ K+ ++ KK + +K
Sbjct: 509 FKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEK 568
Query: 121 Q 121
+
Sbjct: 569 E 569
>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 594
Score = 26.9 bits (60), Expect = 3.3
Identities = 13/51 (25%), Positives = 18/51 (35%)
Query: 74 GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
G E K+ +K K K + K Q+K K +K KK
Sbjct: 438 GTIELPFKEVCEKLLDKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDF 488
Score = 26.9 bits (60), Expect = 3.3
Identities = 11/47 (23%), Positives = 17/47 (36%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
E K K K K Q+K +K +K KK ++ +
Sbjct: 448 CEKLLDKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFET 494
Score = 26.6 bits (59), Expect = 4.8
Identities = 8/46 (17%), Positives = 16/46 (34%)
Query: 79 KKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K K K Q+K K + K+ K+ + + ++
Sbjct: 456 KNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDALKLQE 501
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 25.9 bits (57), Expect = 3.3
Identities = 18/51 (35%), Positives = 25/51 (49%)
Query: 74 GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
G K +KK+ K + K+ K +K Q + T+K K KK KK KK
Sbjct: 15 GCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKK 65
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 26.4 bits (59), Expect = 3.3
Identities = 14/57 (24%), Positives = 32/57 (56%), Gaps = 11/57 (19%)
Query: 79 KKKKQKKKQTKKQKKQKKKKQ-----------KKQKMKKQKKTKKKKKKKQKKQKKK 124
KK K+ KK+ +KQ+KQ +K ++ K +K ++ ++ +++Q + ++K
Sbjct: 15 KKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQK 71
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 26.9 bits (59), Expect = 3.4
Identities = 10/47 (21%), Positives = 26/47 (55%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
+E+ K+ Q+ K+ +K+ K +++ Q K++ + ++KQ+
Sbjct: 209 QEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQ 255
Score = 25.7 bits (56), Expect = 7.5
Identities = 13/50 (26%), Positives = 24/50 (48%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
R + K++ KKQ K Q+K + KQ+ ++K+++ K K
Sbjct: 214 RAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKP 263
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 26.5 bits (59), Expect = 3.6
Identities = 8/24 (33%), Positives = 16/24 (66%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQK 100
+ +KKKQ KKQ++++ K+ +
Sbjct: 176 GHSSDSEKKKQRKKQRRKRSKELR 199
Score = 25.4 bits (56), Expect = 9.2
Identities = 10/26 (38%), Positives = 19/26 (73%)
Query: 71 IPIGRKENKKKKQKKKQTKKQKKQKK 96
+P +++KKKQ+KKQ +K+ K+ +
Sbjct: 174 VPGHSSDSEKKKQRKKQRRKRSKELR 199
>gnl|CDD|215276 PLN02500, PLN02500, cytochrome P450 90B1.
Length = 490
Score = 26.7 bits (59), Expect = 3.7
Identities = 20/74 (27%), Positives = 33/74 (44%), Gaps = 3/74 (4%)
Query: 6 PKIQQEVYDE---IVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHITS 62
PK QE+ +E I + E+ ++ +++ VI ETLRL + R
Sbjct: 310 PKAVQELREEHLEIARAKKQSGESELNWEDYKKMEFTQCVINETLRLGNVVRFLHRKALK 369
Query: 63 EIQADKYTIPIGRK 76
+++ Y IP G K
Sbjct: 370 DVRYKGYDIPSGWK 383
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 26.6 bits (59), Expect = 3.8
Identities = 8/32 (25%), Positives = 20/32 (62%)
Query: 82 KQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKK 113
+ + +KK++ + +KK++M++ K +KK
Sbjct: 82 ASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113
Score = 25.9 bits (57), Expect = 7.0
Identities = 11/55 (20%), Positives = 29/55 (52%)
Query: 69 YTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKK 123
+ ++ K+ + ++ KK+++ + KK+K ++ K +KK + K+K+
Sbjct: 68 FLSDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQ 122
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 26.8 bits (59), Expect = 3.9
Identities = 9/38 (23%), Positives = 22/38 (57%)
Query: 78 NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKK 115
N+++ + Q+K +K+ +K +M+ ++K KK+
Sbjct: 5 NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKE 42
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 26.6 bits (59), Expect = 4.0
Identities = 14/60 (23%), Positives = 29/60 (48%), Gaps = 12/60 (20%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMK----KQKK--------TKKKKKKKQKKQKKK 124
E + + Q++K+ Q + + K+KK ++ Q+K + KK+K +K+ K
Sbjct: 1519 EIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSN 1578
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 26.5 bits (58), Expect = 4.3
Identities = 19/48 (39%), Positives = 21/48 (43%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E KK KK T K+ + KK K K KK K KK K K K
Sbjct: 14 EEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTK 61
>gnl|CDD|223778 COG0706, YidC, Preprotein translocase subunit YidC [Intracellular
trafficking and secretion].
Length = 314
Score = 26.3 bits (58), Expect = 4.3
Identities = 9/40 (22%), Positives = 24/40 (60%)
Query: 83 QKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
+K T+ K ++ + K ++++++ K K+K++Q+ K
Sbjct: 128 LSQKSTRSMAKMQELQPKIKEIQEKYKGTDKQKQQQEMMK 167
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 26.1 bits (58), Expect = 4.5
Identities = 15/50 (30%), Positives = 38/50 (76%), Gaps = 1/50 (2%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKK-KKKKKQKKQKKK 124
KE KKK+++ + ++ + ++++ K++ +K+K+T+K ++KK +++QK+K
Sbjct: 79 KEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKFKRRQKEK 128
>gnl|CDD|221659 pfam12597, DUF3767, Protein of unknown function (DUF3767). This
family of proteins is found in eukaryotes. Proteins in
this family are typically between 112 and 199 amino
acids in length.
Length = 119
Score = 25.7 bits (57), Expect = 4.7
Identities = 8/31 (25%), Positives = 20/31 (64%)
Query: 94 QKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
+ ++++ Q MKK + +KK K+ ++K++
Sbjct: 89 RYRRRKSFQGMKKAVEAMAEKKAKKLREKEE 119
>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein. PACS-1 is a
cytosolic sorting protein that directs the localisation
of membrane proteins in the trans-Golgi network
(TGN)/endosomal system. PACS-1 connects the clathrin
adaptor AP-1 to acidic cluster sorting motifs contained
in the cytoplasmic domain of cargo proteins such as
furin, the cation-independent mannose-6-phosphate
receptor and in viral proteins such as human
immunodeficiency virus type 1 Nef.
Length = 413
Score = 26.3 bits (58), Expect = 4.7
Identities = 8/19 (42%), Positives = 11/19 (57%)
Query: 95 KKKKQKKQKMKKQKKTKKK 113
K+K +K M KK K+K
Sbjct: 332 KEKNKKVPVMFLGKKPKEK 350
Score = 25.9 bits (57), Expect = 7.8
Identities = 9/27 (33%), Positives = 14/27 (51%)
Query: 92 KKQKKKKQKKQKMKKQKKTKKKKKKKQ 118
K+K KK + K+ K K+ + K Q
Sbjct: 331 TKEKNKKVPVMFLGKKPKEKEVESKSQ 357
>gnl|CDD|222806 PHA00645, PHA00645, hypothetical protein.
Length = 125
Score = 25.8 bits (56), Expect = 4.9
Identities = 10/36 (27%), Positives = 18/36 (50%)
Query: 83 QKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQ 118
+K K Q KQ+ ++ KK++ + K K K+
Sbjct: 87 RKGINVGKGVYQNSNKQQVKREKKRQSIRAKAKNKR 122
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 26.3 bits (58), Expect = 5.0
Identities = 14/48 (29%), Positives = 33/48 (68%)
Query: 71 IPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQ 118
+P +E +K+++K+++ K K+++ ++++K +K+KKTKK K+
Sbjct: 35 VPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTT 82
>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
pleckstrin homology (PH) domain. The BCR gene is one of
the two genes in the BCR-ABL complex, which is
associated with the Philadelphia chromosome, a product
of a reciprocal translocation between chromosomes 22 and
9. BCR is a GTPase-activating protein (GAP) for RAC1
(primarily) and CDC42. The Dbl region of BCR has the
most RhoGEF activity for Cdc42, and less activity
towards Rac and Rho. Since BCR possesses both GAP and
GEF activities, it may function to temporally regulate
the activity of these GTPases. It also displays
serine/threonine kinase activity. The BCR protein
contains multiple domains including an N-terminal kinase
domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
domain, and a C-terminal RhoGAP domain. ABR, a related
smaller protein, is structurally similar to BCR, but
lacks the N-terminal kinase domain and has GAP activity
for both Rac and Cdc42. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 179
Score = 26.1 bits (58), Expect = 5.0
Identities = 9/41 (21%), Positives = 20/41 (48%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKK 116
+E + K+ K++ K+ ++KK + +K +KK
Sbjct: 89 EELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRKK 129
Score = 25.8 bits (57), Expect = 6.7
Identities = 9/44 (20%), Positives = 20/44 (45%)
Query: 81 KKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
K ++ K+ K++ K+ ++KK + +K +KK
Sbjct: 86 KSSEELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRKK 129
>gnl|CDD|177060 CHL00138, rps5, ribosomal protein S5; Validated.
Length = 143
Score = 25.7 bits (57), Expect = 5.2
Identities = 13/42 (30%), Positives = 20/42 (47%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
KK KKK K K+ K +++ ++K+ K K KK
Sbjct: 3 FLLKKMYNKKKNRKSNIKENKWEERVIQIKRVSKVVKGGKKL 44
>gnl|CDD|232969 TIGR00422, valS, valyl-tRNA synthetase. The valyl-tRNA synthetase
(ValS) is a class I amino acyl-tRNA ligase and is
particularly closely related to the isoleucyl tRNA
synthetase [Protein synthesis, tRNA aminoacylation].
Length = 861
Score = 26.2 bits (58), Expect = 5.5
Identities = 20/111 (18%), Positives = 44/111 (39%), Gaps = 11/111 (9%)
Query: 14 DEIVQVLGDDPETVPTYDQIQELHLLTRVIKETLRLYPAAPIIGRHITSEIQADKYTIPI 73
E + L + + E+ ++ + T + P I P+
Sbjct: 760 AETAERLKLNAVDIKGAINFSEVEVVIEKPEVTEAVVELVPGFEIII-----------PV 808
Query: 74 GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
NK K+ + Q + K++K+ + + K++ + KK K+ +K+K+K
Sbjct: 809 KGLINKAKELARLQKQLDKEKKEVIRIEGKLENEGFVKKAPKEVIEKEKEK 859
>gnl|CDD|205286 pfam13105, DUF3959, Protein of unknown function (DUF3959). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 260 amino acids in length.
Length = 242
Score = 26.0 bits (57), Expect = 5.9
Identities = 18/59 (30%), Positives = 26/59 (44%), Gaps = 7/59 (11%)
Query: 73 IGRKENKKKKQKKKQTKKQKKQKKKKQK-------KQKMKKQKKTKKKKKKKQKKQKKK 124
I RKE K+ K++KK+KK K K KMK K +K K ++ +
Sbjct: 150 IHRKEKDDNKENINLFKQKKKRKKLSFKIRLPRLPKLKMKLFKFGRKPSNSKDPRKYEH 208
>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
Length = 278
Score = 26.0 bits (57), Expect = 6.2
Identities = 16/39 (41%), Positives = 26/39 (66%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKK 113
++E + KKQ+ +Q K + +KKKK KK+K+ + K KK
Sbjct: 49 KREFELKKQQFEQLKAELSKKKKKFKKEKVDVRVKVVKK 87
>gnl|CDD|181988 PRK09609, PRK09609, hypothetical protein; Provisional.
Length = 312
Score = 25.8 bits (57), Expect = 6.3
Identities = 17/52 (32%), Positives = 27/52 (51%)
Query: 73 IGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
I R +NK KQK++ + +K +K K K KKKKK ++ ++K
Sbjct: 114 IKRYDNKIFKQKEQYDFALENPNSEKIQKIKQKIILLEKKKKKLEKTNEEKS 165
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 25.9 bits (57), Expect = 6.4
Identities = 11/43 (25%), Positives = 16/43 (37%)
Query: 78 NKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
K+ ++ K K +K K K KK KK +K
Sbjct: 172 KAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214
>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
Length = 822
Score = 26.1 bits (58), Expect = 6.4
Identities = 7/57 (12%), Positives = 15/57 (26%)
Query: 68 KYTIPIGRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
YT +++ K + + + K +K K + K
Sbjct: 763 NYTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLTAADPDAVAAKVDGVS 819
>gnl|CDD|235325 PRK04974, PRK04974, glycerol-3-phosphate acyltransferase;
Validated.
Length = 818
Score = 26.0 bits (58), Expect = 6.5
Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 9/48 (18%)
Query: 11 EVYDEIVQVL--------GDDPETVPTYDQIQELHLLTRVIKETLRLY 50
EV D+I+ L DD + + +EL LL ++ETL+ Y
Sbjct: 668 EVIDQIIDELVRQGLITLDDDELHINP-PRSRELQLLAAGVRETLQRY 714
>gnl|CDD|185575 PTZ00361, PTZ00361, 26 proteosome regulatory subunit 4-like
protein; Provisional.
Length = 438
Score = 25.9 bits (57), Expect = 6.5
Identities = 14/39 (35%), Positives = 21/39 (53%)
Query: 83 QKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
+ Q QK + KKK+KK+K + K+KKK+K
Sbjct: 3 NAQGQGNNQKDKNKKKEKKKKESPPPPHEIKRKKKRKGP 41
>gnl|CDD|188606 TIGR04091, LTA_dltB, D-alanyl-lipoteichoic acid biosynthesis
protein DltB. Members of this protein family are DltB,
part of a four-gene operon for D-alanyl-lipoteichoic
acid biosynthesis that is present in the vast majority
of low-GC Gram-positive organisms. This protein may be
involved in transport of D-alanine across the plasma
membrane [Cell envelope, Biosynthesis and degradation of
murein sacculus and peptidoglycan].
Length = 380
Score = 26.0 bits (58), Expect = 6.6
Identities = 8/14 (57%), Positives = 11/14 (78%)
Query: 107 QKKTKKKKKKKQKK 120
Q+K+KK KK K+ K
Sbjct: 342 QRKSKKYKKNKKNK 355
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 25.8 bits (57), Expect = 6.7
Identities = 12/35 (34%), Positives = 19/35 (54%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKT 110
++ + K+K++ KKQKK+K KK K T
Sbjct: 342 SPPAAFERPLALSPKRKREGDKKQKKKKSKKLKLT 376
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 26.2 bits (57), Expect = 6.9
Identities = 10/41 (24%), Positives = 22/41 (53%)
Query: 81 KKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQ 121
K+ KK + ++K ++ K +KK +K++K K+ +
Sbjct: 108 KESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDE 148
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 25.7 bits (57), Expect = 6.9
Identities = 10/31 (32%), Positives = 22/31 (70%)
Query: 90 KQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
Q +++K++KK K++KK K++K+K ++
Sbjct: 91 YQYFEEEKEKKKAMSKEEKKAIKEEKEKLEE 121
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 25.9 bits (57), Expect = 7.3
Identities = 16/47 (34%), Positives = 22/47 (46%)
Query: 76 KENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQK 122
K +KKKK K TK + ++ KK KK +KK + K K
Sbjct: 418 KGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVP 464
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 25.6 bits (56), Expect = 7.7
Identities = 10/51 (19%), Positives = 25/51 (49%)
Query: 74 GRKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
R +K+K++ + ++K K+ ++KKQ++++ + K Q
Sbjct: 161 KRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGL 211
>gnl|CDD|215581 PLN03109, PLN03109, ETHYLENE-INSENSITIVE3-like3 protein;
Provisional.
Length = 599
Score = 25.9 bits (57), Expect = 8.0
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 80 KKKQKKKQTKKQKKQKKKKQKKQKMKKQ 107
K++QKK Q +K K KK Q +K+
Sbjct: 57 KERQKKLQQAALEKSKPKKISDQARRKK 84
>gnl|CDD|236172 PRK08173, PRK08173, DNA topoisomerase III; Validated.
Length = 862
Score = 25.8 bits (57), Expect = 8.1
Identities = 10/48 (20%), Positives = 16/48 (33%)
Query: 77 ENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
E + ++ K KK + + K K KK KK +
Sbjct: 812 EFEPREPKAAAAKKTAAKATAAAATKAEKAAAKKAPAKKTAAKKTAAR 859
Score = 25.8 bits (57), Expect = 9.2
Identities = 10/46 (21%), Positives = 15/46 (32%)
Query: 75 RKENKKKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKK 120
R+ +K K ++ K KKT KK +K
Sbjct: 816 REPKAAAAKKTAAKATAAAATKAEKAAAKKAPAKKTAAKKTAARKT 861
>gnl|CDD|198457 cd10314, FAM20_C, C-terminal putative kinase domain of FAM20
(family with sequence similarity 20) proteins. This
family contains the C-terminal domain of FAM20A, -B, -C
and related proteins. FAM20A may participate in enamel
development and gingival homeostasis, FAM20B in
proteoglycan production, and FAM20C in bone development.
FAM20B is a xylose kinase that may regulate the number
of glycosaminoglycan chains by phosphorylating the
xylose residue in the glycosaminoglycan-protein linkage
region of proteoglycans. FAM20C, also called Dentin
Matrix Protein 4, is abundant in the dentin matrix, and
may participate in the differentiation of mesenchymal
precursor cells into functional odontoblast-like cells.
Mutations in FAM20C are associated with lethal
Osteosclerotic Bone Dysplasia (Raine Syndrome), and
mutations in FAM20A with Amelogenesis imperfecta (AI)
and Gingival Hyperplasia Syndrome. The C-terminal
domains of members of this family are putative kinase
domains, based on mutagenesis of the C-terminal domain
of Drosophila Four-Jointed, a related Golgi kinase. This
domain family is also known as DUF1193.
Length = 209
Score = 25.3 bits (56), Expect = 8.3
Identities = 8/32 (25%), Positives = 19/32 (59%), Gaps = 4/32 (12%)
Query: 29 TYDQIQELH----LLTRVIKETLRLYPAAPII 56
T+ ++Q L L+ +++E+L P +P++
Sbjct: 151 TWLRLQLLAKGGGSLSDLLRESLSHDPLSPVL 182
>gnl|CDD|234941 PRK01315, PRK01315, putative inner membrane protein translocase
component YidC; Provisional.
Length = 329
Score = 25.5 bits (56), Expect = 8.4
Identities = 7/31 (22%), Positives = 20/31 (64%)
Query: 87 QTKKQKKQKKKKQKKQKMKKQKKTKKKKKKK 117
T+ +++ K++ ++Q+ K+Q + K++K
Sbjct: 297 TTEVREQTKRQTVQRQQPKRQSRAKRQKGGA 327
>gnl|CDD|203260 pfam05496, RuvB_N, Holliday junction DNA helicase ruvB N-terminus.
The RuvB protein makes up part of the RuvABC revolvasome
which catalyzes the resolution of Holliday junctions
that arise during genetic recombination and DNA repair.
Branch migration is catalyzed by the RuvB protein that
is targeted to the Holliday junction by the structure
specific RuvA protein. This family contains the
N-terminal region of the protein.
Length = 231
Score = 25.5 bits (57), Expect = 8.9
Identities = 10/20 (50%), Positives = 13/20 (65%), Gaps = 2/20 (10%)
Query: 33 IQELHLLTRVIKETLRLYPA 52
I E+H L R ++E LYPA
Sbjct: 107 IDEIHRLNRAVEEI--LYPA 124
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 25.4 bits (56), Expect = 9.0
Identities = 9/45 (20%), Positives = 27/45 (60%)
Query: 80 KKKQKKKQTKKQKKQKKKKQKKQKMKKQKKTKKKKKKKQKKQKKK 124
++K+K+ + +Q +++K+ K+++ +K + +K+ +K K
Sbjct: 327 REKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEKLKSN 371
>gnl|CDD|216690 pfam01767, Birna_VP3, Birnavirus VP3 protein. VP3 is a minor
structural component of the virus. The large RNA segment
of birnaviruses codes for a polyprotein
(N-VP2-VP4-VP3-C).
Length = 229
Score = 25.3 bits (55), Expect = 9.4
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 10 QEVYDEIVQVLGDDPETVPTYDQIQELHLLTRVIK 44
YDE+ V ++ P +Q+++L LL R +K
Sbjct: 193 DAFYDEVAAVYAENQGRGPNQEQMRDLRLLARRMK 227
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 25.7 bits (57), Expect = 9.8
Identities = 10/37 (27%), Positives = 20/37 (54%), Gaps = 2/37 (5%)
Query: 86 KQTKKQKKQKKKKQKKQKMKKQKKTKK--KKKKKQKK 120
++ KK KK +K+ + ++ KK K+ +KQ+
Sbjct: 84 REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRH 120
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.308 0.126 0.331
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,070,789
Number of extensions: 566546
Number of successful extensions: 15173
Number of sequences better than 10.0: 1
Number of HSP's gapped: 8627
Number of HSP's successfully gapped: 2651
Length of query: 124
Length of database: 10,937,602
Length adjustment: 85
Effective length of query: 39
Effective length of database: 7,167,512
Effective search space: 279532968
Effective search space used: 279532968
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 53 (24.0 bits)