RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7478
(180 letters)
>gnl|CDD|215772 pfam00179, UQ_con, Ubiquitin-conjugating enzyme. Proteins destined
for proteasome-mediated degradation may be
ubiquitinated. Ubiquitination follows conjugation of
ubiquitin to a conserved cysteine residue of UBC
homologues. TSG101 is one of several UBC homologues that
lacks this active site cysteine.
Length = 139
Score = 72.6 bits (179), Expect = 7e-17
Identities = 26/97 (26%), Positives = 47/97 (48%), Gaps = 17/97 (17%)
Query: 91 TLSVQIPLHRVHEQGVPPVV------YPPSIYNHQSTLYSGTVCLNVINQE-WTPLYDLS 143
L ++ P + PP V Y P++ SG +CL+++ E W+P +
Sbjct: 47 KLDIEFPEDYPFK---PPKVKFTTKIYHPNVDP------SGEICLDILKDENWSPALTIE 97
Query: 144 NIFETFLPQLLTYPNPTDPLNGDAAAMYLDKPDKYKE 180
+ + LL+ PNP DPLN +AA +Y +++++
Sbjct: 98 QVL-LSIQSLLSEPNPEDPLNAEAAKLYRKNREEFEK 133
>gnl|CDD|238117 cd00195, UBCc, Ubiquitin-conjugating enzyme E2, catalytic (UBCc)
domain. This is part of the ubiquitin-mediated protein
degradation pathway in which a thiol-ester linkage forms
between a conserved cysteine and the C-terminus of
ubiquitin and complexes with ubiquitin protein ligase
enzymes, E3. This pathway regulates many fundamental
cellular processes. There are also other E2s which form
thiol-ester linkages without the use of E3s as well as
several UBC homologs (TSG101, Mms2, Croc-1 and similar
proteins) which lack the active site cysteine essential
for ubiquitination and appear to function in DNA repair
pathways which were omitted from the scope of this CD.
Length = 141
Score = 69.2 bits (170), Expect = 2e-15
Identities = 25/81 (30%), Positives = 43/81 (53%), Gaps = 14/81 (17%)
Query: 107 PPVV------YPPSIYNHQSTLYSGTVCLNVINQE-WTPLYDLSNIFETFLPQLLTYPNP 159
PP V Y P++ +G +CL+++ W+P Y L + + LL PNP
Sbjct: 62 PPKVRFVTKIYHPNVDE------NGKICLSILKTHGWSPAYTLRTVLLSLQS-LLNEPNP 114
Query: 160 TDPLNGDAAAMYLDKPDKYKE 180
+DPLN +AA +Y + +++K+
Sbjct: 115 SDPLNAEAAKLYKENREEFKK 135
>gnl|CDD|227410 COG5078, COG5078, Ubiquitin-protein ligase [Posttranslational
modification, protein turnover, chaperones].
Length = 153
Score = 68.1 bits (167), Expect = 6e-15
Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 6/74 (8%)
Query: 112 PPSIY-----NHQSTLYSGTVCLNVINQEWTPLYDLSNIFETFLPQLLTYPNPTDPLNGD 166
PP + H + SG VCL+++ W+P+Y L I + LL+ PNP PLN +
Sbjct: 69 PPKVRFTTKIFHPNVDPSGNVCLDILKDRWSPVYTLETILLSLQSLLLS-PNPDSPLNTE 127
Query: 167 AAAMYLDKPDKYKE 180
AA +Y + ++Y++
Sbjct: 128 AATLYREDKEEYEK 141
>gnl|CDD|214562 smart00212, UBCc, Ubiquitin-conjugating enzyme E2, catalytic domain
homologues. Proteins destined for proteasome-mediated
degradation may be ubiquitinated. Ubiquitination follows
conjugation of ubiquitin to a conserved cysteine residue
of UBC homologues. This pathway functions in regulating
many fundamental processes required for cell
viability.TSG101 is one of several UBC homologues that
lacks this active site cysteine.
Length = 145
Score = 60.4 bits (147), Expect = 3e-12
Identities = 28/81 (34%), Positives = 44/81 (54%), Gaps = 14/81 (17%)
Query: 107 PPVV------YPPSIYNHQSTLYSGTVCLNVINQE-WTPLYDLSNIFETFLPQLLTYPNP 159
PP V Y P++ + SG +CL+++ QE W+P L + + L LL+ PNP
Sbjct: 61 PPKVKFITKIYHPNVDS------SGEICLDILKQEKWSPALTLETVLLS-LQSLLSEPNP 113
Query: 160 TDPLNGDAAAMYLDKPDKYKE 180
PLN DAA +Y +++K+
Sbjct: 114 DSPLNADAAELYKKNREEFKK 134
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 44.5 bits (106), Expect = 8e-06
Identities = 17/56 (30%), Positives = 38/56 (67%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
+ K KK++EE+ E+ +++EE+++EEK+K++++KEEE+++ E++
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468
Score = 43.4 bits (103), Expect = 3e-05
Identities = 20/63 (31%), Positives = 37/63 (58%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
L S K +K +++ +EE +++ +KK+EE+++EK+KKEEEK++E E+
Sbjct: 402 LTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461
Query: 61 RRT 63
Sbjct: 462 AEE 464
Score = 43.0 bits (102), Expect = 3e-05
Identities = 23/50 (46%), Positives = 38/50 (76%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+ + +K+KK++ EEEEEE+EKK+EEK++E+++ EEEK++E EK
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472
Score = 37.6 bits (88), Expect = 0.002
Identities = 19/45 (42%), Positives = 35/45 (77%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK 54
+ +KK++EEE+E+ EE+EEEEE+ +EEK++E++KK+++
Sbjct: 433 KAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477
Score = 28.3 bits (64), Expect = 2.3
Identities = 13/34 (38%), Positives = 19/34 (55%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
EEE E K+ KK KK E+ +KK E+++
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKE 429
Score = 26.4 bits (59), Expect = 9.6
Identities = 18/58 (31%), Positives = 25/58 (43%), Gaps = 4/58 (6%)
Query: 32 EEEEEE----KKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKPRPTRCVEGE 85
EEE E KK KK +K ++ EKK+E EK+ GK + + E
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE 452
>gnl|CDD|177768 PLN00172, PLN00172, ubiquitin conjugating enzyme; Provisional.
Length = 147
Score = 41.7 bits (97), Expect = 3e-05
Identities = 22/73 (30%), Positives = 42/73 (57%), Gaps = 2/73 (2%)
Query: 107 PPVVYPPSIYNHQSTLYSGTVCLNVINQEWTPLYDLSNIFETFLPQLLTYPNPTDPLNGD 166
P V + IY H + +G++CL+++ +W+P +S + + + LLT PNP DPL +
Sbjct: 65 PKVQFTTKIY-HPNINSNGSICLDILRDQWSPALTVSKVLLS-ISSLLTDPNPDDPLVPE 122
Query: 167 AAAMYLDKPDKYK 179
A ++ + +Y+
Sbjct: 123 IARVFKENRSRYE 135
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 42.1 bits (99), Expect = 7e-05
Identities = 18/54 (33%), Positives = 36/54 (66%)
Query: 6 TTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
+V + ++K+E+++EE ++EEE +E++++E+KK+K KK +E E E
Sbjct: 32 EKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWE 85
Score = 35.1 bits (81), Expect = 0.014
Identities = 18/44 (40%), Positives = 33/44 (75%)
Query: 19 KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
KE E E EEEE+EE+K++EEK +K+++ +E++++ EK++
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKK 72
Score = 34.4 bits (79), Expect = 0.023
Identities = 21/55 (38%), Positives = 34/55 (61%)
Query: 9 VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
V K ++ EEE+EE +EEEE+ K+EE +E++K+E++KK + K T
Sbjct: 27 VEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETT 81
Score = 32.8 bits (75), Expect = 0.088
Identities = 17/47 (36%), Positives = 30/47 (63%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
K ++++++++ D+E EEEE+EE+KKK +K KE + E K
Sbjct: 43 EKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNK 89
Score = 29.0 bits (65), Expect = 1.3
Identities = 18/51 (35%), Positives = 35/51 (68%), Gaps = 1/51 (1%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
++ +KE DEE E+EE++EE++K K+E+ +EEEK+++ +K + +
Sbjct: 29 KEVEKEVPDEEEEE-EKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVK 78
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 41.5 bits (97), Expect = 1e-04
Identities = 18/48 (37%), Positives = 36/48 (75%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
+K + GG+ EEEEEE+++EE+++E++++EEE+++E E+ +L
Sbjct: 849 EKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSL 896
Score = 39.6 bits (92), Expect = 5e-04
Identities = 18/47 (38%), Positives = 34/47 (72%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
K+ +K + GG EEEEEEE+++EE+++E++++EEE+++ E
Sbjct: 846 KQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 35.7 bits (82), Expect = 0.011
Identities = 16/44 (36%), Positives = 35/44 (79%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
K+ E+ + GGG + + EEE+++EE+++E++++EEE+++E E+
Sbjct: 846 KQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889
Score = 34.6 bits (79), Expect = 0.026
Identities = 19/44 (43%), Positives = 32/44 (72%), Gaps = 4/44 (9%)
Query: 19 KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+EEE+EE EEEEEEEE+++EE+++E++ +E + E R+
Sbjct: 864 EEEEEEE----EEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQ 903
Score = 31.9 bits (72), Expect = 0.18
Identities = 16/42 (38%), Positives = 30/42 (71%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
++++EEE+EE EEEEEEEE+++E ++ + E ++K+
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905
Score = 30.7 bits (69), Expect = 0.42
Identities = 16/47 (34%), Positives = 26/47 (55%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
+R + + E EE GG E+E E E K E + + + E + ++EGE
Sbjct: 650 ERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGE 696
Score = 27.7 bits (61), Expect = 4.7
Identities = 12/47 (25%), Positives = 22/47 (46%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
R ++ E E G EE E ++E + + K + E E + E++
Sbjct: 644 RTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERK 690
Score = 27.3 bits (60), Expect = 5.2
Identities = 17/45 (37%), Positives = 24/45 (53%), Gaps = 5/45 (11%)
Query: 20 EEEDEEGGGGEEEEEEEE-----KKKEEKKKEKKKKEEEKKKEGE 59
E E E G GEE E+E E K + E + ++K+ E E + E E
Sbjct: 729 EGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAE 773
Score = 27.3 bits (60), Expect = 6.5
Identities = 15/47 (31%), Positives = 22/47 (46%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
K K E E EE E E E + + E + ++ +E E + EGE
Sbjct: 702 KEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGE 748
Score = 26.9 bits (59), Expect = 7.2
Identities = 14/42 (33%), Positives = 24/42 (57%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
+ EE ++EG G E + E E + + K+ E + + E + KE E
Sbjct: 737 EGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDE 778
Score = 26.9 bits (59), Expect = 8.7
Identities = 16/46 (34%), Positives = 26/46 (56%), Gaps = 3/46 (6%)
Query: 17 KKKEEEDEEGG---GGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
+ +E EDE G G E E E ++K+ E + E + + +E + EGE
Sbjct: 737 EGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGE 782
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 38.1 bits (89), Expect = 0.002
Identities = 15/35 (42%), Positives = 25/35 (71%)
Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
E+EE+E K++++ ++ KK+EE+KKKE EK
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590
Score = 32.3 bits (74), Expect = 0.11
Identities = 15/42 (35%), Positives = 28/42 (66%), Gaps = 7/42 (16%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
+++KEE+ E +E++ +K +K++EKKKKE EK ++
Sbjct: 554 QREKEEK-------EALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 30.4 bits (69), Expect = 0.57
Identities = 15/37 (40%), Positives = 25/37 (67%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKK 49
+R K++KE E+ + +++EE+KKKE +K EK K
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590
Score = 28.5 bits (64), Expect = 2.6
Identities = 7/33 (21%), Positives = 21/33 (63%)
Query: 33 EEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
+++EE ++E+++KE K+++ +K ++
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKK 580
Score = 27.7 bits (62), Expect = 3.7
Identities = 15/48 (31%), Positives = 30/48 (62%), Gaps = 4/48 (8%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKK 55
K + +++K+E+E + E++ + KK+EEKKK++ +K E+ K
Sbjct: 547 LDDKEELQREKEEKEALK----EQKRLRKLKKQEEKKKKELEKLEKAK 590
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 37.0 bits (86), Expect = 0.002
Identities = 23/50 (46%), Positives = 35/50 (70%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+Y KKKK+E EE ++E EE++K K +KKK KKKK+++K K+ +K
Sbjct: 56 EYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDK 105
Score = 34.3 bits (79), Expect = 0.018
Identities = 8/45 (17%), Positives = 32/45 (71%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
+++K K +++ + ++++++++KK ++ +K+ +K+ E+K ++
Sbjct: 80 EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLED 124
Score = 33.9 bits (78), Expect = 0.023
Identities = 12/47 (25%), Positives = 32/47 (68%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
+Y+ ++K K ++ + ++++++++ KK++K ++K +KE E K E
Sbjct: 77 EYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLE 123
Score = 32.4 bits (74), Expect = 0.071
Identities = 13/67 (19%), Positives = 36/67 (53%)
Query: 7 TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF 66
+ + ++ K KK++ ++ ++++++++ K EKK EK+ +++ + TL+
Sbjct: 76 KEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLST 135
Query: 67 IRIGKPA 73
+ KP
Sbjct: 136 LSELKPR 142
Score = 32.0 bits (73), Expect = 0.10
Identities = 13/47 (27%), Positives = 32/47 (68%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
K KKE E+++ ++++ +++K K++ KK+ KK ++ +KK+ ++
Sbjct: 72 EKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118
Score = 31.6 bits (72), Expect = 0.13
Identities = 11/45 (24%), Positives = 28/45 (62%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
KK+ EE+ + ++ +++++K K++K +K K E+K ++ +
Sbjct: 75 KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAE 119
Score = 29.3 bits (66), Expect = 0.91
Identities = 11/50 (22%), Positives = 29/50 (58%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
+ +K +++ E + ++++ KKK++K K+KK +++ K E + +
Sbjct: 69 EEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118
Score = 28.1 bits (63), Expect = 2.2
Identities = 11/45 (24%), Positives = 28/45 (62%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
+K ++E EE + ++++ +KKK++ K +K K+++K ++
Sbjct: 68 AEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEK 112
Score = 26.6 bits (59), Expect = 6.4
Identities = 11/56 (19%), Positives = 25/56 (44%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
K++ K K+++D++ E+++E+E + + + K E K R
Sbjct: 90 KSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPRKYA 145
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 37.9 bits (88), Expect = 0.002
Identities = 14/48 (29%), Positives = 30/48 (62%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKK 55
+V ++ E + +E +EE E+++K++ ++ KEK++K+E KK
Sbjct: 387 EVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 37.5 bits (87), Expect = 0.002
Identities = 15/50 (30%), Positives = 31/50 (62%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
K+ + K+E D + +EE+ K++E +KK+K++ +E+K+K + R
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDER 432
Score = 32.8 bits (75), Expect = 0.075
Identities = 14/51 (27%), Positives = 28/51 (54%), Gaps = 3/51 (5%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEE---EEEEKKKEEKKKEKKKKEEEKKKEG 58
K K + + +E +EE+ EE EKK++E+ E K+K ++ +++
Sbjct: 384 KLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 28.6 bits (64), Expect = 1.8
Identities = 13/48 (27%), Positives = 27/48 (56%)
Query: 2 QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKK 49
+ T S+ K K+E+ +E +++E+ +E K++ +K E+KK
Sbjct: 387 EVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 28.6 bits (64), Expect = 2.1
Identities = 15/48 (31%), Positives = 30/48 (62%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
K KK ++ ++ G ++E + E+ + + K+EK K+EE +KK+ E+
Sbjct: 372 KEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQ 419
Score = 27.5 bits (61), Expect = 4.2
Identities = 8/48 (16%), Positives = 25/48 (52%)
Query: 3 NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKK 50
+ + + K +EE + E++++E+ + +EK+++ ++K
Sbjct: 386 GEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 36.5 bits (85), Expect = 0.002
Identities = 20/59 (33%), Positives = 35/59 (59%), Gaps = 2/59 (3%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEK--KKEEKKKEKKKKEEEKKKEGEKRRTL 64
+V K KK++EE+EE EE +EEE+ + EK+ K K+E+ ++ E +++ L
Sbjct: 89 KVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEIL 147
Score = 34.6 bits (80), Expect = 0.010
Identities = 16/51 (31%), Positives = 32/51 (62%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
K +++++EEE E EEE+ +E +KE K +++K+ E ++K+ E +
Sbjct: 99 KEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILKE 149
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 37.4 bits (87), Expect = 0.002
Identities = 12/48 (25%), Positives = 21/48 (43%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKK 48
LQN T+ + +++ E G + EE ++ KE + K K
Sbjct: 895 LQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKHKG 942
Score = 30.5 bits (69), Expect = 0.50
Identities = 8/59 (13%), Positives = 23/59 (38%), Gaps = 2/59 (3%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKK--EEKKKEKKKKEEEKKKE 57
++ T +Y + + E EE+ +++ + +E K+ +E + +
Sbjct: 881 IRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDK 939
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 37.0 bits (86), Expect = 0.002
Identities = 17/49 (34%), Positives = 28/49 (57%), Gaps = 6/49 (12%)
Query: 20 EEEDEEGGGGEEEEEEE------EKKKEEKKKEKKKKEEEKKKEGEKRR 62
+ D++ + E+E EK K+E+ +EK+++EEEK E EK R
Sbjct: 129 DSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAR 177
Score = 26.2 bits (58), Expect = 9.0
Identities = 11/25 (44%), Positives = 19/25 (76%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEK 54
+E EE+E+++EEK E++K EE+
Sbjct: 156 KERAEEKEREEEEKAAEEEKAREEE 180
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 36.2 bits (84), Expect = 0.004
Identities = 13/37 (35%), Positives = 18/37 (48%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
++ EE E KEE+ E +KKE +KKK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 35.8 bits (83), Expect = 0.006
Identities = 13/46 (28%), Positives = 21/46 (45%), Gaps = 5/46 (10%)
Query: 12 YKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
Y K K E+ EE E +E++ + +K+E KKK+
Sbjct: 156 YAEWKDAKLLEE-----FAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 28.5 bits (64), Expect = 1.8
Identities = 9/32 (28%), Positives = 15/32 (46%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
++ + EE EE + KEEE + +K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKE 191
Score = 27.3 bits (61), Expect = 3.4
Identities = 11/31 (35%), Positives = 16/31 (51%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
EE EE + KE++ E EKK+ +K
Sbjct: 165 LEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 37.0 bits (85), Expect = 0.004
Identities = 21/54 (38%), Positives = 32/54 (59%), Gaps = 4/54 (7%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEE----KKKEKKKKEEEKKKEGEKRR 62
+ KK EED++ ++ EE+EKK E + +E KK EE KKKE E+++
Sbjct: 1663 AAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKK 1716
Score = 34.0 bits (77), Expect = 0.036
Identities = 19/50 (38%), Positives = 32/50 (64%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
K+ ++ K++E EE EE ++ EE+ K + +E KK EE+KKK E ++
Sbjct: 1633 KKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKK 1682
Score = 34.0 bits (77), Expect = 0.038
Identities = 24/56 (42%), Positives = 32/56 (57%), Gaps = 4/56 (7%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEE----KKKEKKKKEEEKKKEGEKRR 62
K K + KK EEDE+ ++E EE KK E K+ E+KKK EE KK E+ +
Sbjct: 1674 KKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENK 1729
Score = 34.0 bits (77), Expect = 0.040
Identities = 18/52 (34%), Positives = 28/52 (53%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
K KKK EE ++ ++ E ++K E KK E+ KK +E KK E ++
Sbjct: 1484 KADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK 1535
Score = 33.6 bits (76), Expect = 0.050
Identities = 20/55 (36%), Positives = 33/55 (60%), Gaps = 3/55 (5%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK---EKKKKEEEKKKEGEKRR 62
K + +KKKEE ++ +++ EE++K E KKK +KKK +E KK K++
Sbjct: 1365 KAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419
Score = 33.6 bits (76), Expect = 0.053
Identities = 22/60 (36%), Positives = 34/60 (56%), Gaps = 3/60 (5%)
Query: 6 TTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK---EKKKKEEEKKKEGEKRR 62
+ K KKK EE ++ +++ EE++KK +E KK KKK +E KKK EK++
Sbjct: 1373 KEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432
Score = 33.6 bits (76), Expect = 0.060
Identities = 20/57 (35%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK--EKKKKEEEKKKEGEKRR 62
+ K KKK EE+ ++ ++ ++K E KKK EKKK +E KKK E ++
Sbjct: 1389 EKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKK 1445
Score = 33.2 bits (75), Expect = 0.062
Identities = 21/71 (29%), Positives = 36/71 (50%), Gaps = 3/71 (4%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK---EKKKKEEEKKKE 57
L+ + K + KK +EE + ++ EE++KK EE KK ++KK E KKE
Sbjct: 1638 LKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKE 1697
Query: 58 GEKRRTLTFIR 68
E+ + ++
Sbjct: 1698 AEEAKKAEELK 1708
Score = 32.8 bits (74), Expect = 0.083
Identities = 16/53 (30%), Positives = 26/53 (49%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+K K KK+ E+ + + E E E + E+K + EKKKE K++
Sbjct: 1327 AKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379
Score = 32.4 bits (73), Expect = 0.12
Identities = 20/56 (35%), Positives = 34/56 (60%), Gaps = 4/56 (7%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK----EKKKKEEEKKKEGEKRR 62
K ++ KKK+ EE ++ ++ EEE + K E+ K +KKK EE KK E ++++
Sbjct: 1634 KVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK 1689
Score = 32.4 bits (73), Expect = 0.13
Identities = 11/53 (20%), Positives = 24/53 (45%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
K K + KK+ + + E ++ E K E + + + E+K E +++
Sbjct: 1321 KKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373
Score = 32.4 bits (73), Expect = 0.14
Identities = 19/55 (34%), Positives = 29/55 (52%), Gaps = 3/55 (5%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEE---KKKEKKKKEEEKKKEGEKRR 62
+ E EE E+++EE KKK + KK E+KKK +E KK+ E+ +
Sbjct: 1350 AEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDK 1404
Score = 32.4 bits (73), Expect = 0.14
Identities = 22/57 (38%), Positives = 33/57 (57%), Gaps = 2/57 (3%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK--EKKKKEEEKKKEGEKRR 62
+ K KKK +E ++ +++ +E +KK EEKKK E KKK EE KK E ++
Sbjct: 1395 EAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKK 1451
Score = 32.0 bits (72), Expect = 0.18
Identities = 20/55 (36%), Positives = 30/55 (54%), Gaps = 3/55 (5%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK---EKKKKEEEKKKEGEKRR 62
K K + KK +E ++ ++ +E +KK EE KK E KK E KKK E ++
Sbjct: 1463 KKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKK 1517
Score = 32.0 bits (72), Expect = 0.20
Identities = 22/64 (34%), Positives = 34/64 (53%), Gaps = 11/64 (17%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEE-----------KKKEKKKKEEEKKKEG 58
+K K + KK EE+++ +++E EEKKK E K E+ KK EE KK+
Sbjct: 1618 AKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKA 1677
Query: 59 EKRR 62
E+ +
Sbjct: 1678 EEAK 1681
Score = 31.6 bits (71), Expect = 0.21
Identities = 22/54 (40%), Positives = 31/54 (57%), Gaps = 2/54 (3%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK--EKKKKEEEKKKEGEKRR 62
K K + KK +E ++ ++ EE +KK EE KK E KKK EE KK E ++
Sbjct: 1437 KKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKK 1490
Score = 31.6 bits (71), Expect = 0.22
Identities = 18/49 (36%), Positives = 25/49 (51%), Gaps = 3/49 (6%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKK---EKKKKEEEKKKEGEKRR 62
KKK EE ++ +++ EE +KK + KK E KK E K E E
Sbjct: 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAA 1356
Score = 31.6 bits (71), Expect = 0.23
Identities = 15/52 (28%), Positives = 24/52 (46%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
K KKK EE ++ +++ EE +K E K E + +E + EK
Sbjct: 1316 KADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367
Score = 31.6 bits (71), Expect = 0.24
Identities = 12/56 (21%), Positives = 25/56 (44%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
+ K KKK + ++ ++ E K + E ++ + EEK + EK++
Sbjct: 1319 EAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKE 1374
Score = 31.6 bits (71), Expect = 0.26
Identities = 23/53 (43%), Positives = 35/53 (66%), Gaps = 1/53 (1%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKK-EEKKKEKKKKEEEKKKEGE 59
+ +K K + KKE E+++ E +++EEEKKK KKE++KK EE +KE E
Sbjct: 1726 EENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKE 1778
Score = 31.3 bits (70), Expect = 0.27
Identities = 19/53 (35%), Positives = 30/53 (56%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+K K + KK E ++ ++ EE +K E KK E+ KK +E KK EK++
Sbjct: 1495 AKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKK 1547
Score = 31.3 bits (70), Expect = 0.28
Identities = 21/55 (38%), Positives = 34/55 (61%), Gaps = 2/55 (3%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
++ K + +KKK EE ++ + + EE KK+ E ++KKK EE KK E EK++
Sbjct: 1706 ELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAE--EDKKKAEEAKKDEEEKKK 1758
Score = 30.9 bits (69), Expect = 0.36
Identities = 21/61 (34%), Positives = 32/61 (52%), Gaps = 9/61 (14%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK---------EKKKKEEEKKKEGEKR 61
K K +KKK +E ++ ++ +E +KK EE KK E KK +E KKK E +
Sbjct: 1424 KKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAK 1483
Query: 62 R 62
+
Sbjct: 1484 K 1484
Score = 30.9 bits (69), Expect = 0.39
Identities = 16/52 (30%), Positives = 27/52 (51%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
K + K E + E EE+ E +++K++ KKK + KKK EK++
Sbjct: 1341 AKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392
Score = 30.9 bits (69), Expect = 0.41
Identities = 17/55 (30%), Positives = 31/55 (56%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+ K KKK +E ++ +++ +E +K +E KK ++ KK EE KK E ++
Sbjct: 1487 EAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKK 1541
Score = 30.9 bits (69), Expect = 0.46
Identities = 20/52 (38%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEE--KKKEKKKKEEEKKKEGEKRR 62
+ +K + ++ EE +E ++ EEKKK + KK E+ KK EEKKK E ++
Sbjct: 1520 EAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKK 1571
Score = 30.5 bits (68), Expect = 0.55
Identities = 21/74 (28%), Positives = 40/74 (54%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
L+ + K + KK +EE + ++E EE++KK EE KK++++K++ + E+
Sbjct: 1707 LKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEE 1766
Query: 61 RRTLTFIRIGKPAL 74
+ IR K A+
Sbjct: 1767 EKKAEEIRKEKEAV 1780
Score = 30.5 bits (68), Expect = 0.58
Identities = 17/56 (30%), Positives = 30/56 (53%), Gaps = 1/56 (1%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK-EKKKKEEEKKKEGEKRR 62
+ K KK +E ++ +++ +E +K E KKK ++ KK EE KK E ++
Sbjct: 1474 EAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKK 1529
Score = 30.1 bits (67), Expect = 0.66
Identities = 19/49 (38%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEE--KKKEKKKKEEEKKKEGEKR 61
+K ++ ++ +E EE+++ +E KK E KK E+KKK EE KK E +
Sbjct: 1528 KKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDK 1576
Score = 30.1 bits (67), Expect = 0.70
Identities = 14/56 (25%), Positives = 24/56 (42%), Gaps = 3/56 (5%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKE---KKKKEEEKKKEGEK 60
+ K KK +E ++ +++ + +KK EE KK K + E E E
Sbjct: 1306 EAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEA 1361
Score = 29.7 bits (66), Expect = 0.87
Identities = 20/51 (39%), Positives = 31/51 (60%), Gaps = 1/51 (1%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK-EKKKKEEEKKKEGEKRR 62
K+ + K++ DE E ++ +E KK EE KK ++ KK EEKKK E ++
Sbjct: 1503 KKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKK 1553
Score = 29.7 bits (66), Expect = 0.95
Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 3/56 (5%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK---EKKKKEEEKKKEGEKRR 62
+K K + KK+ E+++ +++ EE KK +E KK E KK EE KKK E ++
Sbjct: 1416 AKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK 1471
Score = 29.7 bits (66), Expect = 0.98
Identities = 18/59 (30%), Positives = 32/59 (54%), Gaps = 4/59 (6%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEK----KKKEEEKKKEGEKRR 62
++ K + K K E+ + ++++ EE KK EE +K+ KK+ EE KK E ++
Sbjct: 1651 ELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKK 1709
Score = 29.3 bits (65), Expect = 1.2
Identities = 19/47 (40%), Positives = 30/47 (63%), Gaps = 1/47 (2%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKK-EKKKKEEEKKKEGEKRR 62
KKK EE ++ +++ EE +K E KKK E+ KK +E KK+ E+ +
Sbjct: 1450 KKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAK 1496
Score = 29.3 bits (65), Expect = 1.5
Identities = 19/73 (26%), Positives = 34/73 (46%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIG 70
K KKK EE + ++ EE ++K E KK + KK+ ++ K+ E+ + +
Sbjct: 1471 KADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKA 1530
Query: 71 KPALKPRPTRCVE 83
+ A K + E
Sbjct: 1531 EEAKKADEAKKAE 1543
Score = 29.0 bits (64), Expect = 1.6
Identities = 17/53 (32%), Positives = 30/53 (56%), Gaps = 3/53 (5%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEE---KKKEKKKKEEEKKKEGEKRR 62
K+ + K+ + E +++ EEKKK + KK E+ KK +E KK+ E+ +
Sbjct: 1405 KKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1457
Score = 29.0 bits (64), Expect = 1.8
Identities = 19/53 (35%), Positives = 32/53 (60%), Gaps = 3/53 (5%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEE---KKKEKKKKEEEKKKEGEKRR 62
+ RK + ++ EE +E ++ EEKKK + KK E+ KK +E KK+ E+ +
Sbjct: 1276 EARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1328
Score = 28.6 bits (63), Expect = 2.2
Identities = 21/67 (31%), Positives = 35/67 (52%), Gaps = 2/67 (2%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIG 70
K + ++ K+ E+ + E ++ EEEKKK E+ KKK+ EEKKK E ++ +I
Sbjct: 1605 KKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQL--KKKEAEEKKKAEELKKAEEENKIK 1662
Query: 71 KPALKPR 77
+
Sbjct: 1663 AAEEAKK 1669
Score = 28.6 bits (63), Expect = 2.4
Identities = 20/50 (40%), Positives = 30/50 (60%), Gaps = 2/50 (4%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK--EKKKKEEEKKKEGEKRR 62
+ ++ + DE E+++ +E KK EEKKK E KKK EE KK E ++
Sbjct: 1273 KAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKK 1322
Score = 28.6 bits (63), Expect = 2.5
Identities = 19/54 (35%), Positives = 32/54 (59%), Gaps = 4/54 (7%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEK----KKKEEEKKKEGEKRR 62
K+ ++KK+ ++ + E ++ +E KKK E+ K+K KKK EE KK E +
Sbjct: 1296 KKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAK 1349
Score = 28.2 bits (62), Expect = 2.7
Identities = 15/50 (30%), Positives = 29/50 (58%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
++ + ++ +E+ E ++ EE KK EEKKK ++ K+ E+ K R+
Sbjct: 1533 AKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRK 1582
Score = 28.2 bits (62), Expect = 2.8
Identities = 17/55 (30%), Positives = 33/55 (60%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+++ + RR+ + E+ ++ EE++K E KK E+KKK +E KK+ E+ +
Sbjct: 1261 RMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAK 1315
Score = 27.8 bits (61), Expect = 4.0
Identities = 19/54 (35%), Positives = 29/54 (53%), Gaps = 4/54 (7%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKE----KKKKEEEKKKEGEKRR 62
K +KKK +E ++ ++ +E +KK EE KK+ KKK EE KK +
Sbjct: 1297 KAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKA 1350
Score = 27.4 bits (60), Expect = 5.1
Identities = 16/65 (24%), Positives = 36/65 (55%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKP 72
+++K + ++ EE EE+++ EE KK E+ K ++ E+ K+ E+ R +++ +
Sbjct: 1544 EKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEE 1603
Query: 73 ALKPR 77
K +
Sbjct: 1604 EKKMK 1608
Score = 27.4 bits (60), Expect = 5.4
Identities = 20/55 (36%), Positives = 30/55 (54%), Gaps = 3/55 (5%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK---EKKKKEEEKKKEGEKRR 62
K K + KK EE ++ +++ EE +K E KKK KKK + KKK E ++
Sbjct: 1289 KKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKK 1343
Score = 27.4 bits (60), Expect = 6.2
Identities = 18/55 (32%), Positives = 30/55 (54%), Gaps = 4/55 (7%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEE--KKKEEKKK--EKKKKEEEKKKEGEK 60
+ + +++ E E E +++ + KK EEKKK E KKK EE KK+ ++
Sbjct: 1355 AADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE 1409
Score = 27.4 bits (60), Expect = 6.3
Identities = 21/68 (30%), Positives = 33/68 (48%), Gaps = 1/68 (1%)
Query: 14 RRKKKKEEEDEEGGGGEEEEEEEEKKKEE-KKKEKKKKEEEKKKEGEKRRTLTFIRIGKP 72
R+ + + +EE E + E+ KK E KK E+ KK+ E+ K+ E+ R IR +
Sbjct: 1200 RKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEE 1259
Query: 73 ALKPRPTR 80
A R
Sbjct: 1260 ARMAHFAR 1267
Score = 27.0 bits (59), Expect = 8.4
Identities = 16/52 (30%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK--EKKKKEEEKKKEGEK 60
K K +KKK +E ++ ++++ +E KK KK ++ KK+ E+KK+ ++
Sbjct: 1384 KKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADE 1435
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 36.7 bits (85), Expect = 0.004
Identities = 17/66 (25%), Positives = 36/66 (54%), Gaps = 4/66 (6%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKK----KEKKKKEEEKKK 56
Q + ++RKKK++++ EE + E+E K+ E+++ ++KK+ EE K+
Sbjct: 67 QQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQ 126
Query: 57 EGEKRR 62
K++
Sbjct: 127 AALKQK 132
Score = 29.8 bits (67), Expect = 0.87
Identities = 12/43 (27%), Positives = 20/43 (46%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
K K E + + ++ E KK+ + + KK E KK+ E
Sbjct: 146 KAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAE 188
Score = 29.4 bits (66), Expect = 0.94
Identities = 10/49 (20%), Positives = 22/49 (44%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
+ +++KK+ E+ ++++ EE + K K E E K+
Sbjct: 110 RLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAA 158
Score = 28.6 bits (64), Expect = 1.8
Identities = 15/41 (36%), Positives = 19/41 (46%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
K K E E + ++ E KKK E + KK E KKK
Sbjct: 146 KAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKK 186
Score = 27.5 bits (61), Expect = 4.1
Identities = 16/46 (34%), Positives = 21/46 (45%), Gaps = 1/46 (2%)
Query: 16 KKKKEEEDEEGGGGEEEEE-EEEKKKEEKKKEKKKKEEEKKKEGEK 60
KKK E E E +++ E E KK+ + KKK E K K
Sbjct: 184 KKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAK 229
Score = 27.1 bits (60), Expect = 5.8
Identities = 15/49 (30%), Positives = 21/49 (42%), Gaps = 4/49 (8%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
K KKK E + ++ E +KK E + K E +KK E E
Sbjct: 162 KAAAEAKKKAEAEAA----KKAAAEAKKKAEAEAAAKAAAEAKKKAEAE 206
Score = 27.1 bits (60), Expect = 5.9
Identities = 11/59 (18%), Positives = 26/59 (44%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
LQ + + K+ +K++ E+ EE ++ K+++ ++ K K + E
Sbjct: 92 LQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAE 150
Score = 27.1 bits (60), Expect = 6.2
Identities = 14/45 (31%), Positives = 19/45 (42%), Gaps = 4/45 (8%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
KKK E E ++ + E ++K E K K E K EK
Sbjct: 200 KKKAEAEAKK----KAAAEAKKKAAAEAKAAAAKAAAEAKAAAEK 240
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 36.2 bits (84), Expect = 0.005
Identities = 24/114 (21%), Positives = 45/114 (39%), Gaps = 6/114 (5%)
Query: 2 QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
Q + K ++ KK ++D+ E +E + K+ KK+KKKK+++KKK +
Sbjct: 15 QQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEA 74
Query: 62 RTLTFIRIGKPALKPR----PTRCVEGEGAPTRTLSVQ--IPLHRVHEQGVPPV 109
L + + R + + IP+ + + G PV
Sbjct: 75 YDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQEWKQTQPPTIPVSKQFKDGEYPV 128
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 36.6 bits (85), Expect = 0.005
Identities = 17/30 (56%), Positives = 23/30 (76%), Gaps = 2/30 (6%)
Query: 33 EEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+ EEK K++KKKEKKK+EE K++ EK R
Sbjct: 739 DSVEEKTKKKKKKEKKKEEEYKRE--EKAR 766
Score = 32.0 bits (73), Expect = 0.17
Identities = 13/27 (48%), Positives = 23/27 (85%)
Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKKE 57
EE+ +++KKKE+KK+E+ K+EE+ + E
Sbjct: 742 EEKTKKKKKKEKKKEEEYKREEKARIE 768
Score = 30.8 bits (70), Expect = 0.46
Identities = 20/67 (29%), Positives = 27/67 (40%), Gaps = 18/67 (26%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEE------------EKKKEE------KKKEKKKKEE 52
K + + +EE E G GE E ++E EK EE KK K KK+
Sbjct: 1516 KSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQY 1575
Query: 53 EKKKEGE 59
+ E E
Sbjct: 1576 KSNTEAE 1582
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 36.0 bits (83), Expect = 0.006
Identities = 21/60 (35%), Positives = 39/60 (65%)
Query: 3 NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
QV + K++KK+K +E+ + +EE +E+ KE++K+++KK EE + +E EK+R
Sbjct: 107 EKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKR 166
Score = 34.5 bits (79), Expect = 0.024
Identities = 20/76 (26%), Positives = 43/76 (56%)
Query: 5 STTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
+ T+ +K + + KEEE E+ EE+++++EK KEE K K K+E ++K+ +++
Sbjct: 90 AKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKE 149
Query: 65 TFIRIGKPALKPRPTR 80
++ +P + +
Sbjct: 150 KEKKVEEPRDREEEKK 165
Score = 34.1 bits (78), Expect = 0.033
Identities = 22/58 (37%), Positives = 34/58 (58%)
Query: 5 STTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
S +K K K+ K E +E +E+ +EE+KKK+EK KE+ K + K++ EKR
Sbjct: 85 SKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRP 142
Score = 32.6 bits (74), Expect = 0.095
Identities = 20/67 (29%), Positives = 31/67 (46%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKP 72
KR +K + +E + E K+EEK+KE+ K+E++KKKE K +
Sbjct: 78 KRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEA 137
Query: 73 ALKPRPT 79
K P
Sbjct: 138 KEKRPPK 144
Score = 32.2 bits (73), Expect = 0.12
Identities = 14/57 (24%), Positives = 34/57 (59%)
Query: 6 TTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+ K KE ++E G E+E+E+ +++K++KK++ K++ +++K + E +
Sbjct: 83 GGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKE 139
Score = 31.8 bits (72), Expect = 0.19
Identities = 19/55 (34%), Positives = 27/55 (49%), Gaps = 3/55 (5%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEE---EEEKKKEEKKKEKKKKEEEKKKEGEKR 61
K ++K+KE+E + + EEE E + K KK KKK KKKE +
Sbjct: 139 EKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEE 193
Score = 27.9 bits (62), Expect = 3.6
Identities = 14/56 (25%), Positives = 33/56 (58%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
+V K + + + + E +EEE++K++ K+++KKKKE+ K++ +++
Sbjct: 79 RVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPK 134
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 36.1 bits (84), Expect = 0.007
Identities = 17/47 (36%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 18 KKEEEDEEGGGGEEEEEEEEK--KKEEKKKEKKKKEEEKKKEGEKRR 62
E E+EEG G E +K KK+ K ++K +KEE +K +K+
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKA 439
Score = 36.1 bits (84), Expect = 0.007
Identities = 16/47 (34%), Positives = 23/47 (48%), Gaps = 5/47 (10%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
K+RK +K+ E EE E+ +KK E K+ K + E KK
Sbjct: 417 KQRKAEKKAEKEE-----AEKAAAKKKAEAAAKKAKGPDGETKKVDP 458
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 35.7 bits (83), Expect = 0.007
Identities = 18/66 (27%), Positives = 39/66 (59%), Gaps = 11/66 (16%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKK-----------KKEEEKKK 56
+ K + ++++EE+ + E +EE +EKK+E+KK+E++ +K EEK++
Sbjct: 257 VLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKER 316
Query: 57 EGEKRR 62
+ + R+
Sbjct: 317 KKQARK 322
Score = 31.1 bits (71), Expect = 0.24
Identities = 15/39 (38%), Positives = 26/39 (66%), Gaps = 3/39 (7%)
Query: 30 EEEEEEEEKKKEEKKKE---KKKKEEEKKKEGEKRRTLT 65
EEEEE+ K EE+++E +KK+E++K++ K L+
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLS 304
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 33.8 bits (78), Expect = 0.013
Identities = 11/30 (36%), Positives = 19/30 (63%)
Query: 29 GEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
EEKK+EE+++E+K++ EE+ G
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEEEAAAG 100
Score = 31.8 bits (73), Expect = 0.055
Identities = 10/26 (38%), Positives = 18/26 (69%)
Query: 28 GGEEEEEEEEKKKEEKKKEKKKKEEE 53
EE+K++EE+++EK++ EEE
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEEE 96
Score = 29.2 bits (66), Expect = 0.52
Identities = 6/26 (23%), Positives = 18/26 (69%)
Query: 27 GGGEEEEEEEEKKKEEKKKEKKKKEE 52
EE+++++EE++++++ +EE
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEEE 96
Score = 28.8 bits (65), Expect = 0.76
Identities = 9/26 (34%), Positives = 17/26 (65%)
Query: 34 EEEEKKKEEKKKEKKKKEEEKKKEGE 59
EEKK+E++++EE+++ E E
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEEE 96
Score = 28.0 bits (63), Expect = 1.3
Identities = 8/25 (32%), Positives = 16/25 (64%)
Query: 22 EDEEGGGGEEEEEEEEKKKEEKKKE 46
E++EEEEE++++E+ +E
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEE 95
Score = 27.2 bits (61), Expect = 2.1
Identities = 12/25 (48%), Positives = 16/25 (64%), Gaps = 4/25 (16%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEE 42
EE +E EEEEEEE+++ EE
Sbjct: 75 AAAEEKKE----EEEEEEEKEESEE 95
Score = 26.8 bits (60), Expect = 3.4
Identities = 10/26 (38%), Positives = 16/26 (61%)
Query: 21 EEDEEGGGGEEEEEEEEKKKEEKKKE 46
++EEEEEE++KEE ++E
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEEE 96
Score = 26.1 bits (58), Expect = 6.5
Identities = 11/23 (47%), Positives = 14/23 (60%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEE 42
EE EEEEEE+E+ +EE
Sbjct: 74 AAAAEEKKEEEEEEEEKEESEEE 96
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 34.7 bits (80), Expect = 0.013
Identities = 21/62 (33%), Positives = 38/62 (61%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFI 67
++ KR K ++++ + EEEE EE KK EEK++ ++K+EEE ++E EK++
Sbjct: 2 KIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEER 61
Query: 68 RI 69
+
Sbjct: 62 KE 63
Score = 34.3 bits (79), Expect = 0.018
Identities = 22/48 (45%), Positives = 37/48 (77%)
Query: 12 YKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
+R ++KK EE EG EEEE EEE++K+++++E+K++EE+ +KE E
Sbjct: 26 EEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQE 73
Score = 29.7 bits (67), Expect = 0.59
Identities = 22/52 (42%), Positives = 37/52 (71%), Gaps = 2/52 (3%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEK--KKEKKKKEEEKKKEGEKRR 62
+++++ +EEE EE EE+ E E K++EE ++EKKK+EEE+K+ E+ R
Sbjct: 18 RQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQAR 69
Score = 29.3 bits (66), Expect = 0.82
Identities = 16/43 (37%), Positives = 31/43 (72%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
R+ +++EE+E E+++EEEE+K+ E++ K+++E EK K
Sbjct: 38 REGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKS 80
Score = 28.1 bits (63), Expect = 2.3
Identities = 14/51 (27%), Positives = 29/51 (56%)
Query: 9 VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
+ K ++ +EE +++ E +E EE+ +KE+++ EK K ++EG
Sbjct: 39 EGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEEGT 89
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 33.6 bits (77), Expect = 0.015
Identities = 14/29 (48%), Positives = 23/29 (79%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
EE+ EEK++E+KK+E+K++EEE+ G
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEEEALAG 101
Score = 31.3 bits (71), Expect = 0.083
Identities = 14/24 (58%), Positives = 20/24 (83%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEE 53
EE+ E+K+EEKKKE++K+EEE
Sbjct: 72 AAAEEKAEEKEEEKKKEEEKEEEE 95
Score = 30.5 bits (69), Expect = 0.16
Identities = 13/26 (50%), Positives = 20/26 (76%)
Query: 34 EEEEKKKEEKKKEKKKKEEEKKKEGE 59
++K E+K+E+KKKEEEK++E E
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEE 96
Score = 30.2 bits (68), Expect = 0.24
Identities = 13/27 (48%), Positives = 21/27 (77%)
Query: 33 EEEEEKKKEEKKKEKKKKEEEKKKEGE 59
E+K EEK++EKKK+EE++++E E
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEEE 97
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 34.7 bits (79), Expect = 0.018
Identities = 24/89 (26%), Positives = 44/89 (49%), Gaps = 2/89 (2%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
L +S V K++ + K E + E++ ++ KK+EKK+++K+++++KKKE E
Sbjct: 167 LADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKK--PKKKEKKEKEKERDKDKKKEVEG 224
Query: 61 RRTLTFIRIGKPALKPRPTRCVEGEGAPT 89
++L PA E A T
Sbjct: 225 FKSLLLALDDSPASAASVAEADEASLANT 253
Score = 31.6 bits (71), Expect = 0.21
Identities = 12/31 (38%), Positives = 19/31 (61%)
Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
EEE ++ E+ K+EKKK+E+EK+
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGRRRHH 115
Score = 30.0 bits (67), Expect = 0.78
Identities = 12/33 (36%), Positives = 21/33 (63%)
Query: 32 EEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
EEE +++ EK K +KKK E++K+ + +L
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGRRRHHSL 117
Score = 27.7 bits (61), Expect = 4.2
Identities = 14/46 (30%), Positives = 25/46 (54%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+ + + E EE ++ K +KKK++K+KEE+KKK+
Sbjct: 256 GTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHH 301
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 34.7 bits (80), Expect = 0.019
Identities = 22/81 (27%), Positives = 36/81 (44%)
Query: 3 NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
S +S + + K + E++ E +KK E+KKK+KK+K+E K + K
Sbjct: 33 VLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLG 92
Query: 63 TLTFIRIGKPALKPRPTRCVE 83
T + K KP + E
Sbjct: 93 FKTPKKSKKTKKKPPKPKPNE 113
Score = 32.8 bits (75), Expect = 0.070
Identities = 18/79 (22%), Positives = 35/79 (44%)
Query: 3 NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
ST + K +++ +E E +++ E+KKK++K+K++ K E E K + +
Sbjct: 38 ILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPK 97
Query: 63 TLTFIRIGKPALKPRPTRC 81
+ P KP
Sbjct: 98 KSKKTKKKPPKPKPNEDVD 116
Score = 28.5 bits (64), Expect = 1.9
Identities = 15/54 (27%), Positives = 26/54 (48%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
K +++KKKK+E+ E GE + + KK +K K+K K + +
Sbjct: 68 KKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNK 121
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 34.6 bits (80), Expect = 0.020
Identities = 16/37 (43%), Positives = 28/37 (75%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
EEEDEE ++E++EEE+++E++KK+KK E + +
Sbjct: 354 EEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390
Score = 33.8 bits (78), Expect = 0.042
Identities = 13/38 (34%), Positives = 27/38 (71%)
Query: 19 KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
+EE++E+ +E++EEEE++++EKKK+K + +
Sbjct: 354 EEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391
Score = 31.1 bits (71), Expect = 0.30
Identities = 15/38 (39%), Positives = 28/38 (73%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
+EE++E ++E++EE+++EEK+K+KKK E + E
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390
Score = 30.0 bits (68), Expect = 0.79
Identities = 13/47 (27%), Positives = 29/47 (61%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF 66
E+ D +EE+E+ + + +E+++E++K++++KK R L F
Sbjct: 347 EDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSELPF 393
>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p), ribosomal RNA-processing protein 7 homolog A
(Rrp7A), and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
YCL031C gene from Saccharomyces cerevisiae. It is an
essential yeast protein involved in pre-rRNA processing
and ribosome assembly, and is speculated to be required
for correct assembly of rpS27 into the pre-ribosomal
particle. Rrp7A, also termed gastric cancer antigen
Zg14, is the Rrp7p homolog mainly found in Metazoans.
The cellular function of Rrp7A remains unclear
currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 118
Score = 33.4 bits (77), Expect = 0.021
Identities = 21/65 (32%), Positives = 32/65 (49%), Gaps = 20/65 (30%)
Query: 13 KRRKKKKEEEDEEG------------GGGEEEEEEEEKKKEEKKKEKKKKE--------E 52
+ K+ + E DE+G EE E + K+KE+KKK+KK+ E E
Sbjct: 39 EEAKEARNEPDEDGFVTVTRGGRKGKTAREEAVEAKAKEKEKKKKKKKELEDFYRFQIRE 98
Query: 53 EKKKE 57
+KK+E
Sbjct: 99 KKKEE 103
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 33.9 bits (78), Expect = 0.022
Identities = 13/55 (23%), Positives = 24/55 (43%)
Query: 5 STTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
+ + EE EE E++E+K EK+ E+ ++E E++ E
Sbjct: 48 EQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEES 102
Score = 30.1 bits (68), Expect = 0.45
Identities = 20/64 (31%), Positives = 32/64 (50%), Gaps = 6/64 (9%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALK 75
+ KE++ + EE EEE E++ EE E +K+ EEK + ++ I P+ K
Sbjct: 75 SEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEK------EITNPSWK 128
Query: 76 PRPT 79
P T
Sbjct: 129 PVGT 132
Score = 30.1 bits (68), Expect = 0.51
Identities = 15/47 (31%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 16 KKKKEEEDEEGGGGEEEEEE--EEKKKEEKKKEKKKKEEEKKKEGEK 60
++ +EE++E E++E++ EK+ EE ++E ++++EE E EK
Sbjct: 62 IEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEK 108
Score = 29.7 bits (67), Expect = 0.61
Identities = 14/56 (25%), Positives = 27/56 (48%)
Query: 5 STTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+ + K+ ++ E EE E+E E K+++ EK+ +E E++ E E
Sbjct: 44 AAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEED 99
Score = 29.3 bits (66), Expect = 0.75
Identities = 11/55 (20%), Positives = 30/55 (54%)
Query: 3 NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
+ + S + + +E ++EE E++E++ E++ +E +++ EE+ +E
Sbjct: 47 DEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEE 101
Score = 28.5 bits (64), Expect = 1.7
Identities = 11/59 (18%), Positives = 30/59 (50%)
Query: 2 QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+ + K ++ + EE +E E++E K + +K++++ +EE ++++ E
Sbjct: 44 AAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEES 102
Score = 28.5 bits (64), Expect = 1.7
Identities = 13/45 (28%), Positives = 24/45 (53%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+ +E E ++E E E+ KEE+K+ +++E K + EK
Sbjct: 42 DQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEK 86
Score = 28.2 bits (63), Expect = 2.3
Identities = 16/59 (27%), Positives = 33/59 (55%), Gaps = 4/59 (6%)
Query: 2 QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
Q ++ Q +K ++ E E+ + EEE+E + +E K + +K++EE ++E E+
Sbjct: 43 QAAADEQEAKKSDDQETAEIEEVK----EEEKEAANSEDKEDKGDAEKEDEESEEENEE 97
Score = 27.4 bits (61), Expect = 3.3
Identities = 16/54 (29%), Positives = 25/54 (46%)
Query: 3 NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
+S + K K+ +E E+E EE +E EK+ EEK + +KE
Sbjct: 73 ANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEITNPS 126
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 33.9 bits (78), Expect = 0.023
Identities = 22/52 (42%), Positives = 32/52 (61%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
S+ + E+E E E EEEEKK+++KKKE KK+++EKK + EK
Sbjct: 126 SELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177
Score = 30.8 bits (70), Expect = 0.22
Identities = 17/61 (27%), Positives = 39/61 (63%)
Query: 2 QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
+ S +K ++ + +EEE +E +E ++E+++KK++K+K + K +KKK+ +K+
Sbjct: 133 ETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192
Query: 62 R 62
+
Sbjct: 193 K 193
Score = 30.1 bits (68), Expect = 0.39
Identities = 23/53 (43%), Positives = 35/53 (66%), Gaps = 4/53 (7%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
K K E+E E EEEE++E+KKK+E KKEKK+K+++K+K E + +
Sbjct: 137 KETTAKVEKEAEV----EEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKK 185
Score = 29.7 bits (67), Expect = 0.53
Identities = 17/61 (27%), Positives = 38/61 (62%)
Query: 2 QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
S T++ + +K+ E E+EE ++++E +++KKE+K K++K E + K+ +K+
Sbjct: 130 SESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKK 189
Query: 62 R 62
+
Sbjct: 190 K 190
Score = 29.3 bits (66), Expect = 0.77
Identities = 17/48 (35%), Positives = 29/48 (60%), Gaps = 3/48 (6%)
Query: 18 KKEEEDEEGGGGEE---EEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+ + E E E+E E ++EEKK++KKKKE +K+K+ +K +
Sbjct: 126 SELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDK 173
Score = 28.1 bits (63), Expect = 1.8
Identities = 13/40 (32%), Positives = 20/40 (50%)
Query: 21 EEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
D E E EK+ K +++ + EEE+KKE +K
Sbjct: 120 APDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKK 159
Score = 27.0 bits (60), Expect = 5.3
Identities = 11/41 (26%), Positives = 23/41 (56%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+ G E E E+E + +K+ + ++EE+K+K+ +K
Sbjct: 121 PDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKK 161
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 32.3 bits (74), Expect = 0.033
Identities = 9/33 (27%), Positives = 20/33 (60%)
Query: 26 GGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
EEE+++EE+++E++++ EE+ G
Sbjct: 67 AAAAAAAAAEEEEEEEEEEEEEEEESEEEAMAG 99
Score = 32.3 bits (74), Expect = 0.042
Identities = 7/28 (25%), Positives = 17/28 (60%)
Query: 26 GGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
EE+++EE+++E++++E E
Sbjct: 66 AAAAAAAAAAEEEEEEEEEEEEEEEESE 93
Score = 28.9 bits (65), Expect = 0.55
Identities = 6/33 (18%), Positives = 19/33 (57%)
Query: 27 GGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
E+++EE+++E++++EE +++
Sbjct: 66 AAAAAAAAAAEEEEEEEEEEEEEEEESEEEAMA 98
Score = 26.2 bits (58), Expect = 6.2
Identities = 13/25 (52%), Positives = 17/25 (68%), Gaps = 4/25 (16%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEE 42
EE+EE EEEEEEEE+++ E
Sbjct: 73 AAAEEEEE----EEEEEEEEEEESE 93
Score = 25.8 bits (57), Expect = 8.5
Identities = 12/23 (52%), Positives = 14/23 (60%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEE 42
EE EEEEEEEE++ EE
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEE 94
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 32.8 bits (75), Expect = 0.043
Identities = 14/43 (32%), Positives = 22/43 (51%)
Query: 14 RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
R ++E+EE +E E+E E K + K+KK+ E K
Sbjct: 89 RELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPK 131
Score = 32.4 bits (74), Expect = 0.056
Identities = 11/43 (25%), Positives = 23/43 (53%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
R+ +++EE E +E E++ E K E K+K++ + +
Sbjct: 89 RELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPK 131
Score = 32.0 bits (73), Expect = 0.075
Identities = 18/66 (27%), Positives = 35/66 (53%), Gaps = 1/66 (1%)
Query: 12 YKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGK 71
++ +EE+E +E E+E+ + + + ++KKK E K + EK + T + K
Sbjct: 88 FRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPK 147
Query: 72 PALKPR 77
P+ KP+
Sbjct: 148 PS-KPK 152
Score = 28.6 bits (64), Expect = 1.2
Identities = 11/53 (20%), Positives = 28/53 (52%)
Query: 28 GGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKPRPTR 80
++EEEE E++ ++ +++ E + + + +K+R + + K K P +
Sbjct: 93 SSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKK 145
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 33.5 bits (76), Expect = 0.044
Identities = 19/59 (32%), Positives = 34/59 (57%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
L+ + ++ +R +K+EEEDE+ EE EE E ++E E +K++EK+ + E
Sbjct: 90 LEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAE 148
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called
The RES complex, pre-mRNA retention and splicing
complex. Subunits of this complex are not essential for
viability of yeasts but they are required for efficient
splicing in vitro and in vivo. Furthermore,
inactivation of this complex causes pre-mRNA leakage
from the nucleus. Bud13 contains a unique,
phylogenetically conserved C-terminal region of unknown
function.
Length = 141
Score = 32.7 bits (75), Expect = 0.045
Identities = 17/38 (44%), Positives = 26/38 (68%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKK 50
K K++KEE+++E G G ++EE EK+ EE +K K K
Sbjct: 25 KEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKP 62
Score = 30.4 bits (69), Expect = 0.25
Identities = 18/46 (39%), Positives = 31/46 (67%), Gaps = 4/46 (8%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
+K++E+E E+ EE+E +EEK+KE K +K+E EK+ E ++
Sbjct: 16 EKREEKEREK----EEKERKEEKEKEWGKGLVQKEEREKRLEELEK 57
Score = 30.4 bits (69), Expect = 0.30
Identities = 16/52 (30%), Positives = 32/52 (61%), Gaps = 4/52 (7%)
Query: 9 VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+ + + K++++EE E +EE+E+E K +K+E++K+ EE +K K
Sbjct: 14 IEEKREEKEREKEEKER----KEEKEKEWGKGLVQKEEREKRLEELEKAKNK 61
Score = 27.7 bits (62), Expect = 2.2
Identities = 13/40 (32%), Positives = 30/40 (75%)
Query: 23 DEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
D+ G + EE+ EEK++E+++KE+K+++E++ +G ++
Sbjct: 6 DKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQK 45
>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p) and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
gene from Saccharomyces cerevisiae. It is an essential
yeast protein involved in pre-rRNA processing and
ribosome assembly. Rrp7p contains an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 128
Score = 32.3 bits (74), Expect = 0.048
Identities = 24/68 (35%), Positives = 35/68 (51%), Gaps = 23/68 (33%)
Query: 13 KRRKKKKEEEDEEG-------------GGGEEEEEEEEKKKEEKKKEKKKKE-------- 51
+ RK+ + E DE+G G E + EE+KKE++KK+KKKKE
Sbjct: 39 RERKRLRNEPDEDGFVTVVRGGRKGPAAGEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQ 98
Query: 52 --EEKKKE 57
E+KK+E
Sbjct: 99 LREKKKEE 106
Score = 25.7 bits (57), Expect = 9.6
Identities = 20/67 (29%), Positives = 35/67 (52%), Gaps = 10/67 (14%)
Query: 6 TTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKK-------KEKKKKEEE---KK 55
T V ++ EE + ++E+E+++KKK+E + +EKKK+E+ KK
Sbjct: 54 VTVVRGGRKGPAAGEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKEEQADLLKK 113
Query: 56 KEGEKRR 62
E +K R
Sbjct: 114 FEEDKER 120
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
This domain is found to the N-terminus of bacterial
signal peptidases of the S49 family (pfam01343).
Length = 154
Score = 32.5 bits (75), Expect = 0.051
Identities = 18/53 (33%), Positives = 30/53 (56%), Gaps = 6/53 (11%)
Query: 11 KYKRRKKKKEEE--DEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
+YK K+ E D++ E + E+ +KK EK K K +K++ KK+E + R
Sbjct: 50 EYKDLKESLEAALLDKK----ELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 27.5 bits (62), Expect = 2.5
Identities = 10/28 (35%), Positives = 16/28 (57%), Gaps = 2/28 (7%)
Query: 35 EEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
++E K EK +KK E+ K K +K+
Sbjct: 65 KKELKAWEK--AEKKAEKAKAKAEKKKA 90
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
Provisional.
Length = 330
Score = 33.3 bits (77), Expect = 0.054
Identities = 12/54 (22%), Positives = 26/54 (48%), Gaps = 9/54 (16%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
+YK K++ + +++E K K ++KK+K+E K + + + L
Sbjct: 48 QYKEMKEELKAA---------LLDKKELKAWHKAQKKKEKQEAKAAKAKSKPRL 92
>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2. This family consists
of several Barren protein homologues from several
eukaryotic organisms. In Drosophila Barren (barr) is
required for sister-chromatid segregation in mitosis.
barr encodes a novel protein that is present in
proliferating cells and has homologues in yeast and
human. Mitotic defects in barr embryos become apparent
during cycle 16, resulting in a loss of PNS and CNS
neurons. Centromeres move apart at the
metaphase-anaphase transition and Cyclin B is degraded,
but sister chromatids remain connected, resulting in
chromatin bridging. Barren protein localises to
chromatin throughout mitosis. Colocalisation and
biochemical experiments indicate that Barren associates
with Topoisomerase II throughout mitosis and alters the
activity of Topoisomerase II. It has been suggested that
this association is required for proper chromosomal
segregation by facilitating the decatenation of
chromatids at anaphase. This family forms one of the
three non-structural maintenance of chromosomes (SMC)
subunits of the mitotic condensation complex along with
Cnd1 and Cnd3.
Length = 719
Score = 33.2 bits (76), Expect = 0.055
Identities = 15/45 (33%), Positives = 25/45 (55%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
K+ E EG GE+++EE +E+ K+K KK+ ++K E
Sbjct: 151 DSDNKEAPETGREGDDGEDDDEEGSDGEEDGAKKKAKKKRQRKPE 195
Score = 28.2 bits (63), Expect = 2.6
Identities = 13/50 (26%), Positives = 26/50 (52%), Gaps = 2/50 (4%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
KE + G + E+++EE E+ KKK +++++++ E TL
Sbjct: 152 SDNKEAPETGREGDDGEDDDEEGSDGEEDGAKKKAKKKRQRKPES--TLE 199
Score = 27.0 bits (60), Expect = 6.0
Identities = 12/38 (31%), Positives = 23/38 (60%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
+ ED++ G + EE+ +KK ++K++ K + EK E
Sbjct: 166 DGEDDDEEGSDGEEDGAKKKAKKKRQRKPESTLEKNFE 203
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 32.3 bits (74), Expect = 0.057
Identities = 17/49 (34%), Positives = 31/49 (63%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
RKK KE+ + ++EE E+EK+KEE+ + + E+++ E EK++
Sbjct: 65 RKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKA 113
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation
factor 3 (eIF3). In yeast it is called Hcr1. The
Saccharomyces cerevisiae protein eIF3j (HCR1) has been
shown to be required for processing of 20S pre-rRNA and
binds to 18S rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 32.7 bits (75), Expect = 0.062
Identities = 14/43 (32%), Positives = 25/43 (58%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
EEEDEE + + + K K+ K + ++KE+ K+++ EK
Sbjct: 39 EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGL 81
Score = 30.4 bits (69), Expect = 0.41
Identities = 11/43 (25%), Positives = 26/43 (60%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+EE++E E+ + + K ++ K K +++E+ K+E E++
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKG 80
Score = 30.0 bits (68), Expect = 0.50
Identities = 12/48 (25%), Positives = 26/48 (54%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFI 67
EE++E+ + + + KK K K ++K++ +++KE + R L
Sbjct: 40 EEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEED 87
>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
Length = 1622
Score = 33.2 bits (76), Expect = 0.065
Identities = 14/47 (29%), Positives = 19/47 (40%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
K EE EE G E+++ + KK KKK E + L
Sbjct: 842 KMEEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVL 888
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 33.0 bits (76), Expect = 0.076
Identities = 13/28 (46%), Positives = 16/28 (57%)
Query: 36 EEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
EKK++EK+K K KK K KRR
Sbjct: 400 AEKKEKEKEKPKVKKRHRDTKNIGKRRK 427
Score = 32.2 bits (74), Expect = 0.12
Identities = 10/28 (35%), Positives = 20/28 (71%)
Query: 35 EEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
E+K+KE++K + KK+ + K G++R+
Sbjct: 400 AEKKEKEKEKPKVKKRHRDTKNIGKRRK 427
Score = 30.7 bits (70), Expect = 0.35
Identities = 15/35 (42%), Positives = 20/35 (57%), Gaps = 2/35 (5%)
Query: 38 KKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKP 72
KK K+ EKK+KE+EK K ++ R IGK
Sbjct: 393 KKVLAKRAEKKEKEKEKPKVKKRHRDTK--NIGKR 425
Score = 29.5 bits (67), Expect = 0.92
Identities = 11/44 (25%), Positives = 22/44 (50%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
+K K + E++E+E+EK K +K+ K +++K
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRK 427
Score = 28.4 bits (64), Expect = 2.2
Identities = 10/42 (23%), Positives = 23/42 (54%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
KK + + E++E+E++K + KK+ + K K+++
Sbjct: 386 KKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRK 427
Score = 28.0 bits (63), Expect = 3.2
Identities = 15/45 (33%), Positives = 21/45 (46%)
Query: 33 EEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKPR 77
E+K + KK K+ E+K+KE EK + R K K R
Sbjct: 382 APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRR 426
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion
and trafficking].
Length = 150
Score = 32.0 bits (73), Expect = 0.088
Identities = 15/48 (31%), Positives = 25/48 (52%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
KK E + + E E ++E EE+KK +K +EK ++ E + L
Sbjct: 33 KKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQL 80
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 32.0 bits (73), Expect = 0.099
Identities = 15/46 (32%), Positives = 21/46 (45%), Gaps = 5/46 (10%)
Query: 12 YKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
Y RK K++E+ EE +E KK +K E KKK+
Sbjct: 156 YAERKDAKQKEEFA-----AEERKEALAAAAKKSATPQKVETKKKK 196
Score = 27.3 bits (61), Expect = 3.8
Identities = 10/31 (32%), Positives = 16/31 (51%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
++EE E++KE KK +K E +K
Sbjct: 164 QKEEFAAEERKEALAAAAKKSATPQKVETKK 194
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 32.5 bits (74), Expect = 0.10
Identities = 21/60 (35%), Positives = 32/60 (53%), Gaps = 1/60 (1%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKP 72
K K+KKEE+DEE + EE +E+ +EE+K + E+ E K+ I G+P
Sbjct: 277 KELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKKFIKEVIA-GRP 335
Score = 29.0 bits (65), Expect = 1.6
Identities = 17/54 (31%), Positives = 29/54 (53%), Gaps = 3/54 (5%)
Query: 9 VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEE---KKKEKKKKEEEKKKEGE 59
+ + K +K++K+EE E EE +EE E++++ + EK E KK E
Sbjct: 276 LKELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKKFIKE 329
>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807). This is
a family of conserved fungal proteins of unknown
function.
Length = 169
Score = 31.6 bits (72), Expect = 0.10
Identities = 11/75 (14%), Positives = 27/75 (36%)
Query: 14 RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPA 73
R+++ +EE E GE + + + +K+ E++ +++ R
Sbjct: 66 LRERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAKD 125
Query: 74 LKPRPTRCVEGEGAP 88
+ V+ A
Sbjct: 126 PRGGTQDVVDKSQAS 140
Score = 26.2 bits (58), Expect = 7.4
Identities = 9/54 (16%), Positives = 24/54 (44%), Gaps = 4/54 (7%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEE----EEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+R K++ E +EEG + E ++ +++ ++ K ++R+
Sbjct: 68 ERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERK 121
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 32.8 bits (75), Expect = 0.10
Identities = 19/58 (32%), Positives = 33/58 (56%), Gaps = 4/58 (6%)
Query: 9 VSKYKRRKKKKEEE----DEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
VSKYK RK++ E + +E E+ EE EK+ E+ +++ +K E ++ + E R
Sbjct: 167 VSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRE 224
Score = 28.9 bits (65), Expect = 1.6
Identities = 21/55 (38%), Positives = 34/55 (61%), Gaps = 3/55 (5%)
Query: 9 VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKE---KKKKEEEKKKEGEK 60
++K K +K+ EE +EE EEE EE +++ EE +KE K + EE ++E E+
Sbjct: 231 LAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEE 285
Score = 28.1 bits (63), Expect = 2.8
Identities = 11/50 (22%), Positives = 22/50 (44%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+ + K++ E + E E EE ++ + E+ K+E E+K
Sbjct: 324 RLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEE 373
Score = 27.0 bits (60), Expect = 7.1
Identities = 14/54 (25%), Positives = 25/54 (46%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
K + K++ EE + E+ E E+ KEE +++ EE ++ E R
Sbjct: 330 EKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALRE 383
Score = 27.0 bits (60), Expect = 7.5
Identities = 12/61 (19%), Positives = 26/61 (42%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
L+ +R ++ KE+ + EE E E+ ++ + ++ KEE ++K
Sbjct: 311 LEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSAL 370
Query: 61 R 61
Sbjct: 371 L 371
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 31.7 bits (73), Expect = 0.11
Identities = 13/38 (34%), Positives = 25/38 (65%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKK 48
Y +RKKKK+++ +EG EE++++ +E K+ +K
Sbjct: 79 AYFKRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
Score = 29.4 bits (67), Expect = 0.67
Identities = 10/35 (28%), Positives = 23/35 (65%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKK 50
K+KK+++ ++ G EE++++ E++KE +K
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 32.6 bits (74), Expect = 0.11
Identities = 17/32 (53%), Positives = 23/32 (71%), Gaps = 1/32 (3%)
Query: 31 EEEEEEEKKKEEKKKEKK-KKEEEKKKEGEKR 61
EEE E +KKKEEK KEK+ KK + +KE + +
Sbjct: 15 EEELERKKKKEEKAKEKELKKLKAAQKEAKAK 46
Score = 32.6 bits (74), Expect = 0.12
Identities = 15/31 (48%), Positives = 22/31 (70%)
Query: 32 EEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
E E E+K E++ E+KKK+EEK KE E ++
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKK 35
Score = 30.3 bits (68), Expect = 0.68
Identities = 13/27 (48%), Positives = 20/27 (74%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKK 56
E++ EE+ + +KKKE+K KE+E KK
Sbjct: 9 EKKILTEEELERKKKKEEKAKEKELKK 35
Score = 29.1 bits (65), Expect = 1.7
Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 2/33 (6%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
EEE E KKK+E+K K+KE +K K +K
Sbjct: 13 LTEEELERKKKKEEK--AKEKELKKLKAAQKEA 43
Score = 28.3 bits (63), Expect = 2.5
Identities = 14/35 (40%), Positives = 24/35 (68%)
Query: 26 GGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
G E E E++ +EE +++KKK+E+ K+KE +K
Sbjct: 1 GSRTESEAEKKILTEEELERKKKKEEKAKEKELKK 35
Score = 28.3 bits (63), Expect = 2.9
Identities = 14/35 (40%), Positives = 22/35 (62%)
Query: 27 GGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
G E E E++ EE+ + KKKKEE+ K++ K+
Sbjct: 1 GSRTESEAEKKILTEEELERKKKKEEKAKEKELKK 35
Score = 27.9 bits (62), Expect = 3.3
Identities = 15/44 (34%), Positives = 23/44 (52%), Gaps = 9/44 (20%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
RKKKKE E+ +E+E KK + +KE K K + ++
Sbjct: 20 RKKKKE---------EKAKEKELKKLKAAQKEAKAKLQAQQASD 54
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 32.0 bits (73), Expect = 0.12
Identities = 17/41 (41%), Positives = 24/41 (58%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+ EDE+ E E+ KKKEE + +KKK+ EE K +K
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113
Score = 29.7 bits (67), Expect = 0.63
Identities = 12/41 (29%), Positives = 24/41 (58%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
K E+E E E E+ ++K++ E KK+K+ +E + ++
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113
Score = 27.4 bits (61), Expect = 4.3
Identities = 10/39 (25%), Positives = 22/39 (56%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIR 68
+ E+E+E + ++ KKKEE + K+ ++ L ++
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQ 111
Score = 27.0 bits (60), Expect = 5.0
Identities = 11/41 (26%), Positives = 24/41 (58%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
K + E+E E+ +++E+ + +KKKE ++ + +KK
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 32.2 bits (73), Expect = 0.13
Identities = 19/62 (30%), Positives = 30/62 (48%), Gaps = 2/62 (3%)
Query: 3 NSSTTQVSKYKRRKKKKEEEDEEG--GGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+SS + +R K E + E+ EE EE+K EE+ KK ++ KK +G+K
Sbjct: 289 DSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKK 348
Query: 61 RR 62
Sbjct: 349 NG 350
Score = 31.5 bits (71), Expect = 0.20
Identities = 12/49 (24%), Positives = 25/49 (51%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
R+ K E E++E+ EE ++E+ ++E ++ KK + K +
Sbjct: 300 REDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKK 348
Score = 30.7 bits (69), Expect = 0.36
Identities = 18/52 (34%), Positives = 30/52 (57%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+ K K K E D+E G E ++ E+ +E+ KK+KKK + KKK + ++
Sbjct: 209 RGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKK 260
Score = 30.3 bits (68), Expect = 0.55
Identities = 11/37 (29%), Positives = 20/37 (54%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
++ EEE E GG ++ ++ KK + KK K + +
Sbjct: 321 EESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSD 357
Score = 29.9 bits (67), Expect = 0.69
Identities = 16/54 (29%), Positives = 33/54 (61%), Gaps = 3/54 (5%)
Query: 10 SKYKRRKKKKEEEDEEG---GGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
K + K + +++++G G E+ +EE+ KK++KK K KK+ + K+G++
Sbjct: 210 GKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKR 263
Score = 29.5 bits (66), Expect = 1.1
Identities = 12/44 (27%), Positives = 21/44 (47%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
++ ++ E+ E EEE +K K+ KK + KK +K
Sbjct: 315 EQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDS 358
Score = 28.8 bits (64), Expect = 2.0
Identities = 15/44 (34%), Positives = 24/44 (54%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
K + E+DE+ EEE+ EEE +K K+ KK + +K +
Sbjct: 311 KPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKD 354
Score = 27.6 bits (61), Expect = 3.9
Identities = 14/48 (29%), Positives = 29/48 (60%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+ K E +DE+ G ++ E+ +++ KKK+KK + +KK + +K+
Sbjct: 214 KIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKG 261
Score = 26.8 bits (59), Expect = 6.6
Identities = 12/44 (27%), Positives = 20/44 (45%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
+ + EE EE EE ++ KK +K K KK ++ +
Sbjct: 316 QDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSG 359
Score = 26.8 bits (59), Expect = 7.6
Identities = 14/47 (29%), Positives = 24/47 (51%), Gaps = 4/47 (8%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
++ + +E E+E+ EEE KK ++ KK K KK K + +
Sbjct: 315 EQDEDSEESEEEK----NEEEGGLSKKGKKLKKLKGKKNGLDKDDSD 357
Score = 26.5 bits (58), Expect = 8.8
Identities = 11/45 (24%), Positives = 24/45 (53%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
EE E+ E + E ++ E+ ++ +++K EE+ +K + L
Sbjct: 297 PEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKL 341
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 32.2 bits (74), Expect = 0.13
Identities = 12/29 (41%), Positives = 24/29 (82%)
Query: 34 EEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+ EEKK +++KK+KK+K++ KK++ + R+
Sbjct: 386 KREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 26.9 bits (60), Expect = 6.6
Identities = 14/43 (32%), Positives = 27/43 (62%), Gaps = 9/43 (20%)
Query: 29 GEEEEEEEEKKKEE---------KKKEKKKKEEEKKKEGEKRR 62
G+E +EE K+ EE KKK ++KK +++KK+ ++++
Sbjct: 361 GDELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKK 403
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 31.3 bits (71), Expect = 0.13
Identities = 14/32 (43%), Positives = 24/32 (75%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
E E+E++E KE ++KE K ++EKK++ EK+
Sbjct: 94 ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125
Score = 30.1 bits (68), Expect = 0.41
Identities = 12/30 (40%), Positives = 21/30 (70%)
Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
EE E+E+K+ K+ E+K+ + KK++ EK
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEK 121
Score = 29.7 bits (67), Expect = 0.49
Identities = 10/31 (32%), Positives = 22/31 (70%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
EE E+E+K+ ++ +EK+ + +K+K+ +K
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKK 122
Score = 28.6 bits (64), Expect = 1.4
Identities = 25/67 (37%), Positives = 37/67 (55%), Gaps = 2/67 (2%)
Query: 21 EEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK--KKEGEKRRTLTFIRIGKPALKPRP 78
E+ E E +E EEK+ + KKEKK+K+E+K +K +K+ T T K A K
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTT 151
Query: 79 TRCVEGE 85
T+ EG+
Sbjct: 152 TKKEEGK 158
Score = 27.4 bits (61), Expect = 2.9
Identities = 18/60 (30%), Positives = 31/60 (51%)
Query: 19 KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKPRP 78
++E+ E EE+E + KK++++KKEKK E+ KK+ K T + K +
Sbjct: 96 EKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKE 155
>gnl|CDD|218010 pfam04293, SpoVR, SpoVR like protein. Bacillus subtilis stage V
sporulation protein R is involved in spore cortex
formation. Little is known about cortex biosynthesis,
except that it depends on several sigma E controlled
genes, including spoVR.
Length = 427
Score = 32.2 bits (74), Expect = 0.14
Identities = 14/55 (25%), Positives = 27/55 (49%), Gaps = 2/55 (3%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
V +YKR K EE++ EE EE + + + + + E+ ++E +R+
Sbjct: 165 HVDRYKRPIKLSLEEEKA--RQEEREEYLQSQVNDLWRTIPPQREKLEEERARRK 217
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 31.5 bits (72), Expect = 0.16
Identities = 14/28 (50%), Positives = 21/28 (75%), Gaps = 3/28 (10%)
Query: 32 EEEEEEKK---KEEKKKEKKKKEEEKKK 56
EEE+E+KK KEEKK K++KE+ ++
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
Score = 31.5 bits (72), Expect = 0.19
Identities = 16/28 (57%), Positives = 20/28 (71%)
Query: 33 EEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
EEE+EKKK K+EKK +EEK+K E
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
Score = 26.5 bits (59), Expect = 7.7
Identities = 12/28 (42%), Positives = 21/28 (75%), Gaps = 3/28 (10%)
Query: 37 EKKKEEKK---KEKKKKEEEKKKEGEKR 61
E++KE+KK KE+KK +E+K++ E+
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain. DBINO is a DNA-binding
domain found on global transcription activator SNF2L1
proteins and chromatin re-modelling proteins.
Length = 140
Score = 31.1 bits (71), Expect = 0.16
Identities = 16/37 (43%), Positives = 25/37 (67%), Gaps = 1/37 (2%)
Query: 31 EEEEEEEKKKEEKKK-EKKKKEEEKKKEGEKRRTLTF 66
E+EE E +K+ EK+ E+ KKEEE ++ ++R L F
Sbjct: 82 EKEERELRKRAEKEALEQAKKEEELREAKRQQRKLNF 118
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 31.7 bits (72), Expect = 0.17
Identities = 17/47 (36%), Positives = 24/47 (51%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
K K E E E+ E +++ EE+ K + E KKK E KK+ E
Sbjct: 130 AEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAE 176
Score = 31.3 bits (71), Expect = 0.20
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 6/52 (11%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
K K+K+ EE + ++ E K K E + EKK KEE KK+ E+ +
Sbjct: 112 KQAEEKQKQAEE------AKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAK 157
Score = 31.0 bits (70), Expect = 0.29
Identities = 16/52 (30%), Positives = 29/52 (55%), Gaps = 3/52 (5%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
K K+ + K + + E E++ +EE KK+ E++ + K E KKK E ++
Sbjct: 125 KAKQAAEAKAKAEAEA---EKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKK 173
Score = 30.6 bits (69), Expect = 0.35
Identities = 11/45 (24%), Positives = 24/45 (53%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
K+ E+ + ++++ EE K K+ + + K + E +KK E+
Sbjct: 103 AAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEE 147
Score = 30.2 bits (68), Expect = 0.57
Identities = 14/47 (29%), Positives = 26/47 (55%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
KK+ EEE + E +++ E KK+ + + K K E + K + E+ +
Sbjct: 149 KKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAK 195
Score = 29.8 bits (67), Expect = 0.77
Identities = 12/47 (25%), Positives = 26/47 (55%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
K+ E+ ++ +++ EE + K+ + K K + E EKK + E ++
Sbjct: 104 AKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKK 150
Score = 27.9 bits (62), Expect = 3.3
Identities = 11/56 (19%), Positives = 30/56 (53%), Gaps = 4/56 (7%)
Query: 7 TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
Q + + R+K+ E+ E+ ++ E+ ++ ++++K+ EE K K+ + +
Sbjct: 82 KQRAAEQARQKELEQRAAA----EKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAK 133
Score = 26.7 bits (59), Expect = 6.5
Identities = 15/51 (29%), Positives = 28/51 (54%), Gaps = 3/51 (5%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKK---KEKKKKEEEKKKEGEKRR 62
R+KK E++ EE E+ +K+ E++ K K+ E+ K+ EK++
Sbjct: 69 RQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQK 119
Score = 26.7 bits (59), Expect = 7.0
Identities = 13/41 (31%), Positives = 20/41 (48%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
KKK E ++ E + + E K K + ++ K K E K K
Sbjct: 164 AKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAK 204
Score = 26.7 bits (59), Expect = 8.1
Identities = 11/53 (20%), Positives = 23/53 (43%), Gaps = 3/53 (5%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKK---EKKKKEEEKKKEGEKRR 62
+++ E+ ++ E+ K+ E+ K EK+K+ EE K +
Sbjct: 80 AEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEA 132
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 31.6 bits (72), Expect = 0.19
Identities = 20/61 (32%), Positives = 34/61 (55%), Gaps = 2/61 (3%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG--EKRRTLTFIRIGKPALKPR 77
+ ED+E +EEE E+E ++EE+ K+KK+ + + KE +K++ A PR
Sbjct: 60 DSEDDEPESDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPR 119
Query: 78 P 78
P
Sbjct: 120 P 120
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 31.6 bits (71), Expect = 0.21
Identities = 15/50 (30%), Positives = 25/50 (50%), Gaps = 4/50 (8%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
K RR ++K + EEE+ EKK +++K KK E+ + + E
Sbjct: 112 KTPRRTRRKAA----AASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSES 157
Score = 27.3 bits (60), Expect = 5.3
Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 2/55 (3%)
Query: 7 TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK--KKEGE 59
T+ KRRK KK +ED E G E E + E+ + E + +EE K +GE
Sbjct: 133 TEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEELDLEKDDGE 187
Score = 26.9 bits (59), Expect = 6.4
Identities = 18/71 (25%), Positives = 25/71 (35%), Gaps = 12/71 (16%)
Query: 4 SSTTQVSKYKRRKKKKEEEDEEGGGGEEE---EEEEEKKKEEKKKEKKKK---------E 51
S KR KK E E E E+ KE KK ++ +
Sbjct: 69 SKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDV 128
Query: 52 EEKKKEGEKRR 62
EE+K E + R+
Sbjct: 129 EEEKTEKKVRK 139
Score = 26.9 bits (59), Expect = 7.7
Identities = 13/69 (18%), Positives = 20/69 (28%), Gaps = 3/69 (4%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT---LTFIRIGK 71
+KK E +E +KK K KK + E + + +
Sbjct: 48 SRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALD 107
Query: 72 PALKPRPTR 80
K P R
Sbjct: 108 KESKKTPRR 116
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 30.1 bits (68), Expect = 0.22
Identities = 8/25 (32%), Positives = 17/25 (68%)
Query: 29 GEEEEEEEEKKKEEKKKEKKKKEEE 53
G E E ++ +EE+K+E+ ++E +
Sbjct: 76 GAEAAAEADEAEEEEKEEEAEEESD 100
Score = 30.1 bits (68), Expect = 0.23
Identities = 8/26 (30%), Positives = 18/26 (69%)
Query: 28 GGEEEEEEEEKKKEEKKKEKKKKEEE 53
G E E +E ++EEK++E +++ ++
Sbjct: 76 GAEAAAEADEAEEEEKEEEAEEESDD 101
Score = 28.5 bits (64), Expect = 0.90
Identities = 9/26 (34%), Positives = 12/26 (46%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEE 42
E EEEE+EE+ +EE
Sbjct: 73 AAAGAEAAAEADEAEEEEKEEEAEEE 98
Score = 28.1 bits (63), Expect = 1.0
Identities = 9/32 (28%), Positives = 15/32 (46%)
Query: 26 GGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
G E E + E+++KEEE ++E
Sbjct: 67 AGAAAAAAAGAEAAAEADEAEEEEKEEEAEEE 98
Score = 27.0 bits (60), Expect = 2.8
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 24 EEGGGGEEEEEEEEKKKEEKKKE 46
E +E EEE+K+EE ++E
Sbjct: 76 GAEAAAEADEAEEEEKEEEAEEE 98
Score = 26.6 bits (59), Expect = 3.7
Identities = 10/27 (37%), Positives = 13/27 (48%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEE 42
E E EEEE+EEE ++E
Sbjct: 73 AAAGAEAAAEADEAEEEEKEEEAEEES 99
Score = 26.6 bits (59), Expect = 4.3
Identities = 6/34 (17%), Positives = 17/34 (50%)
Query: 26 GGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
G E + ++ +E++K+EE +++ +
Sbjct: 68 GAAAAAAAGAEAAAEADEAEEEEKEEEAEEESDD 101
Score = 26.2 bits (58), Expect = 6.3
Identities = 8/23 (34%), Positives = 14/23 (60%)
Query: 24 EEGGGGEEEEEEEEKKKEEKKKE 46
E +E EEEEK++E +++
Sbjct: 77 AEAAAEADEAEEEEKEEEAEEES 99
Score = 25.8 bits (57), Expect = 7.5
Identities = 8/27 (29%), Positives = 12/27 (44%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEE 42
+ E EEEE+E++ EE
Sbjct: 71 AAAAAGAEAAAEADEAEEEEKEEEAEE 97
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 31.5 bits (72), Expect = 0.22
Identities = 12/50 (24%), Positives = 28/50 (56%), Gaps = 3/50 (6%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEE---KKKEKKKKEEEKKKEGEKRRTLTF 66
+EEDEE E++E++++ ++ ++ ++ +++ EKR L F
Sbjct: 134 DEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAELEF 183
Score = 30.3 bits (69), Expect = 0.47
Identities = 9/42 (21%), Positives = 27/42 (64%)
Query: 19 KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
K E+DE+ +++++ +++ +++ ++++ EEK+ E E
Sbjct: 142 KSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAELEF 183
Score = 27.6 bits (62), Expect = 3.6
Identities = 9/48 (18%), Positives = 24/48 (50%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
R +++++D+E E+EE + + E+ + + ++ +RR
Sbjct: 121 RDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERR 168
>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
binding domain. Human SGLT1 (hSGLT1) is a
high-affinity/low-capacity glucose transporter, which
can also transport galactose. In the transport
mechanism, two Na+ ions first bind to the extracellular
side of the transporter and induce a conformational
change in the glucose binding site. This results in an
increased affinity for glucose. A second conformational
change in the transporter follows, bringing the Na+ and
glucose binding sites to the inner surface of the
membrane. Glucose is then released, followed by the Na+
ions. In the process, hSGLT1 is also able to transport
water and urea and may be a major pathway for transport
of these across the intestinal brush-border membrane.
hSGLT1 is encoded by the SLC5A1 gene and expressed
mostly in the intestine, but also in the trachea,
kidney, heart, brain, testis, and prostate. The
WHO/UNICEF oral rehydration solution (ORS) for the
treatment of secretory diarrhea contains salt and
glucose. The glucose, along with sodium ions, is
transported by hSGLT1 and water is either co-transported
along with these or follows by osmosis. Mutations in
SGLT1 are associated with intestinal glucose galactose
malabsorption (GGM). Up-regulation of intestinal SGLT1
may protect against enteric infections. SGLT1 is
expressed in colorectal, head and neck, and prostate
tumors. Epidermal growth factor receptor (EGFR)
functions in cell survival by stabilizing SGLT1, and
thereby maintaining intracellular glucose levels. SGLT1
is predicted to have 14 membrane-spanning regions. This
subgroup belongs to the solute carrier 5
(SLC5)transporter family.
Length = 635
Score = 31.4 bits (71), Expect = 0.23
Identities = 9/33 (27%), Positives = 18/33 (54%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEK 47
R +E D + E+E+E E + +E++K+
Sbjct: 538 RNSTEERIDLDADDWTEDEDENEMETDEERKKP 570
Score = 28.7 bits (64), Expect = 2.0
Identities = 14/58 (24%), Positives = 22/58 (37%), Gaps = 13/58 (22%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKK-------------EEKKKEKKKKEEEK 54
S +R ++ E+ E E +EE KK ++ K K +EEE
Sbjct: 540 STEERIDLDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEA 597
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 31.6 bits (72), Expect = 0.23
Identities = 18/51 (35%), Positives = 25/51 (49%), Gaps = 3/51 (5%)
Query: 12 YKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
Y K E+ E G + EE E+ EE + KKK+ +KK+ EK R
Sbjct: 395 YYESAKTALEKAE---GKKAIEEIREELIEEGLLKSKKKKRKKKEWFEKFR 442
Score = 28.9 bits (65), Expect = 1.5
Identities = 14/47 (29%), Positives = 25/47 (53%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
+ Y+ K E+ + + E EE E+ + KK+K+KK+E +K
Sbjct: 394 AYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440
Score = 26.6 bits (59), Expect = 9.1
Identities = 19/82 (23%), Positives = 29/82 (35%), Gaps = 7/82 (8%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKK-----EEEKKKEGEKRRTL 64
KYK+ K K D + +E E K +K + KK EE +EG +
Sbjct: 370 KKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKK 429
Query: 65 TFIRIGKPALKPRPTRCVEGEG 86
+ K + V +G
Sbjct: 430 KKRK--KKEWFEKFRWFVSSDG 449
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 31.2 bits (71), Expect = 0.24
Identities = 23/55 (41%), Positives = 37/55 (67%)
Query: 9 VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
+S KR+ K K E D E E +E+++EKKKEEKKK++ ++ EE+ ++ E + T
Sbjct: 306 ISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQAT 360
Score = 28.5 bits (64), Expect = 2.3
Identities = 15/53 (28%), Positives = 28/53 (52%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
KR KK + + + + E+ E+ E K+K++EKKKE +K++ +
Sbjct: 294 KRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIE 346
Score = 26.9 bits (60), Expect = 6.7
Identities = 13/45 (28%), Positives = 26/45 (57%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+ + + E + E+ + ++KK++KKKEE+KKK+ E+
Sbjct: 303 FEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIER 347
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 31.0 bits (70), Expect = 0.24
Identities = 12/46 (26%), Positives = 31/46 (67%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+KK +++ G+++E+ + ++ K++++KK+EEK++ E +R
Sbjct: 152 RKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKR 197
Score = 29.5 bits (66), Expect = 0.90
Identities = 15/54 (27%), Positives = 31/54 (57%), Gaps = 2/54 (3%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKE--KKKKEEEKKKEGEKR 61
K R +K+ +DE +++E+ + +K KE +KK+EE+++ + +KR
Sbjct: 144 EKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKR 197
Score = 29.1 bits (65), Expect = 1.2
Identities = 15/60 (25%), Positives = 27/60 (45%), Gaps = 7/60 (11%)
Query: 10 SKYKRRKKKKEEE---DEEGGGGEEEEEEE----EKKKEEKKKEKKKKEEEKKKEGEKRR 62
+ R+KK+EE+ DE+ G K+E KEK++K + ++ E +
Sbjct: 177 KHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLSTKDEPPKEKRQKHHDPERRLEPQS 236
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 31.5 bits (71), Expect = 0.25
Identities = 22/107 (20%), Positives = 38/107 (35%), Gaps = 12/107 (11%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK---------KKEGEKR 61
K R K+K +E +G +E + E + E ++ K+ EEK K+ +
Sbjct: 792 KAAARAKQKPKE--KGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPY 849
Query: 62 RTLTFIRIGKPALKPRPTRCVEGEGAPTRTLSVQIPLHRVHEQGVPP 108
+ RI P + + P R + + R E P
Sbjct: 850 KVDNMTRI-LPQQSRYISFIKDDRFVPVRKFKGGVVVLRDREPKEPV 895
Score = 30.7 bits (69), Expect = 0.46
Identities = 11/52 (21%), Positives = 25/52 (48%), Gaps = 5/52 (9%)
Query: 6 TTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
+T + R K+K +E+ G ++E + E E + E+ ++ ++K
Sbjct: 788 STTIKAAARAKQKPKEK-----GPNDKEIKIESPSVETEGERCTIKQREEKG 834
>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
Length = 1176
Score = 31.4 bits (72), Expect = 0.26
Identities = 18/50 (36%), Positives = 25/50 (50%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+YK+ K E GGG E E E +K+E +K E EK +E E+
Sbjct: 828 EYKKSKGYVLVIKLENGGGLELEGEFSEKEEAEKFYNNLIEVEKVEEKEE 877
Score = 29.1 bits (66), Expect = 1.5
Identities = 22/56 (39%), Positives = 28/56 (50%), Gaps = 7/56 (12%)
Query: 9 VSKYKRRKKKKEE----EDEEGGGGEEEEEEEEKKKEEKKKEKKK---KEEEKKKE 57
V +YK KK K + E GGG E E E EK++ EK K EEK++E
Sbjct: 823 VERYKEYKKSKGYVLVIKLENGGGLELEGEFSEKEEAEKFYNNLIEVEKVEEKEEE 878
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 30.9 bits (70), Expect = 0.26
Identities = 16/46 (34%), Positives = 25/46 (54%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
K+ E EE G EE+ E EE +KEE ++ ++ E K +E +
Sbjct: 17 CCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDE 62
Score = 30.5 bits (69), Expect = 0.31
Identities = 16/45 (35%), Positives = 26/45 (57%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
K K E EE E E +EE+K KEE + ++ ++EE ++ E+
Sbjct: 6 KDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEE 50
Score = 29.7 bits (67), Expect = 0.58
Identities = 17/39 (43%), Positives = 21/39 (53%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
+ EE E+ E+ EE E K EE K E K +EE KK
Sbjct: 34 LEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEENKK 72
Score = 29.7 bits (67), Expect = 0.64
Identities = 17/40 (42%), Positives = 22/40 (55%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
EE ++E + EE E K +E K + K KEE KK E E
Sbjct: 37 EEIEKEEIIEDSEESNEVKIEELKDENNKLKEENKKLENE 76
Score = 29.0 bits (65), Expect = 1.2
Identities = 15/50 (30%), Positives = 25/50 (50%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF 66
KE E++E G+EE+ E E+ ++E+ E ++ E K E K
Sbjct: 17 CCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKL 66
Score = 28.6 bits (64), Expect = 1.5
Identities = 13/50 (26%), Positives = 27/50 (54%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+ + + K E +E+ E +EE++ K+E+ + E+ +KEE + E
Sbjct: 2 EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEES 51
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 30.4 bits (69), Expect = 0.28
Identities = 13/30 (43%), Positives = 21/30 (70%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKK 49
+EEDEE EE+++E+E ++EE +K K
Sbjct: 117 DEEDEEEEDDEEDDDEDESEEEESPVKKVK 146
Score = 29.6 bits (67), Expect = 0.46
Identities = 12/31 (38%), Positives = 22/31 (70%), Gaps = 3/31 (9%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKK 50
E+E+EE +EE+++E++ +EE+ KK K
Sbjct: 119 EDEEEED---DEEDDDEDESEEEESPVKKVK 146
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 30.9 bits (70), Expect = 0.29
Identities = 18/55 (32%), Positives = 33/55 (60%), Gaps = 2/55 (3%)
Query: 7 TQVSKYKRRK--KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
+SK+KR+ +EEED + E++ +E + E+ +E +K EEE+++E E
Sbjct: 130 LALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEE 184
Score = 27.0 bits (60), Expect = 5.0
Identities = 13/38 (34%), Positives = 26/38 (68%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
KK KE E E+ +E++EEEE+++EE+ ++ +++
Sbjct: 158 KKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDD 195
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97
family. This model family represents the major capsid
protein component of the heads (capsids) of
bacteriophage HK97, phi-105, P27, and related phage.
This model represents one of several analogous families
lacking detectable sequence similarity. The gene
encoding this component is typically located in an
operon encoding the small and large terminase subunits,
the portal protein and the prohead or maturation
protease [Mobile and extrachromosomal element
functions, Prophage functions].
Length = 384
Score = 31.2 bits (71), Expect = 0.29
Identities = 10/70 (14%), Positives = 22/70 (31%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIG 70
+ + +E + + E EEEK + + KE+ K + + E+ +
Sbjct: 4 LREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPA 63
Query: 71 KPALKPRPTR 80
Sbjct: 64 ASGEGGGGEE 73
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 30.8 bits (70), Expect = 0.32
Identities = 17/49 (34%), Positives = 26/49 (53%), Gaps = 8/49 (16%)
Query: 20 EEEDEEGGGGEEEEEE--------EEKKKEEKKKEKKKKEEEKKKEGEK 60
E +DE G E E E + K K ++ KEK++KE E++ + EK
Sbjct: 252 ESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEK 300
Score = 29.3 bits (66), Expect = 1.2
Identities = 9/44 (20%), Positives = 21/44 (47%)
Query: 19 KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
EE D++G ++E E + E + K ++K + ++ +
Sbjct: 242 LEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNK 285
Score = 28.5 bits (64), Expect = 1.8
Identities = 10/43 (23%), Positives = 21/43 (48%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+E EG E E + + + K K ++ KE+ +K+ + +
Sbjct: 255 DESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAK 297
Score = 26.6 bits (59), Expect = 9.5
Identities = 11/43 (25%), Positives = 20/43 (46%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
EEE ++ E E E E + + ++K K + K++ K
Sbjct: 250 EEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKEL 292
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 30.6 bits (70), Expect = 0.32
Identities = 12/28 (42%), Positives = 20/28 (71%), Gaps = 3/28 (10%)
Query: 32 EEEEEEKK---KEEKKKEKKKKEEEKKK 56
E E+E+KK KEEKK K++K++ ++
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
Score = 30.6 bits (70), Expect = 0.35
Identities = 15/28 (53%), Positives = 18/28 (64%)
Query: 33 EEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
E E+EKKK K+EKK +EEK K E
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
Length = 648
Score = 30.9 bits (71), Expect = 0.33
Identities = 10/47 (21%), Positives = 23/47 (48%), Gaps = 7/47 (14%)
Query: 32 EEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF-------IRIGK 71
+ EEE +E+ + KE E+K++ ++ +T +R+ +
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPEITIDDFDKVELRVAE 557
Score = 29.4 bits (67), Expect = 1.1
Identities = 9/31 (29%), Positives = 17/31 (54%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+ EEE K++ + K+ EE++KK +
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541
Score = 28.2 bits (64), Expect = 3.0
Identities = 7/30 (23%), Positives = 18/30 (60%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
EEE +++ E ++ +++E+K ++ E
Sbjct: 513 EEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 30.9 bits (71), Expect = 0.34
Identities = 10/33 (30%), Positives = 15/33 (45%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEE 52
E +E +EEEEE+E E + E+
Sbjct: 187 SELEELDDDEDEEEEEDENDDSLAADESELPEK 219
Score = 27.5 bits (62), Expect = 4.4
Identities = 6/38 (15%), Positives = 16/38 (42%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
+++E+EE++++E E E ++
Sbjct: 182 PAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEK 219
Score = 27.1 bits (61), Expect = 5.4
Identities = 8/51 (15%), Positives = 23/51 (45%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIR 68
+E+ G EE +++E++++EE + + +E + + +
Sbjct: 178 AEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALA 228
>gnl|CDD|220223 pfam09405, Btz, CASC3/Barentsz eIF4AIII binding. This domain is
found on CASC3 (cancer susceptibility candidate gene 3
protein) which is also known as Barentsz (Btz). CASC3
is a component of the EJC (exon junction complex) which
is a complex that is involved in post-transcriptional
regulation of mRNA in metazoa. The complex is formed by
the association of four proteins (eIF4AIII, Barentsz,
Mago, and Y14), mRNA, and ATP. This domain wraps around
eIF4AIII and stacks against the 5' nucleotide.
Length = 116
Score = 29.7 bits (67), Expect = 0.34
Identities = 12/40 (30%), Positives = 21/40 (52%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
K E E + G E E ++ KE K++E +K ++E +
Sbjct: 1 KVESERQSGRTPSAEPTEPKEDKERKRREHEKYDDEDDEN 40
>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 472
Score = 30.9 bits (70), Expect = 0.34
Identities = 13/51 (25%), Positives = 28/51 (54%)
Query: 5 STTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKK 55
+T S + +E+ D +E+++EE K KEEK+++ + ++ K+
Sbjct: 331 ATRFSSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKE 381
Score = 29.0 bits (65), Expect = 1.5
Identities = 13/50 (26%), Positives = 30/50 (60%)
Query: 7 TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
T+ S ++ EE+++ ++E+++EE K +E+K+E + E+ K+
Sbjct: 332 TRFSSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKE 381
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 30.8 bits (70), Expect = 0.34
Identities = 17/54 (31%), Positives = 32/54 (59%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPA 73
E+E+E+ ++ +E+ +++ E+ +E+ +EEE + E EK L RI PA
Sbjct: 133 EDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATTRILTPA 186
Score = 28.5 bits (64), Expect = 2.1
Identities = 12/41 (29%), Positives = 23/41 (56%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+E + EEE++E KK +E E+ +E+E++ E+
Sbjct: 125 KEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEE 165
Score = 28.5 bits (64), Expect = 2.2
Identities = 8/42 (19%), Positives = 25/42 (59%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
E++EE ++ +E+ ++ ++ E++ EEE+ + +++
Sbjct: 132 SEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEK 173
Score = 28.5 bits (64), Expect = 2.2
Identities = 12/36 (33%), Positives = 22/36 (61%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKK 48
K KK +ED + EE+EEE +++E + +++K
Sbjct: 138 KDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEK 173
Score = 27.7 bits (62), Expect = 3.1
Identities = 12/37 (32%), Positives = 22/37 (59%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEE 52
+ K+ +++ EE+EEE ++EE + EK+K E
Sbjct: 140 EAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASE 176
Score = 27.7 bits (62), Expect = 3.5
Identities = 13/40 (32%), Positives = 22/40 (55%)
Query: 21 EEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
E D+E + E+EEE+ + +K KE +E ++ E E
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEA 161
Score = 27.3 bits (61), Expect = 4.5
Identities = 13/44 (29%), Positives = 24/44 (54%), Gaps = 3/44 (6%)
Query: 19 KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
KE E + E+EEE++E K+ K+ ++ EE ++E +
Sbjct: 125 KEIESSDS---EDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEE 165
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 30.8 bits (70), Expect = 0.35
Identities = 18/44 (40%), Positives = 27/44 (61%), Gaps = 3/44 (6%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
EEE+EE EE EEE E+ ++ K+ EE K++E +RR+
Sbjct: 151 EEEEEEP---EEMEEELEEDAADRDARKRAAEEAKEQEELRRRS 191
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 30.4 bits (69), Expect = 0.36
Identities = 11/44 (25%), Positives = 27/44 (61%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
+ ++K++EE+ E + E+ + ++K KE +K+E+E+ +
Sbjct: 124 REEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMR 167
Score = 30.0 bits (68), Expect = 0.50
Identities = 20/61 (32%), Positives = 33/61 (54%), Gaps = 10/61 (16%)
Query: 9 VSKYKRRKKKKEEEDEEGGGGEEEEEEEE------KKKEEKKKEKKKKEEEKKKEGEKRR 62
V K R ++ E E+EE + +EEE E K + E+ + K+K +E +K+E E+ R
Sbjct: 112 VRKQLRFLEQLEREEEE----KRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMR 167
Query: 63 T 63
Sbjct: 168 H 168
Score = 26.5 bits (59), Expect = 7.6
Identities = 12/50 (24%), Positives = 25/50 (50%), Gaps = 4/50 (8%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
RK+ + E E EEEE+ ++++ E+ K E+ + +K + +
Sbjct: 113 RKQLRFLEQLE----REEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEM 158
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 30.9 bits (70), Expect = 0.39
Identities = 18/60 (30%), Positives = 30/60 (50%), Gaps = 2/60 (3%)
Query: 4 SSTTQVSKYKRRKKKKEEEDEEGGGGEEE--EEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
SS + K KKKK + GG E ++EE+ K+ KK +KK +++ K + +
Sbjct: 409 SSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDSK 468
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 30.5 bits (70), Expect = 0.39
Identities = 18/52 (34%), Positives = 26/52 (50%)
Query: 9 VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
V K K KE E+E EE ++E E K+E E K++ + + E EK
Sbjct: 24 VRKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEK 75
Score = 28.6 bits (65), Expect = 2.1
Identities = 12/47 (25%), Positives = 29/47 (61%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK 54
+ + +K+EEE E+ E++++E +KKEE+ +E +++ ++
Sbjct: 97 NLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143
Score = 27.8 bits (63), Expect = 3.4
Identities = 14/64 (21%), Positives = 31/64 (48%), Gaps = 4/64 (6%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+ + ++ + + K+ EE ++ E E ++E E K++ K + E +K+ E+
Sbjct: 25 RKKIAEAKIKEAEEEAKRILEEAKK----EAEAIKKEALLEAKEEIHKLRNEFEKELRER 80
Query: 61 RRTL 64
R L
Sbjct: 81 RNEL 84
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 29.5 bits (67), Expect = 0.39
Identities = 14/51 (27%), Positives = 27/51 (52%)
Query: 7 TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
++K K + +KK+EE + + EEK + E +K +K+ EE + +
Sbjct: 15 KAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAEL 65
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 30.6 bits (69), Expect = 0.42
Identities = 12/53 (22%), Positives = 31/53 (58%)
Query: 2 QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK 54
+S + ++RK+++ E E+E++ +E + + +K+ K+++EE+K
Sbjct: 345 SDSEASDSGPTRKRKRRRVPPLPEYSSDEDEDDSDEDEVDYEKERKRRREEDK 397
Score = 27.5 bits (61), Expect = 4.9
Identities = 12/52 (23%), Positives = 28/52 (53%)
Query: 4 SSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKK 55
S S R++K++ +E+E++ ++ + + +KE+K++ EE K
Sbjct: 346 DSEASDSGPTRKRKRRRVPPLPEYSSDEDEDDSDEDEVDYEKERKRRREEDK 397
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 29.7 bits (67), Expect = 0.45
Identities = 12/28 (42%), Positives = 21/28 (75%)
Query: 38 KKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
K+K ++KK+KKKK+++KKK+ K+
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKK 133
Score = 29.3 bits (66), Expect = 0.62
Identities = 12/25 (48%), Positives = 21/25 (84%)
Query: 32 EEEEEEKKKEEKKKEKKKKEEEKKK 56
+E+ ++KK+++KKK+KKKK+ KK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKA 130
Score = 28.9 bits (65), Expect = 0.77
Identities = 12/30 (40%), Positives = 24/30 (80%)
Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+E+ +++K+K++KKK+KKKK +K + +K
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 28.1 bits (63), Expect = 1.6
Identities = 13/29 (44%), Positives = 24/29 (82%)
Query: 33 EEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
+E+ ++KK++KKK+KKKK++ KK +K+
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 27.0 bits (60), Expect = 4.0
Identities = 11/28 (39%), Positives = 21/28 (75%)
Query: 36 EEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
+EK K++K+K+KKKK+++KK + +
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKK 133
Score = 26.2 bits (58), Expect = 8.0
Identities = 12/38 (31%), Positives = 26/38 (68%), Gaps = 4/38 (10%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKK 50
K+ K+K ++ ++ ++++++++KKK KK KKKK
Sbjct: 102 KQVLKEKAKQKKQ----KKKKKKKKKKKTSKKAAKKKK 135
Score = 25.8 bits (57), Expect = 8.9
Identities = 11/27 (40%), Positives = 22/27 (81%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKK 56
+++++++KKK++KKK+ KK +KKK
Sbjct: 109 AKQKKQKKKKKKKKKKKTSKKAAKKKK 135
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 29.9 bits (68), Expect = 0.46
Identities = 12/54 (22%), Positives = 31/54 (57%), Gaps = 4/54 (7%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEK----KKKEEEKKKE 57
+ K KR+++K+ + + G E ++ EE K E+ ++++ +++ E ++K
Sbjct: 19 KAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQKA 72
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 30.5 bits (68), Expect = 0.47
Identities = 16/50 (32%), Positives = 28/50 (56%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
K R K +EE++ +EEE KK++E+++ +K + KE K+R
Sbjct: 1023 KERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKR 1072
Score = 28.6 bits (63), Expect = 2.4
Identities = 14/46 (30%), Positives = 25/46 (54%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALK 75
+EE+E E + K++E KKE+E+++ K + + K LK
Sbjct: 1029 QEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLK 1074
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 29.8 bits (67), Expect = 0.50
Identities = 18/52 (34%), Positives = 32/52 (61%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
K++ + EE + G + E++ +KKK E KE+KKK++EKKK+ ++
Sbjct: 119 KHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHS 170
Score = 29.0 bits (65), Expect = 0.89
Identities = 15/56 (26%), Positives = 35/56 (62%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
++ + + + ++ D EG G E++ +++K +++K+++KKKKE++KKK
Sbjct: 111 KKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKK 166
Score = 26.7 bits (59), Expect = 6.5
Identities = 19/46 (41%), Positives = 26/46 (56%), Gaps = 10/46 (21%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
K+ KKKK E+D+E +KKK+EKKK+KK+ E G
Sbjct: 142 KKHKKKKHEDDKE----------RKKKKKEKKKKKKRHSPEHPGVG 177
Score = 26.3 bits (58), Expect = 7.7
Identities = 17/56 (30%), Positives = 30/56 (53%)
Query: 6 TTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
Q K K + K K+ ++ E + E K EKK +KKK E++K+++ +K+
Sbjct: 105 HIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKK 160
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 30.3 bits (68), Expect = 0.52
Identities = 18/44 (40%), Positives = 33/44 (75%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
+E E+EE EEEEEEEE+++E++ +E++ ++EE+++E E
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVE 477
Score = 26.8 bits (59), Expect = 8.2
Identities = 18/42 (42%), Positives = 29/42 (69%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
++EE EE EEEEEEEE++ EE++ E +++EEE + +
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNG 481
Score = 26.4 bits (58), Expect = 9.6
Identities = 15/43 (34%), Positives = 29/43 (67%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
++++EEE+EE +E EEEE + +EE+++ + E++ EG
Sbjct: 446 EEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEG 488
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 30.0 bits (68), Expect = 0.53
Identities = 13/35 (37%), Positives = 24/35 (68%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEE 52
K EEE+EE E E+E+EE + EE++++ + ++
Sbjct: 285 KAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
Score = 28.9 bits (65), Expect = 1.3
Identities = 12/35 (34%), Positives = 25/35 (71%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKK 50
K ++EEE+++ E+E+EE+E ++EE+ ++ K
Sbjct: 285 KAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
Score = 26.9 bits (60), Expect = 6.1
Identities = 9/35 (25%), Positives = 25/35 (71%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEK 47
K ++++E++ E +EE+E+EE+++++ + +K
Sbjct: 285 KAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 30.0 bits (68), Expect = 0.53
Identities = 13/30 (43%), Positives = 21/30 (70%), Gaps = 3/30 (10%)
Query: 32 EEEEEEKK---KEEKKKEKKKKEEEKKKEG 58
+ ++EEKK KEEKK K +KE+ +++ G
Sbjct: 95 KAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124
Score = 29.2 bits (66), Expect = 0.83
Identities = 11/28 (39%), Positives = 19/28 (67%)
Query: 33 EEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+ ++E+KK K+EKK + EK+K E+
Sbjct: 95 KAQKEEKKAMSKEEKKAIKAEKEKLEEE 122
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 30.4 bits (68), Expect = 0.54
Identities = 12/45 (26%), Positives = 31/45 (68%)
Query: 14 RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
R +++DE+ +++EE++E+++EE+++ K +E+++ EG
Sbjct: 148 RDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEG 192
Score = 28.9 bits (64), Expect = 2.0
Identities = 15/53 (28%), Positives = 29/53 (54%), Gaps = 5/53 (9%)
Query: 13 KRRKKKKEEEDEEGGG-----GEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+RR + EED ++E+E+E+ EE +E++++EEE K ++
Sbjct: 134 RRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDE 186
Score = 28.5 bits (63), Expect = 2.6
Identities = 20/54 (37%), Positives = 29/54 (53%), Gaps = 3/54 (5%)
Query: 20 EEEDEEGGGGEEEEEEEEKK---KEEKKKEKKKKEEEKKKEGEKRRTLTFIRIG 70
E++D+E EEEEEEEE K E+++ E + +K E +K FI G
Sbjct: 162 EDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDKTDCFKFIEAG 215
Score = 27.7 bits (61), Expect = 4.2
Identities = 11/45 (24%), Positives = 30/45 (66%)
Query: 22 EDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF 66
+D++ E++++EE+ ++EE+++E+ K +++ +E E T+
Sbjct: 154 DDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTY 198
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 30.2 bits (69), Expect = 0.58
Identities = 16/53 (30%), Positives = 28/53 (52%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKP 72
+E+ E ++EEE+E+ KE + KK + E+KK+ ++IGK
Sbjct: 168 SKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKK 220
Score = 28.3 bits (64), Expect = 2.7
Identities = 14/43 (32%), Positives = 24/43 (55%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+ ++E ++EEE+EK +E + KK E EKKK+ +
Sbjct: 165 IDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKN 207
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 28.7 bits (65), Expect = 0.59
Identities = 8/19 (42%), Positives = 13/19 (68%)
Query: 35 EEEKKKEEKKKEKKKKEEE 53
EE+KKE++++EEE
Sbjct: 61 AAAAAAEEEKKEEEEEEEE 79
Score = 28.4 bits (64), Expect = 0.66
Identities = 7/20 (35%), Positives = 14/20 (70%)
Query: 34 EEEEKKKEEKKKEKKKKEEE 53
+EEKK+E++++EE+
Sbjct: 61 AAAAAAEEEKKEEEEEEEED 80
Score = 28.0 bits (63), Expect = 0.89
Identities = 8/21 (38%), Positives = 12/21 (57%)
Query: 26 GGGGEEEEEEEEKKKEEKKKE 46
EEEKK+EE+++E
Sbjct: 58 AAAAAAAAAEEEKKEEEEEEE 78
Score = 26.4 bits (59), Expect = 3.3
Identities = 7/21 (33%), Positives = 13/21 (61%)
Query: 39 KKEEKKKEKKKKEEEKKKEGE 59
E++KKEEE+++E +
Sbjct: 60 AAAAAAAEEEKKEEEEEEEED 80
Score = 26.0 bits (58), Expect = 4.7
Identities = 7/21 (33%), Positives = 13/21 (61%)
Query: 26 GGGGEEEEEEEEKKKEEKKKE 46
EEE+K++EE+++E
Sbjct: 59 AAAAAAAAEEEKKEEEEEEEE 79
Score = 26.0 bits (58), Expect = 5.0
Identities = 4/26 (15%), Positives = 15/26 (57%)
Query: 28 GGEEEEEEEEKKKEEKKKEKKKKEEE 53
++E+K++E++++E++
Sbjct: 56 AAAAAAAAAAAEEEKKEEEEEEEEDD 81
Score = 25.3 bits (56), Expect = 8.7
Identities = 7/21 (33%), Positives = 13/21 (61%)
Query: 37 EKKKEEKKKEKKKKEEEKKKE 57
+E+KK+EEE+++E
Sbjct: 59 AAAAAAAAEEEKKEEEEEEEE 79
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A
peptide from the central, syntaxin binding domain of
synaphin competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal
exocytosis by promoting interaction between the
complementary syntaxin and synaptobrevin transmembrane
regions that reside in opposing membranes prior to
fusion.
Length = 139
Score = 29.5 bits (66), Expect = 0.60
Identities = 15/53 (28%), Positives = 30/53 (56%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIR 68
K +++EG + EEE+EE ++ ++ E+++K + +K E E+ IR
Sbjct: 17 KGMLGGKEDEGDESDAEEEDEEIQEALREAEEERKAKHRKMEEEREVMRQGIR 69
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 30.0 bits (68), Expect = 0.62
Identities = 13/28 (46%), Positives = 16/28 (57%)
Query: 33 EEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
EEE KKK++KKK+K KK E
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEA 204
Score = 28.9 bits (65), Expect = 1.2
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 29 GEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
EEE E K + E+++ KKKK+++KKK
Sbjct: 164 PTEEEVELLKARLEEERAKKKKKKKKKKT 192
Score = 27.3 bits (61), Expect = 4.1
Identities = 12/29 (41%), Positives = 16/29 (55%)
Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
EEE ++KKK++KKK KK E
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEAT 205
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 29.8 bits (67), Expect = 0.62
Identities = 13/40 (32%), Positives = 22/40 (55%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKK 49
S+ K +K K + ++ +EKK+E+KK +KKK
Sbjct: 38 SQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77
Score = 26.7 bits (59), Expect = 7.2
Identities = 11/51 (21%), Positives = 21/51 (41%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
K K ++ + + + + KK ++K+EEKKK +K
Sbjct: 26 DKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKK 76
Score = 26.3 bits (58), Expect = 9.4
Identities = 7/41 (17%), Positives = 15/41 (36%)
Query: 22 EDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
E E+ +++ E K + + +K + K R
Sbjct: 17 ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAER 57
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more
internally with limited reactivity in the C-terminal
domains, while P2 proteins seem to be more externally
located and are more likely to interact with other
cellular components. In lower eukaryotes, P1 and P2 are
further subdivided into P1A, P1B, P2A, and P2B, which
form P1A/P2B and P1B/P2A heterodimers. Some plant
species have a third P-protein, called P3, which is not
homologous to P1 and P2. In humans, P1 and P2 are
strongly autoimmunogenic. They play a significant role
in the etiology and pathogenesis of systemic lupus
erythema (SLE). In addition, the ribosome-inactivating
protein trichosanthin (TCS) interacts with human P0,
P1, and P2, with its primary binding site located in
the C-terminal region of P2. TCS inactivates the
ribosome by depurinating a specific adenine in the
sarcin-ricin loop of 28S rRNA.
Length = 103
Score = 28.8 bits (65), Expect = 0.63
Identities = 7/21 (33%), Positives = 13/21 (61%)
Query: 26 GGGGEEEEEEEEKKKEEKKKE 46
E ++E+KKEE+++E
Sbjct: 73 AAAAAAAEAKKEEKKEEEEEE 93
Score = 28.4 bits (64), Expect = 0.89
Identities = 8/27 (29%), Positives = 15/27 (55%)
Query: 27 GGGEEEEEEEEKKKEEKKKEKKKKEEE 53
E KKEEKK+E++++ ++
Sbjct: 70 AAAAAAAAAAEAKKEEKKEEEEEESDD 96
Score = 27.7 bits (62), Expect = 1.6
Identities = 7/21 (33%), Positives = 13/21 (61%)
Query: 26 GGGGEEEEEEEEKKKEEKKKE 46
E ++EEKK+EE+++
Sbjct: 74 AAAAAAEAKKEEKKEEEEEES 94
Score = 25.4 bits (56), Expect = 9.6
Identities = 9/19 (47%), Positives = 14/19 (73%)
Query: 39 KKEEKKKEKKKKEEEKKKE 57
E KKE+KK+EEE++ +
Sbjct: 77 AAAEAKKEEKKEEEEEESD 95
>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
SseC is a secreted protein that forms a complex
together with SecB and SecD on the surface of
Salmonella. All these proteins are secreted by the type
III secretion system. Many mucosal pathogens use type
III secretion systems for the injection of effector
proteins into target cells. SecB, SseC and SecD are
inserted into the target cell membrane. where they form
a small pore or translocon. In addition to SseC, this
family includes the bacterial secreted proteins PopB,
PepB, YopB and EspD which are thought to be directly
involved in pore formation, and type III secretion
system translocon.
Length = 303
Score = 29.7 bits (67), Expect = 0.64
Identities = 11/30 (36%), Positives = 20/30 (66%)
Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
E ++++KK E+ +E+ KK EK +E +K
Sbjct: 25 ERARDKQEKKAEEYQEQIKKAIEKAEEAKK 54
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 30.0 bits (68), Expect = 0.67
Identities = 11/47 (23%), Positives = 26/47 (55%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
K++++ +E+ + E++ + +E+ K++K KE K + KK
Sbjct: 229 ESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKV 275
Score = 27.7 bits (62), Expect = 3.1
Identities = 13/32 (40%), Positives = 19/32 (59%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
E EE+ E + +KK K K+E +KKK E +
Sbjct: 236 ETEEDVEAAESRAEKKRKSKEEIKKKKPKESK 267
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 29.8 bits (68), Expect = 0.67
Identities = 9/24 (37%), Positives = 18/24 (75%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEE 53
+ EEE+++EE+++E++ EEE
Sbjct: 297 QAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 27.5 bits (62), Expect = 4.4
Identities = 11/42 (26%), Positives = 23/42 (54%), Gaps = 5/42 (11%)
Query: 23 DEEGGGGEEEEE-----EEEKKKEEKKKEKKKKEEEKKKEGE 59
++ EE +E + EE+++E++++EEE+ E E
Sbjct: 279 ADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 26.4 bits (59), Expect = 8.6
Identities = 9/38 (23%), Positives = 23/38 (60%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
K +EE +E + + E+++EE+++E++++ E
Sbjct: 281 KDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSE 318
Score = 26.4 bits (59), Expect = 9.3
Identities = 8/38 (21%), Positives = 21/38 (55%)
Query: 23 DEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
D++ E +E + + +E++++EEE+++E
Sbjct: 280 DKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPS 317
>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein. All proteins in this family
for which functions are known are used for the
recognition of DNA damage as part of nucleotide excision
repair. This family is based on the phylogenomic
analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 172
Score = 29.4 bits (66), Expect = 0.67
Identities = 15/36 (41%), Positives = 25/36 (69%)
Query: 28 GGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
G EE +EE++++EE K+E K+K+ EKK + +R
Sbjct: 91 GSEEALDEEKERREESKEEMKEKKFEKKLKELRRAV 126
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 29.8 bits (67), Expect = 0.69
Identities = 16/52 (30%), Positives = 32/52 (61%)
Query: 3 NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK 54
+ S K +K EE+D++ G EE+E +E ++EE+ +E++++E+E
Sbjct: 233 QDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDED 284
Score = 28.3 bits (63), Expect = 2.4
Identities = 15/43 (34%), Positives = 30/43 (69%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
K K+ +DEE + + EE++ KE++++E+ ++EEE++ E E
Sbjct: 243 KNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 30.0 bits (68), Expect = 0.72
Identities = 17/65 (26%), Positives = 34/65 (52%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIG 70
+Y+ +++ E E G E EE EK++EE KK +K +EE ++ + ++ L +
Sbjct: 660 EYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKA 719
Query: 71 KPALK 75
++
Sbjct: 720 LERVE 724
Score = 27.3 bits (61), Expect = 5.0
Identities = 16/50 (32%), Positives = 29/50 (58%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+ R K+ EE++E +++ +E EK+ EE ++ + EE K K+ E R
Sbjct: 327 EERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELER 376
Score = 27.3 bits (61), Expect = 5.9
Identities = 21/56 (37%), Positives = 32/56 (57%), Gaps = 3/56 (5%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKK---EEKKKEKKKKEEEKKKEGEK 60
++ K ++ K+ EE EE E+E E E K EEK +E +++ EE KKE E+
Sbjct: 222 ELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEE 277
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 30.0 bits (68), Expect = 0.74
Identities = 16/59 (27%), Positives = 32/59 (54%)
Query: 2 QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
S + K+KKE ++EE EEE + E+ + K+ +K ++EE+++E ++
Sbjct: 434 AESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDE 492
Score = 28.9 bits (65), Expect = 1.6
Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 10/55 (18%)
Query: 16 KKKKEEEDEEGGG------GEEEEEEEEKKKEEKKKEKKKK----EEEKKKEGEK 60
+KKEE D E GEEE +EEE ++ KK ++K EK+ E +K
Sbjct: 384 ARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKK 438
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 29.7 bits (67), Expect = 0.76
Identities = 14/57 (24%), Positives = 32/57 (56%), Gaps = 4/57 (7%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE----KKKEGEKRRT 63
K+ +++K+ + ++E E+E ++ E +++ K+K EE K ++ K+RT
Sbjct: 94 SAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRT 150
Score = 28.5 bits (64), Expect = 1.7
Identities = 13/44 (29%), Positives = 29/44 (65%), Gaps = 5/44 (11%)
Query: 14 RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
KK K+++ + EEE +++KK+++++E+K+K EE ++
Sbjct: 188 ELKKLKQQQQKR-----EEERRKQRKKQQEEEERKQKAEEAWQK 226
Score = 28.2 bits (63), Expect = 2.7
Identities = 12/59 (20%), Positives = 38/59 (64%), Gaps = 4/59 (6%)
Query: 2 QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
VS+ + +K+ +E E ++ ++++++ +++ K+++K+++EEE+K++ E+
Sbjct: 168 SAKPERNVSQEEAKKRLQEWELKK----LKQQQQKREEERRKQRKKQQEEEERKQKAEE 222
Score = 26.2 bits (58), Expect = 9.1
Identities = 12/56 (21%), Positives = 30/56 (53%), Gaps = 3/56 (5%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKE---KKKKEEEKKKEGEKRRT 63
+ K +K +E++ +E EE E ++ +EK +E +K ++ K++ + ++
Sbjct: 101 QKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKE 156
>gnl|CDD|224403 COG1486, CelF, Alpha-galactosidases/6-phospho-beta-glucosidases,
family 4 of glycosyl hydrolases [Carbohydrate transport
and metabolism].
Length = 442
Score = 29.9 bits (68), Expect = 0.77
Identities = 17/52 (32%), Positives = 27/52 (51%), Gaps = 3/52 (5%)
Query: 12 YKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
Y ++KK +E ++ G E + E KE + KK + +EK +E EKR
Sbjct: 265 YDMKEKKPDEIEKFGTRANEVMKRE---KELFELYKKPELKEKPEELEKRIG 313
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 29.8 bits (67), Expect = 0.77
Identities = 19/48 (39%), Positives = 25/48 (52%), Gaps = 3/48 (6%)
Query: 13 KRR---KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
KRR K KEE + E E+ EEEK E + + K E EK++E
Sbjct: 254 KRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRRE 301
Score = 28.3 bits (63), Expect = 2.9
Identities = 11/31 (35%), Positives = 22/31 (70%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
EE+ E EK +E+ + +++ EE++++E EK
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEK 282
>gnl|CDD|227474 COG5145, RAD14, DNA excision repair protein [DNA replication,
recombination, and repair].
Length = 292
Score = 29.6 bits (66), Expect = 0.77
Identities = 14/38 (36%), Positives = 27/38 (71%)
Query: 28 GGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
G EE + E++++E+ K ++K+K+ EKK + +R+T T
Sbjct: 209 GSLEELDREKQRREKMKDDRKEKKLEKKIKELRRKTRT 246
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 28.8 bits (65), Expect = 0.78
Identities = 13/30 (43%), Positives = 23/30 (76%)
Query: 33 EEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
E E K+E+KK+ K+KKEEEK+++ + ++
Sbjct: 93 LEHERNKQEKKKRSKEKKEEEKERKRQLKQ 122
>gnl|CDD|234032 TIGR02855, spore_yabG, sporulation peptidase YabG. Members of this
family are the protein YabG, demonstrated for Bacillus
subtilis to be an endopeptidase able to release
N-terminal peptides from a number of sporulation
proteins, including CotT, CotF, and SpoIVA. It appears
to be expressed under control of sigma-K [Cellular
processes, Sporulation and germination].
Length = 283
Score = 29.6 bits (67), Expect = 0.83
Identities = 16/63 (25%), Positives = 27/63 (42%), Gaps = 8/63 (12%)
Query: 9 VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIR 68
V + +R+K+KK + E E + + K K+K+E + G K L F
Sbjct: 52 VDESERKKRKKRFK--------ERMETSYRLFRQDYKLMKEKKEYRATGGYKNNQLYFGM 103
Query: 69 IGK 71
G+
Sbjct: 104 PGR 106
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 29.1 bits (66), Expect = 0.84
Identities = 15/55 (27%), Positives = 30/55 (54%), Gaps = 1/55 (1%)
Query: 6 TTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
++ + K+R + + + E+ G EE EE + E+ K+ +K+ ++ K E EK
Sbjct: 75 KKELEELKQRIAELQAQIEKLKKGREETEERT-ELLEELKQLEKELKKLKAELEK 128
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 29.6 bits (67), Expect = 0.86
Identities = 11/43 (25%), Positives = 26/43 (60%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+E+D+E ++ ++E+E+KKE K+ EK +++ + +
Sbjct: 151 DDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDD 193
Score = 26.5 bits (59), Expect = 9.7
Identities = 9/43 (20%), Positives = 24/43 (55%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
E++DE+ + ++E+EEKK+ ++ ++ ++ E +
Sbjct: 151 DDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDD 193
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains separated
by a hinge in the middle. The eukaryotic SMC proteins
form two kind of heterodimers: the SMC1/SMC3 and the
SMC2/SMC4 types. These heterodimers constitute an
essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 29.9 bits (67), Expect = 0.87
Identities = 17/57 (29%), Positives = 27/57 (47%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRI 69
KR KEE E EE+E++ + + +K++ EEEKK+ + T R
Sbjct: 964 KRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRF 1020
Score = 29.6 bits (66), Expect = 1.00
Identities = 14/57 (24%), Positives = 28/57 (49%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
+ + + + +R +EE E EE++++ K + KK++ EE+KKE
Sbjct: 951 KEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKE 1007
Score = 28.8 bits (64), Expect = 1.9
Identities = 16/63 (25%), Positives = 30/63 (47%), Gaps = 2/63 (3%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKE--EEKKKEGEKRRTLTFIR 68
K +R KK EE + + EE + ++ + K++ KK E + K+K + L ++
Sbjct: 172 KKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLD 231
Query: 69 IGK 71
K
Sbjct: 232 YLK 234
Score = 28.8 bits (64), Expect = 2.1
Identities = 10/47 (21%), Positives = 24/47 (51%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+++ E E+E G E+ +++E KK ++ E + +E + +
Sbjct: 154 ERRLEIEEEAAGSREKRKKKERLKKLIEETENLAELIIDLEELKLQE 200
Score = 28.4 bits (63), Expect = 2.3
Identities = 20/62 (32%), Positives = 30/62 (48%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRI 69
K+ +K+EE + +EEE+E+K +EE+ K K+EEE K E K
Sbjct: 255 ESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDE 314
Query: 70 GK 71
K
Sbjct: 315 EK 316
Score = 28.0 bits (62), Expect = 3.1
Identities = 18/54 (33%), Positives = 30/54 (55%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRI 69
++K E +E EEEE+ E+ K EE+K+EK K +EE+ + E+ +
Sbjct: 765 EEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELL 818
Score = 27.6 bits (61), Expect = 5.2
Identities = 13/60 (21%), Positives = 30/60 (50%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALK 75
K+++ EEE+E++ +++E K K+++E + + +RR + K + K
Sbjct: 263 KEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEK 322
>gnl|CDD|216095 pfam00748, Calpain_inhib, Calpain inhibitor. This region is found
multiple times in calpain inhibitor proteins.
Length = 131
Score = 28.6 bits (64), Expect = 0.92
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 29 GEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
G+ E + EEKK E K ++K KEE+ +K GE+ T+
Sbjct: 72 GQREPDPEEKKPVEDKVKEKAKEEKLEKLGEREDTI 107
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 29.4 bits (66), Expect = 0.92
Identities = 9/74 (12%), Positives = 29/74 (39%), Gaps = 2/74 (2%)
Query: 3 NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+ + +K + E++E ++E ++++ E++ ++ ++E R
Sbjct: 321 RAPELHAENAEIKKTRTAEKNEAKA--RKKEIAQKRRAAEREINREARQERAAAMARARA 378
Query: 63 TLTFIRIGKPALKP 76
++ K L
Sbjct: 379 RRAAVKAKKKGLID 392
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 29.8 bits (67), Expect = 0.92
Identities = 12/56 (21%), Positives = 25/56 (44%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIR 68
+ ++ + ++ E EE + E+++EK++ EE +R FIR
Sbjct: 782 YLMQAHQKGQLKDTIDVAEPEEPGDPLTAEEQEEKEQLLEEGFSTWSRRDFNAFIR 837
Score = 29.4 bits (66), Expect = 1.3
Identities = 11/43 (25%), Positives = 27/43 (62%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
++++ +E+EE++E+ + K E K+E+ + KE +K++
Sbjct: 30 DDDEVPAEDEDEDEEDDEEAESPAKAEISKREKARLKELKKQK 72
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 28.8 bits (65), Expect = 0.96
Identities = 13/48 (27%), Positives = 24/48 (50%), Gaps = 8/48 (16%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
KRR + E +E +K++ +K ++KK + ++KE EK
Sbjct: 91 KRRIALRL--------RRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 29.6 bits (66), Expect = 0.96
Identities = 16/38 (42%), Positives = 21/38 (55%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
E EE EEE+EEEE K+ K+ KKK+ + K
Sbjct: 768 AESSEEDESSEEEKEEEENKEVSAKRAKKKQRKNMLKS 805
Score = 27.3 bits (60), Expect = 5.5
Identities = 12/43 (27%), Positives = 25/43 (58%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+ E + E EE+E +EEK++E+ K+ K+ + ++R+
Sbjct: 758 DGEQDNSDSFAESSEEDESSEEEKEEEENKEVSAKRAKKKQRK 800
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.
Trichoplein or mitostatin, was first defined as a
meiosis-specific nuclear structural protein. It has
since been linked with mitochondrial movement. It is
associated with the mitochondrial outer membrane, and
over-expression leads to reduction in mitochondrial
motility whereas lack of it enhances mitochondrial
movement. The activity appears to be mediated through
binding the mitochondria to the actin intermediate
filaments (IFs).
Length = 349
Score = 29.5 bits (67), Expect = 0.97
Identities = 16/43 (37%), Positives = 28/43 (65%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKK 55
K+R K +E+E+E EEE + EE+++E+K+KEE ++
Sbjct: 29 KKRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERRE 71
>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
Length = 822
Score = 29.6 bits (67), Expect = 0.98
Identities = 9/53 (16%), Positives = 17/53 (32%), Gaps = 4/53 (7%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
Y ++KK++E E + K EK K+ + +
Sbjct: 763 NYTEKQKKEKES----KSELEALKGVGAKTAEKLKDAGVETVTDLTAADPDAV 811
>gnl|CDD|239285 cd02987, Phd_like_Phd, Phosducin (Phd)-like family, Phd
subfamily; Phd is a cytosolic regulator of G protein
functions. It specifically binds G protein betagamma
(Gbg)-subunits with high affinity, resulting in the
solubilization of Gbg from the plasma membrane. This
impedes the formation of a functional G protein trimer
(G protein alphabetagamma), thereby inhibiting G
protein-mediated signal transduction. Phd also inhibits
the GTPase activity of G protein alpha. Phd can be
phosphorylated by protein kinase A and G
protein-coupled receptor kinase 2, leading to its
inactivation. Phd was originally isolated from the
retina, where it is highly expressed and has been
implicated to play an important role in light
adaptation. It is also found in the pineal gland,
liver, spleen, striated muscle and the brain. The
C-terminal domain of Phd adopts a thioredoxin fold, but
it does not contain a CXXC motif. Phd interacts with G
protein beta mostly through the N-terminal helical
domain.
Length = 175
Score = 28.8 bits (65), Expect = 1.0
Identities = 11/54 (20%), Positives = 31/54 (57%), Gaps = 6/54 (11%)
Query: 9 VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
++ +++ K+ KE E E+++++E+K++ ++ +++ +E K RR
Sbjct: 13 INDWRKFKQLKESEQ------EDDDDDEDKEEFLQQYREQRMQEMHAKLPFGRR 60
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 29.6 bits (67), Expect = 1.0
Identities = 15/68 (22%), Positives = 30/68 (44%), Gaps = 10/68 (14%)
Query: 2 QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEE----------EEKKKEEKKKEKKKKE 51
+ T++ + + + E E G ++E+E EEK+ +K ++ K E
Sbjct: 225 KLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLE 284
Query: 52 EEKKKEGE 59
K+K+ E
Sbjct: 285 ILKEKKDE 292
Score = 29.2 bits (66), Expect = 1.5
Identities = 10/29 (34%), Positives = 20/29 (68%)
Query: 29 GEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
+++E + EEK K+++KKE+ K++E
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREE 676
Score = 28.0 bits (63), Expect = 2.8
Identities = 13/37 (35%), Positives = 23/37 (62%)
Query: 34 EEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIG 70
EE+ KKEEKK++ K++E+E+ + E T+ +
Sbjct: 659 TEEKTKKEEKKEKNKREEKERIEIAETWDTIPLAQAI 695
Score = 28.0 bits (63), Expect = 3.4
Identities = 13/31 (41%), Positives = 21/31 (67%)
Query: 24 EEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK 54
E+ + + EEK K+E+KKEK K+EE++
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678
Score = 26.9 bits (60), Expect = 6.6
Identities = 10/32 (31%), Positives = 23/32 (71%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
E+++E + E+K +K++K+E+ K+E ++R
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKER 679
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 29.6 bits (66), Expect = 1.1
Identities = 9/39 (23%), Positives = 19/39 (48%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
E+ +E+G +E EE + K E+ ++ E + +
Sbjct: 4060 EKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQA 4098
Score = 26.5 bits (58), Expect = 9.7
Identities = 8/38 (21%), Positives = 19/38 (50%)
Query: 23 DEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+++ E+ EE ++ EE ++ K +EE ++
Sbjct: 4057 EDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVP 4094
>gnl|CDD|216799 pfam01940, DUF92, Integral membrane protein DUF92. Members of
this family have several predicted transmembrane
helices. The function of these prokaryotic proteins is
unknown.
Length = 224
Score = 29.0 bits (66), Expect = 1.1
Identities = 5/20 (25%), Positives = 8/20 (40%)
Query: 9 VSKYKRRKKKKEEEDEEGGG 28
+K ++K E GG
Sbjct: 55 ATKLGYKRKAALGIAEARGG 74
>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
Length = 80
Score = 27.8 bits (62), Expect = 1.1
Identities = 7/35 (20%), Positives = 21/35 (60%)
Query: 25 EGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
EG E+++++E+ ++ K +K +++ ++ E
Sbjct: 46 EGEEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80
Score = 27.4 bits (61), Expect = 1.2
Identities = 7/32 (21%), Positives = 19/32 (59%)
Query: 29 GEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
GEE E++++ + ++ +K K+++ + E
Sbjct: 47 GEEPEDDDDDEDDDDDDDKDDKDDDDDDDDED 78
Score = 25.1 bits (55), Expect = 9.7
Identities = 6/34 (17%), Positives = 19/34 (55%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
E E+ E +E++++++ K ++ + E++
Sbjct: 46 EGEEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDD 79
>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
[DNA replication, recombination, and repair].
Length = 417
Score = 29.3 bits (66), Expect = 1.1
Identities = 11/54 (20%), Positives = 25/54 (46%)
Query: 7 TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+++ K++ + ++ E ++ ++E+ K E K EE K+ E K
Sbjct: 361 KELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLENGK 414
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 28.7 bits (65), Expect = 1.1
Identities = 8/32 (25%), Positives = 16/32 (50%), Gaps = 2/32 (6%)
Query: 33 EEEEEKKKEEKKKEKKKKE--EEKKKEGEKRR 62
+ + + K+EK + + + + E K KE E
Sbjct: 73 QAKIARYKKEKARYRSEAKELEAKAKEAEAES 104
Score = 27.2 bits (61), Expect = 3.5
Identities = 8/36 (22%), Positives = 19/36 (52%), Gaps = 3/36 (8%)
Query: 7 TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEE 42
++++YK+ K + E +E E + +E E + +
Sbjct: 74 AKIARYKKEKARYRSEAKE---LEAKAKEAEAESDH 106
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 29.2 bits (65), Expect = 1.2
Identities = 12/45 (26%), Positives = 25/45 (55%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
R+K++E ++ +E+++ E +K+E +K + E KK E
Sbjct: 251 RQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDE 295
Score = 28.4 bits (63), Expect = 2.5
Identities = 15/57 (26%), Positives = 30/57 (52%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
+ + ++ K + KK +EE + + + ++E K EK+ E K+ E +KK+E
Sbjct: 275 VAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKRE 331
Score = 27.7 bits (61), Expect = 3.8
Identities = 13/55 (23%), Positives = 27/55 (49%), Gaps = 10/55 (18%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKK----------KEKKKKEEEKKKEGEK 60
K +++ + + ++ +E +K++E K KE K+ E +K+E EK
Sbjct: 231 KAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEK 285
Score = 27.3 bits (60), Expect = 5.1
Identities = 12/42 (28%), Positives = 21/42 (50%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
K+ K+ + + E+ + E +K EE K K K + K+E
Sbjct: 270 KEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQE 311
Score = 26.5 bits (58), Expect = 9.2
Identities = 11/44 (25%), Positives = 23/44 (52%)
Query: 19 KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
KE E +E ++ +EE KK+ + ++K + + +K+R
Sbjct: 204 KERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQR 247
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 29.2 bits (66), Expect = 1.2
Identities = 13/42 (30%), Positives = 25/42 (59%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGK 71
++E ++ E+K E+ + E +KKEEE +K K + ++ K
Sbjct: 101 KKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAK 142
Score = 28.4 bits (64), Expect = 2.5
Identities = 17/60 (28%), Positives = 28/60 (46%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKP 76
KK EE+ E+ E++EEE +K + K +K K+ KK + L + K +
Sbjct: 105 KKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGLNYKKNFKEK 164
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 28.7 bits (65), Expect = 1.2
Identities = 11/55 (20%), Positives = 31/55 (56%)
Query: 9 VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
+K K + ++ E++ ++ E+E+ ++++ K K +KK + + ++ E + T
Sbjct: 105 EAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHT 159
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 29.2 bits (66), Expect = 1.2
Identities = 7/41 (17%), Positives = 23/41 (56%)
Query: 2 QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEE 42
+ S+ + + +++K ED++ ++ E+E+++ E+
Sbjct: 1346 ASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDED 1386
Score = 26.5 bits (59), Expect = 9.1
Identities = 16/59 (27%), Positives = 29/59 (49%)
Query: 4 SSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+TT + K EE EE EE+E +E++ + K K K K + K + ++++
Sbjct: 1122 KNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKK 1180
>gnl|CDD|151656 pfam11214, Med2, Mediator complex subunit 2. This family of
mediator complex subunit 2 proteins is conserved in
fungi. Cyclin-dependent kinase CDK8 or Srb10 interacts
with and phosphorylates Med2. Post-translational
modifications of Mediator subunits are important for
regulation of gene expression.
Length = 99
Score = 27.8 bits (62), Expect = 1.3
Identities = 12/20 (60%), Positives = 19/20 (95%)
Query: 37 EKKKEEKKKEKKKKEEEKKK 56
E KK+++++E+KKKEEE+KK
Sbjct: 80 ENKKKQEEEERKKKEEEEKK 99
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 29.3 bits (65), Expect = 1.3
Identities = 17/43 (39%), Positives = 29/43 (67%), Gaps = 1/43 (2%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
KK EE E E E++ E+++ EK+KEK++ E E+++E E+
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKER-EREREREAER 620
Score = 28.5 bits (63), Expect = 2.3
Identities = 13/31 (41%), Positives = 23/31 (74%)
Query: 32 EEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
EE E+ K+E ++K ++++E EK+KE E+ R
Sbjct: 582 EEAVEKAKREAEQKAREEREREKEKEKERER 612
>gnl|CDD|239571 cd03489, Topoisomer_IB_N_LdtopoI_like,
Topoisomer_IB_N_LdtopoI_like: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the heterodimeric topo I from
Leishmania donvanni. Topo I enzymes are divided into:
topo type IA (bacterial) and type IB (eukaryotic). Topo
I relaxes superhelical tension in duplex DNA by creating
a single-strand nick, the broken strand can then rotate
around the unbroken strand to remove DNA supercoils and,
the nick is religated, liberating topo I. These enzymes
regulate the topological changes that accompany DNA
replication, transcription and other nuclear processes.
Human topo I is the target of a diverse set of
anticancer drugs including camptothecins (CPTs). CPTs
bind to the topo I-DNA complex and inhibit re-ligation
of the single-strand nick, resulting in the accumulation
of topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topo I play putative roles in
organizing the kinetoplast DNA network unique to these
parasites. This family may represent more than one
structural domain.
Length = 212
Score = 28.7 bits (64), Expect = 1.3
Identities = 15/29 (51%), Positives = 18/29 (62%)
Query: 34 EEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
E+EKKK K+EKK +EEK KE E
Sbjct: 93 REKEKKKSRTKEEKKALKEEKDKEAEPYM 121
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 29.3 bits (66), Expect = 1.3
Identities = 14/34 (41%), Positives = 19/34 (55%)
Query: 9 VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEE 42
+ +K +EEE E+ EEEEE EE + EE
Sbjct: 339 CTWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEE 372
Score = 27.7 bits (62), Expect = 4.2
Identities = 13/28 (46%), Positives = 20/28 (71%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEK 43
++K EEE++E EEEEEE E+ + E+
Sbjct: 345 EQKDEEEEQEDEEEEEEEEEPEEPEPEE 372
Score = 27.3 bits (61), Expect = 5.5
Identities = 17/39 (43%), Positives = 28/39 (71%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKK 55
++ EEE+EE GE +EE+E +++EE++ E+ KEEE
Sbjct: 168 REGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGT 206
Score = 26.6 bits (59), Expect = 8.3
Identities = 14/28 (50%), Positives = 21/28 (75%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEK 47
E++DEE +EEEEEEE++ EE + E+
Sbjct: 345 EQKDEEEEQEDEEEEEEEEEPEEPEPEE 372
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 29.3 bits (65), Expect = 1.3
Identities = 22/52 (42%), Positives = 30/52 (57%), Gaps = 5/52 (9%)
Query: 13 KRRKKKKEEEDEEGGG-----GEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
K +K+ +EDEE GEE + EE K+ +E KK+K KK +E KE E
Sbjct: 209 KTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYE 260
Score = 27.3 bits (60), Expect = 5.8
Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 4/55 (7%)
Query: 30 EEEEEEEEKKKEE----KKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKPRPTR 80
+E+EE+ +K E+ K +E K+ +E KKK+ +K + +T + KP TR
Sbjct: 217 DEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYEVQNKHKPLWTR 271
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 29.3 bits (66), Expect = 1.3
Identities = 13/60 (21%), Positives = 25/60 (41%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+ S K +E+++G E E+ E KEK +K ++ K++ +K
Sbjct: 218 INKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKK 277
Score = 28.9 bits (65), Expect = 1.7
Identities = 11/55 (20%), Positives = 22/55 (40%), Gaps = 3/55 (5%)
Query: 9 VSKYKRRKKKKEEEDE---EGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+ K K KEE++ G + +EE+K + + + + + EK
Sbjct: 210 LIKSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEK 264
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 29.0 bits (65), Expect = 1.4
Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 2/49 (4%)
Query: 14 RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
K E GG +E++E+E +E K++ +KK++ +KK K
Sbjct: 324 DEKSLSEFLGYMGG--IDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 27.8 bits (62), Expect = 1.4
Identities = 10/44 (22%), Positives = 19/44 (43%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
K+ + + + E +K + KK+KK ++ K EG
Sbjct: 30 KEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNT 73
Score = 27.0 bits (60), Expect = 2.4
Identities = 11/40 (27%), Positives = 17/40 (42%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKK 55
KK + G E E+ + KK++K +K K E
Sbjct: 34 AKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNT 73
Score = 27.0 bits (60), Expect = 2.5
Identities = 9/44 (20%), Positives = 18/44 (40%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
K+ K +++ E +K + KK +K K+ K +
Sbjct: 30 KEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNT 73
Score = 27.0 bits (60), Expect = 2.6
Identities = 15/46 (32%), Positives = 19/46 (41%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
K K K KK G E E+ K K++KK KK + E
Sbjct: 28 KPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNT 73
Score = 25.5 bits (56), Expect = 7.9
Identities = 9/42 (21%), Positives = 18/42 (42%), Gaps = 3/42 (7%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
K K+ + + G + E +K + KK +++ KK
Sbjct: 28 KPKEGKAKKNGAPVTQTAGTE---TTTEKCKGKKDKKDCKKG 66
>gnl|CDD|215038 PLN00040, PLN00040, Protein MAK16 homolog; Provisional.
Length = 233
Score = 28.6 bits (64), Expect = 1.4
Identities = 15/45 (33%), Positives = 20/45 (44%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
K E EE EEE + +K KKK + E E + E E +
Sbjct: 186 NKVLEMEEVEEAEEELPKSDKNPNSKKKSRVHVEIEYEDEIEYKS 230
>gnl|CDD|234659 PRK00137, rplI, 50S ribosomal protein L9; Reviewed.
Length = 147
Score = 28.1 bits (64), Expect = 1.4
Identities = 8/34 (23%), Positives = 14/34 (41%)
Query: 35 EEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIR 68
E + + E K ++ E E E + T+T
Sbjct: 48 EARRAELEAKAAEELAEAEALAEKLEGLTVTIKA 81
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 28.9 bits (65), Expect = 1.4
Identities = 11/44 (25%), Positives = 20/44 (45%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
+EE+E+ EEEE E+ ++E + + E G +
Sbjct: 375 DEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSES 418
Score = 26.6 bits (59), Expect = 8.9
Identities = 7/44 (15%), Positives = 19/44 (43%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
+++E ED E G + E+ + + + + K +++
Sbjct: 385 HEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDS 428
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 28.8 bits (64), Expect = 1.5
Identities = 18/63 (28%), Positives = 38/63 (60%)
Query: 2 QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
S + K ++++KKKEE+ E ++ E+E K+ EK++ K ++++++ +E EK+
Sbjct: 67 IQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQ 126
Query: 62 RTL 64
L
Sbjct: 127 AQL 129
Score = 28.0 bits (62), Expect = 2.8
Identities = 16/47 (34%), Positives = 20/47 (42%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
K K E ++ + EE K K E KKK E E K EK +
Sbjct: 162 KLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAK 208
Score = 26.8 bits (59), Expect = 6.6
Identities = 14/56 (25%), Positives = 26/56 (46%), Gaps = 1/56 (1%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEE-EEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+ K + ++K++EE E++ + E K K + K K E KK+ E+
Sbjct: 122 EAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAA 177
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 29.0 bits (66), Expect = 1.5
Identities = 10/29 (34%), Positives = 18/29 (62%)
Query: 34 EEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
EEEE +KEE K+E +++ + + +E
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAE 67
Score = 27.1 bits (61), Expect = 5.7
Identities = 14/42 (33%), Positives = 19/42 (45%)
Query: 21 EEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
EE+E + E EEE K E ++ + EEE K E
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAA 80
Score = 27.1 bits (61), Expect = 6.6
Identities = 15/53 (28%), Positives = 27/53 (50%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
K+ K+K + G GG+ +++ KKK+K+K E+ +E T+T
Sbjct: 131 KKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIPREVVIPETIT 183
Score = 26.7 bits (60), Expect = 8.0
Identities = 12/44 (27%), Positives = 19/44 (43%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
EEE+ + E EEE K + E+ + +EE K +
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPA 82
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 29.0 bits (65), Expect = 1.5
Identities = 22/61 (36%), Positives = 30/61 (49%), Gaps = 9/61 (14%)
Query: 11 KYKRRKKKKEEEDE---------EGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
K KR++ KK EDE EEEE EE + KKE++K+E +G +R
Sbjct: 292 KEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGGRR 351
Query: 62 R 62
R
Sbjct: 352 R 352
Score = 27.9 bits (62), Expect = 3.5
Identities = 17/47 (36%), Positives = 29/47 (61%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
R E+E E G +E+E+E+E K ++ + +++ EEK+KE KR
Sbjct: 251 RDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKR 297
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 28.0 bits (62), Expect = 1.5
Identities = 10/24 (41%), Positives = 16/24 (66%)
Query: 34 EEEEKKKEEKKKEKKKKEEEKKKE 57
K E KK+EKK++EEE++ +
Sbjct: 82 ATAGAKAEAKKEEKKEEEEEEEDD 105
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 28.1 bits (63), Expect = 1.6
Identities = 20/47 (42%), Positives = 33/47 (70%), Gaps = 4/47 (8%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
K KKE EDEE +++ EE+K+K+E+K KK+ + +KKK+ +K+
Sbjct: 57 EKWKKETEDEEF----QQKREEKKRKDEEKTAKKRAKRQKKKQKKKK 99
Score = 27.3 bits (61), Expect = 3.0
Identities = 13/46 (28%), Positives = 28/46 (60%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
K K++K KK + EE G + EE ++++E ++ ++++ E +K
Sbjct: 97 KKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEIMEK 142
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 28.5 bits (64), Expect = 1.6
Identities = 16/47 (34%), Positives = 25/47 (53%), Gaps = 4/47 (8%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
K+KEEE+ E E + E+ + K+K +EE+K E E+ R
Sbjct: 121 KQKEEEERR----VERRRELGLEDPEQLRLKQKAKEEQKAESEETRH 163
Score = 27.0 bits (60), Expect = 5.9
Identities = 8/33 (24%), Positives = 22/33 (66%), Gaps = 2/33 (6%)
Query: 30 EEEEEEEEKKKEEKKKEK--KKKEEEKKKEGEK 60
++++EEE+++ E+++E + E+ + K+ K
Sbjct: 119 AQKQKEEEERRVERRRELGLEDPEQLRLKQKAK 151
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 28.6 bits (64), Expect = 1.6
Identities = 8/35 (22%), Positives = 17/35 (48%)
Query: 19 KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
KE+ED++ G EE + ++ ++ + E
Sbjct: 231 KEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGE 265
>gnl|CDD|225340 COG2719, SpoVR, Uncharacterized conserved protein [Function
unknown].
Length = 495
Score = 28.9 bits (65), Expect = 1.7
Identities = 18/74 (24%), Positives = 30/74 (40%), Gaps = 5/74 (6%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEK---KKEKKKKEEEKKKEGEK--RR 62
V +YKR KK EE+ EE + + + KK + ++ ++ +
Sbjct: 172 GVDRYKRPKKISSEEERARQEEREEYLQSQVNDLWRTLPKKPGEAAVKDARRFPSEPEED 231
Query: 63 TLTFIRIGKPALKP 76
L FI P L+P
Sbjct: 232 LLYFIEKNSPLLEP 245
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 28.7 bits (65), Expect = 1.7
Identities = 14/53 (26%), Positives = 23/53 (43%), Gaps = 3/53 (5%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKE---KKKKEEEKKKEGE 59
+K K K+++ G ++E E K E+ K+K+ E K E E
Sbjct: 482 AKDKGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAE 534
Score = 28.7 bits (65), Expect = 1.9
Identities = 10/45 (22%), Positives = 22/45 (48%), Gaps = 3/45 (6%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
K+++ G ++E E K+ E+ E++K+KE + +
Sbjct: 489 KEQKITITASSGLSDDEIERMVKDA---EEYAAEDKKRKERIEAK 530
>gnl|CDD|240397 PTZ00390, PTZ00390, ubiquitin-conjugating enzyme; Provisional.
Length = 152
Score = 28.2 bits (63), Expect = 1.7
Identities = 18/62 (29%), Positives = 32/62 (51%), Gaps = 2/62 (3%)
Query: 107 PPVVYPPSIYNHQSTLYSGTVCLNVINQEWTPLYDLSNIFETFLPQLLTYPNPTDPLNGD 166
P V + IY H + G +CL+++ +W+P + + + + LL+ P P DPL+
Sbjct: 66 PKVRFLTKIY-HPNIDKLGRICLDILKDKWSPALQIRTVLLS-IQALLSAPEPDDPLDTS 123
Query: 167 AA 168
A
Sbjct: 124 VA 125
>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA. This model
describes flagellar biosynthesis protein FlhA, one of a
large number of genes associated with the biosynthesis
of functional bacterial flagella. Homologs of many such
proteins, including FlhA, function in type III protein
secretion systems. A separate model describes InvA
(Salmonella enterica), LcrD (Yersinia enterocolitica),
HrcV (Xanthomonas), etc., all of which score below the
noise cutoff for this model [Cellular processes,
Chemotaxis and motility].
Length = 678
Score = 28.8 bits (65), Expect = 1.7
Identities = 12/36 (33%), Positives = 19/36 (52%), Gaps = 4/36 (11%)
Query: 12 YKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEK 47
Y RR+ K+EEE E + +EE +EE++
Sbjct: 308 YLRRRSKQEEEAAA----EAAKAQEEAAEEEEESIN 339
Score = 28.4 bits (64), Expect = 2.5
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 36 EEKKKEEKKKEKKKKEEEKKKEGE 59
K++EE E K +EE +E E
Sbjct: 312 RSKQEEEAAAEAAKAQEEAAEEEE 335
>gnl|CDD|133433 cd05297, GH4_alpha_glucosidase_galactosidase, Glycoside Hydrolases
Family 4; Alpha-glucosidases and alpha-galactosidases.
Glucosidases cleave glycosidic bonds to release glucose
from oligosaccharides. Alpha-glucosidases and
alpha-galactosidases release alpha-D-glucose and
alpha-D-galactose, respectively, via the hydrolysis of
alpha-glycopyranoside bonds. Some bacteria
simultaneously translocate and phosphorylate
disaccharides via the phosphoenolpyruvate-dependent
phosphotransferase system (PEP-PTS). After
translocation, these phospho-disaccharides may be
hydrolyzed by the GH4 glycoside hydrolases such as the
alpha-glucosidases. Other organsisms (such as archaea
and Thermotoga maritima) lack the PEP-PTS system, but
have several enzymes normally associated with the
PEP-PTS operon. Alpha-glucosidases and
alpha-galactosidases are part of the NAD(P)-binding
Rossmann fold superfamily, which includes a wide variety
of protein families including the NAD(P)-binding domains
of alcohol dehydrogenases, tyrosine-dependent
oxidoreductases, glyceraldehyde-3-phosphate
dehydrogenases, formate/glycerate dehydrogenases,
siroheme synthases, 6-phosphogluconate dehydrogenases,
aminoacid dehydrogenases, repressor rex, and NAD-binding
potassium channel domains, among others.
Length = 423
Score = 28.7 bits (65), Expect = 1.7
Identities = 14/53 (26%), Positives = 21/53 (39%), Gaps = 5/53 (9%)
Query: 12 YKRRKKKKEEEDEEGGGG-----EEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
K KK E E G + E EE+ K + K++ + K+ GE
Sbjct: 259 RKETKKIWYGEFNEDEYGGRDEEQGWEWYEERLKLILAEIDKEELDPVKRSGE 311
>gnl|CDD|205480 pfam13300, DUF4078, Domain of unknown function (DUF4078). This
family is found from fungi to humans, but its exact
function is not known.
Length = 88
Score = 27.2 bits (61), Expect = 1.7
Identities = 19/50 (38%), Positives = 35/50 (70%), Gaps = 4/50 (8%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+ RK++ EE ++ EE E E K++EE+K+++K+ EE++K+ E+RR
Sbjct: 43 EERKEQMEELEKA----REETERERKEREERKEKRKRAIEERRKKIEERR 88
>gnl|CDD|179798 PRK04239, PRK04239, hypothetical protein; Provisional.
Length = 110
Score = 27.5 bits (62), Expect = 1.8
Identities = 15/61 (24%), Positives = 30/61 (49%), Gaps = 2/61 (3%)
Query: 14 RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPA 73
RR+K +E + + + +EE+EE + + + +K+ + E R L I++ KP
Sbjct: 7 RRRKLEELQKQAQEQQQAQEEQEEAQAQA--EAQKQAILRQILTPEARERLNRIKLVKPE 64
Query: 74 L 74
Sbjct: 65 F 65
>gnl|CDD|215456 PLN02850, PLN02850, aspartate-tRNA ligase.
Length = 530
Score = 28.5 bits (64), Expect = 1.9
Identities = 9/35 (25%), Positives = 16/35 (45%)
Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
EE E+ K+ KK K E+ +++ K +
Sbjct: 6 VEESGEKISKKAAKKAAAKAEKLRREATAKAAAAS 40
>gnl|CDD|218574 pfam05389, MecA, Negative regulator of genetic competence (MecA).
This family contains several bacterial MecA proteins.
The development of competence in Bacillus subtilis is
regulated by growth conditions and several regulatory
genes. In complex media competence development is poor,
and there is little or no expression of late competence
genes. Mec mutations permit competence development and
late competence gene expression in complex media,
bypassing the requirements for many of the competence
regulatory genes. The mecA gene product acts negatively
in the development of competence. Null mutations in mecA
allow expression of a late competence gene comG, under
conditions where it is not normally expressed, including
in complex media and in cells mutant for several
competence regulatory genes. Overexpression of MecA
inhibits comG transcription.
Length = 212
Score = 28.3 bits (64), Expect = 1.9
Identities = 11/49 (22%), Positives = 23/49 (46%)
Query: 19 KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFI 67
K + D+ E+E+E+ E+K ++K E E + K + ++
Sbjct: 77 KSDSDDGLNLEEDEDEDISDLLEQKFRDKGLDEMEDTLDDSKEESTEYV 125
>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator. This protein
is found in a wide range of eukaryotes. It is a nuclear
protein and is suggested to be DNA binding. In plants,
this family is essential for correct circadian clock
functioning by acting as a light-quality regulator
coordinating the activities of blue and red light
signalling pathways during plant growth - inhibiting
growth in red light but promoting growth in blue light.
Length = 233
Score = 28.5 bits (64), Expect = 1.9
Identities = 16/51 (31%), Positives = 28/51 (54%), Gaps = 5/51 (9%)
Query: 13 KRRKKKKE-----EEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
K++KKKK+ +D+E +E E+E++ KE + ++ KKK G
Sbjct: 1 KKKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIG 51
>gnl|CDD|225651 COG3109, ProQ, Activator of osmoprotectant transporter ProP [Signal
transduction mechanisms].
Length = 208
Score = 28.3 bits (63), Expect = 2.0
Identities = 16/67 (23%), Positives = 23/67 (34%), Gaps = 2/67 (2%)
Query: 30 EEEEEEEEKKKEEKKKE--KKKKEEEKKKEGEKRRTLTFIRIGKPALKPRPTRCVEGEGA 87
EE E K+ E K ++ E++ KK E +PA RP E
Sbjct: 93 EEHVEHARKQLAEAKARVQAQRAEQQAKKREEAPAAGEKPTAERPATAARPKPKAPREEV 152
Query: 88 PTRTLSV 94
T +
Sbjct: 153 HTPVSDI 159
>gnl|CDD|221893 pfam13010, pRN1_helical, Primase helical domain. This alpha
helical domain is found in a set of bacterial plasmid
replication proteins. The domain is found to the
C-terminus of the primase/polymerase domain. Mutants of
this domain are defective in template binding,
dinucleotide formation and conformation change prior to
DNA extension.
Length = 135
Score = 27.9 bits (62), Expect = 2.1
Identities = 13/35 (37%), Positives = 20/35 (57%)
Query: 28 GGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
G++EEE+ E+ E+ K+E K + K K E R
Sbjct: 1 EGKKEEEDSEEDFEKLKEEMAKYDRFKGKTVEAIR 35
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 28.5 bits (63), Expect = 2.2
Identities = 17/51 (33%), Positives = 33/51 (64%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
K KR +++K E+EE +EE + + +++EEK++ K++ E + + EKR
Sbjct: 220 KKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAEKR 270
Score = 27.3 bits (60), Expect = 4.5
Identities = 15/48 (31%), Positives = 24/48 (50%)
Query: 14 RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
+ +EEE EE EE EE E + ++K + EE +KE ++
Sbjct: 107 ENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEP 154
Score = 27.3 bits (60), Expect = 4.6
Identities = 13/55 (23%), Positives = 34/55 (61%)
Query: 3 NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
+ T + + + ++++EE +E G + E++ + + EE +KE+K+ E E++++
Sbjct: 107 ENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEK 161
Score = 27.3 bits (60), Expect = 5.2
Identities = 16/47 (34%), Positives = 25/47 (53%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
++EE++E EE EE E K E+K + + EE +K+E E
Sbjct: 110 TVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEP 156
Score = 26.9 bits (59), Expect = 6.6
Identities = 17/73 (23%), Positives = 35/73 (47%)
Query: 3 NSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
N + + K + R++++E E+ EG E++ + +E +K+EK+ + EE++K
Sbjct: 108 NETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSL 167
Query: 63 TLTFIRIGKPALK 75
LK
Sbjct: 168 EENNGEFMTHKLK 180
>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
Length = 336
Score = 28.4 bits (64), Expect = 2.3
Identities = 15/45 (33%), Positives = 28/45 (62%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
+K +E+ +EE E EEEE++++E++K++K +K K E
Sbjct: 13 VEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITE 57
Score = 27.2 bits (61), Expect = 5.2
Identities = 14/38 (36%), Positives = 26/38 (68%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFI 67
EEEEEEE + EE+++E+ ++E+++K + +T I
Sbjct: 21 EEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEI 58
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 27.9 bits (62), Expect = 2.3
Identities = 20/48 (41%), Positives = 33/48 (68%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
K+KR++KK+ E+ E E E+E++ +K ++KEKKK+ EE K+E
Sbjct: 148 LKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEE 195
>gnl|CDD|223014 PHA03231, PHA03231, glycoprotein BALF4; Provisional.
Length = 829
Score = 28.4 bits (64), Expect = 2.4
Identities = 10/70 (14%), Positives = 28/70 (40%), Gaps = 7/70 (10%)
Query: 6 TTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKK-------KEEKKKEKKKKEEEKKKEG 58
TT+ + + ++ E +E + EE ++ +++++KK +++
Sbjct: 742 TTKALPSEATGEYGNGDESEESDFDEYKLEEAREMLLAMHLLSAEERQEKKAKKKNSGPA 801
Query: 59 EKRRTLTFIR 68
L +R
Sbjct: 802 LLSSALDRLR 811
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association
with pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 26.9 bits (60), Expect = 2.4
Identities = 12/42 (28%), Positives = 22/42 (52%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
EE +E+ + + ++ +KK+KKKK ++K KE
Sbjct: 14 NGLGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKE 55
>gnl|CDD|225826 COG3288, PntA, NAD/NADP transhydrogenase alpha subunit [Energy
production and conversion].
Length = 356
Score = 28.1 bits (63), Expect = 2.5
Identities = 13/37 (35%), Positives = 18/37 (48%), Gaps = 1/37 (2%)
Query: 16 KKKKEEEDEEGGGG-EEEEEEEEKKKEEKKKEKKKKE 51
K EDEE GG +E EE K+ + ++ KE
Sbjct: 208 AKFLAVEDEESAGGYAKEMSEEFIAKQAELVAEQAKE 244
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of
bone sialoprotein (BSP) is normally restricted to
mineralised connective tissues of bones and teeth where
it has been associated with mineral crystal formation.
However, it has been found that ectopic expression of
BSP occurs in various lesions, including oral and
extraoral carcinomas, in which it has been associated
with the formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 28.1 bits (62), Expect = 2.5
Identities = 14/49 (28%), Positives = 22/49 (44%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
++ + E+ G EEE EEE EE+ E E+++ E E
Sbjct: 41 RFSVQSGSDSSEENGDGDSSEEEGEEETSNEEENNEDSDGNEDEEAEAE 89
>gnl|CDD|217451 pfam03249, TSA, Type specific antigen. There are several antigenic
variants in Rickettsia tsutsugamushi, and a
type-specific antigen (TSA) of 56-kilodaltons located on
the rickettsial surface is responsible for the
variation. TSA proteins are probably integral membrane
proteins.
Length = 502
Score = 28.3 bits (63), Expect = 2.5
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 32 EEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF 66
++EE+ + +KK+ EK KEG K + F
Sbjct: 369 QQEEDAGNQGGGDCKKKQGASEKSKEGGKGKETEF 403
Score = 26.8 bits (59), Expect = 7.8
Identities = 11/45 (24%), Positives = 20/45 (44%), Gaps = 2/45 (4%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
+K ++ ++ + + KKK+ +K KE K KE E
Sbjct: 360 KKAMEKLAAQQEEDAGNQGGGDCKKKQ--GASEKSKEGGKGKETE 402
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 27.8 bits (62), Expect = 2.5
Identities = 11/59 (18%), Positives = 27/59 (45%)
Query: 6 TTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
TT +K KK+++++ ++ E +++K +E + +E +E +L
Sbjct: 57 TTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSL 115
>gnl|CDD|217553 pfam03430, TATR, Trans-activating transcriptional regulator. This
family of trans-activating transcriptional regulator
(TATR), also known as intermediate early protein 1, are
common to the Nucleopolyhedroviruses.
Length = 571
Score = 28.3 bits (63), Expect = 2.5
Identities = 11/59 (18%), Positives = 25/59 (42%), Gaps = 4/59 (6%)
Query: 4 SSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
S T QV+ KR+ + + + + + + ++ + KK++K KK +
Sbjct: 117 SETKQVTVSKRKGSELDSDSDSS----DSSKGKKLVNKPKKRQKYKKATIQSPTSLTEE 171
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 28.1 bits (62), Expect = 2.6
Identities = 12/53 (22%), Positives = 30/53 (56%)
Query: 5 STTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
+VS+ + ++++E E+ E E + + ++ EEK+++ + +E E+ E
Sbjct: 285 ENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTE 337
>gnl|CDD|135898 PRK06397, PRK06397, V-type ATP synthase subunit H; Validated.
Length = 111
Score = 27.2 bits (60), Expect = 2.6
Identities = 14/44 (31%), Positives = 22/44 (50%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
K E+E+E + EE+ +K +EE E +KE EK+
Sbjct: 30 KNEQENEIKEAKSKYEEKAKKTEEESLNMYNAALMEARKEAEKK 73
>gnl|CDD|224667 COG1753, COG1753, Predicted antotoxin, copG family [Signal
transduction mechanisms].
Length = 74
Score = 26.3 bits (58), Expect = 2.7
Identities = 14/39 (35%), Positives = 20/39 (51%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKK 49
+ +K+ E G EEE E+ KKEEK+ EK +
Sbjct: 32 ELIEKKRGNLEVLMRAFGTLSEEEVEKIKKEEKEAEKMR 70
>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694). Family of
hypothetical bacterial proteins.
Length = 278
Score = 28.0 bits (63), Expect = 2.7
Identities = 19/47 (40%), Positives = 29/47 (61%)
Query: 14 RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+ K+K +E E + EE K+K++KKK+KKKKEE ++ EK
Sbjct: 63 QEKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREWMCEK 109
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 27.6 bits (62), Expect = 2.7
Identities = 18/53 (33%), Positives = 30/53 (56%), Gaps = 7/53 (13%)
Query: 10 SKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
S+ ++ KK EEE +EE E+ + E K + +K+EEE+++ EKR
Sbjct: 123 SELEQEIKKLEEE-------KEELEKRVAELEAKLEAIEKREEEERQIEEKRH 168
Score = 26.4 bits (59), Expect = 8.2
Identities = 18/47 (38%), Positives = 28/47 (59%), Gaps = 5/47 (10%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKE--KKKKEEEKKKEGEKRR 62
+K + E+G + E E+E KK EE+K+E K+ E E K E ++R
Sbjct: 113 RKALQAEQG---KSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKR 156
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 27.9 bits (62), Expect = 2.8
Identities = 13/26 (50%), Positives = 19/26 (73%)
Query: 34 EEEEKKKEEKKKEKKKKEEEKKKEGE 59
E+++K K +K K K KK+E+KKK E
Sbjct: 90 EKKKKGKSKKNKLKGKKDEDKKKARE 115
>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
represents the eukaryotic large ribosomal protein P2.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P2 is located in the L12 stalk, with proteins
P1, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers). Bacteria may have four or six
copies of L7/L12 (two or three homodimers) depending on
the species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more
internally with limited reactivity in the C-terminal
domains, while P2 proteins seem to be more externally
located and are more likely to interact with other
cellular components. In lower eukaryotes, P1 and P2 are
further subdivided into P1A, P1B, P2A, and P2B, which
form P1A/P2B and P1B/P2A heterodimers. Some plants have
a third P-protein, called P3, which is not homologous
to P1 and P2. In humans, P1 and P2 are strongly
autoimmunogenic. They play a significant role in the
etiology and pathogenesis of systemic lupus erythema
(SLE). In addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and
P2, with its primary binding site in the C-terminal
region of P2. TCS inactivates the ribosome by
depurinating a specific adenine in the sarcin-ricin
loop of 28S rRNA.
Length = 109
Score = 26.8 bits (60), Expect = 2.8
Identities = 6/21 (28%), Positives = 11/21 (52%)
Query: 26 GGGGEEEEEEEEKKKEEKKKE 46
++E+KKEE ++E
Sbjct: 78 AAAAAAAAAKKEEKKEESEEE 98
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).
This family consists of several Plasmodium falciparum
SPAM (secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the
alanine heptad-repeat domain. Heptad repeats in which
the a and d position contain hydrophobic residues
generate amphipathic alpha-helices which give rise to
helical bundles or coiled-coil structures in proteins.
SPAM is an example of a P. falciparum antigen in which
a repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 27.5 bits (61), Expect = 2.9
Identities = 11/50 (22%), Positives = 26/50 (52%)
Query: 2 QNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKE 51
Q + + + ++ E+ ++E E+EEEEEE +++ + +K+
Sbjct: 48 QEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKK 97
Score = 26.7 bits (59), Expect = 5.6
Identities = 15/51 (29%), Positives = 33/51 (64%)
Query: 14 RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
+ +K++++E+EE EE EE E+ + EE+ E +++EEE +++ + +
Sbjct: 44 KDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDI 94
Score = 26.4 bits (58), Expect = 6.5
Identities = 13/47 (27%), Positives = 31/47 (65%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
+++EEEDEE E+ E+EE+ E++++E++ +E+ + +++
Sbjct: 52 EEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKN 98
>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1. This
family consists of several acidic phosphoprotein
precursor PCEMA1 sequences which appear to be found
exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
that is associated with the membrane of the infected
erythrocyte throughout the entire intraerythrocytic
cycle. The exact function of this family is unclear.
Length = 286
Score = 27.9 bits (62), Expect = 2.9
Identities = 8/60 (13%), Positives = 26/60 (43%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
L+ + + + + KE+ E + ++ + + E + E + +EE +++ +
Sbjct: 222 LEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDESEDEYEEEAGEEQENE 281
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 28.0 bits (62), Expect = 2.9
Identities = 12/37 (32%), Positives = 25/37 (67%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
+++EEE E GE+ + E ++ ++K+ E+++ EEE
Sbjct: 210 EEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEE 246
Score = 26.9 bits (59), Expect = 6.7
Identities = 14/45 (31%), Positives = 23/45 (51%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
++EEEDEE G E E +EE+++ + + E + E R
Sbjct: 188 REEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGER 232
>gnl|CDD|219951 pfam08644, SPT16, FACT complex subunit (SPT16/CDC68). Proteins in
this family are subunits the FACT complex. The FACT
complex plays a role in transcription initiation and
promotes binding of TATA-binding protein (TBP) to a TATA
box in chromatin.
Length = 152
Score = 27.6 bits (62), Expect = 2.9
Identities = 15/45 (33%), Positives = 22/45 (48%), Gaps = 7/45 (15%)
Query: 39 KKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPA----LKPRPT 79
+K KK+E ++KE+ E EK L R +P + RPT
Sbjct: 91 QKRVKKRETERKEKADVVEQEK---LILNRNKRPPRLKDVYIRPT 132
>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
This family consists of several eukaryotic ribosome
biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
protein that is essential for the maturation of 25 S
rRNA and the 60 S ribosomal subunit assembly in
Saccharomyces cerevisiae.
Length = 164
Score = 27.6 bits (62), Expect = 3.0
Identities = 13/55 (23%), Positives = 30/55 (54%), Gaps = 3/55 (5%)
Query: 9 VSKYKRRKKKKEEEDE---EGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
V ++ ++ K+ E + E ++ +E+ KK +KKE+ K E+++ + +K
Sbjct: 110 VPRWGYKRANKDLEKDWIIEVKPNDDPKEDPFAKKRREKKERVAKNEKRELKNKK 164
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this
family (Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 26.8 bits (60), Expect = 3.0
Identities = 11/24 (45%), Positives = 17/24 (70%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEE 53
E +EEE+ +++EE K+ K K EE
Sbjct: 2 ERKEEEKAQREEELKRLKNLKREE 25
>gnl|CDD|218744 pfam05781, MRVI1, MRVI1 protein. This family consists of mammalian
MRVI1 proteins which are related to the
lymphoid-restricted membrane protein (JAW1) and the IP3
receptor associated cGMP kinase substrates A and B
(IRAGA and IRAGB). The function of MRVI1 is unknown
although mutations in the Mrvi1 gene induces myeloid
leukaemia by altering the expression of a gene important
for myeloid cell growth and/or differentiation so it has
been speculated that Mrvi1 is a tumour suppressor gene.
IRAG is very similar in sequence to MRVI1 and is an
essential NO/cGKI-dependent regulator of IP3-induced
calcium release. Activation of cGKI decreases
IP3-stimulated elevations in intracellular calcium,
induces smooth muscle relaxation and contributes to the
antiproliferative and pro-apoptotic effects of NO/cGMP.
Jaw1 is a member of a class of proteins with
COOH-terminal hydrophobic membrane anchors and is
structurally similar to proteins involved in vesicle
targeting and fusion. This suggests that the function
and/or the structure of the ER in lymphocytes may be
modified by lymphoid-restricted resident ER proteins.
Length = 538
Score = 28.1 bits (62), Expect = 3.2
Identities = 11/53 (20%), Positives = 26/53 (49%)
Query: 9 VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
+S Y ++E++ ++ E E EE ++ K ++ E ++ + E+R
Sbjct: 406 ISTYSWADAEEEKKTKKLQDLREPEGEEAVERTRKPSLSEEVAETEEWDKEQR 458
>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly
related to archaeal Holliday junction resolvase
[Nucleotide transport and metabolism].
Length = 175
Score = 27.4 bits (61), Expect = 3.2
Identities = 15/50 (30%), Positives = 23/50 (46%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
+ + + K E EE E E + + K++E K KE +KK E R
Sbjct: 28 QGKVESKARELEETLQKAERERLVNEAQARKEEEWKLKEWIEKKIEEARE 77
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 28.2 bits (63), Expect = 3.2
Identities = 16/44 (36%), Positives = 24/44 (54%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
E ED +EEEE+EE+++EE + E KE +E E+
Sbjct: 380 EIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDV 423
>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 27.5 bits (61), Expect = 3.2
Identities = 15/60 (25%), Positives = 22/60 (36%), Gaps = 13/60 (21%)
Query: 11 KYKRRKKKKEEED----EEGGGGEEEEEEEEKKKEEKKK---------EKKKKEEEKKKE 57
K + K E + + E +K +EE K E+ +KE KKKE
Sbjct: 133 KKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESSKKKE 192
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 27.6 bits (62), Expect = 3.3
Identities = 14/47 (29%), Positives = 22/47 (46%), Gaps = 4/47 (8%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKK----KKEEEKKKEGE 59
+ E+ EE +EEE+ + E+ + K+ K EK K GE
Sbjct: 37 ENALEKKLLDELDEELDEEEDDRFLEEYRRKRLAEMKALAEKSKFGE 83
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 28.1 bits (62), Expect = 3.3
Identities = 19/46 (41%), Positives = 26/46 (56%)
Query: 19 KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTL 64
K E+EE EE EE EE KKE +K + +EEK K E+ + +
Sbjct: 239 KVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEIKFI 284
>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal. The yeast member
of this family (Kri1p) is found to be required for 40S
ribosome biogenesis in the nucleolus. This is the
C-terminal domain of the family.
Length = 93
Score = 26.4 bits (59), Expect = 3.4
Identities = 10/24 (41%), Positives = 18/24 (75%)
Query: 34 EEEEKKKEEKKKEKKKKEEEKKKE 57
++EE++K++KK KK + E KK+
Sbjct: 68 DKEERRKDKKKYGKKARLREWKKK 91
>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal. This domain
is specific to the N-terminal part of the prp1 splicing
factor, which is involved in mRNA splicing (and
possibly also poly(A)+ RNA nuclear export and cell
cycle progression). This domain is specific to the N
terminus of the RNA splicing factor encoded by prp1. It
is involved in mRNA splicing and possibly also
poly(A)and RNA nuclear export and cell cycle
progression.
Length = 131
Score = 26.9 bits (60), Expect = 3.4
Identities = 13/47 (27%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
K ++EDEE E +E+ E +KK +++KE+E+ ++ +
Sbjct: 53 DGKYDDEDEEAD--RIYESIDERMDERRKKRREQKEKEEIEKYREEN 97
>gnl|CDD|216057 pfam00680, RdRP_1, RNA dependent RNA polymerase.
Length = 479
Score = 28.0 bits (63), Expect = 3.4
Identities = 15/62 (24%), Positives = 20/62 (32%), Gaps = 15/62 (24%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKK------------EKKKKEEEKKKEGEKRRT 63
KK+ DE G G + ++ E K E + E K E K R
Sbjct: 120 KKRDLFNDETGDGKLAKVLYSCLREYEGKYGPALVWITCLKDELRPLE---KVEAGKTRL 176
Query: 64 LT 65
T
Sbjct: 177 FT 178
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 27.7 bits (62), Expect = 3.4
Identities = 12/33 (36%), Positives = 22/33 (66%)
Query: 33 EEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
EE E +++EEK++ KK+ +E K++E E +
Sbjct: 179 EEAERRRREEKERRKKQDKERKQREKETAEKIA 211
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 28.0 bits (63), Expect = 3.5
Identities = 16/55 (29%), Positives = 29/55 (52%), Gaps = 1/55 (1%)
Query: 9 VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK-EGEKRR 62
+ +++K + +E EEE E+ EK+ +E ++E + E E K+ E E R
Sbjct: 71 PLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIER 125
>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
Length = 567
Score = 27.9 bits (62), Expect = 3.5
Identities = 10/32 (31%), Positives = 23/32 (71%)
Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
E+ E E+++++++ K K ++E + K+E K+R
Sbjct: 10 EKLEREQRERKQRAKLKLERERKAKEEAAKQR 41
>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein. NinG or Rap
is involved in recombination. Rap (recombination adept
with plasmid) increases lambda-by-plasmid recombination
catalyzed by Escherichia coli's RecBCD pathway.
Length = 188
Score = 27.3 bits (61), Expect = 3.6
Identities = 13/41 (31%), Positives = 19/41 (46%), Gaps = 7/41 (17%)
Query: 38 KKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKPRP 78
K EK +EKK+K E + + E ++ K LK R
Sbjct: 33 ALKREKAQEKKRKAEAQAERRE-------LKARKEKLKTRS 66
>gnl|CDD|216792 pfam01927, DUF82, Protein of unknown function DUF82. This
prokaryotic protein family has no known function. The
protein contains four conserved cysteines that may be
involved in metal binding or disulphide bridges.
Length = 146
Score = 27.2 bits (61), Expect = 3.7
Identities = 17/61 (27%), Positives = 25/61 (40%), Gaps = 11/61 (18%)
Query: 71 KPALKPRPTRCVEGEGAPTRTLSVQIPLHRVHEQGVPPVVYPPSIYNHQSTLYSGTVCLN 130
K +LKP +RC++ G PL V ++ V V PP +Y + C
Sbjct: 83 KLSLKPEFSRCLKCNG----------PLEPVSKEEVAGRV-PPYVYERYDEFWRCPKCGR 131
Query: 131 V 131
V
Sbjct: 132 V 132
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 27.9 bits (62), Expect = 3.7
Identities = 17/48 (35%), Positives = 26/48 (54%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
K+ KKK+ +E + EE E E+E K EE ++ EEE+ + E
Sbjct: 156 KKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSED 203
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 27.7 bits (62), Expect = 3.8
Identities = 18/57 (31%), Positives = 29/57 (50%), Gaps = 1/57 (1%)
Query: 5 STTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
TT+ Y+ + +K + G +E E + EE+++ KKK EK +E EKR
Sbjct: 277 ETTENLLYENYRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKF-TEKIREKEKR 332
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 27.7 bits (62), Expect = 3.8
Identities = 15/41 (36%), Positives = 26/41 (63%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
+RR++ E EEEEE E+++EE+++E++ EEE
Sbjct: 293 RRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 27.7 bits (61), Expect = 3.9
Identities = 17/42 (40%), Positives = 25/42 (59%), Gaps = 3/42 (7%)
Query: 19 KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
EE+EE E EE+EE +KE K +K EEE+++E +
Sbjct: 139 VLEEEEEV---EMEEDEEYYEKEPGKVVDEKSEEEEEEELKT 177
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 27.1 bits (60), Expect = 3.9
Identities = 17/51 (33%), Positives = 29/51 (56%), Gaps = 1/51 (1%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKK-KEEKKKEKKKKEEEKKKEGEKRR 62
++ +KEEE E + +EEE+K EE +KEK+ E E++ E++
Sbjct: 33 IKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQE 83
Score = 27.1 bits (60), Expect = 4.2
Identities = 16/56 (28%), Positives = 34/56 (60%)
Query: 7 TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
Q +Y+ +K+KE + E E++E+EE +EE+K+E+ + ++++ KR+
Sbjct: 56 EQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEKEEEAEDVKQQEVFSFKRK 111
>gnl|CDD|216381 pfam01237, Oxysterol_BP, Oxysterol-binding protein.
Length = 335
Score = 27.6 bits (62), Expect = 3.9
Identities = 16/34 (47%), Positives = 26/34 (76%), Gaps = 1/34 (2%)
Query: 29 GEEEEEEEEKKK-EEKKKEKKKKEEEKKKEGEKR 61
G+ +E EEEK + EEK++E++K+ EEK +E + R
Sbjct: 278 GDYDEAEEEKLRLEEKQRERRKEREEKGEEWKPR 311
>gnl|CDD|218660 pfam05620, DUF788, Protein of unknown function (DUF788). This
family consists of several eukaryotic proteins of
unknown function.
Length = 166
Score = 27.3 bits (61), Expect = 4.0
Identities = 6/25 (24%), Positives = 13/25 (52%)
Query: 26 GGGGEEEEEEEEKKKEEKKKEKKKK 50
E E + K++EK +++ +K
Sbjct: 142 PSSQGAETNETKSKRQEKLEKRGEK 166
Score = 26.9 bits (60), Expect = 4.5
Identities = 7/29 (24%), Positives = 13/29 (44%)
Query: 26 GGGGEEEEEEEEKKKEEKKKEKKKKEEEK 54
+ E + + K++EK +K EK
Sbjct: 138 FASLPSSQGAETNETKSKRQEKLEKRGEK 166
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 26.9 bits (60), Expect = 4.0
Identities = 8/28 (28%), Positives = 20/28 (71%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
+EE++ ++E K++ ++ KE E++K+
Sbjct: 98 ARDEEKKLDEEEAKRQHEEAKEREREKK 125
Score = 26.1 bits (58), Expect = 6.3
Identities = 9/29 (31%), Positives = 20/29 (68%)
Query: 32 EEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+EE+K EE+ K + ++ +E+++E +K
Sbjct: 98 ARDEEKKLDEEEAKRQHEEAKEREREKKK 126
>gnl|CDD|199812 cd10488, MH1_R-SMAD, N-terminal Mad Homology 1 (MH1) domain of
receptor regulated SMADs. The MH1 is a small
DNA-binding domain present in SMAD (small mothers
against decapentaplegic) family of proteins, which are
signal transducers and transcriptional modulators that
mediate multiple signaling pathways. It binds to the
major groove in an unusual manner via a beta hairpin
structure. It negatively regulates the functions of the
MH2 domain, the C-terminal domain of SMAD. This MH1
domain is found in all receptor regulated SMADs
(R-SMADs) including SMAD1, SMAD2, SMAD3, SMAD5 and
SMAD9. SMAD1 plays an essential role in bone development
and postnatal bone formation through activation by bone
morphogenetic protein (BMP) type 1 receptor kinase.
SMAD2 regulates multiple cellular processes, such as
cell proliferation, apoptosis and differentiation, while
SMAD3 modulates signals of activin and TGF-beta. SMAD4,
a common mediator SMAD (co-SMAD) binds R-SMADs, forming
an oligomeric complex that binds to DNA and serves as a
transcription factor. SMAD5 is involved in bone
morphogenetic proteins (BMP) signal modulation, possibly
playing a role in the pathway involving inhibition of
hematopoietic progenitor cells by TGF-beta. SMAD9 (also
known as SMAD8) can mediate the differentiation of
mesenchymal stem cells (MSCs) into tendon-like cells by
inhibiting the osteogenic pathway.
Length = 123
Score = 26.8 bits (59), Expect = 4.1
Identities = 23/82 (28%), Positives = 36/82 (43%), Gaps = 14/82 (17%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKPRPTRCVEGEGAPT 89
E+ EEEK E+ K KK ++K + E + ++ + TRCV T
Sbjct: 12 GEQNGEEEKWAEKAVKSLVKKLKKKGQLEELEKAIS--------TQNVNTRCV------T 57
Query: 90 RTLSVQIPLHRVHEQGVPPVVY 111
S+ L H +G+P V+Y
Sbjct: 58 IPRSLDGRLQVSHRKGLPHVIY 79
>gnl|CDD|218652 pfam05602, CLPTM1, Cleft lip and palate transmembrane protein 1
(CLPTM1). This family consists of several eukaryotic
cleft lip and palate transmembrane protein 1 sequences.
Cleft lip with or without cleft palate is a common birth
defect that is genetically complex. The nonsyndromic
forms have been studied genetically using linkage and
candidate-gene association studies with only partial
success in defining the loci responsible for orofacial
clefting. CLPTM1 encodes a transmembrane protein and has
strong homology to two Caenorhabditis elegans genes,
suggesting that CLPTM1 may belong to a new gene family.
This family also contains the human cisplatin resistance
related protein CRR9p which is associated with
CDDP-induced apoptosis.
Length = 437
Score = 27.6 bits (62), Expect = 4.2
Identities = 12/44 (27%), Positives = 19/44 (43%), Gaps = 7/44 (15%)
Query: 4 SSTTQVSKYKRRKKKKEE---EDEEGGGGEEEEEEEEKKKEEKK 44
++ Y +KK K++ + E+EE EEEK K
Sbjct: 139 HFVFPLTTYLPKKKVKKKNLLGGKS----EKEEPEEEKTPAPDK 178
>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain. This family
consists of several ATP synthase E chain sequences
which are components of the CF(0) subunit.
Length = 83
Score = 26.3 bits (58), Expect = 4.2
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKK 56
++EE+ E + +EK EK K KKK
Sbjct: 34 KKEEKIREYEAQEKLIEKAKAAYAKKK 60
Score = 25.5 bits (56), Expect = 6.0
Identities = 12/41 (29%), Positives = 20/41 (48%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKE 51
+R KK+EE +E+ E+ K KKK+ K++
Sbjct: 26 AKHQRSLKKKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEA 66
Score = 25.1 bits (55), Expect = 10.0
Identities = 9/35 (25%), Positives = 20/35 (57%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEK 47
K++++K E + + E+ + KKK+ K++ K
Sbjct: 33 KKKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAK 67
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 26.9 bits (60), Expect = 4.2
Identities = 12/48 (25%), Positives = 21/48 (43%)
Query: 12 YKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
K RK+ + +EE E+ E E+ + E+ + E + E E
Sbjct: 69 LKERKEALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEE 116
>gnl|CDD|240400 PTZ00396, PTZ00396, Casein kinase II subunit beta; Provisional.
Length = 251
Score = 27.3 bits (61), Expect = 4.2
Identities = 12/49 (24%), Positives = 22/49 (44%), Gaps = 4/49 (8%)
Query: 9 VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
V K K K+ K + E G E+ KK K + +++E++ +
Sbjct: 206 VHKKKSSKEYKLQNGEFGKSAIEQFL----KKSPKGNIELQEQEQQAIQ 250
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 27.6 bits (62), Expect = 4.2
Identities = 19/61 (31%), Positives = 33/61 (54%), Gaps = 5/61 (8%)
Query: 9 VSKYKRRKKKKEE----EDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE-KKKEGEKRRT 63
+ + +RR ++EE + E EE E++EK+ K+K +KEEE ++ E+R
Sbjct: 78 LQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREE 137
Query: 64 L 64
L
Sbjct: 138 L 138
>gnl|CDD|219009 pfam06404, PSK, Phytosulfokine precursor protein (PSK). This
family consists of several plant specific
phytosulfokine precursor proteins. Phytosulfokines, are
active as either a pentapeptide or a C-terminally
truncated tetrapeptide. These compounds were first
isolated because of their ability to stimulate cell
division in somatic embryo cultures of Asparagus
officinalis.
Length = 74
Score = 25.8 bits (57), Expect = 4.3
Identities = 11/38 (28%), Positives = 17/38 (44%), Gaps = 4/38 (10%)
Query: 32 EEEEEEKKKEEKKKEKKKKEEEKKKEGE----KRRTLT 65
E + + E K ++ + E + EGE RRTL
Sbjct: 24 HESSKIEAAESSKVDEDEVVSECEGEGEEECLMRRTLA 61
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 27.6 bits (62), Expect = 4.4
Identities = 8/33 (24%), Positives = 17/33 (51%)
Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
E +E EK++++ KE+E + G+ +
Sbjct: 243 AERLRQEAAAYEKQQKELAKEQEWIRRGKAAAS 275
>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724). This
family contains several uncharacterized proteins found
in Arabidopsis thaliana and other plants. This region is
often found associated with Agenet domains and may
contain coiled-coil.
Length = 190
Score = 27.0 bits (60), Expect = 4.4
Identities = 15/60 (25%), Positives = 30/60 (50%), Gaps = 4/60 (6%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKK----KEEKKKEKKKKEEEKKK 56
L++ T ++ + K +K+ E++ + E E E+K + + K+KKE E K+
Sbjct: 101 LKDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKE 160
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 27.5 bits (61), Expect = 4.4
Identities = 15/49 (30%), Positives = 31/49 (63%)
Query: 9 VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
+ K K + +++EE EE E+EEEE+ + +K++E+++ + K K+
Sbjct: 133 IQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQ 181
Score = 27.5 bits (61), Expect = 4.7
Identities = 12/53 (22%), Positives = 34/53 (64%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
++ Y++ K ++++E E+EE EE + E++++E+++ +K++E ++
Sbjct: 121 KIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQ 173
>gnl|CDD|216828 pfam01984, dsDNA_bind, Double-stranded DNA-binding domain. This
domain is believed to bind double-stranded DNA of 20
bases length.
Length = 105
Score = 26.4 bits (59), Expect = 4.4
Identities = 11/33 (33%), Positives = 25/33 (75%)
Query: 14 RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKE 46
RR + E + ++GG E++++EE++++EE +K+
Sbjct: 2 RRARLAELQQQQGGLEEQQQQEEQQQQEEAQKQ 34
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 27.4 bits (61), Expect = 4.6
Identities = 15/48 (31%), Positives = 23/48 (47%)
Query: 14 RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
R+KKK E E ++ + +K E+ KKEEE K+ E+
Sbjct: 851 RQKKKLIERLIEAINQYRAKKLDTAEKLEELYILAKKEEEFKQFAEEE 898
>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
Length = 258
Score = 27.1 bits (61), Expect = 4.6
Identities = 11/34 (32%), Positives = 22/34 (64%)
Query: 19 KEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEE 52
++ E EEEE EEE+++EE+++ ++ + E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 26.3 bits (59), Expect = 8.8
Identities = 12/34 (35%), Positives = 21/34 (61%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKE 51
++ E EEE EEEE+++EE++ E+ + E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
>gnl|CDD|193562 cd08014, M20_Acy1_like4, M20 Peptidase Aminoacylase 1 subfamily.
Peptidase M20 family, Uncharacterized subfamily of
uncharacterized bacterial proteins predicted as putative
amidohydrolases. These are a class of zinc binding
homodimeric enzymes involved in hydrolysis of
N-acetylated proteins. N-terminal acetylation of
proteins is a widespread and highly conserved process
that is involved in protection and stability of
proteins. Several types of aminoacylases can be
distinguished on the basis of substrate specificity.
Aminoacylase 1 (ACY1) breaks down cytosolic aliphatic
N-acyl-alpha-amino acids (except L-aspartate),
especially N-acetyl-methionine and acetyl-glutamate into
L-amino acids and an acyl group. However, ACY1 can also
catalyze the reverse reaction, the synthesis of
acetylated amino acids. ACY1 may also play a role in
xenobiotic bioactivation as well as the inter-organ
processing of amino acid-conjugated xenobiotic
derivatives (S-substituted-N-acetyl-L-cysteine).
Length = 372
Score = 27.2 bits (61), Expect = 4.6
Identities = 10/39 (25%), Positives = 17/39 (43%), Gaps = 5/39 (12%)
Query: 72 PALKPRPTRCVEGEGAPTRTLSVQIPLHRVHEQGVPPVV 110
+ EG AP ++ + +++GVPPVV
Sbjct: 255 ETAEELVREIAEGVAAPYG---ARVEV--DYQRGVPPVV 288
>gnl|CDD|235366 PRK05218, PRK05218, heat shock protein 90; Provisional.
Length = 613
Score = 27.4 bits (62), Expect = 4.7
Identities = 13/32 (40%), Positives = 23/32 (71%)
Query: 29 GEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
G+E+EEE+E+K+E +++ K E K+ G+K
Sbjct: 477 GKEDEEEKEEKEEAEEEFKPLLERLKEALGDK 508
>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
peptidyl-prolyl isomerase. This family is only found at
the amino terminus of pfam00254. This domain is of
unknown function.
Length = 124
Score = 26.4 bits (59), Expect = 4.9
Identities = 8/25 (32%), Positives = 16/25 (64%)
Query: 36 EEKKKEEKKKEKKKKEEEKKKEGEK 60
++K + +++ + +KK EE K GE
Sbjct: 78 QKKLQAKQQAKAEKKAEENKAAGEA 102
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 27.1 bits (60), Expect = 4.9
Identities = 15/56 (26%), Positives = 25/56 (44%)
Query: 5 STTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
S K K +KK+ + +E+EE E EE ++ +E +KK + E
Sbjct: 47 SKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYEL 102
>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.
This presumed domain is found at the N-terminus of
Ribosomal L30 proteins and has been termed RL30NT or
NUC018.
Length = 71
Score = 25.7 bits (57), Expect = 5.0
Identities = 11/31 (35%), Positives = 18/31 (58%)
Query: 38 KKKEEKKKEKKKKEEEKKKEGEKRRTLTFIR 68
K+ E+ + ++ KK KK +K+R L F R
Sbjct: 7 KRNEKLRAKRAKKRAAKKAARKKKRKLIFKR 37
>gnl|CDD|217206 pfam02731, SKIP_SNW, SKIP/SNW domain. This domain is found in
chromatin proteins.
Length = 158
Score = 26.9 bits (60), Expect = 5.0
Identities = 10/33 (30%), Positives = 22/33 (66%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
EE + E +++ +KEK++KEE+ ++ ++ R
Sbjct: 123 EEVRQRAELQRQLAEKEKQEKEEKLRELAQRAR 155
>gnl|CDD|150747 pfam10107, Endonuc_Holl, Endonuclease related to archaeal
Holliday junction resolvase. This domain is found in
various predicted bacterial endonucleases which are
distantly related to archaeal Holliday junction
resolvases.
Length = 153
Score = 26.9 bits (60), Expect = 5.0
Identities = 13/47 (27%), Positives = 26/47 (55%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK 54
++ K + + +K+ E E E E + ++ E K +E +KKEE++
Sbjct: 12 KIIKLQGKVEKRARELFEQWRNAEREAQAMREAEAKFEEWQKKEEKE 58
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 27.3 bits (60), Expect = 5.0
Identities = 4/42 (9%), Positives = 11/42 (26%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
+ + EE + ++ + E +K
Sbjct: 382 QAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPI 423
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 26.9 bits (59), Expect = 5.1
Identities = 15/49 (30%), Positives = 32/49 (65%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
+ K K K+E+ + +E + EEK+ +E+++E++ +EE K++EG+
Sbjct: 191 QVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQ 239
Score = 26.5 bits (58), Expect = 7.5
Identities = 12/45 (26%), Positives = 28/45 (62%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
+ K + +E + +E +E K EEK+ +++++EEE ++E ++
Sbjct: 191 QVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQ 235
>gnl|CDD|223266 COG0188, GyrA, Type IIA topoisomerase (DNA gyrase/topo II,
topoisomerase IV), A subunit [DNA replication,
recombination, and repair].
Length = 804
Score = 27.2 bits (61), Expect = 5.2
Identities = 19/50 (38%), Positives = 26/50 (52%), Gaps = 9/50 (18%)
Query: 23 DEEGGGGEEEEEEEEKK-------KEEKKKEKKKKE--EEKKKEGEKRRT 63
+EE E +E E+E EE+ + KKE E KKK G++RRT
Sbjct: 435 EEEKIEKELKELEKEIADLEKILASEERLLDIIKKELLEIKKKFGDERRT 484
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 27.5 bits (62), Expect = 5.2
Identities = 16/62 (25%), Positives = 34/62 (54%), Gaps = 7/62 (11%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKK-------KEEKKKEKKKKEEEKKKEGEKRRTLTFIR 68
K+ ++ + + E E +K K+EKKK+K+K+++E+ K G++ + L+ +
Sbjct: 591 KELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQ 650
Query: 69 IG 70
G
Sbjct: 651 KG 652
Score = 27.5 bits (62), Expect = 5.2
Identities = 15/42 (35%), Positives = 27/42 (64%), Gaps = 3/42 (7%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
+++ E++ EE E +E EK KEE +++K+K +EE+ K
Sbjct: 529 ERELEQKAEEA---EALLKEAEKLKEELEEKKEKLQEEEDKL 567
Score = 27.1 bits (61), Expect = 6.1
Identities = 19/64 (29%), Positives = 36/64 (56%), Gaps = 6/64 (9%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK--RRTLTFIRIGKPA 73
K K+E E+++ E+ +EEE+K EE +KE ++ +E KKE ++ + + G +
Sbjct: 548 KLKEELEEKK----EKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYAS 603
Query: 74 LKPR 77
+K
Sbjct: 604 VKAH 607
>gnl|CDD|233573 TIGR01781, Trep_dent_lipo, Treponema denticola clustered
lipoprotein. This model represents a family of six
predicted lipoproteins from a region of about 20
tandemly arranged genes in the Treponema denticola
genome. Two other neighboring genes share the
lipoprotein signal peptide region but do not show more
extensive homology. The function of this locus is
unknown.
Length = 412
Score = 27.2 bits (60), Expect = 5.3
Identities = 10/50 (20%), Positives = 16/50 (32%)
Query: 9 VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEG 58
K +K+ E + + E+ K+ K K E K G
Sbjct: 36 EKFSKMEEKENEFLSKPEYNTHVKSPEQIKELSNKISGVSGKGESDKFVG 85
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 27.1 bits (60), Expect = 5.4
Identities = 12/22 (54%), Positives = 16/22 (72%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKE 41
EEE+ E G +EEEE EE+ +E
Sbjct: 80 EEEEVEEGEEDEEEEGEEESEE 101
>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
Length = 470
Score = 27.1 bits (61), Expect = 5.5
Identities = 11/39 (28%), Positives = 21/39 (53%)
Query: 21 EEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
EE E G +EE EE +++ ++ E + +E+K +
Sbjct: 389 EELLEIEGFDEETVEELRERAKEALETEALAQEEKLADD 427
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 26.9 bits (60), Expect = 5.5
Identities = 20/52 (38%), Positives = 30/52 (57%), Gaps = 4/52 (7%)
Query: 14 RRKKKKEEEDEEGGGGEEEEE----EEEKKKEEKKKEKKKKEEEKKKEGEKR 61
R KKKE + +E ++++ EEEKKK++KK KKKK ++ G K
Sbjct: 154 ERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 27.2 bits (60), Expect = 5.5
Identities = 20/57 (35%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKK-KEEEKKKEGEKRRTLTFIR 68
K+ + +E + E + EEE ++K EE+K+E K EEE KKE E ++ T +
Sbjct: 44 KKDRITIQEREREAAKEKALEEEAKRKAEERKRETLKIVEEEVKKELELKKRNTLLE 100
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 27.0 bits (59), Expect = 5.5
Identities = 15/49 (30%), Positives = 28/49 (57%)
Query: 14 RRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
++KK E+ +E EE E++ E +++E+K E E ++E E+ R
Sbjct: 116 KKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQERER 164
Score = 26.2 bits (57), Expect = 9.1
Identities = 15/61 (24%), Positives = 36/61 (59%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
L++ + K K ++E+ ++ EE EE E ++ E+K ++++ +EE+++E E+
Sbjct: 107 LRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREE 166
Query: 61 R 61
+
Sbjct: 167 Q 167
>gnl|CDD|236601 PRK09637, PRK09637, RNA polymerase sigma factor SigZ;
Provisional.
Length = 181
Score = 26.9 bits (60), Expect = 5.5
Identities = 13/27 (48%), Positives = 16/27 (59%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKE 41
RKK + EE + E+EE EE KKE
Sbjct: 69 RKKNRSEELPDDLLFEDEEREENAKKE 95
>gnl|CDD|189027 cd09857, PIN_EXO1, PIN domain of Exonuclease-1, a
structure-specific, divalent-metal-ion dependent, 5'
nuclease and homologs. Exonuclease-1 (EXO1) is involved
in multiple, eukaryotic DNA metabolic pathways,
including DNA replication processes (5' flap DNA
endonuclease activity and double stranded DNA
5'-exonuclease activity), DNA repair processes (DNA
mismatch repair (MMR) and post-replication repair
(PRR)), recombination, and telomere integrity. EXO1
functions in the MMS2 error-free branch of the PRR
pathway in the maintenance and repair of stalled
replication forks. Studies also suggest that EXO1 plays
both structural and catalytic roles during MMR-mediated
mutation avoidance. EXO1 belongs to the FEN1-EXO1-like
family of structure-specific, 5' nucleases. These
nucleases contain a PIN (PilT N terminus) domain with a
helical arch/clamp region (I domain) of variable length
(approximately 43 residues in EXO1 PIN domains) and a
H3TH (helix-3-turn-helix) domain, an atypical
helix-hairpin-helix-2-like region. Both the H3TH domain
(not included here) and the helical arch/clamp region
are involved in DNA binding. Nucleases within this group
also have a carboxylate-rich active site that is
involved in binding essential divalent metal ion
cofactors (Mg2+/Mn2+). EXO1 nucleases also have
C-terminal Mlh1- and Msh2-binding domains which allow
interaction with MMR and PRR proteins, respectively.
Length = 210
Score = 26.7 bits (60), Expect = 5.8
Identities = 7/28 (25%), Positives = 17/28 (60%)
Query: 34 EEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
+ E+++ E+++E +K E +EG +
Sbjct: 86 KGTEEERRERREENLEKALELLREGNRS 113
>gnl|CDD|148511 pfam06936, Selenoprotein_S, Selenoprotein S (SelS). This family
consists of several mammalian selenoprotein S (SelS)
sequences. SelS is a plasma membrane protein and is
present in a variety of tissues and cell types. The
function of this family is unknown.
Length = 190
Score = 26.8 bits (59), Expect = 5.8
Identities = 8/25 (32%), Positives = 19/25 (76%)
Query: 33 EEEEEKKKEEKKKEKKKKEEEKKKE 57
+E+ + K + K+++K+ EEEK+++
Sbjct: 93 QEDLDAKAAKFKEKQKQLEEEKRRQ 117
>gnl|CDD|237200 PRK12778, PRK12778, putative bifunctional 2-polyprenylphenol
hydroxylase/glutamate synthase subunit beta;
Provisional.
Length = 752
Score = 27.4 bits (61), Expect = 5.8
Identities = 12/58 (20%), Positives = 19/58 (32%), Gaps = 4/58 (6%)
Query: 13 KRRKKKKEEEDEE----GGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF 66
KR K E EE E K + K++ ++ + + E R F
Sbjct: 255 KRMGAYKTIEGEELLKLEERTAAWRAELRKSMKPKERTAIERVPMPELDPEYRAHNRF 312
>gnl|CDD|221122 pfam11490, DNA_pol3_alph_N, DNA polymerase III polC-type
N-terminus. This is an N-terminal domain of DNA
polymerase III polC subunit A that is found only in
Firmicutes. DNA polymerase polC-type III enzyme
functions as the 'replicase' in low G + C Gram-positive
bacteria. Purine asymmetry is a characteristic of
organisms with a heterodimeric DNA polymerase III
alpha-subunit constituted by polC which probably plays a
direct role in the maintenance of strand-biased gene
distribution; since, among prokaryotic genomes, the
distribution of genes on the leading and lagging strands
of the replication fork is known to be biased. The
domain is associated with DNA_pol3_alpha pfam07733.
Length = 180
Score = 26.5 bits (59), Expect = 5.8
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKR 61
EE++EEEE K E+ E KK+E +KK+ EK
Sbjct: 149 EEQKEEEEAKLAEEALEALKKKEAEKKKKEKE 180
>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
[Posttranslational modification, protein turnover,
chaperones].
Length = 782
Score = 27.1 bits (61), Expect = 5.9
Identities = 11/32 (34%), Positives = 23/32 (71%)
Query: 29 GEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
GE++++++E ++ +K EK K +E K++ EK
Sbjct: 243 GEDDDDKDEVEELREKIEKLKLPKEAKEKAEK 274
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized.
This family of proteins is found in bacteria. Proteins
in this family are approximately 90 amino acids in
length. There are two completely conserved L residues
that may be functionally important.
Length = 70
Score = 25.4 bits (56), Expect = 5.9
Identities = 11/26 (42%), Positives = 20/26 (76%)
Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKK 56
+ EEE+ +++EE +K +++KE EK K
Sbjct: 30 KAEEEKREEEEEARKREERKEREKNK 55
Score = 25.4 bits (56), Expect = 6.0
Identities = 10/25 (40%), Positives = 20/25 (80%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEK 54
EEE+ EE+++ K++E+K++E+ K
Sbjct: 31 AEEEKREEEEEARKREERKEREKNK 55
>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
Length = 567
Score = 27.1 bits (61), Expect = 5.9
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKK 56
EE+ E E+ E +E+++K+EEKK
Sbjct: 64 EEQRERLEELAPELLEEEEEKKEEKKG 90
Score = 27.1 bits (61), Expect = 6.7
Identities = 14/28 (50%), Positives = 15/28 (53%)
Query: 29 GEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
E EEEE+KKEEKK EK K
Sbjct: 74 APELLEEEEEKKEEKKGLPPLPNAEKGK 101
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 27.1 bits (60), Expect = 5.9
Identities = 13/33 (39%), Positives = 22/33 (66%)
Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
+EE+ EE+K+E KK K+KK+ + +K +T
Sbjct: 191 KEEKAEERKQESKKGAKRKKDASGDDKSKKAKT 223
>gnl|CDD|237214 PRK12813, flgD, flagellar basal body rod modification protein;
Reviewed.
Length = 223
Score = 26.6 bits (59), Expect = 6.1
Identities = 12/34 (35%), Positives = 18/34 (52%), Gaps = 3/34 (8%)
Query: 142 LSNIFETFLPQL---LTYPNPTDPLNGDAAAMYL 172
L++ FETFL L L +P +P++ A L
Sbjct: 28 LTSDFETFLRMLTTQLQNQDPLNPMDSSDYAAQL 61
>gnl|CDD|233194 TIGR00930, 2a30, K-Cl cotransporter. [Transport and binding
proteins, Other].
Length = 953
Score = 27.0 bits (60), Expect = 6.2
Identities = 7/40 (17%), Positives = 18/40 (45%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRI 69
+ E++ E K + K K + + + R++ +R+
Sbjct: 854 HKTEKDREAKDPKMTWTKPWKITDAELQSNVRKSYRQVRL 893
>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
Length = 763
Score = 27.2 bits (60), Expect = 6.3
Identities = 10/25 (40%), Positives = 14/25 (56%)
Query: 25 EGGGGEEEEEEEEKKKEEKKKEKKK 49
E E E E + K++KK +KKK
Sbjct: 234 EKPQAEAENEAGKSDKKDKKSKKKK 258
Score = 26.8 bits (59), Expect = 7.4
Identities = 11/30 (36%), Positives = 15/30 (50%)
Query: 21 EEDEEGGGGEEEEEEEEKKKEEKKKEKKKK 50
E E + E E E K ++K K+ KKK
Sbjct: 228 EGGELIEKPQAEAENEAGKSDKKDKKSKKK 257
>gnl|CDD|163660 cd07417, MPP_PP5_C, PP5, C-terminal metallophosphatase domain.
Serine/threonine protein phosphatase-5 (PP5) is a member
of the PPP gene family of protein phosphatases that is
highly conserved among eukaryotes and widely expressed
in mammalian tissues. PP5 has a C-terminal phosphatase
domain and an extended N-terminal TPR (tetratricopeptide
repeat) domain containing three TPR motifs. The PPP
(phosphoprotein phosphatase) family, to which PP5
belongs, is one of two known protein phosphatase
families specific for serine and threonine. The PPP
family also includes: PP1, PP2A, PP2B (calcineurin),
PP4, PP6, PP7, Bsu1, RdgC, PrpE, PrpA/PrpB, and ApA4
hydrolase. The PPP catalytic domain is defined by three
conserved motifs (-GDXHG-, -GDXVDRG- and -GNHE-). The
PPP enzyme family is ancient with members found in all
eukaryotes, and in most bacterial and archeal genomes.
Dephosphorylation of phosphoserines and
phosphothreonines on target proteins plays a central
role in the regulation of many cellular processes. PPPs
belong to the metallophosphatase (MPP) superfamily.
MPPs are functionally diverse, but all share a conserved
domain with an active site consisting of two metal ions
(usually manganese, iron, or zinc) coordinated with
octahedral geometry by a cage of histidine, aspartate,
and asparagine residues. The MPP superfamily includes:
Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
debranching enzymes, YfcE-like phosphodiesterases,
purple acid phosphatases (PAPs), YbbF-like
UDP-2,3-diacylglucosamine hydrolases, and acid
sphingomyelinases (ASMases). The conserved domain is a
double beta-sheet sandwich with a di-metal active site
made up of residues located at the C-terminal side of
the sheets. This domain is thought to allow for
productive metal coordination.
Length = 316
Score = 26.8 bits (60), Expect = 6.3
Identities = 18/43 (41%), Positives = 21/43 (48%), Gaps = 11/43 (25%)
Query: 126 TVCLNVINQEWTPLYDLSNIFETFLPQLLTYPNPTDPL--NGD 166
TVC + Q YDL NIFE L P+ T+P NGD
Sbjct: 63 TVCGDTHGQ----FYDLLNIFE-----LNGLPSETNPYLFNGD 96
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 24.6 bits (54), Expect = 6.3
Identities = 12/29 (41%), Positives = 19/29 (65%), Gaps = 4/29 (13%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKK 48
EEE E +EEE E+E++ EE+ + K+
Sbjct: 22 EEEKRE----DEEENEDEEEGEEQSEVKR 46
Score = 24.6 bits (54), Expect = 6.4
Identities = 10/23 (43%), Positives = 15/23 (65%)
Query: 18 KKEEEDEEGGGGEEEEEEEEKKK 40
+K E++EE EE EE+ E K+
Sbjct: 24 EKREDEEENEDEEEGEEQSEVKR 46
>gnl|CDD|218581 pfam05416, Peptidase_C37, Southampton virus-type processing
peptidase. Corresponds to Merops family C37.
Norwalk-like viruses (NLVs), including the Southampton
virus, cause acute non-bacterial gastroenteritis in
humans. The NLV genome encodes three open reading frames
(ORFs). ORF1 encodes a polyprotein, which is processed
by the viral protease into six proteins.
Length = 535
Score = 27.1 bits (60), Expect = 6.5
Identities = 16/51 (31%), Positives = 20/51 (39%), Gaps = 5/51 (9%)
Query: 20 EEEDEEGGGGEEEEEEE-----EKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
E G EE++E + E KK K KK KK RR L+
Sbjct: 204 XEVTYYDGCESEEDDEGWYIEPQDATPEGKKGKNKKGRGKKHNAFSRRGLS 254
>gnl|CDD|219627 pfam07897, DUF1675, Protein of unknown function (DUF1675). The
members of this family are sequences derived from
hypothetical plant proteins of unknown function. One
member of this family is annotated as a putative
RNA-binding protein, but no evidence was found to
support this.
Length = 283
Score = 26.8 bits (59), Expect = 6.8
Identities = 9/30 (30%), Positives = 17/30 (56%)
Query: 34 EEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
E EE+ ++ K+ + ++ E K+K EK
Sbjct: 67 ETEEEWRKRKEMQSLRRLEAKRKRSEKEYN 96
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 26.8 bits (59), Expect = 6.9
Identities = 9/38 (23%), Positives = 17/38 (44%), Gaps = 2/38 (5%)
Query: 22 EDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
EE +EE E+E + E +E + E + + +
Sbjct: 343 VTEEST--DEESEDEVEIDESVIEEVAEMELLEVQVDD 378
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved
in cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 27.0 bits (60), Expect = 7.0
Identities = 13/44 (29%), Positives = 22/44 (50%)
Query: 17 KKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEK 60
K+ + + +EE E K+K E+ +EK K EK++ K
Sbjct: 31 KESRDAAAYENWKKRQEEAEAKRKREELREKIAKAREKRERNSK 74
>gnl|CDD|215064 PLN00122, PLN00122, serine/threonine protein phosphatase 2A;
Provisional.
Length = 170
Score = 26.6 bits (59), Expect = 7.0
Identities = 10/23 (43%), Positives = 16/23 (69%)
Query: 31 EEEEEEEKKKEEKKKEKKKKEEE 53
EE+E + K+ EEK++ K+ EE
Sbjct: 139 EEDEAKAKEVEEKREATWKRLEE 161
>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
Length = 205
Score = 26.5 bits (59), Expect = 7.2
Identities = 12/33 (36%), Positives = 19/33 (57%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKK 48
KKKK +++EE +++ E K E +KK K
Sbjct: 123 KKKKYKKEEENKKAKKKSESLVAKLEVRKKNHK 155
>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein. The PcfK-like protein family
includes the Enterococcus faecalis PcfK protein, which
is functionally uncharacterized. This family of proteins
is found in bacteria and viruses. Proteins in this
family are typically between 137 and 257 amino acids in
length. There are two completely conserved residues (D
and L) that may be functionally important.
Length = 136
Score = 26.2 bits (58), Expect = 7.2
Identities = 11/44 (25%), Positives = 20/44 (45%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGE 59
E EE +E + ++EE +K +K+ ++ KK E
Sbjct: 84 VNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPV 127
Score = 25.8 bits (57), Expect = 9.6
Identities = 12/50 (24%), Positives = 26/50 (52%)
Query: 8 QVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
+V+ + +EE+ E + ++EE +K +K+ +K KK E + +
Sbjct: 81 RVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPVQGQ 130
>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated. This domain
appears in eukaryotes as well as bacteria and tends to
be found near the C-terminus of the metalloprotease
M16C (pfam05193).
Length = 248
Score = 26.4 bits (59), Expect = 7.2
Identities = 13/31 (41%), Positives = 20/31 (64%), Gaps = 3/31 (9%)
Query: 29 GEEEEEEEEKKKEEKKKEKKKK---EEEKKK 56
+E EE+ ++E +K EKKK EE+K+K
Sbjct: 3 PDETLEEKLNEEEREKLEKKKSSLSEEDKEK 33
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 26.0 bits (57), Expect = 7.3
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKKE 57
E E+ KKK++K K+ KK ++ KK+
Sbjct: 94 PEPTEKPKKKKKKSKKTKKPKKSSKKD 120
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 26.5 bits (59), Expect = 7.4
Identities = 10/34 (29%), Positives = 19/34 (55%)
Query: 30 EEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRT 63
E + E + E++ +K+KEEE+ E ++R
Sbjct: 208 AERAKAEAAEAEQELLREKQKEEEQMMEAQERSY 241
Score = 26.5 bits (59), Expect = 7.6
Identities = 14/50 (28%), Positives = 26/50 (52%), Gaps = 2/50 (4%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKK--EEEKKKEGEK 60
K + + E E E+E E++K+EE+ E +++ +E K+ EK
Sbjct: 202 KEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEK 251
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 26.7 bits (59), Expect = 7.4
Identities = 10/34 (29%), Positives = 17/34 (50%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
ED E G E E +E ++ E + + + +E E
Sbjct: 243 NNEDSEAGREESEGSDESEEDEAEATDGEGEEGE 276
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 27.0 bits (60), Expect = 7.4
Identities = 16/44 (36%), Positives = 25/44 (56%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
+ K++ E +EE EE EK+ EE ++E ++ EEE KK
Sbjct: 392 QELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKK 435
>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
Members of this family of small ribosomal proteins
possess one of three conserved blocks of sequence found
in proteins that stimulate the dissociation of guanine
nucleotides from G-proteins, leaving open the
possibility that MRP-S27 might be a functional partner
of GTP-binding ribosomal proteins.
Length = 417
Score = 26.7 bits (59), Expect = 7.4
Identities = 10/51 (19%), Positives = 24/51 (47%)
Query: 15 RKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
+++ ++E E+ ++ + K ++KKK E+ E ++R T
Sbjct: 364 VEERLPTIEKEDLELYEQRQQLWFFENRKLWQRKKKLREQADEEYQQRHAT 414
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 26.7 bits (59), Expect = 7.6
Identities = 16/59 (27%), Positives = 27/59 (45%), Gaps = 3/59 (5%)
Query: 7 TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLT 65
T + + + K+K E EEEEEEEE EE++ + + E+ G + +
Sbjct: 381 TTIDELREFLKEKGHPVVERWAEEEEEEEEE---EEEEAAEAEAPMEEPVPGFEVPEMP 436
>gnl|CDD|201249 pfam00472, RF-1, RF-1 domain. This domain is found in peptide
chain release factors such as RF-1 and RF-2, and a
number of smaller proteins of unknown function. This
domain contains the peptidyl-tRNA hydrolase activity.
The domain contains a highly conserved motif GGQ, where
the glutamine is thought to coordinate the water that
mediates the hydrolysis.
Length = 114
Score = 25.6 bits (57), Expect = 7.8
Identities = 11/33 (33%), Positives = 16/33 (48%)
Query: 36 EEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIR 68
E +KK EK K + + + EK RT F +
Sbjct: 75 ELQKKREKTKPTRASQVRRGDRKEKIRTYNFPQ 107
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 26.9 bits (60), Expect = 7.9
Identities = 10/42 (23%), Positives = 21/42 (50%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEE 52
K + K+ + +E +++E+ K++KK +K EE
Sbjct: 883 KLNKLIIAKDVLIKLVISSDEIKQDEKTTKKKKKDLEKTDEE 924
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 26.9 bits (59), Expect = 7.9
Identities = 13/43 (30%), Positives = 23/43 (53%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
E E +E E++EEKK EE++ ++K+ E+ +K
Sbjct: 144 SESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFD 186
>gnl|CDD|235202 PRK04023, PRK04023, DNA polymerase II large subunit; Validated.
Length = 1121
Score = 26.8 bits (60), Expect = 7.9
Identities = 15/41 (36%), Positives = 24/41 (58%), Gaps = 4/41 (9%)
Query: 11 KYKRRKKKKEEED----EEGGGGEEEEEEEEKKKEEKKKEK 47
K + K + + EE G++E+EEEE+KK+ K K+K
Sbjct: 255 KILKYVDKLKIDGWEWLEELISGKKEKEEEEEKKKIKPKDK 295
>gnl|CDD|149180 pfam07960, CBP4, CBP4. The CBP4 in S. cerevisiae is essential for
the expression and activity of ubiquinol-cytochrome c
reductase. This family appears to be fungal specific.
Length = 128
Score = 25.8 bits (57), Expect = 7.9
Identities = 12/41 (29%), Positives = 25/41 (60%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
KK+ E ++ E ++EE E+ +EE ++ + + EE +K+
Sbjct: 88 KKRNVREFQKTKAEEAQKEELERIREELEEARAQSEEMRKE 128
>gnl|CDD|227199 COG4862, MecA, Negative regulator of genetic competence,
sporulation and motility [Posttranslational
modification, protein turnover, chaperones / Signal
transduction mechanisms / Cell motility and secretion].
Length = 224
Score = 26.6 bits (59), Expect = 7.9
Identities = 15/59 (25%), Positives = 31/59 (52%)
Query: 9 VSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFI 67
V+K K + ++ DE +E+ EE ++ E+K + +KEE+ + ++ L +I
Sbjct: 75 VTKSKIEGEGPDDLDEFLDMPDEDGAEEGLEQLEEKFDDVEKEEQGDEAEKEPDYLYYI 133
>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
Provisional.
Length = 130
Score = 25.9 bits (57), Expect = 8.0
Identities = 15/47 (31%), Positives = 22/47 (46%), Gaps = 10/47 (21%)
Query: 13 KRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKK--KEEEKKKE 57
K KK+EE +EEE E K+EE E + K++ K+
Sbjct: 92 KLTSKKEEE--------KEEEIPEPTKEEELLGEIRDLLKQQNSSKD 130
>gnl|CDD|181433 PRK08455, fliL, flagellar basal body-associated protein FliL;
Reviewed.
Length = 182
Score = 26.2 bits (58), Expect = 8.0
Identities = 10/36 (27%), Positives = 17/36 (47%)
Query: 27 GGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
G EEE+E +K +E + ++ KEG +
Sbjct: 43 GSKEEEKENAQKPTQEVQANPPANSAQEAKEGSNIQ 78
>gnl|CDD|129262 TIGR00158, L9, ribosomal protein L9. Ribosomal protein L9
appears to be universal in, but restricted to,
eubacteria and chloroplast [Protein synthesis,
Ribosomal proteins: synthesis and modification].
Length = 148
Score = 26.2 bits (58), Expect = 8.1
Identities = 11/32 (34%), Positives = 15/32 (46%)
Query: 35 EEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF 66
E +KK E+K K + KE + TLT
Sbjct: 48 EARRKKLEEKLAANKAAAARLKEVLELGTLTI 79
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 26.5 bits (59), Expect = 8.3
Identities = 23/56 (41%), Positives = 36/56 (64%), Gaps = 2/56 (3%)
Query: 11 KYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEE--KKKEKKKKEEEKKKEGEKRRTL 64
K K++KK K+E E E+++ E +KK+EE KK++ KK ++KKK +K R L
Sbjct: 151 KEKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKKGRIL 206
Score = 26.1 bits (58), Expect = 9.7
Identities = 9/32 (28%), Positives = 21/32 (65%)
Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
E E+ ++K E++K +KK++ ++ K+ E +
Sbjct: 5 EALLEQRRRKREQRKARKKQKRKEAKKKEDAQ 36
>gnl|CDD|236987 PRK11798, PRK11798, ClpXP protease specificity-enhancing factor;
Provisional.
Length = 138
Score = 25.9 bits (58), Expect = 8.3
Identities = 4/29 (13%), Positives = 11/29 (37%)
Query: 22 EDEEGGGGEEEEEEEEKKKEEKKKEKKKK 50
E E +E +++ + E + +
Sbjct: 103 EPEAAYDEDEHDDDTDPDDEPPPPPRGGR 131
>gnl|CDD|132187 TIGR03143, AhpF_homolog, putative alkyl hydroperoxide reductase F
subunit. This family of thioredoxin reductase homologs
is found adjacent to alkylhydroperoxide reductase C
subunit predominantly in cases where there is only one C
subunit in the genome and that genome is lacking the F
subunit partner (also a thioredcxin reductase homolog)
that is usually found (TIGR03140).
Length = 555
Score = 26.7 bits (59), Expect = 8.4
Identities = 15/51 (29%), Positives = 20/51 (39%), Gaps = 6/51 (11%)
Query: 7 TQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKE 57
T +Y + K E+ G EE EEEE K+ E + KK
Sbjct: 302 TSAERYVKELK------EKLGIAEEYEEEEAKEASEASAAETTPAATTKKG 346
>gnl|CDD|100108 cd04411, Ribosomal_P1_P2_L12p, Ribosomal protein P1, P2, and
L12p. Ribosomal proteins P1 and P2 are the eukaryotic
proteins that are functionally equivalent to bacterial
L7/L12. L12p is the archaeal homolog. Unlike other
ribosomal proteins, the archaeal L12p and eukaryotic P1
and P2 do not share sequence similarity with their
bacterial counterparts. They are part of the ribosomal
stalk (called the L7/L12 stalk in bacteria), along with
28S rRNA and the proteins L11 and P0 in eukaryotes (23S
rRNA, L11, and L10e in archaea). In bacterial
ribosomes, L7/L12 homodimers bind the extended
C-terminal helix of L10 to anchor the L7/L12 molecules
to the ribosome. Eukaryotic P1/P2 heterodimers and
archaeal L12p homodimers are believed to bind the L10
equivalent proteins, eukaryotic P0 and archaeal L10e,
in a similar fashion. P1 and P2 (L12p, L7/L12) are the
only proteins in the ribosome to occur as multimers,
always appearing as sets of dimers. Recent data
indicate that most archaeal species contain six copies
of L12p (three homodimers), while eukaryotes have two
copies each of P1 and P2 (two heterodimers). Bacteria
may have four or six copies (two or three homodimers),
depending on the species. As in bacteria, the stalk is
crucial for binding of initiation, elongation, and
release factors in eukaryotes and archaea.
Length = 105
Score = 25.7 bits (56), Expect = 8.6
Identities = 8/25 (32%), Positives = 15/25 (60%)
Query: 29 GEEEEEEEEKKKEEKKKEKKKKEEE 53
E +K EE K+E++++E+E
Sbjct: 74 TAAATAEPAEKAEEAKEEEEEEEDE 98
>gnl|CDD|227513 COG5186, PAP1, Poly(A) polymerase [RNA processing and
modification].
Length = 552
Score = 26.8 bits (59), Expect = 8.6
Identities = 16/47 (34%), Positives = 19/47 (40%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF 66
EE E+ G G E E EK EEE ++G K TF
Sbjct: 417 EESIEKIGSGVTLEVAYESTDHEKLANDTVPEEESMEDGMKVYCTTF 463
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 26.5 bits (59), Expect = 8.9
Identities = 8/28 (28%), Positives = 16/28 (57%)
Query: 35 EEEKKKEEKKKEKKKKEEEKKKEGEKRR 62
EE K +E+ K + + E++++ K R
Sbjct: 583 GEETKPQEQPAPKAEAKPERQQDRRKPR 610
>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
II/NMEMI. CRISPR loci appear to be mobile elements with
a wide host range. This model represents a protein found
only in CRISPR-containing species, near other
CRISPR-associated proteins (cas), as part of the NMENI
subtype of CRISPR/Cas locus. The species range so far
for this protein is animal pathogens and commensals only
[Mobile and extrachromosomal element functions, Other].
Length = 805
Score = 26.6 bits (59), Expect = 9.1
Identities = 10/26 (38%), Positives = 14/26 (53%)
Query: 33 EEEEEKKKEEKKKEKKKKEEEKKKEG 58
EE+ + KE+ KK E+K KE
Sbjct: 525 EEQGTNFGKRNSKERYKKNEDKIKEF 550
>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
Length = 819
Score = 26.6 bits (59), Expect = 9.2
Identities = 14/53 (26%), Positives = 22/53 (41%)
Query: 1 LQNSSTTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEE 53
L+ + V + ++ E + E +EE KKE KK KK E +
Sbjct: 452 LEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEVD 504
Score = 26.6 bits (59), Expect = 9.4
Identities = 9/39 (23%), Positives = 18/39 (46%)
Query: 16 KKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEK 54
EE E+ E+ E +++ +++ +K KK E
Sbjct: 465 LSILEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEV 503
>gnl|CDD|224035 COG1110, COG1110, Reverse gyrase [DNA replication, recombination,
and repair].
Length = 1187
Score = 26.6 bits (59), Expect = 9.4
Identities = 12/61 (19%), Positives = 28/61 (45%)
Query: 20 EEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKPRPT 79
EE E + + ++ ++ +++ E +++ +KRR L + + KPR +
Sbjct: 232 SEEVIESAYELIKLRRKLYGEKRAERVREELREVEREREKKRRKLGILVVSSATGKPRGS 291
Query: 80 R 80
R
Sbjct: 292 R 292
>gnl|CDD|151173 pfam10669, Phage_Gp23, Protein gp23 (Bacteriophage A118). This
is the highly conserved family of the major tail
subunit protein.
Length = 121
Score = 25.8 bits (56), Expect = 9.4
Identities = 10/26 (38%), Positives = 18/26 (69%)
Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKK 56
E +EE +K + E++K K+ +EE+ K
Sbjct: 47 EMKEERDKMETEREKRDKESKEERDK 72
>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 145
Score = 25.9 bits (57), Expect = 9.4
Identities = 8/24 (33%), Positives = 12/24 (50%)
Query: 33 EEEEEKKKEEKKKEKKKKEEEKKK 56
E+ E + KK+EE +KK
Sbjct: 117 SSNEQNSTEALQLRAKKREEHRKK 140
>gnl|CDD|153327 cd07643, I-BAR_IMD_MIM, Inverse (I)-BAR, also known as the
IRSp53/MIM homology Domain (IMD), of Missing In
Metastasis. The IMD domain, also called
Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a
dimerization and lipid-binding module that bends
membranes and induces membrane protrusions. Members of
this subfamily include missing in metastasis (MIM) or
metastasis suppressor 1 (MTSS1), metastasis suppressor
1-like (MTSSL) or ABBA (Actin-Bundling protein with
BAIAP2 homology), and similar proteins. They contain an
N-terminal IMD and a WASP homology 2 (WH2) actin-binding
motif at the C-terminus. MIM was originally identified
as a missing transcript from metastatic bladder and
prostate cancer cells. It is a scaffold protein that
functions in a signaling pathway between the PDGF
receptor, Src kinases, and actin assembly. It may also
function as a cofactor of the Sonic hedgehog (Shh)
transcriptional pathway and may participate in tumor
development and progression via this pathway. ABBA
regulates actin and plasma membrane dynamics to promote
the extension of radial glia, which is important in
neuronal migration, axon guidance and neurogenesis. The
IMD domain of MIM binds and bundles actin filaments,
binds membranes, and interacts with the small GTPase
Rac.
Length = 231
Score = 26.3 bits (58), Expect = 9.5
Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 6/52 (11%)
Query: 31 EEEEEEEKK------KEEKKKEKKKKEEEKKKEGEKRRTLTFIRIGKPALKP 76
+E+ EE KK K+ K+ KK ++E KKK + R R GK L+P
Sbjct: 103 QEKIEEWKKVANQLDKDHAKEYKKARQEIKKKSSDTIRLQKKARKGKGDLQP 154
>gnl|CDD|192481 pfam10197, Cir_N, N-terminal domain of CBF1 interacting
co-repressor CIR. This is a 45 residue conserved
region at the N-terminal end of a family of proteins
referred to as CIRs (CBF1-interacting co-repressors).
CBF1 (centromere-binding factor 1) acts as a
transcription factor that causes repression by binding
specifically to GTGGGAA motifs in responsive promoters,
and it requires CIR as a co-repressor. CIR binds to
histone deacetylase and to SAP30 and serves as a linker
between CBF1 and the histone deacetylase complex.
Length = 37
Score = 24.1 bits (53), Expect = 9.8
Identities = 10/22 (45%), Positives = 16/22 (72%)
Query: 39 KKEEKKKEKKKKEEEKKKEGEK 60
+ E+K E++KK EE +KE E+
Sbjct: 15 EAEQKALEEQKKIEELRKEIEE 36
>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB. Topo IB proteins include the
monomeric yeast and human topo I and heterodimeric topo
I from Leishmania donvanni. Topo I enzymes are divided
into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topos I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I have putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 217
Score = 26.0 bits (57), Expect = 9.8
Identities = 9/31 (29%), Positives = 20/31 (64%)
Query: 36 EEKKKEEKKKEKKKKEEEKKKEGEKRRTLTF 66
EE+K+++K K++KE +KK+ ++ +
Sbjct: 94 EEEKEKKKNLNKEEKEAKKKERAKREYPFNY 124
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 26.6 bits (59), Expect = 9.8
Identities = 10/52 (19%), Positives = 21/52 (40%)
Query: 5 STTQVSKYKRRKKKKEEEDEEGGGGEEEEEEEEKKKEEKKKEKKKKEEEKKK 56
KRR+ K+ E E++ + K ++ ++ KK + +K
Sbjct: 745 GDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQK 796
>gnl|CDD|178450 PLN02859, PLN02859, glutamine-tRNA ligase.
Length = 788
Score = 26.6 bits (59), Expect = 9.9
Identities = 12/27 (44%), Positives = 16/27 (59%)
Query: 31 EEEEEEEKKKEEKKKEKKKKEEEKKKE 57
E+ + +K +KKKEK K EEKK
Sbjct: 181 EKTAADNEKPVKKKKEKPAKVEEKKVA 207
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.307 0.130 0.369
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,430,812
Number of extensions: 910517
Number of successful extensions: 13647
Number of sequences better than 10.0: 1
Number of HSP's gapped: 8373
Number of HSP's successfully gapped: 2076
Length of query: 180
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 89
Effective length of database: 6,901,388
Effective search space: 614223532
Effective search space used: 614223532
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (22.0 bits)
S2: 56 (25.2 bits)