RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9531
(409 letters)
>gnl|CDD|192352 pfam09732, CactinC_cactus, Cactus-binding C-terminus of cactin
protein. CactinC_cactus is the C-terminal 200 residues
of the cactin protein which are necessary for the
association of cactin with IkappaB-cactus as one of the
intracellular members of the Rel complex. The Rel
(NF-kappaB) pathway is conserved in invertebrates and
vertebrates. In mammals, it controls the activities of
the immune and inflammatory response genes as well as
viral genes, and is critical for cell growth and
survival. In Drosophila, the Rel pathway functions in
the innate cellular and humoral immune response, in
muscle development, and in the establishment of
dorsal-ventral polarity in the early embryo. Most
members of the family also have a Cactin_mid domain
pfam10312 further upstream.
Length = 125
Score = 226 bits (578), Expect = 5e-74
Identities = 82/105 (78%), Positives = 89/105 (84%)
Query: 285 WSDKYRPRKPRYFNRVHTGFEWNKYNQTHYDMDNPPPKIVQGYKFNIFYPDLIDKNSTPQ 344
WSDKYRPRKPRYFNRV TGF+WNKYNQTHYD DNPPPK VQGYKFNIFYPDLIDK P+
Sbjct: 1 WSDKYRPRKPRYFNRVKTGFDWNKYNQTHYDSDNPPPKTVQGYKFNIFYPDLIDKGKAPR 60
Query: 345 YFLNTCADNQDFAILRFHAGPPYEDIAFKIVNREWEYSYKRGFRC 389
Y + C D D +LRFHAGPPYEDIAF+IVN+EW+YS KRGFR
Sbjct: 61 YRIEPCEDPDDTCLLRFHAGPPYEDIAFRIVNKEWDYSRKRGFRS 105
>gnl|CDD|214996 smart01050, CactinC_cactus, Cactus-binding C-terminus of cactin
protein. CactinC_cactus is the C-terminal 200 residues
of the cactin protein which are necessary for the
association of cactin with IkappaB-cactus as one of the
intracellular members of the Rel complex. The Rel
(NF-kappaB) pathway is conserved in invertebrates and
vertebrates. In mammals, it controls the activities of
the immune and inflammatory response genes as well as
viral genes, and is critical for cell growth and
survival. In Drosophila, the Rel pathway functions in
the innate cellular and humoral immune response, in
muscle development, and in the establishment of
dorsal-ventral polarity in the early embryo. Most
members of the family also have a Cactin_mid domain
further upstream.
Length = 129
Score = 192 bits (491), Expect = 6e-61
Identities = 76/110 (69%), Positives = 87/110 (79%), Gaps = 4/110 (3%)
Query: 285 WSDKYRPRKPRYFNRVHTGFEWNKYNQTHYDMDNPPPKIVQGYKFNIFYPDLIDKNSTPQ 344
W DKYR RKPRYFNRV TG++WNKYNQTHYD DNPPPKIVQGYKFNIFYPDLIDK P+
Sbjct: 1 WEDKYRLRKPRYFNRVKTGYDWNKYNQTHYDNDNPPPKIVQGYKFNIFYPDLIDKGKAPR 60
Query: 345 YFLNTC----ADNQDFAILRFHAGPPYEDIAFKIVNREWEYSYKRGFRCH 390
Y + + + D +LRF AGPPYEDIAFKIVNREW+YS++ GF+
Sbjct: 61 YRIESSFEIEKPDDDTCLLRFSAGPPYEDIAFKIVNREWDYSHRGGFKSS 110
>gnl|CDD|220686 pfam10312, Cactin_mid, Conserved mid region of cactin. This is the
conserved middle region of a family of proteins referred
to as cactins. The region contains two of three
predicted coiled-coil domains. Most members of this
family have a CactinC_cactus pfam09732 domain at the
C-terminal end. Upstream of Mid_cactin in Drosophila
members are a serine-rich region, some non-typical RD
motifs and three predicted bipartite nuclear
localisation signals, none of which are well-conserved.
Cactin associates with IkappaB-cactus as one of the
intracellular members of the Rel (NF-kappaB) pathway
which is conserved in invertebrates and vertebrates. In
mammals, this pathway controls the activities of the
immune and inflammatory response genes as well as viral
genes, and is critical for cell growth and survival. In
Drosophila, the Rel pathway functions in the innate
cellular and humoral immune response, in muscle
development, and in the establishment of dorsal-ventral
polarity in the early embryo.
Length = 190
Score = 173 bits (441), Expect = 1e-52
Identities = 83/170 (48%), Positives = 112/170 (65%), Gaps = 8/170 (4%)
Query: 83 QRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYI---GSEAEVDAVE 139
+R +E F+EW+ +ED+FHL+QA+ R+ IRI+EGRAKPIDLLA+ + E +V+ E
Sbjct: 1 RRLREDEAFKEWEAKEDEFHLKQAKKRAEIRIKEGRAKPIDLLARNLRFIDPEKDVE-FE 59
Query: 140 MHEPYTYLTGLAIKDLEDLLEDIKVYMELEKG-ENEAYWNDISIIVEDELHGLRKLEKQG 198
+HEPYT L GL +LE+L EDIKVY+ELEK +N YWN + II EDE L KLEK+
Sbjct: 60 LHEPYTVLEGLTEDELEELEEDIKVYLELEKNPKNREYWNALLIICEDE---LEKLEKKE 116
Query: 199 HSDLTRREGIHESVAQDVASVFKGKTTAQLEALQTQIELKISSKTENIDN 248
R G+H SVA DV + GK+ +LE L+ QI+ K+ S +D
Sbjct: 117 AEGRADRSGLHSSVAADVKKILSGKSYEELEELEGQIKAKLRSGEAKVDT 166
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 46.1 bits (110), Expect = 2e-05
Identities = 15/34 (44%), Positives = 24/34 (70%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQ 66
KV A++ +KK+K+K+K K KK+ N+ KRR+
Sbjct: 394 KVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRK 427
Score = 33.0 bits (76), Expect = 0.27
Identities = 11/32 (34%), Positives = 17/32 (53%)
Query: 37 RKKKKKKKKKKKKKKKKKKNLCANVKKRRQER 68
K KK K+ +KK+K+ VKKR ++
Sbjct: 388 TGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDT 419
Score = 30.7 bits (70), Expect = 1.6
Identities = 8/36 (22%), Positives = 16/36 (44%)
Query: 21 VYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKN 56
V A+ + K + + KK+ + K K++K
Sbjct: 395 VLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSG 430
Score = 28.4 bits (64), Expect = 8.4
Identities = 9/35 (25%), Positives = 15/35 (42%)
Query: 38 KKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
K +KK K KK K K++ + + +R
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKR 415
Score = 28.4 bits (64), Expect = 8.5
Identities = 5/24 (20%), Positives = 12/24 (50%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKN 56
K + +K+ + K K++K +
Sbjct: 409 KPKVKKRHRDTKNIGKRRKPSGTS 432
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 38.7 bits (91), Expect = 0.002
Identities = 23/62 (37%), Positives = 39/62 (62%), Gaps = 1/62 (1%)
Query: 29 GTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEA 88
G K +A+K KK+K+K++K+ +K + +K+ +E +AE+AER+ E+ QR EA
Sbjct: 10 GLVDKKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNR-QRQAEA 68
Query: 89 AQ 90
Q
Sbjct: 69 EQ 70
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 36.9 bits (86), Expect = 0.005
Identities = 19/50 (38%), Positives = 31/50 (62%), Gaps = 3/50 (6%)
Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMAL 81
+KVE ++K+KKKK+K+ KK+ + + R+ER ERAE+E+
Sbjct: 69 AKVEKELLREKEKKKKRKRPGKKRRI---ALRLRRERTKERAEKEKRTRK 115
Score = 29.5 bits (67), Expect = 1.4
Identities = 10/33 (30%), Positives = 24/33 (72%)
Query: 37 RKKKKKKKKKKKKKKKKKKNLCANVKKRRQERE 69
R ++++ K++ +K+K+ +KN K+R++E+E
Sbjct: 97 RLRRERTKERAEKEKRTRKNREKKFKRRQKEKE 129
Score = 28.4 bits (64), Expect = 2.9
Identities = 8/22 (36%), Positives = 16/22 (72%)
Query: 35 EARKKKKKKKKKKKKKKKKKKN 56
E K+ +K ++KK K+++K+K
Sbjct: 108 EKEKRTRKNREKKFKRRQKEKE 129
>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
non-specific RNA binding region part 2. This is a
region found N terminal to the catalytic domain of
glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
but not in Escherichia coli. This region is thought to
bind RNA in a non-specific manner, enhancing
interactions between the tRNA and enzyme, but is not
essential for enzyme function.
Length = 83
Score = 34.3 bits (79), Expect = 0.013
Identities = 21/48 (43%), Positives = 23/48 (47%), Gaps = 2/48 (4%)
Query: 31 GSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
G K EA KKKKKKKKKK + A KK E +E A E
Sbjct: 15 GPKTEADLVKKKKKKKKKKAEDTAAT--AKAKKATAEDVSEGAMFMEG 60
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 35.4 bits (82), Expect = 0.013
Identities = 17/38 (44%), Positives = 24/38 (63%)
Query: 18 QRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKK 55
Q++V ++ + K + +KKKKKKKK KK KKKK
Sbjct: 98 QKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 32.7 bits (75), Expect = 0.13
Identities = 16/24 (66%), Positives = 19/24 (79%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKN 56
+V K K+KK+KKKKKKKKKKK
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKT 126
Score = 32.4 bits (74), Expect = 0.15
Identities = 17/30 (56%), Positives = 23/30 (76%)
Query: 34 VEARKKKKKKKKKKKKKKKKKKNLCANVKK 63
++ + K+KK+KKKKKKKKKKK + A KK
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 31.2 bits (71), Expect = 0.35
Identities = 15/31 (48%), Positives = 22/31 (70%)
Query: 36 ARKKKKKKKKKKKKKKKKKKNLCANVKKRRQ 66
++K K+KK+KKKKKKKKKK K+++
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 29.3 bits (66), Expect = 1.6
Identities = 15/31 (48%), Positives = 20/31 (64%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKK 63
K+ + K+K K+KK+KKKKKKK KK
Sbjct: 99 KIVKQVLKEKAKQKKQKKKKKKKKKKKTSKK 129
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 36.8 bits (86), Expect = 0.014
Identities = 11/43 (25%), Positives = 27/43 (62%), Gaps = 1/43 (2%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAER 75
+ E ++KK++KK K++++ K L +++ +E+E ++ R
Sbjct: 280 RQEEAQEKKEEKK-KEEREAKLAKLSPEEQRKLEEKERKKQAR 321
Score = 29.1 bits (66), Expect = 3.7
Identities = 12/58 (20%), Positives = 28/58 (48%), Gaps = 4/58 (6%)
Query: 35 EARKK----KKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEA 88
E +K ++++++K K ++++ A KK +++E A+ + QR E
Sbjct: 256 EVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEE 313
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 33.5 bits (77), Expect = 0.025
Identities = 24/73 (32%), Positives = 32/73 (43%), Gaps = 6/73 (8%)
Query: 31 GSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAERE----EEMALTQRGK 86
G K++ +KKKKKKKKK K K++ K + E E E E
Sbjct: 14 GKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDGMNL 73
Query: 87 EAAQ--FEEWQRQ 97
A+ FEE QR+
Sbjct: 74 TEAERAFEEAQRK 86
Score = 29.7 bits (67), Expect = 0.62
Identities = 25/68 (36%), Positives = 35/68 (51%), Gaps = 4/68 (5%)
Query: 39 KKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQE 98
K KK KKKKKKKKKK N K E E E+ + + G+E E+ +++E
Sbjct: 13 KGKKIDVKKKKKKKKKK----NKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEE 68
Query: 99 DQFHLEQA 106
D +L +A
Sbjct: 69 DGMNLTEA 76
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
This domain is found to the N-terminus of bacterial
signal peptidases of the S49 family (pfam01343).
Length = 154
Score = 34.8 bits (81), Expect = 0.032
Identities = 12/29 (41%), Positives = 16/29 (55%)
Query: 36 ARKKKKKKKKKKKKKKKKKKNLCANVKKR 64
A +K +KK +K K K +KKK K R
Sbjct: 70 AWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 32.5 bits (75), Expect = 0.16
Identities = 10/24 (41%), Positives = 14/24 (58%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKN 56
K + +KK +K K K +KKK K
Sbjct: 69 KAWEKAEKKAEKAKAKAEKKKAKK 92
Score = 32.5 bits (75), Expect = 0.17
Identities = 9/33 (27%), Positives = 16/33 (48%)
Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKR 64
K E + +K +KK +K K K +K + +
Sbjct: 64 DKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96
Score = 32.1 bits (74), Expect = 0.21
Identities = 11/33 (33%), Positives = 15/33 (45%), Gaps = 2/33 (6%)
Query: 35 EARKK--KKKKKKKKKKKKKKKKNLCANVKKRR 65
+ K K +KK +K K K +KK K R
Sbjct: 66 KELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 27.9 bits (63), Expect = 6.6
Identities = 11/22 (50%), Positives = 14/22 (63%)
Query: 36 ARKKKKKKKKKKKKKKKKKKNL 57
A K K K +KKK KK++ K L
Sbjct: 78 AEKAKAKAEKKKAKKEEPKPRL 99
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 35.9 bits (82), Expect = 0.050
Identities = 24/84 (28%), Positives = 44/84 (52%), Gaps = 3/84 (3%)
Query: 30 TGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAA 89
TG EARK ++ KKK + +K ++ + R+ EA +AE + + + ++ ++A
Sbjct: 1108 TGKAEEARKAEEAKKKAEDARKAEEARKAEDA---RKAEEARKAEDAKRVEIARKAEDAR 1164
Query: 90 QFEEWQRQEDQFHLEQARLRSTIR 113
+ EE ++ ED E AR +R
Sbjct: 1165 KAEEARKAEDAKKAEAARKAEEVR 1188
Score = 33.6 bits (76), Expect = 0.26
Identities = 20/88 (22%), Positives = 46/88 (52%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
K E KK + KKK ++ KKK + +++ EA++AE ++ ++ +EA + +
Sbjct: 1478 KAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKAD 1537
Query: 93 EWQRQEDQFHLEQARLRSTIRIQEGRAK 120
E ++ E++ ++ + ++ E + K
Sbjct: 1538 EAKKAEEKKKADELKKAEELKKAEEKKK 1565
Score = 32.8 bits (74), Expect = 0.37
Identities = 10/68 (14%), Positives = 31/68 (45%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
K +A + KK + K + + A +K + + +++ A ++ +E + +
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKAD 1394
Query: 93 EWQRQEDQ 100
E +++ ++
Sbjct: 1395 EAKKKAEE 1402
Score = 32.0 bits (72), Expect = 0.78
Identities = 22/77 (28%), Positives = 41/77 (53%), Gaps = 2/77 (2%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE--MALTQRGKEAAQ 90
K E KK + KK ++KKK + +KK ++++AE A++ EE ++ +EA +
Sbjct: 1529 KAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKK 1588
Query: 91 FEEWQRQEDQFHLEQAR 107
EE + +E E+ +
Sbjct: 1589 AEEARIEEVMKLYEEEK 1605
Score = 31.3 bits (70), Expect = 1.1
Identities = 19/93 (20%), Positives = 47/93 (50%), Gaps = 3/93 (3%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
K + KK ++ KK + KK ++ KK ++++A+ ++ EE+ ++ +E + E
Sbjct: 1511 KADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEEL---KKAEEKKKAE 1567
Query: 93 EWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLL 125
E ++ E+ ++ + + +E R + + L
Sbjct: 1568 EAKKAEEDKNMALRKAEEAKKAEEARIEEVMKL 1600
Score = 31.3 bits (70), Expect = 1.2
Identities = 12/65 (18%), Positives = 31/65 (47%), Gaps = 1/65 (1%)
Query: 14 EDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERA 73
E+ ++ A+ + + EA K + + + + ++K A KK +++A+ A
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE-AAEKKKEEAKKKADAA 1383
Query: 74 EREEE 78
+++ E
Sbjct: 1384 KKKAE 1388
Score = 31.3 bits (70), Expect = 1.3
Identities = 20/87 (22%), Positives = 47/87 (54%), Gaps = 3/87 (3%)
Query: 35 EARKKKKKKKKK---KKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQF 91
EA+K + KKK KK ++ KK + ++ ++ EA++AE +++ ++ +E +
Sbjct: 1501 EAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKA 1560
Query: 92 EEWQRQEDQFHLEQARLRSTIRIQEGR 118
EE ++ E+ E+ + + + +E +
Sbjct: 1561 EEKKKAEEAKKAEEDKNMALRKAEEAK 1587
Score = 31.3 bits (70), Expect = 1.3
Identities = 20/75 (26%), Positives = 37/75 (49%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
K E KK + KKK ++ KKK ++ ++ EA +AE E + +E A+
Sbjct: 1310 KAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAA 1369
Query: 93 EWQRQEDQFHLEQAR 107
E +++E + + A+
Sbjct: 1370 EKKKEEAKKKADAAK 1384
Score = 31.3 bits (70), Expect = 1.4
Identities = 16/62 (25%), Positives = 30/62 (48%)
Query: 39 KKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQE 98
KKK KKK ++ KK A + EAE AE + E A ++ + + + +++
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKA 1387
Query: 99 DQ 100
++
Sbjct: 1388 EE 1389
Score = 30.9 bits (69), Expect = 1.8
Identities = 14/47 (29%), Positives = 29/47 (61%)
Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
+ EA ++K + +KKK++ KK A KK ++++A+ A+++ E
Sbjct: 1355 AADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAE 1401
Score = 30.5 bits (68), Expect = 2.0
Identities = 21/86 (24%), Positives = 44/86 (51%), Gaps = 4/86 (4%)
Query: 23 ARMQVQGTGSKVEARKKKKKKKKKKKK-KKKKKKNLCANVKKRRQEREAERAEREEEMAL 81
A+ + + K + KK + KKK + KK ++ KK + ++A+ A++ EE
Sbjct: 1488 AKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEK-- 1545
Query: 82 TQRGKEAAQFEEWQRQEDQFHLEQAR 107
++ E + EE ++ E++ E+A+
Sbjct: 1546 -KKADELKKAEELKKAEEKKKAEEAK 1570
Score = 30.5 bits (68), Expect = 2.2
Identities = 21/91 (23%), Positives = 47/91 (51%), Gaps = 2/91 (2%)
Query: 10 VYSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQERE 69
+ E++ +++ + + K E KK +++ K K ++ KK + KK + ++
Sbjct: 1625 LKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAE--EDKKKAEEAKK 1682
Query: 70 AERAEREEEMALTQRGKEAAQFEEWQRQEDQ 100
AE E++ AL + +EA + EE +++E +
Sbjct: 1683 AEEDEKKAAEALKKEAEEAKKAEELKKKEAE 1713
Score = 30.5 bits (68), Expect = 2.3
Identities = 28/109 (25%), Positives = 55/109 (50%), Gaps = 6/109 (5%)
Query: 13 EEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKK---KKKKKKKKNLCANVKKRRQER- 68
E+D ++ A+ + EA KK+ ++ KK KKK+ ++K +KK +E
Sbjct: 1670 AEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENK 1729
Query: 69 -EAERAEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQE 116
+AE A++E E ++ +EA + EE +++ E+ + IR ++
Sbjct: 1730 IKAEEAKKEAEED-KKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEK 1777
Score = 30.1 bits (67), Expect = 2.5
Identities = 17/73 (23%), Positives = 40/73 (54%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEW 94
EA+K + KKK ++ KK + A K++ + + AE +++ ++ +EA + +E
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEA 1527
Query: 95 QRQEDQFHLEQAR 107
++ E+ ++A+
Sbjct: 1528 KKAEEAKKADEAK 1540
Score = 30.1 bits (67), Expect = 2.6
Identities = 26/132 (19%), Positives = 60/132 (45%), Gaps = 4/132 (3%)
Query: 35 EARKKKKKKKKKKK--KKKKKKKNLCANVKKRRQE--REAERAEREEEMALTQRGKEAAQ 90
EA+K ++ +KK + KK+ ++ +KK+ E ++AE ++ EE + + +
Sbjct: 1679 EAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKE 1738
Query: 91 FEEWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYIGSEAEVDAVEMHEPYTYLTGL 150
EE +++ ++ ++ + +++ K + + K + E + E E
Sbjct: 1739 AEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDK 1798
Query: 151 AIKDLEDLLEDI 162
IKD+ D +I
Sbjct: 1799 KIKDIFDNFANI 1810
Score = 29.7 bits (66), Expect = 3.6
Identities = 27/136 (19%), Positives = 57/136 (41%)
Query: 39 KKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQE 98
KKK + KK + KKK + ++ ++ EA++AE ++ ++ +E + +E ++ E
Sbjct: 1496 KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAE 1555
Query: 99 DQFHLEQARLRSTIRIQEGRAKPIDLLAKYIGSEAEVDAVEMHEPYTYLTGLAIKDLEDL 158
+ E+ + + E A+ E E+ + Y + ++ +
Sbjct: 1556 ELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKA 1615
Query: 159 LEDIKVYMELEKGENE 174
E EL+K E E
Sbjct: 1616 EEAKIKAEELKKAEEE 1631
Score = 29.7 bits (66), Expect = 3.7
Identities = 17/71 (23%), Positives = 40/71 (56%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEW 94
+A + K K ++ KK +++KKK K+ ++++AE ++ EE + +EA + EE
Sbjct: 1614 KAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEED 1673
Query: 95 QRQEDQFHLEQ 105
+++ ++ +
Sbjct: 1674 KKKAEEAKKAE 1684
Score = 28.6 bits (63), Expect = 7.5
Identities = 18/70 (25%), Positives = 36/70 (51%), Gaps = 2/70 (2%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMA--LTQRGKEAAQ 90
K E KKK + KK KKK +++++ EA++ E + A ++ +EA +
Sbjct: 1399 KAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK 1458
Query: 91 FEEWQRQEDQ 100
EE +++ ++
Sbjct: 1459 AEEAKKKAEE 1468
Score = 28.6 bits (63), Expect = 8.9
Identities = 18/84 (21%), Positives = 41/84 (48%)
Query: 37 RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQR 96
+K ++ KK + KKK ++ KK+ +E + + E ++ ++ EA + EE ++
Sbjct: 1464 KKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKK 1523
Query: 97 QEDQFHLEQARLRSTIRIQEGRAK 120
++ E+A+ + E + K
Sbjct: 1524 ADEAKKAEEAKKADEAKKAEEKKK 1547
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 34.3 bits (79), Expect = 0.069
Identities = 15/38 (39%), Positives = 27/38 (71%)
Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQERE 69
K E +KKK+ KK+KK+KK KK+K + K++++++
Sbjct: 153 EKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKK 190
Score = 31.6 bits (72), Expect = 0.47
Identities = 14/36 (38%), Positives = 24/36 (66%)
Query: 37 RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
KKKKK+ KK+KK+KK KK K +++++ ++
Sbjct: 156 EKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191
Score = 30.1 bits (68), Expect = 1.7
Identities = 13/33 (39%), Positives = 21/33 (63%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRR 65
K E +K+KK+KK KK+K + K + KK++
Sbjct: 160 KKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192
Score = 28.5 bits (64), Expect = 4.8
Identities = 14/46 (30%), Positives = 25/46 (54%), Gaps = 2/46 (4%)
Query: 30 TGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAER 75
T + + ++++KK+KKKKK+ KK KK ++E+ E
Sbjct: 140 TAKVEKEAEVEEEEKKEKKKKKEVKKEK--KEKKDKKEKMVEPKGS 183
Score = 28.5 bits (64), Expect = 5.0
Identities = 15/25 (60%), Positives = 16/25 (64%), Gaps = 6/25 (24%)
Query: 37 RKKKKKKK------KKKKKKKKKKK 55
KK KK+K KKKKKKKKKK
Sbjct: 169 EKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 27.8 bits (62), Expect = 8.1
Identities = 12/37 (32%), Positives = 26/37 (70%)
Query: 36 ARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
+KKKK+ KK+KK+KK KK+ + ++++++ ++
Sbjct: 156 EKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 35.1 bits (81), Expect = 0.075
Identities = 14/42 (33%), Positives = 22/42 (52%)
Query: 37 RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
R +KK K K KK +K K A +K +A+RA +++
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKV 811
Score = 30.1 bits (68), Expect = 2.6
Identities = 12/43 (27%), Positives = 18/43 (41%)
Query: 31 GSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERA 73
G K K KK +K KK K + + A K +R ++
Sbjct: 771 GEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVAE 813
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 34.1 bits (78), Expect = 0.081
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 1/52 (1%)
Query: 28 QGTGSKVEARKKKKKKKKKKKKKKKKKKNLCAN-VKKRRQEREAERAEREEE 78
+ T ++ K+KKKKK KKKK K+L N V + + E E++ E
Sbjct: 77 KETEDWFSPNQENKQKKKKKDKKKKSPKSLLLNFVTVEESKPDDEVHEQKSE 128
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 34.2 bits (79), Expect = 0.091
Identities = 19/65 (29%), Positives = 33/65 (50%), Gaps = 4/65 (6%)
Query: 26 QVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRG 85
+ + +KV A+ K KK K K ++K+K K K+ + RE E E+E+A R
Sbjct: 44 EKEEEKAKVAAKAKAKKALKAKIEEKEKAKRE----KEEKGLRELEEDTPEDELAEKLRL 99
Query: 86 KEAAQ 90
++ +
Sbjct: 100 RKLQE 104
Score = 33.1 bits (76), Expect = 0.17
Identities = 15/51 (29%), Positives = 19/51 (37%), Gaps = 5/51 (9%)
Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALT 82
+ E K K K K KK K K +K + +RE E E T
Sbjct: 43 EEKEEEKAKVAAKAKAKKALKAKIE-----EKEKAKREKEEKGLRELEEDT 88
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 34.6 bits (80), Expect = 0.092
Identities = 12/36 (33%), Positives = 26/36 (72%), Gaps = 6/36 (16%)
Query: 37 RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
+K+++KK +K+KKKKK+KK K ++++++ +
Sbjct: 385 KKREEKKPQKRKKKKKRKK------KGKKRKKKGRK 414
Score = 34.2 bits (79), Expect = 0.11
Identities = 14/23 (60%), Positives = 19/23 (82%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKK 55
K E +K +K+KKKKK+KKK KK+
Sbjct: 386 KREEKKPQKRKKKKKRKKKGKKR 408
Score = 31.5 bits (72), Expect = 0.85
Identities = 14/36 (38%), Positives = 23/36 (63%), Gaps = 5/36 (13%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQER 68
K + +KK +K+KKKKK+KKK K +K++ +
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKK-----RKKKGRK 414
Score = 31.5 bits (72), Expect = 0.92
Identities = 14/35 (40%), Positives = 24/35 (68%), Gaps = 5/35 (14%)
Query: 38 KKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
KKK+++KK +K+KKKKK KK+ ++R+ +
Sbjct: 383 PKKKREEKKPQKRKKKKKR-----KKKGKKRKKKG 412
Score = 28.4 bits (64), Expect = 6.9
Identities = 11/40 (27%), Positives = 25/40 (62%), Gaps = 5/40 (12%)
Query: 36 ARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAER 75
+ KKK+++KK +K+KKKK +K++ ++ ++ +
Sbjct: 380 PKPPKKKREEKKPQKRKKKKK-----RKKKGKKRKKKGRK 414
>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
Length = 300
Score = 34.3 bits (79), Expect = 0.093
Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRR-QEREAERAER 75
KKKKKKKKK KK K K L +K R + ++A+R ER
Sbjct: 253 SKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRER 294
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 34.6 bits (80), Expect = 0.10
Identities = 24/120 (20%), Positives = 50/120 (41%), Gaps = 2/120 (1%)
Query: 11 YSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKK--RRQER 68
E ++ + R++ + G + RK K KKK+KKKKK K+ A+V +R +
Sbjct: 1144 QEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDS 1203
Query: 69 EAERAEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKY 128
+ +R ++ + Q ++ +++ RL+S ++ D +
Sbjct: 1204 DEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSD 1263
Score = 28.5 bits (64), Expect = 9.2
Identities = 19/64 (29%), Positives = 31/64 (48%)
Query: 14 EDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERA 73
E+ + + Q SK + + K +K K KKK+KKKKK+ KK ++R
Sbjct: 1142 EEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRV 1201
Query: 74 EREE 77
+ +E
Sbjct: 1202 DSDE 1205
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 34.5 bits (79), Expect = 0.11
Identities = 18/72 (25%), Positives = 40/72 (55%), Gaps = 6/72 (8%)
Query: 28 QGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKE 87
G + E + K++KKKKK+K K++ K+ +K ++E + +R +E+E ++ E
Sbjct: 102 SGKEEEKEKEQVKEEKKKKKEKPKEEPKD-----RKPKEEAKEKRPPKEKEKE-KEKKVE 155
Query: 88 AAQFEEWQRQED 99
+ E +++ +
Sbjct: 156 EPRDREEEKKRE 167
Score = 31.4 bits (71), Expect = 0.91
Identities = 21/70 (30%), Positives = 39/70 (55%), Gaps = 2/70 (2%)
Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQ-EREAERAEREEEMALTQRGKEAAQ 90
K EA++K+ K+K+K+K+KK ++ +K+R+ R R ++ + + KE +
Sbjct: 133 PKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPE 192
Query: 91 FEEWQRQEDQ 100
EE QRQ +
Sbjct: 193 -EEKQRQAAR 201
Score = 28.7 bits (64), Expect = 7.0
Identities = 15/79 (18%), Positives = 42/79 (53%), Gaps = 9/79 (11%)
Query: 26 QVQGTGSKVEA----RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMAL 81
+V+ GSK A K+ K + K+++K+K++ K+ +++++ + E ++
Sbjct: 79 RVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQV-----KEEKKKKKEKPKEEPKDRKP 133
Query: 82 TQRGKEAAQFEEWQRQEDQ 100
+ KE +E ++++++
Sbjct: 134 KEEAKEKRPPKEKEKEKEK 152
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 33.9 bits (78), Expect = 0.12
Identities = 21/82 (25%), Positives = 33/82 (40%), Gaps = 8/82 (9%)
Query: 11 YSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREA 70
YSEED + + T +VE K + ++++ KKKKKKKKK N
Sbjct: 154 YSEED--------VVPLNPTEEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEAT 205
Query: 71 ERAEREEEMALTQRGKEAAQFE 92
+ E++ A+
Sbjct: 206 VSSAVPTELSSGAGQVGEAKKL 227
>gnl|CDD|177283 PHA00451, PHA00451, protein kinase.
Length = 362
Score = 34.0 bits (78), Expect = 0.14
Identities = 16/90 (17%), Positives = 34/90 (37%), Gaps = 3/90 (3%)
Query: 37 RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQ- 95
+K K++K+ +K + +N R + + R E M + + A
Sbjct: 264 VARKAAMKRRKRNRKLRARNAKELAAMRMEANQIRRNEPRARMLMGDKHWHNAWLHNNNL 323
Query: 96 --RQEDQFHLEQARLRSTIRIQEGRAKPID 123
+ ++ ++ + IQ G+ PID
Sbjct: 324 DFAKLEERAALAWQMADGLAIQWGKPLPID 353
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 32.7 bits (75), Expect = 0.14
Identities = 21/66 (31%), Positives = 37/66 (56%), Gaps = 6/66 (9%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQER-EAERAEREEEMALTQRGKEAAQFEE 93
E RKK K+K ++ K K KK++ A +K ++ER AE E+E A + K+ A+ +
Sbjct: 63 EGRKKAKEKLRRDKLKAKKEE---AEKEKEKEERFMKALAEAEKERAELE--KKKAEAKL 117
Query: 94 WQRQED 99
+ ++
Sbjct: 118 MKEEKK 123
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 33.1 bits (76), Expect = 0.17
Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 1/48 (2%)
Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKK-RRQEREAERAEREEE 78
+ A++ K KKK KKK+K K + VK+ R+ ERE R +R +
Sbjct: 14 AVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAK 61
Score = 28.1 bits (63), Expect = 7.5
Identities = 15/48 (31%), Positives = 20/48 (41%), Gaps = 5/48 (10%)
Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQER---EAERAERE 76
K + K+ K KKK KK + +R E E +AERE
Sbjct: 7 RKARQELAVQVAKQAKAKKKANKKK--RKIYFKRAESFVKEYRKAERE 52
Score = 28.1 bits (63), Expect = 8.0
Identities = 15/82 (18%), Positives = 38/82 (46%), Gaps = 2/82 (2%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEW 94
E KK+K +++ + K+ K++++ +RAE + +E + +
Sbjct: 1 ELLLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAE-REIIRLKRL 59
Query: 95 QRQEDQFHLE-QARLRSTIRIQ 115
++ +F++ + +L IRI+
Sbjct: 60 AKKPGKFYVPAEHKLVFVIRIK 81
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 31.8 bits (72), Expect = 0.19
Identities = 14/29 (48%), Positives = 14/29 (48%)
Query: 27 VQGTGSKVEARKKKKKKKKKKKKKKKKKK 55
E KKKKKK KK KK KK K
Sbjct: 90 DDHPPEPTEKPKKKKKKSKKTKKPKKSSK 118
Score = 31.0 bits (70), Expect = 0.36
Identities = 13/20 (65%), Positives = 15/20 (75%)
Query: 37 RKKKKKKKKKKKKKKKKKKN 56
+KKKKK KK KK KK KK+
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120
Score = 30.2 bits (68), Expect = 0.65
Identities = 14/22 (63%), Positives = 16/22 (72%)
Query: 35 EARKKKKKKKKKKKKKKKKKKN 56
E +K KKKKKK KK KK KK+
Sbjct: 95 EPTEKPKKKKKKSKKTKKPKKS 116
Score = 30.2 bits (68), Expect = 0.68
Identities = 12/19 (63%), Positives = 12/19 (63%)
Query: 39 KKKKKKKKKKKKKKKKKNL 57
KKKKKK KK KK KK
Sbjct: 101 KKKKKKSKKTKKPKKSSKK 119
Score = 29.8 bits (67), Expect = 1.0
Identities = 12/22 (54%), Positives = 14/22 (63%)
Query: 33 KVEARKKKKKKKKKKKKKKKKK 54
K + +KKK KK KK KK KK
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 26.8 bits (59), Expect = 9.6
Identities = 12/26 (46%), Positives = 13/26 (50%)
Query: 30 TGSKVEARKKKKKKKKKKKKKKKKKK 55
+K KKKKKK KK KK K
Sbjct: 89 PDDHPPEPTEKPKKKKKKSKKTKKPK 114
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The
family functions as a highly conserved exonuclease that
is required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing
and export.
Length = 134
Score = 31.9 bits (73), Expect = 0.19
Identities = 16/61 (26%), Positives = 35/61 (57%), Gaps = 4/61 (6%)
Query: 37 RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAE----RAEREEEMALTQRGKEAAQFE 92
RK++++KK +++ K+K+++ K+ R+ER+ E ER+E + L + + +
Sbjct: 30 RKQQRRKKAQEEAKEKEREERIEERKRIREERKQELEKQLKERKEALKLLEEENDDEEDA 89
Query: 93 E 93
E
Sbjct: 90 E 90
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 33.4 bits (77), Expect = 0.21
Identities = 18/52 (34%), Positives = 29/52 (55%)
Query: 27 VQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
++ K E +++++KK+KKKK KKK +K ++E E E E E E
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Score = 33.0 bits (76), Expect = 0.26
Identities = 21/70 (30%), Positives = 39/70 (55%), Gaps = 6/70 (8%)
Query: 30 TGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAA 89
TGSK +K KK +K +KK++++KK +K+++ ++ E EEE ++ +E
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKK------EKKKKAFAGKKKEEEEEEEKEKKEEEKE 456
Query: 90 QFEEWQRQED 99
+ EE +E
Sbjct: 457 EEEEEAEEEK 466
Score = 30.7 bits (70), Expect = 1.4
Identities = 21/56 (37%), Positives = 32/56 (57%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEA 88
K E KK+KKKK KKK++++ K+ +E E E AE E+E ++ K+A
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477
Score = 30.7 bits (70), Expect = 1.5
Identities = 16/70 (22%), Positives = 35/70 (50%)
Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQF 91
K+ + +KK++++KK+KKKK ++ ++ + E + EEE + +E +
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472
Query: 92 EEWQRQEDQF 101
++ Q F
Sbjct: 473 KKKQATLFDF 482
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 32.7 bits (75), Expect = 0.21
Identities = 16/85 (18%), Positives = 40/85 (47%), Gaps = 12/85 (14%)
Query: 33 KVEARKKKKKKKKKKKK------------KKKKKKNLCANVKKRRQEREAERAEREEEMA 80
K+ A+K+ K ++K+ ++ +KK ++ K+ + E ++EEE
Sbjct: 2 KIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEER 61
Query: 81 LTQRGKEAAQFEEWQRQEDQFHLEQ 105
+ + + EE+++ + F +E+
Sbjct: 62 KEREEQARKEQEEYEKLKSSFVVEE 86
Score = 30.4 bits (69), Expect = 1.2
Identities = 23/69 (33%), Positives = 36/69 (52%), Gaps = 7/69 (10%)
Query: 42 KKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREE-EMALTQRGKEAAQFEEWQRQEDQ 100
KK KK+ K ++K + RRQ+REAE EREE + +R E + EE + + ++
Sbjct: 1 KKIGAKKRAKLEEK------QARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREK 54
Query: 101 FHLEQARLR 109
E+ R
Sbjct: 55 KKEEEERKE 63
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 32.4 bits (74), Expect = 0.21
Identities = 22/59 (37%), Positives = 27/59 (45%), Gaps = 1/59 (1%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEE 93
E + KK K K KK KK L A KK + R AE++ A +EAA EE
Sbjct: 111 EGKVNAKKDKLSKAKKAAKKAALEAE-KKVNEARAEAVAEKKAAEAAAVAAEEAAAAEE 168
Score = 30.5 bits (69), Expect = 1.2
Identities = 18/62 (29%), Positives = 23/62 (37%)
Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQF 91
KV A+K K K KK KK + N + E + AE A E +
Sbjct: 112 GKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEA 171
Query: 92 EE 93
EE
Sbjct: 172 EE 173
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 32.5 bits (74), Expect = 0.22
Identities = 16/40 (40%), Positives = 25/40 (62%), Gaps = 5/40 (12%)
Query: 31 GSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREA 70
G K +K KKKK + K++KKKKK KK++++R +
Sbjct: 136 GLKGHEKKHKKKKHEDDKERKKKKKE-----KKKKKKRHS 170
Score = 29.8 bits (67), Expect = 1.7
Identities = 18/40 (45%), Positives = 23/40 (57%), Gaps = 9/40 (22%)
Query: 35 EARKKKKKK---KKKKKKKKKKKKNLCANVKKRRQEREAE 71
E + KKKK K++KKKKK+KKK KK+R E
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKK------KKKRHSPEHP 174
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 32.2 bits (74), Expect = 0.26
Identities = 16/61 (26%), Positives = 32/61 (52%), Gaps = 4/61 (6%)
Query: 24 RMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNL---CANVKKRRQE-REAERAEREEEM 79
R +Q K E ++ KK +++K++ +K+ L ++KR +E R+ E +E+
Sbjct: 113 RKALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEI 172
Query: 80 A 80
A
Sbjct: 173 A 173
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 31.6 bits (72), Expect = 0.28
Identities = 17/43 (39%), Positives = 24/43 (55%)
Query: 36 ARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
A+++KKK+KKKKKKK KK + E ++ E EE
Sbjct: 88 AKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEE 130
Score = 30.8 bits (70), Expect = 0.62
Identities = 18/46 (39%), Positives = 27/46 (58%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
K ++ K++KKK+KKKKKKK K +K + E ++ EEE
Sbjct: 82 KTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEE 127
Score = 28.5 bits (64), Expect = 3.4
Identities = 18/44 (40%), Positives = 24/44 (54%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
E KK+ K++KKK+KKKKKK KK +E E +E
Sbjct: 81 EKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDE 124
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 32.9 bits (75), Expect = 0.29
Identities = 23/118 (19%), Positives = 52/118 (44%), Gaps = 8/118 (6%)
Query: 22 YARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMAL 81
Y R Q Q +K ++KKK++++ ++ ++K+ +K+ +ER A + ++++
Sbjct: 64 YNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEA 123
Query: 82 TQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYIGSEAEVDAVE 139
++ + E + A+ ++ E AK AK +EA+ A
Sbjct: 124 AKQAALKQKQAEEAAAKA---AAAAKAKA-----EAEAKRAAAAAKKAAAEAKKKAEA 173
Score = 28.6 bits (64), Expect = 5.9
Identities = 18/69 (26%), Positives = 26/69 (37%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
K A KKK + + KKK + K A K + A A+ E A + E A
Sbjct: 194 KAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAA 253
Query: 93 EWQRQEDQF 101
+ + D
Sbjct: 254 KAAAEVDDL 262
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 32.8 bits (75), Expect = 0.32
Identities = 19/93 (20%), Positives = 43/93 (46%), Gaps = 12/93 (12%)
Query: 28 QGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKE 87
+G S+ E K + K+K K ++ K+ K+R++ + E++ + ++ +
Sbjct: 260 EGFESEYEPINKPVRPKRKTKAQRNKE-------KRRKELEREAKEEKQLK----KKLAQ 308
Query: 88 AAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAK 120
A+ +E ++ Q +AR + R + G K
Sbjct: 309 LARLKEIAKEVAQKEKARARKKE-QRKERGEKK 340
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 31.8 bits (73), Expect = 0.40
Identities = 9/40 (22%), Positives = 22/40 (55%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
KV+A++ +++ KK++ +K+K K + R+ +
Sbjct: 109 KVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRK 148
Score = 31.8 bits (73), Expect = 0.42
Identities = 11/65 (16%), Positives = 30/65 (46%)
Query: 23 ARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALT 82
A++Q Q + + R+ +K+K ++++K K +K R ++ + +++
Sbjct: 108 AKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISEL 167
Query: 83 QRGKE 87
G+
Sbjct: 168 TVGQA 172
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 31.9 bits (73), Expect = 0.44
Identities = 14/35 (40%), Positives = 18/35 (51%)
Query: 21 VYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKK 55
A + + + E +KKKKKK KKKK KK
Sbjct: 166 AEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAA 200
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 32.6 bits (74), Expect = 0.45
Identities = 25/123 (20%), Positives = 50/123 (40%), Gaps = 4/123 (3%)
Query: 13 EEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
EE+ L+ + K++ +K+ KK +K+ KK+K++ K + +
Sbjct: 296 EEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAE 355
Query: 73 AEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYIGSE 132
E EE++ Q E + + + LE RL S +++E + + K
Sbjct: 356 EEEEEQLEKLQEKLEQLE----EELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLL 411
Query: 133 AEV 135
E+
Sbjct: 412 LEL 414
Score = 31.5 bits (71), Expect = 0.90
Identities = 40/246 (16%), Positives = 94/246 (38%), Gaps = 26/246 (10%)
Query: 31 GSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQ 90
GS+ + +KK++ KK ++ + + + K ++ + E+A++ E + E
Sbjct: 165 GSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELE- 223
Query: 91 FEEWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYIGSEAEVDAVEMHEPYTYLTGL 150
EE D L + R+ + + I+ + + E E+
Sbjct: 224 -EENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEI--------------- 267
Query: 151 AIKDLEDLLEDIKVYMELEKGENEAYWNDISIIVEDELHGLRKLEKQGHSDLTRREGIHE 210
L +L++ K + +K + E ++ ++ ++E +L K + E + E
Sbjct: 268 ----LAQVLKENKEEEKEKKLQEE----ELKLLAKEEEELKSELLKLERRKVDDEEKLKE 319
Query: 211 SVAQDVASVFKGKTTAQLEALQTQIELKISSKTENIDNVITAEEKALQREARKGMGGDEA 270
S +++ + K + E + + ELK + + + LQ + + A
Sbjct: 320 SE-KELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLA 378
Query: 271 EFSVES 276
+ +ES
Sbjct: 379 KKKLES 384
Score = 30.3 bits (68), Expect = 2.4
Identities = 35/228 (15%), Positives = 76/228 (33%), Gaps = 9/228 (3%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
+ KK + + K+K + + + E +E + Q E+++ E
Sbjct: 201 LKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQE 260
Query: 93 EWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYIGSEAEVDAVEMHEPYTYLTGLAI 152
+ +E A++ + +E K + K + E E E+ L +
Sbjct: 261 LEKEEE-----ILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSEL----LKLERRKV 311
Query: 153 KDLEDLLEDIKVYMELEKGENEAYWNDISIIVEDELHGLRKLEKQGHSDLTRREGIHESV 212
D E L E K +LEK + + E + +++ ++ + +
Sbjct: 312 DDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQ 371
Query: 213 AQDVASVFKGKTTAQLEALQTQIELKISSKTENIDNVITAEEKALQRE 260
++ K + +L + E ++ K E E + Q E
Sbjct: 372 LEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQEE 419
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 32.4 bits (73), Expect = 0.54
Identities = 13/41 (31%), Positives = 29/41 (70%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAER 75
EA +K K++ ++K +++++++ ++R +EREAERA +
Sbjct: 583 EAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAK 623
Score = 30.8 bits (69), Expect = 1.6
Identities = 18/50 (36%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 31 GSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMA 80
SK+ A+K+++ +K K++ ++K + K++ +ERE ER ERE E A
Sbjct: 574 SSKL-AKKREEAVEKAKREAEQKAREEREREKEKEKERERER-EREAERA 621
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 32.3 bits (74), Expect = 0.57
Identities = 35/193 (18%), Positives = 77/193 (39%), Gaps = 22/193 (11%)
Query: 12 SEEDDNQRLVYARMQVQGTGSKVEARKKKKKK-KKKKKKKKKKKKNLCANVKKRRQEREA 70
S + + L A ++ ++++ + ++ +++ ++++K++ L + ++E E
Sbjct: 309 SIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELED 368
Query: 71 ERAEREEEMALTQRGKEA-----AQFEEWQRQEDQFHLEQARL-----RSTIRIQEGRAK 120
RAE EE ++ + E+ +R+ ++ E RL R + + + A
Sbjct: 369 LRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAA 428
Query: 121 PIDLLAKYIGSEAEVDAVEMHEPYTYLTGLAIKDLEDLLEDIKVYMELEKGENEAYWNDI 180
+ AK E E + L IK E LE + + + E +
Sbjct: 429 IAGIEAKINELEEEKEDK----------ALEIKKQEWKLEQLAADLSKYEQELYDLKEEY 478
Query: 181 SIIVEDELHGLRK 193
VE EL L++
Sbjct: 479 D-RVEKELSKLQR 490
Score = 29.7 bits (67), Expect = 4.1
Identities = 16/99 (16%), Positives = 41/99 (41%), Gaps = 5/99 (5%)
Query: 41 KKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQEDQ 100
KK++ +++ ++ + +++ R + + ER E E ++ +R E E Q ++ +
Sbjct: 860 NGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEEL---EAQIEKKR 916
Query: 101 FHLEQARLRSTIRIQEGRAKPIDLLAKYIGSEAEVDAVE 139
L + L++ + E I+ E +
Sbjct: 917 KRLSE--LKAKLEALEEELSEIEDPKGEDEEIPEEELSL 953
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 30.7 bits (70), Expect = 0.58
Identities = 13/71 (18%), Positives = 41/71 (57%), Gaps = 2/71 (2%)
Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQERE-AERAEREEEMALTQRGKEAAQ 90
E R++K+++K++K++K++K+K + ++ + + E E+ + L R +
Sbjct: 13 DIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPL-ARYADDED 71
Query: 91 FEEWQRQEDQF 101
++E ++++++
Sbjct: 72 YDEELKEQERW 82
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 32.1 bits (74), Expect = 0.58
Identities = 19/104 (18%), Positives = 40/104 (38%)
Query: 18 QRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREE 77
++L ++V+ S VE + +K++ K++ +++ K + A + +E +AE A
Sbjct: 22 EKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAP 81
Query: 78 EMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAKP 121
+ A E E A R + + K
Sbjct: 82 AEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKKAAKKKKG 125
>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.
This presumed domain is found at the N-terminus of
Ribosomal L30 proteins and has been termed RL30NT or
NUC018.
Length = 71
Score = 29.1 bits (66), Expect = 0.64
Identities = 17/45 (37%), Positives = 26/45 (57%), Gaps = 5/45 (11%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQ--EREAERAEREE 77
+ R K+ KK+ KK +KKK+ L + KR + +E AER+E
Sbjct: 11 KLRAKRAKKRAAKKAARKKKRKL---IFKRAEKYVKEYRAAERDE 52
Score = 27.6 bits (62), Expect = 2.4
Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 2/46 (4%)
Query: 33 KVEARKKKKKKKKKKKKKKKK-KKNLCANVKK-RRQEREAERAERE 76
+ + KK+ KK +KKK+K K VK+ R ER+ R +RE
Sbjct: 13 RAKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERDEIRLKRE 58
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
Provisional.
Length = 330
Score = 31.7 bits (73), Expect = 0.64
Identities = 12/43 (27%), Positives = 23/43 (53%)
Query: 22 YARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKR 64
Y M+ + + ++ ++ K K +KKK+K++ K A K R
Sbjct: 49 YKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 29.4 bits (67), Expect = 3.4
Identities = 9/20 (45%), Positives = 13/20 (65%)
Query: 38 KKKKKKKKKKKKKKKKKKNL 57
+KKK+K++ K K K K L
Sbjct: 73 QKKKEKQEAKAAKAKSKPRL 92
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 31.8 bits (71), Expect = 0.64
Identities = 21/84 (25%), Positives = 43/84 (51%), Gaps = 1/84 (1%)
Query: 11 YSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREA 70
Y EE R +Y + V+ G+ ++ + K K +KKK++ L + +R + E
Sbjct: 109 YREEKKIDRHIYRELYVKAKGNVFRNKRNLMEHIHKVKNEKKKERQLAEQLAAKRLKDEQ 168
Query: 71 ERAE-REEEMALTQRGKEAAQFEE 93
R + R++E+ ++ +E A+ E+
Sbjct: 169 HRHKARKQELRKREKDRERARRED 192
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 32.0 bits (73), Expect = 0.81
Identities = 35/190 (18%), Positives = 73/190 (38%), Gaps = 12/190 (6%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEW 94
++ + K + ++ + + +K+ + EAE E E E+ + E + E+
Sbjct: 323 AQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELE-EQL 381
Query: 95 QRQEDQFH---LEQARLRSTIRIQEGRAKPID-----LLAKYIGSEAEVDAVEMHEPYTY 146
+ + L+ A L + I E R + ++ L + +++ E+ E
Sbjct: 382 ETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAE 441
Query: 147 LTGL--AIKDLEDLLEDIKVYMELEKGENEAYWNDISIIVEDELHGLRKLEKQGHSDLTR 204
L L +++L++ LE ++ +E + E E + E EL L+
Sbjct: 442 LEELEEELEELQEELERLEEALEELREELEEAEQALD-AAERELAQLQARLDSLERLQEN 500
Query: 205 REGIHESVAQ 214
EG E V
Sbjct: 501 LEGFSEGVKA 510
Score = 31.6 bits (72), Expect = 1.0
Identities = 53/271 (19%), Positives = 109/271 (40%), Gaps = 31/271 (11%)
Query: 2 SRLEPGTLVYSEEDDNQRLVYAR-MQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCAN 60
+L PG + + + D LV + G+ + +++++ ++ ++K ++ + A
Sbjct: 639 KKLRPGYRIVTLDGD---LVRPGGVITGGSAKTNSSILERRREIEELEEKIEELEEKIAE 695
Query: 61 VKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAK 120
++K E E E EEE+ ++ E + Q + L E +
Sbjct: 696 LEKALAELRKELEELEEELEQLRK--------ELEELSRQISALRKDLARLEAEVEQLEE 747
Query: 121 PIDLLAKYIGSEAEVDAVEMHEPYTYLTGL---AIKDLEDLLEDIKVYMEL------EKG 171
I L+K + +E E + E+ E A ++E+L I+ E
Sbjct: 748 RIAQLSKEL-TELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALD 806
Query: 172 ENEAYWNDISIIVEDELHGLRKLEKQGHS---DLTRREGIHESVAQDVASVFKGKTTAQL 228
E A ++ + L LE++ + L E E +++D+ S+ A++
Sbjct: 807 ELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESL-----AAEI 861
Query: 229 EALQTQIELKISSKTENIDNVITAEEKALQR 259
E L+ IE ++ S+ E + N + E+AL
Sbjct: 862 EELEELIE-ELESELEALLNERASLEEALAL 891
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 30.8 bits (70), Expect = 0.82
Identities = 19/87 (21%), Positives = 40/87 (45%), Gaps = 1/87 (1%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
+ E ++K K KKKK KKKK K K+ + K + E++ + E E+++ +
Sbjct: 77 EYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK-DEKEAEDKLEDLTKSYSETLST 135
Query: 93 EWQRQEDQFHLEQARLRSTIRIQEGRA 119
+ + ++ L + +S + +
Sbjct: 136 LSELKPRKYALHKDIYQSRLDRKRRAE 162
>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.
This domain is the C-terminal end of ribosomal L18/L5
proteins.
Length = 93
Score = 29.4 bits (67), Expect = 0.82
Identities = 12/30 (40%), Positives = 18/30 (60%)
Query: 36 ARKKKKKKKKKKKKKKKKKKNLCANVKKRR 65
+RKKK+KK+ K + K+ K L +K R
Sbjct: 60 SRKKKEKKEVKAESKRYNAKKLTLEQRKAR 89
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 31.4 bits (72), Expect = 0.83
Identities = 13/43 (30%), Positives = 24/43 (55%)
Query: 37 RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEM 79
RKK +KK++K +KK +K++ A KK+ + + + E
Sbjct: 411 RKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGET 453
Score = 30.7 bits (70), Expect = 1.4
Identities = 13/41 (31%), Positives = 24/41 (58%), Gaps = 4/41 (9%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAER 75
A +KK +KK++K +KK +K +K +++AE A +
Sbjct: 408 PAERKKLRKKQRKAEKKAEK----EEAEKAAAKKKAEAAAK 444
Score = 29.9 bits (68), Expect = 3.0
Identities = 13/63 (20%), Positives = 24/63 (38%), Gaps = 12/63 (19%)
Query: 38 KKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQ 97
++KK +KK++K +KK E+E ++ A K E ++
Sbjct: 408 PAERKKLRKKQRKAEKKA-----------EKEEAEKAAAKKKAEAAAKKAKGPDGE-TKK 455
Query: 98 EDQ 100
D
Sbjct: 456 VDP 458
Score = 29.5 bits (67), Expect = 3.3
Identities = 11/42 (26%), Positives = 19/42 (45%)
Query: 36 ARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREE 77
RK +KK +K++ +K KK A KK + + +
Sbjct: 418 QRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 31.8 bits (72), Expect = 0.91
Identities = 17/65 (26%), Positives = 30/65 (46%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEW 94
E +KKKK++K K+K+ KK K K + Q ++A + + + K + E
Sbjct: 17 ELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVEDENP 76
Query: 95 QRQED 99
+ D
Sbjct: 77 EDFID 81
>gnl|CDD|227515 COG5188, PRP9, Splicing factor 3a, subunit 3 [RNA processing and
modification].
Length = 470
Score = 31.5 bits (71), Expect = 0.92
Identities = 29/186 (15%), Positives = 69/186 (37%), Gaps = 5/186 (2%)
Query: 1 MSRLEPGTLVYSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCAN 60
M+ LE + E + + + R+Q + E R +++ + + ++ + L +
Sbjct: 1 MNLLETRRSLLEEMEIIENAIAERIQRNPKLPRDELRLERQIRIFENMERISNQIWLVEH 60
Query: 61 -VKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRA 119
+++ +R R + L K+ + + + ++ R +A
Sbjct: 61 ERPTGLKDQMMKRINRSIDRDLYGFYKKLGALNVEGKLDGEIEVKGLRDLGYYESSAPKA 120
Query: 120 KPIDLLAKYIGSEAEVDAVEMHEPYTYLTGLAIKDLEDLLEDIKVYMEL----EKGENEA 175
+ +D+ A + GSE D +E+ + + + LE +K E +N
Sbjct: 121 RDLDVEAAFKGSELYGDGMELERIFRKYANVHLCSDCSYLEFLKKLERFDLTTEPSKNFR 180
Query: 176 YWNDIS 181
Y +S
Sbjct: 181 YLEYLS 186
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 31.2 bits (71), Expect = 1.0
Identities = 14/99 (14%), Positives = 45/99 (45%), Gaps = 3/99 (3%)
Query: 14 EDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERA 73
+D + A+ ++ K+ K +++++ ++ K+K+++ L ++ E++ A
Sbjct: 159 DDLESLIASAKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEA 218
Query: 74 EREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTI 112
E+++ L ++ +++ E++ E R
Sbjct: 219 ALEKQLRLEFEREKEELRKKY---EEKLRQELERQAEAH 254
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 30.7 bits (70), Expect = 1.0
Identities = 16/45 (35%), Positives = 30/45 (66%), Gaps = 4/45 (8%)
Query: 34 VEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
+E R++K++++K +KK+K+K+ A K+ Q+ EAE + EE
Sbjct: 8 LEQRRRKREQRKARKKQKRKE----AKKKEDAQKSEAEEVKNEEN 48
Score = 29.2 bits (66), Expect = 2.6
Identities = 16/38 (42%), Positives = 26/38 (68%)
Query: 37 RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAE 74
+K K+K+K+KKK KK+ K V+K++ ER+ +R E
Sbjct: 145 KKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREE 182
Score = 28.0 bits (63), Expect = 7.6
Identities = 12/19 (63%), Positives = 13/19 (68%)
Query: 36 ARKKKKKKKKKKKKKKKKK 54
RK KK KKKKK KKK +
Sbjct: 186 KRKDDKKNKKKKKAKKKGR 204
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 31.1 bits (70), Expect = 1.0
Identities = 33/156 (21%), Positives = 59/156 (37%), Gaps = 13/156 (8%)
Query: 13 EEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKK----KKKKKKNLCANVKKR--RQ 66
+ D + Q + E R+K+++ K K K+ K + N K+ +
Sbjct: 227 IDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKA 286
Query: 67 EREAERAEREEEMA-------LTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRA 119
+ E ++ + E A L Q K + + E + E Q E + A
Sbjct: 287 QIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVEA 346
Query: 120 KPIDLLAKYIGSEAEVDAVEMHEPYTYLTGLAIKDL 155
+P L I S V +++ +P T L+ L + DL
Sbjct: 347 QPTSLNEDAIDSSNPVYGLKVVDPITNLSELVLIDL 382
Score = 29.2 bits (65), Expect = 5.1
Identities = 18/67 (26%), Positives = 31/67 (46%), Gaps = 4/67 (5%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAERE-EEMALTQRGKEAAQF 91
K E KK+ K ++K + N K+R + R+ ++ + + A T KE Q
Sbjct: 219 KEELDKKQIDADKAQQKADFAQDNA---DKQRDEVRQKQQEAKNLPKPADTSSPKEDKQV 275
Query: 92 EEWQRQE 98
E Q++E
Sbjct: 276 AENQKRE 282
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 30.8 bits (70), Expect = 1.1
Identities = 12/37 (32%), Positives = 14/37 (37%)
Query: 20 LVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKN 56
V A K + KKKK K+ K K KK
Sbjct: 240 DVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVV 276
Score = 30.0 bits (68), Expect = 2.0
Identities = 13/27 (48%), Positives = 16/27 (59%)
Query: 37 RKKKKKKKKKKKKKKKKKKNLCANVKK 63
RK K++ KKKK K+ K K L V K
Sbjct: 252 RKSKEEIKKKKPKESKGVKALKKVVAK 278
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 30.4 bits (69), Expect = 1.1
Identities = 12/46 (26%), Positives = 19/46 (41%), Gaps = 1/46 (2%)
Query: 7 GTLVYSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKK 52
L Y+E D ++ + + A KK +K + KKKK
Sbjct: 152 AGLWYAERKDAKQKE-EFAAEERKEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564. This
family of proteins has no known function. However, one
of the members is annotated as an EF-hand family
protein.
Length = 349
Score = 30.9 bits (70), Expect = 1.1
Identities = 16/64 (25%), Positives = 35/64 (54%), Gaps = 3/64 (4%)
Query: 37 RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAE---RAEREEEMALTQRGKEAAQFEE 93
++ K++++KK+ +++KKKK A K+R E E +E R +E + +E
Sbjct: 285 KEAKEQQEKKELEQRKKKKKEMAPKVKQRFEANDPAQKLQEERKEQLAKLRKEEKEREKE 344
Query: 94 WQRQ 97
++++
Sbjct: 345 YEQE 348
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 29.9 bits (68), Expect = 1.1
Identities = 14/59 (23%), Positives = 30/59 (50%), Gaps = 8/59 (13%)
Query: 18 QRLVYARMQVQGTGSKV--------EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQER 68
QR ++ G +K E K++KKK+ K+KK+++K++ +K++ +
Sbjct: 71 QRQAAKEVKKPGISTKAQQALKLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKH 129
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 31.4 bits (71), Expect = 1.2
Identities = 20/61 (32%), Positives = 36/61 (59%), Gaps = 5/61 (8%)
Query: 36 ARKKKKKKKKKKKKKKKKK---KNLCA--NVKKRRQEREAERAEREEEMALTQRGKEAAQ 90
+RKKKKKKK K+KKKKK + K + A N+++ + E++ ++ L ++ ++
Sbjct: 683 SRKKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKLEKKFGQSPV 742
Query: 91 F 91
F
Sbjct: 743 F 743
Score = 29.5 bits (66), Expect = 3.8
Identities = 20/56 (35%), Positives = 28/56 (50%), Gaps = 5/56 (8%)
Query: 37 RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
+KKKKKKK K+KKKKK N + +Q E E E + ++ E Q +
Sbjct: 682 GSRKKKKKKKSKEKKKKK-----NREASKQIHALENIEEGIEGSNNEKSSETPQLK 732
>gnl|CDD|237700 PRK14397, PRK14397, membrane protein; Provisional.
Length = 222
Score = 30.6 bits (69), Expect = 1.2
Identities = 11/41 (26%), Positives = 16/41 (39%)
Query: 241 SKTENIDNVITAEEKALQREARKGMGGDEAEFSVESLLDPQ 281
S ENI + EEK Q++ G A + + D
Sbjct: 175 SHRENIGRLARGEEKPWQKKHHDAAQGTAAGAAPTANADAA 215
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 31.3 bits (71), Expect = 1.2
Identities = 14/24 (58%), Positives = 16/24 (66%)
Query: 32 SKVEARKKKKKKKKKKKKKKKKKK 55
V K KK KKKKKKKKK++K
Sbjct: 267 YDVSEMVKFKKPKKKKKKKKKRRK 290
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 31.1 bits (70), Expect = 1.2
Identities = 21/107 (19%), Positives = 45/107 (42%), Gaps = 9/107 (8%)
Query: 31 GSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQE--REAERAEREEEMALTQRGKEA 88
+ E R++ + + + + +K+ + +K + + + + E+E + T+ A
Sbjct: 154 KTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRA--NA 211
Query: 89 AQFEEWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYIGSEAEV 135
AQ R E+ A ++ IQ+ R I A+ I + AE
Sbjct: 212 AQ----ARTEELARRAAAAQQTAQAIQQ-RDAQISQKAQQIAARAEQ 253
Score = 28.8 bits (64), Expect = 6.6
Identities = 32/199 (16%), Positives = 72/199 (36%), Gaps = 28/199 (14%)
Query: 61 VKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAK 120
+ ++ E ER E+ ++ +EA + E +++ +Q R T + Q+ + +
Sbjct: 93 AQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQAQDLQTR 152
Query: 121 PIDLLAKYIGSEAEVDAVEMHEPYTYLTGLAIKDLEDLLEDIKVYMELEKGENEAYWNDI 180
L + EA+ +++ + + +K + + + + + E E
Sbjct: 153 LKTLAEQRRQLEAQAQSLQASQKQLQASATQLK-----SQVLDLKLRSAQIEQE------ 201
Query: 181 SIIVEDELHGLRKLEKQGHSDLTRREGIHESVAQDVASVFKGKTTAQLEALQTQIELKIS 240
L + +L RR + AQ + Q +A +Q +I+
Sbjct: 202 ----AQNLATRANAAQARTEELARRAAAAQQTAQAI---------QQRDAQISQKAQQIA 248
Query: 241 SKTENIDNVITAEEKALQR 259
++ I E+ LQR
Sbjct: 249 AR----AEQIRERERQLQR 263
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 31.0 bits (70), Expect = 1.2
Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 9/64 (14%)
Query: 23 ARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALT 82
+ + K +K + K +KK+ + K+R ERE R R+E A
Sbjct: 323 PELHAENAEIKKTRTAEKNEAKARKKEIAQ---------KRRAAEREINREARQERAAAM 373
Query: 83 QRGK 86
R +
Sbjct: 374 ARAR 377
Score = 29.4 bits (66), Expect = 3.8
Identities = 20/114 (17%), Positives = 43/114 (37%), Gaps = 11/114 (9%)
Query: 37 RKKKKKKKKKKKKKKKKKKNLCANV-------KKRRQEREAERAEREEEMALTQRGKEAA 89
+ ++K+K+KK ++N + + + E ++ E+ R KE A
Sbjct: 292 AQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKARKKEIA 351
Query: 90 Q-FEEWQRQEDQFHLE---QARLRSTIRIQEGRAKPIDLLAKYIGSEAEVDAVE 139
Q +R+ ++ + A R+ R +AK L+ + + E
Sbjct: 352 QKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPSENEE 405
>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain. This region has been
identified as the binding site of the SRP72 protein to
SRP RNA.
Length = 57
Score = 28.2 bits (63), Expect = 1.2
Identities = 12/32 (37%), Positives = 17/32 (53%)
Query: 26 QVQGTGSKVEARKKKKKKKKKKKKKKKKKKNL 57
+ + K ++KKKK KKK+K K KN
Sbjct: 4 KPKKIAKKKPGTEQKKKKSAKKKRKPKLPKNF 35
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 30.8 bits (70), Expect = 1.3
Identities = 14/67 (20%), Positives = 30/67 (44%), Gaps = 8/67 (11%)
Query: 11 YSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREA 70
+S E D L R +++ K++ + K K K ++ KK++ + RR+++E
Sbjct: 333 FSGEPDGISL---REELEKRIEKLKEKPPKPPTKAKPERDKKERPG-----RYRRKKKEK 384
Query: 71 ERAEREE 77
+
Sbjct: 385 KAKSERR 391
>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are typically between 137 to
196 amino acids in length.
Length = 149
Score = 30.0 bits (68), Expect = 1.3
Identities = 9/47 (19%), Positives = 21/47 (44%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMAL 81
+RK+ + K+ +KK K A K + ++++ + E +
Sbjct: 3 SSRKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASESGI 49
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 31.1 bits (71), Expect = 1.3
Identities = 11/45 (24%), Positives = 27/45 (60%)
Query: 34 VEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
+E + ++ KK+ KK ++K + L A ++K+ +E E + + ++
Sbjct: 91 IEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDK 135
>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
beta and delta phosphatidylinositol 3-kinases (PI3Ks).
PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
regulate cell processes such as cell growth,
differentiation, proliferation, and motility. PI3Ks
work on phosphorylation of phosphatidylinositol,
phosphatidylinositide (4)P (PtdIns (4)P),2 or
PtdIns(4,5)P2. Specifically they phosphorylate the D3
hydroxyl group of phosphoinositol lipids on the inositol
ring. There are 3 classes of PI3Ks based on structure,
regulation, and specificity. All classes contain a C2
domain, a PIK domain, and a kinase catalytic domain.
The members here are class I, beta and delta isoforms of
PI3Ks and contain both a Ras-binding domain and a
p85-binding domain. Class II PI3Ks contain both of
these as well as a PX domain, and a C-terminal C2 domain
containing a nuclear localization signal. C2 domains
fold into an 8-standed beta-sandwich that can adopt 2
structural arrangements: Type I and Type II,
distinguished by a circular permutation involving their
N- and C-terminal beta strands. Many C2 domains are
Ca2+-dependent membrane-targeting modules that bind a
wide variety of substances including bind phospholipids,
inositol polyphosphates, and intracellular proteins.
Most C2 domain proteins are either signal transduction
enzymes that contain a single C2 domain, such as protein
kinase C, or membrane trafficking proteins which contain
at least two C2 domains, such as synaptotagmin 1.
However, there are a few exceptions to this including
RIM isoforms and some splice variants of piccolo/aczonin
and intersectin which only have a single C2 domain. C2
domains with a calcium binding region have negatively
charged residues, primarily aspartates, that serve as
ligands for calcium ions. Members have a type-I
topology.
Length = 173
Score = 30.0 bits (68), Expect = 1.4
Identities = 10/22 (45%), Positives = 15/22 (68%)
Query: 37 RKKKKKKKKKKKKKKKKKKNLC 58
+K K K+ +K + KKKKKK+
Sbjct: 87 KKAKGKRSRKNQTKKKKKKDDN 108
Score = 27.7 bits (62), Expect = 8.0
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 35 EARKKKKKKKKKKKKKKKKKK 55
E KK K K+ +K + KKKKK
Sbjct: 84 EVSKKAKGKRSRKNQTKKKKK 104
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 30.5 bits (69), Expect = 1.4
Identities = 22/73 (30%), Positives = 37/73 (50%), Gaps = 8/73 (10%)
Query: 45 KKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQEDQFHLE 104
K+ +++KK +K++QERE ER E E L + ++EEW RQ+ Q +
Sbjct: 93 LSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKE-----KYEEWCRQKAQ---Q 144
Query: 105 QARLRSTIRIQEG 117
A+ R+ +E
Sbjct: 145 AAKQRTPKHKKEA 157
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 30.4 bits (69), Expect = 1.5
Identities = 18/78 (23%), Positives = 39/78 (50%), Gaps = 7/78 (8%)
Query: 23 ARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALT 82
+R R K+++ ++KK + K +K+ K++++E+E + ER E T
Sbjct: 142 SRSSTVQNKEATHERLKEREIRRKKIQAKARKR------KEKKKEKELTQEERLAEAKET 195
Query: 83 QRGKEAAQFEEWQRQEDQ 100
+R + E ++ QE++
Sbjct: 196 ERINLKS-LERYEEQEEE 212
>gnl|CDD|225651 COG3109, ProQ, Activator of osmoprotectant transporter ProP [Signal
transduction mechanisms].
Length = 208
Score = 30.2 bits (68), Expect = 1.6
Identities = 10/76 (13%), Positives = 34/76 (44%), Gaps = 4/76 (5%)
Query: 16 DNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKK----KKKKKNLCANVKKRRQEREAE 71
+ + + +AR Q+ ++V+A++ +++ KK+++ K + + + + E
Sbjct: 92 EEEHVEHARKQLAEAKARVQAQRAEQQAKKREEAPAAGEKPTAERPATAARPKPKAPREE 151
Query: 72 RAEREEEMALTQRGKE 87
+++ G+
Sbjct: 152 VHTPVSDISALTVGQA 167
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 30.5 bits (70), Expect = 1.7
Identities = 6/26 (23%), Positives = 11/26 (42%)
Query: 30 TGSKVEARKKKKKKKKKKKKKKKKKK 55
G K + +K K++ +K K
Sbjct: 833 AGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 29.0 bits (66), Expect = 6.0
Identities = 7/21 (33%), Positives = 9/21 (42%)
Query: 35 EARKKKKKKKKKKKKKKKKKK 55
+A K KK K +K K
Sbjct: 826 DAPAKDPAGKKAAVKFSRKTK 846
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 30.2 bits (68), Expect = 1.7
Identities = 15/71 (21%), Positives = 30/71 (42%), Gaps = 6/71 (8%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQER------EAERAEREEEMALTQRGKEA 88
EA+ K+ + K K + + +KK K+ +E EA++ E + K
Sbjct: 123 EAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAK 182
Query: 89 AQFEEWQRQED 99
A+ + + E+
Sbjct: 183 AEAKAKAKAEE 193
Score = 30.2 bits (68), Expect = 2.3
Identities = 20/71 (28%), Positives = 27/71 (38%), Gaps = 1/71 (1%)
Query: 23 ARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALT 82
A+ + K EA K K + K K K ++ K A K E A +AE E A
Sbjct: 164 AKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAE-AAAKAEAEAAAAAA 222
Query: 83 QRGKEAAQFEE 93
+ A E
Sbjct: 223 AEAERKADEAE 233
Score = 29.0 bits (65), Expect = 4.0
Identities = 10/71 (14%), Positives = 34/71 (47%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEW 94
+ ++ ++ +K++ ++ ++K L + ++AE+A ++ E Q + A+
Sbjct: 72 KLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAE 131
Query: 95 QRQEDQFHLEQ 105
+ + + E+
Sbjct: 132 AKAKAEAEAEK 142
Score = 28.6 bits (64), Expect = 5.4
Identities = 19/78 (24%), Positives = 33/78 (42%), Gaps = 4/78 (5%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREA-ERAEREEEMALTQRGKEAAQF 91
K EA+K+ +++ K K + KKK A K++ E EA +AE + + + +A
Sbjct: 145 KEEAKKQAEEEAKAKAAAEAKKK---AAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAA 201
Query: 92 EEWQRQEDQFHLEQARLR 109
+ E E
Sbjct: 202 KAKAAAEAAAKAEAEAAA 219
Score = 28.6 bits (64), Expect = 5.6
Identities = 12/65 (18%), Positives = 31/65 (47%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
++E R +K K+ ++ K+ + ++ + ++ AE + E A + +EA +
Sbjct: 93 ELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQA 152
Query: 93 EWQRQ 97
E + +
Sbjct: 153 EEEAK 157
Score = 28.3 bits (63), Expect = 7.6
Identities = 18/81 (22%), Positives = 24/81 (29%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
K EA+ K K ++ K K + K K K E A A E A +
Sbjct: 182 KAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAERKADEAELGDIFGLA 241
Query: 93 EWQRQEDQFHLEQARLRSTIR 113
E Q S +
Sbjct: 242 SGSNAEKQGGARGNAAGSEVD 262
>gnl|CDD|236311 PRK08609, PRK08609, hypothetical protein; Provisional.
Length = 570
Score = 30.7 bits (70), Expect = 1.7
Identities = 26/87 (29%), Positives = 42/87 (48%), Gaps = 11/87 (12%)
Query: 153 KDLEDLLEDIKVYMELEKGEN----EAYWNDISIIVEDELHGLRKLEKQGHSDLTRREGI 208
KD+ LLE I YMEL KGEN A+ + DE R L + D T+ +GI
Sbjct: 4 KDVIKLLETIATYMEL-KGENPFKISAFRKAAQALELDE----RSLSEI--DDFTKLKGI 56
Query: 209 HESVAQDVASVFKGKTTAQLEALQTQI 235
+ A+ + + ++ L+ L+ ++
Sbjct: 57 GKGTAEVIQEYRETGESSVLQELKKEV 83
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 30.6 bits (69), Expect = 1.8
Identities = 14/45 (31%), Positives = 23/45 (51%)
Query: 34 VEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
+E + KK K KKK KK KKK+ ++ + + ++ EE
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEES 186
>gnl|CDD|215732 pfam00121, TIM, Triosephosphate isomerase.
Length = 242
Score = 29.9 bits (68), Expect = 1.8
Identities = 19/75 (25%), Positives = 31/75 (41%), Gaps = 20/75 (26%)
Query: 198 GHSDLTRREGIHES---VAQDVASVFK---------------GKTTAQLEALQTQIELKI 239
GHS+ RR+ E+ VA+ V + K + LE L+ Q++ +
Sbjct: 91 GHSE--RRQYFGETDEIVAKKVKAALKNGLTPVLCVGETLEEREAGKTLEVLKRQLKAIL 148
Query: 240 SSKTENIDNVITAEE 254
+ E N+I A E
Sbjct: 149 AGLDEEWKNLIIAYE 163
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 30.4 bits (68), Expect = 1.8
Identities = 14/50 (28%), Positives = 29/50 (58%), Gaps = 5/50 (10%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQR 84
E R+ +++ +K K++KKK++K K+ R+ + E +E++A Q
Sbjct: 86 EERRHRQRLEKDKREKKKREKE-----KRGRRRHHSLGTESDEDIAPAQM 130
Score = 28.5 bits (63), Expect = 7.0
Identities = 18/69 (26%), Positives = 33/69 (47%)
Query: 10 VYSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQERE 69
V D++ + + T + +KKK++K+K++KKKKKK + + E+
Sbjct: 254 VSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQP 313
Query: 70 AERAEREEE 78
+ EEE
Sbjct: 314 VQNGAVEEE 322
>gnl|CDD|220121 pfam09135, Alb1, Alb1. Alb1 is a nuclear shuttling factor
involved in ribosome biogenesis.
Length = 112
Score = 29.0 bits (65), Expect = 1.9
Identities = 19/57 (33%), Positives = 30/57 (52%), Gaps = 8/57 (14%)
Query: 30 TGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGK 86
SK E KKK++KKK K+ +K +++RQ++ ERAE + +R K
Sbjct: 39 AASKNEGVSKKKQRKKKPKRLTRK--------QRKRQKKGLERAEIVTDTLSKKRAK 87
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 30.5 bits (69), Expect = 2.0
Identities = 39/231 (16%), Positives = 96/231 (41%), Gaps = 13/231 (5%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEW 94
+ ++ +++ ++ +++ ++ K A +++ ++ ++ E EEE+ + E Q
Sbjct: 702 DLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLE 761
Query: 95 QRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYIGSEAEVDAVEMHEPYTYLTGLAIKD 154
+ +E+ LE+A + I+E K L + E E++ E +
Sbjct: 762 ELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAE----------RRLDA 811
Query: 155 LEDLLEDIKVYMELEKGENEAYWNDISIIVEDELHGLRKLEKQGHSDLTRREGIHESVAQ 214
LE LE ++ E + E E +I +E++L L + ++ +L + E +
Sbjct: 812 LERELESLEQRRERLEQEIEELEEEI-EELEEKLDELEEELEELEKELEELKEELEELEA 870
Query: 215 DVASVFK--GKTTAQLEALQTQIELKISSKTENIDNVITAEEKALQREARK 263
+ + + + E L+ ++ S E + + E+ + EA+
Sbjct: 871 EKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKL 921
Score = 29.7 bits (67), Expect = 3.8
Identities = 19/128 (14%), Positives = 54/128 (42%), Gaps = 1/128 (0%)
Query: 13 EEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
E+ +RL + +++ ++E R+ ++ ++ + ++ K E E
Sbjct: 318 LEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEEL 377
Query: 73 -AEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYIGS 131
EE+A + + E + + + LE+ R + R+++ + + +L A+
Sbjct: 378 FEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEEL 437
Query: 132 EAEVDAVE 139
+ E++ +
Sbjct: 438 QTELEELN 445
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 29.8 bits (67), Expect = 2.0
Identities = 13/23 (56%), Positives = 15/23 (65%)
Query: 29 GTGSKVEARKKKKKKKKKKKKKK 51
G K A+K KK KKKKKKK+
Sbjct: 195 YKGDKSSAKKDKKSKKKKKKKRS 217
Score = 27.8 bits (62), Expect = 9.9
Identities = 12/18 (66%), Positives = 12/18 (66%)
Query: 39 KKKKKKKKKKKKKKKKKN 56
K KK KK KKKKKKK
Sbjct: 199 KSSAKKDKKSKKKKKKKR 216
>gnl|CDD|217611 pfam03552, Cellulose_synt, Cellulose synthase. Cellulose, an
aggregate of unbranched polymers of beta-1,4-linked
glucose residues, is the major component of wood and
thus paper, and is synthesised by plants, most algae,
some bacteria and fungi, and even some animals. The
genes that synthesise cellulose in higher plants differ
greatly from the well-characterized genes found in
Acetobacter and Agrobacterium sp. More correctly
designated as 'cellulose synthase catalytic subunits',
plant cellulose synthase (CesA) proteins are integral
membrane proteins, approximately 1,000 amino acids in
length. There are a number of highly conserved residues,
including several motifs shown to be necessary for
processive glycosyltransferase activity.
Length = 716
Score = 30.5 bits (69), Expect = 2.0
Identities = 14/47 (29%), Positives = 20/47 (42%)
Query: 37 RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQ 83
R+KK K K KKK K+K A + E + E+ M+
Sbjct: 312 RRKKPKSASKAKKKDSKRKESEAPIFNLEDIDEGDEDEKSSLMSQLS 358
>gnl|CDD|180007 PRK05309, PRK05309, 30S ribosomal protein S11; Validated.
Length = 128
Score = 29.0 bits (66), Expect = 2.1
Identities = 10/14 (71%), Positives = 10/14 (71%)
Query: 43 KKKKKKKKKKKKKN 56
KKKK KKK KKN
Sbjct: 1 AKKKKTVKKKVKKN 14
>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain. This family
contains a C-terminal presumed domain in Sas10 which
hash been identified as a regulator of chromatin
silencing.
Length = 76
Score = 28.0 bits (63), Expect = 2.2
Identities = 11/40 (27%), Positives = 19/40 (47%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
K + R + KK+KK +K K+ K+ V++ E
Sbjct: 24 KKDRRNPRVKKRKKYEKAVKRLKSQRRVVREEDGGYGGET 63
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 30.2 bits (68), Expect = 2.2
Identities = 17/88 (19%), Positives = 41/88 (46%), Gaps = 8/88 (9%)
Query: 7 GTLVYSEEDDNQRLVYARMQVQGTGSKVEAR----KKKKKKKKKKKKKKKKKKNLCANVK 62
G VY D V +++G + ++K+++ +K K++ +++ +
Sbjct: 221 GQNVYDNNDHKDFCV----EIEGGMDEHSFEDFLLEEKRRELEKLAKEEAERERQAEEQR 276
Query: 63 KRRQEREAERAEREEEMALTQRGKEAAQ 90
+R +E+ A A+R + A ++ +E Q
Sbjct: 277 RREEEKAAMEADRAQAKAEVEKRREKLQ 304
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 30.4 bits (69), Expect = 2.3
Identities = 43/224 (19%), Positives = 86/224 (38%), Gaps = 41/224 (18%)
Query: 13 EEDDNQRLVYARMQVQGTGSKVE-ARKKKKKKKKKKKKKKKKKKNL---CANVKKRRQER 68
+E+ RL R+ Q + E R+ +++ ++ +++ L ++ ++R
Sbjct: 323 DEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDR 382
Query: 69 EAERAEREEEM---------ALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIR-IQEGR 118
E E EEE+ A G EE + + D+ +A L +T+R +E
Sbjct: 383 REEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERV 442
Query: 119 AKPIDLLA--------KYIGSEAEVDAVEMHEPYTYLTGLAIKDLEDLLEDIKVYMELEK 170
+ LL + + V+ +E +++LE LED++ E+E+
Sbjct: 443 EEAEALLEAGKCPECGQPVEGSPHVETIEEDRE-------RVEELEAELEDLEE--EVEE 493
Query: 171 GENEAYWNDISIIVEDELHGLRKLEKQGHSDLTRREGIHESVAQ 214
E + L + E + RRE + E +A+
Sbjct: 494 VEERL----------ERAEDLVEAEDRIERLEERREDLEELIAE 527
>gnl|CDD|216205 pfam00937, Corona_nucleoca, Coronavirus nucleocapsid protein.
Length = 346
Score = 30.1 bits (68), Expect = 2.3
Identities = 8/37 (21%), Positives = 14/37 (37%)
Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQER 68
++K KK + KK + + K R +R
Sbjct: 201 KLGFGKQKSSSKKPSRVTKKSAAEAAKKQLNKPRWKR 237
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 29.9 bits (67), Expect = 2.3
Identities = 13/63 (20%), Positives = 23/63 (36%), Gaps = 6/63 (9%)
Query: 27 VQGTGSKVEARKKKK------KKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMA 80
E +K KK K K KK KKK +VK + ++ + +++
Sbjct: 138 STAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWKSQDLV 197
Query: 81 LTQ 83
+
Sbjct: 198 ING 200
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association
with pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 28.0 bits (63), Expect = 2.3
Identities = 11/32 (34%), Positives = 18/32 (56%)
Query: 25 MQVQGTGSKVEARKKKKKKKKKKKKKKKKKKN 56
+ + R ++ ++KKKKKKK KKK+
Sbjct: 22 EDDEQIDASKVRRISQRNRRKKKKKKKLKKKS 53
Score = 27.3 bits (61), Expect = 3.8
Identities = 12/42 (28%), Positives = 27/42 (64%)
Query: 13 EEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKK 54
E+D++++ ++++ ++ + +KKKK KKK K+ +KK
Sbjct: 20 GEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKK 61
>gnl|CDD|236643 PRK10044, PRK10044, ferrichrome outer membrane transporter;
Provisional.
Length = 727
Score = 30.1 bits (68), Expect = 2.4
Identities = 16/53 (30%), Positives = 22/53 (41%), Gaps = 5/53 (9%)
Query: 107 RLRSTIRIQEGRAKPIDLLAKYIGSEAEVDAVEMHEPYTYL-----TGLAIKD 154
R+R+ I G A + LL Y + D PY L TGL ++D
Sbjct: 399 RMRNDINAWFGYADSVPLLNLYGPVNTDFDFNANSGPYQILNKQKQTGLYVQD 451
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 29.4 bits (66), Expect = 2.6
Identities = 22/81 (27%), Positives = 38/81 (46%), Gaps = 1/81 (1%)
Query: 42 KKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEA-AQFEEWQRQEDQ 100
KK +K+++K+ + K+ Q+ E E AE+E+E +R A Q +E QE++
Sbjct: 33 IKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEE 92
Query: 101 FHLEQARLRSTIRIQEGRAKP 121
E ++ R KP
Sbjct: 93 KEEEAEDVKQQEVFSFKRKKP 113
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 29.7 bits (68), Expect = 2.7
Identities = 14/50 (28%), Positives = 25/50 (50%), Gaps = 6/50 (12%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQR 84
EA+K+ + KK+ + K++ + K R E E E ER E+ ++
Sbjct: 46 EAKKEAEAIKKEALLEAKEE------IHKLRNEFEKELRERRNELQKLEK 89
Score = 29.0 bits (66), Expect = 5.0
Identities = 18/85 (21%), Positives = 47/85 (55%), Gaps = 3/85 (3%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEA-AQF 91
+EA+++ K + + +K+ ++++N ++KR ++E E +R+ E L +R +E +
Sbjct: 59 LLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKE-ENLDRKLE-LLEKREEELEKKE 116
Query: 92 EEWQRQEDQFHLEQARLRSTIRIQE 116
+E ++++ + ++ L I Q
Sbjct: 117 KELEQKQQELEKKEEELEELIEEQL 141
>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11. The WW domain is
a small protein module with a triple-stranded
beta-sheet fold. This is a family of WW domain binding
proteins.
Length = 78
Score = 27.6 bits (62), Expect = 2.8
Identities = 15/57 (26%), Positives = 33/57 (57%), Gaps = 3/57 (5%)
Query: 34 VEARKKKKKKKKKKKKKKKKKKNLCANVKKR---RQEREAERAEREEEMALTQRGKE 87
+A +K++KKK+ KK K +++ A + K+ R +RE ++ + +EE + ++
Sbjct: 7 TDAYRKEQKKKELKKNKAERQARREAKLAKKNPDRLQRELDKLKAQEESGKLRPVEK 63
>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
SecD/SecF; Reviewed.
Length = 755
Score = 29.8 bits (68), Expect = 2.9
Identities = 12/22 (54%), Positives = 16/22 (72%)
Query: 35 EARKKKKKKKKKKKKKKKKKKN 56
E R+ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKI 754
>gnl|CDD|234032 TIGR02855, spore_yabG, sporulation peptidase YabG. Members of this
family are the protein YabG, demonstrated for Bacillus
subtilis to be an endopeptidase able to release
N-terminal peptides from a number of sporulation
proteins, including CotT, CotF, and SpoIVA. It appears
to be expressed under control of sigma-K [Cellular
processes, Sporulation and germination].
Length = 283
Score = 29.6 bits (67), Expect = 2.9
Identities = 10/57 (17%), Positives = 26/57 (45%), Gaps = 2/57 (3%)
Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLC--ANVKKRRQEREAERAEREEEMALTQRGK 86
V+ ++KK+KK+ K++ + + + K ++E A + ++ G+
Sbjct: 50 ELVDESERKKRKKRFKERMETSYRLFRQDYKLMKEKKEYRATGGYKNNQLYFGMPGR 106
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 29.7 bits (67), Expect = 3.2
Identities = 26/128 (20%), Positives = 48/128 (37%), Gaps = 8/128 (6%)
Query: 11 YSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREA 70
+ NQ+ AR + + + E +K+++ + +K +N +K RE
Sbjct: 60 LLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSAREL 119
Query: 71 ERAEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYIG 130
E E E++ L A Q ++ L A L E +A+ + + +
Sbjct: 120 ELEELEKQ--LDNELYRVAGLTPEQARKLLLKLLDAELE------EEKAQRVKKIEEEAD 171
Query: 131 SEAEVDAV 138
EAE A
Sbjct: 172 LEAERKAQ 179
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 29.0 bits (65), Expect = 3.2
Identities = 12/56 (21%), Positives = 29/56 (51%)
Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEA 88
+ ++K+ K KK+KK+KK+K + + K++ + + ++ T + +E
Sbjct: 102 VSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEEG 157
>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
Autographa californica nuclear polyhedrosis virus
(AcMNPV) pp31 is a nuclear phosphoprotein that
accumulates in the virogenic stroma, which is the viral
replication centre in the infected-cell nucleus, binds
to DNA, and serves as a late expression factor.
Length = 267
Score = 29.3 bits (66), Expect = 3.4
Identities = 11/31 (35%), Positives = 14/31 (45%)
Query: 26 QVQGTGSKVEARKKKKKKKKKKKKKKKKKKN 56
V S +RK+K KK KK KK+
Sbjct: 214 VVPTPQSSSPSRKRKAPKKVAKKVAAAKKRK 244
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 29.7 bits (67), Expect = 3.6
Identities = 18/49 (36%), Positives = 26/49 (53%)
Query: 12 SEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCAN 60
+DD+ L + Q +K +KKKKKKKKKKKK + +L +
Sbjct: 32 DVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYD 80
Score = 28.9 bits (65), Expect = 5.6
Identities = 21/63 (33%), Positives = 29/63 (46%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEW 94
EA K+ KKKKKKKKKKKKKNL + ++ + ++ + E
Sbjct: 49 EAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQEWK 108
Query: 95 QRQ 97
Q Q
Sbjct: 109 QTQ 111
>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein. This is a family of
121-amino acid secretory proteins. Laps functions in the
regulation of neuronal cell adhesion and/or movement and
synapse attachment. Laps binds to the ApC/EBP (Aplysia
CCAAT/enhancer binding protein) promoter and activates
the transcription of ApC/EBP mRNA.
Length = 124
Score = 28.3 bits (63), Expect = 3.7
Identities = 15/22 (68%), Positives = 17/22 (77%)
Query: 36 ARKKKKKKKKKKKKKKKKKKNL 57
KK KKK++KKKKKKK KNL
Sbjct: 102 QAKKLKKKREKKKKKKKGAKNL 123
Score = 27.1 bits (60), Expect = 8.9
Identities = 13/20 (65%), Positives = 16/20 (80%)
Query: 36 ARKKKKKKKKKKKKKKKKKK 55
R+ KK KKK++KKKKKKK
Sbjct: 100 QRQAKKLKKKREKKKKKKKG 119
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 29.8 bits (67), Expect = 3.7
Identities = 17/49 (34%), Positives = 23/49 (46%), Gaps = 4/49 (8%)
Query: 64 RRQEREAERAEREEEMALTQRGK---EAAQFEEWQRQEDQ-FHLEQARL 108
RR++ A+RAE A +R + EA Q D+ H E ARL
Sbjct: 445 RRRDDVADRAEATHAAARARRDELDEEAEQAAARAELADEAVHREGARL 493
>gnl|CDD|219589 pfam07808, RED_N, RED-like protein N-terminal region. This
family contains sequences that are similar to the
N-terminal region of Red protein. This and related
proteins contain a RED repeat which consists of a
number of RE and RD sequence elements. The region in
question has several conserved NLS sequences and a
putative trimeric coiled-coil region, suggesting that
these proteins are expressed in the nucleus. The
function of Red protein is unknown, but efficient
sequestration to nuclear bodies suggests that its
expression may be tightly regulated of that the protein
self-aggregates extremely efficiently.
Length = 238
Score = 29.0 bits (65), Expect = 3.9
Identities = 12/62 (19%), Positives = 25/62 (40%)
Query: 38 KKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQ 97
KKKKK +K+++ +K + + R +ER + + +L A +
Sbjct: 1 KKKKKYAYLRKQEENAEKEINPKYRDRARERRKGINKDYDPSSLAAYRAVAPDAKSDIDA 60
Query: 98 ED 99
+
Sbjct: 61 AE 62
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 28.1 bits (63), Expect = 4.0
Identities = 16/60 (26%), Positives = 32/60 (53%)
Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQF 91
++A K++KK+ K ++K+ +K ++ + + R+ ER EE +R KE +F
Sbjct: 25 DSIKAAKEEKKQAKLSERKENRKPKYIGSLLEAAERRKREREIAEERKLQKEREKEGDEF 84
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 28.3 bits (64), Expect = 4.2
Identities = 15/58 (25%), Positives = 28/58 (48%), Gaps = 9/58 (15%)
Query: 26 QVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQ 83
+++ + E + K + KK+K + + + K L E +A+ AE E + AL Q
Sbjct: 62 ELEAAAPRAELQAKIARYKKEKARYRSEAKEL---------EAKAKEAEAESDHALHQ 110
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast)
and GNL3L (human) are putative GTPases which are
required for growth and play a role in processing of
nucleolar pre-rRNA. This family contains a potential
nuclear localisation signal.
Length = 80
Score = 27.2 bits (61), Expect = 4.2
Identities = 15/54 (27%), Positives = 27/54 (50%), Gaps = 12/54 (22%)
Query: 37 RKKKKKKKKKKKKKKKKKK-----NLC-------ANVKKRRQEREAERAEREEE 78
RK +K+ KK K KKKK N ++++++++E E+ R+E
Sbjct: 15 RKLRKEAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEEEKERRKEA 68
>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
(ETRAMP). These sequences represent a family of
proteins from the malaria parasite Plasmodium
falciparum, several of which have been shown to be
expressed specifically in the ring stage as well as the
rodent parasite Plasmodium yoelii. A homologue from
Plasmodium chabaudi was localized to the
parasitophorous vacuole membrane. Members have an
initial hydrophobic, Phe/Tyr-rich, stretch long enough
to span the membrane, a highly charged region rich in
Lys, a second putative transmembrane region and a
second highly charged, low complexity sequence region.
Some members have up to 100 residues of additional
C-terminal sequence. These genes have been shown to be
found in the sub-telomeric regions of both Plasmodium
falciparum and P. yoelii chromosomes.
Length = 82
Score = 27.2 bits (61), Expect = 4.5
Identities = 12/23 (52%), Positives = 14/23 (60%)
Query: 34 VEARKKKKKKKKKKKKKKKKKKN 56
V +KK KK KK +KKKKN
Sbjct: 28 VAKKKKLKKLKKIDDDLEKKKKN 50
Score = 26.8 bits (60), Expect = 5.1
Identities = 16/35 (45%), Positives = 20/35 (57%), Gaps = 4/35 (11%)
Query: 27 VQGTGSKVEARKKKKKKKKKK----KKKKKKKKNL 57
G + V A+KKK KK KK +KKKK KK +
Sbjct: 20 APGLNNNVVAKKKKLKKLKKIDDDLEKKKKNKKKI 54
>gnl|CDD|218611 pfam05499, DMAP1, DNA methyltransferase 1-associated protein 1
(DMAP1). DNA methylation can contribute to
transcriptional silencing through several
transcriptionally repressive complexes, which include
methyl-CpG binding domain proteins (MBDs) and histone
deacetylases (HDACs). The chief enzyme that maintains
mammalian DNA methylation, DNMT1, can also establish a
repressive transcription complex. The non-catalytic
amino terminus of DNMT1 binds to HDAC2 and DMAP1 (for
DNMT1 associated protein), and can mediate
transcriptional repression. DMAP1 has intrinsic
transcription repressive activity, and binds to the
transcriptional co-repressor TSG101. DMAP1 is targeted
to replication foci through interaction with the far N
terminus of DNMT1 throughout S phase, whereas HDAC2
joins DNMT1 and DMAP1 only during late S phase,
providing a platform for how histones may become
deacetylated in heterochromatin following replication.
Length = 175
Score = 28.7 bits (64), Expect = 4.5
Identities = 16/38 (42%), Positives = 21/38 (55%)
Query: 16 DNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKK 53
D Q+L+ A Q S EARK++KK KKK +K
Sbjct: 25 DLQKLITAADQQAAGFSTAEARKREKKLPKKKIPQKPG 62
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 28.5 bits (63), Expect = 4.5
Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 3/78 (3%)
Query: 30 TGSKVEARKKKKKKKKKKKKKKKKKKNLC--ANVKKRRQEREAERAEREEEMALTQRGKE 87
TG + R + + + KK KKKK + ++K RQERE E ER E ++ E
Sbjct: 95 TGVPLAIRIRDQLRSVKKDDIKKKKSLIIRQEQIEKARQERE-ELEERMEWERREEKIDE 153
Query: 88 AAQFEEWQRQEDQFHLEQ 105
EE +R+ ++ +E+
Sbjct: 154 REDQEEQEREREEQTIEE 171
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 29.2 bits (66), Expect = 4.5
Identities = 21/118 (17%), Positives = 41/118 (34%), Gaps = 18/118 (15%)
Query: 19 RLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREE- 77
R A + G + + +++ K + K ++++ ++ N ++ R ER R R
Sbjct: 571 RFFGALKALFSGGEETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRR 630
Query: 78 -----------------EMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGR 118
+ R + A+ E R +D+ R R R E R
Sbjct: 631 EGRENREENRRNRRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKR 688
>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
protein, PEP-CTERM locus subfamily. Members of this
protein family belong to the family of polysaccharide
chain length determinant proteins (pfam02706). All are
found in species that encode the PEP-CTERM/exosortase
system predicted to act in protein sorting in a number
of Gram-negative bacteria, and are found near the epsH
homolog that is the putative exosortase gene [Cell
envelope, Biosynthesis and degradation of surface
polysaccharides and lipopolysaccharides].
Length = 498
Score = 29.2 bits (66), Expect = 4.8
Identities = 15/42 (35%), Positives = 21/42 (50%)
Query: 210 ESVAQDVASVFKGKTTAQLEALQTQIELKISSKTENIDNVIT 251
E V + + K+ AQLEAL T++ IS DN+ T
Sbjct: 84 EKVIRMLDLDLGAKSPAQLEALITKLRKNISISLAGRDNLFT 125
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 28.9 bits (65), Expect = 4.8
Identities = 10/52 (19%), Positives = 25/52 (48%)
Query: 27 VQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
V ++ + +++ K KK KK N K++ E++ ++ + ++E
Sbjct: 31 VLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKE 82
>gnl|CDD|216581 pfam01577, Peptidase_S30, Potyvirus P1 protease. The potyviridae
family positive stand RNA viruses with genome encoding
a polyprotein. members include zucchini yellow mosaic
virus, and turnip mosaic viruses which cause
considerable losses of crops worldwide. This family
consists of a C terminus region from various plant
potyvirus P1 proteins (found at the N terminus of the
polyprotein). The C terminus of P1 is a serine-type
protease responsible for autocatalytic cleavage between
P1 and the helper component protease pfam00851. The
entire P1 protein may be involved in virus-host
interactions.
Length = 245
Score = 28.8 bits (65), Expect = 4.9
Identities = 14/56 (25%), Positives = 25/56 (44%)
Query: 28 QGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQ 83
+KV R +K+ K K++KK + + + ++RE + E EE Q
Sbjct: 1 ADLEAKVAERLLRKEMSKIKQEKKGRIILRKLSPAQVAKKREKLKREEREERQFLQ 56
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 28.9 bits (65), Expect = 5.1
Identities = 11/20 (55%), Positives = 14/20 (70%)
Query: 35 EARKKKKKKKKKKKKKKKKK 54
RK++ KK+KKKK KK K
Sbjct: 355 PKRKREGDKKQKKKKSKKLK 374
>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
Length = 977
Score = 29.2 bits (65), Expect = 5.3
Identities = 13/24 (54%), Positives = 17/24 (70%)
Query: 32 SKVEARKKKKKKKKKKKKKKKKKK 55
++VE+ K KK KKKK KKK+ K
Sbjct: 116 NRVESWKDKKNKKKKSAKKKEAHK 139
>gnl|CDD|197876 smart00792, Agouti, Agouti protein. The agouti protein regulates
pigmentation in the mouse hair follicle producing a
black hair with a subapical yellow band. A highly
homologous protein agouti signal protein (ASIP) is
present in humans and is expressed at highest levels in
adipose tissue where it may play a role in energy
homeostasis and possibly human pigmentation.
Length = 124
Score = 27.9 bits (62), Expect = 5.3
Identities = 8/30 (26%), Positives = 15/30 (50%)
Query: 25 MQVQGTGSKVEARKKKKKKKKKKKKKKKKK 54
+ + K+ A + +KK +KK+KK
Sbjct: 48 VGLNKKPKKISAEEAEKKLLQKKEKKALTN 77
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 28.1 bits (62), Expect = 5.3
Identities = 21/104 (20%), Positives = 60/104 (57%), Gaps = 1/104 (0%)
Query: 13 EEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
E+++ RL ++ + ++ ++ +++++++ ++K++K A ++++++ E ER
Sbjct: 48 EQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQER 107
Query: 73 AEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQE 116
++++E A R +E A+ +R++ +EQ RL R++E
Sbjct: 108 IQKQKEEAEA-RAREEAERMRLEREKHFQQIEQERLERKKRLEE 150
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many
hypothetical proteins.
Length = 98
Score = 27.4 bits (61), Expect = 5.5
Identities = 15/70 (21%), Positives = 28/70 (40%)
Query: 12 SEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAE 71
+ D + + + + + + K KK +KK KKK + KRR+ +
Sbjct: 10 DKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRRRTEDGL 69
Query: 72 RAEREEEMAL 81
EEE+ +
Sbjct: 70 PIYTEEELKI 79
>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein
7 (Rrp7p), ribosomal RNA-processing protein 7 homolog A
(Rrp7A), and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
YCL031C gene from Saccharomyces cerevisiae. It is an
essential yeast protein involved in pre-rRNA processing
and ribosome assembly, and is speculated to be required
for correct assembly of rpS27 into the pre-ribosomal
particle. Rrp7A, also termed gastric cancer antigen
Zg14, is the Rrp7p homolog mainly found in Metazoans.
The cellular function of Rrp7A remains unclear
currently. Both Rrp7p and Rrp7A harbor an N-terminal
RNA recognition motif (RRM), also termed RBD (RNA
binding domain) or RNP (ribonucleoprotein domain), and
a C-terminal RRP7 domain.
Length = 118
Score = 27.6 bits (62), Expect = 5.7
Identities = 12/22 (54%), Positives = 18/22 (81%)
Query: 35 EARKKKKKKKKKKKKKKKKKKN 56
EA + K K+K+KKKKKKK+ ++
Sbjct: 69 EAVEAKAKEKEKKKKKKKELED 90
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 28.9 bits (65), Expect = 5.8
Identities = 18/55 (32%), Positives = 24/55 (43%), Gaps = 7/55 (12%)
Query: 229 EALQTQIELK-ISSKTENID---NVITAEEKALQREARK---GMGGDEAEFSVES 276
+TQ E K I + ID N I E++ LQ EA + M D E + E
Sbjct: 174 SEERTQDETKMIEEYRDKIDALKNAIEVEKEGLQEEAIQEIADMSADILEHAAEE 228
>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
splicing MRS1. Members of this family adopt a secondary
structure consisting of two beta sheets and one alpha
helix, arranged as a beta-alpha-beta motif. Each beta
sheet has five strands, arranged in a 32145 order, with
the second strand being antiparallel to the rest.
Mitochondrial resolvase Ydc2 is capable of resolving
Holliday junctions and cleaves DNA after 5'-CT-3' and
5'-TT-3' sequences. This family also contains the
mitochondrial RNA-splicing protein MRS1 which is
involved in the excision of group I introns.
Length = 254
Score = 28.5 bits (64), Expect = 5.9
Identities = 9/21 (42%), Positives = 12/21 (57%)
Query: 38 KKKKKKKKKKKKKKKKKKNLC 58
KK + KKK K KK + +L
Sbjct: 156 KKPRSKKKSSKNSKKLRIDLV 176
Score = 28.5 bits (64), Expect = 6.7
Identities = 10/22 (45%), Positives = 13/22 (59%)
Query: 34 VEARKKKKKKKKKKKKKKKKKK 55
+ A KKK + KKK K KK +
Sbjct: 151 ILAEKKKPRSKKKSSKNSKKLR 172
>gnl|CDD|214659 smart00433, TOP2c, TopoisomeraseII. Eukaryotic DNA topoisomerase
II, GyrB, ParE.
Length = 594
Score = 28.7 bits (65), Expect = 6.0
Identities = 8/24 (33%), Positives = 13/24 (54%)
Query: 34 VEARKKKKKKKKKKKKKKKKKKNL 57
+AR KK ++ +KKK +L
Sbjct: 344 AKARAAAKKARELTRKKKLSSISL 367
>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
(DUF874). This family consists of several hypothetical
proteins specific to Helicobacter pylori. The function
of this family is unknown.
Length = 417
Score = 28.7 bits (63), Expect = 6.1
Identities = 22/103 (21%), Positives = 56/103 (54%), Gaps = 1/103 (0%)
Query: 14 EDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERA 73
E + Q+ +++ + K E K+K +++K+K +++K+K + AN E+E ++
Sbjct: 159 EQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKT 218
Query: 74 EREEEMALTQRGKEAAQFEE-WQRQEDQFHLEQARLRSTIRIQ 115
E E++ + ++ + E+ Q +QF +++ +++ I I+
Sbjct: 219 ENEKQDLIKEQKDFIKEAEQNCQENHNQFFIKKLGIKAGIAIE 261
>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7). RRP7
is an essential protein in yeast that is involved in
pre-rRNA processing and ribosome assembly. It is
speculated to be required for correct assembly of rpS27
into the pre-ribosomal particle.
Length = 131
Score = 27.6 bits (62), Expect = 6.2
Identities = 18/46 (39%), Positives = 29/46 (63%), Gaps = 5/46 (10%)
Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREE 77
S+ +A ++++K K+KKKKKKK+ +N R Q RE ++ E E
Sbjct: 67 SRNKAAEERRKLKEKKKKKKKELENF-----YRFQIREKKKEELAE 107
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 28.8 bits (65), Expect = 6.3
Identities = 7/31 (22%), Positives = 18/31 (58%)
Query: 36 ARKKKKKKKKKKKKKKKKKKNLCANVKKRRQ 66
K+ K++K+ +K KK+++ ++K +
Sbjct: 558 EEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
>gnl|CDD|221161 pfam11661, DUF2986, Protein of unknown function (DUF2986). This
family of proteins has no known function.
Length = 44
Score = 25.7 bits (57), Expect = 6.3
Identities = 18/42 (42%), Positives = 22/42 (52%), Gaps = 2/42 (4%)
Query: 37 RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
RKKK +K KKK KK K K + +AERA+ E E
Sbjct: 3 RKKKINQKLKKKAKKANAKL--HPSNKPKYISKAERAKLEAE 42
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 28.5 bits (64), Expect = 6.3
Identities = 26/144 (18%), Positives = 52/144 (36%), Gaps = 10/144 (6%)
Query: 1 MSRLEPGTLVYSEED--DNQRL-VYARMQVQG----TGSKVEARKKKKKKKKKKKKKKKK 53
R P L+ S ED + RL +Y + K+ + + ++ +
Sbjct: 127 SGRNPPPALLVSPEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAE 186
Query: 54 KKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQ--FEEWQRQEDQFHLEQARLRST 111
L + + ++ + ER++ +A A Q EE + E + E A +
Sbjct: 187 LTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAA 246
Query: 112 I-RIQEGRAKPIDLLAKYIGSEAE 134
+ +E A A+ +EA+
Sbjct: 247 AAKAREAAAAAEAAAARARAAEAK 270
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 29.1 bits (65), Expect = 6.4
Identities = 10/46 (21%), Positives = 21/46 (45%), Gaps = 2/46 (4%)
Query: 35 EARKKKKKKKKKKKKK--KKKKKNLCANVKKRRQEREAERAEREEE 78
EAR+K ++ + + KK + ++ ++A +E EE
Sbjct: 1177 EAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEE 1222
>gnl|CDD|148523 pfam06952, PsiA, PsiA protein. This family consists of several
Enterobacterial PsiA proteins. The function of PsiA is
unknown although it is thought that it may affect the
generation of an SOS signal in Escherichia coli.
Length = 238
Score = 28.2 bits (63), Expect = 6.4
Identities = 8/34 (23%), Positives = 19/34 (55%)
Query: 50 KKKKKKNLCANVKKRRQEREAERAEREEEMALTQ 83
++++K+++ K R++ RE E+ + L Q
Sbjct: 117 RRRQKQDIAFTRKMRQEAREVEQRWLLYQNLLAQ 150
>gnl|CDD|235825 PRK06547, PRK06547, hypothetical protein; Provisional.
Length = 172
Score = 28.2 bits (63), Expect = 6.5
Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 4/33 (12%)
Query: 69 EAERAEREEEMALTQRGKEAAQFEEWQRQEDQF 101
EA R ER AL + A +E W QE++
Sbjct: 128 EALRKER----ALARDPDYAPHWEMWAAQEERH 156
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 28.8 bits (64), Expect = 6.6
Identities = 21/79 (26%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 22 YARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMAL 81
Y R+Q Q + A+K ++++KKK+++ ++ K A ++R ++ E ER + +E+
Sbjct: 64 YGRIQSQQS----SAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQ 119
Query: 82 TQRGKEAAQFEEWQRQEDQ 100
+ ++ AQ E+ Q++E
Sbjct: 120 AEEAEKQAQLEQKQQEEQA 138
>gnl|CDD|236798 PRK10929, PRK10929, putative mechanosensitive channel protein;
Provisional.
Length = 1109
Score = 28.9 bits (65), Expect = 6.6
Identities = 13/32 (40%), Positives = 18/32 (56%)
Query: 62 KKRRQEREAERAEREEEMALTQRGKEAAQFEE 93
K+RR E A+RA EEE + + A + EE
Sbjct: 737 KQRRAEILAQRARGEEEAHHSSSPEGAIEVEE 768
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 28.3 bits (63), Expect = 6.6
Identities = 12/23 (52%), Positives = 14/23 (60%)
Query: 34 VEARKKKKKKKKKKKKKKKKKKN 56
VE +KK K KK K K KK + K
Sbjct: 89 VEKKKKGKSKKNKLKGKKDEDKK 111
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 28.8 bits (65), Expect = 7.0
Identities = 9/21 (42%), Positives = 13/21 (61%)
Query: 38 KKKKKKKKKKKKKKKKKKNLC 58
K + ++ K KKK KKK+ C
Sbjct: 785 KLRSSQRDKLKKKLLKKKDFC 805
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 28.8 bits (64), Expect = 7.2
Identities = 19/110 (17%), Positives = 39/110 (35%), Gaps = 9/110 (8%)
Query: 22 YARMQVQGTGSKVEAR---------KKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
Y RM + +G V K + K+K K+K K + E E
Sbjct: 767 YPRMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCT 826
Query: 73 AEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAKPI 122
++ EE + + ++ + ++ + + R I++ R P+
Sbjct: 827 IKQREEKGIDAPAILNVKKKKPYKVDNMTRILPQQSRYISFIKDDRFVPV 876
>gnl|CDD|236413 PRK09210, PRK09210, RNA polymerase sigma factor RpoD; Validated.
Length = 367
Score = 28.4 bits (64), Expect = 7.4
Identities = 14/43 (32%), Positives = 18/43 (41%)
Query: 40 KKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALT 82
KK KKK+ ++ K L KKR E AE+ L
Sbjct: 1 KKTKKKETTTLEQVVKELIEKGKKRGTLTYDEIAEKLIPFELD 43
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 28.6 bits (64), Expect = 7.5
Identities = 22/93 (23%), Positives = 42/93 (45%), Gaps = 8/93 (8%)
Query: 15 DDNQRLVYA----RMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREA 70
DDN + A +V+ + K+++ K+ KK + K L + EA
Sbjct: 340 DDNDLTLLALTKLYEEVKSNT-DLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEA 398
Query: 71 ERAEREEEMALTQ---RGKEAAQFEEWQRQEDQ 100
E +EE++ + + KE A ++ +RQ+D+
Sbjct: 399 EAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4;
Provisional.
Length = 191
Score = 27.9 bits (62), Expect = 7.6
Identities = 15/39 (38%), Positives = 18/39 (46%)
Query: 40 KKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
KK KKKK +KKK KK+ +E E A E
Sbjct: 48 KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELS 86
Score = 27.9 bits (62), Expect = 9.1
Identities = 16/45 (35%), Positives = 20/45 (44%)
Query: 34 VEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
VE KK KKKK +KKK KK ++E AE +
Sbjct: 43 VEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSD 87
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This
family includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 28.5 bits (62), Expect = 7.7
Identities = 29/100 (29%), Positives = 51/100 (51%), Gaps = 7/100 (7%)
Query: 15 DDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKK------RRQER 68
+D ++ +A +++ +VE KK +KKK +K+ K + N KK RR E
Sbjct: 48 NDQRKEAFASLELTEQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEA 107
Query: 69 EA-ERAEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQAR 107
EA +RA +EEE + +E Q E+ +++ D L+ +
Sbjct: 108 EAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEK 147
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 28.4 bits (64), Expect = 7.7
Identities = 20/78 (25%), Positives = 31/78 (39%), Gaps = 5/78 (6%)
Query: 19 RLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCA---NVKKRRQEREAERAER 75
+ + + E K+ ++ KK+ + KK+ L V K R E E E ER
Sbjct: 15 GYLLRKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKER 74
Query: 76 EEEMALTQRGKEAAQFEE 93
E L + + Q EE
Sbjct: 75 RNE--LQRLERRLLQREE 90
>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
Length = 405
Score = 28.2 bits (63), Expect = 7.9
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKR 64
EA + K KK+K KK+K+ K A +K
Sbjct: 348 EAARVKAKKEKLAKKRKQVSKEEAAAIKAA 377
>gnl|CDD|225664 COG3122, COG3122, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 215
Score = 28.0 bits (62), Expect = 8.3
Identities = 17/60 (28%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 41 KKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQEDQ 100
KK+ K ++ KK + + + +E +AE+ ER+++++ Q K+AA +E++ Q Q
Sbjct: 58 KKRAKVQRLAKKSRVDAREAYRIAAEENKAEQLERDKQLSEQQ--KQAALAKEYKAQVKQ 115
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region
covers both the Brf homology II and III regions. This
region is involved in binding TATA binding protein.
Length = 95
Score = 26.8 bits (60), Expect = 8.4
Identities = 15/54 (27%), Positives = 22/54 (40%), Gaps = 6/54 (11%)
Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRG 85
K +K + KKKKK+K K KKR + A A + L ++
Sbjct: 34 EKELKQKADEGNNSGKKKKKRKAK------KKRDEAGPASTAAEAVKKMLQKKK 81
>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694). Family of
hypothetical bacterial proteins.
Length = 278
Score = 28.0 bits (63), Expect = 8.5
Identities = 15/51 (29%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Query: 10 VYSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCAN 60
++ +E + + + E K+KKKKKKKKKKK++ ++ +C
Sbjct: 60 IFVQEKEKLNDEHGE-LYERADRAEELLKEKKKKKKKKKKKEELREWMCEK 109
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 28.1 bits (63), Expect = 8.6
Identities = 15/86 (17%), Positives = 37/86 (43%), Gaps = 1/86 (1%)
Query: 9 LVYSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKK-KKKKKKKKNLCANVKKRRQE 67
++ +++D + L + ++ + A + + + + +K +K L A + +
Sbjct: 149 ILEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEAS 208
Query: 68 REAERAEREEEMALTQRGKEAAQFEE 93
E+A EE+ AL + A +E
Sbjct: 209 ALGEKAALEEQKALAEAAAAEAAKQE 234
>gnl|CDD|222790 PHA00430, PHA00430, tail fiber protein.
Length = 568
Score = 28.3 bits (63), Expect = 8.8
Identities = 12/46 (26%), Positives = 21/46 (45%)
Query: 65 RQEREAERAEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRS 110
R EA+RA + E + G A ++W+ + D + E R +
Sbjct: 170 RSRNEADRARNQAERFNNESGASATNTKQWRSEADGSNSEANRFKG 215
>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
Length = 205
Score = 27.6 bits (62), Expect = 9.0
Identities = 14/38 (36%), Positives = 27/38 (71%)
Query: 36 ARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERA 73
+KKK KK+++ KK KKK ++L A ++ R++ + ++A
Sbjct: 122 GKKKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKA 159
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 28.2 bits (63), Expect = 9.7
Identities = 12/29 (41%), Positives = 16/29 (55%)
Query: 28 QGTGSKVEARKKKKKKKKKKKKKKKKKKN 56
+ +K +A K KKK K K KK KK+
Sbjct: 756 KSASTKGKAAKTVKKKSKAKSKKTTKKRA 784
>gnl|CDD|218413 pfam05069, Phage_tail_S, Phage virion morphogenesis family.
Protein S of phage P2 is thought to be involved in tail
completion and stable head joining.
Length = 145
Score = 27.3 bits (61), Expect = 9.8
Identities = 8/30 (26%), Positives = 12/30 (40%)
Query: 24 RMQVQGTGSKVEARKKKKKKKKKKKKKKKK 53
Q GS RK + KK + ++ K
Sbjct: 42 AAQQAPDGSPWAPRKPTLRAKKGRIRRGGK 71
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.132 0.378
Gapped
Lambda K H
0.267 0.0708 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 21,182,744
Number of extensions: 2125889
Number of successful extensions: 10595
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7197
Number of HSP's successfully gapped: 795
Length of query: 409
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 310
Effective length of database: 6,546,556
Effective search space: 2029432360
Effective search space used: 2029432360
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 60 (26.9 bits)