RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3368
(169 letters)
>gnl|CDD|215741 pfam00135, COesterase, Carboxylesterase family.
Length = 510
Score = 117 bits (296), Expect = 1e-31
Identities = 43/111 (38%), Positives = 60/111 (54%), Gaps = 21/111 (18%)
Query: 29 IIELKQGRLRGVVRSPVHNGNLNNIHMYLGIPYAAPPVGQLRFMAPQSPQPWPGLMIADS 88
++E G++RG+ V G ++ +LGIPYA PPVG+LRF PQ P+PW G++ A
Sbjct: 2 VVETSYGKVRGLRV-TVDGGP---VYAFLGIPYAKPPVGELRFKKPQPPEPWTGVLDATK 57
Query: 89 FSPVCPQKLPNLDDDDFSKKMSKGRLQYYQALMPYLKNQSEDCLYLNIYTP 139
+ P CPQ ++D +M SEDCLYLN+YTP
Sbjct: 58 YGPACPQ------NNDLGSEMWNKNT-----------GMSEDCLYLNVYTP 91
>gnl|CDD|238191 cd00312, Esterase_lipase, Esterases and lipases (includes fungal
lipases, cholinesterases, etc.) These enzymes act on
carboxylic esters (EC: 3.1.1.-). The catalytic apparatus
involves three residues (catalytic triad): a serine, a
glutamate or aspartate and a histidine.These catalytic
residues are responsible for the nucleophilic attack on
the carbonyl carbon atom of the ester bond. In contrast
with other alpha/beta hydrolase fold family members,
p-nitrobenzyl esterase and acetylcholine esterase have a
Glu instead of Asp at the active site carboxylate.
Length = 493
Score = 98.6 bits (246), Expect = 1e-24
Identities = 40/111 (36%), Positives = 53/111 (47%), Gaps = 25/111 (22%)
Query: 29 IIELKQGRLRGVVRSPVHNGNLNNIHMYLGIPYAAPPVGQLRFMAPQSPQPWPGLMIADS 88
++ G++RGV V+ +LGIPYA PPVG LRF PQ +PW ++ A S
Sbjct: 1 LVVTPNGKVRGVDEGGVY--------SFLGIPYAEPPVGDLRFKEPQPYEPWSDVLDATS 52
Query: 89 FSPVCPQKLPNLDDDDFSKKMSKGRLQYYQALMPYLKNQSEDCLYLNIYTP 139
+ P C Q D + +L SEDCLYLN+YTP
Sbjct: 53 YPPSCMQ------WDQLGGGLWNAKLP-----------GSEDCLYLNVYTP 86
>gnl|CDD|225181 COG2272, PnbA, Carboxylesterase type B [Lipid metabolism].
Length = 491
Score = 97.1 bits (242), Expect = 4e-24
Identities = 45/120 (37%), Positives = 56/120 (46%), Gaps = 29/120 (24%)
Query: 29 IIELKQGRLRGVVRSPVHNGNLNNIHMYLGIPYAAPPVGQLRFMAPQSPQPWPGLMIADS 88
+ E G++ G+ N +H +LGIPYAAPPVG+LRF P P+PW G+ A
Sbjct: 4 VAETTTGKVEGITV--------NGVHSWLGIPYAAPPVGELRFRRPVPPEPWSGVRDATQ 55
Query: 89 FSPVCPQKLPNLDDDDFSKKMSKGRLQYYQALMPYLKNQSEDCLYLNIYTPLQEEEEEKK 148
F P CPQ +M G SEDCLYLNI+ P E EK
Sbjct: 56 FGPACPQ---------PFNRMGSGEDF----------TGSEDCLYLNIWAP--EVPAEKL 94
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 36.7 bits (85), Expect = 0.001
Identities = 15/28 (53%), Positives = 20/28 (71%)
Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEE 166
EE+ +E+ EEKKKEE+K+EE EE
Sbjct: 70 AAAAAEEKAEEKEEEKKKEEEKEEEEEE 97
Score = 30.9 bits (70), Expect = 0.10
Identities = 12/23 (52%), Positives = 17/23 (73%)
Query: 145 EEKKEEREEKKKEEKKKEEGEEK 167
EE+ E+K+EEKKKEE +E+
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEE 93
Score = 27.1 bits (60), Expect = 2.1
Identities = 12/28 (42%), Positives = 18/28 (64%)
Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEE 166
P +E+ EEK++E+KK+EE EE
Sbjct: 66 PAAAAAAAAEEKAEEKEEEKKKEEEKEE 93
Score = 26.3 bits (58), Expect = 4.7
Identities = 13/23 (56%), Positives = 19/23 (82%), Gaps = 1/23 (4%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEG 164
E+EEEKK+E EEK++EE++ G
Sbjct: 80 EKEEEKKKE-EEKEEEEEEALAG 101
Score = 25.5 bits (56), Expect = 7.7
Identities = 8/24 (33%), Positives = 20/24 (83%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGE 165
+ EE+++E+++E++KEE+++E
Sbjct: 77 KAEEKEEEKKKEEEKEEEEEEALA 100
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 36.1 bits (83), Expect = 0.005
Identities = 15/26 (57%), Positives = 24/26 (92%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEE 166
+EEEEE++EE EE+++EE+++EE EE
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 35.7 bits (82), Expect = 0.009
Identities = 15/29 (51%), Positives = 26/29 (89%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+EEEEE++EE EE+++EE+++EE EE ++
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 32.3 bits (73), Expect = 0.12
Identities = 13/28 (46%), Positives = 25/28 (89%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+ EEE++EE EE+++EE+++EE EE+++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEE 889
Score = 28.4 bits (63), Expect = 2.2
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+EEEEE++EE EE++ EE E E ++
Sbjct: 875 EEEEEEEEEEEEEEENEEPLSLEWPETRQ 903
Score = 27.3 bits (60), Expect = 4.7
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+EEEEE++EE EE+ +E E E ++K
Sbjct: 876 EEEEEEEEEEEEEENEEPLSLEWPETRQK 904
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 35.9 bits (82), Expect = 0.007
Identities = 15/30 (50%), Positives = 20/30 (66%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ EE KK E ++KK EE KK E +EKK
Sbjct: 1661 IKAAEEAKKAEEDKKKAEEAKKAEEDEKKA 1690
Score = 35.1 bits (80), Expect = 0.016
Identities = 15/29 (51%), Positives = 19/29 (65%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ EE KK E +EKK E K+E EE KK
Sbjct: 1675 KKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703
Score = 33.6 bits (76), Expect = 0.050
Identities = 14/26 (53%), Positives = 20/26 (76%)
Query: 143 EEEEKKEEREEKKKEEKKKEEGEEKK 168
EE +K+ E ++KK EE KK+E E+KK
Sbjct: 1733 EEAKKEAEEDKKKAEEAKKDEEEKKK 1758
Score = 32.8 bits (74), Expect = 0.092
Identities = 17/29 (58%), Positives = 19/29 (65%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+ E KKE E KK EE KK+E EEKKK
Sbjct: 1689 KAAEALKKEAEEAKKAEELKKKEAEEKKK 1717
Score = 30.5 bits (68), Expect = 0.46
Identities = 17/26 (65%), Positives = 20/26 (76%)
Query: 144 EEEKKEEREEKKKEEKKKEEGEEKKK 169
EE KK E E+KK E+ KK+E EEKKK
Sbjct: 1623 EELKKAEEEKKKVEQLKKKEAEEKKK 1648
Score = 30.1 bits (67), Expect = 0.58
Identities = 14/29 (48%), Positives = 19/29 (65%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+ EEE K + EE KK E+ K++ EE KK
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKK 1682
Score = 29.7 bits (66), Expect = 0.79
Identities = 14/30 (46%), Positives = 19/30 (63%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
L++ KK+ E KKK E+KK+ E KKK
Sbjct: 1410 LKKAAAAKKKADEAKKKAEEKKKADEAKKK 1439
Score = 29.7 bits (66), Expect = 0.93
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ EE+KK+ E KK KK+ E KKK
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKKADEAKKK 1426
Score = 29.3 bits (65), Expect = 1.0
Identities = 16/44 (36%), Positives = 23/44 (52%)
Query: 125 KNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
+ + E+ + L + EE KK E + K EE KK E E+KK
Sbjct: 1591 EARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKK 1634
Score = 29.3 bits (65), Expect = 1.1
Identities = 14/34 (41%), Positives = 18/34 (52%), Gaps = 4/34 (11%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEE----GEEKKK 169
++ E KK+ E KK EE KK + EE KK
Sbjct: 1502 AKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK 1535
Score = 29.3 bits (65), Expect = 1.1
Identities = 15/30 (50%), Positives = 23/30 (76%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
L++ EEEKK+ + KKKE ++K++ EE KK
Sbjct: 1625 LKKAEEEKKKVEQLKKKEAEEKKKAEELKK 1654
Score = 29.3 bits (65), Expect = 1.2
Identities = 15/30 (50%), Positives = 22/30 (73%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
L++E EE K+ E KKKE ++K++ EE KK
Sbjct: 1694 LKKEAEEAKKAEELKKKEAEEKKKAEELKK 1723
Score = 29.0 bits (64), Expect = 1.3
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
++EE +KK + +KK EEKKK + +KK
Sbjct: 1371 KKKEEAKKKADAAKKKAEEKKKADEAKKK 1399
Score = 29.0 bits (64), Expect = 1.4
Identities = 15/27 (55%), Positives = 18/27 (66%)
Query: 143 EEEEKKEEREEKKKEEKKKEEGEEKKK 169
EE +K EE + K EE KKE E+KKK
Sbjct: 1719 EELKKAEEENKIKAEEAKKEAEEDKKK 1745
Score = 29.0 bits (64), Expect = 1.4
Identities = 12/29 (41%), Positives = 19/29 (65%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ +E KK+ E KKK + K++ EE KK
Sbjct: 1315 KKADEAKKKAEEAKKKADAAKKKAEEAKK 1343
Score = 29.0 bits (64), Expect = 1.5
Identities = 16/27 (59%), Positives = 18/27 (66%)
Query: 143 EEEEKKEEREEKKKEEKKKEEGEEKKK 169
EE +KKE E+KK EE KK E E K K
Sbjct: 1705 EELKKKEAEEKKKAEELKKAEEENKIK 1731
Score = 28.6 bits (63), Expect = 2.1
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 143 EEEEKKEEREEKKKEEKKKEEGEEKKK 169
E E+K E EKKKEE KK+ KKK
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKK 1386
Score = 28.6 bits (63), Expect = 2.3
Identities = 12/29 (41%), Positives = 22/29 (75%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++ +E K++ EEKKK ++ K++ EE KK
Sbjct: 1417 KKKADEAKKKAEEKKKADEAKKKAEEAKK 1445
Score = 28.6 bits (63), Expect = 2.3
Identities = 14/30 (46%), Positives = 20/30 (66%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++ EEKK+ E KKK E+ K+ E KKK
Sbjct: 1423 AKKKAEEKKKADEAKKKAEEAKKADEAKKK 1452
Score = 28.6 bits (63), Expect = 2.3
Identities = 14/27 (51%), Positives = 18/27 (66%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKK 168
E EEK E E+KK+E KKK + +KK
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKK 1386
Score = 28.2 bits (62), Expect = 2.4
Identities = 12/27 (44%), Positives = 17/27 (62%)
Query: 143 EEEEKKEEREEKKKEEKKKEEGEEKKK 169
EE +K EE + K E+ K+ E+KKK
Sbjct: 1650 EELKKAEEENKIKAAEEAKKAEEDKKK 1676
Score = 28.2 bits (62), Expect = 2.4
Identities = 13/30 (43%), Positives = 19/30 (63%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++ EE K+ E KKK E+ K+ E KKK
Sbjct: 1462 AKKKAEEAKKADEAKKKAEEAKKADEAKKK 1491
Score = 28.2 bits (62), Expect = 2.6
Identities = 12/29 (41%), Positives = 19/29 (65%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ +E KK+ E KKK ++ K+ E KKK
Sbjct: 1483 KKADEAKKKAEEAKKKADEAKKAAEAKKK 1511
Score = 28.2 bits (62), Expect = 2.7
Identities = 11/26 (42%), Positives = 17/26 (65%)
Query: 143 EEEEKKEEREEKKKEEKKKEEGEEKK 168
+E +KK E ++KK +E KK +KK
Sbjct: 1394 DEAKKKAEEDKKKADELKKAAAAKKK 1419
Score = 28.2 bits (62), Expect = 2.8
Identities = 11/29 (37%), Positives = 19/29 (65%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ +E KK+ E+KKK ++ K+ KKK
Sbjct: 1391 KKADEAKKKAEEDKKKADELKKAAAAKKK 1419
Score = 28.2 bits (62), Expect = 2.8
Identities = 11/25 (44%), Positives = 18/25 (72%)
Query: 145 EEKKEEREEKKKEEKKKEEGEEKKK 169
+E K++ EE KK ++ K++ EE KK
Sbjct: 1434 DEAKKKAEEAKKADEAKKKAEEAKK 1458
Score = 28.2 bits (62), Expect = 3.0
Identities = 13/30 (43%), Positives = 21/30 (70%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++ EEKK+ E KKK E+ K++ +E KK
Sbjct: 1383 AKKKAEEKKKADEAKKKAEEDKKKADELKK 1412
Score = 28.2 bits (62), Expect = 3.1
Identities = 12/30 (40%), Positives = 20/30 (66%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++ EE K+ E KKK E+ K++ +E KK
Sbjct: 1475 AKKKAEEAKKADEAKKKAEEAKKKADEAKK 1504
Score = 27.8 bits (61), Expect = 3.4
Identities = 11/29 (37%), Positives = 20/29 (68%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++ +E K+ KKK ++ K++ EEKKK
Sbjct: 1404 KKKADELKKAAAAKKKADEAKKKAEEKKK 1432
Score = 27.8 bits (61), Expect = 3.5
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+ EEE K + E KK+ E+ K++ EE KK
Sbjct: 1723 KAEEENKIKAEEAKKEAEEDKKKAEEAKK 1751
Score = 27.8 bits (61), Expect = 3.6
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ EE KK + +KK EE KK+ KKK
Sbjct: 1309 KKAEEAKKADEAKKKAEEAKKKADAAKKK 1337
Score = 27.8 bits (61), Expect = 3.6
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ +E KK+ E KK +E KK+ E KKK
Sbjct: 1470 KKADEAKKKAEEAKKADEAKKKAEEAKKK 1498
Score = 27.8 bits (61), Expect = 3.6
Identities = 11/30 (36%), Positives = 19/30 (63%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++ EE K+ E KKK E+ K++ + KK
Sbjct: 1307 AKKKAEEAKKADEAKKKAEEAKKKADAAKK 1336
Score = 27.8 bits (61), Expect = 3.6
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+ E + E E+K E +K++ E KKK
Sbjct: 1351 EAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379
Score = 27.8 bits (61), Expect = 3.8
Identities = 13/26 (50%), Positives = 17/26 (65%)
Query: 144 EEEKKEEREEKKKEEKKKEEGEEKKK 169
+E KK+ E KK +E KK+ E KKK
Sbjct: 1305 DEAKKKAEEAKKADEAKKKAEEAKKK 1330
Score = 27.8 bits (61), Expect = 4.0
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKK 168
++ EE KK + +KK EE KK E +KK
Sbjct: 1438 KKAEEAKKADEAKKKAEEAKKAEEAKKK 1465
Score = 27.8 bits (61), Expect = 4.1
Identities = 11/29 (37%), Positives = 16/29 (55%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ EE KK+ KKK E+ K+ E K
Sbjct: 1322 KKAEEAKKKADAAKKKAEEAKKAAEAAKA 1350
Score = 27.4 bits (60), Expect = 4.4
Identities = 13/30 (43%), Positives = 19/30 (63%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++ EE K+ E KKK E+ K+ E KKK
Sbjct: 1449 AKKKAEEAKKAEEAKKKAEEAKKADEAKKK 1478
Score = 27.4 bits (60), Expect = 4.5
Identities = 12/29 (41%), Positives = 19/29 (65%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
++ EE+KK + +KK EE KK + +KK
Sbjct: 1295 AKKAEEKKKADEAKKKAEEAKKADEAKKK 1323
Score = 27.4 bits (60), Expect = 4.5
Identities = 12/29 (41%), Positives = 20/29 (68%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ + KK+ E+KK +E KK+ E+KKK
Sbjct: 1378 KKADAAKKKAEEKKKADEAKKKAEEDKKK 1406
Score = 27.4 bits (60), Expect = 4.6
Identities = 12/25 (48%), Positives = 18/25 (72%)
Query: 145 EEKKEEREEKKKEEKKKEEGEEKKK 169
EE K++ EE KK ++ K++ EE KK
Sbjct: 1460 EEAKKKAEEAKKADEAKKKAEEAKK 1484
Score = 27.4 bits (60), Expect = 4.8
Identities = 13/28 (46%), Positives = 17/28 (60%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKK 168
++ EE KK+ E KK E KK+ E KK
Sbjct: 1490 KKAEEAKKKADEAKKAAEAKKKADEAKK 1517
Score = 27.4 bits (60), Expect = 4.9
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ EE KK + +KK EE KK+ E KK
Sbjct: 1477 KKAEEAKKADEAKKKAEEAKKKADEAKKA 1505
Score = 27.4 bits (60), Expect = 5.2
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ EE+KK + +KK EE KK+ E KK
Sbjct: 1385 KKAEEKKKADEAKKKAEEDKKKADELKKA 1413
Score = 27.0 bits (59), Expect = 5.9
Identities = 13/30 (43%), Positives = 19/30 (63%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++ EE K+ E KKK E+ K+ E KKK
Sbjct: 1436 AKKKAEEAKKADEAKKKAEEAKKAEEAKKK 1465
Score = 27.0 bits (59), Expect = 6.0
Identities = 9/29 (31%), Positives = 13/29 (44%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ + KK+ E KK E K E E
Sbjct: 1329 KKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357
Score = 27.0 bits (59), Expect = 6.0
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 143 EEEEKKEEREEKKKEEKKKEEGEEKKK 169
+E +KK E +KK + KK+ E KK
Sbjct: 1318 DEAKKKAEEAKKKADAAKKKAEEAKKA 1344
Score = 27.0 bits (59), Expect = 6.4
Identities = 11/27 (40%), Positives = 15/27 (55%)
Query: 143 EEEEKKEEREEKKKEEKKKEEGEEKKK 169
EE +KK + +KK EE KK K +
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAE 1351
Score = 26.6 bits (58), Expect = 7.5
Identities = 8/29 (27%), Positives = 14/29 (48%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ EE KK K + E +E E ++
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEE 1364
Score = 26.6 bits (58), Expect = 8.1
Identities = 10/34 (29%), Positives = 16/34 (47%), Gaps = 4/34 (11%)
Query: 140 LQEEEEEKKEEREEKKKEEKKK----EEGEEKKK 169
+++ EE K+ E K E + E EEK +
Sbjct: 1334 AKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367
Score = 26.6 bits (58), Expect = 8.5
Identities = 15/34 (44%), Positives = 23/34 (67%), Gaps = 4/34 (11%)
Query: 140 LQEEEEEKKEER----EEKKKEEKKKEEGEEKKK 169
L++ EE+KK + EEKKK ++ K++ EE KK
Sbjct: 1283 LKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKK 1316
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 34.2 bits (79), Expect = 0.008
Identities = 9/25 (36%), Positives = 17/25 (68%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEE 166
++EE EE+++EE+++E EE
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEE 95
Score = 31.9 bits (73), Expect = 0.042
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGE 165
E++EE EE+++EE+++ E E
Sbjct: 69 AAAAAAAEEEEEEEEEEEEEEEESEEE 95
Score = 31.2 bits (71), Expect = 0.086
Identities = 11/23 (47%), Positives = 18/23 (78%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEG 164
EEE++EE EE+++EE+ +EE
Sbjct: 74 AAEEEEEEEEEEEEEEEESEEEA 96
Score = 30.0 bits (68), Expect = 0.21
Identities = 11/25 (44%), Positives = 18/25 (72%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGE 165
EEEE++EE EE+++EE ++E
Sbjct: 74 AAEEEEEEEEEEEEEEEESEEEAMA 98
Score = 28.5 bits (64), Expect = 0.81
Identities = 11/27 (40%), Positives = 19/27 (70%)
Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEG 164
+EEEEE++EE EE+++ E++ G
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEEAMAG 99
Score = 26.9 bits (60), Expect = 2.6
Identities = 8/20 (40%), Positives = 15/20 (75%)
Query: 149 EEREEKKKEEKKKEEGEEKK 168
EE+++EE+++EE EE+
Sbjct: 73 AAAEEEEEEEEEEEEEEEES 92
Score = 26.5 bits (59), Expect = 4.2
Identities = 7/21 (33%), Positives = 15/21 (71%)
Query: 149 EEREEKKKEEKKKEEGEEKKK 169
E+++EE+++EE EE++
Sbjct: 72 AAAAEEEEEEEEEEEEEEEES 92
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 35.7 bits (83), Expect = 0.008
Identities = 13/29 (44%), Positives = 23/29 (79%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++ EE+K+E+++K KKKEE EE++K
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEK 448
Score = 32.2 bits (74), Expect = 0.12
Identities = 10/30 (33%), Positives = 24/30 (80%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+EEE+++K+++ K+++++EE E++KK
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451
Score = 31.8 bits (73), Expect = 0.14
Identities = 12/30 (40%), Positives = 23/30 (76%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++ E+ +++REE+KKE+KKK +KK+
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKE 441
Score = 31.4 bits (72), Expect = 0.20
Identities = 13/30 (43%), Positives = 24/30 (80%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+EE++EKK++ KK+E+++EE +EKK+
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKE 452
Score = 31.4 bits (72), Expect = 0.20
Identities = 9/30 (30%), Positives = 22/30 (73%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+ E++ ++E++E+KKK K++ EE+++
Sbjct: 417 EKAEKKREEEKKEKKKKAFAGKKKEEEEEE 446
Score = 31.0 bits (71), Expect = 0.24
Identities = 13/29 (44%), Positives = 23/29 (79%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ EEEKKE++++ +KK+EE EE+K+
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKE 449
Score = 30.7 bits (70), Expect = 0.34
Identities = 14/28 (50%), Positives = 25/28 (89%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
EEEEEK+++ EEK++EE++ EE +E+++
Sbjct: 443 EEEEEKEKKEEEKEEEEEEAEEEKEEEE 470
Score = 30.7 bits (70), Expect = 0.36
Identities = 10/28 (35%), Positives = 20/28 (71%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
E+ +K+ EEKK+++KK G++K++
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEE 442
Score = 30.7 bits (70), Expect = 0.36
Identities = 15/27 (55%), Positives = 24/27 (88%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKK 168
E++EE+KEE EE+ +EEK++EE ++KK
Sbjct: 449 EKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 30.7 bits (70), Expect = 0.41
Identities = 13/28 (46%), Positives = 23/28 (82%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
E++E+KK+ KKKEE+++EE E+K++
Sbjct: 426 EKKEKKKKAFAGKKKEEEEEEEKEKKEE 453
Score = 30.3 bits (69), Expect = 0.51
Identities = 19/32 (59%), Positives = 26/32 (81%), Gaps = 3/32 (9%)
Query: 141 QEEEEEKKEER---EEKKKEEKKKEEGEEKKK 169
+EEE+EKKEE EE++ EE+K+EE E+KKK
Sbjct: 444 EEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 29.9 bits (68), Expect = 0.55
Identities = 12/28 (42%), Positives = 22/28 (78%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+KKEE EE++KE+K++E+ EE+++
Sbjct: 434 AFAGKKKEEEEEEEKEKKEEEKEEEEEE 461
Score = 29.9 bits (68), Expect = 0.56
Identities = 8/29 (27%), Positives = 24/29 (82%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+E++++ ++++++EE++KE+ EE+K+
Sbjct: 428 KEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456
Score = 29.9 bits (68), Expect = 0.67
Identities = 9/28 (32%), Positives = 19/28 (67%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
E+ EKK E E+K+K++K +++++
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEE 443
Score = 29.5 bits (67), Expect = 0.74
Identities = 14/28 (50%), Positives = 26/28 (92%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
EEEEE+++E++E++KEE+++E EEK++
Sbjct: 441 EEEEEEEKEKKEEEKEEEEEEAEEEKEE 468
Score = 29.5 bits (67), Expect = 0.77
Identities = 14/28 (50%), Positives = 26/28 (92%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
EEEEE+KE++EE+K+EE+++ E E++++
Sbjct: 442 EEEEEEKEKKEEEKEEEEEEAEEEKEEE 469
Score = 28.7 bits (65), Expect = 1.4
Identities = 22/74 (29%), Positives = 32/74 (43%), Gaps = 17/74 (22%)
Query: 109 MSKGRLQYYQALMPYLKNQSED----CLYLNIYTPLQEEEEE---------KKEEREEKK 155
K R + L K+ E +L L EEE E KK ++ +K
Sbjct: 363 KRKVRREVLPFLSIIFKHNPELAARLAAFLE----LTEEEIEFLTGSKKATKKIKKIVEK 418
Query: 156 KEEKKKEEGEEKKK 169
E+K++EE +EKKK
Sbjct: 419 AEKKREEEKKEKKK 432
Score = 28.7 bits (65), Expect = 1.4
Identities = 12/28 (42%), Positives = 23/28 (82%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++++EE EEK+K+E++KEE EE+ +
Sbjct: 436 AGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Score = 28.7 bits (65), Expect = 1.5
Identities = 8/28 (28%), Positives = 23/28 (82%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++E++KK +KK+EE+++E+ +++++
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Score = 28.7 bits (65), Expect = 1.7
Identities = 13/28 (46%), Positives = 25/28 (89%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++EE++EE +EKK+EEK++EE E +++
Sbjct: 438 KKKEEEEEEEKEKKEEEKEEEEEEAEEE 465
Score = 27.2 bits (61), Expect = 4.5
Identities = 9/28 (32%), Positives = 22/28 (78%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ K++ EE+++E++KKEE +E+++
Sbjct: 432 KKAFAGKKKEEEEEEEKEKKEEEKEEEE 459
Score = 27.2 bits (61), Expect = 5.3
Identities = 8/28 (28%), Positives = 23/28 (82%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++ +++EE+++EEK+K+E E++++
Sbjct: 431 KKKAFAGKKKEEEEEEEKEKKEEEKEEE 458
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 33.8 bits (78), Expect = 0.011
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEE 166
EEKKEE EE++++E+ +EE
Sbjct: 72 AAAAAAEEKKEEEEEEEEKEESEEEAAA 99
Score = 33.0 bits (76), Expect = 0.021
Identities = 10/28 (35%), Positives = 17/28 (60%)
Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEE 166
EE++E+++EE++KEE EE
Sbjct: 68 AAAAAAAAAAEEKKEEEEEEEEKEESEE 95
Score = 29.2 bits (66), Expect = 0.45
Identities = 10/23 (43%), Positives = 18/23 (78%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEG 164
EE++E++EE EEK++ E++ G
Sbjct: 78 EEKKEEEEEEEEKEESEEEAAAG 100
Score = 26.8 bits (60), Expect = 2.5
Identities = 8/20 (40%), Positives = 14/20 (70%)
Query: 149 EEREEKKKEEKKKEEGEEKK 168
E+KKEE+++EE +E+
Sbjct: 74 AAAAEEKKEEEEEEEEKEES 93
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 34.5 bits (80), Expect = 0.013
Identities = 14/40 (35%), Positives = 24/40 (60%)
Query: 129 EDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
C + IY +EE+E+KK +E+KK K+++E E+
Sbjct: 83 SKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
Score = 30.7 bits (70), Expect = 0.26
Identities = 13/33 (39%), Positives = 23/33 (69%)
Query: 137 YTPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+TP+ + EE+KE+++ KEEKK + E++K
Sbjct: 87 FTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKL 119
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 33.7 bits (78), Expect = 0.025
Identities = 13/40 (32%), Positives = 24/40 (60%)
Query: 129 EDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
E C + IY + E+E+KK +E+KK K++++ E+
Sbjct: 84 EKCDFTPIYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 33.8 bits (78), Expect = 0.029
Identities = 13/28 (46%), Positives = 20/28 (71%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
EEE++EE +EKK+E+KK+E + K
Sbjct: 275 AAEEERQEEAQEKKEEKKKEEREAKLAK 302
Score = 33.4 bits (77), Expect = 0.038
Identities = 14/33 (42%), Positives = 24/33 (72%), Gaps = 4/33 (12%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEG----EEKKK 169
+E +EE +E++EEKKKEE++ + EE++K
Sbjct: 278 EERQEEAQEKKEEKKKEEREAKLAKLSPEEQRK 310
Score = 29.9 bits (68), Expect = 0.51
Identities = 16/36 (44%), Positives = 23/36 (63%), Gaps = 6/36 (16%)
Query: 140 LQEEEEEKKEEREEKK------KEEKKKEEGEEKKK 169
QE++EEKK+E E K +E++K EE E KK+
Sbjct: 284 AQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQ 319
Score = 28.4 bits (64), Expect = 1.7
Identities = 14/30 (46%), Positives = 23/30 (76%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+EEEEEK + E++++E+ +E+ EEKKK
Sbjct: 264 TREEEEEKILKAAEEERQEEAQEKKEEKKK 293
Score = 28.4 bits (64), Expect = 2.0
Identities = 13/37 (35%), Positives = 23/37 (62%), Gaps = 9/37 (24%)
Query: 141 QEEEEEKKEEREEK---------KKEEKKKEEGEEKK 168
+++EE+KKEERE K +K E+K+ + + +K
Sbjct: 286 EKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
Score = 28.4 bits (64), Expect = 2.1
Identities = 18/35 (51%), Positives = 23/35 (65%), Gaps = 6/35 (17%)
Query: 141 QEEEE---EKKEEREEK---KKEEKKKEEGEEKKK 169
+EEE+ +EER+E+ KKEEKKKEE E K
Sbjct: 267 EEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLA 301
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 33.1 bits (76), Expect = 0.034
Identities = 16/30 (53%), Positives = 24/30 (80%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
L+EE E+KKEE E K++EE+ ++E EE +K
Sbjct: 48 LEEEREKKKEEEERKEREEQARKEQEEYEK 77
Score = 30.1 bits (68), Expect = 0.37
Identities = 16/28 (57%), Positives = 24/28 (85%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+EEEE +EERE+KK+EE++KE E+ +K
Sbjct: 43 KEEEELEEEREKKKEEEERKEREEQARK 70
Score = 28.9 bits (65), Expect = 0.86
Identities = 14/30 (46%), Positives = 23/30 (76%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+EE ++ +E+RE ++KEE++ EE EKKK
Sbjct: 27 EREERKKLEEKREGERKEEEELEEEREKKK 56
Score = 27.4 bits (61), Expect = 2.8
Identities = 14/29 (48%), Positives = 22/29 (75%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ E E+KEE E +++ EKKKEE E K++
Sbjct: 36 EKREGERKEEEELEEEREKKKEEEERKER 64
Score = 27.0 bits (60), Expect = 5.0
Identities = 10/44 (22%), Positives = 24/44 (54%)
Query: 125 KNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
+ + +E E++K+EE ++++E+ +KE+ E +K
Sbjct: 34 LEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEK 77
Score = 26.6 bits (59), Expect = 6.4
Identities = 13/30 (43%), Positives = 24/30 (80%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
L+E+ E +++E EE ++E +KK+E EE+K+
Sbjct: 34 LEEKREGERKEEEELEEEREKKKEEEERKE 63
Score = 26.6 bits (59), Expect = 6.5
Identities = 12/30 (40%), Positives = 24/30 (80%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+EEE E++++ EEK++ E+K+EE E+++
Sbjct: 23 AEEEEREERKKLEEKREGERKEEEELEEER 52
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 32.8 bits (75), Expect = 0.041
Identities = 12/31 (38%), Positives = 22/31 (70%)
Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++E++E +E EEK+ + KKE+ E+K+K
Sbjct: 94 ESEKEQKEVSKETEEKEAIKAKKEKKEKKEK 124
Score = 27.8 bits (62), Expect = 2.0
Identities = 9/29 (31%), Positives = 21/29 (72%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+E++E KE E++ + KK+++ +++KK
Sbjct: 97 KEQKEVSKETEEKEAIKAKKEKKEKKEKK 125
Score = 27.4 bits (61), Expect = 2.9
Identities = 10/29 (34%), Positives = 21/29 (72%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+E EE++ + +++KKE+K+K+ E+ K
Sbjct: 104 KETEEKEAIKAKKEKKEKKEKKVAEKLAK 132
Score = 26.3 bits (58), Expect = 6.2
Identities = 10/27 (37%), Positives = 19/27 (70%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKK 168
+E E+KE + KK++++KKE+ +K
Sbjct: 103 SKETEEKEAIKAKKEKKEKKEKKVAEK 129
Score = 25.9 bits (57), Expect = 8.0
Identities = 11/30 (36%), Positives = 21/30 (70%)
Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGEEK 167
+ +E EEK+ + +K+K+EKK+++ EK
Sbjct: 100 KEVSKETEEKEAIKAKKEKKEKKEKKVAEK 129
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved L residues that may
be functionally important.
Length = 70
Score = 31.2 bits (71), Expect = 0.044
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEE 163
L+ EEE+++EE E +K+EE+K+ E
Sbjct: 29 LKAEEEKREEEEEARKREERKERE 52
Score = 30.0 bits (68), Expect = 0.14
Identities = 13/26 (50%), Positives = 19/26 (73%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEK 167
+ EEEK+EE EE +K E++KE + K
Sbjct: 30 KAEEEKREEEEEARKREERKEREKNK 55
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 33.1 bits (76), Expect = 0.050
Identities = 14/29 (48%), Positives = 22/29 (75%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+E +E+K R+ KK+EEKKK+E E+ +K
Sbjct: 560 KEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 31.2 bits (71), Expect = 0.28
Identities = 12/26 (46%), Positives = 20/26 (76%)
Query: 144 EEEKKEEREEKKKEEKKKEEGEEKKK 169
E+E+KE +E+K+ K K++ E+KKK
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKK 581
Score = 30.8 bits (70), Expect = 0.30
Identities = 9/29 (31%), Positives = 20/29 (68%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++EE + E+EEK+ +++K + KK+
Sbjct: 547 LDDKEELQREKEEKEALKEQKRLRKLKKQ 575
Score = 30.8 bits (70), Expect = 0.32
Identities = 13/31 (41%), Positives = 21/31 (67%), Gaps = 2/31 (6%)
Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
LQ E+EEK+ +++K +K ++ EEKKK
Sbjct: 552 ELQREKEEKEA--LKEQKRLRKLKKQEEKKK 580
Score = 29.2 bits (66), Expect = 1.2
Identities = 8/27 (29%), Positives = 19/27 (70%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEK 167
++E ++++ + KK+E+KK++ EK
Sbjct: 559 EKEALKEQKRLRKLKKQEEKKKKELEK 585
Score = 27.7 bits (62), Expect = 4.0
Identities = 9/24 (37%), Positives = 17/24 (70%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEE 163
+++ K +++EEKKK+E +K E
Sbjct: 564 KEQKRLRKLKKQEEKKKKELEKLE 587
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 32.5 bits (75), Expect = 0.053
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
E+ EKK E+ + K E+KK ++ E K +
Sbjct: 70 AWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 29.0 bits (66), Expect = 0.86
Identities = 11/27 (40%), Positives = 16/27 (59%), Gaps = 1/27 (3%)
Query: 143 EEEEKKEEREEKKKE-EKKKEEGEEKK 168
++E K E+ EKK E K K E ++ K
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAK 91
Score = 28.6 bits (65), Expect = 0.90
Identities = 10/26 (38%), Positives = 16/26 (61%)
Query: 144 EEEKKEEREEKKKEEKKKEEGEEKKK 169
++E K + +KK EK K + E+KK
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKA 90
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 33.3 bits (76), Expect = 0.056
Identities = 13/31 (41%), Positives = 19/31 (61%)
Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
E++ +EE E KKK+E+K +E E KK
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKK 35
Score = 27.9 bits (62), Expect = 3.6
Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 2/33 (6%)
Query: 139 PLQEEEEEKKEEREEK--KKEEKKKEEGEEKKK 169
L EEE E+K+++EEK +KE KK + +++ K
Sbjct: 12 ILTEEELERKKKKEEKAKEKELKKLKAAQKEAK 44
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 32.9 bits (75), Expect = 0.065
Identities = 21/80 (26%), Positives = 34/80 (42%), Gaps = 16/80 (20%)
Query: 94 PQKLPNLDDDDFSKKMSKGRLQYYQALMPYLKNQSEDCLYLN----IYTPLQEEEEEKKE 149
P N D++ Q + A++P N E+ ++ IY LQEE + E
Sbjct: 400 PGNARNYDNNGR---------QDFHAIVP--NNIPEELYWVEEEHQIYRKLQEERRLR-E 447
Query: 150 EREEKKKEEKKKEEGEEKKK 169
E K E+ + + E K+K
Sbjct: 448 EAIRAKAEKTARMKAEMKEK 467
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 31.9 bits (73), Expect = 0.072
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+EE+K +E E K++ E+ KE EKKK
Sbjct: 98 ARDEEKKLDEEEAKRQHEEAKEREREKKK 126
Score = 25.7 bits (57), Expect = 8.8
Identities = 8/29 (27%), Positives = 18/29 (62%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
Q +E+K+ EE+ K + ++ + E++K
Sbjct: 96 QRARDEEKKLDEEEAKRQHEEAKEREREK 124
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 31.2 bits (71), Expect = 0.081
Identities = 9/25 (36%), Positives = 14/25 (56%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEE 166
E E +E ++EEK++E EE
Sbjct: 74 AAGAEAAAEADEAEEEEKEEEAEEE 98
Score = 28.5 bits (64), Expect = 0.72
Identities = 9/26 (34%), Positives = 15/26 (57%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEE 166
E E E EE+K+EE ++E ++
Sbjct: 77 AEAAAEADEAEEEEKEEEAEEESDDD 102
Score = 26.6 bits (59), Expect = 3.8
Identities = 7/23 (30%), Positives = 15/23 (65%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEG 164
E +E +EE +E++ EE+ ++
Sbjct: 81 AEADEAEEEEKEEEAEEESDDDM 103
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 32.1 bits (73), Expect = 0.12
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 135 NIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+ E+ K+ E+ K+ EEK+K+ E K K
Sbjct: 93 ELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAK 127
Score = 29.8 bits (67), Expect = 0.55
Identities = 9/29 (31%), Positives = 18/29 (62%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
Q+E E++ + K+ E+ ++ EEK+K
Sbjct: 91 QKELEQRAAAEKAAKQAEQAAKQAEEKQK 119
Score = 29.4 bits (66), Expect = 0.98
Identities = 16/32 (50%), Positives = 17/32 (53%), Gaps = 3/32 (9%)
Query: 141 QEEEEEKKEEREEKKK---EEKKKEEGEEKKK 169
Q EEE K + E KK E KKK E E K K
Sbjct: 151 QAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAK 182
Score = 28.3 bits (63), Expect = 2.2
Identities = 8/29 (27%), Positives = 16/29 (55%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
+E E+ E+ K+ E+ K+ E++K
Sbjct: 91 QKELEQRAAAEKAAKQAEQAAKQAEEKQK 119
Score = 28.3 bits (63), Expect = 2.4
Identities = 13/32 (40%), Positives = 19/32 (59%), Gaps = 3/32 (9%)
Query: 141 QEEEEEKKEEREEKKK---EEKKKEEGEEKKK 169
++ EE++K+ E K K E K K E E +KK
Sbjct: 112 KQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKK 143
Score = 27.9 bits (62), Expect = 2.5
Identities = 10/29 (34%), Positives = 21/29 (72%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
Q+++ K+E+E +KK E++ EE E+++
Sbjct: 57 QQKKPAAKKEQERQKKLEQQAEEAEKQRA 85
Score = 27.9 bits (62), Expect = 2.8
Identities = 7/29 (24%), Positives = 17/29 (58%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ E+ K+ E++K+ E+ K + + K
Sbjct: 105 KQAEQAAKQAEEKQKQAEEAKAKQAAEAK 133
Score = 27.5 bits (61), Expect = 3.6
Identities = 13/32 (40%), Positives = 16/32 (50%), Gaps = 3/32 (9%)
Query: 141 QEEEEEKKEEREEKKK---EEKKKEEGEEKKK 169
+ E KK+ E KKK E K K E + K K
Sbjct: 159 KAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAK 190
Score = 27.5 bits (61), Expect = 4.3
Identities = 10/29 (34%), Positives = 14/29 (48%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
E E + K E + K K E+ K + E K
Sbjct: 175 AEAEAKAKAEAKAKAKAEEAKAKAEAAKA 203
Score = 27.1 bits (60), Expect = 4.7
Identities = 14/31 (45%), Positives = 17/31 (54%), Gaps = 2/31 (6%)
Query: 141 QEEEEEKKEEREEKKK--EEKKKEEGEEKKK 169
E E EKK + E KK+ EE K + E KK
Sbjct: 136 AEAEAEKKAKEEAKKQAEEEAKAKAAAEAKK 166
Score = 27.1 bits (60), Expect = 4.9
Identities = 5/31 (16%), Positives = 19/31 (61%)
Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ E++ +E +++ E+ +++E E++
Sbjct: 69 RQKKLEQQAEEAEKQRAAEQARQKELEQRAA 99
Score = 27.1 bits (60), Expect = 5.6
Identities = 7/29 (24%), Positives = 17/29 (58%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+ E+ +++E E++ EK ++ E+ K
Sbjct: 84 RAAEQARQKELEQRAAAEKAAKQAEQAAK 112
Score = 27.1 bits (60), Expect = 5.7
Identities = 14/31 (45%), Positives = 17/31 (54%), Gaps = 2/31 (6%)
Query: 141 QEEEEEKKEE--REEKKKEEKKKEEGEEKKK 169
Q E + K E E+K KEE KK+ EE K
Sbjct: 128 QAAEAKAKAEAEAEKKAKEEAKKQAEEEAKA 158
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 32.0 bits (73), Expect = 0.14
Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 6/37 (16%)
Query: 139 PLQEEEEEKKEEREEKKKEEKK------KEEGEEKKK 169
P +EEEEEK+E++EE++K K +EE EEKKK
Sbjct: 36 PDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKK 72
Score = 31.3 bits (71), Expect = 0.25
Identities = 14/28 (50%), Positives = 22/28 (78%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+EEEEK ++EE+ EE++KEE ++K K
Sbjct: 48 KEEEEKTTDKEEEVDEEEEKEEKKKKTK 75
Score = 30.5 bits (69), Expect = 0.41
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGEE 166
T +EE +E++E+ E+KKK +K KE E
Sbjct: 55 TDKEEEVDEEEEKEEKKKKTKKVKETTTE 83
Score = 27.8 bits (62), Expect = 3.2
Identities = 12/29 (41%), Positives = 22/29 (75%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+EEEE+ ++ EE +EE+K+E+ ++ KK
Sbjct: 48 KEEEEKTTDKEEEVDEEEEKEEKKKKTKK 76
Score = 27.8 bits (62), Expect = 3.6
Identities = 14/29 (48%), Positives = 21/29 (72%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+E E+E +E EE++KEEKK+EE + K
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDK 57
Score = 27.0 bits (60), Expect = 6.0
Identities = 11/28 (39%), Positives = 19/28 (67%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+EEEEK+E++++ KK ++ E E K
Sbjct: 62 DEEEEKEEKKKKTKKVKETTTEWELLNK 89
>gnl|CDD|239571 cd03489, Topoisomer_IB_N_LdtopoI_like,
Topoisomer_IB_N_LdtopoI_like: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the heterodimeric topo I from
Leishmania donvanni. Topo I enzymes are divided into:
topo type IA (bacterial) and type IB (eukaryotic). Topo
I relaxes superhelical tension in duplex DNA by creating
a single-strand nick, the broken strand can then rotate
around the unbroken strand to remove DNA supercoils and,
the nick is religated, liberating topo I. These enzymes
regulate the topological changes that accompany DNA
replication, transcription and other nuclear processes.
Human topo I is the target of a diverse set of
anticancer drugs including camptothecins (CPTs). CPTs
bind to the topo I-DNA complex and inhibit re-ligation
of the single-strand nick, resulting in the accumulation
of topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topo I play putative roles in
organizing the kinetoplast DNA network unique to these
parasites. This family may represent more than one
structural domain.
Length = 212
Score = 31.4 bits (71), Expect = 0.14
Identities = 13/40 (32%), Positives = 23/40 (57%)
Query: 129 EDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
E C + IY E+E+KK +E+KK K++++ E +
Sbjct: 80 ELCDFTPIYEWHLREKEKKKSRTKEEKKALKEEKDKEAEP 119
Score = 27.1 bits (60), Expect = 4.7
Identities = 16/44 (36%), Positives = 25/44 (56%), Gaps = 5/44 (11%)
Query: 131 CLYLNIYTPLQE---EEEEKKEER--EEKKKEEKKKEEGEEKKK 169
L L +TP+ E E+EKK+ R EEKK +++K++ E
Sbjct: 78 KLELCDFTPIYEWHLREKEKKKSRTKEEKKALKEEKDKEAEPYM 121
>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p) and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
gene from Saccharomyces cerevisiae. It is an essential
yeast protein involved in pre-rRNA processing and
ribosome assembly. Rrp7p contains an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 128
Score = 30.7 bits (70), Expect = 0.15
Identities = 17/43 (39%), Positives = 22/43 (51%), Gaps = 10/43 (23%)
Query: 137 YTPLQEEEEEKKEEREEKKKEEKKKEEGE----------EKKK 169
P EE K E E+K+KE+KKK++ E EKKK
Sbjct: 62 KGPAAGEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKK 104
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 31.2 bits (71), Expect = 0.16
Identities = 15/30 (50%), Positives = 22/30 (73%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
L+E E+ KKE EEK++EE++K EEK +
Sbjct: 148 LRELEKIKKERAEEKEREEEEKAAEEEKAR 177
Score = 30.8 bits (70), Expect = 0.26
Identities = 13/32 (40%), Positives = 21/32 (65%), Gaps = 2/32 (6%)
Query: 140 LQEEEEEKKEEREEKK--KEEKKKEEGEEKKK 169
L E E+ K+ER E+K +EE+K E E+ ++
Sbjct: 147 LLRELEKIKKERAEEKEREEEEKAAEEEKARE 178
Score = 30.1 bits (68), Expect = 0.45
Identities = 10/21 (47%), Positives = 16/21 (76%)
Query: 143 EEEEKKEEREEKKKEEKKKEE 163
EE+++EE E+ +EEK +EE
Sbjct: 159 AEEKEREEEEKAAEEEKAREE 179
>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain. DBINO is a DNA-binding
domain found on global transcription activator SNF2L1
proteins and chromatin re-modelling proteins.
Length = 140
Score = 30.7 bits (70), Expect = 0.19
Identities = 12/23 (52%), Positives = 16/23 (69%), Gaps = 2/23 (8%)
Query: 147 KKEEREEKKKEEKKKEEGEEKKK 169
+KEERE +K+ E KE E+ KK
Sbjct: 82 EKEERELRKRAE--KEALEQAKK 102
Score = 26.8 bits (60), Expect = 3.3
Identities = 11/32 (34%), Positives = 19/32 (59%), Gaps = 3/32 (9%)
Query: 141 QEEEEEKKEEREE---KKKEEKKKEEGEEKKK 169
+E E K+ E+E KKEE+ +E +++K
Sbjct: 84 EERELRKRAEKEALEQAKKEEELREAKRQQRK 115
>gnl|CDD|220163 pfam09284, RhgB_N, Rhamnogalacturonase B, N-terminal. Members of
this family are found in prokaryotic Rhamnogalacturonase
B, and adopt a structure consisting of a beta
supersandwich, with eighteen strands in two beta-sheets.
The exact function of the domain is unknown, but a
putative role includes carbohydrate-binding.
Length = 249
Score = 30.9 bits (70), Expect = 0.24
Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 2/29 (6%)
Query: 45 VHNGNLNNIHMYLGIPYAAPPVGQLRFMA 73
+G+ N I+M I A P +G+LRF+A
Sbjct: 81 ARSGD-NTIYMATYIT-AEPSIGELRFIA 107
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 31.2 bits (70), Expect = 0.25
Identities = 13/27 (48%), Positives = 18/27 (66%)
Query: 143 EEEEKKEEREEKKKEEKKKEEGEEKKK 169
EEE + +R EK K EKKK E E++ +
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGR 111
Score = 27.0 bits (59), Expect = 6.3
Identities = 15/28 (53%), Positives = 21/28 (75%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
E EE KK + +KKK+ K+KEE ++KKK
Sbjct: 270 EAEETKKSPKHKKKKQRKEKEEKKKKKK 297
>gnl|CDD|217011 pfam02388, FemAB, FemAB family. The femAB operon codes for two
nearly identical approximately 50-kDa proteins involved
in the formation of the Staphylococcal pentaglycine
interpeptide bridge in peptidoglycan. These proteins are
also considered as a factor influencing the level of
methicillin resistance.
Length = 408
Score = 31.1 bits (71), Expect = 0.27
Identities = 18/57 (31%), Positives = 31/57 (54%), Gaps = 1/57 (1%)
Query: 114 LQYYQALMPYLKNQSEDCL-YLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
L YY+ L+ +++E L YLN L E +E+ ++ ++ K E+K E+ KK
Sbjct: 217 LDYYEKLLDTYGDKAEFMLAYLNFKEYLDELQEKLEKLEKDLAKLEEKLEKNPNSKK 273
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 31.1 bits (71), Expect = 0.27
Identities = 11/25 (44%), Positives = 14/25 (56%)
Query: 145 EEKKEEREEKKKEEKKKEEGEEKKK 169
EKK E+EE +K KK+ KK
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKK 445
Score = 29.9 bits (68), Expect = 0.62
Identities = 10/25 (40%), Positives = 18/25 (72%)
Query: 144 EEEKKEEREEKKKEEKKKEEGEEKK 168
E+K+ R++++K EKK E+ E +K
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEK 432
Score = 29.1 bits (66), Expect = 1.2
Identities = 12/29 (41%), Positives = 16/29 (55%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
Q + E+K E+ E +K KKK E KK
Sbjct: 418 QRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446
Score = 27.2 bits (61), Expect = 4.5
Identities = 9/27 (33%), Positives = 15/27 (55%)
Query: 143 EEEEKKEEREEKKKEEKKKEEGEEKKK 169
E++ ++E EK +KK E +K K
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAK 447
Score = 27.2 bits (61), Expect = 4.8
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 143 EEEEKKEEREEKKKEEKKKEEGEEKKK 169
E KK ++++K E+K ++E EK
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAA 434
Score = 26.8 bits (60), Expect = 6.0
Identities = 7/30 (23%), Positives = 16/30 (53%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++EE EK +++ + KK + + + K
Sbjct: 425 AEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 30.8 bits (70), Expect = 0.27
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 137 YTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
+T + + +KEE++ KEEKK + E++K
Sbjct: 87 FTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEK 118
Score = 30.8 bits (70), Expect = 0.28
Identities = 11/40 (27%), Positives = 23/40 (57%)
Query: 129 EDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
C + ++ + ++EEKK +E+KK K ++E E++
Sbjct: 83 SKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEE 122
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast) and
GNL3L (human) are putative GTPases which are required
for growth and play a role in processing of nucleolar
pre-rRNA. This family contains a potential nuclear
localisation signal.
Length = 80
Score = 29.1 bits (66), Expect = 0.33
Identities = 12/28 (42%), Positives = 21/28 (75%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
EE++ K+EE +E++KE +K E E +K+
Sbjct: 52 EEKKRKQEEEKERRKEARKAERAEARKR 79
Score = 25.7 bits (57), Expect = 5.2
Identities = 7/25 (28%), Positives = 15/25 (60%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGE 165
+++EEEK+ +E +K E + +
Sbjct: 56 RKQEEEKERRKEARKAERAEARKRG 80
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 30.6 bits (70), Expect = 0.34
Identities = 10/24 (41%), Positives = 16/24 (66%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEG 164
EE++EE EE+++EE +EE
Sbjct: 298 AAAAEEEEEEEEEEEEEEPSEEEA 321
Score = 29.8 bits (68), Expect = 0.61
Identities = 9/26 (34%), Positives = 17/26 (65%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEE 166
Q + +EE EE+++EE+++ EE
Sbjct: 295 QAQAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 29.1 bits (66), Expect = 1.0
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEE 166
L + + E EE+++EE+++EE E
Sbjct: 292 LSAQAQAAAAEEEEEEEEEEEEEEPSE 318
Score = 28.7 bits (65), Expect = 1.5
Identities = 8/28 (28%), Positives = 17/28 (60%)
Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGE 165
+ + ++EE EE+++EE++ E E
Sbjct: 293 SAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 27.1 bits (61), Expect = 4.2
Identities = 8/30 (26%), Positives = 18/30 (60%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
L+E + + +++EE+++EE EE+
Sbjct: 288 LKEVLSAQAQAAAAEEEEEEEEEEEEEEPS 317
Score = 26.8 bits (60), Expect = 7.3
Identities = 9/30 (30%), Positives = 18/30 (60%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
L EE +E + + E+++EE EE+++
Sbjct: 284 LDEELKEVLSAQAQAAAAEEEEEEEEEEEE 313
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 30.4 bits (69), Expect = 0.36
Identities = 9/27 (33%), Positives = 13/27 (48%)
Query: 143 EEEEKKEEREEKKKEEKKKEEGEEKKK 169
EE+KE K+ ++ E KKK
Sbjct: 169 AAEERKEALAAAAKKSATPQKVETKKK 195
Score = 27.0 bits (60), Expect = 4.6
Identities = 9/28 (32%), Positives = 14/28 (50%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKK 168
+++EE EER+E KK +K
Sbjct: 163 KQKEEFAAEERKEALAAAAKKSATPQKV 190
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 30.2 bits (69), Expect = 0.43
Identities = 17/64 (26%), Positives = 28/64 (43%), Gaps = 7/64 (10%)
Query: 107 KKMSKGRLQYYQALMPYLKNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKE-EKKKEEGE 165
KK KG L+ L K E+ + L +++E K + +KKKE ++ K
Sbjct: 33 KKSQKGELEV-TNLNEQYKEMKEE-----LKAALLDKKELKAWHKAQKKKEKQEAKAAKA 86
Query: 166 EKKK 169
+ K
Sbjct: 87 KSKP 90
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 27.7 bits (62), Expect = 0.45
Identities = 11/25 (44%), Positives = 18/25 (72%)
Query: 145 EEKKEEREEKKKEEKKKEEGEEKKK 169
EE+K E EE+ ++E++ EE E K+
Sbjct: 22 EEEKREDEEENEDEEEGEEQSEVKR 46
Score = 27.3 bits (61), Expect = 0.72
Identities = 12/25 (48%), Positives = 19/25 (76%)
Query: 144 EEEKKEEREEKKKEEKKKEEGEEKK 168
EEEK+E+ EE + EE+ +E+ E K+
Sbjct: 22 EEEKREDEEENEDEEEGEEQSEVKR 46
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 30.3 bits (69), Expect = 0.48
Identities = 11/31 (35%), Positives = 24/31 (77%)
Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
P ++ EE+K ++R++KKK +KK ++ ++K +
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 28.0 bits (63), Expect = 2.3
Identities = 8/28 (28%), Positives = 22/28 (78%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKK 168
+EE++ +K ++++K+K++ KK + + +K
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 27.6 bits (62), Expect = 3.9
Identities = 13/31 (41%), Positives = 22/31 (70%)
Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
P +++ EEKK ++ +KKK+ KKK + +KK
Sbjct: 382 PPKKKREEKKPQKRKKKKKRKKKGKKRKKKG 412
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 29.6 bits (67), Expect = 0.49
Identities = 8/32 (25%), Positives = 17/32 (53%)
Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
E EE +EE+ ++ E++ + EK++
Sbjct: 106 EEEVEVEELDEEEQIDELLEKELAKLKREKRR 137
Score = 29.2 bits (66), Expect = 0.59
Identities = 6/28 (21%), Positives = 20/28 (71%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKK 168
++ +E ++E + K+E++++ E ++K+
Sbjct: 118 EQIDELLEKELAKLKREKRRENERKQKE 145
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 29.7 bits (67), Expect = 0.50
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 144 EEEKKEEREEKKKEEKKKEEGEEKKK 169
+E KKE++E+K K+EK E KKK
Sbjct: 161 KEVKKEKKEKKDKKEKMVEPKGSKKK 186
Score = 29.3 bits (66), Expect = 0.74
Identities = 15/30 (50%), Positives = 24/30 (80%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+E E E++E++E+KKK+E KKE+ E+K K
Sbjct: 144 EKEAEVEEEEKKEKKKKKEVKKEKKEKKDK 173
Score = 28.5 bits (64), Expect = 1.4
Identities = 13/32 (40%), Positives = 25/32 (78%)
Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++E E ++EE++EKKK+++ K+E +EKK
Sbjct: 141 AKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKD 172
Score = 27.4 bits (61), Expect = 3.4
Identities = 13/28 (46%), Positives = 25/28 (89%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
E EEE+K+E+++KK+ +K+K+E ++KK+
Sbjct: 148 EVEEEEKKEKKKKKEVKKEKKEKKDKKE 175
Score = 26.6 bits (59), Expect = 5.7
Identities = 13/32 (40%), Positives = 24/32 (75%)
Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
T + E+E + EE E+K+K++KK+ + E+K+K
Sbjct: 139 TTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEK 170
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 30.3 bits (68), Expect = 0.53
Identities = 15/40 (37%), Positives = 21/40 (52%)
Query: 129 EDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
ED L I + E++E EE EE+K EE+ + KK
Sbjct: 301 EDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKK 340
Score = 27.6 bits (61), Expect = 3.2
Identities = 11/36 (30%), Positives = 17/36 (47%), Gaps = 5/36 (13%)
Query: 139 PLQEEEEEKK-----EEREEKKKEEKKKEEGEEKKK 169
L E K E+ EE ++E+ ++E G KK
Sbjct: 303 KLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKG 338
Score = 26.5 bits (58), Expect = 8.0
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
EE EE+K E E ++ KK + + KK
Sbjct: 321 EESEEEKNEEEGGLSKKGKKLKKLKGKK 348
Score = 26.5 bits (58), Expect = 8.4
Identities = 9/28 (32%), Positives = 15/28 (53%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
EE E+K K E ++ E+ EE ++
Sbjct: 298 EEREDKLSPEIPAKPEIEQDEDSEESEE 325
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 30.0 bits (67), Expect = 0.58
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+ E++ + EE +KEEK+ E EE+K
Sbjct: 134 KSEQKNDWRDAEECQKEEKEPEPEEEEKP 162
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 29.6 bits (67), Expect = 0.59
Identities = 10/28 (35%), Positives = 17/28 (60%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+EEE++++E E+ K K K + K K
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAK 65
Score = 27.7 bits (62), Expect = 2.7
Identities = 10/28 (35%), Positives = 18/28 (64%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
EE+EEK+EE+ + + K K+ + K +
Sbjct: 40 EEDEEKEEEKAKVAAKAKAKKALKAKIE 67
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 29.8 bits (67), Expect = 0.62
Identities = 9/30 (30%), Positives = 16/30 (53%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
L +E++K+ E K+ K+K+ E K
Sbjct: 111 LAAQEQKKQAEEAAKQAALKQKQAEEAAAK 140
Score = 29.4 bits (66), Expect = 0.98
Identities = 9/30 (30%), Positives = 20/30 (66%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
L++ E+E+ +E+KK+ E+ ++ K+K
Sbjct: 103 LKQLEKERLAAQEQKKQAEEAAKQAALKQK 132
Score = 27.5 bits (61), Expect = 3.7
Identities = 8/29 (27%), Positives = 13/29 (44%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
Q EE K+ ++K+ EE + K
Sbjct: 119 QAEEAAKQAALKQKQAEEAAAKAAAAAKA 147
Score = 27.5 bits (61), Expect = 4.2
Identities = 9/29 (31%), Positives = 20/29 (68%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+E ++++ E+E K+ EK++ +E+KK
Sbjct: 90 EELQQKQAAEQERLKQLEKERLAAQEQKK 118
Score = 27.1 bits (60), Expect = 5.6
Identities = 12/33 (36%), Positives = 14/33 (42%), Gaps = 4/33 (12%)
Query: 141 QEEEEEKKEEREEKKK----EEKKKEEGEEKKK 169
+ E KK+ E K E KKK E E K
Sbjct: 162 KAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAK 194
Score = 26.7 bits (59), Expect = 7.1
Identities = 15/32 (46%), Positives = 16/32 (50%), Gaps = 3/32 (9%)
Query: 141 QEEEEEKKEEREEKKK---EEKKKEEGEEKKK 169
E +KK E E K E KKK E E KKK
Sbjct: 179 AAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKK 210
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 29.5 bits (66), Expect = 0.68
Identities = 12/38 (31%), Positives = 23/38 (60%)
Query: 132 LYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
Y + L+E ++EK+E +E ++ E + EE +E+ K
Sbjct: 136 DYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLK 173
>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
(DUF2098). This domain, found in various hypothetical
prokaryotic proteins, has no known function.
Length = 91
Score = 28.4 bits (64), Expect = 0.75
Identities = 16/41 (39%), Positives = 25/41 (60%)
Query: 128 SEDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
S D Y Y + ++ +EKKEEREE K+E ++ + EE+
Sbjct: 36 STDLYYRTDYLEVTDKVKEKKEEREEDKEELIERIKKEEET 76
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 29.7 bits (66), Expect = 0.76
Identities = 23/93 (24%), Positives = 45/93 (48%), Gaps = 11/93 (11%)
Query: 82 GLMIADSFSPVCPQKLPNLDDDDFSKKMSKGRLQYYQALMPYLKNQSEDCLYLNIYTPLQ 141
G++ AD +P P+ ++ + ++ L++ P K + E + N +
Sbjct: 187 GIVGADEEAPPAPKNTLTPLEELYETNVNLFALKH-----PLEKLEEEIDILKNDGDKVA 241
Query: 142 EEEEEKKEEREEKKKEEKKKE------EGEEKK 168
EEEE + +E E+ +E+KK+ EG+E+K
Sbjct: 242 EEEEFELDEEHEEAEEDKKEALEKIGAEGDEEK 274
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 28.0 bits (63), Expect = 0.80
Identities = 8/21 (38%), Positives = 14/21 (66%)
Query: 146 EKKEEREEKKKEEKKKEEGEE 166
EE+KKEE+++EE ++
Sbjct: 61 AAAAAAEEEKKEEEEEEEEDD 81
Score = 25.3 bits (56), Expect = 8.0
Identities = 9/21 (42%), Positives = 13/21 (61%), Gaps = 2/21 (9%)
Query: 143 EEEEKKEEREEKKKEEKKKEE 163
EE EKK+EE+++EE
Sbjct: 61 AAAAAAEE--EKKEEEEEEEE 79
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 29.3 bits (66), Expect = 0.83
Identities = 9/30 (30%), Positives = 21/30 (70%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++++ KK++ ++K K++ KK++ EKK
Sbjct: 85 KWKKKKSKKKKDKDKDKKDDKKDDKSEKKD 114
Score = 28.9 bits (65), Expect = 0.98
Identities = 12/30 (40%), Positives = 24/30 (80%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++E EEK++ + +KKK +KKK++ ++KK
Sbjct: 74 VKKEYEEKQKWKWKKKKSKKKKDKDKDKKD 103
Score = 28.1 bits (63), Expect = 1.5
Identities = 9/30 (30%), Positives = 23/30 (76%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+E+++ K ++++ KKK++K K++ ++KK
Sbjct: 78 YEEKQKWKWKKKKSKKKKDKDKDKKDDKKD 107
Score = 27.7 bits (62), Expect = 2.5
Identities = 8/28 (28%), Positives = 23/28 (82%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++++KK+++++ K E+K ++E E+K +
Sbjct: 96 DKDKDKKDDKKDDKSEKKDEKEAEDKLE 123
Score = 27.3 bits (61), Expect = 2.8
Identities = 4/29 (13%), Positives = 21/29 (72%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++++K +++++K +++ +KE ++ +
Sbjct: 96 DKDKDKKDDKKDDKSEKKDEKEAEDKLED 124
Score = 26.6 bits (59), Expect = 5.1
Identities = 8/28 (28%), Positives = 22/28 (78%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++++KK+++ EKK E++ +++ E+ K
Sbjct: 100 DKKDDKKDDKSEKKDEKEAEDKLEDLTK 127
Score = 26.2 bits (58), Expect = 7.4
Identities = 8/28 (28%), Positives = 23/28 (82%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++ +KK+++++ KK++KK ++ E+K +
Sbjct: 88 KKKSKKKKDKDKDKKDDKKDDKSEKKDE 115
Score = 26.2 bits (58), Expect = 8.2
Identities = 12/34 (35%), Positives = 21/34 (61%)
Query: 136 IYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
IY E ++KK+E E+ ++ KK+ E ++K K
Sbjct: 52 IYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWK 85
Score = 26.2 bits (58), Expect = 8.3
Identities = 14/31 (45%), Positives = 22/31 (70%)
Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
L EE E+ K+E EEK+K + KK++ ++KK
Sbjct: 66 ELAEEIEKVKKEYEEKQKWKWKKKKSKKKKD 96
>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB. Topo IB proteins include the
monomeric yeast and human topo I and heterodimeric topo
I from Leishmania donvanni. Topo I enzymes are divided
into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topos I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I have putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 217
Score = 29.1 bits (65), Expect = 0.84
Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 4/38 (10%)
Query: 129 EDCLYLNIYTPLQEEEEEK----KEEREEKKKEEKKKE 162
D + I L+EE+E+K KEE+E KKKE K+E
Sbjct: 82 SDADFSLIKNHLEEEKEKKKNLNKEEKEAKKKERAKRE 119
Score = 26.8 bits (59), Expect = 5.3
Identities = 10/33 (30%), Positives = 22/33 (66%)
Query: 137 YTPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ ++ EE+KE+++ KEEK+ ++ E K+
Sbjct: 86 FSLIKNHLEEEKEKKKNLNKEEKEAKKKERAKR 118
>gnl|CDD|199907 cd10320, RGL4_N, N-terminal catalytic domain of rhamnogalacturonan
lyase, a family 4 polysaccharide lyase. The
rhamnogalacturonan lyase of the polysaccharide lyase
family 4 (RGL4) is involved in the degradation of RG
(rhamnogalacturonan) type-I, an important pectic plant
cell wall polysaccharide, by cleaving the alpha-1,4
glycoside bond between L-rhamnose and D-galacturonic
acids in the backbone of RG type-I through a
beta-elimination reaction. RGL4 consists of three
domains, an N-terminal catalytic domain, a middle domain
with a FNIII type fold and a C-terminal domain with a
jelly roll fold; the middle and C-terminal domains are
both putative carbohydrate binding modules. There are
two types of RG lyases, which both cleave the alpha-1,4
bonds of the RG-I main chain (RG chain) through the
beta-elimination reaction, but belong to two
structurally unrelated polysaccharide lyase (PL)
families, 4 and 11.
Length = 265
Score = 29.3 bits (66), Expect = 0.87
Identities = 20/78 (25%), Positives = 34/78 (43%), Gaps = 12/78 (15%)
Query: 17 LQVTMSSKEYTNIIELK-----QGRLRGVVRSPVHNGNLNNIHMY--LGIPYAAPPVGQL 69
L T+S+ + + I++ + L VVR I+MY P P +G+L
Sbjct: 73 LGATVSATQSGDYIQISFSRTFETELHYVVRK-----GEPGIYMYTVATHPAPEPSLGEL 127
Query: 70 RFMAPQSPQPWPGLMIAD 87
R + +P +P I+D
Sbjct: 128 RTVFRLNPDLFPNGAISD 145
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 29.4 bits (66), Expect = 0.88
Identities = 11/33 (33%), Positives = 24/33 (72%), Gaps = 3/33 (9%)
Query: 140 LQEEE---EEKKEEREEKKKEEKKKEEGEEKKK 169
+EE+ EE +++++E+ E+K+K + +E+KK
Sbjct: 402 AKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 28.6 bits (64), Expect = 1.9
Identities = 17/71 (23%), Positives = 30/71 (42%), Gaps = 9/71 (12%)
Query: 99 NLDDDDFSKKMSKGRLQYYQALMPYLKNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKEE 158
+L D + + + Y + L Y K E EE E K + E+ K+EE
Sbjct: 361 DLSGDKRQELLKE----YNKKLQDYTKKLGE-----VKDETDASEEAEAKAKEEKLKQEE 411
Query: 159 KKKEEGEEKKK 169
+K++ E+ +
Sbjct: 412 NEKKQKEQADE 422
>gnl|CDD|185466 PTZ00130, PTZ00130, heat shock protein 90; Provisional.
Length = 814
Score = 29.6 bits (66), Expect = 0.90
Identities = 28/111 (25%), Positives = 49/111 (44%), Gaps = 7/111 (6%)
Query: 59 IPYAAPPVGQLRFMAPQSPQPW-PGLMIADSFSPVCPQKLPNL----DDDDFSKKMSKGR 113
IP AP + F S + + +++AD F P+ + + D DD +S+ +
Sbjct: 375 IPSRAPSINDHLFTKQNSIKLYVRRVLVADEFVEFLPRYMSFVKGVVDSDDLPLNVSREQ 434
Query: 114 LQYYQALMPYLKNQSEDCL--YLNIYTPLQEEEEEKKEEREEKKKEEKKKE 162
LQ + L K L + +Y ++ +E + E ++ EEKKKE
Sbjct: 435 LQQNKILKAVSKRIVRKILDTFRTLYKEGKKNKETLRAELAKETDEEKKKE 485
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 28.8 bits (65), Expect = 0.93
Identities = 12/29 (41%), Positives = 23/29 (79%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+EE+E +KEE+E K+++EK+ +G +K+
Sbjct: 18 REEKEREKEEKERKEEKEKEWGKGLVQKE 46
Score = 27.7 bits (62), Expect = 2.0
Identities = 8/20 (40%), Positives = 18/20 (90%)
Query: 144 EEEKKEEREEKKKEEKKKEE 163
EE+++E+ EK+++E+K+E+
Sbjct: 15 EEKREEKEREKEEKERKEEK 34
Score = 27.3 bits (61), Expect = 2.5
Identities = 12/23 (52%), Positives = 20/23 (86%)
Query: 147 KKEEREEKKKEEKKKEEGEEKKK 169
+E+REEK++E+++KE EEK+K
Sbjct: 14 IEEKREEKEREKEEKERKEEKEK 36
Score = 25.7 bits (57), Expect = 9.3
Identities = 8/22 (36%), Positives = 19/22 (86%)
Query: 140 LQEEEEEKKEEREEKKKEEKKK 161
+ EE+E+++E +E+K+E++K+
Sbjct: 16 EKREEKEREKEEKERKEEKEKE 37
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 29.5 bits (66), Expect = 0.94
Identities = 10/28 (35%), Positives = 21/28 (75%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+E + + +EE+K++E+ KEE ++KK+
Sbjct: 95 AKEPKNESGKEEEKEKEQVKEEKKKKKE 122
Score = 28.3 bits (63), Expect = 2.1
Identities = 10/29 (34%), Positives = 21/29 (72%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+ + E KEE +EK++ +++K++ +EK K
Sbjct: 97 EPKNESGKEEEKEKEQVKEEKKKKKEKPK 125
Score = 27.9 bits (62), Expect = 2.6
Identities = 11/31 (35%), Positives = 21/31 (67%)
Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
P +E ++ K +E ++K+ K+KE+ +EKK
Sbjct: 124 PKEEPKDRKPKEEAKEKRPPKEKEKEKEKKV 154
Score = 27.9 bits (62), Expect = 2.9
Identities = 13/29 (44%), Positives = 24/29 (82%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+EEE+EK++ +EEKKK+++K +E + +K
Sbjct: 104 KEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132
Score = 27.2 bits (60), Expect = 5.6
Identities = 11/28 (39%), Positives = 21/28 (75%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
E+ +E+K++++EK KEE K + +E+ K
Sbjct: 111 EQVKEEKKKKKEKPKEEPKDRKPKEEAK 138
Score = 26.4 bits (58), Expect = 9.1
Identities = 12/28 (42%), Positives = 21/28 (75%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
E+E+E+ +E ++KKKE+ K+E + K K
Sbjct: 107 EKEKEQVKEEKKKKKEKPKEEPKDRKPK 134
>gnl|CDD|181941 PRK09546, zntB, zinc transporter; Reviewed.
Length = 324
Score = 29.2 bits (66), Expect = 0.95
Identities = 16/52 (30%), Positives = 23/52 (44%), Gaps = 21/52 (40%)
Query: 64 PPVGQL-----------RFMAPQSPQPWPGLMIADSFSPVCPQKLPNLDDDD 104
PP G+L R+MAPQ D F+ + ++LP + DDD
Sbjct: 179 PPRGELALLRKQLIVMRRYMAPQR----------DVFARLASERLPWMSDDD 220
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 28.8 bits (65), Expect = 0.99
Identities = 10/30 (33%), Positives = 17/30 (56%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
L+ EEEEK++E E ++ K E+ +
Sbjct: 122 LEREEEEKRDEEERERLLRAAKSRSEQSRL 151
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 28.9 bits (65), Expect = 1.2
Identities = 11/31 (35%), Positives = 19/31 (61%), Gaps = 1/31 (3%)
Query: 140 LQEEEEEKKEERE-EKKKEEKKKEEGEEKKK 169
Q+EEEE++ ER E E+ ++ ++K K
Sbjct: 121 KQKEEEERRVERRRELGLEDPEQLRLKQKAK 151
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 29.3 bits (65), Expect = 1.2
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
EE+ +K +E E+ K E+ KE E KKK
Sbjct: 220 EEDTKKADEDGEEPKVEEVKEGDEGKKK 247
>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
Length = 80
Score = 27.4 bits (61), Expect = 1.2
Identities = 4/28 (14%), Positives = 15/28 (53%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
EE E+ ++ ++ ++K ++ ++
Sbjct: 48 EEPEDDDDDEDDDDDDDKDDKDDDDDDD 75
Score = 25.5 bits (56), Expect = 6.3
Identities = 4/25 (16%), Positives = 17/25 (68%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEE 166
+E+++ +++++K ++ +E +E
Sbjct: 56 DEDDDDDDDKDDKDDDDDDDDEDDE 80
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 29.3 bits (66), Expect = 1.3
Identities = 12/25 (48%), Positives = 20/25 (80%)
Query: 139 PLQEEEEEKKEEREEKKKEEKKKEE 163
EE+ +KK+++E+KK+EE K+EE
Sbjct: 739 DSVEEKTKKKKKKEKKKEEEYKREE 763
Score = 27.8 bits (62), Expect = 3.4
Identities = 8/28 (28%), Positives = 22/28 (78%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+ EEK +++++K+K+++++ + EEK +
Sbjct: 739 DSVEEKTKKKKKKEKKKEEEYKREEKAR 766
Score = 26.6 bits (59), Expect = 9.4
Identities = 10/28 (35%), Positives = 20/28 (71%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKK 168
++ E + + EEK K++KKKE+ +E++
Sbjct: 731 KDAEFKISDSVEEKTKKKKKKEKKKEEE 758
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 28.6 bits (64), Expect = 1.3
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEE 166
E++ K+ E E+ +E++K EE EE
Sbjct: 156 LEKKLKELEAEDVDEEDEKDEEEEE 180
>gnl|CDD|235123 PRK03381, PRK03381, PII uridylyl-transferase; Provisional.
Length = 774
Score = 28.8 bits (65), Expect = 1.4
Identities = 11/24 (45%), Positives = 14/24 (58%)
Query: 58 GIPYAAPPVGQLRFMAPQSPQPWP 81
G+P AA + +L AP P PWP
Sbjct: 344 GLPIAAATLSRLAASAPPLPTPWP 367
>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
Length = 472
Score = 28.8 bits (65), Expect = 1.5
Identities = 12/20 (60%), Positives = 19/20 (95%)
Query: 150 EREEKKKEEKKKEEGEEKKK 169
EREEK+KE+K K+EG+E+++
Sbjct: 392 EREEKEKEKKAKKEGKEERR 411
>gnl|CDD|224403 COG1486, CelF, Alpha-galactosidases/6-phospho-beta-glucosidases,
family 4 of glycosyl hydrolases [Carbohydrate transport
and metabolism].
Length = 442
Score = 28.8 bits (65), Expect = 1.5
Identities = 17/57 (29%), Positives = 24/57 (42%), Gaps = 3/57 (5%)
Query: 113 RLQYYQALMPYLKNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKEE-KKKEEGEEKK 168
+YY PY + + T E + +KE E KK E K+K E EK+
Sbjct: 257 YHRYY--YYPYDMKEKKPDEIEKFGTRANEVMKREKELFELYKKPELKEKPEELEKR 311
>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain.
Length = 225
Score = 28.6 bits (64), Expect = 1.5
Identities = 10/34 (29%), Positives = 19/34 (55%), Gaps = 1/34 (2%)
Query: 137 YTPLQEEEEEK-KEEREEKKKEEKKKEEGEEKKK 169
++ EE E ++ RE + ++++E EKKK
Sbjct: 103 ADRVEGEEPESIRKWRERRDLRIEERDEASEKKK 136
>gnl|CDD|224667 COG1753, COG1753, Predicted antotoxin, copG family [Signal
transduction mechanisms].
Length = 74
Score = 27.1 bits (60), Expect = 1.5
Identities = 12/54 (22%), Positives = 26/54 (48%)
Query: 107 KKMSKGRLQYYQALMPYLKNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKEEKK 160
KM +G+ + + ++ + + L EEE +K ++EEK+ E+ +
Sbjct: 17 VKMKRGKESFSDVIRELIEKKRGNLEVLMRAFGTLSEEEVEKIKKEEKEAEKMR 70
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 29.0 bits (65), Expect = 1.6
Identities = 15/30 (50%), Positives = 23/30 (76%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
L+E++EEK EE+ E+ K E+ EE EE++K
Sbjct: 279 LKEKKEEKDEEKSEEVKTEEVDEEFEEEEK 308
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 27.6 bits (61), Expect = 1.6
Identities = 11/22 (50%), Positives = 14/22 (63%)
Query: 146 EKKEEREEKKKEEKKKEEGEEK 167
+ E KKEEKK+EE EE+
Sbjct: 82 ATAGAKAEAKKEEKKEEEEEEE 103
Score = 26.4 bits (58), Expect = 5.1
Identities = 9/22 (40%), Positives = 15/22 (68%)
Query: 142 EEEEEKKEEREEKKKEEKKKEE 163
K E ++E+KKEE+++EE
Sbjct: 82 ATAGAKAEAKKEEKKEEEEEEE 103
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 28.7 bits (64), Expect = 1.7
Identities = 9/29 (31%), Positives = 16/29 (55%), Gaps = 1/29 (3%)
Query: 141 QEEEEEKKEEREEKKKEE-KKKEEGEEKK 168
E E KK EK + + +KKE ++++
Sbjct: 327 AENAEIKKTRTAEKNEAKARKKEIAQKRR 355
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plant species have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site located in the C-terminal
region of P2. TCS inactivates the ribosome by
depurinating a specific adenine in the sarcin-ricin loop
of 28S rRNA.
Length = 103
Score = 27.3 bits (61), Expect = 1.8
Identities = 11/15 (73%), Positives = 12/15 (80%)
Query: 152 EEKKKEEKKKEEGEE 166
E KKEEKK+EE EE
Sbjct: 79 AEAKKEEKKEEEEEE 93
Score = 25.7 bits (57), Expect = 7.6
Identities = 7/16 (43%), Positives = 13/16 (81%)
Query: 148 KEEREEKKKEEKKKEE 163
E ++E+KKEE+++E
Sbjct: 79 AEAKKEEKKEEEEEES 94
Score = 25.4 bits (56), Expect = 8.3
Identities = 8/16 (50%), Positives = 12/16 (75%)
Query: 152 EEKKKEEKKKEEGEEK 167
+ K+E+KKEE EE+
Sbjct: 78 AAEAKKEEKKEEEEEE 93
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 28.4 bits (63), Expect = 1.8
Identities = 7/29 (24%), Positives = 18/29 (62%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+ + K+E + +K+ E K+ E ++K++
Sbjct: 303 HKAFDLKQESKASEKEAEDKELEAQKKRE 331
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 28.2 bits (63), Expect = 1.9
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
E EE KEE +E E K+++G+ +K+
Sbjct: 59 TAEIEEVKEEEKEAANSEDKEDKGDAEKE 87
Score = 28.2 bits (63), Expect = 2.2
Identities = 11/29 (37%), Positives = 19/29 (65%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+E+E E++E+K EK+ EE EE+ +
Sbjct: 68 EEKEAANSEDKEDKGDAEKEDEESEEENE 96
Score = 27.8 bits (62), Expect = 2.2
Identities = 11/29 (37%), Positives = 17/29 (58%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
E EE K+EE+E E+K+ + EK+
Sbjct: 60 AEIEEVKEEEKEAANSEDKEDKGDAEKED 88
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 27.9 bits (63), Expect = 1.9
Identities = 11/26 (42%), Positives = 17/26 (65%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEK 167
++ EE+KEE E++ E + K E EK
Sbjct: 130 KKLEEEKEELEKRVAELEAKLEAIEK 155
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 27.6 bits (62), Expect = 1.9
Identities = 10/29 (34%), Positives = 18/29 (62%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
QEE++ + E+ + EEK + E E+ +K
Sbjct: 28 QEEKKSEAEKLAKMSAEEKAEYELEKLEK 56
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 27.9 bits (62), Expect = 2.2
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKK 168
EEE + EE EE+ +E+ + EKK
Sbjct: 71 EEEIVEDEEEEEEDEEDNVDLKDIEKK 97
Score = 27.9 bits (62), Expect = 2.3
Identities = 10/20 (50%), Positives = 14/20 (70%)
Query: 141 QEEEEEKKEEREEKKKEEKK 160
+EEEEE +E+ + K EKK
Sbjct: 78 EEEEEEDEEDNVDLKDIEKK 97
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 28.4 bits (64), Expect = 2.2
Identities = 7/28 (25%), Positives = 14/28 (50%), Gaps = 2/28 (7%)
Query: 141 QEEEEEKKEEREEKKKE--EKKKEEGEE 166
Q E +K RE + K+ E + + ++
Sbjct: 459 QARLEREKAAREARHKKAAEARAAKDKD 486
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 28.3 bits (63), Expect = 2.3
Identities = 13/97 (13%), Positives = 28/97 (28%), Gaps = 6/97 (6%)
Query: 72 MAPQSPQPWPGLMIADSFSPVCPQKLPNLDDDDFSKKMSKGRLQYYQALMPYLKNQSEDC 131
S P ++ + ++ P+ S S +
Sbjct: 307 AMYSSDYQAPKPEPIYTWEELLRERFPSDLFAISSLPDSD------SEASDSGPTRKRKR 360
Query: 132 LYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
+ +E+E + +E E+++K EE K
Sbjct: 361 RRVPPLPEYSSDEDEDDSDEDEVDYEKERKRRREEDK 397
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 28.2 bits (63), Expect = 2.3
Identities = 7/24 (29%), Positives = 23/24 (95%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEE 163
L++++++++EER +++K+++++EE
Sbjct: 192 LKQQQQKREEERRKQRKKQQEEEE 215
Score = 27.4 bits (61), Expect = 3.6
Identities = 11/29 (37%), Positives = 24/29 (82%), Gaps = 2/29 (6%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+++++K+EE E++K+ KK++E EE+K+
Sbjct: 192 LKQQQQKREE--ERRKQRKKQQEEEERKQ 218
Score = 27.0 bits (60), Expect = 4.4
Identities = 8/30 (26%), Positives = 26/30 (86%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++ ++++++REE++++++KK++ EE++K
Sbjct: 188 ELKKLKQQQQKREEERRKQRKKQQEEEERK 217
Score = 27.0 bits (60), Expect = 4.4
Identities = 10/27 (37%), Positives = 24/27 (88%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEK 167
Q+ EEE++++R+++++EE++K++ EE
Sbjct: 197 QKREEERRKQRKKQQEEEERKQKAEEA 223
>gnl|CDD|169428 PRK08404, PRK08404, V-type ATP synthase subunit H; Validated.
Length = 103
Score = 27.1 bits (60), Expect = 2.3
Identities = 17/28 (60%), Positives = 20/28 (71%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
EEE KK E E +K EKKK+EGEE+ K
Sbjct: 41 EEEIIKKAEEEAQKLIEKKKKEGEEEAK 68
Score = 25.5 bits (56), Expect = 7.2
Identities = 15/30 (50%), Positives = 20/30 (66%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++EE +K EE +K E+KKKE EE KK
Sbjct: 40 IEEEIIKKAEEEAQKLIEKKKKEGEEEAKK 69
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 28.1 bits (62), Expect = 2.4
Identities = 11/25 (44%), Positives = 20/25 (80%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGE 165
EE++ ++E+ EE+ +EE K+EEG+
Sbjct: 215 TEEKQPQEEQEEEEVEEEAKQEEGQ 239
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this family
(Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 26.8 bits (60), Expect = 2.4
Identities = 11/27 (40%), Positives = 18/27 (66%), Gaps = 2/27 (7%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEK 167
+EE+ +++EE + K K+EE EEK
Sbjct: 5 EEEKAQREEELKRLKNL--KREEIEEK 29
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 27.8 bits (61), Expect = 2.5
Identities = 13/28 (46%), Positives = 20/28 (71%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
EEE ++EE ++KEEK K + EE++K
Sbjct: 72 EEEARRQEEERAREKEEKAKRKAEEEEK 99
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 28.1 bits (63), Expect = 2.6
Identities = 9/28 (32%), Positives = 18/28 (64%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
EE EE +R+ +K+ ++ +E EE ++
Sbjct: 162 EELEEDAADRDARKRAAEEAKEQEELRR 189
>gnl|CDD|197363 cd09846, DUF1312, N-Utilization Substance G (NusG) N terminal
(NGN) insert and Lin0431 are part of DUF1312. Domains
of Unknown Function 1312 (DUF1312) are represented in
at least 71 bacterial species with no functional
annotation. Included in this family are N-Utilization
Substance G (NusG) N terminal (NGN) insert and Lin0431,
having similar structure and surface features that
appear to be conserved across these domain families,
suggesting similar function. NusG contains NGN at the
N-terminus and Kyrpides Ouzounis and Woese (KOW)
repeats at the C-terminus in bacteria and archaea, and
this insert (often known as Domain II) is found in
several bacteria. Lin0431 is similar to NGN-insert but
does ot contain the disulphite bridge.
Length = 81
Score = 26.4 bits (59), Expect = 2.7
Identities = 11/21 (52%), Positives = 15/21 (71%), Gaps = 1/21 (4%)
Query: 24 KEYTNIIELKQGRLRGVVRSP 44
+ YTNIIE++ GR+R V S
Sbjct: 30 EGYTNIIEVEDGRVR-VKESD 49
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 27.7 bits (62), Expect = 2.7
Identities = 10/31 (32%), Positives = 16/31 (51%)
Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
E E + E++ + K+E KKK+ E K
Sbjct: 239 EDVEAAESRAEKKRKSKEEIKKKKPKESKGV 269
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 27.7 bits (62), Expect = 2.7
Identities = 12/28 (42%), Positives = 19/28 (67%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKK 168
Q + +K++ +E KKKE+ +K E EE K
Sbjct: 17 QRKARKKQKRKEAKKKEDAQKSEAEEVK 44
Score = 27.3 bits (61), Expect = 3.8
Identities = 10/28 (35%), Positives = 17/28 (60%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
E E++ + E++K KK++ E KKK
Sbjct: 5 EALLEQRRRKREQRKARKKQKRKEAKKK 32
Score = 26.9 bits (60), Expect = 4.3
Identities = 9/26 (34%), Positives = 20/26 (76%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEE 166
+ + E++K +++K+KE KKKE+ ++
Sbjct: 12 RRKREQRKARKKQKRKEAKKKEDAQK 37
>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator. This protein is
found in a wide range of eukaryotes. It is a nuclear
protein and is suggested to be DNA binding. In plants,
this family is essential for correct circadian clock
functioning by acting as a light-quality regulator
coordinating the activities of blue and red light
signalling pathways during plant growth - inhibiting
growth in red light but promoting growth in blue light.
Length = 233
Score = 27.7 bits (62), Expect = 2.8
Identities = 17/43 (39%), Positives = 24/43 (55%), Gaps = 2/43 (4%)
Query: 123 YLKNQSEDCLYLNIYTPLQEEEEEKKEEREE--KKKEEKKKEE 163
KN S D +L ++E E ++E REE KK+E K+EE
Sbjct: 50 IGKNPSVDTSFLPDKAREEKEAELREELREEFLKKQEAVKEEE 92
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 27.9 bits (63), Expect = 2.9
Identities = 12/28 (42%), Positives = 16/28 (57%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
EEEE +KEE + + +EE K E E
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAA 66
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 27.3 bits (60), Expect = 3.1
Identities = 10/22 (45%), Positives = 19/22 (86%)
Query: 142 EEEEEKKEEREEKKKEEKKKEE 163
E EEK +ERE+++++E+++EE
Sbjct: 145 ERREEKIDEREDQEEQEREREE 166
Score = 26.6 bits (58), Expect = 6.1
Identities = 12/30 (40%), Positives = 23/30 (76%), Gaps = 1/30 (3%)
Query: 141 QEEEEEKKE-EREEKKKEEKKKEEGEEKKK 169
+EE EE+ E ER E+K +E++ +E +E+++
Sbjct: 135 REELEERMEWERREEKIDEREDQEEQERER 164
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 27.5 bits (61), Expect = 3.3
Identities = 14/71 (19%), Positives = 32/71 (45%), Gaps = 6/71 (8%)
Query: 99 NLDDDDFSKKMSKGRLQYYQALMPYLKNQSEDCLYLNIYTPLQEEEEEKK------EERE 152
N+D ++ KK+ + + + + + + L ++EEEE++ EE E
Sbjct: 112 NIDLENTKKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEE 171
Query: 153 EKKKEEKKKEE 163
++ + K K+
Sbjct: 172 QQMNKRKNKQA 182
>gnl|CDD|236082 PRK07737, fliD, flagellar capping protein; Validated.
Length = 501
Score = 27.8 bits (62), Expect = 3.3
Identities = 10/25 (40%), Positives = 16/25 (64%)
Query: 137 YTPLQEEEEEKKEEREEKKKEEKKK 161
Y PL +E++E E++ + EEK K
Sbjct: 303 YPPLTDEQKEAMTEKQIELWEEKAK 327
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 27.0 bits (60), Expect = 3.3
Identities = 12/30 (40%), Positives = 24/30 (80%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
L+E+ ++KK+++++KKK++KK + KKK
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
>gnl|CDD|152814 pfam12379, DUF3655, Protein of unknown function (DUF3655). This
domain family is found in viruses, and is approximately
70 amino acids in length. The family is found in
association with pfam08716, pfam01661, pfam05409,
pfam06471, pfam08717, pfam06478, pfam09401, pfam06460,
pfam08715, pfam08710.
Length = 70
Score = 26.2 bits (57), Expect = 3.5
Identities = 13/35 (37%), Positives = 21/35 (60%)
Query: 132 LYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEE 166
+Y + Y P +EE+ E+ E EE+ +E + E G E
Sbjct: 1 MYCSFYPPDEEEDCEEYECEEEEIEETCEHEYGTE 35
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 27.7 bits (61), Expect = 3.5
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEE 166
+ EEEK E++ K+++ KK +E E
Sbjct: 127 DVEEEKTEKKVRKRRKVKKMDEDVE 151
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 27.7 bits (62), Expect = 3.6
Identities = 13/45 (28%), Positives = 22/45 (48%), Gaps = 8/45 (17%)
Query: 124 LKNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
LK SE T +EEE + + E + K K+++EG ++
Sbjct: 226 LKETSE--------TEEREEETDVEIETTSETKGTKQEQEGSTEE 262
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 27.2 bits (61), Expect = 3.7
Identities = 9/29 (31%), Positives = 15/29 (51%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
+ ++EK R E K+ E K +E E +
Sbjct: 76 IARYKKEKARYRSEAKELEAKAKEAEAES 104
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 27.7 bits (62), Expect = 3.7
Identities = 14/45 (31%), Positives = 25/45 (55%), Gaps = 2/45 (4%)
Query: 122 PYLKNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEE 166
++ Q C ++N EEEE++ EE EE+++E ++ E E
Sbjct: 331 QHILPQGR-CTWVNP-EQKDEEEEQEDEEEEEEEEEPEEPEPEEG 373
>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein. This entry includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 106
Score = 26.6 bits (59), Expect = 3.7
Identities = 16/74 (21%), Positives = 34/74 (45%), Gaps = 9/74 (12%)
Query: 101 DDDDFSKKM-----SKGRLQYYQALMPYLKNQSEDCLYLNIYTPLQEEEEEKKEEREEKK 155
++DD+ + +KGR +Y + +D + +EE + R E +
Sbjct: 32 EEDDYDRDEIWAMFNKGRKRYAY----RDDDDDDDDMEAGGADIQEEERRSARLARLEDR 87
Query: 156 KEEKKKEEGEEKKK 169
+EE+ ++E E +K+
Sbjct: 88 EEERLEKEEEREKR 101
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 27.0 bits (60), Expect = 3.8
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
EE EE+ + E++ + K E +EKKK
Sbjct: 99 EETEEESTDETEQEDPPETKTESKEKKK 126
Score = 27.0 bits (60), Expect = 3.9
Identities = 8/32 (25%), Positives = 15/32 (46%)
Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+E E++ + + +EKKK E + K
Sbjct: 103 EESTDETEQEDPPETKTESKEKKKREVPKPKT 134
Score = 26.3 bits (58), Expect = 7.1
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+E EEE +E E++ E K E E+KK+
Sbjct: 99 EETEEESTDETEQEDPPETKTESKEKKKR 127
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 27.2 bits (61), Expect = 4.2
Identities = 16/29 (55%), Positives = 22/29 (75%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
L+EE +E EER E+K+EEK++E EE K
Sbjct: 126 LREEIDEFNEERIERKEEEKEREREEELK 154
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel, four-helix
bundle structure with a large hydrophobic patch in which
23 residues of one monomer form van der Waals contacts
with the other monomer. This CENP-B dimer configuration
may be suitable for capturing two distant CENP-B boxes
during centromeric heterochromatin formation.
Length = 101
Score = 26.3 bits (57), Expect = 4.2
Identities = 7/24 (29%), Positives = 15/24 (62%)
Query: 143 EEEEKKEEREEKKKEEKKKEEGEE 166
+EEE ++ +E+ +E E+ +E
Sbjct: 17 DEEEDDDDEDEEDDDEDDDEDDDE 40
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 26.9 bits (60), Expect = 4.3
Identities = 12/32 (37%), Positives = 17/32 (53%), Gaps = 2/32 (6%)
Query: 140 LQEE--EEEKKEEREEKKKEEKKKEEGEEKKK 169
L EE EE + K++E + E+ E KKK
Sbjct: 164 LLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 27.4 bits (61), Expect = 4.8
Identities = 9/32 (28%), Positives = 18/32 (56%)
Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
++++++KKE++E K + E K KK
Sbjct: 68 KKSEKKKKKKKEKKEPKSEGETKLGFKTPKKS 99
Score = 27.0 bits (60), Expect = 5.9
Identities = 15/29 (51%), Positives = 20/29 (68%), Gaps = 2/29 (6%)
Query: 143 EEEEKKEEREEKKKEEKK--KEEGEEKKK 169
E +KK E+++KKK+EKK K EGE K
Sbjct: 64 NESKKKSEKKKKKKKEKKEPKSEGETKLG 92
>gnl|CDD|192481 pfam10197, Cir_N, N-terminal domain of CBF1 interacting
co-repressor CIR. This is a 45 residue conserved region
at the N-terminal end of a family of proteins referred
to as CIRs (CBF1-interacting co-repressors). CBF1
(centromere-binding factor 1) acts as a transcription
factor that causes repression by binding specifically to
GTGGGAA motifs in responsive promoters, and it requires
CIR as a co-repressor. CIR binds to histone deacetylase
and to SAP30 and serves as a linker between CBF1 and the
histone deacetylase complex.
Length = 37
Score = 24.5 bits (54), Expect = 5.1
Identities = 12/23 (52%), Positives = 15/23 (65%)
Query: 144 EEEKKEEREEKKKEEKKKEEGEE 166
E E+K E+KK EE +KE EE
Sbjct: 15 EAEQKALEEQKKIEELRKEIEEE 37
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 27.2 bits (60), Expect = 5.2
Identities = 11/28 (39%), Positives = 18/28 (64%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKK 168
++ KK E++ KKKEE+ EE + K+
Sbjct: 69 SQQSSAKKGEQQRKKKEEQVAEELKPKQ 96
Score = 26.8 bits (59), Expect = 6.9
Identities = 12/29 (41%), Positives = 21/29 (72%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
Q EE EK+ + E+K++EE+ ++ E+KK
Sbjct: 119 QAEEAEKQAQLEQKQQEEQARKAAAEQKK 147
Score = 26.5 bits (58), Expect = 7.5
Identities = 8/29 (27%), Positives = 19/29 (65%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
E+E K+ +E+ K ++++++ EE +K
Sbjct: 97 AAEQERLKQLEKERLKAQEQQKQAEEAEK 125
Score = 26.5 bits (58), Expect = 8.2
Identities = 12/29 (41%), Positives = 14/29 (48%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+ +KK E E K EK K E E K K
Sbjct: 188 EAAAAKKKAEAEAKAAAEKAKAEAEAKAK 216
>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
[Posttranslational modification, protein turnover,
chaperones].
Length = 782
Score = 27.1 bits (61), Expect = 5.2
Identities = 21/64 (32%), Positives = 32/64 (50%), Gaps = 13/64 (20%)
Query: 107 KKMSKGRLQYYQALMPYLKN-QSEDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGE 165
++M K + +YY L LK Q E L E++++K E E ++K EK K E
Sbjct: 220 EQMEKSQREYY--LREQLKAIQKE----------LGEDDDDKDEVEELREKIEKLKLPKE 267
Query: 166 EKKK 169
K+K
Sbjct: 268 AKEK 271
>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated. This domain
appears in eukaryotes as well as bacteria and tends to
be found near the C-terminus of the metalloprotease M16C
(pfam05193).
Length = 248
Score = 26.8 bits (60), Expect = 5.3
Identities = 13/28 (46%), Positives = 16/28 (57%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
E EEK E E +K E+KK EE K+
Sbjct: 5 ETLEEKLNEEEREKLEKKKSSLSEEDKE 32
>gnl|CDD|188469 TIGR03954, integ_memb_HG, integral membrane protein. This model
describes a strictly bacterial integral membrane domain
of about 85 residues in length. It occurs in proteins
that on rare occasions are fused to transporter domains
such as the major facilitator superfamily domain. Of
three invariant residues, two occur as a His-Gly
dipeptide in the middle of three predicted
transmembrane helices [Unknown function, General].
Length = 85
Score = 25.9 bits (58), Expect = 5.3
Identities = 9/16 (56%), Positives = 10/16 (62%)
Query: 6 ALFTGLLLLLLLQVTM 21
A G+ LLLLL V M
Sbjct: 5 AFAEGVSLLLLLFVAM 20
>gnl|CDD|224401 COG1484, DnaC, DNA replication protein [DNA replication,
recombination, and repair].
Length = 254
Score = 27.0 bits (60), Expect = 5.6
Identities = 12/41 (29%), Positives = 17/41 (41%)
Query: 129 EDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
DC L+ EE+K RE +K E + + KK
Sbjct: 34 ADCKEWGYAEFLEYLLEEEKLAREARKIERRLRSASFPAKK 74
>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
structure and biogenesis].
Length = 107
Score = 26.0 bits (57), Expect = 5.8
Identities = 9/28 (32%), Positives = 17/28 (60%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKK 168
Q +E+K E+ + ++KKK ++KK
Sbjct: 6 QLSKEKKAEKAKAGTAKDKKKWSKKKKK 33
Score = 25.6 bits (56), Expect = 7.7
Identities = 10/31 (32%), Positives = 18/31 (58%)
Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
P + +EKK E+ + + KK+ ++KKK
Sbjct: 3 PKSQLSKEKKAEKAKAGTAKDKKKWSKKKKK 33
>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein. This family includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 116
Score = 26.0 bits (57), Expect = 6.1
Identities = 11/22 (50%), Positives = 17/22 (77%)
Query: 140 LQEEEEEKKEEREEKKKEEKKK 161
L++E E +EE EEK+K++KK
Sbjct: 94 LEDERELAREEEEEKRKKKKKN 115
Score = 26.0 bits (57), Expect = 6.7
Identities = 10/30 (33%), Positives = 18/30 (60%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+EE + R E ++E ++EE E++KK
Sbjct: 82 QKEERRSARMARLEDERELAREEEEEKRKK 111
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 26.2 bits (58), Expect = 6.2
Identities = 13/31 (41%), Positives = 23/31 (74%), Gaps = 2/31 (6%)
Query: 141 QEEEEEKKEEREEKKKE--EKKKEEGEEKKK 169
+EE++ K EE+ KK+ +KKK++ ++KKK
Sbjct: 72 REEKKRKDEEKTAKKRAKRQKKKQKKKKKKK 102
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 26.8 bits (59), Expect = 6.3
Identities = 8/26 (30%), Positives = 18/26 (69%)
Query: 144 EEEKKEEREEKKKEEKKKEEGEEKKK 169
E+ +K + + +KK+E+K+ E+K+
Sbjct: 173 EDSQKHKEDRRKKQEEKRRNDEDKRP 198
>gnl|CDD|222117 pfam13422, DUF4110, Domain of unknown function (DUF4110). This is
a family that is found predominantly at the C-terminus
of Kelch-containing proteins. However, the exact
function of this region is not known.
Length = 99
Score = 25.8 bits (57), Expect = 6.3
Identities = 9/27 (33%), Positives = 13/27 (48%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKK 168
EEEE+ E +EK GE ++
Sbjct: 72 EEEEQAGIEEVVSLAKEKDWGGGEGRR 98
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 27.0 bits (60), Expect = 6.4
Identities = 8/23 (34%), Positives = 14/23 (60%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKE 162
+ EE ++E EE ++E K K+
Sbjct: 303 VNEELTTVRQENEELEEEYKIKK 325
>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 486
Score = 26.7 bits (59), Expect = 6.4
Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 5/46 (10%)
Query: 118 QALMPYLKNQSEDCLYLNI-----YTPLQEEEEEKKEEREEKKKEE 158
+ ++ ED L L I Y P E EE KKE+ +EK++ E
Sbjct: 417 GKITIKVEKSEEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKERPE 462
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 26.5 bits (59), Expect = 6.4
Identities = 11/28 (39%), Positives = 20/28 (71%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
EEE E + R E+++ +KKK++ ++K K
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTK 193
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 26.6 bits (59), Expect = 6.5
Identities = 11/29 (37%), Positives = 19/29 (65%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
EEE EK+ +REE+ K++K+ + K+
Sbjct: 70 DEEEGEKELQREERLKKKKRVKTKAYKEP 98
>gnl|CDD|198151 smart01083, Cir_N, N-terminal domain of CBF1 interacting
co-repressor CIR. This is a 45 residue conserved region
at the N-terminal end of a family of proteins referred
to as CIRs (CBF1-interacting co-repressors). CBF1
(centromere-binding factor 1) acts as a transcription
factor that causes repression by binding specifically to
GTGGGAA motifs in responsive promoters, and it requires
CIR as a co-repressor. CIR binds to histone deacetylase
and to SAP30 and serves as a linker between CBF1 and the
histone deacetylase complex.
Length = 37
Score = 24.4 bits (54), Expect = 6.6
Identities = 8/18 (44%), Positives = 13/18 (72%)
Query: 146 EKKEEREEKKKEEKKKEE 163
++ + E+K +EEKKK E
Sbjct: 11 KRVWKAEQKAEEEKKKIE 28
>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
Length = 567
Score = 26.7 bits (60), Expect = 6.8
Identities = 9/27 (33%), Positives = 18/27 (66%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKK 168
EE+ E+ EE + EE+++++ E+K
Sbjct: 64 EEQRERLEELAPELLEEEEEKKEEKKG 90
Score = 26.7 bits (60), Expect = 6.8
Identities = 10/32 (31%), Positives = 16/32 (50%)
Query: 134 LNIYTPLQEEEEEKKEEREEKKKEEKKKEEGE 165
L P EEEE+K+E ++ E+G+
Sbjct: 70 LEELAPELLEEEEEKKEEKKGLPPLPNAEKGK 101
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 25.7 bits (56), Expect = 6.9
Identities = 12/25 (48%), Positives = 18/25 (72%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEE 166
EEE ++ +E EE + EE++ EE EE
Sbjct: 54 EEEVDEDDEEEEGEGEEEEGEEEEE 78
>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family. This
family includes the HABP4 family of hyaluronan-binding
proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
HABP4 has been observed to bind hyaluronan (a
glucosaminoglycan), but it is not known whether this is
its primary role in vivo. It has also been observed to
bind RNA, but with a lower affinity than that for
hyaluronan. PAI-1 mRNA-binding protein specifically
binds the mRNA of type-1 plasminogen activator inhibitor
(PAI-1), and is thought to be involved in regulation of
mRNA stability. However, in both cases, the sequence
motifs predicted to be important for ligand binding are
not conserved throughout the family, so it is not known
whether members of this family share a common function.
Length = 106
Score = 25.8 bits (57), Expect = 7.0
Identities = 11/29 (37%), Positives = 17/29 (58%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+ E EEK+ EE KE +EE E+++
Sbjct: 47 KNEVEEKQAVEEEANKEGVVEEEEVEEEE 75
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 26.9 bits (59), Expect = 7.1
Identities = 9/45 (20%), Positives = 16/45 (35%)
Query: 125 KNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+ L L+ +E+ KK + KE+ + EE
Sbjct: 186 LAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYL 230
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 26.3 bits (58), Expect = 7.3
Identities = 12/27 (44%), Positives = 22/27 (81%)
Query: 143 EEEEKKEEREEKKKEEKKKEEGEEKKK 169
E++ KK++ E+ K+ +KKK+E ++KKK
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKKKKK 167
>gnl|CDD|224308 COG1390, NtpE, Archaeal/vacuolar-type H+-ATPase subunit E [Energy
production and conversion].
Length = 194
Score = 26.2 bits (58), Expect = 7.5
Identities = 11/30 (36%), Positives = 19/30 (63%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+ E EE+ EE E+ +EE +K + E K++
Sbjct: 11 ILREAEEEAEEILEEAREEAEKIKEEAKRE 40
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 26.7 bits (60), Expect = 7.5
Identities = 18/30 (60%), Positives = 20/30 (66%), Gaps = 1/30 (3%)
Query: 140 LQEEEEEKKEEREEKKKE-EKKKEEGEEKK 168
L+ E E+K EE E KE EK KEE EEKK
Sbjct: 528 LERELEQKAEEAEALLKEAEKLKEELEEKK 557
>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein. All proteins in this family
for which functions are known are used for the
recognition of DNA damage as part of nucleotide excision
repair. This family is based on the phylogenomic
analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 172
Score = 26.3 bits (58), Expect = 7.6
Identities = 11/21 (52%), Positives = 16/21 (76%)
Query: 141 QEEEEEKKEEREEKKKEEKKK 161
+E +E+KE REE K+E K+K
Sbjct: 93 EEALDEEKERREESKEEMKEK 113
Score = 26.3 bits (58), Expect = 8.1
Identities = 13/21 (61%), Positives = 15/21 (71%)
Query: 142 EEEEEKKEEREEKKKEEKKKE 162
E EE KEE +EKK E+K KE
Sbjct: 101 ERREESKEEMKEKKFEKKLKE 121
>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
binding domain. Human SGLT1 (hSGLT1) is a
high-affinity/low-capacity glucose transporter, which
can also transport galactose. In the transport
mechanism, two Na+ ions first bind to the extracellular
side of the transporter and induce a conformational
change in the glucose binding site. This results in an
increased affinity for glucose. A second conformational
change in the transporter follows, bringing the Na+ and
glucose binding sites to the inner surface of the
membrane. Glucose is then released, followed by the Na+
ions. In the process, hSGLT1 is also able to transport
water and urea and may be a major pathway for transport
of these across the intestinal brush-border membrane.
hSGLT1 is encoded by the SLC5A1 gene and expressed
mostly in the intestine, but also in the trachea,
kidney, heart, brain, testis, and prostate. The
WHO/UNICEF oral rehydration solution (ORS) for the
treatment of secretory diarrhea contains salt and
glucose. The glucose, along with sodium ions, is
transported by hSGLT1 and water is either co-transported
along with these or follows by osmosis. Mutations in
SGLT1 are associated with intestinal glucose galactose
malabsorption (GGM). Up-regulation of intestinal SGLT1
may protect against enteric infections. SGLT1 is
expressed in colorectal, head and neck, and prostate
tumors. Epidermal growth factor receptor (EGFR)
functions in cell survival by stabilizing SGLT1, and
thereby maintaining intracellular glucose levels. SGLT1
is predicted to have 14 membrane-spanning regions. This
subgroup belongs to the solute carrier 5
(SLC5)transporter family.
Length = 635
Score = 26.8 bits (59), Expect = 7.7
Identities = 7/29 (24%), Positives = 17/29 (58%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
L ++ + E+ E + +E++K+ G +K
Sbjct: 547 LDADDWTEDEDENEMETDEERKKPGCCRK 575
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 26.7 bits (60), Expect = 7.8
Identities = 13/31 (41%), Positives = 21/31 (67%), Gaps = 2/31 (6%)
Query: 141 QEEEEEKKEEREEKKKE--EKKKEEGEEKKK 169
E+ +KEE ++K E EK++EE E+K+K
Sbjct: 87 LEKRLLQKEENLDRKLELLEKREEELEKKEK 117
>gnl|CDD|205480 pfam13300, DUF4078, Domain of unknown function (DUF4078). This
family is found from fungi to humans, but its exact
function is not known.
Length = 88
Score = 25.3 bits (56), Expect = 8.0
Identities = 13/29 (44%), Positives = 23/29 (79%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
+E+ EE ++ REE ++E K++EE +EK+K
Sbjct: 46 KEQMEELEKAREETERERKEREERKEKRK 74
Score = 24.9 bits (55), Expect = 9.9
Identities = 9/21 (42%), Positives = 16/21 (76%)
Query: 141 QEEEEEKKEEREEKKKEEKKK 161
++E EE+KE+R+ +E +KK
Sbjct: 63 RKEREERKEKRKRAIEERRKK 83
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 26.3 bits (58), Expect = 8.3
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKK 168
E+ EE E + E+ K+E K + E KK
Sbjct: 46 EDSEESNEVKIEELKDENNKLKEENKK 72
>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit. This
family includes the vacuolar ATP synthase E subunit, as
well as the archaebacterial ATP synthase E subunit.
Length = 195
Score = 26.1 bits (58), Expect = 8.4
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
E EEE + E+ E +E +KK E +KK
Sbjct: 14 RAEAEEEFEIEKAEAVEEAEKKIEEIYEKK 43
>gnl|CDD|151173 pfam10669, Phage_Gp23, Protein gp23 (Bacteriophage A118). This is
the highly conserved family of the major tail subunit
protein.
Length = 121
Score = 25.8 bits (56), Expect = 8.9
Identities = 8/24 (33%), Positives = 18/24 (75%)
Query: 140 LQEEEEEKKEEREEKKKEEKKKEE 163
++EE ++ + ERE++ KE K++ +
Sbjct: 48 MKEERDKMETEREKRDKESKEERD 71
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 26.0 bits (57), Expect = 9.3
Identities = 9/30 (30%), Positives = 19/30 (63%)
Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
L E++E K+ + E++++E K + E +K
Sbjct: 73 KLDEKKEIAKQRKREQREKELAKRQKELEK 102
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 26.3 bits (58), Expect = 9.3
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEE 166
+EEEEE++EE EE + E EE
Sbjct: 403 EEEEEEEEEEEEEAAEAEAPMEEPVP 428
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 26.2 bits (58), Expect = 9.5
Identities = 11/32 (34%), Positives = 20/32 (62%), Gaps = 1/32 (3%)
Query: 136 IYTPLQE-EEEEKKEEREEKKKEEKKKEEGEE 166
+ L+E E E+K+EE EK ++ ++ GE+
Sbjct: 89 LQRKLKELEREQKEEEVREKHNKKIIEKFGED 120
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 26.2 bits (58), Expect = 9.5
Identities = 9/28 (32%), Positives = 16/28 (57%)
Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
EE E+ E K ++KK+++ E +K
Sbjct: 413 EEIREELIEEGLLKSKKKKRKKKEWFEK 440
>gnl|CDD|212464 cd01717, Sm_B, Sm protein B. The eukaryotic Sm proteins (B/B', D1,
D2, D3, E, F and G) assemble into a hetero-heptameric
ring around the Sm site of the 2,2,7-trimethyl guanosine
(m3G) capped U1, U2, U4 and U5 snRNAs (Sm snRNAs)
forming the core of the snRNP particle. The snRNP
particle, in turn, assembles with other components onto
the pre-mRNA to form the spliceosome which is
responsible for the excision of introns and the ligation
of exons. Members of this family share a highly
conserved Sm fold, containing an N-terminal helix
followed by a strongly bent five-stranded antiparallel
beta-sheet.
Length = 80
Score = 24.8 bits (55), Expect = 9.7
Identities = 11/19 (57%), Positives = 14/19 (73%)
Query: 149 EEREEKKKEEKKKEEGEEK 167
E R+ K K++KK EE EEK
Sbjct: 43 EFRKIKPKKKKKGEEREEK 61
>gnl|CDD|183534 PRK12451, PRK12451, arginyl-tRNA synthetase; Reviewed.
Length = 562
Score = 26.3 bits (58), Expect = 9.7
Identities = 12/35 (34%), Positives = 19/35 (54%)
Query: 133 YLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEK 167
+Y EE ++ +E EE + KK EEG+E+
Sbjct: 190 LFKLYVQFHEEVKDDEELEEEGRAWFKKLEEGDEE 224
>gnl|CDD|227708 COG5421, COG5421, Transposase [DNA replication, recombination, and
repair].
Length = 480
Score = 26.2 bits (58), Expect = 9.7
Identities = 11/39 (28%), Positives = 23/39 (58%), Gaps = 1/39 (2%)
Query: 132 LYLNIYTP-LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
L++ + + LQ++EE+ R EK+ E+ +K + K +
Sbjct: 269 LWVFLLSYKLQKKEEQTLRTRIEKELEKAEKSLEKLKGR 307
>gnl|CDD|149159 pfam07925, RdRP_5, Reovirus RNA-dependent RNA polymerase lambda 3.
The sequences in this family are similar to the reoviral
minor core protein lambda 3, which functions as a
RNA-dependent RNA polymerase within the protein capsid.
It is organised into 3 domains. N- and C-terminal
domains create a 'cage' that encloses a conserved
central catalytic domain within a hollow centre; this
catalytic domain is arranged to form 'fingers', 'palm'
and 'thumb' subdomains. Unlike other RNA polymerases,
like HIV reverse transcriptase and T7 RNA polymerase,
lambda 3 protein binds template and substrate with only
localised rearrangements, and catalytic activity can
occur with little structural change. However, the
structure of the catalytic complex is similar to that of
other polymerase catalytic complexes with known
structure.
Length = 1271
Score = 26.4 bits (58), Expect = 9.7
Identities = 8/21 (38%), Positives = 13/21 (61%), Gaps = 1/21 (4%)
Query: 64 PPVGQLRFMAPQSPQPWPGLM 84
PPVG+ + P + + WP L+
Sbjct: 802 PPVGK-EYANPSTEEIWPSLL 821
>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
CCCH-type Zn-finger protein [General function prediction
only].
Length = 299
Score = 26.2 bits (57), Expect = 9.8
Identities = 11/45 (24%), Positives = 25/45 (55%)
Query: 124 LKNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
LKN++ I ++ +++ E++EK + E+K+ E E++
Sbjct: 32 LKNKNRSTKVQAIIKQIETLNLKEQLEKKEKMRMEEKRREPEKQV 76
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 26.4 bits (58), Expect = 9.8
Identities = 13/30 (43%), Positives = 21/30 (70%)
Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGEEK 167
QE EEE+ E EE+++EE+++EE E +
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESE 463
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 25.4 bits (56), Expect = 10.0
Identities = 12/26 (46%), Positives = 17/26 (65%)
Query: 144 EEEKKEEREEKKKEEKKKEEGEEKKK 169
EE +K E EEK++EE ++E K K
Sbjct: 100 EENRKLEEEEKEREELEEENDVTKGK 125
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.134 0.388
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,723,369
Number of extensions: 825144
Number of successful extensions: 6520
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5111
Number of HSP's successfully gapped: 929
Length of query: 169
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 79
Effective length of database: 6,945,742
Effective search space: 548713618
Effective search space used: 548713618
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 55 (24.8 bits)