RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16677
(227 letters)
>gnl|CDD|215646 pfam00001, 7tm_1, 7 transmembrane receptor (rhodopsin family).
This family contains, amongst other G-protein-coupled
receptors (GCPRs), members of the opsin family, which
have been considered to be typical members of the
rhodopsin superfamily. They share several motifs, mainly
the seven transmembrane helices, GCPRs of the rhodopsin
superfamily. All opsins bind a chromophore, such as
11-cis-retinal. The function of most opsins other than
the photoisomerases is split into two steps: light
absorption and G-protein activation. Photoisomerases, on
the other hand, are not coupled to G-proteins - they are
thought to generate and supply the chromophore that is
used by visual opsins.
Length = 251
Score = 127 bits (322), Expect = 3e-36
Identities = 47/137 (34%), Positives = 68/137 (49%), Gaps = 5/137 (3%)
Query: 56 VSVMRHRKLRIITNYFVVSLAFADMLVAMVAMTFNASVQVTGKWNFSFFMCDVWNSLDVY 115
+ ++R +KLR TN F+++LA AD+L + + V G W F +C + L V
Sbjct: 1 LVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGGDWPFGDALCKLVGFLFVV 60
Query: 116 FSTASILHLCCISVDRYYAIVKPLKYPINMTKRVVAFMLLNVWVSPAVISFVPIMCGWYT 175
ASIL L IS+DRY AIV PL+Y T R ++L VWV ++S P++ W
Sbjct: 61 NGYASILLLTAISIDRYLAIVHPLRYRRIRTPRRAKVLILVVWVLALLLSLPPLLFSWLR 120
Query: 176 TSEHLKYRYQHPNICDF 192
T E + C
Sbjct: 121 TVE-----EGNVTTCLI 132
>gnl|CDD|222976 PHA03087, PHA03087, G protein-coupled chemokine receptor-like
protein; Provisional.
Length = 335
Score = 60.6 bits (147), Expect = 6e-11
Identities = 43/185 (23%), Positives = 79/185 (42%), Gaps = 9/185 (4%)
Query: 1 MTDSSNLFDASMPGYNVTDSTSPDENVSSIIWMTLRASIMIFIIVASIGGNLLVIVSVMR 60
T Y D + S+I+ + IF + GN++VI V+
Sbjct: 13 NTTDYYYDTYYDEDYADCDLNIGYDTNSTIL---IVVYSTIFFF--GLVGNIIVIY-VLT 66
Query: 61 HRKLRIITNYFVVSLAFADMLVAMVAMTFNASVQVTGKWNFSFFMCDVWNSLDVYFSTAS 120
K++ + ++++LA +D+L M + F + +W+F F C + + L S
Sbjct: 67 KTKIKTPMDIYLLNLAVSDLLFVMT-LPFQIYYYILFQWSFGEFACKIVSGLYYIGFYNS 125
Query: 121 ILHLCCISVDRYYAIVKPLKYPINMTKRVVAFMLLNVWVSPAVISFVPIMCGWYTTSEH- 179
+ + +SVDRY AIV P+K T + + L +W+ ++I PI+ + T +H
Sbjct: 126 MNFITVMSVDRYIAIVHPVKSNKINTVKYGYIVSLVIWII-SIIETTPILFVYTTKKDHE 184
Query: 180 LKYRY 184
Sbjct: 185 TLICC 189
>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein;
Provisional.
Length = 417
Score = 43.8 bits (103), Expect = 3e-05
Identities = 38/139 (27%), Positives = 69/139 (49%), Gaps = 16/139 (11%)
Query: 39 IMIFIIVASIGGNL--LVIVSVMRHRKLRIITNYFVVSLAFADML--VAMVAMTFNASVQ 94
I IF I+ I G I+ ++ +K++ IT+ ++ +LA +D++ + + +N Q
Sbjct: 101 IKIFYIIIFILGLFGNAAIIMILFCKKIKTITDIYIFNLAISDLIFVIDFPFIIYNEFDQ 160
Query: 95 VTGKWNFSFFMCDVWNSLDVYFSTASILHLCCI-SVDRYYAIVKPLKYP------INMTK 147
W F FMC V S Y S + L + S+DRY+AI+ P+ + I +
Sbjct: 161 ----WIFGDFMCKVI-SASYYIGFFSNMFLITLMSIDRYFAILYPISFQKYRTFNIGIIL 215
Query: 148 RVVAFMLLNVWVSPAVISF 166
+++++L + SPA F
Sbjct: 216 CIISWILSLIITSPAYFIF 234
>gnl|CDD|220695 pfam10328, 7TM_GPCR_Srx, Serpentine type 7TM GPCR chemoreceptor
Srx. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Srx is part of the Srg
superfamily of chemoreceptors. Chemoperception is one of
the central senses of soil nematodes like C. elegans
which are otherwise 'blind' and 'deaf'.
Length = 275
Score = 41.4 bits (98), Expect = 1e-04
Identities = 24/156 (15%), Positives = 58/156 (37%), Gaps = 7/156 (4%)
Query: 44 IVASIGG---NLLVIVSVMRHRKLRIITNYFVVSLAFADMLVAMVAMTFNASVQVTGKWN 100
+ S+ G N L+ ++ L+ + A ++ ++ + + + + +
Sbjct: 1 FLISLFGIILNWLIFYIFLKLPSLKNSFGILCANKAISNAIICTIFLFYVVPMTLLDLSF 60
Query: 101 FSFFMCDVWNSLDVYFS-TASILHLCCISVDRYYAIVKPLKYPINMTKRVVAFMLLNVWV 159
+ L + S L IS++R+ A+ P KY + + ++ +W+
Sbjct: 61 LPELLNSHIGGLILLGLYEISPLTHLLISLNRFCAVFFPFKYKKIFSIKNTKIIITFIWI 120
Query: 160 SPAVISFVPIM---CGWYTTSEHLKYRYQHPNICDF 192
+IS + C +Y + E L + + C
Sbjct: 121 IAIIISTLFYFPLGCHFYYSPESLTWSFDEDPPCAE 156
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 39.2 bits (92), Expect = 0.001
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 178 EHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
E +K +Y P KKK EEKK K+K KKK ++K KK + +G
Sbjct: 373 EEIKEKYPKP-----PKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 29.6 bits (67), Expect = 1.3
Identities = 13/33 (39%), Positives = 19/33 (57%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
+ K+ EE K++ K KKK EEKK + R +
Sbjct: 367 ELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKK 399
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast) and
GNL3L (human) are putative GTPases which are required
for growth and play a role in processing of nucleolar
pre-rRNA. This family contains a potential nuclear
localisation signal.
Length = 80
Score = 35.3 bits (82), Expect = 0.003
Identities = 14/41 (34%), Positives = 22/41 (53%), Gaps = 10/41 (24%)
Query: 187 PNICDFK----------KKKEEEKKEAKKKTKKKEEEKKKK 217
PN FK K+K+EE+KE +K+ +K E + +K
Sbjct: 38 PNSFPFKEEILEEIEEKKRKQEEEKERRKEARKAERAEARK 78
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 36.7 bits (85), Expect = 0.004
Identities = 13/29 (44%), Positives = 22/29 (75%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
KK K+++ ++ K++ KKK+E+KKKK R
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKRHS 170
Score = 36.0 bits (83), Expect = 0.007
Identities = 18/49 (36%), Positives = 27/49 (55%), Gaps = 10/49 (20%)
Query: 179 HLKYRYQHP----------NICDFKKKKEEEKKEAKKKTKKKEEEKKKK 217
H K+R Q P + +KK +++K E K+ KKK++EKKKK
Sbjct: 117 HKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKK 165
>gnl|CDD|192535 pfam10320, 7TM_GPCR_Srsx, Serpentine type 7TM GPCR chemoreceptor
Srsx. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Srsx is a solo family
amongst the superfamilies of chemoreceptors.
Chemoperception is one of the central senses of soil
nematodes like C. elegans which are otherwise 'blind'
and 'deaf'.
Length = 257
Score = 36.0 bits (84), Expect = 0.009
Identities = 24/103 (23%), Positives = 42/103 (40%), Gaps = 12/103 (11%)
Query: 44 IVASIGGNLLVIVSVMRHRKLRIITNYFVVSLAFADML-----VAMVAMTFNASVQVTGK 98
V I GN+++I+ + +KLR +Y + AD+L + V + F
Sbjct: 1 SVIGIFGNVIMIILTFKKKKLRSKCSYLICVQCLADLLCLSGEIIFVVLLFT---GTQLT 57
Query: 99 WNFSFFMCDVWNSLDVYFSTASILHLCCISVDRYYAIVKPLKY 141
N F + ++ TA + I +DR A+ P+ Y
Sbjct: 58 RNECFLI----IIPYIFGQTAQSPLMLMIGIDRLIAVKFPIFY 96
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 34.5 bits (80), Expect = 0.010
Identities = 12/25 (48%), Positives = 17/25 (68%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
+KK+EE+K EA+K K EEK +
Sbjct: 25 EKKQEEKKSEAEKLAKMSAEEKAEY 49
Score = 30.3 bits (69), Expect = 0.34
Identities = 9/30 (30%), Positives = 18/30 (60%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
+ K +EK + +KK ++K+ E +K +M
Sbjct: 12 EVDKAIAKEKAKWEKKQEEKKSEAEKLAKM 41
Score = 29.5 bits (67), Expect = 0.56
Identities = 9/31 (29%), Positives = 14/31 (45%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
K EEK E + + +KE E+ + R
Sbjct: 37 KLAKMSAEEKAEYELEKLEKELEELEAELAR 67
Score = 28.8 bits (65), Expect = 1.0
Identities = 8/27 (29%), Positives = 16/27 (59%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKK 217
+ K ++E +A K K K E+K+++
Sbjct: 4 EEKTFTDKEVDKAIAKEKAKWEKKQEE 30
Score = 26.5 bits (59), Expect = 6.7
Identities = 10/28 (35%), Positives = 17/28 (60%), Gaps = 3/28 (10%)
Query: 193 KKKKEEEKKEAK---KKTKKKEEEKKKK 217
++KK E +K AK ++ + E EK +K
Sbjct: 29 EEKKSEAEKLAKMSAEEKAEYELEKLEK 56
Score = 26.5 bits (59), Expect = 7.8
Identities = 6/28 (21%), Positives = 16/28 (57%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
++K E E ++ +K+ ++ E E ++
Sbjct: 44 EEKAEYELEKLEKELEELEAELARRELK 71
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 35.9 bits (82), Expect = 0.014
Identities = 18/51 (35%), Positives = 25/51 (49%)
Query: 177 SEHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
+E K + + D KKK EE K+A + KK EE KKK + E +
Sbjct: 1459 AEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAK 1509
Score = 35.5 bits (81), Expect = 0.019
Identities = 15/51 (29%), Positives = 23/51 (45%)
Query: 177 SEHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
++ K + + D KKK EE K+ + KK E KKK + E +
Sbjct: 1472 ADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAK 1522
Score = 35.5 bits (81), Expect = 0.024
Identities = 12/34 (35%), Positives = 20/34 (58%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
KK EE++K+A + KK+ EE KK ++ +
Sbjct: 1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEE 1714
Score = 34.0 bits (77), Expect = 0.061
Identities = 15/40 (37%), Positives = 19/40 (47%)
Query: 178 EHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKK 217
+ K + + D KKK EE K+ KKK EE KK
Sbjct: 1305 DEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344
Score = 34.0 bits (77), Expect = 0.061
Identities = 13/28 (46%), Positives = 20/28 (71%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
KK+ EE K+A++ KK+ EEKKK ++
Sbjct: 1695 KKEAEEAKKAEELKKKEAEEKKKAEELK 1722
Score = 33.2 bits (75), Expect = 0.11
Identities = 13/40 (32%), Positives = 19/40 (47%)
Query: 177 SEHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKK 216
++ K + + D KKK EE K+ + KK KKK
Sbjct: 1380 ADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419
Score = 33.2 bits (75), Expect = 0.11
Identities = 15/28 (53%), Positives = 20/28 (71%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
KKK EEKK+A + KK EE+KKK ++
Sbjct: 1384 KKKAEEKKKADEAKKKAEEDKKKADELK 1411
Score = 33.2 bits (75), Expect = 0.13
Identities = 12/29 (41%), Positives = 20/29 (68%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
KK EE+KK+A++ K +E+EKK ++
Sbjct: 1667 AKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695
Score = 32.8 bits (74), Expect = 0.18
Identities = 14/28 (50%), Positives = 20/28 (71%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
KK EEEKK+ ++ KK+ EEKKK ++
Sbjct: 1626 KKAEEEKKKVEQLKKKEAEEKKKAEELK 1653
Score = 32.4 bits (73), Expect = 0.18
Identities = 15/39 (38%), Positives = 22/39 (56%)
Query: 177 SEHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKK 215
++ K + + D KKK EE K+A++ KK EE KK
Sbjct: 1433 ADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK 1471
Score = 32.4 bits (73), Expect = 0.24
Identities = 14/23 (60%), Positives = 17/23 (73%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKK 215
+KKKEE KK+A KK EE+KK
Sbjct: 1370 EKKKEEAKKKADAAKKKAEEKKK 1392
Score = 32.0 bits (72), Expect = 0.29
Identities = 15/39 (38%), Positives = 21/39 (53%)
Query: 177 SEHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKK 215
++ K + + D KKK EE K+A + KK EE KK
Sbjct: 1420 ADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK 1458
Score = 31.6 bits (71), Expect = 0.33
Identities = 9/26 (34%), Positives = 13/26 (50%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTR 219
KKK EE K+A + K + E +
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAE 1360
Score = 31.3 bits (70), Expect = 0.45
Identities = 14/34 (41%), Positives = 22/34 (64%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
KK+ EE K+ ++ KK EEEKKK ++ E++
Sbjct: 1736 KKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKK 1769
Score = 31.3 bits (70), Expect = 0.47
Identities = 21/46 (45%), Positives = 24/46 (52%), Gaps = 5/46 (10%)
Query: 177 SEHLKYRYQHPNICDFKKKKEEEKK---EAKKKT--KKKEEEKKKK 217
++ LK D KKK EEKK EAKKK KK +E KKK
Sbjct: 1407 ADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKK 1452
Score = 31.3 bits (70), Expect = 0.54
Identities = 13/24 (54%), Positives = 14/24 (58%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKK 217
KK EEE K + KK EE KKK
Sbjct: 1653 KKAEEENKIKAAEEAKKAEEDKKK 1676
Score = 30.9 bits (69), Expect = 0.61
Identities = 18/30 (60%), Positives = 20/30 (66%), Gaps = 5/30 (16%)
Query: 193 KKKKEEEKK--EAKKKT---KKKEEEKKKK 217
KKK EE+KK EAKKK KKK +E KK
Sbjct: 1384 KKKAEEKKKADEAKKKAEEDKKKADELKKA 1413
Score = 30.9 bits (69), Expect = 0.66
Identities = 13/24 (54%), Positives = 16/24 (66%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKK 217
KK EEE K ++ KK+ EE KKK
Sbjct: 1722 KKAEEENKIKAEEAKKEAEEDKKK 1745
Score = 30.9 bits (69), Expect = 0.67
Identities = 8/25 (32%), Positives = 12/25 (48%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
KK E K EA+ + E ++K
Sbjct: 1342 KKAAEAAKAEAEAAADEAEAAEEKA 1366
Score = 30.9 bits (69), Expect = 0.67
Identities = 9/26 (34%), Positives = 13/26 (50%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
K + E EA+ +K E +KKK
Sbjct: 1349 KAEAEAAADEAEAAEEKAEAAEKKKE 1374
Score = 30.9 bits (69), Expect = 0.74
Identities = 6/26 (23%), Positives = 12/26 (46%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
K + E + + ++K E +KK
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKK 1373
Score = 30.5 bits (68), Expect = 0.84
Identities = 12/26 (46%), Positives = 13/26 (50%), Gaps = 2/26 (7%)
Query: 193 KKKKEEEKKEAKKKT--KKKEEEKKK 216
E E E K + KKKEE KKK
Sbjct: 1354 AAADEAEAAEEKAEAAEKKKEEAKKK 1379
Score = 30.5 bits (68), Expect = 0.87
Identities = 18/46 (39%), Positives = 24/46 (52%), Gaps = 5/46 (10%)
Query: 177 SEHLKYRYQHPNICDFKKKKEEEKK---EAKKKT--KKKEEEKKKK 217
++ K + + + KKK EE K EAKKK KK +E KKK
Sbjct: 1446 ADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKK 1491
Score = 30.1 bits (67), Expect = 1.1
Identities = 17/31 (54%), Positives = 20/31 (64%), Gaps = 4/31 (12%)
Query: 191 DFKKKKEEEKK--EAKKKT--KKKEEEKKKK 217
KKK+E +KK AKKK KKK +E KKK
Sbjct: 1369 AEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399
Score = 30.1 bits (67), Expect = 1.1
Identities = 15/35 (42%), Positives = 17/35 (48%), Gaps = 6/35 (17%)
Query: 191 DFKKKKEEEKK---EAKKKT---KKKEEEKKKKTR 219
+ KKK EE KK AKKK KK E K +
Sbjct: 1319 EAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAE 1353
Score = 30.1 bits (67), Expect = 1.2
Identities = 15/30 (50%), Positives = 19/30 (63%), Gaps = 5/30 (16%)
Query: 193 KKKKEEEKKEAKKK-----TKKKEEEKKKK 217
KKK + KK+A++K KKK EE KKK
Sbjct: 1377 KKKADAAKKKAEEKKKADEAKKKAEEDKKK 1406
Score = 30.1 bits (67), Expect = 1.2
Identities = 12/32 (37%), Positives = 14/32 (43%), Gaps = 5/32 (15%)
Query: 191 DFKKKKEEEKK---EAKKK--TKKKEEEKKKK 217
+ KKK + KK EAKK K E E
Sbjct: 1326 EAKKKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357
Score = 30.1 bits (67), Expect = 1.3
Identities = 13/33 (39%), Positives = 19/33 (57%)
Query: 195 KKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
KK EE K ++ KK EEEKKK +++ +
Sbjct: 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEE 1645
Score = 29.7 bits (66), Expect = 1.4
Identities = 9/25 (36%), Positives = 14/25 (56%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
+ E +E + +KK+EE KKK
Sbjct: 1355 AADEAEAAEEKAEAAEKKKEEAKKK 1379
Score = 29.7 bits (66), Expect = 1.4
Identities = 16/38 (42%), Positives = 18/38 (47%), Gaps = 6/38 (15%)
Query: 194 KKKEEEKK---EAKKKT---KKKEEEKKKKTRMRGGIE 225
KK KK EAKKK KK +E KKK + E
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADE 1448
Score = 29.7 bits (66), Expect = 1.4
Identities = 18/36 (50%), Positives = 22/36 (61%), Gaps = 5/36 (13%)
Query: 191 DFKKKKEEEKK--EAKKKT---KKKEEEKKKKTRMR 221
+ KKK EE+KK EAKKK KK +E KKK +
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1457
Score = 29.3 bits (65), Expect = 1.8
Identities = 13/31 (41%), Positives = 20/31 (64%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
+ KK +EE K +A ++ KK EE+KKK +
Sbjct: 1651 ELKKAEEENKIKAAEEAKKAEEDKKKAEEAK 1681
Score = 29.3 bits (65), Expect = 1.9
Identities = 13/34 (38%), Positives = 17/34 (50%), Gaps = 2/34 (5%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
KK E KK+A + KK EE KK + E +
Sbjct: 1503 KKAAEAKKKADE--AKKAEEAKKADEAKKAEEAK 1534
Score = 29.3 bits (65), Expect = 2.0
Identities = 15/30 (50%), Positives = 19/30 (63%), Gaps = 5/30 (16%)
Query: 193 KKKKEEEKKEAKKK-----TKKKEEEKKKK 217
KKK +E KK+A++ KKK EE KKK
Sbjct: 1301 KKKADEAKKKAEEAKKADEAKKKAEEAKKK 1330
Score = 29.3 bits (65), Expect = 2.0
Identities = 13/32 (40%), Positives = 17/32 (53%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
KKK +E K+A KK +E KKK + E
Sbjct: 1404 KKKADELKKAAAAKKKADEAKKKAEEKKKADE 1435
Score = 29.3 bits (65), Expect = 2.2
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
+ KK +EE K +A++ K+ EE+KKK +
Sbjct: 1720 ELKKAEEENKIKAEEAKKEAEEDKKKAEEAK 1750
Score = 29.3 bits (65), Expect = 2.2
Identities = 12/24 (50%), Positives = 16/24 (66%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKK 217
KKK EE K+A++ KK E KK+
Sbjct: 1674 KKKAEEAKKAEEDEKKAAEALKKE 1697
Score = 29.0 bits (64), Expect = 2.4
Identities = 18/40 (45%), Positives = 21/40 (52%), Gaps = 5/40 (12%)
Query: 191 DFKKKKEEEKK---EAKKKT--KKKEEEKKKKTRMRGGIE 225
+ KKK EE KK EAKK KKK +E KK + E
Sbjct: 1487 EAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADE 1526
Score = 29.0 bits (64), Expect = 3.0
Identities = 15/27 (55%), Positives = 20/27 (74%), Gaps = 1/27 (3%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKK 217
+ KKK+ EEKK+A++ KK EEE K K
Sbjct: 1706 ELKKKEAEEKKKAEE-LKKAEEENKIK 1731
Score = 29.0 bits (64), Expect = 3.0
Identities = 16/32 (50%), Positives = 19/32 (59%), Gaps = 5/32 (15%)
Query: 191 DFKKKKEEEKKEAKKKTK-----KKEEEKKKK 217
D KK E KKEA++ K KKE E+KKK
Sbjct: 1686 DEKKAAEALKKEAEEAKKAEELKKKEAEEKKK 1717
Score = 29.0 bits (64), Expect = 3.0
Identities = 17/28 (60%), Positives = 19/28 (67%), Gaps = 4/28 (14%)
Query: 194 KKKEEEKK--EAKKKT--KKKEEEKKKK 217
KK EE+KK EAKKK KK +E KKK
Sbjct: 1296 KKAEEKKKADEAKKKAEEAKKADEAKKK 1323
Score = 28.2 bits (62), Expect = 4.6
Identities = 13/39 (33%), Positives = 19/39 (48%), Gaps = 2/39 (5%)
Query: 191 DFKKKKEEEKKEAKKKT--KKKEEEKKKKTRMRGGIERQ 227
+ KK +E +K E KKK KK EE K + ++
Sbjct: 1550 ELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKK 1588
Score = 27.8 bits (61), Expect = 5.6
Identities = 16/38 (42%), Positives = 20/38 (52%), Gaps = 3/38 (7%)
Query: 193 KKKKEEEKK-EAKKKTK--KKEEEKKKKTRMRGGIERQ 227
KKK +E KK E KK KK EE KK + E++
Sbjct: 1509 KKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKK 1546
Score = 27.8 bits (61), Expect = 5.9
Identities = 12/24 (50%), Positives = 14/24 (58%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKK 216
KKK EE KK ++ K EE KK
Sbjct: 1646 KKKAEELKKAEEENKIKAAEEAKK 1669
Score = 27.8 bits (61), Expect = 6.7
Identities = 15/31 (48%), Positives = 17/31 (54%), Gaps = 6/31 (19%)
Query: 193 KKKKEEEKK---EAKKKT---KKKEEEKKKK 217
K K EE K E KKK KK EE++KK
Sbjct: 1660 KIKAAEEAKKAEEDKKKAEEAKKAEEDEKKA 1690
Score = 27.8 bits (61), Expect = 7.1
Identities = 6/25 (24%), Positives = 12/25 (48%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
+ KK E +A+ + E E ++
Sbjct: 1340 EAKKAAEAAKAEAEAAADEAEAAEE 1364
Score = 27.4 bits (60), Expect = 7.9
Identities = 9/33 (27%), Positives = 17/33 (51%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
++K E +K+ ++ KK + KKK + E
Sbjct: 1363 EEKAEAAEKKKEEAKKKADAAKKKAEEKKKADE 1395
Score = 27.0 bits (59), Expect = 9.9
Identities = 16/39 (41%), Positives = 19/39 (48%), Gaps = 4/39 (10%)
Query: 193 KKKKEEEKKEAKKKTK----KKEEEKKKKTRMRGGIERQ 227
K K EE KK + K K KK EE+KKK E +
Sbjct: 1606 KMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAE 1644
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 34.3 bits (79), Expect = 0.023
Identities = 12/25 (48%), Positives = 19/25 (76%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
+KK ++EK K +KKK+++KKKK
Sbjct: 169 EKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 33.9 bits (78), Expect = 0.037
Identities = 14/26 (53%), Positives = 21/26 (80%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTR 219
+ +EEEKKE KKK + K+E+K+KK +
Sbjct: 148 EVEEEEKKEKKKKKEVKKEKKEKKDK 173
Score = 32.0 bits (73), Expect = 0.16
Identities = 13/31 (41%), Positives = 23/31 (74%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
++KKE++KK+ KK KK++++KK+K G
Sbjct: 152 EEKKEKKKKKEVKKEKKEKKDKKEKMVEPKG 182
Score = 32.0 bits (73), Expect = 0.16
Identities = 11/26 (42%), Positives = 20/26 (76%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
K+KK++++K + K KK+++KKKK
Sbjct: 168 KEKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 31.6 bits (72), Expect = 0.18
Identities = 14/28 (50%), Positives = 20/28 (71%), Gaps = 1/28 (3%)
Query: 193 KKKKEEEKKEAK-KKTKKKEEEKKKKTR 219
KKKK+E KKE K KK KK++ + K ++
Sbjct: 157 KKKKKEVKKEKKEKKDKKEKMVEPKGSK 184
Score = 31.6 bits (72), Expect = 0.21
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
K +KE E +E +KK KKK++E KK+ +
Sbjct: 140 TAKVEKEAEVEEEEKKEKKKKKEVKKEKK 168
Score = 31.2 bits (71), Expect = 0.31
Identities = 13/27 (48%), Positives = 21/27 (77%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
+K+ E E++E K+K KKKE +K+KK +
Sbjct: 144 EKEAEVEEEEKKEKKKKKEVKKEKKEK 170
Score = 30.1 bits (68), Expect = 0.59
Identities = 11/25 (44%), Positives = 20/25 (80%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
+ ++EE+K++ KKK KKE+++KK
Sbjct: 148 EVEEEEKKEKKKKKEVKKEKKEKKD 172
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 33.5 bits (77), Expect = 0.027
Identities = 12/26 (46%), Positives = 19/26 (73%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
K+KK+++KK+ KKK K ++ KKK
Sbjct: 110 KQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 31.2 bits (71), Expect = 0.19
Identities = 11/27 (40%), Positives = 19/27 (70%)
Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKT 218
K+K +++K++ KKK KKK++ KK
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAA 131
Score = 31.2 bits (71), Expect = 0.20
Identities = 12/27 (44%), Positives = 19/27 (70%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
K K++++KK+ KKK KKK +K K +
Sbjct: 108 KAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 28.9 bits (65), Expect = 1.0
Identities = 13/25 (52%), Positives = 20/25 (80%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKT 218
K+ +EK + KK+ KKK+++KKKKT
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKT 126
Score = 27.7 bits (62), Expect = 2.6
Identities = 10/25 (40%), Positives = 18/25 (72%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
+ KE+ K++ +KK KKK+++KK
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKTS 127
Score = 27.7 bits (62), Expect = 2.6
Identities = 11/25 (44%), Positives = 19/25 (76%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
K+ +E+ K+ K+K KKK+++KKK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKT 126
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved L residues that may
be functionally important.
Length = 70
Score = 32.3 bits (74), Expect = 0.028
Identities = 10/22 (45%), Positives = 18/22 (81%)
Query: 195 KKEEEKKEAKKKTKKKEEEKKK 216
K EEEK+E +++ +K+EE K++
Sbjct: 30 KAEEEKREEEEEARKREERKER 51
Score = 30.4 bits (69), Expect = 0.15
Identities = 8/26 (30%), Positives = 19/26 (73%), Gaps = 2/26 (7%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTR 219
K +EE+++E ++ +K EE+K++ +
Sbjct: 30 KAEEEKREEEEE--ARKREERKEREK 53
Score = 28.8 bits (65), Expect = 0.48
Identities = 7/21 (33%), Positives = 16/21 (76%)
Query: 194 KKKEEEKKEAKKKTKKKEEEK 214
K++EEE+ +++ K++E+ K
Sbjct: 35 KREEEEEARKREERKEREKNK 55
Score = 27.7 bits (62), Expect = 1.1
Identities = 13/38 (34%), Positives = 21/38 (55%), Gaps = 1/38 (2%)
Query: 191 DFKKKKEEEKKEAKKKT-KKKEEEKKKKTRMRGGIERQ 227
D K + +KKE K + K++EEE+ +K R E+
Sbjct: 17 DLLAKLKAKKKELKAEEEKREEEEEARKREERKEREKN 54
Score = 26.9 bits (60), Expect = 2.3
Identities = 8/21 (38%), Positives = 16/21 (76%)
Query: 195 KKEEEKKEAKKKTKKKEEEKK 215
K+EEE++ K++ +K+ E+ K
Sbjct: 35 KREEEEEARKREERKEREKNK 55
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 33.5 bits (77), Expect = 0.029
Identities = 12/37 (32%), Positives = 27/37 (72%), Gaps = 4/37 (10%)
Query: 191 DFKKKKEEEKKEAKKKTKKK----EEEKKKKTRMRGG 223
+F++K+EE+K++ ++KT KK +++K+KK + +
Sbjct: 67 EFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKA 103
Score = 30.8 bits (70), Expect = 0.30
Identities = 12/36 (33%), Positives = 23/36 (63%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIER 226
+ KK+K+EEK K+ ++K+++KKKK + +
Sbjct: 73 EEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNK 108
Score = 28.9 bits (65), Expect = 1.2
Identities = 12/25 (48%), Positives = 18/25 (72%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
KK+ + +KK+ KKK KKK ++ KK
Sbjct: 85 KKRAKRQKKKQKKKKKKKAKKGNKK 109
Score = 28.5 bits (64), Expect = 1.9
Identities = 13/26 (50%), Positives = 21/26 (80%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
KK+K+++KK+AKK KK+E+E K +
Sbjct: 93 KKQKKKKKKKAKKGNKKEEKEGSKSS 118
Score = 27.3 bits (61), Expect = 4.3
Identities = 13/28 (46%), Positives = 18/28 (64%), Gaps = 3/28 (10%)
Query: 195 KKEEEKKEAKKK---TKKKEEEKKKKTR 219
KKE E +E ++K K+K+EEK K R
Sbjct: 60 KKETEDEEFQQKREEKKRKDEEKTAKKR 87
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 34.8 bits (80), Expect = 0.029
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKK 216
KKK E KK+A+ + K K E K K
Sbjct: 165 KKKAAEAKKKAEAEAKAKAEAKAK 188
Score = 34.4 bits (79), Expect = 0.032
Identities = 14/26 (53%), Positives = 17/26 (65%), Gaps = 1/26 (3%)
Query: 193 KKKKEEE-KKEAKKKTKKKEEEKKKK 217
KK+ EEE K +A + KKK E KKK
Sbjct: 149 KKQAEEEAKAKAAAEAKKKAAEAKKK 174
Score = 34.4 bits (79), Expect = 0.036
Identities = 11/25 (44%), Positives = 13/25 (52%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
K E KK+A + KK E E K K
Sbjct: 158 AKAAAEAKKKAAEAKKKAEAEAKAK 182
Score = 33.3 bits (76), Expect = 0.075
Identities = 11/25 (44%), Positives = 15/25 (60%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
K + E EKK ++ K+ EEE K K
Sbjct: 135 KAEAEAEKKAKEEAKKQAEEEAKAK 159
Score = 32.9 bits (75), Expect = 0.12
Identities = 9/25 (36%), Positives = 16/25 (64%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
K+ EE++K+A++ K+ E K K
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAK 135
Score = 32.5 bits (74), Expect = 0.17
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
KK + E K +A+ K K K EE K K
Sbjct: 173 KKAEAEAKAKAEAKAKAKAEEAKAK 197
Score = 31.7 bits (72), Expect = 0.30
Identities = 7/25 (28%), Positives = 16/25 (64%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
++K+ E++ A+K K+ E+ K+
Sbjct: 90 RQKELEQRAAAEKAAKQAEQAAKQA 114
Score = 31.3 bits (71), Expect = 0.31
Identities = 11/27 (40%), Positives = 14/27 (51%), Gaps = 2/27 (7%)
Query: 193 KKKKEEEKKEAKKKT--KKKEEEKKKK 217
K K E + E K K KK+ EE+ K
Sbjct: 132 AKAKAEAEAEKKAKEEAKKQAEEEAKA 158
Score = 30.6 bits (69), Expect = 0.70
Identities = 10/24 (41%), Positives = 16/24 (66%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKK 216
K+ E+ K+A++K K+ EE K K
Sbjct: 104 AKQAEQAAKQAEEKQKQAEEAKAK 127
Score = 30.2 bits (68), Expect = 0.74
Identities = 14/26 (53%), Positives = 18/26 (69%), Gaps = 1/26 (3%)
Query: 193 KKKKEEEKKEAKKKTKKKE-EEKKKK 217
KK KEE KK+A+++ K K E KKK
Sbjct: 142 KKAKEEAKKQAEEEAKAKAAAEAKKK 167
Score = 30.2 bits (68), Expect = 0.75
Identities = 7/25 (28%), Positives = 14/25 (56%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
K+ E+ + ++ +K+ EE K K
Sbjct: 103 AAKQAEQAAKQAEEKQKQAEEAKAK 127
Score = 29.8 bits (67), Expect = 1.0
Identities = 8/25 (32%), Positives = 15/25 (60%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
K+ +E++K+ + K K+ E K K
Sbjct: 112 KQAEEKQKQAEEAKAKQAAEAKAKA 136
Score = 29.4 bits (66), Expect = 1.3
Identities = 11/27 (40%), Positives = 15/27 (55%), Gaps = 2/27 (7%)
Query: 193 KKKKEEEKKE--AKKKTKKKEEEKKKK 217
K+K+ EE K A + K E E +KK
Sbjct: 117 KQKQAEEAKAKQAAEAKAKAEAEAEKK 143
Score = 29.4 bits (66), Expect = 1.7
Identities = 7/23 (30%), Positives = 15/23 (65%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKK 215
+ E+ K+A++ K+ EE++K
Sbjct: 97 RAAAEKAAKQAEQAAKQAEEKQK 119
Score = 28.3 bits (63), Expect = 3.1
Identities = 9/25 (36%), Positives = 12/25 (48%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
K K E + K ++ K K E K K
Sbjct: 180 KAKAEAKAKAKAEEAKAKAEAAKAK 204
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 34.5 bits (80), Expect = 0.035
Identities = 13/44 (29%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 178 EHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
+ L+ + + P+ + K K +K AK+ KK++E++K K + R
Sbjct: 374 DELRPKTKAPS--EKKTGKPSKKVLAKRAEKKEKEKEKPKVKKR 415
Score = 33.8 bits (78), Expect = 0.061
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
+KKE+EK++ K K + ++ + K R
Sbjct: 400 AEKKEKEKEKPKVKKRHRDTKNIGKRR 426
Score = 33.4 bits (77), Expect = 0.082
Identities = 8/25 (32%), Positives = 15/25 (60%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
K+ E+K++ K+K K K+ + K
Sbjct: 396 LAKRAEKKEKEKEKPKVKKRHRDTK 420
Score = 32.2 bits (74), Expect = 0.21
Identities = 9/29 (31%), Positives = 18/29 (62%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
K+ +++EK++ K K KK+ + K + R
Sbjct: 398 KRAEKKEKEKEKPKVKKRHRDTKNIGKRR 426
Score = 31.5 bits (72), Expect = 0.32
Identities = 9/26 (34%), Positives = 17/26 (65%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
KK+KE+EK + KK+ + + K++
Sbjct: 402 KKEKEKEKPKVKKRHRDTKNIGKRRK 427
Score = 29.5 bits (67), Expect = 1.4
Identities = 10/35 (28%), Positives = 16/35 (45%)
Query: 187 PNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
P+ K+ E+++KE +K KK K R
Sbjct: 391 PSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425
>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p. Rrp15p is required for the formation of
60S ribosomal subunits.
Length = 132
Score = 33.1 bits (76), Expect = 0.043
Identities = 10/29 (34%), Positives = 17/29 (58%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
KKK + EK E K K + + E+++ + R
Sbjct: 34 KKKLKSEKLEKKAKRQLRAEKRQALEKGR 62
Score = 28.1 bits (63), Expect = 2.2
Identities = 8/29 (27%), Positives = 20/29 (68%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
K K E+ +K+AK++ + ++ + +K R++
Sbjct: 36 KLKSEKLEKKAKRQLRAEKRQALEKGRVK 64
Score = 26.6 bits (59), Expect = 7.9
Identities = 8/30 (26%), Positives = 19/30 (63%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRG 222
K KK+ + ++ +KK K++ +K++ +G
Sbjct: 32 KAKKKLKSEKLEKKAKRQLRAEKRQALEKG 61
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 32.7 bits (75), Expect = 0.060
Identities = 11/30 (36%), Positives = 23/30 (76%)
Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
F K+K++ +K+A+++ K+KE E++ + R R
Sbjct: 27 FHKRKQQRRKKAQEEAKEKEREERIEERKR 56
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 32.1 bits (73), Expect = 0.073
Identities = 12/23 (52%), Positives = 17/23 (73%)
Query: 196 KEEEKKEAKKKTKKKEEEKKKKT 218
EEK E K++ KKKEEEK+++
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEE 95
Score = 30.5 bits (69), Expect = 0.22
Identities = 10/21 (47%), Positives = 18/21 (85%)
Query: 193 KKKKEEEKKEAKKKTKKKEEE 213
++K EE+++E KK+ +K+EEE
Sbjct: 75 EEKAEEKEEEKKKEEEKEEEE 95
Score = 28.2 bits (63), Expect = 1.3
Identities = 12/21 (57%), Positives = 17/21 (80%)
Query: 193 KKKKEEEKKEAKKKTKKKEEE 213
++K EEK+E KKK ++KEEE
Sbjct: 74 AEEKAEEKEEEKKKEEEKEEE 94
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 32.7 bits (75), Expect = 0.074
Identities = 10/32 (31%), Positives = 16/32 (50%)
Query: 188 NICDFKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
KKKK+++K + K K E+K +K
Sbjct: 87 KKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118
Score = 29.7 bits (67), Expect = 1.0
Identities = 9/26 (34%), Positives = 16/26 (61%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
K KK+++K + +K +KE E K +
Sbjct: 99 KDKKDDKKDDKSEKKDEKEAEDKLED 124
Score = 29.3 bits (66), Expect = 1.1
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
K KK++ KK+ K KK+++K K+
Sbjct: 85 KWKKKKSKKKKDKDKDKKDDKKDDKS 110
Score = 29.3 bits (66), Expect = 1.1
Identities = 11/26 (42%), Positives = 17/26 (65%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
+K K ++KK KKK K K+++ KK
Sbjct: 82 QKWKWKKKKSKKKKDKDKDKKDDKKD 107
Score = 29.3 bits (66), Expect = 1.4
Identities = 9/28 (32%), Positives = 17/28 (60%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKT 218
D K K+++KK+ K + K ++E + K
Sbjct: 96 DKDKDKKDDKKDDKSEKKDEKEAEDKLE 123
Score = 28.9 bits (65), Expect = 1.4
Identities = 12/30 (40%), Positives = 18/30 (60%)
Query: 188 NICDFKKKKEEEKKEAKKKTKKKEEEKKKK 217
I KK+ EE++K KK K K+++ K K
Sbjct: 70 EIEKVKKEYEEKQKWKWKKKKSKKKKDKDK 99
Score = 28.9 bits (65), Expect = 1.6
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
+K+K + KK+ KK K K+++KK
Sbjct: 80 EKQKWKWKKKKSKKKKDKDKDKKDD 104
Score = 28.5 bits (64), Expect = 1.9
Identities = 7/27 (25%), Positives = 17/27 (62%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
K K +++K + K +K++EK+ + +
Sbjct: 95 KDKDKDKKDDKKDDKSEKKDEKEAEDK 121
Score = 28.5 bits (64), Expect = 2.4
Identities = 8/27 (29%), Positives = 16/27 (59%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
KK +++ K KK K+ E++ + T+
Sbjct: 101 KKDDKKDDKSEKKDEKEAEDKLEDLTK 127
Score = 27.3 bits (61), Expect = 5.5
Identities = 10/25 (40%), Positives = 18/25 (72%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
K+K + +KK++KKK K +++K K
Sbjct: 81 KQKWKWKKKKSKKKKDKDKDKKDDK 105
Score = 27.3 bits (61), Expect = 5.5
Identities = 9/26 (34%), Positives = 16/26 (61%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
K K+++ K+ K K K K+++KK
Sbjct: 84 WKWKKKKSKKKKDKDKDKKDDKKDDK 109
Score = 27.0 bits (60), Expect = 5.9
Identities = 12/28 (42%), Positives = 19/28 (67%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
KKKK+E +E +K K+ EE++K K +
Sbjct: 61 KKKKKELAEEIEKVKKEYEEKQKWKWKK 88
Score = 26.6 bits (59), Expect = 9.2
Identities = 9/27 (33%), Positives = 17/27 (62%), Gaps = 1/27 (3%)
Query: 193 KKKKEEEKKE-AKKKTKKKEEEKKKKT 218
++ E+ KKE +K+ K +++K KK
Sbjct: 68 AEEIEKVKKEYEEKQKWKWKKKKSKKK 94
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 33.4 bits (77), Expect = 0.087
Identities = 14/32 (43%), Positives = 18/32 (56%), Gaps = 2/32 (6%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRG 222
D K +EE KE ++ KK E E+ RMRG
Sbjct: 255 DRTKTEEELAKEEAERLKKLEAERL--RRMRG 284
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 32.1 bits (74), Expect = 0.098
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
K +K+ EK +AK + KK ++E+ K
Sbjct: 73 KAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 30.2 bits (69), Expect = 0.47
Identities = 9/28 (32%), Positives = 16/28 (57%)
Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
K ++ EKK K K K ++++ KK+
Sbjct: 68 LKAWEKAEKKAEKAKAKAEKKKAKKEEP 95
Score = 29.0 bits (66), Expect = 1.2
Identities = 10/31 (32%), Positives = 17/31 (54%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
K E+ +K+A+K K E++K KK +
Sbjct: 66 KELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96
Score = 28.6 bits (65), Expect = 1.5
Identities = 9/31 (29%), Positives = 15/31 (48%)
Query: 197 EEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
++E K +K KK E+ K K + + E
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEP 95
Score = 28.3 bits (64), Expect = 2.2
Identities = 16/41 (39%), Positives = 21/41 (51%), Gaps = 2/41 (4%)
Query: 178 EHLKYRYQHPNICDFKKKKEEEKKEAKK-KTKKKEEEKKKK 217
+ LK + + D K+ K EK E K K K K E+KK K
Sbjct: 52 KDLKESLEA-ALLDKKELKAWEKAEKKAEKAKAKAEKKKAK 91
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 32.9 bits (75), Expect = 0.12
Identities = 11/24 (45%), Positives = 14/24 (58%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKK 216
KK+ EE K+A K K+ EE K
Sbjct: 117 KKQAEEAAKQAALKQKQAEEAAAK 140
Score = 32.1 bits (73), Expect = 0.24
Identities = 12/25 (48%), Positives = 12/25 (48%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
K K E E K A KK E KKK
Sbjct: 146 KAKAEAEAKRAAAAAKKAAAEAKKK 170
Score = 28.6 bits (64), Expect = 2.5
Identities = 10/25 (40%), Positives = 14/25 (56%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
KK E KK+A+ + KK + KK
Sbjct: 161 KKAAAEAKKKAEAEAAKKAAAEAKK 185
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 32.8 bits (75), Expect = 0.13
Identities = 13/33 (39%), Positives = 20/33 (60%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
K K E +KKE + ++K++EKK K+ RG
Sbjct: 363 KAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 31.5 bits (72), Expect = 0.14
Identities = 7/28 (25%), Positives = 18/28 (64%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
KK+ + +++TK++ E++K+ + R
Sbjct: 90 KKRRIALRLRRERTKERAEKEKRTRKNR 117
Score = 30.7 bits (70), Expect = 0.29
Identities = 8/27 (29%), Positives = 16/27 (59%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
+++ +++ K+K +K EKK K R
Sbjct: 98 LRRERTKERAEKEKRTRKNREKKFKRR 124
Score = 30.3 bits (69), Expect = 0.32
Identities = 8/31 (25%), Positives = 16/31 (51%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
+ + E KE +K K+ + ++KK + R
Sbjct: 96 LRLRRERTKERAEKEKRTRKNREKKFKRRQK 126
Score = 27.6 bits (62), Expect = 3.0
Identities = 9/30 (30%), Positives = 18/30 (60%), Gaps = 6/30 (20%)
Query: 193 KKKKEEE------KKEAKKKTKKKEEEKKK 216
++ KE +K +KK K++++EK+K
Sbjct: 101 ERTKERAEKEKRTRKNREKKFKRRQKEKEK 130
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 32.2 bits (74), Expect = 0.15
Identities = 15/38 (39%), Positives = 19/38 (50%), Gaps = 4/38 (10%)
Query: 190 CDFKKKKE--EEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
CDF + EE+KE KK K EEKK + +E
Sbjct: 85 CDFTPIYQYFEEEKEKKKAMSK--EEKKAIKEEKEKLE 120
>gnl|CDD|215825 pfam00262, Calreticulin, Calreticulin family.
Length = 359
Score = 32.3 bits (74), Expect = 0.16
Identities = 22/104 (21%), Positives = 36/104 (34%), Gaps = 27/104 (25%)
Query: 131 RYYAIVKPLKYPINMTKR--VVAFML-------------------LNVWV----SPAVIS 165
++YAI P + + VV + + + +P I
Sbjct: 54 KFYAISAKFPKPFSNKGKTLVVQYEVKLQQGIDCGGAYIKLLSKDFDQKDFSGETPYTIM 113
Query: 166 FVPIMCGWYTTSEHLKYRYQHPNICDFKKKKEEEKKEAKKKTKK 209
F P +CG T H +R H I +K++K KK + K
Sbjct: 114 FGPDICGSDTKKVHFIFR--HKPITGYKEEKHLIKKPISRCDKL 155
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this family
(Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 30.7 bits (70), Expect = 0.18
Identities = 8/25 (32%), Positives = 19/25 (76%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
K++KEEEK + +++ K+ + K+++
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREE 25
Score = 30.3 bits (69), Expect = 0.23
Identities = 13/34 (38%), Positives = 20/34 (58%), Gaps = 2/34 (5%)
Query: 193 KKKKEEEKKEAKKKTKKKE-EEKKKKTRMRGGIE 225
K ++EEE K K K++E EEK +K + G+
Sbjct: 8 KAQREEELKR-LKNLKREEIEEKLEKIKKVAGLR 40
Score = 26.1 bits (58), Expect = 7.3
Identities = 6/24 (25%), Positives = 15/24 (62%)
Query: 198 EEKKEAKKKTKKKEEEKKKKTRMR 221
+E+KE +K +++E ++ K +
Sbjct: 1 KERKEEEKAQREEELKRLKNLKRE 24
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 31.8 bits (73), Expect = 0.21
Identities = 13/32 (40%), Positives = 19/32 (59%), Gaps = 3/32 (9%)
Query: 192 FKKKKEEEKKEA---KKKTKKKEEEKKKKTRM 220
K EEE++E KK+ KKKEE + K ++
Sbjct: 272 ILKAAEEERQEEAQEKKEEKKKEEREAKLAKL 303
Score = 30.3 bits (69), Expect = 0.72
Identities = 13/37 (35%), Positives = 22/37 (59%), Gaps = 8/37 (21%)
Query: 193 KKKKEEEKKEAKKKTK--------KKEEEKKKKTRMR 221
++KKEE+KKE ++ +K EEK++K + R
Sbjct: 285 QEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQAR 321
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 32.1 bits (74), Expect = 0.22
Identities = 6/26 (23%), Positives = 14/26 (53%)
Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKK 217
K + +KK+ K++ K + + K +
Sbjct: 66 LKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 27.9 bits (63), Expect = 4.1
Identities = 6/26 (23%), Positives = 12/26 (46%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
K + +K+ +K+ K + K K
Sbjct: 66 LKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 27.5 bits (62), Expect = 5.8
Identities = 12/44 (27%), Positives = 23/44 (52%), Gaps = 2/44 (4%)
Query: 178 EHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
+ +K + + D K+ K K + KKK K++ + K K++ R
Sbjct: 50 KEMKEELKA-ALLDKKELKAWHKAQ-KKKEKQEAKAAKAKSKPR 91
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 31.5 bits (72), Expect = 0.23
Identities = 7/27 (25%), Positives = 19/27 (70%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
+++K E++K KK+ +K+ ++K+ +
Sbjct: 11 RRRKREQRKARKKQKRKEAKKKEDAQK 37
Score = 29.6 bits (67), Expect = 1.0
Identities = 12/28 (42%), Positives = 19/28 (67%), Gaps = 3/28 (10%)
Query: 194 KKKEE---EKKEAKKKTKKKEEEKKKKT 218
KK+EE ++K+ KK KKK+ +KK +
Sbjct: 178 KKREENLKKRKDDKKNKKKKKAKKKGRI 205
Score = 28.0 bits (63), Expect = 3.5
Identities = 12/28 (42%), Positives = 23/28 (82%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
K+KE++KK++KK+ K+++E+ +KK R
Sbjct: 149 KRKEKQKKKSKKEWKERKEKVEKKKAER 176
Score = 27.3 bits (61), Expect = 5.1
Identities = 9/23 (39%), Positives = 17/23 (73%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKK 215
K +K++++KEAKKK ++ E +
Sbjct: 19 KARKKQKRKEAKKKEDAQKSEAE 41
Score = 27.3 bits (61), Expect = 5.7
Identities = 13/25 (52%), Positives = 20/25 (80%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
KKK ++E KE K+K +KK+ E++KK
Sbjct: 155 KKKSKKEWKERKEKVEKKKAERQKK 179
Score = 26.5 bits (59), Expect = 9.4
Identities = 11/29 (37%), Positives = 16/29 (55%), Gaps = 1/29 (3%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
+KKK EE E K + +E+EK K +
Sbjct: 104 RKKKLEELDE-DKAAEIEEKEKWTKALAK 131
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 32.2 bits (74), Expect = 0.23
Identities = 10/25 (40%), Positives = 14/25 (56%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
K +KEE +K A KK + +K K
Sbjct: 424 KAEKEEAEKAAAKKKAEAAAKKAKG 448
Score = 30.7 bits (70), Expect = 0.68
Identities = 10/25 (40%), Positives = 15/25 (60%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
KK ++EE ++A K K + KK K
Sbjct: 423 KKAEKEEAEKAAAKKKAEAAAKKAK 447
Score = 30.7 bits (70), Expect = 0.68
Identities = 11/25 (44%), Positives = 17/25 (68%), Gaps = 1/25 (4%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
KK+++ EKK A+K+ +K KKK
Sbjct: 416 KKQRKAEKK-AEKEEAEKAAAKKKA 439
Score = 29.1 bits (66), Expect = 2.2
Identities = 9/25 (36%), Positives = 18/25 (72%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
+KK +++++A+KK +K+E EK
Sbjct: 411 RKKLRKKQRKAEKKAEKEEAEKAAA 435
Score = 28.0 bits (63), Expect = 4.6
Identities = 8/25 (32%), Positives = 13/25 (52%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
+K ++K EA K K + + KK
Sbjct: 431 EKAAAKKKAEAAAKKAKGPDGETKK 455
Score = 27.6 bits (62), Expect = 5.8
Identities = 13/36 (36%), Positives = 18/36 (50%), Gaps = 3/36 (8%)
Query: 193 KKKKEE---EKKEAKKKTKKKEEEKKKKTRMRGGIE 225
K++K E EK+EA+K KK+ E K E
Sbjct: 417 KQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGE 452
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 32.2 bits (74), Expect = 0.23
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
K KK EK E K++ +KKE++KK +
Sbjct: 411 KIKKIVEKAEKKREEEKKEKKKKAFAGKK 439
Score = 31.4 bits (72), Expect = 0.36
Identities = 10/34 (29%), Positives = 21/34 (61%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIER 226
KK++EE+K++ KK K++E++++ E
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Score = 31.0 bits (71), Expect = 0.53
Identities = 11/25 (44%), Positives = 17/25 (68%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
+K E++++E KK+ KKK KKK
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKK 440
Score = 31.0 bits (71), Expect = 0.55
Identities = 13/35 (37%), Positives = 23/35 (65%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
KK+K+++ KKK +++EEEK+KK + E +
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461
Score = 30.7 bits (70), Expect = 0.60
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
+K +++ ++E K+K KK KKK+
Sbjct: 417 EKAEKKREEEKKEKKKKAFAGKKKEEE 443
Score = 29.9 bits (68), Expect = 1.2
Identities = 11/25 (44%), Positives = 20/25 (80%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
+KK+EEEKKE KKK +++++++
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEE 444
Score = 29.5 bits (67), Expect = 1.4
Identities = 9/26 (34%), Positives = 17/26 (65%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
+KK EE+K+ KKK ++K+++
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEE 444
Score = 28.7 bits (65), Expect = 2.5
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
+K E+K+E +KK KKK+ KK
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKK 440
Score = 28.0 bits (63), Expect = 5.0
Identities = 10/30 (33%), Positives = 20/30 (66%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRG 222
KK ++ ++A+KK +++++EKKKK
Sbjct: 409 TKKIKKIVEKAEKKREEEKKEKKKKAFAGK 438
Score = 28.0 bits (63), Expect = 5.1
Identities = 9/33 (27%), Positives = 19/33 (57%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
KK ++ E +K +++E+++KKK G +
Sbjct: 408 ATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKK 440
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 31.4 bits (72), Expect = 0.26
Identities = 15/50 (30%), Positives = 20/50 (40%), Gaps = 9/50 (18%)
Query: 183 RYQHPNI-----CDFKKKKE--EEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
+ I CDF E E +KE KK +EEKK + +E
Sbjct: 74 KEGGAIIKDLEKCDFTPIYEYFEAEKEKKKA--MSKEEKKAIKEEKDKLE 121
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 30.7 bits (70), Expect = 0.26
Identities = 13/34 (38%), Positives = 26/34 (76%)
Query: 182 YRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKK 215
YR + I D ++K+EE+++E ++K +K+E+EK+
Sbjct: 4 YRDKSGRIIDIEEKREEKEREKEEKERKEEKEKE 37
Score = 27.7 bits (62), Expect = 3.3
Identities = 7/28 (25%), Positives = 19/28 (67%)
Query: 196 KEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
EE+++E +++ ++KE +++K+ G
Sbjct: 14 IEEKREEKEREKEEKERKEEKEKEWGKG 41
Score = 26.9 bits (60), Expect = 5.5
Identities = 10/33 (30%), Positives = 20/33 (60%), Gaps = 4/33 (12%)
Query: 193 KKKKEEEKKEAKKKTKK----KEEEKKKKTRMR 221
+K+++E K+E +K+ K KEE +K+ +
Sbjct: 24 EKEEKERKEEKEKEWGKGLVQKEEREKRLEELE 56
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 30.1 bits (68), Expect = 0.27
Identities = 12/35 (34%), Positives = 18/35 (51%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
KKKK++ K + + T+K+EEEK E
Sbjct: 24 KKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEED 58
Score = 29.3 bits (66), Expect = 0.42
Identities = 12/28 (42%), Positives = 17/28 (60%), Gaps = 2/28 (7%)
Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
K KK + KK KKK KKK+ + K++
Sbjct: 12 LKGKKIDVKK--KKKKKKKKNKSKEEVV 37
Score = 28.2 bits (63), Expect = 1.2
Identities = 10/26 (38%), Positives = 19/26 (73%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
KKKK+++KK K+ E+E+++K+
Sbjct: 21 KKKKKKKKKNKSKEEVVTEKEEEEKS 46
Score = 26.2 bits (58), Expect = 5.9
Identities = 11/25 (44%), Positives = 15/25 (60%), Gaps = 3/25 (12%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
KKKK KK+ KK K+E +K+
Sbjct: 20 KKKK---KKKKKKNKSKEEVVTEKE 41
>gnl|CDD|165177 PHA02834, PHA02834, chemokine receptor-like protein; Provisional.
Length = 323
Score = 31.4 bits (71), Expect = 0.29
Identities = 39/157 (24%), Positives = 77/157 (49%), Gaps = 18/157 (11%)
Query: 21 TSPDENVSSIIWMTLRASIMIFIIVASIGGNLLVIVSVMRHRKLRIITNYFVVSLAFADM 80
+ D N I++ L + IF ++ GN+LVI +V+ ++ + + ++ ++A +D+
Sbjct: 23 VNSDVNYFVIVFYIL---LFIFGLI----GNVLVI-AVLIVKRFMFVVDVYLFNIAMSDL 74
Query: 81 LVAMVAMTFNASVQV-TGKWNFSFFMCDVWNSLDVYFST--ASILHLCCISVDRYYAIVK 137
M+ +F + +W F FMC + L VYF +++ + IS+DRY +V
Sbjct: 75 ---MLVFSFPFIIHNDLNEWIFGEFMCKL--VLGVYFVGFFSNMFFVTLISIDRYILVVN 129
Query: 138 PLKYPINMTKRVVAFMLLNVWVSPAVISFVPIMCGWY 174
K N + + + + WV ++S +P M +Y
Sbjct: 130 ATKIK-NKSISLSVLLSVAAWVCSVILS-MPAMVLYY 164
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 31.2 bits (71), Expect = 0.30
Identities = 13/34 (38%), Positives = 19/34 (55%), Gaps = 6/34 (17%)
Query: 190 CDFK------KKKEEEKKEAKKKTKKKEEEKKKK 217
CDF K ++EEKK K+ KK + +K+K
Sbjct: 85 CDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEK 118
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 31.8 bits (72), Expect = 0.31
Identities = 9/35 (25%), Positives = 19/35 (54%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
K K +E K K ++KE+E+ K+ + + + +
Sbjct: 91 KTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPK 125
Score = 30.6 bits (69), Expect = 0.59
Identities = 14/27 (51%), Positives = 18/27 (66%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
KKKK+E+ KE K K KEE K+K+
Sbjct: 117 KKKKKEKPKEEPKDRKPKEEAKEKRPP 143
Score = 30.6 bits (69), Expect = 0.67
Identities = 12/26 (46%), Positives = 19/26 (73%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
K+EE++KE K+ KKK++EK K+
Sbjct: 102 SGKEEEKEKEQVKEEKKKKKEKPKEE 127
Score = 30.6 bits (69), Expect = 0.74
Identities = 11/27 (40%), Positives = 20/27 (74%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
++K++E+ KE KKK K+K +E+ K +
Sbjct: 106 EEKEKEQVKEEKKKKKEKPKEEPKDRK 132
Score = 29.9 bits (67), Expect = 1.1
Identities = 11/29 (37%), Positives = 23/29 (79%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
K++E+EK++ K++ KKK+E+ K++ + R
Sbjct: 103 GKEEEKEKEQVKEEKKKKKEKPKEEPKDR 131
Score = 29.9 bits (67), Expect = 1.3
Identities = 8/35 (22%), Positives = 21/35 (60%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
K +E K E+ K+ +K++E+ K++ + + ++
Sbjct: 92 TKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKE 126
Score = 29.5 bits (66), Expect = 1.5
Identities = 9/26 (34%), Positives = 17/26 (65%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
+K +EE KE + +K++E++KK
Sbjct: 130 DRKPKEEAKEKRPPKEKEKEKEKKVE 155
Score = 29.1 bits (65), Expect = 2.0
Identities = 10/25 (40%), Positives = 20/25 (80%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
K++ +EEKK+ K+K K++ +++K K
Sbjct: 110 KEQVKEEKKKKKEKPKEEPKDRKPK 134
Score = 29.1 bits (65), Expect = 2.0
Identities = 12/27 (44%), Positives = 19/27 (70%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
+K+KE+ K+E KKK +K +EE K +
Sbjct: 107 EKEKEQVKEEKKKKKEKPKEEPKDRKP 133
Score = 29.1 bits (65), Expect = 2.0
Identities = 11/26 (42%), Positives = 19/26 (73%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
K +K +E+ + K+ K+KE+EK+KK
Sbjct: 129 KDRKPKEEAKEKRPPKEKEKEKEKKV 154
Score = 29.1 bits (65), Expect = 2.3
Identities = 10/28 (35%), Positives = 15/28 (53%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
KK +KK KK + EEEK+++
Sbjct: 175 PKKPPKKKPPNKKKEPPEEEKQRQAARE 202
Score = 28.7 bits (64), Expect = 2.4
Identities = 11/29 (37%), Positives = 20/29 (68%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
K++E++KE K + + EE+KK+ R+R
Sbjct: 142 PPKEKEKEKEKKVEEPRDREEEKKRERVR 170
Score = 28.3 bits (63), Expect = 3.6
Identities = 8/35 (22%), Positives = 20/35 (57%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
KKK++ +++ +K K++ +EK+ E++
Sbjct: 119 KKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKK 153
Score = 28.3 bits (63), Expect = 4.1
Identities = 11/27 (40%), Positives = 19/27 (70%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
+KKK++EK + + K +K +EE K+K
Sbjct: 116 EKKKKKEKPKEEPKDRKPKEEAKEKRP 142
Score = 27.9 bits (62), Expect = 4.5
Identities = 10/27 (37%), Positives = 13/27 (48%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
KK++ K KK KK+ KKK
Sbjct: 164 KKRERVRAKSRPKKPPKKKPPNKKKEP 190
Score = 27.9 bits (62), Expect = 4.7
Identities = 7/29 (24%), Positives = 22/29 (75%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
+ K + +E+++ K++ K+++++KK+K +
Sbjct: 97 EPKNESGKEEEKEKEQVKEEKKKKKEKPK 125
Score = 27.9 bits (62), Expect = 5.0
Identities = 9/30 (30%), Positives = 15/30 (50%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRG 222
+ K KK KKK K++E ++ + R
Sbjct: 169 VRAKSRPKKPPKKKPPNKKKEPPEEEKQRQ 198
Score = 27.5 bits (61), Expect = 6.2
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
KK ++K KKK +EE++++ R
Sbjct: 175 PKKPPKKKPPNKKKEPPEEEKQRQAAR 201
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 31.2 bits (71), Expect = 0.33
Identities = 10/29 (34%), Positives = 14/29 (48%)
Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
KKKK+++ AK KKK R+
Sbjct: 99 TKKKKKKDPTAAKSPKAAAPRPKKKSERI 127
>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain. DBINO is a DNA-binding
domain found on global transcription activator SNF2L1
proteins and chromatin re-modelling proteins.
Length = 140
Score = 30.3 bits (69), Expect = 0.34
Identities = 11/26 (42%), Positives = 19/26 (73%)
Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKK 217
F KK E+E++E +K+ +K+ E+ KK
Sbjct: 77 FWKKNEKEERELRKRAEKEALEQAKK 102
>gnl|CDD|235898 PRK06964, PRK06964, DNA polymerase III subunit delta'; Validated.
Length = 342
Score = 31.3 bits (71), Expect = 0.35
Identities = 15/51 (29%), Positives = 19/51 (37%), Gaps = 6/51 (11%)
Query: 171 CGWYTTSEHLKYRYQHPNICD------FKKKKEEEKKEAKKKTKKKEEEKK 215
C W+ H YR P + KE + E KKTK +E K
Sbjct: 63 CNWFAQGNHPDYRIVRPEALAAEAPGAADEAKEADADEGGKKTKAPSKEIK 113
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 31.2 bits (71), Expect = 0.36
Identities = 11/25 (44%), Positives = 23/25 (92%), Gaps = 1/25 (4%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
++K+EEE+++ +KK +++EEE+K+K
Sbjct: 196 QQKREEERRKQRKK-QQEEEERKQK 219
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 30.8 bits (70), Expect = 0.44
Identities = 10/27 (37%), Positives = 17/27 (62%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKK 217
D +K++E+ K AK K KK + K ++
Sbjct: 42 DEEKEEEKAKVAAKAKAKKALKAKIEE 68
Score = 30.4 bits (69), Expect = 0.63
Identities = 13/27 (48%), Positives = 16/27 (59%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
+K K K +AKK K K EEK+K R
Sbjct: 48 EKAKVAAKAKAKKALKAKIEEKEKAKR 74
Score = 30.4 bits (69), Expect = 0.69
Identities = 11/31 (35%), Positives = 13/31 (41%), Gaps = 4/31 (12%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
K K E+ K KKK K K K + G
Sbjct: 194 KLKAEKAAKGGKKKKGK----AKAKLNVGGA 220
Score = 30.0 bits (68), Expect = 0.91
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
K K+ K + ++K K K E+++K R
Sbjct: 56 AKAKKALKAKIEEKEKAKREKEEKGLR 82
Score = 29.6 bits (67), Expect = 0.98
Identities = 10/30 (33%), Positives = 15/30 (50%), Gaps = 3/30 (10%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
+K + +K AK KKK K K ++ G
Sbjct: 192 NEKLKAEKAAKGGKKKK---GKAKAKLNVG 218
Score = 29.6 bits (67), Expect = 1.1
Identities = 9/29 (31%), Positives = 17/29 (58%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
++K+EE+ K A K KK + K + + +
Sbjct: 43 EEKEEEKAKVAAKAKAKKALKAKIEEKEK 71
Score = 28.9 bits (65), Expect = 1.9
Identities = 10/27 (37%), Positives = 16/27 (59%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
KK + K E K+K K+++EEK +
Sbjct: 58 AKKALKAKIEEKEKAKREKEEKGLREL 84
Score = 28.5 bits (64), Expect = 2.3
Identities = 9/28 (32%), Positives = 16/28 (57%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
K KK + K +K+ K+E+E+K +
Sbjct: 57 KAKKALKAKIEEKEKAKREKEEKGLREL 84
Score = 28.5 bits (64), Expect = 2.8
Identities = 9/31 (29%), Positives = 17/31 (54%)
Query: 197 EEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
EEE +E +++ K + K K ++ IE +
Sbjct: 39 EEEDEEKEEEKAKVAAKAKAKKALKAKIEEK 69
Score = 28.1 bits (63), Expect = 3.3
Identities = 10/34 (29%), Positives = 21/34 (61%), Gaps = 6/34 (17%)
Query: 194 KKKEEEKKEAKKKTKKK------EEEKKKKTRMR 221
++KE+ K+E ++K ++ E+E +K R+R
Sbjct: 67 EEKEKAKREKEEKGLRELEEDTPEDELAEKLRLR 100
Score = 26.9 bits (60), Expect = 8.2
Identities = 6/28 (21%), Positives = 15/28 (53%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
K K + +++ K K +K+E+ ++
Sbjct: 60 KALKAKIEEKEKAKREKEEKGLRELEED 87
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 30.5 bits (69), Expect = 0.45
Identities = 13/25 (52%), Positives = 19/25 (76%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
+ KE E+KEA K K+K+E+K+KK
Sbjct: 101 EVSKETEEKEAIKAKKEKKEKKEKK 125
Score = 30.1 bits (68), Expect = 0.49
Identities = 8/27 (29%), Positives = 18/27 (66%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
+K+ +E +E + KKE+++KK+ +
Sbjct: 99 QKEVSKETEEKEAIKAKKEKKEKKEKK 125
Score = 29.0 bits (65), Expect = 1.3
Identities = 8/28 (28%), Positives = 19/28 (67%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKT 218
+ +K+++E KE ++K K +++KK+
Sbjct: 94 ESEKEQKEVSKETEEKEAIKAKKEKKEK 121
Score = 28.2 bits (63), Expect = 2.6
Identities = 12/29 (41%), Positives = 20/29 (68%)
Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
F ++ E+E+KE K+T++KE K KK +
Sbjct: 91 FPEESEKEQKEVSKETEEKEAIKAKKEKK 119
Score = 27.0 bits (60), Expect = 5.2
Identities = 9/25 (36%), Positives = 18/25 (72%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
++K+ ++ E K+ K K+E+K+KK
Sbjct: 98 EQKEVSKETEEKEAIKAKKEKKEKK 122
Score = 26.6 bits (59), Expect = 7.8
Identities = 12/28 (42%), Positives = 17/28 (60%), Gaps = 2/28 (7%)
Query: 193 KKKKEEEKKEAK--KKTKKKEEEKKKKT 218
KK+++EKKE K +K KK+ K K
Sbjct: 114 AKKEKKEKKEKKVAEKLAKKKSTKTTKN 141
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The function
is unclear.
Length = 151
Score = 30.3 bits (68), Expect = 0.45
Identities = 14/33 (42%), Positives = 18/33 (54%), Gaps = 3/33 (9%)
Query: 185 QHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKK 217
Q KKK+E+ E K K+KKKEE+ K
Sbjct: 21 QSQK---SDKKKKEKVSEKKGKSKKKEEKPNGK 50
Score = 28.8 bits (64), Expect = 1.3
Identities = 13/32 (40%), Positives = 22/32 (68%), Gaps = 3/32 (9%)
Query: 193 KKKKEEEKKEAKKKTKKKEE---EKKKKTRMR 221
K++KE K +++K KKK+E EKK K++ +
Sbjct: 12 KQRKESGKTQSQKSDKKKKEKVSEKKGKSKKK 43
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 29.9 bits (68), Expect = 0.49
Identities = 14/30 (46%), Positives = 23/30 (76%), Gaps = 1/30 (3%)
Query: 193 KKKKEEEKKEAKKKTK-KKEEEKKKKTRMR 221
K + E K+E KK++K KKEEEK++K +++
Sbjct: 92 KLEHERNKQEKKKRSKEKKEEEKERKRQLK 121
Score = 28.8 bits (65), Expect = 1.1
Identities = 14/30 (46%), Positives = 24/30 (80%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRG 222
KKK+ +EKKE +K+ K++ +++KKK + RG
Sbjct: 102 KKKRSKEKKEEEKERKRQLKQQKKKAKHRG 131
>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
This family consists of several eukaryotic ribosome
biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
protein that is essential for the maturation of 25 S
rRNA and the 60 S ribosomal subunit assembly in
Saccharomyces cerevisiae.
Length = 164
Score = 30.3 bits (69), Expect = 0.54
Identities = 12/26 (46%), Positives = 15/26 (57%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKK 216
D KK EKKE K +K+E + KK
Sbjct: 139 DPFAKKRREKKERVAKNEKRELKNKK 164
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 30.9 bits (70), Expect = 0.60
Identities = 9/23 (39%), Positives = 13/23 (56%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKK 216
K+K EE +E K ++K E K
Sbjct: 52 KRKREELREKIAKAREKRERNSK 74
Score = 27.4 bits (61), Expect = 8.2
Identities = 11/26 (42%), Positives = 18/26 (69%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
K KK ++KKEA++K +EK+K+
Sbjct: 97 KSKKRQKKKEAERKKALLLDEKEKER 122
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 30.5 bits (69), Expect = 0.61
Identities = 12/26 (46%), Positives = 18/26 (69%), Gaps = 1/26 (3%)
Query: 193 KKKKE-EEKKEAKKKTKKKEEEKKKK 217
K KKE E+KE +++ K EEEK ++
Sbjct: 153 KIKKERAEEKEREEEEKAAEEEKARE 178
>gnl|CDD|227008 COG4662, TupA, ABC-type tungstate transport system, periplasmic
component [Coenzyme metabolism].
Length = 227
Score = 30.5 bits (69), Expect = 0.62
Identities = 18/83 (21%), Positives = 34/83 (40%)
Query: 21 TSPDENVSSIIWMTLRASIMIFIIVASIGGNLLVIVSVMRHRKLRIITNYFVVSLAFADM 80
S D + I+ TL S++ + A +G L ++++ R R I L+ +
Sbjct: 15 FSLDPELIGIVATTLYVSLISIFLAALVGVPLAFVLALREFRGKRFIKMIINTLLSMPTV 74
Query: 81 LVAMVAMTFNASVQVTGKWNFSF 103
LV ++ + G +N F
Sbjct: 75 LVGLLLYLLLSRSGPLGWFNLLF 97
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 30.9 bits (70), Expect = 0.62
Identities = 14/31 (45%), Positives = 22/31 (70%), Gaps = 4/31 (12%)
Query: 193 KKKKEEEKKEAKK----KTKKKEEEKKKKTR 219
++KKEEE+K K ++++EEKKKKT+
Sbjct: 45 EEKKEEEEKTTDKEEEVDEEEEKEEKKKKTK 75
Score = 29.4 bits (66), Expect = 1.6
Identities = 12/19 (63%), Positives = 14/19 (73%)
Query: 197 EEEKKEAKKKTKKKEEEKK 215
EEEK+E KKKTKK +E
Sbjct: 64 EEEKEEKKKKTKKVKETTT 82
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 30.6 bits (69), Expect = 0.67
Identities = 11/25 (44%), Positives = 15/25 (60%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
K KK + KK+ KK KK+ +E K
Sbjct: 145 KAKKGKAKKKTKKSKKKEAKESSDK 169
Score = 28.3 bits (63), Expect = 3.2
Identities = 9/27 (33%), Positives = 15/27 (55%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
K K+ + K+ KK+KKKE ++
Sbjct: 144 TKAKKGKAKKKTKKSKKKEAKESSDKD 170
Score = 27.1 bits (60), Expect = 8.3
Identities = 8/27 (29%), Positives = 14/27 (51%)
Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKT 218
+ K ++ K + K K KK+E K+
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKESSD 168
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 30.8 bits (69), Expect = 0.69
Identities = 12/25 (48%), Positives = 21/25 (84%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
KK K+ +KKE K+K K+++++KKK+
Sbjct: 197 KKSKKPKKKEKKEKEKERDKDKKKE 221
Score = 30.4 bits (68), Expect = 0.79
Identities = 12/30 (40%), Positives = 20/30 (66%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
+KK ++ K+ +KK K+KE +K KK + G
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGF 225
Score = 30.4 bits (68), Expect = 0.83
Identities = 19/83 (22%), Positives = 31/83 (37%), Gaps = 2/83 (2%)
Query: 147 KRVVAFMLLNVWV--SPAVISFVPIMCGWYTTSEHLKYRYQHPNICDFKKKKEEEKKEAK 204
K V F L + + SPA + V + + EE KK K
Sbjct: 220 KEVEGFKSLLLALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPK 279
Query: 205 KKTKKKEEEKKKKTRMRGGIERQ 227
K KK+ +EK++K + + +
Sbjct: 280 HKKKKQRKEKEEKKKKKKHHHHR 302
Score = 29.3 bits (65), Expect = 2.2
Identities = 10/32 (31%), Positives = 20/32 (62%)
Query: 196 KEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
K EE++ +++ +K + EKKK+ + + G R
Sbjct: 83 KLEEERRHRQRLEKDKREKKKREKEKRGRRRH 114
Score = 28.9 bits (64), Expect = 2.4
Identities = 11/28 (39%), Positives = 18/28 (64%)
Query: 195 KKEEEKKEAKKKTKKKEEEKKKKTRMRG 222
K EEE++ ++ K K E+KK++ RG
Sbjct: 83 KLEEERRHRQRLEKDKREKKKREKEKRG 110
Score = 28.5 bits (63), Expect = 3.1
Identities = 8/27 (29%), Positives = 18/27 (66%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
++++ ++ E K+ KKK E++K+ R
Sbjct: 86 EERRHRQRLEKDKREKKKREKEKRGRR 112
>gnl|CDD|147641 pfam05582, Peptidase_U57, YabG peptidase U57. YabG is a protease
involved in the proteolysis and maturation of SpoIVA and
YrbA proteins, conserved with the cortex and/or coat
assembly by Bacillus subtilis.
Length = 287
Score = 30.2 bits (69), Expect = 0.72
Identities = 10/37 (27%), Positives = 16/37 (43%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
D +K E KK+ ++K E K R + R+
Sbjct: 49 DLEKVDASEVTRYKKRFEEKINESLKLIRQDRKLMRE 85
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 30.8 bits (70), Expect = 0.72
Identities = 14/25 (56%), Positives = 19/25 (76%)
Query: 197 EEEKKEAKKKTKKKEEEKKKKTRMR 221
EE+ K+ KKK KKKEEE K++ + R
Sbjct: 742 EEKTKKKKKKEKKKEEEYKREEKAR 766
Score = 28.9 bits (65), Expect = 2.9
Identities = 11/24 (45%), Positives = 18/24 (75%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKK 216
+K K+++KKE KK+ + K EEK +
Sbjct: 743 EKTKKKKKKEKKKEEEYKREEKAR 766
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 30.4 bits (69), Expect = 0.76
Identities = 15/37 (40%), Positives = 19/37 (51%), Gaps = 6/37 (16%)
Query: 190 CDFKKKK------EEEKKEAKKKTKKKEEEKKKKTRM 220
KKK+ E + A+KK K KEE KKKK +
Sbjct: 229 ESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKE 265
Score = 27.7 bits (62), Expect = 4.5
Identities = 11/31 (35%), Positives = 16/31 (51%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMRGGI 224
K KEE KK+ K++K + KK + I
Sbjct: 253 KSKEEIKKKKPKESKGVKALKKVVAKGMKKI 283
>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p) and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
gene from Saccharomyces cerevisiae. It is an essential
yeast protein involved in pre-rRNA processing and
ribosome assembly. Rrp7p contains an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 128
Score = 29.2 bits (66), Expect = 0.77
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 10/37 (27%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEE----------EKKKK 217
+ K EEEKKE +KK KKK+E EKKK+
Sbjct: 69 EAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKE 105
>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain. This
presumed domain is found at the N-terminus of Ribosomal
L30 proteins and has been termed RL30NT or NUC018.
Length = 71
Score = 28.3 bits (64), Expect = 0.78
Identities = 7/27 (25%), Positives = 16/27 (59%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
K+K+ E+ + + K + ++ +KK R
Sbjct: 5 KRKRNEKLRAKRAKKRAAKKAARKKKR 31
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 27.7 bits (62), Expect = 0.82
Identities = 7/27 (25%), Positives = 21/27 (77%)
Query: 189 ICDFKKKKEEEKKEAKKKTKKKEEEKK 215
+C+ +K+++EE+ E +++ +++ E K+
Sbjct: 20 LCEEEKREDEEENEDEEEGEEQSEVKR 46
>gnl|CDD|204122 pfam09072, TMA7, Translation machinery associated TMA7. TMA7 plays
a role in protein translation. Deletions of the TMA7
gene results in altered protein synthesis rates.
Length = 63
Score = 27.9 bits (62), Expect = 0.83
Identities = 11/25 (44%), Positives = 17/25 (68%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
K+KKE ++++ K K+KEE K K
Sbjct: 16 KQKKELDEEDLAFKQKQKEEAKALK 40
>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
Length = 763
Score = 30.2 bits (68), Expect = 0.86
Identities = 12/31 (38%), Positives = 18/31 (58%)
Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKTRMRG 222
+K + E + EA K KK ++ KKKK +G
Sbjct: 233 IEKPQAEAENEAGKSDKKDKKSKKKKVLGKG 263
>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain. FH proteins control
rearrangements of the actin cytoskeleton, especially in
the context of cytokinesis and cell polarisation.
Members of this family have been found to interact with
Rho-GTPases, profilin and other actin-assoziated
proteins. These interactions are mediated by the
proline-rich FH1 domain, usually located in front of FH2
(but not listed in SMART). Despite this cytosolic
function, vertebrate formins have been assigned
functions within the nucleus. A set of Formin-Binding
Proteins (FBPs) has been shown to bind FH1 with their WW
domain.
Length = 392
Score = 30.0 bits (68), Expect = 0.95
Identities = 9/36 (25%), Positives = 19/36 (52%)
Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
F K EE K+ +++ +++++ K+ T RQ
Sbjct: 330 FSKAAEENIKKEEEEEERRKKLVKETTEYEQSSSRQ 365
>gnl|CDD|216941 pfam02233, PNTB, NAD(P) transhydrogenase beta subunit. This family
corresponds to the beta subunit of NADP transhydrogenase
in prokaryotes, and either the protein N- or C terminal
in eukaryotes. The domain is often found in conjunction
with pfam01262. Pyridine nucleotide transhydrogenase
catalyzes the reduction of NAD+ to NADPH. A complete
loss of activity occurs upon mutation of Gly314 in E.
coli.
Length = 464
Score = 30.0 bits (68), Expect = 0.96
Identities = 12/41 (29%), Positives = 28/41 (68%), Gaps = 1/41 (2%)
Query: 20 STSPDENVSSIIWMTLRASIMIFIIVASIGG-NLLVIVSVM 59
S SP + +++ MT + ++ + +VA+IGG ++ V++S++
Sbjct: 181 SESPGTGMLALLLMTAISLVLGWHLVAAIGGADMPVVISML 221
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 28.8 bits (65), Expect = 1.0
Identities = 12/24 (50%), Positives = 16/24 (66%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKK 216
KK EEE K ++ K++E EKKK
Sbjct: 103 KKLDEEEAKRQHEEAKEREREKKK 126
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 29.2 bits (66), Expect = 1.0
Identities = 6/29 (20%), Positives = 18/29 (62%), Gaps = 1/29 (3%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKT-RMR 221
+ E+E + K++ +++ E K+K+ + +
Sbjct: 122 ELLEKELAKLKREKRRENERKQKEILKEQ 150
Score = 27.3 bits (61), Expect = 4.5
Identities = 8/26 (30%), Positives = 16/26 (61%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
K K E+++E ++K K+ +E+ K
Sbjct: 129 AKLKREKRRENERKQKEILKEQMKML 154
Score = 26.5 bits (59), Expect = 8.8
Identities = 8/34 (23%), Positives = 19/34 (55%), Gaps = 3/34 (8%)
Query: 191 DFKKKKEEEKKE---AKKKTKKKEEEKKKKTRMR 221
+ ++ EEE+ + K+ K K E++++ R +
Sbjct: 110 EVEELDEEEQIDELLEKELAKLKREKRRENERKQ 143
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 30.0 bits (68), Expect = 1.0
Identities = 12/29 (41%), Positives = 20/29 (68%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
K++KE K++ + + KK+EEKKKK +
Sbjct: 557 KEEKEALKEQKRLRKLKKQEEKKKKELEK 585
Score = 28.8 bits (65), Expect = 2.5
Identities = 8/29 (27%), Positives = 20/29 (68%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
+K+ +E+K +K K++E++KK+ ++
Sbjct: 559 EKEALKEQKRLRKLKKQEEKKKKELEKLE 587
Score = 28.8 bits (65), Expect = 2.6
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
K++K K + +++ KKKE EK +K
Sbjct: 564 KEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 27.7 bits (62), Expect = 5.6
Identities = 11/26 (42%), Positives = 19/26 (73%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
+K+++E KE K+ K K++E+KKK
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKK 581
>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564. This
family of proteins has no known function. However, one
of the members is annotated as an EF-hand family
protein.
Length = 349
Score = 29.7 bits (67), Expect = 1.1
Identities = 10/40 (25%), Positives = 24/40 (60%)
Query: 188 NICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
++ F + + +EK+ +++ KK+ E++KKK + +Q
Sbjct: 273 SLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQ 312
Score = 27.4 bits (61), Expect = 5.7
Identities = 10/26 (38%), Positives = 16/26 (61%)
Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKK 217
+ K EEE + +KK ++ E +KK K
Sbjct: 25 RQSKLEEELNKLEKKEEEAECKKKFK 50
>gnl|CDD|234582 PRK00028, infC, translation initiation factor IF-3; Reviewed.
Length = 177
Score = 29.4 bits (67), Expect = 1.1
Identities = 16/45 (35%), Positives = 24/45 (53%), Gaps = 13/45 (28%)
Query: 181 KYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
K++Y+ +KKK+ KEAKKK K E K+ + R I+
Sbjct: 70 KFKYE-------QKKKQ---KEAKKKQKVIE---VKEIKFRPKID 101
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A peptide
from the central, syntaxin binding domain of synaphin
competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal exocytosis
by promoting interaction between the complementary
syntaxin and synaptobrevin transmembrane regions that
reside in opposing membranes prior to fusion.
Length = 139
Score = 29.1 bits (65), Expect = 1.1
Identities = 10/36 (27%), Positives = 22/36 (61%), Gaps = 5/36 (13%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKK-----TRMRGGI 224
++++EE +EA ++ +++ + K +K MR GI
Sbjct: 33 EEEDEEIQEALREAEEERKAKHRKMEEEREVMRQGI 68
>gnl|CDD|204032 pfam08703, PLC-beta_C, PLC-beta C terminal. This domain
corresponds to the alpha helical C terminal domain of
phospholipase C beta.
Length = 181
Score = 29.4 bits (66), Expect = 1.2
Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 5/40 (12%)
Query: 192 FKKKKE---EEKKEAKKKTKKKEEEK--KKKTRMRGGIER 226
KK KE +EKKE KKK +K E+ + KT+ + ER
Sbjct: 54 LKKLKEISEKEKKELKKKLDRKRLERIQEAKTKEKHAQER 93
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 28.0 bits (63), Expect = 1.2
Identities = 6/20 (30%), Positives = 11/20 (55%)
Query: 194 KKKEEEKKEAKKKTKKKEEE 213
+E KK+ +++EEE
Sbjct: 60 AAAAAAAEEEKKEEEEEEEE 79
>gnl|CDD|169428 PRK08404, PRK08404, V-type ATP synthase subunit H; Validated.
Length = 103
Score = 28.2 bits (63), Expect = 1.3
Identities = 15/33 (45%), Positives = 19/33 (57%), Gaps = 3/33 (9%)
Query: 193 KKKKEEEKK---EAKKKTKKKEEEKKKKTRMRG 222
+K KEE KK +AK++ KK EEE KK
Sbjct: 20 EKAKEEAKKIIRKAKEEAKKIEEEIIKKAEEEA 52
Score = 26.3 bits (58), Expect = 5.9
Identities = 14/32 (43%), Positives = 19/32 (59%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
KK EEE ++ +K KK+ EE+ KK G E
Sbjct: 46 KKAEEEAQKLIEKKKKEGEEEAKKILEEGEKE 77
>gnl|CDD|216985 pfam02349, MSG, Major surface glycoprotein. This is a novel repeat
in Pneumocystis carinii Major surface glycoprotein (MSG)
some members of the alignment have up to nine repeats of
this family, the repeats containing several conserved
cysteines. The MSG of P. carinii is an important protein
in host-pathogen interactions. Surface glycoprotein A
from Pneumocystis carinii is a main target for the host
immune system, this protein is implicated in the
attachment of Pneumocystis carinii to the host alveolar
epithelial cells, alveolar macrophages, host surfactant
and possibly accounts in part for the hypoxia seen in
Pneumocystis carinii pneumonia (PCP).
Length = 81
Score = 27.8 bits (62), Expect = 1.3
Identities = 11/34 (32%), Positives = 16/34 (47%), Gaps = 4/34 (11%)
Query: 188 NICDFKKKKEEEK----KEAKKKTKKKEEEKKKK 217
C KK + + KE K + KKK++ KK
Sbjct: 35 EKCYKLKKDLKLEELLLKELKGELKKKKKCKKAL 68
>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region. This
domain is the C-terminal end of ribosomal L18/L5
proteins.
Length = 93
Score = 27.9 bits (63), Expect = 1.4
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
KK+K+E K E+K+ KK +++K R
Sbjct: 63 KKEKKEVKAESKRYNAKKLTLEQRKAR 89
>gnl|CDD|184861 PRK14859, tatA, twin arginine translocase protein A; Provisional.
Length = 63
Score = 27.4 bits (61), Expect = 1.4
Identities = 13/24 (54%), Positives = 17/24 (70%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKK 217
KK EK+E + K KKE++KKKK
Sbjct: 40 KKATSEKEEIEIKPTKKEDKKKKK 63
Score = 26.2 bits (58), Expect = 3.1
Identities = 12/28 (42%), Positives = 20/28 (71%)
Query: 189 ICDFKKKKEEEKKEAKKKTKKKEEEKKK 216
I +FKK E+++ K TKK++++KKK
Sbjct: 36 IKNFKKATSEKEEIEIKPTKKEDKKKKK 63
>gnl|CDD|220624 pfam10187, Nefa_Nip30_N, N-terminal domain of NEFA-interacting
nuclear protein NIP30. This is a the N-terminal 100
amino acids of a family of proteins conserved from
plants to humans. The full-length protein has putatively
been called NEFA-interacting nuclear protein NIP30,
however no reference could be found to confirm this.
Length = 99
Score = 28.1 bits (63), Expect = 1.5
Identities = 10/29 (34%), Positives = 20/29 (68%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
+F + EE ++ A+KK K++E E+ ++ R
Sbjct: 71 EFLDEVEESRRAAEKKRKREEAEELEEFR 99
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 29.1 bits (65), Expect = 1.5
Identities = 9/28 (32%), Positives = 16/28 (57%)
Query: 196 KEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
K++EK+ + K KE+ +KK+ R
Sbjct: 165 KQKEKRRVEDSQKHKEDRRKKQEEKRRN 192
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 29.5 bits (66), Expect = 1.5
Identities = 9/24 (37%), Positives = 16/24 (66%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKK 216
K+++E+ +K A ++ KK E K K
Sbjct: 132 KQQEEQARKAAAEQKKKAEAAKAK 155
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 28.0 bits (63), Expect = 1.6
Identities = 7/20 (35%), Positives = 14/20 (70%)
Query: 194 KKKEEEKKEAKKKTKKKEEE 213
EE+K+E +++ +K+E E
Sbjct: 75 AAAEEKKEEEEEEEEKEESE 94
Score = 28.0 bits (63), Expect = 1.9
Identities = 6/22 (27%), Positives = 14/22 (63%)
Query: 197 EEEKKEAKKKTKKKEEEKKKKT 218
+KK +++EEE+K+++
Sbjct: 72 AAAAAAEEKKEEEEEEEEKEES 93
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many hypothetical
proteins.
Length = 98
Score = 27.8 bits (62), Expect = 1.6
Identities = 12/27 (44%), Positives = 16/27 (59%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
KKKK+ K A+ K KK ++K KK
Sbjct: 25 KKKKKRTAKTARPKATKKGQKKDKKKD 51
>gnl|CDD|218270 pfam04795, PAPA-1, PAPA-1-like conserved region. Family of
proteins with a conserved region found in PAPA-1, a
PAP-1 binding protein.
Length = 89
Score = 27.7 bits (62), Expect = 1.6
Identities = 11/33 (33%), Positives = 16/33 (48%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIER 226
K+ EEEK E K KK+ +KK+ +
Sbjct: 19 KRAEEEKMETINKLLKKQARRKKREDKIKKGDE 51
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 28.7 bits (64), Expect = 1.7
Identities = 15/36 (41%), Positives = 22/36 (61%), Gaps = 2/36 (5%)
Query: 193 KKKKEEEKKEAKKKTK--KKEEEKKKKTRMRGGIER 226
KKKK +EKKE K+ K ++E+E K+ + IE
Sbjct: 70 KKKKLDEKKEIAKQRKREQREKELAKRQKELEKIEL 105
>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
non-specific RNA binding region part 2. This is a
region found N terminal to the catalytic domain of
glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
but not in Escherichia coli. This region is thought to
bind RNA in a non-specific manner, enhancing
interactions between the tRNA and enzyme, but is not
essential for enzyme function.
Length = 83
Score = 27.3 bits (61), Expect = 1.9
Identities = 11/28 (39%), Positives = 13/28 (46%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
KKKK+ E A K KK E + M
Sbjct: 29 KKKKKAEDTAATAKAKKATAEDVSEGAM 56
Score = 26.9 bits (60), Expect = 2.4
Identities = 10/24 (41%), Positives = 13/24 (54%)
Query: 195 KKEEEKKEAKKKTKKKEEEKKKKT 218
K E + + KKK KKK+ E T
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAAT 40
Score = 26.6 bits (59), Expect = 4.2
Identities = 11/28 (39%), Positives = 13/28 (46%), Gaps = 3/28 (10%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKT 218
D KKK KK+ KKK + K K
Sbjct: 21 DLVKKK---KKKKKKKAEDTAATAKAKK 45
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 29.3 bits (66), Expect = 1.9
Identities = 7/21 (33%), Positives = 14/21 (66%)
Query: 193 KKKKEEEKKEAKKKTKKKEEE 213
K++KE E++ ++K KK+
Sbjct: 287 KRRKELEREAKEEKQLKKKLA 307
Score = 28.5 bits (64), Expect = 3.2
Identities = 8/30 (26%), Positives = 18/30 (60%), Gaps = 3/30 (10%)
Query: 193 KKKKEEEK---KEAKKKTKKKEEEKKKKTR 219
K+K + ++ K K+ ++ +EEK+ K +
Sbjct: 276 KRKTKAQRNKEKRRKELEREAKEEKQLKKK 305
Score = 28.5 bits (64), Expect = 3.3
Identities = 7/29 (24%), Positives = 19/29 (65%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
+K K + KE ++K ++E +++K+ + +
Sbjct: 277 RKTKAQRNKEKRRKELEREAKEEKQLKKK 305
Score = 28.1 bits (63), Expect = 4.1
Identities = 12/29 (41%), Positives = 20/29 (68%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
K+ ++EK A+KK ++KE +KKK + R
Sbjct: 317 KEVAQKEKARARKKEQRKERGEKKKLKRR 345
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 29.2 bits (65), Expect = 1.9
Identities = 15/45 (33%), Positives = 28/45 (62%), Gaps = 3/45 (6%)
Query: 178 EHLKYRYQHPN--ICDFKKKKEEEKKEAKKKT-KKKEEEKKKKTR 219
E LK + Q + + KKK+EE +K +++ ++K+EE +K+R
Sbjct: 203 EKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSR 247
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 29.0 bits (65), Expect = 2.1
Identities = 14/33 (42%), Positives = 19/33 (57%), Gaps = 3/33 (9%)
Query: 192 FKKKKEEEKK---EAKKKTKKKEEEKKKKTRMR 221
F KKK +EKK EA + T K+E E++ R
Sbjct: 221 FFKKKTKEKKEKKEASESTVKEESEEESGKRDV 253
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 28.5 bits (64), Expect = 2.2
Identities = 12/39 (30%), Positives = 24/39 (61%), Gaps = 4/39 (10%)
Query: 193 KKKKEEEKKEAKKKTKKKE----EEKKKKTRMRGGIERQ 227
KK+ + E+K+A+++ ++ E EE+KK R G ++
Sbjct: 6 KKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKE 44
Score = 27.7 bits (62), Expect = 3.6
Identities = 10/25 (40%), Positives = 20/25 (80%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
+KK EE+++ +K+ ++ EEE++KK
Sbjct: 31 RKKLEEKREGERKEEEELEEEREKK 55
Score = 27.7 bits (62), Expect = 4.4
Identities = 10/24 (41%), Positives = 20/24 (83%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKK 217
KKKEEE+++ +++ +KE+E+ +K
Sbjct: 54 KKKEEEERKEREEQARKEQEEYEK 77
>gnl|CDD|107372 cd06377, PBP1_iGluR_NMDA_NR3, N-terminal
leucine/isoleucine/valine-binding protein (LIVBP)-like
domain of the NR3 subunit of NMDA receptor family.
N-terminal leucine/isoleucine/valine-binding protein
(LIVBP)-like domain of the NR3 subunit of NMDA receptor
family. The ionotropic N-methyl-d-asparate (NMDA)
subtype of glutamate receptor serves critical functions
in neuronal development, functioning, and degeneration
in the mammalian central nervous system. The functional
NMDA receptor is a heterotetramer composed of two NR1
and two NR2 (A, B, C, and D) or of NR3 (A and B)
subunits. The receptor controls a cation channel that is
highly permeable to monovalent ions and calcium and
exhibits voltage-dependent inhibition by magnesium. Dual
agonists, glutamate and glycine, are required for
efficient activation of the NMDA receptor. Among NMDA
receptor subtypes, the NR2B subunit containing receptors
appear particularly important for pain perception; thus
NR2B-selective antagonists may be useful in the
treatment of chronic pain.
Length = 382
Score = 29.0 bits (65), Expect = 2.3
Identities = 12/56 (21%), Positives = 27/56 (48%), Gaps = 1/56 (1%)
Query: 55 IVSVMRHRKLRIITNYFVVSLAFADMLVAMVAMTFNASVQVTGKWNFSFFMCDVWN 110
+VS++R R N F + +++A L ++ + + +Q G + S +C +
Sbjct: 109 VVSIVRREFPRGSQNPFHLQMSWASPLSTLLDVLL-SVLQRNGWEDVSLVLCRERD 163
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 28.9 bits (65), Expect = 2.3
Identities = 8/24 (33%), Positives = 16/24 (66%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKK 217
KK+++++K ++ K E KK+K
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEK 650
>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
structure and biogenesis].
Length = 107
Score = 27.6 bits (61), Expect = 2.3
Identities = 15/35 (42%), Positives = 18/35 (51%)
Query: 187 PNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
P K+KK E+ K K KKK +KKKK R
Sbjct: 3 PKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEAR 37
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 28.9 bits (65), Expect = 2.3
Identities = 13/25 (52%), Positives = 22/25 (88%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
K KKE++ E+KKK++KK+++KK+K
Sbjct: 56 KDKKEDKNNESKKKSEKKKKKKKEK 80
Score = 28.5 bits (64), Expect = 3.1
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
KKKK++EKKE K + + K K K
Sbjct: 73 KKKKKKEKKEPKSEGETKLGFKTPKK 98
Score = 27.8 bits (62), Expect = 5.9
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
K E +KK KKK KKKE+++ K
Sbjct: 61 DKNNESKKKSEKKKKKKKEKKEPKSE 86
>gnl|CDD|153288 cd07604, BAR_ASAPs, The Bin/Amphiphysin/Rvs (BAR) domain of ArfGAP
with SH3 domain, ANK repeat and PH domain containing
proteins. BAR domains are dimerization, lipid binding
and curvature sensing modules found in many different
proteins with diverse functions. This subfamily is
composed of ASAPs (ArfGAP with SH3 domain, ANK repeat
and PH domain containing proteins), which are Arf GTPase
activating proteins (GAPs) with similarity to ACAPs
(ArfGAP with Coiled-coil, ANK repeat and PH domain
containing proteins) in that they contain an N-terminal
BAR domain, followed by a Pleckstrin homology (PH)
domain, an Arf GAP domain, and ankyrin (ANK) repeats.
However, ASAPs contain an additional C-terminal SH3
domain. ASAPs function in regulating cell growth,
migration, and invasion. Vertebrates contain at least
three members, ASAP1, ASAP2, and ASAP3. ASAP1 and ASAP2
shows GTPase activating protein (GAP) activity towards
Arf1 and Arf5. They do not show GAP activity towards
Arf6, but is able to mediate Arf6 signaling by binding
stably to GTP-Arf6. ASAP3 is an Arf6-specific GAP. BAR
domains form dimers that bind to membranes, induce
membrane bending and curvature, and may also be involved
in protein-protein interactions. The BAR domain of ASAP1
mediates membrane bending, is essential for function,
and autoinhibits GAP activity by interacting with the PH
and/or Arf GAP domains.
Length = 215
Score = 28.5 bits (64), Expect = 2.3
Identities = 12/34 (35%), Positives = 19/34 (55%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGI 224
D KK ++ K+ + K K E+EKK+ + G I
Sbjct: 108 DLKKPFDKAWKDYETKASKIEKEKKQLAKEAGMI 141
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 27.4 bits (61), Expect = 2.4
Identities = 8/31 (25%), Positives = 15/31 (48%)
Query: 188 NICDFKKKKEEEKKEAKKKTKKKEEEKKKKT 218
C + KK+++ K + K+ KK +T
Sbjct: 14 KGCKWDKKEDDGKCKPKEGKAKKNGAPVTQT 44
Score = 27.4 bits (61), Expect = 2.5
Identities = 13/38 (34%), Positives = 16/38 (42%), Gaps = 9/38 (23%)
Query: 190 CDFKKKKEEE---------KKEAKKKTKKKEEEKKKKT 218
C+ K K + E KKE K K KE + KK
Sbjct: 1 CNKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNG 38
Score = 26.2 bits (58), Expect = 6.4
Identities = 8/40 (20%), Positives = 15/40 (37%), Gaps = 11/40 (27%)
Query: 190 CDFKKKKEEEK-----------KEAKKKTKKKEEEKKKKT 218
C K+ K ++ +K KK+++ KK
Sbjct: 27 CKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKG 66
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 28.0 bits (63), Expect = 2.4
Identities = 12/41 (29%), Positives = 23/41 (56%), Gaps = 1/41 (2%)
Query: 188 NICDFKKKKEEEK-KEAKKKTKKKEEEKKKKTRMRGGIERQ 227
+ K+++ E+K +E KK K+KEEE K+ ++ +
Sbjct: 56 KENEAKRRRAEKKAEEEKKLRKEKEEEIKELKAELEELKAE 96
Score = 26.8 bits (60), Expect = 5.3
Identities = 8/32 (25%), Positives = 20/32 (62%)
Query: 188 NICDFKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
F K+ E +++ A+KK +++++ +K+K
Sbjct: 50 KFDKFLKENEAKRRRAEKKAEEEKKLRKEKEE 81
>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB. Topo IB proteins include the
monomeric yeast and human topo I and heterodimeric topo
I from Leishmania donvanni. Topo I enzymes are divided
into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topos I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I have putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 217
Score = 28.3 bits (63), Expect = 2.5
Identities = 12/36 (33%), Positives = 18/36 (50%)
Query: 190 CDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
DF K ++E +KK +EEK+ K + R E
Sbjct: 84 ADFSLIKNHLEEEKEKKKNLNKEEKEAKKKERAKRE 119
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 27.3 bits (61), Expect = 2.6
Identities = 7/20 (35%), Positives = 17/20 (85%)
Query: 194 KKKEEEKKEAKKKTKKKEEE 213
+++EEE++E +++ ++ EEE
Sbjct: 76 EEEEEEEEEEEEEEEESEEE 95
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 27.6 bits (61), Expect = 2.6
Identities = 10/19 (52%), Positives = 15/19 (78%)
Query: 198 EEKKEAKKKTKKKEEEKKK 216
K EAKK+ KK+EEE+++
Sbjct: 85 GAKAEAKKEEKKEEEEEEE 103
>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7). RRP7
is an essential protein in yeast that is involved in
pre-rRNA processing and ribosome assembly. It is
speculated to be required for correct assembly of rpS27
into the pre-ribosomal particle.
Length = 131
Score = 27.6 bits (62), Expect = 2.7
Identities = 9/23 (39%), Positives = 17/23 (73%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKK 215
+ K EE+++ K+K KKK++E +
Sbjct: 68 RNKAAEERRKLKEKKKKKKKELE 90
Score = 27.6 bits (62), Expect = 3.0
Identities = 15/40 (37%), Positives = 22/40 (55%), Gaps = 11/40 (27%)
Query: 193 KKKKEEE------KKEAKKKT-----KKKEEEKKKKTRMR 221
KKKKE E +E KK+ KK EE+KK+ +++
Sbjct: 84 KKKKELENFYRFQIREKKKEELAELRKKFEEDKKRIEQLK 123
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 28.6 bits (64), Expect = 2.7
Identities = 8/23 (34%), Positives = 17/23 (73%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKK 215
+KK++E+ + K+ ++K+E KK
Sbjct: 412 NEKKQKEQADEDKEKRQKDERKK 434
Score = 28.2 bits (63), Expect = 3.7
Identities = 6/24 (25%), Positives = 19/24 (79%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKK 216
+ +K+++++ + K K++++E+KK
Sbjct: 411 ENEKKQKEQADEDKEKRQKDERKK 434
Score = 27.8 bits (62), Expect = 5.0
Identities = 6/24 (25%), Positives = 17/24 (70%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKK 217
++ E+++KE + K+K ++ ++K
Sbjct: 410 EENEKKQKEQADEDKEKRQKDERK 433
>gnl|CDD|164575 CHL00199, infC, translation initiation factor 3; Provisional.
Length = 182
Score = 28.3 bits (63), Expect = 2.7
Identities = 14/37 (37%), Positives = 21/37 (56%)
Query: 189 ICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
I D+ K K ++K AK+ KK+ K+ +MR IE
Sbjct: 70 IIDYGKYKFTQEKRAKEAKKKQHNSSIKEVKMRYKIE 106
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 28.6 bits (64), Expect = 2.8
Identities = 11/25 (44%), Positives = 15/25 (60%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
+KKE+EKKE K KK+ +K
Sbjct: 161 LRKKEKEKKEQLKIQKKQSLLEKYG 185
>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family. Members of this family are
coiled-coil proteins that are involved in pre-rRNA
processing.
Length = 105
Score = 27.4 bits (61), Expect = 2.9
Identities = 8/30 (26%), Positives = 17/30 (56%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
+ K +KE E++ + K++ K++K R
Sbjct: 49 ELKDEKEAERQRRIQAIKERRAAKEEKERY 78
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 28.5 bits (64), Expect = 3.0
Identities = 8/30 (26%), Positives = 17/30 (56%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
D ++ EE +E + ++ E EK+K + +
Sbjct: 148 DSDEELSEEDEEEAAEEEEAEAEKEKASEL 177
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 27.6 bits (62), Expect = 3.0
Identities = 9/25 (36%), Positives = 15/25 (60%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
++ K K+ K +KE++KKKK
Sbjct: 60 RRPKTARKESVAAKAAEKEKKKKKK 84
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 28.9 bits (65), Expect = 3.0
Identities = 11/33 (33%), Positives = 19/33 (57%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
+K E+E +E +K ++ EEE K T G ++
Sbjct: 386 PEKLEKELEELEKAKEEIEEEISKITARIGELK 418
Score = 28.1 bits (63), Expect = 4.9
Identities = 10/23 (43%), Positives = 17/23 (73%)
Query: 195 KKEEEKKEAKKKTKKKEEEKKKK 217
K+ EEK+E ++ KKK +E +K+
Sbjct: 331 KELEEKEERLEELKKKLKELEKR 353
>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
MW N terminal. This domain is found in eukaryotes. This
domain is about 90 amino acids in length. This domain is
found associated with pfam00076. This domain is part of
U1 snRNP, which is the pre-mRNA binding protein of the
penta-snRNP spliceosome complex. It extends over a
distance of 180 A from its RNA binding domain, wraps
around the core domain of U1 snRNP consisting of the
seven Sm proteins and finally contacts U1-C, which is
crucial for 5'-splice-site recognition.
Length = 94
Score = 26.8 bits (60), Expect = 3.2
Identities = 12/23 (52%), Positives = 18/23 (78%)
Query: 195 KKEEEKKEAKKKTKKKEEEKKKK 217
K+E EK+E K+K +KK EE+ K+
Sbjct: 65 KREREKREKKEKLEKKLEEELKE 87
>gnl|CDD|135898 PRK06397, PRK06397, V-type ATP synthase subunit H; Validated.
Length = 111
Score = 27.2 bits (60), Expect = 3.3
Identities = 15/30 (50%), Positives = 18/30 (60%), Gaps = 4/30 (13%)
Query: 189 ICDFKKKKEEEKKEAKKKTKKKEEEKKKKT 218
I + K ++E E KEA K K EEK KKT
Sbjct: 26 IANIKNEQENEIKEA----KSKYEEKAKKT 51
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 28.5 bits (64), Expect = 3.4
Identities = 9/31 (29%), Positives = 20/31 (64%)
Query: 197 EEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
+E K+EA++K ++ EE ++ + +E+Q
Sbjct: 171 KERKEEAERKLERTEENLERLEDLLEELEKQ 201
Score = 27.4 bits (61), Expect = 9.3
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
+K EEE K K + + E+ ++ R +ERQ
Sbjct: 683 LEKLEEELKSLKNELRSLEDLLEELRRQLEELERQ 717
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 27.8 bits (62), Expect = 3.4
Identities = 9/26 (34%), Positives = 15/26 (57%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
K+KK+ E + K + +K + E KK
Sbjct: 122 KEKKKREVPKPKTEKEKPKTEPKKPK 147
Score = 27.0 bits (60), Expect = 5.0
Identities = 8/27 (29%), Positives = 14/27 (51%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
+++ E K K+ KK+E K K +
Sbjct: 110 EQEDPPETKTESKEKKKREVPKPKTEK 136
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 28.6 bits (63), Expect = 3.4
Identities = 7/27 (25%), Positives = 19/27 (70%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
K++ E ++ +++ KKE+E++++ R
Sbjct: 1032 KERMESLQRAKEEEIGKKEKEREQRIR 1058
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 28.3 bits (63), Expect = 3.6
Identities = 11/34 (32%), Positives = 17/34 (50%), Gaps = 1/34 (2%)
Query: 186 HPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
H + KK + EK EA K KK+ +K++
Sbjct: 326 HAENAEIKKTRTAEKNEA-KARKKEIAQKRRAAE 358
>gnl|CDD|223866 COG0795, COG0795, Predicted permeases [General function prediction
only].
Length = 364
Score = 28.1 bits (63), Expect = 3.8
Identities = 11/53 (20%), Positives = 17/53 (32%)
Query: 27 VSSIIWMTLRASIMIFIIVASIGGNLLVIVSVMRHRKLRIITNYFVVSLAFAD 79
V W +IFI GN L V + R ++ + A +
Sbjct: 147 VEGGFWAKDGNDSLIFIESIDRDGNKLQGVFIKRFDDNGRLSVVYAAEGALFE 199
>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
Length = 205
Score = 27.6 bits (62), Expect = 3.9
Identities = 14/32 (43%), Positives = 18/32 (56%)
Query: 190 CDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
D KKK+ +K+E KK KKK E K +R
Sbjct: 119 IDLGKKKKYKKEEENKKAKKKSESLVAKLEVR 150
>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 158 and 187 amino
acids in length. This family includes the PcfB protein.
Length = 137
Score = 27.2 bits (61), Expect = 4.0
Identities = 8/27 (29%), Positives = 18/27 (66%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
+KK ++ K+E K K+++++ K K +
Sbjct: 111 RKKLKKFKEEQAAKKKERKDKVKNKKK 137
>gnl|CDD|234306 TIGR03674, fen_arch, flap structure-specific endonuclease.
Endonuclease that cleaves the 5'-overhanging flap
structure that is generated by displacement synthesis
when DNA polymerase encounters the 5'-end of a
downstream Okazaki fragment. Has 5'-endo-/exonuclease
and 5'-pseudo-Y-endonuclease activities. Cleaves the
junction between single and double-stranded regions of
flap DNA.
Length = 338
Score = 28.0 bits (63), Expect = 4.0
Identities = 8/27 (29%), Positives = 18/27 (66%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKK 217
+ K + EE++E +++ ++K EE +K
Sbjct: 85 ELKAETLEERREIREEAEEKWEEALEK 111
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 27.8 bits (62), Expect = 4.0
Identities = 11/27 (40%), Positives = 14/27 (51%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
K+K + EKK K K K K + K K
Sbjct: 85 KEKPKPEKKPKKPKPKPKPKPKPKPKV 111
>gnl|CDD|165026 PHA02644, PHA02644, hypothetical protein; Provisional.
Length = 112
Score = 27.3 bits (59), Expect = 4.1
Identities = 16/43 (37%), Positives = 24/43 (55%), Gaps = 6/43 (13%)
Query: 175 TTSEHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKK 217
TT EH+K + +KK E+ + E +KK +K E E +KK
Sbjct: 54 TTHEHIKKENED------EKKPEKPENEDEKKPEKPENEDEKK 90
>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal. The yeast member
of this family (Kri1p) is found to be required for 40S
ribosome biogenesis in the nucleolus. This is the
C-terminal domain of the family.
Length = 93
Score = 26.8 bits (60), Expect = 4.1
Identities = 12/25 (48%), Positives = 14/25 (56%), Gaps = 1/25 (4%)
Query: 196 KEEEKKEAKKKTKKK-EEEKKKKTR 219
KEE +K+ KK KK E KKK
Sbjct: 69 KEERRKDKKKYGKKARLREWKKKVF 93
>gnl|CDD|151665 pfam11223, DUF3020, Protein of unknown function (DUF3020). This
family of fungal proteins is conserved towards the
C-terminus of HMG domains. The function is not known.
Length = 49
Score = 25.6 bits (56), Expect = 4.2
Identities = 10/30 (33%), Positives = 12/30 (40%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRG 222
KKK E E K + KKK + G
Sbjct: 5 KKKWREANSERNKDNDLRSRVKKKAKKKFG 34
>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2). This family
represents a conserved region approximately 60 residues
long within the eukaryotic targeting protein for Xklp2
(TPX2). Xklp2 is a kinesin-like protein localised on
centrosomes throughout the cell cycle and on spindle
pole microtubules during metaphase. In Xenopus, it has
been shown that Xklp2 protein is required for centrosome
separation and maintenance of spindle bi-polarity. TPX2
is a microtubule-associated protein that mediates the
binding of the C-terminal domain of Xklp2 to
microtubules. It is phosphorylated during mitosis in a
microtubule-dependent way.
Length = 57
Score = 25.8 bits (57), Expect = 4.2
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTR 219
K E EK+EA+ + K++EEE K+ R
Sbjct: 22 KALEAEKEEAEARQKEEEEEAIKQLR 47
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 28.0 bits (63), Expect = 4.3
Identities = 8/26 (30%), Positives = 18/26 (69%), Gaps = 1/26 (3%)
Query: 197 EEEKKEAKKKTKKKEEEKKKKTRMRG 222
EEE+ +A + +++E ++K++ R G
Sbjct: 48 EEERLKALAEEEERERKRKEE-RREG 72
Score = 27.2 bits (61), Expect = 7.5
Identities = 7/28 (25%), Positives = 22/28 (78%)
Query: 189 ICDFKKKKEEEKKEAKKKTKKKEEEKKK 216
I +++++K E ++E + + ++++EEK++
Sbjct: 155 ILEYQREKAEREEEREAERRERKEEKER 182
>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11. The WW domain is a
small protein module with a triple-stranded beta-sheet
fold. This is a family of WW domain binding proteins.
Length = 78
Score = 26.1 bits (58), Expect = 4.7
Identities = 8/23 (34%), Positives = 14/23 (60%)
Query: 197 EEEKKEAKKKTKKKEEEKKKKTR 219
+ +KE KKK KK + +++ R
Sbjct: 8 DAYRKEQKKKELKKNKAERQARR 30
>gnl|CDD|237582 PRK14011, PRK14011, prefoldin subunit alpha; Provisional.
Length = 144
Score = 27.0 bits (60), Expect = 4.8
Identities = 17/39 (43%), Positives = 24/39 (61%)
Query: 189 ICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
I DFKK EE K K+ KK EE K+ T++R +E++
Sbjct: 90 IEDFKKSVEELDKTKKEGNKKIEELNKEITKLRKELEKR 128
>gnl|CDD|223367 COG0290, InfC, Translation initiation factor 3 (IF-3) [Translation,
ribosomal structure and biogenesis].
Length = 176
Score = 27.2 bits (61), Expect = 4.8
Identities = 15/45 (33%), Positives = 25/45 (55%), Gaps = 13/45 (28%)
Query: 181 KYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
K++Y+ +KKK KEAKKK K+ + K+ ++R I+
Sbjct: 69 KFKYE-------QKKKA---KEAKKKQKQIQ---VKEIKLRPKID 100
>gnl|CDD|204055 pfam08764, Coagulase, Staphylococcus aureus coagulase.
Staphylococcus aureus secretes a cofactor called
coagulase. Coagulase is an extracellular protein that
forms a complex with human prothrombin, and activates it
without the usual proteolytic cleavages. The resulting
complex directly initiates blood clotting.
Length = 282
Score = 27.8 bits (62), Expect = 4.8
Identities = 12/38 (31%), Positives = 18/38 (47%), Gaps = 6/38 (15%)
Query: 187 PNICDFKKKKEEEK------KEAKKKTKKKEEEKKKKT 218
P D+K K E ++ KE ++ K +E K KT
Sbjct: 240 PTKHDYKNKSENKENFDALVKETREAVAKADESWKTKT 277
>gnl|CDD|219783 pfam08297, U3_snoRNA_assoc, U3 snoRNA associated. This family of
proteins is associated with U3 snoRNA. U3 snoRNA is
required for nucleolar processing of pre-18S ribosomal
RNA.
Length = 87
Score = 26.2 bits (58), Expect = 4.9
Identities = 10/26 (38%), Positives = 15/26 (57%)
Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKK 217
+ EEE K+ KKK K + +K +K
Sbjct: 20 PPESYEEELKKQKKKNKLRFLKKLEK 45
>gnl|CDD|234900 PRK01109, PRK01109, ATP-dependent DNA ligase; Provisional.
Length = 590
Score = 28.0 bits (63), Expect = 5.0
Identities = 12/35 (34%), Positives = 19/35 (54%), Gaps = 1/35 (2%)
Query: 130 DRYYAIVKPLKYPINMTKRVVAFMLLNVWVSPAVI 164
D ++KP K + RVV+ M +VWV P ++
Sbjct: 490 DELPKMLKPYKID-HKHPRVVSKMEPDVWVEPKLV 523
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 27.6 bits (62), Expect = 5.1
Identities = 5/27 (18%), Positives = 14/27 (51%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
KK+E ++ K ++++ + K +
Sbjct: 119 AKKREAAGEKEKAPRRERKPKPKAPRK 145
Score = 26.8 bits (60), Expect = 8.6
Identities = 5/27 (18%), Positives = 15/27 (55%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
+ +++ +K+EA + +K ++K
Sbjct: 114 RAEQQAKKREAAGEKEKAPRRERKPKP 140
>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777). This is
a family of eukaryotic proteins of unknown function.
Some of the proteins in this family are putative nucleic
acid binding proteins.
Length = 158
Score = 27.1 bits (60), Expect = 5.4
Identities = 8/47 (17%), Positives = 25/47 (53%)
Query: 181 KYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
++R + + + +K E K+A++ K++ ++ K+ + G + +
Sbjct: 58 RHRSRSRSPSRRRDRKRERDKDAREPKKRERQKLIKEEDLEGKSDEE 104
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 27.9 bits (62), Expect = 5.7
Identities = 13/34 (38%), Positives = 18/34 (52%), Gaps = 3/34 (8%)
Query: 193 KKKKEE---EKKEAKKKTKKKEEEKKKKTRMRGG 223
KKKKEE EK+ K K +KE + K + +
Sbjct: 21 KKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASD 54
Score = 27.2 bits (60), Expect = 9.6
Identities = 11/29 (37%), Positives = 20/29 (68%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
+KKK+EEK + K+ K K +K+ K +++
Sbjct: 20 RKKKKEEKAKEKELKKLKAAQKEAKAKLQ 48
>gnl|CDD|216137 pfam00822, PMP22_Claudin, PMP-22/EMP/MP20/Claudin family.
Length = 161
Score = 26.9 bits (60), Expect = 5.7
Identities = 16/94 (17%), Positives = 32/94 (34%), Gaps = 10/94 (10%)
Query: 36 RASIMIFIIVASIGGNLLVIVSVMRHRKLRIITNYFVVSLAFADMLVAMVAMTFNASVQV 95
+++ IV+ I +L+ V+ I + + VS A + +
Sbjct: 1 MLVLLLGFIVSHIAWVILLFVAT--------IPDQWKVSRYVGAAASAGLWRNC-TTQSC 51
Query: 96 TGKWNFSFFMCDVWNSLDVYFSTASILHLCCISV 129
TG+ + + SI+ L IS+
Sbjct: 52 TGQISCKVLELNDALQAVQALMILSII-LGIISL 84
>gnl|CDD|235185 PRK03980, PRK03980, flap endonuclease-1; Provisional.
Length = 292
Score = 27.5 bits (62), Expect = 5.8
Identities = 8/27 (29%), Positives = 20/27 (74%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKK 217
+ K ++ EE++E +++ ++K EE K++
Sbjct: 38 ELKAEEIEERREVREEAEEKYEEAKEE 64
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 27.1 bits (60), Expect = 5.8
Identities = 11/24 (45%), Positives = 16/24 (66%)
Query: 196 KEEEKKEAKKKTKKKEEEKKKKTR 219
K+++ K K K K K++E KKK R
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAR 114
Score = 26.7 bits (59), Expect = 8.5
Identities = 12/31 (38%), Positives = 19/31 (61%)
Query: 187 PNICDFKKKKEEEKKEAKKKTKKKEEEKKKK 217
P +KKK+ + K+ K K KK E++KK +
Sbjct: 84 PATTAVEKKKKGKSKKNKLKGKKDEDKKKAR 114
>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p), ribosomal RNA-processing protein 7 homolog A
(Rrp7A), and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
YCL031C gene from Saccharomyces cerevisiae. It is an
essential yeast protein involved in pre-rRNA processing
and ribosome assembly, and is speculated to be required
for correct assembly of rpS27 into the pre-ribosomal
particle. Rrp7A, also termed gastric cancer antigen
Zg14, is the Rrp7p homolog mainly found in Metazoans.
The cellular function of Rrp7A remains unclear
currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 118
Score = 26.4 bits (59), Expect = 5.9
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
+K K ++ + K K+KE++KKKK
Sbjct: 61 RKGKTAREEAVEAKAKEKEKKKKKK 85
Score = 26.1 bits (58), Expect = 9.6
Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 10/35 (28%)
Query: 193 KKKKEEEKKEAKKKTKKKEE----------EKKKK 217
++ E + KE +KK KKK+E EKKK+
Sbjct: 68 EEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKE 102
>gnl|CDD|211393 cd11381, NSA2, pre-ribosomal protein NSA2 (Nop seven-associated 2).
NSA2 appears to be a protein required for the
maturation of 27S pre-rRNA in yeast; it has been
characterized in mammalian cells as a nucleolar protein
that might play a role in the regulation of the cell
cycle and in cell proliferation.
Length = 257
Score = 27.2 bits (61), Expect = 6.0
Identities = 10/27 (37%), Positives = 15/27 (55%)
Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKT 218
+ KK+ +EK + KK K EE K+
Sbjct: 47 YNKKRYKEKIQMKKTIKMHEERNVKQK 73
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 27.3 bits (61), Expect = 6.1
Identities = 9/24 (37%), Positives = 14/24 (58%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKK 217
+ K + K + K K KK EE+ K+
Sbjct: 96 EPKPKPKPKPKPKPVKKVEEQPKR 119
>gnl|CDD|220397 pfam09781, NDUF_B5, NADH:ubiquinone oxidoreductase, NDUFB5/SGDH
subunit. Members of this family mediate the transfer of
electrons from NADH to the respiratory chain. The
immediate electron acceptor for the enzyme is believed
to be ubiquinone, the reaction that occurs being: NADH +
ubiquinone = NAD(+) + ubiquinol.
Length = 182
Score = 27.1 bits (60), Expect = 6.1
Identities = 27/89 (30%), Positives = 42/89 (47%), Gaps = 21/89 (23%)
Query: 149 VVAFMLL-NVWVSPAVISFVPIMCGWYTTSEHLKYRYQHP-------NICDFKKKKEEEK 200
V AF+ NV++ PA ++ +P G+ EH +Y Y+HP + D +KE EK
Sbjct: 75 VAAFITYVNVFIGPAELAEIPE--GY--EPEHWEY-YKHPITRWIARYVYD-SPQKEYEK 128
Query: 201 KEA-------KKKTKKKEEEKKKKTRMRG 222
A K + + E E ++ R RG
Sbjct: 129 MLALIQEENEKAEIRLLELEVRRLMRERG 157
>gnl|CDD|227474 COG5145, RAD14, DNA excision repair protein [DNA replication,
recombination, and repair].
Length = 292
Score = 27.3 bits (60), Expect = 6.2
Identities = 12/28 (42%), Positives = 21/28 (75%), Gaps = 1/28 (3%)
Query: 193 KKKKEEEKKEAK-KKTKKKEEEKKKKTR 219
K+++E+ K + K KK +KK +E ++KTR
Sbjct: 218 KQRREKMKDDRKEKKLEKKIKELRRKTR 245
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 27.0 bits (60), Expect = 6.2
Identities = 11/30 (36%), Positives = 20/30 (66%), Gaps = 1/30 (3%)
Query: 193 KKKKEEEKKE-AKKKTKKKEEEKKKKTRMR 221
KK KE+ +++ K K ++ E+EK+K+ R
Sbjct: 66 KKAKEKLRRDKLKAKKEEAEKEKEKEERFM 95
>gnl|CDD|226640 COG4171, SapC, ABC-type antimicrobial peptide transport system,
permease component [Defense mechanisms].
Length = 296
Score = 27.4 bits (61), Expect = 6.2
Identities = 8/56 (14%), Positives = 25/56 (44%)
Query: 34 TLRASIMIFIIVASIGGNLLVIVSVMRHRKLRIITNYFVVSLAFADMLVAMVAMTF 89
T+ ++++ + GG L + + ++ + L+ +L+A++ + F
Sbjct: 98 TVGGALLVTLAATICGGVLGIFAGATHGLRSAVLNHILDTLLSIPSLLLAIIVVAF 153
>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
Length = 64
Score = 25.6 bits (57), Expect = 6.3
Identities = 7/31 (22%), Positives = 18/31 (58%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
+ ++++ EK K+K KK K+++ ++
Sbjct: 31 ELRRREFYEKPSEKRKRKKAAARKRRRKKLA 61
>gnl|CDD|211392 cd11380, Ribosomal_S8e_like, Eukaryotic/archaeal ribosomal protein
S8e and similar proteins. This family contains the
eukaryotic/archaeal ribosomal protein S8, a component of
the small ribosomal subunits, as well as the NSA2 gene
product.
Length = 138
Score = 26.7 bits (60), Expect = 6.3
Identities = 12/46 (26%), Positives = 19/46 (41%), Gaps = 12/46 (26%)
Query: 193 KKKKEEEKKEAKKKTKKKE------------EEKKKKTRMRGGIER 226
K+K K + +K +K E E +K R+RGG +
Sbjct: 6 KRKATGGKFKVVRKKRKYELGRKPANTKLGGERFTRKVRVRGGNIK 51
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 27.6 bits (62), Expect = 6.3
Identities = 13/37 (35%), Positives = 21/37 (56%), Gaps = 1/37 (2%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKK-TRMRGGIER 226
K+ EE KKEA+ K+ E K++ ++R +ER
Sbjct: 33 LAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELER 69
>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein. This model
describes nucleoid occlusion protein, a close homolog to
ParB chromosome partitioning proteins including Spo0J in
Bacillus subtilis. Its gene often is located near the
gene for the Spo0J ortholog. This protein bind a
specific DNA sequence and blocks cytokinesis from
happening until chromosome segregation is complete.
Length = 255
Score = 27.5 bits (62), Expect = 6.4
Identities = 9/26 (34%), Positives = 14/26 (53%)
Query: 196 KEEEKKEAKKKTKKKEEEKKKKTRMR 221
K+ E+ K K ++ +KKKK R
Sbjct: 185 KQTEELIKKLLEKPEKPKKKKKRRKG 210
>gnl|CDD|227822 COG5535, RAD4, DNA repair protein RAD4 [DNA replication,
recombination, and repair].
Length = 650
Score = 27.5 bits (61), Expect = 6.4
Identities = 13/53 (24%), Positives = 20/53 (37%), Gaps = 3/53 (5%)
Query: 115 YFSTASILHLCCISVDRYYAIVKPLKYPINMTKRVVAFMLLNVWVSPAVISFV 167
FS S L + + + L+YP + V F V+V P + V
Sbjct: 278 VFSNNSDLD---VLSEGLLEYIDSLEYPGFWGEVVDKFEKKWVFVDPVRLYIV 327
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association with
pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 25.7 bits (57), Expect = 6.7
Identities = 13/41 (31%), Positives = 20/41 (48%), Gaps = 7/41 (17%)
Query: 188 NICDFKKKKEEEKKEAKKKTKKKEEEKK-------KKTRMR 221
D K + ++ +KK KKK+ +KK KK +MR
Sbjct: 25 EQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMR 65
>gnl|CDD|219160 pfam06749, DUF1218, Protein of unknown function (DUF1218). This
family contains hypothetical plant proteins of unknown
function. Family members contain a number of conserved
cysteine residues.
Length = 97
Score = 26.1 bits (58), Expect = 6.9
Identities = 14/55 (25%), Positives = 25/55 (45%), Gaps = 2/55 (3%)
Query: 142 PINMTKRVVAFMLLNV-WVSPAVISFVPIMCGWYTTSEHLKYRYQHPNICDFKKK 195
P + KR +A + + W++ VI+F ++ G + H +Y C KK
Sbjct: 28 PKSGRKRALAVVCFVLSWIA-FVIAFALLLAGAARNAYHTRYNVHFRPSCYTLKK 81
>gnl|CDD|224977 COG2066, GlsA, Glutaminase [Amino acid transport and metabolism].
Length = 309
Score = 27.2 bits (61), Expect = 6.9
Identities = 22/98 (22%), Positives = 30/98 (30%), Gaps = 35/98 (35%)
Query: 100 NFSFFMCDVWNSLDVYFSTASILHLCCI--SVDRYYAIVKPL----KYPIN--------M 145
+F DV +LDVYF H C I + + L P+
Sbjct: 178 SFGNLEHDVEEALDVYF------HQCAIEMNCVDLARLGLFLANGGVNPLTGERVVPADQ 231
Query: 146 TKRVVAFMLLNVWVSPAVISFVPIMCGWYTTSEHLKYR 183
+++ A ML CG Y S YR
Sbjct: 232 ARQINALML---------------TCGMYDASGEFAYR 254
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 27.5 bits (61), Expect = 7.1
Identities = 10/26 (38%), Positives = 12/26 (46%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
KKK K KT KK+ + K K
Sbjct: 753 AKKKSASTKGKAAKTVKKKSKAKSKK 778
>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated. This domain
appears in eukaryotes as well as bacteria and tends to
be found near the C-terminus of the metalloprotease M16C
(pfam05193).
Length = 248
Score = 27.2 bits (61), Expect = 7.3
Identities = 12/30 (40%), Positives = 18/30 (60%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRG 222
+K EEE+++ +KK EE K+K RG
Sbjct: 9 EKLNEEEREKLEKKKSSLSEEDKEKIIERG 38
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 27.3 bits (61), Expect = 7.4
Identities = 11/24 (45%), Positives = 16/24 (66%)
Query: 187 PNICDFKKKKEEEKKEAKKKTKKK 210
P K+K+E +KK+ KKK+KK
Sbjct: 350 PLALSPKRKREGDKKQKKKKSKKL 373
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 27.5 bits (61), Expect = 7.6
Identities = 15/43 (34%), Positives = 23/43 (53%), Gaps = 1/43 (2%)
Query: 177 SEHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
SE L Y + D + EE K+E +KK K K++ +K+K
Sbjct: 329 SEFLGYM-GGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 27.3 bits (61), Expect = 7.7
Identities = 14/34 (41%), Positives = 23/34 (67%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIER 226
+K + E KE KK+ KK+E++KK+ R+ IE+
Sbjct: 321 NEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEK 354
>gnl|CDD|178415 PLN02820, PLN02820, 3-methylcrotonyl-CoA carboxylase, beta chain.
Length = 569
Score = 27.5 bits (61), Expect = 7.8
Identities = 10/31 (32%), Positives = 15/31 (48%)
Query: 197 EEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
E E K+ + KEEE+ K + ER+
Sbjct: 492 ERENKKRQGIQWSKEEEEAFKAKTVEAYERE 522
>gnl|CDD|218140 pfam04547, Anoctamin, Calcium-activated chloride channel. The
family carries eight putative transmembrane domains,
and, although it has no similarity to other known
channel proteins, it is clearly a calcium-activated
ionic channel. It is expressed in various secretory
epithelia, the retina and sensory neurons, and mediates
receptor-activated chloride currents in diverse
physiological processes.
Length = 449
Score = 27.2 bits (61), Expect = 8.1
Identities = 13/79 (16%), Positives = 29/79 (36%), Gaps = 8/79 (10%)
Query: 59 MRHRKLRIITNYFVVSLAFADMLVAMVAMTFNASVQVTGKWNFSFFMCDVWNSLDVYFST 118
R LR + + VV L ++ ++ + + + + SL+ +
Sbjct: 114 PWKRLLRQLLSIPVVLLFIGLVIGIIIGI-----FILRIFLSEVYSGPSFKQSLESF--L 166
Query: 119 ASILHLCCISV-DRYYAIV 136
+IL+ I + + Y V
Sbjct: 167 PAILNSVIILILNFIYEKV 185
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 26.2 bits (58), Expect = 8.1
Identities = 8/30 (26%), Positives = 20/30 (66%)
Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
KK+ EE ++ +++ K++EE +++ +G
Sbjct: 96 KKQLEENRKLEEEEKEREELEEENDVTKGK 125
>gnl|CDD|215456 PLN02850, PLN02850, aspartate-tRNA ligase.
Length = 530
Score = 27.4 bits (61), Expect = 8.4
Identities = 8/25 (32%), Positives = 12/25 (48%)
Query: 200 KKEAKKKTKKKEEEKKKKTRMRGGI 224
KK AKK K E+ +++ T
Sbjct: 15 KKAAKKAAAKAEKLRREATAKAAAA 39
>gnl|CDD|201966 pfam01777, Ribosomal_L27e, Ribosomal L27e protein family. The
N-terminal region of the eukaryotic ribosomal L27 has
the KOW motif. C-terminal region is represented by this
family.
Length = 85
Score = 25.7 bits (57), Expect = 8.6
Identities = 11/22 (50%), Positives = 16/22 (72%)
Query: 200 KKEAKKKTKKKEEEKKKKTRMR 221
+KEA+K+ KKK EE+ K + R
Sbjct: 56 RKEARKEVKKKFEERYKTGKNR 77
>gnl|CDD|180505 PRK06278, PRK06278, cobyrinic acid a,c-diamide synthase; Validated.
Length = 476
Score = 27.3 bits (61), Expect = 8.6
Identities = 12/30 (40%), Positives = 18/30 (60%)
Query: 188 NICDFKKKKEEEKKEAKKKTKKKEEEKKKK 217
N + K KE+EK+E KK K +E+ K +
Sbjct: 194 NFLKYLKIKEDEKEEIFKKNKILKEKLKSR 223
>gnl|CDD|217935 pfam04158, Sof1, Sof1-like domain. Sof1 is essential for cell
growth and is a component of the nucleolar rRNA
processing machinery.
Length = 88
Score = 25.7 bits (57), Expect = 8.8
Identities = 10/27 (37%), Positives = 16/27 (59%), Gaps = 1/27 (3%)
Query: 193 KKKKEE-EKKEAKKKTKKKEEEKKKKT 218
KK+KEE +K +K + + E+KK
Sbjct: 61 KKRKEENRRKHSKPGSVPPKPERKKHV 87
Score = 25.7 bits (57), Expect = 9.0
Identities = 17/49 (34%), Positives = 26/49 (53%), Gaps = 10/49 (20%)
Query: 178 EHLKYRYQH-PNICDFK---------KKKEEEKKEAKKKTKKKEEEKKK 216
E LK +Y+H P I KK ++ K+E K+ K+KEE ++K
Sbjct: 22 EALKEKYKHMPEIKRIARHRHVPKAIKKAQKIKREMKEAKKRKEENRRK 70
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 26.4 bits (59), Expect = 8.9
Identities = 13/25 (52%), Positives = 15/25 (60%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
KKKKE E E KKK EE+K+
Sbjct: 89 KKKKEGEFFEEKKKKYVVSEERKED 113
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region covers
both the Brf homology II and III regions. This region is
involved in binding TATA binding protein.
Length = 95
Score = 25.7 bits (57), Expect = 9.0
Identities = 7/29 (24%), Positives = 18/29 (62%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
+++KE ++K + K+++K+K + R
Sbjct: 32 QEEKELKQKADEGNNSGKKKKKRKAKKKR 60
>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
Length = 567
Score = 27.1 bits (60), Expect = 9.1
Identities = 14/37 (37%), Positives = 22/37 (59%), Gaps = 4/37 (10%)
Query: 193 KKKKEEEKKEAKKKTKKK-EEEKKKK---TRMRGGIE 225
++K E E++E K++ K K E E+K K + R IE
Sbjct: 9 REKLEREQRERKQRAKLKLERERKAKEEAAKQREAIE 45
>gnl|CDD|153326 cd07642, BAR_ASAP2, The Bin/Amphiphysin/Rvs (BAR) domain of ArfGAP
with SH3 domain, ANK repeat and PH domain containing
protein 2. BAR domains are dimerization, lipid binding
and curvature sensing modules found in many different
proteins with diverse functions. ASAP2 (ArfGAP with SH3
domain, ANK repeat and PH domain containing protein 2)
is also known as DDEF2 (Development and Differentiation
Enhancing Factor 2), AMAP2, centaurin beta-3, or PAG3.
ASAP2 mediates the functions of Arf GTPases vial dual
mechanisms: it exhibits GTPase activating protein (GAP)
activity towards class I (Arf1) and II (Arf5) Arfs; and
binds class III Arfs (GTP-Arf6) stably without GAP
activity. It binds paxillin and is implicated in Fcgamma
receptor-mediated phagocytosis in macrophages and in
cell migration. ASAP2 contains an N-terminal BAR domain,
followed by a Pleckstrin homology (PH) domain, an Arf
GAP domain, ankyrin (ANK) repeats, and a C-terminal SH3
domain. BAR domains form dimers that bind to membranes,
induce membrane bending and curvature, and may also be
involved in protein-protein interactions. The BAR domain
of the related protein ASAP1 mediates membrane bending,
is essential for function, and autoinhibits GAP activity
by interacting with the PH and/or Arf GAP domains.
Length = 215
Score = 26.9 bits (59), Expect = 9.1
Identities = 13/37 (35%), Positives = 21/37 (56%)
Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
D KK ++ K+ + K K E+EKK+ +M G I +
Sbjct: 108 DLKKPFDKAWKDYETKVTKIEKEKKEHAKMHGMIRTE 144
>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit. This
family includes the vacuolar ATP synthase E subunit, as
well as the archaebacterial ATP synthase E subunit.
Length = 195
Score = 26.5 bits (59), Expect = 9.2
Identities = 12/35 (34%), Positives = 21/35 (60%)
Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIER 226
F+ +K E +EA+KK ++ E+K+K+ M I
Sbjct: 21 FEIEKAEAVEEAEKKIEEIYEKKEKQAEMEKQIII 55
>gnl|CDD|167649 PRK03963, PRK03963, V-type ATP synthase subunit E; Provisional.
Length = 198
Score = 26.6 bits (59), Expect = 9.7
Identities = 11/35 (31%), Positives = 23/35 (65%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
+K+ E+ K+EA+K+ + K E +K + + +E+Q
Sbjct: 27 QKEAEKIKEEARKRAESKAEWILRKAKTQAELEKQ 61
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains separated
by a hinge in the middle. The eukaryotic SMC proteins
form two kind of heterodimers: the SMC1/SMC3 and the
SMC2/SMC4 types. These heterodimers constitute an
essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 27.2 bits (60), Expect = 9.7
Identities = 8/27 (29%), Positives = 17/27 (62%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
K++ EEEKKE ++ ++ ++ K+
Sbjct: 998 KERLEEEKKELLREIIEETCQRFKEFL 1024
Score = 27.2 bits (60), Expect = 9.8
Identities = 7/27 (25%), Positives = 14/27 (51%)
Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
K KE+ KK + K++ E +++
Sbjct: 201 LKLKEQAKKALEYYQLKEKLELEEENL 227
>gnl|CDD|240396 PTZ00388, PTZ00388, 40S ribosomal protein S8-like; Provisional.
Length = 223
Score = 26.6 bits (59), Expect = 9.8
Identities = 12/27 (44%), Positives = 15/27 (55%)
Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKT 218
F KK+ EK E KK K EE+ K+
Sbjct: 13 FNKKRYAEKAEMKKTIKAHEEKDVKEK 39
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 27.0 bits (60), Expect = 9.8
Identities = 11/34 (32%), Positives = 21/34 (61%)
Query: 188 NICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
I + K+K++E +KEA + +K++E K +R
Sbjct: 21 EIEELKEKRDELRKEASELAEKRDELNAKVRELR 54
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 26.6 bits (59), Expect = 10.0
Identities = 15/45 (33%), Positives = 21/45 (46%)
Query: 175 TTSEHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
TT E L R P I +K ++K+ KK KKK + K+
Sbjct: 145 TTPELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKE 189
>gnl|CDD|147685 pfam05663, DUF809, Protein of unknown function (DUF809). This
family consists of several proteins of unknown function
Raphanus sativus (Radish) and Brassica napus (Rape).
Length = 138
Score = 26.3 bits (57), Expect = 10.0
Identities = 13/31 (41%), Positives = 21/31 (67%)
Query: 197 EEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
+EEKKE K + + KEE+K+ K + G E++
Sbjct: 100 KEEKKEGKGEIEGKEEKKEGKGEIEGKEEKK 130
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.324 0.134 0.406
Gapped
Lambda K H
0.267 0.0694 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,406,173
Number of extensions: 1078761
Number of successful extensions: 6879
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5969
Number of HSP's successfully gapped: 576
Length of query: 227
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 134
Effective length of database: 6,812,680
Effective search space: 912899120
Effective search space used: 912899120
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 57 (25.8 bits)