RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7791
(551 letters)
>gnl|CDD|217253 pfam02854, MIF4G, MIF4G domain. MIF4G is named after Middle domain
of eukaryotic initiation factor 4G (eIF4G). Also occurs
in NMD2p and CBP80. The domain is rich in alpha-helices
and may contain multiple alpha-helical repeats. In
eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.
Length = 198
Score = 102 bits (257), Expect = 3e-25
Identities = 54/205 (26%), Positives = 92/205 (44%), Gaps = 18/205 (8%)
Query: 59 KSIHGSVNKVNTGNIGIIARKLFQENII--RGRGLLTRTILQAQAASPTFTNVYAALVDI 116
K + G +NK++ N I +L + N+ + I + P F YA L
Sbjct: 2 KKVKGLLNKLSPSNFEKIIDELLKLNMSDKNLLKYILELIFEKAVEEPNFIPAYARLCAG 61
Query: 117 INSKFPSIGELLLNRCIQQFKR---SFKRNDKALCINSVTFIAHLVNQQVAHEIIVLEIL 173
+NSK P GELLLNR ++F++ +R ++ + V F+ L N +V E I+LE L
Sbjct: 62 LNSKNPDFGELLLNRLQEEFEKGVEEEERGNRKRRLGLVRFLGELFNFKVLTEKIILECL 121
Query: 174 TLLVETPTN---DSDFSLEPYSKHKLHHLRSLMCMPLWWISSIQSFQALESFFS--TDVY 228
L+E+ T +FSLE L L + + + ++
Sbjct: 122 KELLESLTEEDPRDEFSLEIL-------LLLLTTCG-KLLDREKLKKLMDEILERIQKYL 173
Query: 229 NSSKGKLEKRVQYMIEVLAQVRKDN 253
S+ +L R+++M+E L ++RK+
Sbjct: 174 LSADPELSSRLRFMLEDLIELRKNK 198
>gnl|CDD|214713 smart00543, MIF4G, Middle domain of eukaryotic initiation factor 4G
(eIF4G). Also occurs in NMD2p and CBP80. The domain is
rich in alpha-helices and may contain multiple
alpha-helical repeats. In eIF4G, this domain binds
eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G
domain homologues (in press).
Length = 200
Score = 95.9 bits (239), Expect = 8e-23
Identities = 55/211 (26%), Positives = 93/211 (44%), Gaps = 28/211 (13%)
Query: 59 KSIHGSVNKVNTGNIGIIARKLFQENIIRG--RGLLTRTILQAQAASPTFTNVYAALVDI 116
K + G +NK++ N I ++L + N R + I + P F YA L +
Sbjct: 2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCAL 61
Query: 117 INSKFPSIGELLLNRCIQQFKR---SFKRNDKALCINSVTFIAHLVNQQVAHEIIVLEIL 173
+N+K P G LLL R ++F++ S + +DK + V F+ L N QV I+LE+L
Sbjct: 62 LNAKNPDFGSLLLERLQEEFEKGLESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELL 121
Query: 174 TLLVETPTN----DSDFSLEPYSKHKLHHLRSLMCMPLWWI----SSIQSFQALESFFS- 224
L+ T SDFS+E + L +S + L+
Sbjct: 122 KELLNDLTKLDPPRSDFSVE------------CLLSLLPTCGKDLEREKSPKLLDEILER 169
Query: 225 --TDVYNSSKGKLEKRVQYMIEVLAQVRKDN 253
+ K +L R+++M+E+L ++RK+
Sbjct: 170 LQDYLLKKDKTELSSRLRFMLELLIELRKNK 200
>gnl|CDD|214714 smart00544, MA3, Domain in DAP-5, eIF4G, MA-3 and other proteins.
Highly alpha-helical. May contain repeats and/or regions
similar to MIF4G domains Ponting (TIBS) "Novel eIF4G
domain homologues" in press.
Length = 113
Score = 60.7 bits (148), Expect = 2e-11
Identities = 25/101 (24%), Positives = 45/101 (44%), Gaps = 10/101 (9%)
Query: 358 LRRTIYLTIHSSL---DFEECAHKLLRMQLKPGQESELCHIVHIDTSLWGLFGDQLRTYE 414
L++ I+L I L D +E H LL ++L P Q E+ ++ ++ RTY
Sbjct: 1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKL-PEQHHEVVKVLLTCAL------EEKRTYR 53
Query: 415 KFFGLLAQRFCQINKMYVAPLEQIFRESYSTVHRLDINKLR 455
+ + +L R CQ N + E+ F + L+++
Sbjct: 54 EMYSVLLSRLCQANVISTKQFEKGFWRLLEDIEDLELDIPN 94
>gnl|CDD|111714 pfam02847, MA3, MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other
proteins. Highly alpha-helical. May contain repeats
and/or regions similar to MIF4G domains.
Length = 113
Score = 57.3 bits (139), Expect = 3e-10
Identities = 23/101 (22%), Positives = 47/101 (46%), Gaps = 10/101 (9%)
Query: 358 LRRTIYLTIHSSL---DFEECAHKLLRMQLKPGQESELCHIVHIDTSLWGLFGDQLRTYE 414
L++ I+L + L D++E A LL + L P Q E+ ++ + ++ +TY
Sbjct: 1 LKKKIFLILEEYLSSGDYDEAARCLLELGL-PFQHHEVVKVLLVCAL------EEKKTYR 53
Query: 415 KFFGLLAQRFCQINKMYVAPLEQIFRESYSTVHRLDINKLR 455
+ +G+L R CQ + E+ F ++ L+++
Sbjct: 54 EMYGVLLSRLCQFGLISTKQFEKGFSRLLESLEDLELDIPD 94
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 43.8 bits (103), Expect = 2e-04
Identities = 22/52 (42%), Positives = 37/52 (71%), Gaps = 4/52 (7%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETI-IDNTETNLVALRRTIYL 364
GD +E++E+ ++EE+E+EEE E EE E+EN+E + ++ ET ++ IYL
Sbjct: 861 GDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETR---QKQAIYL 909
Score = 41.9 bits (98), Expect = 0.001
Identities = 15/32 (46%), Positives = 24/32 (75%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
G + E++E+ ++EE+E+EEE E EE E+E E
Sbjct: 860 GGDSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891
Score = 40.7 bits (95), Expect = 0.002
Identities = 25/76 (32%), Positives = 37/76 (48%), Gaps = 8/76 (10%)
Query: 278 TLDGVKDTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDE--------DGDDEEDE 329
T G KD + + Q D + +E L E G+ +D++ DG D E+E
Sbjct: 807 TEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEE 866
Query: 330 DEEESEAEEGEKENKE 345
+EEE E EE E+E +E
Sbjct: 867 EEEEEEEEEEEEEEEE 882
Score = 37.7 bits (87), Expect = 0.017
Identities = 15/32 (46%), Positives = 23/32 (71%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
G + D E+ ++EE+E+EEE E EE E+E +E
Sbjct: 857 GSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEE 888
Score = 36.5 bits (84), Expect = 0.043
Identities = 14/32 (43%), Positives = 23/32 (71%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
D + +E+ ++EE+E+EEE E EE E+E +E
Sbjct: 858 SDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889
Score = 32.7 bits (74), Expect = 0.66
Identities = 21/60 (35%), Positives = 26/60 (43%), Gaps = 8/60 (13%)
Query: 302 EEKYTTLRREILGDEDEDD----EDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVA 357
EE E G EDE + E+G++ EDE E E+E K ET D ET
Sbjct: 713 EEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEG----KHEVETEGDRKETEHEG 768
Score = 29.6 bits (66), Expect = 6.1
Identities = 13/41 (31%), Positives = 19/41 (46%)
Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
E K + E +E E + + + E EDE E E E +E
Sbjct: 700 EAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEE 740
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 40.4 bits (95), Expect = 4e-04
Identities = 13/35 (37%), Positives = 23/35 (65%)
Query: 312 ILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
DE +DDE+ ++EED++E++ E E E+E+
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVK 143
Score = 38.5 bits (90), Expect = 0.002
Identities = 11/31 (35%), Positives = 22/31 (70%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENK 344
DE++++E+ D+E+D+++E E E K+ K
Sbjct: 116 DDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146
Score = 37.7 bits (88), Expect = 0.003
Identities = 14/36 (38%), Positives = 26/36 (72%), Gaps = 1/36 (2%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
E D+DE+DE+ +++++ED++E E+EE E K+
Sbjct: 111 EDESDDDEEDEE-EEDDEEDDDEDESEEEESPVKKV 145
Score = 35.4 bits (82), Expect = 0.023
Identities = 9/37 (24%), Positives = 27/37 (72%)
Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
+ ++ E+++ +D +++E+E+++E + +E E E +E+
Sbjct: 104 QHLVASEEDESDDDEEDEEEEDDEEDDDEDESEEEES 140
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 40.4 bits (94), Expect = 0.001
Identities = 19/38 (50%), Positives = 27/38 (71%)
Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
EDE DED ++EE+E+EEE+E EE E+ T ++TE
Sbjct: 132 EDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEV 169
Score = 38.1 bits (88), Expect = 0.007
Identities = 17/34 (50%), Positives = 21/34 (61%)
Query: 321 EDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
ED DE++E+EEE E EE E E E + T TN
Sbjct: 132 EDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTN 165
Score = 37.4 bits (86), Expect = 0.015
Identities = 21/60 (35%), Positives = 32/60 (53%), Gaps = 5/60 (8%)
Query: 291 VFQYDPEYLMNEEK--YTTLRREIL---GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
VF+Y P Y + + Y L+R + D E++ DGD E+E EEE+ EE E+ +
Sbjct: 20 VFKYRPRYYLYKHAYFYPPLKRFSVQSGSDSSEENGDGDSSEEEGEEETSNEEENNEDSD 79
Score = 28.9 bits (64), Expect = 7.1
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 326 EEDEDEEESEAEEGEKENKETIIDN 350
+EDE +E+ E EE E+E + + +N
Sbjct: 131 KEDESDEDEEEEEEEEEEEAEVEEN 155
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 40.1 bits (94), Expect = 0.002
Identities = 20/62 (32%), Positives = 31/62 (50%), Gaps = 10/62 (16%)
Query: 284 DTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
T + + D E+ + EE+ +E+E D D DD ED++ E + EEGEKE
Sbjct: 29 WTYLLFEEEEDDEEFEIEEEE----------EEEEVDSDFDDSEDDEPESDDEEEGEKEL 78
Query: 344 KE 345
+
Sbjct: 79 QR 80
Score = 31.6 bits (72), Expect = 0.79
Identities = 16/35 (45%), Positives = 21/35 (60%), Gaps = 4/35 (11%)
Query: 313 LGDEDEDDEDGDD----EEDEDEEESEAEEGEKEN 343
L +E E+DE EE+ED+EE E EE E+E
Sbjct: 18 LEEELEEDEFFWTYLLFEEEEDDEEFEIEEEEEEE 52
Score = 31.6 bits (72), Expect = 0.80
Identities = 14/30 (46%), Positives = 20/30 (66%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENK 344
D+ EDDE D+E+E E+E + EE K+ K
Sbjct: 59 DDSEDDEPESDDEEEGEKELQREERLKKKK 88
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 38.5 bits (90), Expect = 0.002
Identities = 16/45 (35%), Positives = 26/45 (57%)
Query: 309 RREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
R+E L +E+++D +D E ED E+ E +E E + T+ D E
Sbjct: 72 RKEALKLLEEENDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEE 116
Score = 29.2 bits (66), Expect = 2.4
Identities = 9/36 (25%), Positives = 19/36 (52%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIID 349
DE++ + + ++ ++DE E E + +E ID
Sbjct: 84 DDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEEYID 119
Score = 27.7 bits (62), Expect = 8.0
Identities = 9/38 (23%), Positives = 17/38 (44%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
++DE+D + +D ED +++E E E
Sbjct: 83 NDDEEDAETEDTEDVEDDEWEGFPEPTVTDYEEEYIDE 120
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 39.6 bits (93), Expect = 0.003
Identities = 13/53 (24%), Positives = 31/53 (58%)
Query: 293 QYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
+ +PE + +E+ E +E+E+D+ + E++++E+E E EE + + +
Sbjct: 266 KGEPEVVGGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGD 318
Score = 35.0 bits (81), Expect = 0.091
Identities = 11/27 (40%), Positives = 21/27 (77%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEK 341
D+ + ED D+E++++EEE + +EG+K
Sbjct: 293 DDYSESEDEDEEDEDEEEEEDDDEGDK 319
Score = 33.9 bits (78), Expect = 0.17
Identities = 17/49 (34%), Positives = 31/49 (63%), Gaps = 5/49 (10%)
Query: 311 EILGDEDEDDED-----GDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
E++G ++ED E+ ++EE++D ESE E+ E E++E D+ E +
Sbjct: 270 EVVGGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGD 318
Score = 28.5 bits (64), Expect = 9.7
Identities = 11/36 (30%), Positives = 22/36 (61%)
Query: 303 EKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEE 338
EK E E ED+++ D++E+E+E++ E ++
Sbjct: 284 EKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 38.8 bits (91), Expect = 0.007
Identities = 18/34 (52%), Positives = 27/34 (79%)
Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
+EDE+DED DDE+DE+EEE E E+ +K++ E+
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAES 386
Score = 37.7 bits (88), Expect = 0.015
Identities = 14/32 (43%), Positives = 23/32 (71%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
D +++ED +DE+ +DE++ E EE EKE K+
Sbjct: 350 DLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKK 381
Score = 36.9 bits (86), Expect = 0.033
Identities = 15/36 (41%), Positives = 27/36 (75%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
++ ++E+DE+ +D +DED+EE E EE EK+ K++
Sbjct: 348 DVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKS 383
Score = 36.9 bits (86), Expect = 0.034
Identities = 16/38 (42%), Positives = 26/38 (68%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
DE+ED+ED D ++++DEEE E E+ +K+ K +E
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390
Score = 36.5 bits (85), Expect = 0.037
Identities = 12/33 (36%), Positives = 24/33 (72%)
Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
G +DED+ED DD+ +E+EE+ + + E++ ++
Sbjct: 328 DGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEED 360
Score = 35.4 bits (82), Expect = 0.089
Identities = 12/39 (30%), Positives = 25/39 (64%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
D D DE+ D+E+++ ++E + EE E+E ++ + E+
Sbjct: 348 DVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAES 386
Score = 35.0 bits (81), Expect = 0.13
Identities = 12/40 (30%), Positives = 23/40 (57%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
+ D+E+ +++ED D+E+ E EE E++ K+ T
Sbjct: 349 VDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTR 388
Score = 34.6 bits (80), Expect = 0.15
Identities = 10/36 (27%), Positives = 25/36 (69%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
++ +E++++++ D+ED++EEE E +E +K+
Sbjct: 350 DLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAE 385
Score = 33.8 bits (78), Expect = 0.28
Identities = 20/76 (26%), Positives = 42/76 (55%), Gaps = 3/76 (3%)
Query: 280 DGVKDTQDILNVFQYDPE---YLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEA 336
D + D+ + F+ D + L E+ + + DEDE+D+D D EE+E++ +
Sbjct: 294 DSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSD 353
Query: 337 EEGEKENKETIIDNTE 352
EE ++E++++ ++ E
Sbjct: 354 EEEDEEDEDSDDEDDE 369
Score = 33.4 bits (77), Expect = 0.38
Identities = 12/40 (30%), Positives = 25/40 (62%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
+E+E+D D DEE+++E+E +E ++E +E + +
Sbjct: 343 EEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKK 382
Score = 33.0 bits (76), Expect = 0.41
Identities = 9/49 (18%), Positives = 25/49 (51%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRRTIY 363
+E++ D ++E++EDE+ + ++ E+E +E ++ +
Sbjct: 345 EEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSELPF 393
Score = 33.0 bits (76), Expect = 0.50
Identities = 10/35 (28%), Positives = 23/35 (65%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
E D+ E++E+ D DE+E+E + + +++++E
Sbjct: 336 EDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEE 370
Score = 32.3 bits (74), Expect = 0.82
Identities = 11/31 (35%), Positives = 21/31 (67%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
D +E++ED D ++E++EE E + E + +E
Sbjct: 341 DLEEEEEDVDLSDEEEDEEDEDSDDEDDEEE 371
Score = 31.5 bits (72), Expect = 1.5
Identities = 12/46 (26%), Positives = 30/46 (65%)
Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNL 355
E + DE++++ D++ D++++E E EE +++ K+ ++T + L
Sbjct: 346 EEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391
Score = 30.7 bits (70), Expect = 2.3
Identities = 13/36 (36%), Positives = 24/36 (66%), Gaps = 1/36 (2%)
Query: 308 LRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
LRR + G+E++D+E+ D +E D+ + E E + +N
Sbjct: 279 LRR-MRGEEEDDEEEEDSKESADDLDDEFEPDDDDN 313
Score = 30.0 bits (68), Expect = 4.1
Identities = 12/43 (27%), Positives = 24/43 (55%)
Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNL 355
DE+E+++ DDE++ED+++ EE E + ++ E
Sbjct: 320 EEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDED 362
Score = 29.6 bits (67), Expect = 5.1
Identities = 12/48 (25%), Positives = 23/48 (47%), Gaps = 10/48 (20%)
Query: 313 LGDEDEDDEDGDDEEDE----------DEEESEAEEGEKENKETIIDN 350
D+ +DD+ GD D+ DEE+ E + K++K+ ++
Sbjct: 134 DDDDFDDDDLGDLASDDRAAHFGGGEDDEEDEEEQPERKKSKKEVMKE 181
Score = 29.2 bits (66), Expect = 6.5
Identities = 15/49 (30%), Positives = 24/49 (48%), Gaps = 1/49 (2%)
Query: 302 EEKYTTLRREILGDEDEDDEDGD-DEEDEDEEESEAEEGEKENKETIID 349
EE+ + + L DE E D+D + +E+E E E+G + E D
Sbjct: 291 EEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDD 339
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 37.8 bits (88), Expect = 0.008
Identities = 25/65 (38%), Positives = 33/65 (50%), Gaps = 4/65 (6%)
Query: 294 YDP-EYLMNEEKYTTLR--REILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDN 350
YDP EY E+Y L R+ L + EDD D D EDEDEE +E + + ++
Sbjct: 14 YDPSEYKEVVEEYEKLEELRKKLKAKKEDDSDDTDYEDEDEE-LYLDESDMGDSDSKTRT 72
Query: 351 TETNL 355
T NL
Sbjct: 73 TVRNL 77
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 37.6 bits (88), Expect = 0.008
Identities = 11/26 (42%), Positives = 21/26 (80%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGE 340
+ED +D+D DDE++++++E EA+ G
Sbjct: 50 EEDLEDDDDDDEDEDEDDEEEADLGP 75
Score = 37.6 bits (88), Expect = 0.008
Identities = 10/31 (32%), Positives = 20/31 (64%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
++ED ED DD++++++E+ E E + E
Sbjct: 48 LDEEDLEDDDDDDEDEDEDDEEEADLGPDPE 78
Score = 36.4 bits (85), Expect = 0.019
Identities = 12/28 (42%), Positives = 20/28 (71%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKE 342
+ DE+D + DD++DEDE+E + EE +
Sbjct: 47 ELDEEDLEDDDDDDEDEDEDDEEEADLG 74
Score = 36.4 bits (85), Expect = 0.021
Identities = 11/29 (37%), Positives = 21/29 (72%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
+ + D+ED +D++D+DE+E E +E E +
Sbjct: 45 ESELDEEDLEDDDDDDEDEDEDDEEEADL 73
Score = 31.8 bits (73), Expect = 0.63
Identities = 9/34 (26%), Positives = 20/34 (58%)
Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKETIID 349
E + D +D ED+D+++ + +E ++E + D
Sbjct: 43 AIESELDEEDLEDDDDDDEDEDEDDEEEADLGPD 76
Score = 30.2 bits (69), Expect = 1.9
Identities = 11/28 (39%), Positives = 19/28 (67%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEE 338
E L D+D+DDED D++++E+ + E
Sbjct: 51 EDLEDDDDDDEDEDEDDEEEADLGPDPE 78
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The CAF-1
or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression in
addition to DNA-based checkpoints.
Length = 76
Score = 34.9 bits (81), Expect = 0.009
Identities = 10/24 (41%), Positives = 19/24 (79%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEE 338
+E+E+ ED + E++EDEEE + ++
Sbjct: 49 EEEEEGEDLESEDEEDEEEDDDDD 72
Score = 33.0 bits (76), Expect = 0.050
Identities = 12/29 (41%), Positives = 20/29 (68%), Gaps = 1/29 (3%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
E E++E+G+D E EDEE+ E E+ + +
Sbjct: 45 DAEWEEEEEGEDLESEDEEDEE-EDDDDD 72
Score = 31.8 bits (73), Expect = 0.12
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 311 EILGDEDEDDEDGDDEEDED 330
E L EDE+DE+ DD++D D
Sbjct: 55 EDLESEDEEDEEEDDDDDMD 74
Score = 31.4 bits (72), Expect = 0.16
Identities = 7/23 (30%), Positives = 17/23 (73%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAE 337
+E ED E D+E++E++++ + +
Sbjct: 52 EEGEDLESEDEEDEEEDDDDDMD 74
Score = 28.3 bits (64), Expect = 2.0
Identities = 7/20 (35%), Positives = 16/20 (80%)
Query: 314 GDEDEDDEDGDDEEDEDEEE 333
G++ E +++ D+EED+D++
Sbjct: 54 GEDLESEDEEDEEEDDDDDM 73
Score = 26.8 bits (60), Expect = 6.7
Identities = 6/20 (30%), Positives = 15/20 (75%)
Query: 314 GDEDEDDEDGDDEEDEDEEE 333
D + +DE+ ++E+D+D+ +
Sbjct: 55 EDLESEDEEDEEEDDDDDMD 74
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 37.4 bits (87), Expect = 0.010
Identities = 14/44 (31%), Positives = 30/44 (68%)
Query: 300 MNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
M E+K L E + +EDE DE+ ++EE+E++E+ + ++ + ++
Sbjct: 155 MLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDD 198
Score = 36.3 bits (84), Expect = 0.024
Identities = 14/31 (45%), Positives = 22/31 (70%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
ED D+ED DEE+E+EEE E E+ + ++ +
Sbjct: 165 AEDVDEEDEKDEEEEEEEEEEDEDFDDDDDD 195
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 38.1 bits (89), Expect = 0.010
Identities = 16/58 (27%), Positives = 37/58 (63%)
Query: 282 VKDTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEG 339
V++ D L+ + + ++++ + + T + E +E+E +++ ++EE+E+ EE E EEG
Sbjct: 316 VRELADSLDNWVHHVQHILPQGRCTWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEG 373
Score = 33.5 bits (77), Expect = 0.27
Identities = 13/28 (46%), Positives = 20/28 (71%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKE 342
E +D+E+ ++E+E+EEE E EE E E
Sbjct: 344 PEQKDEEEEQEDEEEEEEEEEPEEPEPE 371
Score = 30.0 bits (68), Expect = 4.1
Identities = 13/28 (46%), Positives = 20/28 (71%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKE 342
+DE++E D+EE+E+EEE E E E+
Sbjct: 346 QKDEEEEQEDEEEEEEEEEPEEPEPEEG 373
Score = 29.6 bits (67), Expect = 4.2
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 318 EDDEDGDDEEDEDEEESEAEEGEKENKE 345
++ ++EE EDEEE E EE +E +
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEP 370
Score = 29.6 bits (67), Expect = 5.2
Identities = 8/24 (33%), Positives = 14/24 (58%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAE 337
EDE++E+ ++E +E E E
Sbjct: 352 EQEDEEEEEEEEEPEEPEPEEGPP 375
Score = 29.3 bits (66), Expect = 5.9
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKE 342
E ++ +++EDE+EEE E E E E
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPE 369
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 37.7 bits (87), Expect = 0.011
Identities = 24/103 (23%), Positives = 40/103 (38%), Gaps = 11/103 (10%)
Query: 252 DNFKDFPDVIEDLDLVPEEDKFTHLMTLDGVKDTQDILNVFQYDPEYLMNEEKYTTLRRE 311
++ + P D+D EE DG ++ +D E N E E
Sbjct: 162 EDNDEAPPAQPDVDNEEEERLEES----DGREEEEDEEVGSDSYGE--GNREL-----NE 210
Query: 312 ILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
+E E +DG+D D + E + ++GE+E E + N
Sbjct: 211 EEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEI 253
Score = 35.4 bits (81), Expect = 0.054
Identities = 13/41 (31%), Positives = 19/41 (46%), Gaps = 3/41 (7%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNL 355
+E+ED+E G D E E EE E+ D+ E +
Sbjct: 189 EEEEDEEVGSDSYGEGNRELNEEEEEEAEGS---DDGEDVV 226
Score = 30.3 bits (68), Expect = 2.2
Identities = 20/59 (33%), Positives = 29/59 (49%), Gaps = 7/59 (11%)
Query: 303 EKYTTLRREILGDEDEDDE-------DGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
EK R I+ D DED++ D D+EE+E EES+ E E++ + E N
Sbjct: 147 EKPRVTRFNIVWDNDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGN 205
Score = 30.3 bits (68), Expect = 2.7
Identities = 14/41 (34%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 314 GDEDEDDED-GDDEEDEDEEESEAEEGEKENKETIIDNTET 353
G+E+E +E+ + E E EEES +EE + N+E+ +
Sbjct: 238 GEEEEMEEEVINLFEIEWEEESPSEEVPRNNEESPAKKQKV 278
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 37.1 bits (86), Expect = 0.015
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
R++L D +E GD+ E D E++E E+ KE+++
Sbjct: 200 RDMLSSMDMAEELGDEPESADSEDNEDEDDPKEDED 235
Score = 36.7 bits (85), Expect = 0.024
Identities = 17/66 (25%), Positives = 32/66 (48%), Gaps = 7/66 (10%)
Query: 285 TQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEE----SEAEEGE 340
+D+L+ E ++ + E DED+ ED DD++ E+EE S +E+ +
Sbjct: 199 VRDMLSSMDMAEEL---GDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSD 255
Query: 341 KENKET 346
++E
Sbjct: 256 ASSEEM 261
Score = 31.7 bits (72), Expect = 0.87
Identities = 11/35 (31%), Positives = 18/35 (51%), Gaps = 2/35 (5%)
Query: 314 GDEDEDDEDGDDEEDEDE--EESEAEEGEKENKET 346
DED+D + ++ D E+S+A E E+ E
Sbjct: 232 EDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEM 266
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 37.3 bits (87), Expect = 0.016
Identities = 12/32 (37%), Positives = 21/32 (65%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
G E +DD+D ++E + +E+E +EGE + E
Sbjct: 91 GLESDDDDDEEEEWEVEEDEDSDDEGEWIDVE 122
Score = 34.2 bits (79), Expect = 0.16
Identities = 10/31 (32%), Positives = 18/31 (58%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
D++ + D +DEE++DE +A+E E
Sbjct: 124 DKEIESSDSEDEEEKDEAAKKAKEDSDEELS 154
Score = 32.3 bits (74), Expect = 0.60
Identities = 13/40 (32%), Positives = 21/40 (52%)
Query: 303 EKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
EK+ R+ E + D DD+E+E+ E E E+ + E
Sbjct: 76 EKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDE 115
Score = 32.3 bits (74), Expect = 0.61
Identities = 14/51 (27%), Positives = 25/51 (49%), Gaps = 12/51 (23%)
Query: 314 GDEDEDDEDG------------DDEEDEDEEESEAEEGEKENKETIIDNTE 352
DED DDE D EDE+E++ A++ ++++ E + + E
Sbjct: 108 EDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDE 158
Score = 31.5 bits (72), Expect = 0.92
Identities = 16/38 (42%), Positives = 22/38 (57%), Gaps = 7/38 (18%)
Query: 315 DEDEDDE-------DGDDEEDEDEEESEAEEGEKENKE 345
DE+E DE D D+E E++EE AEE E E ++
Sbjct: 134 DEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEK 171
Score = 30.0 bits (68), Expect = 3.2
Identities = 20/90 (22%), Positives = 37/90 (41%), Gaps = 9/90 (10%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRRTIYLT-IHSS 369
EI + ED+E+ D+ + +E+S+ E E++ +E + + T I +
Sbjct: 126 EIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATTRILTP 185
Query: 370 LDF--------EECAHKLLRMQLKPGQESE 391
DF E+ K L +LK +
Sbjct: 186 ADFAKIQELRLEKGVDKALGGKLKRRDKDA 215
Score = 29.2 bits (66), Expect = 4.8
Identities = 8/29 (27%), Positives = 15/29 (51%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
+E+ + E+ +D +DE E + E E
Sbjct: 100 EEEEWEVEEDEDSDDEGEWIDVESDKEIE 128
Score = 28.9 bits (65), Expect = 6.6
Identities = 9/39 (23%), Positives = 23/39 (58%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
+ D++ E D E++E+++E+ + E ++E ++ E
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEE 160
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 37.3 bits (86), Expect = 0.018
Identities = 16/41 (39%), Positives = 29/41 (70%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNT 351
E + +ED+D+ED DD++DEDE++ + ++ E + E D+T
Sbjct: 48 EAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDST 88
Score = 36.1 bits (83), Expect = 0.041
Identities = 16/52 (30%), Positives = 33/52 (63%)
Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
E + + E +ED+DD+D +DE+D+D+++ E +E E ++ T+ D++
Sbjct: 44 EAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSA 95
Score = 34.6 bits (79), Expect = 0.13
Identities = 31/104 (29%), Positives = 49/104 (47%), Gaps = 11/104 (10%)
Query: 254 FKDF-PDVIEDLDLVPEEDKFTHLMTL-DGVKDTQDILNVFQYDPEYLMNEEKYTTLRRE 311
F +F +ED D V + +TL D +K I + + E M EE +
Sbjct: 2 FHEFASSELEDDDWVRGSLDYKEKLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDD 61
Query: 312 ILGDEDEDDEDGDDEEDEDEEE---------SEAEEGEKENKET 346
DEDEDD+D DD+ED+++E+ S A++G + + E
Sbjct: 62 DDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNEA 105
Score = 33.0 bits (75), Expect = 0.42
Identities = 16/53 (30%), Positives = 27/53 (50%)
Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
EE + D+D+DDED DD++D+D+E+ E E+ + + N
Sbjct: 47 EEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGN 99
Score = 32.7 bits (74), Expect = 0.49
Identities = 14/39 (35%), Positives = 26/39 (66%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
D+DED++D DD++DED+E+ + ++ + + D ET
Sbjct: 63 DDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNET 101
Score = 30.3 bits (68), Expect = 3.0
Identities = 9/41 (21%), Positives = 27/41 (65%)
Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLV 356
+E+D+D +D++D+D+E+ + ++ + + + D+ ++ L
Sbjct: 50 MEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLH 90
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 37.3 bits (86), Expect = 0.019
Identities = 23/68 (33%), Positives = 31/68 (45%), Gaps = 2/68 (2%)
Query: 293 QYDPEYLMNEEKYTTLRREILGDEDEDDEDGDD--EEDEDEEESEAEEGEKENKETIIDN 350
Q E+L+N+E L + + ED DG D ED ++E E EE E E I D
Sbjct: 76 QKYVEFLINKEHIRVLAKLQDSESHEDGSDGSDMDSEDSADDEEEEEEDESLEDEMIDDE 135
Query: 351 TETNLVAL 358
E +L
Sbjct: 136 DEADLFNE 143
Score = 31.1 bits (70), Expect = 1.7
Identities = 13/38 (34%), Positives = 20/38 (52%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
E ++ DDE ++DEE+ EE E KE++ T
Sbjct: 144 SESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATR 181
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 37.3 bits (87), Expect = 0.020
Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 5/50 (10%)
Query: 299 LMNEEKYTTLRREILG-DEDEDDEDGDDEEDEDEEESEAEEGEKENKETI 347
EE LRR+I G E+DED D + +E++E+ + ++ + EN +
Sbjct: 309 RQGEE----LRRKIEGKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPWML 354
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 34.9 bits (80), Expect = 0.021
Identities = 18/40 (45%), Positives = 25/40 (62%), Gaps = 1/40 (2%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEG-EKENKETIIDNTET 353
DED+++E+G+ EE+E EEE E E K E D+ ET
Sbjct: 58 DEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAET 97
Score = 31.9 bits (72), Expect = 0.25
Identities = 12/44 (27%), Positives = 22/44 (50%), Gaps = 7/44 (15%)
Query: 314 GDEDEDDEDGDDEEDEDEEES-------EAEEGEKENKETIIDN 350
+E+E + + ++ E+E+E E E EE + E K+ D
Sbjct: 61 DEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQKTDE 104
Score = 29.9 bits (67), Expect = 0.95
Identities = 14/31 (45%), Positives = 24/31 (77%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
DE E++E+ D++++E+E E E EEGE+E +
Sbjct: 49 DEMEEEEEVDEDDEEEEGEGEEEEGEEEEET 79
Score = 29.2 bits (65), Expect = 1.7
Identities = 12/32 (37%), Positives = 21/32 (65%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
+E+E+ ++ D+EE+ + EE E EE E+ T
Sbjct: 52 EEEEEVDEDDEEEEGEGEEEEGEEEEETEGAT 83
Score = 28.8 bits (64), Expect = 2.4
Identities = 13/32 (40%), Positives = 22/32 (68%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
G ++ DDE ++EE ++++E E EGE+E E
Sbjct: 43 GAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGE 74
Score = 28.4 bits (63), Expect = 3.1
Identities = 12/40 (30%), Positives = 21/40 (52%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
E E++E ++EE E A E E+++ ET T+ +
Sbjct: 66 GEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQKTDED 105
Score = 27.2 bits (60), Expect = 8.5
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
+E+ ++GDDE +E+EE E +E E+ E
Sbjct: 39 NEENGAQEGDDEMEEEEEVDEDDEEEEGEGE 69
>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
N-terminal. This is the very acidic N-terminal region
of the early transcription elongation factor Spt5. The
Spt5-Spt4 complex regulates early transcription
elongation by RNA polymerase II and has an imputed role
in pre-mRNA processing via its physical association with
mRNA capping enzymes. The actual function of this
N-terminal domain is not known although it is
dispensable for binding to Spt4.
Length = 92
Score = 34.3 bits (79), Expect = 0.025
Identities = 12/35 (34%), Positives = 20/35 (57%)
Query: 319 DDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
D E D+E+E+EEE E + + +++ ID E
Sbjct: 3 DTEAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAEA 37
Score = 30.1 bits (68), Expect = 0.76
Identities = 11/29 (37%), Positives = 20/29 (68%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
+E+E++E+ DD ED +E+ +E E E+
Sbjct: 11 EEEEEEEEEDDLEDLSDEDEFIDEAEAED 39
Score = 28.9 bits (65), Expect = 1.5
Identities = 9/32 (28%), Positives = 21/32 (65%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
+ D+++E+ ++EED+ E+ S+ +E E +
Sbjct: 7 EVDDEEEEEEEEEDDLEDLSDEDEFIDEAEAE 38
Score = 28.2 bits (63), Expect = 3.1
Identities = 7/25 (28%), Positives = 17/25 (68%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEE 338
DE+E++E+ +D+ ++ +E E +
Sbjct: 9 DDEEEEEEEEEDDLEDLSDEDEFID 33
Score = 27.0 bits (60), Expect = 8.5
Identities = 10/34 (29%), Positives = 19/34 (55%), Gaps = 2/34 (5%)
Query: 315 DEDE--DDEDGDDEEDEDEEESEAEEGEKENKET 346
E E D+E+ ++EE++D E+ E+ + E
Sbjct: 4 TEAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAEA 37
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 36.6 bits (84), Expect = 0.028
Identities = 19/95 (20%), Positives = 37/95 (38%), Gaps = 3/95 (3%)
Query: 252 DNFKDFPDVIEDLDLVPEEDKFTHLMTLDGVKDTQ-DILNVFQYDPEYLMNEEKYTTLRR 310
+NFK+ P + +E + +G+ + D+ ++E +
Sbjct: 242 ENFKEEPGEPLSRPALKKEKQGAEEEGEEGMSEEDLDVGAAEI--ENKEVSEGDKEQQQE 299
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
E+ E +E D DE EE E ++ +EN+
Sbjct: 300 EVENAEAHKEEVQSDRPDEIGEEKEEDDENEENER 334
Score = 31.6 bits (71), Expect = 1.2
Identities = 12/47 (25%), Positives = 20/47 (42%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRR 360
+ D D EE E+++E+E E E ++ E + RR
Sbjct: 309 EEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELEKGIEEKRR 355
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 36.8 bits (85), Expect = 0.031
Identities = 15/30 (50%), Positives = 24/30 (80%)
Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
E++E+ ++EE+E+E+ESE EEGE E +E
Sbjct: 444 SVEEEEEEEEEEEEEEQESEEEEGEDEEEE 473
Score = 33.3 bits (76), Expect = 0.35
Identities = 14/32 (43%), Positives = 25/32 (78%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
+E+ +E+ ++EE+E+EEE E+EE E E++E
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEE 471
Score = 32.2 bits (73), Expect = 0.78
Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 2/44 (4%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVA 357
E E++E ++EE+E+EEE E E+ +E + D E V
Sbjct: 436 SQESEEEESVEEEEEEEEEEEEEEQESEEEEGE--DEEEEEEVE 477
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 36.9 bits (85), Expect = 0.031
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 317 DEDDEDGDDEEDEDEEESEAEEGEKENKET 346
E + E+DED EESE E+ E+E +
Sbjct: 306 PEIPAKPEIEQDEDSEESEEEKNEEEGGLS 335
Score = 33.4 bits (76), Expect = 0.35
Identities = 12/38 (31%), Positives = 18/38 (47%), Gaps = 1/38 (2%)
Query: 309 RREILGDEDED-DEDGDDEEDEDEEESEAEEGEKENKE 345
R + L E E DE+ E+ EE + EE +K+
Sbjct: 300 REDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKK 337
Score = 32.6 bits (74), Expect = 0.51
Identities = 15/39 (38%), Positives = 21/39 (53%), Gaps = 8/39 (20%)
Query: 314 GDEDEDD---EDGDDEEDE-----DEEESEAEEGEKENK 344
GD+D D+ +DGDDE E D S + E+E+K
Sbjct: 265 GDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDK 303
Score = 30.3 bits (68), Expect = 3.1
Identities = 9/29 (31%), Positives = 18/29 (62%)
Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENK 344
E +D++DGD+ + E+ E + +K+ K
Sbjct: 219 EGDDEDDGDESDKGGEDGDEEKSKKKKKK 247
Score = 29.9 bits (67), Expect = 3.6
Identities = 11/43 (25%), Positives = 21/43 (48%)
Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENK 344
EE+ L EI + + ++ +E +E++ E E +K K
Sbjct: 298 EEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKK 340
Score = 29.9 bits (67), Expect = 3.8
Identities = 8/31 (25%), Positives = 19/31 (61%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENK 344
+ ++D++ + EE+++EEE + K+ K
Sbjct: 312 PEIEQDEDSEESEEEKNEEEGGLSKKGKKLK 342
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 36.9 bits (85), Expect = 0.032
Identities = 16/39 (41%), Positives = 27/39 (69%)
Query: 300 MNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEE 338
+ EE + ++ D+DED+++ DD+E++DEEE E EE
Sbjct: 140 LAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEE 178
Score = 35.8 bits (82), Expect = 0.082
Identities = 14/28 (50%), Positives = 23/28 (82%)
Query: 317 DEDDEDGDDEEDEDEEESEAEEGEKENK 344
D+DDED D+++D++E++ E EE E+E K
Sbjct: 154 DDDDEDEDEDDDDEEDDEEEEEEEEEIK 181
Score = 35.4 bits (81), Expect = 0.090
Identities = 17/39 (43%), Positives = 27/39 (69%)
Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGE 340
E+ + R + D+D++DED DD+++ED+EE E EE E
Sbjct: 141 AEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEE 179
Score = 33.9 bits (77), Expect = 0.26
Identities = 11/38 (28%), Positives = 23/38 (60%)
Query: 308 LRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
L E + D D DDE+++++++ E ++ E+E +E
Sbjct: 140 LAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEE 177
Score = 32.7 bits (74), Expect = 0.59
Identities = 16/31 (51%), Positives = 25/31 (80%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
DEDEDD+D +D+E+E+EEE E + + E++E
Sbjct: 159 DEDEDDDDEEDDEEEEEEEEEIKGFDDEDEE 189
Score = 31.6 bits (71), Expect = 1.6
Identities = 12/31 (38%), Positives = 22/31 (70%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
D D+D +DE+++D++E + EE E+E +E
Sbjct: 149 DNFVIDDDDEDEDEDDDDEEDDEEEEEEEEE 179
Score = 30.0 bits (67), Expect = 4.7
Identities = 17/32 (53%), Positives = 22/32 (68%), Gaps = 5/32 (15%)
Query: 314 GDEDEDDEDGDDEEDEDEEE-----SEAEEGE 340
DED+DDE+ D+EE+E+EEE E EE E
Sbjct: 160 EDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191
Score = 29.2 bits (65), Expect = 8.3
Identities = 11/37 (29%), Positives = 23/37 (62%), Gaps = 3/37 (8%)
Query: 313 LGDEDEDDEDG---DDEEDEDEEESEAEEGEKENKET 346
L +ED D DD++++++E+ + EE ++E +E
Sbjct: 140 LAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEE 176
>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
fusion protein; Provisional.
Length = 131
Score = 34.9 bits (80), Expect = 0.033
Identities = 9/39 (23%), Positives = 19/39 (48%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
+E E + +D ++ E ++ E E+E + +D T
Sbjct: 73 AEEAEAADADEDADEAAEADAADEADEEEETDEAVDETA 111
Score = 33.4 bits (76), Expect = 0.10
Identities = 8/22 (36%), Positives = 16/22 (72%)
Query: 315 DEDEDDEDGDDEEDEDEEESEA 336
DE D + ++ ++E++EE+EA
Sbjct: 110 TADEADAEAEEADEEEDEEAEA 131
Score = 32.6 bits (74), Expect = 0.16
Identities = 11/24 (45%), Positives = 16/24 (66%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEE 338
DE D+ D + EE ++EE+ EAE
Sbjct: 108 DETADEADAEAEEADEEEDEEAEA 131
Score = 31.8 bits (72), Expect = 0.33
Identities = 16/53 (30%), Positives = 23/53 (43%), Gaps = 1/53 (1%)
Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEE-ESEAEEGEKENKETIIDNTET 353
EK + E+ E D +ED DE E++A + E +ET ET
Sbjct: 58 AEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDET 110
Score = 30.3 bits (68), Expect = 1.1
Identities = 10/28 (35%), Positives = 18/28 (64%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKE 342
+E+E DE D+ DE + E+E + E++
Sbjct: 99 EEEETDEAVDETADEADAEAEEADEEED 126
Score = 30.3 bits (68), Expect = 1.2
Identities = 11/32 (34%), Positives = 21/32 (65%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
DE +++E+ D+ DE +E++AE E + +E
Sbjct: 95 DEADEEEETDEAVDETADEADAEAEEADEEED 126
Score = 29.1 bits (65), Expect = 2.7
Identities = 13/42 (30%), Positives = 20/42 (47%), Gaps = 3/42 (7%)
Query: 315 DEDEDD---EDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
DED D+ D DE DE+EE EA + + + + +
Sbjct: 82 DEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADE 123
Score = 28.8 bits (64), Expect = 3.7
Identities = 11/25 (44%), Positives = 15/25 (60%), Gaps = 1/25 (4%)
Query: 316 EDEDDEDGDDEEDEDEEESEAEEGE 340
DE ++ D E +E +EE E EE E
Sbjct: 107 VDETADEADAEAEEADEE-EDEEAE 130
Score = 28.8 bits (64), Expect = 3.9
Identities = 12/43 (27%), Positives = 20/43 (46%), Gaps = 8/43 (18%)
Query: 311 EILGDEDEDDEDGDDE--------EDEDEEESEAEEGEKENKE 345
E + ++ D+E E DE ++EAEE ++E E
Sbjct: 85 ADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDE 127
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 36.5 bits (85), Expect = 0.035
Identities = 19/45 (42%), Positives = 30/45 (66%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALR 359
++D++D+D DD +DEDEE+ EA+E EK + + E + ALR
Sbjct: 154 EDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALR 198
Score = 35.0 bits (81), Expect = 0.095
Identities = 13/30 (43%), Positives = 21/30 (70%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
D+++DDED DD++ +DE+E + E E E
Sbjct: 151 DDDEDDDEDDDDDDVDDEDEEKKEAKELEK 180
Score = 33.4 bits (77), Expect = 0.35
Identities = 17/51 (33%), Positives = 28/51 (54%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRRTIYLT 365
DED+DD+D DDE++E +E E E+ ++ ++ L R+ LT
Sbjct: 157 DEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKLT 207
Score = 31.9 bits (73), Expect = 0.86
Identities = 11/38 (28%), Positives = 23/38 (60%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
D+ +DD+ DD++DED++E + ++ + E + E
Sbjct: 140 DDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKE 177
Score = 31.1 bits (71), Expect = 1.7
Identities = 16/107 (14%), Positives = 40/107 (37%), Gaps = 6/107 (5%)
Query: 247 AQVRKDNFKDFPDVIEDLDLVPEEDKFTHLMTLDGVKDTQDILNVFQYDPEYLMNEEKYT 306
+ + D + ++D+D++ + D D + D + D + +++
Sbjct: 107 EKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDE--DDDDDD 164
Query: 307 TLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
+ E ++ E D++D +E ++E + K D T
Sbjct: 165 VDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARK----DAKLT 207
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 36.4 bits (84), Expect = 0.036
Identities = 15/58 (25%), Positives = 25/58 (43%), Gaps = 1/58 (1%)
Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALR 359
E+ + + + D E+ DEE EDE E E +E E + + + +A R
Sbjct: 326 EDIFEVVEVVEKQEGDVVTEESTDEESEDEVE-IDESVIEEVAEMELLEVQVDDLAER 382
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 36.1 bits (83), Expect = 0.036
Identities = 9/30 (30%), Positives = 17/30 (56%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
D+DE D +E+ED ++ + E+ + N
Sbjct: 273 SDDDEGSSSNDYDEEEDGDDDDNEDNDDTN 302
Score = 33.8 bits (77), Expect = 0.21
Identities = 10/29 (34%), Positives = 20/29 (68%), Gaps = 1/29 (3%)
Query: 315 DEDEDDE-DGDDEEDEDEEESEAEEGEKE 342
D D+E DGDD+++ED +++ G+++
Sbjct: 281 SNDYDEEEDGDDDDNEDNDDTNTNHGDED 309
Score = 32.2 bits (73), Expect = 0.67
Identities = 10/38 (26%), Positives = 21/38 (55%)
Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
+ DD++G D DEEE ++ ++N +T ++ +
Sbjct: 271 YESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDE 308
Score = 30.7 bits (69), Expect = 1.9
Identities = 11/61 (18%), Positives = 26/61 (42%), Gaps = 6/61 (9%)
Query: 301 NEEKYTTLRREILGDEDED------DEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
+++ Y ++ + ++ D + DD+E + + EE ++ D+T TN
Sbjct: 245 DDQGYGRYKQNLSSNKYGDFKGVNYGYESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNTN 304
Query: 355 L 355
Sbjct: 305 H 305
>gnl|CDD|217059 pfam02480, Herpes_gE, Alphaherpesvirus glycoprotein E.
Glycoprotein E (gE) of Alphaherpesvirus forms a complex
with glycoprotein I (gI) (pfam01688), functioning as an
immunoglobulin G (IgG) Fc binding protein. gE is
involved in virus spread but is not essential for
propagation.
Length = 437
Score = 36.2 bits (84), Expect = 0.042
Identities = 12/50 (24%), Positives = 19/50 (38%)
Query: 293 QYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
Y YT+L L ++ D + DDEE + + + E K
Sbjct: 383 PYAVILNPFSPVYTSLPTNELFEDVSSDSESDDEEFDSDSDREPSGPSKS 432
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 35.8 bits (83), Expect = 0.047
Identities = 12/49 (24%), Positives = 17/49 (34%)
Query: 298 YLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
M E E+ DD+ ++ +DE E E E NK
Sbjct: 225 EKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPV 273
Score = 32.4 bits (74), Expect = 0.54
Identities = 10/30 (33%), Positives = 16/30 (53%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENK 344
+D+ +E+ DDE + ESE E K +
Sbjct: 245 SDDDGEEESDDESAWEGFESEYEPINKPVR 274
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 36.1 bits (83), Expect = 0.051
Identities = 16/51 (31%), Positives = 27/51 (52%), Gaps = 9/51 (17%)
Query: 301 NEEKYTTLRREILGDEDEDDEDGDDEEDEDEEE---------SEAEEGEKE 342
+++ + + RE+L + +E GDD E EDEE+ + E+GE E
Sbjct: 182 DQQAFARVVREMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGE 232
Score = 30.3 bits (68), Expect = 3.1
Identities = 14/44 (31%), Positives = 22/44 (50%), Gaps = 1/44 (2%)
Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAE-EGEKENK 344
+E T R G+E+ D DD DE +++SE EG + +
Sbjct: 242 QESEATDRESESGEEEMVQSDQDDLPDESDDDSETPGEGARPAR 285
Score = 28.8 bits (64), Expect = 9.1
Identities = 14/42 (33%), Positives = 22/42 (52%), Gaps = 9/42 (21%)
Query: 314 GDEDEDDEDGDDEEDEDE---------EESEAEEGEKENKET 346
GD+D+ E+ +E+ E E +ESEA + E E+ E
Sbjct: 215 GDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEE 256
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 35.2 bits (81), Expect = 0.062
Identities = 18/43 (41%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKE-TIIDNTETNLVA 357
E EDG++EE E+ EE E EEGE+E++E + +T L
Sbjct: 72 EVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGDTPEELTE 114
Score = 31.7 bits (72), Expect = 0.87
Identities = 11/39 (28%), Positives = 22/39 (56%), Gaps = 1/39 (2%)
Query: 301 NEEKYTTLR-REILGDEDEDDEDGDDEEDEDEEESEAEE 338
++E +R E +E+ ++ + D+EE+ +EE E E
Sbjct: 66 DDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEP 104
Score = 30.6 bits (69), Expect = 2.2
Identities = 13/34 (38%), Positives = 22/34 (64%)
Query: 309 RREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
R E+ ED ++E+ ++ E+++EEE E E E E
Sbjct: 70 RIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFE 103
Score = 29.8 bits (67), Expect = 3.7
Identities = 14/61 (22%), Positives = 29/61 (47%)
Query: 280 DGVKDTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEG 339
D ++ + + Y E+ + I D +E++ + +E++E+E E E+EE
Sbjct: 43 DAIELAEPETSDDPYGNPDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEF 102
Query: 340 E 340
E
Sbjct: 103 E 103
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 34.8 bits (80), Expect = 0.063
Identities = 10/47 (21%), Positives = 20/47 (42%), Gaps = 3/47 (6%)
Query: 313 LGDEDEDDEDGDDEEDEDEEE---SEAEEGEKENKETIIDNTETNLV 356
L E+ D + ++E D+ + + N E +ID ++ L
Sbjct: 80 LAAEELSDSEENEENDKKVDYELPKVQNTAAEVNHEDVIDLSDLKLA 126
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 34.7 bits (80), Expect = 0.065
Identities = 14/56 (25%), Positives = 27/56 (48%)
Query: 297 EYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
E + K+ + + + + +ED EED + EE E EE ++++E+ E
Sbjct: 2 EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIE 57
Score = 30.5 bits (69), Expect = 1.9
Identities = 14/63 (22%), Positives = 29/63 (46%)
Query: 283 KDTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
K+ +D + + NE K +E + +E +++ E+ E+ E + EE + E
Sbjct: 3 KECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDE 62
Query: 343 NKE 345
N +
Sbjct: 63 NNK 65
Score = 28.6 bits (64), Expect = 7.7
Identities = 10/59 (16%), Positives = 28/59 (47%)
Query: 300 MNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVAL 358
M +E + D +++E+ ++++ ++E+ E ++E E ++ E + L
Sbjct: 1 MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEEL 59
>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2. This
domain family is found in eukaryotes, and is typically
between 138 and 153 amino acids in length. The family is
found in association with pfam00493. Mini-chromosome
maintenance (MCM) proteins are essential for DNA
replication. These proteins use ATPase activity to
perform this function.
Length = 145
Score = 33.8 bits (78), Expect = 0.084
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGE 340
+ DE++E DD +D DEE E E+GE
Sbjct: 3 EPDEEEELEDDIDDLDEEAEEEEDGE 28
Score = 28.1 bits (63), Expect = 7.6
Identities = 12/37 (32%), Positives = 15/37 (40%)
Query: 309 RREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
RR DED D +D D + E+E E K
Sbjct: 109 RRRRQYDEDRDADDEDIDPLEEELSLEDLSDVKAASI 145
>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal. The family represents
the N-terminal region of RXT2-like proteins. In S.
cerevisiae, RXT2 has been demonstrated to be involved in
conjugation with cellular fusion (mating) and invasive
growth. A high throughput localisation study has
localised RXT2 to the nucleus.
Length = 141
Score = 33.5 bits (77), Expect = 0.084
Identities = 15/26 (57%), Positives = 21/26 (80%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGE 340
DED D D DDE+DED+EE++AE+ +
Sbjct: 48 DEDGGDIDDDDEDDEDDEEADAEDDD 73
Score = 30.4 bits (69), Expect = 1.2
Identities = 10/40 (25%), Positives = 21/40 (52%)
Query: 299 LMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEE 338
L + I D + D+D +D+ED++E ++E ++
Sbjct: 34 LSRKRIKFNNPPRIDEDGGDIDDDDEDDEDDEEADAEDDD 73
Score = 30.1 bits (68), Expect = 1.5
Identities = 10/27 (37%), Positives = 19/27 (70%)
Query: 317 DEDDEDGDDEEDEDEEESEAEEGEKEN 343
DED D DD++++DE++ EA+ + +
Sbjct: 48 DEDGGDIDDDDEDDEDDEEADAEDDDE 74
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 35.2 bits (81), Expect = 0.084
Identities = 17/41 (41%), Positives = 24/41 (58%), Gaps = 4/41 (9%)
Query: 316 EDEDDEDGDD---EEDEDEEE-SEAEEGEKENKETIIDNTE 352
ED D+EDGDD + +ED E E EG E++E + T+
Sbjct: 229 EDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATD 269
Score = 33.7 bits (77), Expect = 0.28
Identities = 13/37 (35%), Positives = 23/37 (62%)
Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
R++LG D +E GDD +ED +E + ++ + +N E
Sbjct: 210 RDMLGSMDMAEETGDDGIEEDADEEDGDDDQPDNNED 246
Score = 30.6 bits (69), Expect = 2.9
Identities = 12/33 (36%), Positives = 17/33 (51%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
GD+D+ D + D E +E E E E E + T
Sbjct: 236 GDDDQPDNNEDSEAGREESEGSDESEEDEAEAT 268
Score = 29.4 bits (66), Expect = 5.4
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
+ED + + E ++ EE EAE + E +E +D E
Sbjct: 243 NNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAE 281
>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
Reviewed.
Length = 129
Score = 33.4 bits (77), Expect = 0.091
Identities = 15/25 (60%), Positives = 21/25 (84%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESE 335
+IL D+D D+ED D+E+DEDEE+ E
Sbjct: 105 DILPDDDFDEEDLDEEDDEDEEDEE 129
Score = 31.5 bits (72), Expect = 0.41
Identities = 13/32 (40%), Positives = 20/32 (62%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
GD+D D+D ++D DEE+ + E+ E E E
Sbjct: 97 GDDDIIDDDILPDDDFDEEDLDEEDDEDEEDE 128
Score = 28.8 bits (65), Expect = 3.8
Identities = 9/28 (32%), Positives = 17/28 (60%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEE 338
I D DD+ +++ DE+++E E +E
Sbjct: 101 IIDDDILPDDDFDEEDLDEEDDEDEEDE 128
Score = 27.7 bits (62), Expect = 8.7
Identities = 11/37 (29%), Positives = 22/37 (59%)
Query: 309 RREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
+++ D D+D D D D+D +E + +E + E++E
Sbjct: 90 KKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEE 126
>gnl|CDD|165026 PHA02644, PHA02644, hypothetical protein; Provisional.
Length = 112
Score = 33.0 bits (74), Expect = 0.094
Identities = 19/60 (31%), Positives = 33/60 (55%), Gaps = 1/60 (1%)
Query: 293 QYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
+YD ++ NE++ T I E+ED++ + E+EDE++ E E E E K +N +
Sbjct: 40 KYDKIFISNEDEMKTTHEHI-KKENEDEKKPEKPENEDEKKPEKPENEDEKKPEKPENED 98
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 34.3 bits (79), Expect = 0.10
Identities = 14/31 (45%), Positives = 18/31 (58%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
D D D+D DD++ EDE + E EK KE
Sbjct: 127 DSDSSDDDSDDDDSEDETAALLRELEKIKKE 157
Score = 32.4 bits (74), Expect = 0.43
Identities = 15/44 (34%), Positives = 19/44 (43%), Gaps = 9/44 (20%)
Query: 311 EILGDEDEDDEDGDDEEDEDEE---------ESEAEEGEKENKE 345
D +DD D DD EDE + AEE E+E +E
Sbjct: 125 SSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEE 168
Score = 32.4 bits (74), Expect = 0.47
Identities = 8/31 (25%), Positives = 16/31 (51%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
+ D DD D + D +++S+ ++ E E
Sbjct: 116 NSDADDSDSSSDSDSSDDDSDDDDSEDETAA 146
Score = 30.8 bits (70), Expect = 1.6
Identities = 9/44 (20%), Positives = 20/44 (45%)
Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
EE +R + D +D D D D + ++++ + E++
Sbjct: 101 EEDEIKRKRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDET 144
Score = 28.2 bits (63), Expect = 9.5
Identities = 6/32 (18%), Positives = 14/32 (43%)
Query: 309 RREILGDEDEDDEDGDDEEDEDEEESEAEEGE 340
R D+ + D D +D+ +++ +E
Sbjct: 114 ARNSDADDSDSSSDSDSSDDDSDDDDSEDETA 145
>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
Length = 80
Score = 32.1 bits (73), Expect = 0.11
Identities = 10/34 (29%), Positives = 27/34 (79%)
Query: 312 ILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
I G+E EDD+D +D++D+D+++ + ++ + ++++
Sbjct: 45 IEGEEPEDDDDDEDDDDDDDKDDKDDDDDDDDED 78
Score = 31.7 bits (72), Expect = 0.15
Identities = 9/31 (29%), Positives = 23/31 (74%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
ED+DD++ DD++D+ +++ + ++ + E+ E
Sbjct: 50 PEDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80
Score = 27.1 bits (60), Expect = 6.3
Identities = 8/24 (33%), Positives = 18/24 (75%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAE 337
D+D+DD+ D ++D+D+++ + E
Sbjct: 57 EDDDDDDDKDDKDDDDDDDDEDDE 80
Score = 26.7 bits (59), Expect = 9.6
Identities = 8/29 (27%), Positives = 22/29 (75%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
D+D++D+D DD++D+ +++ + ++ + E
Sbjct: 52 DDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80
>gnl|CDD|233222 TIGR00986, 3a0801s05tom22, mitochondrial import receptor subunit
Tom22. The mitochondrial protein translocase (MPT)
family, which brings nuclearly encoded preproteins into
mitochondria, is very complex with 19 currently
identified protein constituents.These proteins include
several chaperone proteins, four proteins of the outer
membrane translocase (Tom) import receptor, five
proteins of the Tom channel complex, five proteins of
the inner membrane translocase (Tim) and three "motor"
proteins. This family is specific for the Tom22 proteins
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 145
Score = 33.4 bits (76), Expect = 0.12
Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 7/50 (14%)
Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALR 359
++ + + +EDDED D + ED +S+ E E +ET D + AL+
Sbjct: 17 QQTVVEPEEDDEDFTDVDSEDSVDSDFESLE---EETFTD----RIYALK 59
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 33.7 bits (77), Expect = 0.13
Identities = 20/74 (27%), Positives = 39/74 (52%), Gaps = 3/74 (4%)
Query: 280 DGVKDTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEG 339
D +K+ +D+ + Q D E EE+ E D ++++E +DEE+E+E+E + +
Sbjct: 35 DIIKENEDVKDEKQEDDE--EEEEEDEEEIEEP-EDIEDEEEIVEDEEEEEEDEEDNVDL 91
Query: 340 EKENKETIIDNTET 353
+ K+ I D +
Sbjct: 92 KDIEKKNINDIFNS 105
Score = 33.3 bits (76), Expect = 0.13
Identities = 18/51 (35%), Positives = 28/51 (54%)
Query: 297 EYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETI 347
E L+ K T+ +E + E+ED +D E+DE+EEE + EE E+
Sbjct: 20 ENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIED 70
Score = 29.8 bits (67), Expect = 2.0
Identities = 20/72 (27%), Positives = 40/72 (55%), Gaps = 8/72 (11%)
Query: 289 LNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDD-----EEDEDEEE-SEAEEGEKE 342
+ + +D E ++ E + + E D++E++E+ ++ E+ EDEEE E EE E+E
Sbjct: 26 VKITSWDKEDIIKENEDV--KDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEE 83
Query: 343 NKETIIDNTETN 354
++E +D +
Sbjct: 84 DEEDNVDLKDIE 95
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 34.1 bits (78), Expect = 0.18
Identities = 20/68 (29%), Positives = 29/68 (42%), Gaps = 14/68 (20%)
Query: 314 GDEDEDDEDGDDEEDEDEEES------EAEEGEKENKETIIDNTETNLVALRRTIYLTIH 367
G + E +ED ++EE+E+EEE E K +E+ V RR H
Sbjct: 223 GSDSESEEDEEEEEEEEEEEEIDVVTVEKRRSSSNRKA---STSESITVPSRRH-----H 274
Query: 368 SSLDFEEC 375
S L + C
Sbjct: 275 SPLVLKRC 282
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 34.3 bits (79), Expect = 0.18
Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 23/77 (29%)
Query: 301 NEEKYTTL-------------------RREILGDEDEDDEDGDDEEDEDE----EESEAE 337
N++K+T DE+E+DE+ ++EEDEDE E S+ E
Sbjct: 357 NKQKWTKAKETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDE 416
Query: 338 EGEKENKETIIDNTETN 354
E E+++ E+ ++++ N
Sbjct: 417 EFEEDDVESKYEDSDGN 433
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 34.0 bits (78), Expect = 0.19
Identities = 18/61 (29%), Positives = 28/61 (45%), Gaps = 3/61 (4%)
Query: 296 PEYLMNE---EKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
P YL E EK E E++ +E+ + E+E + E E+ EK+ K + T
Sbjct: 22 PIYLWVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETT 81
Query: 353 T 353
T
Sbjct: 82 T 82
>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
Members of this family are bacterial proteins with a
conserved motif [KR]FYDLN, sometimes flanked by a pair
of CXXC motifs, followed by a long region of low
complexity sequence in which roughly half the residues
are Asp and Glu, including multiple runs of five or more
acidic residues. The function of members of this family
is unknown.
Length = 104
Score = 31.9 bits (73), Expect = 0.20
Identities = 9/30 (30%), Positives = 22/30 (73%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGE 340
+++ D+D+DD+D DD D D+++ + ++ +
Sbjct: 65 DVVLDDDDDDDDDDDLPDLDDDDVDLDDDD 94
Score = 30.0 bits (68), Expect = 0.90
Identities = 6/29 (20%), Positives = 21/29 (72%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
DE+++D+ D++D+D+++ + + + ++
Sbjct: 59 DEEDEDDVVLDDDDDDDDDDDLPDLDDDD 87
Score = 30.0 bits (68), Expect = 1.0
Identities = 9/23 (39%), Positives = 15/23 (65%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEE 333
L D+D D +D DD+ ED+++
Sbjct: 81 PDLDDDDVDLDDDDDDFLEDDDD 103
Score = 29.2 bits (66), Expect = 1.8
Identities = 7/37 (18%), Positives = 17/37 (45%)
Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEE 338
E + ++D +D DD++D+D+ ++
Sbjct: 50 AEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDD 86
Score = 29.2 bits (66), Expect = 1.9
Identities = 9/42 (21%), Positives = 24/42 (57%)
Query: 317 DEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVAL 358
D +D DE++EDE++ ++ + ++ + + + + + V L
Sbjct: 49 DAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDL 90
Score = 28.8 bits (65), Expect = 2.4
Identities = 8/37 (21%), Positives = 18/37 (48%)
Query: 306 TTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
R D ++ + +DEEDED+ + ++ + +
Sbjct: 40 VAKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDD 76
Score = 27.7 bits (62), Expect = 5.3
Identities = 8/38 (21%), Positives = 21/38 (55%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
D + DED +DE+D ++ + ++ + + + D+ +
Sbjct: 52 DAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVD 89
Score = 27.7 bits (62), Expect = 6.0
Identities = 7/24 (29%), Positives = 15/24 (62%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEE 338
D DD+D D ++D+D+ + ++
Sbjct: 80 LPDLDDDDVDLDDDDDDFLEDDDD 103
Score = 27.3 bits (61), Expect = 9.4
Identities = 8/26 (30%), Positives = 16/26 (61%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGE 340
D + D+D D +D+D++ E ++ E
Sbjct: 79 DLPDLDDDDVDLDDDDDDFLEDDDDE 104
Score = 26.9 bits (60), Expect = 9.9
Identities = 7/41 (17%), Positives = 24/41 (58%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNL 355
+++ED++D ++D+D+++ + ++ + D+ + L
Sbjct: 58 EDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDDFL 98
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 33.9 bits (78), Expect = 0.22
Identities = 9/45 (20%), Positives = 20/45 (44%)
Query: 301 NEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
R + + E+ D+++ ++EE +E E E + E +
Sbjct: 353 KMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEG 397
Score = 31.6 bits (72), Expect = 1.2
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKE 342
DEDE++E DE +E+E E EEG +
Sbjct: 373 DEDEEEEQRSDEHEEEEGEDSEEEGSQS 400
Score = 30.1 bits (68), Expect = 3.3
Identities = 12/38 (31%), Positives = 18/38 (47%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
DEDED+E+ ++ +EEE E E E +
Sbjct: 371 DEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSES 408
Score = 29.7 bits (67), Expect = 4.6
Identities = 9/33 (27%), Positives = 21/33 (63%)
Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
+ DED+++ +++ ++ EE E E+ E+E +
Sbjct: 367 FEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQ 399
Score = 29.3 bits (66), Expect = 5.1
Identities = 9/32 (28%), Positives = 18/32 (56%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
D + +E +DE++E+E+ S+ E E+
Sbjct: 363 DPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSE 394
Score = 28.9 bits (65), Expect = 8.2
Identities = 12/52 (23%), Positives = 21/52 (40%)
Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
EE+ + E ED ++E ED E S + E+K +++
Sbjct: 377 EEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDS 428
>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2. Transcripts
harbouring premature signals for translation termination
are recognised and rapidly degraded by eukaryotic cells
through a pathway known as nonsense-mediated mRNA decay.
In Saccharomyces cerevisiae, three trans-acting factors
(Upf1 to Upf3) are required for nonsense-mediated mRNA
decay.
Length = 171
Score = 32.7 bits (75), Expect = 0.24
Identities = 10/31 (32%), Positives = 18/31 (58%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
DE+ +ED DDE ++EE ++ + E +
Sbjct: 13 DEELPEEDEDDESSDEEEVDLPDDEQDEESD 43
Score = 32.0 bits (73), Expect = 0.40
Identities = 14/39 (35%), Positives = 24/39 (61%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIID 349
E L +EDEDDE D+EE + ++ + EE + E ++ +
Sbjct: 14 EELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVT 52
Score = 31.6 bits (72), Expect = 0.59
Identities = 8/39 (20%), Positives = 24/39 (61%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
G+EDE+ + D++++ +EE +++++E+ + +
Sbjct: 10 GEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQ 48
Score = 30.8 bits (70), Expect = 0.99
Identities = 11/39 (28%), Positives = 20/39 (51%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
DE++ D D+E ++E +SE E+ +E +D
Sbjct: 24 ESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEA 62
Score = 30.8 bits (70), Expect = 1.1
Identities = 9/24 (37%), Positives = 16/24 (66%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEE 338
D+ E+DE+ +E+++DE E E
Sbjct: 8 DDGEEDEELPEEDEDDESSDEEEV 31
Score = 30.8 bits (70), Expect = 1.2
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
+D ++++ EEDED+E S+ EE + + E
Sbjct: 6 ESDDGEEDEELPEEDEDDESSDEEEVDLPDDE 37
Score = 28.9 bits (65), Expect = 4.2
Identities = 7/32 (21%), Positives = 19/32 (59%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
D +ED+E +++ED++ + E + + ++
Sbjct: 9 DGEEDEELPEEDEDDESSDEEEVDLPDDEQDE 40
Score = 28.5 bits (64), Expect = 6.7
Identities = 10/40 (25%), Positives = 22/40 (55%), Gaps = 1/40 (2%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
E +D E+ D+E E++E+ E+ + E+ + + E+
Sbjct: 4 ESESDDGEE-DEELPEEDEDDESSDEEEVDLPDDEQDEES 42
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 33.1 bits (76), Expect = 0.25
Identities = 13/30 (43%), Positives = 20/30 (66%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENK 344
DEDEDD+ D ++E++EE E E+ + K
Sbjct: 26 DEDEDDDVKDSWDEEEDEEKEEEKAKVAAK 55
Score = 31.6 bits (72), Expect = 0.85
Identities = 11/30 (36%), Positives = 18/30 (60%), Gaps = 1/30 (3%)
Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
+DED++D + ++EE+ E EE EK
Sbjct: 25 DDEDEDDDVKDSWDEEEDEEKEE-EKAKVA 53
>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
[Transcription].
Length = 175
Score = 32.8 bits (75), Expect = 0.25
Identities = 29/102 (28%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 235 LEKRVQYMIEVLAQVRKDNFKD-FPDVIEDLDLVPEEDKFTHLMTLDGVKDTQDILNVFQ 293
+++ +Q M E KD D F + ED + D K+ ++ V
Sbjct: 87 IDEEIQAMTEKKDIKAKDKEVDAFEEGDEDE------------LDYDEDKEEEEDDEVDS 134
Query: 294 YDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESE 335
D + +E++ EIL ++DE DED DD+EDE++EE +
Sbjct: 135 LD-DENDDEDEDDDEIVEILIEDDEVDEDEDDDEDEEDEEDK 175
Score = 29.3 bits (66), Expect = 3.7
Identities = 17/69 (24%), Positives = 31/69 (44%), Gaps = 1/69 (1%)
Query: 285 TQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDE-EDEDEEESEAEEGEKEN 343
+DI + + +E + +ED++ + DDE +DEDE++ E E E+
Sbjct: 97 KKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVEILIED 156
Query: 344 KETIIDNTE 352
E D +
Sbjct: 157 DEVDEDEDD 165
>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
Provisional.
Length = 291
Score = 33.2 bits (75), Expect = 0.27
Identities = 19/59 (32%), Positives = 32/59 (54%), Gaps = 4/59 (6%)
Query: 315 DEDEDDEDGDDE----EDEDEEESEAEEGEKENKETIIDNTETNLVALRRTIYLTIHSS 369
D D+D E+GDD+ EDE+EE+ + G++ +TE L+ + L+I S+
Sbjct: 80 DVDDDSEEGDDDDGGAEDEEEEKVRGQSGQEGTVALGSGSTEKKLIGSEKQTELSISSA 138
>gnl|CDD|220223 pfam09405, Btz, CASC3/Barentsz eIF4AIII binding. This domain is
found on CASC3 (cancer susceptibility candidate gene 3
protein) which is also known as Barentsz (Btz). CASC3 is
a component of the EJC (exon junction complex) which is
a complex that is involved in post-transcriptional
regulation of mRNA in metazoa. The complex is formed by
the association of four proteins (eIF4AIII, Barentsz,
Mago, and Y14), mRNA, and ATP. This domain wraps around
eIF4AIII and stacks against the 5' nucleotide.
Length = 116
Score = 31.6 bits (72), Expect = 0.29
Identities = 7/32 (21%), Positives = 15/32 (46%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
E + +ED++ + E E+ + E+ E
Sbjct: 8 SGRTPSAEPTEPKEDKERKRREHEKYDDEDDE 39
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 32.8 bits (75), Expect = 0.29
Identities = 18/52 (34%), Positives = 26/52 (50%)
Query: 301 NEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
+E+K E +E E++ + +DEE DE E E EE + N E I N
Sbjct: 75 SEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNVEKEITNPS 126
Score = 32.0 bits (73), Expect = 0.49
Identities = 17/41 (41%), Positives = 25/41 (60%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
ED+ D + +DEE E+E E E EE EN++ + TE+N
Sbjct: 77 DKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESN 117
Score = 31.6 bits (72), Expect = 0.83
Identities = 10/39 (25%), Positives = 22/39 (56%)
Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
++E+ E + E+ ED+ ++E E+ E E + D ++
Sbjct: 66 KEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSD 104
Score = 30.9 bits (70), Expect = 1.2
Identities = 13/40 (32%), Positives = 28/40 (70%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
+E+++ + +D+ED+ + E E EE E+EN+E ++++ N
Sbjct: 67 EEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDEN 106
Score = 28.9 bits (65), Expect = 4.9
Identities = 12/40 (30%), Positives = 22/40 (55%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
++ ED D + E++E EEE+E E+ E ++ +T
Sbjct: 76 EDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTE 115
Score = 28.5 bits (64), Expect = 6.7
Identities = 16/39 (41%), Positives = 21/39 (53%), Gaps = 1/39 (2%)
Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
ED ED D E EDE ESE E E++ + + + ET
Sbjct: 74 NSEDKEDKGDAEKEDE-ESEEENEEEDEESSDENEKETE 111
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 33.4 bits (77), Expect = 0.29
Identities = 11/28 (39%), Positives = 21/28 (75%)
Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKEN 343
+ E+D+D D+E DE++EES E ++++
Sbjct: 122 DLEEDDDDDEESDEEDEESSKSEDDEDD 149
Score = 32.3 bits (74), Expect = 0.77
Identities = 7/40 (17%), Positives = 21/40 (52%)
Query: 309 RREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETII 348
E D+DE+ ++ D+E + E++ + ++ + ++
Sbjct: 122 DLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATR 161
Score = 32.3 bits (74), Expect = 0.77
Identities = 12/31 (38%), Positives = 22/31 (70%)
Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAEEGE 340
+ D +EDD+D ++ ++EDEE S++E+ E
Sbjct: 117 DDAYRDLEEDDDDDEESDEEDEESSKSEDDE 147
Score = 31.5 bits (72), Expect = 1.3
Identities = 11/40 (27%), Positives = 25/40 (62%)
Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
L ++D+DDE+ D+E++E + + E+ + ++ + I E
Sbjct: 123 LEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRE 162
Score = 29.6 bits (67), Expect = 5.5
Identities = 11/50 (22%), Positives = 26/50 (52%), Gaps = 3/50 (6%)
Query: 302 EEKYTTLRREILGDEDEDDED---GDDEEDEDEEESEAEEGEKENKETII 348
++ Y L + DE+ D+ED E+DED+++ + ++ + ++
Sbjct: 117 DDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLE 166
Score = 29.6 bits (67), Expect = 5.7
Identities = 15/63 (23%), Positives = 29/63 (46%), Gaps = 14/63 (22%)
Query: 312 ILGDEDEDDE--------------DGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVA 357
I D D ++E + DD++DE+ +E + E + E+ E D+ + + +A
Sbjct: 100 IFDDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIA 159
Query: 358 LRR 360
R
Sbjct: 160 TRE 162
>gnl|CDD|221620 pfam12527, DUF3727, Protein of unknown function (DUF3727). This
domain family is found in bacteria and eukaryotes, and
is approximately 100 amino acids in length.
Length = 100
Score = 31.4 bits (72), Expect = 0.30
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGE 340
E E+DE + EE++D++E E EE E
Sbjct: 19 PEPEEDEILELEEEDDDDEDEEEEYE 44
Score = 27.6 bits (62), Expect = 7.0
Identities = 9/24 (37%), Positives = 17/24 (70%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEE 338
+EDE E ++++D+++EE E E
Sbjct: 22 EEDEILELEEEDDDDEDEEEEYEL 45
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity
of nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 33.2 bits (76), Expect = 0.31
Identities = 21/53 (39%), Positives = 25/53 (47%), Gaps = 1/53 (1%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRRTIYLTIH 367
D ED+E DD DEDEE+ EA E EN+ T + E V TI
Sbjct: 33 DLTEDEEAEDDVVDEDEED-EAVVEEDENELTEEEEDEEGEVKASPDADTTIL 84
Score = 29.7 bits (67), Expect = 3.2
Identities = 11/36 (30%), Positives = 18/36 (50%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
E D+ D+++ D+ E++E EE E E E
Sbjct: 38 EEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGEV 73
>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
Length = 390
Score = 33.3 bits (77), Expect = 0.33
Identities = 25/102 (24%), Positives = 40/102 (39%), Gaps = 28/102 (27%)
Query: 260 VIEDLDLVPEEDKFTHLMTL------------------DGVKD-TQDILNVFQYDPEYLM 300
V +DL+ EE+ + GVK+ D++ + +P
Sbjct: 281 VFNKIDLLDEEEAEERAKAIVEALGWEGPVYLISAASGLGVKELCWDLMTFIEENPREEA 340
Query: 301 NEEKYT--------TLRREIL-GDEDEDDEDGDDEEDEDEEE 333
E + RE L E+EDD+D DD+ DED++E
Sbjct: 341 EEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDE 382
Score = 29.1 bits (66), Expect = 5.9
Identities = 10/38 (26%), Positives = 20/38 (52%)
Query: 295 DPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEE 332
E++ ++ L D+D+ D+D D+++DE E
Sbjct: 348 KVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDEGVE 385
>gnl|CDD|151714 pfam11273, DUF3073, Protein of unknown function (DUF3073). This
family of proteins with unknown function appears to be
restricted to Actinobacteria.
Length = 64
Score = 30.3 bits (69), Expect = 0.36
Identities = 10/38 (26%), Positives = 18/38 (47%), Gaps = 2/38 (5%)
Query: 305 YTTLRREILG--DEDEDDEDGDDEEDEDEEESEAEEGE 340
L+RE+ G D+D E+D D+ + A+ +
Sbjct: 25 LDALQRELSGGSSRSSADDDDQYEDDYDDYDKYADYDD 62
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel, four-helix
bundle structure with a large hydrophobic patch in which
23 residues of one monomer form van der Waals contacts
with the other monomer. This CENP-B dimer configuration
may be suitable for capturing two distant CENP-B boxes
during centromeric heterochromatin formation.
Length = 101
Score = 31.3 bits (70), Expect = 0.36
Identities = 11/28 (39%), Positives = 21/28 (75%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKE 342
D D D+E+ DD+EDE++++ + +E + E
Sbjct: 13 DSDSDEEEDDDDEDEEDDDEDDDEDDDE 40
Score = 31.3 bits (70), Expect = 0.37
Identities = 11/34 (32%), Positives = 24/34 (70%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETI 347
G+ED D + ++E+D+DE+E + +E + E+ + +
Sbjct: 8 GEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDEV 41
Score = 30.1 bits (67), Expect = 0.90
Identities = 15/36 (41%), Positives = 24/36 (66%)
Query: 298 YLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEE 333
+L EE + E D+DED+ED D+++DED++E
Sbjct: 5 FLEGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDE 40
Score = 30.1 bits (67), Expect = 0.95
Identities = 10/30 (33%), Positives = 22/30 (73%)
Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
E E+D D D +E+ED+++ + E+ ++++ E
Sbjct: 7 EGEEDSDSDSDEEEDDDDEDEEDDDEDDDE 36
Score = 29.0 bits (64), Expect = 2.1
Identities = 11/30 (36%), Positives = 21/30 (70%)
Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
L E++ D D D+EED+D+E+ E ++ + +
Sbjct: 6 LEGEEDSDSDSDEEEDDDDEDEEDDDEDDD 35
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 31.2 bits (71), Expect = 0.41
Identities = 11/25 (44%), Positives = 18/25 (72%)
Query: 316 EDEDDEDGDDEEDEDEEESEAEEGE 340
E++E+ ++EE+E+EEESE E
Sbjct: 74 AAEEEEEEEEEEEEEEEESEEEAMA 98
Score = 29.2 bits (66), Expect = 1.7
Identities = 11/24 (45%), Positives = 19/24 (79%)
Query: 316 EDEDDEDGDDEEDEDEEESEAEEG 339
E+E++E+ ++EE+E+E E EA G
Sbjct: 76 EEEEEEEEEEEEEEEESEEEAMAG 99
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 33.4 bits (76), Expect = 0.42
Identities = 27/115 (23%), Positives = 51/115 (44%), Gaps = 15/115 (13%)
Query: 244 EVLAQVRKDNFKDFPDVIEDLDLVPEEDKF-THLMTLDGVKDTQDI-LNVFQYDPEYLMN 301
E+ ++++N + P+ EDLDL PE+ K + D +D+ + + E
Sbjct: 3968 EIQPDIQENNSQPPPE-NEDLDL-PEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADA 4025
Query: 302 E-----------EKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
E E+ TL +I D+ D + D++ +ED E +E E+ ++
Sbjct: 4026 EKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTED 4080
Score = 32.7 bits (74), Expect = 0.70
Identities = 24/97 (24%), Positives = 45/97 (46%), Gaps = 4/97 (4%)
Query: 259 DVIEDLDLVPEEDKFTHLMTLDGVKDTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDE 318
D+ E+++ +PE+ + L + +D+L Q E + + +E D+++
Sbjct: 3887 DLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKE---DDNK 3943
Query: 319 DDEDGDDEEDEDEEESEAEEG-EKENKETIIDNTETN 354
ED D +E EDEEE + G + E + I +N
Sbjct: 3944 ALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQP 3980
Score = 29.6 bits (66), Expect = 6.9
Identities = 17/41 (41%), Positives = 23/41 (56%), Gaps = 4/41 (9%)
Query: 315 DEDEDDEDGDDE-EDEDEEESEAEEGEKENKETIIDNTETN 354
+D D ED D E DE++EE++AE+ E E D E N
Sbjct: 4003 SKDSDLEDMDMEAADENKEEADAEKDEPMQDE---DPLEEN 4040
>gnl|CDD|235548 PRK05657, PRK05657, RNA polymerase sigma factor RpoS; Validated.
Length = 325
Score = 32.6 bits (75), Expect = 0.43
Identities = 13/56 (23%), Positives = 26/56 (46%), Gaps = 9/56 (16%)
Query: 306 TTLRREILGDEDEDDEDG---------DDEEDEDEEESEAEEGEKENKETIIDNTE 352
TL+ L ++ + DE+G +EE D + +E E + + ++D T+
Sbjct: 1 MTLKVHDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQRVLDATQ 56
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 31.8 bits (72), Expect = 0.53
Identities = 15/28 (53%), Positives = 17/28 (60%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEE 338
E +E ED ED DEEDE+ EE E E
Sbjct: 32 EEDEEEMEDWEDSLDEEDEEAEEVEEET 59
Score = 28.0 bits (62), Expect = 9.4
Identities = 14/37 (37%), Positives = 22/37 (59%)
Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEE 338
EE + + E+ DE+E ++ D ++EDEE E EE
Sbjct: 21 EEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEE 57
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 32.5 bits (75), Expect = 0.58
Identities = 17/61 (27%), Positives = 27/61 (44%), Gaps = 8/61 (13%)
Query: 286 QDILNVFQYDPEYLMNEEKYTTLRREIL-GDEDEDDEDGDDEEDEDEEESEAEEGEKENK 344
IL YD L N E RE++ G D + E+ + EE + +E E+E +
Sbjct: 150 DAILE--WYDR--LENGE---RRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEE 202
Query: 345 E 345
+
Sbjct: 203 D 203
Score = 32.1 bits (74), Expect = 0.89
Identities = 19/79 (24%), Positives = 33/79 (41%), Gaps = 4/79 (5%)
Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETII---DNTETNLVALRRTIYLTI 366
+ + +E D+D D+EE+EDE + E E E ++ LR+ +
Sbjct: 183 AHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLRKAQEKKV 242
Query: 367 HSSLDFEECAHKLLRMQLK 385
L + + LR +LK
Sbjct: 243 EGRLAQHK-KYAKLREKLK 260
Score = 29.4 bits (67), Expect = 6.3
Identities = 7/38 (18%), Positives = 15/38 (39%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
+D + + + DE+ E + E+ I T+
Sbjct: 66 ADDLLLAENEADAQTDEDAEEEAAAALSSVESEIGRTD 103
>gnl|CDD|218379 pfam05009, EBV-NA3, Epstein-Barr virus nuclear antigen 3 (EBNA-3).
This family contains EBNA-3A, -3B, and -3C which are
latent infection nuclear proteins important for
Epstein-Barr virus (EBV)-induced B-cell immortalisation
and the immune response to EBV infection.
Length = 254
Score = 32.0 bits (73), Expect = 0.61
Identities = 13/46 (28%), Positives = 19/46 (41%), Gaps = 1/46 (2%)
Query: 308 LRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
L L E D+D E + E+SE+ E + E I+ E
Sbjct: 199 LGLSALRTPGEPDDDATVETSSESEDSES-ESDDEELPYIVPRMEP 243
>gnl|CDD|152814 pfam12379, DUF3655, Protein of unknown function (DUF3655). This
domain family is found in viruses, and is approximately
70 amino acids in length. The family is found in
association with pfam08716, pfam01661, pfam05409,
pfam06471, pfam08717, pfam06478, pfam09401, pfam06460,
pfam08715, pfam08710.
Length = 70
Score = 29.6 bits (66), Expect = 0.62
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
DE+ED E+ + EE+E EE E E G +++
Sbjct: 9 DEEEDCEEYECEEEEIEETCEHEYGTEDD 37
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 32.7 bits (74), Expect = 0.64
Identities = 7/30 (23%), Positives = 16/30 (53%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENK 344
++ D+EDG+D ++ + + + K K
Sbjct: 972 NDSSDEEDGEDWDELESKAAYDSRPGKRRK 1001
Score = 30.0 bits (67), Expect = 4.2
Identities = 8/37 (21%), Positives = 19/37 (51%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDN 350
DE +D D+ E++ +E + E+ ++ + D+
Sbjct: 958 SDESSEDLSEDESENDSSDEEDGEDWDELESKAAYDS 994
Score = 29.6 bits (66), Expect = 5.4
Identities = 10/48 (20%), Positives = 25/48 (52%)
Query: 295 DPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
D +E++ + E L +++ +++ D+E+ ED +E E++
Sbjct: 947 DESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELESKAAYDS 994
Score = 29.2 bits (65), Expect = 8.2
Identities = 13/32 (40%), Positives = 22/32 (68%), Gaps = 1/32 (3%)
Query: 315 DEDEDDEDGDD-EEDEDEEESEAEEGEKENKE 345
DE ++DE+ D+ ED E+ESE + ++E+ E
Sbjct: 950 DETDEDEESDESSEDLSEDESENDSSDEEDGE 981
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 32.7 bits (74), Expect = 0.64
Identities = 9/30 (30%), Positives = 19/30 (63%)
Query: 317 DEDDEDGDDEEDEDEEESEAEEGEKENKET 346
D E+ DD+ +E++ +S + + K +K+T
Sbjct: 113 KTDSEEDDDDSEEEDNKSTSSKDGKGSKKT 142
>gnl|CDD|234064 TIGR02936, fdxN_nitrog, ferredoxin III, nif-specific. Members of
this family are homodimeric ferredoxins from nitrogen
fixation regions of many nitrogen-fixing bacteria. As
characterized in Rhodobacter capsulatus, these proteins
are homodimeric, with two 4Fe-4S clusters bound per
monomer. Although nif-specific, this protein family is
not usiveral, as other nitrogenase systems may
substitute flavodoxins, or different types of ferredoxin
[Central intermediary metabolism, Nitrogen fixation].
Length = 91
Score = 30.0 bits (68), Expect = 0.68
Identities = 11/51 (21%), Positives = 25/51 (49%), Gaps = 9/51 (17%)
Query: 294 YDPEYLM--NEEK-------YTTLRREILGDEDEDDEDGDDEEDEDEEESE 335
+ P+++ ++EK Y R++L + ++E D+D++E E
Sbjct: 10 WTPQFVTSIDQEKCIGCGRCYKVCGRDVLTLKGINEEGELVASDDDDDEIE 60
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A peptide
from the central, syntaxin binding domain of synaphin
competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal exocytosis
by promoting interaction between the complementary
syntaxin and synaptobrevin transmembrane regions that
reside in opposing membranes prior to fusion.
Length = 139
Score = 31.0 bits (70), Expect = 0.71
Identities = 16/31 (51%), Positives = 20/31 (64%), Gaps = 2/31 (6%)
Query: 314 GDEDEDDEDGDDEEDEDEEES--EAEEGEKE 342
G EDE DE +EEDE+ +E+ EAEE K
Sbjct: 22 GKEDEGDESDAEEEDEEIQEALREAEEERKA 52
Score = 29.5 bits (66), Expect = 2.3
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 318 EDDEDGDDEEDEDEEESEAEEGEKENKE 345
ED DE D +EE+ E +E +E +E
Sbjct: 21 GGKEDEGDESDAEEEDEEIQEALREAEE 48
Score = 28.3 bits (63), Expect = 6.2
Identities = 10/33 (30%), Positives = 21/33 (63%)
Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
LG ++++ ++ D EE+++E + E E+E K
Sbjct: 20 LGGKEDEGDESDAEEEDEEIQEALREAEEERKA 52
>gnl|CDD|239201 cd02803, OYE_like_FMN_family, Old yellow enzyme (OYE)-like FMN
binding domain. OYE was the first flavin-dependent
enzyme identified, however its true physiological role
remains elusive to this day. Each monomer of OYE
contains FMN as a non-covalently bound cofactor, uses
NADPH as a reducing agent with oxygens, quinones, and
alpha,beta-unsaturated aldehydes and ketones, and can
act as electron acceptors in the catalytic reaction.
Members of OYE family include trimethylamine
dehydrogenase, 2,4-dienoyl-CoA reductase, enoate
reductase, pentaerythriol tetranitrate reductase,
xenobiotic reductase, and morphinone reductase.
Length = 327
Score = 32.2 bits (74), Expect = 0.74
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 9/46 (19%)
Query: 219 LESFFS------TDVYNSSKGKLEKRVQYMIEVLAQVRKDNFKDFP 258
L F S TD Y G LE R ++++E++A VR+ DFP
Sbjct: 168 LSQFLSPYTNKRTDEYG---GSLENRARFLLEIVAAVREAVGPDFP 210
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 32.0 bits (73), Expect = 0.76
Identities = 9/31 (29%), Positives = 16/31 (51%)
Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
DED + D DD +D+ +E + + +N
Sbjct: 259 DDDEDAIESDLDDSDDDVSDEDGEDLFDTDN 289
Score = 30.9 bits (70), Expect = 1.5
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 310 REILGDEDEDDEDGDDEEDEDEEE 333
I D++ D D DD+ED E +
Sbjct: 245 DGIDSDDEGDGSDDDDDEDAIESD 268
Score = 30.1 bits (68), Expect = 3.0
Identities = 11/34 (32%), Positives = 20/34 (58%), Gaps = 1/34 (2%)
Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESE 335
++ T + + + +DE D DD++DED ES+
Sbjct: 236 SKRRTIAQIDGIDSDDEGDGS-DDDDDEDAIESD 268
Score = 29.7 bits (67), Expect = 4.0
Identities = 10/31 (32%), Positives = 13/31 (41%), Gaps = 6/31 (19%)
Query: 314 GDEDEDDE------DGDDEEDEDEEESEAEE 338
D+DED DD DED E+ +
Sbjct: 258 DDDDEDAIESDLDDSDDDVSDEDGEDLFDTD 288
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 32.5 bits (74), Expect = 0.78
Identities = 13/41 (31%), Positives = 25/41 (60%)
Query: 301 NEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEK 341
+EE+ + R D D+D+ +DE++++E++ EAE K
Sbjct: 14 DEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAK 54
Score = 29.8 bits (67), Expect = 5.0
Identities = 15/59 (25%), Positives = 28/59 (47%)
Query: 284 DTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
+ Q + +++ + YLM + L+ I E E+ D E+++E+E EEG
Sbjct: 768 NVQRLTELYEKEVRYLMQAHQKGQLKDTIDVAEPEEPGDPLTAEEQEEKEQLLEEGFST 826
>gnl|CDD|219939 pfam08619, Nha1_C, Alkali metal cation/H+ antiporter Nha1 C
terminus. The C terminus of the plasma membrane Nha1
antiporter plays an important role in the immediate cell
response to hypo-osmotic shock which prevents an
execessive loss of ions and water. This domain is found
with pfam00999.
Length = 430
Score = 32.1 bits (73), Expect = 0.80
Identities = 11/31 (35%), Positives = 18/31 (58%)
Query: 307 TLRREILGDEDEDDEDGDDEEDEDEEESEAE 337
R D +++D + DD + +DEEE+ AE
Sbjct: 379 PSRPRSRRDSEDEDTERDDSDSDDEEETPAE 409
>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
subunit 8 N-terminus. The largest of the mammalian
translation initiation factors, eIF3, consists of at
least eight subunits ranging in mass from 35 to 170 kDa.
eIF3 binds to the 40 S ribosome in an early step of
translation initiation and promotes the binding of
methionyl-tRNAi and mRNA.
Length = 593
Score = 32.1 bits (73), Expect = 0.85
Identities = 24/68 (35%), Positives = 35/68 (51%), Gaps = 12/68 (17%)
Query: 283 KDTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDED-GDDEEDEDEEESEAEEGEK 341
K +D + ++ DPE EE +EDEDD+D G D+EDEDE+ A E
Sbjct: 122 KQFEDDITRYREDPESEDEEE-----------EEDEDDDDDGSDDEDEDEDGVGATEEVA 170
Query: 342 ENKETIID 349
+ E+ +D
Sbjct: 171 ASSESGVD 178
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 32.2 bits (73), Expect = 0.89
Identities = 22/79 (27%), Positives = 37/79 (46%), Gaps = 13/79 (16%)
Query: 273 FTHL---MTLDGVKDTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDD-EDGDDEED 328
FT+ +T +G++ D+L +E K + +EDEDD E+ DD+ED
Sbjct: 613 FTYFIEDVTNEGLQLILDVLK---------ADENKSRHQQLFEGEEEDEDDLEETDDDED 663
Query: 329 EDEEESEAEEGEKENKETI 347
E E ++E + + E
Sbjct: 664 ECEAIEDSESESESDGEDG 682
Score = 31.0 bits (70), Expect = 2.2
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEG 339
+ + + + D EEDE E+++EA EG
Sbjct: 672 ESESESDGEDGEEDEQEDDAEANEG 696
>gnl|CDD|236876 PRK11191, PRK11191, RNase E inhibitor protein; Provisional.
Length = 138
Score = 30.7 bits (70), Expect = 0.90
Identities = 10/20 (50%), Positives = 14/20 (70%)
Query: 316 EDEDDEDGDDEEDEDEEESE 335
ED + EDGDDE+ DE++
Sbjct: 116 EDPNAEDGDDEDFVDEDDDG 135
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 31.9 bits (72), Expect = 1.0
Identities = 6/30 (20%), Positives = 11/30 (36%)
Query: 317 DEDDEDGDDEEDEDEEESEAEEGEKENKET 346
DE E + + + E E++ T
Sbjct: 209 DEPGEPQSPTPQQAPSPNTQQAVEHEDEPT 238
Score = 31.1 bits (70), Expect = 1.8
Identities = 5/31 (16%), Positives = 7/31 (22%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
+ + E EE E E
Sbjct: 154 SHNPSPNQQPSSFLQPSHEDSPEEPEPPTSE 184
Score = 30.7 bits (69), Expect = 2.2
Identities = 7/33 (21%), Positives = 10/33 (30%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
ED +E + E E + ET
Sbjct: 165 SFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSET 197
>gnl|CDD|191365 pfam05756, S-antigen, S-antigen protein. S-antigens are heat
stable proteins that are found in the blood of
individuals infected with malaria.
Length = 94
Score = 29.5 bits (66), Expect = 1.0
Identities = 16/40 (40%), Positives = 26/40 (65%), Gaps = 1/40 (2%)
Query: 303 EKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
+KY L+ E G E++D+ED + EE ++EE+ EG+ E
Sbjct: 55 QKYEDLQEEGEG-ENDDEEDSNSEESNNDEENGLIEGQGE 93
>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
kDa subunit. This family contains several DNA-directed
RNA polymerase 19 kDa polypeptides. The Poxvirus
DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
DNA-template-directed extension of the 3'-end of an RNA
strand by one nucleotide at a time.
Length = 167
Score = 30.8 bits (70), Expect = 1.0
Identities = 11/41 (26%), Positives = 20/41 (48%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
D + + D DD E+ +EEE + E+ E + ++N
Sbjct: 5 DDIIDYESDDDDSEEYEEEEEDEEDAESLESSDVSSLKQSN 45
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 31.3 bits (71), Expect = 1.1
Identities = 16/48 (33%), Positives = 29/48 (60%), Gaps = 2/48 (4%)
Query: 301 NEEKYTTLRREILGDEDEDDEDGDDEEDE--DEEESEAEEGEKENKET 346
+ K + ++ + DE++DD+DGD E++ E+E E E E+E +E
Sbjct: 235 DGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEED 282
Score = 30.6 bits (69), Expect = 1.8
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEE 338
+E E ED ++EE E+EEE E E+
Sbjct: 261 EEKELKEDEEEEETEEEEEEEDED 284
Score = 30.6 bits (69), Expect = 1.8
Identities = 10/27 (37%), Positives = 18/27 (66%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAE 337
+ E ++DE+ ++ E+E+EEE E E
Sbjct: 259 YVEEKELKEDEEEEETEEEEEEEDEDE 285
Score = 29.0 bits (65), Expect = 6.8
Identities = 14/38 (36%), Positives = 21/38 (55%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
D++EDD+ D +E E + + EE E E +E D E
Sbjct: 248 DDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285
Score = 28.6 bits (64), Expect = 7.9
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
++ DDE+ DD++ + EE E +E E+E + + E
Sbjct: 244 NKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEED 282
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 32.0 bits (72), Expect = 1.1
Identities = 10/26 (38%), Positives = 19/26 (73%)
Query: 312 ILGDEDEDDEDGDDEEDEDEEESEAE 337
I+ ++D++ E+ + E D DE++S AE
Sbjct: 354 IIVEDDDESEEIESECDPDEDKSGAE 379
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 30.8 bits (70), Expect = 1.2
Identities = 19/99 (19%), Positives = 36/99 (36%), Gaps = 29/99 (29%)
Query: 294 YDPEYLMNEEKYTTLRREI-------------------LGDEDEDDEDGDDEEDEDEEES 334
YD EY ++K L EI + + D+D D++ +D+ E
Sbjct: 53 YDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEK 112
Query: 335 EAEEGEKENKETI----------IDNTETNLVALRRTIY 363
+ E+ ++ E + + + AL + IY
Sbjct: 113 KDEKEAEDKLEDLTKSYSETLSTLSELKPRKYALHKDIY 151
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 29.5 bits (67), Expect = 1.3
Identities = 8/26 (30%), Positives = 16/26 (61%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGE 340
+++ ++EE+E++EESE E
Sbjct: 74 AAAAEEKKEEEEEEEEKEESEEEAAA 99
Score = 28.8 bits (65), Expect = 2.9
Identities = 9/24 (37%), Positives = 16/24 (66%)
Query: 316 EDEDDEDGDDEEDEDEEESEAEEG 339
+E E+ ++EE+++E E EA G
Sbjct: 77 AEEKKEEEEEEEEKEESEEEAAAG 100
>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22. The
mitochondrial protein translocase family, which is
responsible for movement of nuclear encoded pre-proteins
into mitochondria, is very complex with at least 19
components. These proteins include several chaperone
proteins, four proteins of the outer membrane
translocase (Tom) import receptor, five proteins of the
Tom channel complex, five proteins of the inner membrane
translocase (Tim) and three "motor" proteins. This
family represents the Tom22 proteins. The N terminal
region of Tom22 has been shown to have chaperone-like
activity, and the C terminal region faces the
intermembrane face.
Length = 136
Score = 30.3 bits (69), Expect = 1.3
Identities = 16/58 (27%), Positives = 28/58 (48%), Gaps = 10/58 (17%)
Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALR 359
+EK + + D+DDED D + + ++S+ E ET+ D +VAL+
Sbjct: 13 QEKPAAPKNLAQEESDDDDEDDTDTDSDISDDSDFEN------ETLYD----RIVALK 60
>gnl|CDD|227494 COG5165, POB3, Nucleosome-binding factor SPN, POB3 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 508
Score = 31.5 bits (71), Expect = 1.4
Identities = 11/43 (25%), Positives = 20/43 (46%)
Query: 301 NEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
NEE L+ ++ D +D + +EDE E E + ++
Sbjct: 441 NEEVQERLQTDLGSISDSEDINMGSAGEEDESEDEDFQMVSDS 483
>gnl|CDD|177755 PLN00152, PLN00152, DNA-directed RNA polymerase; Provisional.
Length = 130
Score = 29.8 bits (67), Expect = 1.4
Identities = 14/51 (27%), Positives = 20/51 (39%)
Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRRTIY 363
+G EDE E +E E++ ES + K + D E V R
Sbjct: 7 MGYEDEPSEPEIEEGAEEDSESNNNDEVKGSIGENEDKEEQEPVQRPRKTT 57
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 31.6 bits (71), Expect = 1.5
Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 13/100 (13%)
Query: 252 DNFKDFPDVIEDLDLVPEEDKFTHLMTLDGVKDTQDILNVFQYDPEYLMNEEKYTTLRRE 311
+ D D +D +L ED F+ D D + L+ D E +E + + E
Sbjct: 696 KSRPDVEDDSDDSELDFAEDDFS-----DSTSDDEPKLD--AIDDEDAKSEGSQESDQEE 748
Query: 312 ILG------DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
L D ++D+ D E E++E SE E+ E+ENKE
Sbjct: 749 GLDEIFYSFDGEQDNSDSFAESSEEDESSEEEKEEEENKE 788
>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2. This family consists
of several Barren protein homologues from several
eukaryotic organisms. In Drosophila Barren (barr) is
required for sister-chromatid segregation in mitosis.
barr encodes a novel protein that is present in
proliferating cells and has homologues in yeast and
human. Mitotic defects in barr embryos become apparent
during cycle 16, resulting in a loss of PNS and CNS
neurons. Centromeres move apart at the
metaphase-anaphase transition and Cyclin B is degraded,
but sister chromatids remain connected, resulting in
chromatin bridging. Barren protein localises to
chromatin throughout mitosis. Colocalisation and
biochemical experiments indicate that Barren associates
with Topoisomerase II throughout mitosis and alters the
activity of Topoisomerase II. It has been suggested that
this association is required for proper chromosomal
segregation by facilitating the decatenation of
chromatids at anaphase. This family forms one of the
three non-structural maintenance of chromosomes (SMC)
subunits of the mitotic condensation complex along with
Cnd1 and Cnd3.
Length = 719
Score = 31.3 bits (71), Expect = 1.5
Identities = 8/31 (25%), Positives = 15/31 (48%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
+ E D E+D++E E+G K+ +
Sbjct: 157 APETGREGDDGEDDDEEGSDGEEDGAKKKAK 187
Score = 30.1 bits (68), Expect = 3.3
Identities = 13/30 (43%), Positives = 17/30 (56%)
Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
+ E EDD D D E+ D ++E E EKE
Sbjct: 344 INAEVEDDPDEDAEDGGDGGDAEEFEAEKE 373
Score = 30.1 bits (68), Expect = 4.1
Identities = 14/47 (29%), Positives = 23/47 (48%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRR 360
G E +D ED D+E + EE+ ++ +K+ + E N AL
Sbjct: 161 GREGDDGEDDDEEGSDGEEDGAKKKAKKKRQRKPESTLEKNFEALNV 207
Score = 29.7 bits (67), Expect = 4.8
Identities = 10/36 (27%), Positives = 17/36 (47%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDN 350
D + + ED DE+ ED + E + KE ++
Sbjct: 343 DINAEVEDDPDEDAEDGGDGGDAEEFEAEKEPMLKV 378
Score = 29.7 bits (67), Expect = 4.9
Identities = 13/31 (41%), Positives = 18/31 (58%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENK 344
+ E +GDD ED+DEE S+ EE + K
Sbjct: 155 KEAPETGREGDDGEDDDEEGSDGEEDGAKKK 185
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 30.3 bits (69), Expect = 1.5
Identities = 14/41 (34%), Positives = 25/41 (60%), Gaps = 4/41 (9%)
Query: 310 REILG----DEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
R++LG +++E++E+ + E+ DEEE E EKE +
Sbjct: 91 RKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKL 131
Score = 28.0 bits (63), Expect = 7.3
Identities = 8/32 (25%), Positives = 19/32 (59%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
+E+E + + DEE++ +E E E + + ++
Sbjct: 104 EEEEEVEVEELDEEEQIDELLEKELAKLKREK 135
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 30.4 bits (68), Expect = 1.6
Identities = 18/53 (33%), Positives = 28/53 (52%)
Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETN 354
EE+ RRE DE ED E+ + E +E E ++++ E E E TE++
Sbjct: 139 EERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETESD 191
>gnl|CDD|225618 COG3076, COG3076, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 135
Score = 29.8 bits (67), Expect = 1.7
Identities = 10/20 (50%), Positives = 14/20 (70%)
Query: 316 EDEDDEDGDDEEDEDEEESE 335
ED + ED DD+ED DE++
Sbjct: 113 EDPNAEDDDDDEDVDEDDDG 132
>gnl|CDD|239240 cd02930, DCR_FMN, 2,4-dienoyl-CoA reductase (DCR) FMN-binding
domain. DCR in E. coli is an iron-sulfur flavoenzyme
which contains FMN, FAD, and a 4Fe-4S cluster. It is
also a monomer, unlike that of its eukaryotic
counterparts which form homotetramers and lack the
flavin and iron-sulfur cofactors. Metabolism of
unsaturated fatty acids requires auxiliary enzymes in
addition to those used in b-oxidation. After a given
number of cycles through the b-oxidation pathway, those
unsaturated fatty acyl-CoAs with double bonds at
even-numbered carbon positions contain 2-trans, 4-cis
double bonds that can not be modified by enoyl-CoA
hydratase. DCR utilizes NADPH to remove the C4-C5 double
bond. DCR can catalyze the reduction of both natural
fatty acids with cis double bonds, as well as substrates
containing trans double bonds. The reaction is initiated
by hybrid transfer from NADPH to FAD, which in turn
transfers electrons, one at a time, to FMN via the
4Fe-4S cluster. The fully reduced FMN provides a hydrid
ion to the C5 atom of substrate, and Tyr and His are
proposed to form a catalytic dyad that protonates the C4
atom of the substrate and completes the reaction.
Length = 353
Score = 30.7 bits (70), Expect = 1.8
Identities = 17/59 (28%), Positives = 28/59 (47%), Gaps = 5/59 (8%)
Query: 213 IQSFQALESFFSTDVYNSSKGKLEKRVQYMIEVLAQVRKDNFKDFPDV--IEDLDLVPE 269
I F A + TD + G E R+++ +E++ VR +DF + + LDLV
Sbjct: 164 INQFLAPRTNKRTDEWG---GSFENRMRFPVEIVRAVRAAVGEDFIIIYRLSMLDLVEG 219
>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain. This presumed domain is
found at the C-terminus of a family of FtsJ-like
methyltransferases. Members of this family are involved
in 60S ribosomal biogenesis.
Length = 212
Score = 30.3 bits (69), Expect = 1.8
Identities = 13/41 (31%), Positives = 23/41 (56%), Gaps = 1/41 (2%)
Query: 316 EDEDDEDGDDE-EDEDEEESEAEEGEKENKETIIDNTETNL 355
E + DED + E +E+E + ++ E ++K+ I D T L
Sbjct: 1 ESKSDEDDEFEVVPAEEDEKKVDKDEDDDKDDIDDLTAEAL 41
>gnl|CDD|204975 pfam12598, TBX, T-box transcription factor. This domain family is
found in eukaryotes, and is typically between 77 and 89
amino acids in length. The family is found in
association with pfam00907. There are two completely
conserved residues (S and P) that may be functionally
important. T-box genes encode transcription factors
involved in morphogenesis and organogenesis of
vertebrates and invertebrates.
Length = 91
Score = 28.6 bits (64), Expect = 2.1
Identities = 11/32 (34%), Positives = 17/32 (53%)
Query: 307 TLRREILGDEDEDDEDGDDEEDEDEEESEAEE 338
L+R +ED D E DD+ E + +SE +
Sbjct: 44 RLKRLCPSEEDSDVESRDDKIPEADSDSEISD 75
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 30.9 bits (70), Expect = 2.1
Identities = 24/132 (18%), Positives = 51/132 (38%), Gaps = 21/132 (15%)
Query: 225 TDVYNSSKGKLEKRVQYM-------IEVLAQVRKDNFKDFPDVIEDLDLVPEEDKFTHLM 277
T + N KL K VQY+ +E L +K+N + ++ + D L
Sbjct: 269 TTLENYDPEKLPKSVQYILAVSYVNLEDLTTTKKENILN--------NISLKSDDNYLLY 320
Query: 278 TLD-GVKDTQDILNVFQY--DPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEE- 333
+ G + ++ +N+ + D + + T L E+ + D + + E ++
Sbjct: 321 WIYSGRGEFKEAINIARNLDDNDLTL--LALTKLYEEVKSNTDLSGDKRQELLKEYNKKL 378
Query: 334 SEAEEGEKENKE 345
+ + E K+
Sbjct: 379 QDYTKKLGEVKD 390
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 30.9 bits (70), Expect = 2.1
Identities = 16/63 (25%), Positives = 23/63 (36%), Gaps = 1/63 (1%)
Query: 284 DTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDED-GDDEEDEDEEESEAEEGEKE 342
D + + Q + E + E L E +ED EDE E E E E +
Sbjct: 382 DAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQ 441
Query: 343 NKE 345
+E
Sbjct: 442 PEE 444
>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist. LIN-8
is a nuclear protein, present at the sites of
transcriptional repressor complexes, which interacts
with LIN-35 Rb.Lin35 Rb is a product of the class B
synMuv gene lin-35 which silences genes required for
vulval specification through chromatin modification and
remodelling. The biological role of the interaction has
not yet been determined however predictions have been
made. The interaction shows that class A synMuv genes
control vulval induction through the transcriptional
regulation of gene expression. LIN-8 normally functions
as part of a protein complex however when the complex is
absent, other family members can partially replace LIN-8
activity.
Length = 316
Score = 30.4 bits (69), Expect = 2.1
Identities = 15/60 (25%), Positives = 24/60 (40%), Gaps = 13/60 (21%)
Query: 307 TLRREILGDEDEDDEDGDD-----------EEDEDEEESEA--EEGEKENKETIIDNTET 353
LR+E ED++ +D DD E +D EE++ E +E + N
Sbjct: 125 ELRKEPWTGEDQEADDEDDIIYDGIFEDRTRESQDMEENDEVDEVEVEEVPDDYGANLAV 184
>gnl|CDD|221586 pfam12457, TIP_N, Tuftelin interacting protein N terminal. This
domain family is found in eukaryotes, and is typically
between 99 and 114 amino acids in length. The family is
found in association with pfam08697, pfam01585. There
are two completely conserved residues (G and F) that may
be functionally important. TIP is involved in enamel
assembly by interacting with one of the major proteins
responsible for biomineralisation of enamel - tuftelin.
Length = 106
Score = 28.9 bits (65), Expect = 2.2
Identities = 7/26 (26%), Positives = 18/26 (69%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEG 339
+ +EDD++ + E+D++ +E + E+
Sbjct: 81 KEPEEDDKEDESEDDDESDEDDDEDD 106
Score = 28.1 bits (63), Expect = 4.1
Identities = 11/25 (44%), Positives = 17/25 (68%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESE 335
E D+ ED+ + DDE DED++E +
Sbjct: 82 EPEEDDKEDESEDDDESDEDDDEDD 106
>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366). This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are typically between 227 and
387 amino acids in length.
Length = 213
Score = 30.0 bits (68), Expect = 2.2
Identities = 8/24 (33%), Positives = 13/24 (54%)
Query: 317 DEDDEDGDDEEDEDEEESEAEEGE 340
D+D D+ + DE+E E +E
Sbjct: 189 GAPDDDLDEYDYGDEDEEEDDEPP 212
Score = 29.6 bits (67), Expect = 3.6
Identities = 8/22 (36%), Positives = 15/22 (68%)
Query: 315 DEDEDDEDGDDEEDEDEEESEA 336
D+D D+ D DE++E+++E
Sbjct: 192 DDDLDEYDYGDEDEEEDDEPPW 213
Score = 28.5 bits (64), Expect = 7.3
Identities = 7/25 (28%), Positives = 15/25 (60%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEE 338
G D+D ++ D ++++EE+ E
Sbjct: 189 GAPDDDLDEYDYGDEDEEEDDEPPW 213
>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2. This model
describes a tightly conserved subfamily of the larger
family of sugar (and other) transporters described by
PFAM model pfam00083. Members of this subfamily include
closely related forms SV2A and SV2B of synaptic vesicle
protein from vertebrates and a more distantly related
homolog (below trusted cutoff) from Drosophila
melanogaster. Members are predicted to have two sets of
six transmembrane helices.
Length = 742
Score = 30.7 bits (69), Expect = 2.4
Identities = 16/55 (29%), Positives = 27/55 (49%), Gaps = 11/55 (20%)
Query: 302 EEKYTTLRREILGDEDEDDE-----------DGDDEEDEDEEESEAEEGEKENKE 345
+++Y+ DED+DD+ + D+E+E E S+A EG E+ E
Sbjct: 38 QDEYSQRSYSRFEDEDDDDDFPAPADGYSRGEAADDEEEGEASSDATEGHDEDDE 92
>gnl|CDD|214923 smart00937, PCRF, This domain is found in peptide chain release
factors.
Length = 116
Score = 29.0 bits (66), Expect = 2.4
Identities = 21/78 (26%), Positives = 33/78 (42%), Gaps = 15/78 (19%)
Query: 313 LGDEDEDDEDGDDEEDE--DEEESEAEEGEKENKETIIDNTETNLVALRRTIYLTIHS-- 368
L +E+EDDE + E+E + E EE E+ K ++ + + L I +
Sbjct: 2 LLEEEEDDEMRELAEEELEELEAELEEELEELLKLLLLPKDPND----DKNAILEIRAGA 57
Query: 369 ----SLDFEECAHKLLRM 382
+ F A LLRM
Sbjct: 58 GGDEAALF---AGDLLRM 72
>gnl|CDD|222117 pfam13422, DUF4110, Domain of unknown function (DUF4110). This is
a family that is found predominantly at the C-terminus
of Kelch-containing proteins. However, the exact
function of this region is not known.
Length = 99
Score = 28.9 bits (65), Expect = 2.5
Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 4/46 (8%)
Query: 300 MNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
+ EE+Y LR E+ EDE++E E E S A+E + E
Sbjct: 54 LAEERYWELREELRALEDEEEEQAGIE----EVVSLAKEKDWGGGE 95
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 30.6 bits (69), Expect = 2.5
Identities = 12/31 (38%), Positives = 18/31 (58%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
DE+E +E +ED ESE++E + E E
Sbjct: 216 DEEEGEEAPSINYNEDTSESESDESDSEISE 246
Score = 29.4 bits (66), Expect = 5.0
Identities = 12/40 (30%), Positives = 17/40 (42%)
Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEK 341
+E E EDE + E+D D+EE E + E
Sbjct: 164 KESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSED 203
Score = 28.6 bits (64), Expect = 8.6
Identities = 6/38 (15%), Positives = 15/38 (39%)
Query: 313 LGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDN 350
L D +++E + E++ E ++ + E
Sbjct: 211 LVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESR 248
Score = 28.6 bits (64), Expect = 9.3
Identities = 8/27 (29%), Positives = 13/27 (48%)
Query: 316 EDEDDEDGDDEEDEDEEESEAEEGEKE 342
+ +DE +E ED+ + E EE
Sbjct: 175 SESEDESKSEESAEDDSDDEEEEDSDS 201
>gnl|CDD|131353 TIGR02300, FYDLN_acid, TIGR02300 family protein. Members of this
family are bacterial proteins with a conserved motif
[KR]FYDLN, sometimes flanked by a pair of CXXC motifs,
followed by a long region of low complexity sequence in
which roughly half the residues are Asp and Glu,
including multiple runs of five or more acidic residues.
The function of members of this family is unknown.
Length = 129
Score = 29.2 bits (65), Expect = 2.5
Identities = 11/32 (34%), Positives = 18/32 (56%), Gaps = 5/32 (15%)
Query: 311 EILGDEDEDDEDGD-----DEEDEDEEESEAE 337
++ DED D+D D DEED+D ++ +
Sbjct: 93 DLAEDEDIADDDDDVTFLEDEEDDDIDDEDII 124
>gnl|CDD|235636 PRK05886, yajC, preprotein translocase subunit YajC; Validated.
Length = 109
Score = 29.0 bits (65), Expect = 2.6
Identities = 9/28 (32%), Positives = 18/28 (64%)
Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAE 337
R+ + +DEDDE+ + + D+D ++ E
Sbjct: 81 RDRILPDDEDDEELNADLDKDVDDVAGE 108
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function
of NS2 is not known, but it may be involved in the
transport and condensation of viral mRNAs.
Length = 363
Score = 30.2 bits (68), Expect = 2.6
Identities = 16/62 (25%), Positives = 29/62 (46%)
Query: 309 RREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRRTIYLTIHS 368
RR L E E +E+ + + D+++E E E + E+ I + ++ R + S
Sbjct: 238 RRRDLRKEQELEENVERDSDDEDEHGEDSEDGETKPESYITSEYIERISEIRKMKDERLS 297
Query: 369 SL 370
SL
Sbjct: 298 SL 299
>gnl|CDD|219285 pfam07065, D123, D123. This family contains a number of eukaryotic
D123 proteins approximately 330 residues long. It has
been shown that mutated variants of D123 exhibit
temperature-dependent differences in their degradation
rate. D123 proteins are regulators of eIF2, the central
regulator of translational initiation.
Length = 295
Score = 30.4 bits (69), Expect = 2.7
Identities = 17/62 (27%), Positives = 28/62 (45%)
Query: 286 QDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
++ L D L +EE + +E +E D + DD+ED D E E ++ +E
Sbjct: 19 EEFLEYLLQDGILLPSEESSLPIYQESSDNEYSDWFEDDDDEDTDVTRPEFPELHQKIRE 78
Query: 346 TI 347
I
Sbjct: 79 AI 80
>gnl|CDD|217573 pfam03462, PCRF, PCRF domain. This domain is found in peptide
chain release factors.
Length = 115
Score = 28.7 bits (65), Expect = 3.0
Identities = 24/75 (32%), Positives = 31/75 (41%), Gaps = 14/75 (18%)
Query: 318 EDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVAL----RRTIYLTIHS----- 368
E E+ DD E +E E E EE EKE E + L+ L + L I +
Sbjct: 1 ELLEEEDDPELREEAEEELEELEKE-LEELEKELLNLLLPLDPYDDKNAILEIRAGAGGT 59
Query: 369 -SLDFEECAHKLLRM 382
+ F A LLRM
Sbjct: 60 EAQLF---AEDLLRM 71
>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
pre-ribosomes [Translation, ribosomal structure and
biogenesis / Intracellular trafficking and secretion].
Length = 657
Score = 30.4 bits (68), Expect = 3.1
Identities = 14/69 (20%), Positives = 24/69 (34%), Gaps = 18/69 (26%)
Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAEEGE-----------KENKETI-------IDNT 351
E L + + + D + +DE++ EE + KE I I+
Sbjct: 92 EEFLSESESEASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSEKERIASICTKIIEEP 151
Query: 352 ETNLVALRR 360
E NL +
Sbjct: 152 EENLGMMEE 160
>gnl|CDD|220112 pfam09110, HAND, HAND. The HAND domain adopts a secondary
structure consisting of four alpha helices, three of
which (H2, H3, H4) form an L-like configuration. Helix
H2 runs antiparallel to helices H3 and H4, packing
closely against helix H4, whilst helix H1 reposes in the
concave surface formed by these three helices and runs
perpendicular to them. The domain confers DNA and
nucleosome binding properties to the protein.
Length = 109
Score = 28.4 bits (64), Expect = 3.3
Identities = 14/32 (43%), Positives = 21/32 (65%), Gaps = 3/32 (9%)
Query: 321 EDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
+D D ++EDEEE EA+ + E ++ IDN E
Sbjct: 64 DDVGDGDEEDEEEREAKR-KLEQEK--IDNAE 92
>gnl|CDD|236765 PRK10810, PRK10810, anti-sigma28 factor FlgM; Provisional.
Length = 98
Score = 28.1 bits (63), Expect = 3.4
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 8/52 (15%)
Query: 32 AKLKLMQQSISDKSSVEYQRISWETLKKSIHGSVNKVNTGNIGIIARKLFQE 83
A+ KLMQ D + +R+ E LK++I K++TG IA L +E
Sbjct: 47 AQAKLMQPGSQD---INVERV--EALKQAIRNGELKMDTGK---IADALIKE 90
>gnl|CDD|222918 PHA02687, PHA02687, ORF061 late transcription factor VLTF-4;
Provisional.
Length = 231
Score = 29.6 bits (66), Expect = 3.7
Identities = 14/52 (26%), Positives = 23/52 (44%), Gaps = 4/52 (7%)
Query: 303 EKYTTLRREILGDEDEDDEDGDDEEDED----EEESEAEEGEKENKETIIDN 350
EK ++ +D DD++ + E E+ EE +AE G + DN
Sbjct: 100 EKSPKAVEKLCPPDDRDDKNEEKEPTEEAQRNEESGDAEGGASGRSPSDDDN 151
>gnl|CDD|216194 pfam00922, Phosphoprotein, Vesiculovirus phosphoprotein.
Length = 283
Score = 29.5 bits (66), Expect = 3.8
Identities = 14/103 (13%), Positives = 31/103 (30%), Gaps = 18/103 (17%)
Query: 253 NFKDFPDVIEDLDLVPE--EDKFTHLMTLDGVKDTQDILNVFQYDPEYLMNEEKYTTLRR 310
N+ + +++ + E DK + + T P Y + EE +
Sbjct: 9 NYPRLDQTLSEIEEMEEQRADKSSTFQEDSVEEHTS---------PSYYLAEELSDS--- 56
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
+ + ED + + E + + I D+
Sbjct: 57 ----ETEPSIEDDQGLYTQLPPAEQVEGFIQGPLDDIADDDID 95
>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator. This protein is
found in a wide range of eukaryotes. It is a nuclear
protein and is suggested to be DNA binding. In plants,
this family is essential for correct circadian clock
functioning by acting as a light-quality regulator
coordinating the activities of blue and red light
signalling pathways during plant growth - inhibiting
growth in red light but promoting growth in blue light.
Length = 233
Score = 29.6 bits (67), Expect = 3.9
Identities = 20/80 (25%), Positives = 37/80 (46%), Gaps = 10/80 (12%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKEN----KETIIDNTETNLVALRRTIYLTIHSSL 370
+D+++ED D+ EDE + E+ E ++ N K+ I N + T +L +
Sbjct: 14 GDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSVD------TSFLPDKARE 67
Query: 371 DFEECAHKLLRMQLKPGQES 390
+ E + LR + QE+
Sbjct: 68 EKEAELREELREEFLKKQEA 87
>gnl|CDD|165509 PHA03250, PHA03250, UL35; Provisional.
Length = 564
Score = 30.1 bits (68), Expect = 3.9
Identities = 12/43 (27%), Positives = 18/43 (41%), Gaps = 2/43 (4%)
Query: 296 PEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEE 338
P YL + L R + D G D++ E +E +E E
Sbjct: 332 PGYLHFA-IFALLLR-LYNLRDATHCAGLDDDLEGQESAELEV 372
>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex. This Mediator
complex subunit was formerly known as Srb4 in yeasts or
Trap80 in Drosophila and human. The Med17 subunit is
located within the head domain and is essential for cell
viability to the extent that a mutant strain of
cerevisiae lacking it shows all RNA polymerase
II-dependent transcription ceasing at non-permissive
temperatures.
Length = 454
Score = 30.0 bits (68), Expect = 4.0
Identities = 14/64 (21%), Positives = 30/64 (46%), Gaps = 6/64 (9%)
Query: 307 TLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRRTIYLTI 366
+LR EI + + D + +E+ED+EE + ++ E+ +T R + +
Sbjct: 53 SLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEEN------KDTVEEFPKARDEVLEQL 106
Query: 367 HSSL 370
+ +L
Sbjct: 107 NQAL 110
>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
Length = 852
Score = 30.1 bits (68), Expect = 4.2
Identities = 9/39 (23%), Positives = 20/39 (51%)
Query: 309 RREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETI 347
R + G ED+ ++ ++D+D+ +S E+E +
Sbjct: 27 SRWVDGSEDDSEDPASLDDDDDDRDSGGGMLEEEEDGNM 65
>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region. The precise
function of this eukaryotic protein family is unknown.
The yeast orthologues have been implicated in cell cycle
progression and biogenesis of 60S ribosomal subunits.
The Schistosoma mansoni Mak16 has been shown to target
protein transport to the nucleolus.
Length = 97
Score = 27.9 bits (62), Expect = 4.3
Identities = 10/35 (28%), Positives = 21/35 (60%)
Query: 312 ILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
L E+ ++ D ++EE+E+EE+ E +++E
Sbjct: 54 ALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEEL 88
Score = 27.9 bits (62), Expect = 4.7
Identities = 11/38 (28%), Positives = 23/38 (60%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
+ +E+DE+ ++EE+E++E + E E I++ E
Sbjct: 59 ESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIEDLE 96
Score = 27.5 bits (61), Expect = 7.1
Identities = 13/38 (34%), Positives = 23/38 (60%)
Query: 308 LRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
L E + DE++E+ ++EEDE E E +++ E E +
Sbjct: 55 LEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEI 92
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 29.9 bits (68), Expect = 4.5
Identities = 30/119 (25%), Positives = 52/119 (43%), Gaps = 12/119 (10%)
Query: 231 SKGKLEKRVQYMIEVLAQVRKDNFKDFPDVIEDLDLVPEEDKFTHLMTLDGVKDTQDILN 290
SK K+ + V + ++ K N + + L+L EE +F L G K +
Sbjct: 362 SKRKVRREVLPFLSIIF---KHNPELAARLAAFLELTEEEIEF-----LTGSKKATKKIK 413
Query: 291 VFQYDPEYLMNEE----KYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKE 345
E EE K + +E+E++++ +EE E+EEE EE E+E ++
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472
>gnl|CDD|147601 pfam05505, Ebola_NP, Ebola nucleoprotein. This family consists of
Ebola and Marburg virus nucleoproteins. These proteins
are responsible for encapsidation of genomic RNA. It has
been found that nucleoprotein DNA vaccines can offer
protection from the virus.
Length = 717
Score = 29.6 bits (66), Expect = 5.4
Identities = 20/71 (28%), Positives = 34/71 (47%), Gaps = 10/71 (14%)
Query: 280 DGVKDTQDILNVFQYDPEYLMNEEKYTTLRREILG---DEDEDDEDGDDE-------EDE 329
G + +QD+ N Q + N T+LR + L +EDE ++ DD+ E +
Sbjct: 508 PGNRASQDLNNNNQKQEDESTNPIGKTSLRYQELTPVQEEDEPEDQTDDDDSSLPPLESD 567
Query: 330 DEEESEAEEGE 340
D+ S+ E+G
Sbjct: 568 DDPGSDNEQGV 578
>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein. This family consists of several
eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
to suppress GCR2 and is highly expressed in the muscle
and heart. The function of this family is unknown
although it has been speculated that SGT1 may be
functionally analogous to the Gcr2p protein of
Saccharomyces cerevisiae which is known to be a
regulatory factor of glycolytic gene expression.
Length = 557
Score = 29.3 bits (66), Expect = 5.4
Identities = 12/30 (40%), Positives = 18/30 (60%)
Query: 314 GDEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
+D DD D DD+ED+DE+E + + E
Sbjct: 481 LPDDSDDADEDDDEDDDEDEDSSSDSTLEE 510
>gnl|CDD|219343 pfam07228, SpoIIE, Stage II sporulation protein E (SpoIIE). This
family contains a number of bacterial stage II
sporulation E proteins (EC:3.1.3.16). These are required
for formation of a normal polar septum during
sporulation. The N-terminal region is hydrophobic and is
expected to contain up to 12 membrane-spanning segments.
Length = 192
Score = 28.8 bits (65), Expect = 5.6
Identities = 10/49 (20%), Positives = 22/49 (44%), Gaps = 3/49 (6%)
Query: 414 EKFFGLLAQRFCQ-INKMYVAPLEQIFRESYSTVHRLDINKLRDDVCVI 461
+ FGL +R + + + E++ + RL +L DD+ ++
Sbjct: 141 GELFGL--ERLLALLAERHGLSPEELLDALLEDLLRLGGGELEDDITLL 187
>gnl|CDD|240400 PTZ00396, PTZ00396, Casein kinase II subunit beta; Provisional.
Length = 251
Score = 29.2 bits (66), Expect = 5.6
Identities = 11/27 (40%), Positives = 14/27 (51%), Gaps = 1/27 (3%)
Query: 312 ILGDEDEDDEDGDDEEDEDEEESEAEE 338
IL E DDED ++E E EA +
Sbjct: 67 ILDSEPPDDED-LEDEQFLEVYQEASD 92
>gnl|CDD|222800 PHA00458, PHA00458, single-stranded DNA-binding protein.
Length = 233
Score = 29.0 bits (65), Expect = 5.7
Identities = 11/34 (32%), Positives = 19/34 (55%)
Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESE 335
E+ Y + +DE++ D DDEE+ +E+E
Sbjct: 198 EDGYVASEAKARKPQDEEEWDEDDEEESEEDEDG 231
Score = 28.6 bits (64), Expect = 7.4
Identities = 9/29 (31%), Positives = 18/29 (62%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKEN 343
E + + D+EE ++++E E+EE E +
Sbjct: 204 SEAKARKPQDEEEWDEDDEEESEEDEDGD 232
>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
factor RRN3. This family consists of several eukaryotic
proteins which are homologous to the yeast RRN3 protein.
RRN3 is one of the RRN genes specifically required for
the transcription of rDNA by RNA polymerase I (Pol I) in
Saccharomyces cerevisiae.
Length = 554
Score = 29.2 bits (66), Expect = 5.8
Identities = 17/44 (38%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 310 REILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTET 353
+L DED+DDED + D+D+EE + E E TI + +E
Sbjct: 231 ERVLADEDDDDEDDMFDMDDDDEEE--SDPEVERTSTIKEVSEK 272
>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal. The C-terminal domain of
FCP-1 is required for interaction with the carboxy
terminal domain of RAP74. Interaction relies extensively
on van der Waals contacts between hydrophobic residues
situated within alpha-helices in both domains.
Length = 263
Score = 29.2 bits (65), Expect = 6.0
Identities = 13/50 (26%), Positives = 25/50 (50%), Gaps = 4/50 (8%)
Query: 296 PEYLMNEEKYTTLRRE---ILGDE-DEDDEDGDDEEDEDEEESEAEEGEK 341
P Y + +E ++ +E ILG+ D+ D + +EE+ +A + K
Sbjct: 136 PLYTLCKEDLESMDKEVDDILGEGSDDSDSEKKKPPGNEEEQEQAPQPRK 185
>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein. This
family consists of REP proteins from Dictyostelium
(Slime molds). REP protein is likely involved in
transcription regulation and control of DNA replication,
specifically amplification of plasmid at low copy
numbers. The formation of homomultimers may be required
for their regulatory activity.
Length = 910
Score = 29.4 bits (66), Expect = 6.0
Identities = 13/22 (59%), Positives = 17/22 (77%)
Query: 316 EDEDDEDGDDEEDEDEEESEAE 337
+D+ DED D +EDEDE+E E E
Sbjct: 889 DDDADEDEDQDEDEDEDEDEDE 910
Score = 29.0 bits (65), Expect = 7.8
Identities = 14/24 (58%), Positives = 18/24 (75%), Gaps = 1/24 (4%)
Query: 313 LGDEDED-DEDGDDEEDEDEEESE 335
D+D D DED D++EDEDE+E E
Sbjct: 887 NEDDDADEDEDQDEDEDEDEDEDE 910
>gnl|CDD|185558 PTZ00329, PTZ00329, eukaryotic translation initiation factor 1A;
Provisional.
Length = 155
Score = 28.2 bits (63), Expect = 6.4
Identities = 10/37 (27%), Positives = 18/37 (48%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETI 347
+I DE +D + D+E ++E E K + + I
Sbjct: 119 DIFDDEGDDGIEFQDDESDEEATDEMLNLTKADIDDI 155
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 29.0 bits (65), Expect = 6.8
Identities = 16/53 (30%), Positives = 23/53 (43%), Gaps = 1/53 (1%)
Query: 310 REILGDEDEDDEDGDDEE-DEDEEESEAEEGEKENKETIIDNTETNLVALRRT 361
++++ DEDED+E E +EEESE E K+ T R
Sbjct: 299 KKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGGRR 351
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 29.2 bits (65), Expect = 6.8
Identities = 12/39 (30%), Positives = 21/39 (53%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIID 349
E+ +EDE+ + + + DE+ E EE E + + ID
Sbjct: 145 EVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFID 183
>gnl|CDD|215541 PLN03020, PLN03020, low-temperature-induced protein; Provisional.
Length = 556
Score = 28.9 bits (64), Expect = 7.5
Identities = 12/27 (44%), Positives = 17/27 (62%), Gaps = 1/27 (3%)
Query: 315 DEDEDDEDGDD-EEDEDEEESEAEEGE 340
DE +D D EED+DE+E AE+ +
Sbjct: 58 DEHRVPDDHDLYEEDDDEDEEMAEDPQ 84
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 28.9 bits (64), Expect = 7.5
Identities = 23/75 (30%), Positives = 32/75 (42%), Gaps = 10/75 (13%)
Query: 290 NVFQYDP-EYLMNEEKYTTLRREILGD--EDEDDE-------DGDDEEDEDEEESEAEEG 339
+V YDP LM E+ E L + ED E D DDEE + ++E E E
Sbjct: 452 SVDSYDPRASLMTMEETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQ 511
Query: 340 EKENKETIIDNTETN 354
+ ET + + N
Sbjct: 512 GIKYSETSEADKDVN 526
>gnl|CDD|222440 pfam13897, GOLD_2, Golgi-dynamics membrane-trafficking. Sec14-like
Golgi-trafficking domain The GOLD domain is always found
combined with lipid- or membrane-association domains.
Length = 136
Score = 27.7 bits (62), Expect = 8.0
Identities = 13/28 (46%), Positives = 17/28 (60%), Gaps = 2/28 (7%)
Query: 321 EDGDDEEDEDEEESEAEEG--EKENKET 346
E D+EE+E+ EE EAE G E +K
Sbjct: 53 ESSDEEEEEEAEEEEAETGDVEAGSKSQ 80
>gnl|CDD|218970 pfam06278, DUF1032, Protein of unknown function (DUF1032). This
family consists of several conserved eukaryotic proteins
of unknown function.
Length = 565
Score = 28.8 bits (64), Expect = 8.4
Identities = 26/95 (27%), Positives = 39/95 (41%), Gaps = 16/95 (16%)
Query: 302 EEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRRT 361
E+K L + L D EDD D DD D ++ E + E E + E
Sbjct: 369 EQKLWPLEEDRLEDSVEDDGDADDFSDPEDYLEPPEGLDPE--EQAFQDLEAE------- 419
Query: 362 IYLTIHSSLDFEECAHKLLRMQL----KPGQESEL 392
+ SSL +EE + + + + K QE+EL
Sbjct: 420 ---AMPSSLSYEELVRRNVDLFIANSQKYAQETEL 451
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 28.8 bits (64), Expect = 8.4
Identities = 16/42 (38%), Positives = 24/42 (57%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTE 352
+IL E + E+ + EE E EEE E E E ++ + I D+ E
Sbjct: 366 DILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKRICDDDE 407
>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6. The VP6 protein a
minor protein in the core of the virion is probably the
viral helicase.
Length = 322
Score = 28.5 bits (63), Expect = 8.4
Identities = 12/43 (27%), Positives = 23/43 (53%), Gaps = 2/43 (4%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKE--NKETIIDNTETNL 355
+++ DEDG ++ D E + ++G KE +KET + +
Sbjct: 35 EKEPKDEDGQEQRISDGEGEQKQKGGKEESDKETEDASVDRRT 77
>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57. Eukaryotic
cilia and flagella are specialised organelles found at
the periphery of cells of diverse organisms.
Intra-flagellar transport (IFT) is required for the
assembly and maintenance of eukaryotic cilia and
flagella, and consists of the bidirectional movement of
large protein particles between the base and the distal
tip of the organelle. IFT particles contain multiple
copies of two distinct protein complexes, A and B, which
contain at least 6 and 11 protein subunits. IFT57 is
part of complex B but is not, however, required for the
core subunits to stay associated. This protein is known
as Huntington-interacting protein-1 in humans.
Length = 355
Score = 28.5 bits (64), Expect = 8.8
Identities = 16/77 (20%), Positives = 30/77 (38%), Gaps = 14/77 (18%)
Query: 283 KDTQDILNVFQYDPEYLMNEEKYTTLRREILGDEDEDDEDGDDEEDEDEEESEAEEGEKE 342
+ + NV + D E ++ E + +E E +E DDE ++ + +
Sbjct: 128 NEEDEEENVDEDDAEIIL----------EEVEEEVEIEEVDDDEGTQETKYKRGDTSLTP 177
Query: 343 NK----ETIIDNTETNL 355
E++ID E L
Sbjct: 178 QAKDVLESLIDAAEWKL 194
>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
Length = 470
Score = 28.7 bits (65), Expect = 9.1
Identities = 18/83 (21%), Positives = 35/83 (42%), Gaps = 7/83 (8%)
Query: 283 KDTQDILNVFQYDPE------YLMNEEKYTTLRREILGDEDEDDE-DGDDEEDEDEEESE 335
+ IL++F + L+ EE +++L +E E +G DEE +E
Sbjct: 349 AEFDAILDLFMEALDIDEEIAQLLVEEGFSSLEELAYVPVEELLEIEGFDEETVEELRER 408
Query: 336 AEEGEKENKETIIDNTETNLVAL 358
A+E + + +L++L
Sbjct: 409 AKEALETEALAQEEKLADDLLSL 431
>gnl|CDD|177868 PLN02222, PLN02222, phosphoinositide phospholipase C 2.
Length = 581
Score = 28.8 bits (64), Expect = 9.2
Identities = 8/22 (36%), Positives = 18/22 (81%)
Query: 316 EDEDDEDGDDEEDEDEEESEAE 337
D++D +GDD++D+D+ E +++
Sbjct: 288 VDKNDSNGDDDDDDDDGEDKSK 309
>gnl|CDD|218055 pfam04386, SspB, Stringent starvation protein B. Escherichia coli
stringent starvation protein B (SspB), is thought to
enhance the specificity of degradation of tmRNA-tagged
proteins by the ClpXP protease. The tmRNA tag, also
known as ssrA, is an 11-aa peptide added to the C
terminus of proteins stalled during translation, targets
proteins for degradation by ClpXP and ClpAP. SspB a
cytoplasmic protein that specifically binds to residues
1-4 and 7 of the tag. Binding of SspB enhances
degradation of tagged proteins by ClpX, and masks
sequence elements important for ClpA interactions,
inhibiting degradation by ClpA. However, more recent
work has cast doubt on the importance of SspB in
wild-type cells. SspB is encoded in an operon whose
synthesis is stimulated by carbon, amino acid, and
phosphate starvation. SspB may play a special role
during nutrient stress, for example by ensuring rapid
degradation of the products of stalled translation,
without causing a global increase in degradation of all
ClpXP substrates.
Length = 153
Score = 27.9 bits (63), Expect = 9.2
Identities = 10/34 (29%), Positives = 18/34 (52%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEEGEKENKETII 348
+E+EDD+ D EDED E + + + ++
Sbjct: 119 EEEEDDDADDSPEDEDPEPKDPPKTKGRPSLKVV 152
>gnl|CDD|219912 pfam08574, DUF1762, Protein of unknown function (DUF1762). This is
a family of proteins of unknown function. Yeast IWR1 is
known to interact with RNA polymerase II and deletion of
this protein results in hypersensitivity to the K1
killer toxin.
Length = 77
Score = 26.6 bits (59), Expect = 9.3
Identities = 10/27 (37%), Positives = 15/27 (55%)
Query: 311 EILGDEDEDDEDGDDEEDEDEEESEAE 337
E D D++D D+ D+E+S AE
Sbjct: 40 EYHEDLANDEDDDADQVLSDDEDSNAE 66
>gnl|CDD|215562 PLN03078, PLN03078, Putative tRNA pseudouridine synthase;
Provisional.
Length = 513
Score = 28.7 bits (64), Expect = 9.3
Identities = 9/47 (19%), Positives = 21/47 (44%)
Query: 317 DEDDEDGDDEEDEDEEESEAEEGEKENKETIIDNTETNLVALRRTIY 363
+E+ + +E++ED S G + E I+ + + R ++
Sbjct: 278 EENHGEISEEDEEDLSFSSIPSGSSDENEDILKFQSSQVQIRARWLH 324
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 28.6 bits (64), Expect = 9.3
Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 1/38 (2%)
Query: 309 RREILGDEDEDDEDGDDEEDEDEEESEAEEGEKENKET 346
E LG DED +DE+DE E + E E+ +K K+
Sbjct: 328 LSEFLGYMGGIDEDDEDEDDE-ESKEEVEKKQKVKKKP 364
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 28.9 bits (65), Expect = 9.5
Identities = 12/24 (50%), Positives = 18/24 (75%)
Query: 315 DEDEDDEDGDDEEDEDEEESEAEE 338
ED+DD + DD EDED+E+ E ++
Sbjct: 1365 SEDDDDSEVDDSEDEDDEDDEDDD 1388
>gnl|CDD|219240 pfam06957, COPI_C, Coatomer (COPI) alpha subunit C-terminus. This
family represents the C-terminus (approximately 500
residues) of the eukaryotic coatomer alpha subunit.
Coatomer (COPI) is a large cytosolic protein complex
which forms a coat around vesicles budding from the
Golgi apparatus. Such coatomer-coated vesicles have been
proposed to play a role in many distinct steps of
intracellular transport. Note that many family members
also contain the pfam04053 domain.
Length = 421
Score = 28.7 bits (64), Expect = 9.6
Identities = 10/44 (22%), Positives = 13/44 (29%), Gaps = 11/44 (25%)
Query: 313 LGDEDEDDEDGDDEE-----DEDEE------ESEAEEGEKENKE 345
D D E+ DED + EEG + E
Sbjct: 30 AADFAVMGGDDWGEDADLGLDEDGYLEGEDGLLDDEEGPEGGDE 73
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.135 0.384
Gapped
Lambda K H
0.267 0.0857 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 27,937,354
Number of extensions: 2795072
Number of successful extensions: 7731
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5817
Number of HSP's successfully gapped: 533
Length of query: 551
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 449
Effective length of database: 6,413,494
Effective search space: 2879658806
Effective search space used: 2879658806
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.0 bits)