RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy12224
(904 letters)
>gnl|CDD|214744 smart00611, SEC63, Domain of unknown function in Sec63p, Brr2p and
other proteins.
Length = 312
Score = 179 bits (456), Expect = 3e-50
Identities = 70/268 (26%), Positives = 118/268 (44%), Gaps = 32/268 (11%)
Query: 109 GMWWYKSIRYTGDKVLL-ETINFYYAFFQITPHMALKRVLMILGASLEFDRRFNSEIIER 167
G+W R + TI + + P M K +L IL S EFD+
Sbjct: 1 GIWPTDLGRIASYYYISYTTIRTF--NELLKPKMTTKDLLRILSMSSEFDQ--------- 49
Query: 168 PSDEIEVPQLIRQIPNLGEKNRERPLYHKYSIKARALIYAHLSNMQLTSETLEADRMYIV 227
E L+ ++ E P +KA L+ AHLS ++L S LE+D +Y++
Sbjct: 50 IPVRHEEDLLLEELAEKLPIRLENPSLDDPHVKANLLLQAHLSRLKLPSFALESDTVYVL 109
Query: 228 KKCPYLLQEMVTCISQLILLAYAQRVPRLIHIETLENVMKLCPMIVQGMWDFKNPLLQLP 287
+ LLQ MV + L+ A N + L MI+Q +W +PLLQLP
Sbjct: 110 QNAGRLLQAMVDIALERGWLSTA------------LNALNLSQMIIQALWPTDSPLLQLP 157
Query: 288 YVTDDHLKHFICKKRYIKSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGNMPYVDLHV 347
++ ++ LK +K+ + SL+ ++++EER ++ + D + + KVL +P +++ +
Sbjct: 158 HLPEEILKR--LEKKKVLSLEDLLELEDEERGELLG-LLDAEGERVYKVLSRLPKLNIEI 214
Query: 348 QPEVIDDEATTEYTAGAIITVTCTLMRK 375
E I G +T+T L
Sbjct: 215 SLEPITRT-----VLGVEVTLTVDLTWD 237
Score = 83.5 bits (207), Expect = 2e-17
Identities = 27/73 (36%), Positives = 36/73 (49%), Gaps = 1/73 (1%)
Query: 556 PEDKQEFWWIYISDRKSRTLLTSPYH-ITELVDQEQVQLKFTAPRWPGVYTFSVCLRCDS 614
KQE WW+ I D LL + + E+V+L FTAP G Y +++ L DS
Sbjct: 240 IHGKQEGWWLVIGDSDGNELLHIERFSLNKKNVSEEVKLDFTAPATEGNYQYTLRLVSDS 299
Query: 615 YLGFDQMQDIKLD 627
YLG DQ + D
Sbjct: 300 YLGCDQEYPLSFD 312
>gnl|CDD|217270 pfam02889, Sec63, Sec63 Brl domain. This domain (also known as the
Brl domain) is required for assembly of functional
endoplasmic reticulum translocons.
Length = 309
Score = 143 bits (362), Expect = 1e-37
Identities = 61/255 (23%), Positives = 116/255 (45%), Gaps = 40/255 (15%)
Query: 105 PTAVGM---WWYKSIRYTGDKVLLETINFYYAFFQITPHMALKRVLMILGASLEFDRRFN 161
PT +G +Y +L ET+ + + P+ LK +L IL ++ EF+
Sbjct: 1 PTELGRIASHYY---------ILYETMETF--NTSLKPNTTLKDLLEILSSASEFE---- 45
Query: 162 SEIIERPSDEIEVPQLIRQIPN-LGEKNRERPLYHKYSIKARALIYAHLSNMQLTSETLE 220
EI R ++ E+ +L+ ++P + + + P K L+ A+LS ++L +L
Sbjct: 46 -EIPVREEEKKELKKLLERVPIPVKSPSIDDP-----HAKVNLLLQAYLSRLKLPDFSLV 99
Query: 221 ADRMYIVKKCPYLLQEMVTCISQLILLAYAQRVPRLIHIETLENVMKLCPMIVQGMWDFK 280
+D YI++ LL+ + I L+ + N ++L MI Q +W
Sbjct: 100 SDLNYILQNAGRLLRALF-----EIALSKG-------WLSPALNALELSKMIEQRLWPSD 147
Query: 281 NPLLQLPYVTDDHLKHFICKKRYIKSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGNM 340
+PL QLP++ + LK +K+ I SL+ + EE ++ N +Q D+ + +
Sbjct: 148 SPLRQLPHIPPEVLKK--LEKKGISSLEDLMDLDPEELGELLG-NNPKQGKDIAEFVNRF 204
Query: 341 PYVDLHVQPEVIDDE 355
P +++ + + I E
Sbjct: 205 PRLEIEAEVQPITRE 219
Score = 40.7 bits (96), Expect = 0.002
Identities = 23/79 (29%), Positives = 35/79 (44%), Gaps = 4/79 (5%)
Query: 550 VHCPRFPEDKQEFWWIYISDRKSRTLLTSPYHITELVDQEQVQLKFTAPRWPGVYTFSVC 609
RF K E +WI + D + LL T +++ +L+FT P G +
Sbjct: 233 EWDERFH-GKSEGFWIVVEDSDNNELLAIER-FTLNKKKDEHELEFTVPLSGGPLPPQLT 290
Query: 610 LR--CDSYLGFDQMQDIKL 626
+R DS+LG DQ + L
Sbjct: 291 IRLVSDSWLGCDQEVPVSL 309
>gnl|CDD|227694 COG5407, SEC63, Preprotein translocase subunit Sec63 [Intracellular
trafficking and secretion].
Length = 610
Score = 120 bits (301), Expect = 6e-28
Identities = 107/554 (19%), Positives = 190/554 (34%), Gaps = 96/554 (17%)
Query: 13 GASEASIKKAYRKQSLILHPDK---ETGD-----EKAFMKLTKAYQALTDEESRRNFEKY 64
SE IKK YR S+ HPDK + E+ + +TKAY LTD++ R N+ Y
Sbjct: 109 DTSERDIKKRYRMLSMKFHPDKAPPMVNELRSEYEEKYKTITKAYGLLTDKKRRENYLNY 168
Query: 65 GNPDGPGAMSFGIALPSYIVEKENSVWVLGLYALVFMVALPTAVGMWWYKSIRYTGDKVL 124
G PD P S GIALP IV E S++ +Y+L+ V LP V WW + YT V
Sbjct: 169 GTPDSPQHTSEGIALPKVIVRSERSMYAFVMYSLLLGVFLPYWVYRWWREIRDYTKVGVH 228
Query: 125 LETINFYYAFFQITPHMALKRVLMILGASLEFDRRFNSEIIERPSDEIEVPQLIRQIPNL 184
T+ +Y +I + L +L + +S E R P
Sbjct: 229 FVTMEMFYE--RIDGSLTLDELLSLFASSKELTRMN---------------------PKG 265
Query: 185 GEKNRERPLYHKYSIKARALIYAHLSNMQLTSETLEADRMYIVKKCPYLLQEMVTCISQL 244
+ E+ L+ HL N ++E + I LL ++
Sbjct: 266 PKCTLEK------------LLGDHL-NRA---RSVEFNEYRIKSNVEGLLGALLR----- 304
Query: 245 ILLAYAQRVPRLIHIETLENVMKLCPMIVQGMWDFKNPLLQLPYVTDDHLKHFICKKRYI 304
I +A + + +++ P+ D +LQLP ++ + R I
Sbjct: 305 IASNFAFPLKECGKENKGQYIVQAIPL------DHLFRILQLPRSDVEYAQRV--SLRLI 356
Query: 305 KSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGNMPYVDLHVQPEVIDDEATTEYTAGA 364
+ ++ + + +V+K + + ++ V +P L P ++ ++ +
Sbjct: 357 EGMKAIGSLIAKRYGNVLKNLVVLELMEIQAVADGIPRYFLLQAPFLVFEDLFITEKSKE 416
Query: 365 IITVTCTLMRKPMSVLFGDDTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTH 424
+++ L P +E S IE K + + +P T
Sbjct: 417 RVSLKGYLGAIPGPEHRTSALNVYNQVEISDFEASVIETGAIKNDSSDSPYSEAPDFATR 476
Query: 425 RPV--WMKNKK---------GGKKKKFTKNKHDKKSSGQ--------------------- 452
++ K GG +K D++ G
Sbjct: 477 NDSEWAVRVAKCEQTVYIIPGGSIATVSKVTLDRRIQGDVAPVDKTGGKQLIVHAPFMVG 536
Query: 453 ----KKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKN 508
K I + K++ D E+ +V + S +++ V +E
Sbjct: 537 AFSVKWIVMLTVDNVIYGEQLKIQVVDDFEKKANVRFSIDVGELKSTCDDEVFVGDEEDL 596
Query: 509 EDSNNKSEESSDDD 522
K ++ ++
Sbjct: 597 SRYVEKDTDTKVEE 610
Score = 30.7 bits (69), Expect = 3.9
Identities = 14/161 (8%), Positives = 43/161 (26%), Gaps = 7/161 (4%)
Query: 509 EDSNNKSEESSDDDDDDWEKYQTGLNKRDK-VLEGRSKQSHSVHCPRFPEDKQEFWWIYI 567
E + + + D+E + ++ S S P F W + +
Sbjct: 431 EHRTSALNVYNQVEISDFEASV-----IETGAIKNDSSDSPYSEAPDFATRNDSEWAVRV 485
Query: 568 SDRKSRTLLTSPYHITEL-VDQEQVQLKFTAPRWPGVYTFSVCLRCDSYLGFDQMQDIKL 626
+ + + I + +++ + + +G ++ I +
Sbjct: 486 AKCEQTVYIIPGGSIATVSKVTLDRRIQGDVAPVDKTGGKQLIVHAPFMVGAFSVKWIVM 545
Query: 627 DVKEAPEVPTEHPQWEMSGEEDEGDEEMGGSDVSEFTTDED 667
+ + + E + + +T +D
Sbjct: 546 LTVDNVIYGEQLKIQVVDDFEKKANVRFSIDVGELKSTCDD 586
>gnl|CDD|214946 smart00973, Sec63, Sec63 Brl domain. This domain was named after
the yeast Sec63 (or NPL1) (also known as the Brl domain)
protein in which it was found. This protein is required
for assembly of functional endoplasmic reticulum
translocons. Other yeast proteins containing this domain
include pre-mRNA splicing helicase BRR2, HFM1 protein
and putative helicases.
Length = 314
Score = 85.5 bits (212), Expect = 6e-18
Identities = 59/256 (23%), Positives = 104/256 (40%), Gaps = 39/256 (15%)
Query: 105 PTAVGM---WWYKSIRYTGDKVLLETINFYYAFFQITPHMALKRVLMILGASLEFDRRFN 161
PT +G ++Y I Y ETI + + P LK +L IL + EF
Sbjct: 1 PTELGRIASYYY--ISY-------ETIETFNQ--SLKPTTTLKDILEILSRASEFK---- 45
Query: 162 SEIIERPSDEIEVPQLIRQIP-NLGEKNRERPLYHKYSIKARALIYAHLSNMQLTSETLE 220
EI R +++ E+ +L +++P + E + P K L+ AHLS + L L
Sbjct: 46 -EIPVRHNEKKELNELNKRVPIPVKEGIIDSP-----HAKVNLLLQAHLSRLPLPDFDLV 99
Query: 221 ADRMYIVKKCPYLLQEMVTCISQLILLAYAQRVPRLIHIETLENVMKLCPMIVQGMWDFK 280
+D YI++ P +L+ +V L A N + L M+VQ +W+
Sbjct: 100 SDLKYILQNAPRILRALVDIALSKGWLRTAL------------NALDLSQMVVQRLWEDS 147
Query: 281 N-PLLQLPYVTDDHLKHFICKKRYIKSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGN 339
+ PL QLP+ + + + K +S + M E + + ++L
Sbjct: 148 DSPLKQLPHFLIEDVYDKLELKDGSRSFELLLDMNAAELGEFLNRLPPNGRLIY-ELLRR 206
Query: 340 MPYVDLHVQPEVIDDE 355
P +++ + I +
Sbjct: 207 FPKIEVEAEVLPITRD 222
Score = 50.4 bits (121), Expect = 2e-06
Identities = 34/150 (22%), Positives = 56/150 (37%), Gaps = 5/150 (3%)
Query: 481 KDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWEKYQTGLNKRDKVL 540
++ S L D+ E+ + + + E + + RD L
Sbjct: 166 LELKDGSRSFELLLDMNAAELGEFLNRLPPNGRLIYELLRRFPKIEVEAEVLPITRDLTL 225
Query: 541 EGRSKQS--HSVHCPRFPEDKQEFWWIYISDRKSRTLLTSPY-HITELVDQEQVQLKFTA 597
+ + + PR + K E WW+ + D + LL + + +V+L FT
Sbjct: 226 RVELEITPVFAWDLPR-HKGKSESWWLVVGDSDTNELLAIKRVTLRKKKKSNEVKLDFTV 284
Query: 598 PR-WPGVYTFSVCLRCDSYLGFDQMQDIKL 626
P PG ++V L DSYLG DQ L
Sbjct: 285 PLSEPGPENYTVYLISDSYLGCDQEVSFSL 314
>gnl|CDD|215804 pfam00226, DnaJ, DnaJ domain. DnaJ domains (J-domains) are
associated with hsp70 heat-shock system and it is
thought that this domain mediates the interaction.
DnaJ-domain is therefore part of a chaperone (protein
folding) system. The T-antigens, although not in
Prosite are confirmed as DnaJ containing domains from
literature.
Length = 63
Score = 66.8 bits (164), Expect = 7e-14
Identities = 25/54 (46%), Positives = 34/54 (62%), Gaps = 3/54 (5%)
Query: 11 PSGASEASIKKAYRKQSLILHPDKETGD---EKAFMKLTKAYQALTDEESRRNF 61
P AS+ IKKAYRK +L HPDK GD E+ F ++ +AY+ L+D E R +
Sbjct: 9 PRDASDEEIKKAYRKLALKYHPDKNPGDPAAEEKFKEINEAYEVLSDPEKRAIY 62
>gnl|CDD|223560 COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn
finger domain [Posttranslational modification, protein
turnover, chaperones].
Length = 371
Score = 67.7 bits (166), Expect = 8e-12
Identities = 28/73 (38%), Positives = 41/73 (56%), Gaps = 7/73 (9%)
Query: 11 PSGASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYG-- 65
ASE IKKAYRK + HPD+ GD++A F ++ +AY+ L+D E R ++++G
Sbjct: 13 SKDASEEEIKKAYRKLAKKYHPDRNPGDKEAEEKFKEINEAYEVLSDPEKRAAYDQFGHA 72
Query: 66 --NPDGPGAMSFG 76
G G FG
Sbjct: 73 GFKAGGFGGFGFG 85
>gnl|CDD|99751 cd06257, DnaJ, DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock
protein 40) proteins are highly conserved and play
crucial roles in protein translation, folding,
unfolding, translocation, and degradation. They act
primarily by stimulating the ATPase activity of Hsp70s,
an important chaperonine family. Hsp40 proteins are
characterized by the presence of a J domain, which
mediates the interaction with Hsp70. They may contain
other domains as well, and the architectures provide a
means of classification.
Length = 55
Score = 59.9 bits (146), Expect = 1e-11
Identities = 22/47 (46%), Positives = 30/47 (63%), Gaps = 3/47 (6%)
Query: 11 PSGASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTD 54
P AS+ IKKAYRK +L HPDK D +A F ++ +AY+ L+D
Sbjct: 9 PPDASDEEIKKAYRKLALKYHPDKNPDDPEAEEKFKEINEAYEVLSD 55
>gnl|CDD|197617 smart00271, DnaJ, DnaJ molecular chaperone homology domain.
Length = 60
Score = 59.9 bits (146), Expect = 2e-11
Identities = 24/51 (47%), Positives = 31/51 (60%), Gaps = 4/51 (7%)
Query: 11 PSGASEASIKKAYRKQSLILHPDKETGD----EKAFMKLTKAYQALTDEES 57
P AS IKKAYRK +L HPDK GD E+ F ++ +AY+ L+D E
Sbjct: 10 PRDASLDEIKKAYRKLALKYHPDKNPGDKEEAEEKFKEINEAYEVLSDPEK 60
>gnl|CDD|237658 PRK14284, PRK14284, chaperone protein DnaJ; Provisional.
Length = 391
Score = 64.5 bits (157), Expect = 8e-11
Identities = 32/69 (46%), Positives = 45/69 (65%), Gaps = 6/69 (8%)
Query: 13 GASEASIKKAYRKQSLILHPDKETGD---EKAFMKLTKAYQALTDEESRRNFEKYGNPDG 69
AS IKKAYRK ++ HPDK GD EK F ++++AY+ L+D + R ++++YG DG
Sbjct: 12 TASPEEIKKAYRKLAVKYHPDKNPGDAEAEKRFKEVSEAYEVLSDAQKRESYDRYGK-DG 70
Query: 70 P--GAMSFG 76
P GA FG
Sbjct: 71 PFAGAGGFG 79
>gnl|CDD|184611 PRK14297, PRK14297, chaperone protein DnaJ; Provisional.
Length = 380
Score = 64.0 bits (156), Expect = 1e-10
Identities = 27/67 (40%), Positives = 44/67 (65%), Gaps = 3/67 (4%)
Query: 13 GASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYGNPDG 69
GAS+ IKKA+RK ++ HPDK G+++A F ++ +AYQ L+D + + ++++G D
Sbjct: 15 GASDDEIKKAFRKLAIKYHPDKNKGNKEAEEKFKEINEAYQVLSDPQKKAQYDQFGTADF 74
Query: 70 PGAMSFG 76
GA FG
Sbjct: 75 NGAGGFG 81
>gnl|CDD|233829 TIGR02349, DnaJ_bact, chaperone protein DnaJ. This model
represents bacterial forms of DnaJ, part of the
DnaK-DnaJ-GrpE chaperone system. The three components
typically are encoded by consecutive genes. DnaJ
homologs occur in many genomes, typically not near DnaK
and GrpE-like genes; most such genes are not included
by this family. Eukaryotic (mitochondrial and
chloroplast) forms are not included in the scope of
this family.
Length = 354
Score = 61.8 bits (151), Expect = 5e-10
Identities = 25/68 (36%), Positives = 38/68 (55%), Gaps = 3/68 (4%)
Query: 11 PSGASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGNPD 68
ASE IKKAYRK + HPD K+ E+ F ++ +AY+ L+D E R ++++G+
Sbjct: 9 SKDASEEEIKKAYRKLAKKYHPDRNKDKEAEEKFKEINEAYEVLSDPEKRAQYDQFGH-A 67
Query: 69 GPGAMSFG 76
G G
Sbjct: 68 GFNGGGGG 75
>gnl|CDD|237653 PRK14276, PRK14276, chaperone protein DnaJ; Provisional.
Length = 380
Score = 62.0 bits (151), Expect = 6e-10
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 4/65 (6%)
Query: 14 ASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGNPDGPG 71
AS+ IKKAYRK S HPD KE G E+ + ++ +AY+ L+D + R +++YG G
Sbjct: 16 ASQDEIKKAYRKLSKKYHPDINKEPGAEEKYKEVQEAYETLSDPQKRAAYDQYGAAGANG 75
Query: 72 AMSFG 76
FG
Sbjct: 76 G--FG 78
>gnl|CDD|237662 PRK14292, PRK14292, chaperone protein DnaJ; Provisional.
Length = 371
Score = 61.8 bits (150), Expect = 6e-10
Identities = 29/71 (40%), Positives = 40/71 (56%), Gaps = 8/71 (11%)
Query: 3 GKSRTPLKPSGASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRN 60
G SRT AS IK AYRK +L HPD KE G + F ++ +AY L+D E R +
Sbjct: 9 GVSRT------ASADEIKSAYRKLALKYHPDRNKEKGAAEKFAQINEAYAVLSDAEKRAH 62
Query: 61 FEKYGNPDGPG 71
++++G G G
Sbjct: 63 YDRFGTAPGAG 73
>gnl|CDD|240236 PTZ00037, PTZ00037, DnaJ_C chaperone protein; Provisional.
Length = 421
Score = 59.4 bits (144), Expect = 4e-09
Identities = 22/52 (42%), Positives = 37/52 (71%), Gaps = 1/52 (1%)
Query: 14 ASEASIKKAYRKQSLILHPDKETGDEKAFMKLTKAYQALTDEESRRNFEKYG 65
+ + IKKAYRK ++ HPDK GD + F ++++AY+ L+D E R+ +++YG
Sbjct: 40 CTTSEIKKAYRKLAIKHHPDK-GGDPEKFKEISRAYEVLSDPEKRKIYDEYG 90
>gnl|CDD|236757 PRK10767, PRK10767, chaperone protein DnaJ; Provisional.
Length = 371
Score = 58.6 bits (143), Expect = 7e-09
Identities = 28/71 (39%), Positives = 42/71 (59%), Gaps = 8/71 (11%)
Query: 13 GASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYG---- 65
ASE IKKAYRK ++ HPD+ GD++A F ++ +AY+ L+D + R +++YG
Sbjct: 15 NASEDEIKKAYRKLAMKYHPDRNPGDKEAEEKFKEIKEAYEVLSDPQKRAAYDQYGHAAF 74
Query: 66 NPDGPGAMSFG 76
G G FG
Sbjct: 75 EQGGGGG-GFG 84
>gnl|CDD|237656 PRK14280, PRK14280, chaperone protein DnaJ; Provisional.
Length = 376
Score = 57.0 bits (138), Expect = 2e-08
Identities = 27/74 (36%), Positives = 43/74 (58%), Gaps = 10/74 (13%)
Query: 13 GASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGNPD-- 68
AS+ IKKAYRK S HPD KE G ++ F ++++AY+ L+D++ R ++++G+
Sbjct: 15 SASKDEIKKAYRKLSKKYHPDINKEEGADEKFKEISEAYEVLSDDQKRAQYDQFGHAGPN 74
Query: 69 ------GPGAMSFG 76
G G FG
Sbjct: 75 QGFGGGGFGGGDFG 88
>gnl|CDD|184599 PRK14277, PRK14277, chaperone protein DnaJ; Provisional.
Length = 386
Score = 56.0 bits (135), Expect = 5e-08
Identities = 26/73 (35%), Positives = 42/73 (57%), Gaps = 7/73 (9%)
Query: 11 PSGASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYG-- 65
A+E IKKAYR+ + HPD GD++A F ++ +AY+ L+D + R ++++G
Sbjct: 14 DRNATEEEIKKAYRRLAKKYHPDLNPGDKEAEQKFKEINEAYEILSDPQKRAQYDQFGHA 73
Query: 66 --NPDGPGAMSFG 76
+P G G FG
Sbjct: 74 AFDPGGFGQGGFG 86
>gnl|CDD|237664 PRK14294, PRK14294, chaperone protein DnaJ; Provisional.
Length = 366
Score = 54.4 bits (131), Expect = 1e-07
Identities = 23/55 (41%), Positives = 37/55 (67%), Gaps = 3/55 (5%)
Query: 14 ASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYG 65
ASE IKK+YRK ++ HPD+ GD++A F + +AY+ L+D + R +++YG
Sbjct: 16 ASEEEIKKSYRKLAMKYHPDRNPGDKEAEELFKEAAEAYEVLSDPKKRGIYDQYG 70
>gnl|CDD|184612 PRK14298, PRK14298, chaperone protein DnaJ; Provisional.
Length = 377
Score = 54.1 bits (130), Expect = 2e-07
Identities = 23/58 (39%), Positives = 36/58 (62%), Gaps = 2/58 (3%)
Query: 11 PSGASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGN 66
AS IKKAYRK ++ HPD KE E+ F ++++AY L+D E R ++++G+
Sbjct: 14 SKDASVEDIKKAYRKLAMKYHPDKNKEPDAEEKFKEISEAYAVLSDAEKRAQYDRFGH 71
>gnl|CDD|184604 PRK14283, PRK14283, chaperone protein DnaJ; Provisional.
Length = 378
Score = 53.3 bits (128), Expect = 3e-07
Identities = 22/56 (39%), Positives = 38/56 (67%), Gaps = 2/56 (3%)
Query: 13 GASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGN 66
A + IKKAYRK + HPD +E G E+ F ++++AY L+D+E R+ ++++G+
Sbjct: 16 NADKKEIKKAYRKLARKYHPDVSEEEGAEEKFKEISEAYAVLSDDEKRQRYDQFGH 71
>gnl|CDD|237661 PRK14291, PRK14291, chaperone protein DnaJ; Provisional.
Length = 382
Score = 53.6 bits (129), Expect = 3e-07
Identities = 21/57 (36%), Positives = 35/57 (61%), Gaps = 2/57 (3%)
Query: 11 PSGASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYG 65
A++ IKKAYR+ + HPD K E+ F ++ +AYQ L+D E R+ ++++G
Sbjct: 12 SRNATQEEIKKAYRRLARKYHPDFNKNPEAEEKFKEINEAYQVLSDPEKRKLYDQFG 68
>gnl|CDD|237657 PRK14281, PRK14281, chaperone protein DnaJ; Provisional.
Length = 397
Score = 53.3 bits (128), Expect = 3e-07
Identities = 21/55 (38%), Positives = 38/55 (69%), Gaps = 3/55 (5%)
Query: 14 ASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYG 65
A + IKKAYRK +L HPDK +++A F ++ +AY+ L++++ RR ++++G
Sbjct: 15 ADKDEIKKAYRKLALKYHPDKNPDNKEAEEHFKEVNEAYEVLSNDDKRRRYDQFG 69
>gnl|CDD|172773 PRK14285, PRK14285, chaperone protein DnaJ; Provisional.
Length = 365
Score = 53.5 bits (128), Expect = 3e-07
Identities = 26/66 (39%), Positives = 41/66 (62%), Gaps = 3/66 (4%)
Query: 13 GASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYGNPDG 69
GAS+ IKKAYRK ++ HPDK G+++A F + T+AY+ L D+ R ++++G+
Sbjct: 14 GASKDEIKKAYRKIAIKYHPDKNKGNKEAESIFKEATEAYEVLIDDNKRAQYDRFGHTAF 73
Query: 70 PGAMSF 75
G F
Sbjct: 74 EGGGGF 79
>gnl|CDD|237654 PRK14278, PRK14278, chaperone protein DnaJ; Provisional.
Length = 378
Score = 52.7 bits (127), Expect = 5e-07
Identities = 26/63 (41%), Positives = 37/63 (58%), Gaps = 4/63 (6%)
Query: 13 GASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYGNPDG 69
AS+A IK+AYRK + LHPD DE+A F +++ AY+ L+D E RR + G+P
Sbjct: 14 NASDAEIKRAYRKLARELHPDV-NPDEEAQEKFKEISVAYEVLSDPEKRRIVDLGGDPLE 72
Query: 70 PGA 72
Sbjct: 73 SAG 75
>gnl|CDD|237655 PRK14279, PRK14279, chaperone protein DnaJ; Provisional.
Length = 392
Score = 52.8 bits (127), Expect = 5e-07
Identities = 22/54 (40%), Positives = 31/54 (57%), Gaps = 3/54 (5%)
Query: 12 SGASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFE 62
S AS IKKAYRK + LHPD GD A F +++A+ L+D R+ ++
Sbjct: 19 SDASAEEIKKAYRKLARELHPDANPGDPAAEERFKAVSEAHDVLSDPAKRKEYD 72
>gnl|CDD|237665 PRK14295, PRK14295, chaperone protein DnaJ; Provisional.
Length = 389
Score = 52.5 bits (126), Expect = 6e-07
Identities = 31/76 (40%), Positives = 44/76 (57%), Gaps = 10/76 (13%)
Query: 11 PSGASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEK---- 63
P A+EA IKKAYRK + HPD GD KA F ++++AY L+DE+ R+ +++
Sbjct: 18 PKDATEAEIKKAYRKLAREYHPDANKGDAKAEERFKEISEAYDVLSDEKKRKEYDEARSL 77
Query: 64 YGNP---DGPGAMSFG 76
+GN GPG G
Sbjct: 78 FGNGGFRPGPGGGGGG 93
>gnl|CDD|237667 PRK14299, PRK14299, chaperone protein DnaJ; Provisional.
Length = 291
Score = 51.5 bits (123), Expect = 9e-07
Identities = 24/61 (39%), Positives = 35/61 (57%), Gaps = 2/61 (3%)
Query: 11 PSGASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGNPD 68
P AS+ IKKA++K + HPD K G E+ F ++ +AY L+D E RR ++ YG
Sbjct: 13 PKNASQDEIKKAFKKLARKYHPDVNKSPGAEEKFKEINEAYTVLSDPEKRRIYDTYGTTA 72
Query: 69 G 69
Sbjct: 73 A 73
>gnl|CDD|172774 PRK14286, PRK14286, chaperone protein DnaJ; Provisional.
Length = 372
Score = 51.1 bits (122), Expect = 2e-06
Identities = 25/68 (36%), Positives = 40/68 (58%), Gaps = 4/68 (5%)
Query: 13 GASEASIKKAYRKQSLILHPDKETGD---EKAFMKLTKAYQALTDEESRRNFEKYGNPD- 68
A++ IK AYRK ++ HPDK G+ E+ F + T+AY+ L D + R+ ++++G
Sbjct: 15 SANDEEIKSAYRKLAIKYHPDKNKGNKESEEKFKEATEAYEILRDPKKRQAYDQFGKAGV 74
Query: 69 GPGAMSFG 76
GA FG
Sbjct: 75 NAGAGGFG 82
>gnl|CDD|237660 PRK14289, PRK14289, chaperone protein DnaJ; Provisional.
Length = 386
Score = 51.0 bits (122), Expect = 2e-06
Identities = 24/62 (38%), Positives = 39/62 (62%), Gaps = 3/62 (4%)
Query: 14 ASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYGNPDGP 70
A+ IKKAYRK+++ HPDK GD++A F + +AY L+D + R ++++G+
Sbjct: 17 ATVDEIKKAYRKKAIQYHPDKNPGDKEAEEKFKEAAEAYDVLSDPDKRSRYDQFGHAGVG 76
Query: 71 GA 72
GA
Sbjct: 77 GA 78
>gnl|CDD|237668 PRK14301, PRK14301, chaperone protein DnaJ; Provisional.
Length = 373
Score = 50.5 bits (121), Expect = 2e-06
Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 3/66 (4%)
Query: 14 ASEASIKKAYRKQSLILHPDKETGD---EKAFMKLTKAYQALTDEESRRNFEKYGNPDGP 70
ASE IKKAYRK +L HPD+ + E+ F + +AY+ L D E R ++++G+
Sbjct: 16 ASEDEIKKAYRKLALQYHPDRNPDNPEAEQKFKEAAEAYEVLRDAEKRARYDRFGHAGVN 75
Query: 71 GAMSFG 76
G FG
Sbjct: 76 GNGGFG 81
>gnl|CDD|237663 PRK14293, PRK14293, chaperone protein DnaJ; Provisional.
Length = 374
Score = 50.4 bits (121), Expect = 3e-06
Identities = 23/65 (35%), Positives = 38/65 (58%), Gaps = 2/65 (3%)
Query: 14 ASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGNPDGPG 71
A + +K+AYR+ + HPD KE G E F ++ +AY+ L+D E+R ++++G G
Sbjct: 15 ADKDELKRAYRRLARKYHPDVNKEPGAEDRFKEINRAYEVLSDPETRARYDQFGEAGVSG 74
Query: 72 AMSFG 76
A F
Sbjct: 75 AAGFP 79
>gnl|CDD|225124 COG2214, CbpA, DnaJ-class molecular chaperone [Posttranslational
modification, protein turnover, chaperones].
Length = 237
Score = 48.3 bits (114), Expect = 6e-06
Identities = 26/59 (44%), Positives = 35/59 (59%), Gaps = 4/59 (6%)
Query: 11 PSGASEASIKKAYRKQSLILHPDKETGDEKA----FMKLTKAYQALTDEESRRNFEKYG 65
P AS IKKAYRK +L HPD+ GD K F ++ +AY+ L+D E R ++K G
Sbjct: 15 PPNASLEEIKKAYRKLALKYHPDRNPGDPKVAEEKFKEINEAYEILSDPERRAEYDKIG 73
>gnl|CDD|172776 PRK14288, PRK14288, chaperone protein DnaJ; Provisional.
Length = 369
Score = 47.4 bits (112), Expect = 2e-05
Identities = 22/55 (40%), Positives = 38/55 (69%), Gaps = 3/55 (5%)
Query: 14 ASEASIKKAYRKQSLILHPDKETGDEKA---FMKLTKAYQALTDEESRRNFEKYG 65
+++ +IKK+YRK +L HPD+ GD++A F + +AY L+DE+ R +++YG
Sbjct: 15 SNQETIKKSYRKLALKYHPDRNAGDKEAEEKFKLINEAYGVLSDEKKRALYDRYG 69
>gnl|CDD|172778 PRK14290, PRK14290, chaperone protein DnaJ; Provisional.
Length = 365
Score = 45.3 bits (107), Expect = 9e-05
Identities = 24/67 (35%), Positives = 41/67 (61%), Gaps = 5/67 (7%)
Query: 14 ASEASIKKAYRKQSLILHPDKETGD----EKAFMKLTKAYQALTDEESRRNFEKYGNPD- 68
AS+ IKKA+R+ + HPD G+ E+ F ++++AY+ L+D + RR +++ G D
Sbjct: 15 ASQEDIKKAFRELAKKWHPDLHPGNKAEAEEKFKEISEAYEVLSDPQKRRQYDQTGTVDF 74
Query: 69 GPGAMSF 75
G G +F
Sbjct: 75 GAGGSNF 81
>gnl|CDD|172788 PRK14300, PRK14300, chaperone protein DnaJ; Provisional.
Length = 372
Score = 45.4 bits (107), Expect = 1e-04
Identities = 22/55 (40%), Positives = 34/55 (61%), Gaps = 2/55 (3%)
Query: 14 ASEASIKKAYRKQSLILHPDK-ETGD-EKAFMKLTKAYQALTDEESRRNFEKYGN 66
AS+A +KKAY K + HPD + D EK F ++ AY L DE+ R ++++G+
Sbjct: 15 ASQADLKKAYLKLAKQYHPDTTDAKDAEKKFKEINAAYDVLKDEQKRAAYDRFGH 69
>gnl|CDD|237666 PRK14296, PRK14296, chaperone protein DnaJ; Provisional.
Length = 372
Score = 44.6 bits (105), Expect = 2e-04
Identities = 19/65 (29%), Positives = 35/65 (53%), Gaps = 2/65 (3%)
Query: 14 ASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGNPDGPG 71
ASE I++AYRK + HPD K +++ +A L D++ R+ ++++G+ G
Sbjct: 16 ASEQEIRQAYRKLAKQYHPDLNKSPDAHDKMVEINEAADVLLDKDKRKQYDQFGHAAFDG 75
Query: 72 AMSFG 76
+ F
Sbjct: 76 SSGFS 80
>gnl|CDD|237659 PRK14287, PRK14287, chaperone protein DnaJ; Provisional.
Length = 371
Score = 44.6 bits (105), Expect = 2e-04
Identities = 25/71 (35%), Positives = 39/71 (54%), Gaps = 8/71 (11%)
Query: 14 ASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGNPD--- 68
AS +KKAYRK + HPD K E F ++ +AY L+D + + +++++G+ D
Sbjct: 16 ASVDEVKKAYRKLARKYHPDVNKAPDAEDKFKEVKEAYDTLSDPQKKAHYDQFGHTDPNQ 75
Query: 69 ---GPGAMSFG 76
G GA FG
Sbjct: 76 GFGGGGAGDFG 86
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 43.7 bits (103), Expect = 3e-04
Identities = 21/167 (12%), Positives = 62/167 (37%)
Query: 395 KEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKK 454
++ + E ++++EKP KE+ + RP K K+ KK + +++ ++K + +
Sbjct: 111 EQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVR 170
Query: 455 ITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNK 514
+ ++ K ++ E+ + + E+ + N+E +++ +D
Sbjct: 171 AKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETT 230
Query: 515 SEESSDDDDDDWEKYQTGLNKRDKVLEGRSKQSHSVHCPRFPEDKQE 561
+ +D+ + + K + + +
Sbjct: 231 TSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSKRTETR 277
Score = 31.4 bits (71), Expect = 2.5
Identities = 25/144 (17%), Positives = 42/144 (29%), Gaps = 20/144 (13%)
Query: 387 KVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHD 446
K + E E E E+E++ +E + KK + + ++ K
Sbjct: 134 KEEAKEKRPPKEKEKEKEKKV----------EEPRDREEE-----KKRERVRAKSRPKKP 178
Query: 447 KKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE 506
K K E+ + + K E + EE D D
Sbjct: 179 PKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVN-----EEREKEEDDGKDRETTTSP 233
Query: 507 KNEDSNNKSEESSDDDDDDWEKYQ 530
ED + +S E S +K
Sbjct: 234 MEEDESRQSSEISRRSSSSLKKPD 257
Score = 31.0 bits (70), Expect = 3.5
Identities = 25/182 (13%), Positives = 59/182 (32%), Gaps = 15/182 (8%)
Query: 392 EPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSG 451
+ + +E + +G ++ ++ + K K+ K +K + +K+
Sbjct: 85 SKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP- 143
Query: 452 QKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVEND------------ 499
++K ++ E P D + +K++ + + + + +
Sbjct: 144 KEKEKEKEKKVEEPRDREEEKKRERVRAKS--RPKKPPKKKPPNKKKEPPEEEKQRQAAR 201
Query: 500 EVVDKKEKNEDSNNKSEESSDDDDDDWEKYQTGLNKRDKVLEGRSKQSHSVHCPRFPEDK 559
E V K + D N + E+ DD D + S++S S P
Sbjct: 202 EAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPS 261
Query: 560 QE 561
Sbjct: 262 MA 263
>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
TopJ. This model describes TopJ (MG_200, CbpA), a DnaJ
homolog and probable assembly protein of the Mycoplasma
terminal organelle. The terminal organelle is involved
in both cytadherence and gliding motility [Cellular
processes, Chemotaxis and motility].
Length = 871
Score = 42.5 bits (99), Expect = 0.001
Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 4/66 (6%)
Query: 14 ASEASIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEKYGNP--DG 69
A E IKKA+RK + HPD K F ++ +A L++ + R N++KYG+ D
Sbjct: 14 ADEQEIKKAFRKLAKKYHPDRNKAPDAASIFAEINEANDVLSNPKKRANYDKYGHDGVDR 73
Query: 70 PGAMSF 75
F
Sbjct: 74 EDDFDF 79
>gnl|CDD|222986 PHA03102, PHA03102, Small T antigen; Reviewed.
Length = 153
Score = 40.0 bits (94), Expect = 0.001
Identities = 13/46 (28%), Positives = 22/46 (47%), Gaps = 1/46 (2%)
Query: 15 SEASIKKAYRKQSLILHPDKETGDEKAFMKLTKAYQALTDEESRRN 60
+ ++KAY ++ L HPDK GDE+ +L Y+ +
Sbjct: 20 NLPLMRKAYLRKCLEFHPDK-GGDEEKMKELNTLYKKFRESVKSLR 64
>gnl|CDD|184603 PRK14282, PRK14282, chaperone protein DnaJ; Provisional.
Length = 369
Score = 41.7 bits (98), Expect = 0.001
Identities = 16/59 (27%), Positives = 34/59 (57%), Gaps = 4/59 (6%)
Query: 11 PSGASEASIKKAYRKQSLILHPDKETGD----EKAFMKLTKAYQALTDEESRRNFEKYG 65
A++ IK+AY++ HPD+ + E+ F ++ +AY+ L+D + R ++++G
Sbjct: 13 SRNATQEEIKRAYKRLVKEWHPDRHPENRKEAEQKFKEIQEAYEVLSDPQKRAMYDRFG 71
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 41.0 bits (96), Expect = 0.003
Identities = 27/105 (25%), Positives = 41/105 (39%), Gaps = 24/105 (22%)
Query: 437 KKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDV 496
K K++H + G+++ + +E+T+ +D E +D SESE ESD D
Sbjct: 633 KADENKSRHQQLFEGEEEDEDDLEETDD--------DEDECEAIED--SESESESDGEDG 682
Query: 497 ENDEVVDKKEKNEDSNNKSEESS--------------DDDDDDWE 527
E DE D E NE + D +D E
Sbjct: 683 EEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSEDEE 727
Score = 29.8 bits (67), Expect = 8.6
Identities = 18/59 (30%), Positives = 30/59 (50%), Gaps = 3/59 (5%)
Query: 467 DVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
DV K ++ + + E EEE + E D+ D+ E EDS ++SE +D ++D
Sbjct: 630 DVLKADENKSRHQQL---FEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEED 685
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 40.2 bits (94), Expect = 0.004
Identities = 24/116 (20%), Positives = 41/116 (35%), Gaps = 18/116 (15%)
Query: 431 NKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKK---------DTPEESK 481
K K K K K KK ++ E+ D +K E+ + +S+
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSE 202
Query: 482 DV---------SSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWEK 528
D SS+ EE + + +E + E +E + SE S D ++
Sbjct: 203 DYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSEESSP 258
Score = 31.3 bits (71), Expect = 2.7
Identities = 23/110 (20%), Positives = 43/110 (39%), Gaps = 26/110 (23%)
Query: 435 GKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLS 494
G + K K K KK+ KK ++ + +D E+ ++ +ESK ES E+
Sbjct: 141 GIETKAKKGKAKKKTKKSKK-----KEAKESSDKDDEEESESEDESKS--EESAEDDSDD 193
Query: 495 DVE-------------------NDEVVDKKEKNEDSNNKSEESSDDDDDD 525
+ E ++E ++ + + SE SD+ D +
Sbjct: 194 EEEEDSDSEDYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSE 243
Score = 31.0 bits (70), Expect = 3.1
Identities = 18/73 (24%), Positives = 27/73 (36%), Gaps = 12/73 (16%)
Query: 478 EESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNK------------SEESSDDDDDD 525
ES + SE E +SD E KK K + +++ SE+ DDD+D
Sbjct: 235 SESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTFLPSLMGGYFSGSEDEDDDDEDI 294
Query: 526 WEKYQTGLNKRDK 538
+ K
Sbjct: 295 DPDQVVKKPVKRK 307
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 40.4 bits (95), Expect = 0.004
Identities = 27/133 (20%), Positives = 47/133 (35%)
Query: 393 PSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQ 452
+KE E E ++K I V+ KKK T K +
Sbjct: 33 ITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPA 92
Query: 453 KKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSN 512
KK + A+ KD + D D ++D+ +D + ++D +
Sbjct: 93 KKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDD 152
Query: 513 NKSEESSDDDDDD 525
++ ++ DDDDD
Sbjct: 153 DEDDDEDDDDDDV 165
Score = 39.6 bits (93), Expect = 0.007
Identities = 23/174 (13%), Positives = 45/174 (25%), Gaps = 18/174 (10%)
Query: 384 DTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKN 443
+ IK K E + E + K K K
Sbjct: 36 EEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKK 95
Query: 444 KHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSE----------------- 486
D+ S +K + + K + D +
Sbjct: 96 LKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDED 155
Query: 487 SEEESDLSDVENDEVVDKKEKNEDSN-NKSEESSDDDDDDWEKYQTGLNKRDKV 539
+E+ D DV++++ K+ K + + + D+DD + + K
Sbjct: 156 DDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKLTAT 209
Score = 35.4 bits (82), Expect = 0.14
Identities = 20/125 (16%), Positives = 35/125 (28%), Gaps = 2/125 (1%)
Query: 404 EEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTE 463
+ K G IT K + + + K ++ +
Sbjct: 25 AKSKSKGFITKEEIK-EALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAA 83
Query: 464 APADVAKVEKKDTPEESKD-VSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDD 522
A KK +E +E + D D N D ++ ++ D D
Sbjct: 84 KAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLD 143
Query: 523 DDDWE 527
DDD +
Sbjct: 144 DDDID 148
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 39.6 bits (92), Expect = 0.008
Identities = 31/147 (21%), Positives = 51/147 (34%), Gaps = 15/147 (10%)
Query: 393 PSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQ 452
+ GE +EE+ NK+K + K K+GG + D G+
Sbjct: 227 DESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDK----KGKRGGDDDADEYDSDDGDDEGR 282
Query: 453 KKI-----------TNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEV 501
++ E +P AK E + + + ++EEE LS
Sbjct: 283 EEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLK 342
Query: 502 VDKKEKNEDSNNKSEESSDDDDDDWEK 528
K +KN + S+ D DD D +
Sbjct: 343 KLKGKKNGLDKDDSDSGDDSDDSDIDG 369
Score = 35.3 bits (81), Expect = 0.14
Identities = 23/119 (19%), Positives = 46/119 (38%), Gaps = 7/119 (5%)
Query: 432 KKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEES 491
+ KKK K +KK K + D + + D +E ++ S+ +
Sbjct: 237 DEEKSKKKKKKLAKNKKKLDDDK----KGKRGGDDDADEYDSDDGDDEGREEDYISDSSA 292
Query: 492 DLSDVENDEVV---DKKEKNEDSNNKSEESSDDDDDDWEKYQTGLNKRDKVLEGRSKQS 547
+D E E + K E ++ E S+++ ++ E + K+ K L+G+
Sbjct: 293 SGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGL 351
Score = 30.7 bits (69), Expect = 4.2
Identities = 24/129 (18%), Positives = 48/129 (37%), Gaps = 17/129 (13%)
Query: 401 IEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVE 460
EE EE+ ++K + M K K D+++ G+K
Sbjct: 161 AEEAEEEMK-------RRKKTANGFQLMMM--KAAKNGPAAFGDEDEETEGEK------G 205
Query: 461 QTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSD 520
D+ + + E+ D S + E+ D + ++ + K KN+ + ++
Sbjct: 206 GGGRGKDLKIKDLEGDDEDDGDESDKGGEDGD--EEKSKKKKKKLAKNKKKLDDDKKGKR 263
Query: 521 DDDDDWEKY 529
DDD ++Y
Sbjct: 264 GGDDDADEY 272
>gnl|CDD|215527 PLN02976, PLN02976, amine oxidase.
Length = 1713
Score = 39.5 bits (92), Expect = 0.009
Identities = 33/163 (20%), Positives = 56/163 (34%), Gaps = 22/163 (13%)
Query: 394 SKEGEGEIE----EEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKS 449
SK G I+ ++++P G++ + K K K K +
Sbjct: 10 SKRGLKVIQIGDDSDDDEPIGSLFKL-----------------KRPKNSKKVKVGLESTG 52
Query: 450 SGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNE 509
++K++ E +E D +K K V S S S + +EN E VD NE
Sbjct: 53 KREEKLSALDEDSEGMDDTLASFRKRLKGPKKGVGSVSARMSQGALLENQEEVDTV-LNE 111
Query: 510 DSNNKSEESSDDDDDDWEKYQTGLNKRDKVLEGRSKQSHSVHC 552
S + + +D + + R KQ + V
Sbjct: 112 GSKRSLDGNLEDSLSAFLREAQAGLSRKSCANSCGKQLNRVQG 154
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 38.2 bits (89), Expect = 0.009
Identities = 16/98 (16%), Positives = 41/98 (41%), Gaps = 9/98 (9%)
Query: 440 FTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVEND 499
F + +++ ++ + E E ++ EE K+ ++ ++E + D
Sbjct: 35 FFPSSPSDQAAADEQEAKKSDDQET------AEIEEVKEEEKEAANSEDKEDKGDAEKED 88
Query: 500 EVV---DKKEKNEDSNNKSEESSDDDDDDWEKYQTGLN 534
E +++E E S+ +E+ + + + EK T +
Sbjct: 89 EESEEENEEEDEESSDENEKETEEKTESNVEKEITNPS 126
Score = 34.3 bits (79), Expect = 0.19
Identities = 16/89 (17%), Positives = 32/89 (35%), Gaps = 1/89 (1%)
Query: 428 WMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSES 487
+ + + KKS Q+ + E +K+D + K+
Sbjct: 34 LFFPSSPSDQAAADEQE-AKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESE 92
Query: 488 EEESDLSDVENDEVVDKKEKNEDSNNKSE 516
EE + + +DE + E+ +SN + E
Sbjct: 93 EENEEEDEESSDENEKETEEKTESNVEKE 121
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 39.2 bits (91), Expect = 0.014
Identities = 50/260 (19%), Positives = 101/260 (38%), Gaps = 41/260 (15%)
Query: 451 GQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNED 510
G + ITNT+ + + ++A E + D S E ESD++ V D VV + E N D
Sbjct: 3823 GAEDITNTLNEDDDLEELANEEDTAN-QSDLDESEARELESDMNGVTKDSVVSENE-NSD 3880
Query: 511 SNNKSEESSDDDDDDWEKYQTGLNKR-------DKVLEGRSKQSHSVHC----------- 552
S ++++ ++ +D E LN++ + +LE K +
Sbjct: 3881 SEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKED 3940
Query: 553 -PRFPEDKQEFWWIYISDRKSRTLLTSPYHITELVDQEQVQLKFTAPRWPGVYTFSVC-- 609
+ EDK +++ ++ I + + Q +Q + P P +
Sbjct: 3941 DNKALEDKDR------QEKEDEEEMSDDVGIDDEI-QPDIQENNSQPP-PENEDLDLPED 3992
Query: 610 LRCDSYLGFDQ----MQDIKLDVKEAPEVPTEHPQWEMSGEEDEG------DEEMGGSDV 659
L+ D G ++D+ ++ + + + + E +ED DE++ D
Sbjct: 3993 LKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDF 4052
Query: 660 SEFTTDEDVEDDLKLEPCAQ 679
S+ D++ ++ E Q
Sbjct: 4053 SDLAEDDEKMNEDGFEENVQ 4072
Score = 36.5 bits (84), Expect = 0.085
Identities = 24/129 (18%), Positives = 50/129 (38%), Gaps = 4/129 (3%)
Query: 402 EEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQ 461
EEE + + + + + +W + + + K+ ++ + + + +
Sbjct: 3882 EEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDD 3941
Query: 462 TEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEK---NEDSNNKSEES 518
+A D + EK+D E S DV + E + D+ + N + + E ED +E
Sbjct: 3942 NKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQE-NNSQPPPENEDLDLPEDLKLDEKEG 4000
Query: 519 SDDDDDDWE 527
D D E
Sbjct: 4001 DVSKDSDLE 4009
Score = 33.4 bits (76), Expect = 0.78
Identities = 32/160 (20%), Positives = 61/160 (38%), Gaps = 5/160 (3%)
Query: 391 LEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKH---DK 447
LE E E EEE G + ++ +P + K K
Sbjct: 3945 LEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSK 4004
Query: 448 KSSGQKKITNTVEQTEAPADVAKVE--KKDTPEESKDVSSESEEESDLSDVENDEVVDKK 505
S + ++ + AD K E + + P E + E ++ D SD+ D+ +
Sbjct: 4005 DSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNE 4064
Query: 506 EKNEDSNNKSEESSDDDDDDWEKYQTGLNKRDKVLEGRSK 545
+ E++ ++EES++D E+ + G D+ ++ K
Sbjct: 4065 DGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPK 4104
Score = 30.0 bits (67), Expect = 7.8
Identities = 15/68 (22%), Positives = 31/68 (45%), Gaps = 1/68 (1%)
Query: 460 EQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESS 519
Q + +D+A+ ++K E+ + + + EES V++DE +++ E ED +
Sbjct: 4047 IQQDDFSDLAEDDEKMN-EDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKM 4105
Query: 520 DDDDDDWE 527
D
Sbjct: 4106 DAKSTFAS 4113
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 37.1 bits (86), Expect = 0.016
Identities = 20/85 (23%), Positives = 45/85 (52%), Gaps = 4/85 (4%)
Query: 447 KKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE 506
KK +KK N +E + + +K+D +E++DV E +E+ + + E++E +++ E
Sbjct: 11 KKPEEEKKDENLLEHVKITS----WDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPE 66
Query: 507 KNEDSNNKSEESSDDDDDDWEKYQT 531
ED E+ ++++D+ +
Sbjct: 67 DIEDEEEIVEDEEEEEEDEEDNVDL 91
Score = 34.1 bits (78), Expect = 0.15
Identities = 20/95 (21%), Positives = 46/95 (48%), Gaps = 5/95 (5%)
Query: 434 GGKKKKFTKNKHDKKSSGQKKIT-----NTVEQTEAPADVAKVEKKDTPEESKDVSSESE 488
G +KK + K D+ KIT + +++ E D + + ++ EE ++ E E
Sbjct: 7 GAPEKKPEEEKKDENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPE 66
Query: 489 EESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDD 523
+ D ++ DE +++++ ++ + K E + +D
Sbjct: 67 DIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNIND 101
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 38.3 bits (89), Expect = 0.016
Identities = 25/142 (17%), Positives = 51/142 (35%), Gaps = 1/142 (0%)
Query: 384 DTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKN 443
+ + S + + +E V + + P KKK TK
Sbjct: 170 SIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKK-TKE 228
Query: 444 KHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVD 503
K +KK + + + E+ DV ++ P + E E + +++E +
Sbjct: 229 KKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETE 288
Query: 504 KKEKNEDSNNKSEESSDDDDDD 525
+KEK + K +D+D++
Sbjct: 289 EKEKEKRKRLKKMMEDEDEDEE 310
>gnl|CDD|222912 PHA02624, PHA02624, large T antigen; Provisional.
Length = 647
Score = 38.4 bits (90), Expect = 0.018
Identities = 16/60 (26%), Positives = 25/60 (41%), Gaps = 11/60 (18%)
Query: 19 IKKAYRKQSLILHPDKETGDEKAFMKLTKAYQAL----------TDEESRRNFEKYGNPD 68
++KAY ++ HPDK GDE+ +L Y+ L + YG P+
Sbjct: 30 MRKAYLRKCKEYHPDK-GGDEEKMKRLNSLYKKLQEGVKSARQSFGTQDSSEIPTYGTPE 88
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 38.5 bits (90), Expect = 0.018
Identities = 24/103 (23%), Positives = 44/103 (42%), Gaps = 1/103 (0%)
Query: 447 KKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE 506
KK+ G++K + EA + K E K+ +E K+ E E E + +
Sbjct: 417 KKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLK 476
Query: 507 KNEDSNNKSEESSDDDDDDWEKYQTGLNKRDKVLEGRSKQSHS 549
++E + + EE D+++ W K T + + K+S S
Sbjct: 477 RSEKAQKEEEEEELDEENPWLK-TTSSVGKSAKKQDSKKKSSS 518
Score = 37.3 bits (87), Expect = 0.035
Identities = 26/134 (19%), Positives = 47/134 (35%), Gaps = 1/134 (0%)
Query: 392 EPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSG 451
E E E E E + K+ R + G K+ + K K + K+
Sbjct: 389 ENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRK-FGPENGEKEAESKKLKKENKNEF 447
Query: 452 QKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDS 511
++K + E+ + AKVEK + ++ EEE + D EN + + +
Sbjct: 448 KEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSA 507
Query: 512 NNKSEESSDDDDDD 525
+ + D
Sbjct: 508 KKQDSKKKSSSKLD 521
Score = 33.5 bits (77), Expect = 0.62
Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 8/131 (6%)
Query: 395 KEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKK 454
KE E + ++E K KEK+++ +++++ K +K + QK+
Sbjct: 432 KEAESKKLKKENKNEF-------KEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKE 484
Query: 455 ITNTVEQTEAPADVAKVE-KKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNN 513
E P K ++ S S+ + + + V KK+K ++ +
Sbjct: 485 EEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSI 544
Query: 514 KSEESSDDDDD 524
++ D++D
Sbjct: 545 DLDDDLIDEED 555
Score = 32.3 bits (74), Expect = 1.5
Identities = 30/158 (18%), Positives = 53/158 (33%), Gaps = 26/158 (16%)
Query: 383 DDTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPV--WMKNKKGGKKKK- 439
+ ++ + K E +EEEE+ P + K K K K
Sbjct: 464 EAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKA 523
Query: 440 -FTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDV-- 496
+K K +KK +++ + D K D +E E E++ +L +
Sbjct: 524 ANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDE------EDEDDEELPFLFK 577
Query: 497 ---------ENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
D+VV + EK K E ++D +
Sbjct: 578 QKDLIKEAFAGDDVVAEFEK-----EKKEVIEEEDPKE 610
Score = 32.0 bits (73), Expect = 1.8
Identities = 17/56 (30%), Positives = 29/56 (51%), Gaps = 1/56 (1%)
Query: 470 KVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
K+E K EE +D S+SEEE + D ++D+ + K +E DD+++
Sbjct: 317 KIEGKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPW-MLRKKLGKLKEGEDDEENS 371
Score = 29.6 bits (67), Expect = 9.3
Identities = 31/177 (17%), Positives = 66/177 (37%), Gaps = 39/177 (22%)
Query: 392 EPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSG 451
E ++ + + EEE+E + N WM KK GK K+ ++ +
Sbjct: 325 EEDEDEDSDSEEEDEDDDEDDDDGENP---------WMLRKKLGKLKEGEDDEENSGLLS 375
Query: 452 QKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE----- 506
K + + +K++ E +++ E E E + + EN+E K
Sbjct: 376 MKFMQRAEAR----------KKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKF 425
Query: 507 ---------------KNEDSNNKSEESSDDDDDDWEKYQTGLNKRDKVLEGRSKQSH 548
K + K ++ SD++++ ++ + + K L RS+++
Sbjct: 426 GPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQ 482
>gnl|CDD|112562 pfam03753, HHV6-IE, Human herpesvirus 6 immediate early protein.
The proteins in this family are poorly characterized,
but an investigation has indicated that the immediate
early protein is required the down-regulation of MHC
class I expression in dendritic cells. Human herpesvirus
6 immediate early protein is also referred to as U90.
Length = 993
Score = 38.5 bits (89), Expect = 0.020
Identities = 37/128 (28%), Positives = 55/128 (42%), Gaps = 10/128 (7%)
Query: 430 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEE 489
K KKK + N+ KK + ++K TV + K + T +SSESE+
Sbjct: 643 TFYKRDKKKDISTNRKVKKRTAKRK---TVGYKTDKSKKIKSDSLPTDTNVIVISSESED 699
Query: 490 ESDLSDVENDEVVDKK--EKNEDSNNKSEESSDDDDDDWEKYQTGLNKRDKVLEGRSKQS 547
E D ++ K +K S KSE SS+ DD E + L+ DKV+ + +S
Sbjct: 700 EED-----GFNIIKKSQLKKKIKSELKSESSSESDDCTSEDNELHLSDYDKVINNGNCES 754
Query: 548 HSVHCPRF 555
P F
Sbjct: 755 KGFPSPVF 762
>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
Members of this family are bacterial proteins with a
conserved motif [KR]FYDLN, sometimes flanked by a pair
of CXXC motifs, followed by a long region of low
complexity sequence in which roughly half the residues
are Asp and Glu, including multiple runs of five or more
acidic residues. The function of members of this family
is unknown.
Length = 104
Score = 35.4 bits (82), Expect = 0.024
Identities = 18/64 (28%), Positives = 29/64 (45%), Gaps = 2/64 (3%)
Query: 462 TEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDD 521
E P +VAK E E EE+ D +++D+ D + D ++ ++ D
Sbjct: 34 EEVPPEVAKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDD--DDVDLD 91
Query: 522 DDDD 525
DDDD
Sbjct: 92 DDDD 95
Score = 27.7 bits (62), Expect = 9.9
Identities = 15/64 (23%), Positives = 28/64 (43%), Gaps = 12/64 (18%)
Query: 462 TEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDD 521
A +K + E+ DV + +++ D ++D++ D + + D DD
Sbjct: 45 APAADAEDAAKKDEDEEDEDDVVLDDDDD----DDDDDDLPDLDDDDVD--------LDD 92
Query: 522 DDDD 525
DDDD
Sbjct: 93 DDDD 96
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 38.0 bits (88), Expect = 0.032
Identities = 27/123 (21%), Positives = 36/123 (29%), Gaps = 7/123 (5%)
Query: 686 PAAAPHAGRLMKNNSTNSSDRLRDLCSVGR----RRVLSESFSTTQERPIL-PRPTPGAP 740
P+ A + + +R RD + GR RR + P PR A
Sbjct: 2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRR-RAAR 2689
Query: 741 IMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQKPPPILPAPAPTVPTP 800
V TS+ T P + P+PP P PAP P
Sbjct: 2690 PTVGSLTSLADPPPPPPTP-EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGP 2748
Query: 801 IAP 803
P
Sbjct: 2749 ATP 2751
Score = 35.7 bits (82), Expect = 0.14
Identities = 19/122 (15%), Positives = 28/122 (22%), Gaps = 7/122 (5%)
Query: 686 PAAAPHAGRLMKNNSTNSSDRLRDLCSVGRRRVLSESFSTTQERPILPRPTPGAPIMVAP 745
P + S S D + + P P P P + AP
Sbjct: 2780 PRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP 2839
Query: 746 TTSVQFLNGSKATQFVFVPANSL-------QPVPIPPAKKRGYVKQKPPPILPAPAPTVP 798
S P + P P A R V++ P + +
Sbjct: 2840 PPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA 2899
Query: 799 TP 800
P
Sbjct: 2900 LP 2901
Score = 34.1 bits (78), Expect = 0.38
Identities = 17/91 (18%), Positives = 24/91 (26%), Gaps = 3/91 (3%)
Query: 713 VGRRRVLSESFSTTQERPILPRPTPGAPIMVAPTTSVQFLNGSKATQFVFVPANSLQPVP 772
G R + P P P +V L+ S+ + + P
Sbjct: 2752 GGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES-LPSPWDPADPPAA 2810
Query: 773 IPPAKKRGYVKQKPPPILPAPAPTVPTPIAP 803
+ P P P PT P AP
Sbjct: 2811 VLAPAAALPPAASPAG--PLPPPTSAQPTAP 2839
Score = 33.8 bits (77), Expect = 0.65
Identities = 19/81 (23%), Positives = 29/81 (35%), Gaps = 7/81 (8%)
Query: 730 PILPRPTPGAPIMVA------PTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVK 783
+ RP +P P + S++T+ +P + + P P A
Sbjct: 2861 DVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPP-QP 2919
Query: 784 QKPPPILPAPAPTVPTPIAPE 804
Q PP P P P P P P+
Sbjct: 2920 QPQPPPPPQPQPPPPPPPRPQ 2940
Score = 30.7 bits (69), Expect = 4.7
Identities = 30/124 (24%), Positives = 36/124 (29%), Gaps = 13/124 (10%)
Query: 685 RPAAAPHAGRLMKNNSTNSSDRLRDLCSVG---RRRVLSESFS--TTQERPILPRPTPGA 739
RP P GR+ + R S RRR + T+ P P PTP
Sbjct: 2651 RPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEP 2710
Query: 740 PIMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQKPPPILPAPAPTVPT 799
P V + +L P PPA G P PA PT
Sbjct: 2711 A----PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPAT----PGGPARPARPPT 2762
Query: 800 PIAP 803
P
Sbjct: 2763 TAGP 2766
Score = 29.9 bits (67), Expect = 7.7
Identities = 26/120 (21%), Positives = 42/120 (35%), Gaps = 24/120 (20%)
Query: 727 QERPILPRPTPGAP-IMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQK 785
+P P P P P +APTT PA + +P P G +
Sbjct: 2926 PPQPQPPPPPPPRPQPPLAPTTD---------------PAGAGEPSGAVPQPWLGALVPG 2970
Query: 786 PPPI----LPAPAPTVPTPIAPEFLWSRESKMSIANRIVSEATTMFLETMLQPDILSYIQ 841
+ +P PAP+ P + + +R+ S A+++ L P +S Q
Sbjct: 2971 RVAVPRFRVPQPAPSREAPASS----TPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQ 3026
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 37.3 bits (87), Expect = 0.037
Identities = 24/82 (29%), Positives = 39/82 (47%), Gaps = 3/82 (3%)
Query: 467 DVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDW 526
D +E++ E D + EE D D E+DE ++ K+ED ++ DDDDDD
Sbjct: 102 DDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDE---DDDDDDDDDDI 158
Query: 527 EKYQTGLNKRDKVLEGRSKQSH 548
+ L +R + E K++
Sbjct: 159 ATRERSLERRRRRREWEEKRAE 180
>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
ribosomal structure and biogenesis / Posttranslational
modification, protein turnover, chaperones].
Length = 379
Score = 36.9 bits (85), Expect = 0.037
Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 5/54 (9%)
Query: 14 ASEASIKKAYRKQSLILHPDKET-----GDEKAFMKLTKAYQALTDEESRRNFE 62
A I KA++K+ HPDK G ++ F + KA + L D + R ++
Sbjct: 58 AIPPQILKAHKKKVYKYHPDKTAAGGNKGCDEFFKLIQKAREVLGDRKLRLQYD 111
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 36.6 bits (85), Expect = 0.044
Identities = 15/54 (27%), Positives = 32/54 (59%)
Query: 475 DTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWEK 528
D+ +E + + ES++E + SD E++E D+ K ++ E S +D+++ E+
Sbjct: 111 DSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEE 164
Score = 29.2 bits (66), Expect = 8.9
Identities = 21/82 (25%), Positives = 39/82 (47%)
Query: 447 KKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE 506
KK G++++ + + E + E+K E S + ++E + +VE DE D +
Sbjct: 57 KKGYGEEEVVDGIPGLELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEG 116
Query: 507 KNEDSNNKSEESSDDDDDDWEK 528
+ D + E S D +D+ EK
Sbjct: 117 EWIDVESDKEIESSDSEDEEEK 138
>gnl|CDD|182347 PRK10266, PRK10266, curved DNA-binding protein CbpA; Provisional.
Length = 306
Score = 36.3 bits (84), Expect = 0.052
Identities = 17/48 (35%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 18 SIKKAYRKQSLILHPD--KETGDEKAFMKLTKAYQALTDEESRRNFEK 63
+IK AYR+ + HPD KE E F ++ +A++ L+DE+ R +++
Sbjct: 20 TIKTAYRRLARKYHPDVSKEPDAEARFKEVAEAWEVLSDEQRRAEYDQ 67
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 36.9 bits (86), Expect = 0.055
Identities = 20/113 (17%), Positives = 42/113 (37%), Gaps = 2/113 (1%)
Query: 413 TPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVE 472
V + ++ G K TK K K + + ++ ++ A+ +
Sbjct: 1278 VSAVQYSPPPPSKRPDGESNGGSKPSSPTKKK--VKKRLEGSLAALKKKKKSEKKTARKK 1335
Query: 473 KKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
K T + S S ++D + + +E +++ E+ DD+DDD
Sbjct: 1336 KSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDEDDD 1388
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 36.7 bits (85), Expect = 0.059
Identities = 38/132 (28%), Positives = 63/132 (47%), Gaps = 13/132 (9%)
Query: 428 WMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKK-DTPEESKDV--- 483
W+K K +KKK + K ++K E T KV K + EE +DV
Sbjct: 94 WLKKSKKRQKKKEAERKKALLLDEKEK-ERAAEYTSEDLAGLKVGHKVEEFEEGEDVILT 152
Query: 484 -----SSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWEKYQTG-LNKRD 537
E E+E D +EN E+V+K++ ++ K ++ D DDD + + L+K D
Sbjct: 153 LKDTGVLEDEDEGDE--LENVELVEKEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYD 210
Query: 538 KVLEGRSKQSHS 549
+ +EG+ K+S +
Sbjct: 211 EEIEGKKKKSDN 222
Score = 30.1 bits (68), Expect = 7.0
Identities = 19/87 (21%), Positives = 30/87 (34%), Gaps = 10/87 (11%)
Query: 454 KITNTVE-----QTEAPADVAKVEKKDTPEESKDVSSESEEE--SDLSDVENDEVVDKKE 506
I T E Q E + + + + E S + +EE DVE V + +E
Sbjct: 392 VIDETSEFVRSLQKEPLEEKPENKDESVEEISDAEEDDEDEEDEDGDGDVEMSAVDNDEE 451
Query: 507 KNEDSNNKSEESSDDDDDDWEKYQTGL 533
K E+ E ++ GL
Sbjct: 452 KEEEDK---EAIPSTILEEEPTVGGGL 475
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 36.5 bits (85), Expect = 0.063
Identities = 24/83 (28%), Positives = 42/83 (50%), Gaps = 2/83 (2%)
Query: 472 EKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWEKYQT 531
++D EE V E EE+ D E +E VD +E+ + + E+S D+DD++ E+ +
Sbjct: 319 GEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVD--LSDEEEDEEDEDSDDEDDEEEEEEEK 376
Query: 532 GLNKRDKVLEGRSKQSHSVHCPR 554
K+ RS+ + CP+
Sbjct: 377 EKKKKKSAESTRSELPFTFPCPK 399
Score = 31.5 bits (72), Expect = 2.6
Identities = 13/31 (41%), Positives = 18/31 (58%)
Query: 641 WEMSGEEDEGDEEMGGSDVSEFTTDEDVEDD 671
GEEDE +EE G D E D+D+E++
Sbjct: 315 GLGQGEEDEEEEEDGVDDEDEEDDDDDLEEE 345
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 36.0 bits (83), Expect = 0.093
Identities = 13/76 (17%), Positives = 18/76 (23%), Gaps = 4/76 (5%)
Query: 728 ERPILPRPTPGAPIMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQKPP 787
P P P A A ++ + +L A+ G P
Sbjct: 397 PAPAAPPAAPAAAPAAAAAARAVAAAPARRSP----APEALAAARQASARGPGGAPAPAP 452
Query: 788 PILPAPAPTVPTPIAP 803
APA A
Sbjct: 453 APAAAPAAAARPAAAG 468
>gnl|CDD|219939 pfam08619, Nha1_C, Alkali metal cation/H+ antiporter Nha1 C
terminus. The C terminus of the plasma membrane Nha1
antiporter plays an important role in the immediate cell
response to hypo-osmotic shock which prevents an
execessive loss of ions and water. This domain is found
with pfam00999.
Length = 430
Score = 35.6 bits (82), Expect = 0.12
Identities = 25/134 (18%), Positives = 41/134 (30%), Gaps = 19/134 (14%)
Query: 400 EIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTV 459
+I + PVV++ + G K K ++ K
Sbjct: 269 KINPHGKPERDRSNPVVSR----------ALSWVGLKGTKHD-FIKQIRNLDPKARREVD 317
Query: 460 EQTEAPADVAKVEKKDTPE----ESKDVSSESEEESDLSD----VENDEVVDKKEKNEDS 511
+T AP V + + D SE + E DL+ E E +E + S
Sbjct: 318 LETNAPERVKAPARDEVDTNDSSPLTDSKSEEKLEDDLAHSLLGSEASERRKNREPSSSS 377
Query: 512 NNKSEESSDDDDDD 525
S D +D+
Sbjct: 378 PPSRPRSRRDSEDE 391
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 35.8 bits (82), Expect = 0.12
Identities = 23/86 (26%), Positives = 34/86 (39%), Gaps = 6/86 (6%)
Query: 442 KNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEV 501
K ++K E KV K+ ++ + + ES++SDVE E
Sbjct: 108 KESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEF 167
Query: 502 VDKKEKNEDSNNKSEESSDDDDDDWE 527
V E N+SEE D + DD E
Sbjct: 168 VTSLE------NESEEELDLEKDDGE 187
Score = 32.3 bits (73), Expect = 1.5
Identities = 24/117 (20%), Positives = 38/117 (32%), Gaps = 9/117 (7%)
Query: 405 EEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEA 464
E P N K K K+ TK + S ++ + V +A
Sbjct: 53 ESALAVDEEPDENGAV--------SKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDA 104
Query: 465 PADVAKVEKKD-TPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSD 520
D + T ++ SS+ EEE V V K +++ + E SD
Sbjct: 105 ALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSD 161
Score = 30.8 bits (69), Expect = 3.5
Identities = 30/116 (25%), Positives = 48/116 (41%), Gaps = 8/116 (6%)
Query: 431 NKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESK-------DV 483
NK G++ + +NK + +KKI + + E P + V KK K
Sbjct: 29 NKFRGERWR-LQNKFLLAAGSRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVE 87
Query: 484 SSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWEKYQTGLNKRDKV 539
SE EE V D +DK+ K + + ++ D + EK + + KR KV
Sbjct: 88 ISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKV 143
>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large
and diverse group of secondary transporters that
includes uniporters, symporters, and antiporters. MFS
proteins facilitate the transport across cytoplasmic or
internal membranes of a variety of substrates including
ions, sugar phosphates, drugs, neurotransmitters,
nucleosides, amino acids, and peptides. They do so using
the electrochemical potential of the transported
substrates. Uniporters transport a single substrate,
while symporters and antiporters transport two
substrates in the same or in opposite directions,
respectively, across membranes. MFS proteins are
typically 400 to 600 amino acids in length, and the
majority contain 12 transmembrane alpha helices (TMs)
connected by hydrophilic loops. The N- and C-terminal
halves of these proteins display weak similarity and may
be the result of a gene duplication/fusion event. Based
on kinetic studies and the structures of a few bacterial
superfamily members, GlpT (glycerol-3-phosphate
transporter), LacY (lactose permease), and EmrD
(multidrug transporter), MFS proteins are thought to
function through a single substrate binding site,
alternating-access mechanism involving a rocker-switch
type of movement. Bacterial members function primarily
for nutrient uptake, and as drug-efflux pumps to confer
antibiotic resistance. Some MFS proteins have medical
significance in humans such as the glucose transporter
Glut4, which is impaired in type II diabetes, and
glucose-6-phosphate transporter (G6PT), which causes
glycogen storage disease when mutated.
Length = 352
Score = 35.4 bits (82), Expect = 0.14
Identities = 16/92 (17%), Positives = 32/92 (34%), Gaps = 14/92 (15%)
Query: 73 MSFGIALPSYIVEKENSV----WVLGLYALVFMVALPTAVGMWWYKSIRYTGDKVLLETI 128
ALP + S ++ ++L + + + + S R+ +VLL +
Sbjct: 16 GLLSPALPLLAEDLGLSASQAGLIVSAFSLGYALGSL--LAGYL--SDRFGRRRVLLLGL 71
Query: 129 ------NFYYAFFQITPHMALKRVLMILGASL 154
+ AF + + R L+ LG
Sbjct: 72 LLFALGSLLLAFASSLWLLLVGRFLLGLGGGA 103
>gnl|CDD|178135 PLN02520, PLN02520, bifunctional 3-dehydroquinate
dehydratase/shikimate dehydrogenase.
Length = 529
Score = 34.7 bits (80), Expect = 0.21
Identities = 24/87 (27%), Positives = 44/87 (50%), Gaps = 11/87 (12%)
Query: 570 RKSRTLLTSPYHITELVDQEQVQLKFTAPRWPGVYTFSVCLRCDSYLGFDQMQDIKLDVK 629
R++ TL+ P + + VD+ +++ + V +R D F+ +D+K +K
Sbjct: 19 RRNPTLICVPI-MADSVDKMLIEMAKAKELGADL----VEIRLDFLKNFNPREDLKTLIK 73
Query: 630 EAPEVP---TEHPQWEMSGEEDEGDEE 653
++P +P T P+WE G + EGDE
Sbjct: 74 QSP-LPTLVTYRPKWE--GGQYEGDEN 97
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 33.2 bits (76), Expect = 0.31
Identities = 12/71 (16%), Positives = 24/71 (33%)
Query: 453 KKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSN 512
KK T + + K +K+ S+ E + + + + K +N +
Sbjct: 52 KKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAE 111
Query: 513 NKSEESSDDDD 523
E+ D D
Sbjct: 112 VNHEDVIDLSD 122
>gnl|CDD|234474 TIGR04113, cas_csx17, CRISPR-associated protein Csx17, subtype
Dpsyc. Members of this protein family are found
exclusively in CRISPR-associated (cas) type I system
gene clusters of the Dpsyc subtype. Markers for that
type include a variant form of cas3 (model TIGR02621)
and the GSU0054-like protein family (model TIGR02165).
This family occurs in less than half of known Dpsyc
clusters.
Length = 704
Score = 34.3 bits (79), Expect = 0.32
Identities = 16/52 (30%), Positives = 23/52 (44%), Gaps = 6/52 (11%)
Query: 703 SSDRLRDLCSVGRRRVLSESFSTTQERPILPRPTPGAPIMVAPTTSVQFLNG 754
D R+L +V RR+L E+ +RP G P + + QFL G
Sbjct: 516 GGDLARNLAAVLERRLLEEARKDNDDRPQ------GGPCPASLSDIAQFLRG 561
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 34.1 bits (79), Expect = 0.33
Identities = 24/89 (26%), Positives = 39/89 (43%), Gaps = 13/89 (14%)
Query: 445 HDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDK 504
KK++ KKI VE+ E + K EKK K E EEE +K
Sbjct: 404 GSKKAT--KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEE-----------KEK 450
Query: 505 KEKNEDSNNKSEESSDDDDDDWEKYQTGL 533
KE+ ++ + E +++++ +K Q L
Sbjct: 451 KEEEKEEEEEEAEEEKEEEEEKKKKQATL 479
Score = 32.6 bits (75), Expect = 1.1
Identities = 19/72 (26%), Positives = 34/72 (47%)
Query: 436 KKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSD 495
KK TK +KK ++ + A K ++++ EE + E EEE + ++
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Query: 496 VENDEVVDKKEK 507
E +E +KK+K
Sbjct: 464 EEKEEEEEKKKK 475
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 33.1 bits (76), Expect = 0.34
Identities = 17/61 (27%), Positives = 32/61 (52%), Gaps = 4/61 (6%)
Query: 469 AKVEKKDTPEESKDVSSESEEE----SDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 524
AK +KK+ EE + V E EE+ + + DK +K++ ++KSE+ + + +
Sbjct: 60 AKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAE 119
Query: 525 D 525
D
Sbjct: 120 D 120
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 33.5 bits (77), Expect = 0.35
Identities = 24/142 (16%), Positives = 49/142 (34%), Gaps = 36/142 (25%)
Query: 423 THRPVWM----KNKKGGKKKK-------------FTKNKHDKKSSG------QKKITNTV 459
HRP + K ++GG + TK K+ + G ++ + +
Sbjct: 4 AHRPTFDPARGKEEQGGNRLSGPSSIYSSRDLPAHTKLKYRQPGQGTEDELRKRDLRAEL 63
Query: 460 EQTEAPADVAKVEKK-----DTPEESKDVSSESEEESD--------LSDVENDEVVDKKE 506
E+ E K K D + E++ + + + + D +
Sbjct: 64 EEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADDSD 123
Query: 507 KNEDSNNKSEESSDDDDDDWEK 528
+ DS++ ++S DDD +D
Sbjct: 124 SSSDSDSSDDDSDDDDSEDETA 145
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 34.1 bits (78), Expect = 0.38
Identities = 24/118 (20%), Positives = 48/118 (40%), Gaps = 2/118 (1%)
Query: 434 GGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDL 493
+K+ +N DK+ S + +V E ++ + + D S+ E E E ++
Sbjct: 21 ESSQKRIQQNNGDKEDSSTSTSSLSVSAVEKTSNAKEEIQVDFQHNSESAVEEVEAEDEI 80
Query: 494 SDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWE-KYQTGL-NKRDKVLEGRSKQSHS 549
+N V K + S + DDD + K + N R + +E ++++ S
Sbjct: 81 EVEQNQSDVLKSSSIVKEESISTDMDGIDDDSLDRKLKLERENLRKREIEELAEENFS 138
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 34.0 bits (78), Expect = 0.40
Identities = 24/117 (20%), Positives = 32/117 (27%), Gaps = 10/117 (8%)
Query: 687 AAAPHAGRLMKNNSTNSSDRLRDLCSVGRRRVLSESFSTTQERPILPRPTPGAPIMVAPT 746
A A GR T + V + + ++ E P P M A T
Sbjct: 617 AQARTFGRA-----TVGEMIISGFPPVFKTALPRPDYNRGGEAGGPGVPGPVPVGMPAHT 671
Query: 747 TSVQFL---NGSKATQFVFVPANSLQPVPIPPAKKRGYVKQKPPPILPAPAPTVPTP 800
+ + + T P P G PPP LPA A P
Sbjct: 672 ARPSRVARGDPVRPTAHHAAL--RAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCG 726
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 33.7 bits (77), Expect = 0.50
Identities = 21/99 (21%), Positives = 38/99 (38%), Gaps = 2/99 (2%)
Query: 463 EAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDD 522
E+ + + E+++ EE ++ ESEEE + E +EV E+ SE D +
Sbjct: 438 ESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGE 497
Query: 523 DDDWEKYQTGLNKRDKVLEGRSKQSHSVHCPRFPEDKQE 561
+ + + + +Q PE QE
Sbjct: 498 EPEEDAERRNSEMAGISRMSEGQQPRGS--SVQPESPQE 534
>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
Validated.
Length = 725
Score = 33.6 bits (77), Expect = 0.51
Identities = 24/152 (15%), Positives = 56/152 (36%), Gaps = 7/152 (4%)
Query: 390 FLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKS 449
E +E E EI+ ++ + N + ++K + + + K K + D K+
Sbjct: 355 LSELEEEDENEIKFKKIEENSIDNLDIKEKKIENENDI--EGKSDTKNLEEGFETKDNKN 412
Query: 450 SGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNE 509
I T ++ +K T ++ E+ +E + + ND + + +
Sbjct: 413 KNSSFINKTENILTNSPLKDELLEKTTEI----INIENPQEFEFGQIGNDIISTEIAQL- 467
Query: 510 DSNNKSEESSDDDDDDWEKYQTGLNKRDKVLE 541
D N ++ + D ++ +K+ E
Sbjct: 468 DENQNLIDTGEFDLENNFSNSFNPENGNKIDE 499
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 32.8 bits (75), Expect = 0.51
Identities = 17/53 (32%), Positives = 28/53 (52%), Gaps = 1/53 (1%)
Query: 475 DTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWE 527
T EE S E L ++E ++V ++ EK+E+ + EE D+D DD +
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEE-EEEEEEEEDEDFDDDD 193
Score = 30.5 bits (69), Expect = 3.0
Identities = 27/108 (25%), Positives = 48/108 (44%), Gaps = 11/108 (10%)
Query: 425 RPVWMKNKKGGKK---KKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESK 481
V NKK GKK KF K K + ++ I + E K+++ + + +
Sbjct: 117 YSVMGINKKAGKKLALSKF-KRKVGLFTEEEEDIDEKLSMLEK-----KLKELEAEDVDE 170
Query: 482 DVSSESEEESDLSDVENDEVVDKKEKNEDSNNKS--EESSDDDDDDWE 527
+ + EEE + + + D D + ++D N ++ + DDD DD E
Sbjct: 171 EDEKDEEEEEEEEEEDEDFDDDDDDDDDDYNAENYFDNGEDDDYDDDE 218
>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
unknown].
Length = 769
Score = 33.5 bits (76), Expect = 0.63
Identities = 31/128 (24%), Positives = 51/128 (39%), Gaps = 7/128 (5%)
Query: 463 EAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVV----DKKEKNEDSNNKSEES 518
E + A V K + ++K E EE+ +D E+ D KE + + EES
Sbjct: 373 EQTSKKAAVPKGTSFYQAKWAEDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEES 432
Query: 519 SDDDDDDWEKYQTGLNKRDKVLEGRSKQSHSVHCPRFPEDKQEFWWI--YISDRKSRTLL 576
+ DD++ +E+ +R ++ E R + P E + I Y R R L
Sbjct: 433 AIDDNEGFEELSPEEEER-QLREFRDMEKEDREFPDEAELQPSESAIERYKEYRGLRNLY 491
Query: 577 TSPYHITE 584
T + E
Sbjct: 492 TCSWRSDE 499
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 33.5 bits (76), Expect = 0.64
Identities = 26/98 (26%), Positives = 45/98 (45%), Gaps = 2/98 (2%)
Query: 442 KNKHDKKSSGQ--KKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVEND 499
+ KH +K Q KI ++ E+ A D ++E D S V+SE +EE D + ++
Sbjct: 5 QGKHQRKGKKQLENKILHSYEEESAGFDSEELEDNDEQGYSFGVNSEDDEEIDSDEAFDE 64
Query: 500 EVVDKKEKNEDSNNKSEESSDDDDDDWEKYQTGLNKRD 537
E + + +KS +S+ D + + LN D
Sbjct: 65 EDEKRFADWSFNASKSGKSNKDHKNLNNTKEISLNDSD 102
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 33.6 bits (76), Expect = 0.69
Identities = 28/140 (20%), Positives = 57/140 (40%), Gaps = 9/140 (6%)
Query: 384 DTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKN 443
D K + E K E + + EE K K+ ++ + + +
Sbjct: 1305 DEAKKKAEEAKKADEAKKKAEEAKKKAD---AAKKKAEEAKKAAEAAKAEAEAAADEAEA 1361
Query: 444 KHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVD 503
+K + +KK + +A A K E+K +E+K + E ++++D E +
Sbjct: 1362 AEEKAEAAEKKKEE--AKKKADAAKKKAEEKKKADEAKKKAEEDKKKAD----ELKKAAA 1415
Query: 504 KKEKNEDSNNKSEESSDDDD 523
K+K +++ K+EE D+
Sbjct: 1416 AKKKADEAKKKAEEKKKADE 1435
Score = 30.9 bits (69), Expect = 3.7
Identities = 33/143 (23%), Positives = 55/143 (38%), Gaps = 3/143 (2%)
Query: 384 DTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKN 443
D K + E K E E E + E + + K KKK K
Sbjct: 1332 DAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKK 1391
Query: 444 KHD---KKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDE 500
K D KK+ KK + +++ A A KK E+ K ++ + E E +
Sbjct: 1392 KADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKK 1451
Query: 501 VVDKKEKNEDSNNKSEESSDDDD 523
++ +K E++ K+EE+ D+
Sbjct: 1452 KAEEAKKAEEAKKKAEEAKKADE 1474
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 32.9 bits (75), Expect = 0.73
Identities = 20/60 (33%), Positives = 30/60 (50%)
Query: 466 ADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
AD E +D P+E +D EEES SD +++ E+ E ++ E+S DD D
Sbjct: 219 ADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDTPD 278
Score = 30.6 bits (69), Expect = 3.1
Identities = 16/85 (18%), Positives = 37/85 (43%), Gaps = 8/85 (9%)
Query: 446 DKKSSGQKKITNTVEQTEAPADVAK--VEKKDTPEESKDVSSESEEE------SDLSDVE 497
DK ++ V+ +A A V + + D EE D ++ E D +
Sbjct: 176 DKAGKDLDRLAACVDDQQAFARVVRDMLSSMDMAEELGDEPESADSEDNEDEDDPKEDED 235
Query: 498 NDEVVDKKEKNEDSNNKSEESSDDD 522
+D+ +++ + DS ++ ++S ++
Sbjct: 236 DDQGEEEESGSSDSLSEDSDASSEE 260
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 32.4 bits (74), Expect = 0.74
Identities = 19/80 (23%), Positives = 38/80 (47%), Gaps = 2/80 (2%)
Query: 447 KKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLS-DVENDEVVDKK 505
K +K+ T T + K +K+ EE K S+ ++ + D +N+E
Sbjct: 48 PKQPKKKRPT-TPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDS 106
Query: 506 EKNEDSNNKSEESSDDDDDD 525
+++ +N+ S+ ++DD DD
Sbjct: 107 QESASANSLSDIDNEDDMDD 126
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 32.9 bits (75), Expect = 0.81
Identities = 30/180 (16%), Positives = 67/180 (37%), Gaps = 17/180 (9%)
Query: 386 IKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKH 445
++ +L+ + E +E+ +G + K + + KK F
Sbjct: 29 LRSNYLQSTVEDIEGPSAVDERTSGVL-RDEGKHANILYNSILCNQKK---HASFLN--- 81
Query: 446 DKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKK 505
Q+K + + E + +D + S ++++S L + DE D +
Sbjct: 82 ------QRKSLDDDDDDEFDFLY--EDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQ 133
Query: 506 EKNEDSNNKSEESSDDDD-DDWEKYQTGLNKRDKVLEGRSKQSHSV-HCPRFPEDKQEFW 563
N+ +N+ ++ S + D K L+ +K++E + Q+ + F D +E
Sbjct: 134 ANNDQTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEKKGQTENTFKFYNFGNDGEEAA 193
Score = 31.0 bits (70), Expect = 4.1
Identities = 13/71 (18%), Positives = 25/71 (35%), Gaps = 9/71 (12%)
Query: 471 VEKKDTPEESK----DVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDW 526
+E+K E+ + ++ EE + D K N+ + D D +
Sbjct: 169 IEEKKGQTENTFKFYNFGNDGEEAA-----AKDGGKSKSSDPGPLNDSDGQGDDGDPESA 223
Query: 527 EKYQTGLNKRD 537
E+ + N R
Sbjct: 224 EEDKAASNTRA 234
>gnl|CDD|226907 COG4530, COG4530, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 129
Score = 31.0 bits (70), Expect = 0.92
Identities = 15/42 (35%), Positives = 22/42 (52%)
Query: 630 EAPEVPTEHPQWEMSGEEDEGDEEMGGSDVSEFTTDEDVEDD 671
EV E+ + E+ ED D+ GG D+ + DEDV+ D
Sbjct: 57 VVKEVDAENEEVEVVSLEDADDDPKGGDDLPDLGDDEDVDLD 98
Score = 30.2 bits (68), Expect = 2.2
Identities = 18/63 (28%), Positives = 30/63 (47%), Gaps = 6/63 (9%)
Query: 469 AKVEKKDTPEESKDVSSESEEE------SDLSDVENDEVVDKKEKNEDSNNKSEESSDDD 522
V++ D E +V S + + DL D+ +DE VD + ++D+ + EE DDD
Sbjct: 56 EVVKEVDAENEEVEVVSLEDADDDPKGGDDLPDLGDDEDVDLDDDDDDTFLEDEEDDDDD 115
Query: 523 DDD 525
Sbjct: 116 VSG 118
>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
family. Members of this protein family occur in
Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
related Mycoplasmas in small paralogous families that
may also include truncated forms and/or pseudogenes.
Members are predicted lipoproteins with a conserved
signal peptidase II processing and lipid attachment
site. Note that the name for certain characterized
members, p72, reflects an anomalous apparent molecular
weight, given a theoretical MW of about 61 kDa.
Length = 541
Score = 32.9 bits (75), Expect = 0.92
Identities = 17/96 (17%), Positives = 38/96 (39%), Gaps = 1/96 (1%)
Query: 430 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEE 489
KK K K KK ++ N ++ + PE+ D S+++E
Sbjct: 34 PEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDP-SKNKE 92
Query: 490 ESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
E + E + K + ++ +N +++ +++ D
Sbjct: 93 EIEKPKDEPKKPDKKPQADQPNNVHADQPNNNKVDF 128
>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
fusion protein; Provisional.
Length = 131
Score = 31.1 bits (70), Expect = 0.94
Identities = 10/70 (14%), Positives = 26/70 (37%)
Query: 458 TVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEE 517
A A+ ++ ++ + + E+ E + + +E D+ +E
Sbjct: 59 EKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEA 118
Query: 518 SSDDDDDDWE 527
D+++D E
Sbjct: 119 EEADEEEDEE 128
Score = 29.5 bits (66), Expect = 3.8
Identities = 15/72 (20%), Positives = 34/72 (47%), Gaps = 6/72 (8%)
Query: 460 EQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSD-VENDEVVDKKEKNEDSNNKSEES 518
A A+ A+ + +E D ++E++ + + E DE VD E ++ E+
Sbjct: 64 AAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVD-----ETADEADAEA 118
Query: 519 SDDDDDDWEKYQ 530
+ D+++ E+ +
Sbjct: 119 EEADEEEDEEAE 130
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 32.6 bits (74), Expect = 0.96
Identities = 29/138 (21%), Positives = 42/138 (30%), Gaps = 25/138 (18%)
Query: 671 DLKLEPCAQLMLYLRPAAAPHAG----RLMKNNSTNSSDRLRDLCSVGRRRVLSES--FS 724
L L+ Q +L R AA G R+ +TN+S L + L+
Sbjct: 120 TLTLDLARQPLLRARGAARAVVGLYVLRVWVEGATNASLFPLGLAAFPAEGTLAAPPLGE 179
Query: 725 TTQERPILPRPTPGAPIMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQ 784
+ + P AP + P VFVPA P P + +
Sbjct: 180 GSADGSCDPALPLSAP-RLGPAD-------------VFVPA-----TPRPTPRTTASPET 220
Query: 785 KPPPILPAPAPTVPTPIA 802
P P P+ P
Sbjct: 221 TPTPSTTTSPPSTTIPAP 238
>gnl|CDD|225677 COG3135, BenE, Uncharacterized protein involved in benzoate
metabolism [Secondary metabolites biosynthesis,
transport, and catabolism].
Length = 402
Score = 32.3 bits (74), Expect = 1.0
Identities = 13/46 (28%), Positives = 20/46 (43%), Gaps = 9/46 (19%)
Query: 787 PPILPAPAPTVPTPIAPEFLWSRESKMSIANRIVSEATTMFLETML 832
L A + PT + PEF ++ ++S A +FL TM
Sbjct: 200 HTALVALEISTPTWVTPEFSFA---------AMLSLALPLFLVTMA 236
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 32.7 bits (74), Expect = 1.0
Identities = 25/95 (26%), Positives = 36/95 (37%), Gaps = 9/95 (9%)
Query: 431 NKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEE 490
KK K NK S E E A V+ + E+ D S E
Sbjct: 663 TKKTADGKGKKSNKASFDSDD--------EMDENEIWSALVKSRPDVEDDSDDSELDFAE 714
Query: 491 SDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
D SD +D+ K + +D + KSE S + D ++
Sbjct: 715 DDFSDSTSDD-EPKLDAIDDEDAKSEGSQESDQEE 748
>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
Length = 280
Score = 32.1 bits (72), Expect = 1.1
Identities = 28/88 (31%), Positives = 34/88 (38%), Gaps = 11/88 (12%)
Query: 733 PRPTPGAPIMVAPTTSVQFLNGSKATQFVFVPANSLQ---------PVPIPPAKKRGYVK 783
P P P P APT + + A P ++ Q P P P AK
Sbjct: 110 PAPAPACPPATAPTCPPPAVCPAPARPAPACPPSTRQCPPAPPLPTPKPAPAAKPIFLHN 169
Query: 784 QKPPPILPAPA-PTVPT-PIAPEFLWSR 809
Q PPP PA + PT+ T P A L R
Sbjct: 170 QLPPPDYPAASCPTIETAPAASPVLEPR 197
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 32.7 bits (74), Expect = 1.2
Identities = 22/52 (42%), Positives = 30/52 (57%), Gaps = 4/52 (7%)
Query: 480 SKDVSSESEEESD----LSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWE 527
S D S ESEEE SD E+DE + +E +E S + SE+ S++D D E
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEE 978
Score = 31.9 bits (72), Expect = 1.7
Identities = 11/51 (21%), Positives = 25/51 (49%), Gaps = 2/51 (3%)
Query: 477 PEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWE 527
E S+ +S +E + + E + + ++S N S + ++D +DW+
Sbjct: 936 EEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSD--EEDGEDWD 984
>gnl|CDD|225519 COG2972, COG2972, Predicted signal transduction protein with a
C-terminal ATPase domain [Signal transduction
mechanisms].
Length = 456
Score = 32.4 bits (74), Expect = 1.2
Identities = 31/138 (22%), Positives = 53/138 (38%), Gaps = 18/138 (13%)
Query: 130 FYYAFFQITPHMALKRVLMILGASLEFDRRFNSEIIERPSDEIEVPQLIRQIPNLGEKNR 189
+ L +L ++ + SE+ E S+ I ++ ++ L E+N
Sbjct: 191 VILLLSSFSSRSILLPILGLILLLSRIALGYLSELKEI-SETIN--EMEERLEYLIEENY 247
Query: 190 ERPLYHKYSIKARAL---IYAH-LSNMQLTSETLEADRMYIVKKCPYLLQEMVTCISQLI 245
+ RAL I H L N TLE RM + P ++V +S+L+
Sbjct: 248 SLEQEQLE-AELRALQSQINPHFLYN------TLETIRMLAEEDDPEEAAKVVKALSKLL 300
Query: 246 LLAYAQRVPRLIHIETLE 263
+ + L +I TLE
Sbjct: 301 RYSLS----NLDNIVTLE 314
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 32.7 bits (74), Expect = 1.2
Identities = 29/134 (21%), Positives = 63/134 (47%), Gaps = 4/134 (2%)
Query: 392 EPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSG 451
E EGE E E +E++ G I + E + K + G+ + K++H+ +S
Sbjct: 763 ETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAE-GKVEHEGETEAGEKDEHEGQSET 821
Query: 452 QKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDS 511
Q T ++T ++ + + ++ K V + SD D E +E +++E+ E+
Sbjct: 822 QADDTEVKDETGE-QELNAENQGEAKQDEKGV--DGGGGSDGGDSEEEEEEEEEEEEEEE 878
Query: 512 NNKSEESSDDDDDD 525
+ EE ++++++
Sbjct: 879 EEEEEEEEEEENEE 892
Score = 31.9 bits (72), Expect = 1.8
Identities = 30/131 (22%), Positives = 56/131 (42%), Gaps = 10/131 (7%)
Query: 396 EGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKI 455
E EGEI+ E+ K + + K++ G+ + + K +G++++
Sbjct: 778 EDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQEL 837
Query: 456 TNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKS 515
N Q EA D EK D +SEEE + E +E +++E+ E+ +
Sbjct: 838 -NAENQGEAKQD----EKGVDGGGGSD-GGDSEEEEE----EEEEEEEEEEEEEEEEEEE 887
Query: 516 EESSDDDDDDW 526
EE+ + +W
Sbjct: 888 EENEEPLSLEW 898
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 32.0 bits (73), Expect = 1.2
Identities = 17/75 (22%), Positives = 26/75 (34%), Gaps = 14/75 (18%)
Query: 451 GQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNED 510
Q K + A + ++ D + S D E ESDL D ++D
Sbjct: 227 KQPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLDDSDDDV---------- 276
Query: 511 SNNKSEESSDDDDDD 525
S+E +D D
Sbjct: 277 ----SDEDGEDLFDT 287
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 32.0 bits (73), Expect = 1.2
Identities = 28/153 (18%), Positives = 45/153 (29%), Gaps = 15/153 (9%)
Query: 384 DTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKK---F 440
D K + KE E E E+ + H
Sbjct: 27 DFFKKLLKKKPKE-IDEQTEAEDNLLDKAVDDILSN----HYLKEGNGTTSVAIGDGMEK 81
Query: 441 TKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDE 500
T + K+S ITN P +A E +D SSES +
Sbjct: 82 TIHGDVKESDPMSFITNPSLTVSVP--IASGESDLENLNDEDTSSESSYGF--KEESKKG 137
Query: 501 VVDKKEKNEDSNNKSEESSDD---DDDDWEKYQ 530
+ + E ++ SE S+ D +++ E+
Sbjct: 138 SAETLKLEEKDSSSSEGSTVDLEPPEEEEEEIA 170
>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 584
Score = 32.5 bits (74), Expect = 1.3
Identities = 12/46 (26%), Positives = 16/46 (34%)
Query: 764 PANSLQPVPIPPAKKRGYVKQKPPPILPAPAPTVPTPIAPEFLWSR 809
P + P P K + P + P PA P A +WS
Sbjct: 396 PQAAPAPSAAAPEPKHQPAPEPRPVLAPTPASGEPNAAAVRSMWST 441
>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein. This family consists of several
eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
to suppress GCR2 and is highly expressed in the muscle
and heart. The function of this family is unknown
although it has been speculated that SGT1 may be
functionally analogous to the Gcr2p protein of
Saccharomyces cerevisiae which is known to be a
regulatory factor of glycolytic gene expression.
Length = 557
Score = 32.4 bits (74), Expect = 1.3
Identities = 22/116 (18%), Positives = 49/116 (42%), Gaps = 9/116 (7%)
Query: 432 KKGGKKKKFTKNKHDKKSSGQ-KKITNTVEQ-TEAPADVAKVEKKDTPEESKDVSSESEE 489
++ G KK NK D +++ +++E+ + E D +E D +SE+
Sbjct: 393 ERQGDKKDLKSNKEDANEVDDLEEVVSSMEEFLNKVSSFEGAEFADDEDEDDDEPDDSED 452
Query: 490 ESDLSDVEN-DEVVDKK------EKNEDSNNKSEESSDDDDDDWEKYQTGLNKRDK 538
+ D + E + E + D + S+++ +DDD+D ++ + +
Sbjct: 453 KDVSFDEDEFFEFLKNMLGLKDDEIDNDLPDDSDDADEDDDEDDDEDEDSSSDSTL 508
Score = 32.0 bits (73), Expect = 1.6
Identities = 17/61 (27%), Positives = 26/61 (42%), Gaps = 2/61 (3%)
Query: 467 DVAKVEKKDTPEESKDVSSESEEESD--LSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 524
D A + + +E +D SS+S E D + E+ N + S S +DDD
Sbjct: 486 DDADEDDDEDDDEDEDSSSDSTLEELEEYMDQMDAELKQTDSSNNADISNSGSSGAEDDD 545
Query: 525 D 525
D
Sbjct: 546 D 546
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 32.4 bits (74), Expect = 1.3
Identities = 12/110 (10%), Positives = 39/110 (35%), Gaps = 8/110 (7%)
Query: 436 KKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSD 495
K + + + ++ I +T + E +K +++ + +SE++
Sbjct: 20 KLQPISYIYSNVLVLSKE-ILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKK 78
Query: 496 VENDEVVDKKEKNE-DSNNKSEESS------DDDDDDWEKYQTGLNKRDK 538
+ + + + K + KS+++ ++D + +K
Sbjct: 79 EKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEK 128
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 32.5 bits (74), Expect = 1.3
Identities = 28/149 (18%), Positives = 52/149 (34%), Gaps = 25/149 (16%)
Query: 402 EEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQ 461
+++++ ++ +K+ +P K K SGQK +T ++
Sbjct: 1314 DDDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEMLKP 1373
Query: 462 TEA-----PADVAKVEKKDTPEESKDV--------SSESEEESDLSDVENDEVVD----- 503
EA V K+ ++S V +ES E S + +D
Sbjct: 1374 AEAIGISPEKKVRKMRASPFNKKSGSVLGRAATNKETESSENVSGSSSSEKDEIDVSAKP 1433
Query: 504 -------KKEKNEDSNNKSEESSDDDDDD 525
K+ S+++SE + D D DD
Sbjct: 1434 RPQRANRKQTTYVLSDSESESADDSDFDD 1462
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 32.1 bits (74), Expect = 1.3
Identities = 15/72 (20%), Positives = 17/72 (23%), Gaps = 11/72 (15%)
Query: 729 RPILPRPTPGAPIMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQKPPP 788
P P P P P A + + V VP PP P
Sbjct: 360 HPAAPLPEPEVPPQSAAPAASAQATAAPTAA---VAPPQAPAVPPPP--------ASAPQ 408
Query: 789 ILPAPAPTVPTP 800
PA T
Sbjct: 409 QAPAVPLPETTS 420
>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein. This is a family of
121-amino acid secretory proteins. Laps functions in the
regulation of neuronal cell adhesion and/or movement and
synapse attachment. Laps binds to the ApC/EBP (Aplysia
CCAAT/enhancer binding protein) promoter and activates
the transcription of ApC/EBP mRNA.
Length = 124
Score = 30.6 bits (69), Expect = 1.4
Identities = 22/69 (31%), Positives = 30/69 (43%), Gaps = 5/69 (7%)
Query: 391 LEPSKEGEGEIEEEEEKPNGTI-TPVVNKE---KQQTHRPVWMKNKKGGKKKKFTKNKHD 446
E K+ E EE++EK V NK+ Q P WM ++ KK K + K
Sbjct: 56 KEKKKKEGTESEEDDEKMEVDNKAAVRNKKTLRDQHGQYPAWMNQRQ-AKKLKKKREKKK 114
Query: 447 KKSSGQKKI 455
KK G K +
Sbjct: 115 KKKKGAKNL 123
>gnl|CDD|178151 PLN02536, PLN02536, diaminopimelate epimerase.
Length = 267
Score = 31.3 bits (71), Expect = 1.9
Identities = 20/54 (37%), Positives = 23/54 (42%), Gaps = 1/54 (1%)
Query: 363 GAIITVTCTLMRKPMSVLFGDDTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVV 416
G VTC M P V FG+ +KV L K G + E E P T T V
Sbjct: 135 GKTWLVTCVSMGNPHCVTFGEKELKVDDLPLEKIGP-KFEHHEMFPARTNTEFV 187
>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
[Transcription].
Length = 175
Score = 30.9 bits (70), Expect = 2.0
Identities = 15/42 (35%), Positives = 19/42 (45%)
Query: 486 ESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWE 527
E +E L D +DE D E E E D+DDD+ E
Sbjct: 128 EDDEVDSLDDENDDEDEDDDEIVEILIEDDEVDEDEDDDEDE 169
>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
Length = 306
Score = 31.2 bits (70), Expect = 2.1
Identities = 25/78 (32%), Positives = 33/78 (42%), Gaps = 5/78 (6%)
Query: 730 PILPRPTPGAPIMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQKPPPI 789
P PTP ++ P VQ A Q + +P QP P+P + +V Q+ P
Sbjct: 20 PPAAAPTPQPNPVIQPQAPVQPGQ-PGAPQQLAIPTQ--QPQPVPTSAMTPHVVQQAPA- 75
Query: 790 LPAP-APTVPTPIAPEFL 806
PAP AP PE L
Sbjct: 76 QPAPAAPPAAGAALPEAL 93
>gnl|CDD|233222 TIGR00986, 3a0801s05tom22, mitochondrial import receptor subunit
Tom22. The mitochondrial protein translocase (MPT)
family, which brings nuclearly encoded preproteins into
mitochondria, is very complex with 19 currently
identified protein constituents.These proteins include
several chaperone proteins, four proteins of the outer
membrane translocase (Tom) import receptor, five
proteins of the Tom channel complex, five proteins of
the inner membrane translocase (Tim) and three "motor"
proteins. This family is specific for the Tom22 proteins
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 145
Score = 30.3 bits (68), Expect = 2.2
Identities = 13/52 (25%), Positives = 24/52 (46%), Gaps = 1/52 (1%)
Query: 454 KITNTVEQTEA-PADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDK 504
++T ++ P VE ++ E+ DV SE +SD +E + D+
Sbjct: 3 QLTEVEDEHFQQPEQQTVVEPEEDDEDFTDVDSEDSVDSDFESLEEETFTDR 54
>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
Provisional.
Length = 291
Score = 31.3 bits (70), Expect = 2.2
Identities = 22/71 (30%), Positives = 37/71 (52%), Gaps = 9/71 (12%)
Query: 469 AKVEKKDTPEESKDVSSESEEESDL----SDV-ENDEVVDKKEKNEDSNNKSEESSDDD- 522
A+ + +P ESK + + S+ DL + EN + +E +ED ++ SEE DDD
Sbjct: 34 AQQDGGKSPPESKGLETSSQGTQDLKGGAAGAKENSPPLPTEEDDEDVDDDSEEGDDDDG 93
Query: 523 ---DDDWEKYQ 530
D++ EK +
Sbjct: 94 GAEDEEEEKVR 104
>gnl|CDD|219928 pfam08602, Mgr1, Mgr1-like, i-AAA protease complex subunit. The S.
cerevisiae Mgr1 protein has been shown to be required
for mitochondrial viability in yeast lacking
mitochondrial DNA. It is a mitochondrial inner membrane
protein, which interacts with Yme1 and is a new subunit
of the i-AAA protease complex.
Length = 362
Score = 31.4 bits (71), Expect = 2.3
Identities = 18/68 (26%), Positives = 29/68 (42%), Gaps = 4/68 (5%)
Query: 95 LYALVFMVALPTAVGMWWYKSIRYTGDKVLLETINFYYAFFQITPHMALKRVLMILGASL 154
L L +V L T +G++++K R G K+ I A F + M + +
Sbjct: 54 LPGLYTLVGLQTGIGLFFFKRFRTLGKKLRKRDI----ADFPSLNRFSTTHGDMHMAPVI 109
Query: 155 EFDRRFNS 162
+ RRF S
Sbjct: 110 PWQRRFVS 117
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity
of nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 31.3 bits (71), Expect = 2.4
Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 2/60 (3%)
Query: 465 PADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 524
PA + P S +E EE D DV +++ D+ ED N +EE D++ +
Sbjct: 15 PAVLLTFGNGKGPFASAQDLTEDEEAED--DVVDEDEEDEAVVEEDENELTEEEEDEEGE 72
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 31.2 bits (70), Expect = 2.5
Identities = 21/77 (27%), Positives = 35/77 (45%), Gaps = 4/77 (5%)
Query: 455 ITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEK----NED 510
I N E ++ + EE + E EE+ D+ ++E E V+ E+ +ED
Sbjct: 77 IANDKSDNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDED 136
Query: 511 SNNKSEESSDDDDDDWE 527
S E+ SD + D+ E
Sbjct: 137 SEKDDEKESDAEGDENE 153
Score = 29.6 bits (66), Expect = 7.3
Identities = 20/95 (21%), Positives = 50/95 (52%), Gaps = 2/95 (2%)
Query: 439 KFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEE--SKDVSSESEEESDLSDV 496
KN+ ++ + +++ VE+ E +V VE++ +E KD ES+ E D +++
Sbjct: 95 IIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEGDENEL 154
Query: 497 ENDEVVDKKEKNEDSNNKSEESSDDDDDDWEKYQT 531
+ ++++ + + N++ S D+ +D + Y++
Sbjct: 155 AGEYIIEEVDDDVAILNENINSIDELEDPLDSYKS 189
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 31.3 bits (71), Expect = 2.5
Identities = 6/72 (8%), Positives = 27/72 (37%)
Query: 478 EESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWEKYQTGLNKRD 537
+ + + +S S+ + + + ++N + + D++D + + + +
Sbjct: 30 PQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSN 89
Query: 538 KVLEGRSKQSHS 549
+++ K
Sbjct: 90 NIIDFIYKNLPQ 101
Score = 30.9 bits (70), Expect = 3.5
Identities = 13/57 (22%), Positives = 25/57 (43%), Gaps = 5/57 (8%)
Query: 466 ADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDD 522
A +K T + + S +S ++ S + + DK D+NN S + ++D
Sbjct: 25 AYADDPQKDSTAKTTSHDSKKSNDDET-SKDTSSKDTDK----ADNNNTSNQDNNDK 76
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 31.2 bits (70), Expect = 2.6
Identities = 23/89 (25%), Positives = 43/89 (48%), Gaps = 3/89 (3%)
Query: 461 QTEAPADVAKVEKKDTPEESKDVSSESEEE-SDLSDVENDEVVDKKEKNEDSNNKSEESS 519
Q E V + E+ DT E+ V+ +SE+E + D+E D+ + +S + S
Sbjct: 197 QDEKERYVEEEEESDTELEA--VTDDSEKEKTKKKDLEKWLGSDQSMETSESEEEESSES 254
Query: 520 DDDDDDWEKYQTGLNKRDKVLEGRSKQSH 548
+ D+D+ E + + KR +S++ H
Sbjct: 255 ESDEDEDEDNKGKIRKRKTDDAKKSRKPH 283
>gnl|CDD|223035 PHA03294, PHA03294, envelope glycoprotein H; Provisional.
Length = 835
Score = 31.2 bits (71), Expect = 2.7
Identities = 20/103 (19%), Positives = 26/103 (25%), Gaps = 15/103 (14%)
Query: 725 TTQERPILPRPTPGAPIMVAPTTS-------VQFLNGSKATQFVFVPA-NSLQPVPIPPA 776
T + P PT P VAP + +L F VP SL +
Sbjct: 106 TFAKLPARA-PTGRLPPPVAPLSGLLGNPNLAPYLRTRHLVDFSVVPDPRSLTRWVFDRS 164
Query: 777 KKRGYVKQKPPPILPAPAPT------VPTPIAPEFLWSRESKM 813
LP P P I P + +
Sbjct: 165 DTAATKAHPSGVALPPPRAPPPRNTTDPATIKPNDHLNPATTF 207
>gnl|CDD|221866 pfam12935, Sec16_N, Vesicle coat trafficking protein Sec16
N-terminus. Sec16 is a multi-domain vesicle coat
protein. The overall function of Sec16 is in mediating
the movement of protein-cargo between the organelles of
the secretory pathway. Over-expression of truncated
mutants of only the N-terminus are lethal, and this
portion does not appear to be essential for function so
may act as a stabilising region.
Length = 246
Score = 30.6 bits (69), Expect = 2.8
Identities = 24/161 (14%), Positives = 54/161 (33%), Gaps = 12/161 (7%)
Query: 383 DDTIKVQFLEPSKEGEGEI-----EEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKK 437
D+ + + +E S I E+E+++ + + E + +P +K +
Sbjct: 47 DNGEQNEPVEESAPQTVAIDSVFVEDEDDEGSDFFNSLHEGEAVEEQQPPPHLTRKSTSQ 106
Query: 438 KKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVE 497
+ + S + + + + + S+SE S+
Sbjct: 107 VLDSLGLNPDSLSSPASAEPLDPTAQNEFSNVL---AASTDGNPESESQSEPSSEEELAA 163
Query: 498 NDEVVDKK-EKNEDSNNKSEESS---DDDDDDWEKYQTGLN 534
E+ D + E ++ +E D+DDD +T L
Sbjct: 164 RAELSDDESESTPTEDDLAERWQAFLDNDDDLLLDDETALA 204
>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
unknown].
Length = 523
Score = 31.0 bits (70), Expect = 2.9
Identities = 17/71 (23%), Positives = 29/71 (40%), Gaps = 5/71 (7%)
Query: 430 KNKKGGKKKKFTKNKHDKKSSG----QKKITNTVEQTEAPADVAKVEKKD-TPEESKDVS 484
K K K + K +S + + ++T + E+ + K+ + S S
Sbjct: 10 KFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSKLYSDVDS 69
Query: 485 SESEEESDLSD 495
S SEEE D S+
Sbjct: 70 SSSEEEEDGSE 80
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 28.9 bits (65), Expect = 2.9
Identities = 19/61 (31%), Positives = 28/61 (45%), Gaps = 4/61 (6%)
Query: 430 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEE 489
K KK KK K K KK + K+ T E+ E A+ + K+ E+ + E EE
Sbjct: 13 KGKKIDVKK---KKKKKKKKNKSKEEVVT-EKEEEEKSSAESDLKEGEEDEDNEKIEQEE 68
Query: 490 E 490
+
Sbjct: 69 D 69
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 31.1 bits (70), Expect = 3.1
Identities = 25/107 (23%), Positives = 46/107 (42%), Gaps = 6/107 (5%)
Query: 428 WMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVE-QTEAPADVAKVEKKDTPEESKDVSSE 486
W+ K+K T N KK + +K+ E + D + + D +E +D +
Sbjct: 15 WV-RGSLDYKEKLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDD 73
Query: 487 SEEESDLSDVENDEVVDKKEKNEDSNNKSEES----SDDDDDDWEKY 529
++E D + ++D + +D N E+ SDD+ DD +Y
Sbjct: 74 DDDEDDEDEDDDDSTLHDDSSADDGNETDNEAGFADSDDESDDGSEY 120
>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
Provisional.
Length = 337
Score = 30.9 bits (70), Expect = 3.2
Identities = 10/38 (26%), Positives = 23/38 (60%)
Query: 488 EEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
+E+SD S E+D+ D + ++ +++ S D +++D
Sbjct: 281 DEDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTNEED 318
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 30.9 bits (70), Expect = 3.3
Identities = 21/92 (22%), Positives = 35/92 (38%), Gaps = 10/92 (10%)
Query: 756 KATQFVFVPANSLQPVPIPPAKKRGYVKQKPPPILPAPAPTVPTPIAPEFLWSRESKMSI 815
A + V +P + P P P +PPP P P P P E +
Sbjct: 253 LAVELVLLPRATRLPEPEP----------QPPPPPPPPEPPEPEEEPDEPDQTDPDDGEE 302
Query: 816 ANRIVSEATTMFLETMLQPDILSYIQKGEGKK 847
++I E +E L +IL+ +Q + ++
Sbjct: 303 TDQIPEELMFDAVEADLPDNILATLQTVQRRR 334
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 30.9 bits (70), Expect = 3.3
Identities = 19/79 (24%), Positives = 30/79 (37%)
Query: 456 TNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKS 515
+T E A+++ D E +D E E+ SD + E E +++ + S
Sbjct: 347 PSTKESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSS 406
Query: 516 EESSDDDDDDWEKYQTGLN 534
E SSD D K
Sbjct: 407 ESSSDVGSDSESKADKESA 425
>gnl|CDD|147374 pfam05160, DSS1_SEM1, DSS1/SEM1 family. This family contains the
breast cancer tumour suppressor BRCA2-interacting
protein DSS1 and its homologue SEM1, both of which are
short acidic proteins. DSS1 has been shown to be a
conserved component of the Rae1 mediated mRNA export
pathway in Schizosaccharomyces pombe.
Length = 63
Score = 28.0 bits (63), Expect = 3.3
Identities = 12/38 (31%), Positives = 22/38 (57%)
Query: 488 EEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
EE+ + D ++ +++ + E+ NN EE DDDD +
Sbjct: 12 EEDDEFEDFPVEDWPEEETEKENGNNLWEEDWDDDDVE 49
>gnl|CDD|237304 PRK13206, ureC, urease subunit alpha; Reviewed.
Length = 573
Score = 30.8 bits (70), Expect = 3.3
Identities = 15/30 (50%), Positives = 16/30 (53%), Gaps = 2/30 (6%)
Query: 722 SFSTTQERPILPRPTPGAPIMVAPTTSVQF 751
S T Q P+LPRP GA A TSV F
Sbjct: 470 SIPTPQ--PVLPRPMFGAAPAAAAATSVHF 497
>gnl|CDD|236512 PRK09430, djlA, Dna-J like membrane chaperone protein; Provisional.
Length = 267
Score = 30.6 bits (70), Expect = 3.5
Identities = 10/23 (43%), Positives = 12/23 (52%)
Query: 12 SGASEASIKKAYRKQSLILHPDK 34
+ IK+AYRK HPDK
Sbjct: 210 ESDDDQEIKRAYRKLMSEHHPDK 232
>gnl|CDD|235826 PRK06549, PRK06549, acetyl-CoA carboxylase biotin carboxyl carrier
protein subunit; Validated.
Length = 130
Score = 29.4 bits (66), Expect = 3.5
Identities = 12/44 (27%), Positives = 17/44 (38%)
Query: 758 TQFVFVPANSLQPVPIPPAKKRGYVKQKPPPILPAPAPTVPTPI 801
Q + PVP+P Q P P A A +P+P+
Sbjct: 25 AQAAAPAQPASTPVPVPTEASPQVEAQAPQPAAAAGADAMPSPM 68
>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1. Rec10 / Red1 is involved in meiotic
recombination and chromosome segregation during
homologous chromosome formation. This protein localises
to the synaptonemal complex in S. cerevisiae and the
analogous structures (linear elements) in S. pombe. This
family is currently only found in fungi.
Length = 706
Score = 31.0 bits (70), Expect = 3.5
Identities = 38/189 (20%), Positives = 61/189 (32%), Gaps = 20/189 (10%)
Query: 345 LHVQPEVIDDEATTEYTAGAIITVTCTLMRKPMSVLFGDDTIKVQFLEPSKEGEGEIEEE 404
L EV +D+A++ A K + GD +K + K+ ++ E
Sbjct: 363 LESLKEVFEDQASSVKNKQAQSEENLNESPKTPIAVTGDPHLKDPTIIAGKKLMNKLTSE 422
Query: 405 EEKPNGTITPVVNKEKQQTH--------------RPVWMKNKKGGKKKKFTKNKHDKKSS 450
+ + V+K K PV + +K K K TK K+ K
Sbjct: 423 KINNPVKV-VKVSKYKGNKSEKKRDINVLDTIFASPVSKELRKKVGKSKQTKLKNFKPVP 481
Query: 451 GQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNED 510
+ K ++ KV K T ++ N E DK+
Sbjct: 482 NKSKKQLANNNSQNI-KSKKVVKAKTNNKANLQDVGECSSPPN----NKEKNDKQTSTSS 536
Query: 511 SNNKSEESS 519
S KS+ SS
Sbjct: 537 SVLKSDRSS 545
>gnl|CDD|132386 TIGR03343, biphenyl_bphD, 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate
hydrolase. Members of this family are
2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase, or
HOPD hydrolase, the BphD protein of biphenyl
degradation. BphD acts on the product of ring
meta-cleavage by BphC. Many species carrying bphC and
bphD are capable of degrading polychlorinated biphenyls
as well as biphenyl itself.
Length = 282
Score = 30.3 bits (68), Expect = 3.8
Identities = 21/83 (25%), Positives = 43/83 (51%), Gaps = 12/83 (14%)
Query: 272 IVQGMWDFKNPLLQLPYVTDDHLKHFI--CKKRYIKSLQQFAQMKNEERRSVVKFMNDEQ 329
++QG W + + P +HLK+F+ +K + + A++ + +++V + D++
Sbjct: 183 LLQGRW---ENIQRQP----EHLKNFLISSQKAPLSTWDVTARLGEIKAKTLVTWGRDDR 235
Query: 330 YSDM---LKVLGNMPYVDLHVQP 349
+ + LK+L NMP LHV
Sbjct: 236 FVPLDHGLKLLWNMPDAQLHVFS 258
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 30.3 bits (68), Expect = 3.9
Identities = 17/71 (23%), Positives = 30/71 (42%), Gaps = 6/71 (8%)
Query: 467 DVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDW 526
DV E++ E E EE S E + ++++E+ E E SDD +D
Sbjct: 173 DVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEA------EGSDDGEDVV 226
Query: 527 EKYQTGLNKRD 537
+ ++K+
Sbjct: 227 DYEGERIDKKQ 237
Score = 29.2 bits (65), Expect = 9.3
Identities = 15/69 (21%), Positives = 26/69 (37%)
Query: 457 NTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSE 516
N V + D A + D E ++ ES+ + D E + E + + E
Sbjct: 155 NIVWDNDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEE 214
Query: 517 ESSDDDDDD 525
E+ DD +
Sbjct: 215 EAEGSDDGE 223
>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
Length = 193
Score = 30.1 bits (67), Expect = 4.4
Identities = 30/112 (26%), Positives = 52/112 (46%), Gaps = 9/112 (8%)
Query: 429 MKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQT--EAPADVAKVEKKDTPEESKDVSSE 486
M N K K K + + + ++K+ VE+ + P D+ + E +D PEE + E
Sbjct: 1 MGNYCRKNKAKEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVE-ELEDQPEEPPEQEEE 59
Query: 487 SEEESDLSDVENDEVVDK--KEKN----EDSNNKSEESSDDDDDDWEKYQTG 532
+EE+ +++ +K EKN E SN+ S D+ +K +TG
Sbjct: 60 NEEQKPKEEIDYPIQENKSFDEKNLDDLERSNSDIYSESHKYDNASDKLETG 111
>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1). This family
consists of several mammalian dentin matrix protein 1
(DMP1) sequences. The dentin matrix acidic
phosphoprotein 1 (DMP1) gene has been mapped to human
chromosome 4q21. DMP1 is a bone and teeth specific
protein initially identified from mineralised dentin.
DMP1 is primarily localised in the nuclear compartment
of undifferentiated osteoblasts. In the nucleus, DMP1
acts as a transcriptional component for activation of
osteoblast-specific genes like osteocalcin. During the
early phase of osteoblast maturation, Ca(2+) surges into
the nucleus from the cytoplasm, triggering the
phosphorylation of DMP1 by a nuclear isoform of casein
kinase II. This phosphorylated DMP1 is then exported out
into the extracellular matrix, where it regulates
nucleation of hydroxyapatite. DMP1 is a unique molecule
that initiates osteoblast differentiation by
transcription in the nucleus and orchestrates
mineralised matrix formation extracellularly, at later
stages of osteoblast maturation. The DMP1 gene has been
found to be ectopically expressed in lung cancer
although the reason for this is unknown.
Length = 514
Score = 30.4 bits (68), Expect = 4.5
Identities = 19/61 (31%), Positives = 34/61 (55%), Gaps = 7/61 (11%)
Query: 472 EKKDTPEESKDV---SSESEEESDLSDVEND----EVVDKKEKNEDSNNKSEESSDDDDD 524
E+ + E+S++V SSES +E+DL EN E V + + ++ +N + S D +D
Sbjct: 317 EENQSQEDSQEVQDPSSESSQEADLPSQENSSESQEEVVSESRGDNPDNTTSHSEDQEDS 376
Query: 525 D 525
+
Sbjct: 377 E 377
Score = 29.6 bits (66), Expect = 8.1
Identities = 32/120 (26%), Positives = 53/120 (44%), Gaps = 9/120 (7%)
Query: 448 KSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVS-----SESEEESDLSDVENDEVV 502
+SS + + + +E+ +V + D P+ + S SES EE L + E
Sbjct: 334 ESSQEADLPSQENSSESQEEVVSESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSSSESQ 393
Query: 503 DKKEK----NEDSNNKSEESSDDDDDDWEKYQTGLNKRDKVLEGRSKQSHSVHCPRFPED 558
+E+ + +S + SEES + +D+ Q GL E RS++S S R ED
Sbjct: 394 STEEQADSESNESLSSSEESPESTEDENSSSQEGLQSHSASTESRSQESQSEQDSRSEED 453
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 29.7 bits (67), Expect = 4.7
Identities = 14/69 (20%), Positives = 26/69 (37%)
Query: 478 EESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWEKYQTGLNKRD 537
E EE+ + +E K+E E + EE +D ++ E L +
Sbjct: 4 ECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDEN 63
Query: 538 KVLEGRSKQ 546
L+ +K+
Sbjct: 64 NKLKEENKK 72
Score = 29.7 bits (67), Expect = 5.6
Identities = 14/79 (17%), Positives = 28/79 (35%)
Query: 442 KNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEV 501
+ K K + ++ E E + + + E+ + + E + DE
Sbjct: 4 ECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDEN 63
Query: 502 VDKKEKNEDSNNKSEESSD 520
KE+N+ N+ E D
Sbjct: 64 NKLKEENKKLENELEALKD 82
Score = 29.3 bits (66), Expect = 7.0
Identities = 14/53 (26%), Positives = 25/53 (47%), Gaps = 1/53 (1%)
Query: 470 KVEKKDTPEESKDVSSESEEES-DLSDVENDEVVDKKEKNEDSNNKSEESSDD 521
K K + EE +E++EE + E ++K+E EDS +E ++
Sbjct: 6 KDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEE 58
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 30.1 bits (68), Expect = 4.8
Identities = 15/93 (16%), Positives = 26/93 (27%), Gaps = 4/93 (4%)
Query: 432 KKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEES 491
K + F ++K KK+ E K ++ T ES S+ +
Sbjct: 106 KSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPSSTPNSSRP 165
Query: 492 DLSDVENDEVV----DKKEKNEDSNNKSEESSD 520
+ + KK S S +
Sbjct: 166 STPHLLKAKEGPSRRAKKAAKLSSTASSGDEKS 198
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 30.5 bits (69), Expect = 4.9
Identities = 13/46 (28%), Positives = 26/46 (56%)
Query: 483 VSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWEK 528
+ E E E ++ D E +E ++K++ E+ EE D++++ EK
Sbjct: 25 LWVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEK 70
>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter. The Monovalent
Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
CPA1 family is a large family of proteins derived from
Gram-positive and Gram-negative bacteria, blue green
bacteria, yeast, plants and animals. Transporters from
eukaryotes have been functionally characterized, and all
of these catalyze Na+:H+ exchange. Their primary
physiological functions may be in (1) cytoplasmic pH
regulation, extruding the H+ generated during
metabolism, and (2) salt tolerance (in plants), due to
Na+ uptake into vacuoles. This model is specific for the
fungal members of this family [Transport and binding
proteins, Cations and iron carrying compounds].
Length = 810
Score = 30.7 bits (69), Expect = 4.9
Identities = 22/98 (22%), Positives = 42/98 (42%), Gaps = 13/98 (13%)
Query: 429 MKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKD---VSS 485
++++ G +++ + H K +K+ + V E A +E E + + S
Sbjct: 723 IEDESGEVFRRYRISPHGGKRKIKKRNDSVVSVDEEKA----IEGPSRVPERGNHDLLHS 778
Query: 486 ESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDD 523
E E D ++ EN + EDS++ + ES D D
Sbjct: 779 EDEMADDEAESENM------DDYEDSDDNAYESKDHAD 810
>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295). This
family is conserved in fungi but the function is not
known.
Length = 509
Score = 30.3 bits (68), Expect = 5.0
Identities = 16/56 (28%), Positives = 23/56 (41%), Gaps = 2/56 (3%)
Query: 474 KDTPEESKDVSSESEEES--DLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWE 527
K +S E+ + +D+ D E ED ++S DDDD DWE
Sbjct: 258 PSLMSPRKKTASFKEQVVTRTFPERTSDDDEDAIETEEDDVDESAIEDDDDDSDWE 313
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 30.5 bits (69), Expect = 5.4
Identities = 16/74 (21%), Positives = 33/74 (44%), Gaps = 8/74 (10%)
Query: 475 DTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD---DWEKYQT 531
DT +SK+++ EE++ V + KN K + S++ +D +Y
Sbjct: 208 DTLIKSKEININKEEKN-----NGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDK 262
Query: 532 GLNKRDKVLEGRSK 545
++ K L+ ++K
Sbjct: 263 EKLEKIKDLKEKAK 276
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 30.4 bits (68), Expect = 5.5
Identities = 19/76 (25%), Positives = 28/76 (36%), Gaps = 5/76 (6%)
Query: 730 PILPRPTPGA---PIMVAPTTSVQFLNGSKATQFVFVPANSLQP--VPIPPAKKRGYVKQ 784
P P PTP A P +P + + + + A SL P +P P +
Sbjct: 191 PSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPPLQPQTAS 250
Query: 785 KPPPILPAPAPTVPTP 800
+ P PAP+ P
Sbjct: 251 QQSPQPPAPSSRHPQS 266
Score = 30.0 bits (67), Expect = 6.6
Identities = 22/106 (20%), Positives = 33/106 (31%), Gaps = 21/106 (19%)
Query: 720 SESFSTTQERPILPRPTPGAPIMVAPTTSVQFLNGSKATQFVFVPAN----SLQPVPIPP 775
+ES S + + L +P I V P ++ VP + QP P P
Sbjct: 160 NESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQ 219
Query: 776 AKKRGYVKQKP---PPILPAP--------------APTVPTPIAPE 804
+ P P LP+P P P+ P+
Sbjct: 220 QPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQ 265
Score = 30.0 bits (67), Expect = 7.1
Identities = 16/71 (22%), Positives = 22/71 (30%), Gaps = 3/71 (4%)
Query: 733 PRPTPGAPIMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQKPPPILPA 792
+P P T Q L +T + P P + + P I P
Sbjct: 412 SQPLQSVPAQPPVLTQSQSLPPKASTH---PHSGLHSGPPQSPFAQHPFTSGGLPAIGPP 468
Query: 793 PAPTVPTPIAP 803
P+ TP AP
Sbjct: 469 PSLPTSTPAAP 479
>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106). This
family of proteins are found in large numbers in the
Trichomonas vaginalis proteome. The function of this
protein is unknown.
Length = 422
Score = 30.0 bits (67), Expect = 5.6
Identities = 21/77 (27%), Positives = 30/77 (38%), Gaps = 3/77 (3%)
Query: 730 PILPR--PTPGAPIMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQKPP 787
P PR P PG P + + + K TQ V + QP PA++ + +
Sbjct: 174 PNPPREAPAPGLPKTFTSSHGHRHRHAPKPTQQPTVQNPAQQPTVQNPAQQPQ-QQPQQQ 232
Query: 788 PILPAPAPTVPTPIAPE 804
P+ PA PT P
Sbjct: 233 PVQPAQQPTPQNPAQQP 249
>gnl|CDD|216161 pfam00865, Osteopontin, Osteopontin.
Length = 293
Score = 30.1 bits (67), Expect = 5.6
Identities = 24/95 (25%), Positives = 45/95 (47%), Gaps = 5/95 (5%)
Query: 434 GGKKKKFTKNKHDKKSSGQKKITNTVEQTE-APADVAKVEKKDTPEESK--DVSSESEEE 490
G ++K NKH + K + +Q AP + E+ D ++ S+ES E
Sbjct: 6 GSSEEKQLYNKHPDAVATWLKPDPSQKQNLLAPQNSVSSEETDDLKQETLPSKSNESHEH 65
Query: 491 SDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
D D ++D+ D + + ++N+S+++ DD D
Sbjct: 66 MD--DDDDDDDDDHVDSQDSNSNESDDADHTDDSD 98
>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
Reviewed.
Length = 129
Score = 28.8 bits (65), Expect = 5.7
Identities = 14/56 (25%), Positives = 25/56 (44%), Gaps = 4/56 (7%)
Query: 472 EKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDDWE 527
E+ EE D + + D +++D + D ED + E D+D++D E
Sbjct: 78 EEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLD----EEDDEDEEDEE 129
Score = 28.4 bits (64), Expect = 8.9
Identities = 8/58 (13%), Positives = 31/58 (53%)
Query: 467 DVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 524
V +++++ P E K + + D+ +D+++ + +E+ ++ ++ ++D++
Sbjct: 72 PVDEIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEEDEE 129
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 30.3 bits (68), Expect = 5.9
Identities = 18/100 (18%), Positives = 29/100 (29%), Gaps = 3/100 (3%)
Query: 707 LRDLCSVGRRRVLSESF---STTQERPILPRPTPGAPIMVAPTTSVQFLNGSKATQFVFV 763
LR++ + + LP PG+ + +S + + F+
Sbjct: 166 LREMLVELQIGGRGGNIPGAIQPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFL 225
Query: 764 PANSLQPVPIPPAKKRGYVKQKPPPILPAPAPTVPTPIAP 803
P S PPA + P P PI P
Sbjct: 226 PGPSPAQPSAPPASIPAPPIPPVIQYVAPPPVPPPQPIIP 265
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 30.1 bits (68), Expect = 6.9
Identities = 17/53 (32%), Positives = 22/53 (41%), Gaps = 3/53 (5%)
Query: 471 VEKKDTPEESKDVSSESEEESDLSDVEND---EVVDKKEKNEDSNNKSEESSD 520
+E +D E D E EEE + D + E D +E ED E SD
Sbjct: 379 LEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSD 431
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 29.5 bits (66), Expect = 7.0
Identities = 19/134 (14%), Positives = 44/134 (32%), Gaps = 1/134 (0%)
Query: 384 DTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKN 443
K + E KE + E+ + KE Q + +++
Sbjct: 95 RKTKQKVEEEVKEQLQSLLEKIV-VSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAG 153
Query: 444 KHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVD 503
K +++ + T + K +++ + + V +++ DV ++ +
Sbjct: 154 KVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEEDYDE 213
Query: 504 KKEKNEDSNNKSEE 517
+ EDS EE
Sbjct: 214 EDNPVEDSKAIKEE 227
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 30.0 bits (67), Expect = 7.5
Identities = 11/60 (18%), Positives = 23/60 (38%), Gaps = 3/60 (5%)
Query: 468 VAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE---KNEDSNNKSEESSDDDDD 524
+ + + E KD +S ++ +K E +N + + EE D +D+
Sbjct: 336 LNNEKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDE 395
>gnl|CDD|131353 TIGR02300, FYDLN_acid, TIGR02300 family protein. Members of this
family are bacterial proteins with a conserved motif
[KR]FYDLN, sometimes flanked by a pair of CXXC motifs,
followed by a long region of low complexity sequence in
which roughly half the residues are Asp and Glu,
including multiple runs of five or more acidic residues.
The function of members of this family is unknown.
Length = 129
Score = 28.4 bits (63), Expect = 8.1
Identities = 14/76 (18%), Positives = 28/76 (36%), Gaps = 4/76 (5%)
Query: 456 TNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEE----SDLSDVENDEVVDKKEKNEDS 511
+ + V +V+ + E++EE + +D D+ ++D
Sbjct: 46 RVRADDEKEAVAVKEVDPEIDEAAELVSLEEADEEVDPGAPTPAGGDDLAEDEDIADDDD 105
Query: 512 NNKSEESSDDDDDDWE 527
+ E +DDD D E
Sbjct: 106 DVTFLEDEEDDDIDDE 121
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 29.6 bits (66), Expect = 8.3
Identities = 23/114 (20%), Positives = 45/114 (39%), Gaps = 6/114 (5%)
Query: 430 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEE 489
+ + + F D K +KK + E + KK+ + ++ E +E
Sbjct: 247 DEEGNIEYEDF----FDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDE 302
Query: 490 ESDLSDVE--NDEVVDKKEKNEDSNNKSEESSDDDDDDWEKYQTGLNKRDKVLE 541
+ D D E + +DK + +E + S + +EK Q L ++ + LE
Sbjct: 303 QEDDQDEEEPPEAAMDKVKLDEPVLEGVDLESPKELSSFEKRQAKLKQQIEQLE 356
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 29.7 bits (67), Expect = 8.3
Identities = 18/76 (23%), Positives = 21/76 (27%), Gaps = 9/76 (11%)
Query: 730 PILPRPTPGAPIMVAPTTSVQFLNGSKATQFVFVPANSLQPVPIPPAKKRGYVKQKPPPI 789
P P A VA + A A + P PPA P
Sbjct: 381 PARPEAAAPAAAPVAQAAAAP---APAAAPAAAASAPAAPPAAAPPAPV-----AAPAAA 432
Query: 790 LPAPAPT-VPTPIAPE 804
PA AP P +A
Sbjct: 433 APAAAPAAAPAAVALA 448
>gnl|CDD|221403 pfam12064, DUF3544, Domain of unknown function (DUF3544). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 198 to 216 amino acids in length. This domain is
found associated with pfam00628, pfam01753, pfam00439,
pfam00855.
Length = 207
Score = 29.2 bits (65), Expect = 8.5
Identities = 24/79 (30%), Positives = 37/79 (46%), Gaps = 3/79 (3%)
Query: 441 TKNKHDKKSSGQKKITNTVEQTEAPADVAKV-EKKDTPEESKDVSSESEEESDLSDVEND 499
T K +S Q + T+E +A + ++ E T E D SE E+SD SD E
Sbjct: 131 TSPKRQIRSRFQLNLDKTIESCKAQLGINEISEDAYTGVEHSD--SEDSEKSDSSDSEYS 188
Query: 500 EVVDKKEKNEDSNNKSEES 518
++K KNE + +E+
Sbjct: 189 SDDEQKAKNEQEDASDKEA 207
>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
Length = 80
Score = 27.4 bits (61), Expect = 9.0
Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 6/40 (15%)
Query: 486 ESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDD 525
E EE D D E+D+ D K+ +D + DDD+DD
Sbjct: 46 EGEEPEDDDDDEDDDDDDDKDDKDD------DDDDDDEDD 79
>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D. SpoVID, the
stage VI sporulation protein D, is restricted to
endospore-forming members of the bacteria, all of which
are found among the Firmicutes. It is widely distributed
but not quite universal in this group. Between
well-conserved N-terminal and C-terminal domains is a
poorly conserved, low-complexity region of variable
length, rich enough in glutamic acid to cause spurious
BLAST search results unless a filter is used. The seed
alignment for this model was trimmed, in effect, by
choosing member sequences in which these regions are
relatively short. SpoVID is involved in spore coat
assembly by the mother cell compartment late in the
process of sporulation [Cellular processes, Sporulation
and germination].
Length = 338
Score = 29.5 bits (66), Expect = 9.0
Identities = 14/75 (18%), Positives = 38/75 (50%), Gaps = 4/75 (5%)
Query: 455 ITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNK 514
I+ ++ A+ A+ ++++ S + ++EE+ +E D+ + +++ +
Sbjct: 133 ISGIQQENNLDAEPAREDEEEEESFSAEFEHPAQEETA----GEEERTDEPKVEHEAHEQ 188
Query: 515 SEESSDDDDDDWEKY 529
E+ +DDD D+W+
Sbjct: 189 HEQPADDDPDEWKIS 203
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 28.5 bits (64), Expect = 9.1
Identities = 17/58 (29%), Positives = 27/58 (46%), Gaps = 3/58 (5%)
Query: 437 KKKFTKNKHDKKSSGQKKITNTV--EQTEAPADVAKVEKKDTPEESKDVSSESEEESD 492
+KK T KH +K Q+++ N + E V +K+ E + SS SE S+
Sbjct: 59 QKKITFQKHKEK-PEQEELKNQLFSENATENNTVKATKKQLFSSEYEQTSSSSESTSE 115
>gnl|CDD|113815 pfam05058, ActA, ActA Protein. The ActA family is found in
Listeria and is associated with motility. ActA protein
acts as a scaffold to assemble and activate host cell
actin cytoskeletal factors at the bacterial surface,
resulting in directional actin polymerisation and
propulsion of the bacterium through the cytoplasm of the
host cell.
Length = 601
Score = 29.4 bits (65), Expect = 9.5
Identities = 28/124 (22%), Positives = 56/124 (45%), Gaps = 13/124 (10%)
Query: 367 TVTCTLMRKPMSVLFGDDTIKVQFLEPSKEGEGEIEEEEEKPNGTITPVVNKEKQQTHRP 426
T T T+++K V K+ L +K E I E + P + K+ + +P
Sbjct: 454 TTTKTVLKKITPV---KIAPKLAELPATKPQETAIGENK-------APFIEKQAETNKQP 503
Query: 427 VWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSE 486
+ M + +K+ ++K + K ++K+ VE++E D ++ EE K ++
Sbjct: 504 IDMPSLPVIQKEATERDKEEMKPQTEEKM---VEESEPANDANGKKRSAGIEEGKLIAKS 560
Query: 487 SEEE 490
+E+E
Sbjct: 561 AEDE 564
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 29.4 bits (66), Expect = 9.9
Identities = 21/78 (26%), Positives = 27/78 (34%), Gaps = 19/78 (24%)
Query: 733 PRPTPGAPIMVAPTTSVQFLN-------GSKATQFVFVPANSLQPVPIPPAKKRGYVKQK 785
PR G + APT + + L+ G AT A++ P PPA
Sbjct: 873 PRTEGGVRVGYAPTLA-EALDQVFGPGTGRVATAPGGDAASAPPPGAGPPA--------- 922
Query: 786 PPPILPAPAPTVPTPIAP 803
PP P P P A
Sbjct: 923 PPQ--AVPPPRTTQPPAA 938
>gnl|CDD|131515 TIGR02462, pyranose_ox, pyranose oxidase. Pyranose oxidase (also
called glucose 2-oxidase) converts D-glucose and
molecular oxygen to 2-dehydro-D-glucose and hydrogen
peroxide. Peroxide production is believed to be
important to the wood rot fungi in which this enzyme is
found for lignin degradation.
Length = 544
Score = 29.4 bits (66), Expect = 9.9
Identities = 24/102 (23%), Positives = 37/102 (36%), Gaps = 6/102 (5%)
Query: 712 SVGRRRVLSESFSTTQERPILPRPTPGAPIMVAPTT--SVQFLNGSKATQFVFVPANSLQ 769
+V ++L S RP P P P + T S+ F +T+ V + +
Sbjct: 277 AVHNPQILVNSGFGQLGRPDPTNPPPLLPSLGRYITEQSMTFCQIVLSTELVDSVRSDPR 336
Query: 770 PVPIPPAKKRGYVKQKPPPILPAP----APTVPTPIAPEFLW 807
+ K ++ + P LP P P V TP E W
Sbjct: 337 GLDWWKEKVANHMMKHPEDPLPIPFRDPEPQVTTPFTEEHPW 378
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.132 0.384
Gapped
Lambda K H
0.267 0.0705 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 45,948,009
Number of extensions: 4577789
Number of successful extensions: 7245
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6415
Number of HSP's successfully gapped: 351
Length of query: 904
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 798
Effective length of database: 6,236,078
Effective search space: 4976390244
Effective search space used: 4976390244
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.5 bits)
S2: 64 (28.5 bits)