RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17599
(180 letters)
>gnl|CDD|215804 pfam00226, DnaJ, DnaJ domain. DnaJ domains (J-domains) are
associated with hsp70 heat-shock system and it is
thought that this domain mediates the interaction.
DnaJ-domain is therefore part of a chaperone (protein
folding) system. The T-antigens, although not in
Prosite are confirmed as DnaJ containing domains from
literature.
Length = 63
Score = 79.5 bits (197), Expect = 2e-20
Identities = 32/63 (50%), Positives = 41/63 (65%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L + D S+++I+ AYRK ALK HPDKNP D A E F +++A EVL D RA
Sbjct: 1 DYYEILGVPRDASDEEIKKAYRKLALKYHPDKNPGDPAAEEKFKEINEAYEVLSDPEKRA 60
Query: 88 AYD 90
YD
Sbjct: 61 IYD 63
>gnl|CDD|236757 PRK10767, PRK10767, chaperone protein DnaJ; Provisional.
Length = 371
Score = 82.1 bits (204), Expect = 7e-19
Identities = 31/63 (49%), Positives = 43/63 (68%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L ++ + SE +I+ AYRK A+K HPD+NP D++A E F + +A EVL D RA
Sbjct: 5 DYYEVLGVSRNASEDEIKKAYRKLAMKYHPDRNPGDKEAEEKFKEIKEAYEVLSDPQKRA 64
Query: 88 AYD 90
AYD
Sbjct: 65 AYD 67
>gnl|CDD|99751 cd06257, DnaJ, DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock
protein 40) proteins are highly conserved and play
crucial roles in protein translation, folding,
unfolding, translocation, and degradation. They act
primarily by stimulating the ATPase activity of Hsp70s,
an important chaperonine family. Hsp40 proteins are
characterized by the presence of a J domain, which
mediates the interaction with Hsp70. They may contain
other domains as well, and the architectures provide a
means of classification.
Length = 55
Score = 71.8 bits (177), Expect = 2e-17
Identities = 29/55 (52%), Positives = 39/55 (70%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLD 82
D Y +L + D S+++I+ AYRK ALK HPDKNPDD +A E F +++A EVL D
Sbjct: 1 DYYDILGVPPDASDEEIKKAYRKLALKYHPDKNPDDPEAEEKFKEINEAYEVLSD 55
>gnl|CDD|223560 COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn
finger domain [Posttranslational modification, protein
turnover, chaperones].
Length = 371
Score = 76.1 bits (188), Expect = 9e-17
Identities = 32/63 (50%), Positives = 44/63 (69%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L ++ D SE++I+ AYRK A K HPD+NP D++A E F +++A EVL D RA
Sbjct: 5 DYYEILGVSKDASEEEIKKAYRKLAKKYHPDRNPGDKEAEEKFKEINEAYEVLSDPEKRA 64
Query: 88 AYD 90
AYD
Sbjct: 65 AYD 67
>gnl|CDD|233829 TIGR02349, DnaJ_bact, chaperone protein DnaJ. This model
represents bacterial forms of DnaJ, part of the
DnaK-DnaJ-GrpE chaperone system. The three components
typically are encoded by consecutive genes. DnaJ
homologs occur in many genomes, typically not near DnaK
and GrpE-like genes; most such genes are not included
by this family. Eukaryotic (mitochondrial and
chloroplast) forms are not included in the scope of
this family.
Length = 354
Score = 69.5 bits (171), Expect = 2e-14
Identities = 31/63 (49%), Positives = 41/63 (65%), Gaps = 1/63 (1%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L ++ D SE++I+ AYRK A K HPD+N D E A E F +++A EVL D RA
Sbjct: 1 DYYEILGVSKDASEEEIKKAYRKLAKKYHPDRNKDKE-AEEKFKEINEAYEVLSDPEKRA 59
Query: 88 AYD 90
YD
Sbjct: 60 QYD 62
>gnl|CDD|237668 PRK14301, PRK14301, chaperone protein DnaJ; Provisional.
Length = 373
Score = 68.6 bits (168), Expect = 4e-14
Identities = 30/63 (47%), Positives = 42/63 (66%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L ++ D SE +I+ AYRK AL+ HPD+NPD+ +A + F ++A EVL D RA
Sbjct: 5 DYYEVLGVSRDASEDEIKKAYRKLALQYHPDRNPDNPEAEQKFKEAAEAYEVLRDAEKRA 64
Query: 88 AYD 90
YD
Sbjct: 65 RYD 67
>gnl|CDD|197617 smart00271, DnaJ, DnaJ molecular chaperone homology domain.
Length = 60
Score = 63.0 bits (154), Expect = 5e-14
Identities = 29/57 (50%), Positives = 37/57 (64%), Gaps = 1/57 (1%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDD-EKAIETFHLLSKAIEVLLDK 83
D Y +L + D S +I+ AYRK ALK HPDKNP D E+A E F +++A EVL D
Sbjct: 2 DYYEILGVPRDASLDEIKKAYRKLALKYHPDKNPGDKEEAEEKFKEINEAYEVLSDP 58
>gnl|CDD|237664 PRK14294, PRK14294, chaperone protein DnaJ; Provisional.
Length = 366
Score = 67.1 bits (164), Expect = 1e-13
Identities = 30/63 (47%), Positives = 42/63 (66%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L +T D SE++I+ +YRK A+K HPD+NP D++A E F ++A EVL D R
Sbjct: 5 DYYEILGVTRDASEEEIKKSYRKLAMKYHPDRNPGDKEAEELFKEAAEAYEVLSDPKKRG 64
Query: 88 AYD 90
YD
Sbjct: 65 IYD 67
>gnl|CDD|237655 PRK14279, PRK14279, chaperone protein DnaJ; Provisional.
Length = 392
Score = 65.9 bits (161), Expect = 4e-13
Identities = 28/63 (44%), Positives = 38/63 (60%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y L ++ D S ++I+ AYRK A + HPD NP D A E F +S+A +VL D + R
Sbjct: 10 DFYKELGVSSDASAEEIKKAYRKLARELHPDANPGDPAAEERFKAVSEAHDVLSDPAKRK 69
Query: 88 AYD 90
YD
Sbjct: 70 EYD 72
>gnl|CDD|184611 PRK14297, PRK14297, chaperone protein DnaJ; Provisional.
Length = 380
Score = 66.0 bits (161), Expect = 4e-13
Identities = 26/63 (41%), Positives = 40/63 (63%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L L S+ +I+ A+RK A+K HPDKN +++A E F +++A +VL D +A
Sbjct: 5 DYYEVLGLEKGASDDEIKKAFRKLAIKYHPDKNKGNKEAEEKFKEINEAYQVLSDPQKKA 64
Query: 88 AYD 90
YD
Sbjct: 65 QYD 67
>gnl|CDD|237658 PRK14284, PRK14284, chaperone protein DnaJ; Provisional.
Length = 391
Score = 65.6 bits (160), Expect = 6e-13
Identities = 29/64 (45%), Positives = 42/64 (65%)
Query: 27 LDLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSAR 86
+D Y +L ++ S ++I+ AYRK A+K HPDKNP D +A + F +S+A EVL D R
Sbjct: 1 MDYYTILGVSKTASPEEIKKAYRKLAVKYHPDKNPGDAEAEKRFKEVSEAYEVLSDAQKR 60
Query: 87 AAYD 90
+YD
Sbjct: 61 ESYD 64
>gnl|CDD|237654 PRK14278, PRK14278, chaperone protein DnaJ; Provisional.
Length = 378
Score = 65.5 bits (160), Expect = 6e-13
Identities = 30/63 (47%), Positives = 39/63 (61%), Gaps = 1/63 (1%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D YGLL ++ + S+ +I+ AYRK A + HPD NP DE+A E F +S A EVL D R
Sbjct: 4 DYYGLLGVSRNASDAEIKRAYRKLARELHPDVNP-DEEAQEKFKEISVAYEVLSDPEKRR 62
Query: 88 AYD 90
D
Sbjct: 63 IVD 65
>gnl|CDD|237660 PRK14289, PRK14289, chaperone protein DnaJ; Provisional.
Length = 386
Score = 64.9 bits (158), Expect = 1e-12
Identities = 27/63 (42%), Positives = 41/63 (65%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L ++ + +I+ AYRKKA++ HPDKNP D++A E F ++A +VL D R+
Sbjct: 6 DYYEVLGVSKTATVDEIKKAYRKKAIQYHPDKNPGDKEAEEKFKEAAEAYDVLSDPDKRS 65
Query: 88 AYD 90
YD
Sbjct: 66 RYD 68
>gnl|CDD|237657 PRK14281, PRK14281, chaperone protein DnaJ; Provisional.
Length = 397
Score = 62.5 bits (152), Expect = 6e-12
Identities = 28/63 (44%), Positives = 41/63 (65%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L ++ + +I+ AYRK ALK HPDKNPD+++A E F +++A EVL + R
Sbjct: 4 DYYEVLGVSRSADKDEIKKAYRKLALKYHPDKNPDNKEAEEHFKEVNEAYEVLSNDDKRR 63
Query: 88 AYD 90
YD
Sbjct: 64 RYD 66
>gnl|CDD|237662 PRK14292, PRK14292, chaperone protein DnaJ; Provisional.
Length = 371
Score = 61.8 bits (150), Expect = 1e-11
Identities = 29/64 (45%), Positives = 40/64 (62%), Gaps = 1/64 (1%)
Query: 27 LDLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSAR 86
+D Y LL ++ S +I+SAYRK ALK HPD+N ++ A E F +++A VL D R
Sbjct: 2 MDYYELLGVSRTASADEIKSAYRKLALKYHPDRNK-EKGAAEKFAQINEAYAVLSDAEKR 60
Query: 87 AAYD 90
A YD
Sbjct: 61 AHYD 64
>gnl|CDD|240236 PTZ00037, PTZ00037, DnaJ_C chaperone protein; Provisional.
Length = 421
Score = 61.0 bits (148), Expect = 2e-11
Identities = 30/62 (48%), Positives = 38/62 (61%), Gaps = 4/62 (6%)
Query: 29 LYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARAA 88
LY +L L+ DC+ +I+ AYRK A+K HPDK D EK F +S+A EVL D R
Sbjct: 30 LYEVLNLSKDCTTSEIKKAYRKLAIKHHPDKGGDPEK----FKEISRAYEVLSDPEKRKI 85
Query: 89 YD 90
YD
Sbjct: 86 YD 87
>gnl|CDD|184612 PRK14298, PRK14298, chaperone protein DnaJ; Provisional.
Length = 377
Score = 60.6 bits (147), Expect = 2e-11
Identities = 31/63 (49%), Positives = 39/63 (61%), Gaps = 1/63 (1%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L L+ D S +DI+ AYRK A+K HPDKN + A E F +S+A VL D RA
Sbjct: 6 DYYEILGLSKDASVEDIKKAYRKLAMKYHPDKN-KEPDAEEKFKEISEAYAVLSDAEKRA 64
Query: 88 AYD 90
YD
Sbjct: 65 QYD 67
>gnl|CDD|237665 PRK14295, PRK14295, chaperone protein DnaJ; Provisional.
Length = 389
Score = 60.2 bits (146), Expect = 4e-11
Identities = 28/63 (44%), Positives = 38/63 (60%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L + D +E +I+ AYRK A + HPD N D KA E F +S+A +VL D+ R
Sbjct: 10 DYYKVLGVPKDATEAEIKKAYRKLAREYHPDANKGDAKAEERFKEISEAYDVLSDEKKRK 69
Query: 88 AYD 90
YD
Sbjct: 70 EYD 72
>gnl|CDD|184599 PRK14277, PRK14277, chaperone protein DnaJ; Provisional.
Length = 386
Score = 59.4 bits (144), Expect = 6e-11
Identities = 29/70 (41%), Positives = 44/70 (62%), Gaps = 2/70 (2%)
Query: 21 MADIKDLDLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVL 80
MA KD Y +L + + +E++I+ AYR+ A K HPD NP D++A + F +++A E+L
Sbjct: 1 MAAKKDY--YEILGVDRNATEEEIKKAYRRLAKKYHPDLNPGDKEAEQKFKEINEAYEIL 58
Query: 81 LDKSARAAYD 90
D RA YD
Sbjct: 59 SDPQKRAQYD 68
>gnl|CDD|237653 PRK14276, PRK14276, chaperone protein DnaJ; Provisional.
Length = 380
Score = 58.6 bits (142), Expect = 1e-10
Identities = 26/65 (40%), Positives = 36/65 (55%), Gaps = 5/65 (7%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKN--PDDEKAIETFHLLSKAIEVLLDKSA 85
+ Y L ++ D S+ +I+ AYRK + K HPD N P E E + + +A E L D
Sbjct: 5 EYYDRLGVSKDASQDEIKKAYRKLSKKYHPDINKEPGAE---EKYKEVQEAYETLSDPQK 61
Query: 86 RAAYD 90
RAAYD
Sbjct: 62 RAAYD 66
>gnl|CDD|172774 PRK14286, PRK14286, chaperone protein DnaJ; Provisional.
Length = 372
Score = 58.1 bits (140), Expect = 2e-10
Identities = 25/67 (37%), Positives = 43/67 (64%)
Query: 24 IKDLDLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDK 83
+ + Y +L ++ ++++I+SAYRK A+K HPDKN ++++ E F ++A E+L D
Sbjct: 1 MSERSYYDILGVSKSANDEEIKSAYRKLAIKYHPDKNKGNKESEEKFKEATEAYEILRDP 60
Query: 84 SARAAYD 90
R AYD
Sbjct: 61 KKRQAYD 67
>gnl|CDD|172773 PRK14285, PRK14285, chaperone protein DnaJ; Provisional.
Length = 365
Score = 57.7 bits (139), Expect = 2e-10
Identities = 28/63 (44%), Positives = 41/63 (65%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L L+ S+ +I+ AYRK A+K HPDKN +++A F ++A EVL+D + RA
Sbjct: 4 DYYEILGLSKGASKDEIKKAYRKIAIKYHPDKNKGNKEAESIFKEATEAYEVLIDDNKRA 63
Query: 88 AYD 90
YD
Sbjct: 64 QYD 66
>gnl|CDD|225124 COG2214, CbpA, DnaJ-class molecular chaperone [Posttranslational
modification, protein turnover, chaperones].
Length = 237
Score = 54.8 bits (131), Expect = 2e-09
Identities = 31/67 (46%), Positives = 42/67 (62%), Gaps = 1/67 (1%)
Query: 25 KDLDLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEK-AIETFHLLSKAIEVLLDK 83
LD Y +L + + S ++I+ AYRK ALK HPD+NP D K A E F +++A E+L D
Sbjct: 4 DLLDYYEILGVPPNASLEEIKKAYRKLALKYHPDRNPGDPKVAEEKFKEINEAYEILSDP 63
Query: 84 SARAAYD 90
RA YD
Sbjct: 64 ERRAEYD 70
>gnl|CDD|237663 PRK14293, PRK14293, chaperone protein DnaJ; Provisional.
Length = 374
Score = 55.4 bits (134), Expect = 2e-09
Identities = 24/63 (38%), Positives = 37/63 (58%), Gaps = 1/63 (1%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L ++ D + +++ AYR+ A K HPD N + A + F +++A EVL D RA
Sbjct: 4 DYYEILGVSRDADKDELKRAYRRLARKYHPDVN-KEPGAEDRFKEINRAYEVLSDPETRA 62
Query: 88 AYD 90
YD
Sbjct: 63 RYD 65
>gnl|CDD|237656 PRK14280, PRK14280, chaperone protein DnaJ; Provisional.
Length = 376
Score = 55.1 bits (133), Expect = 2e-09
Identities = 28/63 (44%), Positives = 38/63 (60%), Gaps = 1/63 (1%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L ++ S+ +I+ AYRK + K HPD N +E A E F +S+A EVL D RA
Sbjct: 5 DYYEVLGVSKSASKDEIKKAYRKLSKKYHPDIN-KEEGADEKFKEISEAYEVLSDDQKRA 63
Query: 88 AYD 90
YD
Sbjct: 64 QYD 66
>gnl|CDD|172776 PRK14288, PRK14288, chaperone protein DnaJ; Provisional.
Length = 369
Score = 54.7 bits (131), Expect = 3e-09
Identities = 28/65 (43%), Positives = 43/65 (66%)
Query: 26 DLDLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSA 85
+L Y +LE+ +++ I+ +YRK ALK HPD+N D++A E F L+++A VL D+
Sbjct: 2 ELSYYEILEVEKHSNQETIKKSYRKLALKYHPDRNAGDKEAEEKFKLINEAYGVLSDEKK 61
Query: 86 RAAYD 90
RA YD
Sbjct: 62 RALYD 66
>gnl|CDD|184604 PRK14283, PRK14283, chaperone protein DnaJ; Provisional.
Length = 378
Score = 53.3 bits (128), Expect = 8e-09
Identities = 25/63 (39%), Positives = 37/63 (58%), Gaps = 1/63 (1%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L + + +++I+ AYRK A K HPD + ++E A E F +S+A VL D R
Sbjct: 6 DYYEVLGVDRNADKKEIKKAYRKLARKYHPDVS-EEEGAEEKFKEISEAYAVLSDDEKRQ 64
Query: 88 AYD 90
YD
Sbjct: 65 RYD 67
>gnl|CDD|237666 PRK14296, PRK14296, chaperone protein DnaJ; Provisional.
Length = 372
Score = 53.0 bits (127), Expect = 1e-08
Identities = 28/67 (41%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 24 IKDLDLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDK 83
+K D Y +L ++ SEQ+IR AYRK A + HPD N + A + +++A +VLLDK
Sbjct: 1 MKKKDYYEVLGVSKTASEQEIRQAYRKLAKQYHPDLNKSPD-AHDKMVEINEAADVLLDK 59
Query: 84 SARAAYD 90
R YD
Sbjct: 60 DKRKQYD 66
>gnl|CDD|172778 PRK14290, PRK14290, chaperone protein DnaJ; Provisional.
Length = 365
Score = 52.2 bits (125), Expect = 2e-08
Identities = 26/64 (40%), Positives = 40/64 (62%), Gaps = 1/64 (1%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDE-KAIETFHLLSKAIEVLLDKSAR 86
D Y +L + + S++DI+ A+R+ A K HPD +P ++ +A E F +S+A EVL D R
Sbjct: 4 DYYKILGVDRNASQEDIKKAFRELAKKWHPDLHPGNKAEAEEKFKEISEAYEVLSDPQKR 63
Query: 87 AAYD 90
YD
Sbjct: 64 RQYD 67
>gnl|CDD|184603 PRK14282, PRK14282, chaperone protein DnaJ; Provisional.
Length = 369
Score = 51.3 bits (123), Expect = 4e-08
Identities = 22/64 (34%), Positives = 41/64 (64%), Gaps = 1/64 (1%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEK-AIETFHLLSKAIEVLLDKSAR 86
D Y +L ++ + ++++I+ AY++ + HPD++P++ K A + F + +A EVL D R
Sbjct: 5 DYYEILGVSRNATQEEIKRAYKRLVKEWHPDRHPENRKEAEQKFKEIQEAYEVLSDPQKR 64
Query: 87 AAYD 90
A YD
Sbjct: 65 AMYD 68
>gnl|CDD|237661 PRK14291, PRK14291, chaperone protein DnaJ; Provisional.
Length = 382
Score = 48.6 bits (116), Expect = 4e-07
Identities = 24/63 (38%), Positives = 39/63 (61%), Gaps = 1/63 (1%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L ++ + ++++I+ AYR+ A K HPD N + E A E F +++A +VL D R
Sbjct: 4 DYYEILGVSRNATQEEIKKAYRRLARKYHPDFNKNPE-AEEKFKEINEAYQVLSDPEKRK 62
Query: 88 AYD 90
YD
Sbjct: 63 LYD 65
>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
TopJ. This model describes TopJ (MG_200, CbpA), a DnaJ
homolog and probable assembly protein of the Mycoplasma
terminal organelle. The terminal organelle is involved
in both cytadherence and gliding motility [Cellular
processes, Chemotaxis and motility].
Length = 871
Score = 47.9 bits (113), Expect = 9e-07
Identities = 25/63 (39%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L + D EQ+I+ A+RK A K HPD+N A F +++A +VL + RA
Sbjct: 3 DYYEVLGIDRDADEQEIKKAFRKLAKKYHPDRN-KAPDAASIFAEINEANDVLSNPKKRA 61
Query: 88 AYD 90
YD
Sbjct: 62 NYD 64
>gnl|CDD|182347 PRK10266, PRK10266, curved DNA-binding protein CbpA; Provisional.
Length = 306
Score = 45.2 bits (107), Expect = 5e-06
Identities = 25/65 (38%), Positives = 36/65 (55%), Gaps = 5/65 (7%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPD--KNPDDEKAIETFHLLSKAIEVLLDKSA 85
D Y ++ + + I++AYR+ A K HPD K PD E F +++A EVL D+
Sbjct: 5 DYYAIMGVKPTDDLKTIKTAYRRLARKYHPDVSKEPDAE---ARFKEVAEAWEVLSDEQR 61
Query: 86 RAAYD 90
RA YD
Sbjct: 62 RAEYD 66
>gnl|CDD|172788 PRK14300, PRK14300, chaperone protein DnaJ; Provisional.
Length = 372
Score = 44.2 bits (104), Expect = 1e-05
Identities = 24/63 (38%), Positives = 37/63 (58%), Gaps = 1/63 (1%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L ++ S+ D++ AY K A + HPD D + A + F ++ A +VL D+ RA
Sbjct: 4 DYYQILGVSKTASQADLKKAYLKLAKQYHPD-TTDAKDAEKKFKEINAAYDVLKDEQKRA 62
Query: 88 AYD 90
AYD
Sbjct: 63 AYD 65
>gnl|CDD|237667 PRK14299, PRK14299, chaperone protein DnaJ; Provisional.
Length = 291
Score = 44.2 bits (104), Expect = 1e-05
Identities = 23/64 (35%), Positives = 35/64 (54%), Gaps = 1/64 (1%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDKSARA 87
D Y +L + + S+ +I+ A++K A K HPD N A E F +++A VL D R
Sbjct: 5 DYYAILGVPKNASQDEIKKAFKKLARKYHPDVNKSPG-AEEKFKEINEAYTVLSDPEKRR 63
Query: 88 AYDS 91
YD+
Sbjct: 64 IYDT 67
>gnl|CDD|227694 COG5407, SEC63, Preprotein translocase subunit Sec63 [Intracellular
trafficking and secretion].
Length = 610
Score = 43.8 bits (103), Expect = 2e-05
Identities = 27/83 (32%), Positives = 43/83 (51%), Gaps = 10/83 (12%)
Query: 17 LSSIMADIKDL-----DLYGLLELTIDCSEQDIRSAYRKKALKCHPDKNP---DDEKAI- 67
L S + +K D Y +L + D SE+DI+ YR ++K HPDK P ++ ++
Sbjct: 83 LISNIRTLKIEYRRGFDPYEILGIDQDTSERDIKKRYRMLSMKFHPDKAPPMVNELRSEY 142
Query: 68 -ETFHLLSKAIEVLLDKSARAAY 89
E + ++KA +L DK R Y
Sbjct: 143 EEKYKTITKAYGLLTDKKRRENY 165
>gnl|CDD|222986 PHA03102, PHA03102, Small T antigen; Reviewed.
Length = 153
Score = 40.4 bits (95), Expect = 8e-05
Identities = 18/47 (38%), Positives = 23/47 (48%), Gaps = 2/47 (4%)
Query: 32 LLELTIDC--SEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKA 76
LL L + +R AY +K L+ HPDK D+EK E L K
Sbjct: 10 LLGLPRSAWGNLPLMRKAYLRKCLEFHPDKGGDEEKMKELNTLYKKF 56
>gnl|CDD|222912 PHA02624, PHA02624, large T antigen; Provisional.
Length = 647
Score = 40.4 bits (95), Expect = 2e-04
Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 2/40 (5%)
Query: 29 LYGLLELTIDC--SEQDIRSAYRKKALKCHPDKNPDDEKA 66
L LL L + + +R AY +K + HPDK D+EK
Sbjct: 13 LMDLLGLPMAAWGNLPLMRKAYLRKCKEYHPDKGGDEEKM 52
>gnl|CDD|237659 PRK14287, PRK14287, chaperone protein DnaJ; Provisional.
Length = 371
Score = 40.0 bits (93), Expect = 3e-04
Identities = 24/65 (36%), Positives = 35/65 (53%), Gaps = 5/65 (7%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPD--KNPDDEKAIETFHLLSKAIEVLLDKSA 85
D Y +L + + S +++ AYRK A K HPD K PD E + F + +A + L D
Sbjct: 5 DYYEVLGVDRNASVDEVKKAYRKLARKYHPDVNKAPDAE---DKFKEVKEAYDTLSDPQK 61
Query: 86 RAAYD 90
+A YD
Sbjct: 62 KAHYD 66
>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
ribosomal structure and biogenesis / Posttranslational
modification, protein turnover, chaperones].
Length = 379
Score = 36.9 bits (85), Expect = 0.004
Identities = 29/75 (38%), Positives = 36/75 (48%), Gaps = 5/75 (6%)
Query: 23 DIKDLDLYGLLELTI---DCSEQDIRSAYRKKALKCHPDKNPDDEK--AIETFHLLSKAI 77
+ K +DLY LL L+ I A++KK K HPDK E F L+ KA
Sbjct: 39 NWKKVDLYALLGLSKYRTKAIPPQILKAHKKKVYKYHPDKTAAGGNKGCDEFFKLIQKAR 98
Query: 78 EVLLDKSARAAYDSV 92
EVL D+ R YDS
Sbjct: 99 EVLGDRKLRLQYDSN 113
>gnl|CDD|184732 PRK14539, PRK14539, 50S ribosomal protein L11/unknown domain fusion
protein; Provisional.
Length = 196
Score = 33.4 bits (76), Expect = 0.041
Identities = 29/85 (34%), Positives = 43/85 (50%), Gaps = 7/85 (8%)
Query: 57 PDKNPDD-EKAIETFHLLSKAIEVLLDKSARAAYDSVIRRKEEVKIRNSKLDATRKKFKE 115
PD N DD E+A+ T +K + VL++ YD V++ KEE K +K A K +
Sbjct: 111 PDLNTDDVEEAMHTIAGTAKNMGVLVE-----GYDDVVKAKEEAKAA-AKAAALAKAKEA 164
Query: 116 DLERREKEAENERYKGYKVNKSEEE 140
L+ E E + + K +VN EE
Sbjct: 165 SLKSAEAELKASKGKSIEVNVIGEE 189
>gnl|CDD|215770 pfam00176, SNF2_N, SNF2 family N-terminal domain. This domain is
found in proteins involved in a variety of processes
including transcription regulation (e.g., SNF2, STH1,
brahma, MOT1), DNA repair (e.g. ERCC6, RAD16, RAD5), DNA
recombination (e.g. RAD54), and chromatin unwinding
(e.g. ISWI) as well as a variety of other proteins with
little functional information (e.g. lodestar, ETL1).
Length = 301
Score = 32.3 bits (74), Expect = 0.098
Identities = 25/128 (19%), Positives = 50/128 (39%), Gaps = 7/128 (5%)
Query: 28 DLYGLLELTIDC----SEQDIRSAYRKKALKCHPDKNPDDEKAIETFHLLSKAIEVLLDK 83
+L+ LL + S + + +KN + EK E + L K ++ L +
Sbjct: 169 ELWALLNF-LRPGPFGSFKVFEEWFNIPIANTADNKNKNLEKGKEGINRLHKLLKPFLLR 227
Query: 84 SARAAYDSVIRRKEEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQ 143
+ + + K E + L ++K + L + + A + +G + N +L
Sbjct: 228 RTKDDVEKSLPPKTEH-VLYCNLSDEQRKLYKKLLTKRRLALSFAVEGGEKNVGIASLL- 285
Query: 144 QEIDRLRK 151
I +LRK
Sbjct: 286 NLIMQLRK 293
>gnl|CDD|236512 PRK09430, djlA, Dna-J like membrane chaperone protein; Provisional.
Length = 267
Score = 31.7 bits (73), Expect = 0.16
Identities = 13/32 (40%), Positives = 20/32 (62%)
Query: 28 DLYGLLELTIDCSEQDIRSAYRKKALKCHPDK 59
D Y +L ++ +Q+I+ AYRK + HPDK
Sbjct: 201 DAYKVLGVSESDDDQEIKRAYRKLMSEHHPDK 232
>gnl|CDD|219083 pfam06548, Kinesin-related, Kinesin-related. This family
represents a region within kinesin-related proteins from
higher plants. Many family members also contain the
pfam00225 domain. Kinesins are ATP-driven microtubule
motor proteins that produce directed force. Some family
members are associated with the phragmoplast, a
structure composed mainly of microtubules that executes
cytokinesis in higher plants.
Length = 488
Score = 30.6 bits (69), Expect = 0.43
Identities = 16/62 (25%), Positives = 31/62 (50%)
Query: 98 EVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRKEGKYPL 157
E K+ S + + KE +E + + E E +K + + E+E+L +EI L+ + + L
Sbjct: 183 EHKLLKSDYENHPEVLKETIELKRIQEELESFKNFYFDMGEKEVLLEEIQDLKSQLQCML 242
Query: 158 KP 159
Sbjct: 243 SS 244
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 29.5 bits (67), Expect = 0.57
Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 4/68 (5%)
Query: 89 YDSVIRRKEEVKIRNSKLDATRKKFKEDLERREKE--AENERYKGYKVNKSEEEI--LQQ 144
V+ K +L+ KK + +L+++EKE E ++ + SEE QQ
Sbjct: 23 VQKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQ 82
Query: 145 EIDRLRKE 152
E+ + ++E
Sbjct: 83 ELQQKQQE 90
>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D. BicD
proteins consist of three coiled-coiled domains and are
involved in dynein-mediated minus end-directed transport
from the Golgi apparatus to the endoplasmic reticulum
(ER). For full functioning they bind with GSK-3beta
pfam05350 to maintain the anchoring of microtubules to
the centromere. It appears that amino-acid residues
437-617 of BicD and the kinase activity of GSK-3 are
necessary for the formation of a complex between BicD
and GSK-3beta in intact cells.
Length = 711
Score = 30.2 bits (68), Expect = 0.66
Identities = 21/94 (22%), Positives = 42/94 (44%), Gaps = 2/94 (2%)
Query: 59 KNPDDEKAIETFHLLSKAIEVLLDKSARAAYDSVIRRKEEVKIRNSKLDATRKKFKEDLE 118
K+ + + + + E+L K R A V K E+K ++ + +++++++ +
Sbjct: 330 KDRGSHEDGDYYEVDINGPEILECK-YRVAVSEVGELKAELKALKARYNELQERYEDEKD 388
Query: 119 RREKEAENERYKGYKVNKSEEEILQQEIDRLRKE 152
R E +N K + KS E Q+ L KE
Sbjct: 389 RLRNEIQNLAEKLLSLEKSSHE-DQERSGDLEKE 421
>gnl|CDD|236995 PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase;
Provisional.
Length = 693
Score = 30.0 bits (69), Expect = 0.71
Identities = 18/56 (32%), Positives = 30/56 (53%), Gaps = 5/56 (8%)
Query: 73 LSKAIEVLLDKSARAAYDSVIRRKEEVKIRNSKLDATRKKFKEDLERREKEAENER 128
L A++ L + + AY I K+E R + LDA +++ E L E+E E+E+
Sbjct: 241 LKAAVKELAEAKLKEAYQ--ITDKQE---REAALDAIKEEVLEALAAEEEEEEDEK 291
>gnl|CDD|150256 pfam09520, RE_MjaII, MjaII restriction endonuclease. This family
includes the MjaII (recognises GGNCC but the cleavage
site is unknown) restriction endonuclease.
Length = 252
Score = 29.3 bits (66), Expect = 0.88
Identities = 12/38 (31%), Positives = 19/38 (50%)
Query: 122 KEAENERYKGYKVNKSEEEILQQEIDRLRKEGKYPLKP 159
AE + K K++++ + + ID LRK P KP
Sbjct: 84 AVAERQYDKQGKIDQAAIAKIDEIIDDLRKGENLPKKP 121
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 29.4 bits (67), Expect = 1.2
Identities = 16/81 (19%), Positives = 39/81 (48%), Gaps = 19/81 (23%)
Query: 88 AYDSVIRRKE----EVKIRNSKLDATRKKFK----------EDLERREKEAENERYKGYK 133
A + + + + E++ R ++L K+ E LE+RE+E E + +
Sbjct: 62 AKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKE---KE 118
Query: 134 VNKSEEEI--LQQEIDRLRKE 152
+ + ++E+ ++E++ L +E
Sbjct: 119 LEQKQQELEKKEEELEELIEE 139
Score = 28.6 bits (65), Expect = 2.2
Identities = 17/71 (23%), Positives = 36/71 (50%), Gaps = 12/71 (16%)
Query: 94 RRKEEVKIRNSKLDATRKKF---KEDLERREKEAEN---------ERYKGYKVNKSEEEI 141
R+ E ++ R +L+ K+ +++LE++E+E E ER G +++E +
Sbjct: 100 RKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEIL 159
Query: 142 LQQEIDRLRKE 152
L++ + R E
Sbjct: 160 LEKVEEEARHE 170
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 29.1 bits (66), Expect = 1.5
Identities = 13/54 (24%), Positives = 33/54 (61%), Gaps = 4/54 (7%)
Query: 99 VKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRKE 152
+ ++L+ +K +++++R EK+ NE + V K+ EE++++E ++L +
Sbjct: 812 LAAELARLEKELEKLEKEIDRIEKKLSNEGF----VAKAPEEVVEKEKEKLAEY 861
>gnl|CDD|235012 PRK02201, PRK02201, putative inner membrane protein translocase
component YidC; Provisional.
Length = 357
Score = 28.8 bits (65), Expect = 1.6
Identities = 17/49 (34%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 118 ERREKEAE-NERYKGYKVNKSEEEILQQEIDRL-RKEGKYPLKPVLYVF 164
E + K+A+ + +YK YK +K ++ QQEI L +K P P + +F
Sbjct: 164 ELQGKKAKIDAKYKDYKKDKQMKQRKQQEIQELYKKHNISPFSPFVQMF 212
>gnl|CDD|218791 pfam05879, RHD3, Root hair defective 3 GTP-binding protein (RHD3).
This family consists of several eukaryotic root hair
defective 3 like GTP-binding proteins. It has been
speculated that the RHD3 protein is a member of a novel
class of GTP-binding proteins that is widespread in
eukaryotes and required for regulated cell enlargement.
The family also contains the homologous yeast synthetic
enhancement of YOP1 (SEY1) protein which is involved in
membrane trafficking.
Length = 743
Score = 28.6 bits (64), Expect = 2.2
Identities = 21/72 (29%), Positives = 37/72 (51%), Gaps = 3/72 (4%)
Query: 73 LSKAIEVLLDKSARAAYDSVIRR-KEEVKIRNSKLDATRKKFKEDLERREKEAENERYKG 131
LS+ +E+LL+K + +D+V++ K EV+ S+ + F E +K +N + K
Sbjct: 438 LSEPVELLLNKLNKETWDTVLKLFKREVEDAVSRFTDRKSGFDLSEEENDKMLKNLKRKS 497
Query: 132 YKV--NKSEEEI 141
+ V K EE
Sbjct: 498 WGVLRTKIHEEA 509
>gnl|CDD|193187 pfam12711, Kinesin-relat_1, Kinesin motor. This family is closely
related to Kinesin-related, pfam06548.
Length = 86
Score = 26.8 bits (60), Expect = 2.4
Identities = 15/58 (25%), Positives = 29/58 (50%), Gaps = 5/58 (8%)
Query: 96 KEEVKIRNSKLDATRK--KFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRK 151
KEE+K+ +++ + +F + R +E R + + + E E+L +EI LR
Sbjct: 30 KEEIKLLREQVEHNPEVTRFALENIRLREE--LRRLQNFYE-QGEREMLLEEISELRD 84
>gnl|CDD|234271 TIGR03591, polynuc_phos, polyribonucleotide nucleotidyltransferase.
Members of this protein family are polyribonucleotide
nucleotidyltransferase, also called polynucleotide
phosphorylase. Some members have been shown also to have
additional functions as guanosine pentaphosphate
synthetase and as poly(A) polymerase (see model
TIGR02696 for an exception clade, within this family)
[Transcription, Degradation of RNA].
Length = 684
Score = 28.2 bits (64), Expect = 2.4
Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 8/78 (10%)
Query: 72 LLSKAIEVLLDKSARAAYDSVIRRKEEVKIRNSKLDATRKKFKEDL---ERREKEAENER 128
L +K E+ + + AY I K+E R + LDA +++ E L E E+EA+ E+
Sbjct: 233 LKAKVKELAYEAGLKEAYQ--ITEKQE---RYAALDAIKEEVLEALAEEEEEEEEADREK 287
Query: 129 YKGYKVNKSEEEILQQEI 146
E++I+++ I
Sbjct: 288 EIKEAFKDLEKKIVRERI 305
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 28.4 bits (64), Expect = 2.6
Identities = 13/42 (30%), Positives = 23/42 (54%), Gaps = 7/42 (16%)
Query: 110 RKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRK 151
+KK+K+D+E+R + Y+ Y K +E LQ ++ R
Sbjct: 86 KKKYKKDVEKRNR-----SYRLYLDKKRKE--LQALSEKQRH 120
>gnl|CDD|223618 COG0544, Tig, FKBP-type peptidyl-prolyl cis-trans isomerase
(trigger factor) [Posttranslational modification,
protein turnover, chaperones].
Length = 441
Score = 28.0 bits (63), Expect = 2.6
Identities = 16/58 (27%), Positives = 29/58 (50%), Gaps = 8/58 (13%)
Query: 103 NSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEE--------ILQQEIDRLRKE 152
L+ ++K +++LER KEA E+ K ++ E +++ EID L K+
Sbjct: 259 EDTLEELKEKLRKNLERELKEATLEKRKEQLLDALVEANDFDLPESLVEAEIDNLLKQ 316
>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A.
Length = 427
Score = 27.9 bits (63), Expect = 3.2
Identities = 22/98 (22%), Positives = 38/98 (38%), Gaps = 24/98 (24%)
Query: 64 EKAIETFHLLSKAIEVLLDKSARAAYDSVIRRKEEVKIRNSKLDATRKKFKEDLERREKE 123
EK E H+L ++ L +++ +D KK K++L E
Sbjct: 330 EKLEERLHILEGLLKALNK------------IDFVIEVIRGSID--LKKAKKELIEELSE 375
Query: 124 AENE--------RYKGYKVNKSEEEI--LQQEIDRLRK 151
+ + R ++ K E+EI L++EI L K
Sbjct: 376 IQADYLLDMRLRRLTKEEIEKLEKEIEELEKEIAELEK 413
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 27.7 bits (62), Expect = 3.7
Identities = 13/67 (19%), Positives = 29/67 (43%)
Query: 86 RAAYDSVIRRKEEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQQE 145
+ + + SK + +K +++ ++KE + E K ++ + EE I + E
Sbjct: 297 KKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLE 356
Query: 146 IDRLRKE 152
+ KE
Sbjct: 357 VQATDKE 363
>gnl|CDD|151173 pfam10669, Phage_Gp23, Protein gp23 (Bacteriophage A118). This is
the highly conserved family of the major tail subunit
protein.
Length = 121
Score = 27.0 bits (59), Expect = 3.7
Identities = 12/56 (21%), Positives = 31/56 (55%)
Query: 97 EEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRKE 152
++ KI ++ R K + + E+R+KE++ ER K ++ ++ ++ D + ++
Sbjct: 39 DDSKIVRVEMKEERDKMETEREKRDKESKEERDKFISTMNEQQRLMDKQNDMMGQQ 94
>gnl|CDD|134335 PRK00625, PRK00625, shikimate kinase; Provisional.
Length = 173
Score = 27.0 bits (60), Expect = 4.3
Identities = 18/45 (40%), Positives = 21/45 (46%), Gaps = 5/45 (11%)
Query: 118 ERREKEAENERYKGYKVNKSEEEILQQEIDRLRKEGKY--PLKPV 160
+R +K ER K S EEIL Q IDR+R Y L V
Sbjct: 111 QRLQKRGLPERLKH---APSLEEILSQRIDRMRSIADYIFSLDHV 152
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 27.7 bits (61), Expect = 4.5
Identities = 13/38 (34%), Positives = 21/38 (55%)
Query: 91 SVIRRKEEVKIRNSKLDATRKKFKEDLERREKEAENER 128
S + +K E + +K +A +K +E +EKE E ER
Sbjct: 575 SKLAKKREEAVEKAKREAEQKAREEREREKEKEKERER 612
>gnl|CDD|180530 PRK06328, PRK06328, type III secretion system protein; Validated.
Length = 223
Score = 27.1 bits (60), Expect = 4.8
Identities = 18/74 (24%), Positives = 41/74 (55%), Gaps = 1/74 (1%)
Query: 88 AYDSVIRRKEEVKIRNSKLDATRKKFKEDLERREKEAENERYK-GYKVNKSEEEILQQEI 146
A+ +++ +E ++ +A ++ E+ E+ +EA+N+ +K G K + L++E
Sbjct: 24 AFSALLDAQELLEKTKEDSEAYTQETHEECEKLREEAKNQGFKEGSKAWSKQLAFLEEET 83
Query: 147 DRLRKEGKYPLKPV 160
+LR++ K L P+
Sbjct: 84 QKLREQVKEALVPL 97
>gnl|CDD|148788 pfam07374, DUF1492, Protein of unknown function (DUF1492). This
family consists of several hypothetical, highly
conserved Streptococcal and related phage proteins of
around 100 residues in length. The function of this
family is unknown. It appears to be distantly related to
pfam08281.
Length = 100
Score = 26.3 bits (58), Expect = 5.0
Identities = 14/66 (21%), Positives = 29/66 (43%), Gaps = 11/66 (16%)
Query: 114 KEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRKEGKYPLKPVLYVFRINLWLISY 173
KE +E+ EA + + L + ID+L + ++Y+ ++ +W +
Sbjct: 31 KEAIEQDTAEAIKRKLE-----------LGRLIDKLENPDSRTILRMVYIDKMTVWQVCD 79
Query: 174 QGRCSR 179
+ CSR
Sbjct: 80 KLNCSR 85
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 26.8 bits (60), Expect = 5.1
Identities = 12/59 (20%), Positives = 29/59 (49%)
Query: 96 KEEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRKEGK 154
K E++ KL+ +++ ++ + E + E + + + EE+ EI L+K+ +
Sbjct: 122 KSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLKKQNQ 180
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 26.8 bits (60), Expect = 5.2
Identities = 14/67 (20%), Positives = 30/67 (44%), Gaps = 4/67 (5%)
Query: 90 DSVIRRKEEVKIRNSKLDATRKKFKEDLERREKEAEN--ERYKGYKVNKSEEEI--LQQE 145
+++ K +L+ KK + +LE+ EKE + E+ + SE ++E
Sbjct: 7 QKILQESPAGKAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKE 66
Query: 146 IDRLRKE 152
+ + +E
Sbjct: 67 LQKKVQE 73
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 27.2 bits (61), Expect = 5.2
Identities = 12/57 (21%), Positives = 29/57 (50%), Gaps = 2/57 (3%)
Query: 98 EVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEI--LQQEIDRLRKE 152
E+K R ++L ++ + E +++ E+ K + K E+E+ ++ +D +E
Sbjct: 70 ELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEE 126
>gnl|CDD|182904 PRK11020, PRK11020, hypothetical protein; Provisional.
Length = 118
Score = 26.1 bits (58), Expect = 5.7
Identities = 17/55 (30%), Positives = 25/55 (45%), Gaps = 7/55 (12%)
Query: 96 KEEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLR 150
K E+K + +LDA R K R + E Y + E+ L+ EI RL+
Sbjct: 4 KNEIKRLSDRLDAIRHKLAAASLRGDAEK-------YAQFEKEKATLEAEIARLK 51
>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La. This protein is induced
by heat shock and other stresses in E. coli, B.
subtilis, and other species. The yeast member,
designated PIM1, is located in the mitochondrial matrix,
required for mitochondrial function, and also induced by
heat shock [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 775
Score = 27.3 bits (61), Expect = 6.0
Identities = 20/109 (18%), Positives = 52/109 (47%), Gaps = 13/109 (11%)
Query: 56 HPDKNPDDEKAI-ETFHL---LSKAIEVLLDKSARAAYDSVIRRKEEVKIR--------N 103
K D+ + + ET ++ L KA+E+L + + I +K E K+
Sbjct: 170 LQLKEKDELQEVLETVNIEKRLKKALELLKKELELLKLQNKITKKVEEKMEKTQREYYLR 229
Query: 104 SKLDATRKKFKEDLERREKEAE-NERYKGYKVNKSEEEILQQEIDRLRK 151
+L A +K+ + + +++ + E+ + K+ + ++++++E+ +L
Sbjct: 230 EQLKAIKKELGIEKDDKDELEKLKEKLEELKLPEEVKKVIEKELTKLSL 278
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 26.9 bits (60), Expect = 6.2
Identities = 21/99 (21%), Positives = 42/99 (42%), Gaps = 6/99 (6%)
Query: 57 PDKNPDDEKAIETFHLLSKAIEVLLDKSARAAYDSVIRR-KEEVKIRNSKLDATRKKFKE 115
+P ++ + F + I+ + + + + +S+I KEE+ + KL + + +E
Sbjct: 132 TASDPVVQELVSIF---NDLIDSIKEDNLKDDLESLIASAKEELDQLSKKLAELKAEEEE 188
Query: 116 DLERREKEAENERYKGY--KVNKSEEEILQQEIDRLRKE 152
+LER KE E ++ E +LR E
Sbjct: 189 ELERALKEKREELLSKLEEELLARLESKEAALEKQLRLE 227
>gnl|CDD|237551 PRK13902, alaS, alanyl-tRNA synthetase; Provisional.
Length = 900
Score = 27.1 bits (61), Expect = 6.7
Identities = 10/57 (17%), Positives = 25/57 (43%), Gaps = 10/57 (17%)
Query: 97 EEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRKEG 153
E + + +L T ++F E+ + ++K E E+ + +E+ + L +
Sbjct: 750 EILGVPPEQLPKTVERFFEEWKEQKK--EIEKLR--------KELAELLASELLSKA 796
>gnl|CDD|235246 PRK04184, PRK04184, DNA topoisomerase VI subunit B; Validated.
Length = 535
Score = 26.8 bits (60), Expect = 6.8
Identities = 18/67 (26%), Positives = 24/67 (35%), Gaps = 4/67 (5%)
Query: 75 KAIEVLLDKSARAAYDSVIRRKEEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKV 134
K E + + AR + I KEE I KK +E E E+E E E
Sbjct: 473 KTFEKYIPEIARKL--AEILEKEEPDIEEVLAKLIGKKLEE--EEVEEEEEEEAVVVESA 528
Query: 135 NKSEEEI 141
E+
Sbjct: 529 KNYTEKD 535
>gnl|CDD|232945 TIGR00376, TIGR00376, DNA helicase, putative. The gene product may
represent a DNA helicase. Eukaryotic members of this
family have been characterized as binding certain
single-stranded G-rich DNA sequences (GGGGT and GGGCT).
A number of related proteins are characterized as
helicases [DNA metabolism, DNA replication,
recombination, and repair].
Length = 637
Score = 27.1 bits (60), Expect = 6.8
Identities = 16/79 (20%), Positives = 33/79 (41%), Gaps = 6/79 (7%)
Query: 80 LLDKSARAAYDSVIRRKEEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEE 139
LL + + + D +I + + R+K E +E R K+ + + S+
Sbjct: 237 LLKSNKQHSLDYLIENHPK----YQIVADIREKIDELIEERNKKLKP--SPQKRRGLSDI 290
Query: 140 EILQQEIDRLRKEGKYPLK 158
+IL++ + + G LK
Sbjct: 291 KILRKALKKREARGIESLK 309
>gnl|CDD|225664 COG3122, COG3122, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 215
Score = 26.5 bits (58), Expect = 7.4
Identities = 14/55 (25%), Positives = 27/55 (49%), Gaps = 1/55 (1%)
Query: 114 KEDLERREKEAEN-ERYKGYKVNKSEEEILQQEIDRLRKEGKYPLKPVLYVFRIN 167
K ++ RE E K ++ + ++ QQ+ L KE K +K ++ + R+N
Sbjct: 69 KSRVDAREAYRIAAEENKAEQLERDKQLSEQQKQAALAKEYKAQVKQLIEMNRLN 123
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 26.7 bits (59), Expect = 8.3
Identities = 18/84 (21%), Positives = 34/84 (40%), Gaps = 6/84 (7%)
Query: 73 LSKAIEVLLDKSARAAYDSVIRRKEEVKIRNSKLDATRKKFKEDLERREKEAENERYKGY 132
++ IE L + E++ KL ++ E+L+ RE+ + E K
Sbjct: 513 INVLIEKLSALEKE--LEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEA 570
Query: 133 --KVNKSEEEILQQEIDRLRKEGK 154
+ ++E+ + I R KE K
Sbjct: 571 QEALKALKKEV--ESIIRELKEKK 592
>gnl|CDD|216282 pfam01071, GARS_A, Phosphoribosylglycinamide synthetase, ATP-grasp
(A) domain. Phosphoribosylglycinamide synthetase
catalyzes the second step in the de novo biosynthesis of
purine. The reaction catalyzed by
Phosphoribosylglycinamide synthetase is the ATP-
dependent addition of 5-phosphoribosylamine to glycine
to form 5'phosphoribosylglycinamide. This domain is
related to the ATP-grasp domain of biotin
carboxylase/carbamoyl phosphate synthetase (see
pfam02786).
Length = 193
Score = 26.1 bits (58), Expect = 8.6
Identities = 13/25 (52%), Positives = 16/25 (64%), Gaps = 1/25 (4%)
Query: 139 EEILQQEIDRLRKEGKYPLKPVLYV 163
E I++ +D LRKEG P K VLY
Sbjct: 147 ETIVEPTVDGLRKEG-IPYKGVLYA 170
>gnl|CDD|148511 pfam06936, Selenoprotein_S, Selenoprotein S (SelS). This family
consists of several mammalian selenoprotein S (SelS)
sequences. SelS is a plasma membrane protein and is
present in a variety of tissues and cell types. The
function of this family is unknown.
Length = 190
Score = 26.4 bits (58), Expect = 8.6
Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 10/61 (16%)
Query: 97 EEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEI---DRLRKEG 153
++V R L+A R + +EDL +A+ ++K K + EEE +Q+I DR+ +EG
Sbjct: 76 DKVVRRQEALEAARLRMQEDL-----DAKAAKFK-EKQKQLEEEKRRQKIEMWDRM-QEG 128
Query: 154 K 154
K
Sbjct: 129 K 129
>gnl|CDD|118696 pfam10168, Nup88, Nuclear pore component. Nup88 can be divided
into two structural domains; the N-terminal two-thirds
of the protein has no obvious structural motifs but is
the region for binding to Nup98, one of the components
of the nuclear pore. the C-terminal end is a predicted
coiled-coil domain. Nup88 is overexpressed in tumour
cells.
Length = 717
Score = 26.8 bits (59), Expect = 9.0
Identities = 19/88 (21%), Positives = 38/88 (43%), Gaps = 12/88 (13%)
Query: 68 ETFHLLSKAIEVLLDKSARAAYDSVIRR----KEEVKIRNSKLDATRKKFKEDLERREKE 123
E LLS+A +V R Y + + +EE + R L ++K ED++ +E
Sbjct: 536 ECLQLLSRATQVF-----REQY---LLKHDLAREEFQRRVKLLQLQKEKQLEDIQDCREE 587
Query: 124 AENERYKGYKVNKSEEEILQQEIDRLRK 151
++ + K+ + EE + + +
Sbjct: 588 RKSLSERAEKLAEKFEEAKYNQELLVNR 615
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 26.3 bits (59), Expect = 9.0
Identities = 20/69 (28%), Positives = 35/69 (50%), Gaps = 15/69 (21%)
Query: 95 RKEEVKIRNSKLDATRKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRKEGK 154
R E ++ RNS+L+ ++ + L R+ EAENE+ ++E++RL+ E +
Sbjct: 2 RLEALEERNSELEEQIRQLELKL--RDLEAENEKL-------------ERELERLKSELE 46
Query: 155 YPLKPVLYV 163
P L V
Sbjct: 47 KLKSPPLIV 55
>gnl|CDD|236828 PRK11034, clpA, ATP-dependent Clp protease ATP-binding subunit;
Provisional.
Length = 758
Score = 26.7 bits (59), Expect = 9.2
Identities = 15/37 (40%), Positives = 21/37 (56%)
Query: 65 KAIETFHLLSKAIEVLLDKSARAAYDSVIRRKEEVKI 101
K I HL KAI+V+ + ARA V +RK+ V +
Sbjct: 387 KYINDRHLPDKAIDVIDEAGARARLMPVSKRKKTVNV 423
>gnl|CDD|234272 TIGR03592, yidC_oxa1_cterm, membrane protein insertase, YidC/Oxa1
family, C-terminal domain. This model describes
full-length from some species, and the C-terminal region
only from other species, of the YidC/Oxa1 family of
proteins. This domain appears to be univeral among
bacteria (although absent from Archaea). The
well-characterized YidC protein from Escherichia coli
and its close homologs contain a large N-terminal
periplasmic domain in addition to the region modeled
here [Protein fate, Protein and peptide secretion and
trafficking].
Length = 181
Score = 25.9 bits (58), Expect = 9.4
Identities = 10/31 (32%), Positives = 19/31 (61%), Gaps = 1/31 (3%)
Query: 105 KLDATRKKFKEDLERREKEAENERYKGYKVN 135
KL ++K+K+D ++ ++E + YK VN
Sbjct: 37 KLKEIQEKYKDDPQKLQQEM-MKLYKEEGVN 66
>gnl|CDD|235855 PRK06707, PRK06707, amidase; Provisional.
Length = 536
Score = 26.4 bits (58), Expect = 9.5
Identities = 19/91 (20%), Positives = 36/91 (39%), Gaps = 2/91 (2%)
Query: 63 DEKAIETFHLLSKAIEVLLDKSARAAYDSVIRRKEEVKIRNSKLDATRKKFKEDL-ERRE 121
+ ++ L + ++ + ++ EE+ N K R K+ + L E E
Sbjct: 376 NNGGVDNLQTLEYEFKHNVNDYFSQQKNVPVKSLEEIIAFNKKDSKRRIKYGQTLIEASE 435
Query: 122 KEAENERYKGYKVNKSEEEILQQEIDRLRKE 152
K A + V S+E ++E+DR E
Sbjct: 436 KSAITKDEFEKVVQTSQE-NAKKELDRYLVE 465
>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2). This family
represents a conserved region approximately 60 residues
long within the eukaryotic targeting protein for Xklp2
(TPX2). Xklp2 is a kinesin-like protein localised on
centrosomes throughout the cell cycle and on spindle
pole microtubules during metaphase. In Xenopus, it has
been shown that Xklp2 protein is required for centrosome
separation and maintenance of spindle bi-polarity. TPX2
is a microtubule-associated protein that mediates the
binding of the C-terminal domain of Xklp2 to
microtubules. It is phosphorylated during mitosis in a
microtubule-dependent way.
Length = 57
Score = 24.7 bits (54), Expect = 9.7
Identities = 18/51 (35%), Positives = 26/51 (50%), Gaps = 3/51 (5%)
Query: 110 RKKFKEDLERREKEAENERYKGYKVNKSEEEILQQEIDRLRKEGKYPLKPV 160
R +F + LE +EK E E+ + K EEE + I +LRKE P+
Sbjct: 10 RAEFDKKLEEKEKALEAEKEEAEARQKEEEE---EAIKQLRKELVPKANPM 57
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.135 0.386
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,322,233
Number of extensions: 881725
Number of successful extensions: 1958
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1833
Number of HSP's successfully gapped: 325
Length of query: 180
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 89
Effective length of database: 6,901,388
Effective search space: 614223532
Effective search space used: 614223532
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 56 (25.5 bits)