RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16770
(554 letters)
>gnl|CDD|239261 cd02963, TRX_DnaJ, TRX domain, DnaJ domain containing protein
family; composed of uncharacterized proteins of about
500-800 amino acids, containing an N-terminal DnaJ
domain followed by one redox active TRX domain. DnaJ is
a member of the 40 kDa heat-shock protein (Hsp40) family
of molecular chaperones, which regulate the activity of
Hsp70s. TRX is involved in the redox regulation of many
protein substrates through the reduction of disulfide
bonds. TRX has been implicated to catalyse the reduction
of Hsp33, a chaperone holdase that binds to unfolded
protein intermediates. The presence of DnaJ and TRX
domains in members of this family suggests that they
could be involved in a redox-regulated chaperone
network.
Length = 111
Score = 130 bits (328), Expect = 4e-36
Identities = 45/112 (40%), Positives = 72/112 (64%), Gaps = 1/112 (0%)
Query: 124 ISLFHKLSTTHWNFEKNYIPKSYTTPHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGF 183
S +K S T +E +PKS+ P+LI SDWCF+C+ +EP++K+++ EL PLGVG
Sbjct: 1 DSFDYKYSLTFSQYENEIVPKSFKKPYLIKITSDWCFSCIHIEPVWKEVIQELEPLGVGI 60
Query: 184 FTVHVHNEQGLARRLGVGSQLPQIALLTDGRTSFFKEPSFSVQKMVEFFRLK 235
TV+ +E+ LAR+LG +P I + +G+ +F+ + SF+ Q +V+F R
Sbjct: 61 ATVNAGHERRLARKLGA-HSVPAIVGIINGQVTFYHDSSFTKQHVVDFVRKL 111
>gnl|CDD|233829 TIGR02349, DnaJ_bact, chaperone protein DnaJ. This model
represents bacterial forms of DnaJ, part of the
DnaK-DnaJ-GrpE chaperone system. The three components
typically are encoded by consecutive genes. DnaJ
homologs occur in many genomes, typically not near DnaK
and GrpE-like genes; most such genes are not included by
this family. Eukaryotic (mitochondrial and chloroplast)
forms are not included in the scope of this family.
Length = 354
Score = 120 bits (302), Expect = 8e-30
Identities = 42/103 (40%), Positives = 58/103 (56%), Gaps = 3/103 (2%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
D YE LGV + AS +EI+K Y++L ++HPD+N D A+EKF ++ EAY +LSD E+R Q
Sbjct: 1 DYYEILGVSKDASEEEIKKAYRKLAKKYHPDRNKDKEAEEKFKEINEAYEVLSDPEKRAQ 60
Query: 80 YDLFGTTD---GFSGQDSASRNFHNHMYNPFDDVFSEGFNFPF 119
YD FG G G F + F D+F + F
Sbjct: 61 YDQFGHAGFNGGGGGGGGGFNGFDIGFFGDFGDIFGDFFGGGG 103
>gnl|CDD|223560 COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn
finger domain [Posttranslational modification, protein
turnover, chaperones].
Length = 371
Score = 120 bits (302), Expect = 1e-29
Identities = 41/97 (42%), Positives = 56/97 (57%), Gaps = 3/97 (3%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNN-DPTAQEKFLQLTEAYNILSDAERRK 78
D YE LGV + AS +EI+K Y++L ++HPD+N D A+EKF ++ EAY +LSD E+R
Sbjct: 5 DYYEILGVSKDASEEEIKKAYRKLAKKYHPDRNPGDKEAEEKFKEINEAYEVLSDPEKRA 64
Query: 79 QYDLFGTTDGFSGQDSASRNFHNHMYNPFDDVFSEGF 115
YD FG GF F F D+F + F
Sbjct: 65 AYDQFG-HAGFKAGGFGGFGF-GGFGGDFGDIFEDFF 99
>gnl|CDD|237661 PRK14291, PRK14291, chaperone protein DnaJ; Provisional.
Length = 382
Score = 108 bits (272), Expect = 9e-26
Identities = 42/104 (40%), Positives = 58/104 (55%), Gaps = 7/104 (6%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
D YE LGV R A+ +EI+K Y+RL ++HPD N +P A+EKF ++ EAY +LSD E+RK
Sbjct: 4 DYYEILGVSRNATQEEIKKAYRRLARKYHPDFNKNPEAEEKFKEINEAYQVLSDPEKRKL 63
Query: 80 YDLFGTTDGFSGQDSASRNFHNHMYNPF---DDVFSE---GFNF 117
YD FG FSG + +D+ + F F
Sbjct: 64 YDQFG-HAAFSGSGQQQQGQEGFSDFGGGNIEDILEDVFDIFGF 106
>gnl|CDD|215804 pfam00226, DnaJ, DnaJ domain. DnaJ domains (J-domains) are
associated with hsp70 heat-shock system and it is
thought that this domain mediates the interaction.
DnaJ-domain is therefore part of a chaperone (protein
folding) system. The T-antigens, although not in
Prosite are confirmed as DnaJ containing domains from
literature.
Length = 63
Score = 99.1 bits (248), Expect = 2e-25
Identities = 36/63 (57%), Positives = 48/63 (76%), Gaps = 1/63 (1%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
D YE LGVPR AS +EI+K Y++L +++HPDKN DP A+EKF ++ EAY +LSD E+R
Sbjct: 1 DYYEILGVPRDASDEEIKKAYRKLALKYHPDKNPGDPAAEEKFKEINEAYEVLSDPEKRA 60
Query: 79 QYD 81
YD
Sbjct: 61 IYD 63
>gnl|CDD|184612 PRK14298, PRK14298, chaperone protein DnaJ; Provisional.
Length = 377
Score = 104 bits (262), Expect = 2e-24
Identities = 45/100 (45%), Positives = 66/100 (66%), Gaps = 1/100 (1%)
Query: 16 AVSLDPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAE 75
A + D YE LG+ + AS+++I+K Y++L +++HPDKN +P A+EKF +++EAY +LSDAE
Sbjct: 2 ATTRDYYEILGLSKDASVEDIKKAYRKLAMKYHPDKNKEPDAEEKFKEISEAYAVLSDAE 61
Query: 76 RRKQYDLFGTTDGFSGQDSASRNFHNHMYNPFDDVFSEGF 115
+R QYD FG G Q SA F + F D+F F
Sbjct: 62 KRAQYDRFGHA-GIDNQYSAEDIFRGADFGGFGDIFEMFF 100
>gnl|CDD|184604 PRK14283, PRK14283, chaperone protein DnaJ; Provisional.
Length = 378
Score = 102 bits (257), Expect = 9e-24
Identities = 44/103 (42%), Positives = 65/103 (63%), Gaps = 10/103 (9%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
D YE LGV R A +EI+K Y++L ++HPD + + A+EKF +++EAY +LSD E+R++
Sbjct: 6 DYYEVLGVDRNADKKEIKKAYRKLARKYHPDVSEEEGAEEKFKEISEAYAVLSDDEKRQR 65
Query: 80 YDLFGTT--DGFSGQDSASRNFHNHMYNPFDDVFSEGFNFPFE 120
YD FG DGFS +D N++ F+D+F GF F
Sbjct: 66 YDQFGHAGMDGFSQED-----IFNNI--NFEDIFQ-GFGFGIG 100
>gnl|CDD|236757 PRK10767, PRK10767, chaperone protein DnaJ; Provisional.
Length = 371
Score = 101 bits (254), Expect = 3e-23
Identities = 41/105 (39%), Positives = 58/105 (55%), Gaps = 17/105 (16%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
D YE LGV R AS EI+K Y++L +++HPD+N D A+EKF ++ EAY +LSD ++R
Sbjct: 5 DYYEVLGVSRNASEDEIKKAYRKLAMKYHPDRNPGDKEAEEKFKEIKEAYEVLSDPQKRA 64
Query: 79 QYDLFG--------TTDGFSGQDSASRNFHNHMYNPFDDVFSEGF 115
YD +G GF G F + F D+F + F
Sbjct: 65 AYDQYGHAAFEQGGGGGGFGGGG----GFG----DIFGDIFGDIF 101
>gnl|CDD|237662 PRK14292, PRK14292, chaperone protein DnaJ; Provisional.
Length = 371
Score = 99.6 bits (248), Expect = 1e-22
Identities = 45/99 (45%), Positives = 61/99 (61%), Gaps = 7/99 (7%)
Query: 19 LDPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRK 78
+D YE LGV RTAS EI+ Y++L +++HPD+N + A EKF Q+ EAY +LSDAE+R
Sbjct: 2 MDYYELLGVSRTASADEIKSAYRKLALKYHPDRNKEKGAAEKFAQINEAYAVLSDAEKRA 61
Query: 79 QYDLFGTTD--GFSGQDSASRNFHNHMYNPFDDVFSEGF 115
YD FGT G G D F ++P D+F + F
Sbjct: 62 HYDRFGTAPGAGMPGGDP----FGGMGFDPM-DIFEQLF 95
>gnl|CDD|237653 PRK14276, PRK14276, chaperone protein DnaJ; Provisional.
Length = 380
Score = 97.5 bits (243), Expect = 7e-22
Identities = 38/106 (35%), Positives = 58/106 (54%), Gaps = 16/106 (15%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
+ Y+ LGV + AS EI+K Y++L ++HPD N +P A+EK+ ++ EAY LSD ++R
Sbjct: 5 EYYDRLGVSKDASQDEIKKAYRKLSKKYHPDINKEPGAEEKYKEVQEAYETLSDPQKRAA 64
Query: 80 YDLFGTTD----------GFSGQDSASRNFHNHMYNPFDDVFSEGF 115
YD +G GF G D + + F+D+FS F
Sbjct: 65 YDQYGAAGANGGFGGGAGGFGGFDGSGG------FGGFEDIFSSFF 104
>gnl|CDD|237656 PRK14280, PRK14280, chaperone protein DnaJ; Provisional.
Length = 376
Score = 96.7 bits (241), Expect = 1e-21
Identities = 42/99 (42%), Positives = 59/99 (59%), Gaps = 5/99 (5%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
D YE LGV ++AS EI+K Y++L ++HPD N + A EKF +++EAY +LSD ++R Q
Sbjct: 5 DYYEVLGVSKSASKDEIKKAYRKLSKKYHPDINKEEGADEKFKEISEAYEVLSDDQKRAQ 64
Query: 80 YDLFGTTD---GFSGQDSASRNFHNHMYNPFDDVFSEGF 115
YD FG GF G +F F+D+FS F
Sbjct: 65 YDQFGHAGPNQGFGGGGFGGGDFGGGF--GFEDIFSSFF 101
>gnl|CDD|237668 PRK14301, PRK14301, chaperone protein DnaJ; Provisional.
Length = 373
Score = 96.0 bits (239), Expect = 2e-21
Identities = 42/105 (40%), Positives = 64/105 (60%), Gaps = 5/105 (4%)
Query: 16 AVSLDPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDA 74
D YE LGV R AS EI+K Y++L +++HPD+N ++P A++KF + EAY +L DA
Sbjct: 1 MSQRDYYEVLGVSRDASEDEIKKAYRKLALQYHPDRNPDNPEAEQKFKEAAEAYEVLRDA 60
Query: 75 ERRKQYDLFGTTDGFSGQDSASRNFHNH--MYNPFDDVFSEGFNF 117
E+R +YD FG G +G F + +++ F D+F + F F
Sbjct: 61 EKRARYDRFG-HAGVNGNGGFG-GFSSAEDIFSHFSDIFGDLFGF 103
>gnl|CDD|237667 PRK14299, PRK14299, chaperone protein DnaJ; Provisional.
Length = 291
Score = 94.6 bits (235), Expect = 2e-21
Identities = 41/105 (39%), Positives = 57/105 (54%), Gaps = 6/105 (5%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
D Y LGVP+ AS EI+K +K+L ++HPD N P A+EKF ++ EAY +LSD E+R+
Sbjct: 5 DYYAILGVPKNASQDEIKKAFKKLARKYHPDVNKSPGAEEKFKEINEAYTVLSDPEKRRI 64
Query: 80 YDLFGTTDGFSGQDS------ASRNFHNHMYNPFDDVFSEGFNFP 118
YD +GTT +G +F F D F + F
Sbjct: 65 YDTYGTTAASAGWQGPPPGPPGGGDFSGFNVGDFSDFFQQLFGGR 109
>gnl|CDD|237664 PRK14294, PRK14294, chaperone protein DnaJ; Provisional.
Length = 366
Score = 94.4 bits (235), Expect = 5e-21
Identities = 42/100 (42%), Positives = 64/100 (64%), Gaps = 4/100 (4%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
D YE LGV R AS +EI+K+Y++L +++HPD+N D A+E F + EAY +LSD ++R
Sbjct: 5 DYYEILGVTRDASEEEIKKSYRKLAMKYHPDRNPGDKEAEELFKEAAEAYEVLSDPKKRG 64
Query: 79 QYDLFGTTDGFSGQD-SASRNFHNHMYNPFDDVFSEGFNF 117
YD +G +G SG S F + +++ F D+F + F F
Sbjct: 65 IYDQYG-HEGLSGTGFSGFSGFDD-IFSSFGDIFEDFFGF 102
>gnl|CDD|184603 PRK14282, PRK14282, chaperone protein DnaJ; Provisional.
Length = 369
Score = 94.5 bits (235), Expect = 5e-21
Identities = 40/102 (39%), Positives = 59/102 (57%), Gaps = 11/102 (10%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN--NDPTAQEKFLQLTEAYNILSDAERR 77
D YE LGV R A+ +EI++ YKRLV EWHPD++ N A++KF ++ EAY +LSD ++R
Sbjct: 5 DYYEILGVSRNATQEEIKRAYKRLVKEWHPDRHPENRKEAEQKFKEIQEAYEVLSDPQKR 64
Query: 78 KQYDLFG-TTDG-FSGQDSASRNFHNHMYNPFDDVFSEGFNF 117
YD FG + + + F F+D+F + N
Sbjct: 65 AMYDRFGYVGEQPPYQETESGGGF-------FEDIFKDFENI 99
>gnl|CDD|197617 smart00271, DnaJ, DnaJ molecular chaperone homology domain.
Length = 60
Score = 83.4 bits (207), Expect = 5e-20
Identities = 32/59 (54%), Positives = 44/59 (74%), Gaps = 2/59 (3%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN--NDPTAQEKFLQLTEAYNILSDAER 76
D YE LGVPR ASL EI+K Y++L +++HPDKN + A+EKF ++ EAY +LSD E+
Sbjct: 2 DYYEILGVPRDASLDEIKKAYRKLALKYHPDKNPGDKEEAEEKFKEINEAYEVLSDPEK 60
>gnl|CDD|237657 PRK14281, PRK14281, chaperone protein DnaJ; Provisional.
Length = 397
Score = 91.8 bits (228), Expect = 5e-20
Identities = 42/117 (35%), Positives = 68/117 (58%), Gaps = 16/117 (13%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
D YE LGV R+A EI+K Y++L +++HPDKN ++ A+E F ++ EAY +LS+ ++R+
Sbjct: 4 DYYEVLGVSRSADKDEIKKAYRKLALKYHPDKNPDNKEAEEHFKEVNEAYEVLSNDDKRR 63
Query: 79 QYDLFG--------TTDGFSGQDSASRNFHNHMYNPFDDVFSEGFN------FPFEE 121
+YD FG + G G +F N +++ F+D+F G F FE+
Sbjct: 64 RYDQFGHAGVGSSAASGGGPGYGGGGGDF-NDIFSAFNDMFGGGARRGGGSPFGFED 119
>gnl|CDD|99751 cd06257, DnaJ, DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock
protein 40) proteins are highly conserved and play
crucial roles in protein translation, folding,
unfolding, translocation, and degradation. They act
primarily by stimulating the ATPase activity of Hsp70s,
an important chaperonine family. Hsp40 proteins are
characterized by the presence of a J domain, which
mediates the interaction with Hsp70. They may contain
other domains as well, and the architectures provide a
means of classification.
Length = 55
Score = 83.0 bits (206), Expect = 6e-20
Identities = 30/55 (54%), Positives = 42/55 (76%), Gaps = 1/55 (1%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNND-PTAQEKFLQLTEAYNILSD 73
D Y+ LGVP AS +EI+K Y++L +++HPDKN D P A+EKF ++ EAY +LSD
Sbjct: 1 DYYDILGVPPDASDEEIKKAYRKLALKYHPDKNPDDPEAEEKFKEINEAYEVLSD 55
>gnl|CDD|184599 PRK14277, PRK14277, chaperone protein DnaJ; Provisional.
Length = 386
Score = 90.6 bits (225), Expect = 1e-19
Identities = 42/110 (38%), Positives = 58/110 (52%), Gaps = 11/110 (10%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
D YE LGV R A+ +EI+K Y+RL ++HPD N D A++KF ++ EAY ILSD ++R
Sbjct: 6 DYYEILGVDRNATEEEIKKAYRRLAKKYHPDLNPGDKEAEQKFKEINEAYEILSDPQKRA 65
Query: 79 QYDLFG---------TTDGFSGQDSASRNFHNHMYNPFDDVFSEGFNFPF 119
QYD FG GF + + F D+F + F F
Sbjct: 66 QYDQFGHAAFDPGGFGQGGFGQGG-FGGGGFDFDFGGFGDIFEDIFGDFF 114
>gnl|CDD|237665 PRK14295, PRK14295, chaperone protein DnaJ; Provisional.
Length = 389
Score = 89.5 bits (222), Expect = 3e-19
Identities = 34/70 (48%), Positives = 51/70 (72%), Gaps = 5/70 (7%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
D Y+ LGVP+ A+ EI+K Y++L E+HPD N D A+E+F +++EAY++LSD ++RK
Sbjct: 10 DYYKVLGVPKDATEAEIKKAYRKLAREYHPDANKGDAKAEERFKEISEAYDVLSDEKKRK 69
Query: 79 QYD----LFG 84
+YD LFG
Sbjct: 70 EYDEARSLFG 79
>gnl|CDD|184611 PRK14297, PRK14297, chaperone protein DnaJ; Provisional.
Length = 380
Score = 89.1 bits (221), Expect = 3e-19
Identities = 41/100 (41%), Positives = 59/100 (59%), Gaps = 4/100 (4%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
D YE LG+ + AS EI+K +++L +++HPDKN + A+EKF ++ EAY +LSD +++
Sbjct: 5 DYYEVLGLEKGASDDEIKKAFRKLAIKYHPDKNKGNKEAEEKFKEINEAYQVLSDPQKKA 64
Query: 79 QYDLFGTTD--GFSGQDSAS-RNFHNHMYNPFDDVFSEGF 115
QYD FGT D G G S F F D+F F
Sbjct: 65 QYDQFGTADFNGAGGFGSGGFGGFDFSDMGGFGDIFDSFF 104
>gnl|CDD|237666 PRK14296, PRK14296, chaperone protein DnaJ; Provisional.
Length = 372
Score = 88.9 bits (220), Expect = 5e-19
Identities = 44/105 (41%), Positives = 63/105 (60%), Gaps = 9/105 (8%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
D YE LGV +TAS QEIR+ Y++L ++HPD N P A +K +++ EA ++L D ++RKQ
Sbjct: 5 DYYEVLGVSKTASEQEIRQAYRKLAKQYHPDLNKSPDAHDKMVEINEAADVLLDKDKRKQ 64
Query: 80 YDLFGTT--DGFSGQDSASRNFHNHMYN-------PFDDVFSEGF 115
YD FG DG SG S +F + N F ++FS+ F
Sbjct: 65 YDQFGHAAFDGSSGFSSNFGDFEDLFSNMGSSGFSSFTNIFSDFF 109
>gnl|CDD|237658 PRK14284, PRK14284, chaperone protein DnaJ; Provisional.
Length = 391
Score = 87.2 bits (216), Expect = 2e-18
Identities = 35/75 (46%), Positives = 55/75 (73%), Gaps = 1/75 (1%)
Query: 19 LDPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERR 77
+D Y LGV +TAS +EI+K Y++L V++HPDKN D A+++F +++EAY +LSDA++R
Sbjct: 1 MDYYTILGVSKTASPEEIKKAYRKLAVKYHPDKNPGDAEAEKRFKEVSEAYEVLSDAQKR 60
Query: 78 KQYDLFGTTDGFSGQ 92
+ YD +G F+G
Sbjct: 61 ESYDRYGKDGPFAGA 75
>gnl|CDD|172778 PRK14290, PRK14290, chaperone protein DnaJ; Provisional.
Length = 365
Score = 86.9 bits (215), Expect = 2e-18
Identities = 42/103 (40%), Positives = 66/103 (64%), Gaps = 7/103 (6%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN--NDPTAQEKFLQLTEAYNILSDAERR 77
D Y+ LGV R AS ++I+K ++ L +WHPD + N A+EKF +++EAY +LSD ++R
Sbjct: 4 DYYKILGVDRNASQEDIKKAFRELAKKWHPDLHPGNKAEAEEKFKEISEAYEVLSDPQKR 63
Query: 78 KQYDLFGTTD-GFSGQDSASRNFHNHMYNPFDDVFSEGF--NF 117
+QYD GT D G G + NF + ++ +D+F++ F NF
Sbjct: 64 RQYDQTGTVDFGAGGSNFNWDNFTH--FSDINDIFNQIFGGNF 104
>gnl|CDD|237659 PRK14287, PRK14287, chaperone protein DnaJ; Provisional.
Length = 371
Score = 84.7 bits (209), Expect = 1e-17
Identities = 40/99 (40%), Positives = 58/99 (58%), Gaps = 10/99 (10%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
D YE LGV R AS+ E++K Y++L ++HPD N P A++KF ++ EAY+ LSD +++
Sbjct: 5 DYYEVLGVDRNASVDEVKKAYRKLARKYHPDVNKAPDAEDKFKEVKEAYDTLSDPQKKAH 64
Query: 80 YDLFGTTD---GFSGQDSASRNFHNHMYNPFDDVFSEGF 115
YD FG TD GF G + + F D+F F
Sbjct: 65 YDQFGHTDPNQGFGGGGAGD-------FGGFSDIFDMFF 96
>gnl|CDD|172774 PRK14286, PRK14286, chaperone protein DnaJ; Provisional.
Length = 372
Score = 83.9 bits (207), Expect = 2e-17
Identities = 35/98 (35%), Positives = 59/98 (60%), Gaps = 4/98 (4%)
Query: 22 YETLGVPRTASLQEIRKNYKRLVVEWHPDKNN-DPTAQEKFLQLTEAYNILSDAERRKQY 80
Y+ LGV ++A+ +EI+ Y++L +++HPDKN + ++EKF + TEAY IL D ++R+ Y
Sbjct: 7 YDILGVSKSANDEEIKSAYRKLAIKYHPDKNKGNKESEEKFKEATEAYEILRDPKKRQAY 66
Query: 81 DLF---GTTDGFSGQDSASRNFHNHMYNPFDDVFSEGF 115
D F G G G + + ++ F D+F + F
Sbjct: 67 DQFGKAGVNAGAGGFGQGAYTDFSDIFGDFGDIFGDFF 104
>gnl|CDD|237660 PRK14289, PRK14289, chaperone protein DnaJ; Provisional.
Length = 386
Score = 83.7 bits (207), Expect = 3e-17
Identities = 39/94 (41%), Positives = 59/94 (62%), Gaps = 4/94 (4%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
D YE LGV +TA++ EI+K Y++ +++HPDKN D A+EKF + EAY++LSD ++R
Sbjct: 6 DYYEVLGVSKTATVDEIKKAYRKKAIQYHPDKNPGDKEAEEKFKEAAEAYDVLSDPDKRS 65
Query: 79 QYDLFGTTDGFSGQDSASRNFHNHMYNPFDDVFS 112
+YD FG G G + F + +D+FS
Sbjct: 66 RYDQFGHA-GVGGA-AGGGGFSGEGMS-MEDIFS 96
>gnl|CDD|237654 PRK14278, PRK14278, chaperone protein DnaJ; Provisional.
Length = 378
Score = 83.2 bits (206), Expect = 3e-17
Identities = 33/65 (50%), Positives = 43/65 (66%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
D Y LGV R AS EI++ Y++L E HPD N D AQEKF +++ AY +LSD E+R+
Sbjct: 4 DYYGLLGVSRNASDAEIKRAYRKLARELHPDVNPDEEAQEKFKEISVAYEVLSDPEKRRI 63
Query: 80 YDLFG 84
DL G
Sbjct: 64 VDLGG 68
>gnl|CDD|237663 PRK14293, PRK14293, chaperone protein DnaJ; Provisional.
Length = 374
Score = 82.3 bits (204), Expect = 5e-17
Identities = 33/72 (45%), Positives = 47/72 (65%), Gaps = 1/72 (1%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
D YE LGV R A E+++ Y+RL ++HPD N +P A+++F ++ AY +LSD E R +
Sbjct: 4 DYYEILGVSRDADKDELKRAYRRLARKYHPDVNKEPGAEDRFKEINRAYEVLSDPETRAR 63
Query: 80 YDLFGTTDGFSG 91
YD FG G SG
Sbjct: 64 YDQFGEA-GVSG 74
>gnl|CDD|225124 COG2214, CbpA, DnaJ-class molecular chaperone [Posttranslational
modification, protein turnover, chaperones].
Length = 237
Score = 79.9 bits (196), Expect = 7e-17
Identities = 43/117 (36%), Positives = 61/117 (52%), Gaps = 3/117 (2%)
Query: 18 SLDPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDP--TAQEKFLQLTEAYNILSDAE 75
LD YE LGVP ASL+EI+K Y++L +++HPD+N A+EKF ++ EAY ILSD E
Sbjct: 5 LLDYYEILGVPPNASLEEIKKAYRKLALKYHPDRNPGDPKVAEEKFKEINEAYEILSDPE 64
Query: 76 RRKQYDLFGTTDGFSGQDS-ASRNFHNHMYNPFDDVFSEGFNFPFEEHDISLFHKLS 131
RR +YD G + F G+ S F ++ E + + L
Sbjct: 65 RRAEYDKIGADNYFPGKSYRESAAFPIEGSLQLTGLYEEWRGKRNKTERKASLADLL 121
>gnl|CDD|172773 PRK14285, PRK14285, chaperone protein DnaJ; Provisional.
Length = 365
Score = 78.9 bits (194), Expect = 8e-16
Identities = 39/102 (38%), Positives = 57/102 (55%), Gaps = 7/102 (6%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
D YE LG+ + AS EI+K Y+++ +++HPDKN + A+ F + TEAY +L D +R
Sbjct: 4 DYYEILGLSKGASKDEIKKAYRKIAIKYHPDKNKGNKEAESIFKEATEAYEVLIDDNKRA 63
Query: 79 QYDLFGTT-----DGFSGQDSASRNFHNHMYNPFDDVFSEGF 115
QYD FG T GF G F + ++ F D+F F
Sbjct: 64 QYDRFGHTAFEGGGGFEGFSGGFSGFSD-IFEDFGDIFDSFF 104
>gnl|CDD|237655 PRK14279, PRK14279, chaperone protein DnaJ; Provisional.
Length = 392
Score = 78.6 bits (194), Expect = 1e-15
Identities = 31/63 (49%), Positives = 45/63 (71%), Gaps = 1/63 (1%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
D Y+ LGV AS +EI+K Y++L E HPD N DP A+E+F ++EA+++LSD +RK
Sbjct: 10 DFYKELGVSSDASAEEIKKAYRKLARELHPDANPGDPAAEERFKAVSEAHDVLSDPAKRK 69
Query: 79 QYD 81
+YD
Sbjct: 70 EYD 72
>gnl|CDD|172788 PRK14300, PRK14300, chaperone protein DnaJ; Provisional.
Length = 372
Score = 75.8 bits (186), Expect = 1e-14
Identities = 37/102 (36%), Positives = 58/102 (56%), Gaps = 6/102 (5%)
Query: 17 VSLDPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAER 76
+S D Y+ LGV +TAS +++K Y +L ++HPD + A++KF ++ AY++L D ++
Sbjct: 1 MSQDYYQILGVSKTASQADLKKAYLKLAKQYHPDTTDAKDAEKKFKEINAAYDVLKDEQK 60
Query: 77 RKQYDLFGTTDGFSGQDS-----ASRNFHNHMYNPFDDVFSE 113
R YD FG D F Q S FH + + F D FS+
Sbjct: 61 RAAYDRFG-HDAFQNQQSRGGGGNHGGFHPDINDIFGDFFSD 101
>gnl|CDD|240236 PTZ00037, PTZ00037, DnaJ_C chaperone protein; Provisional.
Length = 421
Score = 75.2 bits (185), Expect = 2e-14
Identities = 30/70 (42%), Positives = 45/70 (64%), Gaps = 4/70 (5%)
Query: 22 YETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQYD 81
YE L + + + EI+K Y++L ++ HPDK DP EKF +++ AY +LSD E+RK YD
Sbjct: 31 YEVLNLSKDCTTSEIKKAYRKLAIKHHPDKGGDP---EKFKEISRAYEVLSDPEKRKIYD 87
Query: 82 LFGTTDGFSG 91
+G +G G
Sbjct: 88 EYG-EEGLEG 96
>gnl|CDD|182347 PRK10266, PRK10266, curved DNA-binding protein CbpA; Provisional.
Length = 306
Score = 72.2 bits (177), Expect = 8e-14
Identities = 32/99 (32%), Positives = 52/99 (52%), Gaps = 4/99 (4%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
D Y +GV T L+ I+ Y+RL ++HPD + +P A+ +F ++ EA+ +LSD +RR +
Sbjct: 5 DYYAIMGVKPTDDLKTIKTAYRRLARKYHPDVSKEPDAEARFKEVAEAWEVLSDEQRRAE 64
Query: 80 YD---LFGTTDGFSGQDSASRNFHNHMYNPFDDVFSEGF 115
YD F + + + FDD+FS F
Sbjct: 65 YDQLWQHRNDPQF-NRQFQHGDGQSFNAEDFDDIFSSIF 102
>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
TopJ. This model describes TopJ (MG_200, CbpA), a DnaJ
homolog and probable assembly protein of the Mycoplasma
terminal organelle. The terminal organelle is involved
in both cytadherence and gliding motility [Cellular
processes, Chemotaxis and motility].
Length = 871
Score = 72.5 bits (177), Expect = 2e-13
Identities = 33/88 (37%), Positives = 53/88 (60%), Gaps = 3/88 (3%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
D YE LG+ R A QEI+K +++L ++HPD+N P A F ++ EA ++LS+ ++R
Sbjct: 3 DYYEVLGIDRDADEQEIKKAFRKLAKKYHPDRNKAPDAASIFAEINEANDVLSNPKKRAN 62
Query: 80 YDLFGTTDGFSGQDSASRNFHNHMYNPF 107
YD +G DG +D +F ++N F
Sbjct: 63 YDKYG-HDGVDREDDF--DFQADVFNSF 87
>gnl|CDD|172776 PRK14288, PRK14288, chaperone protein DnaJ; Provisional.
Length = 369
Score = 65.1 bits (158), Expect = 2e-11
Identities = 35/102 (34%), Positives = 60/102 (58%), Gaps = 3/102 (2%)
Query: 17 VSLDPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAE 75
+ L YE L V + ++ + I+K+Y++L +++HPD+N D A+EKF + EAY +LSD +
Sbjct: 1 MELSYYEILEVEKHSNQETIKKSYRKLALKYHPDRNAGDKEAEEKFKLINEAYGVLSDEK 60
Query: 76 RRKQYDLFGTTDGFSGQDSASRNFHNHMYNPFDDVFSEGFNF 117
+R YD +G G + Q AS++ + + F + F F
Sbjct: 61 KRALYDRYG-KKGLN-QAGASQSDFSDFFEDLGSFFEDAFGF 100
>gnl|CDD|227694 COG5407, SEC63, Preprotein translocase subunit Sec63 [Intracellular
trafficking and secretion].
Length = 610
Score = 65.0 bits (158), Expect = 5e-11
Identities = 28/75 (37%), Positives = 46/75 (61%), Gaps = 6/75 (8%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDK--NNDPTA----QEKFLQLTEAYNILSD 73
DPYE LG+ + S ++I+K Y+ L +++HPDK +EK+ +T+AY +L+D
Sbjct: 99 DPYEILGIDQDTSERDIKKRYRMLSMKFHPDKAPPMVNELRSEYEEKYKTITKAYGLLTD 158
Query: 74 AERRKQYDLFGTTDG 88
+RR+ Y +GT D
Sbjct: 159 KKRRENYLNYGTPDS 173
>gnl|CDD|225660 COG3118, COG3118, Thioredoxin domain-containing protein
[Posttranslational modification, protein turnover,
chaperones].
Length = 304
Score = 51.2 bits (123), Expect = 6e-07
Identities = 26/83 (31%), Positives = 38/83 (45%), Gaps = 2/83 (2%)
Query: 132 TTHWNFEKNYIPKSYTTPHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNE 191
T NFE+ I S P L+ F++ WC C Q+ P +KL E V+ E
Sbjct: 28 VTEANFEQEVIQSSREVPVLVDFWAPWCGPCKQLTPTLEKLAAEYKG-KFKLAKVNCDAE 86
Query: 192 QGLARRLGVGSQLPQIALLTDGR 214
+A + GV S +P + DG+
Sbjct: 87 PMVAAQFGVQS-IPTVYAFKDGQ 108
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 50.9 bits (121), Expect = 2e-06
Identities = 26/70 (37%), Positives = 40/70 (57%), Gaps = 1/70 (1%)
Query: 22 YETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQYD 81
Y+ LGV A ++EI + Y +L ++P K + KF ++ EAY IL D +++K Y+
Sbjct: 576 YDILGVGVNADMKEISERYFKLAENYYPPKRSGNEGFHKFKKINEAYQILGDIDKKKMYN 635
Query: 82 LFGTTDGFSG 91
FG DG G
Sbjct: 636 KFG-YDGIKG 644
>gnl|CDD|239259 cd02961, PDI_a_family, Protein Disulfide Isomerase (PDIa) family,
redox active TRX domains; composed of eukaryotic
proteins involved in oxidative protein folding in the
endoplasmic reticulum (ER) by acting as catalysts and
folding assistants. Members of this family include PDI
and PDI-related proteins like ERp72, ERp57 (or ERp60),
ERp44, P5, PDIR, ERp46 and the transmembrane PDIs. PDI,
ERp57, ERp72, P5, PDIR and ERp46 are all oxidases,
catalyzing the formation of disulfide bonds of newly
synthesized polypeptides in the ER. They also exhibit
reductase activity in acting as isomerases to correct
any non-native disulfide bonds, as well as chaperone
activity to prevent protein aggregation and facilitate
the folding of newly synthesized proteins. These
proteins usually contain multiple copies of a redox
active TRX (a) domain containing a CXXC motif, and may
also contain one or more redox inactive TRX-like (b)
domains. Only one a domain is required for the oxidase
function but multiple copies are necessary for the
isomerase function. The different types of PDIs may show
different substrate specificities and tissue-specific
expression, or may be induced by stress. PDIs are in
their reduced form at steady state and are oxidized to
the active form by Ero1, which is localized in the ER
through ERp44. Some members of this family also contain
a DnaJ domain in addition to the redox active a domains;
examples are ERdj5 and Pfj2. Also included in the family
is the redox inactive N-terminal TRX-like domain of
ERp29.
Length = 101
Score = 43.8 bits (104), Expect = 1e-05
Identities = 28/98 (28%), Positives = 41/98 (41%), Gaps = 6/98 (6%)
Query: 136 NFEKNYIPKSYTTPHLILFYSDWCFACLQVEPIFKKLMDELSP-LGVGFFTVHVHNEQGL 194
NF++ + S L+ FY+ WC C + P ++KL EL V V L
Sbjct: 7 NFDE-LVKDS--KDVLVEFYAPWCGHCKALAPEYEKLAKELKGDGKVVVAKVDCTANNDL 63
Query: 195 ARRLGVGSQLPQIALLTDGRTSFFK-EPSFSVQKMVEF 231
GV P I L +G K E +++ +VEF
Sbjct: 64 CSEYGV-RGYPTIKLFPNGSKEPVKYEGPRTLESLVEF 100
>gnl|CDD|239245 cd02947, TRX_family, TRX family; composed of two groups: Group I,
which includes proteins that exclusively encode a TRX
domain; and Group II, which are composed of fusion
proteins of TRX and additional domains. Group I TRX is a
small ancient protein that alter the redox state of
target proteins via the reversible oxidation of an
active site dithiol, present in a CXXC motif, partially
exposed at the protein's surface. TRX reduces protein
disulfide bonds, resulting in a disulfide bond at its
active site. Oxidized TRX is converted to the active
form by TRX reductase, using reducing equivalents
derived from either NADPH or ferredoxins. By altering
their redox state, TRX regulates the functions of at
least 30 target proteins, some of which are enzymes and
transcription factors. It also plays an important role
in the defense against oxidative stress by directly
reducing hydrogen peroxide and certain radicals, and by
serving as a reductant for peroxiredoxins. At least two
major types of functional TRXs have been reported in
most organisms; in eukaryotes, they are located in the
cytoplasm and the mitochondria. Higher plants contain
more types (at least 20 TRX genes have been detected in
the genome of Arabidopsis thaliana), two of which (types
f amd m) are located in the same compartment, the
chloroplast. Also included in the alignment are TRX-like
domains which show sequence homology to TRX but do not
contain the redox active CXXC motif. Group II proteins,
in addition to either a redox active TRX or a TRX-like
domain, also contain additional domains, which may or
may not possess homology to known proteins.
Length = 93
Score = 43.3 bits (103), Expect = 2e-05
Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 3/66 (4%)
Query: 149 PHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQGLARRLGVGSQLPQIA 208
P ++ F++ WC C + P+ ++L +E V F V V LA GV +P
Sbjct: 12 PVVVDFWAPWCGPCKAIAPVLEELAEEYP--KVKFVKVDVDENPELAEEYGV-RSIPTFL 68
Query: 209 LLTDGR 214
+G+
Sbjct: 69 FFKNGK 74
>gnl|CDD|239254 cd02956, ybbN, ybbN protein family; ybbN is a hypothetical protein
containing a redox-inactive TRX-like domain. Its gene
has been sequenced from several gammaproteobacteria and
actinobacteria.
Length = 96
Score = 41.9 bits (99), Expect = 5e-05
Identities = 16/79 (20%), Positives = 37/79 (46%), Gaps = 3/79 (3%)
Query: 136 NFEKNYIPKSYTTPHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQGLA 195
NF+ + +S P ++ F++ ++ P+ ++L +E V+ + +A
Sbjct: 2 NFQ-QVLQESTQVPVVVDFWAPRSPPSKELLPLLERLAEE-YQGQFVLAKVNCDAQPQIA 59
Query: 196 RRLGVGSQLPQIALLTDGR 214
++ GV + LP + L G+
Sbjct: 60 QQFGVQA-LPTVYLFAAGQ 77
>gnl|CDD|239292 cd02994, PDI_a_TMX, PDIa family, TMX subfamily; composed of
proteins similar to the TRX-related human transmembrane
protein, TMX. TMX is a type I integral membrane protein;
the N-terminal redox active TRX domain is present in the
endoplasmic reticulum (ER) lumen while the C-terminus is
oriented towards the cytoplasm. It is expressed in many
cell types and its active site motif (CPAC) is unique.
In vitro, TMX reduces interchain disulfides of insulin
and renatures inactive RNase containing incorrect
disulfide bonds. The C. elegans homolog, DPY-11, is
expressed only in the hypodermis and resides in the
cytoplasm. It is required for body and sensory organ
morphogeneis. Another uncharacterized TRX-related
transmembrane protein, human TMX4, is included in the
alignment. The active site sequence of TMX4 is CPSC.
Length = 101
Score = 41.2 bits (97), Expect = 1e-04
Identities = 24/63 (38%), Positives = 34/63 (53%), Gaps = 1/63 (1%)
Query: 151 LILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQGLARRLGVGSQLPQIALL 210
+I FY+ WC AC Q++P +++ D LG+ V V E GL+ R V + LP I
Sbjct: 20 MIEFYAPWCPACQQLQPEWEEFADWSDDLGINVAKVDVTQEPGLSGRFFVTA-LPTIYHA 78
Query: 211 TDG 213
DG
Sbjct: 79 KDG 81
>gnl|CDD|200072 TIGR01068, thioredoxin, thioredoxin. Several proteins, such as
protein disulfide isomerase, have two or more copies of
a domain closely related to thioredoxin. This model is
designed to recognize authentic thioredoxin, a small
protein that should be hit exactly once by This model.
Any protein that hits once with a score greater than the
second (per domain) trusted cutoff may be taken as
thioredoxin [Energy metabolism, Electron transport].
Length = 101
Score = 40.7 bits (96), Expect = 2e-04
Identities = 19/65 (29%), Positives = 33/65 (50%), Gaps = 2/65 (3%)
Query: 149 PHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQGLARRLGVGSQLPQIA 208
P L+ F++ WC C + PI ++L E V F ++V +A + G+ S +P +
Sbjct: 16 PVLVDFWAPWCGPCKMIAPILEELAKEYEG-KVKFVKLNVDENPDIAAKYGIRS-IPTLL 73
Query: 209 LLTDG 213
L +G
Sbjct: 74 LFKNG 78
>gnl|CDD|224002 COG1076, DjlA, DnaJ-domain-containing proteins 1 [Posttranslational
modification, protein turnover, chaperones].
Length = 174
Score = 41.2 bits (97), Expect = 3e-04
Identities = 18/62 (29%), Positives = 28/62 (45%), Gaps = 5/62 (8%)
Query: 19 LDPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFL-----QLTEAYNILSD 73
D + LGV A I+K Y++L+ E HPDK + +F+ +L E D
Sbjct: 113 EDALKVLGVEIKADQDAIKKAYRKLLSEQHPDKAAAKGLKLEFIEKLKEKLQEIQEAYED 172
Query: 74 AE 75
+
Sbjct: 173 IK 174
Score = 28.8 bits (65), Expect = 5.6
Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 8/69 (11%)
Query: 20 DPYETLGVPRTAS--LQEIRKNYKRLVVEWHPDKNNDPTAQEKFL------QLTEAYNIL 71
D + G+PR L ++ Y+ L +HPD+ + E+ ++ AY L
Sbjct: 2 DGFVLFGLPRAFQIDLDALKLQYRELQRAYHPDRFGKASEAEQRKALQQSAEVNPAYQTL 61
Query: 72 SDAERRKQY 80
D R +Y
Sbjct: 62 KDPLLRAEY 70
>gnl|CDD|236512 PRK09430, djlA, Dna-J like membrane chaperone protein; Provisional.
Length = 267
Score = 42.1 bits (100), Expect = 3e-04
Identities = 15/32 (46%), Positives = 22/32 (68%)
Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDK 51
D Y+ LGV + QEI++ Y++L+ E HPDK
Sbjct: 201 DAYKVLGVSESDDDQEIKRAYRKLMSEHHPDK 232
>gnl|CDD|215704 pfam00085, Thioredoxin, Thioredoxin. Thioredoxins are small
enzymes that participate in redox reactions, via the
reversible oxidation of an active centre disulfide bond.
Some members with only the active site are not separated
from the noise.
Length = 104
Score = 38.7 bits (91), Expect = 8e-04
Identities = 25/78 (32%), Positives = 34/78 (43%), Gaps = 4/78 (5%)
Query: 136 NFEKNYIPKSYTTPHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQGLA 195
NF++ + KS P L+ FY+ WC C + P ++KL E V F V LA
Sbjct: 9 NFDE-EVAKS-DKPVLVDFYAPWCGPCKALAPEYEKLAQEYKD-DVKFAKVDADENPDLA 65
Query: 196 RRLGVGSQLPQIALLTDG 213
GV P I +G
Sbjct: 66 SEYGVRG-FPTIKFFKNG 82
>gnl|CDD|185622 PTZ00443, PTZ00443, Thioredoxin domain-containing protein;
Provisional.
Length = 224
Score = 40.0 bits (93), Expect = 0.002
Identities = 34/148 (22%), Positives = 59/148 (39%), Gaps = 10/148 (6%)
Query: 128 HKLSTTHWNFEK--NYIPKSYTTPHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFT 185
+ NFEK + T P + FY+ WC C ++ P +++L L V
Sbjct: 31 ALVLLNDKNFEKLTQASTGATTGPWFVKFYAPWCSHCRKMAPAWERLAKALKGQ-VNVAD 89
Query: 186 VHVHNEQGLARRLGVGSQLPQIALLTDGRTSFFKEPSFSVQKMVEF----FRLKLPYKLI 241
+ LA+R + P + L G+ ++ S +K+ F F+ L +
Sbjct: 90 LDATRALNLAKRFAIKG-YPTLLLFDKGKMYQYEGGDRSTEKLAAFALGDFKKALGAPVP 148
Query: 242 VPLS--ATNVDAFLDNWREDNKVHALLF 267
PLS A +D F+ E +++ F
Sbjct: 149 APLSFFALTIDFFVSGTNEALRIYDAAF 176
>gnl|CDD|167217 PRK01356, hscB, co-chaperone HscB; Provisional.
Length = 166
Score = 39.1 bits (91), Expect = 0.002
Identities = 22/67 (32%), Positives = 37/67 (55%), Gaps = 6/67 (8%)
Query: 22 YETLGVPRTAS--LQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQ----LTEAYNILSDAE 75
++ LG+P+ + L+ + K Y + V++HPDK +E+ L L AY+ L DA
Sbjct: 5 FQLLGLPQEYNIDLKILEKQYFAMQVKYHPDKAKTLQEKEQNLIIASELNNAYSTLKDAL 64
Query: 76 RRKQYDL 82
+R +Y L
Sbjct: 65 KRAEYML 71
>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
ribosomal structure and biogenesis / Posttranslational
modification, protein turnover, chaperones].
Length = 379
Score = 39.6 bits (92), Expect = 0.003
Identities = 21/69 (30%), Positives = 32/69 (46%), Gaps = 8/69 (11%)
Query: 16 AVSLDPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTA---QEKFLQLTEAYNILS 72
+ L Y T +P +I K +K+ V ++HPDK E F + +A +L
Sbjct: 48 LLGLSKYRTKAIP-----PQILKAHKKKVYKYHPDKTAAGGNKGCDEFFKLIQKAREVLG 102
Query: 73 DAERRKQYD 81
D + R QYD
Sbjct: 103 DRKLRLQYD 111
>gnl|CDD|222986 PHA03102, PHA03102, Small T antigen; Reviewed.
Length = 153
Score = 37.3 bits (87), Expect = 0.006
Identities = 29/146 (19%), Positives = 53/146 (36%), Gaps = 15/146 (10%)
Query: 23 ETLGVPRTA--SLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQY 80
+ LG+PR+A +L +RK Y R +E+HPDK D EK +L Y ++ + +
Sbjct: 9 DLLGLPRSAWGNLPLMRKAYLRKCLEFHPDKGGDE---EKMKELNTLYKKFRESVKSLRD 65
Query: 81 DLFGTTDGFSGQDSASRNFHNHMYNPFDDVFSEGF---NFPFEEHDISLFHKLSTTHWNF 137
+D S + + ++ + + +F L H
Sbjct: 66 LDGEEDSSSEEEDVPSGYVGATFGDRVNALYCKDWDTCLKNLSSLCKCIFCLLRKQHRL- 124
Query: 138 EKNYIPKSYTTPHLILFYSDWCFACL 163
K + L+ +C+ C
Sbjct: 125 -KQKLRPEP----LVWGEC-YCYKCF 144
>gnl|CDD|223600 COG0526, TrxA, Thiol-disulfide isomerase and thioredoxins
[Posttranslational modification, protein turnover,
chaperones / Energy production and conversion].
Length = 127
Score = 36.8 bits (84), Expect = 0.007
Identities = 21/68 (30%), Positives = 33/68 (48%), Gaps = 3/68 (4%)
Query: 149 PHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNE-QGLARRLGV-GSQLPQ 206
P L+ F++ WC C P+ ++L +E V V+V +E LA GV +P
Sbjct: 34 PVLVDFWAPWCPPCRAEAPLLEELAEEY-GGDVEVVAVNVDDENPDLAAEFGVAVRSIPT 92
Query: 207 IALLTDGR 214
+ L DG+
Sbjct: 93 LLLFKDGK 100
>gnl|CDD|238829 cd01659, TRX_superfamily, Thioredoxin (TRX) superfamily; a large,
diverse group of proteins containing a TRX-fold. Many
members contain a classic TRX domain with a redox active
CXXC motif. They function as protein disulfide
oxidoreductases (PDOs), altering the redox state of
target proteins via the reversible oxidation of their
active site dithiol. The PDO members of this superfamily
include TRX, protein disulfide isomerase (PDI),
tlpA-like, glutaredoxin, NrdH redoxin, and the bacterial
Dsb (DsbA, DsbC, DsbG, DsbE, DsbDgamma) protein
families. Members of the superfamily that do not
function as PDOs but contain a TRX-fold domain include
phosducins, peroxiredoxins and glutathione (GSH)
peroxidases, SCO proteins, GSH transferases (GST,
N-terminal domain), arsenic reductases, TRX-like
ferredoxins and calsequestrin, among others.
Length = 69
Score = 34.2 bits (78), Expect = 0.014
Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 6/66 (9%)
Query: 151 LILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHV---HNEQGLARRLGVGSQLPQI 207
L+LFY+ WC C + P+ +L GV F V V + +R GVG +P +
Sbjct: 1 LVLFYAPWCPFCQALRPVLAELALLN--KGVKFEAVDVDEDPALEKELKRYGVG-GVPTL 57
Query: 208 ALLTDG 213
+ G
Sbjct: 58 VVFGPG 63
>gnl|CDD|173347 PTZ00051, PTZ00051, thioredoxin; Provisional.
Length = 98
Score = 34.1 bits (78), Expect = 0.028
Identities = 14/64 (21%), Positives = 31/64 (48%), Gaps = 3/64 (4%)
Query: 151 LILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQGLARRLGVGSQLPQIALL 210
++ FY++WC C ++ P +++ E + + F V V +A + + S +P +
Sbjct: 22 IVDFYAEWCGPCKRIAPFYEECSKEYTKM--VFVKVDVDELSEVAEKENITS-MPTFKVF 78
Query: 211 TDGR 214
+G
Sbjct: 79 KNGS 82
>gnl|CDD|181812 PRK09381, trxA, thioredoxin; Provisional.
Length = 109
Score = 34.3 bits (78), Expect = 0.038
Identities = 23/89 (25%), Positives = 41/89 (46%), Gaps = 8/89 (8%)
Query: 151 LILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTV---HVHNEQGLARRLGVGSQLPQI 207
L+ F+++WC C + PI ++ DE G TV ++ G A + G+ +P +
Sbjct: 25 LVDFWAEWCGPCKMIAPILDEIADEYQ----GKLTVAKLNIDQNPGTAPKYGI-RGIPTL 79
Query: 208 ALLTDGRTSFFKEPSFSVQKMVEFFRLKL 236
L +G + K + S ++ EF L
Sbjct: 80 LLFKNGEVAATKVGALSKGQLKEFLDANL 108
>gnl|CDD|222912 PHA02624, PHA02624, large T antigen; Provisional.
Length = 647
Score = 36.1 bits (84), Expect = 0.044
Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 6/68 (8%)
Query: 23 ETLGVPRTA--SLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQY 80
+ LG+P A +L +RK Y R E+HPDK D EK +L Y L + +
Sbjct: 15 DLLGLPMAAWGNLPLMRKAYLRKCKEYHPDKGGDE---EKMKRLNSLYKKLQE-GVKSAR 70
Query: 81 DLFGTTDG 88
FGT D
Sbjct: 71 QSFGTQDS 78
>gnl|CDD|239293 cd02995, PDI_a_PDI_a'_C, PDIa family, C-terminal TRX domain (a')
subfamily; composed of the C-terminal redox active a'
domains of PDI, ERp72, ERp57 (or ERp60) and EFP1. PDI,
ERp72 and ERp57 are endoplasmic reticulum (ER)-resident
eukaryotic proteins involved in oxidative protein
folding. They are oxidases, catalyzing the formation of
disulfide bonds of newly synthesized polypeptides in the
ER. They also exhibit reductase activity in acting as
isomerases to correct any non-native disulfide bonds, as
well as chaperone activity to prevent protein
aggregation and facilitate the folding of newly
synthesized proteins. PDI and ERp57 have the abb'a'
domain structure (where a and a' are redox active TRX
domains while b and b' are redox inactive TRX-like
domains). PDI also contains an acidic region (c domain)
after the a' domain that is absent in ERp57. ERp72 has
an additional a domain at the N-terminus (a"abb'a'
domain structure). ERp57 interacts with the lectin
chaperones, calnexin and calreticulin, and specifically
promotes the oxidative folding of glycoproteins, while
PDI shows a wider substrate specificity. ERp72
associates with several ER chaperones and folding
factors to form complexes in the ER that bind nascent
proteins. EFP1 is a binding partner protein of thyroid
oxidase, which is responsible for the generation of
hydrogen peroxide, a crucial substrate of
thyroperoxidase, which functions to iodinate
thyroglobulin and synthesize thyroid hormones.
Length = 104
Score = 33.3 bits (77), Expect = 0.070
Identities = 10/27 (37%), Positives = 18/27 (66%)
Query: 151 LILFYSDWCFACLQVEPIFKKLMDELS 177
L+ FY+ WC C + PI+++L ++L
Sbjct: 22 LVEFYAPWCGHCKALAPIYEELAEKLK 48
>gnl|CDD|200074 TIGR01126, pdi_dom, protein disulfide-isomerase domain. This model
describes a domain of eukaryotic protein disulfide
isomerases, generally found in two copies. The high
cutoff for total score reflects the expectation of
finding both copies. The domain is similar to
thioredoxin but the redox-active disulfide region motif
is APWCGHCK [Protein fate, Protein folding and
stabilization].
Length = 102
Score = 33.0 bits (76), Expect = 0.077
Identities = 26/86 (30%), Positives = 38/86 (44%), Gaps = 6/86 (6%)
Query: 151 LILFYSDWCFACLQVEPIFKKLMDELS---PLGVGFFTVHVHNEQGLARRLGVGSQLPQI 207
L+ FY+ WC C + P ++KL EL + + V E+ LA R GV S P I
Sbjct: 17 LVEFYAPWCGHCKNLAPEYEKLAKELKKDPDIVLA--KVDATAEKDLASRFGV-SGFPTI 73
Query: 208 ALLTDGRTSFFKEPSFSVQKMVEFFR 233
G+ E ++ +VEF
Sbjct: 74 KFFPKGKKPVDYEGGRDLEAIVEFVN 99
>gnl|CDD|239300 cd03002, PDI_a_MPD1_like, PDI family, MPD1-like subfamily; composed
of eukaryotic proteins similar to Saccharomyces
cerevisiae MPD1 protein, which contains a single redox
active TRX domain located at the N-terminus, and an ER
retention signal at the C-terminus indicative of an
ER-resident protein. MPD1 has been shown to suppress the
maturation defect of carboxypeptidase Y caused by
deletion of the yeast PDI1 gene. Other characterized
members of this subfamily include the Aspergillus niger
prpA protein and Giardia PDI-1. PrpA is non-essential to
strain viability, however, its transcript level is
induced by heterologous protein expression suggesting a
possible role in oxidative protein folding during high
protein production. Giardia PDI-1 has the ability to
refold scrambled RNase and exhibits transglutaminase
activity.
Length = 109
Score = 33.1 bits (76), Expect = 0.091
Identities = 17/44 (38%), Positives = 24/44 (54%), Gaps = 2/44 (4%)
Query: 136 NFEKNYIPKSYTTPHLILFYSDWCFACLQVEPIFKKLMDELSPL 179
NF+K +YTT L+ FY+ WC C ++P + K EL L
Sbjct: 9 NFDKVVHNTNYTT--LVEFYAPWCGHCKNLKPEYAKAAKELDGL 50
>gnl|CDD|182889 PRK10996, PRK10996, thioredoxin 2; Provisional.
Length = 139
Score = 33.5 bits (77), Expect = 0.094
Identities = 21/67 (31%), Positives = 34/67 (50%), Gaps = 2/67 (2%)
Query: 148 TPHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQGLARRLGVGSQLPQI 207
P +I F++ WC C PIF+ + E S V F V+ E+ L+ R + S +P I
Sbjct: 53 LPVVIDFWAPWCGPCRNFAPIFEDVAAERSG-KVRFVKVNTEAERELSARFRIRS-IPTI 110
Query: 208 ALLTDGR 214
+ +G+
Sbjct: 111 MIFKNGQ 117
>gnl|CDD|239298 cd03000, PDI_a_TMX3, PDIa family, TMX3 subfamily; composed of
eukaryotic proteins similar to human TMX3, a TRX related
transmembrane protein containing one redox active TRX
domain at the N-terminus and a classical ER retrieval
sequence for type I transmembrane proteins at the
C-terminus. The TMX3 transcript is found in a variety of
tissues with the highest levels detected in skeletal
muscle and the heart. In vitro, TMX3 showed oxidase
activity albeit slightly lower than that of protein
disulfide isomerase.
Length = 104
Score = 32.0 bits (73), Expect = 0.21
Identities = 11/30 (36%), Positives = 19/30 (63%)
Query: 151 LILFYSDWCFACLQVEPIFKKLMDELSPLG 180
L+ FY+ WC C ++EP++ ++ EL G
Sbjct: 19 LVDFYAPWCGHCKKLEPVWNEVGAELKSSG 48
>gnl|CDD|240266 PTZ00102, PTZ00102, disulphide isomerase; Provisional.
Length = 477
Score = 33.6 bits (77), Expect = 0.25
Identities = 16/52 (30%), Positives = 24/52 (46%), Gaps = 2/52 (3%)
Query: 151 LILFYSDWCFACLQVEPIFKKLMDELSPLG--VGFFTVHVHNEQGLARRLGV 200
L+ FY+ WC C ++ P +KK L + +V E LA+ GV
Sbjct: 53 LVKFYAPWCGHCKRLAPEYKKAAKMLKEKKSEIVLASVDATEEMELAQEFGV 104
>gnl|CDD|166894 PRK00294, hscB, co-chaperone HscB; Provisional.
Length = 173
Score = 32.5 bits (74), Expect = 0.30
Identities = 19/56 (33%), Positives = 25/56 (44%), Gaps = 6/56 (10%)
Query: 33 LQEIRKNYKRLVVEWHPDKNNDPT------AQEKFLQLTEAYNILSDAERRKQYDL 82
L ++ Y+ L E HPD+ D A E+ L EAY L RR +Y L
Sbjct: 20 LDQLATRYRELAREVHPDRFADAPEREQRLALERSASLNEAYQTLKSPPRRARYLL 75
>gnl|CDD|239290 cd02992, PDI_a_QSOX, PDIa family, Quiescin-sulfhydryl oxidase
(QSOX) subfamily; QSOX is a eukaryotic protein
containing an N-terminal redox active TRX domain,
similar to that of PDI, and a small C-terminal flavin
adenine dinucleotide (FAD)-binding domain homologous to
the yeast ERV1p protein. QSOX oxidizes thiol groups to
disulfides like PDI, however, unlike PDI, this oxidation
is accompanied by the reduction of oxygen to hydrogen
peroxide. QSOX is localized in high concentrations in
cells with heavy secretory load and prefers peptides and
proteins as substrates, not monothiols like glutathione.
Inside the cell, QSOX is found in the endoplasmic
reticulum and Golgi. The flow of reducing equivalents in
a QSOX-catalyzed reaction goes from the dithiol
substrate -> dithiol of the QSOX TRX domain -> dithiols
of the QSOX ERV1p domain -> FAD -> oxygen.
Length = 114
Score = 31.5 bits (72), Expect = 0.35
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 150 HLILFYSDWCFACLQVEPIFKKL 172
L+ FY+ WC C P +KKL
Sbjct: 22 WLVEFYASWCGHCRAFAPTWKKL 44
>gnl|CDD|233282 TIGR01130, ER_PDI_fam, protein disulfide isomerase, eukaryotic.
This model represents eukaryotic protein disulfide
isomerases retained in the endoplasmic reticulum (ER)
and closely related forms. Some members have been
assigned alternative or additional functions such as
prolyl 4-hydroxylase and
dolichyl-diphosphooligosaccharide-protein
glycotransferase. Members of this family have at least
two protein-disulfide domains, each similar to
thioredoxin but with the redox-active disulfide in the
motif PWCGHCK, and an ER retention signal at the extreme
C-terminus (KDEL, HDEL, and similar motifs).
Length = 462
Score = 33.1 bits (76), Expect = 0.39
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 151 LILFYSDWCFACLQVEPIFKKLMDELS 177
L+ FY+ WC C + PI+++L ++
Sbjct: 368 LVEFYAPWCGHCKNLAPIYEELAEKYK 394
>gnl|CDD|211601 TIGR00714, hscB, Fe-S protein assembly co-chaperone HscB. This
model describes the small subunit, Hsc20 (20K heat
shock cognate protein) of a pair of proteins
Hsc66-Hsc20, related to the DnaK-DnaJ heat shock
proteins, which also serve as molecular chaperones.
Hsc20, unlike DnaJ, appears not to have chaperone
activity on its own, but to act solely as a regulatory
subunit for Hsc66 (i.e., to be a co-chaperone). The
gene for Hsc20 in E. coli, hscB, is not induced by heat
shock [Protein fate, Protein folding and
stabilization].
Length = 155
Score = 31.8 bits (72), Expect = 0.46
Identities = 19/55 (34%), Positives = 29/55 (52%), Gaps = 8/55 (14%)
Query: 34 QEIRKNYKRLVVEWHPDKNNDPTAQEKFL------QLTEAYNILSDAERRKQYDL 82
+RK Y++L ++HPD + AQE+ L +AY+ L D RR +Y L
Sbjct: 6 SRLRKRYRQLQAQYHPDASG--MAQEQLAASQQSTTLNQAYHTLKDPLRRAEYML 58
>gnl|CDD|239291 cd02993, PDI_a_APS_reductase, PDIa family, 5'-Adenylylsulfate (APS)
reductase subfamily; composed of plant-type APS
reductases containing a C-terminal redox active TRX
domain and an N-terminal reductase domain which is part
of a superfamily that includes N type ATP PPases. APS
reductase catalyzes the reduction of activated sulfate
to sulfite, a key step in the biosynthesis of
sulfur-containing metabolites. Sulfate is first
activated by ATP sulfurylase, forming APS, which can be
phosphorylated to 3'-phosphoadenosine-5'-phosphosulfate
(PAPS). Depending on the organism, either APS or PAPS
can be used for sulfate reduction. Prokaryotes and fungi
use PAPS, whereas plants use both APS and PAPS. Since
plant-type APS reductase uses glutathione (GSH) as its
electron donor, the C-terminal domain may function like
glutaredoxin, a GSH-dependent member of the TRX
superfamily. The flow of reducing equivalents goes from
GSH -> C-terminal TRX domain -> N-terminal reductase
domain -> APS. Plant-type APS reductase shows no
homology to that of dissimilatory sulfate-reducing
bacteria, which is an iron-sulfur flavoenzyme. Also
included in the alignment is EYE2 from Chlamydomonas
reinhardtii, a protein required for eyespot assembly.
Length = 109
Score = 30.5 bits (69), Expect = 0.61
Identities = 17/75 (22%), Positives = 29/75 (38%), Gaps = 1/75 (1%)
Query: 149 PHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQG-LARRLGVGSQLPQI 207
L++ Y+ WC C +E +++L ++L+ V + EQ A+ P I
Sbjct: 23 STLVVLYAPWCPFCQAMEASYEELAEKLAGSNVKVAKFNADGEQREFAKEELQLKSFPTI 82
Query: 208 ALLTDGRTSFFKEPS 222
K PS
Sbjct: 83 LFFPKNSRQPIKYPS 97
>gnl|CDD|239250 cd02952, TRP14_like, Human TRX-related protein 14 (TRP14)-like
family; composed of proteins similar to TRP14, a 14kD
cytosolic protein that shows disulfide reductase
activity in vitro with a different substrate specificity
compared with another human cytosolic protein, TRX1.
TRP14 catalyzes the reduction of small
disulfide-containing peptides but does not reduce
disulfides of ribonucleotide reductase, peroxiredoxin
and methionine sulfoxide reductase, which are TRX1
substrates. TRP14 also plays a role in tumor necrosis
factor (TNF)-alpha signaling pathways, distinct from
that of TRX1. Its depletion promoted TNF-alpha induced
activation of c-Jun N-terminal kinase and
mitogen-activated protein kinases.
Length = 119
Score = 30.8 bits (70), Expect = 0.66
Identities = 13/48 (27%), Positives = 20/48 (41%), Gaps = 8/48 (16%)
Query: 152 ILFYSD-------WCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQ 192
ILFY D WC C++ EP+ ++ + P F V +
Sbjct: 26 ILFYGDKDPDGQSWCPDCVKAEPVVREALKAA-PEDCVFIYCDVGDRP 72
>gnl|CDD|240265 PTZ00100, PTZ00100, DnaJ chaperone protein; Provisional.
Length = 116
Score = 29.8 bits (67), Expect = 1.5
Identities = 12/29 (41%), Positives = 20/29 (68%)
Query: 22 YETLGVPRTASLQEIRKNYKRLVVEWHPD 50
Y+ L + TAS + IR+ +K+L++ HPD
Sbjct: 68 YKILNISPTASKERIREAHKQLMLRNHPD 96
>gnl|CDD|239303 cd03005, PDI_a_ERp46, PDIa family, endoplasmic reticulum protein 46
(ERp46) subfamily; ERp46 is an ER-resident protein
containing three redox active TRX domains. Yeast
complementation studies show that ERp46 can substitute
for protein disulfide isomerase (PDI) function in vivo.
It has been detected in many tissues, however,
transcript and protein levels do not correlate in all
tissues, suggesting regulation at a posttranscriptional
level. An identical protein, named endoPDI, has been
identified as an endothelial PDI that is highly
expressed in the endothelium of tumors and hypoxic
lesions. It has a protective effect on cells exposed to
hypoxia.
Length = 102
Score = 29.2 bits (66), Expect = 1.6
Identities = 7/30 (23%), Positives = 17/30 (56%)
Query: 149 PHLILFYSDWCFACLQVEPIFKKLMDELSP 178
H + F++ WC C ++ P +++L + +
Sbjct: 18 NHFVKFFAPWCGHCKRLAPTWEQLAKKFNN 47
>gnl|CDD|239295 cd02997, PDI_a_PDIR, PDIa family, PDIR subfamily; composed of
proteins similar to human PDIR (for Protein Disulfide
Isomerase Related). PDIR is composed of three redox
active TRX (a) domains and an N-terminal redox inactive
TRX-like (b) domain. Similar to PDI, it is involved in
oxidative protein folding in the endoplasmic reticulum
(ER) through its isomerase and chaperone activities.
These activities are lower compared to PDI, probably due
to PDIR acting only on a subset of proteins. PDIR is
preferentially expressed in cells actively secreting
proteins and its expression is induced by stress.
Similar to PDI, the isomerase and chaperone activities
of PDIR are independent; CXXC mutants lacking isomerase
activity retain chaperone activity.
Length = 104
Score = 28.8 bits (65), Expect = 2.2
Identities = 20/86 (23%), Positives = 32/86 (37%), Gaps = 4/86 (4%)
Query: 149 PHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFT---VHVHNEQGLARRLGVGSQLP 205
L++FY+ WC C +++P F K EL G G L V P
Sbjct: 19 HVLVMFYAPWCGHCKKMKPEFTKAATELKEDGKGVLAAVDCTKPEHDALKEEYNV-KGFP 77
Query: 206 QIALLTDGRTSFFKEPSFSVQKMVEF 231
+G+ E + + ++EF
Sbjct: 78 TFKYFENGKFVEKYEGERTAEDIIEF 103
>gnl|CDD|222448 pfam13905, Thioredoxin_8, Thioredoxin-like. Thioredoxins are small
enzymes that participate in redox reactions, via the
reversible oxidation of an active centre disulfide bond.
Length = 94
Score = 28.4 bits (64), Expect = 2.7
Identities = 11/42 (26%), Positives = 20/42 (47%)
Query: 151 LILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQ 192
L+ F++ WC C P K+L ++L V V + ++
Sbjct: 5 LLYFWASWCPPCRAFTPELKELYEKLKKPKVEIVYVSLDRDE 46
>gnl|CDD|239296 cd02998, PDI_a_ERp38, PDIa family, endoplasmic reticulum protein 38
(ERp38) subfamily; composed of proteins similar to the
P5-like protein first isolated from alfalfa, which
contains two redox active TRX (a) domains at the
N-terminus, like human P5, and a C-terminal domain with
homology to the C-terminal domain of ERp29, unlike human
P5. The cDNA clone of this protein (named G1) was
isolated from an alfalfa cDNA library by screening with
human protein disulfide isomerase (PDI) cDNA. The G1
protein is constitutively expressed in all major organs
of the plant and its expression is induced by treatment
with tunicamycin, indicating that it may be a
glucose-regulated protein. The G1 homolog in the
eukaryotic social amoeba Dictyostelium discoideum is
also described as a P5-like protein, which is located in
the endoplasmic reticulum (ER) despite the absence of an
ER-retrieval signal. G1 homologs from Aspergillus niger
and Neurospora crassa have also been characterized, and
are named TIGA and ERp38, respectively. Also included in
the alignment is an atypical PDI from Leishmania
donovani containing a single a domain, and the
C-terminal a domain of a P5-like protein from Entamoeba
histolytica.
Length = 105
Score = 28.8 bits (65), Expect = 2.8
Identities = 14/41 (34%), Positives = 20/41 (48%), Gaps = 4/41 (9%)
Query: 133 THWNFEKNYI-PKSYTTPHLILFYSDWCFACLQVEPIFKKL 172
T NF+K K L+ FY+ WC C + P ++KL
Sbjct: 6 TDSNFDKVVGDDKKDV---LVEFYAPWCGHCKNLAPEYEKL 43
>gnl|CDD|239299 cd03001, PDI_a_P5, PDIa family, P5 subfamily; composed of
eukaryotic proteins similar to human P5, a PDI-related
protein with a domain structure of aa'b (where a and a'
are redox active TRX domains and b is a redox inactive
TRX-like domain). Like PDI, P5 is located in the
endoplasmic reticulum (ER) and displays both isomerase
and chaperone activities, which are independent of each
other. Compared to PDI, the isomerase and chaperone
activities of P5 are lower. The first cysteine in the
CXXC motif of both redox active domains in P5 is
necessary for isomerase activity. The P5 gene was first
isolated as an amplified gene from a
hydroxyurea-resistant hamster cell line. The zebrafish
P5 homolog has been implicated to play a critical role
in establishing left/right asymmetries in the embryonic
midline. Some members of this subfamily are P5-like
proteins containing only one redox active TRX domain.
Length = 103
Score = 28.4 bits (64), Expect = 2.9
Identities = 18/53 (33%), Positives = 25/53 (47%), Gaps = 7/53 (13%)
Query: 151 LILFYSDWCFACLQVEPIFKKLMDELS---PLGVGFFTVHVHNEQGLARRLGV 200
L+ FY+ WC C + P +KK L +G VH Q LA++ GV
Sbjct: 22 LVEFYAPWCGHCKNLAPEWKKAAKALKGIVKVGAVDADVH----QSLAQQYGV 70
>gnl|CDD|221921 pfam13098, Thioredoxin_2, Thioredoxin-like domain.
Length = 105
Score = 28.5 bits (64), Expect = 2.9
Identities = 21/83 (25%), Positives = 32/83 (38%), Gaps = 17/83 (20%)
Query: 149 PHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVH---------------NEQG 193
P L++F C C ++ K D L F ++V+ +E+
Sbjct: 7 PVLVVFTDPDCPYCKKLHKELLKDPDVQEYLKDNFVVIYVNVDDSKEVTDFDGETLSEKE 66
Query: 194 LARRLGVGSQLPQIALLT-DGRT 215
LAR+ GV P I L DG+
Sbjct: 67 LARKYGVRGT-PTIVFLDGDGKE 88
>gnl|CDD|239248 cd02950, TxlA, TRX-like protein A (TxlA) family; TxlA was
originally isolated from the cyanobacterium
Synechococcus. It is found only in oxygenic
photosynthetic organisms. TRX is a small enzyme that
participate in redox reactions, via the reversible
oxidation of an active site dithiol present in a CXXC
motif. Disruption of the txlA gene suggests that the
protein is involved in the redox regulation of the
structure and function of photosynthetic apparatus. The
plant homolog (designated as HCF164) is localized in the
chloroplast and is involved in the assembly of the
cytochrome b6f complex, which takes a central position
in photosynthetic electron transport.
Length = 142
Score = 28.8 bits (65), Expect = 3.4
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 149 PHLILFYSDWCFACLQVEPIFKKL 172
P L+ FY+DWC C ++ P KL
Sbjct: 22 PTLVEFYADWCTVCQEMAPDVAKL 45
>gnl|CDD|224081 COG1159, Era, GTPase [General function prediction only].
Length = 298
Score = 29.9 bits (68), Expect = 3.4
Identities = 14/40 (35%), Positives = 22/40 (55%), Gaps = 1/40 (2%)
Query: 224 SVQKMVEFFRLKLPYKLIVPLSATNVDAFLDNWREDNKVH 263
+ K++ F + LP+K IVP+SA D +D E K +
Sbjct: 132 VLLKLIAFLKKLLPFKEIVPISALKGDN-VDTLLEIIKEY 170
>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
lipoprotein. This protein family occurs in strictly
within a subset of Gram-negative bacterial species with
the proposed PEP-CTERM/exosortase system, analogous to
the LPXTG/sortase system common in Gram-positive
bacteria. This protein occurs in a species if and only
if a transmembrane histidine kinase (TIGR02916) and a
DNA-binding response regulator (TIGR02915) also occur.
The present of tetratricopeptide repeats (TPR) suggests
protein-protein interaction, possibly for the regulation
of PEP-CTERM protein expression, since many PEP-CTERM
proteins in these genomes are preceded by a proposed DNA
binding site for the response regulator.
Length = 899
Score = 30.1 bits (68), Expect = 3.7
Identities = 14/79 (17%), Positives = 26/79 (32%), Gaps = 10/79 (12%)
Query: 16 AVSLDPYETLGVPRTASLQEIRKNYKRLV------VEWHPDKNNDPTAQEKFL----QLT 65
++L P L + A + KNY + + +E PD + L +
Sbjct: 627 LLALQPDSALALLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTE 686
Query: 66 EAYNILSDAERRKQYDLFG 84
A I +++ G
Sbjct: 687 SAKKIAKSLQKQHPKAALG 705
>gnl|CDD|219888 pfam08534, Redoxin, Redoxin. This family of redoxins includes
peroxiredoxin, thioredoxin and glutaredoxin proteins.
Length = 142
Score = 28.8 bits (65), Expect = 3.8
Identities = 14/72 (19%), Positives = 25/72 (34%), Gaps = 2/72 (2%)
Query: 149 PHLILFY-SDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQGLARRLGVGSQLPQI 207
++ F+ +C C P +KL GV V+ N+ + +
Sbjct: 29 KVVLNFWPGAFCPTCSAEHPYLEKLSKLYKAKGVDVVAVNASNDPFFVMN-FWAKEGLKY 87
Query: 208 ALLTDGRTSFFK 219
+L D +F K
Sbjct: 88 PVLADRDGAFTK 99
>gnl|CDD|221069 pfam11304, DUF3106, Protein of unknown function (DUF3106). Some
members in this family of proteins are annotated as
transmembrane proteins however this cannot be confirmed.
Currently no function is known.
Length = 107
Score = 28.1 bits (63), Expect = 4.4
Identities = 13/45 (28%), Positives = 19/45 (42%), Gaps = 7/45 (15%)
Query: 34 QEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRK 78
++ R NY+R + P +E EAY LS ER+
Sbjct: 65 EQARLNYQRF-------RQLPPEQKEALRARWEAYQALSPEERQA 102
>gnl|CDD|239246 cd02948, TRX_NDPK, TRX domain, TRX and NDP-kinase (NDPK) fusion
protein family; most members of this group are fusion
proteins which contain one redox active TRX domain
containing a CXXC motif and three NDPK domains, and are
characterized as intermediate chains (ICs) of axonemal
outer arm dynein. Dyneins are molecular motors that
generate force against microtubules to produce cellular
movement, and are divided into two classes: axonemal and
cytoplasmic. They are supramolecular complexes
consisting of three protein groups classified according
to size: dynein heavy, intermediate and light chains.
Axonemal dyneins form two structures, the inner and
outer arms, which are attached to doublet microtubules
throughout the cilia and flagella. The human homolog is
the sperm-specific Sptrx-2, presumed to be a component
of the human sperm axoneme architecture. Included in
this group is another human protein, TRX-like protein 2,
a smaller fusion protein containing one TRX and one NDPK
domain, which is also associated with microtubular
structures. The other members of this group are
hypothetical insect proteins containing a TRX domain and
outer arm dynein light chains (14 and 16kDa) of
Chlamydomonas reinhardtii. Using standard assays, the
fusion proteins have shown no TRX enzymatic activity.
Length = 102
Score = 27.7 bits (62), Expect = 5.2
Identities = 12/32 (37%), Positives = 17/32 (53%)
Query: 155 YSDWCFACLQVEPIFKKLMDELSPLGVGFFTV 186
Y +WC C V +FKK+ +EL + F T
Sbjct: 25 YQEWCGPCKAVVSLFKKIKNELGDDLLHFATA 56
>gnl|CDD|239302 cd03004, PDI_a_ERdj5_C, PDIa family, C-terminal ERdj5 subfamily;
ERdj5, also known as JPDI and macrothioredoxin, is a
protein containing an N-terminal DnaJ domain and four
redox active TRX domains. This subfamily is composed of
the three TRX domains located at the C-terminal half of
the protein. ERdj5 is a ubiquitous protein localized in
the endoplasmic reticulum (ER) and is abundant in
secretory cells. It's transcription is induced during ER
stress. It interacts with BiP through its DnaJ domain in
an ATP-dependent manner. BiP, an ER-resident member of
the Hsp70 chaperone family, functions in ER-associated
degradation and protein translocation. Also included in
the alignment is the single complete TRX domain of an
uncharacterized protein from Tetraodon nigroviridis,
which also contains a DnaJ domain at its N-terminus.
Length = 104
Score = 27.6 bits (62), Expect = 6.7
Identities = 15/74 (20%), Positives = 28/74 (37%), Gaps = 2/74 (2%)
Query: 151 LILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQGLARRLGVGSQLPQIALL 210
L+ FY+ WC C + P +K L V +V + L ++ + + P I L
Sbjct: 23 LVDFYAPWCGPCQALLPELRKAARALKGK-VKVGSVDCQKYESLCQQANIRA-YPTIRLY 80
Query: 211 TDGRTSFFKEPSFS 224
+ + +
Sbjct: 81 PGNASKYHSYNGWH 94
>gnl|CDD|239282 cd02984, TRX_PICOT, TRX domain, PICOT (for PKC-interacting cousin
of TRX) subfamily; PICOT is a protein that interacts
with protein kinase C (PKC) theta, a calcium independent
PKC isoform selectively expressed in skeletal muscle and
T lymphocytes. PICOT contains an N-terminal TRX-like
domain, which does not contain the catalytic CXXC motif,
followed by one to three glutaredoxin domains. The
TRX-like domain is required for interaction with PKC
theta. PICOT inhibits the activation of c-Jun N-terminal
kinase and the transcription factors, AP-1 and NF-kB,
induced by PKC theta or T-cell activating stimuli.
Length = 97
Score = 27.2 bits (61), Expect = 7.6
Identities = 8/40 (20%), Positives = 19/40 (47%)
Query: 139 KNYIPKSYTTPHLILFYSDWCFACLQVEPIFKKLMDELSP 178
+ + + ++ F++ W C Q+ +F++L E P
Sbjct: 6 EELLKSDASKLLVLHFWAPWAEPCKQMNQVFEELAKEAFP 45
>gnl|CDD|216185 pfam00909, Ammonium_transp, Ammonium Transporter Family.
Length = 399
Score = 28.8 bits (65), Expect = 8.0
Identities = 12/42 (28%), Positives = 20/42 (47%), Gaps = 3/42 (7%)
Query: 516 QSLSRQHILPAVSLIFTVIIIVVLAMIMNHYMKL---EEEEI 554
+ L Q I AV L + + +L +++ + L EEEE
Sbjct: 345 KQLGVQLIGIAVILAYAFGVTFILGLLLGLTLGLRVSEEEEE 386
>gnl|CDD|129372 TIGR00271, TIGR00271, uncharacterized hydrophobic domain. This
domain is in a family of archaeal proteins that includes
AF0785 of Archaeoglobus fulgidus and in several
eubacterial proteins, including the much longer protein
sll1151 from Synechocystis PCC6803.
Length = 175
Score = 28.1 bits (63), Expect = 8.7
Identities = 7/27 (25%), Positives = 14/27 (51%)
Query: 526 AVSLIFTVIIIVVLAMIMNHYMKLEEE 552
A +L+ + ++V AMIM + +
Sbjct: 62 AKNLLLAMAFVIVFAMIMGLFHPSKLT 88
>gnl|CDD|222416 pfam13848, Thioredoxin_6, Thioredoxin-like domain.
Length = 183
Score = 28.1 bits (63), Expect = 9.7
Identities = 14/52 (26%), Positives = 24/52 (46%), Gaps = 3/52 (5%)
Query: 183 FFTVHVHNEQGLARRLGV-GSQLPQIALL--TDGRTSFFKEPSFSVQKMVEF 231
F TV + + LG+ + LP I ++ + +F E FS + + EF
Sbjct: 129 FATVDGKSFGRVLEYLGLSSADLPVIVIVDSASHKKKYFPEDEFSPESLKEF 180
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.323 0.138 0.416
Gapped
Lambda K H
0.267 0.0769 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 29,095,126
Number of extensions: 2911702
Number of successful extensions: 2894
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2868
Number of HSP's successfully gapped: 105
Length of query: 554
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 452
Effective length of database: 6,413,494
Effective search space: 2898899288
Effective search space used: 2898899288
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 62 (27.6 bits)