RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16770
         (554 letters)



>gnl|CDD|239261 cd02963, TRX_DnaJ, TRX domain, DnaJ domain containing protein
           family; composed of uncharacterized proteins of about
           500-800 amino acids, containing an N-terminal DnaJ
           domain followed by one redox active TRX domain. DnaJ is
           a member of the 40 kDa heat-shock protein (Hsp40) family
           of molecular chaperones, which regulate the activity of
           Hsp70s. TRX is involved in the redox regulation of many
           protein substrates through the reduction of disulfide
           bonds. TRX has been implicated to catalyse the reduction
           of Hsp33, a chaperone holdase that binds to unfolded
           protein intermediates. The presence of DnaJ and TRX
           domains in members of this family suggests that they
           could be involved in a redox-regulated chaperone
           network.
          Length = 111

 Score =  130 bits (328), Expect = 4e-36
 Identities = 45/112 (40%), Positives = 72/112 (64%), Gaps = 1/112 (0%)

Query: 124 ISLFHKLSTTHWNFEKNYIPKSYTTPHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGF 183
            S  +K S T   +E   +PKS+  P+LI   SDWCF+C+ +EP++K+++ EL PLGVG 
Sbjct: 1   DSFDYKYSLTFSQYENEIVPKSFKKPYLIKITSDWCFSCIHIEPVWKEVIQELEPLGVGI 60

Query: 184 FTVHVHNEQGLARRLGVGSQLPQIALLTDGRTSFFKEPSFSVQKMVEFFRLK 235
            TV+  +E+ LAR+LG    +P I  + +G+ +F+ + SF+ Q +V+F R  
Sbjct: 61  ATVNAGHERRLARKLGA-HSVPAIVGIINGQVTFYHDSSFTKQHVVDFVRKL 111


>gnl|CDD|233829 TIGR02349, DnaJ_bact, chaperone protein DnaJ.  This model
           represents bacterial forms of DnaJ, part of the
           DnaK-DnaJ-GrpE chaperone system. The three components
           typically are encoded by consecutive genes. DnaJ
           homologs occur in many genomes, typically not near DnaK
           and GrpE-like genes; most such genes are not included by
           this family. Eukaryotic (mitochondrial and chloroplast)
           forms are not included in the scope of this family.
          Length = 354

 Score =  120 bits (302), Expect = 8e-30
 Identities = 42/103 (40%), Positives = 58/103 (56%), Gaps = 3/103 (2%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
           D YE LGV + AS +EI+K Y++L  ++HPD+N D  A+EKF ++ EAY +LSD E+R Q
Sbjct: 1   DYYEILGVSKDASEEEIKKAYRKLAKKYHPDRNKDKEAEEKFKEINEAYEVLSDPEKRAQ 60

Query: 80  YDLFGTTD---GFSGQDSASRNFHNHMYNPFDDVFSEGFNFPF 119
           YD FG      G  G       F    +  F D+F + F    
Sbjct: 61  YDQFGHAGFNGGGGGGGGGFNGFDIGFFGDFGDIFGDFFGGGG 103


>gnl|CDD|223560 COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn
           finger domain [Posttranslational modification, protein
           turnover, chaperones].
          Length = 371

 Score =  120 bits (302), Expect = 1e-29
 Identities = 41/97 (42%), Positives = 56/97 (57%), Gaps = 3/97 (3%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNN-DPTAQEKFLQLTEAYNILSDAERRK 78
           D YE LGV + AS +EI+K Y++L  ++HPD+N  D  A+EKF ++ EAY +LSD E+R 
Sbjct: 5   DYYEILGVSKDASEEEIKKAYRKLAKKYHPDRNPGDKEAEEKFKEINEAYEVLSDPEKRA 64

Query: 79  QYDLFGTTDGFSGQDSASRNFHNHMYNPFDDVFSEGF 115
            YD FG   GF         F       F D+F + F
Sbjct: 65  AYDQFG-HAGFKAGGFGGFGF-GGFGGDFGDIFEDFF 99


>gnl|CDD|237661 PRK14291, PRK14291, chaperone protein DnaJ; Provisional.
          Length = 382

 Score =  108 bits (272), Expect = 9e-26
 Identities = 42/104 (40%), Positives = 58/104 (55%), Gaps = 7/104 (6%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
           D YE LGV R A+ +EI+K Y+RL  ++HPD N +P A+EKF ++ EAY +LSD E+RK 
Sbjct: 4   DYYEILGVSRNATQEEIKKAYRRLARKYHPDFNKNPEAEEKFKEINEAYQVLSDPEKRKL 63

Query: 80  YDLFGTTDGFSGQDSASRNFHNHMYNPF---DDVFSE---GFNF 117
           YD FG    FSG     +             +D+  +    F F
Sbjct: 64  YDQFG-HAAFSGSGQQQQGQEGFSDFGGGNIEDILEDVFDIFGF 106


>gnl|CDD|215804 pfam00226, DnaJ, DnaJ domain.  DnaJ domains (J-domains) are
          associated with hsp70 heat-shock system and it is
          thought that this domain mediates the interaction.
          DnaJ-domain is therefore part of a chaperone (protein
          folding) system. The T-antigens, although not in
          Prosite are confirmed as DnaJ containing domains from
          literature.
          Length = 63

 Score = 99.1 bits (248), Expect = 2e-25
 Identities = 36/63 (57%), Positives = 48/63 (76%), Gaps = 1/63 (1%)

Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
          D YE LGVPR AS +EI+K Y++L +++HPDKN  DP A+EKF ++ EAY +LSD E+R 
Sbjct: 1  DYYEILGVPRDASDEEIKKAYRKLALKYHPDKNPGDPAAEEKFKEINEAYEVLSDPEKRA 60

Query: 79 QYD 81
           YD
Sbjct: 61 IYD 63


>gnl|CDD|184612 PRK14298, PRK14298, chaperone protein DnaJ; Provisional.
          Length = 377

 Score =  104 bits (262), Expect = 2e-24
 Identities = 45/100 (45%), Positives = 66/100 (66%), Gaps = 1/100 (1%)

Query: 16  AVSLDPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAE 75
           A + D YE LG+ + AS+++I+K Y++L +++HPDKN +P A+EKF +++EAY +LSDAE
Sbjct: 2   ATTRDYYEILGLSKDASVEDIKKAYRKLAMKYHPDKNKEPDAEEKFKEISEAYAVLSDAE 61

Query: 76  RRKQYDLFGTTDGFSGQDSASRNFHNHMYNPFDDVFSEGF 115
           +R QYD FG   G   Q SA   F    +  F D+F   F
Sbjct: 62  KRAQYDRFGHA-GIDNQYSAEDIFRGADFGGFGDIFEMFF 100


>gnl|CDD|184604 PRK14283, PRK14283, chaperone protein DnaJ; Provisional.
          Length = 378

 Score =  102 bits (257), Expect = 9e-24
 Identities = 44/103 (42%), Positives = 65/103 (63%), Gaps = 10/103 (9%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
           D YE LGV R A  +EI+K Y++L  ++HPD + +  A+EKF +++EAY +LSD E+R++
Sbjct: 6   DYYEVLGVDRNADKKEIKKAYRKLARKYHPDVSEEEGAEEKFKEISEAYAVLSDDEKRQR 65

Query: 80  YDLFGTT--DGFSGQDSASRNFHNHMYNPFDDVFSEGFNFPFE 120
           YD FG    DGFS +D       N++   F+D+F  GF F   
Sbjct: 66  YDQFGHAGMDGFSQED-----IFNNI--NFEDIFQ-GFGFGIG 100


>gnl|CDD|236757 PRK10767, PRK10767, chaperone protein DnaJ; Provisional.
          Length = 371

 Score =  101 bits (254), Expect = 3e-23
 Identities = 41/105 (39%), Positives = 58/105 (55%), Gaps = 17/105 (16%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
           D YE LGV R AS  EI+K Y++L +++HPD+N  D  A+EKF ++ EAY +LSD ++R 
Sbjct: 5   DYYEVLGVSRNASEDEIKKAYRKLAMKYHPDRNPGDKEAEEKFKEIKEAYEVLSDPQKRA 64

Query: 79  QYDLFG--------TTDGFSGQDSASRNFHNHMYNPFDDVFSEGF 115
            YD +G           GF G       F     + F D+F + F
Sbjct: 65  AYDQYGHAAFEQGGGGGGFGGGG----GFG----DIFGDIFGDIF 101


>gnl|CDD|237662 PRK14292, PRK14292, chaperone protein DnaJ; Provisional.
          Length = 371

 Score = 99.6 bits (248), Expect = 1e-22
 Identities = 45/99 (45%), Positives = 61/99 (61%), Gaps = 7/99 (7%)

Query: 19  LDPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRK 78
           +D YE LGV RTAS  EI+  Y++L +++HPD+N +  A EKF Q+ EAY +LSDAE+R 
Sbjct: 2   MDYYELLGVSRTASADEIKSAYRKLALKYHPDRNKEKGAAEKFAQINEAYAVLSDAEKRA 61

Query: 79  QYDLFGTTD--GFSGQDSASRNFHNHMYNPFDDVFSEGF 115
            YD FGT    G  G D     F    ++P  D+F + F
Sbjct: 62  HYDRFGTAPGAGMPGGDP----FGGMGFDPM-DIFEQLF 95


>gnl|CDD|237653 PRK14276, PRK14276, chaperone protein DnaJ; Provisional.
          Length = 380

 Score = 97.5 bits (243), Expect = 7e-22
 Identities = 38/106 (35%), Positives = 58/106 (54%), Gaps = 16/106 (15%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
           + Y+ LGV + AS  EI+K Y++L  ++HPD N +P A+EK+ ++ EAY  LSD ++R  
Sbjct: 5   EYYDRLGVSKDASQDEIKKAYRKLSKKYHPDINKEPGAEEKYKEVQEAYETLSDPQKRAA 64

Query: 80  YDLFGTTD----------GFSGQDSASRNFHNHMYNPFDDVFSEGF 115
           YD +G             GF G D +        +  F+D+FS  F
Sbjct: 65  YDQYGAAGANGGFGGGAGGFGGFDGSGG------FGGFEDIFSSFF 104


>gnl|CDD|237656 PRK14280, PRK14280, chaperone protein DnaJ; Provisional.
          Length = 376

 Score = 96.7 bits (241), Expect = 1e-21
 Identities = 42/99 (42%), Positives = 59/99 (59%), Gaps = 5/99 (5%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
           D YE LGV ++AS  EI+K Y++L  ++HPD N +  A EKF +++EAY +LSD ++R Q
Sbjct: 5   DYYEVLGVSKSASKDEIKKAYRKLSKKYHPDINKEEGADEKFKEISEAYEVLSDDQKRAQ 64

Query: 80  YDLFGTTD---GFSGQDSASRNFHNHMYNPFDDVFSEGF 115
           YD FG      GF G      +F       F+D+FS  F
Sbjct: 65  YDQFGHAGPNQGFGGGGFGGGDFGGGF--GFEDIFSSFF 101


>gnl|CDD|237668 PRK14301, PRK14301, chaperone protein DnaJ; Provisional.
          Length = 373

 Score = 96.0 bits (239), Expect = 2e-21
 Identities = 42/105 (40%), Positives = 64/105 (60%), Gaps = 5/105 (4%)

Query: 16  AVSLDPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDA 74
               D YE LGV R AS  EI+K Y++L +++HPD+N ++P A++KF +  EAY +L DA
Sbjct: 1   MSQRDYYEVLGVSRDASEDEIKKAYRKLALQYHPDRNPDNPEAEQKFKEAAEAYEVLRDA 60

Query: 75  ERRKQYDLFGTTDGFSGQDSASRNFHNH--MYNPFDDVFSEGFNF 117
           E+R +YD FG   G +G       F +   +++ F D+F + F F
Sbjct: 61  EKRARYDRFG-HAGVNGNGGFG-GFSSAEDIFSHFSDIFGDLFGF 103


>gnl|CDD|237667 PRK14299, PRK14299, chaperone protein DnaJ; Provisional.
          Length = 291

 Score = 94.6 bits (235), Expect = 2e-21
 Identities = 41/105 (39%), Positives = 57/105 (54%), Gaps = 6/105 (5%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
           D Y  LGVP+ AS  EI+K +K+L  ++HPD N  P A+EKF ++ EAY +LSD E+R+ 
Sbjct: 5   DYYAILGVPKNASQDEIKKAFKKLARKYHPDVNKSPGAEEKFKEINEAYTVLSDPEKRRI 64

Query: 80  YDLFGTTDGFSGQDS------ASRNFHNHMYNPFDDVFSEGFNFP 118
           YD +GTT   +G            +F       F D F + F   
Sbjct: 65  YDTYGTTAASAGWQGPPPGPPGGGDFSGFNVGDFSDFFQQLFGGR 109


>gnl|CDD|237664 PRK14294, PRK14294, chaperone protein DnaJ; Provisional.
          Length = 366

 Score = 94.4 bits (235), Expect = 5e-21
 Identities = 42/100 (42%), Positives = 64/100 (64%), Gaps = 4/100 (4%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
           D YE LGV R AS +EI+K+Y++L +++HPD+N  D  A+E F +  EAY +LSD ++R 
Sbjct: 5   DYYEILGVTRDASEEEIKKSYRKLAMKYHPDRNPGDKEAEELFKEAAEAYEVLSDPKKRG 64

Query: 79  QYDLFGTTDGFSGQD-SASRNFHNHMYNPFDDVFSEGFNF 117
            YD +G  +G SG   S    F + +++ F D+F + F F
Sbjct: 65  IYDQYG-HEGLSGTGFSGFSGFDD-IFSSFGDIFEDFFGF 102


>gnl|CDD|184603 PRK14282, PRK14282, chaperone protein DnaJ; Provisional.
          Length = 369

 Score = 94.5 bits (235), Expect = 5e-21
 Identities = 40/102 (39%), Positives = 59/102 (57%), Gaps = 11/102 (10%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN--NDPTAQEKFLQLTEAYNILSDAERR 77
           D YE LGV R A+ +EI++ YKRLV EWHPD++  N   A++KF ++ EAY +LSD ++R
Sbjct: 5   DYYEILGVSRNATQEEIKRAYKRLVKEWHPDRHPENRKEAEQKFKEIQEAYEVLSDPQKR 64

Query: 78  KQYDLFG-TTDG-FSGQDSASRNFHNHMYNPFDDVFSEGFNF 117
             YD FG   +     +  +   F       F+D+F +  N 
Sbjct: 65  AMYDRFGYVGEQPPYQETESGGGF-------FEDIFKDFENI 99


>gnl|CDD|197617 smart00271, DnaJ, DnaJ molecular chaperone homology domain. 
          Length = 60

 Score = 83.4 bits (207), Expect = 5e-20
 Identities = 32/59 (54%), Positives = 44/59 (74%), Gaps = 2/59 (3%)

Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN--NDPTAQEKFLQLTEAYNILSDAER 76
          D YE LGVPR ASL EI+K Y++L +++HPDKN  +   A+EKF ++ EAY +LSD E+
Sbjct: 2  DYYEILGVPRDASLDEIKKAYRKLALKYHPDKNPGDKEEAEEKFKEINEAYEVLSDPEK 60


>gnl|CDD|237657 PRK14281, PRK14281, chaperone protein DnaJ; Provisional.
          Length = 397

 Score = 91.8 bits (228), Expect = 5e-20
 Identities = 42/117 (35%), Positives = 68/117 (58%), Gaps = 16/117 (13%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
           D YE LGV R+A   EI+K Y++L +++HPDKN ++  A+E F ++ EAY +LS+ ++R+
Sbjct: 4   DYYEVLGVSRSADKDEIKKAYRKLALKYHPDKNPDNKEAEEHFKEVNEAYEVLSNDDKRR 63

Query: 79  QYDLFG--------TTDGFSGQDSASRNFHNHMYNPFDDVFSEGFN------FPFEE 121
           +YD FG         + G  G      +F N +++ F+D+F  G        F FE+
Sbjct: 64  RYDQFGHAGVGSSAASGGGPGYGGGGGDF-NDIFSAFNDMFGGGARRGGGSPFGFED 119


>gnl|CDD|99751 cd06257, DnaJ, DnaJ domain or J-domain.  DnaJ/Hsp40 (heat shock
          protein 40) proteins are highly conserved and play
          crucial roles in protein translation, folding,
          unfolding, translocation, and degradation. They act
          primarily by stimulating the ATPase activity of Hsp70s,
          an important chaperonine family. Hsp40 proteins are
          characterized by the presence of a J domain, which
          mediates the interaction with Hsp70. They may contain
          other domains as well, and the architectures provide a
          means of classification.
          Length = 55

 Score = 83.0 bits (206), Expect = 6e-20
 Identities = 30/55 (54%), Positives = 42/55 (76%), Gaps = 1/55 (1%)

Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNND-PTAQEKFLQLTEAYNILSD 73
          D Y+ LGVP  AS +EI+K Y++L +++HPDKN D P A+EKF ++ EAY +LSD
Sbjct: 1  DYYDILGVPPDASDEEIKKAYRKLALKYHPDKNPDDPEAEEKFKEINEAYEVLSD 55


>gnl|CDD|184599 PRK14277, PRK14277, chaperone protein DnaJ; Provisional.
          Length = 386

 Score = 90.6 bits (225), Expect = 1e-19
 Identities = 42/110 (38%), Positives = 58/110 (52%), Gaps = 11/110 (10%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
           D YE LGV R A+ +EI+K Y+RL  ++HPD N  D  A++KF ++ EAY ILSD ++R 
Sbjct: 6   DYYEILGVDRNATEEEIKKAYRRLAKKYHPDLNPGDKEAEQKFKEINEAYEILSDPQKRA 65

Query: 79  QYDLFG---------TTDGFSGQDSASRNFHNHMYNPFDDVFSEGFNFPF 119
           QYD FG            GF           +  +  F D+F + F   F
Sbjct: 66  QYDQFGHAAFDPGGFGQGGFGQGG-FGGGGFDFDFGGFGDIFEDIFGDFF 114


>gnl|CDD|237665 PRK14295, PRK14295, chaperone protein DnaJ; Provisional.
          Length = 389

 Score = 89.5 bits (222), Expect = 3e-19
 Identities = 34/70 (48%), Positives = 51/70 (72%), Gaps = 5/70 (7%)

Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
          D Y+ LGVP+ A+  EI+K Y++L  E+HPD N  D  A+E+F +++EAY++LSD ++RK
Sbjct: 10 DYYKVLGVPKDATEAEIKKAYRKLAREYHPDANKGDAKAEERFKEISEAYDVLSDEKKRK 69

Query: 79 QYD----LFG 84
          +YD    LFG
Sbjct: 70 EYDEARSLFG 79


>gnl|CDD|184611 PRK14297, PRK14297, chaperone protein DnaJ; Provisional.
          Length = 380

 Score = 89.1 bits (221), Expect = 3e-19
 Identities = 41/100 (41%), Positives = 59/100 (59%), Gaps = 4/100 (4%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
           D YE LG+ + AS  EI+K +++L +++HPDKN  +  A+EKF ++ EAY +LSD +++ 
Sbjct: 5   DYYEVLGLEKGASDDEIKKAFRKLAIKYHPDKNKGNKEAEEKFKEINEAYQVLSDPQKKA 64

Query: 79  QYDLFGTTD--GFSGQDSAS-RNFHNHMYNPFDDVFSEGF 115
           QYD FGT D  G  G  S     F       F D+F   F
Sbjct: 65  QYDQFGTADFNGAGGFGSGGFGGFDFSDMGGFGDIFDSFF 104


>gnl|CDD|237666 PRK14296, PRK14296, chaperone protein DnaJ; Provisional.
          Length = 372

 Score = 88.9 bits (220), Expect = 5e-19
 Identities = 44/105 (41%), Positives = 63/105 (60%), Gaps = 9/105 (8%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
           D YE LGV +TAS QEIR+ Y++L  ++HPD N  P A +K +++ EA ++L D ++RKQ
Sbjct: 5   DYYEVLGVSKTASEQEIRQAYRKLAKQYHPDLNKSPDAHDKMVEINEAADVLLDKDKRKQ 64

Query: 80  YDLFGTT--DGFSGQDSASRNFHNHMYN-------PFDDVFSEGF 115
           YD FG    DG SG  S   +F +   N        F ++FS+ F
Sbjct: 65  YDQFGHAAFDGSSGFSSNFGDFEDLFSNMGSSGFSSFTNIFSDFF 109


>gnl|CDD|237658 PRK14284, PRK14284, chaperone protein DnaJ; Provisional.
          Length = 391

 Score = 87.2 bits (216), Expect = 2e-18
 Identities = 35/75 (46%), Positives = 55/75 (73%), Gaps = 1/75 (1%)

Query: 19 LDPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERR 77
          +D Y  LGV +TAS +EI+K Y++L V++HPDKN  D  A+++F +++EAY +LSDA++R
Sbjct: 1  MDYYTILGVSKTASPEEIKKAYRKLAVKYHPDKNPGDAEAEKRFKEVSEAYEVLSDAQKR 60

Query: 78 KQYDLFGTTDGFSGQ 92
          + YD +G    F+G 
Sbjct: 61 ESYDRYGKDGPFAGA 75


>gnl|CDD|172778 PRK14290, PRK14290, chaperone protein DnaJ; Provisional.
          Length = 365

 Score = 86.9 bits (215), Expect = 2e-18
 Identities = 42/103 (40%), Positives = 66/103 (64%), Gaps = 7/103 (6%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN--NDPTAQEKFLQLTEAYNILSDAERR 77
           D Y+ LGV R AS ++I+K ++ L  +WHPD +  N   A+EKF +++EAY +LSD ++R
Sbjct: 4   DYYKILGVDRNASQEDIKKAFRELAKKWHPDLHPGNKAEAEEKFKEISEAYEVLSDPQKR 63

Query: 78  KQYDLFGTTD-GFSGQDSASRNFHNHMYNPFDDVFSEGF--NF 117
           +QYD  GT D G  G +    NF +  ++  +D+F++ F  NF
Sbjct: 64  RQYDQTGTVDFGAGGSNFNWDNFTH--FSDINDIFNQIFGGNF 104


>gnl|CDD|237659 PRK14287, PRK14287, chaperone protein DnaJ; Provisional.
          Length = 371

 Score = 84.7 bits (209), Expect = 1e-17
 Identities = 40/99 (40%), Positives = 58/99 (58%), Gaps = 10/99 (10%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
           D YE LGV R AS+ E++K Y++L  ++HPD N  P A++KF ++ EAY+ LSD +++  
Sbjct: 5   DYYEVLGVDRNASVDEVKKAYRKLARKYHPDVNKAPDAEDKFKEVKEAYDTLSDPQKKAH 64

Query: 80  YDLFGTTD---GFSGQDSASRNFHNHMYNPFDDVFSEGF 115
           YD FG TD   GF G  +         +  F D+F   F
Sbjct: 65  YDQFGHTDPNQGFGGGGAGD-------FGGFSDIFDMFF 96


>gnl|CDD|172774 PRK14286, PRK14286, chaperone protein DnaJ; Provisional.
          Length = 372

 Score = 83.9 bits (207), Expect = 2e-17
 Identities = 35/98 (35%), Positives = 59/98 (60%), Gaps = 4/98 (4%)

Query: 22  YETLGVPRTASLQEIRKNYKRLVVEWHPDKNN-DPTAQEKFLQLTEAYNILSDAERRKQY 80
           Y+ LGV ++A+ +EI+  Y++L +++HPDKN  +  ++EKF + TEAY IL D ++R+ Y
Sbjct: 7   YDILGVSKSANDEEIKSAYRKLAIKYHPDKNKGNKESEEKFKEATEAYEILRDPKKRQAY 66

Query: 81  DLF---GTTDGFSGQDSASRNFHNHMYNPFDDVFSEGF 115
           D F   G   G  G    +    + ++  F D+F + F
Sbjct: 67  DQFGKAGVNAGAGGFGQGAYTDFSDIFGDFGDIFGDFF 104


>gnl|CDD|237660 PRK14289, PRK14289, chaperone protein DnaJ; Provisional.
          Length = 386

 Score = 83.7 bits (207), Expect = 3e-17
 Identities = 39/94 (41%), Positives = 59/94 (62%), Gaps = 4/94 (4%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
           D YE LGV +TA++ EI+K Y++  +++HPDKN  D  A+EKF +  EAY++LSD ++R 
Sbjct: 6   DYYEVLGVSKTATVDEIKKAYRKKAIQYHPDKNPGDKEAEEKFKEAAEAYDVLSDPDKRS 65

Query: 79  QYDLFGTTDGFSGQDSASRNFHNHMYNPFDDVFS 112
           +YD FG   G  G  +    F     +  +D+FS
Sbjct: 66  RYDQFGHA-GVGGA-AGGGGFSGEGMS-MEDIFS 96


>gnl|CDD|237654 PRK14278, PRK14278, chaperone protein DnaJ; Provisional.
          Length = 378

 Score = 83.2 bits (206), Expect = 3e-17
 Identities = 33/65 (50%), Positives = 43/65 (66%)

Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
          D Y  LGV R AS  EI++ Y++L  E HPD N D  AQEKF +++ AY +LSD E+R+ 
Sbjct: 4  DYYGLLGVSRNASDAEIKRAYRKLARELHPDVNPDEEAQEKFKEISVAYEVLSDPEKRRI 63

Query: 80 YDLFG 84
           DL G
Sbjct: 64 VDLGG 68


>gnl|CDD|237663 PRK14293, PRK14293, chaperone protein DnaJ; Provisional.
          Length = 374

 Score = 82.3 bits (204), Expect = 5e-17
 Identities = 33/72 (45%), Positives = 47/72 (65%), Gaps = 1/72 (1%)

Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
          D YE LGV R A   E+++ Y+RL  ++HPD N +P A+++F ++  AY +LSD E R +
Sbjct: 4  DYYEILGVSRDADKDELKRAYRRLARKYHPDVNKEPGAEDRFKEINRAYEVLSDPETRAR 63

Query: 80 YDLFGTTDGFSG 91
          YD FG   G SG
Sbjct: 64 YDQFGEA-GVSG 74


>gnl|CDD|225124 COG2214, CbpA, DnaJ-class molecular chaperone [Posttranslational
           modification, protein turnover, chaperones].
          Length = 237

 Score = 79.9 bits (196), Expect = 7e-17
 Identities = 43/117 (36%), Positives = 61/117 (52%), Gaps = 3/117 (2%)

Query: 18  SLDPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDP--TAQEKFLQLTEAYNILSDAE 75
            LD YE LGVP  ASL+EI+K Y++L +++HPD+N      A+EKF ++ EAY ILSD E
Sbjct: 5   LLDYYEILGVPPNASLEEIKKAYRKLALKYHPDRNPGDPKVAEEKFKEINEAYEILSDPE 64

Query: 76  RRKQYDLFGTTDGFSGQDS-ASRNFHNHMYNPFDDVFSEGFNFPFEEHDISLFHKLS 131
           RR +YD  G  + F G+    S  F          ++ E      +    +    L 
Sbjct: 65  RRAEYDKIGADNYFPGKSYRESAAFPIEGSLQLTGLYEEWRGKRNKTERKASLADLL 121


>gnl|CDD|172773 PRK14285, PRK14285, chaperone protein DnaJ; Provisional.
          Length = 365

 Score = 78.9 bits (194), Expect = 8e-16
 Identities = 39/102 (38%), Positives = 57/102 (55%), Gaps = 7/102 (6%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
           D YE LG+ + AS  EI+K Y+++ +++HPDKN  +  A+  F + TEAY +L D  +R 
Sbjct: 4   DYYEILGLSKGASKDEIKKAYRKIAIKYHPDKNKGNKEAESIFKEATEAYEVLIDDNKRA 63

Query: 79  QYDLFGTT-----DGFSGQDSASRNFHNHMYNPFDDVFSEGF 115
           QYD FG T      GF G       F + ++  F D+F   F
Sbjct: 64  QYDRFGHTAFEGGGGFEGFSGGFSGFSD-IFEDFGDIFDSFF 104


>gnl|CDD|237655 PRK14279, PRK14279, chaperone protein DnaJ; Provisional.
          Length = 392

 Score = 78.6 bits (194), Expect = 1e-15
 Identities = 31/63 (49%), Positives = 45/63 (71%), Gaps = 1/63 (1%)

Query: 20 DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAERRK 78
          D Y+ LGV   AS +EI+K Y++L  E HPD N  DP A+E+F  ++EA+++LSD  +RK
Sbjct: 10 DFYKELGVSSDASAEEIKKAYRKLARELHPDANPGDPAAEERFKAVSEAHDVLSDPAKRK 69

Query: 79 QYD 81
          +YD
Sbjct: 70 EYD 72


>gnl|CDD|172788 PRK14300, PRK14300, chaperone protein DnaJ; Provisional.
          Length = 372

 Score = 75.8 bits (186), Expect = 1e-14
 Identities = 37/102 (36%), Positives = 58/102 (56%), Gaps = 6/102 (5%)

Query: 17  VSLDPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAER 76
           +S D Y+ LGV +TAS  +++K Y +L  ++HPD  +   A++KF ++  AY++L D ++
Sbjct: 1   MSQDYYQILGVSKTASQADLKKAYLKLAKQYHPDTTDAKDAEKKFKEINAAYDVLKDEQK 60

Query: 77  RKQYDLFGTTDGFSGQDS-----ASRNFHNHMYNPFDDVFSE 113
           R  YD FG  D F  Q S         FH  + + F D FS+
Sbjct: 61  RAAYDRFG-HDAFQNQQSRGGGGNHGGFHPDINDIFGDFFSD 101


>gnl|CDD|240236 PTZ00037, PTZ00037, DnaJ_C chaperone protein; Provisional.
          Length = 421

 Score = 75.2 bits (185), Expect = 2e-14
 Identities = 30/70 (42%), Positives = 45/70 (64%), Gaps = 4/70 (5%)

Query: 22 YETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQYD 81
          YE L + +  +  EI+K Y++L ++ HPDK  DP   EKF +++ AY +LSD E+RK YD
Sbjct: 31 YEVLNLSKDCTTSEIKKAYRKLAIKHHPDKGGDP---EKFKEISRAYEVLSDPEKRKIYD 87

Query: 82 LFGTTDGFSG 91
           +G  +G  G
Sbjct: 88 EYG-EEGLEG 96


>gnl|CDD|182347 PRK10266, PRK10266, curved DNA-binding protein CbpA; Provisional.
          Length = 306

 Score = 72.2 bits (177), Expect = 8e-14
 Identities = 32/99 (32%), Positives = 52/99 (52%), Gaps = 4/99 (4%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
           D Y  +GV  T  L+ I+  Y+RL  ++HPD + +P A+ +F ++ EA+ +LSD +RR +
Sbjct: 5   DYYAIMGVKPTDDLKTIKTAYRRLARKYHPDVSKEPDAEARFKEVAEAWEVLSDEQRRAE 64

Query: 80  YD---LFGTTDGFSGQDSASRNFHNHMYNPFDDVFSEGF 115
           YD          F  +     +  +     FDD+FS  F
Sbjct: 65  YDQLWQHRNDPQF-NRQFQHGDGQSFNAEDFDDIFSSIF 102


>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
           TopJ.  This model describes TopJ (MG_200, CbpA), a DnaJ
           homolog and probable assembly protein of the Mycoplasma
           terminal organelle. The terminal organelle is involved
           in both cytadherence and gliding motility [Cellular
           processes, Chemotaxis and motility].
          Length = 871

 Score = 72.5 bits (177), Expect = 2e-13
 Identities = 33/88 (37%), Positives = 53/88 (60%), Gaps = 3/88 (3%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQ 79
           D YE LG+ R A  QEI+K +++L  ++HPD+N  P A   F ++ EA ++LS+ ++R  
Sbjct: 3   DYYEVLGIDRDADEQEIKKAFRKLAKKYHPDRNKAPDAASIFAEINEANDVLSNPKKRAN 62

Query: 80  YDLFGTTDGFSGQDSASRNFHNHMYNPF 107
           YD +G  DG   +D    +F   ++N F
Sbjct: 63  YDKYG-HDGVDREDDF--DFQADVFNSF 87


>gnl|CDD|172776 PRK14288, PRK14288, chaperone protein DnaJ; Provisional.
          Length = 369

 Score = 65.1 bits (158), Expect = 2e-11
 Identities = 35/102 (34%), Positives = 60/102 (58%), Gaps = 3/102 (2%)

Query: 17  VSLDPYETLGVPRTASLQEIRKNYKRLVVEWHPDKN-NDPTAQEKFLQLTEAYNILSDAE 75
           + L  YE L V + ++ + I+K+Y++L +++HPD+N  D  A+EKF  + EAY +LSD +
Sbjct: 1   MELSYYEILEVEKHSNQETIKKSYRKLALKYHPDRNAGDKEAEEKFKLINEAYGVLSDEK 60

Query: 76  RRKQYDLFGTTDGFSGQDSASRNFHNHMYNPFDDVFSEGFNF 117
           +R  YD +G   G + Q  AS++  +  +      F + F F
Sbjct: 61  KRALYDRYG-KKGLN-QAGASQSDFSDFFEDLGSFFEDAFGF 100


>gnl|CDD|227694 COG5407, SEC63, Preprotein translocase subunit Sec63 [Intracellular
           trafficking and secretion].
          Length = 610

 Score = 65.0 bits (158), Expect = 5e-11
 Identities = 28/75 (37%), Positives = 46/75 (61%), Gaps = 6/75 (8%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDK--NNDPTA----QEKFLQLTEAYNILSD 73
           DPYE LG+ +  S ++I+K Y+ L +++HPDK            +EK+  +T+AY +L+D
Sbjct: 99  DPYEILGIDQDTSERDIKKRYRMLSMKFHPDKAPPMVNELRSEYEEKYKTITKAYGLLTD 158

Query: 74  AERRKQYDLFGTTDG 88
            +RR+ Y  +GT D 
Sbjct: 159 KKRRENYLNYGTPDS 173


>gnl|CDD|225660 COG3118, COG3118, Thioredoxin domain-containing protein
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 304

 Score = 51.2 bits (123), Expect = 6e-07
 Identities = 26/83 (31%), Positives = 38/83 (45%), Gaps = 2/83 (2%)

Query: 132 TTHWNFEKNYIPKSYTTPHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNE 191
            T  NFE+  I  S   P L+ F++ WC  C Q+ P  +KL  E          V+   E
Sbjct: 28  VTEANFEQEVIQSSREVPVLVDFWAPWCGPCKQLTPTLEKLAAEYKG-KFKLAKVNCDAE 86

Query: 192 QGLARRLGVGSQLPQIALLTDGR 214
             +A + GV S +P +    DG+
Sbjct: 87  PMVAAQFGVQS-IPTVYAFKDGQ 108


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
           Provisional.
          Length = 1136

 Score = 50.9 bits (121), Expect = 2e-06
 Identities = 26/70 (37%), Positives = 40/70 (57%), Gaps = 1/70 (1%)

Query: 22  YETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQYD 81
           Y+ LGV   A ++EI + Y +L   ++P K +      KF ++ EAY IL D +++K Y+
Sbjct: 576 YDILGVGVNADMKEISERYFKLAENYYPPKRSGNEGFHKFKKINEAYQILGDIDKKKMYN 635

Query: 82  LFGTTDGFSG 91
            FG  DG  G
Sbjct: 636 KFG-YDGIKG 644


>gnl|CDD|239259 cd02961, PDI_a_family, Protein Disulfide Isomerase (PDIa) family,
           redox active TRX domains; composed of eukaryotic
           proteins involved in oxidative protein folding in the
           endoplasmic reticulum (ER) by acting as catalysts and
           folding assistants. Members of this family include PDI
           and PDI-related proteins like ERp72, ERp57 (or ERp60),
           ERp44, P5, PDIR, ERp46 and the transmembrane PDIs. PDI,
           ERp57, ERp72, P5, PDIR and ERp46 are all oxidases,
           catalyzing the formation of disulfide bonds of newly
           synthesized polypeptides in the ER. They also exhibit
           reductase activity in acting as isomerases to correct
           any non-native disulfide bonds, as well as chaperone
           activity to prevent protein aggregation and facilitate
           the folding of newly synthesized proteins. These
           proteins usually contain multiple copies of a redox
           active TRX (a) domain containing a CXXC motif, and may
           also contain one or more redox inactive TRX-like (b)
           domains. Only one a domain is required for the oxidase
           function but multiple copies are necessary for the
           isomerase function. The different types of PDIs may show
           different substrate specificities and tissue-specific
           expression, or may be induced by stress. PDIs are in
           their reduced form at steady state and are oxidized to
           the active form by Ero1, which is localized in the ER
           through ERp44. Some members of this family also contain
           a DnaJ domain in addition to the redox active a domains;
           examples are ERdj5 and Pfj2. Also included in the family
           is the redox inactive N-terminal TRX-like domain of
           ERp29.
          Length = 101

 Score = 43.8 bits (104), Expect = 1e-05
 Identities = 28/98 (28%), Positives = 41/98 (41%), Gaps = 6/98 (6%)

Query: 136 NFEKNYIPKSYTTPHLILFYSDWCFACLQVEPIFKKLMDELSP-LGVGFFTVHVHNEQGL 194
           NF++  +  S     L+ FY+ WC  C  + P ++KL  EL     V    V       L
Sbjct: 7   NFDE-LVKDS--KDVLVEFYAPWCGHCKALAPEYEKLAKELKGDGKVVVAKVDCTANNDL 63

Query: 195 ARRLGVGSQLPQIALLTDGRTSFFK-EPSFSVQKMVEF 231
               GV    P I L  +G     K E   +++ +VEF
Sbjct: 64  CSEYGV-RGYPTIKLFPNGSKEPVKYEGPRTLESLVEF 100


>gnl|CDD|239245 cd02947, TRX_family, TRX family; composed of two groups: Group I,
           which includes proteins that exclusively encode a TRX
           domain; and Group II, which are composed of fusion
           proteins of TRX and additional domains. Group I TRX is a
           small ancient protein that alter the redox state of
           target proteins via the reversible oxidation of an
           active site dithiol, present in a CXXC motif, partially
           exposed at the protein's surface. TRX reduces protein
           disulfide bonds, resulting in a disulfide bond at its
           active site. Oxidized TRX is converted to the active
           form by TRX reductase, using reducing equivalents
           derived from either NADPH or ferredoxins. By altering
           their redox state, TRX regulates the functions of at
           least 30 target proteins, some of which are enzymes and
           transcription factors. It also plays an important role
           in the defense against oxidative stress by directly
           reducing hydrogen peroxide and certain radicals, and by
           serving as a reductant for peroxiredoxins. At least two
           major types of functional TRXs have been reported in
           most organisms; in eukaryotes, they are located in the
           cytoplasm and the mitochondria. Higher plants contain
           more types (at least 20 TRX genes have been detected in
           the genome of Arabidopsis thaliana), two of which (types
           f amd m) are located in the same compartment, the
           chloroplast. Also included in the alignment are TRX-like
           domains which show sequence homology to TRX but do not
           contain the redox active CXXC motif. Group II proteins,
           in addition to either a redox active TRX or a TRX-like
           domain, also contain additional domains, which may or
           may not possess homology to known proteins.
          Length = 93

 Score = 43.3 bits (103), Expect = 2e-05
 Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 3/66 (4%)

Query: 149 PHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQGLARRLGVGSQLPQIA 208
           P ++ F++ WC  C  + P+ ++L +E     V F  V V     LA   GV   +P   
Sbjct: 12  PVVVDFWAPWCGPCKAIAPVLEELAEEYP--KVKFVKVDVDENPELAEEYGV-RSIPTFL 68

Query: 209 LLTDGR 214
              +G+
Sbjct: 69  FFKNGK 74


>gnl|CDD|239254 cd02956, ybbN, ybbN protein family; ybbN is a hypothetical protein
           containing a redox-inactive TRX-like domain. Its gene
           has been sequenced from several gammaproteobacteria and
           actinobacteria.
          Length = 96

 Score = 41.9 bits (99), Expect = 5e-05
 Identities = 16/79 (20%), Positives = 37/79 (46%), Gaps = 3/79 (3%)

Query: 136 NFEKNYIPKSYTTPHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQGLA 195
           NF+   + +S   P ++ F++       ++ P+ ++L +E          V+   +  +A
Sbjct: 2   NFQ-QVLQESTQVPVVVDFWAPRSPPSKELLPLLERLAEE-YQGQFVLAKVNCDAQPQIA 59

Query: 196 RRLGVGSQLPQIALLTDGR 214
           ++ GV + LP + L   G+
Sbjct: 60  QQFGVQA-LPTVYLFAAGQ 77


>gnl|CDD|239292 cd02994, PDI_a_TMX, PDIa family, TMX subfamily; composed of
           proteins similar to the TRX-related human transmembrane
           protein, TMX. TMX is a type I integral membrane protein;
           the N-terminal redox active TRX domain is present in the
           endoplasmic reticulum (ER) lumen while the C-terminus is
           oriented towards the cytoplasm. It is expressed in many
           cell types and its active site motif (CPAC) is unique.
           In vitro, TMX reduces interchain disulfides of insulin
           and renatures inactive RNase containing incorrect
           disulfide bonds. The C. elegans homolog, DPY-11, is
           expressed only in the hypodermis and resides in the
           cytoplasm. It is required for body and sensory organ
           morphogeneis. Another uncharacterized TRX-related
           transmembrane protein, human TMX4, is included in the
           alignment. The active site sequence of TMX4 is CPSC.
          Length = 101

 Score = 41.2 bits (97), Expect = 1e-04
 Identities = 24/63 (38%), Positives = 34/63 (53%), Gaps = 1/63 (1%)

Query: 151 LILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQGLARRLGVGSQLPQIALL 210
           +I FY+ WC AC Q++P +++  D    LG+    V V  E GL+ R  V + LP I   
Sbjct: 20  MIEFYAPWCPACQQLQPEWEEFADWSDDLGINVAKVDVTQEPGLSGRFFVTA-LPTIYHA 78

Query: 211 TDG 213
            DG
Sbjct: 79  KDG 81


>gnl|CDD|200072 TIGR01068, thioredoxin, thioredoxin.  Several proteins, such as
           protein disulfide isomerase, have two or more copies of
           a domain closely related to thioredoxin. This model is
           designed to recognize authentic thioredoxin, a small
           protein that should be hit exactly once by This model.
           Any protein that hits once with a score greater than the
           second (per domain) trusted cutoff may be taken as
           thioredoxin [Energy metabolism, Electron transport].
          Length = 101

 Score = 40.7 bits (96), Expect = 2e-04
 Identities = 19/65 (29%), Positives = 33/65 (50%), Gaps = 2/65 (3%)

Query: 149 PHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQGLARRLGVGSQLPQIA 208
           P L+ F++ WC  C  + PI ++L  E     V F  ++V     +A + G+ S +P + 
Sbjct: 16  PVLVDFWAPWCGPCKMIAPILEELAKEYEG-KVKFVKLNVDENPDIAAKYGIRS-IPTLL 73

Query: 209 LLTDG 213
           L  +G
Sbjct: 74  LFKNG 78


>gnl|CDD|224002 COG1076, DjlA, DnaJ-domain-containing proteins 1 [Posttranslational
           modification, protein turnover, chaperones].
          Length = 174

 Score = 41.2 bits (97), Expect = 3e-04
 Identities = 18/62 (29%), Positives = 28/62 (45%), Gaps = 5/62 (8%)

Query: 19  LDPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTAQEKFL-----QLTEAYNILSD 73
            D  + LGV   A    I+K Y++L+ E HPDK      + +F+     +L E      D
Sbjct: 113 EDALKVLGVEIKADQDAIKKAYRKLLSEQHPDKAAAKGLKLEFIEKLKEKLQEIQEAYED 172

Query: 74  AE 75
            +
Sbjct: 173 IK 174



 Score = 28.8 bits (65), Expect = 5.6
 Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 8/69 (11%)

Query: 20 DPYETLGVPRTAS--LQEIRKNYKRLVVEWHPDKNNDPTAQEKFL------QLTEAYNIL 71
          D +   G+PR     L  ++  Y+ L   +HPD+    +  E+        ++  AY  L
Sbjct: 2  DGFVLFGLPRAFQIDLDALKLQYRELQRAYHPDRFGKASEAEQRKALQQSAEVNPAYQTL 61

Query: 72 SDAERRKQY 80
           D   R +Y
Sbjct: 62 KDPLLRAEY 70


>gnl|CDD|236512 PRK09430, djlA, Dna-J like membrane chaperone protein; Provisional.
          Length = 267

 Score = 42.1 bits (100), Expect = 3e-04
 Identities = 15/32 (46%), Positives = 22/32 (68%)

Query: 20  DPYETLGVPRTASLQEIRKNYKRLVVEWHPDK 51
           D Y+ LGV  +   QEI++ Y++L+ E HPDK
Sbjct: 201 DAYKVLGVSESDDDQEIKRAYRKLMSEHHPDK 232


>gnl|CDD|215704 pfam00085, Thioredoxin, Thioredoxin.  Thioredoxins are small
           enzymes that participate in redox reactions, via the
           reversible oxidation of an active centre disulfide bond.
           Some members with only the active site are not separated
           from the noise.
          Length = 104

 Score = 38.7 bits (91), Expect = 8e-04
 Identities = 25/78 (32%), Positives = 34/78 (43%), Gaps = 4/78 (5%)

Query: 136 NFEKNYIPKSYTTPHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQGLA 195
           NF++  + KS   P L+ FY+ WC  C  + P ++KL  E     V F  V       LA
Sbjct: 9   NFDE-EVAKS-DKPVLVDFYAPWCGPCKALAPEYEKLAQEYKD-DVKFAKVDADENPDLA 65

Query: 196 RRLGVGSQLPQIALLTDG 213
              GV    P I    +G
Sbjct: 66  SEYGVRG-FPTIKFFKNG 82


>gnl|CDD|185622 PTZ00443, PTZ00443, Thioredoxin domain-containing protein;
           Provisional.
          Length = 224

 Score = 40.0 bits (93), Expect = 0.002
 Identities = 34/148 (22%), Positives = 59/148 (39%), Gaps = 10/148 (6%)

Query: 128 HKLSTTHWNFEK--NYIPKSYTTPHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFT 185
             +     NFEK       + T P  + FY+ WC  C ++ P +++L   L    V    
Sbjct: 31  ALVLLNDKNFEKLTQASTGATTGPWFVKFYAPWCSHCRKMAPAWERLAKALKGQ-VNVAD 89

Query: 186 VHVHNEQGLARRLGVGSQLPQIALLTDGRTSFFKEPSFSVQKMVEF----FRLKLPYKLI 241
           +       LA+R  +    P + L   G+   ++    S +K+  F    F+  L   + 
Sbjct: 90  LDATRALNLAKRFAIKG-YPTLLLFDKGKMYQYEGGDRSTEKLAAFALGDFKKALGAPVP 148

Query: 242 VPLS--ATNVDAFLDNWREDNKVHALLF 267
            PLS  A  +D F+    E  +++   F
Sbjct: 149 APLSFFALTIDFFVSGTNEALRIYDAAF 176


>gnl|CDD|167217 PRK01356, hscB, co-chaperone HscB; Provisional.
          Length = 166

 Score = 39.1 bits (91), Expect = 0.002
 Identities = 22/67 (32%), Positives = 37/67 (55%), Gaps = 6/67 (8%)

Query: 22 YETLGVPRTAS--LQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQ----LTEAYNILSDAE 75
          ++ LG+P+  +  L+ + K Y  + V++HPDK      +E+ L     L  AY+ L DA 
Sbjct: 5  FQLLGLPQEYNIDLKILEKQYFAMQVKYHPDKAKTLQEKEQNLIIASELNNAYSTLKDAL 64

Query: 76 RRKQYDL 82
          +R +Y L
Sbjct: 65 KRAEYML 71


>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
           ribosomal structure and biogenesis / Posttranslational
           modification, protein turnover, chaperones].
          Length = 379

 Score = 39.6 bits (92), Expect = 0.003
 Identities = 21/69 (30%), Positives = 32/69 (46%), Gaps = 8/69 (11%)

Query: 16  AVSLDPYETLGVPRTASLQEIRKNYKRLVVEWHPDKNNDPTA---QEKFLQLTEAYNILS 72
            + L  Y T  +P      +I K +K+ V ++HPDK          E F  + +A  +L 
Sbjct: 48  LLGLSKYRTKAIP-----PQILKAHKKKVYKYHPDKTAAGGNKGCDEFFKLIQKAREVLG 102

Query: 73  DAERRKQYD 81
           D + R QYD
Sbjct: 103 DRKLRLQYD 111


>gnl|CDD|222986 PHA03102, PHA03102, Small T antigen; Reviewed.
          Length = 153

 Score = 37.3 bits (87), Expect = 0.006
 Identities = 29/146 (19%), Positives = 53/146 (36%), Gaps = 15/146 (10%)

Query: 23  ETLGVPRTA--SLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQY 80
           + LG+PR+A  +L  +RK Y R  +E+HPDK  D    EK  +L   Y    ++ +  + 
Sbjct: 9   DLLGLPRSAWGNLPLMRKAYLRKCLEFHPDKGGDE---EKMKELNTLYKKFRESVKSLRD 65

Query: 81  DLFGTTDGFSGQDSASRNFHNHMYNPFDDVFSEGF---NFPFEEHDISLFHKLSTTHWNF 137
                      +D  S        +  + ++ + +             +F  L   H   
Sbjct: 66  LDGEEDSSSEEEDVPSGYVGATFGDRVNALYCKDWDTCLKNLSSLCKCIFCLLRKQHRL- 124

Query: 138 EKNYIPKSYTTPHLILFYSDWCFACL 163
            K  +        L+     +C+ C 
Sbjct: 125 -KQKLRPEP----LVWGEC-YCYKCF 144


>gnl|CDD|223600 COG0526, TrxA, Thiol-disulfide isomerase and thioredoxins
           [Posttranslational modification, protein turnover,
           chaperones / Energy production and conversion].
          Length = 127

 Score = 36.8 bits (84), Expect = 0.007
 Identities = 21/68 (30%), Positives = 33/68 (48%), Gaps = 3/68 (4%)

Query: 149 PHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNE-QGLARRLGV-GSQLPQ 206
           P L+ F++ WC  C    P+ ++L +E     V    V+V +E   LA   GV    +P 
Sbjct: 34  PVLVDFWAPWCPPCRAEAPLLEELAEEY-GGDVEVVAVNVDDENPDLAAEFGVAVRSIPT 92

Query: 207 IALLTDGR 214
           + L  DG+
Sbjct: 93  LLLFKDGK 100


>gnl|CDD|238829 cd01659, TRX_superfamily, Thioredoxin (TRX) superfamily; a large,
           diverse group of proteins containing a TRX-fold. Many
           members contain a classic TRX domain with a redox active
           CXXC motif. They function as protein disulfide
           oxidoreductases (PDOs), altering the redox state of
           target proteins via the reversible oxidation of their
           active site dithiol. The PDO members of this superfamily
           include TRX, protein disulfide isomerase (PDI),
           tlpA-like, glutaredoxin, NrdH redoxin, and the bacterial
           Dsb (DsbA, DsbC, DsbG, DsbE, DsbDgamma) protein
           families. Members of the superfamily that do not
           function as PDOs but contain a TRX-fold domain include
           phosducins, peroxiredoxins and glutathione (GSH)
           peroxidases, SCO proteins, GSH transferases (GST,
           N-terminal domain), arsenic reductases, TRX-like
           ferredoxins and calsequestrin, among others.
          Length = 69

 Score = 34.2 bits (78), Expect = 0.014
 Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 6/66 (9%)

Query: 151 LILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHV---HNEQGLARRLGVGSQLPQI 207
           L+LFY+ WC  C  + P+  +L       GV F  V V      +   +R GVG  +P +
Sbjct: 1   LVLFYAPWCPFCQALRPVLAELALLN--KGVKFEAVDVDEDPALEKELKRYGVG-GVPTL 57

Query: 208 ALLTDG 213
            +   G
Sbjct: 58  VVFGPG 63


>gnl|CDD|173347 PTZ00051, PTZ00051, thioredoxin; Provisional.
          Length = 98

 Score = 34.1 bits (78), Expect = 0.028
 Identities = 14/64 (21%), Positives = 31/64 (48%), Gaps = 3/64 (4%)

Query: 151 LILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQGLARRLGVGSQLPQIALL 210
           ++ FY++WC  C ++ P +++   E + +   F  V V     +A +  + S +P   + 
Sbjct: 22  IVDFYAEWCGPCKRIAPFYEECSKEYTKM--VFVKVDVDELSEVAEKENITS-MPTFKVF 78

Query: 211 TDGR 214
            +G 
Sbjct: 79  KNGS 82


>gnl|CDD|181812 PRK09381, trxA, thioredoxin; Provisional.
          Length = 109

 Score = 34.3 bits (78), Expect = 0.038
 Identities = 23/89 (25%), Positives = 41/89 (46%), Gaps = 8/89 (8%)

Query: 151 LILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTV---HVHNEQGLARRLGVGSQLPQI 207
           L+ F+++WC  C  + PI  ++ DE      G  TV   ++    G A + G+   +P +
Sbjct: 25  LVDFWAEWCGPCKMIAPILDEIADEYQ----GKLTVAKLNIDQNPGTAPKYGI-RGIPTL 79

Query: 208 ALLTDGRTSFFKEPSFSVQKMVEFFRLKL 236
            L  +G  +  K  + S  ++ EF    L
Sbjct: 80  LLFKNGEVAATKVGALSKGQLKEFLDANL 108


>gnl|CDD|222912 PHA02624, PHA02624, large T antigen; Provisional.
          Length = 647

 Score = 36.1 bits (84), Expect = 0.044
 Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 6/68 (8%)

Query: 23 ETLGVPRTA--SLQEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRKQY 80
          + LG+P  A  +L  +RK Y R   E+HPDK  D    EK  +L   Y  L +   +   
Sbjct: 15 DLLGLPMAAWGNLPLMRKAYLRKCKEYHPDKGGDE---EKMKRLNSLYKKLQE-GVKSAR 70

Query: 81 DLFGTTDG 88
            FGT D 
Sbjct: 71 QSFGTQDS 78


>gnl|CDD|239293 cd02995, PDI_a_PDI_a'_C, PDIa family, C-terminal TRX domain (a')
           subfamily; composed of the C-terminal redox active a'
           domains of PDI, ERp72, ERp57 (or ERp60) and EFP1. PDI,
           ERp72 and ERp57 are endoplasmic reticulum (ER)-resident
           eukaryotic proteins involved in oxidative protein
           folding. They are oxidases, catalyzing the formation of
           disulfide bonds of newly synthesized polypeptides in the
           ER. They also exhibit reductase activity in acting as
           isomerases to correct any non-native disulfide bonds, as
           well as chaperone activity to prevent protein
           aggregation and facilitate the folding of newly
           synthesized proteins. PDI and ERp57 have the abb'a'
           domain structure (where a and a' are redox active TRX
           domains while b and b' are redox inactive TRX-like
           domains). PDI also contains an acidic region (c domain)
           after the a' domain that is absent in ERp57. ERp72 has
           an additional a domain at the N-terminus (a"abb'a'
           domain structure). ERp57 interacts with the lectin
           chaperones, calnexin and calreticulin, and specifically
           promotes the oxidative folding of glycoproteins, while
           PDI shows a wider substrate specificity. ERp72
           associates with several ER chaperones and folding
           factors to form complexes in the ER that bind nascent
           proteins. EFP1 is a binding partner protein of thyroid
           oxidase, which is responsible for the generation of
           hydrogen peroxide, a crucial substrate of
           thyroperoxidase, which functions to iodinate
           thyroglobulin and synthesize thyroid hormones.
          Length = 104

 Score = 33.3 bits (77), Expect = 0.070
 Identities = 10/27 (37%), Positives = 18/27 (66%)

Query: 151 LILFYSDWCFACLQVEPIFKKLMDELS 177
           L+ FY+ WC  C  + PI+++L ++L 
Sbjct: 22  LVEFYAPWCGHCKALAPIYEELAEKLK 48


>gnl|CDD|200074 TIGR01126, pdi_dom, protein disulfide-isomerase domain.  This model
           describes a domain of eukaryotic protein disulfide
           isomerases, generally found in two copies. The high
           cutoff for total score reflects the expectation of
           finding both copies. The domain is similar to
           thioredoxin but the redox-active disulfide region motif
           is APWCGHCK [Protein fate, Protein folding and
           stabilization].
          Length = 102

 Score = 33.0 bits (76), Expect = 0.077
 Identities = 26/86 (30%), Positives = 38/86 (44%), Gaps = 6/86 (6%)

Query: 151 LILFYSDWCFACLQVEPIFKKLMDELS---PLGVGFFTVHVHNEQGLARRLGVGSQLPQI 207
           L+ FY+ WC  C  + P ++KL  EL     + +    V    E+ LA R GV S  P I
Sbjct: 17  LVEFYAPWCGHCKNLAPEYEKLAKELKKDPDIVLA--KVDATAEKDLASRFGV-SGFPTI 73

Query: 208 ALLTDGRTSFFKEPSFSVQKMVEFFR 233
                G+     E    ++ +VEF  
Sbjct: 74  KFFPKGKKPVDYEGGRDLEAIVEFVN 99


>gnl|CDD|239300 cd03002, PDI_a_MPD1_like, PDI family, MPD1-like subfamily; composed
           of eukaryotic proteins similar to Saccharomyces
           cerevisiae MPD1 protein, which contains a single redox
           active TRX domain located at the N-terminus, and an ER
           retention signal at the C-terminus indicative of an
           ER-resident protein. MPD1 has been shown to suppress the
           maturation defect of carboxypeptidase Y caused by
           deletion of the yeast PDI1 gene. Other characterized
           members of this subfamily include the Aspergillus niger
           prpA protein and Giardia PDI-1. PrpA is non-essential to
           strain viability, however, its transcript level is
           induced by heterologous protein expression suggesting a
           possible role in oxidative protein folding during high
           protein production. Giardia PDI-1 has the ability to
           refold scrambled RNase and exhibits transglutaminase
           activity.
          Length = 109

 Score = 33.1 bits (76), Expect = 0.091
 Identities = 17/44 (38%), Positives = 24/44 (54%), Gaps = 2/44 (4%)

Query: 136 NFEKNYIPKSYTTPHLILFYSDWCFACLQVEPIFKKLMDELSPL 179
           NF+K     +YTT  L+ FY+ WC  C  ++P + K   EL  L
Sbjct: 9   NFDKVVHNTNYTT--LVEFYAPWCGHCKNLKPEYAKAAKELDGL 50


>gnl|CDD|182889 PRK10996, PRK10996, thioredoxin 2; Provisional.
          Length = 139

 Score = 33.5 bits (77), Expect = 0.094
 Identities = 21/67 (31%), Positives = 34/67 (50%), Gaps = 2/67 (2%)

Query: 148 TPHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQGLARRLGVGSQLPQI 207
            P +I F++ WC  C    PIF+ +  E S   V F  V+   E+ L+ R  + S +P I
Sbjct: 53  LPVVIDFWAPWCGPCRNFAPIFEDVAAERSG-KVRFVKVNTEAERELSARFRIRS-IPTI 110

Query: 208 ALLTDGR 214
            +  +G+
Sbjct: 111 MIFKNGQ 117


>gnl|CDD|239298 cd03000, PDI_a_TMX3, PDIa family, TMX3 subfamily; composed of
           eukaryotic proteins similar to human TMX3, a TRX related
           transmembrane protein containing one redox active TRX
           domain at the N-terminus and a classical ER retrieval
           sequence for type I transmembrane proteins at the
           C-terminus. The TMX3 transcript is found in a variety of
           tissues with the highest levels detected in skeletal
           muscle and the heart. In vitro, TMX3 showed oxidase
           activity albeit slightly lower than that of protein
           disulfide isomerase.
          Length = 104

 Score = 32.0 bits (73), Expect = 0.21
 Identities = 11/30 (36%), Positives = 19/30 (63%)

Query: 151 LILFYSDWCFACLQVEPIFKKLMDELSPLG 180
           L+ FY+ WC  C ++EP++ ++  EL   G
Sbjct: 19  LVDFYAPWCGHCKKLEPVWNEVGAELKSSG 48


>gnl|CDD|240266 PTZ00102, PTZ00102, disulphide isomerase; Provisional.
          Length = 477

 Score = 33.6 bits (77), Expect = 0.25
 Identities = 16/52 (30%), Positives = 24/52 (46%), Gaps = 2/52 (3%)

Query: 151 LILFYSDWCFACLQVEPIFKKLMDELSPLG--VGFFTVHVHNEQGLARRLGV 200
           L+ FY+ WC  C ++ P +KK    L      +   +V    E  LA+  GV
Sbjct: 53  LVKFYAPWCGHCKRLAPEYKKAAKMLKEKKSEIVLASVDATEEMELAQEFGV 104


>gnl|CDD|166894 PRK00294, hscB, co-chaperone HscB; Provisional.
          Length = 173

 Score = 32.5 bits (74), Expect = 0.30
 Identities = 19/56 (33%), Positives = 25/56 (44%), Gaps = 6/56 (10%)

Query: 33 LQEIRKNYKRLVVEWHPDKNNDPT------AQEKFLQLTEAYNILSDAERRKQYDL 82
          L ++   Y+ L  E HPD+  D        A E+   L EAY  L    RR +Y L
Sbjct: 20 LDQLATRYRELAREVHPDRFADAPEREQRLALERSASLNEAYQTLKSPPRRARYLL 75


>gnl|CDD|239290 cd02992, PDI_a_QSOX, PDIa family, Quiescin-sulfhydryl oxidase
           (QSOX) subfamily; QSOX is a eukaryotic protein
           containing an N-terminal redox active TRX domain,
           similar to that of PDI, and a small C-terminal flavin
           adenine dinucleotide (FAD)-binding domain homologous to
           the yeast ERV1p protein. QSOX oxidizes thiol groups to
           disulfides like PDI, however, unlike PDI, this oxidation
           is accompanied by the reduction of oxygen to hydrogen
           peroxide. QSOX is localized in high concentrations in
           cells with heavy secretory load and prefers peptides and
           proteins as substrates, not monothiols like glutathione.
           Inside the cell, QSOX is found in the endoplasmic
           reticulum and Golgi. The flow of reducing equivalents in
           a QSOX-catalyzed reaction goes from the dithiol
           substrate -> dithiol of the QSOX TRX domain -> dithiols
           of the QSOX ERV1p domain -> FAD -> oxygen.
          Length = 114

 Score = 31.5 bits (72), Expect = 0.35
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 150 HLILFYSDWCFACLQVEPIFKKL 172
            L+ FY+ WC  C    P +KKL
Sbjct: 22  WLVEFYASWCGHCRAFAPTWKKL 44


>gnl|CDD|233282 TIGR01130, ER_PDI_fam, protein disulfide isomerase, eukaryotic.
           This model represents eukaryotic protein disulfide
           isomerases retained in the endoplasmic reticulum (ER)
           and closely related forms. Some members have been
           assigned alternative or additional functions such as
           prolyl 4-hydroxylase and
           dolichyl-diphosphooligosaccharide-protein
           glycotransferase. Members of this family have at least
           two protein-disulfide domains, each similar to
           thioredoxin but with the redox-active disulfide in the
           motif PWCGHCK, and an ER retention signal at the extreme
           C-terminus (KDEL, HDEL, and similar motifs).
          Length = 462

 Score = 33.1 bits (76), Expect = 0.39
 Identities = 9/27 (33%), Positives = 17/27 (62%)

Query: 151 LILFYSDWCFACLQVEPIFKKLMDELS 177
           L+ FY+ WC  C  + PI+++L ++  
Sbjct: 368 LVEFYAPWCGHCKNLAPIYEELAEKYK 394


>gnl|CDD|211601 TIGR00714, hscB, Fe-S protein assembly co-chaperone HscB.  This
          model describes the small subunit, Hsc20 (20K heat
          shock cognate protein) of a pair of proteins
          Hsc66-Hsc20, related to the DnaK-DnaJ heat shock
          proteins, which also serve as molecular chaperones.
          Hsc20, unlike DnaJ, appears not to have chaperone
          activity on its own, but to act solely as a regulatory
          subunit for Hsc66 (i.e., to be a co-chaperone). The
          gene for Hsc20 in E. coli, hscB, is not induced by heat
          shock [Protein fate, Protein folding and
          stabilization].
          Length = 155

 Score = 31.8 bits (72), Expect = 0.46
 Identities = 19/55 (34%), Positives = 29/55 (52%), Gaps = 8/55 (14%)

Query: 34 QEIRKNYKRLVVEWHPDKNNDPTAQEKFL------QLTEAYNILSDAERRKQYDL 82
            +RK Y++L  ++HPD +    AQE+         L +AY+ L D  RR +Y L
Sbjct: 6  SRLRKRYRQLQAQYHPDASG--MAQEQLAASQQSTTLNQAYHTLKDPLRRAEYML 58


>gnl|CDD|239291 cd02993, PDI_a_APS_reductase, PDIa family, 5'-Adenylylsulfate (APS)
           reductase subfamily; composed of plant-type APS
           reductases containing a C-terminal redox active TRX
           domain and an N-terminal reductase domain which is part
           of a superfamily that includes N type ATP PPases. APS
           reductase catalyzes the reduction of activated sulfate
           to sulfite, a key step in the biosynthesis of
           sulfur-containing metabolites. Sulfate is first
           activated by ATP sulfurylase, forming APS, which can be
           phosphorylated to 3'-phosphoadenosine-5'-phosphosulfate
           (PAPS). Depending on the organism, either APS or PAPS
           can be used for sulfate reduction. Prokaryotes and fungi
           use PAPS, whereas plants use both APS and PAPS. Since
           plant-type APS reductase uses glutathione (GSH) as its
           electron donor, the C-terminal domain may function like
           glutaredoxin, a GSH-dependent member of the TRX
           superfamily. The flow of reducing equivalents goes from
           GSH -> C-terminal TRX domain -> N-terminal reductase
           domain -> APS. Plant-type APS reductase shows no
           homology to that of dissimilatory sulfate-reducing
           bacteria, which is an iron-sulfur flavoenzyme. Also
           included in the alignment is EYE2 from Chlamydomonas
           reinhardtii, a protein required for eyespot assembly.
          Length = 109

 Score = 30.5 bits (69), Expect = 0.61
 Identities = 17/75 (22%), Positives = 29/75 (38%), Gaps = 1/75 (1%)

Query: 149 PHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQG-LARRLGVGSQLPQI 207
             L++ Y+ WC  C  +E  +++L ++L+   V     +   EQ   A+        P I
Sbjct: 23  STLVVLYAPWCPFCQAMEASYEELAEKLAGSNVKVAKFNADGEQREFAKEELQLKSFPTI 82

Query: 208 ALLTDGRTSFFKEPS 222
                      K PS
Sbjct: 83  LFFPKNSRQPIKYPS 97


>gnl|CDD|239250 cd02952, TRP14_like, Human TRX-related protein 14 (TRP14)-like
           family; composed of proteins similar to TRP14, a 14kD
           cytosolic protein that shows disulfide reductase
           activity in vitro with a different substrate specificity
           compared with another human cytosolic protein, TRX1.
           TRP14 catalyzes the reduction of small
           disulfide-containing peptides but does not reduce
           disulfides of ribonucleotide reductase, peroxiredoxin
           and methionine sulfoxide reductase, which are TRX1
           substrates. TRP14 also plays a role in tumor necrosis
           factor (TNF)-alpha signaling pathways, distinct from
           that of TRX1. Its depletion promoted TNF-alpha induced
           activation of c-Jun N-terminal kinase and
           mitogen-activated protein kinases.
          Length = 119

 Score = 30.8 bits (70), Expect = 0.66
 Identities = 13/48 (27%), Positives = 20/48 (41%), Gaps = 8/48 (16%)

Query: 152 ILFYSD-------WCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQ 192
           ILFY D       WC  C++ EP+ ++ +    P    F    V +  
Sbjct: 26  ILFYGDKDPDGQSWCPDCVKAEPVVREALKAA-PEDCVFIYCDVGDRP 72


>gnl|CDD|240265 PTZ00100, PTZ00100, DnaJ chaperone protein; Provisional.
          Length = 116

 Score = 29.8 bits (67), Expect = 1.5
 Identities = 12/29 (41%), Positives = 20/29 (68%)

Query: 22 YETLGVPRTASLQEIRKNYKRLVVEWHPD 50
          Y+ L +  TAS + IR+ +K+L++  HPD
Sbjct: 68 YKILNISPTASKERIREAHKQLMLRNHPD 96


>gnl|CDD|239303 cd03005, PDI_a_ERp46, PDIa family, endoplasmic reticulum protein 46
           (ERp46) subfamily; ERp46 is an ER-resident protein
           containing three redox active TRX domains. Yeast
           complementation studies show that ERp46 can substitute
           for protein disulfide isomerase (PDI) function in vivo.
           It has been detected in many tissues, however,
           transcript and protein levels do not correlate in all
           tissues, suggesting regulation at a posttranscriptional
           level. An identical protein, named endoPDI, has been
           identified as an endothelial PDI that is highly
           expressed in the endothelium of tumors and hypoxic
           lesions. It has a protective effect on cells exposed to
           hypoxia.
          Length = 102

 Score = 29.2 bits (66), Expect = 1.6
 Identities = 7/30 (23%), Positives = 17/30 (56%)

Query: 149 PHLILFYSDWCFACLQVEPIFKKLMDELSP 178
            H + F++ WC  C ++ P +++L  + + 
Sbjct: 18  NHFVKFFAPWCGHCKRLAPTWEQLAKKFNN 47


>gnl|CDD|239295 cd02997, PDI_a_PDIR, PDIa family, PDIR subfamily; composed of
           proteins similar to human PDIR (for Protein Disulfide
           Isomerase Related). PDIR is composed of three redox
           active TRX (a) domains and an N-terminal redox inactive
           TRX-like (b) domain. Similar to PDI, it is involved in
           oxidative protein folding in the endoplasmic reticulum
           (ER) through its isomerase and chaperone activities.
           These activities are lower compared to PDI, probably due
           to PDIR acting only on a subset of proteins. PDIR is
           preferentially expressed in cells actively secreting
           proteins and its expression is induced by stress.
           Similar to PDI, the isomerase and chaperone activities
           of PDIR are independent; CXXC mutants lacking isomerase
           activity retain chaperone activity.
          Length = 104

 Score = 28.8 bits (65), Expect = 2.2
 Identities = 20/86 (23%), Positives = 32/86 (37%), Gaps = 4/86 (4%)

Query: 149 PHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFT---VHVHNEQGLARRLGVGSQLP 205
             L++FY+ WC  C +++P F K   EL   G G              L     V    P
Sbjct: 19  HVLVMFYAPWCGHCKKMKPEFTKAATELKEDGKGVLAAVDCTKPEHDALKEEYNV-KGFP 77

Query: 206 QIALLTDGRTSFFKEPSFSVQKMVEF 231
                 +G+     E   + + ++EF
Sbjct: 78  TFKYFENGKFVEKYEGERTAEDIIEF 103


>gnl|CDD|222448 pfam13905, Thioredoxin_8, Thioredoxin-like.  Thioredoxins are small
           enzymes that participate in redox reactions, via the
           reversible oxidation of an active centre disulfide bond.
          Length = 94

 Score = 28.4 bits (64), Expect = 2.7
 Identities = 11/42 (26%), Positives = 20/42 (47%)

Query: 151 LILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQ 192
           L+ F++ WC  C    P  K+L ++L    V    V +  ++
Sbjct: 5   LLYFWASWCPPCRAFTPELKELYEKLKKPKVEIVYVSLDRDE 46


>gnl|CDD|239296 cd02998, PDI_a_ERp38, PDIa family, endoplasmic reticulum protein 38
           (ERp38) subfamily; composed of proteins similar to the
           P5-like protein first isolated from alfalfa, which
           contains two redox active TRX (a) domains at the
           N-terminus, like human P5, and a C-terminal domain with
           homology to the C-terminal domain of ERp29, unlike human
           P5. The cDNA clone of this protein (named G1) was
           isolated from an alfalfa cDNA library by screening with
           human protein disulfide isomerase (PDI) cDNA. The G1
           protein is constitutively expressed in all major organs
           of the plant and its expression is induced by treatment
           with tunicamycin, indicating that it may be a
           glucose-regulated protein. The G1 homolog in the
           eukaryotic social amoeba Dictyostelium discoideum is
           also described as a P5-like protein, which is located in
           the endoplasmic reticulum (ER) despite the absence of an
           ER-retrieval signal. G1 homologs from Aspergillus niger
           and Neurospora crassa have also been characterized, and
           are named TIGA and ERp38, respectively. Also included in
           the alignment is an atypical PDI from Leishmania
           donovani containing a single a domain, and the
           C-terminal a domain of a P5-like protein from Entamoeba
           histolytica.
          Length = 105

 Score = 28.8 bits (65), Expect = 2.8
 Identities = 14/41 (34%), Positives = 20/41 (48%), Gaps = 4/41 (9%)

Query: 133 THWNFEKNYI-PKSYTTPHLILFYSDWCFACLQVEPIFKKL 172
           T  NF+K     K      L+ FY+ WC  C  + P ++KL
Sbjct: 6   TDSNFDKVVGDDKKDV---LVEFYAPWCGHCKNLAPEYEKL 43


>gnl|CDD|239299 cd03001, PDI_a_P5, PDIa family, P5 subfamily; composed of
           eukaryotic proteins similar to human P5, a PDI-related
           protein with a domain structure of aa'b (where a and a'
           are redox active TRX domains and b is a redox inactive
           TRX-like domain). Like PDI, P5 is located in the
           endoplasmic reticulum (ER) and displays both isomerase
           and chaperone activities, which are independent of each
           other. Compared to PDI, the isomerase and chaperone
           activities of P5 are lower. The first cysteine in the
           CXXC motif of both redox active domains in P5 is
           necessary for isomerase activity. The P5 gene was first
           isolated as an amplified gene from a
           hydroxyurea-resistant hamster cell line. The zebrafish
           P5 homolog has been implicated to play a critical role
           in establishing left/right asymmetries in the embryonic
           midline. Some members of this subfamily are P5-like
           proteins containing only one redox active TRX domain.
          Length = 103

 Score = 28.4 bits (64), Expect = 2.9
 Identities = 18/53 (33%), Positives = 25/53 (47%), Gaps = 7/53 (13%)

Query: 151 LILFYSDWCFACLQVEPIFKKLMDELS---PLGVGFFTVHVHNEQGLARRLGV 200
           L+ FY+ WC  C  + P +KK    L     +G     VH    Q LA++ GV
Sbjct: 22  LVEFYAPWCGHCKNLAPEWKKAAKALKGIVKVGAVDADVH----QSLAQQYGV 70


>gnl|CDD|221921 pfam13098, Thioredoxin_2, Thioredoxin-like domain. 
          Length = 105

 Score = 28.5 bits (64), Expect = 2.9
 Identities = 21/83 (25%), Positives = 32/83 (38%), Gaps = 17/83 (20%)

Query: 149 PHLILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVH---------------NEQG 193
           P L++F    C  C ++     K  D    L   F  ++V+               +E+ 
Sbjct: 7   PVLVVFTDPDCPYCKKLHKELLKDPDVQEYLKDNFVVIYVNVDDSKEVTDFDGETLSEKE 66

Query: 194 LARRLGVGSQLPQIALLT-DGRT 215
           LAR+ GV    P I  L  DG+ 
Sbjct: 67  LARKYGVRGT-PTIVFLDGDGKE 88


>gnl|CDD|239248 cd02950, TxlA, TRX-like protein A (TxlA) family; TxlA was
           originally isolated from the cyanobacterium
           Synechococcus. It is found only in oxygenic
           photosynthetic organisms. TRX is a small enzyme that
           participate in redox reactions, via the reversible
           oxidation of an active site dithiol present in a CXXC
           motif. Disruption of the txlA gene suggests that the
           protein is involved in the redox regulation  of the
           structure and function of photosynthetic apparatus. The
           plant homolog (designated as HCF164) is localized in the
           chloroplast and is involved in the assembly of the
           cytochrome b6f complex, which takes a central position
           in photosynthetic electron transport.
          Length = 142

 Score = 28.8 bits (65), Expect = 3.4
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 149 PHLILFYSDWCFACLQVEPIFKKL 172
           P L+ FY+DWC  C ++ P   KL
Sbjct: 22  PTLVEFYADWCTVCQEMAPDVAKL 45


>gnl|CDD|224081 COG1159, Era, GTPase [General function prediction only].
          Length = 298

 Score = 29.9 bits (68), Expect = 3.4
 Identities = 14/40 (35%), Positives = 22/40 (55%), Gaps = 1/40 (2%)

Query: 224 SVQKMVEFFRLKLPYKLIVPLSATNVDAFLDNWREDNKVH 263
            + K++ F +  LP+K IVP+SA   D  +D   E  K +
Sbjct: 132 VLLKLIAFLKKLLPFKEIVPISALKGDN-VDTLLEIIKEY 170


>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
           lipoprotein.  This protein family occurs in strictly
           within a subset of Gram-negative bacterial species with
           the proposed PEP-CTERM/exosortase system, analogous to
           the LPXTG/sortase system common in Gram-positive
           bacteria. This protein occurs in a species if and only
           if a transmembrane histidine kinase (TIGR02916) and a
           DNA-binding response regulator (TIGR02915) also occur.
           The present of tetratricopeptide repeats (TPR) suggests
           protein-protein interaction, possibly for the regulation
           of PEP-CTERM protein expression, since many PEP-CTERM
           proteins in these genomes are preceded by a proposed DNA
           binding site for the response regulator.
          Length = 899

 Score = 30.1 bits (68), Expect = 3.7
 Identities = 14/79 (17%), Positives = 26/79 (32%), Gaps = 10/79 (12%)

Query: 16  AVSLDPYETLGVPRTASLQEIRKNYKRLV------VEWHPDKNNDPTAQEKFL----QLT 65
            ++L P   L +   A    + KNY + +      +E  PD         + L    +  
Sbjct: 627 LLALQPDSALALLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTE 686

Query: 66  EAYNILSDAERRKQYDLFG 84
            A  I    +++      G
Sbjct: 687 SAKKIAKSLQKQHPKAALG 705


>gnl|CDD|219888 pfam08534, Redoxin, Redoxin.  This family of redoxins includes
           peroxiredoxin, thioredoxin and glutaredoxin proteins.
          Length = 142

 Score = 28.8 bits (65), Expect = 3.8
 Identities = 14/72 (19%), Positives = 25/72 (34%), Gaps = 2/72 (2%)

Query: 149 PHLILFY-SDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQGLARRLGVGSQLPQI 207
             ++ F+   +C  C    P  +KL       GV    V+  N+           +  + 
Sbjct: 29  KVVLNFWPGAFCPTCSAEHPYLEKLSKLYKAKGVDVVAVNASNDPFFVMN-FWAKEGLKY 87

Query: 208 ALLTDGRTSFFK 219
            +L D   +F K
Sbjct: 88  PVLADRDGAFTK 99


>gnl|CDD|221069 pfam11304, DUF3106, Protein of unknown function (DUF3106).  Some
           members in this family of proteins are annotated as
           transmembrane proteins however this cannot be confirmed.
           Currently no function is known.
          Length = 107

 Score = 28.1 bits (63), Expect = 4.4
 Identities = 13/45 (28%), Positives = 19/45 (42%), Gaps = 7/45 (15%)

Query: 34  QEIRKNYKRLVVEWHPDKNNDPTAQEKFLQLTEAYNILSDAERRK 78
           ++ R NY+R        +   P  +E      EAY  LS  ER+ 
Sbjct: 65  EQARLNYQRF-------RQLPPEQKEALRARWEAYQALSPEERQA 102


>gnl|CDD|239246 cd02948, TRX_NDPK, TRX domain, TRX and NDP-kinase (NDPK) fusion
           protein family; most members of this group are fusion
           proteins which contain one redox active TRX domain
           containing a CXXC motif and three NDPK domains, and are
           characterized as intermediate chains (ICs) of axonemal
           outer arm dynein. Dyneins are molecular motors that
           generate force against microtubules to produce cellular
           movement, and are divided into two classes: axonemal and
           cytoplasmic. They are supramolecular complexes
           consisting of three protein groups classified according
           to size: dynein heavy, intermediate and light chains.
           Axonemal dyneins form two structures, the inner and
           outer arms, which are attached to doublet microtubules
           throughout the cilia and flagella. The human homolog is
           the sperm-specific Sptrx-2, presumed to be a  component
           of the human sperm axoneme architecture. Included in
           this group is another human protein, TRX-like protein 2,
           a smaller fusion protein containing one TRX and one NDPK
           domain, which is also associated with microtubular
           structures. The other members of this group are
           hypothetical insect proteins containing a TRX domain and
           outer arm dynein light chains (14 and 16kDa) of
           Chlamydomonas reinhardtii. Using standard assays, the
           fusion proteins have shown no TRX enzymatic activity.
          Length = 102

 Score = 27.7 bits (62), Expect = 5.2
 Identities = 12/32 (37%), Positives = 17/32 (53%)

Query: 155 YSDWCFACLQVEPIFKKLMDELSPLGVGFFTV 186
           Y +WC  C  V  +FKK+ +EL    + F T 
Sbjct: 25  YQEWCGPCKAVVSLFKKIKNELGDDLLHFATA 56


>gnl|CDD|239302 cd03004, PDI_a_ERdj5_C, PDIa family, C-terminal ERdj5 subfamily;
           ERdj5, also known as  JPDI and macrothioredoxin, is a
           protein containing an N-terminal DnaJ domain and four
           redox active TRX domains. This subfamily is composed of
           the three TRX domains located at the C-terminal half of
           the protein. ERdj5 is a ubiquitous protein localized in
           the endoplasmic reticulum (ER) and is abundant in
           secretory cells. It's transcription is induced during ER
           stress. It interacts with BiP through its DnaJ domain in
           an ATP-dependent manner. BiP, an ER-resident member of
           the Hsp70 chaperone family, functions in ER-associated
           degradation and protein translocation. Also included in
           the alignment is the single complete TRX domain of an
           uncharacterized protein from Tetraodon nigroviridis,
           which also contains a DnaJ domain at its N-terminus.
          Length = 104

 Score = 27.6 bits (62), Expect = 6.7
 Identities = 15/74 (20%), Positives = 28/74 (37%), Gaps = 2/74 (2%)

Query: 151 LILFYSDWCFACLQVEPIFKKLMDELSPLGVGFFTVHVHNEQGLARRLGVGSQLPQIALL 210
           L+ FY+ WC  C  + P  +K    L    V   +V     + L ++  + +  P I L 
Sbjct: 23  LVDFYAPWCGPCQALLPELRKAARALKGK-VKVGSVDCQKYESLCQQANIRA-YPTIRLY 80

Query: 211 TDGRTSFFKEPSFS 224
               + +     + 
Sbjct: 81  PGNASKYHSYNGWH 94


>gnl|CDD|239282 cd02984, TRX_PICOT, TRX domain, PICOT (for PKC-interacting cousin
           of TRX) subfamily; PICOT is a protein that interacts
           with protein kinase C (PKC) theta, a calcium independent
           PKC isoform selectively expressed in skeletal muscle and
           T lymphocytes. PICOT contains an N-terminal TRX-like
           domain, which does not contain the catalytic CXXC motif,
           followed by one to three glutaredoxin domains. The
           TRX-like domain is required for interaction with PKC
           theta. PICOT inhibits the activation of c-Jun N-terminal
           kinase and the transcription factors, AP-1 and NF-kB,
           induced by PKC theta or T-cell activating stimuli.
          Length = 97

 Score = 27.2 bits (61), Expect = 7.6
 Identities = 8/40 (20%), Positives = 19/40 (47%)

Query: 139 KNYIPKSYTTPHLILFYSDWCFACLQVEPIFKKLMDELSP 178
           +  +    +   ++ F++ W   C Q+  +F++L  E  P
Sbjct: 6   EELLKSDASKLLVLHFWAPWAEPCKQMNQVFEELAKEAFP 45


>gnl|CDD|216185 pfam00909, Ammonium_transp, Ammonium Transporter Family. 
          Length = 399

 Score = 28.8 bits (65), Expect = 8.0
 Identities = 12/42 (28%), Positives = 20/42 (47%), Gaps = 3/42 (7%)

Query: 516 QSLSRQHILPAVSLIFTVIIIVVLAMIMNHYMKL---EEEEI 554
           + L  Q I  AV L +   +  +L +++   + L   EEEE 
Sbjct: 345 KQLGVQLIGIAVILAYAFGVTFILGLLLGLTLGLRVSEEEEE 386


>gnl|CDD|129372 TIGR00271, TIGR00271, uncharacterized hydrophobic domain.  This
           domain is in a family of archaeal proteins that includes
           AF0785 of Archaeoglobus fulgidus and in several
           eubacterial proteins, including the much longer protein
           sll1151 from Synechocystis PCC6803.
          Length = 175

 Score = 28.1 bits (63), Expect = 8.7
 Identities = 7/27 (25%), Positives = 14/27 (51%)

Query: 526 AVSLIFTVIIIVVLAMIMNHYMKLEEE 552
           A +L+  +  ++V AMIM  +   +  
Sbjct: 62  AKNLLLAMAFVIVFAMIMGLFHPSKLT 88


>gnl|CDD|222416 pfam13848, Thioredoxin_6, Thioredoxin-like domain. 
          Length = 183

 Score = 28.1 bits (63), Expect = 9.7
 Identities = 14/52 (26%), Positives = 24/52 (46%), Gaps = 3/52 (5%)

Query: 183 FFTVHVHNEQGLARRLGV-GSQLPQIALL--TDGRTSFFKEPSFSVQKMVEF 231
           F TV   +   +   LG+  + LP I ++     +  +F E  FS + + EF
Sbjct: 129 FATVDGKSFGRVLEYLGLSSADLPVIVIVDSASHKKKYFPEDEFSPESLKEF 180


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.138    0.416 

Gapped
Lambda     K      H
   0.267   0.0769    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 29,095,126
Number of extensions: 2911702
Number of successful extensions: 2894
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2868
Number of HSP's successfully gapped: 105
Length of query: 554
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 452
Effective length of database: 6,413,494
Effective search space: 2898899288
Effective search space used: 2898899288
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 62 (27.6 bits)