RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= 042284
         (430 letters)



>gnl|CDD|215175 PLN02309, PLN02309, 5'-adenylylsulfate reductase.
          Length = 457

 Score =  776 bits (2006), Expect = 0.0
 Identities = 297/384 (77%), Positives = 339/384 (88%), Gaps = 10/384 (2%)

Query: 47  EDYEKLARGMESASPLEIMDKAFQKFGNDIAIAFSGAEDVVLIEYAKLTGRPFRVFSLDT 106
           ED+EKLA+ +E+ASPLEIMDKA +KFGNDIAIAFSGAEDV LIEYA LTGRPFRVFSLDT
Sbjct: 84  EDFEKLAKELENASPLEIMDKALEKFGNDIAIAFSGAEDVALIEYAHLTGRPFRVFSLDT 143

Query: 107 GRLNPETHQFFDTVEKHYGIRIEYTFPNAVEVQALVRTKGLFSFYEDGHQECCRIRKVRP 166
           GRLNPET++ FD VEKHYGIRIEY FP+AVEVQALVR KGLFSFYEDGHQECCR+RKVRP
Sbjct: 144 GRLNPETYRLFDAVEKHYGIRIEYMFPDAVEVQALVRNKGLFSFYEDGHQECCRVRKVRP 203

Query: 167 LKRALKGLRAWITGQRKDQSPGTRAEIPVVQIDTSFEGIDGGKGSLVKWNPLANVKGQDI 226
           L+RALKGLRAWITGQRKDQSPGTRAE+PVVQ+D  FEG+DGG GSLVKWNPLANV G ++
Sbjct: 204 LRRALKGLRAWITGQRKDQSPGTRAEVPVVQVDPVFEGLDGGPGSLVKWNPLANVTGNEV 263

Query: 227 WNFLRAMNIPINSLHSQGYISIGCEPCTRPVLPGQHEREGRWWWEDAKAKECGLHNGNIK 286
           WNFLR M++P+NSLH+QGY+SIGCEPCTRPVLPGQHEREGRWWWEDAKAKECGLH GNIK
Sbjct: 264 WNFLRTMDVPVNSLHAQGYVSIGCEPCTRPVLPGQHEREGRWWWEDAKAKECGLHKGNIK 323

Query: 287 QEELSQHININGNGVAQHTNGSAPASDLFNSQKLVSFRRTGIENLARLQNREDPWLIVLY 346
           +E+            A + NG+A  +D+FNSQ +V+  R GIENL +L+NR++PWL+VLY
Sbjct: 324 EED----------NGAANDNGNAAVADIFNSQNVVALSRAGIENLLKLENRKEPWLVVLY 373

Query: 347 APWCHFCQAMEGSYIELAEQLEGMGVKVGKFRADGDHKEFAKQKLQLVSFPTILFFPKHS 406
           APWC FCQAME SY ELAE+L G GVKV KFRADGD KEFAKQ+LQL SFPTIL FPK+S
Sbjct: 374 APWCPFCQAMEASYEELAEKLAGSGVKVAKFRADGDQKEFAKQELQLGSFPTILLFPKNS 433

Query: 407 AKPVKYPSEKRDVDSLMAFVNALR 430
           ++P+KYPSEKRDVDSL++FVN+LR
Sbjct: 434 SRPIKYPSEKRDVDSLLSFVNSLR 457


>gnl|CDD|232970 TIGR00424, APS_reduc, 5'-adenylylsulfate reductase,
           thioredoxin-independent.  This enzyme, involved in the
           assimilation of inorganic sulfate, is closely related to
           the thioredoxin-dependent PAPS reductase of Bacteria
           (CysH) and Saccharomyces cerevisiae. However, it has its
           own C-terminal thioredoxin-like domain and is not
           thioredoxin-dependent. Also, it has a substrate
           preference for 5'-adenylylsulfate (APS) over
           3'-phosphoadenylylsulfate (PAPS) so the pathway does not
           require an APS kinase (CysC) to convert APS to PAPS.
           Arabidopsis thaliana appears to have three isozymes, all
           able to complement E. coli CysH mutants (even in
           backgrounds lacking thioredoxin or APS kinase) but
           likely localized to different compartments in
           Arabidopsis [Central intermediary metabolism, Sulfur
           metabolism].
          Length = 463

 Score =  697 bits (1799), Expect = 0.0
 Identities = 297/390 (76%), Positives = 342/390 (87%), Gaps = 9/390 (2%)

Query: 41  ESTNDHEDYEKLARGMESASPLEIMDKAFQKFGNDIAIAFSGAEDVVLIEYAKLTGRPFR 100
           E   + ED+EKLA+ +E+ASPLEIMDKA +KFGNDIAIAFSGAEDV LIEYA LTGRPFR
Sbjct: 83  EKVVEVEDFEKLAKKLENASPLEIMDKALEKFGNDIAIAFSGAEDVALIEYAHLTGRPFR 142

Query: 101 VFSLDTGRLNPETHQFFDTVEKHYGIRIEYTFPNAVEVQALVRTKGLFSFYEDGHQECCR 160
           VFSLDTGRLNPET++FFD VEK YGIRIEY FP+AVEVQALVR+KGLFSFYEDGHQECCR
Sbjct: 143 VFSLDTGRLNPETYRFFDAVEKQYGIRIEYMFPDAVEVQALVRSKGLFSFYEDGHQECCR 202

Query: 161 IRKVRPLKRALKGLRAWITGQRKDQSPGTRAEIPVVQIDTSFEGIDGGKGSLVKWNPLAN 220
           +RKVRPL+RALKGL+AWITGQRKDQSPGTR+EIPVVQ+D  FEG+DGG GSLVKWNP+AN
Sbjct: 203 VRKVRPLRRALKGLKAWITGQRKDQSPGTRSEIPVVQVDPVFEGLDGGVGSLVKWNPVAN 262

Query: 221 VKGQDIWNFLRAMNIPINSLHSQGYISIGCEPCTRPVLPGQHEREGRWWWEDAKAKECGL 280
           V+G+D+WNFLR M++P+N+LH+QGY+SIGCEPCTRPVLPGQHEREGRWWWEDAKAKECGL
Sbjct: 263 VEGKDVWNFLRTMDVPVNTLHAQGYVSIGCEPCTRPVLPGQHEREGRWWWEDAKAKECGL 322

Query: 281 HNGNIKQEELSQHININGNGVAQHTNGSAPASDLFNSQKLVSFRRTGIENLARLQNREDP 340
           H GNIK+E          +G A + NGS   +D+F+S  +VS  R GIENL +L+ R++ 
Sbjct: 323 HKGNIKEE--------TLDG-AVNGNGSDAVADIFDSNNVVSLSRPGIENLLKLEERKEA 373

Query: 341 WLIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKFRADGDHKEFAKQKLQLVSFPTIL 400
           WL+VLYAPWC FCQAME SY+ELAE+L G GVKV KFRADGD KEFAKQ+LQL SFPTIL
Sbjct: 374 WLVVLYAPWCPFCQAMEASYLELAEKLAGSGVKVAKFRADGDQKEFAKQELQLGSFPTIL 433

Query: 401 FFPKHSAKPVKYPSEKRDVDSLMAFVNALR 430
           FFPKHS++P+KYPSEKRDVDSLM+FVN LR
Sbjct: 434 FFPKHSSRPIKYPSEKRDVDSLMSFVNLLR 463


>gnl|CDD|234997 PRK02090, PRK02090, phosphoadenosine phosphosulfate reductase;
           Provisional.
          Length = 241

 Score =  281 bits (722), Expect = 1e-93
 Identities = 105/242 (43%), Positives = 138/242 (57%), Gaps = 14/242 (5%)

Query: 46  HEDYEKLARGMESASPLEIMDKAFQKFGNDIAIAFS-GAEDVVLIEYAKLTGRPFRVFSL 104
             D  +L   +E AS  E +  A + FG  +A+  S GAED VL+           V  L
Sbjct: 13  ALDLAELNAELEGASAQERLAWALENFGGRLALVSSFGAEDAVLLHLVAQVDPDIPVIFL 72

Query: 105 DTGRLNPETHQFFDTVEKHYGIRIEYTFPNAVEVQALVRTKGLFSFYEDGHQECCRIRKV 164
           DTG L PET++F D + +   + ++   P+A   +   R  GL+    +   ECCRIRKV
Sbjct: 73  DTGYLFPETYRFIDELTERLLLNLKVYRPDASAAEQEARYGGLWEQSVEDRDECCRIRKV 132

Query: 165 RPLKRALKGLRAWITGQRKDQSPGTRAEIPVVQIDTSFEGIDGGKGSLVKWNPLANVKGQ 224
            PL RAL GL AWITG R++QS  TRA +PV++ID          G   K NPLA+   +
Sbjct: 133 EPLNRALAGLDAWITGLRREQSG-TRANLPVLEID----------GGRFKINPLADWTNE 181

Query: 225 DIWNFLRAMNIPINSLHSQGYISIGCEPCTRPVLPGQHEREGRWWWEDAKAKECGLHNGN 284
           D+W +L+  ++P + L  QGY SIGCEPCTRPV PG+ ER GR WW   K KECGLH GN
Sbjct: 182 DVWAYLKEHDLPYHPLVDQGYPSIGCEPCTRPVEPGEDERAGR-WWGGLK-KECGLHEGN 239

Query: 285 IK 286
           + 
Sbjct: 240 LP 241


>gnl|CDD|233701 TIGR02055, APS_reductase, thioredoxin-dependent adenylylsulfate APS
           reductase.  This model describes recently identified
           adenosine 5'-phosphosulfate (APS) reductase activity
           found in sulfate-assimilatory prokaryotes, thus
           separating it from the traditionally described
           phosphoadenosine 5'-phosphosulfate (PAPS) reductases
           found in bacteria and fungi. Homologous to PAPS
           reductase in enterobacteria, cyanobacteria, and yeast,
           APS reductase here clusters with, and demonstrates
           greater homology to plant APS reductase. Additionally,
           the presence of two conserved C-terminal motifs
           (CCXXRKXXPL & SXGCXXCT) distinguishes APS substrate
           specificity and serves as a FeS cluster [Central
           intermediary metabolism, Sulfur metabolism].
          Length = 191

 Score =  261 bits (669), Expect = 1e-86
 Identities = 101/200 (50%), Positives = 125/200 (62%), Gaps = 9/200 (4%)

Query: 81  SGAEDVVLIEYAKLTGRPFRVFSLDTGRLNPETHQFFDTVEKHYGIRIEYTFPNAVEVQA 140
            GAEDVVL++ A       +VF LDTGRL  ET++  D V + Y I I+   P  + V+ 
Sbjct: 1   LGAEDVVLVDLAAKVRPDVKVFFLDTGRLFKETYETIDQVRERYDILIDVLSPPPLTVEE 60

Query: 141 LVRTKGLFSFYEDGHQECCRIRKVRPLKRALKGLRAWITGQRKDQSPGTRAEIPVVQIDT 200
            V+  GL  FY     ECC IRKV PLKRAL G+ AWITG R+DQSP TRA+ P ++ID 
Sbjct: 61  QVKEYGLNLFYRSVPHECCGIRKVEPLKRALAGVSAWITGLRRDQSP-TRAQAPFLEID- 118

Query: 201 SFEGIDGGKGSLVKWNPLANVKGQDIWNFLRAMNIPINSLHSQGYISIGCEPCTRPVLPG 260
                      LVK NPLA+   +D+W ++    +P N LH +GY SIGCEPCTRPV PG
Sbjct: 119 -------EAFGLVKINPLADWTSEDVWEYIADNELPYNPLHDRGYPSIGCEPCTRPVAPG 171

Query: 261 QHEREGRWWWEDAKAKECGL 280
           +  R GRWWWE+A  KECGL
Sbjct: 172 EDPRAGRWWWEEAAKKECGL 191


>gnl|CDD|223253 COG0175, CysH, 3'-phosphoadenosine 5'-phosphosulfate
           sulfotransferase (PAPS reductase)/FAD synthetase and
           related enzymes [Amino acid transport and metabolism /
           Coenzyme metabolism].
          Length = 261

 Score =  221 bits (564), Expect = 9e-70
 Identities = 101/259 (38%), Positives = 139/259 (53%), Gaps = 14/259 (5%)

Query: 37  EGRIESTNDHEDYEKLARGMESASPLEIMDKAFQKFGNDIAIAFS-GAEDVVLIEYAKLT 95
            G        E+   L   +E+ SP+EI+  A ++F N + ++FS G +  VL+  A   
Sbjct: 3   TGSPTKDRMSENLASLLDKLEAESPIEILRWAAEEFSNPVVVSFSGGKDSTVLLHLAAKA 62

Query: 96  GRPFRVFSLDTGRLNPETHQFFDTVEKHYGIRIEYTFPNAVEVQALVRTKGLFSFYEDGH 155
              F V  LDTG   PET++F D + + YG+ ++  +    EV A     G   +     
Sbjct: 63  FPDFPVIFLDTGYHFPETYEFRDRLAEEYGLDLK-VYRPDDEV-AEGEKYGGKLWEPSVE 120

Query: 156 QECCRIRKVRPLKRALKGL--RAWITGQRKDQSPGTRAEIPVVQIDTSFEGIDGGKGSLV 213
           + CC IRKV PLKRAL      AW TG R+D+SP TRA++PVV  D+ F          +
Sbjct: 121 RWCCDIRKVEPLKRALDEYGFDAWFTGLRRDESP-TRAKLPVVSFDSEFGES-------I 172

Query: 214 KWNPLANVKGQDIWNFLRAMNIPINSLHSQGYISIGCEPCTRPVLP-GQHEREGRWWWED 272
           + NPLA+    D+W ++ A N+P N L+ QGY SIGC PCTRPV P  + ER GRW  E 
Sbjct: 173 RVNPLADWTELDVWLYILANNLPYNPLYDQGYRSIGCWPCTRPVEPLAEDERAGRWEGEL 232

Query: 273 AKAKECGLHNGNIKQEELS 291
           A+  ECGLH  +     LS
Sbjct: 233 AEKTECGLHRADDPDSALS 251


>gnl|CDD|201832 pfam01507, PAPS_reduct, Phosphoadenosine phosphosulfate reductase
           family.  This domain is found in phosphoadenosine
           phosphosulfate (PAPS) reductase enzymes or PAPS
           sulfotransferase. PAPS reductase is part of the adenine
           nucleotide alpha hydrolases superfamily also including N
           type ATP PPases and ATP sulphurylases. The enzyme uses
           thioredoxin as an electron donor for the reduction of
           PAPS to phospho-adenosine-phosphate (PAP). It is also
           found in NodP nodulation protein P from Rhizobium which
           has ATP sulfurylase activity (sulfate adenylate
           transferase).
          Length = 173

 Score =  196 bits (500), Expect = 2e-61
 Identities = 73/184 (39%), Positives = 104/184 (56%), Gaps = 15/184 (8%)

Query: 76  IAIAFS-GAEDVVLIEYAKLTGRPFRVFSLDTGRLNPETHQFFDTVEKHYGIRIEYTFPN 134
           + ++FS G + +VL+  A     P  V  +DTG   PET++F D +E+ YG+ ++   P 
Sbjct: 2   LVVSFSGGKDSLVLLHLASKAFPPGPVIFIDTGYEFPETYEFVDELEEKYGLNLKVYLPE 61

Query: 135 AVEVQALVRTKGLFSFYEDGHQECCRIRKVRPLKRALK--GLRAWITGQRKDQSPGTRAE 192
               + +       S Y    + CCR+RKV PLKRALK  G  AW TG R+D+SP +RA+
Sbjct: 62  DSFAEGINPEGIPSSLY----RRCCRLRKVEPLKRALKELGFDAWFTGLRRDESP-SRAK 116

Query: 193 IPVVQIDTSFEGIDGGKGSLVKWNPLANVKGQDIWNFLRAMNIPINSLHSQGYISIGCEP 252
           +P+V ID  F         ++K  PL N    D+W ++ A N+P N L+ QGY SIGC P
Sbjct: 117 LPIVSIDGDFPK-------VIKVFPLLNWTETDVWQYILANNVPYNPLYDQGYRSIGCYP 169

Query: 253 CTRP 256
           CT P
Sbjct: 170 CTGP 173


>gnl|CDD|239291 cd02993, PDI_a_APS_reductase, PDIa family, 5'-Adenylylsulfate (APS)
           reductase subfamily; composed of plant-type APS
           reductases containing a C-terminal redox active TRX
           domain and an N-terminal reductase domain which is part
           of a superfamily that includes N type ATP PPases. APS
           reductase catalyzes the reduction of activated sulfate
           to sulfite, a key step in the biosynthesis of
           sulfur-containing metabolites. Sulfate is first
           activated by ATP sulfurylase, forming APS, which can be
           phosphorylated to 3'-phosphoadenosine-5'-phosphosulfate
           (PAPS). Depending on the organism, either APS or PAPS
           can be used for sulfate reduction. Prokaryotes and fungi
           use PAPS, whereas plants use both APS and PAPS. Since
           plant-type APS reductase uses glutathione (GSH) as its
           electron donor, the C-terminal domain may function like
           glutaredoxin, a GSH-dependent member of the TRX
           superfamily. The flow of reducing equivalents goes from
           GSH -> C-terminal TRX domain -> N-terminal reductase
           domain -> APS. Plant-type APS reductase shows no
           homology to that of dissimilatory sulfate-reducing
           bacteria, which is an iron-sulfur flavoenzyme. Also
           included in the alignment is EYE2 from Chlamydomonas
           reinhardtii, a protein required for eyespot assembly.
          Length = 109

 Score =  179 bits (455), Expect = 1e-55
 Identities = 69/107 (64%), Positives = 84/107 (78%)

Query: 320 LVSFRRTGIENLARLQNREDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKFRA 379
           +V+  R  IE LA+ + R    L+VLYAPWC FCQAME SY ELAE+L G  VKV KF A
Sbjct: 3   VVTLSRAEIEALAKGERRNQSTLVVLYAPWCPFCQAMEASYEELAEKLAGSNVKVAKFNA 62

Query: 380 DGDHKEFAKQKLQLVSFPTILFFPKHSAKPVKYPSEKRDVDSLMAFV 426
           DG+ +EFAK++LQL SFPTILFFPK+S +P+KYPSE+RDVDSL+ FV
Sbjct: 63  DGEQREFAKEELQLKSFPTILFFPKNSRQPIKYPSEQRDVDSLLMFV 109


>gnl|CDD|238846 cd01713, PAPS_reductase, This domain is found in phosphoadenosine
           phosphosulphate (PAPS) reductase enzymes or PAPS
           sulphotransferase. PAPS reductase is part of the adenine
           nucleotide alpha hydrolases superfamily also including N
           type ATP PPases and ATP sulphurylases. A highly modified
           version of the P loop, the fingerprint peptide of
           mononucleotide-binding proteins, is present in the
           active site of the protein, which appears to be a
           positively charged cleft containing a number of
           conserved arginine and lysine residues. Although PAPS
           reductase has no ATPase activity, it shows a striking
           similarity to the structure of the ATP pyrophosphatase
           (ATP PPase) domain of GMP synthetase, indicating that
           both enzyme families have evolved from a common
           ancestral nucleotide-binding fold.   The enzyme uses
           thioredoxin as an electron donor for the reduction of
           PAPS to phospho-adenosine-phosphate (PAP) . It is also
           found in NodP nodulation protein P from Rhizobium
           meliloti which has ATP sulphurylase activity (sulphate
           adenylate transferase) .
          Length = 173

 Score =  174 bits (444), Expect = 4e-53
 Identities = 69/181 (38%), Positives = 94/181 (51%), Gaps = 14/181 (7%)

Query: 75  DIAIAFS-GAEDVVLIEYAKLTGR---PFRVFSLDTGRLNPETHQFFDTVEKHYGIRIEY 130
           ++ ++FS G +  VL+  A        P  V  LDTG   PET++F D V + YG+ +  
Sbjct: 1   NVVVSFSGGKDSTVLLHLALKALPELKPVPVIFLDTGYEFPETYEFVDRVAERYGLPLVV 60

Query: 131 TFPNAVEVQALVRTKGLFSFYEDGHQECCRIRKVRPLKRALKGLR--AWITGQRKDQSPG 188
             P     + L      F       + CCRI KV PL+RALK L   AWITG R+D+S  
Sbjct: 61  VRPPDSPAEGLALGLKGFPLPSPDRRWCCRILKVEPLRRALKELGVVAWITGIRRDESA- 119

Query: 189 TRAEIPVVQIDTSFEGIDGGKGSLVKWNPLANVKGQDIWNFLRAMNIPINSLHSQGYISI 248
            RA +PVV         D GKG ++K NPL +   +D+W +L    +P N L+ QGY SI
Sbjct: 120 RRALLPVV-------WTDDGKGGILKVNPLLDWTYEDVWAYLARHGLPYNPLYDQGYRSI 172

Query: 249 G 249
           G
Sbjct: 173 G 173


>gnl|CDD|129526 TIGR00434, cysH, phosophoadenylyl-sulfate reductase (thioredoxin). 
           This enzyme, involved in the assimilation of inorganic
           sulfate, is designated cysH in Bacteria and MET16 in
           Saccharomyces cerevisiae. Synonyms include
           phosphoadenosine phosphosulfate reductase, PAPS
           reductase, and PAPS reductase, thioredoxin-dependent. In
           a reaction requiring reduced thioredoxin and NADPH, it
           converts 3(prime)-phosphoadenylylsulfate (PAPS) to
           sulfite and adenosine 3(prime),5(prime) diphosphate
           (PAP). A related family of plant enzymes, scoring below
           the trusted cutoff, differs in having a thioredoxin-like
           C-terminal domain, not requiring thioredoxin, and in
           having a preference for 5(prime)-adenylylsulfate (APS)
           over PAPS [Central intermediary metabolism, Sulfur
           metabolism].
          Length = 212

 Score =  158 bits (400), Expect = 4e-46
 Identities = 81/226 (35%), Positives = 115/226 (50%), Gaps = 20/226 (8%)

Query: 61  PLEIMDKAFQKFGNDIAIAFS-GAEDVVLIEYAKLTGRPFRVFSLDTGRLNPETHQFFDT 119
             EI+  A+  FG  +  + S G +  VL++          V  LDTG   PET++  D 
Sbjct: 1   AQEIIAWAYVTFGGHLVYSTSFGIQGAVLLDLVSKISPDIPVIFLDTGYHFPETYELIDE 60

Query: 120 VEKHYGIRIE-YTFPNAVEVQALVRTKGLFSFYEDGHQECCRIRKVRPLKRALKGL--RA 176
           + + Y + I+ Y    ++  QA    K     +E    +   +RKV P+ RALK L   A
Sbjct: 61  LTERYPLNIKVYKPDLSLAEQA---AKYGDKLWEQDPNKYDYLRKVEPMHRALKELHASA 117

Query: 177 WITGQRKDQSPGTRAEIPVVQIDTSFEGIDGGKGSLVKWNPLANVKGQDIWNFLRAMNIP 236
           W TG R+DQ P +RA + ++ ID  F         ++K  PL +   +D++ ++ A N+P
Sbjct: 118 WFTGLRRDQGP-SRANLSILNIDEKFG--------ILKVLPLIDWTWKDVYQYIDAHNLP 168

Query: 237 INSLHSQGYISIGCEPCTRPVLPGQHEREGRWWWEDAKAK-ECGLH 281
            N LH QGY SIG    TRPV  G+ ER GRW     KAK ECGLH
Sbjct: 169 YNPLHDQGYPSIGDYHSTRPVKEGEDERAGRW---KGKAKTECGLH 211


>gnl|CDD|131112 TIGR02057, PAPS_reductase, phosphoadenosine phosphosulfate
           reductase, thioredoxin dependent.  Requiring thioredoxin
           as an electron donor, phosphoadenosine phosphosulfate
           reductase catalyzes the reduction of
           3'-phosphoadenylylsulfate (PAPS) to sulfite and
           phospho-adenosine-phosphate (PAP). Found in
           enterobacteria, cyanobacteria, and yeast, PAPS reductase
           is related to a group of plant (TIGR00424) and bacterial
           (TIGR02055) enzymes preferring 5'-adenylylsulfate (APS)
           over PAPS as a substrate for reduction to sulfite
           [Central intermediary metabolism, Sulfur metabolism].
          Length = 226

 Score =  136 bits (343), Expect = 1e-37
 Identities = 72/232 (31%), Positives = 109/232 (46%), Gaps = 19/232 (8%)

Query: 56  MESASPLEIMDKAFQKFGNDIAIAFS-GAEDVVLIEY-AKLTGRPFRVFSLDTGRLNPET 113
           +E  +P EI+  +   F + +    + G + +V +   + ++     V  +DT    P+T
Sbjct: 8   LEKLTPQEIIAWSIVTFPHGLVQTSAFGIQALVTLHLLSSISEPMIPVIFIDTLYHFPQT 67

Query: 114 HQFFDTVEKHYGIRIEYTFPNAVEVQALVRTKGLFSFYEDGHQECC-RIRKVRPLKRALK 172
               D + K Y   +     +  E +A    K           E    I KV P++RALK
Sbjct: 68  LTLKDELTKKYYQTLNLYKYDGCESEADFEAK-YGKLLWQKDIEKYDYIAKVEPMQRALK 126

Query: 173 GLR--AWITGQRKDQSPGTRAEIPVVQIDTSFEGIDGGKGSLVKWNPLANVKGQDIWNFL 230
            L   AW TG+R+DQ    RA +PV++ID         +  ++K NPL +   + ++ +L
Sbjct: 127 ELNASAWFTGRRRDQG-SARANLPVIEID--------EQNGILKVNPLIDWTFEQVYQYL 177

Query: 231 RAMNIPINSLHSQGYISIGCEPCTRPVLPGQHEREGRWWWEDAKAK-ECGLH 281
            A N+P N L  QGY SIG    TR V  G+ ER GRW     K K ECG+H
Sbjct: 178 DAHNVPYNPLLDQGYRSIGDYHSTRKVKEGEDERAGRWK---GKLKTECGIH 226


>gnl|CDD|239259 cd02961, PDI_a_family, Protein Disulfide Isomerase (PDIa) family,
           redox active TRX domains; composed of eukaryotic
           proteins involved in oxidative protein folding in the
           endoplasmic reticulum (ER) by acting as catalysts and
           folding assistants. Members of this family include PDI
           and PDI-related proteins like ERp72, ERp57 (or ERp60),
           ERp44, P5, PDIR, ERp46 and the transmembrane PDIs. PDI,
           ERp57, ERp72, P5, PDIR and ERp46 are all oxidases,
           catalyzing the formation of disulfide bonds of newly
           synthesized polypeptides in the ER. They also exhibit
           reductase activity in acting as isomerases to correct
           any non-native disulfide bonds, as well as chaperone
           activity to prevent protein aggregation and facilitate
           the folding of newly synthesized proteins. These
           proteins usually contain multiple copies of a redox
           active TRX (a) domain containing a CXXC motif, and may
           also contain one or more redox inactive TRX-like (b)
           domains. Only one a domain is required for the oxidase
           function but multiple copies are necessary for the
           isomerase function. The different types of PDIs may show
           different substrate specificities and tissue-specific
           expression, or may be induced by stress. PDIs are in
           their reduced form at steady state and are oxidized to
           the active form by Ero1, which is localized in the ER
           through ERp44. Some members of this family also contain
           a DnaJ domain in addition to the redox active a domains;
           examples are ERdj5 and Pfj2. Also included in the family
           is the redox inactive N-terminal TRX-like domain of
           ERp29.
          Length = 101

 Score = 88.8 bits (221), Expect = 1e-21
 Identities = 33/99 (33%), Positives = 51/99 (51%), Gaps = 4/99 (4%)

Query: 329 ENLARLQNREDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMG-VKVGKFRADGDHKEFA 387
           +N   L       L+  YAPWC  C+A+   Y +LA++L+G G V V K      + +  
Sbjct: 6   DNFDELVKDSKDVLVEFYAPWCGHCKALAPEYEKLAKELKGDGKVVVAKVDCT-ANNDLC 64

Query: 388 KQKLQLVSFPTILFFPKHSAKPVKYPSEKRDVDSLMAFV 426
            +   +  +PTI  FP  S +PVKY    R ++SL+ F+
Sbjct: 65  SEY-GVRGYPTIKLFPNGSKEPVKYEGP-RTLESLVEFI 101


>gnl|CDD|239296 cd02998, PDI_a_ERp38, PDIa family, endoplasmic reticulum protein 38
           (ERp38) subfamily; composed of proteins similar to the
           P5-like protein first isolated from alfalfa, which
           contains two redox active TRX (a) domains at the
           N-terminus, like human P5, and a C-terminal domain with
           homology to the C-terminal domain of ERp29, unlike human
           P5. The cDNA clone of this protein (named G1) was
           isolated from an alfalfa cDNA library by screening with
           human protein disulfide isomerase (PDI) cDNA. The G1
           protein is constitutively expressed in all major organs
           of the plant and its expression is induced by treatment
           with tunicamycin, indicating that it may be a
           glucose-regulated protein. The G1 homolog in the
           eukaryotic social amoeba Dictyostelium discoideum is
           also described as a P5-like protein, which is located in
           the endoplasmic reticulum (ER) despite the absence of an
           ER-retrieval signal. G1 homologs from Aspergillus niger
           and Neurospora crassa have also been characterized, and
           are named TIGA and ERp38, respectively. Also included in
           the alignment is an atypical PDI from Leishmania
           donovani containing a single a domain, and the
           C-terminal a domain of a P5-like protein from Entamoeba
           histolytica.
          Length = 105

 Score = 77.7 bits (192), Expect = 1e-17
 Identities = 35/86 (40%), Positives = 48/86 (55%), Gaps = 3/86 (3%)

Query: 342 LIVLYAPWCHFCQAMEGSYIELAEQLEGM-GVKVGKFRADGDHKEFAKQKLQLVSFPTIL 400
           L+  YAPWC  C+ +   Y +LA        V + K  AD  +K+ AK K  +  FPT+ 
Sbjct: 22  LVEFYAPWCGHCKNLAPEYEKLAAVFANEDDVVIAKVDADEANKDLAK-KYGVSGFPTLK 80

Query: 401 FFPKHSAKPVKYPSEKRDVDSLMAFV 426
           FFPK S +PVKY    RD++ L+ FV
Sbjct: 81  FFPKGSTEPVKY-EGGRDLEDLVKFV 105


>gnl|CDD|239293 cd02995, PDI_a_PDI_a'_C, PDIa family, C-terminal TRX domain (a')
           subfamily; composed of the C-terminal redox active a'
           domains of PDI, ERp72, ERp57 (or ERp60) and EFP1. PDI,
           ERp72 and ERp57 are endoplasmic reticulum (ER)-resident
           eukaryotic proteins involved in oxidative protein
           folding. They are oxidases, catalyzing the formation of
           disulfide bonds of newly synthesized polypeptides in the
           ER. They also exhibit reductase activity in acting as
           isomerases to correct any non-native disulfide bonds, as
           well as chaperone activity to prevent protein
           aggregation and facilitate the folding of newly
           synthesized proteins. PDI and ERp57 have the abb'a'
           domain structure (where a and a' are redox active TRX
           domains while b and b' are redox inactive TRX-like
           domains). PDI also contains an acidic region (c domain)
           after the a' domain that is absent in ERp57. ERp72 has
           an additional a domain at the N-terminus (a"abb'a'
           domain structure). ERp57 interacts with the lectin
           chaperones, calnexin and calreticulin, and specifically
           promotes the oxidative folding of glycoproteins, while
           PDI shows a wider substrate specificity. ERp72
           associates with several ER chaperones and folding
           factors to form complexes in the ER that bind nascent
           proteins. EFP1 is a binding partner protein of thyroid
           oxidase, which is responsible for the generation of
           hydrogen peroxide, a crucial substrate of
           thyroperoxidase, which functions to iodinate
           thyroglobulin and synthesize thyroid hormones.
          Length = 104

 Score = 68.7 bits (169), Expect = 2e-14
 Identities = 34/91 (37%), Positives = 46/91 (50%), Gaps = 14/91 (15%)

Query: 342 LIVLYAPWCHFCQAMEGSYIELAEQLEGM-GVKVGKFRADGD----HKEFAKQKLQLVSF 396
           L+  YAPWC  C+A+   Y ELAE+L+G   V + K   D        EF         F
Sbjct: 22  LVEFYAPWCGHCKALAPIYEELAEKLKGDDNVVIAKM--DATANDVPSEFVVD-----GF 74

Query: 397 PTILFFPK-HSAKPVKYPSEKRDVDSLMAFV 426
           PTILFFP    + P+KY    R ++ L+ F+
Sbjct: 75  PTILFFPAGDKSNPIKY-EGDRTLEDLIKFI 104


>gnl|CDD|239302 cd03004, PDI_a_ERdj5_C, PDIa family, C-terminal ERdj5 subfamily;
           ERdj5, also known as  JPDI and macrothioredoxin, is a
           protein containing an N-terminal DnaJ domain and four
           redox active TRX domains. This subfamily is composed of
           the three TRX domains located at the C-terminal half of
           the protein. ERdj5 is a ubiquitous protein localized in
           the endoplasmic reticulum (ER) and is abundant in
           secretory cells. It's transcription is induced during ER
           stress. It interacts with BiP through its DnaJ domain in
           an ATP-dependent manner. BiP, an ER-resident member of
           the Hsp70 chaperone family, functions in ER-associated
           degradation and protein translocation. Also included in
           the alignment is the single complete TRX domain of an
           uncharacterized protein from Tetraodon nigroviridis,
           which also contains a DnaJ domain at its N-terminus.
          Length = 104

 Score = 67.7 bits (166), Expect = 3e-14
 Identities = 32/91 (35%), Positives = 51/91 (56%), Gaps = 3/91 (3%)

Query: 336 NREDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKFRADGDHKEFAKQKLQLVS 395
           NR++PWL+  YAPWC  CQA+     + A  L+G  VKVG       ++    Q+  + +
Sbjct: 17  NRKEPWLVDFYAPWCGPCQALLPELRKAARALKGK-VKVGSVDCQ-KYESLC-QQANIRA 73

Query: 396 FPTILFFPKHSAKPVKYPSEKRDVDSLMAFV 426
           +PTI  +P +++K   Y    RD DS++ F+
Sbjct: 74  YPTIRLYPGNASKYHSYNGWHRDADSILEFI 104


>gnl|CDD|200074 TIGR01126, pdi_dom, protein disulfide-isomerase domain.  This model
           describes a domain of eukaryotic protein disulfide
           isomerases, generally found in two copies. The high
           cutoff for total score reflects the expectation of
           finding both copies. The domain is similar to
           thioredoxin but the redox-active disulfide region motif
           is APWCGHCK [Protein fate, Protein folding and
           stabilization].
          Length = 102

 Score = 65.0 bits (159), Expect = 4e-13
 Identities = 30/90 (33%), Positives = 48/90 (53%), Gaps = 5/90 (5%)

Query: 339 DPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMG-VKVGKFRADGDHKEFAKQKLQLVSFP 397
              L+  YAPWC  C+ +   Y +LA++L+    + + K  A  +    ++  +    FP
Sbjct: 14  KDVLVEFYAPWCGHCKNLAPEYEKLAKELKKDPDIVLAKVDATAEKDLASRFGVS--GFP 71

Query: 398 TILFFPKHSAKPVKYPSEKRDVDSLMAFVN 427
           TI FFPK   KPV Y    RD+++++ FVN
Sbjct: 72  TIKFFPKGK-KPVDYEGG-RDLEAIVEFVN 99


>gnl|CDD|239299 cd03001, PDI_a_P5, PDIa family, P5 subfamily; composed of
           eukaryotic proteins similar to human P5, a PDI-related
           protein with a domain structure of aa'b (where a and a'
           are redox active TRX domains and b is a redox inactive
           TRX-like domain). Like PDI, P5 is located in the
           endoplasmic reticulum (ER) and displays both isomerase
           and chaperone activities, which are independent of each
           other. Compared to PDI, the isomerase and chaperone
           activities of P5 are lower. The first cysteine in the
           CXXC motif of both redox active domains in P5 is
           necessary for isomerase activity. The P5 gene was first
           isolated as an amplified gene from a
           hydroxyurea-resistant hamster cell line. The zebrafish
           P5 homolog has been implicated to play a critical role
           in establishing left/right asymmetries in the embryonic
           midline. Some members of this subfamily are P5-like
           proteins containing only one redox active TRX domain.
          Length = 103

 Score = 62.7 bits (153), Expect = 2e-12
 Identities = 27/91 (29%), Positives = 43/91 (47%), Gaps = 4/91 (4%)

Query: 336 NREDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKFRADGDHKEFAKQKLQLVS 395
           N +D WL+  YAPWC  C+ +   + + A+ L+G+ VKVG   AD       +  ++   
Sbjct: 16  NSDDVWLVEFYAPWCGHCKNLAPEWKKAAKALKGI-VKVGAVDADVHQSLAQQYGVR--G 72

Query: 396 FPTILFFPKHSAKPVKYPSEKRDVDSLMAFV 426
           FPTI  F      P  Y    R   ++++  
Sbjct: 73  FPTIKVFGAGKNSPQDYQGG-RTAKAIVSAA 102


>gnl|CDD|215704 pfam00085, Thioredoxin, Thioredoxin.  Thioredoxins are small
           enzymes that participate in redox reactions, via the
           reversible oxidation of an active centre disulfide bond.
           Some members with only the active site are not separated
           from the noise.
          Length = 104

 Score = 60.7 bits (148), Expect = 1e-11
 Identities = 31/96 (32%), Positives = 47/96 (48%), Gaps = 5/96 (5%)

Query: 334 LQNREDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKFRADGDHKEFAKQKLQL 393
           +   + P L+  YAPWC  C+A+   Y +LA++ +   VK  K  AD ++ + A +   +
Sbjct: 14  VAKSDKPVLVDFYAPWCGPCKALAPEYEKLAQEYKD-DVKFAKVDAD-ENPDLASE-YGV 70

Query: 394 VSFPTILFFPKHSAKPVKYPSEKRDVDSLMAFVNAL 429
             FPTI FF     K   Y    R  D L+AF+   
Sbjct: 71  RGFPTIKFFKN-GKKVSDY-VGARTKDDLVAFIKKH 104


>gnl|CDD|233282 TIGR01130, ER_PDI_fam, protein disulfide isomerase, eukaryotic.
           This model represents eukaryotic protein disulfide
           isomerases retained in the endoplasmic reticulum (ER)
           and closely related forms. Some members have been
           assigned alternative or additional functions such as
           prolyl 4-hydroxylase and
           dolichyl-diphosphooligosaccharide-protein
           glycotransferase. Members of this family have at least
           two protein-disulfide domains, each similar to
           thioredoxin but with the redox-active disulfide in the
           motif PWCGHCK, and an ER retention signal at the extreme
           C-terminus (KDEL, HDEL, and similar motifs).
          Length = 462

 Score = 65.1 bits (159), Expect = 2e-11
 Identities = 27/90 (30%), Positives = 39/90 (43%), Gaps = 10/90 (11%)

Query: 342 LIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKFRADG---DHKEFAKQKLQLVSFPT 398
           L+  YAPWC  C+ +   Y ELAE+ +     V   + D    D   F  +      FPT
Sbjct: 368 LVEFYAPWCGHCKNLAPIYEELAEKYKDAESDVVIAKMDATANDVPPFEVE-----GFPT 422

Query: 399 ILFFPKHS-AKPVKYPSEKRDVDSLMAFVN 427
           I F P    ++PV Y    R ++    F+ 
Sbjct: 423 IKFVPAGKKSEPVPY-DGDRTLEDFSKFIA 451



 Score = 43.9 bits (104), Expect = 1e-04
 Identities = 26/96 (27%), Positives = 49/96 (51%), Gaps = 10/96 (10%)

Query: 337 REDPWLIV-LYAPWCHFCQAMEGSYIELAEQL--EGMGVKVGKFRADGDHKEFAKQKLQL 393
           +   +++V  YAPWC  C+++   Y + A++L  +G  +K+ K  A  + K+ A QK  +
Sbjct: 16  KSHEFVLVEFYAPWCGHCKSLAPEYEKAADELKKKGPPIKLAKVDAT-EEKDLA-QKYGV 73

Query: 394 VSFPTILFFP--KHSAKPVKYPSEKRDVDSLMAFVN 427
             +PT+  F   + S      P   RD D ++ ++ 
Sbjct: 74  SGYPTLKIFRNGEDSVSDYNGP---RDADGIVKYMK 106


>gnl|CDD|240266 PTZ00102, PTZ00102, disulphide isomerase; Provisional.
          Length = 477

 Score = 58.6 bits (142), Expect = 3e-09
 Identities = 30/94 (31%), Positives = 44/94 (46%), Gaps = 4/94 (4%)

Query: 335 QNREDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMG-VKVGKFRADGDHKEFAKQKLQL 393
              +   L+ +YAPWC  C+ +E  Y EL E+ +    + V K   +G   E   ++   
Sbjct: 372 FKSDKDVLLEIYAPWCGHCKNLEPVYNELGEKYKDNDSIIVAKM--NGTANETPLEEFSW 429

Query: 394 VSFPTILFFPKHSAKPVKYPSEKRDVDSLMAFVN 427
            +FPTILF       P+ Y  E R V+    FVN
Sbjct: 430 SAFPTILFVKAGERTPIPYEGE-RTVEGFKEFVN 462



 Score = 49.7 bits (119), Expect = 1e-06
 Identities = 26/90 (28%), Positives = 43/90 (47%), Gaps = 7/90 (7%)

Query: 342 LIVLYAPWCHFCQAMEGSYIELAEQL--EGMGVKVGKFRADGDHKEFAKQKLQLVSFPTI 399
           L+  YAPWC  C+ +   Y + A+ L  +   + +    A  +  E A Q+  +  +PTI
Sbjct: 53  LVKFYAPWCGHCKRLAPEYKKAAKMLKEKKSEIVLASVDATEE-MELA-QEFGVRGYPTI 110

Query: 400 LFFPKHSAKPVKYPSEKRDVDSLMAFVNAL 429
            FF K    PV Y    R  D +++++  L
Sbjct: 111 KFFNK--GNPVNYSG-GRTADGIVSWIKKL 137


>gnl|CDD|239298 cd03000, PDI_a_TMX3, PDIa family, TMX3 subfamily; composed of
           eukaryotic proteins similar to human TMX3, a TRX related
           transmembrane protein containing one redox active TRX
           domain at the N-terminus and a classical ER retrieval
           sequence for type I transmembrane proteins at the
           C-terminus. The TMX3 transcript is found in a variety of
           tissues with the highest levels detected in skeletal
           muscle and the heart. In vitro, TMX3 showed oxidase
           activity albeit slightly lower than that of protein
           disulfide isomerase.
          Length = 104

 Score = 51.3 bits (123), Expect = 2e-08
 Identities = 30/97 (30%), Positives = 45/97 (46%), Gaps = 8/97 (8%)

Query: 334 LQNR-EDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMG--VKVGKFRADGDHKEFAKQK 390
              R ED WL+  YAPWC  C+ +E  + E+  +L+  G  V+VGK  A   +   A  +
Sbjct: 10  KDVRKEDIWLVDFYAPWCGHCKKLEPVWNEVGAELKSSGSPVRVGKLDATA-YSSIAS-E 67

Query: 391 LQLVSFPTILFFPKHSAKPVKYPSEKRDVDSLMAFVN 427
             +  +PTI       A   + P  K   D ++ F N
Sbjct: 68  FGVRGYPTIKLLKGDLAYNYRGPRTK---DDIVEFAN 101


>gnl|CDD|237509 PRK13794, PRK13794, hypothetical protein; Provisional.
          Length = 479

 Score = 54.3 bits (131), Expect = 6e-08
 Identities = 60/250 (24%), Positives = 99/250 (39%), Gaps = 64/250 (25%)

Query: 29  YGPHHKGVEGRIESTND-HEDYEKLARGMESASPLEIMDKAFQKFGNDIAIAFSGAED-- 85
           YGP  +  +  +E+  +  + YE+ + G         +    +K    + +A+SG +D  
Sbjct: 210 YGPGEETWKDMVEANKNVLDKYERNSIGF--------IRNTAEKINKPVTVAYSGGKDSL 261

Query: 86  VVLIEYAKLTGRPFRVFSLDTGRLNPETHQFFDTVEKHYGIRIEYTFPNAVEVQALVRTK 145
             L+   K  G  F V   DTG   PET +  + VEKHYG+ I             +RTK
Sbjct: 262 ATLLLALKALGINFPVLFNDTGLEFPETLENVEDVEKHYGLEI-------------IRTK 308

Query: 146 GLFSFYEDGHQE---------CCRIRKVRPLKRAL-----KGLRAWITGQRKDQSPGTRA 191
               F+E   +          C  + K+ PL + +         +++ GQRK +S   R+
Sbjct: 309 SE-EFWEKLEEYGPPARDNRWCSEVCKLEPLGKLIDEKYEGECLSFV-GQRKYES-FNRS 365

Query: 192 EIPVV--------QIDTSFEGIDGGKGSLVKWNPLANVKGQDIWNFLRAMNIPINSLHSQ 243
           + P +        QI  +               P+ +     +W +L     P N L+ Q
Sbjct: 366 KKPRIWRNPYIKKQILAA---------------PILHWTAMHVWIYLFREKAPYNKLYEQ 410

Query: 244 GYISIGCEPC 253
           G+  IGC  C
Sbjct: 411 GFDRIGCFMC 420


>gnl|CDD|239300 cd03002, PDI_a_MPD1_like, PDI family, MPD1-like subfamily; composed
           of eukaryotic proteins similar to Saccharomyces
           cerevisiae MPD1 protein, which contains a single redox
           active TRX domain located at the N-terminus, and an ER
           retention signal at the C-terminus indicative of an
           ER-resident protein. MPD1 has been shown to suppress the
           maturation defect of carboxypeptidase Y caused by
           deletion of the yeast PDI1 gene. Other characterized
           members of this subfamily include the Aspergillus niger
           prpA protein and Giardia PDI-1. PrpA is non-essential to
           strain viability, however, its transcript level is
           induced by heterologous protein expression suggesting a
           possible role in oxidative protein folding during high
           protein production. Giardia PDI-1 has the ability to
           refold scrambled RNase and exhibits transglutaminase
           activity.
          Length = 109

 Score = 48.9 bits (117), Expect = 2e-07
 Identities = 27/101 (26%), Positives = 47/101 (46%), Gaps = 14/101 (13%)

Query: 334 LQNREDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKFRADGD-HKEF-AKQKL 391
           + N     L+  YAPWC  C+ ++  Y + A++L+G+ V+V     D D +K    K  +
Sbjct: 14  VHNTNYTTLVEFYAPWCGHCKNLKPEYAKAAKELDGL-VQVAAVDCDEDKNKPLCGKYGV 72

Query: 392 QLVSFPTILFFP------KHSAKPVKYPSEKRDVDSLMAFV 426
           Q   FPT+  F       KH+ +        +   +++ FV
Sbjct: 73  Q--GFPTLKVFRPPKKASKHAVEDYNGERSAK---AIVDFV 108


>gnl|CDD|185622 PTZ00443, PTZ00443, Thioredoxin domain-containing protein;
           Provisional.
          Length = 224

 Score = 51.2 bits (122), Expect = 2e-07
 Identities = 26/86 (30%), Positives = 39/86 (45%), Gaps = 5/86 (5%)

Query: 340 PWLIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKFRADGDHKEFAKQKLQLVSFPTI 399
           PW +  YAPWC  C+ M  ++  LA+ L+G  V V     D        ++  +  +PT+
Sbjct: 54  PWFVKFYAPWCSHCRKMAPAWERLAKALKGQ-VNVADL--DATRALNLAKRFAIKGYPTL 110

Query: 400 LFFPKHSAKPVKYPSEKRDVDSLMAF 425
           L F     K  +Y    R  + L AF
Sbjct: 111 LLF--DKGKMYQYEGGDRSTEKLAAF 134


>gnl|CDD|239301 cd03003, PDI_a_ERdj5_N, PDIa family, N-terminal ERdj5 subfamily;
           ERdj5, also known as JPDI and macrothioredoxin, is a
           protein containing an N-terminal DnaJ domain and four
           redox active TRX domains. This subfamily is comprised of
           the first TRX domain of ERdj5 located after the DnaJ
           domain at the N-terminal half of the protein. ERdj5 is a
           ubiquitous protein localized in the endoplasmic
           reticulum (ER) and is abundant in secretory cells. It's
           transcription is induced during ER stress. It interacts
           with BiP through its DnaJ domain in an ATP-dependent
           manner. BiP, an ER-resident member of the Hsp70
           chaperone family, functions in ER-associated degradation
           and protein translocation.
          Length = 101

 Score = 46.7 bits (111), Expect = 1e-06
 Identities = 23/90 (25%), Positives = 47/90 (52%), Gaps = 5/90 (5%)

Query: 336 NREDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKFRADGDHKEFAKQKLQLVS 395
           N  + W +  Y+P C  C  +  ++ E A++++G+ +++G     GD +   + +  + S
Sbjct: 16  NSGEIWFVNFYSPRCSHCHDLAPTWREFAKEMDGV-IRIGAVNC-GDDRMLCRSQ-GVNS 72

Query: 396 FPTILFFPKHSAKPVKYPSEKRDVDSLMAF 425
           +P++  FP     P KY  + R  +SL+ F
Sbjct: 73  YPSLYVFPS-GMNPEKYYGD-RSKESLVKF 100


>gnl|CDD|239292 cd02994, PDI_a_TMX, PDIa family, TMX subfamily; composed of
           proteins similar to the TRX-related human transmembrane
           protein, TMX. TMX is a type I integral membrane protein;
           the N-terminal redox active TRX domain is present in the
           endoplasmic reticulum (ER) lumen while the C-terminus is
           oriented towards the cytoplasm. It is expressed in many
           cell types and its active site motif (CPAC) is unique.
           In vitro, TMX reduces interchain disulfides of insulin
           and renatures inactive RNase containing incorrect
           disulfide bonds. The C. elegans homolog, DPY-11, is
           expressed only in the hypodermis and resides in the
           cytoplasm. It is required for body and sensory organ
           morphogeneis. Another uncharacterized TRX-related
           transmembrane protein, human TMX4, is included in the
           alignment. The active site sequence of TMX4 is CPSC.
          Length = 101

 Score = 46.2 bits (110), Expect = 1e-06
 Identities = 15/39 (38%), Positives = 23/39 (58%)

Query: 338 EDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGK 376
           E  W+I  YAPWC  CQ ++  + E A+  + +G+ V K
Sbjct: 16  EGEWMIEFYAPWCPACQQLQPEWEEFADWSDDLGINVAK 54


>gnl|CDD|239245 cd02947, TRX_family, TRX family; composed of two groups: Group I,
           which includes proteins that exclusively encode a TRX
           domain; and Group II, which are composed of fusion
           proteins of TRX and additional domains. Group I TRX is a
           small ancient protein that alter the redox state of
           target proteins via the reversible oxidation of an
           active site dithiol, present in a CXXC motif, partially
           exposed at the protein's surface. TRX reduces protein
           disulfide bonds, resulting in a disulfide bond at its
           active site. Oxidized TRX is converted to the active
           form by TRX reductase, using reducing equivalents
           derived from either NADPH or ferredoxins. By altering
           their redox state, TRX regulates the functions of at
           least 30 target proteins, some of which are enzymes and
           transcription factors. It also plays an important role
           in the defense against oxidative stress by directly
           reducing hydrogen peroxide and certain radicals, and by
           serving as a reductant for peroxiredoxins. At least two
           major types of functional TRXs have been reported in
           most organisms; in eukaryotes, they are located in the
           cytoplasm and the mitochondria. Higher plants contain
           more types (at least 20 TRX genes have been detected in
           the genome of Arabidopsis thaliana), two of which (types
           f amd m) are located in the same compartment, the
           chloroplast. Also included in the alignment are TRX-like
           domains which show sequence homology to TRX but do not
           contain the redox active CXXC motif. Group II proteins,
           in addition to either a redox active TRX or a TRX-like
           domain, also contain additional domains, which may or
           may not possess homology to known proteins.
          Length = 93

 Score = 44.9 bits (107), Expect = 3e-06
 Identities = 26/74 (35%), Positives = 36/74 (48%), Gaps = 4/74 (5%)

Query: 329 ENLARLQNREDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKFRADGDHKEFAK 388
           E    L     P ++  +APWC  C+A+     ELAE  E   VK  K   D ++ E A 
Sbjct: 1   EEFEELIKSAKPVVVDFWAPWCGPCKAIAPVLEELAE--EYPKVKFVKVDVD-ENPELA- 56

Query: 389 QKLQLVSFPTILFF 402
           ++  + S PT LFF
Sbjct: 57  EEYGVRSIPTFLFF 70


>gnl|CDD|237510 PRK13795, PRK13795, hypothetical protein; Provisional.
          Length = 636

 Score = 48.8 bits (117), Expect = 4e-06
 Identities = 54/197 (27%), Positives = 89/197 (45%), Gaps = 45/197 (22%)

Query: 76  IAIAFSGAED--VVLIEYAKLTGRPFRVFSLDTGRLNPETHQFFDTVEKHYGIRIEY--- 130
           ++++FSG +D  VVL + A+   + F+ F  +TG   PET +    V + YGI +     
Sbjct: 246 VSVSFSGGKDSLVVL-DLAREALKDFKAFFNNTGLEFPETVENVKEVAEEYGIELIEADA 304

Query: 131 --TFPNAVEVQALVRTKGLFSFYEDGHQECCRIRKVRPLKRALK--GLRAWIT--GQRKD 184
              F  AVE          F      ++ CC++ K+ P+ RA+K    +  +T  GQRK 
Sbjct: 305 GDAFWRAVEK---------FGPPARDYRWCCKVCKLGPITRAIKENFPKGCLTFVGQRKY 355

Query: 185 QSPGTRAEIPVV--------QIDTSFEGIDGGKGSLVKWNPLANVKGQDIWNFLRAMNIP 236
           +S  +RA+ P V        QI  S               P+ +    ++W ++    +P
Sbjct: 356 ES-FSRAKSPRVWRNPWVPNQIGAS---------------PIQDWTALEVWLYIFWRKLP 399

Query: 237 INSLHSQGYISIGCEPC 253
            N L+ +G+  IGC  C
Sbjct: 400 YNPLYERGFDRIGCWLC 416


>gnl|CDD|239303 cd03005, PDI_a_ERp46, PDIa family, endoplasmic reticulum protein 46
           (ERp46) subfamily; ERp46 is an ER-resident protein
           containing three redox active TRX domains. Yeast
           complementation studies show that ERp46 can substitute
           for protein disulfide isomerase (PDI) function in vivo.
           It has been detected in many tissues, however,
           transcript and protein levels do not correlate in all
           tissues, suggesting regulation at a posttranscriptional
           level. An identical protein, named endoPDI, has been
           identified as an endothelial PDI that is highly
           expressed in the endothelium of tumors and hypoxic
           lesions. It has a protective effect on cells exposed to
           hypoxia.
          Length = 102

 Score = 44.6 bits (106), Expect = 4e-06
 Identities = 26/87 (29%), Positives = 43/87 (49%), Gaps = 6/87 (6%)

Query: 342 LIVLYAPWCHFCQAMEGSYIELAEQLEGM--GVKVGKFRADGDHKEFAKQKLQLVSFPTI 399
            +  +APWC  C+ +  ++ +LA++       VK+ K       +  ++   Q+  +PT+
Sbjct: 20  FVKFFAPWCGHCKRLAPTWEQLAKKFNNENPSVKIAKVDCTQHRELCSEF--QVRGYPTL 77

Query: 400 LFFPKHSAKPVKYPSEKRDVDSLMAFV 426
           L F K   K  KY    RD+DSL  FV
Sbjct: 78  LLF-KDGEKVDKY-KGTRDLDSLKEFV 102


>gnl|CDD|181465 PRK08557, PRK08557, hypothetical protein; Provisional.
          Length = 417

 Score = 48.2 bits (115), Expect = 5e-06
 Identities = 55/210 (26%), Positives = 80/210 (38%), Gaps = 40/210 (19%)

Query: 62  LEIMDKAFQKFGND---IAIAFSGAED-VVLIEYAKLTGRPFRVFSLDTGRLNPETHQFF 117
           L I+    +K+ N    I  +FSG +D  V    AK       V  +DTG   PET  + 
Sbjct: 167 LSILKDYIEKYKNKGYAINASFSGGKDSSVSTLLAKEVIPDLEVIFIDTGLEYPETINYV 226

Query: 118 DTVEKHYGIRIEYTFPNAVEVQALVRTKGLFSFYEDGHQE---------CCRIRKVRPLK 168
               K Y +              L    G  +F+E+  +E         C    K+ PLK
Sbjct: 227 KDFAKKYDLN-------------LDTLDGD-NFWENLEKEGIPTKDNRWCNSACKLMPLK 272

Query: 169 RALKGLRAW-----ITGQRKDQSPGTRAEIPVVQIDTSFEGIDGGKGSLVKWNPLANVKG 223
             LK          I G RK +S  TRA +        +E   G         P+ +   
Sbjct: 273 EYLKKKYGNKKVLTIDGSRKYES-FTRANL-------DYERKSGFIDFQTNVFPILDWNS 324

Query: 224 QDIWNFLRAMNIPINSLHSQGYISIGCEPC 253
            DIW+++   +I  N L+ +G+  IGC  C
Sbjct: 325 LDIWSYIYLNDILYNPLYDKGFERIGCYLC 354


>gnl|CDD|239295 cd02997, PDI_a_PDIR, PDIa family, PDIR subfamily; composed of
           proteins similar to human PDIR (for Protein Disulfide
           Isomerase Related). PDIR is composed of three redox
           active TRX (a) domains and an N-terminal redox inactive
           TRX-like (b) domain. Similar to PDI, it is involved in
           oxidative protein folding in the endoplasmic reticulum
           (ER) through its isomerase and chaperone activities.
           These activities are lower compared to PDI, probably due
           to PDIR acting only on a subset of proteins. PDIR is
           preferentially expressed in cells actively secreting
           proteins and its expression is induced by stress.
           Similar to PDI, the isomerase and chaperone activities
           of PDIR are independent; CXXC mutants lacking isomerase
           activity retain chaperone activity.
          Length = 104

 Score = 44.6 bits (106), Expect = 6e-06
 Identities = 19/77 (24%), Positives = 35/77 (45%), Gaps = 1/77 (1%)

Query: 329 ENLARLQNREDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKV-GKFRADGDHKEFA 387
           E+  +   +E   L++ YAPWC  C+ M+  + + A +L+  G  V           +  
Sbjct: 8   EDFRKFLKKEKHVLVMFYAPWCGHCKKMKPEFTKAATELKEDGKGVLAAVDCTKPEHDAL 67

Query: 388 KQKLQLVSFPTILFFPK 404
           K++  +  FPT  +F  
Sbjct: 68  KEEYNVKGFPTFKYFEN 84


>gnl|CDD|200072 TIGR01068, thioredoxin, thioredoxin.  Several proteins, such as
           protein disulfide isomerase, have two or more copies of
           a domain closely related to thioredoxin. This model is
           designed to recognize authentic thioredoxin, a small
           protein that should be hit exactly once by This model.
           Any protein that hits once with a score greater than the
           second (per domain) trusted cutoff may be taken as
           thioredoxin [Energy metabolism, Electron transport].
          Length = 101

 Score = 43.4 bits (103), Expect = 1e-05
 Identities = 25/78 (32%), Positives = 39/78 (50%), Gaps = 5/78 (6%)

Query: 334 LQNREDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKFRADGDHKEFAKQKLQL 393
           + + + P L+  +APWC  C+ +     ELA++ EG  VK  K   D ++ + A  K  +
Sbjct: 10  IASSDKPVLVDFWAPWCGPCKMIAPILEELAKEYEG-KVKFVKLNVD-ENPDIA-AKYGI 66

Query: 394 VSFPTILFFPKHSAKPVK 411
            S PT+L F     K V 
Sbjct: 67  RSIPTLLLFKN--GKEVD 82


>gnl|CDD|223600 COG0526, TrxA, Thiol-disulfide isomerase and thioredoxins
           [Posttranslational modification, protein turnover,
           chaperones / Energy production and conversion].
          Length = 127

 Score = 44.1 bits (103), Expect = 2e-05
 Identities = 20/61 (32%), Positives = 29/61 (47%)

Query: 342 LIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKFRADGDHKEFAKQKLQLVSFPTILF 401
           L+  +APWC  C+A      ELAE+  G    V     D +    A+  + + S PT+L 
Sbjct: 36  LVDFWAPWCPPCRAEAPLLEELAEEYGGDVEVVAVNVDDENPDLAAEFGVAVRSIPTLLL 95

Query: 402 F 402
           F
Sbjct: 96  F 96


>gnl|CDD|236300 PRK08576, PRK08576, hypothetical protein; Provisional.
          Length = 438

 Score = 45.5 bits (108), Expect = 4e-05
 Identities = 50/202 (24%), Positives = 85/202 (42%), Gaps = 26/202 (12%)

Query: 62  LEIMDKAFQKF-----GNDIAIAFSGAED--VVLIEYAKLTGRPFRVFSLDTGRLNPETH 114
           LE  +KA  KF        + + +SG +D    L+   K  G    V+ +DTG   P T 
Sbjct: 218 LEAFEKASIKFLRKFEEWTVIVPWSGGKDSTAALLLAKKAFGDVTAVY-VDTGYEMPLTD 276

Query: 115 QFFDTVEKHYGIRIEYTFPNAVEVQALVRTKGLFSFYEDGHQE-CCRIRKVRPLKRALKG 173
           ++ + V +  G+ +       V+V   +   G+ +     H    C   KV  L+ A++ 
Sbjct: 277 EYVEKVAEKLGVDLIRA---GVDVPMPIEKYGMPT-----HSNRWCTKLKVEALEEAIRE 328

Query: 174 LR--AWITGQRKDQSPGTRAEIPVVQIDTSFEGIDGGKGSLVKWNPLANVKGQDIWNFLR 231
           L     + G R  +S   R   PVV+  T+F  I      LV   P+    G  +  ++ 
Sbjct: 329 LEDGLLVVGDRDGESARRRLRPPVVERKTNFGKI------LVV-MPIKFWSGAMVQLYIL 381

Query: 232 AMNIPINSLHSQGYISIGCEPC 253
              + +N L+ +G+  +GC  C
Sbjct: 382 MNGLELNPLYYKGFYRLGCYIC 403


>gnl|CDD|239261 cd02963, TRX_DnaJ, TRX domain, DnaJ domain containing protein
           family; composed of uncharacterized proteins of about
           500-800 amino acids, containing an N-terminal DnaJ
           domain followed by one redox active TRX domain. DnaJ is
           a member of the 40 kDa heat-shock protein (Hsp40) family
           of molecular chaperones, which regulate the activity of
           Hsp70s. TRX is involved in the redox regulation of many
           protein substrates through the reduction of disulfide
           bonds. TRX has been implicated to catalyse the reduction
           of Hsp33, a chaperone holdase that binds to unfolded
           protein intermediates. The presence of DnaJ and TRX
           domains in members of this family suggests that they
           could be involved in a redox-regulated chaperone
           network.
          Length = 111

 Score = 42.0 bits (99), Expect = 4e-05
 Identities = 23/88 (26%), Positives = 45/88 (51%), Gaps = 4/88 (4%)

Query: 313 DLFNSQKLVSFRRTGIENLARLQNREDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMGV 372
           D F+ +  ++F +   EN    ++ + P+LI + + WC  C  +E  + E+ ++LE +GV
Sbjct: 1   DSFDYKYSLTFSQ--YENEIVPKSFKKPYLIKITSDWCFSCIHIEPVWKEVIQELEPLGV 58

Query: 373 KVGKFRADGDHKEFAKQKLQLVSFPTIL 400
            +     +  H+    +KL   S P I+
Sbjct: 59  GIAT--VNAGHERRLARKLGAHSVPAIV 84


>gnl|CDD|238829 cd01659, TRX_superfamily, Thioredoxin (TRX) superfamily; a large,
           diverse group of proteins containing a TRX-fold. Many
           members contain a classic TRX domain with a redox active
           CXXC motif. They function as protein disulfide
           oxidoreductases (PDOs), altering the redox state of
           target proteins via the reversible oxidation of their
           active site dithiol. The PDO members of this superfamily
           include TRX, protein disulfide isomerase (PDI),
           tlpA-like, glutaredoxin, NrdH redoxin, and the bacterial
           Dsb (DsbA, DsbC, DsbG, DsbE, DsbDgamma) protein
           families. Members of the superfamily that do not
           function as PDOs but contain a TRX-fold domain include
           phosducins, peroxiredoxins and glutathione (GSH)
           peroxidases, SCO proteins, GSH transferases (GST,
           N-terminal domain), arsenic reductases, TRX-like
           ferredoxins and calsequestrin, among others.
          Length = 69

 Score = 41.1 bits (96), Expect = 5e-05
 Identities = 22/62 (35%), Positives = 31/62 (50%), Gaps = 3/62 (4%)

Query: 342 LIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKFRADGD-HKEFAKQKLQLVSFPTIL 400
           L++ YAPWC FCQA+     ELA   +  GVK      D D   E   ++  +   PT++
Sbjct: 1   LVLFYAPWCPFCQALRPVLAELALLNK--GVKFEAVDVDEDPALEKELKRYGVGGVPTLV 58

Query: 401 FF 402
            F
Sbjct: 59  VF 60


>gnl|CDD|131094 TIGR02039, CysD, sulfate adenylyltransferase, small subunit.
           Metabolic assimilation of sulfur from inorganic sulfate,
           requires sulfate activation by coupling to a nucleoside,
           for the production of high-energy nucleoside
           phosphosulfates. This pathway appears to be similar in
           all prokaryotic organisms. Activation is first achieved
           through sulfation of sulfate with ATP by sulfate
           adenylyltransferase (ATP sulfurylase) to produce
           5'-phosphosulfate (APS), coupled by GTP hydrolysis.
           Subsequently, APS is phosphorylated by an APS kinase to
           produce 3'-phosphoadenosine-5'-phosphosulfate (PAPS). In
           Escherichia coli, ATP sulfurylase is a heterodimer
           composed of two subunits encoded by cysD and cysN, with
           APS kinase encoded by cysC. These genes are located in a
           unidirectionally transcribed gene cluster, and have been
           shown to be required for the synthesis of
           sulfur-containing amino acids. Homologous to this E.coli
           activation pathway are nodPQH gene products found among
           members of the Rhizobiaceae family. These gene products
           have been shown to exhibit ATP sulfurase and APS kinase
           activity, yet are involved in Nod factor sulfation, and
           sulfation of other macromolecules [Central intermediary
           metabolism, Sulfur metabolism].
          Length = 294

 Score = 40.9 bits (96), Expect = 8e-04
 Identities = 60/238 (25%), Positives = 97/238 (40%), Gaps = 47/238 (19%)

Query: 54  RGMESASPLEIMDKAFQKFGNDIAIAFSGAED-VVLIEYAKLT---GR-PFRVFSLDTGR 108
           R +ES + + I+ +   +F   + + +S  +D  VL+  A+     G  PF +  +DTG 
Sbjct: 2   RALESEA-IHIIREVAAEFERPV-MLYSIGKDSSVLLHLARKAFYPGPLPFPLLHVDTGW 59

Query: 109 LNPETHQFFDTVEKHYGIR-IEYTFPNAVEVQALVRTKGLFSFYEDGHQECCRIRKVRPL 167
              E   F D +   YG+R I ++    +         G+  F  +G      I K   L
Sbjct: 60  KFREMIAFRDHMVAKYGLRLIVHSNEEGIA-------DGINPF-TEGSALHTDIMKTEAL 111

Query: 168 KRALK--GLRAWITGQRKDQSPGTRAEIPVVQIDTSFEGIDG---------------GKG 210
           ++AL      A   G R+D+   +RA+  +     +F   D                 KG
Sbjct: 112 RQALDKNQFDAAFGGARRDEEK-SRAKERIFSFRNAFHQWDPKKQRPELWNLYNGRISKG 170

Query: 211 SLVKWNPLANVKGQDIWNFLRAMNIPINSLHSQGYISIGCEPCTRPVLPGQHEREGRW 268
             V+  PL+N    DIW ++ A NIPI  L    Y +       RPV+    +R+G  
Sbjct: 171 ESVRVFPLSNWTELDIWRYIAAENIPIVPL----YFA-----AKRPVV----QRDGML 215


>gnl|CDD|239297 cd02999, PDI_a_ERp44_like, PDIa family, endoplasmic reticulum
           protein 44 (ERp44)-like subfamily; composed of
           uncharacterized PDI-like eukaryotic proteins containing
           only one redox active TRX (a) domain with a CXXS motif,
           similar to ERp44. CXXS is still a redox active motif;
           however, the mixed disulfide formed with the substrate
           is more stable than those formed by CXXC motif proteins.
           PDI-related proteins are usually involved in the
           oxidative protein folding in the ER by acting as
           catalysts and folding assistants. ERp44 is involved in
           thiol-mediated retention in the ER.
          Length = 100

 Score = 37.3 bits (87), Expect = 0.002
 Identities = 26/93 (27%), Positives = 39/93 (41%), Gaps = 6/93 (6%)

Query: 334 LQNREDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKFRADGDHKEFAKQKLQL 393
             NRED   ++ YA WC F  +    +  L+       ++                +  +
Sbjct: 14  AFNREDYTAVLFYASWCPFSASFRPHFNALSSMFP--QIRHLAIEESSIKP-SLLSRYGV 70

Query: 394 VSFPTILFFPKHSAKPVKYPSEKRDVDSLMAFV 426
           V FPTIL F  +S   V+Y +  R +DSL AF 
Sbjct: 71  VGFPTILLF--NSTPRVRY-NGTRTLDSLAAFY 100


>gnl|CDD|226685 COG4232, COG4232, Thiol:disulfide interchange protein
           [Posttranslational modification, protein turnover,
           chaperones / Energy production and conversion].
          Length = 569

 Score = 39.7 bits (93), Expect = 0.003
 Identities = 20/88 (22%), Positives = 34/88 (38%), Gaps = 9/88 (10%)

Query: 332 ARLQNREDPWLIVLYAPWCHFCQAMEG-SYIELAEQLEGMGVKVGKFRAD-----GDHKE 385
           A  + +  P ++  YA WC  C+  E  ++ +   Q     V +   +AD          
Sbjct: 468 ALAEAKAKPVMLDFYADWCVTCKENEKYTFSDPQVQQALQDVVL--LQADVTANDPAITA 525

Query: 386 FAKQKLQLVSFPTILFFPKHSAKPVKYP 413
             K+ L +   PT LFF    ++P    
Sbjct: 526 LLKR-LGVFGVPTYLFFGPQGSEPEILT 552


>gnl|CDD|235375 PRK05253, PRK05253, sulfate adenylyltransferase subunit 2;
           Provisional.
          Length = 301

 Score = 38.6 bits (91), Expect = 0.004
 Identities = 60/238 (25%), Positives = 89/238 (37%), Gaps = 58/238 (24%)

Query: 64  IMDKAFQKFGNDIAIAFSGAED-VVLIEYAKLTGRPFRV-FSL---DTGRLNPETHQFFD 118
           I+ +   +F N   + +S  +D  V++  A+    P ++ F L   DTG   PE  +F D
Sbjct: 19  ILREVAAEFEN-PVMLYSIGKDSSVMLHLARKAFYPGKLPFPLLHVDTGWKFPEMIEFRD 77

Query: 119 TVEKHYGIR-IEYTFPNAVEVQALVRTKGLFSFYEDGHQECCRIRKVRPLKRAL--KGLR 175
              K  G+  I ++ P  +      R    F     G  +     K   LK+AL   G  
Sbjct: 78  RRAKELGLELIVHSNPEGIA-----RGINPF---RHGSAKHTNAMKTEGLKQALEKYGFD 129

Query: 176 AWITGQRKDQSPGTRAEIPVVQIDTSFEGIDGGKGSLVK------WN------------- 216
           A   G R+D+   +RA+  +      F   D       K      WN             
Sbjct: 130 AAFGGARRDEE-KSRAKERIFSFRDEFGQWD------PKNQRPELWNLYNGRINKGEHIR 182

Query: 217 --PLANVKGQDIWNFLRAMNIPINSLHSQGYISIGCEPCTRPVLPGQHEREGRWWWED 272
             PL+N    DIW ++   NIPI  L    Y +       RPV+    ER+G     D
Sbjct: 183 VFPLSNWTELDIWQYIERENIPIVPL----YFAHE-----RPVV----ERDGMLIMVD 227


>gnl|CDD|239251 cd02953, DsbDgamma, DsbD gamma family; DsbD gamma is the C-terminal
           periplasmic domain of the bacterial protein DsbD. It
           contains a CXXC motif in a TRX fold and shuttles the
           reducing potential from the membrane domain (DsbD beta)
           to the N-terminal periplasmic domain (DsbD alpha).  DsbD
           beta, a transmembrane domain comprising of eight
           helices, acquires its reducing potential from the
           cytoplasmic thioredoxin. DsbD alpha transfers the
           acquired reducing potential from DsbD gamma to target
           proteins such as the periplasmic protein disulphide
           isomerases, DsbC and DsbG. This flow of reducing
           potential from the cytoplasm through DsbD allows DsbC
           and DsbG to act as isomerases in the oxidizing
           environment of the bacterial periplasm. DsbD also
           transfers reducing potential from the cytoplasm to
           specific reductases in the periplasm which are involved
           in the maturation of cytochromes.
          Length = 104

 Score = 36.0 bits (84), Expect = 0.005
 Identities = 18/84 (21%), Positives = 27/84 (32%), Gaps = 6/84 (7%)

Query: 331 LARLQNREDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKFRADG-----DHKE 385
           LA+   +  P  +   A WC  C+  E       E    +   V   RAD      +   
Sbjct: 4   LAQALAQGKPVFVDFTADWCVTCKVNEKVVFSDPEVQAALKKDVVLLRADWTKNDPEITA 63

Query: 386 FAKQKLQLVSFPTILFFPKHSAKP 409
             K+   +   PT LF+       
Sbjct: 64  LLKR-FGVFGPPTYLFYGPGGEPE 86


>gnl|CDD|221921 pfam13098, Thioredoxin_2, Thioredoxin-like domain. 
          Length = 105

 Score = 35.1 bits (81), Expect = 0.011
 Identities = 15/86 (17%), Positives = 28/86 (32%), Gaps = 15/86 (17%)

Query: 342 LIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKV--------------GKFRADGDHKEFA 387
           L+V   P C +C+ +    ++  +  E +                           KE A
Sbjct: 9   LVVFTDPDCPYCKKLHKELLKDPDVQEYLKDNFVVIYVNVDDSKEVTDFDGETLSEKELA 68

Query: 388 KQKLQLVSFPTILFFPKHSAKPVKYP 413
            +K  +   PTI+F      +  + P
Sbjct: 69  -RKYGVRGTPTIVFLDGDGKEVARLP 93


>gnl|CDD|222442 pfam13899, Thioredoxin_7, Thioredoxin-like.  Thioredoxins are small
           enzymes that participate in redox reactions, via the
           reversible oxidation of an active centre disulfide bond.
          Length = 81

 Score = 32.6 bits (75), Expect = 0.051
 Identities = 11/30 (36%), Positives = 16/30 (53%)

Query: 329 ENLARLQNREDPWLIVLYAPWCHFCQAMEG 358
           E LA+ +    P L+   A WC  CQA++ 
Sbjct: 8   EALAKAREEGKPLLVDFGADWCPTCQALDR 37


>gnl|CDD|225660 COG3118, COG3118, Thioredoxin domain-containing protein
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 304

 Score = 35.0 bits (81), Expect = 0.065
 Identities = 21/66 (31%), Positives = 32/66 (48%), Gaps = 3/66 (4%)

Query: 337 REDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKFRADGDHKEFAKQKLQLVSF 396
           RE P L+  +APWC  C+ +  +  +LA + +G   K+ K   D      A Q   + S 
Sbjct: 42  REVPVLVDFWAPWCGPCKQLTPTLEKLAAEYKG-KFKLAKVNCD-AEPMVAAQ-FGVQSI 98

Query: 397 PTILFF 402
           PT+  F
Sbjct: 99  PTVYAF 104


>gnl|CDD|234717 PRK00293, dipZ, thiol:disulfide interchange protein precursor;
           Provisional.
          Length = 571

 Score = 35.2 bits (82), Expect = 0.067
 Identities = 26/110 (23%), Positives = 45/110 (40%), Gaps = 24/110 (21%)

Query: 309 APASDLFNSQKLVSFRRTGIENLARLQ-------NREDPWLIVLYAPWCHFCQAMEGSYI 361
             A+    +Q  ++F+R  I+ +A L         +  P ++ LYA WC  C+  E  Y 
Sbjct: 440 GGAAAGAQTQAHLNFQR--IKTVAELDQALAEAKGKGKPVMLDLYADWCVACKEFE-KYT 496

Query: 362 ----ELAEQLEGMGVKVGKFRAD-----GDHKEFAKQKLQLVSFPTILFF 402
               ++ + L    +     +AD      +     K    ++  PTILFF
Sbjct: 497 FSDPQVQQALADTVL----LQADVTANNAEDVALLKH-YNVLGLPTILFF 541


>gnl|CDD|239294 cd02996, PDI_a_ERp44, PDIa family, endoplasmic reticulum protein 44
           (ERp44) subfamily; ERp44 is an ER-resident protein,
           induced during stress, involved in thiol-mediated ER
           retention. It contains an N-terminal TRX domain, similar
           to that of PDIa, with a CXFS motif followed by two redox
           inactive TRX-like domains, homologous to the b and b'
           domains of PDI. The CXFS motif in the N-terminal domain
           allows ERp44 to form stable reversible mixed disulfides
           with its substrates. Through this activity, ERp44
           mediates the ER localization of Ero1alpha, a protein
           that oxidizes protein disulfide isomerases into their
           active form. ERp44 also prevents the secretion of
           unassembled cargo protein with unpaired cysteines. It
           also modulates the activity of inositol
           1,4,5-triphosphate type I receptor (IP3R1), an
           intracellular channel protein that mediates calcium
           release from the ER to the cytosol.
          Length = 108

 Score = 31.2 bits (71), Expect = 0.27
 Identities = 24/89 (26%), Positives = 41/89 (46%), Gaps = 6/89 (6%)

Query: 342 LIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKF---RADGDHKEFAKQKLQLVSFPT 398
           L+  YA WC F Q +   + E A +++      GK    + D D +     + ++  +PT
Sbjct: 22  LVNFYADWCRFSQMLHPIFEEAAAKIKEEFPDAGKVVWGKVDCDKESDIADRYRINKYPT 81

Query: 399 I-LFFPKHSAKPVKYPSEKRDVDSLMAFV 426
           + LF      K  +Y  + R V++L  FV
Sbjct: 82  LKLFRNGMMMK-REYRGQ-RSVEALAEFV 108


>gnl|CDD|182889 PRK10996, PRK10996, thioredoxin 2; Provisional.
          Length = 139

 Score = 31.2 bits (71), Expect = 0.43
 Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 11/78 (14%)

Query: 329 ENLARLQNREDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKFRADGDHKEFAK 388
           E L +L   + P +I  +APWC  C+     + ++A +      + GK R    + E A+
Sbjct: 43  ETLDKLLQDDLPVVIDFWAPWCGPCRNFAPIFEDVAAE------RSGKVRFVKVNTE-AE 95

Query: 389 QKL----QLVSFPTILFF 402
           ++L    ++ S PTI+ F
Sbjct: 96  RELSARFRIRSIPTIMIF 113


>gnl|CDD|239248 cd02950, TxlA, TRX-like protein A (TxlA) family; TxlA was
           originally isolated from the cyanobacterium
           Synechococcus. It is found only in oxygenic
           photosynthetic organisms. TRX is a small enzyme that
           participate in redox reactions, via the reversible
           oxidation of an active site dithiol present in a CXXC
           motif. Disruption of the txlA gene suggests that the
           protein is involved in the redox regulation  of the
           structure and function of photosynthetic apparatus. The
           plant homolog (designated as HCF164) is localized in the
           chloroplast and is involved in the assembly of the
           cytochrome b6f complex, which takes a central position
           in photosynthetic electron transport.
          Length = 142

 Score = 30.8 bits (70), Expect = 0.61
 Identities = 10/30 (33%), Positives = 14/30 (46%)

Query: 340 PWLIVLYAPWCHFCQAMEGSYIELAEQLEG 369
           P L+  YA WC  CQ M     +L ++   
Sbjct: 22  PTLVEFYADWCTVCQEMAPDVAKLKQKYGD 51


>gnl|CDD|222416 pfam13848, Thioredoxin_6, Thioredoxin-like domain. 
          Length = 183

 Score = 30.8 bits (70), Expect = 0.78
 Identities = 19/46 (41%), Positives = 22/46 (47%), Gaps = 3/46 (6%)

Query: 383 HKEFAKQKLQLVSFPTILFFPKHSAKPVKYPSE-KRDVDSLMAFVN 427
            KE AK+    +  PTIL F K   K V YP E K   D L  F+ 
Sbjct: 30  DKEVAKK--YGIKEPTILLFRKFDEKQVTYPGEDKTTFDDLKKFIQ 73


>gnl|CDD|239264 cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA,
           ResA, DsbE and similar proteins. TlpA, ResA and DsbE are
           bacterial protein disulfide reductases with important
           roles in cytochrome maturation. They are
           membrane-anchored proteins with a soluble TRX domain
           containing a CXXC motif located in the periplasm. The
           TRX domains of this family contain an insert,
           approximately 25 residues in length, which correspond to
           an extra alpha helix and a beta strand when compared
           with TRX. TlpA catalyzes an essential reaction in the
           biogenesis of cytochrome aa3, while ResA and DsbE are
           essential proteins in cytochrome c maturation. Also
           included in this family are proteins containing a
           TlpA-like TRX domain with domain architectures similar
           to E. coli DipZ protein, and the N-terminal TRX domain
           of PilB protein from Neisseria which acts as a disulfide
           reductase that can recylce methionine sulfoxide
           reductases.
          Length = 116

 Score = 29.5 bits (67), Expect = 1.1
 Identities = 19/75 (25%), Positives = 34/75 (45%), Gaps = 8/75 (10%)

Query: 330 NLARLQNREDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKV----GKFRADGDHKE 385
           +L+ L+ +    L+  +A WC  C+A       LA++ +  GV+V             K 
Sbjct: 13  SLSDLKGK--VVLVNFWASWCPPCRAEMPELEALAKEYKDDGVEVVGVNVDDDDPAAVKA 70

Query: 386 FAKQKLQLVSFPTIL 400
           F K+    ++FP +L
Sbjct: 71  FLKKY--GITFPVLL 83


>gnl|CDD|222346 pfam13728, TraF, F plasmid transfer operon protein.  TraF protein
           undergoes proteolytic processing associated with export.
           The 19 amino acids at the amino terminus of the
           polypeptides appear to constitute a typical membrane
           leader peptide - not included in this family, while the
           remainder of the molecule is predicted to be primarily
           hydrophilic in character. F plasmid TraF and TraH are
           required for F pilus assembly and F plasmid transfer,
           and they are both localised to the outer membrane in the
           presence of the complete F transfer region, especially
           TraV, the putative anchor.
          Length = 215

 Score = 29.9 bits (68), Expect = 1.9
 Identities = 19/83 (22%), Positives = 28/83 (33%), Gaps = 16/83 (19%)

Query: 342 LIVLYAPWCHFCQAMEGSYIELAEQ--LEGMGVKV-GKF-------RADGDHKEFAKQKL 391
           L   Y   C +CQA        A++     + V V G         R D        Q+L
Sbjct: 124 LFFFYRGDCPYCQAQAPILQAFADKYGFSVIPVSVDGGPLPGFPNNRVDTGQ----AQRL 179

Query: 392 QLVSFPTILFFPKHSA--KPVKY 412
            + + P +      +   KPV Y
Sbjct: 180 GVKTTPALFLVDPKTGKVKPVAY 202


>gnl|CDD|234119 TIGR03140, AhpF, alkyl hydroperoxide reductase subunit F.  This
           enzyme is the partner of the peroxiredoxin (alkyl
           hydroperoxide reductase) AhpC which contains the
           peroxide-reactive cysteine. AhpF contains the reductant
           (NAD(P)H) binding domain (pfam00070) and presumably acts
           to resolve the disulfide which forms after oxidation of
           the active site cysteine in AphC. This proteins contains
           two paired conserved cysteine motifs, CxxCP and CxHCDGP
           [Cellular processes, Detoxification, Cellular processes,
           Adaptations to atypical conditions].
          Length = 515

 Score = 30.0 bits (68), Expect = 2.3
 Identities = 22/78 (28%), Positives = 30/78 (38%), Gaps = 17/78 (21%)

Query: 186 SPGTRAEIPVVQIDTSFEGIDGGK--GSLVKWNPLA--NVKG------QDIWNFLRAMNI 235
            P              F GI GG    SLV    LA   V G      + I + +R +N 
Sbjct: 62  KPSFTILRDGADTGIRFAGIPGGHEFTSLV----LAILQVGGHGPKLDEGIIDRIRRLN- 116

Query: 236 PINSLHSQGYISIGCEPC 253
               LH + Y+S+ C+ C
Sbjct: 117 --GPLHFETYVSLTCQNC 132


>gnl|CDD|239290 cd02992, PDI_a_QSOX, PDIa family, Quiescin-sulfhydryl oxidase
           (QSOX) subfamily; QSOX is a eukaryotic protein
           containing an N-terminal redox active TRX domain,
           similar to that of PDI, and a small C-terminal flavin
           adenine dinucleotide (FAD)-binding domain homologous to
           the yeast ERV1p protein. QSOX oxidizes thiol groups to
           disulfides like PDI, however, unlike PDI, this oxidation
           is accompanied by the reduction of oxygen to hydrogen
           peroxide. QSOX is localized in high concentrations in
           cells with heavy secretory load and prefers peptides and
           proteins as substrates, not monothiols like glutathione.
           Inside the cell, QSOX is found in the endoplasmic
           reticulum and Golgi. The flow of reducing equivalents in
           a QSOX-catalyzed reaction goes from the dithiol
           substrate -> dithiol of the QSOX TRX domain -> dithiols
           of the QSOX ERV1p domain -> FAD -> oxygen.
          Length = 114

 Score = 28.0 bits (63), Expect = 3.4
 Identities = 12/38 (31%), Positives = 18/38 (47%)

Query: 332 ARLQNREDPWLIVLYAPWCHFCQAMEGSYIELAEQLEG 369
           + L      WL+  YA WC  C+A   ++ +LA  L  
Sbjct: 13  SALLGSPSAWLVEFYASWCGHCRAFAPTWKKLARDLRK 50


>gnl|CDD|114761 pfam06057, VirJ, Bacterial virulence protein (VirJ).  This family
           consists of several bacterial VirJ virulence proteins.
           VirJ is thought to be involved in the type IV secretion
           system. It is thought that the substrate proteins
           localised to the periplasm may associate with the pilus
           in a manner that is mediated by VirJ, and suggest a
           two-step process for type IV secretion in Agrobacterium.
          Length = 192

 Score = 28.8 bits (65), Expect = 3.7
 Identities = 14/67 (20%), Positives = 20/67 (29%), Gaps = 14/67 (20%)

Query: 3   LSSSSSAAAALLNGSGSISSSFAVCYYGPHHK----------GVEGRIESTNDH---EDY 49
           L       A  +     +  +   C YG   K          G E  +     H    DY
Sbjct: 119 LGWDGEGKADPVPDLARLPPARVQCIYGQDEKDTACPSLRQRGAE-VVALPGGHHFDGDY 177

Query: 50  EKLARGM 56
           E LA+ +
Sbjct: 178 EALAKRI 184


>gnl|CDD|239254 cd02956, ybbN, ybbN protein family; ybbN is a hypothetical protein
           containing a redox-inactive TRX-like domain. Its gene
           has been sequenced from several gammaproteobacteria and
           actinobacteria.
          Length = 96

 Score = 27.6 bits (62), Expect = 4.3
 Identities = 15/77 (19%), Positives = 33/77 (42%), Gaps = 5/77 (6%)

Query: 334 LQNREDPWLIVLYAPWCHFCQAMEGSYIELAEQLEGMGVKVGKFRADGDHKEFAKQKLQL 393
            ++ + P ++  +AP     + +      LAE+ +G  V     + + D +    Q+  +
Sbjct: 8   QESTQVPVVVDFWAPRSPPSKELLPLLERLAEEYQGQFVLA---KVNCDAQPQIAQQFGV 64

Query: 394 VSFPTILFFPKHSAKPV 410
            + PT+  F     +PV
Sbjct: 65  QALPTVYLFAA--GQPV 79


>gnl|CDD|212094 cd11556, SLC6sbd_SERT-like_u1, uncharacterized subgroup of the
           SERT-like Na(+)- and Cl(-)-dependent monoamine
           transporter subfamily; solute binding domain.  SERT-like
           Na(+)- and Cl(-)-dependent monoamine transporters,
           transport monoamine neurotransmitters from synaptic
           spaces into presynaptic neurons. Members include: the
           norepinephrine transporter NET, the serotonin
           transporter SERT , and the dopamine transporter DAT1.
           These latter may play a role in diseases or disorders
           including depression, anxiety disorders, and
           attention-deficit hyperactivity disorder, and in the
           control of human behavior and emotional states. They
           belongs to the solute carrier 6 (SLC6) transporter
           family. Members of this subgroup are uncharacterized.
          Length = 552

 Score = 29.4 bits (66), Expect = 4.4
 Identities = 12/56 (21%), Positives = 23/56 (41%), Gaps = 6/56 (10%)

Query: 291 SQHINING-NGVAQHTNGSAPASDLFNSQKLVSFRRTGIENLARLQNREDPWLIVL 345
           S + + +  N  A   N ++ A + F    L      G+ +L  ++     W I+L
Sbjct: 146 SANQSFSATNISAAEENFTSAALEYFERGVLELHVSRGVTDLGNIR-----WQILL 196


>gnl|CDD|223848 COG0777, AccD, Acetyl-CoA carboxylase beta subunit [Lipid
           metabolism].
          Length = 294

 Score = 29.2 bits (66), Expect = 4.5
 Identities = 21/79 (26%), Positives = 33/79 (41%), Gaps = 2/79 (2%)

Query: 22  SSFAVCYYGPHHK--GVEGRIESTNDHEDYEKLARGMESASPLEIMDKAFQKFGNDIAIA 79
           S+  VC    HH       R+E+  D   +E+L   +E   PL+  D    K   + A  
Sbjct: 45  SNLKVCPKCGHHMRISARERLEALLDEGSFEELDSPLEPKDPLKFPDSKKYKDRLEAARK 104

Query: 80  FSGAEDVVLIEYAKLTGRP 98
            +G +D V+     + G P
Sbjct: 105 KTGLDDAVVTGEGTINGLP 123


>gnl|CDD|237138 PRK12563, PRK12563, sulfate adenylyltransferase subunit 2;
           Provisional.
          Length = 312

 Score = 29.0 bits (65), Expect = 4.6
 Identities = 45/208 (21%), Positives = 81/208 (38%), Gaps = 32/208 (15%)

Query: 56  MESASPLEIMDKAFQKFGNDIAIAFSGAEDVVLIEYAKLTGR----PFRVFSLDTGRLNP 111
           +E+ S + I+ +   +    + +   G + VV++  A    R    PF +  +DT     
Sbjct: 22  LEAES-IHILREVVAECSKPVMLYSIGKDSVVMLHLAMKAFRPTRPPFPLLHVDTTWKFR 80

Query: 112 ETHQFFDTVEKHYGIR-IEYTFPNAVEVQALVRTKGLFSFYEDGHQECCRIRKVRPLKRA 170
           E   F D   K  G+  + +  P+ +      R    F      H +   + K + LK+A
Sbjct: 81  EMIDFRDRRAKELGLDLVVHHNPDGIA-----RGIVPFRHGSALHTD---VAKTQGLKQA 132

Query: 171 LK--GLRAWITGQRKDQSPGTRAEIPVVQIDTSFEGIDGGKGSLVKWN------------ 216
           L   G  A I G R+D+   +RA+  +    ++F   D        W+            
Sbjct: 133 LDHHGFDAAIGGARRDEEK-SRAKERIFSFRSAFHRWDPKAQRPELWSLYNARLRRGESL 191

Query: 217 ---PLANVKGQDIWNFLRAMNIPINSLH 241
              PL+N    D+W ++    IP+  L+
Sbjct: 192 RVFPLSNWTELDVWQYIAREKIPLVPLY 219


>gnl|CDD|178645 PLN03097, FHY3, Protein FAR-RED ELONGATED HYPOCOTYL 3; Provisional.
          Length = 846

 Score = 29.2 bits (65), Expect = 5.0
 Identities = 14/50 (28%), Positives = 22/50 (44%), Gaps = 1/50 (2%)

Query: 298 GNGVAQHTNGSAPASD-LFNSQKLVSFRRTGIENLARLQNREDPWLIVLY 346
           G  + QH N  A     ++ S     F +   + L R + +ED W+  LY
Sbjct: 388 GQVIKQHENFMAKFEKCIYRSWTEEEFGKRWWKILDRFELKEDEWMQSLY 437


>gnl|CDD|232945 TIGR00376, TIGR00376, DNA helicase, putative.  The gene product may
           represent a DNA helicase. Eukaryotic members of this
           family have been characterized as binding certain
           single-stranded G-rich DNA sequences (GGGGT and GGGCT).
           A number of related proteins are characterized as
           helicases [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 637

 Score = 29.0 bits (65), Expect = 6.4
 Identities = 16/53 (30%), Positives = 23/53 (43%), Gaps = 6/53 (11%)

Query: 192 EIPVVQIDTSFEGIDGG----KGSLVKWNPLANVKGQDIWNFLRAMNIPINSL 240
            IP++ IDTS  G +        S  K+NP       +I   L  M +P N +
Sbjct: 480 GIPLLFIDTS--GCELFELKEADSTSKYNPGEAELVSEIIQALVKMGVPANDI 530


>gnl|CDD|223592 COG0518, GuaA, GMP synthase - Glutamine amidotransferase domain
           [Nucleotide transport and metabolism].
          Length = 198

 Score = 28.0 bits (63), Expect = 7.0
 Identities = 15/73 (20%), Positives = 22/73 (30%), Gaps = 20/73 (27%)

Query: 337 REDPWLIVLYAPWCHF---C---QAMEGSYIELAEQLEGMGVKVGKFRADGDHKEFAKQK 390
           RE   +     P       C   Q +       A+ L   G KV +    G  +E     
Sbjct: 67  REKDLIKDAGVPGKPVLGICLGHQLL-------AKAL---GGKVER----GPKREIGWTP 112

Query: 391 LQLVSFPTILFFP 403
           ++L      LF  
Sbjct: 113 VELTEGDDPLFAG 125


>gnl|CDD|232867 TIGR00194, uvrC, excinuclease ABC, C subunit.  This family consists
           of the DNA repair enzyme UvrC, an ABC excinuclease
           subunit which interacts with the UvrA/UvrB complex to
           excise UV-damaged nucleotide segments [DNA metabolism,
           DNA replication, recombination, and repair].
          Length = 574

 Score = 28.5 bits (64), Expect = 7.4
 Identities = 24/101 (23%), Positives = 43/101 (42%), Gaps = 12/101 (11%)

Query: 33  HKGVEGRIESTNDHEDYEKLARGMES-ASPLEIMDKAFQKFGNDI---AIAFSGAED--- 85
            K +E ++E  +++ ++E+ AR  +  A+  E+ +K      + I    IA +   +   
Sbjct: 196 IKELEQKMEKASENLEFEEAARIRDQIAAVRELNEKQHVSLTDLIDLDIIAVAFDGNVAA 255

Query: 86  --VVLIEYAKLTGRPFRVFSLDTGRLNPETHQFFDTVEKHY 124
             V  I   KL GR    FSL    L+     F   + + Y
Sbjct: 256 IQVFFIRQGKLIGRDQFDFSLPGTDLDELVETF---LIQFY 293


>gnl|CDD|236266 PRK08447, PRK08447, ribonucleotide-diphosphate reductase subunit
           alpha; Validated.
          Length = 789

 Score = 28.5 bits (64), Expect = 7.7
 Identities = 18/57 (31%), Positives = 24/57 (42%), Gaps = 13/57 (22%)

Query: 87  VLIEYAKLTGRPFRVFSLDTGRLNPETHQ------------FFDTVEKHYGIRIEYT 131
           +L  Y + TG PF  F  +  R NP  H             F +T   HY I+IE+ 
Sbjct: 390 ILTNYFE-TGLPFLCFKDNANRANPNAHAGIIRSSNLCTEIFQNTAPNHYKIKIEFE 445


>gnl|CDD|239304 cd03006, PDI_a_EFP1_N, PDIa family, N-terminal EFP1 subfamily; EFP1
           is a binding partner protein of thyroid oxidase (ThOX),
           also called Duox. ThOX proteins are responsible for the
           generation of hydrogen peroxide, a crucial substrate of
           thyroperoxidase, which functions to iodinate
           thyroglobulin and synthesize thyroid hormones. EFP1 was
           isolated through a yeast two-hybrid method using the
           EF-hand fragment of dog Duox1 as a bait. It could be one
           of the partners in the assembly of a multiprotein
           complex constituting the thyroid hydrogen peroxide
           generating system. EFP1 contains two TRX domains related
           to the redox active TRX domains of protein disulfide
           isomerase (PDI). This subfamily is composed of the
           N-terminal TRX domain of EFP1, which contains a CXXS
           sequence in place of the typical CXXC motif, similar to
           ERp44. The CXXS motif allows the formation of stable
           mixed disulfides, crucial for the ER-retention function
           of ERp44.
          Length = 113

 Score = 27.0 bits (60), Expect = 8.3
 Identities = 13/62 (20%), Positives = 27/62 (43%), Gaps = 9/62 (14%)

Query: 306 NGSAPASDLFNSQKLVSFRRTGIENLARLQNREDPWLIVLYAPWCHFCQAMEGSYIELAE 365
           +  +P  D +  Q         ++    L+   +  L++ YAPW    QA    + ++A+
Sbjct: 6   SQRSPVLDFYKGQ---------LDYAEELRTDAEVSLVMYYAPWDAQSQAARQEFEQVAQ 56

Query: 366 QL 367
           +L
Sbjct: 57  KL 58


>gnl|CDD|233770 TIGR02188, Ac_CoA_lig_AcsA, acetate--CoA ligase.  This model
           describes acetate-CoA ligase (EC 6.2.1.1), also called
           acetyl-CoA synthetase and acetyl-activating enzyme. It
           catalyzes the reaction ATP + acetate + CoA = AMP +
           diphosphate + acetyl-CoA and belongs to the family of
           AMP-binding enzymes described by pfam00501.
          Length = 625

 Score = 28.4 bits (64), Expect = 9.3
 Identities = 10/26 (38%), Positives = 17/26 (65%), Gaps = 1/26 (3%)

Query: 111 PETHQFFDTVEKHYGIRIEYTFPNAV 136
           P+  +F++ +EKH  + I YT P A+
Sbjct: 317 PDPGRFWEIIEKH-KVTIFYTAPTAI 341


>gnl|CDD|129606 TIGR00515, accD, acetyl-CoA carboxylase, carboxyl transferase, beta
           subunit.  The enzyme acetyl-CoA carboxylase contains a
           biotin carboxyl carrier protein or domain, a biotin
           carboxylase, and a carboxyl transferase. This model
           represents the beta chain of the carboxyl transferase
           for cases in which the architecture of the protein is as
           in E. coli, in which the carboxyltransferase portion
           consists of two non-identical subnits, alpha and beta
           [Fatty acid and phospholipid metabolism, Biosynthesis].
          Length = 285

 Score = 27.8 bits (62), Expect = 9.8
 Identities = 26/88 (29%), Positives = 39/88 (44%), Gaps = 6/88 (6%)

Query: 22  SSFAVCYYGPHHKGVEG--RIESTNDHEDYEKLARGMESASPLEIMDKAFQKFGNDIAIA 79
            +  VC    HH  ++   RIES  D   +E+    +E   PL+  D   +K+ + IA A
Sbjct: 43  RNLEVCPKCDHHMRMDARERIESLLDEGSFEEFNSHLEPKDPLKFKDS--KKYKDRIAKA 100

Query: 80  --FSGAEDVVLIEYAKLTGRPFRVFSLD 105
              +G +D V+     L G P  V   D
Sbjct: 101 QKETGEKDAVVTGKGTLYGMPIVVAVFD 128


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.135    0.414 

Gapped
Lambda     K      H
   0.267   0.0741    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 21,887,147
Number of extensions: 2115722
Number of successful extensions: 2038
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1990
Number of HSP's successfully gapped: 77
Length of query: 430
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 330
Effective length of database: 6,502,202
Effective search space: 2145726660
Effective search space used: 2145726660
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (26.9 bits)