RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10444
         (341 letters)



>gnl|CDD|236963 PRK11720, PRK11720, galactose-1-phosphate uridylyltransferase;
           Provisional.
          Length = 346

 Score =  471 bits (1214), Expect = e-168
 Identities = 155/290 (53%), Positives = 201/290 (69%), Gaps = 16/290 (5%)

Query: 49  EHQHVRYNPLKGEWILVSPHRMKRPWSGQEEKSNEDPIAEFDPKNPLCPTATRGNGEVNP 108
           +H H RYNPL G+W+LVSPHR KRPW GQ+E   ++ +  +DP   LCP  TR  G+VNP
Sbjct: 8   DHPHRRYNPLTGQWVLVSPHRAKRPWQGQQETPAKETLPAYDPDCFLCPGNTRVTGDVNP 67

Query: 109 NYDSVFVFNNDFPALLENVPSPPVQDDPLFKMGNASGVCKVICFHPKSNTTLPIMSIEEI 168
           +Y   +VF NDF AL+ + P  P  DDPLF+  +A G  +VICF P  + TLP +S+  +
Sbjct: 68  DYTGTYVFTNDFAALMPDTPDAPESDDPLFRCQSARGTSRVICFSPDHSKTLPELSVAAL 127

Query: 169 RKVIDTWVDETASLGKTYEWVQIFENKGAIMGCSNPHPHCQVWASSFLPNLPRIKDIHQR 228
           R+V+DTW ++TA LGKTY WVQ+FENKGA MGCSNPHPH Q+WA+SFLPN    +D  QR
Sbjct: 128 REVVDTWQEQTAELGKTYPWVQVFENKGAAMGCSNPHPHGQIWANSFLPNEAEREDRLQR 187

Query: 229 EYYQTHGKPLLMDYVSKELLKRKQPTSVAVWLKELLKRDRVVVRNPDWVVVVPFWAEWPF 288
            Y+  HG PLL+DYV                 +EL   +R+VV    W+ VVP+WA WPF
Sbjct: 188 AYFAEHGSPLLVDYVQ----------------RELADGERIVVETEHWLAVVPYWAAWPF 231

Query: 289 ETMILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFH 338
           ET++LP+ HVLR TDL D QR++LA  +KK+T++YDNLF+ SFPYSMG+H
Sbjct: 232 ETLLLPKAHVLRLTDLTDAQRDDLALALKKLTSRYDNLFQCSFPYSMGWH 281



 Score = 96.5 bits (241), Expect = 1e-22
 Identities = 32/59 (54%), Positives = 44/59 (74%), Gaps = 7/59 (11%)

Query: 1   MILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFH-------EHQH 52
           ++LP+ HVLR TDL D QR++LA  +KK+T++YDNLF+ SFPYSMG+H       E+ H
Sbjct: 234 LLLPKAHVLRLTDLTDAQRDDLALALKKLTSRYDNLFQCSFPYSMGWHGAPFNGEENDH 292


>gnl|CDD|238341 cd00608, GalT, Galactose-1-phosphate uridyl transferase (GalT):
           This enzyme plays a key role in galactose metabolism by
           catalysing the transfer of a uridine 5'-phosphoryl group
           from UDP-galactose 1-phosphate. The structure of E.coli
           GalT reveals that the enzyme contains two identical
           subunits. It also demonstrates that the active site is
           formed by amino acid residues from both subunits of the
           dimer.
          Length = 329

 Score =  380 bits (977), Expect = e-132
 Identities = 151/290 (52%), Positives = 188/290 (64%), Gaps = 22/290 (7%)

Query: 52  HVRYNPLKGEWILVSPHRMKRPWSGQEEKSNEDPIAEFDPKNPLCPTATR-GNGEVNPNY 110
           H RYNPL GEW+LVSPHR KRPW GQ+E   + P  E+DP  PLCP   R   GE NP+Y
Sbjct: 1   HRRYNPLTGEWVLVSPHRAKRPWQGQQEAPKKLP--EYDPDCPLCPGNERADTGEQNPDY 58

Query: 111 DSVFVFNNDFPALLENVPSPPVQDDPLFKMGNASGVCKVICFHPKSNTTLPIMSIEEIRK 170
           D V VF NDFPAL  + P+P   DD LF+   A G C+VICF P  N TL  MS+ EIR+
Sbjct: 59  D-VRVFENDFPALKPDAPAPEDSDDGLFRTAPARGRCEVICFSPDHNLTLAEMSVAEIRE 117

Query: 171 VIDTWVDETASLGK--TYEWVQIFENKGAIMGCSNPHPHCQVWASSFLPNLPRIKDIHQR 228
           V++ W + T  LGK    ++VQIFENKGA MG S PHPH Q+WA  FLP     +  +Q+
Sbjct: 118 VVEAWAERTRELGKNPRIKYVQIFENKGAEMGASLPHPHGQIWALPFLPPEVARELRNQK 177

Query: 229 EYYQTHGKPLLMDYVSKELLKRKQPTSVAVWLKELLKRDRVVVRNPDWVVVVPFWAEWPF 288
            YY+ HG+ LL DY+                  EL  ++R+VV N  +V VVPFWA WPF
Sbjct: 178 AYYEKHGRCLLCDYL----------------KLELESKERIVVENEHFVAVVPFWARWPF 221

Query: 289 ETMILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFH 338
           E  ILP+ HV RFTDL D++R +LA+I+K++  +YDNLF  SFPYSMG+H
Sbjct: 222 EVHILPKRHVSRFTDLTDEEREDLAEILKRLLARYDNLFNCSFPYSMGWH 271



 Score = 79.3 bits (196), Expect = 1e-16
 Identities = 29/48 (60%), Positives = 39/48 (81%)

Query: 1   MILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFH 48
            ILP+ HV RFTDL D++R +LA+I+K++  +YDNLF  SFPYSMG+H
Sbjct: 224 HILPKRHVSRFTDLTDEEREDLAEILKRLLARYDNLFNCSFPYSMGWH 271


>gnl|CDD|129313 TIGR00209, galT_1, galactose-1-phosphate uridylyltransferase,
           family 1.  This enzyme is involved in glucose and
           galactose interconversion. This model describes one of
           two extremely distantly related branches of the model
           pfam01087 [Energy metabolism, Sugars].
          Length = 347

 Score =  356 bits (914), Expect = e-122
 Identities = 149/292 (51%), Positives = 194/292 (66%), Gaps = 16/292 (5%)

Query: 47  FHEHQHVRYNPLKGEWILVSPHRMKRPWSGQEEKSNEDPIAEFDPKNPLCPTATRGNGEV 106
             +H H RYNPL  +WILVSPHR KRPW GQ+E   +  +  +DP   LCP   R  G++
Sbjct: 6   PVDHPHRRYNPLTDQWILVSPHRAKRPWQGQQETPAKQVLPAYDPDCYLCPGNKRVTGDL 65

Query: 107 NPNYDSVFVFNNDFPALLENVPSPPVQDDPLFKMGNASGVCKVICFHPKSNTTLPIMSIE 166
           NP+Y   +VF NDF AL+ + P  P   DPL +  +A G  +VICF P  + TLP +S+ 
Sbjct: 66  NPDYTGTYVFTNDFAALMSDTPDAPESHDPLMRCQSARGTSRVICFSPDHSKTLPELSVA 125

Query: 167 EIRKVIDTWVDETASLGKTYEWVQIFENKGAIMGCSNPHPHCQVWASSFLPNLPRIKDIH 226
            + +++ TW ++TA LGKTY WVQIFENKGA MGCSNPHPH Q+WA+SFLPN    +D  
Sbjct: 126 ALTEIVKTWQEQTAELGKTYPWVQIFENKGAAMGCSNPHPHGQIWANSFLPNEVEREDRL 185

Query: 227 QREYYQTHGKPLLMDYVSKELLKRKQPTSVAVWLKELLKRDRVVVRNPDWVVVVPFWAEW 286
           Q+EY+  H  P+L+DYV                 +EL  + R VV    W+ VVP+WA W
Sbjct: 186 QKEYFAEHKSPMLVDYVK----------------RELADKSRTVVETEHWIAVVPYWAIW 229

Query: 287 PFETMILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFH 338
           PFET++LP+ HVLR TDL D QR++LA I+KK+T+KYDNLF+ SFPYSMG+H
Sbjct: 230 PFETLLLPKAHVLRITDLTDAQRSDLALILKKLTSKYDNLFETSFPYSMGWH 281



 Score = 77.3 bits (190), Expect = 7e-16
 Identities = 35/61 (57%), Positives = 45/61 (73%), Gaps = 6/61 (9%)

Query: 1   MILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFHEHQHVRYNPLKG 60
           ++LP+ HVLR TDL D QR++LA I+KK+T+KYDNLF+ SFPYSMG+H        P  G
Sbjct: 234 LLLPKAHVLRITDLTDAQRSDLALILKKLTSKYDNLFETSFPYSMGWHG------APFNG 287

Query: 61  E 61
           E
Sbjct: 288 E 288


>gnl|CDD|224010 COG1085, GalT, Galactose-1-phosphate uridylyltransferase [Energy
           production and conversion].
          Length = 338

 Score =  288 bits (740), Expect = 2e-96
 Identities = 128/291 (43%), Positives = 164/291 (56%), Gaps = 21/291 (7%)

Query: 50  HQHVRYNPLKGEWILVSPHRMKRPWSGQEEKSNEDPIAEFDPKNPLCPTATRGNGEVNPN 109
               RYNPL G+W+LVSPHR KRPW G +EK  E      DP  P CP   R   E NP 
Sbjct: 1   MPERRYNPLTGQWVLVSPHRAKRPWQGAQEKIAEQTD--HDPTCPFCPGNERTT-EENPR 57

Query: 110 YDSVFVFNNDFPALLENVPSPPVQDDPLFKMGNASGVCKVICFHPKSNTTLPIMSIEEIR 169
           Y  V VF NDFPA+ E+ P  P  +DPLFK+  A G  +VI   P  + TLP + +EEI 
Sbjct: 58  YWHVRVFPNDFPAVSEDPPDAPGSEDPLFKIQEARGKSRVIVESPDHSKTLPELPVEEIE 117

Query: 170 KVIDTWVDETASLGKT--YEWVQIFENKGAIMGCSNPHPHCQVWASSFLPNLPRIKDIHQ 227
           +V+  W +    L +   Y++VQIFENKG   G S PHPH Q+ A   LP     +    
Sbjct: 118 EVVKLWQERVRELYEREKYKYVQIFENKGKAAGASLPHPHGQIVALPVLPLEVARELRSA 177

Query: 228 REYYQTHGKPLLMDYVSKELLKRKQPTSVAVWLKELLKRDRVVVRNPDWVVVVPFWAEWP 287
           REYY+ +G  +  D V +E    +                R+VV N  ++  VPFWA WP
Sbjct: 178 REYYEENGSCMYCDLVEREKGDGE----------------RIVVENDHFLAFVPFWARWP 221

Query: 288 FETMILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFH 338
           FE +I P+ HV   TDL D++  +LA+I+KK+  +YDNLF  SFPYSMGFH
Sbjct: 222 FEVLIYPKEHVSFLTDLSDEELKDLAEILKKLLARYDNLFGNSFPYSMGFH 272



 Score = 63.6 bits (155), Expect = 3e-11
 Identities = 27/47 (57%), Positives = 35/47 (74%)

Query: 2   ILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFH 48
           I P+ HV   TDL D++  +LA+I+KK+  +YDNLF  SFPYSMGFH
Sbjct: 226 IYPKEHVSFLTDLSDEELKDLAEILKKLLARYDNLFGNSFPYSMGFH 272


>gnl|CDD|216293 pfam01087, GalP_UDP_transf, Galactose-1-phosphate uridyl
           transferase, N-terminal domain.  SCOP reports fold
           duplication with C-terminal domain. Both involved in Zn
           and Fe binding.
          Length = 170

 Score =  210 bits (537), Expect = 4e-68
 Identities = 77/172 (44%), Positives = 95/172 (55%), Gaps = 16/172 (9%)

Query: 50  HQHVRYNPLKGEWILVSPHRMKRPWSGQEEKSNEDPIAEFDPKNPLCPTATRGNGEVNPN 109
             + RYNPL GEW+LVSPHR+KRPW+GQ EK  ED + E D    LCP  +R NG+ NP 
Sbjct: 11  TSNRRYNPLTGEWVLVSPHRLKRPWAGQNEKI-EDTLPERDLMCYLCPGPSRINGDFNPK 69

Query: 110 YDSVFVFNNDFPALLENVPSPPVQD---DPLFKMGNASGVCKVICFHPKSNTTLPIMSIE 166
           Y S FVF NDF AL ++           + LFK   A G C+V    PK          +
Sbjct: 70  YKSPFVFTNDFYALSKDNNYIKTDAIAKNILFKAETAYGRCEVTINLPKPE--------K 121

Query: 167 EIRKVIDTWVDETASLGKTY-EWVQIFENKGAIMGCSNPHPHCQVWASSFLP 217
           + + V       TA L  +Y + V  FEN+G   GCSNPHPH Q+WAS  LP
Sbjct: 122 DPKAVAAA---RTAELASSYPKCVLCFENEGYAGGCSNPHPHGQIWASPHLP 170


>gnl|CDD|145739 pfam02744, GalP_UDP_tr_C, Galactose-1-phosphate uridyl transferase,
           C-terminal domain.  SCOP reports fold duplication with
           N-terminal domain. Both involved in Zn and Fe binding.
          Length = 168

 Score =  117 bits (295), Expect = 4e-32
 Identities = 52/117 (44%), Positives = 70/117 (59%), Gaps = 17/117 (14%)

Query: 223 KDIHQREYYQTH-GKPLLMDYVSKELLKRKQPTSVAVWLKELLKRDRVVVRNPDWVVVVP 281
           ++     Y+    G  LL DYV                  EL +++RVVV N  W  VVP
Sbjct: 1   EERIFPHYFAGSNGSILLHDYV----------------QMELAEKERVVVENEHWPAVVP 44

Query: 282 FWAEWPFETMILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFH 338
           +WA+WPFET+ LP+ HVL  T+L D++R +LA I+K +T +YDNLF+ SFPYSMG H
Sbjct: 45  YWAKWPFETLRLPKDHVLSLTELTDKEREDLAAIVKPLTRRYDNLFETSFPYSMGIH 101



 Score = 69.4 bits (170), Expect = 3e-14
 Identities = 27/48 (56%), Positives = 37/48 (77%)

Query: 1   MILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFH 48
           + LP+ HVL  T+L D++R +LA I+K +T +YDNLF+ SFPYSMG H
Sbjct: 54  LRLPKDHVLSLTELTDKEREDLAAIVKPLTRRYDNLFETSFPYSMGIH 101


>gnl|CDD|215346 PLN02643, PLN02643, ADP-glucose phosphorylase.
          Length = 336

 Score =  100 bits (250), Expect = 6e-24
 Identities = 69/289 (23%), Positives = 116/289 (40%), Gaps = 37/289 (12%)

Query: 53  VRYNPLKGEWILVSPHRMKRPWSGQEEKSNEDPIAEFDPKNPLCPTATRGNG-------- 104
           +R +P+   W++ SP R KRP +  + KS ++P        P C                
Sbjct: 4   LRKDPVTNRWVIFSPARGKRP-TDFKSKSPQNPNGNHSSGCPFCIGHEHECAPEIFRVPD 62

Query: 105 -EVNPNYDSVFVFNNDFPALLENVPSPPVQDDPLFKMG---NASGVCKVICFHPKSNTTL 160
               P++  V V  N +PAL  ++  P  +       G      G   V+   P  +  L
Sbjct: 63  DASAPDW-KVRVIENLYPALSRDLEPPCTEGQGEDYGGRRLPGFGFHDVVIETPVHSVQL 121

Query: 161 PIMSIEEIRKVIDTWVDETASLG--KTYEWVQIFENKGAIMGCSNPHPHCQVWASSFLPN 218
             +    I +V+  +      L     +++VQ+F+N GA  G S  H H Q+ A   +P 
Sbjct: 122 SDLPARHIGEVLKAYKKRINQLQSDSRFKYVQVFKNHGASAGASMSHSHSQIIALPVVPP 181

Query: 219 LPRIKDIHQREYYQTHGKPLLMDYVSKELLKRKQPTSVAVWLKELLKRDRVVVRNPDWVV 278
               +    +EY++  GK  L + V K+LL                     +  +  +V 
Sbjct: 182 SVSARLDGSKEYFEKTGKCSLCEVVKKDLL---------------------IDESSHFVS 220

Query: 279 VVPFWAEWPFETMILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLF 327
           + PF A +PFE  I+PR H   F ++DD +  +L  ++K +  K     
Sbjct: 221 IAPFAATFPFEIWIIPRDHSSNFHEIDDDKAVDLGGLLKLMLQKISKQL 269


>gnl|CDD|238263 cd00468, HIT_like, HIT family: HIT (Histidine triad) proteins,
           named for a motif related to the sequence HxHxH/Qxx (x,
           a hydrophobic amino acid), are a superfamily of
           nucleotide hydrolases and transferases, which act on the
           alpha-phosphate of ribonucleotides. On the basis of
           sequence, substrate specificity, structure, evolution
           and mechanism, HIT proteins are classified in the
           literacture into three major branches: the Hint branch,
           which consists of adenosine 5' -monophosphoramide
           hydrolases, the Fhit branch, that consists of
           diadenosine polyphosphate hydrolases, and the GalT
           branch consisting of specific nucloside monophosphate
           transferases. Further sequence analysis reveals several
           new closely related, yet uncharacterized subgroups.
          Length = 86

 Score = 54.0 bits (130), Expect = 1e-09
 Identities = 24/108 (22%), Positives = 36/108 (33%), Gaps = 24/108 (22%)

Query: 107 NPNYDSVFVFNNDFPALLENVPSPPVQDDPLFKMGNASGVCKVICFHPKSNTTLPIMSIE 166
            P+ +  F F N  PA   +V                     ++C   +   TLP +   
Sbjct: 1   VPDDEHSFAFVNLKPAAPGHV---------------------LVCP-KRHVETLPDLDEA 38

Query: 167 EIRKVIDTWVDETASL--GKTYEWVQIFENKGAIMGCSNPHPHCQVWA 212
            +  ++ T     A L        + +F N GA  G S PH H  V  
Sbjct: 39  LLADLVITAQRVAAELEKHGNVPSLTVFVNDGAAAGQSVPHVHLHVLP 86


>gnl|CDD|223611 COG0537, Hit, Diadenosine tetraphosphate (Ap4A) hydrolase and other
           HIT family hydrolases [Nucleotide transport and
           metabolism / Carbohydrate transport and metabolism /
           General function prediction only].
          Length = 138

 Score = 31.6 bits (72), Expect = 0.27
 Identities = 12/53 (22%), Positives = 24/53 (45%), Gaps = 2/53 (3%)

Query: 268 RVVVRNPDWVVVV-PFWAEWPFETMILPRTHVLRFTDLDDQQRNNLADIIKKI 319
             V  +   +  +  + A     T+++P+ HV    DLD ++   L  + +KI
Sbjct: 16  NKVYEDEHVLAFLDIYPAAPGH-TLVIPKRHVSDLEDLDPEELAELFLLAQKI 67



 Score = 31.2 bits (71), Expect = 0.38
 Identities = 21/100 (21%), Positives = 38/100 (38%), Gaps = 15/100 (15%)

Query: 1   MILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVS-FPYSM---------GFHEH 50
           +++P+ HV    DLD ++   L  + +KI       F    +   +          FH H
Sbjct: 39  LVIPKRHVSDLEDLDPEELAELFLLAQKIAKALKEAFGADGYNIGINNGKAAGQEVFHLH 98

Query: 51  QHV--RYNPLKGEWILVSPHRMKRPWSGQEEKSNEDPIAE 88
            H+  RY   KG+     P    +    +E +   + I +
Sbjct: 99  IHIIPRY---KGDDNFPGPGWGTKVEPNEELEELAEKIRK 135


>gnl|CDD|236983 PRK11788, PRK11788, tetratricopeptide repeat protein; Provisional.
          Length = 389

 Score = 30.9 bits (71), Expect = 0.82
 Identities = 11/60 (18%), Positives = 27/60 (45%), Gaps = 12/60 (20%)

Query: 219 LPRIKDIHQ------------REYYQTHGKPLLMDYVSKELLKRKQPTSVAVWLKELLKR 266
           LP++ + +Q            R   + +    L+  +++ L +++ P +    L+E L+R
Sbjct: 252 LPKLMECYQALGDEAEGLEFLRRALEEYPGADLLLALAQLLEEQEGPEAAQALLREQLRR 311


>gnl|CDD|226551 COG4065, COG4065, Uncharacterized protein conserved in archaea
           [Function unknown].
          Length = 480

 Score = 30.2 bits (68), Expect = 1.9
 Identities = 13/36 (36%), Positives = 19/36 (52%), Gaps = 4/36 (11%)

Query: 218 NLPRIKDIHQREYYQTHGKPLLM---DYVSKELLKR 250
           +L ++ +I  RE Y+ HG P L    D V   + KR
Sbjct: 348 DLGKLTEI-GREIYEEHGIPTLFAVLDEVMALIAKR 382


>gnl|CDD|226489 COG3981, COG3981, Predicted acetyltransferase [General function
           prediction only].
          Length = 174

 Score = 28.5 bits (64), Expect = 3.2
 Identities = 7/28 (25%), Positives = 13/28 (46%)

Query: 259 WLKELLKRDRVVVRNPDWVVVVPFWAEW 286
           WL++L +++        WV    +WA  
Sbjct: 48  WLEDLTRQEPGNNLPEGWVPASTYWAVD 75


>gnl|CDD|234156 TIGR03285, methan_mark_14, putative methanogenesis marker protein
           14.  Members of this protein family, to date, are found
           in a completed prokaryotic genome if and only if the
           species is one of the archaeal methanogens. The exact
           function is unknown, but likely is linked to
           methanogenesis or a process closely connected to it
           [Energy metabolism, Methanogenesis].
          Length = 445

 Score = 29.0 bits (65), Expect = 3.5
 Identities = 12/36 (33%), Positives = 19/36 (52%), Gaps = 4/36 (11%)

Query: 218 NLPRIKDIHQREYYQTHGKPLLM---DYVSKELLKR 250
           +L ++ +I  RE Y++ G P L    D V   + KR
Sbjct: 315 DLTKLTEI-GREIYESEGIPTLFAVLDDVMALIAKR 349


>gnl|CDD|226803 COG4360, APA2, ATP adenylyltransferase
           (5',5'''-P-1,P-4-tetraphosphate phosphorylase II)
           [Nucleotide transport and metabolism].
          Length = 298

 Score = 29.1 bits (65), Expect = 3.6
 Identities = 10/32 (31%), Positives = 15/32 (46%)

Query: 191 IFENKGAIMGCSNPHPHCQVWASSFLPNLPRI 222
           +F N G I G S  H H Q+    F+    ++
Sbjct: 141 VFYNSGPIAGASQDHKHLQIVPMPFVAFQDQL 172


>gnl|CDD|216377 pfam01230, HIT, HIT domain. 
          Length = 97

 Score = 27.7 bits (62), Expect = 3.6
 Identities = 10/53 (18%), Positives = 24/53 (45%)

Query: 267 DRVVVRNPDWVVVVPFWAEWPFETMILPRTHVLRFTDLDDQQRNNLADIIKKI 319
            +VV  +   +  +    + P   +++P+ H+    DL  ++  +L    KK+
Sbjct: 6   SKVVYEDDLVLAFLDIDPQAPGHVLVIPKKHITELHDLTPEELGDLMSEAKKV 58


>gnl|CDD|239280 cd02982, PDI_b'_family, Protein Disulfide Isomerase (PDIb') family,
           redox inactive TRX-like domain b'; composed of
           eukaryotic proteins involved in oxidative protein
           folding in the endoplasmic reticulum (ER) by acting as
           catalysts and folding assistants. Members of this family
           include PDI, calsequestrin and other PDI-related
           proteins like ERp72, ERp57 (or ERp60), ERp44, P5 and
           PDIR. PDI, ERp57, ERp72, P5 and PDIR are all oxidases,
           catalyzing the formation of disulfide bonds of newly
           synthesized polypeptides in the ER. They also exhibit
           reductase activity in acting as isomerases to correct
           any non-native disulfide bonds, as well as chaperone
           activity to prevent protein aggregation and facilitate
           the folding of newly synthesized proteins. These
           proteins contain multiple copies of a redox active TRX
           (a) domain containing a CXXC motif, and one or more
           redox inactive TRX-like (b) domains. The molecular
           structure of PDI is abb'a'. Also included in this family
           is the PDI-related protein ERp27, which contains only
           redox-inactive TRX-like (b and b') domains. The redox
           inactive domains are implicated in substrate recognition
           with the b' domain serving as the primary substrate
           binding site. Only the b' domain is necessary for the
           binding of small peptide substrates. In addition to the
           b' domain, other domains are required for the binding of
           larger polypeptide substrates. The b' domain is also
           implicated in chaperone activity.
          Length = 103

 Score = 27.2 bits (61), Expect = 5.2
 Identities = 7/36 (19%), Positives = 16/36 (44%)

Query: 288 FETMILPRTHVLRFTDLDDQQRNNLADIIKKITTKY 323
           F      +  ++ F + DD +   L +  K++  K+
Sbjct: 6   FNYEESGKPLLVLFYNKDDSESEELRERFKEVAKKF 41


>gnl|CDD|238608 cd01277, HINT_subgroup, HINT (histidine triad nucleotide-binding
           protein) subgroup: Members of this CD belong to the
           superfamily of histidine triad hydrolases that act on
           alpha-phosphate of ribonucleotides. This subgroup
           includes members from all three forms of cellular life.
           Although the biochemical function has not been
           characterised for many of the members of this subgroup,
           the proteins from Yeast have been shown to be involved
           in secretion, peroxisome formation and gene expression.
          Length = 103

 Score = 27.2 bits (61), Expect = 5.5
 Identities = 10/40 (25%), Positives = 17/40 (42%)

Query: 290 TMILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKV 329
           T+++P+ H     DLD ++   L    KK+        K 
Sbjct: 37  TLVIPKKHYENLLDLDPEELAELILAAKKVARALKKALKA 76


>gnl|CDD|182309 PRK10213, nepI, ribonucleoside transporter; Reviewed.
          Length = 394

 Score = 28.3 bits (63), Expect = 5.6
 Identities = 16/59 (27%), Positives = 26/59 (44%), Gaps = 6/59 (10%)

Query: 182 LGKTYEWVQIFENKGAIMGCSNPHPHCQVWASSFLPNLPRIKDIHQREYYQTHGKPLLM 240
           LG+   W  +F N  A+MG       C  W    LP+LP      ++  ++   +P +M
Sbjct: 167 LGELIGWRNVF-NAAAVMGVL-----CIFWIIKSLPSLPGEPSHQKQNTFRLLQRPGVM 219


>gnl|CDD|236300 PRK08576, PRK08576, hypothetical protein; Provisional.
          Length = 438

 Score = 28.5 bits (64), Expect = 5.7
 Identities = 20/112 (17%), Positives = 36/112 (32%), Gaps = 15/112 (13%)

Query: 89  FDPKNPLCPTATRGNGEVNPNYDSVFVFNNDFPALLENVPSPPVQDDPLF--KMGNA--- 143
             P+N           E++P+YD        F   ++ +    + ++ L   K+G     
Sbjct: 121 LSPENGE-------GIEIHPDYDIFLAIGEGFRKNMKKLLGVDIGENALVLRKLGGEHEV 173

Query: 144 -SGVCKVICFHPKSNTTLPIMSIEEIRKVIDTWVDETASLGKTYEWVQIFEN 194
            SG  KV           P   +  I    +  V     +    E ++ FE 
Sbjct: 174 YSGPRKVAELEISKKIGKPTEVLWRIPYPEE--VSLEKLIEANREVLEAFEK 223


>gnl|CDD|218915 pfam06156, DUF972, Protein of unknown function (DUF972).  This
           family consists of several hypothetical bacterial
           sequences. The function of this family is unknown.
          Length = 106

 Score = 26.9 bits (60), Expect = 7.3
 Identities = 11/24 (45%), Positives = 15/24 (62%), Gaps = 3/24 (12%)

Query: 317 KKITTKYDNLFKVSFPYSMGFHVC 340
           KK+   YDNL ++   Y  GFH+C
Sbjct: 67  KKLGEGYDNLARL---YQEGFHIC 87


>gnl|CDD|237489 PRK13745, PRK13745, anaerobic sulfatase-maturase; Provisional.
          Length = 412

 Score = 27.9 bits (62), Expect = 8.1
 Identities = 13/35 (37%), Positives = 20/35 (57%), Gaps = 3/35 (8%)

Query: 230 YYQ--THGKPLLMDYVSKELLKRKQPTSVAVWLKE 262
           Y+Q   H  P  MD++ KEL+ ++ P +V    KE
Sbjct: 374 YHQFFKHVAPY-MDFMKKELMNQRPPANVMDAAKE 407


>gnl|CDD|227858 COG5571, COG5571, Autotransporter protein or domain, integral
           membrane beta-barrel involved in protein secretion [Cell
           motility and secretion].
          Length = 239

 Score = 27.8 bits (62), Expect = 8.4
 Identities = 7/31 (22%), Positives = 10/31 (32%)

Query: 209 QVWASSFLPNLPRIKDIHQREYYQTHGKPLL 239
             W SS LP    I    ++      G  + 
Sbjct: 50  PTWLSSHLPVYNTILSTKRQSAQLGAGARVQ 80


>gnl|CDD|200072 TIGR01068, thioredoxin, thioredoxin.  Several proteins, such as
           protein disulfide isomerase, have two or more copies of
           a domain closely related to thioredoxin. This model is
           designed to recognize authentic thioredoxin, a small
           protein that should be hit exactly once by This model.
           Any protein that hits once with a score greater than the
           second (per domain) trusted cutoff may be taken as
           thioredoxin [Energy metabolism, Electron transport].
          Length = 101

 Score = 26.5 bits (59), Expect = 8.8
 Identities = 7/22 (31%), Positives = 10/22 (45%)

Query: 265 KRDRVVVRNPDWVVVVPFWAEW 286
                 + + D  V+V FWA W
Sbjct: 4   ANFDETIASSDKPVLVDFWAPW 25


>gnl|CDD|224976 COG2065, PyrR, Pyrimidine operon attenuation protein/uracil
           phosphoribosyltransferase [Nucleotide transport and
           metabolism].
          Length = 179

 Score = 27.2 bits (61), Expect = 8.9
 Identities = 16/43 (37%), Positives = 22/43 (51%), Gaps = 9/43 (20%)

Query: 237 PLLMDYVSKELLKRKQPTS----VAVWLKELLKRDRVVVRNPD 275
           P+  DYV K +     PTS    V V L+E+  RD V++  P 
Sbjct: 142 PIRADYVGKNI-----PTSRSEEVKVRLEEVDGRDAVLISEPT 179


>gnl|CDD|227349 COG5016, COG5016, Pyruvate/oxaloacetate carboxyltransferase [Energy
           production and conversion].
          Length = 472

 Score = 27.7 bits (62), Expect = 9.6
 Identities = 10/45 (22%), Positives = 18/45 (40%)

Query: 287 PFETMILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSF 331
             ETM+         T LD +    +A+  +++  KY  L +   
Sbjct: 243 ATETMVAALRGTGYDTGLDLELLEEIAEYFREVRKKYKGLLEPQA 287


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.138    0.442 

Gapped
Lambda     K      H
   0.267   0.0735    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 18,101,584
Number of extensions: 1731863
Number of successful extensions: 1883
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1866
Number of HSP's successfully gapped: 35
Length of query: 341
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 243
Effective length of database: 6,590,910
Effective search space: 1601591130
Effective search space used: 1601591130
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (26.5 bits)