RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10444
(341 letters)
>gnl|CDD|236963 PRK11720, PRK11720, galactose-1-phosphate uridylyltransferase;
Provisional.
Length = 346
Score = 471 bits (1214), Expect = e-168
Identities = 155/290 (53%), Positives = 201/290 (69%), Gaps = 16/290 (5%)
Query: 49 EHQHVRYNPLKGEWILVSPHRMKRPWSGQEEKSNEDPIAEFDPKNPLCPTATRGNGEVNP 108
+H H RYNPL G+W+LVSPHR KRPW GQ+E ++ + +DP LCP TR G+VNP
Sbjct: 8 DHPHRRYNPLTGQWVLVSPHRAKRPWQGQQETPAKETLPAYDPDCFLCPGNTRVTGDVNP 67
Query: 109 NYDSVFVFNNDFPALLENVPSPPVQDDPLFKMGNASGVCKVICFHPKSNTTLPIMSIEEI 168
+Y +VF NDF AL+ + P P DDPLF+ +A G +VICF P + TLP +S+ +
Sbjct: 68 DYTGTYVFTNDFAALMPDTPDAPESDDPLFRCQSARGTSRVICFSPDHSKTLPELSVAAL 127
Query: 169 RKVIDTWVDETASLGKTYEWVQIFENKGAIMGCSNPHPHCQVWASSFLPNLPRIKDIHQR 228
R+V+DTW ++TA LGKTY WVQ+FENKGA MGCSNPHPH Q+WA+SFLPN +D QR
Sbjct: 128 REVVDTWQEQTAELGKTYPWVQVFENKGAAMGCSNPHPHGQIWANSFLPNEAEREDRLQR 187
Query: 229 EYYQTHGKPLLMDYVSKELLKRKQPTSVAVWLKELLKRDRVVVRNPDWVVVVPFWAEWPF 288
Y+ HG PLL+DYV +EL +R+VV W+ VVP+WA WPF
Sbjct: 188 AYFAEHGSPLLVDYVQ----------------RELADGERIVVETEHWLAVVPYWAAWPF 231
Query: 289 ETMILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFH 338
ET++LP+ HVLR TDL D QR++LA +KK+T++YDNLF+ SFPYSMG+H
Sbjct: 232 ETLLLPKAHVLRLTDLTDAQRDDLALALKKLTSRYDNLFQCSFPYSMGWH 281
Score = 96.5 bits (241), Expect = 1e-22
Identities = 32/59 (54%), Positives = 44/59 (74%), Gaps = 7/59 (11%)
Query: 1 MILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFH-------EHQH 52
++LP+ HVLR TDL D QR++LA +KK+T++YDNLF+ SFPYSMG+H E+ H
Sbjct: 234 LLLPKAHVLRLTDLTDAQRDDLALALKKLTSRYDNLFQCSFPYSMGWHGAPFNGEENDH 292
>gnl|CDD|238341 cd00608, GalT, Galactose-1-phosphate uridyl transferase (GalT):
This enzyme plays a key role in galactose metabolism by
catalysing the transfer of a uridine 5'-phosphoryl group
from UDP-galactose 1-phosphate. The structure of E.coli
GalT reveals that the enzyme contains two identical
subunits. It also demonstrates that the active site is
formed by amino acid residues from both subunits of the
dimer.
Length = 329
Score = 380 bits (977), Expect = e-132
Identities = 151/290 (52%), Positives = 188/290 (64%), Gaps = 22/290 (7%)
Query: 52 HVRYNPLKGEWILVSPHRMKRPWSGQEEKSNEDPIAEFDPKNPLCPTATR-GNGEVNPNY 110
H RYNPL GEW+LVSPHR KRPW GQ+E + P E+DP PLCP R GE NP+Y
Sbjct: 1 HRRYNPLTGEWVLVSPHRAKRPWQGQQEAPKKLP--EYDPDCPLCPGNERADTGEQNPDY 58
Query: 111 DSVFVFNNDFPALLENVPSPPVQDDPLFKMGNASGVCKVICFHPKSNTTLPIMSIEEIRK 170
D V VF NDFPAL + P+P DD LF+ A G C+VICF P N TL MS+ EIR+
Sbjct: 59 D-VRVFENDFPALKPDAPAPEDSDDGLFRTAPARGRCEVICFSPDHNLTLAEMSVAEIRE 117
Query: 171 VIDTWVDETASLGK--TYEWVQIFENKGAIMGCSNPHPHCQVWASSFLPNLPRIKDIHQR 228
V++ W + T LGK ++VQIFENKGA MG S PHPH Q+WA FLP + +Q+
Sbjct: 118 VVEAWAERTRELGKNPRIKYVQIFENKGAEMGASLPHPHGQIWALPFLPPEVARELRNQK 177
Query: 229 EYYQTHGKPLLMDYVSKELLKRKQPTSVAVWLKELLKRDRVVVRNPDWVVVVPFWAEWPF 288
YY+ HG+ LL DY+ EL ++R+VV N +V VVPFWA WPF
Sbjct: 178 AYYEKHGRCLLCDYL----------------KLELESKERIVVENEHFVAVVPFWARWPF 221
Query: 289 ETMILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFH 338
E ILP+ HV RFTDL D++R +LA+I+K++ +YDNLF SFPYSMG+H
Sbjct: 222 EVHILPKRHVSRFTDLTDEEREDLAEILKRLLARYDNLFNCSFPYSMGWH 271
Score = 79.3 bits (196), Expect = 1e-16
Identities = 29/48 (60%), Positives = 39/48 (81%)
Query: 1 MILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFH 48
ILP+ HV RFTDL D++R +LA+I+K++ +YDNLF SFPYSMG+H
Sbjct: 224 HILPKRHVSRFTDLTDEEREDLAEILKRLLARYDNLFNCSFPYSMGWH 271
>gnl|CDD|129313 TIGR00209, galT_1, galactose-1-phosphate uridylyltransferase,
family 1. This enzyme is involved in glucose and
galactose interconversion. This model describes one of
two extremely distantly related branches of the model
pfam01087 [Energy metabolism, Sugars].
Length = 347
Score = 356 bits (914), Expect = e-122
Identities = 149/292 (51%), Positives = 194/292 (66%), Gaps = 16/292 (5%)
Query: 47 FHEHQHVRYNPLKGEWILVSPHRMKRPWSGQEEKSNEDPIAEFDPKNPLCPTATRGNGEV 106
+H H RYNPL +WILVSPHR KRPW GQ+E + + +DP LCP R G++
Sbjct: 6 PVDHPHRRYNPLTDQWILVSPHRAKRPWQGQQETPAKQVLPAYDPDCYLCPGNKRVTGDL 65
Query: 107 NPNYDSVFVFNNDFPALLENVPSPPVQDDPLFKMGNASGVCKVICFHPKSNTTLPIMSIE 166
NP+Y +VF NDF AL+ + P P DPL + +A G +VICF P + TLP +S+
Sbjct: 66 NPDYTGTYVFTNDFAALMSDTPDAPESHDPLMRCQSARGTSRVICFSPDHSKTLPELSVA 125
Query: 167 EIRKVIDTWVDETASLGKTYEWVQIFENKGAIMGCSNPHPHCQVWASSFLPNLPRIKDIH 226
+ +++ TW ++TA LGKTY WVQIFENKGA MGCSNPHPH Q+WA+SFLPN +D
Sbjct: 126 ALTEIVKTWQEQTAELGKTYPWVQIFENKGAAMGCSNPHPHGQIWANSFLPNEVEREDRL 185
Query: 227 QREYYQTHGKPLLMDYVSKELLKRKQPTSVAVWLKELLKRDRVVVRNPDWVVVVPFWAEW 286
Q+EY+ H P+L+DYV +EL + R VV W+ VVP+WA W
Sbjct: 186 QKEYFAEHKSPMLVDYVK----------------RELADKSRTVVETEHWIAVVPYWAIW 229
Query: 287 PFETMILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFH 338
PFET++LP+ HVLR TDL D QR++LA I+KK+T+KYDNLF+ SFPYSMG+H
Sbjct: 230 PFETLLLPKAHVLRITDLTDAQRSDLALILKKLTSKYDNLFETSFPYSMGWH 281
Score = 77.3 bits (190), Expect = 7e-16
Identities = 35/61 (57%), Positives = 45/61 (73%), Gaps = 6/61 (9%)
Query: 1 MILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFHEHQHVRYNPLKG 60
++LP+ HVLR TDL D QR++LA I+KK+T+KYDNLF+ SFPYSMG+H P G
Sbjct: 234 LLLPKAHVLRITDLTDAQRSDLALILKKLTSKYDNLFETSFPYSMGWHG------APFNG 287
Query: 61 E 61
E
Sbjct: 288 E 288
>gnl|CDD|224010 COG1085, GalT, Galactose-1-phosphate uridylyltransferase [Energy
production and conversion].
Length = 338
Score = 288 bits (740), Expect = 2e-96
Identities = 128/291 (43%), Positives = 164/291 (56%), Gaps = 21/291 (7%)
Query: 50 HQHVRYNPLKGEWILVSPHRMKRPWSGQEEKSNEDPIAEFDPKNPLCPTATRGNGEVNPN 109
RYNPL G+W+LVSPHR KRPW G +EK E DP P CP R E NP
Sbjct: 1 MPERRYNPLTGQWVLVSPHRAKRPWQGAQEKIAEQTD--HDPTCPFCPGNERTT-EENPR 57
Query: 110 YDSVFVFNNDFPALLENVPSPPVQDDPLFKMGNASGVCKVICFHPKSNTTLPIMSIEEIR 169
Y V VF NDFPA+ E+ P P +DPLFK+ A G +VI P + TLP + +EEI
Sbjct: 58 YWHVRVFPNDFPAVSEDPPDAPGSEDPLFKIQEARGKSRVIVESPDHSKTLPELPVEEIE 117
Query: 170 KVIDTWVDETASLGKT--YEWVQIFENKGAIMGCSNPHPHCQVWASSFLPNLPRIKDIHQ 227
+V+ W + L + Y++VQIFENKG G S PHPH Q+ A LP +
Sbjct: 118 EVVKLWQERVRELYEREKYKYVQIFENKGKAAGASLPHPHGQIVALPVLPLEVARELRSA 177
Query: 228 REYYQTHGKPLLMDYVSKELLKRKQPTSVAVWLKELLKRDRVVVRNPDWVVVVPFWAEWP 287
REYY+ +G + D V +E + R+VV N ++ VPFWA WP
Sbjct: 178 REYYEENGSCMYCDLVEREKGDGE----------------RIVVENDHFLAFVPFWARWP 221
Query: 288 FETMILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFH 338
FE +I P+ HV TDL D++ +LA+I+KK+ +YDNLF SFPYSMGFH
Sbjct: 222 FEVLIYPKEHVSFLTDLSDEELKDLAEILKKLLARYDNLFGNSFPYSMGFH 272
Score = 63.6 bits (155), Expect = 3e-11
Identities = 27/47 (57%), Positives = 35/47 (74%)
Query: 2 ILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFH 48
I P+ HV TDL D++ +LA+I+KK+ +YDNLF SFPYSMGFH
Sbjct: 226 IYPKEHVSFLTDLSDEELKDLAEILKKLLARYDNLFGNSFPYSMGFH 272
>gnl|CDD|216293 pfam01087, GalP_UDP_transf, Galactose-1-phosphate uridyl
transferase, N-terminal domain. SCOP reports fold
duplication with C-terminal domain. Both involved in Zn
and Fe binding.
Length = 170
Score = 210 bits (537), Expect = 4e-68
Identities = 77/172 (44%), Positives = 95/172 (55%), Gaps = 16/172 (9%)
Query: 50 HQHVRYNPLKGEWILVSPHRMKRPWSGQEEKSNEDPIAEFDPKNPLCPTATRGNGEVNPN 109
+ RYNPL GEW+LVSPHR+KRPW+GQ EK ED + E D LCP +R NG+ NP
Sbjct: 11 TSNRRYNPLTGEWVLVSPHRLKRPWAGQNEKI-EDTLPERDLMCYLCPGPSRINGDFNPK 69
Query: 110 YDSVFVFNNDFPALLENVPSPPVQD---DPLFKMGNASGVCKVICFHPKSNTTLPIMSIE 166
Y S FVF NDF AL ++ + LFK A G C+V PK +
Sbjct: 70 YKSPFVFTNDFYALSKDNNYIKTDAIAKNILFKAETAYGRCEVTINLPKPE--------K 121
Query: 167 EIRKVIDTWVDETASLGKTY-EWVQIFENKGAIMGCSNPHPHCQVWASSFLP 217
+ + V TA L +Y + V FEN+G GCSNPHPH Q+WAS LP
Sbjct: 122 DPKAVAAA---RTAELASSYPKCVLCFENEGYAGGCSNPHPHGQIWASPHLP 170
>gnl|CDD|145739 pfam02744, GalP_UDP_tr_C, Galactose-1-phosphate uridyl transferase,
C-terminal domain. SCOP reports fold duplication with
N-terminal domain. Both involved in Zn and Fe binding.
Length = 168
Score = 117 bits (295), Expect = 4e-32
Identities = 52/117 (44%), Positives = 70/117 (59%), Gaps = 17/117 (14%)
Query: 223 KDIHQREYYQTH-GKPLLMDYVSKELLKRKQPTSVAVWLKELLKRDRVVVRNPDWVVVVP 281
++ Y+ G LL DYV EL +++RVVV N W VVP
Sbjct: 1 EERIFPHYFAGSNGSILLHDYV----------------QMELAEKERVVVENEHWPAVVP 44
Query: 282 FWAEWPFETMILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFH 338
+WA+WPFET+ LP+ HVL T+L D++R +LA I+K +T +YDNLF+ SFPYSMG H
Sbjct: 45 YWAKWPFETLRLPKDHVLSLTELTDKEREDLAAIVKPLTRRYDNLFETSFPYSMGIH 101
Score = 69.4 bits (170), Expect = 3e-14
Identities = 27/48 (56%), Positives = 37/48 (77%)
Query: 1 MILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSFPYSMGFH 48
+ LP+ HVL T+L D++R +LA I+K +T +YDNLF+ SFPYSMG H
Sbjct: 54 LRLPKDHVLSLTELTDKEREDLAAIVKPLTRRYDNLFETSFPYSMGIH 101
>gnl|CDD|215346 PLN02643, PLN02643, ADP-glucose phosphorylase.
Length = 336
Score = 100 bits (250), Expect = 6e-24
Identities = 69/289 (23%), Positives = 116/289 (40%), Gaps = 37/289 (12%)
Query: 53 VRYNPLKGEWILVSPHRMKRPWSGQEEKSNEDPIAEFDPKNPLCPTATRGNG-------- 104
+R +P+ W++ SP R KRP + + KS ++P P C
Sbjct: 4 LRKDPVTNRWVIFSPARGKRP-TDFKSKSPQNPNGNHSSGCPFCIGHEHECAPEIFRVPD 62
Query: 105 -EVNPNYDSVFVFNNDFPALLENVPSPPVQDDPLFKMG---NASGVCKVICFHPKSNTTL 160
P++ V V N +PAL ++ P + G G V+ P + L
Sbjct: 63 DASAPDW-KVRVIENLYPALSRDLEPPCTEGQGEDYGGRRLPGFGFHDVVIETPVHSVQL 121
Query: 161 PIMSIEEIRKVIDTWVDETASLG--KTYEWVQIFENKGAIMGCSNPHPHCQVWASSFLPN 218
+ I +V+ + L +++VQ+F+N GA G S H H Q+ A +P
Sbjct: 122 SDLPARHIGEVLKAYKKRINQLQSDSRFKYVQVFKNHGASAGASMSHSHSQIIALPVVPP 181
Query: 219 LPRIKDIHQREYYQTHGKPLLMDYVSKELLKRKQPTSVAVWLKELLKRDRVVVRNPDWVV 278
+ +EY++ GK L + V K+LL + + +V
Sbjct: 182 SVSARLDGSKEYFEKTGKCSLCEVVKKDLL---------------------IDESSHFVS 220
Query: 279 VVPFWAEWPFETMILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLF 327
+ PF A +PFE I+PR H F ++DD + +L ++K + K
Sbjct: 221 IAPFAATFPFEIWIIPRDHSSNFHEIDDDKAVDLGGLLKLMLQKISKQL 269
>gnl|CDD|238263 cd00468, HIT_like, HIT family: HIT (Histidine triad) proteins,
named for a motif related to the sequence HxHxH/Qxx (x,
a hydrophobic amino acid), are a superfamily of
nucleotide hydrolases and transferases, which act on the
alpha-phosphate of ribonucleotides. On the basis of
sequence, substrate specificity, structure, evolution
and mechanism, HIT proteins are classified in the
literacture into three major branches: the Hint branch,
which consists of adenosine 5' -monophosphoramide
hydrolases, the Fhit branch, that consists of
diadenosine polyphosphate hydrolases, and the GalT
branch consisting of specific nucloside monophosphate
transferases. Further sequence analysis reveals several
new closely related, yet uncharacterized subgroups.
Length = 86
Score = 54.0 bits (130), Expect = 1e-09
Identities = 24/108 (22%), Positives = 36/108 (33%), Gaps = 24/108 (22%)
Query: 107 NPNYDSVFVFNNDFPALLENVPSPPVQDDPLFKMGNASGVCKVICFHPKSNTTLPIMSIE 166
P+ + F F N PA +V ++C + TLP +
Sbjct: 1 VPDDEHSFAFVNLKPAAPGHV---------------------LVCP-KRHVETLPDLDEA 38
Query: 167 EIRKVIDTWVDETASL--GKTYEWVQIFENKGAIMGCSNPHPHCQVWA 212
+ ++ T A L + +F N GA G S PH H V
Sbjct: 39 LLADLVITAQRVAAELEKHGNVPSLTVFVNDGAAAGQSVPHVHLHVLP 86
>gnl|CDD|223611 COG0537, Hit, Diadenosine tetraphosphate (Ap4A) hydrolase and other
HIT family hydrolases [Nucleotide transport and
metabolism / Carbohydrate transport and metabolism /
General function prediction only].
Length = 138
Score = 31.6 bits (72), Expect = 0.27
Identities = 12/53 (22%), Positives = 24/53 (45%), Gaps = 2/53 (3%)
Query: 268 RVVVRNPDWVVVV-PFWAEWPFETMILPRTHVLRFTDLDDQQRNNLADIIKKI 319
V + + + + A T+++P+ HV DLD ++ L + +KI
Sbjct: 16 NKVYEDEHVLAFLDIYPAAPGH-TLVIPKRHVSDLEDLDPEELAELFLLAQKI 67
Score = 31.2 bits (71), Expect = 0.38
Identities = 21/100 (21%), Positives = 38/100 (38%), Gaps = 15/100 (15%)
Query: 1 MILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVS-FPYSM---------GFHEH 50
+++P+ HV DLD ++ L + +KI F + + FH H
Sbjct: 39 LVIPKRHVSDLEDLDPEELAELFLLAQKIAKALKEAFGADGYNIGINNGKAAGQEVFHLH 98
Query: 51 QHV--RYNPLKGEWILVSPHRMKRPWSGQEEKSNEDPIAE 88
H+ RY KG+ P + +E + + I +
Sbjct: 99 IHIIPRY---KGDDNFPGPGWGTKVEPNEELEELAEKIRK 135
>gnl|CDD|236983 PRK11788, PRK11788, tetratricopeptide repeat protein; Provisional.
Length = 389
Score = 30.9 bits (71), Expect = 0.82
Identities = 11/60 (18%), Positives = 27/60 (45%), Gaps = 12/60 (20%)
Query: 219 LPRIKDIHQ------------REYYQTHGKPLLMDYVSKELLKRKQPTSVAVWLKELLKR 266
LP++ + +Q R + + L+ +++ L +++ P + L+E L+R
Sbjct: 252 LPKLMECYQALGDEAEGLEFLRRALEEYPGADLLLALAQLLEEQEGPEAAQALLREQLRR 311
>gnl|CDD|226551 COG4065, COG4065, Uncharacterized protein conserved in archaea
[Function unknown].
Length = 480
Score = 30.2 bits (68), Expect = 1.9
Identities = 13/36 (36%), Positives = 19/36 (52%), Gaps = 4/36 (11%)
Query: 218 NLPRIKDIHQREYYQTHGKPLLM---DYVSKELLKR 250
+L ++ +I RE Y+ HG P L D V + KR
Sbjct: 348 DLGKLTEI-GREIYEEHGIPTLFAVLDEVMALIAKR 382
>gnl|CDD|226489 COG3981, COG3981, Predicted acetyltransferase [General function
prediction only].
Length = 174
Score = 28.5 bits (64), Expect = 3.2
Identities = 7/28 (25%), Positives = 13/28 (46%)
Query: 259 WLKELLKRDRVVVRNPDWVVVVPFWAEW 286
WL++L +++ WV +WA
Sbjct: 48 WLEDLTRQEPGNNLPEGWVPASTYWAVD 75
>gnl|CDD|234156 TIGR03285, methan_mark_14, putative methanogenesis marker protein
14. Members of this protein family, to date, are found
in a completed prokaryotic genome if and only if the
species is one of the archaeal methanogens. The exact
function is unknown, but likely is linked to
methanogenesis or a process closely connected to it
[Energy metabolism, Methanogenesis].
Length = 445
Score = 29.0 bits (65), Expect = 3.5
Identities = 12/36 (33%), Positives = 19/36 (52%), Gaps = 4/36 (11%)
Query: 218 NLPRIKDIHQREYYQTHGKPLLM---DYVSKELLKR 250
+L ++ +I RE Y++ G P L D V + KR
Sbjct: 315 DLTKLTEI-GREIYESEGIPTLFAVLDDVMALIAKR 349
>gnl|CDD|226803 COG4360, APA2, ATP adenylyltransferase
(5',5'''-P-1,P-4-tetraphosphate phosphorylase II)
[Nucleotide transport and metabolism].
Length = 298
Score = 29.1 bits (65), Expect = 3.6
Identities = 10/32 (31%), Positives = 15/32 (46%)
Query: 191 IFENKGAIMGCSNPHPHCQVWASSFLPNLPRI 222
+F N G I G S H H Q+ F+ ++
Sbjct: 141 VFYNSGPIAGASQDHKHLQIVPMPFVAFQDQL 172
>gnl|CDD|216377 pfam01230, HIT, HIT domain.
Length = 97
Score = 27.7 bits (62), Expect = 3.6
Identities = 10/53 (18%), Positives = 24/53 (45%)
Query: 267 DRVVVRNPDWVVVVPFWAEWPFETMILPRTHVLRFTDLDDQQRNNLADIIKKI 319
+VV + + + + P +++P+ H+ DL ++ +L KK+
Sbjct: 6 SKVVYEDDLVLAFLDIDPQAPGHVLVIPKKHITELHDLTPEELGDLMSEAKKV 58
>gnl|CDD|239280 cd02982, PDI_b'_family, Protein Disulfide Isomerase (PDIb') family,
redox inactive TRX-like domain b'; composed of
eukaryotic proteins involved in oxidative protein
folding in the endoplasmic reticulum (ER) by acting as
catalysts and folding assistants. Members of this family
include PDI, calsequestrin and other PDI-related
proteins like ERp72, ERp57 (or ERp60), ERp44, P5 and
PDIR. PDI, ERp57, ERp72, P5 and PDIR are all oxidases,
catalyzing the formation of disulfide bonds of newly
synthesized polypeptides in the ER. They also exhibit
reductase activity in acting as isomerases to correct
any non-native disulfide bonds, as well as chaperone
activity to prevent protein aggregation and facilitate
the folding of newly synthesized proteins. These
proteins contain multiple copies of a redox active TRX
(a) domain containing a CXXC motif, and one or more
redox inactive TRX-like (b) domains. The molecular
structure of PDI is abb'a'. Also included in this family
is the PDI-related protein ERp27, which contains only
redox-inactive TRX-like (b and b') domains. The redox
inactive domains are implicated in substrate recognition
with the b' domain serving as the primary substrate
binding site. Only the b' domain is necessary for the
binding of small peptide substrates. In addition to the
b' domain, other domains are required for the binding of
larger polypeptide substrates. The b' domain is also
implicated in chaperone activity.
Length = 103
Score = 27.2 bits (61), Expect = 5.2
Identities = 7/36 (19%), Positives = 16/36 (44%)
Query: 288 FETMILPRTHVLRFTDLDDQQRNNLADIIKKITTKY 323
F + ++ F + DD + L + K++ K+
Sbjct: 6 FNYEESGKPLLVLFYNKDDSESEELRERFKEVAKKF 41
>gnl|CDD|238608 cd01277, HINT_subgroup, HINT (histidine triad nucleotide-binding
protein) subgroup: Members of this CD belong to the
superfamily of histidine triad hydrolases that act on
alpha-phosphate of ribonucleotides. This subgroup
includes members from all three forms of cellular life.
Although the biochemical function has not been
characterised for many of the members of this subgroup,
the proteins from Yeast have been shown to be involved
in secretion, peroxisome formation and gene expression.
Length = 103
Score = 27.2 bits (61), Expect = 5.5
Identities = 10/40 (25%), Positives = 17/40 (42%)
Query: 290 TMILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKV 329
T+++P+ H DLD ++ L KK+ K
Sbjct: 37 TLVIPKKHYENLLDLDPEELAELILAAKKVARALKKALKA 76
>gnl|CDD|182309 PRK10213, nepI, ribonucleoside transporter; Reviewed.
Length = 394
Score = 28.3 bits (63), Expect = 5.6
Identities = 16/59 (27%), Positives = 26/59 (44%), Gaps = 6/59 (10%)
Query: 182 LGKTYEWVQIFENKGAIMGCSNPHPHCQVWASSFLPNLPRIKDIHQREYYQTHGKPLLM 240
LG+ W +F N A+MG C W LP+LP ++ ++ +P +M
Sbjct: 167 LGELIGWRNVF-NAAAVMGVL-----CIFWIIKSLPSLPGEPSHQKQNTFRLLQRPGVM 219
>gnl|CDD|236300 PRK08576, PRK08576, hypothetical protein; Provisional.
Length = 438
Score = 28.5 bits (64), Expect = 5.7
Identities = 20/112 (17%), Positives = 36/112 (32%), Gaps = 15/112 (13%)
Query: 89 FDPKNPLCPTATRGNGEVNPNYDSVFVFNNDFPALLENVPSPPVQDDPLF--KMGNA--- 143
P+N E++P+YD F ++ + + ++ L K+G
Sbjct: 121 LSPENGE-------GIEIHPDYDIFLAIGEGFRKNMKKLLGVDIGENALVLRKLGGEHEV 173
Query: 144 -SGVCKVICFHPKSNTTLPIMSIEEIRKVIDTWVDETASLGKTYEWVQIFEN 194
SG KV P + I + V + E ++ FE
Sbjct: 174 YSGPRKVAELEISKKIGKPTEVLWRIPYPEE--VSLEKLIEANREVLEAFEK 223
>gnl|CDD|218915 pfam06156, DUF972, Protein of unknown function (DUF972). This
family consists of several hypothetical bacterial
sequences. The function of this family is unknown.
Length = 106
Score = 26.9 bits (60), Expect = 7.3
Identities = 11/24 (45%), Positives = 15/24 (62%), Gaps = 3/24 (12%)
Query: 317 KKITTKYDNLFKVSFPYSMGFHVC 340
KK+ YDNL ++ Y GFH+C
Sbjct: 67 KKLGEGYDNLARL---YQEGFHIC 87
>gnl|CDD|237489 PRK13745, PRK13745, anaerobic sulfatase-maturase; Provisional.
Length = 412
Score = 27.9 bits (62), Expect = 8.1
Identities = 13/35 (37%), Positives = 20/35 (57%), Gaps = 3/35 (8%)
Query: 230 YYQ--THGKPLLMDYVSKELLKRKQPTSVAVWLKE 262
Y+Q H P MD++ KEL+ ++ P +V KE
Sbjct: 374 YHQFFKHVAPY-MDFMKKELMNQRPPANVMDAAKE 407
>gnl|CDD|227858 COG5571, COG5571, Autotransporter protein or domain, integral
membrane beta-barrel involved in protein secretion [Cell
motility and secretion].
Length = 239
Score = 27.8 bits (62), Expect = 8.4
Identities = 7/31 (22%), Positives = 10/31 (32%)
Query: 209 QVWASSFLPNLPRIKDIHQREYYQTHGKPLL 239
W SS LP I ++ G +
Sbjct: 50 PTWLSSHLPVYNTILSTKRQSAQLGAGARVQ 80
>gnl|CDD|200072 TIGR01068, thioredoxin, thioredoxin. Several proteins, such as
protein disulfide isomerase, have two or more copies of
a domain closely related to thioredoxin. This model is
designed to recognize authentic thioredoxin, a small
protein that should be hit exactly once by This model.
Any protein that hits once with a score greater than the
second (per domain) trusted cutoff may be taken as
thioredoxin [Energy metabolism, Electron transport].
Length = 101
Score = 26.5 bits (59), Expect = 8.8
Identities = 7/22 (31%), Positives = 10/22 (45%)
Query: 265 KRDRVVVRNPDWVVVVPFWAEW 286
+ + D V+V FWA W
Sbjct: 4 ANFDETIASSDKPVLVDFWAPW 25
>gnl|CDD|224976 COG2065, PyrR, Pyrimidine operon attenuation protein/uracil
phosphoribosyltransferase [Nucleotide transport and
metabolism].
Length = 179
Score = 27.2 bits (61), Expect = 8.9
Identities = 16/43 (37%), Positives = 22/43 (51%), Gaps = 9/43 (20%)
Query: 237 PLLMDYVSKELLKRKQPTS----VAVWLKELLKRDRVVVRNPD 275
P+ DYV K + PTS V V L+E+ RD V++ P
Sbjct: 142 PIRADYVGKNI-----PTSRSEEVKVRLEEVDGRDAVLISEPT 179
>gnl|CDD|227349 COG5016, COG5016, Pyruvate/oxaloacetate carboxyltransferase [Energy
production and conversion].
Length = 472
Score = 27.7 bits (62), Expect = 9.6
Identities = 10/45 (22%), Positives = 18/45 (40%)
Query: 287 PFETMILPRTHVLRFTDLDDQQRNNLADIIKKITTKYDNLFKVSF 331
ETM+ T LD + +A+ +++ KY L +
Sbjct: 243 ATETMVAALRGTGYDTGLDLELLEEIAEYFREVRKKYKGLLEPQA 287
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.138 0.442
Gapped
Lambda K H
0.267 0.0735 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 18,101,584
Number of extensions: 1731863
Number of successful extensions: 1883
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1866
Number of HSP's successfully gapped: 35
Length of query: 341
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 243
Effective length of database: 6,590,910
Effective search space: 1601591130
Effective search space used: 1601591130
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (26.5 bits)