RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8678
(394 letters)
>gnl|CDD|217725 pfam03781, FGE-sulfatase, Formylglycine-generating sulfatase
enzyme. This domain is found in eukaryotic proteins
required for post-translational sulphatase modification
(SUMF1). These proteins are associated with the rare
disorder multiple sulphatase deficiency (MSD). The
protein product of the SUMF1 gene is FGE, formylglycine
(FGly),-generating enzyme, which is a sulfatase.
Sulfatases are enzymes essential for degradation and
remodelling of sulfate esters, and formylglycine (FGly),
the key catalytic in the active site, is unique to
sulfatases. FGE is localised to the endoplasmic
reticulum (ER) and interacts with and modifies the
unfolded form of newly synthesised sulfatases. FGE is a
single-domain monomer with a surprising paucity of
secondary structure that adopts a unique fold which is
stabilised by two Ca2+ ions. The effect of all mutations
found in MSD patients is explained by the FGE structure,
providing a molecular basis for MSD. A redox-active
disulfide bond is present in the active site of FGE. An
oxidized cysteine residue, possibly cysteine sulfenic
acid, has been detected that may allow formulation of a
structure-based mechanism for FGly formation from
cysteine residues in all sulfatases.
Length = 259
Score = 190 bits (483), Expect = 3e-58
Identities = 84/190 (44%), Positives = 107/190 (56%), Gaps = 15/190 (7%)
Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLEN 215
W HP G S I+ +HPV VSW DAVAY W G RLPTEAEWEY RGG +
Sbjct: 75 WRHPSGGLSDIDDGADHPVTGVSWYDAVAYARWLGKRTGNGYRLPTEAEWEYAARGGSKG 134
Query: 216 RLFPWGNNLTPRGEHRANVWQG-EFPTNNTAADGY-LSTAPVMSYKENKFGLYNMVGNVW 273
R +PWG+ L P G N+WQG +FP + AD + T+PV S+ N GLY+M GNVW
Sbjct: 135 RRYPWGDELYPAG----NIWQGADFPNEHAGADSFNGRTSPVGSFPPNALGLYDMAGNVW 190
Query: 274 EWTADWWNVHHHPAPSYNPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAAR-SQNTPDSS 331
EWT+DW+ H+ AP G +V +GGS+ C+ R R A R + TP +
Sbjct: 191 EWTSDWYKPHYSFAPYDELSRDNFGGGYRVVRGGSWACSVY-PSRLRPAFRGNCQTPGTR 249
Query: 332 AGNLGFRCAA 341
A ++GFR
Sbjct: 250 ADDVGFRLVR 259
>gnl|CDD|224182 COG1262, COG1262, Uncharacterized conserved protein [Function
unknown].
Length = 314
Score = 153 bits (389), Expect = 1e-43
Identities = 95/348 (27%), Positives = 125/348 (35%), Gaps = 104/348 (29%)
Query: 15 MVLLPGDTFRMGTNKPILIKD--GEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
MVL+PG F MG+ + E P VT+ F +D++ V+N QF FV A GY T
Sbjct: 53 MVLIPGGEFTMGSPDDEWERFDRNEAPVHKVTVPPFEIDKYPVTNAQFARFVEAGGYTTA 112
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
E+ G E S + + R+ HPVV +SW DA AY
Sbjct: 113 WEEDG------------EPVYPSYWKGE---------GGRLRLEHPVVGVSWYDAQAYAA 151
Query: 133 WRG-----ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
W G RLPTEAEWEY R G +PWG + ++
Sbjct: 152 WLGVKTGEYRLPTEAEWEYAARAGTTTDSYPWG-----DELEP----------------G 190
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
AY WEY + P G N
Sbjct: 191 LNAYAG----------TWEY----LRAAAGWARERETAPVGAFPPN-------------- 222
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA---------PSYNPKGPTT- 297
+GLY+M GNVWEWTADW H+ Y+ +
Sbjct: 223 -------------AAYGLYDMHGNVWEWTADWEKEWHYDNYGPAPSDGSAWYDGNSGSKF 269
Query: 298 -GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
G+ +V +GGS+ R A R+ PD ++GFRCA
Sbjct: 270 FGSLRVVRGGSWASYPGVL---RPAFRNFLVPDYRQAHVGFRCARLIK 314
>gnl|CDD|234246 TIGR03529, GldK_short, gliding motility-associated lipoprotein
GldK. Members of this protein family are exclusive to
the Bacteroidetes phylum (previously
Cytophaga-Flavobacteria-Bacteroides). GldK is a
lipoprotein linked to a type of rapid surface gliding
motility found in certain Bacteroidetes, such as
Flavobacterium johnsoniae. Knockouts of GldK abolish the
gliding phenotype. GldK is homologous to GldJ. This
model represents a GldK homolog in Cytophaga
hutchinsonii and several other species that has a
different, shorter architecture than that found in
Flavobacterium johnsoniae and related species
(represented by (TIGR03525). Gliding motility appears
closely linked to chitin utilization in the model
species Flavobacterium johnsoniae. Bacteroidetes with
members of this protein family appear to have all of the
genes associated with gliding motility.
Length = 344
Score = 120 bits (303), Expect = 5e-31
Identities = 89/342 (26%), Positives = 127/342 (37%), Gaps = 74/342 (21%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA--TGYVT 71
MV++P TF MG + ++ +T+ F++D+ EV+N ++++F+ G +
Sbjct: 53 GMVVIPAGTFHMGQADEDVPATQINLNKQITISEFFMDKTEVTNNKYRQFLEVVLEGQLA 112
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHP------VVHISWN 125
+ E L + + H M D +E+ HP VV + WN
Sbjct: 113 TGTPLPPEYDMEELYPDTTVW-STSFSHHMG-----DPLMEYYFDHPAFDNYPVVGVDWN 166
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
A +C WR + MN S
Sbjct: 167 AAKQFCEWR--------------------------------------TYHMNAYRNEESQ 188
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNL--TPRGEHRANVWQGEFPTNN 243
D + RLP+EAEWEY RGG + +PWG RG AN G N
Sbjct: 189 YDMPRF------RLPSEAEWEYAARGGRDMAKYPWGGPYLRNKRGCMLANFKPGR---GN 239
Query: 244 TAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----PSYNPKGPTTG 298
DG+ TAPV Y N FGLY+M GNV EW D + P P Y
Sbjct: 240 YYDDGFPYTAPVAVYFPNDFGLYDMAGNVAEWVLDAYAATSVPIVWDLNPVYEDPNEVR- 298
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
K+ +GGS+ Y R+ D S ++GFR
Sbjct: 299 --KIIRGGSWKDIAYYL---ETGTRTFEYEDVSQAHIGFRTV 335
>gnl|CDD|132564 TIGR03525, GldK, gliding motility-associated lipoprotein GldK.
Members of this protein family are exclusive to the
Bacteroidetes phylum (previously
Cytophaga-Flavobacteria-Bacteroides). GldK is a
lipoprotein linked to a type of rapid surface gliding
motility found in certain Bacteroidetes, such as
Flavobacterium johnsoniae. Knockouts of GldK abolish the
gliding phenotype. GldK is homologous to GldJ. There is
a GldK homolog in Cytophaga hutchinsonii and several
other species that has a different, shorter architecture
and is represented by a separate model. Gliding motility
appears closely linked to chitin utilization in the
model species Flavobacterium johnsoniae. Bacteroidetes
with members of this protein family appear to have all
of the genes associated with gliding motility.
Length = 449
Score = 105 bits (264), Expect = 4e-25
Identities = 69/189 (36%), Positives = 89/189 (47%), Gaps = 31/189 (16%)
Query: 177 NHPVVHVSWNDAVAYCTWRGA-----------------RLPTEAEWEYGCRGGLENRLFP 219
++PVV V+W A A+C WR RLPTEAEWEY RGGLE +P
Sbjct: 262 DYPVVGVTWKQARAFCNWRTKYKNDFRKKKGPANVNTFRLPTEAEWEYAARGGLEGATYP 321
Query: 220 WGNNLT--PRGEHRANVWQGEFPTN--NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEW 275
WG T RG AN F + AAD L T SY+ N +GLYNM GNV EW
Sbjct: 322 WGGPYTKNDRGCFMAN-----FKPVRGDYAADEALYTVEAKSYEPNDYGLYNMAGNVSEW 376
Query: 276 TADWWNVHHHP-APSYNPK-GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAG 333
T ++ + + NP + T KV +GGS+ + Y + + R DS+
Sbjct: 377 TNSSYDPSSYEYMSTMNPNVNDSENTRKVVRGGSW---KDVAYFLQVSTRDYEYADSARS 433
Query: 334 NLGFRCAAD 342
+GFR D
Sbjct: 434 YIGFRTVQD 442
Score = 63.0 bits (153), Expect = 8e-11
Identities = 27/61 (44%), Positives = 32/61 (52%), Gaps = 17/61 (27%)
Query: 117 HPVVHISWNDAVAYCTWRGA-----------------RLPTEAEWEYGCRGGLENRLFPW 159
+PVV ++W A A+C WR RLPTEAEWEY RGGLE +PW
Sbjct: 263 YPVVGVTWKQARAFCNWRTKYKNDFRKKKGPANVNTFRLPTEAEWEYAARGGLEGATYPW 322
Query: 160 G 160
G
Sbjct: 323 G 323
Score = 47.2 bits (112), Expect = 8e-06
Identities = 18/51 (35%), Positives = 35/51 (68%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
MVL+PG +F MG + + P++ VT+ +FY+D+ E++N+++++FV
Sbjct: 41 MVLVPGGSFIMGKSDEDIAGVMNAPTKTVTVRSFYMDETEITNSEYRQFVE 91
>gnl|CDD|234211 TIGR03440, egtB_TIGR03440, ergothioneine biosynthesis protein EgtB.
Members of this family include EgtB, and enzyme of the
ergothioneine biosynthesis, as found in numerous
Actinobacteria. Characterized homologs to this family
include a formylglycine-generating enzyme that serves as
a maturase for an aerobic sulfatase (cf. the radical SAM
enzymes that serve as anaerobic sulfatase maturases)
[Biosynthesis of cofactors, prosthetic groups, and
carriers, Glutathione and analogs].
Length = 406
Score = 102 bits (255), Expect = 6e-24
Identities = 78/313 (24%), Positives = 101/313 (32%), Gaps = 102/313 (32%)
Query: 3 LLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
+PAP V PG F +G++ D E P V + F +D V+N ++ E
Sbjct: 158 PVPAPASAPPLRWVAFPGGEFEIGSDADGFAFDNERPRHRVLVPPFEIDARPVTNGEYLE 217
Query: 63 FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
F+ GY L + A + P
Sbjct: 218 FIEDGGYRRPE-----------LWLSDGWAWVQA----------------EGWQAP---- 246
Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
Y WR + W GGL L P + PV H
Sbjct: 247 ------LY--WR----RDDGTWWVFTLGGLRP--------LDP-----------DAPVCH 275
Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
VS+ +A AY W GARLPTEAEWE R G F AN+
Sbjct: 276 VSYYEADAYARWAGARLPTEAEWEKAARWGDAPPNF-----------AEANLG------- 317
Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP--------KG 294
APV +Y GL + G+VWEWTA + P P + P G
Sbjct: 318 ----------APVGAYPAGAQGLGQLFGDVWEWTASPY----EPYPGFRPPPGAYGEYNG 363
Query: 295 PTTGTDKVKKGGS 307
V +GGS
Sbjct: 364 KFMDGQMVLRGGS 376
>gnl|CDD|132563 TIGR03524, GldJ, gliding motility-associated lipoprotein GldJ.
Members of this protein family are exclusive to the
Bacteroidetes phylum (previously
Cytophaga-Flavobacteria-Bacteroides). GldJ is a
lipoprotein linked to a type of rapid surface gliding
motility found in certain Bacteroidetes, such as
Flavobacterium johnsoniae. Knockouts of GldJ abolish the
gliding phenotype. GldJ is homologous to GldK. There is
a GldJ homolog in Cytophaga hutchinsonii and several
other species that has a different, shorter architecture
and is represented by a separate model. Gliding motility
appears closely linked to chitin utilization in the
model species Flavobacterium johnsoniae. Bacteroidetes
with members of this protein family appear to have all
of the genes associated with gliding motility.
Length = 559
Score = 91.2 bits (226), Expect = 7e-20
Identities = 82/315 (26%), Positives = 121/315 (38%), Gaps = 63/315 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV---T 71
+V + G TF MG + ++ D + +FY+D+ EV+N+ + E++ V +
Sbjct: 61 LVFVEGGTFTMGQVQDDVMHDWNNTPTQQHVQSFYMDETEVTNSMYLEYLQYLKDVFPPS 120
Query: 72 EAE-------KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
EA DT V+ L E + +RH +PVV +SW
Sbjct: 121 EANYKNIYSGALPDTLVWRNRLGNNETMTENYLRH------------PAYADYPVVGVSW 168
Query: 125 NDAVAYCTWR-----------------GARLPTEAEWEYGCRGGL--ENRLFPWGSWLHP 165
AV + WR GA++ A + L ++ + + ++
Sbjct: 169 IQAVEFSKWRTNRVNEKVLEDKGNIKKGAKIDVTASSFFDTDVYLVDPSKTYGGDTTVYK 228
Query: 166 EGIDSTIEHRMNHPVVHVSWNDAVAY----CTWRGARLPTEAEWEYGCRGGLENR----- 216
GI T + DA + RLPTEAEWEY + + NR
Sbjct: 229 RGIGRTRRKKGEARPAVPEEKDAYQQRKDGIITQRYRLPTEAEWEYAAKANVGNREYNNY 288
Query: 217 ----LFPW------GNNLTPRGEHRANVWQGEFPTNNTAA---DGYLSTAPVMSYKENKF 263
+PW N RG+ AN QG+ A DG T + SY N F
Sbjct: 289 RGRKKYPWNGKYTRSKNRRNRGDQLANFKQGKGDYGGIAGWSDDGADITNEIKSYPPNDF 348
Query: 264 GLYNMVGNVWEWTAD 278
GLY+M GNV EW AD
Sbjct: 349 GLYDMAGNVAEWVAD 363
>gnl|CDD|132569 TIGR03530, GldJ_short, gliding motility-associated lipoprotein
GldJ. Members of this protein family are exclusive to
the Bacteroidetes phylum (previously
Cytophaga-Flavobacteria-Bacteroides). GldJ is a
lipoprotein linked to a type of rapid surface gliding
motility found in certain Bacteroidetes, such as
Flavobacterium johnsoniae. Knockouts of GldJ abolish the
gliding phenotype. GldJ is homologous to GldK. This
model represents the GldJ homolog in Cytophaga
hutchinsonii and several other species which is of
shorter architecture than that found in Flavobacterium
johnsoniae and is represented by a separate model
(TIGR03524). Gliding motility appears closely linked to
chitin utilization in the model species Flavobacterium
johnsoniae. Bacteroidetes with members of this protein
family appear to have all of the genes associated with
gliding motility.
Length = 402
Score = 77.0 bits (189), Expect = 2e-15
Identities = 89/371 (23%), Positives = 132/371 (35%), Gaps = 84/371 (22%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++ + G +G+ + L+ G+ R VT+ FY+D+ E++N + EF+ + A
Sbjct: 65 NLKFIEGGRAVLGSFEEDLMAFGDNLERTVTIANFYMDETEIANIDWLEFLFNMKKDSSA 124
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ E + + E H RF G + PV ++W A AYC W
Sbjct: 125 DFIEKAEPDEDVWAGELAFNDLYQDHYF-RFPGFN-------FFPVAGVNWIQANAYCIW 176
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R E E GIDS + VA
Sbjct: 177 RT---------EVVNELLAEEA-----------GIDSPEGGG------QIPIERGVALAD 210
Query: 194 WRGARLPTEAEWEYGCRGGLEN----------RLFPW-GNNL-TPRGEHRANVWQGEFPT 241
+ RLP EAEWEY + + N R++PW G+ L P R G+F
Sbjct: 211 F---RLPNEAEWEYAAKALIGNQWLDENQEHGRIYPWDGHALRNPYNVKRKGKQMGDFLA 267
Query: 242 N-------------NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW-------- 280
N N DG + + + N FGLY M GN+ EW D +
Sbjct: 268 NFKRGRGDYAGIAGNKLNDGAIIPTNIYDFAPNDFGLYCMAGNMNEWVYDVYRPLSFQDF 327
Query: 281 --------NVHHHPAPSYNPKGPTTGTD---KVKKGGSYLCNEQYCYRHRCAARSQNTPD 329
+ Y+ G + D +V KGGS+ + Y R D
Sbjct: 328 DDLNPLRKDGFFDEEEGYDKAGFQSLLDDEFRVYKGGSW---KDVAYWLSPGTRRFLAED 384
Query: 330 SSAGNLGFRCA 340
S+ +GFRCA
Sbjct: 385 SATAAIGFRCA 395
>gnl|CDD|234555 TIGR04344, ovoA_Nterm, 5-histidylcysteine sulfoxide synthase.
Ovothiol A is N1-methyl-4-mercaptohistidine. In the
absence of S-adenosylmethione, a methyl donor, the
intermediate produced is 4-mercaptohistidine. In both
Erwinia tasmaniensis and Trypanosoma cruzi, a protein
occurs with 5-histidylcysteine sulfoxide synthase
activity, but these two enzymes and most homologs share
an additional C-terminal methyltransferase domain. Thus
OvoA may be a bifunctional enzyme with
5-histidylcysteine sulfoxide synthase and
4-mercaptohistidine N1-methyltranferase activity. This
model describes the 5-histidylcysteine sulfoxide
synthase domain, a homolog of the ergothioneine
biosynthesis protein EgtB [Biosynthesis of cofactors,
prosthetic groups, and carriers, Glutathione and
analogs].
Length = 442
Score = 67.3 bits (165), Expect = 3e-12
Identities = 68/276 (24%), Positives = 96/276 (34%), Gaps = 95/276 (34%)
Query: 14 DMVLLPGDTFRMG--TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+++ +PG ++G + P D E+ S + F ++ VSN +F EFV A GY
Sbjct: 189 ELIQVPGGEVKLGKPDDDPSYGWDNEYGSHRAEVKPFKASKYLVSNGEFLEFVKAGGYEN 248
Query: 72 EAEKFGDTFVFEPLLSEEERA--KISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
+ SEE + +H F
Sbjct: 249 QR-----------YWSEEGWEWKQFRNAKH--PTF------------------------- 270
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
W P + R E PW W PV V++ +A
Sbjct: 271 ---WV----PDSHGYRL--RTMFEEIPMPW-DW----------------PV-EVNYLEAK 303
Query: 190 AYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN- 242
A+C W+ A RLPTEAEW +RL G VW E P N
Sbjct: 304 AFCRWKSAKTGKPIRLPTEAEW---------HRL------RDVSGYSDDPVWTDEAPANL 348
Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD 278
N A S+ PV + + G Y++VGNVW+WT
Sbjct: 349 NLAY--GASSCPVDRFAQG--GFYDVVGNVWQWTET 380
>gnl|CDD|233760 TIGR02171, Fb_sc_TIGR02171, Fibrobacter succinogenes paralogous
family TIGR02171. This model describes a paralogous
family of the rumen bacterium Fibrobacter succinogenes.
Eleven members are found in Fibrobacter succinogenes
S85, averaging over 900 amino acids in length. More than
half are predicted lipoproteins. The function is
unknown.
Length = 912
Score = 45.6 bits (108), Expect = 4e-05
Identities = 36/155 (23%), Positives = 50/155 (32%), Gaps = 30/155 (19%)
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G RLPTEAEW Y W E N+ +
Sbjct: 157 KGYRLPTEAEWIYVASQS----------------------WDPEKSWNSDNSSSEAHE-- 192
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT--DKVKKGGSYL--C 310
V + +N + +M GNV EW DW + G+ ++V KGGSY
Sbjct: 193 VCTSPDNPGNVCDMAGNVLEWVNDWLASFKDTTLTNYVGSSDPGSLGERVVKGGSYRNSP 252
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
+ Y T + +GFR P
Sbjct: 253 SAINLYTRGDVYPV--TSSTKGDYVGFRLVLGAIP 285
Score = 32.5 bits (74), Expect = 0.50
Identities = 34/139 (24%), Positives = 52/139 (37%), Gaps = 34/139 (24%)
Query: 24 RMGTNKPILIKDGEFPSRNVTLD-AFYLDQHEVSNTQFQEFVSATGYVTEAEK-----FG 77
+GT+ I K E P V L FY+ +HEV+ +F + + K
Sbjct: 49 TLGTDD-ISAKSNESPKMTVQLTYDFYIGRHEVTCGEFNDLMKGETGFKVPCKEDKLPAT 107
Query: 78 DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW---- 133
+ ++ +L +K EGLD+ + + DA +C
Sbjct: 108 NVTFYDAVLYANALSKS----------EGLDTVYTY-------TSANFDASGHCVNLEGL 150
Query: 134 ------RGARLPTEAEWEY 146
+G RLPTEAEW Y
Sbjct: 151 AFHPEVKGYRLPTEAEWIY 169
>gnl|CDD|220778 pfam10487, Nup188, Nucleoporin subcomplex protein binding to Pom34.
This is one of the many peptides that make up the
nucleoporin complex (NPC), and is found across
eukaryotes. The Nup188 subcomplex
(Nic96p-Nup188p-Nup192p-Pom152p) is one of at least six
that make up the NPC, and as such is symmetrically
localised on both faces of the NPC at the nuclear end,
being integrally bound to the C-terminus of Pom34p.
Length = 931
Score = 32.5 bits (74), Expect = 0.49
Identities = 26/97 (26%), Positives = 44/97 (45%), Gaps = 16/97 (16%)
Query: 32 LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEER 91
LIK F +R + ++HEVS + +S G + T L S ER
Sbjct: 222 LIKSTGFSTR-------FTNRHEVSESM-DHLISRIGSLFTII----TIEGLGLNSSHER 269
Query: 92 AKI---SQVRHDMKRFEGLDSTIEHRM-HHPVVHISW 124
A S++ +D+ F+ ++S + + + H+P V SW
Sbjct: 270 ANDDRTSKLANDVHTFQEVNSALLNDVSHNPPVLYSW 306
>gnl|CDD|215265 PLN02478, PLN02478, alternative oxidase.
Length = 328
Score = 30.6 bits (69), Expect = 1.5
Identities = 20/72 (27%), Positives = 31/72 (43%), Gaps = 15/72 (20%)
Query: 142 AEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPT 201
EW++ C PW ++ ID H V + D +AY T + R+PT
Sbjct: 95 TEWKWNC-------FRPWETYKADLSIDLKKHH------VPKTLLDKIAYWTVKSLRVPT 141
Query: 202 EAEWE--YGCRG 211
+ ++ YGCR
Sbjct: 142 DLFFQRRYGCRA 153
Score = 29.1 bits (65), Expect = 4.6
Identities = 16/45 (35%), Positives = 26/45 (57%), Gaps = 4/45 (8%)
Query: 108 DSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWE--YGCRG 150
D +I+ + HH V + D +AY T + R+PT+ ++ YGCR
Sbjct: 111 DLSIDLKKHH--VPKTLLDKIAYWTVKSLRVPTDLFFQRRYGCRA 153
>gnl|CDD|153243 cd03169, GATase1_PfpI_1, Type 1 glutamine amidotransferase
(GATase1)-like domain found in a subgroup of proteins
similar to PfpI from Pyrococcus furiosus. Type 1
glutamine amidotransferase (GATase1)-like domain found
in a subgroup of proteins similar to PfpI from
Pyrococcus furiosus. PfpI is an ATP-independent
intracellular proteases which may hydrolyze small
peptides to provide a nutritional source. Only Cys of
the catalytic triad typical of GATase1 domains is
conserved in this group. This Cys residue is found in
the sharp turn between a beta strand and an alpha helix
termed the nucleophile elbow.
Length = 180
Score = 28.4 bits (64), Expect = 4.3
Identities = 7/38 (18%), Positives = 13/38 (34%)
Query: 90 ERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
+ K V + F G + E H V +++
Sbjct: 35 GKKKGDTVVTAIHDFPGWQTYTEKPGHRFAVTADFDEV 72
>gnl|CDD|239029 cd02111, eukary_SO_Moco, molybdopterin binding domain of sulfite
oxidase (SO). SO catalyzes the terminal reaction in the
oxidative degradation of the sulfur-containing amino
acids cysteine and methionine. Common features of all
known members of the sulfite oxidase (SO) family of
molybdopterin binding domains are that they contain one
single pterin cofactor and part of the coordination of
the metal (Mo) is a cysteine ligand of the protein and
that they catalyze the transfer of an oxygen to or from
a lone pair of electrons on the substrate.
Length = 365
Score = 28.9 bits (65), Expect = 4.8
Identities = 11/42 (26%), Positives = 16/42 (38%), Gaps = 13/42 (30%)
Query: 98 RHDMKRFEGLDSTIEHRMHHPVVHISWND-AVAYCTWRGARL 138
R +M + V + W D A++ W GARL
Sbjct: 85 RSEMTK------------VKKVKGLQWGDGAISNAEWGGARL 114
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.135 0.450
Gapped
Lambda K H
0.267 0.0597 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,170,838
Number of extensions: 1876015
Number of successful extensions: 1380
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1345
Number of HSP's successfully gapped: 37
Length of query: 394
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 295
Effective length of database: 6,546,556
Effective search space: 1931234020
Effective search space used: 1931234020
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (27.2 bits)