RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8678
         (394 letters)



>gnl|CDD|217725 pfam03781, FGE-sulfatase, Formylglycine-generating sulfatase
           enzyme.  This domain is found in eukaryotic proteins
           required for post-translational sulphatase modification
           (SUMF1). These proteins are associated with the rare
           disorder multiple sulphatase deficiency (MSD). The
           protein product of the SUMF1 gene is FGE, formylglycine
           (FGly),-generating enzyme, which is a sulfatase.
           Sulfatases are enzymes essential for degradation and
           remodelling of sulfate esters, and formylglycine (FGly),
           the key catalytic in the active site, is unique to
           sulfatases. FGE is localised to the endoplasmic
           reticulum (ER) and interacts with and modifies the
           unfolded form of newly synthesised sulfatases. FGE is a
           single-domain monomer with a surprising paucity of
           secondary structure that adopts a unique fold which is
           stabilised by two Ca2+ ions. The effect of all mutations
           found in MSD patients is explained by the FGE structure,
           providing a molecular basis for MSD. A redox-active
           disulfide bond is present in the active site of FGE. An
           oxidized cysteine residue, possibly cysteine sulfenic
           acid, has been detected that may allow formulation of a
           structure-based mechanism for FGly formation from
           cysteine residues in all sulfatases.
          Length = 259

 Score =  190 bits (483), Expect = 3e-58
 Identities = 84/190 (44%), Positives = 107/190 (56%), Gaps = 15/190 (7%)

Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLEN 215
           W HP G  S I+   +HPV  VSW DAVAY  W G       RLPTEAEWEY  RGG + 
Sbjct: 75  WRHPSGGLSDIDDGADHPVTGVSWYDAVAYARWLGKRTGNGYRLPTEAEWEYAARGGSKG 134

Query: 216 RLFPWGNNLTPRGEHRANVWQG-EFPTNNTAADGY-LSTAPVMSYKENKFGLYNMVGNVW 273
           R +PWG+ L P G    N+WQG +FP  +  AD +   T+PV S+  N  GLY+M GNVW
Sbjct: 135 RRYPWGDELYPAG----NIWQGADFPNEHAGADSFNGRTSPVGSFPPNALGLYDMAGNVW 190

Query: 274 EWTADWWNVHHHPAPSYNPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAAR-SQNTPDSS 331
           EWT+DW+  H+  AP         G   +V +GGS+ C+     R R A R +  TP + 
Sbjct: 191 EWTSDWYKPHYSFAPYDELSRDNFGGGYRVVRGGSWACSVY-PSRLRPAFRGNCQTPGTR 249

Query: 332 AGNLGFRCAA 341
           A ++GFR   
Sbjct: 250 ADDVGFRLVR 259


>gnl|CDD|224182 COG1262, COG1262, Uncharacterized conserved protein [Function
           unknown].
          Length = 314

 Score =  153 bits (389), Expect = 1e-43
 Identities = 95/348 (27%), Positives = 125/348 (35%), Gaps = 104/348 (29%)

Query: 15  MVLLPGDTFRMGTNKPILIKD--GEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           MVL+PG  F MG+      +    E P   VT+  F +D++ V+N QF  FV A GY T 
Sbjct: 53  MVLIPGGEFTMGSPDDEWERFDRNEAPVHKVTVPPFEIDKYPVTNAQFARFVEAGGYTTA 112

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
            E+ G            E    S  + +             R+ HPVV +SW DA AY  
Sbjct: 113 WEEDG------------EPVYPSYWKGE---------GGRLRLEHPVVGVSWYDAQAYAA 151

Query: 133 WRG-----ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
           W G      RLPTEAEWEY  R G     +PWG     + ++                  
Sbjct: 152 WLGVKTGEYRLPTEAEWEYAARAGTTTDSYPWG-----DELEP----------------G 190

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
             AY             WEY          +       P G    N              
Sbjct: 191 LNAYAG----------TWEY----LRAAAGWARERETAPVGAFPPN-------------- 222

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA---------PSYNPKGPTT- 297
                          +GLY+M GNVWEWTADW    H+             Y+    +  
Sbjct: 223 -------------AAYGLYDMHGNVWEWTADWEKEWHYDNYGPAPSDGSAWYDGNSGSKF 269

Query: 298 -GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
            G+ +V +GGS+          R A R+   PD    ++GFRCA    
Sbjct: 270 FGSLRVVRGGSWASYPGVL---RPAFRNFLVPDYRQAHVGFRCARLIK 314


>gnl|CDD|234246 TIGR03529, GldK_short, gliding motility-associated lipoprotein
           GldK.  Members of this protein family are exclusive to
           the Bacteroidetes phylum (previously
           Cytophaga-Flavobacteria-Bacteroides). GldK is a
           lipoprotein linked to a type of rapid surface gliding
           motility found in certain Bacteroidetes, such as
           Flavobacterium johnsoniae. Knockouts of GldK abolish the
           gliding phenotype. GldK is homologous to GldJ. This
           model represents a GldK homolog in Cytophaga
           hutchinsonii and several other species that has a
           different, shorter architecture than that found in
           Flavobacterium johnsoniae and related species
           (represented by (TIGR03525). Gliding motility appears
           closely linked to chitin utilization in the model
           species Flavobacterium johnsoniae. Bacteroidetes with
           members of this protein family appear to have all of the
           genes associated with gliding motility.
          Length = 344

 Score =  120 bits (303), Expect = 5e-31
 Identities = 89/342 (26%), Positives = 127/342 (37%), Gaps = 74/342 (21%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA--TGYVT 71
            MV++P  TF MG     +       ++ +T+  F++D+ EV+N ++++F+     G + 
Sbjct: 53  GMVVIPAGTFHMGQADEDVPATQINLNKQITISEFFMDKTEVTNNKYRQFLEVVLEGQLA 112

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHP------VVHISWN 125
                   +  E L  +      +   H M      D  +E+   HP      VV + WN
Sbjct: 113 TGTPLPPEYDMEELYPDTTVW-STSFSHHMG-----DPLMEYYFDHPAFDNYPVVGVDWN 166

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
            A  +C WR                                       + MN      S 
Sbjct: 167 AAKQFCEWR--------------------------------------TYHMNAYRNEESQ 188

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNL--TPRGEHRANVWQGEFPTNN 243
            D   +      RLP+EAEWEY  RGG +   +PWG       RG   AN   G     N
Sbjct: 189 YDMPRF------RLPSEAEWEYAARGGRDMAKYPWGGPYLRNKRGCMLANFKPGR---GN 239

Query: 244 TAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----PSYNPKGPTTG 298
              DG+  TAPV  Y  N FGLY+M GNV EW  D +     P      P Y        
Sbjct: 240 YYDDGFPYTAPVAVYFPNDFGLYDMAGNVAEWVLDAYAATSVPIVWDLNPVYEDPNEVR- 298

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
             K+ +GGS+     Y        R+    D S  ++GFR  
Sbjct: 299 --KIIRGGSWKDIAYYL---ETGTRTFEYEDVSQAHIGFRTV 335


>gnl|CDD|132564 TIGR03525, GldK, gliding motility-associated lipoprotein GldK.
           Members of this protein family are exclusive to the
           Bacteroidetes phylum (previously
           Cytophaga-Flavobacteria-Bacteroides). GldK is a
           lipoprotein linked to a type of rapid surface gliding
           motility found in certain Bacteroidetes, such as
           Flavobacterium johnsoniae. Knockouts of GldK abolish the
           gliding phenotype. GldK is homologous to GldJ. There is
           a GldK homolog in Cytophaga hutchinsonii and several
           other species that has a different, shorter architecture
           and is represented by a separate model. Gliding motility
           appears closely linked to chitin utilization in the
           model species Flavobacterium johnsoniae. Bacteroidetes
           with members of this protein family appear to have all
           of the genes associated with gliding motility.
          Length = 449

 Score =  105 bits (264), Expect = 4e-25
 Identities = 69/189 (36%), Positives = 89/189 (47%), Gaps = 31/189 (16%)

Query: 177 NHPVVHVSWNDAVAYCTWRGA-----------------RLPTEAEWEYGCRGGLENRLFP 219
           ++PVV V+W  A A+C WR                   RLPTEAEWEY  RGGLE   +P
Sbjct: 262 DYPVVGVTWKQARAFCNWRTKYKNDFRKKKGPANVNTFRLPTEAEWEYAARGGLEGATYP 321

Query: 220 WGNNLT--PRGEHRANVWQGEFPTN--NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEW 275
           WG   T   RG   AN     F     + AAD  L T    SY+ N +GLYNM GNV EW
Sbjct: 322 WGGPYTKNDRGCFMAN-----FKPVRGDYAADEALYTVEAKSYEPNDYGLYNMAGNVSEW 376

Query: 276 TADWWNVHHHP-APSYNPK-GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAG 333
           T   ++   +    + NP    +  T KV +GGS+   +   Y  + + R     DS+  
Sbjct: 377 TNSSYDPSSYEYMSTMNPNVNDSENTRKVVRGGSW---KDVAYFLQVSTRDYEYADSARS 433

Query: 334 NLGFRCAAD 342
            +GFR   D
Sbjct: 434 YIGFRTVQD 442



 Score = 63.0 bits (153), Expect = 8e-11
 Identities = 27/61 (44%), Positives = 32/61 (52%), Gaps = 17/61 (27%)

Query: 117 HPVVHISWNDAVAYCTWRGA-----------------RLPTEAEWEYGCRGGLENRLFPW 159
           +PVV ++W  A A+C WR                   RLPTEAEWEY  RGGLE   +PW
Sbjct: 263 YPVVGVTWKQARAFCNWRTKYKNDFRKKKGPANVNTFRLPTEAEWEYAARGGLEGATYPW 322

Query: 160 G 160
           G
Sbjct: 323 G 323



 Score = 47.2 bits (112), Expect = 8e-06
 Identities = 18/51 (35%), Positives = 35/51 (68%)

Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
          MVL+PG +F MG +   +      P++ VT+ +FY+D+ E++N+++++FV 
Sbjct: 41 MVLVPGGSFIMGKSDEDIAGVMNAPTKTVTVRSFYMDETEITNSEYRQFVE 91


>gnl|CDD|234211 TIGR03440, egtB_TIGR03440, ergothioneine biosynthesis protein EgtB.
            Members of this family include EgtB, and enzyme of the
           ergothioneine biosynthesis, as found in numerous
           Actinobacteria. Characterized homologs to this family
           include a formylglycine-generating enzyme that serves as
           a maturase for an aerobic sulfatase (cf. the radical SAM
           enzymes that serve as anaerobic sulfatase maturases)
           [Biosynthesis of cofactors, prosthetic groups, and
           carriers, Glutathione and analogs].
          Length = 406

 Score =  102 bits (255), Expect = 6e-24
 Identities = 78/313 (24%), Positives = 101/313 (32%), Gaps = 102/313 (32%)

Query: 3   LLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
            +PAP        V  PG  F +G++      D E P   V +  F +D   V+N ++ E
Sbjct: 158 PVPAPASAPPLRWVAFPGGEFEIGSDADGFAFDNERPRHRVLVPPFEIDARPVTNGEYLE 217

Query: 63  FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
           F+   GY               L   +  A +                       P    
Sbjct: 218 FIEDGGYRRPE-----------LWLSDGWAWVQA----------------EGWQAP---- 246

Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
                  Y  WR      +  W     GGL          L P           + PV H
Sbjct: 247 ------LY--WR----RDDGTWWVFTLGGLRP--------LDP-----------DAPVCH 275

Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
           VS+ +A AY  W GARLPTEAEWE   R G     F             AN+        
Sbjct: 276 VSYYEADAYARWAGARLPTEAEWEKAARWGDAPPNF-----------AEANLG------- 317

Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP--------KG 294
                     APV +Y     GL  + G+VWEWTA  +     P P + P         G
Sbjct: 318 ----------APVGAYPAGAQGLGQLFGDVWEWTASPY----EPYPGFRPPPGAYGEYNG 363

Query: 295 PTTGTDKVKKGGS 307
                  V +GGS
Sbjct: 364 KFMDGQMVLRGGS 376


>gnl|CDD|132563 TIGR03524, GldJ, gliding motility-associated lipoprotein GldJ.
           Members of this protein family are exclusive to the
           Bacteroidetes phylum (previously
           Cytophaga-Flavobacteria-Bacteroides). GldJ is a
           lipoprotein linked to a type of rapid surface gliding
           motility found in certain Bacteroidetes, such as
           Flavobacterium johnsoniae. Knockouts of GldJ abolish the
           gliding phenotype. GldJ is homologous to GldK. There is
           a GldJ homolog in Cytophaga hutchinsonii and several
           other species that has a different, shorter architecture
           and is represented by a separate model. Gliding motility
           appears closely linked to chitin utilization in the
           model species Flavobacterium johnsoniae. Bacteroidetes
           with members of this protein family appear to have all
           of the genes associated with gliding motility.
          Length = 559

 Score = 91.2 bits (226), Expect = 7e-20
 Identities = 82/315 (26%), Positives = 121/315 (38%), Gaps = 63/315 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV---T 71
           +V + G TF MG  +  ++ D         + +FY+D+ EV+N+ + E++     V   +
Sbjct: 61  LVFVEGGTFTMGQVQDDVMHDWNNTPTQQHVQSFYMDETEVTNSMYLEYLQYLKDVFPPS 120

Query: 72  EAE-------KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
           EA           DT V+   L   E    + +RH                 +PVV +SW
Sbjct: 121 EANYKNIYSGALPDTLVWRNRLGNNETMTENYLRH------------PAYADYPVVGVSW 168

Query: 125 NDAVAYCTWR-----------------GARLPTEAEWEYGCRGGL--ENRLFPWGSWLHP 165
             AV +  WR                 GA++   A   +     L   ++ +   + ++ 
Sbjct: 169 IQAVEFSKWRTNRVNEKVLEDKGNIKKGAKIDVTASSFFDTDVYLVDPSKTYGGDTTVYK 228

Query: 166 EGIDSTIEHRMNHPVVHVSWNDAVAY----CTWRGARLPTEAEWEYGCRGGLENR----- 216
            GI  T   +           DA          +  RLPTEAEWEY  +  + NR     
Sbjct: 229 RGIGRTRRKKGEARPAVPEEKDAYQQRKDGIITQRYRLPTEAEWEYAAKANVGNREYNNY 288

Query: 217 ----LFPW------GNNLTPRGEHRANVWQGEFPTNNTAA---DGYLSTAPVMSYKENKF 263
                +PW        N   RG+  AN  QG+      A    DG   T  + SY  N F
Sbjct: 289 RGRKKYPWNGKYTRSKNRRNRGDQLANFKQGKGDYGGIAGWSDDGADITNEIKSYPPNDF 348

Query: 264 GLYNMVGNVWEWTAD 278
           GLY+M GNV EW AD
Sbjct: 349 GLYDMAGNVAEWVAD 363


>gnl|CDD|132569 TIGR03530, GldJ_short, gliding motility-associated lipoprotein
           GldJ.  Members of this protein family are exclusive to
           the Bacteroidetes phylum (previously
           Cytophaga-Flavobacteria-Bacteroides). GldJ is a
           lipoprotein linked to a type of rapid surface gliding
           motility found in certain Bacteroidetes, such as
           Flavobacterium johnsoniae. Knockouts of GldJ abolish the
           gliding phenotype. GldJ is homologous to GldK. This
           model represents the GldJ homolog in Cytophaga
           hutchinsonii and several other species which is of
           shorter architecture than that found in Flavobacterium
           johnsoniae and is represented by a separate model
           (TIGR03524). Gliding motility appears closely linked to
           chitin utilization in the model species Flavobacterium
           johnsoniae. Bacteroidetes with members of this protein
           family appear to have all of the genes associated with
           gliding motility.
          Length = 402

 Score = 77.0 bits (189), Expect = 2e-15
 Identities = 89/371 (23%), Positives = 132/371 (35%), Gaps = 84/371 (22%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++  + G    +G+ +  L+  G+   R VT+  FY+D+ E++N  + EF+      + A
Sbjct: 65  NLKFIEGGRAVLGSFEEDLMAFGDNLERTVTIANFYMDETEIANIDWLEFLFNMKKDSSA 124

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           +        E + + E         H   RF G +         PV  ++W  A AYC W
Sbjct: 125 DFIEKAEPDEDVWAGELAFNDLYQDHYF-RFPGFN-------FFPVAGVNWIQANAYCIW 176

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R          E       E             GIDS            +     VA   
Sbjct: 177 RT---------EVVNELLAEEA-----------GIDSPEGGG------QIPIERGVALAD 210

Query: 194 WRGARLPTEAEWEYGCRGGLEN----------RLFPW-GNNL-TPRGEHRANVWQGEFPT 241
           +   RLP EAEWEY  +  + N          R++PW G+ L  P    R     G+F  
Sbjct: 211 F---RLPNEAEWEYAAKALIGNQWLDENQEHGRIYPWDGHALRNPYNVKRKGKQMGDFLA 267

Query: 242 N-------------NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW-------- 280
           N             N   DG +    +  +  N FGLY M GN+ EW  D +        
Sbjct: 268 NFKRGRGDYAGIAGNKLNDGAIIPTNIYDFAPNDFGLYCMAGNMNEWVYDVYRPLSFQDF 327

Query: 281 --------NVHHHPAPSYNPKGPTTGTD---KVKKGGSYLCNEQYCYRHRCAARSQNTPD 329
                   +        Y+  G  +  D   +V KGGS+   +   Y      R     D
Sbjct: 328 DDLNPLRKDGFFDEEEGYDKAGFQSLLDDEFRVYKGGSW---KDVAYWLSPGTRRFLAED 384

Query: 330 SSAGNLGFRCA 340
           S+   +GFRCA
Sbjct: 385 SATAAIGFRCA 395


>gnl|CDD|234555 TIGR04344, ovoA_Nterm, 5-histidylcysteine sulfoxide synthase.
           Ovothiol A is N1-methyl-4-mercaptohistidine. In the
           absence of S-adenosylmethione, a methyl donor, the
           intermediate produced is 4-mercaptohistidine. In both
           Erwinia tasmaniensis and Trypanosoma cruzi, a protein
           occurs with 5-histidylcysteine sulfoxide synthase
           activity, but these two enzymes and most homologs share
           an additional C-terminal methyltransferase domain. Thus
           OvoA may be a bifunctional enzyme with
           5-histidylcysteine sulfoxide synthase and
           4-mercaptohistidine N1-methyltranferase activity. This
           model describes the 5-histidylcysteine sulfoxide
           synthase domain, a homolog of the ergothioneine
           biosynthesis protein EgtB [Biosynthesis of cofactors,
           prosthetic groups, and carriers, Glutathione and
           analogs].
          Length = 442

 Score = 67.3 bits (165), Expect = 3e-12
 Identities = 68/276 (24%), Positives = 96/276 (34%), Gaps = 95/276 (34%)

Query: 14  DMVLLPGDTFRMG--TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           +++ +PG   ++G   + P    D E+ S    +  F   ++ VSN +F EFV A GY  
Sbjct: 189 ELIQVPGGEVKLGKPDDDPSYGWDNEYGSHRAEVKPFKASKYLVSNGEFLEFVKAGGYEN 248

Query: 72  EAEKFGDTFVFEPLLSEEERA--KISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           +              SEE     +    +H    F                         
Sbjct: 249 QR-----------YWSEEGWEWKQFRNAKH--PTF------------------------- 270

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
              W     P    +    R   E    PW  W                PV  V++ +A 
Sbjct: 271 ---WV----PDSHGYRL--RTMFEEIPMPW-DW----------------PV-EVNYLEAK 303

Query: 190 AYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN- 242
           A+C W+ A      RLPTEAEW         +RL          G     VW  E P N 
Sbjct: 304 AFCRWKSAKTGKPIRLPTEAEW---------HRL------RDVSGYSDDPVWTDEAPANL 348

Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD 278
           N A     S+ PV  + +   G Y++VGNVW+WT  
Sbjct: 349 NLAY--GASSCPVDRFAQG--GFYDVVGNVWQWTET 380


>gnl|CDD|233760 TIGR02171, Fb_sc_TIGR02171, Fibrobacter succinogenes paralogous
           family TIGR02171.  This model describes a paralogous
           family of the rumen bacterium Fibrobacter succinogenes.
           Eleven members are found in Fibrobacter succinogenes
           S85, averaging over 900 amino acids in length. More than
           half are predicted lipoproteins. The function is
           unknown.
          Length = 912

 Score = 45.6 bits (108), Expect = 4e-05
 Identities = 36/155 (23%), Positives = 50/155 (32%), Gaps = 30/155 (19%)

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G RLPTEAEW Y                           W  E   N+  +        
Sbjct: 157 KGYRLPTEAEWIYVASQS----------------------WDPEKSWNSDNSSSEAHE-- 192

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT--DKVKKGGSYL--C 310
           V +  +N   + +M GNV EW  DW         +        G+  ++V KGGSY    
Sbjct: 193 VCTSPDNPGNVCDMAGNVLEWVNDWLASFKDTTLTNYVGSSDPGSLGERVVKGGSYRNSP 252

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           +    Y          T  +    +GFR      P
Sbjct: 253 SAINLYTRGDVYPV--TSSTKGDYVGFRLVLGAIP 285



 Score = 32.5 bits (74), Expect = 0.50
 Identities = 34/139 (24%), Positives = 52/139 (37%), Gaps = 34/139 (24%)

Query: 24  RMGTNKPILIKDGEFPSRNVTLD-AFYLDQHEVSNTQFQEFVSATGYVTEAEK-----FG 77
            +GT+  I  K  E P   V L   FY+ +HEV+  +F + +          K       
Sbjct: 49  TLGTDD-ISAKSNESPKMTVQLTYDFYIGRHEVTCGEFNDLMKGETGFKVPCKEDKLPAT 107

Query: 78  DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW---- 133
           +   ++ +L     +K           EGLD+   +         +  DA  +C      
Sbjct: 108 NVTFYDAVLYANALSKS----------EGLDTVYTY-------TSANFDASGHCVNLEGL 150

Query: 134 ------RGARLPTEAEWEY 146
                 +G RLPTEAEW Y
Sbjct: 151 AFHPEVKGYRLPTEAEWIY 169


>gnl|CDD|220778 pfam10487, Nup188, Nucleoporin subcomplex protein binding to Pom34.
            This is one of the many peptides that make up the
           nucleoporin complex (NPC), and is found across
           eukaryotes. The Nup188 subcomplex
           (Nic96p-Nup188p-Nup192p-Pom152p) is one of at least six
           that make up the NPC, and as such is symmetrically
           localised on both faces of the NPC at the nuclear end,
           being integrally bound to the C-terminus of Pom34p.
          Length = 931

 Score = 32.5 bits (74), Expect = 0.49
 Identities = 26/97 (26%), Positives = 44/97 (45%), Gaps = 16/97 (16%)

Query: 32  LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEER 91
           LIK   F +R       + ++HEVS +     +S  G +        T     L S  ER
Sbjct: 222 LIKSTGFSTR-------FTNRHEVSESM-DHLISRIGSLFTII----TIEGLGLNSSHER 269

Query: 92  AKI---SQVRHDMKRFEGLDSTIEHRM-HHPVVHISW 124
           A     S++ +D+  F+ ++S + + + H+P V  SW
Sbjct: 270 ANDDRTSKLANDVHTFQEVNSALLNDVSHNPPVLYSW 306


>gnl|CDD|215265 PLN02478, PLN02478, alternative oxidase.
          Length = 328

 Score = 30.6 bits (69), Expect = 1.5
 Identities = 20/72 (27%), Positives = 31/72 (43%), Gaps = 15/72 (20%)

Query: 142 AEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPT 201
            EW++ C         PW ++     ID    H      V  +  D +AY T +  R+PT
Sbjct: 95  TEWKWNC-------FRPWETYKADLSIDLKKHH------VPKTLLDKIAYWTVKSLRVPT 141

Query: 202 EAEWE--YGCRG 211
           +  ++  YGCR 
Sbjct: 142 DLFFQRRYGCRA 153



 Score = 29.1 bits (65), Expect = 4.6
 Identities = 16/45 (35%), Positives = 26/45 (57%), Gaps = 4/45 (8%)

Query: 108 DSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWE--YGCRG 150
           D +I+ + HH  V  +  D +AY T +  R+PT+  ++  YGCR 
Sbjct: 111 DLSIDLKKHH--VPKTLLDKIAYWTVKSLRVPTDLFFQRRYGCRA 153


>gnl|CDD|153243 cd03169, GATase1_PfpI_1, Type 1 glutamine amidotransferase
           (GATase1)-like domain found in a subgroup of proteins
           similar to PfpI from Pyrococcus furiosus.  Type 1
           glutamine amidotransferase (GATase1)-like domain found
           in a subgroup of proteins similar to PfpI from
           Pyrococcus furiosus.   PfpI is an ATP-independent
           intracellular proteases which may hydrolyze small
           peptides to provide a nutritional source.  Only Cys of
           the catalytic triad typical of GATase1 domains is
           conserved in this group. This Cys residue is found in
           the sharp turn between a beta strand and an alpha helix
           termed the nucleophile elbow.
          Length = 180

 Score = 28.4 bits (64), Expect = 4.3
 Identities = 7/38 (18%), Positives = 13/38 (34%)

Query: 90  ERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
            + K   V   +  F G  +  E   H   V   +++ 
Sbjct: 35  GKKKGDTVVTAIHDFPGWQTYTEKPGHRFAVTADFDEV 72


>gnl|CDD|239029 cd02111, eukary_SO_Moco, molybdopterin binding domain of sulfite
           oxidase (SO). SO catalyzes the terminal reaction in the
           oxidative degradation of the sulfur-containing amino
           acids cysteine and methionine. Common features of all
           known members of the sulfite oxidase (SO) family of
           molybdopterin binding domains are that they contain one
           single pterin cofactor and part of the coordination of
           the metal (Mo) is a cysteine ligand of the protein and
           that they catalyze the transfer of an oxygen to or from
           a lone pair of electrons on the substrate.
          Length = 365

 Score = 28.9 bits (65), Expect = 4.8
 Identities = 11/42 (26%), Positives = 16/42 (38%), Gaps = 13/42 (30%)

Query: 98  RHDMKRFEGLDSTIEHRMHHPVVHISWND-AVAYCTWRGARL 138
           R +M +               V  + W D A++   W GARL
Sbjct: 85  RSEMTK------------VKKVKGLQWGDGAISNAEWGGARL 114


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.135    0.450 

Gapped
Lambda     K      H
   0.267   0.0597    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,170,838
Number of extensions: 1876015
Number of successful extensions: 1380
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1345
Number of HSP's successfully gapped: 37
Length of query: 394
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 295
Effective length of database: 6,546,556
Effective search space: 1931234020
Effective search space used: 1931234020
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (27.2 bits)