RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7014
         (500 letters)



>gnl|CDD|216930 pfam02210, Laminin_G_2, Laminin G domain.  This family includes the
           Thrombospondin N-terminal-like domain, a Laminin G
           subfamily.
          Length = 124

 Score = 52.8 bits (127), Expect = 1e-08
 Identities = 24/71 (33%), Positives = 34/71 (47%), Gaps = 3/71 (4%)

Query: 222 YTVRVGKNGQQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHS 281
           + V V ++G+   L VD    V+   PG  + LN    LYLGG    +       LP+  
Sbjct: 57  HRVSVSRDGRSLTLSVDGGTVVSEALPGSSSILNLNGPLYLGGLPEDSGL---SLLPVTE 113

Query: 282 GFSGCIFDVEL 292
           GF GCI +V +
Sbjct: 114 GFVGCIRNVRV 124



 Score = 34.7 bits (80), Expect = 0.029
 Identities = 13/44 (29%), Positives = 18/44 (40%)

Query: 454 IRPTKDKGLLMYFGHPQKNSMMTLSLQGGVLELRVLMLGDRPKT 497
            R T+  GLL+Y G       + L L+ G L LR  +       
Sbjct: 1   FRTTQPNGLLLYAGGEDGLDFLALELEDGRLVLRYDLGSGGSVL 44



 Score = 29.3 bits (66), Expect = 1.9
 Identities = 13/37 (35%), Positives = 19/37 (51%), Gaps = 4/37 (10%)

Query: 106 ALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNLGSG 142
            LL + G +   D     LA+    G +VL ++LGSG
Sbjct: 8   GLLLYAGGEDGLD----FLALELEDGRLVLRYDLGSG 40


>gnl|CDD|214598 smart00282, LamG, Laminin G domain. 
          Length = 132

 Score = 50.8 bits (122), Expect = 8e-08
 Identities = 27/71 (38%), Positives = 38/71 (53%), Gaps = 3/71 (4%)

Query: 222 YTVRVGKNGQQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHS 281
           + V V +NG+   L VD    V+ +SPG LT LN    LYLGG       +    LP+  
Sbjct: 62  HRVAVERNGRSVTLSVDGGNRVSGESPGGLTILNLDGPLYLGGL---PEDLKLPPLPVTP 118

Query: 282 GFSGCIFDVEL 292
           GF GCI ++++
Sbjct: 119 GFRGCIRNLKV 129



 Score = 40.4 bits (95), Expect = 3e-04
 Identities = 17/48 (35%), Positives = 20/48 (41%), Gaps = 1/48 (2%)

Query: 450 IDLEIRPTKDKGLLMYFGHPQKNSMMTLSLQGGVLELRVLMLGDRPKT 497
           I    R T   GLL+Y G       + L L+ G L LR   LG  P  
Sbjct: 2   ISFSFRTTSPNGLLLYAGSKGGGDYLALELRDGRLVLRY-DLGSGPAR 48



 Score = 36.2 bits (84), Expect = 0.010
 Identities = 19/64 (29%), Positives = 27/64 (42%), Gaps = 10/64 (15%)

Query: 91  FELKFR-FVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVLTWNLGSGWYLVYFE 149
               FR   PN      LL + G     D    +LA+    G +VL ++LGSG   +  +
Sbjct: 2   ISFSFRTTSPN-----GLLLYAGSKGGGD----YLALELRDGRLVLRYDLGSGPARLTSD 52

Query: 150 HTYL 153
            T L
Sbjct: 53  PTPL 56


>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
           large number of membrane-bound and extracellular (mostly
           animal) proteins. Many of these proteins require calcium
           for their biological function and calcium-binding sites
           have been found to be located at the N-terminus of
           particular EGF-like domains; calcium-binding may be
           crucial for numerous protein-protein interactions. Six
           conserved core cysteines form three disulfide bridges as
           in non calcium-binding EGF domains, whose structures are
           very similar. EGF_CA can be found in tandem repeat
           arrangements.
          Length = 38

 Score = 45.7 bits (109), Expect = 5e-07
 Identities = 18/37 (48%), Positives = 21/37 (56%), Gaps = 1/37 (2%)

Query: 356 NLCDSTRHNCSFGATCVPLTHSYECDCPPGRTGKFCE 392
           + C S  + C  G TCV    SY C CPPG TG+ CE
Sbjct: 3   DECAS-GNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38



 Score = 38.0 bits (89), Expect = 2e-04
 Identities = 10/30 (33%), Positives = 16/30 (53%)

Query: 322 NHTCSHGGACMNHGATFSCLCADGWFGPLC 351
            + C +GG C+N   ++ C C  G+ G  C
Sbjct: 8   GNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37


>gnl|CDD|238058 cd00110, LamG, Laminin G domain; Laminin G-like domains are usually
           Ca++ mediated receptors that can have binding sites for
           steroids, beta1 integrins, heparin, sulfatides,
           fibulin-1, and alpha-dystroglycans. Proteins that
           contain LamG domains serve a variety of purposes
           including signal transduction via cell-surface steroid
           receptors, adhesion, migration and differentiation
           through mediation of cell adhesion molecules.
          Length = 151

 Score = 47.4 bits (113), Expect = 1e-06
 Identities = 27/71 (38%), Positives = 36/71 (50%), Gaps = 3/71 (4%)

Query: 222 YTVRVGKNGQQCWLMVDNMGNVTSKSPGRLTQLNTKPMLYLGGHFSKNFSILPHDLPLHS 281
           ++V V +NG+   L VD    V S SPG    LN    LYLGG       +    LP+  
Sbjct: 83  HSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGL---PEDLKSPGLPVSP 139

Query: 282 GFSGCIFDVEL 292
           GF GCI D+++
Sbjct: 140 GFVGCIRDLKV 150



 Score = 44.3 bits (105), Expect = 2e-05
 Identities = 22/62 (35%), Positives = 28/62 (45%), Gaps = 1/62 (1%)

Query: 427 ISFSGRRSYISLPSSELHLHEACIDLEIRPTKDKGLLMYFGHPQKNSMMTLSLQGGVLEL 486
           +SFSG  SY+ LP+         I    R T   GLL+Y G       + L L+ G L L
Sbjct: 2   VSFSGS-SYVRLPTLPAPRTRLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVL 60

Query: 487 RV 488
           R 
Sbjct: 61  RY 62



 Score = 33.2 bits (76), Expect = 0.12
 Identities = 17/67 (25%), Positives = 25/67 (37%), Gaps = 6/67 (8%)

Query: 76  SSFSAYVIPANIHHCFELKFRFVPNSFDQIALLAFIGQDYQHDAITDHLAVSFIKGYVVL 135
           SS+              + F F   +     LL + G     D     LA+    G +VL
Sbjct: 7   SSYVRLPTLPAPRTRLSISFSF--RTTSPNGLLLYAGSQNGGD----FLALELEDGRLVL 60

Query: 136 TWNLGSG 142
            ++LGSG
Sbjct: 61  RYDLGSG 67


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
           growth factor (EGF) presents in a large number of
           proteins, mostly animal; the list of proteins currently
           known to contain one or more copies of an EGF-like
           pattern is large and varied; the functional significance
           of EGF-like domains in what appear to be unrelated
           proteins is not yet clear; a common feature is that
           these repeats are found in the extracellular domain of
           membrane-bound proteins or in proteins known to be
           secreted (exception: prostaglandin G/H synthase); the
           domain includes six cysteine residues which have been
           shown to be involved in disulfide bonds; the main
           structure is a two-stranded beta-sheet followed by a
           loop to a C-terminal short two-stranded sheet;
           Subdomains between the conserved cysteines vary in
           length; the region between the 5th and 6th cysteine
           contains two conserved glycines of which at  least  one 
           is  present  in  most EGF-like domains; a subset of
           these bind calcium.
          Length = 36

 Score = 40.5 bits (95), Expect = 3e-05
 Identities = 17/34 (50%), Positives = 19/34 (55%), Gaps = 1/34 (2%)

Query: 360 STRHNCSFGATCVPLTHSYECDCPPGRTGKF-CE 392
           +  + CS G TCV    SY C CPPG TG   CE
Sbjct: 3   AASNPCSNGGTCVNTPGSYRCVCPPGYTGDRSCE 36



 Score = 36.3 bits (84), Expect = 0.001
 Identities = 10/28 (35%), Positives = 18/28 (64%)

Query: 322 NHTCSHGGACMNHGATFSCLCADGWFGP 349
           ++ CS+GG C+N   ++ C+C  G+ G 
Sbjct: 5   SNPCSNGGTCVNTPGSYRCVCPPGYTGD 32


>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 39.9 bits (94), Expect = 5e-05
 Identities = 17/38 (44%), Positives = 21/38 (55%), Gaps = 2/38 (5%)

Query: 356 NLCDSTRHNCSFGATCVPLTHSYECDCPPG-RTGKFCE 392
           + C S  + C  G TCV    SY C+CPPG   G+ CE
Sbjct: 3   DECAS-GNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39



 Score = 33.0 bits (76), Expect = 0.018
 Identities = 10/31 (32%), Positives = 16/31 (51%), Gaps = 1/31 (3%)

Query: 322 NHTCSHGGACMNHGATFSCLCADGW-FGPLC 351
            + C +GG C+N   ++ C C  G+  G  C
Sbjct: 8   GNPCQNGGTCVNTVGSYRCECPPGYTDGRNC 38


>gnl|CDD|215681 pfam00054, Laminin_G_1, Laminin G domain. 
          Length = 131

 Score = 42.3 bits (100), Expect = 6e-05
 Identities = 24/73 (32%), Positives = 38/73 (52%), Gaps = 3/73 (4%)

Query: 222 YTVRVGKNGQQCWLMVDNMGNVTSKSP-GRLTQLNTKPMLYLGGHFSKNFSILPHDLPLH 280
           ++V + +NG+   L VD    VT +SP G  T L+    LY+GG    + ++    L + 
Sbjct: 56  HSVELERNGRSGTLSVDGEARVTGESPLGATTDLDVDGPLYVGG--LPSLAVKLRRLAIS 113

Query: 281 SGFSGCIFDVELS 293
             F GCI DV ++
Sbjct: 114 PSFDGCIRDVIVN 126


>gnl|CDD|215652 pfam00008, EGF, EGF-like domain.  There is no clear separation
           between noise and signal. pfam00053 is very similar, but
           has 8 instead of 6 conserved cysteines. Includes some
           cytokine receptors. The EGF domain misses the N-terminus
           regions of the Ca2+ binding EGF domains (this is the
           main reason of discrepancy between swiss-prot domain
           start/end and Pfam). The family is hard to model due to
           many similar but different sub-types of EGF domains.
           Pfam certainly misses a number of EGF domains.
          Length = 32

 Score = 38.2 bits (89), Expect = 2e-04
 Identities = 15/33 (45%), Positives = 17/33 (51%), Gaps = 1/33 (3%)

Query: 358 CDSTRHNCSFGATCVPLTHSYECDCPPGRTGKF 390
           C    + CS G TCV     Y C+CP G TGK 
Sbjct: 1   CSPN-NPCSNGGTCVDTPGGYTCECPEGYTGKR 32



 Score = 36.3 bits (84), Expect = 0.001
 Identities = 10/31 (32%), Positives = 18/31 (58%), Gaps = 1/31 (3%)

Query: 320 CHNHT-CSHGGACMNHGATFSCLCADGWFGP 349
           C  +  CS+GG C++    ++C C +G+ G 
Sbjct: 1   CSPNNPCSNGGTCVDTPGGYTCECPEGYTGK 31


>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain. 
          Length = 35

 Score = 35.2 bits (81), Expect = 0.003
 Identities = 17/34 (50%), Positives = 20/34 (58%), Gaps = 2/34 (5%)

Query: 360 STRHNCSFGATCVPLTHSYECDCPPGRTG-KFCE 392
           ++   CS G TC+    SY C CPPG TG K CE
Sbjct: 3   ASGGPCSNG-TCINTPGSYTCSCPPGYTGDKRCE 35



 Score = 30.9 bits (70), Expect = 0.080
 Identities = 9/27 (33%), Positives = 15/27 (55%), Gaps = 1/27 (3%)

Query: 322 NHTCSHGGACMNHGATFSCLCADGWFG 348
              CS+G  C+N   +++C C  G+ G
Sbjct: 5   GGPCSNGT-CINTPGSYTCSCPPGYTG 30


>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain. 
          Length = 42

 Score = 33.1 bits (76), Expect = 0.016
 Identities = 12/28 (42%), Positives = 13/28 (46%)

Query: 358 CDSTRHNCSFGATCVPLTHSYECDCPPG 385
           C    HNC     CV    S+EC CP G
Sbjct: 5   CADGTHNCPANTVCVNTIGSFECVCPDG 32



 Score = 30.0 bits (68), Expect = 0.21
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 323 HTCSHGGACMNHGATFSCLCADGW 346
           H C     C+N   +F C+C DG+
Sbjct: 10  HNCPANTVCVNTIGSFECVCPDGY 33


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
           EGF-like domain homologues. This family includes the
           C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 28.7 bits (65), Expect = 0.55
 Identities = 12/36 (33%), Positives = 13/36 (36%), Gaps = 2/36 (5%)

Query: 358 CDSTRHNCSFGATCVPLTHSYECDCPPGRT--GKFC 391
           C      C   ATC     S+ C C  G T  G  C
Sbjct: 1   CAENNGGCHPNATCTNTGGSFTCTCKSGYTGDGVTC 36


>gnl|CDD|204999 pfam12661, hEGF, Human growth factor-like EGF.  hEGF, or human
           growth factor-like EGF, domains have six conserved
           residues disulfide-bonded into the characteristic
           'ababcc' pattern. They are involved in growth and
           proliferation of cells, in proteins of the Notch/Delta
           pathway, neurogulin and selectins. hEGFs are also found
           in mosaic proteins with four-disulfide laminin EGFs such
           as aggrecan and perlecan. The core fold of the EGF
           domain consists of two small beta-hairpins packed
           against each other. Two major structural variants have
           been identified based on the structural context of the
           C-terminal Cys residue of disulfide 'c' in the
           C-terminal hairpin: hEGFs and cEGFs. In hEGFs the
           C-terminal thiol resides in the beta-turn, resulting in
           shorter loop-lengths between the Cys residues of
           disulfide 'c', typically C[8-9]XC. These shorter
           loop-lengths are also typical of the four-disulfide EGF
           domains, laminin ad integrin. Tandem hEGF domains have
           six linking residues between terminal cysteines of
           adjacent domains. hEGF domains may or may not bind
           calcium in the linker region. hEGF domains with the
           consensus motif CXD4X[F,Y]XCXC are hydroxylated
           exclusively in the Asp residue.
          Length = 13

 Score = 28.1 bits (64), Expect = 0.68
 Identities = 8/13 (61%), Positives = 9/13 (69%)

Query: 379 ECDCPPGRTGKFC 391
           +C CPPG TG  C
Sbjct: 1   KCQCPPGYTGPRC 13


>gnl|CDD|116375 pfam07761, DUF1617, Protein of unknown function (DUF1617).  This is
           a family of sequences from hypothetical bacterial and
           bacteriophage proteins. The region in question is
           approximately 150 residues long and is highly conserved
           throughout the family.
          Length = 143

 Score = 30.7 bits (69), Expect = 0.81
 Identities = 13/52 (25%), Positives = 25/52 (48%), Gaps = 8/52 (15%)

Query: 226 VGKNGQQCWLMVDNMGNVTSKSPGRLTQLN-------TKPMLYLGGHFSKNF 270
             K+    ++ +D+ GN+    P +L +LN        + ++  GG +SK F
Sbjct: 57  AAKDDDDKFI-IDDHGNIKIADPAKLAELNDFLDELADEEIVIKGGEYSKRF 107


>gnl|CDD|219677 pfam07974, EGF_2, EGF-like domain.  This family contains EGF
           domains found in a variety of extracellular proteins.
          Length = 31

 Score = 28.2 bits (63), Expect = 0.86
 Identities = 10/31 (32%), Positives = 12/31 (38%), Gaps = 4/31 (12%)

Query: 325 CSHGGACMNHG----ATFSCLCADGWFGPLC 351
           CS  G C   G        C+C  G+ G  C
Sbjct: 1   CSASGICNGRGTCVRPCGKCVCDSGYQGATC 31



 Score = 26.6 bits (59), Expect = 2.7
 Identities = 10/32 (31%), Positives = 12/32 (37%), Gaps = 2/32 (6%)

Query: 360 STRHNCSFGATCVPLTHSYECDCPPGRTGKFC 391
           S    C+   TCV      +C C  G  G  C
Sbjct: 2   SASGICNGRGTCV--RPCGKCVCDSGYQGATC 31


>gnl|CDD|223157 COG0079, HisC, Histidinol-phosphate/aromatic aminotransferase and
           cobyric acid decarboxylase [Amino acid transport and
           metabolism].
          Length = 356

 Score = 31.5 bits (72), Expect = 0.88
 Identities = 14/70 (20%), Positives = 25/70 (35%), Gaps = 8/70 (11%)

Query: 45  INQLLSIIFTNFLPPDIEIGQASYSSSMSGLSSFSAYVIPANIHHCFELKFRFVPNSFDQ 104
            ++L+ ++   F+ P   +     +        FS Y I A +     +K        D 
Sbjct: 84  SDELIELLVRAFVEPGDTVLIPEPT--------FSMYEIAAQLAGAEVVKVPLKEFRLDL 135

Query: 105 IALLAFIGQD 114
            A+LA I   
Sbjct: 136 DAILAAIRDK 145


>gnl|CDD|220730 pfam10395, Utp8, Utp8 family.  Utp8 is an essential component of
           the nuclear tRNA export machinery in Saccharomyces
           cerevisiae. It is a tRNA binding protein that acts at a
           step between tRNA maturation /aminoacylation, and
           translocation of the tRNA across the nuclear pore
           complex.
          Length = 669

 Score = 30.0 bits (68), Expect = 3.1
 Identities = 11/34 (32%), Positives = 19/34 (55%), Gaps = 7/34 (20%)

Query: 27  EENDRIVNFQHYFDTNQPINQLLSIIFTNFLPPD 60
            ENDR ++        + ++Q+L +IF NF  P+
Sbjct: 450 TENDRFID-------PEFLDQVLDLIFGNFADPE 476


>gnl|CDD|214543 smart00180, EGF_Lam, Laminin-type epidermal growth factor-like
           domai. 
          Length = 46

 Score = 26.9 bits (60), Expect = 3.5
 Identities = 10/28 (35%), Positives = 16/28 (57%), Gaps = 2/28 (7%)

Query: 366 SFGATCVPLTHSYECDCPPGRTGKFCEK 393
           S   TC P   + +C+C P  TG+ C++
Sbjct: 8   SASGTCDP--DTGQCECKPNVTGRRCDR 33


>gnl|CDD|215558 PLN03073, PLN03073, ABC transporter F family; Provisional.
          Length = 718

 Score = 29.8 bits (67), Expect = 3.7
 Identities = 12/34 (35%), Positives = 15/34 (44%)

Query: 3   AHHLQSCPELTNPDNIKILGRHRQEENDRIVNFQ 36
                  P L+  D  KI  R R+EE  R V +Q
Sbjct: 113 KPEPDDGPLLSERDLAKIERRKRKEERQREVQYQ 146


>gnl|CDD|224036 COG1111, MPH1, ERCC4-like helicases [DNA replication,
           recombination, and repair].
          Length = 542

 Score = 29.6 bits (67), Expect = 4.4
 Identities = 15/49 (30%), Positives = 19/49 (38%), Gaps = 4/49 (8%)

Query: 27  EENDRIVNFQHYFDTNQPINQLLSIIFTNFLPPDIEIGQAS--YSSSMS 73
             + R++ F  Y DT + I   L  I          IGQAS      MS
Sbjct: 364 NGDSRVIVFTEYRDTAEEIVNFLKKIGIKA--RVRFIGQASREGDKGMS 410


>gnl|CDD|215633 PLN03210, PLN03210, Resistant to P. syringae 6; Provisional.
          Length = 1153

 Score = 29.5 bits (66), Expect = 5.6
 Identities = 19/53 (35%), Positives = 28/53 (52%), Gaps = 7/53 (13%)

Query: 412 PSSELHLIINESLSDISFSGRRSYISLPSSELHLH-------EACIDLEIRPT 457
           P + L  +++ SL+ +  S   S + LPSS  +LH       E CI+LE  PT
Sbjct: 768 PLTPLMTMLSPSLTRLFLSDIPSLVELPSSIQNLHKLEHLEIENCINLETLPT 820


>gnl|CDD|215767 pfam00171, Aldedh, Aldehyde dehydrogenase family.  This family of
           dehydrogenases act on aldehyde substrates. Members use
           NADP as a cofactor. The family includes the following
           members: The prototypical members are the aldehyde
           dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
           dehydrogenase EC:1.2.1.16. Lactaldehyde dehydrogenase
           EC:1.2.1.22. Benzaldehyde dehydrogenase EC:1.2.1.28.
           Methylmalonate-semialdehyde dehydrogenase EC:1.2.1.27.
           Glyceraldehyde-3-phosphate dehydrogenase EC:1.2.1.9.
           Delta-1-pyrroline-5-carboxylate dehydrogenase EC:
           1.5.1.12. Acetaldehyde dehydrogenase EC:1.2.1.10.
           Glutamate-5-semialdehyde dehydrogenase EC:1.2.1.41. This
           family also includes omega crystallin, an eye lens
           protein from squid and octopus that has little aldehyde
           dehydrogenase activity.
          Length = 459

 Score = 29.0 bits (66), Expect = 7.1
 Identities = 16/49 (32%), Positives = 19/49 (38%), Gaps = 13/49 (26%)

Query: 282 GFSGCIF--DVE--------LSAGNVGINLYKTRAAEGR---GVGQCGT 317
           G +  +F  D+E        L AG V IN Y T   E     G  Q G 
Sbjct: 393 GLAAGVFTNDLERALRVARRLEAGMVWINDYTTGDPEALPFGGFKQSGF 441


>gnl|CDD|238012 cd00055, EGF_Lam, Laminin-type epidermal growth factor-like domain;
           laminins are the major noncollagenous components of
           basement membranes that mediate cell adhesion, growth
           migration, and differentiation; the laminin-type
           epidermal growth factor-like module occurs in tandem
           arrays; the domain contains 4 disulfide bonds (loops
           a-d) the first three resemble epidermal growth factor
           (EGF); the number of copies of this domain in the
           different forms of laminins is highly variable ranging
           from 3 up to 22 copies.
          Length = 50

 Score = 25.8 bits (57), Expect = 9.6
 Identities = 9/28 (32%), Positives = 15/28 (53%), Gaps = 2/28 (7%)

Query: 366 SFGATCVPLTHSYECDCPPGRTGKFCEK 393
           S    C P   + +C+C P  TG+ C++
Sbjct: 9   SLSGQCDP--GTGQCECKPNTTGRRCDR 34


>gnl|CDD|203773 pfam07819, PGAP1, PGAP1-like protein.  The sequences found in this
           family are similar to PGAP1. This is an endoplasmic
           reticulum membrane protein with a catalytic serine
           containing motif that is conserved in a number of
           lipases. PGAP1 functions as a GPI inositol-deacylase;
           this deacylation is important for the efficient
           transport of GPI-anchored proteins from the endoplasmic
           reticulum to the Golgi body.
          Length = 225

 Score = 28.1 bits (63), Expect = 9.8
 Identities = 17/49 (34%), Positives = 23/49 (46%), Gaps = 7/49 (14%)

Query: 42  NQPINQLLSIIFTNFLPPD--IEIGQASYSSSMSGLSSFSAYVIPANIH 88
           N  I  +LS+  +N  PP   I IG      SM GL + +A  +P  I 
Sbjct: 67  NDAIRYILSLYNSNRPPPTSVILIGH-----SMGGLVARAALTLPNYIP 110


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.139    0.444 

Gapped
Lambda     K      H
   0.267   0.0715    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 25,090,470
Number of extensions: 2391920
Number of successful extensions: 1630
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1624
Number of HSP's successfully gapped: 49
Length of query: 500
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 399
Effective length of database: 6,457,848
Effective search space: 2576681352
Effective search space used: 2576681352
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 61 (27.3 bits)