RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7854
         (153 letters)



>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain. 
          Length = 35

 Score = 40.6 bits (95), Expect = 7e-06
 Identities = 15/31 (48%), Positives = 18/31 (58%)

Query: 81  ENPGICTNGACENLMGTYRCICNPGYEGDPT 111
            + G C+NG C N  G+Y C C PGY GD  
Sbjct: 3   ASGGPCSNGTCINTPGSYTCSCPPGYTGDKR 33



 Score = 24.4 bits (53), Expect = 5.6
 Identities = 10/15 (66%), Positives = 11/15 (73%)

Query: 127 ICSGGTCKNTPGSFQ 141
            CS GTC NTPGS+ 
Sbjct: 7   PCSNGTCINTPGSYT 21


>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
           large number of membrane-bound and extracellular (mostly
           animal) proteins. Many of these proteins require calcium
           for their biological function and calcium-binding sites
           have been found to be located at the N-terminus of
           particular EGF-like domains; calcium-binding may be
           crucial for numerous protein-protein interactions. Six
           conserved core cysteines form three disulfide bridges as
           in non calcium-binding EGF domains, whose structures are
           very similar. EGF_CA can be found in tandem repeat
           arrangements.
          Length = 38

 Score = 38.0 bits (89), Expect = 5e-05
 Identities = 14/30 (46%), Positives = 17/30 (56%), Gaps = 1/30 (3%)

Query: 81  ENPGICTNGA-CENLMGTYRCICNPGYEGD 109
            +   C NG  C N +G+YRC C PGY G 
Sbjct: 6   ASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35



 Score = 29.5 bits (67), Expect = 0.089
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query: 117 DQNECLMDVMICSGGTCKNTPGSFQ 141
           D +EC       +GGTC NT GS++
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYR 25


>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 38.0 bits (89), Expect = 6e-05
 Identities = 15/36 (41%), Positives = 19/36 (52%), Gaps = 4/36 (11%)

Query: 81  ENPGICTNGA-CENLMGTYRCICNPGYEGDPTGKLC 115
            +   C NG  C N +G+YRC C PGY     G+ C
Sbjct: 6   ASGNPCQNGGTCVNTVGSYRCECPPGYT---DGRNC 38



 Score = 29.1 bits (66), Expect = 0.11
 Identities = 12/26 (46%), Positives = 15/26 (57%), Gaps = 2/26 (7%)

Query: 117 DQNECLMDVMICS-GGTCKNTPGSFQ 141
           D +EC      C  GGTC NT GS++
Sbjct: 1   DIDECASGN-PCQNGGTCVNTVGSYR 25


>gnl|CDD|221695 pfam12662, cEGF, Complement Clr-like EGF-like.  cEGF, or complement
           Clr-like EGF, domains have six conserved cysteine
           residues disulfide-bonded into the characteristic
           pattern 'ababcc'. They are found in blood coagulation
           proteins such as fibrillin, Clr and Cls, thrombomodulin,
           and the LDL receptor. The core fold of the EGF domain
           consists of two small beta-hairpins packed against each
           other. Two major structural variants have been
           identified based on the structural context of the
           C-terminal cysteine residue of disulfide 'c' in the
           C-terminal hairpin: hEGFs and cEGFs. In cEGFs the
           C-terminal thiol resides on the C-terminal beta-sheet,
           resulting in long loop-lengths between the cysteine
           residues of disulfide 'c', typically C[10+]XC. These
           longer loop-lengths may have arisen by selective
           cysteine loss from a four-disulfide EGF template such as
           laminin or integrin. Tandem cEGF domains have five
           linking residues between terminal cysteines of adjacent
           domains. cEGF domains may or may not bind calcium in the
           linker region. cEGF domains with the consensus motif
           CXN4X[F,Y]XCXC are hydroxylated exclusively on the
           asparagine residue.
          Length = 24

 Score = 36.3 bits (85), Expect = 2e-04
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 98  YRCICNPGYEGDPTGKLCSDQNE 120
           Y C C PGY+    G+ C D +E
Sbjct: 2   YTCSCPPGYQLSGDGRTCEDIDE 24


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
           growth factor (EGF) presents in a large number of
           proteins, mostly animal; the list of proteins currently
           known to contain one or more copies of an EGF-like
           pattern is large and varied; the functional significance
           of EGF-like domains in what appear to be unrelated
           proteins is not yet clear; a common feature is that
           these repeats are found in the extracellular domain of
           membrane-bound proteins or in proteins known to be
           secreted (exception: prostaglandin G/H synthase); the
           domain includes six cysteine residues which have been
           shown to be involved in disulfide bonds; the main
           structure is a two-stranded beta-sheet followed by a
           loop to a C-terminal short two-stranded sheet;
           Subdomains between the conserved cysteines vary in
           length; the region between the 5th and 6th cysteine
           contains two conserved glycines of which at  least  one 
           is  present  in  most EGF-like domains; a subset of
           these bind calcium.
          Length = 36

 Score = 36.3 bits (84), Expect = 2e-04
 Identities = 15/32 (46%), Positives = 19/32 (59%), Gaps = 1/32 (3%)

Query: 81  ENPGICTNGA-CENLMGTYRCICNPGYEGDPT 111
                C+NG  C N  G+YRC+C PGY GD +
Sbjct: 3   AASNPCSNGGTCVNTPGSYRCVCPPGYTGDRS 34



 Score = 24.7 bits (54), Expect = 3.8
 Identities = 12/27 (44%), Positives = 16/27 (59%), Gaps = 6/27 (22%)

Query: 115 CSDQNECLMDVMICSGGTCKNTPGSFQ 141
           C+  N C       +GGTC NTPGS++
Sbjct: 2   CAASNPCS------NGGTCVNTPGSYR 22


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
           EGF-like domain homologues. This family includes the
           C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 34.8 bits (81), Expect = 8e-04
 Identities = 16/37 (43%), Positives = 18/37 (48%), Gaps = 3/37 (8%)

Query: 80  TENPGIC-TNGACENLMGTYRCICNPGYEGDPTGKLC 115
            EN G C  N  C N  G++ C C  GY GD  G  C
Sbjct: 2   AENNGGCHPNATCTNTGGSFTCTCKSGYTGD--GVTC 36


>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain. 
          Length = 42

 Score = 33.9 bits (78), Expect = 0.003
 Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 88  NGACENLMGTYRCICNPGYEGDPTGKLC 115
           N  C N +G++ C+C  GYE +  G  C
Sbjct: 15  NTVCVNTIGSFECVCPDGYENNEDGTNC 42


>gnl|CDD|215652 pfam00008, EGF, EGF-like domain.  There is no clear separation
           between noise and signal. pfam00053 is very similar, but
           has 8 instead of 6 conserved cysteines. Includes some
           cytokine receptors. The EGF domain misses the N-terminus
           regions of the Ca2+ binding EGF domains (this is the
           main reason of discrepancy between swiss-prot domain
           start/end and Pfam). The family is hard to model due to
           many similar but different sub-types of EGF domains.
           Pfam certainly misses a number of EGF domains.
          Length = 32

 Score = 28.6 bits (64), Expect = 0.15
 Identities = 11/25 (44%), Positives = 13/25 (52%), Gaps = 1/25 (4%)

Query: 86  CTNGA-CENLMGTYRCICNPGYEGD 109
           C+NG  C +  G Y C C  GY G 
Sbjct: 7   CSNGGTCVDTPGGYTCECPEGYTGK 31


>gnl|CDD|224454 COG1537, PelA, Predicted RNA-binding proteins [General function
           prediction only].
          Length = 352

 Score = 31.1 bits (71), Expect = 0.18
 Identities = 19/73 (26%), Positives = 26/73 (35%), Gaps = 25/73 (34%)

Query: 12  EGRLRIKGV----------SKHSRLAVALRRSISI--DEFTVHSSTLIALSQESQDLERL 59
             RLRIKG             +  + V +   I I  +E+                LERL
Sbjct: 75  ANRLRIKGPIVEGPEEVVKGSYHTINVTIGTEIEIEKEEW------------NKDQLERL 122

Query: 60  KFGLEPPTSRSEV 72
           K  +E  + R EV
Sbjct: 123 KEAVE-ASKRPEV 134


>gnl|CDD|213325 cd12117, A_NRPS_Srf_like, The adenylation domain of nonribosomal
          peptide synthetases (NRPS), including Bacillus subtilis
          termination module Surfactin (SrfA-C).  The adenylation
          (A) domain of NRPS recognizes a specific amino acid or
          hydroxy acid and activates it as an (amino) acyl
          adenylate by hydrolysis of ATP. The activated acyl
          moiety then forms a thioester to the enzyme-bound
          cofactor phosphopantetheine of a peptidyl carrier
          protein domain. NRPSs are large multifunctional enzymes
          which synthesize many therapeutically useful peptides
          in bacteria and fungi via a template-directed, nucleic
          acid independent nonribosomal mechanism. These natural
          products include antibiotics, immunosuppressants, plant
          and animal toxins, and enzyme inhibitors. NRPS has a
          distinct modular structure in which each module is
          responsible for the recognition, activation, and, in
          some cases, modification of a single amino acid residue
          of the final peptide product. The modules can be
          subdivided into domains that catalyze specific
          biochemical reactions. This family includes the
          adenylation domain of the Bacillus subtilis termination
          module (Surfactin domain, SrfA-C) which recognizes a
          specific amino acid building block, which is then
          activated and transferred to the terminal thiol of the
          4'-phosphopantetheine (Ppan) arm of the downstream
          peptidyl carrier protein (PCP) domain.
          Length = 474

 Score = 30.2 bits (69), Expect = 0.39
 Identities = 13/30 (43%), Positives = 16/30 (53%), Gaps = 1/30 (3%)

Query: 4  NANANMLTEGRLRIKGVSKHSRLAVALRRS 33
          N  AN L   RLR +GV     +A+ L RS
Sbjct: 19 NERANRLA-RRLRARGVGPGDVVALLLERS 47


>gnl|CDD|236075 PRK07683, PRK07683, aminotransferase A; Validated.
          Length = 387

 Score = 29.7 bits (67), Expect = 0.49
 Identities = 20/76 (26%), Positives = 35/76 (46%), Gaps = 16/76 (21%)

Query: 35  SIDE-FT--VHSSTLIALSQESQDLERLKFGLEPPTSRSEVHGANHHTTENPGICTNGAC 91
           +I E +T   H++ L+ L + + +  + K+ L             H++ E+  I T GA 
Sbjct: 53  AITENYTSYTHNAGLLELRKAACNFVKDKYDL-------------HYSPESEIIVTIGAS 99

Query: 92  ENLMGTYRCICNPGYE 107
           E +   +R I  PG E
Sbjct: 100 EAIDIAFRTILEPGTE 115


>gnl|CDD|237054 PRK12316, PRK12316, peptide synthase; Provisional.
          Length = 5163

 Score = 29.2 bits (65), Expect = 1.2
 Identities = 13/33 (39%), Positives = 19/33 (57%), Gaps = 1/33 (3%)

Query: 4    NANANMLTEGRLRIKGVSKHSRLAVALRRSISI 36
            ++ AN L   RLR +GV    R+A+A  RS  +
Sbjct: 2035 DSRANRLAH-RLRARGVGPEVRVAIAAERSFEL 2066


>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein.  This
           family consists of REP proteins from Dictyostelium
           (Slime molds). REP protein is likely involved in
           transcription regulation and control of DNA replication,
           specifically amplification of plasmid at low copy
           numbers. The formation of homomultimers may be required
           for their regulatory activity.
          Length = 910

 Score = 28.6 bits (64), Expect = 1.3
 Identities = 18/60 (30%), Positives = 30/60 (50%), Gaps = 12/60 (20%)

Query: 16  RIKGVSKHSRLAVALR--------RSISIDEFTVHSSTLIALSQE----SQDLERLKFGL 63
           +IK + K  +L ++++        RSIS D F   S+ L+ +SQ       D+ +LK  L
Sbjct: 481 KIKSILKFIQLCISMKDINGFISIRSISFDTFKSISNELLGMSQRLLNLEADVRKLKDLL 540


>gnl|CDD|165214 PHA02887, PHA02887, EGF-like protein; Provisional.
          Length = 126

 Score = 27.2 bits (60), Expect = 2.3
 Identities = 12/25 (48%), Positives = 14/25 (56%), Gaps = 2/25 (8%)

Query: 86  CTNGACENLMGTYR--CICNPGYEG 108
           C NG C N++      CICN GY G
Sbjct: 94  CINGECMNIIDLDEKFCICNKGYTG 118


>gnl|CDD|213296 cd05930, A_NRPS, The adenylation domain of nonribosomal peptide
          synthetases (NRPS).  The adenylation (A) domain of NRPS
          recognizes a specific amino acid or hydroxy acid and
          activates it as an (amino) acyl adenylate by hydrolysis
          of ATP. The activated acyl moiety then forms a
          thioester bond to the enzyme-bound cofactor
          phosphopantetheine of a peptidyl carrier protein
          domain. NRPSs are large multifunctional enzymes which
          synthesize many therapeutically useful peptides in
          bacteria and fungi via a template-directed, nucleic
          acid independent nonribosomal mechanism. These natural
          products include antibiotics, immunosuppressants, plant
          and animal toxins, and enzyme inhibitors. NRPS has a
          distinct modular structure in which each module is
          responsible for the recognition, activation, and in
          some cases, modification of a single amino acid residue
          of the final peptide product. The modules can be
          subdivided into domains that catalyze specific
          biochemical reactions.
          Length = 445

 Score = 27.0 bits (61), Expect = 4.7
 Identities = 12/31 (38%), Positives = 15/31 (48%), Gaps = 1/31 (3%)

Query: 4  NANANMLTEGRLRIKGVSKHSRLAVALRRSI 34
          N  AN L    LR +GV     +A+ L RS 
Sbjct: 19 NERANRLA-HYLRARGVGPGDLVAICLERSP 48


>gnl|CDD|188999 cd06461, M2_ACE, Peptidase family M2 Angiotensin converting enzyme
           (ACE).  Peptidase family M2 Angiotensin converting
           enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc
           dependent dipeptidase that catalyzes the conversion of
           the decapeptide angiotensin I to the potent vasopressor
           octapeptide angiotensin II, by removing two C-terminal
           amino acids. There are two forms of the enzyme in
           humans, the ubiquitous somatic ACE and the
           sperm-specific germinal ACE, both encoded by the same
           gene through transcription from alternative promoters.
           Somatic ACE has two tandem active sites with distinct
           catalytic properties, whereas germinal ACE, the function
           of which is largely unknown, has just a single active
           site. Recently, an ACE homolog, ACE2, has been
           identified in humans that differs from ACE; it
           preferentially removes carboxy-terminal hydrophobic or
           basic amino acids and appears to be important in cardiac
           function. ACE homologs (also known as members of the M2
           gluzincin family) have been found in a wide variety of
           species, including those that neither have a
           cardiovascular system nor synthesize angiotensin. ACE is
           well-known as a key part of the renin-angiotensin system
           that regulates blood pressure and ACE inhibitors are
           important for the treatment of hypertension.
          Length = 562

 Score = 26.8 bits (60), Expect = 5.7
 Identities = 11/25 (44%), Positives = 15/25 (60%), Gaps = 4/25 (16%)

Query: 58  RLKF-GLEPPTSRSEVH---GANHH 78
           R K+ G++PP  RSE     GA +H
Sbjct: 444 REKYQGVKPPVPRSETDFDPGAKYH 468


>gnl|CDD|204999 pfam12661, hEGF, Human growth factor-like EGF.  hEGF, or human
           growth factor-like EGF, domains have six conserved
           residues disulfide-bonded into the characteristic
           'ababcc' pattern. They are involved in growth and
           proliferation of cells, in proteins of the Notch/Delta
           pathway, neurogulin and selectins. hEGFs are also found
           in mosaic proteins with four-disulfide laminin EGFs such
           as aggrecan and perlecan. The core fold of the EGF
           domain consists of two small beta-hairpins packed
           against each other. Two major structural variants have
           been identified based on the structural context of the
           C-terminal Cys residue of disulfide 'c' in the
           C-terminal hairpin: hEGFs and cEGFs. In hEGFs the
           C-terminal thiol resides in the beta-turn, resulting in
           shorter loop-lengths between the Cys residues of
           disulfide 'c', typically C[8-9]XC. These shorter
           loop-lengths are also typical of the four-disulfide EGF
           domains, laminin ad integrin. Tandem hEGF domains have
           six linking residues between terminal cysteines of
           adjacent domains. hEGF domains may or may not bind
           calcium in the linker region. hEGF domains with the
           consensus motif CXD4X[F,Y]XCXC are hydroxylated
           exclusively in the Asp residue.
          Length = 13

 Score = 24.2 bits (54), Expect = 5.8
 Identities = 6/10 (60%), Positives = 6/10 (60%)

Query: 100 CICNPGYEGD 109
           C C PGY G 
Sbjct: 2   CQCPPGYTGP 11


>gnl|CDD|181036 PRK07568, PRK07568, aspartate aminotransferase; Provisional.
          Length = 397

 Score = 26.7 bits (60), Expect = 5.9
 Identities = 9/32 (28%), Positives = 14/32 (43%)

Query: 76  NHHTTENPGICTNGACENLMGTYRCICNPGYE 107
                 +  + TNG  E ++     IC+PG E
Sbjct: 83  GIDVEPDEILITNGGSEAILFAMMAICDPGDE 114


>gnl|CDD|213324 cd12116, A_NRPS_Ta1_like, The adenylation domain of nonribosomal
          peptide synthetases (NRPS), including salinosporamide A
          polyketide synthase.  The adenylation (A) domain of
          NRPS recognizes a specific amino acid or hydroxy acid
          and activates it as an (amino) acyl adenylate by
          hydrolysis of ATP. The activated acyl moiety then forms
          a thioester to the enzyme-bound cofactor
          phosphopantetheine of a peptidyl carrier protein
          domain. NRPSs are large multifunctional enzymes which
          synthesize many therapeutically useful peptides in
          bacteria and fungi via a template-directed, nucleic
          acid independent nonribosomal mechanism. These natural
          products include antibiotics, immunosuppressants, plant
          and animal toxins, and enzyme inhibitors. NRPS has a
          distinct modular structure in which each module is
          responsible for the recognition, activation, and in
          some cases, modification of a single amino acid residue
          of the final peptide product. The modules can be
          subdivided into domains that catalyze specific
          biochemical reactions. This family includes the
          myxovirescin (TA) antibiotic biosynthetic gene in
          Myxococcus xanthus; TA production plays a role in
          predation. It also includes the salinosporamide A
          polyketide synthase which is involved in the
          biosynthesis of salinosporamide A, a marine microbial
          metabolite whose chlorine atom is crucial for potent
          proteasome inhibition and anticancer activity.
          Length = 438

 Score = 26.4 bits (59), Expect = 6.7
 Identities = 10/23 (43%), Positives = 12/23 (52%)

Query: 14 RLRIKGVSKHSRLAVALRRSISI 36
          RLR  GV    R+ V L RS  +
Sbjct: 28 RLRALGVGPGDRVGVLLERSADL 50


>gnl|CDD|225281 COG2425, COG2425, Uncharacterized protein containing a von
          Willebrand factor type A (vWA) domain [General function
          prediction only].
          Length = 437

 Score = 26.6 bits (59), Expect = 6.8
 Identities = 11/53 (20%), Positives = 16/53 (30%)

Query: 12 EGRLRIKGVSKHSRLAVALRRSISIDEFTVHSSTLIALSQESQDLERLKFGLE 64
           G ++        R    LR  ++ D F  H    +   QES         L 
Sbjct: 24 FGIVKYILKKLIPRWREILRERLAADAFYPHLLEEVFRYQESNSEPEFLVQLP 76


>gnl|CDD|213262 cd03295, ABC_OpuCA_Osmoprotection, ATP-binding cassette domain of
          the osmoprotectant transporter.  OpuCA is a the ATP
          binding component of a bacterial solute transporter
          that serves a protective role to cells growing in a
          hyperosmolar environment. ABC (ATP-binding cassette)
          transporter nucleotide-binding domain; ABC transporters
          are a large family of proteins involved in the
          transport of a wide variety of different compounds,
          like sugars, ions, peptides, and more complex organic
          molecules. The nucleotide binding domain shows the
          highest similarity between all members of the family.
          ABC transporters are a subset of nucleotide hydrolases
          that contain a signature motif, Q-loop, and
          H-loop/switch region, in addition, to the Walker A
          motif/P-loop and Walker B motif commonly found in a
          number of ATP- and GTP-binding and hydrolyzing
          proteins.
          Length = 242

 Score = 26.1 bits (58), Expect = 7.1
 Identities = 9/24 (37%), Positives = 11/24 (45%)

Query: 11 TEGRLRIKGVSKHSRLAVALRRSI 34
          T G + I G     +  V LRR I
Sbjct: 54 TSGEIFIDGEDIREQDPVELRRKI 77


>gnl|CDD|218955 pfam06247, Plasmod_Pvs28, Plasmodium ookinete surface protein
           Pvs28.  This family consists of several ookinete surface
           protein (Pvs28) from several species of Plasmodium.
           Pvs25 and Pvs28 are expressed on the surface of
           ookinetes. These proteins are potential candidates for
           vaccine and induce antibodies that block the infectivity
           of Plasmodium vivax in immunised animals.
          Length = 196

 Score = 25.9 bits (57), Expect = 9.4
 Identities = 9/22 (40%), Positives = 11/22 (50%)

Query: 85  ICTNGACENLMGTYRCICNPGY 106
           IC NG    +   + C CN GY
Sbjct: 7   ICKNGYLIQMSNHFECKCNEGY 28


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.131    0.394 

Gapped
Lambda     K      H
   0.267   0.0632    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,182,725
Number of extensions: 590918
Number of successful extensions: 473
Number of sequences better than 10.0: 1
Number of HSP's gapped: 473
Number of HSP's successfully gapped: 36
Length of query: 153
Length of database: 10,937,602
Length adjustment: 89
Effective length of query: 64
Effective length of database: 6,990,096
Effective search space: 447366144
Effective search space used: 447366144
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 55 (25.2 bits)