RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy590
         (257 letters)



>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
           EGF-like domain homologues. This family includes the
           C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 31.0 bits (71), Expect = 0.039
 Identities = 11/29 (37%), Positives = 15/29 (51%)

Query: 110 PCAPNAVCINKNHATDCSCLEGFQGNGFV 138
            C PNA C N   +  C+C  G+ G+G  
Sbjct: 7   GCHPNATCTNTGGSFTCTCKSGYTGDGVT 35



 Score = 26.0 bits (58), Expect = 2.3
 Identities = 8/25 (32%), Positives = 12/25 (48%)

Query: 178 CGEHAICIPKNHGIDCQCQSGHQGN 202
           C  +A C        C C+SG+ G+
Sbjct: 8   CHPNATCTNTGGSFTCTCKSGYTGD 32


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
           growth factor (EGF) presents in a large number of
           proteins, mostly animal; the list of proteins currently
           known to contain one or more copies of an EGF-like
           pattern is large and varied; the functional significance
           of EGF-like domains in what appear to be unrelated
           proteins is not yet clear; a common feature is that
           these repeats are found in the extracellular domain of
           membrane-bound proteins or in proteins known to be
           secreted (exception: prostaglandin G/H synthase); the
           domain includes six cysteine residues which have been
           shown to be involved in disulfide bonds; the main
           structure is a two-stranded beta-sheet followed by a
           loop to a C-terminal short two-stranded sheet;
           Subdomains between the conserved cysteines vary in
           length; the region between the 5th and 6th cysteine
           contains two conserved glycines of which at  least  one 
           is  present  in  most EGF-like domains; a subset of
           these bind calcium.
          Length = 36

 Score = 29.8 bits (67), Expect = 0.14
 Identities = 10/33 (30%), Positives = 16/33 (48%)

Query: 104 PCAVHSPCAPNAVCINKNHATDCSCLEGFQGNG 136
            CA  +PC+    C+N   +  C C  G+ G+ 
Sbjct: 1   ECAASNPCSNGGTCVNTPGSYRCVCPPGYTGDR 33



 Score = 26.3 bits (58), Expect = 2.2
 Identities = 10/31 (32%), Positives = 14/31 (45%)

Query: 41 CAIASLCAPSAKCHVVKHRPVCTCPQGYEGN 71
          CA ++ C+    C        C CP GY G+
Sbjct: 2  CAASNPCSNGGTCVNTPGSYRCVCPPGYTGD 32


>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
           large number of membrane-bound and extracellular (mostly
           animal) proteins. Many of these proteins require calcium
           for their biological function and calcium-binding sites
           have been found to be located at the N-terminus of
           particular EGF-like domains; calcium-binding may be
           crucial for numerous protein-protein interactions. Six
           conserved core cysteines form three disulfide bridges as
           in non calcium-binding EGF domains, whose structures are
           very similar. EGF_CA can be found in tandem repeat
           arrangements.
          Length = 38

 Score = 27.6 bits (62), Expect = 0.69
 Identities = 11/30 (36%), Positives = 15/30 (50%)

Query: 105 CAVHSPCAPNAVCINKNHATDCSCLEGFQG 134
           CA  +PC     C+N   +  CSC  G+ G
Sbjct: 5   CASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34



 Score = 25.7 bits (57), Expect = 3.3
 Identities = 11/32 (34%), Positives = 13/32 (40%)

Query: 39 DLCAIASLCAPSAKCHVVKHRPVCTCPQGYEG 70
          D CA  + C     C        C+CP GY G
Sbjct: 3  DECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34


>gnl|CDD|201524 pfam00954, S_locus_glycop, S-locus glycoprotein family.  In
           Brassicaceae, self-incompatible plants have a
           self/non-self recognition system. This is
           sporophytically controlled by multiple alleles at a
           single locus (S). S-locus glycoproteins, as well as
           S-receptor kinases, are in linkage with the S-alleles.
          Length = 110

 Score = 28.4 bits (64), Expect = 1.4
 Identities = 10/32 (31%), Positives = 14/32 (43%), Gaps = 1/32 (3%)

Query: 38  EDLCAIASLCAPSAKCHVVKHRPVCTCPQGYE 69
           +D C +   C P   C  V   P C C +G+ 
Sbjct: 77  KDQCDVYGRCGPYGYC-DVNTSPKCNCIKGFV 107


>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
          Length = 1052

 Score = 30.0 bits (67), Expect = 1.5
 Identities = 12/34 (35%), Positives = 13/34 (38%), Gaps = 4/34 (11%)

Query: 207 CLSELICLCHLY----TEEPWHRLPMPERASGQP 236
           CL     L H +    TEE WH L     AS   
Sbjct: 690 CLQTSYRLLHPFMPFITEELWHVLRAQVGASAWA 723


>gnl|CDD|177468 PHA02686, PHA02686, hypothetical protein; Provisional.
          Length = 138

 Score = 28.4 bits (63), Expect = 2.1
 Identities = 11/24 (45%), Positives = 14/24 (58%)

Query: 212 ICLCHLYTEEPWHRLPMPERASGQ 235
           +CLC L  E P  R P  + ASG+
Sbjct: 22  VCLCQLRGEAPERRFPALDEASGE 45


>gnl|CDD|200537 cd11276, Sema_plexin_B2, The Sema domain, a protein interacting
           module, of Plexin B2.  Plexin B2 serves as the receptor
           of Sema4C and Sema4G. By signaling the effect of Sema4C
           and Sema4G, the plexin B2 receptor plays important roles
           in neural tube closure and cerebellar granule cell
           development. Mice lacking Plexin B2 demonstrated defects
           in closure of the neural tube and disorganization of the
           embryonic brain. In developing kidney, Sema4C-Plexin B2
           signaling modulates ureteric branching. Plexin B2 is
           expressed both in the pretubular aggregates and the
           ureteric epithelium in the developing kidney. Deletion
           of Plexin B2 results in renal hypoplasia and occasional
           double ureters. The Sema domain is located at the
           N-terminus and contains four disulfide bonds formed by
           eight conserved cysteine residues. It serves as a
           ligand-recognition and -binding module.
          Length = 449

 Score = 29.0 bits (65), Expect = 3.0
 Identities = 17/66 (25%), Positives = 24/66 (36%), Gaps = 7/66 (10%)

Query: 30  EACYSGQCEDLCAIASLCAPSAKCHVVKHRPVCTCPQGYEGNPATKCYLPNPLGCVSNEE 89
           EACY+G  +D                  H+      +  +  P    +LP PLG  S +E
Sbjct: 309 EACYTGTIDDRDVFYKPFHSQKDIICGSHQ-----QKNSKSFPCGSEHLPYPLG--SRDE 361

Query: 90  CASTEA 95
            A T  
Sbjct: 362 LALTAP 367


>gnl|CDD|181145 PRK07850, PRK07850, acetyl-CoA acetyltransferase; Provisional.
          Length = 387

 Score = 28.1 bits (63), Expect = 5.0
 Identities = 10/18 (55%), Positives = 11/18 (61%)

Query: 191 IDCQCQSGHQGNPYIACL 208
           IDCQC S  Q N  +A L
Sbjct: 85  IDCQCGSAQQANHLVAGL 102


>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
          Length = 455

 Score = 27.9 bits (62), Expect = 6.4
 Identities = 11/43 (25%), Positives = 18/43 (41%), Gaps = 7/43 (16%)

Query: 91  ASTEACINSQCQKP---CAVHSPCAPNAVCI----NKNHATDC 126
             T AC+  +C+     C  +  CA N  C+     +   T+C
Sbjct: 111 LKTCACLLKECRLELAKCIANPKCAANVACLQTCNGRPDETEC 153


>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain. 
          Length = 35

 Score = 24.8 bits (54), Expect = 7.7
 Identities = 13/32 (40%), Positives = 16/32 (50%), Gaps = 1/32 (3%)

Query: 104 PCAVHSPCAPNAVCINKNHATDCSCLEGFQGN 135
            CA   PC  N  CIN   +  CSC  G+ G+
Sbjct: 1   ECASGGPC-SNGTCINTPGSYTCSCPPGYTGD 31


>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 24.9 bits (55), Expect = 7.8
 Identities = 9/29 (31%), Positives = 13/29 (44%)

Query: 105 CAVHSPCAPNAVCINKNHATDCSCLEGFQ 133
           CA  +PC     C+N   +  C C  G+ 
Sbjct: 5   CASGNPCQNGGTCVNTVGSYRCECPPGYT 33


>gnl|CDD|215505 PLN02935, PLN02935, Bifunctional NADH kinase/NAD(+) kinase.
          Length = 508

 Score = 27.5 bits (61), Expect = 9.2
 Identities = 11/62 (17%), Positives = 23/62 (37%), Gaps = 4/62 (6%)

Query: 141 LPVRNYQPICQYNEDCPPDKLCDRLNRICINPCQEDSCGEHAIC---IPKNHGIDCQCQS 197
           L  +      + NE+    +L +  N+  +     + CG   IC   + ++   D   +S
Sbjct: 81  LEHKELSS-GECNEESNDQRLENLANQPMLYNEAINCCGMEGICSHEVLQDGSTDSDNRS 139

Query: 198 GH 199
             
Sbjct: 140 VL 141


>gnl|CDD|239590 cd03513, CrtW_beta-carotene-ketolase, Beta-carotene
           ketolase/oxygenase (CrtW, also known as CrtO), the
           carotenoid astaxanthin biosynthetic enzyme, initially
           catalyzes the addition of two keto groups to carbons C4
           and C4' of beta-carotene. Carotenoids are important
           natural pigments produced by many microorganisms and
           plants. Astaxanthin is reported to be an antioxidant, an
           anti-cancer agent, and an immune system stimulant. A
           number of bacteria and green algae can convert
           beta-carotene into astaxanthin by using several
           ketocarotenoids as intermediates and CrtW and a
           beta-carotene hydroxylase (CrtZ). CrtW initially
           converts beta-carotene to canthaxanthin via echinenone,
           and CrtZ initially mediates the conversion of
           beta-carotene to zeaxanthin via beta-cryptoxanthin.
           After a few more intermediates are formed, CrtW and CrtZ
           act in combination to produce astaxanthin. Sequences of
           this domain family appear to be structurally related to
           membrane fatty acid desaturases and alkane hydroxylases.
           They all share in common extensive hydrophobic regions
           that are capable of spanning the membrane bilayer at
           least twice. Comparison of these sequences also reveals
           three regions of conserved histidine cluster motifs that
           contain eight histidine residues: HXXXH, HXXHH, and
           HXXHH. These histidine residues are reported to be
           catalytically essential and proposed to be the ligands
           for the iron atoms contained within homologs, stearoyl
           CoA desaturase and alkane hydroxylase.
          Length = 225

 Score = 26.9 bits (60), Expect = 9.9
 Identities = 7/12 (58%), Positives = 7/12 (58%)

Query: 216 HLYTEEPWHRLP 227
           HL    PW RLP
Sbjct: 211 HLSPSTPWWRLP 222


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.137    0.480 

Gapped
Lambda     K      H
   0.267   0.0632    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,797,190
Number of extensions: 991470
Number of successful extensions: 692
Number of sequences better than 10.0: 1
Number of HSP's gapped: 676
Number of HSP's successfully gapped: 58
Length of query: 257
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 162
Effective length of database: 6,723,972
Effective search space: 1089283464
Effective search space used: 1089283464
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 58 (26.3 bits)