RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy13157
         (1434 letters)



>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
            EGF-like domain homologues. This family includes the
            C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 29.4 bits (67), Expect = 0.86
 Identities = 15/28 (53%), Positives = 15/28 (53%)

Query: 1132 PGTCGQNANCKVINHSPICTCKPGYTGD 1159
             G C  NA C     S  CTCK GYTGD
Sbjct: 5    NGGCHPNATCTNTGGSFTCTCKSGYTGD 32



 Score = 29.4 bits (67), Expect = 0.96
 Identities = 14/28 (50%), Positives = 15/28 (53%)

Query: 257 PGTCGQNANCRVINHSPICTCKPGFTGD 284
            G C  NA C     S  CTCK G+TGD
Sbjct: 5   NGGCHPNATCTNTGGSFTCTCKSGYTGD 32



 Score = 26.7 bits (60), Expect = 8.4
 Identities = 12/27 (44%), Positives = 14/27 (51%)

Query: 72 PGSCGQNANCRVINHSPVCSCKPGFTG 98
           G C  NA C     S  C+CK G+TG
Sbjct: 5  NGGCHPNATCTNTGGSFTCTCKSGYTG 31



 Score = 26.7 bits (60), Expect = 8.4
 Identities = 12/27 (44%), Positives = 14/27 (51%)

Query: 574 PGSCGQNANCRVINHSPVCSCKPGFTG 600
            G C  NA C     S  C+CK G+TG
Sbjct: 5   NGGCHPNATCTNTGGSFTCTCKSGYTG 31



 Score = 26.7 bits (60), Expect = 8.4
 Identities = 12/27 (44%), Positives = 14/27 (51%)

Query: 1006 PGSCGQNANCRVINHSPVCSCKPGFTG 1032
             G C  NA C     S  C+CK G+TG
Sbjct: 5    NGGCHPNATCTNTGGSFTCTCKSGYTG 31


>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
            modification].
          Length = 2365

 Score = 33.0 bits (75), Expect = 1.4
 Identities = 15/43 (34%), Positives = 20/43 (46%), Gaps = 12/43 (27%)

Query: 1168 PPPPPPQEPICTCKPGYTGDALSYCNRIPPPPPPQDDVPEPVN 1210
            PPPPPP        PG+   +     + PPPPPP  +V +   
Sbjct: 9    PPPPPP-------PPGFEPPS-----QPPPPPPPGVNVKKRSR 39



 Score = 31.1 bits (70), Expect = 6.3
 Identities = 12/29 (41%), Positives = 14/29 (48%)

Query: 1196 PPPPPPQDDVPEPVNPCYPSPCGLYSECR 1224
            PPPPPP      P  P  P P G+  + R
Sbjct: 9    PPPPPPPPGFEPPSQPPPPPPPGVNVKKR 37


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 32.7 bits (75), Expect = 1.7
 Identities = 15/46 (32%), Positives = 19/46 (41%), Gaps = 17/46 (36%)

Query: 1166 RIPPPPPPQEPICTCKPGYTGDALSYCNRIPPPPPPQDDVPEPVNP 1211
            R+P PP P E     +P             PPPPPP +D  +P   
Sbjct: 257  RLPAPPQPPEE----EP-------------PPPPPPPEDDDDPPED 285


>gnl|CDD|201524 pfam00954, S_locus_glycop, S-locus glycoprotein family.  In
           Brassicaceae, self-incompatible plants have a
           self/non-self recognition system. This is
           sporophytically controlled by multiple alleles at a
           single locus (S). S-locus glycoproteins, as well as
           S-receptor kinases, are in linkage with the S-alleles.
          Length = 110

 Score = 30.7 bits (70), Expect = 1.8
 Identities = 11/20 (55%), Positives = 14/20 (70%), Gaps = 1/20 (5%)

Query: 313 CGPYAQCRDINGSPSCSCLP 332
           CGPY  C D+N SP C+C+ 
Sbjct: 86  CGPYGYC-DVNTSPKCNCIK 104



 Score = 28.8 bits (65), Expect = 6.8
 Identities = 11/24 (45%), Positives = 12/24 (50%), Gaps = 1/24 (4%)

Query: 258 GTCGQNANCRVINHSPICTCKPGF 281
           G CG    C  +N SP C C  GF
Sbjct: 84  GRCGPYGYC-DVNTSPKCNCIKGF 106



 Score = 28.8 bits (65), Expect = 7.2
 Identities = 13/32 (40%), Positives = 16/32 (50%), Gaps = 3/32 (9%)

Query: 67  PEHPCP--GSCGQNANCRVINHSPVCSCKPGF 96
           P+  C   G CG    C  +N SP C+C  GF
Sbjct: 76  PKDQCDVYGRCGPYGYC-DVNTSPKCNCIKGF 106



 Score = 28.4 bits (64), Expect = 9.0
 Identities = 11/24 (45%), Positives = 13/24 (54%), Gaps = 1/24 (4%)

Query: 575 GSCGQNANCRVINHSPVCSCKPGF 598
           G CG    C  +N SP C+C  GF
Sbjct: 84  GRCGPYGYC-DVNTSPKCNCIKGF 106



 Score = 28.4 bits (64), Expect = 9.0
 Identities = 11/24 (45%), Positives = 13/24 (54%), Gaps = 1/24 (4%)

Query: 1007 GSCGQNANCRVINHSPVCSCKPGF 1030
            G CG    C  +N SP C+C  GF
Sbjct: 84   GRCGPYGYC-DVNTSPKCNCIKGF 106


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
            model represents one of two ATPase subunits of the
            trimeric magnesium chelatase responsible for insertion of
            magnesium ion into protoporphyrin IX. This is an
            essential step in the biosynthesis of both chlorophyll
            and bacteriochlorophyll. This subunit is found in green
            plants, photosynthetic algae, cyanobacteria and other
            photosynthetic bacteria. Unlike subunit I (TIGR02030),
            this subunit is not found in archaea [Biosynthesis of
            cofactors, prosthetic groups, and carriers, Chlorophyll
            and bacteriochlorphyll].
          Length = 589

 Score = 32.5 bits (74), Expect = 1.8
 Identities = 13/43 (30%), Positives = 14/43 (32%), Gaps = 19/43 (44%)

Query: 1169 PPPPPQEPICTCKPGYTGDALSYCNRIPPPPPPQDDVPEPVNP 1211
            P P PQ P                   PPPPPP+   PE    
Sbjct: 267  PEPEPQPP-------------------PPPPPPEPPEPEEEPD 290


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 32.2 bits (73), Expect = 3.0
 Identities = 25/105 (23%), Positives = 30/105 (28%), Gaps = 15/105 (14%)

Query: 610  PPRPPPQEDVPEPVNPC--YPSPCGPYSQCRDIGGSPSCS--------CLPNYIGSPPN- 658
             P PPP    P    P    P    P  + RD       S               SPP  
Sbjct: 2623 APDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQR 2682

Query: 659  CRPECVMNSECPSHEASRPPPQEDVPEP----VNPCYPSPCGPYS 699
             R      +       + PPP    PEP    +    P P GP +
Sbjct: 2683 PRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAA 2727



 Score = 30.3 bits (68), Expect = 9.4
 Identities = 19/93 (20%), Positives = 27/93 (29%), Gaps = 1/93 (1%)

Query: 600  GEPRIRCNKIPPRPPPQEDVPEPVNPCYPSPCGPYSQCRDIGGSPSCSCLPNYIGSPPNC 659
               R    + PP P PQ   P P  P  P P  P  Q   +  +   +      G+ P  
Sbjct: 2904 QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP-LAPTTDPAGAGEPSGAVPQP 2962

Query: 660  RPECVMNSECPSHEASRPPPQEDVPEPVNPCYP 692
                ++           P P      P +   P
Sbjct: 2963 WLGALVPGRVAVPRFRVPQPAPSREAPASSTPP 2995


>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
           large number of membrane-bound and extracellular (mostly
           animal) proteins. Many of these proteins require calcium
           for their biological function and calcium-binding sites
           have been found to be located at the N-terminus of
           particular EGF-like domains; calcium-binding may be
           crucial for numerous protein-protein interactions. Six
           conserved core cysteines form three disulfide bridges as
           in non calcium-binding EGF domains, whose structures are
           very similar. EGF_CA can be found in tandem repeat
           arrangements.
          Length = 38

 Score = 28.0 bits (63), Expect = 3.1
 Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 1/33 (3%)

Query: 305 VNPCV-PSPCGPYAQCRDINGSPSCSCLPNYIG 336
           ++ C   +PC     C +  GS  CSC P Y G
Sbjct: 2   IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34



 Score = 26.8 bits (60), Expect = 8.1
 Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 1/33 (3%)

Query: 623 VNPCY-PSPCGPYSQCRDIGGSPSCSCLPNYIG 654
           ++ C   +PC     C +  GS  CSC P Y G
Sbjct: 2   IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34



 Score = 26.8 bits (60), Expect = 8.1
 Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 1/33 (3%)

Query: 687 VNPCY-PSPCGPYSQCRDIGGSPSCSCLPNYIG 718
           ++ C   +PC     C +  GS  CSC P Y G
Sbjct: 2   IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34


>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein. 
          Length = 753

 Score = 30.9 bits (70), Expect = 6.6
 Identities = 10/55 (18%), Positives = 13/55 (23%)

Query: 1165 NRIPPPPPPQEPICTCKPGYTGDALSYCNRIPPPPPPQDDVPEPVNPCYPSPCGL 1219
              +  P  P+        G             P P     +  P  P  P P G 
Sbjct: 690  AALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPPGA 744


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.140    0.504 

Gapped
Lambda     K      H
   0.267   0.0656    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 68,333,304
Number of extensions: 6243832
Number of successful extensions: 6236
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5899
Number of HSP's successfully gapped: 363
Length of query: 1434
Length of database: 10,937,602
Length adjustment: 109
Effective length of query: 1325
Effective length of database: 6,103,016
Effective search space: 8086496200
Effective search space used: 8086496200
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 65 (29.0 bits)