RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy18237
         (1050 letters)



>gnl|CDD|201391 pfam00683, TB, TB domain.  This domain is also known as the 8
           cysteine domain. This family includes the hybrid
           domains. This cysteine rich repeat is found in TGF
           binding protein and fibrillin.
          Length = 42

 Score = 51.2 bits (123), Expect = 1e-08
 Identities = 19/41 (46%), Positives = 27/41 (65%)

Query: 658 TCTKEIAGSTTRSTCCCSIGKAWGPQCEECPAVGSDEHKTL 698
            C+  + G+ T+S CCCS+G+AWG  CE CP  G+ E + L
Sbjct: 2   RCSNPLPGNVTKSECCCSLGRAWGTPCEPCPVQGTAEFRQL 42



 Score = 44.2 bits (105), Expect = 4e-06
 Identities = 18/39 (46%), Positives = 27/39 (69%)

Query: 362 GLCSLPMSNEQTRMVCCCSMGQSWGKPCQPCPPPGSRDY 400
           G CS P+    T+  CCCS+G++WG PC+PCP  G+ ++
Sbjct: 1   GRCSNPLPGNVTKSECCCSLGRAWGTPCEPCPVQGTAEF 39



 Score = 38.9 bits (91), Expect = 4e-04
 Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 4/40 (10%)

Query: 462 GRCVLPTGPALLMEVTRMDCCCTMGMAWGPQCQLCPTRGS 501
           GRC  P        VT+ +CCC++G AWG  C+ CP +G+
Sbjct: 1   GRCSNPLPGN----VTKSECCCSLGRAWGTPCEPCPVQGT 36


>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 40.7 bits (96), Expect = 8e-05
 Identities = 21/42 (50%), Positives = 25/42 (59%), Gaps = 3/42 (7%)

Query: 766 DLNECALMPSACQGGECINTDGSYRCECPAGYVKDETGKICI 807
           D++ECA       GG C+NT GSYRCECP GY     G+ C 
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCECPPGY---TDGRNCE 39



 Score = 39.9 bits (94), Expect = 1e-04
 Identities = 18/37 (48%), Positives = 22/37 (59%)

Query: 903 DIDECARHPNICNNGTCVNAIGSFKCHCYAGFKLSHN 939
           DIDECA      N GTCVN +GS++C C  G+    N
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRN 37



 Score = 39.9 bits (94), Expect = 1e-04
 Identities = 17/33 (51%), Positives = 20/33 (60%), Gaps = 1/33 (3%)

Query: 973  DINECESPQACLYG-NCTNTLGSFSCTCPPNYQ 1004
            DI+EC S   C  G  C NT+GS+ C CPP Y 
Sbjct: 1    DIDECASGNPCQNGGTCVNTVGSYRCECPPGYT 33



 Score = 39.2 bits (92), Expect = 2e-04
 Identities = 15/32 (46%), Positives = 20/32 (62%)

Query: 148 EIDECNLMPNMCNHGTCMNTPGSFHCQCNRGF 179
           +IDEC       N GTC+NT GS+ C+C  G+
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCECPPGY 32



 Score = 37.2 bits (87), Expect = 0.001
 Identities = 16/33 (48%), Positives = 20/33 (60%), Gaps = 2/33 (6%)

Query: 1017 DINECEEHDNICENG-HCTNTFGSFMCSCQDGF 1048
            DI+EC    N C+NG  C NT GS+ C C  G+
Sbjct: 1    DIDECAS-GNPCQNGGTCVNTVGSYRCECPPGY 32



 Score = 35.7 bits (83), Expect = 0.004
 Identities = 17/37 (45%), Positives = 19/37 (51%), Gaps = 2/37 (5%)

Query: 808 DDNECLSIPNICGNG-TCTNLNGGFECTCSEGYAPGP 843
           D +EC S  N C NG TC N  G + C C  GY  G 
Sbjct: 1   DIDECAS-GNPCQNGGTCVNTVGSYRCECPPGYTDGR 36



 Score = 35.3 bits (82), Expect = 0.005
 Identities = 22/43 (51%), Positives = 24/43 (55%), Gaps = 5/43 (11%)

Query: 307 DIDECATSIQRCGEG-FCVNDVGTYHCVCPDGYMLLPSGKECI 348
           DIDECA S   C  G  CVN VG+Y C CP GY     G+ C 
Sbjct: 1   DIDECA-SGNPCQNGGTCVNTVGSYRCECPPGYT---DGRNCE 39



 Score = 34.5 bits (80), Expect = 0.011
 Identities = 17/34 (50%), Positives = 19/34 (55%), Gaps = 2/34 (5%)

Query: 861 DIDECYERPGICANG-DCANFQGSFQCTCANGYT 893
           DIDEC      C NG  C N  GS++C C  GYT
Sbjct: 1   DIDEC-ASGNPCQNGGTCVNTVGSYRCECPPGYT 33



 Score = 33.8 bits (78), Expect = 0.020
 Identities = 17/34 (50%), Positives = 22/34 (64%), Gaps = 2/34 (5%)

Query: 576 DLDECRMMSYLCRNG-RCRNNIGSFFCECLQGYT 608
           D+DEC   +  C+NG  C N +GS+ CEC  GYT
Sbjct: 1   DIDECASGN-PCQNGGTCVNTVGSYRCECPPGYT 33



 Score = 33.0 bits (76), Expect = 0.033
 Identities = 13/32 (40%), Positives = 17/32 (53%)

Query: 234 INECEEMPEICGSGTCENNIGSFSCRCEDGYS 265
           I+EC         GTC N +GS+ C C  GY+
Sbjct: 2   IDECASGNPCQNGGTCVNTVGSYRCECPPGYT 33



 Score = 31.8 bits (73), Expect = 0.084
 Identities = 12/25 (48%), Positives = 15/25 (60%)

Query: 714 GICPSPGKCQNVMGSFICTCPPGYR 738
             C + G C N +GS+ C CPPGY 
Sbjct: 9   NPCQNGGTCVNTVGSYRCECPPGYT 33


>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain. 
          Length = 42

 Score = 40.0 bits (94), Expect = 1e-04
 Identities = 18/42 (42%), Positives = 23/42 (54%), Gaps = 1/42 (2%)

Query: 766 DLNECALMPSAC-QGGECINTDGSYRCECPAGYVKDETGKIC 806
           D++ECA     C     C+NT GS+ C CP GY  +E G  C
Sbjct: 1   DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42



 Score = 37.3 bits (87), Expect = 0.001
 Identities = 14/40 (35%), Positives = 18/40 (45%), Gaps = 1/40 (2%)

Query: 234 INECEEMPEIC-GSGTCENNIGSFSCRCEDGYSVKPAEGP 272
           ++EC +    C  +  C N IGSF C C DGY        
Sbjct: 2   VDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTN 41



 Score = 36.6 bits (85), Expect = 0.002
 Identities = 18/42 (42%), Positives = 23/42 (54%), Gaps = 1/42 (2%)

Query: 861 DIDECYERPGIC-ANGDCANFQGSFQCTCANGYTLNTARDSC 901
           D+DEC +    C AN  C N  GSF+C C +GY  N    +C
Sbjct: 1   DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42



 Score = 36.2 bits (84), Expect = 0.003
 Identities = 19/42 (45%), Positives = 23/42 (54%), Gaps = 1/42 (2%)

Query: 307 DIDECATSIQRCGEGF-CVNDVGTYHCVCPDGYMLLPSGKEC 347
           D+DECA     C     CVN +G++ CVCPDGY     G  C
Sbjct: 1   DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42



 Score = 35.8 bits (83), Expect = 0.003
 Identities = 15/40 (37%), Positives = 19/40 (47%), Gaps = 1/40 (2%)

Query: 808 DDNECLSIPNIC-GNGTCTNLNGGFECTCSEGYAPGPLGS 846
           D +EC    + C  N  C N  G FEC C +GY     G+
Sbjct: 1   DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGT 40



 Score = 35.8 bits (83), Expect = 0.004
 Identities = 16/42 (38%), Positives = 20/42 (47%), Gaps = 2/42 (4%)

Query: 973  DINECESPQACLYGN--CTNTLGSFSCTCPPNYQLTPAGNAC 1012
            D++EC         N  C NT+GSF C CP  Y+    G  C
Sbjct: 1    DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42



 Score = 35.0 bits (81), Expect = 0.009
 Identities = 17/34 (50%), Positives = 22/34 (64%), Gaps = 1/34 (2%)

Query: 903 DIDECARHPNIC-NNGTCVNAIGSFKCHCYAGFK 935
           D+DECA   + C  N  CVN IGSF+C C  G++
Sbjct: 1   DVDECADGTHNCPANTVCVNTIGSFECVCPDGYE 34



 Score = 34.2 bits (79), Expect = 0.013
 Identities = 15/34 (44%), Positives = 21/34 (61%), Gaps = 1/34 (2%)

Query: 1017 DINECEEHDNIC-ENGHCTNTFGSFMCSCQDGFK 1049
            D++EC +  + C  N  C NT GSF C C DG++
Sbjct: 1    DVDECADGTHNCPANTVCVNTIGSFECVCPDGYE 34



 Score = 33.5 bits (77), Expect = 0.030
 Identities = 16/41 (39%), Positives = 21/41 (51%), Gaps = 1/41 (2%)

Query: 149 IDECNLMPNMCNHGT-CMNTPGSFHCQCNRGFLYDSDTHQC 188
           +DEC    + C   T C+NT GSF C C  G+  + D   C
Sbjct: 2   VDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42



 Score = 32.7 bits (75), Expect = 0.054
 Identities = 14/32 (43%), Positives = 18/32 (56%)

Query: 715 ICPSPGKCQNVMGSFICTCPPGYRLSPDKNSC 746
            CP+   C N +GSF C CP GY  + D  +C
Sbjct: 11  NCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42



 Score = 31.6 bits (72), Expect = 0.12
 Identities = 18/42 (42%), Positives = 22/42 (52%), Gaps = 1/42 (2%)

Query: 576 DLDECRMMSYLC-RNGRCRNNIGSFFCECLQGYTLASEGQYC 616
           D+DEC   ++ C  N  C N IGSF C C  GY    +G  C
Sbjct: 1   DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42



 Score = 27.7 bits (62), Expect = 3.3
 Identities = 10/27 (37%), Positives = 14/27 (51%), Gaps = 1/27 (3%)

Query: 207 INECEEMPEIC-GSGTCENNIGSFSCR 232
           ++EC +    C  +  C N IGSF C 
Sbjct: 2   VDECADGTHNCPANTVCVNTIGSFECV 28


>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
           large number of membrane-bound and extracellular (mostly
           animal) proteins. Many of these proteins require calcium
           for their biological function and calcium-binding sites
           have been found to be located at the N-terminus of
           particular EGF-like domains; calcium-binding may be
           crucial for numerous protein-protein interactions. Six
           conserved core cysteines form three disulfide bridges as
           in non calcium-binding EGF domains, whose structures are
           very similar. EGF_CA can be found in tandem repeat
           arrangements.
          Length = 38

 Score = 39.9 bits (94), Expect = 1e-04
 Identities = 17/36 (47%), Positives = 22/36 (61%)

Query: 903 DIDECARHPNICNNGTCVNAIGSFKCHCYAGFKLSH 938
           DIDECA      N GTCVN +GS++C C  G+   +
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRN 36



 Score = 39.5 bits (93), Expect = 2e-04
 Identities = 21/42 (50%), Positives = 25/42 (59%), Gaps = 4/42 (9%)

Query: 766 DLNECALMPSACQGGECINTDGSYRCECPAGYVKDETGKICI 807
           D++ECA       GG C+NT GSYRC CP GY    TG+ C 
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCSCPPGY----TGRNCE 38



 Score = 38.8 bits (91), Expect = 3e-04
 Identities = 17/35 (48%), Positives = 21/35 (60%), Gaps = 1/35 (2%)

Query: 973  DINECESPQACLY-GNCTNTLGSFSCTCPPNYQLT 1006
            DI+EC S   C   G C NT+GS+ C+CPP Y   
Sbjct: 1    DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35



 Score = 37.6 bits (88), Expect = 7e-04
 Identities = 17/32 (53%), Positives = 20/32 (62%), Gaps = 2/32 (6%)

Query: 149 IDECNLMPNMC-NHGTCMNTPGSFHCQCNRGF 179
           IDEC    N C N GTC+NT GS+ C C  G+
Sbjct: 2   IDEC-ASGNPCQNGGTCVNTVGSYRCSCPPGY 32



 Score = 37.2 bits (87), Expect = 0.001
 Identities = 14/34 (41%), Positives = 18/34 (52%)

Query: 1017 DINECEEHDNICENGHCTNTFGSFMCSCQDGFKL 1050
            DI+EC   +     G C NT GS+ CSC  G+  
Sbjct: 1    DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34



 Score = 35.3 bits (82), Expect = 0.005
 Identities = 20/35 (57%), Positives = 21/35 (60%), Gaps = 2/35 (5%)

Query: 307 DIDECATSIQRCGE-GFCVNDVGTYHCVCPDGYML 340
           DIDECA S   C   G CVN VG+Y C CP GY  
Sbjct: 1   DIDECA-SGNPCQNGGTCVNTVGSYRCSCPPGYTG 34



 Score = 34.1 bits (79), Expect = 0.012
 Identities = 14/36 (38%), Positives = 19/36 (52%)

Query: 576 DLDECRMMSYLCRNGRCRNNIGSFFCECLQGYTLAS 611
           D+DEC   +     G C N +GS+ C C  GYT  +
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRN 36



 Score = 34.1 bits (79), Expect = 0.014
 Identities = 16/33 (48%), Positives = 19/33 (57%), Gaps = 2/33 (6%)

Query: 808 DDNECLSIPNICGN-GTCTNLNGGFECTCSEGY 839
           D +EC S  N C N GTC N  G + C+C  GY
Sbjct: 1   DIDECAS-GNPCQNGGTCVNTVGSYRCSCPPGY 32



 Score = 33.4 bits (77), Expect = 0.028
 Identities = 15/36 (41%), Positives = 18/36 (50%)

Query: 861 DIDECYERPGICANGDCANFQGSFQCTCANGYTLNT 896
           DIDEC         G C N  GS++C+C  GYT   
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRN 36



 Score = 32.2 bits (74), Expect = 0.062
 Identities = 12/29 (41%), Positives = 16/29 (55%)

Query: 711 TREGICPSPGKCQNVMGSFICTCPPGYRL 739
                C + G C N +GS+ C+CPPGY  
Sbjct: 6   ASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34



 Score = 32.2 bits (74), Expect = 0.075
 Identities = 13/32 (40%), Positives = 17/32 (53%)

Query: 234 INECEEMPEICGSGTCENNIGSFSCRCEDGYS 265
           I+EC         GTC N +GS+ C C  GY+
Sbjct: 2   IDECASGNPCQNGGTCVNTVGSYRCSCPPGYT 33


>gnl|CDD|221695 pfam12662, cEGF, Complement Clr-like EGF-like.  cEGF, or complement
           Clr-like EGF, domains have six conserved cysteine
           residues disulfide-bonded into the characteristic
           pattern 'ababcc'. They are found in blood coagulation
           proteins such as fibrillin, Clr and Cls, thrombomodulin,
           and the LDL receptor. The core fold of the EGF domain
           consists of two small beta-hairpins packed against each
           other. Two major structural variants have been
           identified based on the structural context of the
           C-terminal cysteine residue of disulfide 'c' in the
           C-terminal hairpin: hEGFs and cEGFs. In cEGFs the
           C-terminal thiol resides on the C-terminal beta-sheet,
           resulting in long loop-lengths between the cysteine
           residues of disulfide 'c', typically C[10+]XC. These
           longer loop-lengths may have arisen by selective
           cysteine loss from a four-disulfide EGF template such as
           laminin or integrin. Tandem cEGF domains have five
           linking residues between terminal cysteines of adjacent
           domains. cEGF domains may or may not bind calcium in the
           linker region. cEGF domains with the consensus motif
           CXN4X[F,Y]XCXC are hydroxylated exclusively on the
           asparagine residue.
          Length = 24

 Score = 37.0 bits (87), Expect = 9e-04
 Identities = 9/24 (37%), Positives = 12/24 (50%)

Query: 170 SFHCQCNRGFLYDSDTHQCIDINE 193
           S+ C C  G+    D   C DI+E
Sbjct: 1   SYTCSCPPGYQLSGDGRTCEDIDE 24



 Score = 35.1 bits (82), Expect = 0.005
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 883 SFQCTCANGYTLNTARDSCVDIDE 906
           S+ C+C  GY L+    +C DIDE
Sbjct: 1   SYTCSCPPGYQLSGDGRTCEDIDE 24



 Score = 34.7 bits (81), Expect = 0.005
 Identities = 11/24 (45%), Positives = 13/24 (54%)

Query: 788 SYRCECPAGYVKDETGKICIDDNE 811
           SY C CP GY     G+ C D +E
Sbjct: 1   SYTCSCPPGYQLSGDGRTCEDIDE 24



 Score = 34.4 bits (80), Expect = 0.008
 Identities = 11/24 (45%), Positives = 16/24 (66%)

Query: 598 SFFCECLQGYTLASEGQYCRDVDE 621
           S+ C C  GY L+ +G+ C D+DE
Sbjct: 1   SYTCSCPPGYQLSGDGRTCEDIDE 24



 Score = 33.6 bits (78), Expect = 0.017
 Identities = 11/20 (55%), Positives = 16/20 (80%)

Query: 728 SFICTCPPGYRLSPDKNSCQ 747
           S+ C+CPPGY+LS D  +C+
Sbjct: 1   SYTCSCPPGYQLSGDGRTCE 20



 Score = 32.4 bits (75), Expect = 0.047
 Identities = 10/20 (50%), Positives = 11/20 (55%)

Query: 330 YHCVCPDGYMLLPSGKECID 349
           Y C CP GY L   G+ C D
Sbjct: 2   YTCSCPPGYQLSGDGRTCED 21



 Score = 30.9 bits (71), Expect = 0.15
 Identities = 10/27 (37%), Positives = 15/27 (55%), Gaps = 3/27 (11%)

Query: 925 SFKCHCYAGFKLSHNNDCIGNCTDINE 951
           S+ C C  G++LS +      C DI+E
Sbjct: 1   SYTCSCPPGYQLSGDG---RTCEDIDE 24



 Score = 30.9 bits (71), Expect = 0.17
 Identities = 14/27 (51%), Positives = 19/27 (70%), Gaps = 3/27 (11%)

Query: 994  SFSCTCPPNYQLTPAGNACVVLEDINE 1020
            S++C+CPP YQL+  G  C   EDI+E
Sbjct: 1    SYTCSCPPGYQLSGDGRTC---EDIDE 24



 Score = 29.3 bits (67), Expect = 0.55
 Identities = 12/21 (57%), Positives = 15/21 (71%), Gaps = 2/21 (9%)

Query: 99  CVCDEGFR--GDGYSCEDIDE 117
           C C  G++  GDG +CEDIDE
Sbjct: 4   CSCPPGYQLSGDGRTCEDIDE 24


>gnl|CDD|238752 cd01475, vWA_Matrilin, VWA_Matrilin: In cartilaginous plate,
           extracellular matrix molecules mediate cell-matrix and
           matrix-matrix interactions thereby providing tissue
           integrity. Some members of the matrilin family are
           expressed specifically in developing cartilage
           rudiments. The matrilin family consists of at least four
           members. All the members of the matrilin family contain
           VWA domains, EGF-like domains and a heptad repeat
           coiled-coiled domain at the carboxy terminus which is
           responsible for the oligomerization of the matrilins.
           The VWA domains have been shown to be essential for
           matrilin network formation by interacting with matrix
           ligands.
          Length = 224

 Score = 38.9 bits (91), Expect = 0.007
 Identities = 17/41 (41%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 305 CVDIDECATSIQRCGEGFCVNDVGTYHCVCPDGYMLLPSGK 345
           CV  D CAT    C +  C++  G+Y C C +GY LL   K
Sbjct: 184 CVVPDLCATLSHVC-QQVCISTPGSYLCACTEGYALLEDNK 223



 Score = 38.1 bits (89), Expect = 0.014
 Identities = 12/39 (30%), Positives = 19/39 (48%), Gaps = 1/39 (2%)

Query: 147 EEIDECNLMPNMCNHGTCMNTPGSFHCQCNRGFLYDSDT 185
              D C  + ++C    C++TPGS+ C C  G+    D 
Sbjct: 185 VVPDLCATLSHVCQQV-CISTPGSYLCACTEGYALLEDN 222



 Score = 35.8 bits (83), Expect = 0.073
 Identities = 14/40 (35%), Positives = 21/40 (52%), Gaps = 1/40 (2%)

Query: 901 CVDIDECARHPNICNNGTCVNAIGSFKCHCYAGFKLSHNN 940
           CV  D CA   ++C    C++  GS+ C C  G+ L  +N
Sbjct: 184 CVVPDLCATLSHVCQQV-CISTPGSYLCACTEGYALLEDN 222



 Score = 34.3 bits (79), Expect = 0.25
 Identities = 14/38 (36%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 803 GKICIDDNECLSIPNICGNGTCTNLNGGFECTCSEGYA 840
           GKIC+  + C ++ ++C    C +  G + C C+EGYA
Sbjct: 181 GKICVVPDLCATLSHVCQQV-CISTPGSYLCACTEGYA 217



 Score = 33.5 bits (77), Expect = 0.45
 Identities = 15/42 (35%), Positives = 23/42 (54%), Gaps = 4/42 (9%)

Query: 1009 GNACVVLEDINECEEHDNICENGHCTNTFGSFMCSCQDGFKL 1050
            G  CVV  D+     H   C+   C +T GS++C+C +G+ L
Sbjct: 181  GKICVV-PDLCATLSHV--CQQV-CISTPGSYLCACTEGYAL 218



 Score = 30.4 bits (69), Expect = 4.0
 Identities = 13/40 (32%), Positives = 22/40 (55%), Gaps = 1/40 (2%)

Query: 574 CIDLDECRMMSYLCRNGRCRNNIGSFFCECLQGYTLASEG 613
           C+  D C  +S++C+   C +  GS+ C C +GY L  + 
Sbjct: 184 CVVPDLCATLSHVCQQV-CISTPGSYLCACTEGYALLEDN 222



 Score = 29.3 bits (66), Expect = 8.9
 Identities = 16/36 (44%), Positives = 18/36 (50%), Gaps = 1/36 (2%)

Query: 769 ECALMPSACQGGECINTDGSYRCECPAGYVKDETGK 804
            CA +   CQ   CI+T GSY C C  GY   E  K
Sbjct: 189 LCATLSHVCQQV-CISTPGSYLCACTEGYALLEDNK 223


>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain. 
          Length = 35

 Score = 32.9 bits (75), Expect = 0.032
 Identities = 15/26 (57%), Positives = 16/26 (61%)

Query: 777 CQGGECINTDGSYRCECPAGYVKDET 802
           C  G CINT GSY C CP GY  D+ 
Sbjct: 8   CSNGTCINTPGSYTCSCPPGYTGDKR 33



 Score = 32.9 bits (75), Expect = 0.034
 Identities = 15/32 (46%), Positives = 18/32 (56%)

Query: 976  ECESPQACLYGNCTNTLGSFSCTCPPNYQLTP 1007
            EC S   C  G C NT GS++C+CPP Y    
Sbjct: 1    ECASGGPCSNGTCINTPGSYTCSCPPGYTGDK 32



 Score = 32.5 bits (74), Expect = 0.053
 Identities = 14/31 (45%), Positives = 16/31 (51%), Gaps = 1/31 (3%)

Query: 1020 ECEEHDNICENGHCTNTFGSFMCSCQDGFKL 1050
            EC      C NG C NT GS+ CSC  G+  
Sbjct: 1    ECASG-GPCSNGTCINTPGSYTCSCPPGYTG 30



 Score = 32.1 bits (73), Expect = 0.079
 Identities = 11/26 (42%), Positives = 14/26 (53%)

Query: 814 SIPNICGNGTCTNLNGGFECTCSEGY 839
           +    C NGTC N  G + C+C  GY
Sbjct: 3   ASGGPCSNGTCINTPGSYTCSCPPGY 28



 Score = 31.7 bits (72), Expect = 0.11
 Identities = 14/34 (41%), Positives = 18/34 (52%), Gaps = 1/34 (2%)

Query: 906 ECARHPNICNNGTCVNAIGSFKCHCYAGFKLSHN 939
           ECA     C+NGTC+N  GS+ C C  G+     
Sbjct: 1   ECASG-GPCSNGTCINTPGSYTCSCPPGYTGDKR 33



 Score = 31.3 bits (71), Expect = 0.12
 Identities = 12/24 (50%), Positives = 17/24 (70%)

Query: 156 PNMCNHGTCMNTPGSFHCQCNRGF 179
              C++GTC+NTPGS+ C C  G+
Sbjct: 5   GGPCSNGTCINTPGSYTCSCPPGY 28



 Score = 30.2 bits (68), Expect = 0.31
 Identities = 14/34 (41%), Positives = 17/34 (50%), Gaps = 1/34 (2%)

Query: 310 ECATSIQRCGEGFCVNDVGTYHCVCPDGYMLLPS 343
           ECA+    C  G C+N  G+Y C CP GY     
Sbjct: 1   ECASG-GPCSNGTCINTPGSYTCSCPPGYTGDKR 33



 Score = 30.2 bits (68), Expect = 0.37
 Identities = 12/23 (52%), Positives = 13/23 (56%)

Query: 587 CRNGRCRNNIGSFFCECLQGYTL 609
           C NG C N  GS+ C C  GYT 
Sbjct: 8   CSNGTCINTPGSYTCSCPPGYTG 30



 Score = 29.8 bits (67), Expect = 0.48
 Identities = 13/30 (43%), Positives = 17/30 (56%)

Query: 867 ERPGICANGDCANFQGSFQCTCANGYTLNT 896
              G C+NG C N  GS+ C+C  GYT + 
Sbjct: 3   ASGGPCSNGTCINTPGSYTCSCPPGYTGDK 32



 Score = 28.6 bits (64), Expect = 1.2
 Identities = 12/25 (48%), Positives = 14/25 (56%)

Query: 718 SPGKCQNVMGSFICTCPPGYRLSPD 742
           S G C N  GS+ C+CPPGY     
Sbjct: 9   SNGTCINTPGSYTCSCPPGYTGDKR 33



 Score = 27.9 bits (62), Expect = 2.2
 Identities = 13/30 (43%), Positives = 17/30 (56%), Gaps = 1/30 (3%)

Query: 236 ECEEMPEICGSGTCENNIGSFSCRCEDGYS 265
           EC      C +GTC N  GS++C C  GY+
Sbjct: 1   ECAS-GGPCSNGTCINTPGSYTCSCPPGYT 29


>gnl|CDD|234522 TIGR04249, SCM_chp_ScmC, SynChlorMet cassette protein ScmC.  A
           biosynthesis cassette found in Syntrophobacter
           fumaroxidans MPOB, Chlorobium limicola DSM 245,
           Methanocella paludicola SANAE, and delta proteobacterium
           NaphS2 contains two PqqE-like radical SAM/SPASM domain
           proteins, a PqqD homolog, and a conserved hypothetical
           protein. These components suggest modification of a
           ribosomally produced peptide precursor, but the
           precursor has not been identified. Members of this
           family are designated ScmC.
          Length = 292

 Score = 36.4 bits (84), Expect = 0.065
 Identities = 20/67 (29%), Positives = 24/67 (35%), Gaps = 9/67 (13%)

Query: 665 GSTTRSTCCCSIGKAWGPQCEECPAVGSDEHKTLCPGGSGYR--PNSATREGICPSPGKC 722
           G T +STCC  IG  W   C+       DE   +     GYR  P       +  S  K 
Sbjct: 146 GGTGKSTCCRRIGDPWRCHCD-------DESLVVPDTDKGYRAHPFPTWSNFLFGSEEKT 198

Query: 723 QNVMGSF 729
              M S 
Sbjct: 199 WQSMRSV 205


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 33.8 bits (78), Expect = 0.073
 Identities = 10/17 (58%), Positives = 13/17 (76%)

Query: 637 EEEEEEEEEEEGGYGGG 653
           ++EEEEEEEE+   G G
Sbjct: 70  KKEEEEEEEEDDDMGFG 86


>gnl|CDD|218955 pfam06247, Plasmod_Pvs28, Plasmodium ookinete surface protein
           Pvs28.  This family consists of several ookinete surface
           protein (Pvs28) from several species of Plasmodium.
           Pvs25 and Pvs28 are expressed on the surface of
           ookinetes. These proteins are potential candidates for
           vaccine and induce antibodies that block the infectivity
           of Plasmodium vivax in immunised animals.
          Length = 196

 Score = 35.1 bits (81), Expect = 0.092
 Identities = 35/138 (25%), Positives = 49/138 (35%), Gaps = 27/138 (19%)

Query: 818 ICGNGTCTNLNGGFECTCSEGYAPGPLGSCAILLTLPPISPSTDIDECYERPG---ICAN 874
           IC NG    ++  FEC C+EGY      +C              ++   +  G    C N
Sbjct: 7   ICKNGYLIQMSNHFECKCNEGYVLKNENTCE------EKVKCDKLENVNKVCGEYATCIN 60

Query: 875 GDCANFQGSFQCTCANGYTLNTARDSCVDIDECARHPNICNNGTCVNA--------IGSF 926
                 + + +C C NGYTL+  +  CV        PN CNN  C +           + 
Sbjct: 61  QANKAEEKALKCGCINGYTLS--QGVCV--------PNKCNNKVCGSGKCIVDPANPNNT 110

Query: 927 KCHCYAGFKLSHNNDCIG 944
            C C  G     N  C  
Sbjct: 111 TCSCNIGKVPDQNGKCTK 128


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
           growth factor (EGF) presents in a large number of
           proteins, mostly animal; the list of proteins currently
           known to contain one or more copies of an EGF-like
           pattern is large and varied; the functional significance
           of EGF-like domains in what appear to be unrelated
           proteins is not yet clear; a common feature is that
           these repeats are found in the extracellular domain of
           membrane-bound proteins or in proteins known to be
           secreted (exception: prostaglandin G/H synthase); the
           domain includes six cysteine residues which have been
           shown to be involved in disulfide bonds; the main
           structure is a two-stranded beta-sheet followed by a
           loop to a C-terminal short two-stranded sheet;
           Subdomains between the conserved cysteines vary in
           length; the region between the 5th and 6th cysteine
           contains two conserved glycines of which at  least  one 
           is  present  in  most EGF-like domains; a subset of
           these bind calcium.
          Length = 36

 Score = 31.3 bits (71), Expect = 0.15
 Identities = 14/34 (41%), Positives = 18/34 (52%)

Query: 906 ECARHPNICNNGTCVNAIGSFKCHCYAGFKLSHN 939
           ECA      N GTCVN  GS++C C  G+    +
Sbjct: 1   ECAASNPCSNGGTCVNTPGSYRCVCPPGYTGDRS 34



 Score = 30.5 bits (69), Expect = 0.24
 Identities = 19/35 (54%), Positives = 22/35 (62%), Gaps = 2/35 (5%)

Query: 769 ECALMPSACQ-GGECINTDGSYRCECPAGYVKDET 802
           ECA   + C  GG C+NT GSYRC CP GY  D +
Sbjct: 1   ECAA-SNPCSNGGTCVNTPGSYRCVCPPGYTGDRS 34



 Score = 30.1 bits (68), Expect = 0.34
 Identities = 12/27 (44%), Positives = 14/27 (51%)

Query: 714 GICPSPGKCQNVMGSFICTCPPGYRLS 740
             C + G C N  GS+ C CPPGY   
Sbjct: 6   NPCSNGGTCVNTPGSYRCVCPPGYTGD 32



 Score = 29.8 bits (67), Expect = 0.58
 Identities = 14/31 (45%), Positives = 16/31 (51%), Gaps = 1/31 (3%)

Query: 976  ECESPQACL-YGNCTNTLGSFSCTCPPNYQL 1005
            EC +   C   G C NT GS+ C CPP Y  
Sbjct: 1    ECAASNPCSNGGTCVNTPGSYRCVCPPGYTG 31



 Score = 29.4 bits (66), Expect = 0.59
 Identities = 12/20 (60%), Positives = 15/20 (75%)

Query: 160 NHGTCMNTPGSFHCQCNRGF 179
           N GTC+NTPGS+ C C  G+
Sbjct: 10  NGGTCVNTPGSYRCVCPPGY 29



 Score = 27.8 bits (62), Expect = 2.1
 Identities = 12/27 (44%), Positives = 14/27 (51%), Gaps = 1/27 (3%)

Query: 814 SIPNICGN-GTCTNLNGGFECTCSEGY 839
           +  N C N GTC N  G + C C  GY
Sbjct: 3   AASNPCSNGGTCVNTPGSYRCVCPPGY 29



 Score = 27.8 bits (62), Expect = 2.5
 Identities = 11/31 (35%), Positives = 14/31 (45%)

Query: 1020 ECEEHDNICENGHCTNTFGSFMCSCQDGFKL 1050
            EC   +     G C NT GS+ C C  G+  
Sbjct: 1    ECAASNPCSNGGTCVNTPGSYRCVCPPGYTG 31



 Score = 27.8 bits (62), Expect = 2.6
 Identities = 17/32 (53%), Positives = 18/32 (56%), Gaps = 2/32 (6%)

Query: 310 ECATSIQRCGE-GFCVNDVGTYHCVCPDGYML 340
           ECA S   C   G CVN  G+Y CVCP GY  
Sbjct: 1   ECAAS-NPCSNGGTCVNTPGSYRCVCPPGYTG 31



 Score = 27.1 bits (60), Expect = 4.5
 Identities = 10/21 (47%), Positives = 11/21 (52%)

Query: 589 NGRCRNNIGSFFCECLQGYTL 609
            G C N  GS+ C C  GYT 
Sbjct: 11  GGTCVNTPGSYRCVCPPGYTG 31


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 33.0 bits (76), Expect = 0.17
 Identities = 11/18 (61%), Positives = 13/18 (72%)

Query: 637 EEEEEEEEEEEGGYGGGS 654
           EEEE+EE EEE   G G+
Sbjct: 86  EEEEKEESEEEAAAGLGA 103


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
           EGF-like domain homologues. This family includes the
           C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 31.0 bits (71), Expect = 0.18
 Identities = 11/22 (50%), Positives = 13/22 (59%)

Query: 161 HGTCMNTPGSFHCQCNRGFLYD 182
           + TC NT GSF C C  G+  D
Sbjct: 11  NATCTNTGGSFTCTCKSGYTGD 32



 Score = 29.0 bits (66), Expect = 0.92
 Identities = 11/18 (61%), Positives = 12/18 (66%)

Query: 987  NCTNTLGSFSCTCPPNYQ 1004
             CTNT GSF+CTC   Y 
Sbjct: 13   TCTNTGGSFTCTCKSGYT 30



 Score = 28.3 bits (64), Expect = 1.5
 Identities = 14/30 (46%), Positives = 19/30 (63%), Gaps = 1/30 (3%)

Query: 1021 CEEHDNIC-ENGHCTNTFGSFMCSCQDGFK 1049
            C E++  C  N  CTNT GSF C+C+ G+ 
Sbjct: 1    CAENNGGCHPNATCTNTGGSFTCTCKSGYT 30



 Score = 27.5 bits (62), Expect = 2.7
 Identities = 12/19 (63%), Positives = 12/19 (63%)

Query: 821 NGTCTNLNGGFECTCSEGY 839
           N TCTN  G F CTC  GY
Sbjct: 11  NATCTNTGGSFTCTCKSGY 29



 Score = 27.5 bits (62), Expect = 3.4
 Identities = 10/18 (55%), Positives = 13/18 (72%)

Query: 248 TCENNIGSFSCRCEDGYS 265
           TC N  GSF+C C+ GY+
Sbjct: 13  TCTNTGGSFTCTCKSGYT 30



 Score = 26.7 bits (60), Expect = 5.5
 Identities = 12/26 (46%), Positives = 14/26 (53%), Gaps = 2/26 (7%)

Query: 781 ECINTDGSYRCECPAGYVKDETGKIC 806
            C NT GS+ C C +GY  D  G  C
Sbjct: 13  TCTNTGGSFTCTCKSGYTGD--GVTC 36



 Score = 26.3 bits (59), Expect = 7.3
 Identities = 10/17 (58%), Positives = 10/17 (58%)

Query: 721 KCQNVMGSFICTCPPGY 737
            C N  GSF CTC  GY
Sbjct: 13  TCTNTGGSFTCTCKSGY 29


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 32.3 bits (74), Expect = 0.25
 Identities = 12/18 (66%), Positives = 13/18 (72%)

Query: 637 EEEEEEEEEEEGGYGGGS 654
           EEEEEEE EEE   G G+
Sbjct: 85  EEEEEEESEEEAMAGLGA 102


>gnl|CDD|224793 COG1881, COG1881, Phospholipid-binding protein [General function
           prediction only].
          Length = 174

 Score = 32.7 bits (75), Expect = 0.58
 Identities = 13/49 (26%), Positives = 18/49 (36%), Gaps = 5/49 (10%)

Query: 384 SWGKP--CQPCPPPGS--RDYILLCGKDFAKLC-PEGVGRGDKGEDLNE 427
            +G      PCPP G   R Y  +   D   L  P G    + G+ +  
Sbjct: 111 DFGSRGYGGPCPPKGHGHRYYFTVYALDVELLLLPAGASGAELGKAMEG 159


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 32.9 bits (76), Expect = 0.73
 Identities = 11/18 (61%), Positives = 12/18 (66%)

Query: 637 EEEEEEEEEEEGGYGGGS 654
           EEEEEE  EEE   G G+
Sbjct: 310 EEEEEEPSEEEAAAGLGA 327


>gnl|CDD|215652 pfam00008, EGF, EGF-like domain.  There is no clear separation
           between noise and signal. pfam00053 is very similar, but
           has 8 instead of 6 conserved cysteines. Includes some
           cytokine receptors. The EGF domain misses the N-terminus
           regions of the Ca2+ binding EGF domains (this is the
           main reason of discrepancy between swiss-prot domain
           start/end and Pfam). The family is hard to model due to
           many similar but different sub-types of EGF domains.
           Pfam certainly misses a number of EGF domains.
          Length = 32

 Score = 28.9 bits (65), Expect = 0.81
 Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 152 CNLMPNMCNHGTCMNTPGSFHCQCNRGF 179
           C+      N GTC++TPG + C+C  G+
Sbjct: 1   CSPNNPCSNGGTCVDTPGGYTCECPEGY 28



 Score = 27.0 bits (60), Expect = 3.7
 Identities = 12/19 (63%), Positives = 14/19 (73%)

Query: 779 GGECINTDGSYRCECPAGY 797
           GG C++T G Y CECP GY
Sbjct: 10  GGTCVDTPGGYTCECPEGY 28



 Score = 27.0 bits (60), Expect = 3.8
 Identities = 10/32 (31%), Positives = 15/32 (46%)

Query: 907 CARHPNICNNGTCVNAIGSFKCHCYAGFKLSH 938
           C+ +    N GTCV+  G + C C  G+    
Sbjct: 1   CSPNNPCSNGGTCVDTPGGYTCECPEGYTGKR 32


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 32.0 bits (73), Expect = 1.7
 Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 620 DECKEVNKRESRCYLDTEEEEEEEEEEE 647
           +E +E  + E +     EE +EEEE+EE
Sbjct: 42  EEKEEKKEEEEKTTDKEEEVDEEEEKEE 69


>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
          Length = 310

 Score = 31.5 bits (72), Expect = 2.1
 Identities = 9/15 (60%), Positives = 12/15 (80%)

Query: 637 EEEEEEEEEEEGGYG 651
             EEEEEEE++ G+G
Sbjct: 293 PAEEEEEEEDDMGFG 307


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 31.6 bits (72), Expect = 2.5
 Identities = 10/21 (47%), Positives = 12/21 (57%)

Query: 637 EEEEEEEEEEEGGYGGGSRRV 657
           EE+EEEEEEE+   G      
Sbjct: 393 EEDEEEEEEEDEDEGPSKEHS 413


>gnl|CDD|164576 CHL00201, syh, histidine-tRNA synthetase; Provisional.
          Length = 430

 Score = 31.4 bits (71), Expect = 2.8
 Identities = 14/49 (28%), Positives = 20/49 (40%)

Query: 158 MCNHGTCMNTPGSFHCQCNRGFLYDSDTHQCIDINECEEMPEICGSGTY 206
           +C +   +N P   + +  RG  Y +DT   I          ICG G Y
Sbjct: 239 VCTYLNLLNIPYKINYKLVRGLDYYNDTAFEIKTLSSNGQDTICGGGRY 287


>gnl|CDD|237071 PRK12353, PRK12353, putative amino acid kinase; Reviewed.
          Length = 314

 Score = 30.9 bits (71), Expect = 3.7
 Identities = 12/26 (46%), Positives = 14/26 (53%), Gaps = 6/26 (23%)

Query: 638 EEEEEEEEEEGGY------GGGSRRV 657
           EEE E+  +E GY      G G RRV
Sbjct: 138 EEEAEKLAKEKGYTFKEDAGRGYRRV 163


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 31.1 bits (70), Expect = 4.6
 Identities = 12/13 (92%), Positives = 13/13 (100%)

Query: 635 DTEEEEEEEEEEE 647
           D+EEEEEEEEEEE
Sbjct: 862 DSEEEEEEEEEEE 874


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 31.0 bits (70), Expect = 5.2
 Identities = 17/43 (39%), Positives = 24/43 (55%), Gaps = 1/43 (2%)

Query: 609 LASEG-QYCRDVDECKEVNKRESRCYLDTEEEEEEEEEEEGGY 650
           L  EG  + +++ E KE    E    + TEE +EE EEEE G+
Sbjct: 268 LGIEGWDWLKELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGF 310


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.137    0.462 

Gapped
Lambda     K      H
   0.267   0.0720    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 50,023,367
Number of extensions: 4603760
Number of successful extensions: 4471
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4401
Number of HSP's successfully gapped: 163
Length of query: 1050
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 943
Effective length of database: 6,191,724
Effective search space: 5838795732
Effective search space used: 5838795732
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (28.4 bits)