RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5025
         (533 letters)



>gnl|CDD|238058 cd00110, LamG, Laminin G domain; Laminin G-like domains are usually
           Ca++ mediated receptors that can have binding sites for
           steroids, beta1 integrins, heparin, sulfatides,
           fibulin-1, and alpha-dystroglycans. Proteins that
           contain LamG domains serve a variety of purposes
           including signal transduction via cell-surface steroid
           receptors, adhesion, migration and differentiation
           through mediation of cell adhesion molecules.
          Length = 151

 Score =  127 bits (321), Expect = 9e-35
 Identities = 57/152 (37%), Positives = 85/152 (55%), Gaps = 4/152 (2%)

Query: 89  FNGSSYLRYQGLGEQYLSLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDF 148
           F+GSSY+R   L      L  ++  F+   PNG+LLY G      GDF+AL L D  +  
Sbjct: 4   FSGSSYVRLPTLPAPRTRL-SISFSFRTTSPNGLLLYAGS--QNGGDFLALELEDGRLVL 60

Query: 149 TFDLGTGAATLRSSNPISLGEWHKLRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLS 208
            +DLG+G+  L S  P++ G+WH + + R GR   L VD     +  SPG    L+L   
Sbjct: 61  RYDLGSGSLVLSSKTPLNDGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGP 120

Query: 209 LYLGGVPDYNIVSPKVKIKSSFIGCIQKVLIN 240
           LYLGG+P+ ++ SP + +   F+GCI+ + +N
Sbjct: 121 LYLGGLPE-DLKSPGLPVSPGFVGCIRDLKVN 151



 Score =  105 bits (265), Expect = 6e-27
 Identities = 43/168 (25%), Positives = 74/168 (44%), Gaps = 33/168 (19%)

Query: 333 RITGYEFNIDMRIKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLENNQ 392
                  +I    +T+S +GL+L+ GS    DFLAL + +G   + Y+LG+G ++L + +
Sbjct: 16  PAPRTRLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLGSGSLVLSS-K 74

Query: 393 TKINNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDY 452
           T +N+G WH ++V RN                                G+   + +D + 
Sbjct: 75  TPLNDGQWHSVSVERN--------------------------------GRSVTLSVDGER 102

Query: 453 SMQSISGGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFKGWISNLIL 500
            ++S S G    LN D  LY+GG+ +  +         F G I +L +
Sbjct: 103 VVESGSPGGSALLNLDGPLYLGGLPEDLKSPGLPVSPGFVGCIRDLKV 150


>gnl|CDD|214598 smart00282, LamG, Laminin G domain. 
          Length = 132

 Score =  118 bits (297), Expect = 2e-31
 Identities = 49/135 (36%), Positives = 75/135 (55%), Gaps = 4/135 (2%)

Query: 109 DLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSS-NPISL 167
            ++  F+   PNG+LLY G +  G  D++AL L D  +   +DLG+G A L S   P++ 
Sbjct: 1   SISFSFRTTSPNGLLLYAGSKGGG--DYLALELRDGRLVLRYDLGSGPARLTSDPTPLND 58

Query: 168 GEWHKLRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIK 227
           G+WH++ + R GR   L VD        SPG  T L+L   LYLGG+P+ ++  P + + 
Sbjct: 59  GQWHRVAVERNGRSVTLSVDGGNRVSGESPGGLTILNLDGPLYLGGLPE-DLKLPPLPVT 117

Query: 228 SSFIGCIQKVLINNH 242
             F GCI+ + +N  
Sbjct: 118 PGFRGCIRNLKVNGK 132



 Score =  106 bits (267), Expect = 2e-27
 Identities = 44/163 (26%), Positives = 72/163 (44%), Gaps = 32/163 (19%)

Query: 340 NIDMRIKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGV 399
           +I    +T+S +GL+L+ GS    D+LAL + +G   + Y+LG+G   L ++ T +N+G 
Sbjct: 1   SISFSFRTTSPNGLLLYAGSKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQ 60

Query: 400 WHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISG 459
           WHR+ V RN                                G+   + +D    +   S 
Sbjct: 61  WHRVAVERN--------------------------------GRSVTLSVDGGNRVSGESP 88

Query: 460 GKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFKGWISNLILSN 502
           G    LN D  LY+GG+ +  +L        F+G I NL ++ 
Sbjct: 89  GGLTILNLDGPLYLGGLPEDLKLPPLPVTPGFRGCIRNLKVNG 131


>gnl|CDD|216930 pfam02210, Laminin_G_2, Laminin G domain.  This family includes the
           Thrombospondin N-terminal-like domain, a Laminin G
           subfamily.
          Length = 124

 Score =  110 bits (276), Expect = 9e-29
 Identities = 44/127 (34%), Positives = 69/127 (54%), Gaps = 4/127 (3%)

Query: 114 FKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGA-ATLRSSNPISLGEWHK 172
           F+  +PNG+LLY G       DF+AL L D  +   +DLG+G    L S   ++ G+WH+
Sbjct: 1   FRTTQPNGLLLYAGGEDGL--DFLALELEDGRLVLRYDLGSGGSVLLLSGKKLNDGQWHR 58

Query: 173 LRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIG 232
           + ++R GR   L VD         PG  + L+L+  LYLGG+P+ +     + +   F+G
Sbjct: 59  VSVSRDGRSLTLSVDGGTVVSEALPGSSSILNLNGPLYLGGLPE-DSGLSLLPVTEGFVG 117

Query: 233 CIQKVLI 239
           CI+ V +
Sbjct: 118 CIRNVRV 124



 Score = 95.2 bits (237), Expect = 2e-23
 Identities = 38/156 (24%), Positives = 66/156 (42%), Gaps = 32/156 (20%)

Query: 345 IKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLT 404
            +T+  +GL+L+ G     DFLAL + +G   + Y+LG+G  +L  +  K+N+G WHR++
Sbjct: 1   FRTTQPNGLLLYAGGEDGLDFLALELEDGRLVLRYDLGSGGSVLLLSGKKLNDGQWHRVS 60

Query: 405 VHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQ 464
           V R+                                G+   + +D    +     G    
Sbjct: 61  VSRD--------------------------------GRSLTLSVDGGTVVSEALPGSSSI 88

Query: 465 LNTDSGLYVGGVEDIEELTRSKYHHSFKGWISNLIL 500
           LN +  LY+GG+ +   L+       F G I N+ +
Sbjct: 89  LNLNGPLYLGGLPEDSGLSLLPVTEGFVGCIRNVRV 124


>gnl|CDD|215681 pfam00054, Laminin_G_1, Laminin G domain. 
          Length = 131

 Score =  108 bits (272), Expect = 3e-28
 Identities = 52/133 (39%), Positives = 79/133 (59%), Gaps = 3/133 (2%)

Query: 114 FKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPISLGEWHKL 173
           F+  EP+G+LLYNG + +   DF+AL L D  ++ ++DLG+GAA +RS + ++ G+WH +
Sbjct: 1   FRTTEPSGLLLYNGTQTER--DFLALELRDGRLEVSYDLGSGAAVVRSGDKLNDGKWHSV 58

Query: 174 RLTRTGRHAYLQVDRFPSSQILSP-GPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIG 232
            L R GR   L VD        SP G  T L +   LY+GG+P   +   ++ I  SF G
Sbjct: 59  ELERNGRSGTLSVDGEARVTGESPLGATTDLDVDGPLYVGGLPSLAVKLRRLAISPSFDG 118

Query: 233 CIQKVLINNHELN 245
           CI+ V++N   L+
Sbjct: 119 CIRDVIVNGKPLD 131



 Score = 81.6 bits (202), Expect = 2e-18
 Identities = 40/160 (25%), Positives = 67/160 (41%), Gaps = 35/160 (21%)

Query: 345 IKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLT 404
            +T+  SGL+L+ G+   +DFLAL + +G   V Y+LG+G  ++ +   K+N+G WH + 
Sbjct: 1   FRTTEPSGLLLYNGTQTERDFLALELRDGRLEVSYDLGSGAAVVRSGD-KLNDGKWHSVE 59

Query: 405 VHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRID-TDYSMQSISGGKFE 463
           + RN                                G+   + +D           G   
Sbjct: 60  LERN--------------------------------GRSGTLSVDGEARVTGESPLGATT 87

Query: 464 QLNTDSGLYVGGVEDIEELTRSKYH-HSFKGWISNLILSN 502
            L+ D  LYVGG+  +    R      SF G I ++I++ 
Sbjct: 88  DLDVDGPLYVGGLPSLAVKLRRLAISPSFDGCIRDVIVNG 127


>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
          large number of membrane-bound and extracellular
          (mostly animal) proteins. Many of these proteins
          require calcium for their biological function and
          calcium-binding sites have been found to be located at
          the N-terminus of particular EGF-like domains;
          calcium-binding may be crucial for numerous
          protein-protein interactions. Six conserved core
          cysteines form three disulfide bridges as in non
          calcium-binding EGF domains, whose structures are very
          similar. EGF_CA can be found in tandem repeat
          arrangements.
          Length = 38

 Score = 46.5 bits (111), Expect = 3e-07
 Identities = 17/39 (43%), Positives = 22/39 (56%), Gaps = 4/39 (10%)

Query: 41 ECLEDKCNKTPCEHGGKCITTPGSSYCLCPLGFTGDLCQ 79
          EC        PC++GG C+ T GS  C CP G+TG  C+
Sbjct: 4  ECASG----NPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38



 Score = 27.2 bits (61), Expect = 1.9
 Identities = 8/19 (42%), Positives = 10/19 (52%), Gaps = 1/19 (5%)

Query: 267 PCG-SGSCVPLYDGYKCVC 284
           PC   G+CV     Y+C C
Sbjct: 10  PCQNGGTCVNTVGSYRCSC 28


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
          growth factor (EGF) presents in a large number of
          proteins, mostly animal; the list of proteins currently
          known to contain one or more copies of an EGF-like
          pattern is large and varied; the functional
          significance of EGF-like domains in what appear to be
          unrelated proteins is not yet clear; a common feature
          is that these repeats are found in the extracellular
          domain of membrane-bound proteins or in proteins known
          to be secreted (exception: prostaglandin G/H synthase);
          the domain includes six cysteine residues which have
          been shown to be involved in disulfide bonds; the main
          structure is a two-stranded beta-sheet followed by a
          loop to a C-terminal short two-stranded sheet;
          Subdomains between the conserved cysteines vary in
          length; the region between the 5th and 6th cysteine
          contains two conserved glycines of which at  least  one
           is  present  in  most EGF-like domains; a subset of
          these bind calcium.
          Length = 36

 Score = 42.1 bits (99), Expect = 1e-05
 Identities = 17/30 (56%), Positives = 22/30 (73%), Gaps = 1/30 (3%)

Query: 51 PCEHGGKCITTPGSSYCLCPLGFTGDL-CQ 79
          PC +GG C+ TPGS  C+CP G+TGD  C+
Sbjct: 7  PCSNGGTCVNTPGSYRCVCPPGYTGDRSCE 36



 Score = 25.9 bits (57), Expect = 5.2
 Identities = 11/27 (40%), Positives = 14/27 (51%), Gaps = 4/27 (14%)

Query: 259 CAHPCITQPCGS-GSCVPLYDGYKCVC 284
           CA      PC + G+CV     Y+CVC
Sbjct: 2   CAASN---PCSNGGTCVNTPGSYRCVC 25


>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 41.8 bits (99), Expect = 1e-05
 Identities = 17/40 (42%), Positives = 22/40 (55%), Gaps = 5/40 (12%)

Query: 41 ECLEDKCNKTPCEHGGKCITTPGSSYCLCPLGFT-GDLCQ 79
          EC        PC++GG C+ T GS  C CP G+T G  C+
Sbjct: 4  ECASG----NPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39



 Score = 26.4 bits (59), Expect = 4.7
 Identities = 8/19 (42%), Positives = 10/19 (52%), Gaps = 1/19 (5%)

Query: 267 PCGSGS-CVPLYDGYKCVC 284
           PC +G  CV     Y+C C
Sbjct: 10  PCQNGGTCVNTVGSYRCEC 28


>gnl|CDD|215652 pfam00008, EGF, EGF-like domain.  There is no clear separation
          between noise and signal. pfam00053 is very similar,
          but has 8 instead of 6 conserved cysteines. Includes
          some cytokine receptors. The EGF domain misses the
          N-terminus regions of the Ca2+ binding EGF domains
          (this is the main reason of discrepancy between
          swiss-prot domain start/end and Pfam). The family is
          hard to model due to many similar but different
          sub-types of EGF domains. Pfam certainly misses a
          number of EGF domains.
          Length = 32

 Score = 41.7 bits (98), Expect = 1e-05
 Identities = 15/31 (48%), Positives = 19/31 (61%), Gaps = 1/31 (3%)

Query: 47 CNK-TPCEHGGKCITTPGSSYCLCPLGFTGD 76
          C+   PC +GG C+ TPG   C CP G+TG 
Sbjct: 1  CSPNNPCSNGGTCVDTPGGYTCECPEGYTGK 31


>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain. 
          Length = 35

 Score = 36.0 bits (83), Expect = 0.002
 Identities = 18/36 (50%), Positives = 20/36 (55%), Gaps = 5/36 (13%)

Query: 41 ECLEDKCNKTPCEHGGKCITTPGSSYCLCPLGFTGD 76
          EC        PC +G  CI TPGS  C CP G+TGD
Sbjct: 1  ECAS----GGPCSNG-TCINTPGSYTCSCPPGYTGD 31



 Score = 28.3 bits (63), Expect = 0.98
 Identities = 8/24 (33%), Positives = 11/24 (45%), Gaps = 1/24 (4%)

Query: 262 PC-ITQPCGSGSCVPLYDGYKCVC 284
            C    PC +G+C+     Y C C
Sbjct: 1   ECASGGPCSNGTCINTPGSYTCSC 24


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
          EGF-like domain homologues. This family includes the
          C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 28.3 bits (64), Expect = 0.81
 Identities = 10/20 (50%), Positives = 11/20 (55%)

Query: 57 KCITTPGSSYCLCPLGFTGD 76
           C  T GS  C C  G+TGD
Sbjct: 13 TCTNTGGSFTCTCKSGYTGD 32


>gnl|CDD|204999 pfam12661, hEGF, Human growth factor-like EGF.  hEGF, or human
          growth factor-like EGF, domains have six conserved
          residues disulfide-bonded into the characteristic
          'ababcc' pattern. They are involved in growth and
          proliferation of cells, in proteins of the Notch/Delta
          pathway, neurogulin and selectins. hEGFs are also found
          in mosaic proteins with four-disulfide laminin EGFs
          such as aggrecan and perlecan. The core fold of the EGF
          domain consists of two small beta-hairpins packed
          against each other. Two major structural variants have
          been identified based on the structural context of the
          C-terminal Cys residue of disulfide 'c' in the
          C-terminal hairpin: hEGFs and cEGFs. In hEGFs the
          C-terminal thiol resides in the beta-turn, resulting in
          shorter loop-lengths between the Cys residues of
          disulfide 'c', typically C[8-9]XC. These shorter
          loop-lengths are also typical of the four-disulfide EGF
          domains, laminin ad integrin. Tandem hEGF domains have
          six linking residues between terminal cysteines of
          adjacent domains. hEGF domains may or may not bind
          calcium in the linker region. hEGF domains with the
          consensus motif CXD4X[F,Y]XCXC are hydroxylated
          exclusively in the Asp residue.
          Length = 13

 Score = 28.1 bits (64), Expect = 0.85
 Identities = 7/12 (58%), Positives = 8/12 (66%)

Query: 67 CLCPLGFTGDLC 78
          C CP G+TG  C
Sbjct: 2  CQCPPGYTGPRC 13


>gnl|CDD|109365 pfam00304, Gamma-thionin, Gamma-thionin family. 
          Length = 47

 Score = 28.4 bits (64), Expect = 1.0
 Identities = 10/31 (32%), Positives = 13/31 (41%), Gaps = 2/31 (6%)

Query: 258 NCAHPCITQPCGSGSCVPLYDGYKCVCTQSC 288
           NCA  C  +    G C       +C CT+ C
Sbjct: 19  NCAKVCRNEGFAGGHCS--GFRRRCFCTKPC 47


>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain. 
          Length = 42

 Score = 28.1 bits (63), Expect = 1.1
 Identities = 11/32 (34%), Positives = 15/32 (46%), Gaps = 2/32 (6%)

Query: 45 DKCN--KTPCEHGGKCITTPGSSYCLCPLGFT 74
          D+C      C     C+ T GS  C+CP G+ 
Sbjct: 3  DECADGTHNCPANTVCVNTIGSFECVCPDGYE 34


>gnl|CDD|219677 pfam07974, EGF_2, EGF-like domain.  This family contains EGF
          domains found in a variety of extracellular proteins.
          Length = 31

 Score = 26.6 bits (59), Expect = 2.5
 Identities = 9/31 (29%), Positives = 12/31 (38%), Gaps = 2/31 (6%)

Query: 48 NKTPCEHGGKCITTPGSSYCLCPLGFTGDLC 78
              C   G C+   G   C+C  G+ G  C
Sbjct: 3  ASGICNGRGTCVRPCGK--CVCDSGYQGATC 31


>gnl|CDD|219377 pfam07333, SLR1-BP, S locus-related glycoprotein 1 binding pollen
          coat protein (SLR1-BP).  This family consists of a
          number of cysteine rich SLR1 binding pollen coat like
          proteins. Adhesion of pollen grains to the stigmatic
          surface is a critical step during sexual reproduction
          in plants. In Brassica, S locus-related glycoprotein 1
          (SLR1), a stigma-specific protein belonging to the S
          gene family of proteins, has been shown to be involved
          in this step. SLR1-BP specifically binds SLR1 with high
          affinity. The SLR1-BP gene is specifically expressed in
          pollen at late stages of development and is a member of
          the class A pollen coat protein (PCP) family, which
          includes PCP-A1, an SLG (S locus glycoprotein)-binding
          protein.
          Length = 57

 Score = 27.5 bits (61), Expect = 2.7
 Identities = 14/53 (26%), Positives = 19/53 (35%), Gaps = 15/53 (28%)

Query: 32 GNKIKGRSKECLEDKCNKTPCEHG--------------GKCITTPGSS-YCLC 69
            + +G+ ++C E   N  PC                 G CI TP     CLC
Sbjct: 1  VKETQGQERQCHEILPNGGPCVPDECAALCKKKYKGGVGTCIPTPKGKKQCLC 53


>gnl|CDD|130153 TIGR01081, mpl,
           UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl-meso-
           diaminopimelate ligase.  Alternate name: murein
           tripeptide ligase [Cell envelope, Biosynthesis and
           degradation of murein sacculus and peptidoglycan].
          Length = 448

 Score = 30.2 bits (68), Expect = 2.8
 Identities = 24/76 (31%), Positives = 34/76 (44%), Gaps = 4/76 (5%)

Query: 295 EVMCASFDHGASFLHYLDPEIIHSSGLLHYKFQLHVDFRITGYEFNIDMRIKTSSSSGLI 354
           E   A FD  + F+HY  P  +  + L      +  D +    +F+    ++T    GLI
Sbjct: 160 EYDTAFFDKRSKFVHY-RPRTLVLNNLEFDHADIFDDLKAIQRQFH--HLVRTVPGEGLI 216

Query: 355 LWTGSSQS-KDFLALG 369
           L  G  QS KD LA G
Sbjct: 217 LCPGRDQSLKDTLAKG 232


>gnl|CDD|146221 pfam03465, eRF1_3, eRF1 domain 3.  The release factor eRF1
           terminates protein biosynthesis by recognising stop
           codons at the A site of the ribosome and stimulating
           peptidyl-tRNA bond hydrolysis at the peptidyl
           transferase centre. The crystal structure of human eRF1
           is known. The overall shape and dimensions of eRF1
           resemble a tRNA molecule with domains 1, 2, and 3 of
           eRF1 corresponding to the anticodon loop, aminoacyl
           acceptor stem, and T stem of a tRNA molecule,
           respectively. The position of the essential GGQ motif at
           an exposed tip of domain 2 suggests that the Gln residue
           coordinates a water molecule to mediate the hydrolytic
           activity at the peptidyl transferase centre. A conserved
           groove on domain 1, 80 A from the GGQ motif, is proposed
           to form the codon recognition site. This family also
           includes other proteins for which the precise molecular
           function is unknown. Many of them are from
           Archaebacteria. These proteins may also be involved in
           translation termination but this awaits experimental
           verification.
          Length = 100

 Score = 27.9 bits (63), Expect = 5.0
 Identities = 9/20 (45%), Positives = 14/20 (70%)

Query: 462 FEQLNTDSGLYVGGVEDIEE 481
            E+L  D+GL V GVE++ +
Sbjct: 12  LEELAKDTGLAVYGVEEVLK 31


>gnl|CDD|185677 cd00817, ValRS_core, catalytic core domain of valyl-tRNA
           synthetases.  Valine amino-acyl tRNA synthetase (ValRS)
           catalytic core domain. This enzyme is a monomer which
           aminoacylates the 2'-OH of the nucleotide at the 3' of
           the appropriate tRNA. The core domain is based on the
           Rossman fold and is responsible for the ATP-dependent
           formation of the enzyme bound aminoacyl-adenylate. It
           contains the characteristic class I HIGH and KMSKS
           motifs, which are involved in ATP binding.  ValRS has an
           insertion in the core domain, which is subject to both
           deletions and rearrangements. This editing region
           hydrolyzes mischarged cognate tRNAs and thus prevents
           the incorporation of chemically similar amino acids.
          Length = 382

 Score = 29.5 bits (67), Expect = 5.2
 Identities = 13/48 (27%), Positives = 19/48 (39%), Gaps = 2/48 (4%)

Query: 12  DWNTVR--VFRHRWDAWIQLNDGNKIKGRSKECLEDKCNKTPCEHGGK 57
           DW   R   + HR  AW   + G+ +  R ++   DK     C   G 
Sbjct: 211 DWCISRQLWWGHRIPAWYCKDGGHWVVAREEDEAIDKAAPEACVPCGG 258


>gnl|CDD|226766 COG4316, COG4316, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 138

 Score = 28.0 bits (62), Expect = 6.4
 Identities = 14/52 (26%), Positives = 20/52 (38%)

Query: 136 FIALYLNDRYVDFTFDLGTGAATLRSSNPISLGEWHKLRLTRTGRHAYLQVD 187
            IA    + Y+   FD    A   R+  P ++GEW K+ L        L   
Sbjct: 85  TIAGLCVNGYIHAVFDFVEQAGYCRNGFPEAIGEWVKVELDDELLDKILSRK 136


>gnl|CDD|238752 cd01475, vWA_Matrilin, VWA_Matrilin: In cartilaginous plate,
           extracellular matrix molecules mediate cell-matrix and
           matrix-matrix interactions thereby providing tissue
           integrity. Some members of the matrilin family are
           expressed specifically in developing cartilage
           rudiments. The matrilin family consists of at least four
           members. All the members of the matrilin family contain
           VWA domains, EGF-like domains and a heptad repeat
           coiled-coiled domain at the carboxy terminus which is
           responsible for the oligomerization of the matrilins.
           The VWA domains have been shown to be essential for
           matrilin network formation by interacting with matrix
           ligands.
          Length = 224

 Score = 28.5 bits (64), Expect = 6.9
 Identities = 12/34 (35%), Positives = 16/34 (47%), Gaps = 4/34 (11%)

Query: 43  LEDKCNKTP--CEHGGKCITTPGSSYCLCPLGFT 74
           + D C      C+    CI+TPGS  C C  G+ 
Sbjct: 186 VPDLCATLSHVCQQV--CISTPGSYLCACTEGYA 217


>gnl|CDD|184896 PRK14903, PRK14903, 16S rRNA methyltransferase B; Provisional.
          Length = 431

 Score = 28.7 bits (64), Expect = 7.8
 Identities = 12/32 (37%), Positives = 17/32 (53%), Gaps = 2/32 (6%)

Query: 209 LYLGGVPDYNIVSPKVKI--KSSFIGCIQKVL 238
           L++  VPDY  VS  VK+    +F   +  VL
Sbjct: 85  LFMNSVPDYAAVSETVKLVKNENFKKLVNAVL 116


>gnl|CDD|225333 COG2706, COG2706, 3-carboxymuconate cyclase [Carbohydrate transport
           and metabolism].
          Length = 346

 Score = 28.5 bits (64), Expect = 9.1
 Identities = 23/109 (21%), Positives = 33/109 (30%), Gaps = 7/109 (6%)

Query: 116 AIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPISLGEWHKLRL 175
           A+ P+   LY  +     G   A  ++      TF       TL  S P  +      R 
Sbjct: 46  AVNPDQRHLYVVNEPGEEGGVAAYRIDPDDGRLTF---LNRQTLPGSPPCYVSVDEDGRF 102

Query: 176 TRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKV 224
                +    V  +P     S  P  Q    +  + G  P     SP V
Sbjct: 103 VFVANYHSGSVSVYPLQADGSLQPVVQ----VVKHTGSGPHERQESPHV 147


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.138    0.430 

Gapped
Lambda     K      H
   0.267   0.0715    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,269,302
Number of extensions: 2490646
Number of successful extensions: 1647
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1631
Number of HSP's successfully gapped: 37
Length of query: 533
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 432
Effective length of database: 6,457,848
Effective search space: 2789790336
Effective search space used: 2789790336
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 61 (27.3 bits)