RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5025
(533 letters)
>gnl|CDD|238058 cd00110, LamG, Laminin G domain; Laminin G-like domains are usually
Ca++ mediated receptors that can have binding sites for
steroids, beta1 integrins, heparin, sulfatides,
fibulin-1, and alpha-dystroglycans. Proteins that
contain LamG domains serve a variety of purposes
including signal transduction via cell-surface steroid
receptors, adhesion, migration and differentiation
through mediation of cell adhesion molecules.
Length = 151
Score = 127 bits (321), Expect = 9e-35
Identities = 57/152 (37%), Positives = 85/152 (55%), Gaps = 4/152 (2%)
Query: 89 FNGSSYLRYQGLGEQYLSLLDLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDF 148
F+GSSY+R L L ++ F+ PNG+LLY G GDF+AL L D +
Sbjct: 4 FSGSSYVRLPTLPAPRTRL-SISFSFRTTSPNGLLLYAGS--QNGGDFLALELEDGRLVL 60
Query: 149 TFDLGTGAATLRSSNPISLGEWHKLRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLS 208
+DLG+G+ L S P++ G+WH + + R GR L VD + SPG L+L
Sbjct: 61 RYDLGSGSLVLSSKTPLNDGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGP 120
Query: 209 LYLGGVPDYNIVSPKVKIKSSFIGCIQKVLIN 240
LYLGG+P+ ++ SP + + F+GCI+ + +N
Sbjct: 121 LYLGGLPE-DLKSPGLPVSPGFVGCIRDLKVN 151
Score = 105 bits (265), Expect = 6e-27
Identities = 43/168 (25%), Positives = 74/168 (44%), Gaps = 33/168 (19%)
Query: 333 RITGYEFNIDMRIKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLENNQ 392
+I +T+S +GL+L+ GS DFLAL + +G + Y+LG+G ++L + +
Sbjct: 16 PAPRTRLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLGSGSLVLSS-K 74
Query: 393 TKINNGVWHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDY 452
T +N+G WH ++V RN G+ + +D +
Sbjct: 75 TPLNDGQWHSVSVERN--------------------------------GRSVTLSVDGER 102
Query: 453 SMQSISGGKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFKGWISNLIL 500
++S S G LN D LY+GG+ + + F G I +L +
Sbjct: 103 VVESGSPGGSALLNLDGPLYLGGLPEDLKSPGLPVSPGFVGCIRDLKV 150
>gnl|CDD|214598 smart00282, LamG, Laminin G domain.
Length = 132
Score = 118 bits (297), Expect = 2e-31
Identities = 49/135 (36%), Positives = 75/135 (55%), Gaps = 4/135 (2%)
Query: 109 DLTIVFKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSS-NPISL 167
++ F+ PNG+LLY G + G D++AL L D + +DLG+G A L S P++
Sbjct: 1 SISFSFRTTSPNGLLLYAGSKGGG--DYLALELRDGRLVLRYDLGSGPARLTSDPTPLND 58
Query: 168 GEWHKLRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIK 227
G+WH++ + R GR L VD SPG T L+L LYLGG+P+ ++ P + +
Sbjct: 59 GQWHRVAVERNGRSVTLSVDGGNRVSGESPGGLTILNLDGPLYLGGLPE-DLKLPPLPVT 117
Query: 228 SSFIGCIQKVLINNH 242
F GCI+ + +N
Sbjct: 118 PGFRGCIRNLKVNGK 132
Score = 106 bits (267), Expect = 2e-27
Identities = 44/163 (26%), Positives = 72/163 (44%), Gaps = 32/163 (19%)
Query: 340 NIDMRIKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGV 399
+I +T+S +GL+L+ GS D+LAL + +G + Y+LG+G L ++ T +N+G
Sbjct: 1 SISFSFRTTSPNGLLLYAGSKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQ 60
Query: 400 WHRLTVHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISG 459
WHR+ V RN G+ + +D + S
Sbjct: 61 WHRVAVERN--------------------------------GRSVTLSVDGGNRVSGESP 88
Query: 460 GKFEQLNTDSGLYVGGVEDIEELTRSKYHHSFKGWISNLILSN 502
G LN D LY+GG+ + +L F+G I NL ++
Sbjct: 89 GGLTILNLDGPLYLGGLPEDLKLPPLPVTPGFRGCIRNLKVNG 131
>gnl|CDD|216930 pfam02210, Laminin_G_2, Laminin G domain. This family includes the
Thrombospondin N-terminal-like domain, a Laminin G
subfamily.
Length = 124
Score = 110 bits (276), Expect = 9e-29
Identities = 44/127 (34%), Positives = 69/127 (54%), Gaps = 4/127 (3%)
Query: 114 FKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGA-ATLRSSNPISLGEWHK 172
F+ +PNG+LLY G DF+AL L D + +DLG+G L S ++ G+WH+
Sbjct: 1 FRTTQPNGLLLYAGGEDGL--DFLALELEDGRLVLRYDLGSGGSVLLLSGKKLNDGQWHR 58
Query: 173 LRLTRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIG 232
+ ++R GR L VD PG + L+L+ LYLGG+P+ + + + F+G
Sbjct: 59 VSVSRDGRSLTLSVDGGTVVSEALPGSSSILNLNGPLYLGGLPE-DSGLSLLPVTEGFVG 117
Query: 233 CIQKVLI 239
CI+ V +
Sbjct: 118 CIRNVRV 124
Score = 95.2 bits (237), Expect = 2e-23
Identities = 38/156 (24%), Positives = 66/156 (42%), Gaps = 32/156 (20%)
Query: 345 IKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLT 404
+T+ +GL+L+ G DFLAL + +G + Y+LG+G +L + K+N+G WHR++
Sbjct: 1 FRTTQPNGLLLYAGGEDGLDFLALELEDGRLVLRYDLGSGGSVLLLSGKKLNDGQWHRVS 60
Query: 405 VHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRIDTDYSMQSISGGKFEQ 464
V R+ G+ + +D + G
Sbjct: 61 VSRD--------------------------------GRSLTLSVDGGTVVSEALPGSSSI 88
Query: 465 LNTDSGLYVGGVEDIEELTRSKYHHSFKGWISNLIL 500
LN + LY+GG+ + L+ F G I N+ +
Sbjct: 89 LNLNGPLYLGGLPEDSGLSLLPVTEGFVGCIRNVRV 124
>gnl|CDD|215681 pfam00054, Laminin_G_1, Laminin G domain.
Length = 131
Score = 108 bits (272), Expect = 3e-28
Identities = 52/133 (39%), Positives = 79/133 (59%), Gaps = 3/133 (2%)
Query: 114 FKAIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPISLGEWHKL 173
F+ EP+G+LLYNG + + DF+AL L D ++ ++DLG+GAA +RS + ++ G+WH +
Sbjct: 1 FRTTEPSGLLLYNGTQTER--DFLALELRDGRLEVSYDLGSGAAVVRSGDKLNDGKWHSV 58
Query: 174 RLTRTGRHAYLQVDRFPSSQILSP-GPFTQLSLSLSLYLGGVPDYNIVSPKVKIKSSFIG 232
L R GR L VD SP G T L + LY+GG+P + ++ I SF G
Sbjct: 59 ELERNGRSGTLSVDGEARVTGESPLGATTDLDVDGPLYVGGLPSLAVKLRRLAISPSFDG 118
Query: 233 CIQKVLINNHELN 245
CI+ V++N L+
Sbjct: 119 CIRDVIVNGKPLD 131
Score = 81.6 bits (202), Expect = 2e-18
Identities = 40/160 (25%), Positives = 67/160 (41%), Gaps = 35/160 (21%)
Query: 345 IKTSSSSGLILWTGSSQSKDFLALGIHNGYAHVMYNLGNGEVLLENNQTKINNGVWHRLT 404
+T+ SGL+L+ G+ +DFLAL + +G V Y+LG+G ++ + K+N+G WH +
Sbjct: 1 FRTTEPSGLLLYNGTQTERDFLALELRDGRLEVSYDLGSGAAVVRSGD-KLNDGKWHSVE 59
Query: 405 VHRNFMVVMHGSSHTRSNKSSKIEFKCLSIQRLHRTGKQCLIRID-TDYSMQSISGGKFE 463
+ RN G+ + +D G
Sbjct: 60 LERN--------------------------------GRSGTLSVDGEARVTGESPLGATT 87
Query: 464 QLNTDSGLYVGGVEDIEELTRSKYH-HSFKGWISNLILSN 502
L+ D LYVGG+ + R SF G I ++I++
Sbjct: 88 DLDVDGPLYVGGLPSLAVKLRRLAISPSFDGCIRDVIVNG 127
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular
(mostly animal) proteins. Many of these proteins
require calcium for their biological function and
calcium-binding sites have been found to be located at
the N-terminus of particular EGF-like domains;
calcium-binding may be crucial for numerous
protein-protein interactions. Six conserved core
cysteines form three disulfide bridges as in non
calcium-binding EGF domains, whose structures are very
similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 46.5 bits (111), Expect = 3e-07
Identities = 17/39 (43%), Positives = 22/39 (56%), Gaps = 4/39 (10%)
Query: 41 ECLEDKCNKTPCEHGGKCITTPGSSYCLCPLGFTGDLCQ 79
EC PC++GG C+ T GS C CP G+TG C+
Sbjct: 4 ECASG----NPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
Score = 27.2 bits (61), Expect = 1.9
Identities = 8/19 (42%), Positives = 10/19 (52%), Gaps = 1/19 (5%)
Query: 267 PCG-SGSCVPLYDGYKCVC 284
PC G+CV Y+C C
Sbjct: 10 PCQNGGTCVNTVGSYRCSC 28
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional
significance of EGF-like domains in what appear to be
unrelated proteins is not yet clear; a common feature
is that these repeats are found in the extracellular
domain of membrane-bound proteins or in proteins known
to be secreted (exception: prostaglandin G/H synthase);
the domain includes six cysteine residues which have
been shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 42.1 bits (99), Expect = 1e-05
Identities = 17/30 (56%), Positives = 22/30 (73%), Gaps = 1/30 (3%)
Query: 51 PCEHGGKCITTPGSSYCLCPLGFTGDL-CQ 79
PC +GG C+ TPGS C+CP G+TGD C+
Sbjct: 7 PCSNGGTCVNTPGSYRCVCPPGYTGDRSCE 36
Score = 25.9 bits (57), Expect = 5.2
Identities = 11/27 (40%), Positives = 14/27 (51%), Gaps = 4/27 (14%)
Query: 259 CAHPCITQPCGS-GSCVPLYDGYKCVC 284
CA PC + G+CV Y+CVC
Sbjct: 2 CAASN---PCSNGGTCVNTPGSYRCVC 25
>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain.
Length = 39
Score = 41.8 bits (99), Expect = 1e-05
Identities = 17/40 (42%), Positives = 22/40 (55%), Gaps = 5/40 (12%)
Query: 41 ECLEDKCNKTPCEHGGKCITTPGSSYCLCPLGFT-GDLCQ 79
EC PC++GG C+ T GS C CP G+T G C+
Sbjct: 4 ECASG----NPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39
Score = 26.4 bits (59), Expect = 4.7
Identities = 8/19 (42%), Positives = 10/19 (52%), Gaps = 1/19 (5%)
Query: 267 PCGSGS-CVPLYDGYKCVC 284
PC +G CV Y+C C
Sbjct: 10 PCQNGGTCVNTVGSYRCEC 28
>gnl|CDD|215652 pfam00008, EGF, EGF-like domain. There is no clear separation
between noise and signal. pfam00053 is very similar,
but has 8 instead of 6 conserved cysteines. Includes
some cytokine receptors. The EGF domain misses the
N-terminus regions of the Ca2+ binding EGF domains
(this is the main reason of discrepancy between
swiss-prot domain start/end and Pfam). The family is
hard to model due to many similar but different
sub-types of EGF domains. Pfam certainly misses a
number of EGF domains.
Length = 32
Score = 41.7 bits (98), Expect = 1e-05
Identities = 15/31 (48%), Positives = 19/31 (61%), Gaps = 1/31 (3%)
Query: 47 CNK-TPCEHGGKCITTPGSSYCLCPLGFTGD 76
C+ PC +GG C+ TPG C CP G+TG
Sbjct: 1 CSPNNPCSNGGTCVDTPGGYTCECPEGYTGK 31
>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain.
Length = 35
Score = 36.0 bits (83), Expect = 0.002
Identities = 18/36 (50%), Positives = 20/36 (55%), Gaps = 5/36 (13%)
Query: 41 ECLEDKCNKTPCEHGGKCITTPGSSYCLCPLGFTGD 76
EC PC +G CI TPGS C CP G+TGD
Sbjct: 1 ECAS----GGPCSNG-TCINTPGSYTCSCPPGYTGD 31
Score = 28.3 bits (63), Expect = 0.98
Identities = 8/24 (33%), Positives = 11/24 (45%), Gaps = 1/24 (4%)
Query: 262 PC-ITQPCGSGSCVPLYDGYKCVC 284
C PC +G+C+ Y C C
Sbjct: 1 ECASGGPCSNGTCINTPGSYTCSC 24
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 28.3 bits (64), Expect = 0.81
Identities = 10/20 (50%), Positives = 11/20 (55%)
Query: 57 KCITTPGSSYCLCPLGFTGD 76
C T GS C C G+TGD
Sbjct: 13 TCTNTGGSFTCTCKSGYTGD 32
>gnl|CDD|204999 pfam12661, hEGF, Human growth factor-like EGF. hEGF, or human
growth factor-like EGF, domains have six conserved
residues disulfide-bonded into the characteristic
'ababcc' pattern. They are involved in growth and
proliferation of cells, in proteins of the Notch/Delta
pathway, neurogulin and selectins. hEGFs are also found
in mosaic proteins with four-disulfide laminin EGFs
such as aggrecan and perlecan. The core fold of the EGF
domain consists of two small beta-hairpins packed
against each other. Two major structural variants have
been identified based on the structural context of the
C-terminal Cys residue of disulfide 'c' in the
C-terminal hairpin: hEGFs and cEGFs. In hEGFs the
C-terminal thiol resides in the beta-turn, resulting in
shorter loop-lengths between the Cys residues of
disulfide 'c', typically C[8-9]XC. These shorter
loop-lengths are also typical of the four-disulfide EGF
domains, laminin ad integrin. Tandem hEGF domains have
six linking residues between terminal cysteines of
adjacent domains. hEGF domains may or may not bind
calcium in the linker region. hEGF domains with the
consensus motif CXD4X[F,Y]XCXC are hydroxylated
exclusively in the Asp residue.
Length = 13
Score = 28.1 bits (64), Expect = 0.85
Identities = 7/12 (58%), Positives = 8/12 (66%)
Query: 67 CLCPLGFTGDLC 78
C CP G+TG C
Sbjct: 2 CQCPPGYTGPRC 13
>gnl|CDD|109365 pfam00304, Gamma-thionin, Gamma-thionin family.
Length = 47
Score = 28.4 bits (64), Expect = 1.0
Identities = 10/31 (32%), Positives = 13/31 (41%), Gaps = 2/31 (6%)
Query: 258 NCAHPCITQPCGSGSCVPLYDGYKCVCTQSC 288
NCA C + G C +C CT+ C
Sbjct: 19 NCAKVCRNEGFAGGHCS--GFRRRCFCTKPC 47
>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain.
Length = 42
Score = 28.1 bits (63), Expect = 1.1
Identities = 11/32 (34%), Positives = 15/32 (46%), Gaps = 2/32 (6%)
Query: 45 DKCN--KTPCEHGGKCITTPGSSYCLCPLGFT 74
D+C C C+ T GS C+CP G+
Sbjct: 3 DECADGTHNCPANTVCVNTIGSFECVCPDGYE 34
>gnl|CDD|219677 pfam07974, EGF_2, EGF-like domain. This family contains EGF
domains found in a variety of extracellular proteins.
Length = 31
Score = 26.6 bits (59), Expect = 2.5
Identities = 9/31 (29%), Positives = 12/31 (38%), Gaps = 2/31 (6%)
Query: 48 NKTPCEHGGKCITTPGSSYCLCPLGFTGDLC 78
C G C+ G C+C G+ G C
Sbjct: 3 ASGICNGRGTCVRPCGK--CVCDSGYQGATC 31
>gnl|CDD|219377 pfam07333, SLR1-BP, S locus-related glycoprotein 1 binding pollen
coat protein (SLR1-BP). This family consists of a
number of cysteine rich SLR1 binding pollen coat like
proteins. Adhesion of pollen grains to the stigmatic
surface is a critical step during sexual reproduction
in plants. In Brassica, S locus-related glycoprotein 1
(SLR1), a stigma-specific protein belonging to the S
gene family of proteins, has been shown to be involved
in this step. SLR1-BP specifically binds SLR1 with high
affinity. The SLR1-BP gene is specifically expressed in
pollen at late stages of development and is a member of
the class A pollen coat protein (PCP) family, which
includes PCP-A1, an SLG (S locus glycoprotein)-binding
protein.
Length = 57
Score = 27.5 bits (61), Expect = 2.7
Identities = 14/53 (26%), Positives = 19/53 (35%), Gaps = 15/53 (28%)
Query: 32 GNKIKGRSKECLEDKCNKTPCEHG--------------GKCITTPGSS-YCLC 69
+ +G+ ++C E N PC G CI TP CLC
Sbjct: 1 VKETQGQERQCHEILPNGGPCVPDECAALCKKKYKGGVGTCIPTPKGKKQCLC 53
>gnl|CDD|130153 TIGR01081, mpl,
UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl-meso-
diaminopimelate ligase. Alternate name: murein
tripeptide ligase [Cell envelope, Biosynthesis and
degradation of murein sacculus and peptidoglycan].
Length = 448
Score = 30.2 bits (68), Expect = 2.8
Identities = 24/76 (31%), Positives = 34/76 (44%), Gaps = 4/76 (5%)
Query: 295 EVMCASFDHGASFLHYLDPEIIHSSGLLHYKFQLHVDFRITGYEFNIDMRIKTSSSSGLI 354
E A FD + F+HY P + + L + D + +F+ ++T GLI
Sbjct: 160 EYDTAFFDKRSKFVHY-RPRTLVLNNLEFDHADIFDDLKAIQRQFH--HLVRTVPGEGLI 216
Query: 355 LWTGSSQS-KDFLALG 369
L G QS KD LA G
Sbjct: 217 LCPGRDQSLKDTLAKG 232
>gnl|CDD|146221 pfam03465, eRF1_3, eRF1 domain 3. The release factor eRF1
terminates protein biosynthesis by recognising stop
codons at the A site of the ribosome and stimulating
peptidyl-tRNA bond hydrolysis at the peptidyl
transferase centre. The crystal structure of human eRF1
is known. The overall shape and dimensions of eRF1
resemble a tRNA molecule with domains 1, 2, and 3 of
eRF1 corresponding to the anticodon loop, aminoacyl
acceptor stem, and T stem of a tRNA molecule,
respectively. The position of the essential GGQ motif at
an exposed tip of domain 2 suggests that the Gln residue
coordinates a water molecule to mediate the hydrolytic
activity at the peptidyl transferase centre. A conserved
groove on domain 1, 80 A from the GGQ motif, is proposed
to form the codon recognition site. This family also
includes other proteins for which the precise molecular
function is unknown. Many of them are from
Archaebacteria. These proteins may also be involved in
translation termination but this awaits experimental
verification.
Length = 100
Score = 27.9 bits (63), Expect = 5.0
Identities = 9/20 (45%), Positives = 14/20 (70%)
Query: 462 FEQLNTDSGLYVGGVEDIEE 481
E+L D+GL V GVE++ +
Sbjct: 12 LEELAKDTGLAVYGVEEVLK 31
>gnl|CDD|185677 cd00817, ValRS_core, catalytic core domain of valyl-tRNA
synthetases. Valine amino-acyl tRNA synthetase (ValRS)
catalytic core domain. This enzyme is a monomer which
aminoacylates the 2'-OH of the nucleotide at the 3' of
the appropriate tRNA. The core domain is based on the
Rossman fold and is responsible for the ATP-dependent
formation of the enzyme bound aminoacyl-adenylate. It
contains the characteristic class I HIGH and KMSKS
motifs, which are involved in ATP binding. ValRS has an
insertion in the core domain, which is subject to both
deletions and rearrangements. This editing region
hydrolyzes mischarged cognate tRNAs and thus prevents
the incorporation of chemically similar amino acids.
Length = 382
Score = 29.5 bits (67), Expect = 5.2
Identities = 13/48 (27%), Positives = 19/48 (39%), Gaps = 2/48 (4%)
Query: 12 DWNTVR--VFRHRWDAWIQLNDGNKIKGRSKECLEDKCNKTPCEHGGK 57
DW R + HR AW + G+ + R ++ DK C G
Sbjct: 211 DWCISRQLWWGHRIPAWYCKDGGHWVVAREEDEAIDKAAPEACVPCGG 258
>gnl|CDD|226766 COG4316, COG4316, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 138
Score = 28.0 bits (62), Expect = 6.4
Identities = 14/52 (26%), Positives = 20/52 (38%)
Query: 136 FIALYLNDRYVDFTFDLGTGAATLRSSNPISLGEWHKLRLTRTGRHAYLQVD 187
IA + Y+ FD A R+ P ++GEW K+ L L
Sbjct: 85 TIAGLCVNGYIHAVFDFVEQAGYCRNGFPEAIGEWVKVELDDELLDKILSRK 136
>gnl|CDD|238752 cd01475, vWA_Matrilin, VWA_Matrilin: In cartilaginous plate,
extracellular matrix molecules mediate cell-matrix and
matrix-matrix interactions thereby providing tissue
integrity. Some members of the matrilin family are
expressed specifically in developing cartilage
rudiments. The matrilin family consists of at least four
members. All the members of the matrilin family contain
VWA domains, EGF-like domains and a heptad repeat
coiled-coiled domain at the carboxy terminus which is
responsible for the oligomerization of the matrilins.
The VWA domains have been shown to be essential for
matrilin network formation by interacting with matrix
ligands.
Length = 224
Score = 28.5 bits (64), Expect = 6.9
Identities = 12/34 (35%), Positives = 16/34 (47%), Gaps = 4/34 (11%)
Query: 43 LEDKCNKTP--CEHGGKCITTPGSSYCLCPLGFT 74
+ D C C+ CI+TPGS C C G+
Sbjct: 186 VPDLCATLSHVCQQV--CISTPGSYLCACTEGYA 217
>gnl|CDD|184896 PRK14903, PRK14903, 16S rRNA methyltransferase B; Provisional.
Length = 431
Score = 28.7 bits (64), Expect = 7.8
Identities = 12/32 (37%), Positives = 17/32 (53%), Gaps = 2/32 (6%)
Query: 209 LYLGGVPDYNIVSPKVKI--KSSFIGCIQKVL 238
L++ VPDY VS VK+ +F + VL
Sbjct: 85 LFMNSVPDYAAVSETVKLVKNENFKKLVNAVL 116
>gnl|CDD|225333 COG2706, COG2706, 3-carboxymuconate cyclase [Carbohydrate transport
and metabolism].
Length = 346
Score = 28.5 bits (64), Expect = 9.1
Identities = 23/109 (21%), Positives = 33/109 (30%), Gaps = 7/109 (6%)
Query: 116 AIEPNGILLYNGHRADGVGDFIALYLNDRYVDFTFDLGTGAATLRSSNPISLGEWHKLRL 175
A+ P+ LY + G A ++ TF TL S P + R
Sbjct: 46 AVNPDQRHLYVVNEPGEEGGVAAYRIDPDDGRLTF---LNRQTLPGSPPCYVSVDEDGRF 102
Query: 176 TRTGRHAYLQVDRFPSSQILSPGPFTQLSLSLSLYLGGVPDYNIVSPKV 224
+ V +P S P Q + + G P SP V
Sbjct: 103 VFVANYHSGSVSVYPLQADGSLQPVVQ----VVKHTGSGPHERQESPHV 147
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.138 0.430
Gapped
Lambda K H
0.267 0.0715 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,269,302
Number of extensions: 2490646
Number of successful extensions: 1647
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1631
Number of HSP's successfully gapped: 37
Length of query: 533
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 432
Effective length of database: 6,457,848
Effective search space: 2789790336
Effective search space used: 2789790336
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 61 (27.3 bits)