RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3619
(418 letters)
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular (mostly
animal) proteins. Many of these proteins require calcium
for their biological function and calcium-binding sites
have been found to be located at the N-terminus of
particular EGF-like domains; calcium-binding may be
crucial for numerous protein-protein interactions. Six
conserved core cysteines form three disulfide bridges as
in non calcium-binding EGF domains, whose structures are
very similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 51.9 bits (125), Expect = 3e-09
Identities = 21/38 (55%), Positives = 26/38 (68%), Gaps = 1/38 (2%)
Query: 173 NIDDCKS-NPCLNGGICRDGVAKFTCDCPPGWTGSRCE 209
+ID+C S NPC NGG C + V + C CPPG+TG CE
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
Score = 49.6 bits (119), Expect = 2e-08
Identities = 20/37 (54%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 96 IDDCASE-PCQNGGSCVDMLDGFMCQCRPGFVGLQCE 131
ID+CAS PCQNGG+CV+ + + C C PG+ G CE
Sbjct: 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
Score = 43.4 bits (103), Expect = 3e-06
Identities = 19/40 (47%), Positives = 24/40 (60%), Gaps = 3/40 (7%)
Query: 133 DIDECLS-DPCSPEGTDKCLDLDNKFQCECNVGYTGVMCE 171
DIDEC S +PC GT C++ ++C C GYTG CE
Sbjct: 1 DIDECASGNPCQNGGT--CVNTVGSYRCSCPPGYTGRNCE 38
Score = 39.9 bits (94), Expect = 5e-05
Identities = 12/24 (50%), Positives = 18/24 (75%)
Query: 70 GTCTDLVGGFSCSCDLGFTGKHCQ 93
GTC + VG + CSC G+TG++C+
Sbjct: 15 GTCVNTVGSYRCSCPPGYTGRNCE 38
>gnl|CDD|222092 pfam13385, Laminin_G_3, Concanavalin A-like lectin/glucanases
superfamily. This domain belongs to the Concanavalin
A-like lectin/glucanases superfamily.
Length = 156
Score = 53.5 bits (129), Expect = 1e-08
Identities = 27/86 (31%), Positives = 41/86 (47%), Gaps = 8/86 (9%)
Query: 274 ATINDGQWHHVAIVWTQGTLTLITEGLIAGKVENYGSGKSLPQYGWVVLGKPRPDNIKAY 333
A + GQWHHVA+ + GTL L G++ G G+ + G + +G
Sbjct: 79 APLPPGQWHHVAVTYDGGTLKLYVNGVLVGSTTLSGTI-TSGTTGPLYIGA------SNG 131
Query: 334 TEAGFEGKLTKVQIWNRALDFTNEIQ 359
+ F G + +V+I+NRAL EI
Sbjct: 132 GDRYFNGAIDEVRIYNRALS-AAEIA 156
>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain.
Length = 39
Score = 46.5 bits (111), Expect = 2e-07
Identities = 21/39 (53%), Positives = 27/39 (69%), Gaps = 2/39 (5%)
Query: 173 NIDDCKS-NPCLNGGICRDGVAKFTCDCPPGWT-GSRCE 209
+ID+C S NPC NGG C + V + C+CPPG+T G CE
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39
Score = 44.9 bits (107), Expect = 9e-07
Identities = 20/38 (52%), Positives = 27/38 (71%), Gaps = 2/38 (5%)
Query: 96 IDDCASE-PCQNGGSCVDMLDGFMCQCRPGFV-GLQCE 131
ID+CAS PCQNGG+CV+ + + C+C PG+ G CE
Sbjct: 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39
Score = 39.2 bits (92), Expect = 9e-05
Identities = 20/41 (48%), Positives = 25/41 (60%), Gaps = 4/41 (9%)
Query: 133 DIDECLS-DPCSPEGTDKCLDLDNKFQCECNVGYT-GVMCE 171
DIDEC S +PC GT C++ ++CEC GYT G CE
Sbjct: 1 DIDECASGNPCQNGGT--CVNTVGSYRCECPPGYTDGRNCE 39
Score = 34.9 bits (81), Expect = 0.003
Identities = 11/25 (44%), Positives = 17/25 (68%), Gaps = 1/25 (4%)
Query: 70 GTCTDLVGGFSCSCDLGFT-GKHCQ 93
GTC + VG + C C G+T G++C+
Sbjct: 15 GTCVNTVGSYRCECPPGYTDGRNCE 39
>gnl|CDD|216930 pfam02210, Laminin_G_2, Laminin G domain. This family includes the
Thrombospondin N-terminal-like domain, a Laminin G
subfamily.
Length = 124
Score = 47.1 bits (112), Expect = 1e-06
Identities = 28/115 (24%), Positives = 39/115 (33%), Gaps = 14/115 (12%)
Query: 247 LIQAHSSGVQVSLFPELQDAFLAFH------------EYATINDGQWHHVAIVWTQGTLT 294
L+ A L EL+D L +NDGQWH V++ +LT
Sbjct: 10 LLYAGGEDGLDFLALELEDGRLVLRYDLGSGGSVLLLSGKKLNDGQWHRVSVSRDGRSLT 69
Query: 295 LITEGLIAGKVENYGSGKSLPQYGWVVLGK--PRPDNIKAYTEAGFEGKLTKVQI 347
L +G GS L G + LG GF G + V++
Sbjct: 70 LSVDGGTVVSEALPGSSSILNLNGPLYLGGLPEDSGLSLLPVTEGFVGCIRNVRV 124
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional significance
of EGF-like domains in what appear to be unrelated
proteins is not yet clear; a common feature is that
these repeats are found in the extracellular domain of
membrane-bound proteins or in proteins known to be
secreted (exception: prostaglandin G/H synthase); the
domain includes six cysteine residues which have been
shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 42.1 bits (99), Expect = 9e-06
Identities = 17/32 (53%), Positives = 20/32 (62%), Gaps = 1/32 (3%)
Query: 179 SNPCLNGGICRDGVAKFTCDCPPGWTGS-RCE 209
SNPC NGG C + + C CPPG+TG CE
Sbjct: 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGDRSCE 36
Score = 37.8 bits (88), Expect = 3e-04
Identities = 14/28 (50%), Positives = 18/28 (64%)
Query: 100 ASEPCQNGGSCVDMLDGFMCQCRPGFVG 127
AS PC NGG+CV+ + C C PG+ G
Sbjct: 4 ASNPCSNGGTCVNTPGSYRCVCPPGYTG 31
Score = 31.3 bits (71), Expect = 0.050
Identities = 10/25 (40%), Positives = 15/25 (60%), Gaps = 1/25 (4%)
Query: 70 GTCTDLVGGFSCSCDLGFTG-KHCQ 93
GTC + G + C C G+TG + C+
Sbjct: 12 GTCVNTPGSYRCVCPPGYTGDRSCE 36
Score = 29.4 bits (66), Expect = 0.24
Identities = 17/38 (44%), Positives = 22/38 (57%), Gaps = 4/38 (10%)
Query: 136 EC-LSDPCSPEGTDKCLDLDNKFQCECNVGYTGVM-CE 171
EC S+PCS GT C++ ++C C GYTG CE
Sbjct: 1 ECAASNPCSNGGT--CVNTPGSYRCVCPPGYTGDRSCE 36
>gnl|CDD|215652 pfam00008, EGF, EGF-like domain. There is no clear separation
between noise and signal. pfam00053 is very similar, but
has 8 instead of 6 conserved cysteines. Includes some
cytokine receptors. The EGF domain misses the N-terminus
regions of the Ca2+ binding EGF domains (this is the
main reason of discrepancy between swiss-prot domain
start/end and Pfam). The family is hard to model due to
many similar but different sub-types of EGF domains.
Pfam certainly misses a number of EGF domains.
Length = 32
Score = 39.7 bits (93), Expect = 4e-05
Identities = 16/28 (57%), Positives = 19/28 (67%)
Query: 180 NPCLNGGICRDGVAKFTCDCPPGWTGSR 207
NPC NGG C D +TC+CP G+TG R
Sbjct: 5 NPCSNGGTCVDTPGGYTCECPEGYTGKR 32
Score = 34.3 bits (79), Expect = 0.004
Identities = 13/25 (52%), Positives = 17/25 (68%)
Query: 103 PCQNGGSCVDMLDGFMCQCRPGFVG 127
PC NGG+CVD G+ C+C G+ G
Sbjct: 6 PCSNGGTCVDTPGGYTCECPEGYTG 30
Score = 29.3 bits (66), Expect = 0.28
Identities = 12/22 (54%), Positives = 15/22 (68%)
Query: 70 GTCTDLVGGFSCSCDLGFTGKH 91
GTC D GG++C C G+TGK
Sbjct: 11 GTCVDTPGGYTCECPEGYTGKR 32
Score = 26.2 bits (58), Expect = 3.3
Identities = 14/27 (51%), Positives = 16/27 (59%), Gaps = 2/27 (7%)
Query: 141 PCSPEGTDKCLDLDNKFQCECNVGYTG 167
PCS GT C+D + CEC GYTG
Sbjct: 6 PCSNGGT--CVDTPGGYTCECPEGYTG 30
>gnl|CDD|238058 cd00110, LamG, Laminin G domain; Laminin G-like domains are usually
Ca++ mediated receptors that can have binding sites for
steroids, beta1 integrins, heparin, sulfatides,
fibulin-1, and alpha-dystroglycans. Proteins that
contain LamG domains serve a variety of purposes
including signal transduction via cell-surface steroid
receptors, adhesion, migration and differentiation
through mediation of cell adhesion molecules.
Length = 151
Score = 42.8 bits (101), Expect = 6e-05
Identities = 20/77 (25%), Positives = 37/77 (48%), Gaps = 4/77 (5%)
Query: 275 TINDGQWHHVAIVWTQGTLTLITEGLIAGKVENYGSGKSLPQYGWVVLGKPRPDNIKAYT 334
+NDGQWH V++ ++TL +G + + G L G + LG P+++K+
Sbjct: 76 PLNDGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGG-LPEDLKSPG 134
Query: 335 EA---GFEGKLTKVQIW 348
GF G + +++
Sbjct: 135 LPVSPGFVGCIRDLKVN 151
>gnl|CDD|184753 PRK14581, hmsF, outer membrane N-deacetylase; Provisional.
Length = 672
Score = 43.0 bits (101), Expect = 3e-04
Identities = 22/52 (42%), Positives = 30/52 (57%), Gaps = 12/52 (23%)
Query: 327 PDNIKAYTEAGFEGKLTKV-------QIWNR-----ALDFTNEIQKQVRDCR 366
PD I+AY +AGF G +T + Q W R +DFTNE+ ++VRD R
Sbjct: 478 PDAIRAYEKAGFPGSITTIRQDPEMMQRWTRYKSKYLIDFTNELTREVRDIR 529
>gnl|CDD|238086 cd00152, PTX, Pentraxins are plasma proteins characterized by their
pentameric discoid assembly and their Ca2+ dependent
ligand binding, such as Serum amyloid P component (SAP)
and C-reactive Protein (CRP), which are
cytokine-inducible acute-phase proteins implicated in
innate immunity. CRP binds to ligands containing
phosphocholine, SAP binds to amyloid fibrils, DNA,
chromatin, fibronectin, C4-binding proteins and
glycosaminoglycans. "Long" pentraxins have N-terminal
extensions to the common pentraxin domain; one group,
the neuronal pentraxins, may be involved in synapse
formation and remodeling, and they may also be able to
form heteromultimers.
Length = 201
Score = 40.7 bits (96), Expect = 5e-04
Identities = 34/124 (27%), Positives = 59/124 (47%), Gaps = 24/124 (19%)
Query: 277 NDGQWHHVAIVWT--QGTLTLITEG-LIAGKVENYGSGKSLPQYGWVVLGKPRPDNIKAY 333
+DG WHH+ + W G L G L K G ++ G ++LG+ D+ Y
Sbjct: 88 SDGAWHHICVTWESTSGIAELWVNGKLSVRKS--LKKGYTVGPGGSIILGQ-EQDS---Y 141
Query: 334 ------TEAGFEGKLTKVQIWNRALDFTNEIQKQVRDCRTEPVLHTGLVLTWTG--YDDI 385
T++ F G+++ V +W+ L + E ++++ +E +G +L W Y +I
Sbjct: 142 GGGFDATQS-FVGEISDVNMWDSVL--SPE---EIKNVYSEGGTLSGNILNWRALNY-EI 194
Query: 386 NGGV 389
NGGV
Sbjct: 195 NGGV 198
>gnl|CDD|214598 smart00282, LamG, Laminin G domain.
Length = 132
Score = 39.2 bits (92), Expect = 7e-04
Identities = 22/79 (27%), Positives = 37/79 (46%), Gaps = 4/79 (5%)
Query: 275 TINDGQWHHVAIVWTQGTLTLITEGLIAGKVENYGSGKSLPQYGWVVLGKPRPDNIKAYT 334
+NDGQWH VA+ ++TL +G E+ G L G + LG P+++K
Sbjct: 55 PLNDGQWHRVAVERNGRSVTLSVDGGNRVSGESPGGLTILNLDGPLYLGG-LPEDLKLPP 113
Query: 335 EA---GFEGKLTKVQIWNR 350
GF G + +++ +
Sbjct: 114 LPVTPGFRGCIRNLKVNGK 132
>gnl|CDD|128463 smart00159, PTX, Pentraxin / C-reactive protein / pentaxin family.
This family form a doscoid pentameric structure. Human
serum amyloid P demonstrates calcium-mediated
ligand-binding.
Length = 206
Score = 38.8 bits (91), Expect = 0.003
Identities = 31/121 (25%), Positives = 56/121 (46%), Gaps = 16/121 (13%)
Query: 276 INDGQWHHVAIVWTQGTLTLITEGLIAGKV---ENYGSGKSLPQYGWVVLGKPRPDNIKA 332
+DG+WHH+ W + + I E + GK + G ++ G ++LG+ + D+
Sbjct: 87 ESDGKWHHICTTWE--SSSGIAELWVDGKPGVRKGLAKGYTVKPGGSIILGQEQ-DSYGG 143
Query: 333 YTEAG--FEGKLTKVQIWNRALDFTNEIQKQVRDCRTEPVLHTGLVLTW--TGYDDINGG 388
+A F G++ + +W+ L + E K V T G +L W Y +++GG
Sbjct: 144 GFDATQSFVGEIGDLNMWDSVL--SPEEIKSVYKGST---FSIGNILNWRALNY-EVHGG 197
Query: 389 V 389
V
Sbjct: 198 V 198
>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain.
Length = 35
Score = 32.1 bits (73), Expect = 0.024
Identities = 16/32 (50%), Positives = 19/32 (59%), Gaps = 2/32 (6%)
Query: 179 SNPCLNGGICRDGVAKFTCDCPPGWTGS-RCE 209
PC NG C + +TC CPPG+TG RCE
Sbjct: 5 GGPCSNG-TCINTPGSYTCSCPPGYTGDKRCE 35
Score = 30.2 bits (68), Expect = 0.14
Identities = 13/31 (41%), Positives = 19/31 (61%), Gaps = 2/31 (6%)
Query: 98 DCASE-PCQNGGSCVDMLDGFMCQCRPGFVG 127
+CAS PC NG +C++ + C C PG+ G
Sbjct: 1 ECASGGPCSNG-TCINTPGSYTCSCPPGYTG 30
Score = 29.8 bits (67), Expect = 0.17
Identities = 12/26 (46%), Positives = 17/26 (65%), Gaps = 1/26 (3%)
Query: 69 PGTCTDLVGGFSCSCDLGFTG-KHCQ 93
GTC + G ++CSC G+TG K C+
Sbjct: 10 NGTCINTPGSYTCSCPPGYTGDKRCE 35
Score = 27.1 bits (60), Expect = 1.6
Identities = 15/38 (39%), Positives = 18/38 (47%), Gaps = 5/38 (13%)
Query: 136 ECLS-DPCSPEGTDKCLDLDNKFQCECNVGYTGV-MCE 171
EC S PCS C++ + C C GYTG CE
Sbjct: 1 ECASGGPCSN---GTCINTPGSYTCSCPPGYTGDKRCE 35
>gnl|CDD|204999 pfam12661, hEGF, Human growth factor-like EGF. hEGF, or human
growth factor-like EGF, domains have six conserved
residues disulfide-bonded into the characteristic
'ababcc' pattern. They are involved in growth and
proliferation of cells, in proteins of the Notch/Delta
pathway, neurogulin and selectins. hEGFs are also found
in mosaic proteins with four-disulfide laminin EGFs such
as aggrecan and perlecan. The core fold of the EGF
domain consists of two small beta-hairpins packed
against each other. Two major structural variants have
been identified based on the structural context of the
C-terminal Cys residue of disulfide 'c' in the
C-terminal hairpin: hEGFs and cEGFs. In hEGFs the
C-terminal thiol resides in the beta-turn, resulting in
shorter loop-lengths between the Cys residues of
disulfide 'c', typically C[8-9]XC. These shorter
loop-lengths are also typical of the four-disulfide EGF
domains, laminin ad integrin. Tandem hEGF domains have
six linking residues between terminal cysteines of
adjacent domains. hEGF domains may or may not bind
calcium in the linker region. hEGF domains with the
consensus motif CXD4X[F,Y]XCXC are hydroxylated
exclusively in the Asp residue.
Length = 13
Score = 30.0 bits (69), Expect = 0.14
Identities = 9/13 (69%), Positives = 10/13 (76%)
Query: 196 TCDCPPGWTGSRC 208
C CPPG+TG RC
Sbjct: 1 KCQCPPGYTGPRC 13
>gnl|CDD|165214 PHA02887, PHA02887, EGF-like protein; Provisional.
Length = 126
Score = 31.4 bits (71), Expect = 0.28
Identities = 18/44 (40%), Positives = 25/44 (56%), Gaps = 7/44 (15%)
Query: 129 QCEADI-DECLSDPCSPEGTDKCLDLDNKFQCECNVGYTGVMCE 171
+C+ D D C++ C +DLD KF C CN GYTG+ C+
Sbjct: 85 KCKNDFNDFCINGECM-----NIIDLDEKF-CICNKGYTGIRCD 122
Score = 31.4 bits (71), Expect = 0.29
Identities = 13/38 (34%), Positives = 21/38 (55%), Gaps = 6/38 (15%)
Query: 177 CKS---NPCLNGGICRDGV--AKFTCDCPPGWTGSRCE 209
CK+ + C+NG C + + + C C G+TG RC+
Sbjct: 86 CKNDFNDFCING-ECMNIIDLDEKFCICNKGYTGIRCD 122
>gnl|CDD|214722 smart00560, LamGL, LamG-like jellyroll fold domain.
Length = 133
Score = 30.9 bits (70), Expect = 0.60
Identities = 25/80 (31%), Positives = 36/80 (45%), Gaps = 16/80 (20%)
Query: 279 GQWHHVAIVWTQ--GTLTLITEGLIAGKVEN--YGSGKSLPQYGWVVL--GKPRPDNIKA 332
G W H+A V+ G L+L G+ E S +LPQ G +L G
Sbjct: 61 GVWVHLAGVYDGGAGKLSLYVNGVEVATSETQPSPSSGNLPQ-GGRILLGGAGG------ 113
Query: 333 YTEAGFEGKLTKVQIWNRAL 352
F G+L +V+++NRAL
Sbjct: 114 ---ENFSGRLDEVRVYNRAL 130
>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain.
Length = 42
Score = 28.5 bits (64), Expect = 0.66
Identities = 10/32 (31%), Positives = 15/32 (46%), Gaps = 2/32 (6%)
Query: 96 IDDCASEP--CQNGGSCVDMLDGFMCQCRPGF 125
+D+CA C CV+ + F C C G+
Sbjct: 2 VDECADGTHNCPANTVCVNTIGSFECVCPDGY 33
Score = 26.5 bits (59), Expect = 3.1
Identities = 10/34 (29%), Positives = 15/34 (44%)
Query: 133 DIDECLSDPCSPEGTDKCLDLDNKFQCECNVGYT 166
D+DEC + C++ F+C C GY
Sbjct: 1 DVDECADGTHNCPANTVCVNTIGSFECVCPDGYE 34
>gnl|CDD|184754 PRK14582, pgaB, outer membrane N-deacetylase; Provisional.
Length = 671
Score = 32.0 bits (73), Expect = 0.72
Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 12/52 (23%)
Query: 327 PDNIKAYTEAGFEGKLTKV-------QIWNR----AL-DFTNEIQKQVRDCR 366
I AY +AGF G L+++ + W R AL DFT E+ +V+ R
Sbjct: 478 APAITAYQQAGFSGSLSEIRQNPEQFKQWTRFKSRALTDFTLELSARVKAIR 529
>gnl|CDD|165381 PHA03099, PHA03099, epidermal growth factor-like protein (EGF-like
protein); Provisional.
Length = 139
Score = 30.0 bits (67), Expect = 1.1
Identities = 15/38 (39%), Positives = 20/38 (52%), Gaps = 9/38 (23%)
Query: 142 CSPEGTDKCL--------DLDNKFQCECNVGYTGVMCE 171
C PEG CL D+D + C C+ GYTG+ C+
Sbjct: 45 CGPEGDGYCLHGDCIHARDIDGMY-CRCSHGYTGIRCQ 81
Score = 27.7 bits (61), Expect = 7.5
Identities = 9/21 (42%), Positives = 13/21 (61%)
Query: 76 VGGFSCSCDLGFTGKHCQHLI 96
+ G C C G+TG CQH++
Sbjct: 64 IDGMYCRCSHGYTGIRCQHVV 84
>gnl|CDD|219677 pfam07974, EGF_2, EGF-like domain. This family contains EGF
domains found in a variety of extracellular proteins.
Length = 31
Score = 27.4 bits (61), Expect = 1.3
Identities = 10/30 (33%), Positives = 12/30 (40%), Gaps = 2/30 (6%)
Query: 179 SNPCLNGGICRDGVAKFTCDCPPGWTGSRC 208
S C G C K C C G+ G+ C
Sbjct: 4 SGICNGRGTCVRPCGK--CVCDSGYQGATC 31
Score = 25.9 bits (57), Expect = 4.1
Identities = 11/31 (35%), Positives = 13/31 (41%), Gaps = 2/31 (6%)
Query: 100 ASEPCQNGGSCVDMLDGFMCQCRPGFVGLQC 130
AS C G+CV C C G+ G C
Sbjct: 3 ASGICNGRGTCVR--PCGKCVCDSGYQGATC 31
>gnl|CDD|214560 smart00210, TSPN, Thrombospondin N-terminal -like domains.
Heparin-binding and cell adhesion domain of
thrombospondin.
Length = 184
Score = 30.0 bits (68), Expect = 1.5
Identities = 16/61 (26%), Positives = 21/61 (34%), Gaps = 15/61 (24%)
Query: 240 SVQGRRPLI----QAHSSGVQVSLFPELQDAFLAFHEYATINDGQWHHVAIVWTQGTLTL 295
V GR + Q F L LA DGQWH +A+ + + TL
Sbjct: 85 EVDGRANTLLLRYQGVDGKQHTVSFRNLP---LA--------DGQWHKLALSVSGSSATL 133
Query: 296 I 296
Sbjct: 134 Y 134
>gnl|CDD|113868 pfam05113, DUF693, Protein of unknown function (DUF693). This
family consists of several uncharacterized proteins from
Borrelia burgdorferi (Lyme disease spirochete).
Length = 311
Score = 30.6 bits (69), Expect = 1.5
Identities = 8/19 (42%), Positives = 13/19 (68%)
Query: 341 KLTKVQIWNRALDFTNEIQ 359
K K+ +WN LDF++ I+
Sbjct: 61 KKAKIVLWNLPLDFSDHIK 79
>gnl|CDD|221695 pfam12662, cEGF, Complement Clr-like EGF-like. cEGF, or complement
Clr-like EGF, domains have six conserved cysteine
residues disulfide-bonded into the characteristic
pattern 'ababcc'. They are found in blood coagulation
proteins such as fibrillin, Clr and Cls, thrombomodulin,
and the LDL receptor. The core fold of the EGF domain
consists of two small beta-hairpins packed against each
other. Two major structural variants have been
identified based on the structural context of the
C-terminal cysteine residue of disulfide 'c' in the
C-terminal hairpin: hEGFs and cEGFs. In cEGFs the
C-terminal thiol resides on the C-terminal beta-sheet,
resulting in long loop-lengths between the cysteine
residues of disulfide 'c', typically C[10+]XC. These
longer loop-lengths may have arisen by selective
cysteine loss from a four-disulfide EGF template such as
laminin or integrin. Tandem cEGF domains have five
linking residues between terminal cysteines of adjacent
domains. cEGF domains may or may not bind calcium in the
linker region. cEGF domains with the consensus motif
CXN4X[F,Y]XCXC are hydroxylated exclusively on the
asparagine residue.
Length = 24
Score = 26.6 bits (60), Expect = 1.8
Identities = 11/25 (44%), Positives = 13/25 (52%), Gaps = 5/25 (20%)
Query: 116 GFMCQCRPGFV----GLQCEADIDE 136
+ C C PG+ G CE DIDE
Sbjct: 1 SYTCSCPPGYQLSGDGRTCE-DIDE 24
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 26.0 bits (58), Expect = 4.0
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 71 TCTDLVGGFSCSCDLGFTG 89
TCT+ G F+C+C G+TG
Sbjct: 13 TCTNTGGSFTCTCKSGYTG 31
Score = 26.0 bits (58), Expect = 4.1
Identities = 11/28 (39%), Positives = 12/28 (42%), Gaps = 2/28 (7%)
Query: 140 DPCSPEGTDKCLDLDNKFQCECNVGYTG 167
C P T C + F C C GYTG
Sbjct: 6 GGCHPNAT--CTNTGGSFTCTCKSGYTG 31
>gnl|CDD|215681 pfam00054, Laminin_G_1, Laminin G domain.
Length = 131
Score = 28.1 bits (63), Expect = 4.7
Identities = 9/24 (37%), Positives = 13/24 (54%), Gaps = 2/24 (8%)
Query: 274 ATINDGQWHHV--AIVWTQGTLTL 295
+NDG+WH V GTL++
Sbjct: 48 DKLNDGKWHSVELERNGRSGTLSV 71
>gnl|CDD|143593 cd07569, DCase, N-carbamyl-D-amino acid amidohydrolase (DCase,
class 6 nitrilases). DCase hydrolyses
N-carbamyl-D-amino acids to produce D-amino acids. It is
an important biocatalyst in the pharmaceutical industry,
producing useful D-amino acids for example in the
preparation of beta-lactam antibiotics. This subgroup
belongs to a larger nitrilase superfamily comprised of
nitrile- or amide-hydrolyzing enzymes and
amide-condensing enzymes, which depend on a Glu-Lys-Cys
catalytic triad. This superfamily has been classified in
the literature based on global and structure based
sequence analysis into thirteen different enzyme classes
(referred to as 1-13), this subgroup corresponds to
class 6. Members of this superfamily generally form
homomeric complexes, the basic building block of which
is a homodimer. Agrobacterium radiobacter DCase forms a
tetramer (dimer of dimers). Some DCases may form
trimers.
Length = 302
Score = 28.4 bits (64), Expect = 6.1
Identities = 11/24 (45%), Positives = 14/24 (58%), Gaps = 2/24 (8%)
Query: 247 LIQAHSSGVQVSLFPELQDAFLAF 270
L +A S G Q+ +FPEL A F
Sbjct: 31 LEEAASRGAQLVVFPEL--ALTTF 52
>gnl|CDD|218955 pfam06247, Plasmod_Pvs28, Plasmodium ookinete surface protein
Pvs28. This family consists of several ookinete surface
protein (Pvs28) from several species of Plasmodium.
Pvs25 and Pvs28 are expressed on the surface of
ookinetes. These proteins are potential candidates for
vaccine and induce antibodies that block the infectivity
of Plasmodium vivax in immunised animals.
Length = 196
Score = 28.2 bits (63), Expect = 7.4
Identities = 32/120 (26%), Positives = 45/120 (37%), Gaps = 20/120 (16%)
Query: 104 CQNGGSCVDMLDGFMCQCRPGFVGLQ---CE----ADIDECLSDPCSPEGT---DKCLDL 153
C+NG + M + F C+C G+V CE D E ++ C T
Sbjct: 8 CKNG-YLIQMSNHFECKCNEGYVLKNENTCEEKVKCDKLENVNKVCGEYATCINQANKAE 66
Query: 154 DNKFQCECNVGYTGVMCETNIDDCKSNPCLNGGICRDGVAKFTCDCPPGWTGSRCEKDIG 213
+ +C C GYT + C N C N +C G K D P + C +IG
Sbjct: 67 EKALKCGCINGYT-----LSQGVCVPNKC-NNKVCGSG--KCIVD-PANPNNTTCSCNIG 117
>gnl|CDD|235057 PRK02628, nadE, NAD synthetase; Reviewed.
Length = 679
Score = 28.3 bits (64), Expect = 9.5
Identities = 7/15 (46%), Positives = 11/15 (73%)
Query: 249 QAHSSGVQVSLFPEL 263
+A GV +++FPEL
Sbjct: 40 RAADDGVALAVFPEL 54
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.139 0.467
Gapped
Lambda K H
0.267 0.0676 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,531,180
Number of extensions: 1896631
Number of successful extensions: 1268
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1253
Number of HSP's successfully gapped: 86
Length of query: 418
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 319
Effective length of database: 6,546,556
Effective search space: 2088351364
Effective search space used: 2088351364
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 60 (27.0 bits)