RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy18237
(1050 letters)
>gnl|CDD|201391 pfam00683, TB, TB domain. This domain is also known as the 8
cysteine domain. This family includes the hybrid
domains. This cysteine rich repeat is found in TGF
binding protein and fibrillin.
Length = 42
Score = 51.2 bits (123), Expect = 1e-08
Identities = 19/41 (46%), Positives = 27/41 (65%)
Query: 658 TCTKEIAGSTTRSTCCCSIGKAWGPQCEECPAVGSDEHKTL 698
C+ + G+ T+S CCCS+G+AWG CE CP G+ E + L
Sbjct: 2 RCSNPLPGNVTKSECCCSLGRAWGTPCEPCPVQGTAEFRQL 42
Score = 44.2 bits (105), Expect = 4e-06
Identities = 18/39 (46%), Positives = 27/39 (69%)
Query: 362 GLCSLPMSNEQTRMVCCCSMGQSWGKPCQPCPPPGSRDY 400
G CS P+ T+ CCCS+G++WG PC+PCP G+ ++
Sbjct: 1 GRCSNPLPGNVTKSECCCSLGRAWGTPCEPCPVQGTAEF 39
Score = 38.9 bits (91), Expect = 4e-04
Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 4/40 (10%)
Query: 462 GRCVLPTGPALLMEVTRMDCCCTMGMAWGPQCQLCPTRGS 501
GRC P VT+ +CCC++G AWG C+ CP +G+
Sbjct: 1 GRCSNPLPGN----VTKSECCCSLGRAWGTPCEPCPVQGT 36
>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain.
Length = 39
Score = 40.7 bits (96), Expect = 8e-05
Identities = 21/42 (50%), Positives = 25/42 (59%), Gaps = 3/42 (7%)
Query: 766 DLNECALMPSACQGGECINTDGSYRCECPAGYVKDETGKICI 807
D++ECA GG C+NT GSYRCECP GY G+ C
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGY---TDGRNCE 39
Score = 39.9 bits (94), Expect = 1e-04
Identities = 18/37 (48%), Positives = 22/37 (59%)
Query: 903 DIDECARHPNICNNGTCVNAIGSFKCHCYAGFKLSHN 939
DIDECA N GTCVN +GS++C C G+ N
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRN 37
Score = 39.9 bits (94), Expect = 1e-04
Identities = 17/33 (51%), Positives = 20/33 (60%), Gaps = 1/33 (3%)
Query: 973 DINECESPQACLYG-NCTNTLGSFSCTCPPNYQ 1004
DI+EC S C G C NT+GS+ C CPP Y
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYT 33
Score = 39.2 bits (92), Expect = 2e-04
Identities = 15/32 (46%), Positives = 20/32 (62%)
Query: 148 EIDECNLMPNMCNHGTCMNTPGSFHCQCNRGF 179
+IDEC N GTC+NT GS+ C+C G+
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGY 32
Score = 37.2 bits (87), Expect = 0.001
Identities = 16/33 (48%), Positives = 20/33 (60%), Gaps = 2/33 (6%)
Query: 1017 DINECEEHDNICENG-HCTNTFGSFMCSCQDGF 1048
DI+EC N C+NG C NT GS+ C C G+
Sbjct: 1 DIDECAS-GNPCQNGGTCVNTVGSYRCECPPGY 32
Score = 35.7 bits (83), Expect = 0.004
Identities = 17/37 (45%), Positives = 19/37 (51%), Gaps = 2/37 (5%)
Query: 808 DDNECLSIPNICGNG-TCTNLNGGFECTCSEGYAPGP 843
D +EC S N C NG TC N G + C C GY G
Sbjct: 1 DIDECAS-GNPCQNGGTCVNTVGSYRCECPPGYTDGR 36
Score = 35.3 bits (82), Expect = 0.005
Identities = 22/43 (51%), Positives = 24/43 (55%), Gaps = 5/43 (11%)
Query: 307 DIDECATSIQRCGEG-FCVNDVGTYHCVCPDGYMLLPSGKECI 348
DIDECA S C G CVN VG+Y C CP GY G+ C
Sbjct: 1 DIDECA-SGNPCQNGGTCVNTVGSYRCECPPGYT---DGRNCE 39
Score = 34.5 bits (80), Expect = 0.011
Identities = 17/34 (50%), Positives = 19/34 (55%), Gaps = 2/34 (5%)
Query: 861 DIDECYERPGICANG-DCANFQGSFQCTCANGYT 893
DIDEC C NG C N GS++C C GYT
Sbjct: 1 DIDEC-ASGNPCQNGGTCVNTVGSYRCECPPGYT 33
Score = 33.8 bits (78), Expect = 0.020
Identities = 17/34 (50%), Positives = 22/34 (64%), Gaps = 2/34 (5%)
Query: 576 DLDECRMMSYLCRNG-RCRNNIGSFFCECLQGYT 608
D+DEC + C+NG C N +GS+ CEC GYT
Sbjct: 1 DIDECASGN-PCQNGGTCVNTVGSYRCECPPGYT 33
Score = 33.0 bits (76), Expect = 0.033
Identities = 13/32 (40%), Positives = 17/32 (53%)
Query: 234 INECEEMPEICGSGTCENNIGSFSCRCEDGYS 265
I+EC GTC N +GS+ C C GY+
Sbjct: 2 IDECASGNPCQNGGTCVNTVGSYRCECPPGYT 33
Score = 31.8 bits (73), Expect = 0.084
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 714 GICPSPGKCQNVMGSFICTCPPGYR 738
C + G C N +GS+ C CPPGY
Sbjct: 9 NPCQNGGTCVNTVGSYRCECPPGYT 33
>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain.
Length = 42
Score = 40.0 bits (94), Expect = 1e-04
Identities = 18/42 (42%), Positives = 23/42 (54%), Gaps = 1/42 (2%)
Query: 766 DLNECALMPSAC-QGGECINTDGSYRCECPAGYVKDETGKIC 806
D++ECA C C+NT GS+ C CP GY +E G C
Sbjct: 1 DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42
Score = 37.3 bits (87), Expect = 0.001
Identities = 14/40 (35%), Positives = 18/40 (45%), Gaps = 1/40 (2%)
Query: 234 INECEEMPEIC-GSGTCENNIGSFSCRCEDGYSVKPAEGP 272
++EC + C + C N IGSF C C DGY
Sbjct: 2 VDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTN 41
Score = 36.6 bits (85), Expect = 0.002
Identities = 18/42 (42%), Positives = 23/42 (54%), Gaps = 1/42 (2%)
Query: 861 DIDECYERPGIC-ANGDCANFQGSFQCTCANGYTLNTARDSC 901
D+DEC + C AN C N GSF+C C +GY N +C
Sbjct: 1 DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42
Score = 36.2 bits (84), Expect = 0.003
Identities = 19/42 (45%), Positives = 23/42 (54%), Gaps = 1/42 (2%)
Query: 307 DIDECATSIQRCGEGF-CVNDVGTYHCVCPDGYMLLPSGKEC 347
D+DECA C CVN +G++ CVCPDGY G C
Sbjct: 1 DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42
Score = 35.8 bits (83), Expect = 0.003
Identities = 15/40 (37%), Positives = 19/40 (47%), Gaps = 1/40 (2%)
Query: 808 DDNECLSIPNIC-GNGTCTNLNGGFECTCSEGYAPGPLGS 846
D +EC + C N C N G FEC C +GY G+
Sbjct: 1 DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGT 40
Score = 35.8 bits (83), Expect = 0.004
Identities = 16/42 (38%), Positives = 20/42 (47%), Gaps = 2/42 (4%)
Query: 973 DINECESPQACLYGN--CTNTLGSFSCTCPPNYQLTPAGNAC 1012
D++EC N C NT+GSF C CP Y+ G C
Sbjct: 1 DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42
Score = 35.0 bits (81), Expect = 0.009
Identities = 17/34 (50%), Positives = 22/34 (64%), Gaps = 1/34 (2%)
Query: 903 DIDECARHPNIC-NNGTCVNAIGSFKCHCYAGFK 935
D+DECA + C N CVN IGSF+C C G++
Sbjct: 1 DVDECADGTHNCPANTVCVNTIGSFECVCPDGYE 34
Score = 34.2 bits (79), Expect = 0.013
Identities = 15/34 (44%), Positives = 21/34 (61%), Gaps = 1/34 (2%)
Query: 1017 DINECEEHDNIC-ENGHCTNTFGSFMCSCQDGFK 1049
D++EC + + C N C NT GSF C C DG++
Sbjct: 1 DVDECADGTHNCPANTVCVNTIGSFECVCPDGYE 34
Score = 33.5 bits (77), Expect = 0.030
Identities = 16/41 (39%), Positives = 21/41 (51%), Gaps = 1/41 (2%)
Query: 149 IDECNLMPNMCNHGT-CMNTPGSFHCQCNRGFLYDSDTHQC 188
+DEC + C T C+NT GSF C C G+ + D C
Sbjct: 2 VDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42
Score = 32.7 bits (75), Expect = 0.054
Identities = 14/32 (43%), Positives = 18/32 (56%)
Query: 715 ICPSPGKCQNVMGSFICTCPPGYRLSPDKNSC 746
CP+ C N +GSF C CP GY + D +C
Sbjct: 11 NCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42
Score = 31.6 bits (72), Expect = 0.12
Identities = 18/42 (42%), Positives = 22/42 (52%), Gaps = 1/42 (2%)
Query: 576 DLDECRMMSYLC-RNGRCRNNIGSFFCECLQGYTLASEGQYC 616
D+DEC ++ C N C N IGSF C C GY +G C
Sbjct: 1 DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42
Score = 27.7 bits (62), Expect = 3.3
Identities = 10/27 (37%), Positives = 14/27 (51%), Gaps = 1/27 (3%)
Query: 207 INECEEMPEIC-GSGTCENNIGSFSCR 232
++EC + C + C N IGSF C
Sbjct: 2 VDECADGTHNCPANTVCVNTIGSFECV 28
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular (mostly
animal) proteins. Many of these proteins require calcium
for their biological function and calcium-binding sites
have been found to be located at the N-terminus of
particular EGF-like domains; calcium-binding may be
crucial for numerous protein-protein interactions. Six
conserved core cysteines form three disulfide bridges as
in non calcium-binding EGF domains, whose structures are
very similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 39.9 bits (94), Expect = 1e-04
Identities = 17/36 (47%), Positives = 22/36 (61%)
Query: 903 DIDECARHPNICNNGTCVNAIGSFKCHCYAGFKLSH 938
DIDECA N GTCVN +GS++C C G+ +
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRN 36
Score = 39.5 bits (93), Expect = 2e-04
Identities = 21/42 (50%), Positives = 25/42 (59%), Gaps = 4/42 (9%)
Query: 766 DLNECALMPSACQGGECINTDGSYRCECPAGYVKDETGKICI 807
D++ECA GG C+NT GSYRC CP GY TG+ C
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGY----TGRNCE 38
Score = 38.8 bits (91), Expect = 3e-04
Identities = 17/35 (48%), Positives = 21/35 (60%), Gaps = 1/35 (2%)
Query: 973 DINECESPQACLY-GNCTNTLGSFSCTCPPNYQLT 1006
DI+EC S C G C NT+GS+ C+CPP Y
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35
Score = 37.6 bits (88), Expect = 7e-04
Identities = 17/32 (53%), Positives = 20/32 (62%), Gaps = 2/32 (6%)
Query: 149 IDECNLMPNMC-NHGTCMNTPGSFHCQCNRGF 179
IDEC N C N GTC+NT GS+ C C G+
Sbjct: 2 IDEC-ASGNPCQNGGTCVNTVGSYRCSCPPGY 32
Score = 37.2 bits (87), Expect = 0.001
Identities = 14/34 (41%), Positives = 18/34 (52%)
Query: 1017 DINECEEHDNICENGHCTNTFGSFMCSCQDGFKL 1050
DI+EC + G C NT GS+ CSC G+
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34
Score = 35.3 bits (82), Expect = 0.005
Identities = 20/35 (57%), Positives = 21/35 (60%), Gaps = 2/35 (5%)
Query: 307 DIDECATSIQRCGE-GFCVNDVGTYHCVCPDGYML 340
DIDECA S C G CVN VG+Y C CP GY
Sbjct: 1 DIDECA-SGNPCQNGGTCVNTVGSYRCSCPPGYTG 34
Score = 34.1 bits (79), Expect = 0.012
Identities = 14/36 (38%), Positives = 19/36 (52%)
Query: 576 DLDECRMMSYLCRNGRCRNNIGSFFCECLQGYTLAS 611
D+DEC + G C N +GS+ C C GYT +
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRN 36
Score = 34.1 bits (79), Expect = 0.014
Identities = 16/33 (48%), Positives = 19/33 (57%), Gaps = 2/33 (6%)
Query: 808 DDNECLSIPNICGN-GTCTNLNGGFECTCSEGY 839
D +EC S N C N GTC N G + C+C GY
Sbjct: 1 DIDECAS-GNPCQNGGTCVNTVGSYRCSCPPGY 32
Score = 33.4 bits (77), Expect = 0.028
Identities = 15/36 (41%), Positives = 18/36 (50%)
Query: 861 DIDECYERPGICANGDCANFQGSFQCTCANGYTLNT 896
DIDEC G C N GS++C+C GYT
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRN 36
Score = 32.2 bits (74), Expect = 0.062
Identities = 12/29 (41%), Positives = 16/29 (55%)
Query: 711 TREGICPSPGKCQNVMGSFICTCPPGYRL 739
C + G C N +GS+ C+CPPGY
Sbjct: 6 ASGNPCQNGGTCVNTVGSYRCSCPPGYTG 34
Score = 32.2 bits (74), Expect = 0.075
Identities = 13/32 (40%), Positives = 17/32 (53%)
Query: 234 INECEEMPEICGSGTCENNIGSFSCRCEDGYS 265
I+EC GTC N +GS+ C C GY+
Sbjct: 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYT 33
>gnl|CDD|221695 pfam12662, cEGF, Complement Clr-like EGF-like. cEGF, or complement
Clr-like EGF, domains have six conserved cysteine
residues disulfide-bonded into the characteristic
pattern 'ababcc'. They are found in blood coagulation
proteins such as fibrillin, Clr and Cls, thrombomodulin,
and the LDL receptor. The core fold of the EGF domain
consists of two small beta-hairpins packed against each
other. Two major structural variants have been
identified based on the structural context of the
C-terminal cysteine residue of disulfide 'c' in the
C-terminal hairpin: hEGFs and cEGFs. In cEGFs the
C-terminal thiol resides on the C-terminal beta-sheet,
resulting in long loop-lengths between the cysteine
residues of disulfide 'c', typically C[10+]XC. These
longer loop-lengths may have arisen by selective
cysteine loss from a four-disulfide EGF template such as
laminin or integrin. Tandem cEGF domains have five
linking residues between terminal cysteines of adjacent
domains. cEGF domains may or may not bind calcium in the
linker region. cEGF domains with the consensus motif
CXN4X[F,Y]XCXC are hydroxylated exclusively on the
asparagine residue.
Length = 24
Score = 37.0 bits (87), Expect = 9e-04
Identities = 9/24 (37%), Positives = 12/24 (50%)
Query: 170 SFHCQCNRGFLYDSDTHQCIDINE 193
S+ C C G+ D C DI+E
Sbjct: 1 SYTCSCPPGYQLSGDGRTCEDIDE 24
Score = 35.1 bits (82), Expect = 0.005
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 883 SFQCTCANGYTLNTARDSCVDIDE 906
S+ C+C GY L+ +C DIDE
Sbjct: 1 SYTCSCPPGYQLSGDGRTCEDIDE 24
Score = 34.7 bits (81), Expect = 0.005
Identities = 11/24 (45%), Positives = 13/24 (54%)
Query: 788 SYRCECPAGYVKDETGKICIDDNE 811
SY C CP GY G+ C D +E
Sbjct: 1 SYTCSCPPGYQLSGDGRTCEDIDE 24
Score = 34.4 bits (80), Expect = 0.008
Identities = 11/24 (45%), Positives = 16/24 (66%)
Query: 598 SFFCECLQGYTLASEGQYCRDVDE 621
S+ C C GY L+ +G+ C D+DE
Sbjct: 1 SYTCSCPPGYQLSGDGRTCEDIDE 24
Score = 33.6 bits (78), Expect = 0.017
Identities = 11/20 (55%), Positives = 16/20 (80%)
Query: 728 SFICTCPPGYRLSPDKNSCQ 747
S+ C+CPPGY+LS D +C+
Sbjct: 1 SYTCSCPPGYQLSGDGRTCE 20
Score = 32.4 bits (75), Expect = 0.047
Identities = 10/20 (50%), Positives = 11/20 (55%)
Query: 330 YHCVCPDGYMLLPSGKECID 349
Y C CP GY L G+ C D
Sbjct: 2 YTCSCPPGYQLSGDGRTCED 21
Score = 30.9 bits (71), Expect = 0.15
Identities = 10/27 (37%), Positives = 15/27 (55%), Gaps = 3/27 (11%)
Query: 925 SFKCHCYAGFKLSHNNDCIGNCTDINE 951
S+ C C G++LS + C DI+E
Sbjct: 1 SYTCSCPPGYQLSGDG---RTCEDIDE 24
Score = 30.9 bits (71), Expect = 0.17
Identities = 14/27 (51%), Positives = 19/27 (70%), Gaps = 3/27 (11%)
Query: 994 SFSCTCPPNYQLTPAGNACVVLEDINE 1020
S++C+CPP YQL+ G C EDI+E
Sbjct: 1 SYTCSCPPGYQLSGDGRTC---EDIDE 24
Score = 29.3 bits (67), Expect = 0.55
Identities = 12/21 (57%), Positives = 15/21 (71%), Gaps = 2/21 (9%)
Query: 99 CVCDEGFR--GDGYSCEDIDE 117
C C G++ GDG +CEDIDE
Sbjct: 4 CSCPPGYQLSGDGRTCEDIDE 24
>gnl|CDD|238752 cd01475, vWA_Matrilin, VWA_Matrilin: In cartilaginous plate,
extracellular matrix molecules mediate cell-matrix and
matrix-matrix interactions thereby providing tissue
integrity. Some members of the matrilin family are
expressed specifically in developing cartilage
rudiments. The matrilin family consists of at least four
members. All the members of the matrilin family contain
VWA domains, EGF-like domains and a heptad repeat
coiled-coiled domain at the carboxy terminus which is
responsible for the oligomerization of the matrilins.
The VWA domains have been shown to be essential for
matrilin network formation by interacting with matrix
ligands.
Length = 224
Score = 38.9 bits (91), Expect = 0.007
Identities = 17/41 (41%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Query: 305 CVDIDECATSIQRCGEGFCVNDVGTYHCVCPDGYMLLPSGK 345
CV D CAT C + C++ G+Y C C +GY LL K
Sbjct: 184 CVVPDLCATLSHVC-QQVCISTPGSYLCACTEGYALLEDNK 223
Score = 38.1 bits (89), Expect = 0.014
Identities = 12/39 (30%), Positives = 19/39 (48%), Gaps = 1/39 (2%)
Query: 147 EEIDECNLMPNMCNHGTCMNTPGSFHCQCNRGFLYDSDT 185
D C + ++C C++TPGS+ C C G+ D
Sbjct: 185 VVPDLCATLSHVCQQV-CISTPGSYLCACTEGYALLEDN 222
Score = 35.8 bits (83), Expect = 0.073
Identities = 14/40 (35%), Positives = 21/40 (52%), Gaps = 1/40 (2%)
Query: 901 CVDIDECARHPNICNNGTCVNAIGSFKCHCYAGFKLSHNN 940
CV D CA ++C C++ GS+ C C G+ L +N
Sbjct: 184 CVVPDLCATLSHVCQQV-CISTPGSYLCACTEGYALLEDN 222
Score = 34.3 bits (79), Expect = 0.25
Identities = 14/38 (36%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 803 GKICIDDNECLSIPNICGNGTCTNLNGGFECTCSEGYA 840
GKIC+ + C ++ ++C C + G + C C+EGYA
Sbjct: 181 GKICVVPDLCATLSHVCQQV-CISTPGSYLCACTEGYA 217
Score = 33.5 bits (77), Expect = 0.45
Identities = 15/42 (35%), Positives = 23/42 (54%), Gaps = 4/42 (9%)
Query: 1009 GNACVVLEDINECEEHDNICENGHCTNTFGSFMCSCQDGFKL 1050
G CVV D+ H C+ C +T GS++C+C +G+ L
Sbjct: 181 GKICVV-PDLCATLSHV--CQQV-CISTPGSYLCACTEGYAL 218
Score = 30.4 bits (69), Expect = 4.0
Identities = 13/40 (32%), Positives = 22/40 (55%), Gaps = 1/40 (2%)
Query: 574 CIDLDECRMMSYLCRNGRCRNNIGSFFCECLQGYTLASEG 613
C+ D C +S++C+ C + GS+ C C +GY L +
Sbjct: 184 CVVPDLCATLSHVCQQV-CISTPGSYLCACTEGYALLEDN 222
Score = 29.3 bits (66), Expect = 8.9
Identities = 16/36 (44%), Positives = 18/36 (50%), Gaps = 1/36 (2%)
Query: 769 ECALMPSACQGGECINTDGSYRCECPAGYVKDETGK 804
CA + CQ CI+T GSY C C GY E K
Sbjct: 189 LCATLSHVCQQV-CISTPGSYLCACTEGYALLEDNK 223
>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain.
Length = 35
Score = 32.9 bits (75), Expect = 0.032
Identities = 15/26 (57%), Positives = 16/26 (61%)
Query: 777 CQGGECINTDGSYRCECPAGYVKDET 802
C G CINT GSY C CP GY D+
Sbjct: 8 CSNGTCINTPGSYTCSCPPGYTGDKR 33
Score = 32.9 bits (75), Expect = 0.034
Identities = 15/32 (46%), Positives = 18/32 (56%)
Query: 976 ECESPQACLYGNCTNTLGSFSCTCPPNYQLTP 1007
EC S C G C NT GS++C+CPP Y
Sbjct: 1 ECASGGPCSNGTCINTPGSYTCSCPPGYTGDK 32
Score = 32.5 bits (74), Expect = 0.053
Identities = 14/31 (45%), Positives = 16/31 (51%), Gaps = 1/31 (3%)
Query: 1020 ECEEHDNICENGHCTNTFGSFMCSCQDGFKL 1050
EC C NG C NT GS+ CSC G+
Sbjct: 1 ECASG-GPCSNGTCINTPGSYTCSCPPGYTG 30
Score = 32.1 bits (73), Expect = 0.079
Identities = 11/26 (42%), Positives = 14/26 (53%)
Query: 814 SIPNICGNGTCTNLNGGFECTCSEGY 839
+ C NGTC N G + C+C GY
Sbjct: 3 ASGGPCSNGTCINTPGSYTCSCPPGY 28
Score = 31.7 bits (72), Expect = 0.11
Identities = 14/34 (41%), Positives = 18/34 (52%), Gaps = 1/34 (2%)
Query: 906 ECARHPNICNNGTCVNAIGSFKCHCYAGFKLSHN 939
ECA C+NGTC+N GS+ C C G+
Sbjct: 1 ECASG-GPCSNGTCINTPGSYTCSCPPGYTGDKR 33
Score = 31.3 bits (71), Expect = 0.12
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 156 PNMCNHGTCMNTPGSFHCQCNRGF 179
C++GTC+NTPGS+ C C G+
Sbjct: 5 GGPCSNGTCINTPGSYTCSCPPGY 28
Score = 30.2 bits (68), Expect = 0.31
Identities = 14/34 (41%), Positives = 17/34 (50%), Gaps = 1/34 (2%)
Query: 310 ECATSIQRCGEGFCVNDVGTYHCVCPDGYMLLPS 343
ECA+ C G C+N G+Y C CP GY
Sbjct: 1 ECASG-GPCSNGTCINTPGSYTCSCPPGYTGDKR 33
Score = 30.2 bits (68), Expect = 0.37
Identities = 12/23 (52%), Positives = 13/23 (56%)
Query: 587 CRNGRCRNNIGSFFCECLQGYTL 609
C NG C N GS+ C C GYT
Sbjct: 8 CSNGTCINTPGSYTCSCPPGYTG 30
Score = 29.8 bits (67), Expect = 0.48
Identities = 13/30 (43%), Positives = 17/30 (56%)
Query: 867 ERPGICANGDCANFQGSFQCTCANGYTLNT 896
G C+NG C N GS+ C+C GYT +
Sbjct: 3 ASGGPCSNGTCINTPGSYTCSCPPGYTGDK 32
Score = 28.6 bits (64), Expect = 1.2
Identities = 12/25 (48%), Positives = 14/25 (56%)
Query: 718 SPGKCQNVMGSFICTCPPGYRLSPD 742
S G C N GS+ C+CPPGY
Sbjct: 9 SNGTCINTPGSYTCSCPPGYTGDKR 33
Score = 27.9 bits (62), Expect = 2.2
Identities = 13/30 (43%), Positives = 17/30 (56%), Gaps = 1/30 (3%)
Query: 236 ECEEMPEICGSGTCENNIGSFSCRCEDGYS 265
EC C +GTC N GS++C C GY+
Sbjct: 1 ECAS-GGPCSNGTCINTPGSYTCSCPPGYT 29
>gnl|CDD|234522 TIGR04249, SCM_chp_ScmC, SynChlorMet cassette protein ScmC. A
biosynthesis cassette found in Syntrophobacter
fumaroxidans MPOB, Chlorobium limicola DSM 245,
Methanocella paludicola SANAE, and delta proteobacterium
NaphS2 contains two PqqE-like radical SAM/SPASM domain
proteins, a PqqD homolog, and a conserved hypothetical
protein. These components suggest modification of a
ribosomally produced peptide precursor, but the
precursor has not been identified. Members of this
family are designated ScmC.
Length = 292
Score = 36.4 bits (84), Expect = 0.065
Identities = 20/67 (29%), Positives = 24/67 (35%), Gaps = 9/67 (13%)
Query: 665 GSTTRSTCCCSIGKAWGPQCEECPAVGSDEHKTLCPGGSGYR--PNSATREGICPSPGKC 722
G T +STCC IG W C+ DE + GYR P + S K
Sbjct: 146 GGTGKSTCCRRIGDPWRCHCD-------DESLVVPDTDKGYRAHPFPTWSNFLFGSEEKT 198
Query: 723 QNVMGSF 729
M S
Sbjct: 199 WQSMRSV 205
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 33.8 bits (78), Expect = 0.073
Identities = 10/17 (58%), Positives = 13/17 (76%)
Query: 637 EEEEEEEEEEEGGYGGG 653
++EEEEEEEE+ G G
Sbjct: 70 KKEEEEEEEEDDDMGFG 86
>gnl|CDD|218955 pfam06247, Plasmod_Pvs28, Plasmodium ookinete surface protein
Pvs28. This family consists of several ookinete surface
protein (Pvs28) from several species of Plasmodium.
Pvs25 and Pvs28 are expressed on the surface of
ookinetes. These proteins are potential candidates for
vaccine and induce antibodies that block the infectivity
of Plasmodium vivax in immunised animals.
Length = 196
Score = 35.1 bits (81), Expect = 0.092
Identities = 35/138 (25%), Positives = 49/138 (35%), Gaps = 27/138 (19%)
Query: 818 ICGNGTCTNLNGGFECTCSEGYAPGPLGSCAILLTLPPISPSTDIDECYERPG---ICAN 874
IC NG ++ FEC C+EGY +C ++ + G C N
Sbjct: 7 ICKNGYLIQMSNHFECKCNEGYVLKNENTCE------EKVKCDKLENVNKVCGEYATCIN 60
Query: 875 GDCANFQGSFQCTCANGYTLNTARDSCVDIDECARHPNICNNGTCVNA--------IGSF 926
+ + +C C NGYTL+ + CV PN CNN C + +
Sbjct: 61 QANKAEEKALKCGCINGYTLS--QGVCV--------PNKCNNKVCGSGKCIVDPANPNNT 110
Query: 927 KCHCYAGFKLSHNNDCIG 944
C C G N C
Sbjct: 111 TCSCNIGKVPDQNGKCTK 128
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional significance
of EGF-like domains in what appear to be unrelated
proteins is not yet clear; a common feature is that
these repeats are found in the extracellular domain of
membrane-bound proteins or in proteins known to be
secreted (exception: prostaglandin G/H synthase); the
domain includes six cysteine residues which have been
shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 31.3 bits (71), Expect = 0.15
Identities = 14/34 (41%), Positives = 18/34 (52%)
Query: 906 ECARHPNICNNGTCVNAIGSFKCHCYAGFKLSHN 939
ECA N GTCVN GS++C C G+ +
Sbjct: 1 ECAASNPCSNGGTCVNTPGSYRCVCPPGYTGDRS 34
Score = 30.5 bits (69), Expect = 0.24
Identities = 19/35 (54%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 769 ECALMPSACQ-GGECINTDGSYRCECPAGYVKDET 802
ECA + C GG C+NT GSYRC CP GY D +
Sbjct: 1 ECAA-SNPCSNGGTCVNTPGSYRCVCPPGYTGDRS 34
Score = 30.1 bits (68), Expect = 0.34
Identities = 12/27 (44%), Positives = 14/27 (51%)
Query: 714 GICPSPGKCQNVMGSFICTCPPGYRLS 740
C + G C N GS+ C CPPGY
Sbjct: 6 NPCSNGGTCVNTPGSYRCVCPPGYTGD 32
Score = 29.8 bits (67), Expect = 0.58
Identities = 14/31 (45%), Positives = 16/31 (51%), Gaps = 1/31 (3%)
Query: 976 ECESPQACL-YGNCTNTLGSFSCTCPPNYQL 1005
EC + C G C NT GS+ C CPP Y
Sbjct: 1 ECAASNPCSNGGTCVNTPGSYRCVCPPGYTG 31
Score = 29.4 bits (66), Expect = 0.59
Identities = 12/20 (60%), Positives = 15/20 (75%)
Query: 160 NHGTCMNTPGSFHCQCNRGF 179
N GTC+NTPGS+ C C G+
Sbjct: 10 NGGTCVNTPGSYRCVCPPGY 29
Score = 27.8 bits (62), Expect = 2.1
Identities = 12/27 (44%), Positives = 14/27 (51%), Gaps = 1/27 (3%)
Query: 814 SIPNICGN-GTCTNLNGGFECTCSEGY 839
+ N C N GTC N G + C C GY
Sbjct: 3 AASNPCSNGGTCVNTPGSYRCVCPPGY 29
Score = 27.8 bits (62), Expect = 2.5
Identities = 11/31 (35%), Positives = 14/31 (45%)
Query: 1020 ECEEHDNICENGHCTNTFGSFMCSCQDGFKL 1050
EC + G C NT GS+ C C G+
Sbjct: 1 ECAASNPCSNGGTCVNTPGSYRCVCPPGYTG 31
Score = 27.8 bits (62), Expect = 2.6
Identities = 17/32 (53%), Positives = 18/32 (56%), Gaps = 2/32 (6%)
Query: 310 ECATSIQRCGE-GFCVNDVGTYHCVCPDGYML 340
ECA S C G CVN G+Y CVCP GY
Sbjct: 1 ECAAS-NPCSNGGTCVNTPGSYRCVCPPGYTG 31
Score = 27.1 bits (60), Expect = 4.5
Identities = 10/21 (47%), Positives = 11/21 (52%)
Query: 589 NGRCRNNIGSFFCECLQGYTL 609
G C N GS+ C C GYT
Sbjct: 11 GGTCVNTPGSYRCVCPPGYTG 31
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 33.0 bits (76), Expect = 0.17
Identities = 11/18 (61%), Positives = 13/18 (72%)
Query: 637 EEEEEEEEEEEGGYGGGS 654
EEEE+EE EEE G G+
Sbjct: 86 EEEEKEESEEEAAAGLGA 103
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 31.0 bits (71), Expect = 0.18
Identities = 11/22 (50%), Positives = 13/22 (59%)
Query: 161 HGTCMNTPGSFHCQCNRGFLYD 182
+ TC NT GSF C C G+ D
Sbjct: 11 NATCTNTGGSFTCTCKSGYTGD 32
Score = 29.0 bits (66), Expect = 0.92
Identities = 11/18 (61%), Positives = 12/18 (66%)
Query: 987 NCTNTLGSFSCTCPPNYQ 1004
CTNT GSF+CTC Y
Sbjct: 13 TCTNTGGSFTCTCKSGYT 30
Score = 28.3 bits (64), Expect = 1.5
Identities = 14/30 (46%), Positives = 19/30 (63%), Gaps = 1/30 (3%)
Query: 1021 CEEHDNIC-ENGHCTNTFGSFMCSCQDGFK 1049
C E++ C N CTNT GSF C+C+ G+
Sbjct: 1 CAENNGGCHPNATCTNTGGSFTCTCKSGYT 30
Score = 27.5 bits (62), Expect = 2.7
Identities = 12/19 (63%), Positives = 12/19 (63%)
Query: 821 NGTCTNLNGGFECTCSEGY 839
N TCTN G F CTC GY
Sbjct: 11 NATCTNTGGSFTCTCKSGY 29
Score = 27.5 bits (62), Expect = 3.4
Identities = 10/18 (55%), Positives = 13/18 (72%)
Query: 248 TCENNIGSFSCRCEDGYS 265
TC N GSF+C C+ GY+
Sbjct: 13 TCTNTGGSFTCTCKSGYT 30
Score = 26.7 bits (60), Expect = 5.5
Identities = 12/26 (46%), Positives = 14/26 (53%), Gaps = 2/26 (7%)
Query: 781 ECINTDGSYRCECPAGYVKDETGKIC 806
C NT GS+ C C +GY D G C
Sbjct: 13 TCTNTGGSFTCTCKSGYTGD--GVTC 36
Score = 26.3 bits (59), Expect = 7.3
Identities = 10/17 (58%), Positives = 10/17 (58%)
Query: 721 KCQNVMGSFICTCPPGY 737
C N GSF CTC GY
Sbjct: 13 TCTNTGGSFTCTCKSGY 29
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 32.3 bits (74), Expect = 0.25
Identities = 12/18 (66%), Positives = 13/18 (72%)
Query: 637 EEEEEEEEEEEGGYGGGS 654
EEEEEEE EEE G G+
Sbjct: 85 EEEEEEESEEEAMAGLGA 102
>gnl|CDD|224793 COG1881, COG1881, Phospholipid-binding protein [General function
prediction only].
Length = 174
Score = 32.7 bits (75), Expect = 0.58
Identities = 13/49 (26%), Positives = 18/49 (36%), Gaps = 5/49 (10%)
Query: 384 SWGKP--CQPCPPPGS--RDYILLCGKDFAKLC-PEGVGRGDKGEDLNE 427
+G PCPP G R Y + D L P G + G+ +
Sbjct: 111 DFGSRGYGGPCPPKGHGHRYYFTVYALDVELLLLPAGASGAELGKAMEG 159
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 32.9 bits (76), Expect = 0.73
Identities = 11/18 (61%), Positives = 12/18 (66%)
Query: 637 EEEEEEEEEEEGGYGGGS 654
EEEEEE EEE G G+
Sbjct: 310 EEEEEEPSEEEAAAGLGA 327
>gnl|CDD|215652 pfam00008, EGF, EGF-like domain. There is no clear separation
between noise and signal. pfam00053 is very similar, but
has 8 instead of 6 conserved cysteines. Includes some
cytokine receptors. The EGF domain misses the N-terminus
regions of the Ca2+ binding EGF domains (this is the
main reason of discrepancy between swiss-prot domain
start/end and Pfam). The family is hard to model due to
many similar but different sub-types of EGF domains.
Pfam certainly misses a number of EGF domains.
Length = 32
Score = 28.9 bits (65), Expect = 0.81
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 152 CNLMPNMCNHGTCMNTPGSFHCQCNRGF 179
C+ N GTC++TPG + C+C G+
Sbjct: 1 CSPNNPCSNGGTCVDTPGGYTCECPEGY 28
Score = 27.0 bits (60), Expect = 3.7
Identities = 12/19 (63%), Positives = 14/19 (73%)
Query: 779 GGECINTDGSYRCECPAGY 797
GG C++T G Y CECP GY
Sbjct: 10 GGTCVDTPGGYTCECPEGY 28
Score = 27.0 bits (60), Expect = 3.8
Identities = 10/32 (31%), Positives = 15/32 (46%)
Query: 907 CARHPNICNNGTCVNAIGSFKCHCYAGFKLSH 938
C+ + N GTCV+ G + C C G+
Sbjct: 1 CSPNNPCSNGGTCVDTPGGYTCECPEGYTGKR 32
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 32.0 bits (73), Expect = 1.7
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 620 DECKEVNKRESRCYLDTEEEEEEEEEEE 647
+E +E + E + EE +EEEE+EE
Sbjct: 42 EEKEEKKEEEEKTTDKEEEVDEEEEKEE 69
>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
Length = 310
Score = 31.5 bits (72), Expect = 2.1
Identities = 9/15 (60%), Positives = 12/15 (80%)
Query: 637 EEEEEEEEEEEGGYG 651
EEEEEEE++ G+G
Sbjct: 293 PAEEEEEEEDDMGFG 307
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 31.6 bits (72), Expect = 2.5
Identities = 10/21 (47%), Positives = 12/21 (57%)
Query: 637 EEEEEEEEEEEGGYGGGSRRV 657
EE+EEEEEEE+ G
Sbjct: 393 EEDEEEEEEEDEDEGPSKEHS 413
>gnl|CDD|164576 CHL00201, syh, histidine-tRNA synthetase; Provisional.
Length = 430
Score = 31.4 bits (71), Expect = 2.8
Identities = 14/49 (28%), Positives = 20/49 (40%)
Query: 158 MCNHGTCMNTPGSFHCQCNRGFLYDSDTHQCIDINECEEMPEICGSGTY 206
+C + +N P + + RG Y +DT I ICG G Y
Sbjct: 239 VCTYLNLLNIPYKINYKLVRGLDYYNDTAFEIKTLSSNGQDTICGGGRY 287
>gnl|CDD|237071 PRK12353, PRK12353, putative amino acid kinase; Reviewed.
Length = 314
Score = 30.9 bits (71), Expect = 3.7
Identities = 12/26 (46%), Positives = 14/26 (53%), Gaps = 6/26 (23%)
Query: 638 EEEEEEEEEEGGY------GGGSRRV 657
EEE E+ +E GY G G RRV
Sbjct: 138 EEEAEKLAKEKGYTFKEDAGRGYRRV 163
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 31.1 bits (70), Expect = 4.6
Identities = 12/13 (92%), Positives = 13/13 (100%)
Query: 635 DTEEEEEEEEEEE 647
D+EEEEEEEEEEE
Sbjct: 862 DSEEEEEEEEEEE 874
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 31.0 bits (70), Expect = 5.2
Identities = 17/43 (39%), Positives = 24/43 (55%), Gaps = 1/43 (2%)
Query: 609 LASEG-QYCRDVDECKEVNKRESRCYLDTEEEEEEEEEEEGGY 650
L EG + +++ E KE E + TEE +EE EEEE G+
Sbjct: 268 LGIEGWDWLKELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGF 310
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.137 0.462
Gapped
Lambda K H
0.267 0.0720 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 50,023,367
Number of extensions: 4603760
Number of successful extensions: 4471
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4401
Number of HSP's successfully gapped: 163
Length of query: 1050
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 943
Effective length of database: 6,191,724
Effective search space: 5838795732
Effective search space used: 5838795732
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (28.4 bits)