RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy11799
(483 letters)
>gnl|CDD|221695 pfam12662, cEGF, Complement Clr-like EGF-like. cEGF, or complement
Clr-like EGF, domains have six conserved cysteine
residues disulfide-bonded into the characteristic
pattern 'ababcc'. They are found in blood coagulation
proteins such as fibrillin, Clr and Cls, thrombomodulin,
and the LDL receptor. The core fold of the EGF domain
consists of two small beta-hairpins packed against each
other. Two major structural variants have been
identified based on the structural context of the
C-terminal cysteine residue of disulfide 'c' in the
C-terminal hairpin: hEGFs and cEGFs. In cEGFs the
C-terminal thiol resides on the C-terminal beta-sheet,
resulting in long loop-lengths between the cysteine
residues of disulfide 'c', typically C[10+]XC. These
longer loop-lengths may have arisen by selective
cysteine loss from a four-disulfide EGF template such as
laminin or integrin. Tandem cEGF domains have five
linking residues between terminal cysteines of adjacent
domains. cEGF domains may or may not bind calcium in the
linker region. cEGF domains with the consensus motif
CXN4X[F,Y]XCXC are hydroxylated exclusively on the
asparagine residue.
Length = 24
Score = 42.8 bits (102), Expect = 4e-06
Identities = 14/24 (58%), Positives = 17/24 (70%)
Query: 211 SFRCICPYGYALAPDGRHCIDINE 234
S+ C CP GY L+ DGR C DI+E
Sbjct: 1 SYTCSCPPGYQLSGDGRTCEDIDE 24
Score = 33.2 bits (77), Expect = 0.011
Identities = 16/25 (64%), Positives = 18/25 (72%), Gaps = 1/25 (4%)
Query: 252 SYTCGCHPGFELSSDGSGTCLDSDE 276
SYTC C PG++LS DG TC D DE
Sbjct: 1 SYTCSCPPGYQLSGDG-RTCEDIDE 24
Score = 32.0 bits (74), Expect = 0.024
Identities = 11/24 (45%), Positives = 14/24 (58%)
Query: 151 SFRCLCPRGFKLDRTGTFCLDQDE 174
S+ C CP G++L G C D DE
Sbjct: 1 SYTCSCPPGYQLSGDGRTCEDIDE 24
Score = 32.0 bits (74), Expect = 0.024
Identities = 11/24 (45%), Positives = 14/24 (58%)
Query: 448 SFRCLCPRGFKLDRTGTFCLDQDE 471
S+ C CP G++L G C D DE
Sbjct: 1 SYTCSCPPGYQLSGDGRTCEDIDE 24
Score = 29.0 bits (66), Expect = 0.35
Identities = 9/24 (37%), Positives = 10/24 (41%)
Query: 293 GYKCGCPEGYVRHPFSNECIDDNE 316
Y C CP GY C D +E
Sbjct: 1 SYTCSCPPGYQLSGDGRTCEDIDE 24
>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain.
Length = 42
Score = 42.7 bits (101), Expect = 6e-06
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 3/45 (6%)
Query: 422 DIDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFC 466
D+DEC +G CP C+NT+GSF C+CP G++ + GT C
Sbjct: 1 DVDEC---ADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42
Score = 41.2 bits (97), Expect = 2e-05
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 3/45 (6%)
Query: 125 DMDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFC 169
D+DEC +G CP C+NT+GSF C+CP G++ + GT C
Sbjct: 1 DVDEC---ADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42
Score = 37.3 bits (87), Expect = 5e-04
Identities = 17/42 (40%), Positives = 24/42 (57%), Gaps = 2/42 (4%)
Query: 190 DINECLELSNQC--AFRCHNVPGSFRCICPYGYALAPDGRHC 229
D++EC + ++ C C N GSF C+CP GY DG +C
Sbjct: 1 DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42
Score = 37.3 bits (87), Expect = 5e-04
Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 3/42 (7%)
Query: 273 DSDECATGRHICQQ--IC-NLLGGYKCGCPEGYVRHPFSNEC 311
D DECA G H C +C N +G ++C CP+GY + C
Sbjct: 1 DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42
Score = 35.8 bits (83), Expect = 0.002
Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 2/40 (5%)
Query: 231 DINECEQFPGLCAH--TCTNTEGSYTCGCHPGFELSSDGS 268
D++EC C C NT GS+ C C G+E + DG+
Sbjct: 1 DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGT 40
Score = 31.2 bits (71), Expect = 0.090
Identities = 15/42 (35%), Positives = 20/42 (47%), Gaps = 10/42 (23%)
Query: 313 DDNECA--RNPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQ 352
D +ECA + C C NT GS+ C CPDG++
Sbjct: 1 DVDECADGTHNCP--------ANTVCVNTIGSFECVCPDGYE 34
>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain.
Length = 39
Score = 42.6 bits (101), Expect = 6e-06
Identities = 19/41 (46%), Positives = 23/41 (56%), Gaps = 4/41 (9%)
Query: 422 DIDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRT 462
DIDEC C GTC+NT+GS+RC CP G+ R
Sbjct: 1 DIDECASG----NPCQNGGTCVNTVGSYRCECPPGYTDGRN 37
Score = 40.7 bits (96), Expect = 3e-05
Identities = 18/41 (43%), Positives = 23/41 (56%), Gaps = 4/41 (9%)
Query: 125 DMDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRT 165
D+DEC C GTC+NT+GS+RC CP G+ R
Sbjct: 1 DIDECASG----NPCQNGGTCVNTVGSYRCECPPGYTDGRN 37
Score = 39.9 bits (94), Expect = 6e-05
Identities = 16/34 (47%), Positives = 19/34 (55%), Gaps = 3/34 (8%)
Query: 231 DINECEQFPGLCAH--TCTNTEGSYTCGCHPGFE 262
DI+EC C + TC NT GSY C C PG+
Sbjct: 1 DIDECAS-GNPCQNGGTCVNTVGSYRCECPPGYT 33
Score = 39.5 bits (93), Expect = 9e-05
Identities = 19/42 (45%), Positives = 23/42 (54%), Gaps = 4/42 (9%)
Query: 190 DINECLELSN-QCAFRCHNVPGSFRCICPYGYALAPDGRHCI 230
DI+EC + Q C N GS+RC CP GY DGR+C
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYT---DGRNCE 39
Score = 36.5 bits (85), Expect = 0.001
Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 9/41 (21%)
Query: 313 DDNECA-RNPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQ 352
D +ECA NPC+ GG C NT GSYRC CP G+
Sbjct: 1 DIDECASGNPCQN--------GGTCVNTVGSYRCECPPGYT 33
Score = 30.3 bits (69), Expect = 0.15
Identities = 17/33 (51%), Positives = 21/33 (63%), Gaps = 4/33 (12%)
Query: 273 DSDECATGRHICQQ--IC-NLLGGYKCGCPEGY 302
D DECA+G + CQ C N +G Y+C CP GY
Sbjct: 1 DIDECASG-NPCQNGGTCVNTVGSYRCECPPGY 32
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular (mostly
animal) proteins. Many of these proteins require calcium
for their biological function and calcium-binding sites
have been found to be located at the N-terminus of
particular EGF-like domains; calcium-binding may be
crucial for numerous protein-protein interactions. Six
conserved core cysteines form three disulfide bridges as
in non calcium-binding EGF domains, whose structures are
very similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 42.2 bits (100), Expect = 9e-06
Identities = 18/40 (45%), Positives = 22/40 (55%), Gaps = 4/40 (10%)
Query: 422 DIDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDR 461
DIDEC C GTC+NT+GS+RC CP G+
Sbjct: 1 DIDECASG----NPCQNGGTCVNTVGSYRCSCPPGYTGRN 36
Score = 40.3 bits (95), Expect = 4e-05
Identities = 17/40 (42%), Positives = 22/40 (55%), Gaps = 4/40 (10%)
Query: 125 DMDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDR 164
D+DEC C GTC+NT+GS+RC CP G+
Sbjct: 1 DIDECASG----NPCQNGGTCVNTVGSYRCSCPPGYTGRN 36
Score = 38.0 bits (89), Expect = 2e-04
Identities = 16/35 (45%), Positives = 19/35 (54%), Gaps = 3/35 (8%)
Query: 231 DINECEQFPGLCAH--TCTNTEGSYTCGCHPGFEL 263
DI+EC C + TC NT GSY C C PG+
Sbjct: 1 DIDECAS-GNPCQNGGTCVNTVGSYRCSCPPGYTG 34
Score = 35.7 bits (83), Expect = 0.002
Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 9/41 (21%)
Query: 313 DDNECA-RNPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQ 352
D +ECA NPC+ GG C NT GSYRC CP G+
Sbjct: 1 DIDECASGNPCQN--------GGTCVNTVGSYRCSCPPGYT 33
Score = 35.3 bits (82), Expect = 0.003
Identities = 18/42 (42%), Positives = 22/42 (52%), Gaps = 5/42 (11%)
Query: 190 DINECLELSN-QCAFRCHNVPGSFRCICPYGYALAPDGRHCI 230
DI+EC + Q C N GS+RC CP GY GR+C
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYT----GRNCE 38
Score = 30.3 bits (69), Expect = 0.15
Identities = 17/33 (51%), Positives = 21/33 (63%), Gaps = 4/33 (12%)
Query: 273 DSDECATGRHICQ--QIC-NLLGGYKCGCPEGY 302
D DECA+G + CQ C N +G Y+C CP GY
Sbjct: 1 DIDECASG-NPCQNGGTCVNTVGSYRCSCPPGY 32
>gnl|CDD|238752 cd01475, vWA_Matrilin, VWA_Matrilin: In cartilaginous plate,
extracellular matrix molecules mediate cell-matrix and
matrix-matrix interactions thereby providing tissue
integrity. Some members of the matrilin family are
expressed specifically in developing cartilage
rudiments. The matrilin family consists of at least four
members. All the members of the matrilin family contain
VWA domains, EGF-like domains and a heptad repeat
coiled-coiled domain at the carboxy terminus which is
responsible for the oligomerization of the matrilins.
The VWA domains have been shown to be essential for
matrilin network formation by interacting with matrix
ligands.
Length = 224
Score = 43.1 bits (102), Expect = 1e-04
Identities = 17/34 (50%), Positives = 21/34 (61%), Gaps = 1/34 (2%)
Query: 271 CLDSDECATGRHICQQIC-NLLGGYKCGCPEGYV 303
C+ D CAT H+CQQ+C + G Y C C EGY
Sbjct: 184 CVVPDLCATLSHVCQQVCISTPGSYLCACTEGYA 217
Score = 42.8 bits (101), Expect = 2e-04
Identities = 14/42 (33%), Positives = 20/42 (47%)
Query: 226 GRHCIDINECEQFPGLCAHTCTNTEGSYTCGCHPGFELSSDG 267
G+ C+ + C +C C +T GSY C C G+ L D
Sbjct: 181 GKICVVPDLCATLSHVCQQVCISTPGSYLCACTEGYALLEDN 222
Score = 38.9 bits (91), Expect = 0.003
Identities = 16/40 (40%), Positives = 21/40 (52%)
Query: 188 CEDINECLELSNQCAFRCHNVPGSFRCICPYGYALAPDGR 227
C + C LS+ C C + PGS+ C C GYAL D +
Sbjct: 184 CVVPDLCATLSHVCQQVCISTPGSYLCACTEGYALLEDNK 223
Score = 30.8 bits (70), Expect = 1.2
Identities = 20/61 (32%), Positives = 28/61 (45%), Gaps = 6/61 (9%)
Query: 103 LFYVIIFYVISLKKKKFY-SVFLDMDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFK 161
+FYV F I KKF + + D C + C + CI+T GS+ C C G+
Sbjct: 163 VFYVEDFSTIEELTKKFQGKICVVPDLCATLSHV---CQQV--CISTPGSYLCACTEGYA 217
Query: 162 L 162
L
Sbjct: 218 L 218
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional significance
of EGF-like domains in what appear to be unrelated
proteins is not yet clear; a common feature is that
these repeats are found in the extracellular domain of
membrane-bound proteins or in proteins known to be
secreted (exception: prostaglandin G/H synthase); the
domain includes six cysteine residues which have been
shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 34.0 bits (78), Expect = 0.008
Identities = 15/27 (55%), Positives = 20/27 (74%)
Query: 139 CPKPGTCINTMGSFRCLCPRGFKLDRT 165
C GTC+NT GS+RC+CP G+ DR+
Sbjct: 8 CSNGGTCVNTPGSYRCVCPPGYTGDRS 34
Score = 34.0 bits (78), Expect = 0.008
Identities = 15/27 (55%), Positives = 20/27 (74%)
Query: 436 CPKPGTCINTMGSFRCLCPRGFKLDRT 462
C GTC+NT GS+RC+CP G+ DR+
Sbjct: 8 CSNGGTCVNTPGSYRCVCPPGYTGDRS 34
Score = 30.5 bits (69), Expect = 0.14
Identities = 11/21 (52%), Positives = 12/21 (57%)
Query: 244 HTCTNTEGSYTCGCHPGFELS 264
TC NT GSY C C PG+
Sbjct: 12 GTCVNTPGSYRCVCPPGYTGD 32
Score = 29.4 bits (66), Expect = 0.29
Identities = 19/38 (50%), Positives = 20/38 (52%), Gaps = 9/38 (23%)
Query: 316 ECAR-NPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQ 352
ECA NPC GG C NT GSYRC CP G+
Sbjct: 1 ECAASNPCSN--------GGTCVNTPGSYRCVCPPGYT 30
Score = 27.4 bits (61), Expect = 1.3
Identities = 14/35 (40%), Positives = 16/35 (45%), Gaps = 5/35 (14%)
Query: 188 CEDINECLELSNQCAFRCHNVPGSFRCICPYGYAL 222
C N C C N PGS+RC+CP GY
Sbjct: 2 CAASNPCSNGG-----TCVNTPGSYRCVCPPGYTG 31
>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain.
Length = 35
Score = 32.1 bits (73), Expect = 0.034
Identities = 13/24 (54%), Positives = 16/24 (66%)
Query: 142 PGTCINTMGSFRCLCPRGFKLDRT 165
GTCINT GS+ C CP G+ D+
Sbjct: 10 NGTCINTPGSYTCSCPPGYTGDKR 33
Score = 32.1 bits (73), Expect = 0.034
Identities = 13/24 (54%), Positives = 16/24 (66%)
Query: 439 PGTCINTMGSFRCLCPRGFKLDRT 462
GTCINT GS+ C CP G+ D+
Sbjct: 10 NGTCINTPGSYTCSCPPGYTGDKR 33
Score = 30.2 bits (68), Expect = 0.17
Identities = 16/32 (50%), Positives = 19/32 (59%), Gaps = 2/32 (6%)
Query: 234 ECEQFPGLCAH-TCTNTEGSYTCGCHPGFELS 264
EC G C++ TC NT GSYTC C PG+
Sbjct: 1 ECAS-GGPCSNGTCINTPGSYTCSCPPGYTGD 31
Score = 27.9 bits (62), Expect = 1.0
Identities = 10/22 (45%), Positives = 11/22 (50%)
Query: 204 RCHNVPGSFRCICPYGYALAPD 225
C N PGS+ C CP GY
Sbjct: 12 TCINTPGSYTCSCPPGYTGDKR 33
Score = 27.5 bits (61), Expect = 1.4
Identities = 11/19 (57%), Positives = 12/19 (63%)
Query: 334 GKCFNTAGSYRCGCPDGFQ 352
G C NT GSY C CP G+
Sbjct: 11 GTCINTPGSYTCSCPPGYT 29
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 30.6 bits (70), Expect = 0.12
Identities = 11/18 (61%), Positives = 13/18 (72%)
Query: 245 TCTNTEGSYTCGCHPGFE 262
TCTNT GS+TC C G+
Sbjct: 13 TCTNTGGSFTCTCKSGYT 30
Score = 29.8 bits (68), Expect = 0.24
Identities = 13/26 (50%), Positives = 14/26 (53%), Gaps = 2/26 (7%)
Query: 144 TCINTMGSFRCLCPRGFKLDRTGTFC 169
TC NT GSF C C G+ D G C
Sbjct: 13 TCTNTGGSFTCTCKSGYTGD--GVTC 36
Score = 29.8 bits (68), Expect = 0.24
Identities = 13/26 (50%), Positives = 14/26 (53%), Gaps = 2/26 (7%)
Query: 441 TCINTMGSFRCLCPRGFKLDRTGTFC 466
TC NT GSF C C G+ D G C
Sbjct: 13 TCTNTGGSFTCTCKSGYTGD--GVTC 36
Score = 29.0 bits (66), Expect = 0.41
Identities = 12/25 (48%), Positives = 12/25 (48%), Gaps = 2/25 (8%)
Query: 205 CHNVPGSFRCICPYGYALAPDGRHC 229
C N GSF C C GY DG C
Sbjct: 14 CTNTGGSFTCTCKSGYTG--DGVTC 36
>gnl|CDD|224596 COG1682, TagG, ABC-type polysaccharide/polyol phosphate export
systems, permease component [Carbohydrate transport and
metabolism / Cell envelope biogenesis, outer membrane].
Length = 263
Score = 32.9 bits (76), Expect = 0.31
Identities = 20/107 (18%), Positives = 34/107 (31%), Gaps = 32/107 (29%)
Query: 22 FLINMLAGIFVWSLASQWTGEILTNVSPYPTSVNMASVSTRKGL-TGVSVTRDTSYTMAI 80
FL +LAG+ +W S+ E SV L ++
Sbjct: 68 FLAYLLAGLILWFFFSEAISEG------------AGSVVANAALIKKINFPPL------- 108
Query: 81 LNVEQDKQGILIFSMLQTRYFNLFY---VIIFYVISLKKKKFYSVFL 124
IL + +R FN +I+ ++I L + + L
Sbjct: 109 ---------ILPVARTLSRLFNFLIHLIIILIFLIILGVEPSWHWLL 146
>gnl|CDD|240151 cd05024, S-100A10, S-100A10: A subgroup of the S-100A10 domain
found in proteins similar to S100A10. S100A10 is a
member of the S100 family of EF-hand superfamily of
calcium-binding proteins. Note that the S-100 hierarchy,
to which this S-100A10 group belongs, contains only
S-100 EF-hand domains, other EF-hands have been modeled
separately. S100 proteins are expressed exclusively in
vertebrates, and are implicated in intracellular and
extracellular regulatory activities. A unique feature of
S100A10 is that it contains mutation in both of the
calcium binding sites, making it calcium insensitive.
S100A10 has been detected in brain, heart,
gastrointestinal tract, kidney, liver, lung, spleen,
testes, epidermis, aorta, and thymus. Structural data
supports the homo- and hetero-dimeric as well as
hetero-tetrameric nature of the protein. S100A10 has
multiple binding partners in its calcium free state and
is therefore involved in many diverse biological
functions.
Length = 91
Score = 30.6 bits (69), Expect = 0.42
Identities = 11/33 (33%), Positives = 17/33 (51%)
Query: 273 DSDECATGRHICQQICNLLGGYKCGCPEGYVRH 305
D D+C G+ Q +L+ G C + YV+H
Sbjct: 56 DLDDCRDGKVGFQSFFSLIAGLLIACNDYYVKH 88
>gnl|CDD|233450 TIGR01528, NMN_trans_PnuC, nicotinamide mononucleotide transporter
PnuC. The PnuC protein of E. coli is membrane protein
responsible for nicotinamide mononucleotide transport,
subject to regulation by interaction with the NadR (also
called NadI) protein (see TIGR01526). This model defines
a region corresponding to most of the length of PnuC,
found primarily in pathogens. The extreme N- and
C-terminal regions are poorly conserved and not included
in the alignment and model [Transport and binding
proteins, Other, Biosynthesis of cofactors, prosthetic
groups, and carriers, Pyridine nucleotides].
Length = 189
Score = 32.0 bits (73), Expect = 0.43
Identities = 25/134 (18%), Positives = 45/134 (33%), Gaps = 33/134 (24%)
Query: 15 ELLIELLFLINMLAGIFVWSLASQWTGEILTNVSPYPTSVN------------MASVSTR 62
E+L++L F+I + G+F W SQ +T+ +
Sbjct: 49 EVLLQLFFVIANIYGLFAWKKNSQQEENGVTSGLKKLKGKGWTIILSTSVVLTVVIALIL 108
Query: 63 KGLTGVSVTRDTSYTMAILNVEQDKQGILIFSMLQTRYF-----NLFYVIIF-------Y 110
K + D S T I V Q IL+ + + N+ +I++
Sbjct: 109 KFIGDNFPWLD-SITFVIGIVAQ----ILMVRRYRENWILWLLSNVVSIILWFLAFTGSL 163
Query: 111 VISLKKKKFYSVFL 124
+ L Y ++L
Sbjct: 164 AMLLT----YIIYL 173
>gnl|CDD|238537 cd01117, YbiR_permease, Putative anion permease YbiR. Based on
sequence similarity, YbiR proteins are predicted to
function as anion translocating permeases in eubacteria,
archaea and plants. They belong to ArsB/NhaD superfamily
of permeases that have been shown to translocate sodium,
sulfate, arsenite and organic anions. A typical
ArsB/NhaD permease is composed of 8-13 transmembrane
domains.
Length = 384
Score = 31.5 bits (72), Expect = 1.0
Identities = 17/62 (27%), Positives = 25/62 (40%), Gaps = 6/62 (9%)
Query: 1 MFLLKARK-LETFDRELLIELLFLINMLAGIFVWSLASQWTGEILTNVSPYPTSVNMASV 59
+ LK R+ L+ D LL LF + I V A E L+ + P SV +
Sbjct: 241 LTRLKPRRVLKKVDWGLL---LFFAGLF--ILVGGFAQGLPLEGLSRLLPLLDSVLSLFL 295
Query: 60 ST 61
+
Sbjct: 296 VS 297
>gnl|CDD|178194 PLN02582, PLN02582, 1-deoxy-D-xylulose-5-phosphate synthase.
Length = 677
Score = 30.6 bits (69), Expect = 2.0
Identities = 14/29 (48%), Positives = 17/29 (58%)
Query: 380 GISGFSCECPSGYQRIGQGHCLATISSGL 408
G+SGF+ S Y G GH TIS+GL
Sbjct: 126 GLSGFTKRAESEYDCFGTGHSSTTISAGL 154
>gnl|CDD|214379 CHL00161, secY, preprotein translocase subunit SecY; Validated.
Length = 417
Score = 30.0 bits (68), Expect = 2.8
Identities = 30/138 (21%), Positives = 50/138 (36%), Gaps = 24/138 (17%)
Query: 16 LLIELLFLINMLAGIFVW---------SLASQWTGEILTNVSPYPTSVNMASVSTRKGLT 66
LL+ LF I ++ IF+ S L + S P +N A
Sbjct: 204 LLLLFLFFITLIGIIFLQEAIRKIPLISAKQLGKSSRLASNSYLPLRLNQA--------- 254
Query: 67 GVSVTRDTSYTMAILN--VEQDKQGIL---IFSMLQTRYFNLFYVIIFYVISLKKKKFYS 121
GV S + + L + LQ F + Y+++++V+ L FYS
Sbjct: 255 GVMPIIFASALLVLPGYLTNILLNQFLLPLLSLFLQLASFKILYLVLYFVLILFFSYFYS 314
Query: 122 VF-LDMDECKEEINGVAV 138
L+ + E + +AV
Sbjct: 315 TIVLNPKDISENLQKMAV 332
>gnl|CDD|232977 TIGR00440, glnS, glutaminyl-tRNA synthetase. This protein is a
relatively rare aminoacyl-tRNA synthetase, found in the
cytosolic compartment of eukaryotes, in E. coli and a
number of other Gram-negative Bacteria, and in
Deinococcus radiodurans. In contrast, the pathway to
Gln-tRNA in mitochondria, Archaea, Gram-positive
Bacteria, and a number of other lineages is by
misacylation with Glu followed by transamidation to
correct the aminoacylation to Gln. This enzyme is a
class I tRNA synthetase (hit by the pfam model
tRNA-synt_1c) and is quite closely related to
glutamyl-tRNA synthetases [Protein synthesis, tRNA
aminoacylation].
Length = 522
Score = 30.3 bits (68), Expect = 2.9
Identities = 23/75 (30%), Positives = 35/75 (46%), Gaps = 6/75 (8%)
Query: 403 TISSGLYPGFRSYER----PNLGDIDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFK 458
+ SS R Y+R PN G D+ IN ++ K G +++G + + F+
Sbjct: 434 SASSKYPTETRLYDRLFKVPNPGAPDDFLSVINPESLVIKQGFMEHSLGD--AVANKRFQ 491
Query: 459 LDRTGTFCLDQDECT 473
+R G FCLD E T
Sbjct: 492 FEREGYFCLDSKEST 506
>gnl|CDD|188093 TIGR00864, PCC, polycystin cation channel protein. The Polycystin
Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a
huge protein of 4303aas. Its repeated leucine-rich (LRR)
segment is found in many proteins. It contains 16
polycystic kidney disease (PKD) domains, one
LDL-receptor class A domain, one C-type lectin family
domain, and 16-18 putative TMSs in positions between
residues 2200 and 4100. Polycystin-L has been shown to
be a cation (Na+, K+ and Ca2+) channel that is activated
by Ca2+. Two members of the PCC family (polycystin 1 and
2) are mutated in autosomal dominant polycystic kidney
disease, and polycystin-L is deleted in mice with renal
and retinal defects. Note: this model is restricted to
the amino half for technical reasons.
Length = 2740
Score = 29.3 bits (65), Expect = 6.2
Identities = 23/123 (18%), Positives = 33/123 (26%), Gaps = 32/123 (26%)
Query: 259 PGFELSSDGSGTCLD--SDECATGRHICQQICNLLGGYK-CGCPEGYVRHPFSNECIDDN 315
P S + + + E HI QI L G P
Sbjct: 494 PALRFSRAAFLSSAEFGTQELRRPAHILFQIYRLRCRLPGAGGPA--------------- 538
Query: 316 ECARNP-CEPSSNTTGERGGKCFNTAGSYRCGCPDGFQFEAALQLCTQLSSSCLSSPCSF 374
C C P N + + C G Q+ +C L + C C+
Sbjct: 539 -CGPEAECRPPDNRSADAPA------------CMKGEQWCPFAHICLPLDAPCHPQACAN 585
Query: 375 GCT 377
GC+
Sbjct: 586 GCS 588
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.324 0.140 0.469
Gapped
Lambda K H
0.267 0.0632 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 23,320,799
Number of extensions: 2141487
Number of successful extensions: 1789
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1767
Number of HSP's successfully gapped: 100
Length of query: 483
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 382
Effective length of database: 6,457,848
Effective search space: 2466897936
Effective search space used: 2466897936
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 61 (27.5 bits)