RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy15407
(604 letters)
>gnl|CDD|206026 pfam13855, LRR_8, Leucine rich repeat.
Length = 60
Score = 66.0 bits (162), Expect = 7e-14
Identities = 23/55 (41%), Positives = 32/55 (58%)
Query: 222 DLSNNQIGFLSNFTFSPLSKLSQLILSYNKLQCIDRDALAGLKSLRIISLHGNDI 276
DLSNN++ + + F L L L LS N L I +A +GL SLR + L GN++
Sbjct: 6 DLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60
Score = 61.0 bits (149), Expect = 4e-12
Identities = 22/60 (36%), Positives = 34/60 (56%)
Query: 241 KLSQLILSYNKLQCIDRDALAGLKSLRIISLHGNDISMIPEGAFADLHAITHLAIGGNPL 300
L L LS N+L I A GL +L+++ L GN+++ I AF+ L ++ L + GN L
Sbjct: 1 NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60
Score = 50.2 bits (121), Expect = 3e-08
Identities = 20/45 (44%), Positives = 24/45 (53%), Gaps = 1/45 (2%)
Query: 118 TELILNDNEIGKIKSDGLFGRLPNLIKLDLRRNQVTGIEDNAFEG 162
L L++N + I DG F LPNL LDL N +T I AF G
Sbjct: 3 KSLDLSNNRLTVI-PDGAFKGLPNLKVLDLSGNNLTSISPEAFSG 46
Score = 32.5 bits (75), Expect = 0.048
Identities = 16/49 (32%), Positives = 19/49 (38%), Gaps = 4/49 (8%)
Query: 106 LKEIPKDIPLYT---TELILNDNEIGKIKSDGLFGRLPNLIKLDLRRNQ 151
L IP L L+ N + I S F LP+L LDL N
Sbjct: 12 LTVIPDGAFKGLPNLKVLDLSGNNLTSI-SPEAFSGLPSLRSLDLSGNN 59
Score = 31.7 bits (73), Expect = 0.099
Identities = 12/22 (54%), Positives = 15/22 (68%)
Query: 141 NLIKLDLRRNQVTGIEDNAFEG 162
NL LDL N++T I D AF+G
Sbjct: 1 NLKSLDLSNNRLTVIPDGAFKG 22
>gnl|CDD|188093 TIGR00864, PCC, polycystin cation channel protein. The Polycystin
Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a
huge protein of 4303aas. Its repeated leucine-rich (LRR)
segment is found in many proteins. It contains 16
polycystic kidney disease (PKD) domains, one
LDL-receptor class A domain, one C-type lectin family
domain, and 16-18 putative TMSs in positions between
residues 2200 and 4100. Polycystin-L has been shown to
be a cation (Na+, K+ and Ca2+) channel that is activated
by Ca2+. Two members of the PCC family (polycystin 1 and
2) are mutated in autosomal dominant polycystic kidney
disease, and polycystin-L is deleted in mice with renal
and retinal defects. Note: this model is restricted to
the amino half for technical reasons.
Length = 2740
Score = 62.8 bits (152), Expect = 5e-10
Identities = 29/82 (35%), Positives = 41/82 (50%), Gaps = 3/82 (3%)
Query: 271 LHGNDISMIPEGAFADLHAITHLAIGGNPLYCDCGLAWLSEWVKRDYV---EPGIARCTE 327
+ N IS I EG A+L ++ + + GNP CDCGLA L W + V +P A C
Sbjct: 2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAG 61
Query: 328 PASMRDKLLLTAPASSFQCKEK 349
P ++ + LL P C E+
Sbjct: 62 PGALAGQPLLGIPLLDSGCDEE 83
>gnl|CDD|227223 COG4886, COG4886, Leucine-rich repeat (LRR) protein [Function
unknown].
Length = 394
Score = 53.0 bits (127), Expect = 2e-07
Identities = 53/199 (26%), Positives = 73/199 (36%), Gaps = 41/199 (20%)
Query: 106 LKEIPKDIPLYT---TELILNDNEIGKIKSDGLFGRLPNLIKLDLRRNQVTGIEDNAFEG 162
+ +IP I L EL L+DN+I + S LPNL LDL N ++ +
Sbjct: 128 ITDIPPLIGLLKSNLKELDLSDNKIESLPSPLRN--LPNLKNLDLSFNDLSDLPKLLSNL 185
Query: 163 ENDQ-GCLGDNYCPPKCTCTVRVLFYDGRAGECYIAVELFSSRIRVSYYVGNYPVSTMYR 221
N L N + L EL
Sbjct: 186 SNLNNLDLSGN--------KISDL--PPEIELLSALEEL--------------------- 214
Query: 222 DLSNNQIGFLSNFTFSPLSKLSQLILSYNKLQCIDRDALAGLKSLRIISLHGNDISMIPE 281
DLSNN I + S L LS L LS NKL+ + +++ L +L + L N IS I
Sbjct: 215 DLSNNSI-IELLSSLSNLKNLSGLELSNNKLEDLP-ESIGNLSNLETLDLSNNQISSIS- 271
Query: 282 GAFADLHAITHLAIGGNPL 300
+ L + L + GN L
Sbjct: 272 -SLGSLTNLRELDLSGNSL 289
Score = 43.8 bits (103), Expect = 2e-04
Identities = 24/77 (31%), Positives = 40/77 (51%), Gaps = 2/77 (2%)
Query: 222 DLSNNQIGFLSNFTFSPLSKLSQLILSYNKLQCIDRDALAGLKSLRIISLHGNDISMIPE 281
DL NN I + S L +L LS NK++ + L L +L+ + L ND+S +P+
Sbjct: 122 DLDNNNITDIPPLIGLLKSNLKELDLSDNKIESLPSP-LRNLPNLKNLDLSFNDLSDLPK 180
Query: 282 GAFADLHAITHLAIGGN 298
++L + +L + GN
Sbjct: 181 LL-SNLSNLNNLDLSGN 196
Score = 32.2 bits (73), Expect = 0.77
Identities = 17/63 (26%), Positives = 31/63 (49%), Gaps = 4/63 (6%)
Query: 97 TRVDCSQRGLKEIPKDIPLYT--TELILNDNEIGKIKSDGLFGRLPNLIKLDLRRNQVTG 154
+D S + ++P +I L + EL L++N I I+ L NL L+L N++
Sbjct: 189 NNLDLSGNKISDLPPEIELLSALEELDLSNNSI--IELLSSLSNLKNLSGLELSNNKLED 246
Query: 155 IED 157
+ +
Sbjct: 247 LPE 249
>gnl|CDD|214507 smart00082, LRRCT, Leucine rich repeat C-terminal domain.
Length = 51
Score = 45.1 bits (107), Expect = 1e-06
Identities = 20/52 (38%), Positives = 27/52 (51%), Gaps = 3/52 (5%)
Query: 298 NPLYCDCGLAWLSEWVKRDYV--EPGIARCTEPASMRDKLLLTAPASSFQCK 347
NP CDC L WL W++ + +P RC P+S+R LL S F+C
Sbjct: 1 NPFICDCELRWLLRWLQANEHLQDPVDLRCASPSSLRGP-LLELLHSEFKCP 51
>gnl|CDD|214470 smart00013, LRRNT, Leucine rich repeat N-terminal domain.
Length = 33
Score = 42.7 bits (101), Expect = 8e-06
Identities = 20/33 (60%), Positives = 24/33 (72%)
Query: 87 ACPSGCSCDGTRVDCSQRGLKEIPKDIPLYTTE 119
ACP+ C+C GT VDCS RGL E+P D+P TT
Sbjct: 1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular (mostly
animal) proteins. Many of these proteins require calcium
for their biological function and calcium-binding sites
have been found to be located at the N-terminus of
particular EGF-like domains; calcium-binding may be
crucial for numerous protein-protein interactions. Six
conserved core cysteines form three disulfide bridges as
in non calcium-binding EGF domains, whose structures are
very similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 42.6 bits (101), Expect = 9e-06
Identities = 16/38 (42%), Positives = 24/38 (63%), Gaps = 1/38 (2%)
Query: 513 NVDDCV-NHMCQNGGTCVDGINDYTCKCEGDFVGKFCE 549
++D+C + CQNGGTCV+ + Y C C + G+ CE
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
Score = 41.9 bits (99), Expect = 2e-05
Identities = 16/38 (42%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 435 NIDDCLS-HKCENNATCVDLIQAYECRCAPGFQGEFCQ 471
+ID+C S + C+N TCV+ + +Y C C PG+ G C+
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
Score = 40.3 bits (95), Expect = 5e-05
Identities = 14/32 (43%), Positives = 19/32 (59%)
Query: 480 EYNPCRNGARCLDHFSHYTCECTPGFSGVNCT 511
NPC+NG C++ Y C C PG++G NC
Sbjct: 7 SGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
Score = 39.9 bits (94), Expect = 7e-05
Identities = 21/38 (55%), Positives = 23/38 (60%), Gaps = 2/38 (5%)
Query: 397 IDACY-GNPCRNSGTCKLLEEGRFNCECAPGYTGDRCE 433
ID C GNPC+N GTC G + C C PGYTG CE
Sbjct: 2 IDECASGNPCQNGGTCVNTV-GSYRCSCPPGYTGRNCE 38
Score = 34.5 bits (80), Expect = 0.006
Identities = 14/31 (45%), Positives = 18/31 (58%), Gaps = 1/31 (3%)
Query: 364 SPCTNQGVCEPLPERQYQCRCTPGYHGQHCE 394
+PC N G C Y+C C PGY G++CE
Sbjct: 9 NPCQNGGTCVNTVGS-YRCSCPPGYTGRNCE 38
Score = 33.8 bits (78), Expect = 0.011
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 1 NGGTCVDGINDYTCKCEGDFVGKFCE 26
NGGTCV+ + Y C C + G+ CE
Sbjct: 13 NGGTCVNTVGSYRCSCPPGYTGRNCE 38
>gnl|CDD|201808 pfam01462, LRRNT, Leucine rich repeat N-terminal domain. Leucine
Rich Repeats pfam00560 are short sequence motifs present
in a number of proteins with diverse functions and
cellular locations. Leucine Rich Repeats are often
flanked by cysteine rich domains. This domain is often
found at the N-terminus of tandem leucine rich repeats.
Length = 28
Score = 39.5 bits (93), Expect = 8e-05
Identities = 18/28 (64%), Positives = 22/28 (78%)
Query: 87 ACPSGCSCDGTRVDCSQRGLKEIPKDIP 114
ACP+ C C GT VDCS RGL E+P+D+P
Sbjct: 1 ACPAPCVCSGTVVDCSGRGLTEVPRDLP 28
>gnl|CDD|205079 pfam12799, LRR_4, Leucine Rich repeats (2 copies). Leucine rich
repeats are short sequence motifs present in a number of
proteins with diverse functions and cellular locations.
These repeats are usually involved in protein-protein
interactions. Each Leucine Rich Repeat is composed of a
beta-alpha unit. These units form elongated non-globular
structures. Leucine Rich Repeats are often flanked by
cysteine rich domains.
Length = 43
Score = 39.8 bits (94), Expect = 9e-05
Identities = 13/39 (33%), Positives = 20/39 (51%), Gaps = 3/39 (7%)
Query: 118 TELILNDNEIGKIKSDGLFGRLPNLIKLDLRRNQVTGIE 156
L L++N+I + LPNL LDL N++T +
Sbjct: 4 ETLDLSNNQITDLPP---LSNLPNLETLDLSGNKITDLS 39
Score = 39.4 bits (93), Expect = 1e-04
Identities = 12/41 (29%), Positives = 20/41 (48%), Gaps = 2/41 (4%)
Query: 241 KLSQLILSYNKLQCIDRDALAGLKSLRIISLHGNDISMIPE 281
L L LS N++ + L+ L +L + L GN I+ +
Sbjct: 2 NLETLDLSNNQITDLP--PLSNLPNLETLDLSGNKITDLSP 40
Score = 35.5 bits (83), Expect = 0.002
Identities = 16/35 (45%), Positives = 18/35 (51%), Gaps = 2/35 (5%)
Query: 222 DLSNNQIGFLSNFTFSPLSKLSQLILSYNKLQCID 256
DLSNNQI L S L L L LS NK+ +
Sbjct: 7 DLSNNQITDLPPL--SNLPNLETLDLSGNKITDLS 39
Score = 32.5 bits (75), Expect = 0.033
Identities = 9/37 (24%), Positives = 18/37 (48%), Gaps = 2/37 (5%)
Query: 264 KSLRIISLHGNDISMIPEGAFADLHAITHLAIGGNPL 300
+L + L N I+ +P ++L + L + GN +
Sbjct: 1 TNLETLDLSNNQITDLPP--LSNLPNLETLDLSGNKI 35
>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain.
Length = 39
Score = 37.6 bits (88), Expect = 5e-04
Identities = 16/39 (41%), Positives = 25/39 (64%), Gaps = 2/39 (5%)
Query: 513 NVDDCV-NHMCQNGGTCVDGINDYTCKCEGDFV-GKFCE 549
++D+C + CQNGGTCV+ + Y C+C + G+ CE
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39
Score = 36.8 bits (86), Expect = 8e-04
Identities = 16/39 (41%), Positives = 25/39 (64%), Gaps = 2/39 (5%)
Query: 435 NIDDCLS-HKCENNATCVDLIQAYECRCAPGFQ-GEFCQ 471
+ID+C S + C+N TCV+ + +Y C C PG+ G C+
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39
Score = 35.3 bits (82), Expect = 0.003
Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 1/33 (3%)
Query: 480 EYNPCRNGARCLDHFSHYTCECTPGFS-GVNCT 511
NPC+NG C++ Y CEC PG++ G NC
Sbjct: 7 SGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39
Score = 32.6 bits (75), Expect = 0.030
Identities = 22/39 (56%), Positives = 24/39 (61%), Gaps = 3/39 (7%)
Query: 397 IDACY-GNPCRNSGTCKLLEEGRFNCECAPGYT-GDRCE 433
ID C GNPC+N GTC G + CEC PGYT G CE
Sbjct: 2 IDECASGNPCQNGGTCVNTV-GSYRCECPPGYTDGRNCE 39
Score = 29.5 bits (67), Expect = 0.31
Identities = 14/33 (42%), Positives = 18/33 (54%), Gaps = 2/33 (6%)
Query: 363 KSPCTNQGVCEPLPERQYQCRCTPGYH-GQHCE 394
+PC N G C Y+C C PGY G++CE
Sbjct: 8 GNPCQNGGTCVNTVG-SYRCECPPGYTDGRNCE 39
Score = 28.8 bits (65), Expect = 0.58
Identities = 12/27 (44%), Positives = 17/27 (62%), Gaps = 1/27 (3%)
Query: 1 NGGTCVDGINDYTCKCEGDFV-GKFCE 26
NGGTCV+ + Y C+C + G+ CE
Sbjct: 13 NGGTCVNTVGSYRCECPPGYTDGRNCE 39
>gnl|CDD|197688 smart00370, LRR, Leucine-rich repeats, outliers.
Length = 24
Score = 36.6 bits (86), Expect = 0.001
Identities = 10/24 (41%), Positives = 14/24 (58%)
Query: 263 LKSLRIISLHGNDISMIPEGAFAD 286
L +LR + L N +S +P GAF
Sbjct: 1 LPNLRELDLSNNQLSSLPPGAFQG 24
Score = 34.2 bits (80), Expect = 0.006
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 139 LPNLIKLDLRRNQVTGIEDNAFEG 162
LPNL +LDL NQ++ + AF+G
Sbjct: 1 LPNLRELDLSNNQLSSLPPGAFQG 24
Score = 28.5 bits (65), Expect = 0.64
Identities = 9/24 (37%), Positives = 12/24 (50%)
Query: 239 LSKLSQLILSYNKLQCIDRDALAG 262
L L +L LS N+L + A G
Sbjct: 1 LPNLRELDLSNNQLSSLPPGAFQG 24
>gnl|CDD|197687 smart00369, LRR_TYP, Leucine-rich repeats, typical (most populated)
subfamily.
Length = 24
Score = 36.6 bits (86), Expect = 0.001
Identities = 10/24 (41%), Positives = 14/24 (58%)
Query: 263 LKSLRIISLHGNDISMIPEGAFAD 286
L +LR + L N +S +P GAF
Sbjct: 1 LPNLRELDLSNNQLSSLPPGAFQG 24
Score = 34.2 bits (80), Expect = 0.006
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 139 LPNLIKLDLRRNQVTGIEDNAFEG 162
LPNL +LDL NQ++ + AF+G
Sbjct: 1 LPNLRELDLSNNQLSSLPPGAFQG 24
Score = 28.5 bits (65), Expect = 0.64
Identities = 9/24 (37%), Positives = 12/24 (50%)
Query: 239 LSKLSQLILSYNKLQCIDRDALAG 262
L L +L LS N+L + A G
Sbjct: 1 LPNLRELDLSNNQLSSLPPGAFQG 24
>gnl|CDD|215652 pfam00008, EGF, EGF-like domain. There is no clear separation
between noise and signal. pfam00053 is very similar, but
has 8 instead of 6 conserved cysteines. Includes some
cytokine receptors. The EGF domain misses the N-terminus
regions of the Ca2+ binding EGF domains (this is the
main reason of discrepancy between swiss-prot domain
start/end and Pfam). The family is hard to model due to
many similar but different sub-types of EGF domains.
Pfam certainly misses a number of EGF domains.
Length = 32
Score = 34.7 bits (80), Expect = 0.004
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 482 NPCRNGARCLDHFSHYTCECTPGFSG 507
NPC NG C+D YTCEC G++G
Sbjct: 5 NPCSNGGTCVDTPGGYTCECPEGYTG 30
Score = 32.0 bits (73), Expect = 0.040
Identities = 11/26 (42%), Positives = 13/26 (50%)
Query: 444 CENNATCVDLIQAYECRCAPGFQGEF 469
C N TCVD Y C C G+ G+
Sbjct: 7 CSNGGTCVDTPGGYTCECPEGYTGKR 32
Score = 32.0 bits (73), Expect = 0.043
Identities = 15/29 (51%), Positives = 18/29 (62%)
Query: 519 NHMCQNGGTCVDGINDYTCKCEGDFVGKF 547
N+ C NGGTCVD YTC+C + GK
Sbjct: 4 NNPCSNGGTCVDTPGGYTCECPEGYTGKR 32
Score = 28.9 bits (65), Expect = 0.44
Identities = 16/29 (55%), Positives = 17/29 (58%), Gaps = 1/29 (3%)
Query: 403 NPCRNSGTCKLLEEGRFNCECAPGYTGDR 431
NPC N GTC G + CEC GYTG R
Sbjct: 5 NPCSNGGTCVDTPGG-YTCECPEGYTGKR 32
Score = 28.6 bits (64), Expect = 0.67
Identities = 12/28 (42%), Positives = 14/28 (50%), Gaps = 1/28 (3%)
Query: 365 PCTNQGVCEPLPERQYQCRCTPGYHGQH 392
PC+N G C P Y C C GY G+
Sbjct: 6 PCSNGGTCVDTPGG-YTCECPEGYTGKR 32
Score = 26.6 bits (59), Expect = 3.5
Identities = 13/24 (54%), Positives = 15/24 (62%)
Query: 1 NGGTCVDGINDYTCKCEGDFVGKF 24
NGGTCVD YTC+C + GK
Sbjct: 9 NGGTCVDTPGGYTCECPEGYTGKR 32
Score = 26.6 bits (59), Expect = 3.7
Identities = 12/36 (33%), Positives = 16/36 (44%), Gaps = 6/36 (16%)
Query: 559 PQTSPCQHHDCKNGICFQPQGSNDYLCKCAPGYSGK 594
+PC + G C G Y C+C GY+GK
Sbjct: 2 SPNNPCSNG----GTCVDTPG--GYTCECPEGYTGK 31
Score = 25.5 bits (56), Expect = 9.1
Identities = 11/35 (31%), Positives = 15/35 (42%), Gaps = 6/35 (17%)
Query: 36 PQTSPCQHHDCKNGICFQPQGSNDYLCKCAPGYSG 70
+PC + G C G Y C+C GY+G
Sbjct: 2 SPNNPCSNG----GTCVDTPG--GYTCECPEGYTG 30
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional significance
of EGF-like domains in what appear to be unrelated
proteins is not yet clear; a common feature is that
these repeats are found in the extracellular domain of
membrane-bound proteins or in proteins known to be
secreted (exception: prostaglandin G/H synthase); the
domain includes six cysteine residues which have been
shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 34.4 bits (79), Expect = 0.007
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 440 LSHKCENNATCVDLIQAYECRCAPGFQGEF 469
S+ C N TCV+ +Y C C PG+ G+
Sbjct: 4 ASNPCSNGGTCVNTPGSYRCVCPPGYTGDR 33
Score = 33.6 bits (77), Expect = 0.011
Identities = 12/28 (42%), Positives = 16/28 (57%)
Query: 482 NPCRNGARCLDHFSHYTCECTPGFSGVN 509
NPC NG C++ Y C C PG++G
Sbjct: 6 NPCSNGGTCVNTPGSYRCVCPPGYTGDR 33
Score = 32.1 bits (73), Expect = 0.043
Identities = 11/29 (37%), Positives = 15/29 (51%)
Query: 519 NHMCQNGGTCVDGINDYTCKCEGDFVGKF 547
++ C NGGTCV+ Y C C + G
Sbjct: 5 SNPCSNGGTCVNTPGSYRCVCPPGYTGDR 33
Score = 30.9 bits (70), Expect = 0.12
Identities = 18/32 (56%), Positives = 19/32 (59%), Gaps = 2/32 (6%)
Query: 403 NPCRNSGTCKLLEEGRFNCECAPGYTGD-RCE 433
NPC N GTC G + C C PGYTGD CE
Sbjct: 6 NPCSNGGTCVNTP-GSYRCVCPPGYTGDRSCE 36
Score = 28.2 bits (63), Expect = 1.1
Identities = 15/32 (46%), Positives = 18/32 (56%), Gaps = 2/32 (6%)
Query: 364 SPCTNQGVCEPLPERQYQCRCTPGYHGQ-HCE 394
+PC+N G C P Y+C C PGY G CE
Sbjct: 6 NPCSNGGTCVNTPG-SYRCVCPPGYTGDRSCE 36
Score = 26.7 bits (59), Expect = 3.1
Identities = 10/24 (41%), Positives = 12/24 (50%)
Query: 1 NGGTCVDGINDYTCKCEGDFVGKF 24
NGGTCV+ Y C C + G
Sbjct: 10 NGGTCVNTPGSYRCVCPPGYTGDR 33
>gnl|CDD|178695 PLN03150, PLN03150, hypothetical protein; Provisional.
Length = 623
Score = 37.9 bits (88), Expect = 0.014
Identities = 20/56 (35%), Positives = 30/56 (53%)
Query: 222 DLSNNQIGFLSNFTFSPLSKLSQLILSYNKLQCIDRDALAGLKSLRIISLHGNDIS 277
+LS N I + ++ L L LSYN ++L L SLRI++L+GN +S
Sbjct: 448 NLSGNSIRGNIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSLRILNLNGNSLS 503
>gnl|CDD|215061 PLN00113, PLN00113, leucine-rich repeat receptor-like protein
kinase; Provisional.
Length = 968
Score = 36.4 bits (84), Expect = 0.052
Identities = 28/80 (35%), Positives = 38/80 (47%), Gaps = 2/80 (2%)
Query: 222 DLSNNQIGFLSNFTFSPLSKLSQLILSYNKLQCIDRDALAGLKSLRIISLHGNDIS-MIP 280
DLS NQ LS+L QL LS NKL D L+ K L + L N +S IP
Sbjct: 481 DLSRNQFSGAVPRKLGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIP 540
Query: 281 EGAFADLHAITHLAIGGNPL 300
F+++ ++ L + N L
Sbjct: 541 AS-FSEMPVLSQLDLSQNQL 559
>gnl|CDD|214598 smart00282, LamG, Laminin G domain.
Length = 132
Score = 33.5 bits (77), Expect = 0.087
Identities = 13/44 (29%), Positives = 20/44 (45%), Gaps = 1/44 (2%)
Query: 184 VLFYDGRAGEC-YIAVELFSSRIRVSYYVGNYPVSTMYRDLSNN 226
+L Y G G Y+A+EL R+ + Y +G+ P N
Sbjct: 14 LLLYAGSKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLN 57
>gnl|CDD|215681 pfam00054, Laminin_G_1, Laminin G domain.
Length = 131
Score = 33.4 bits (77), Expect = 0.10
Identities = 14/44 (31%), Positives = 23/44 (52%), Gaps = 1/44 (2%)
Query: 184 VLFYDGRAGE-CYIAVELFSSRIRVSYYVGNYPVSTMYRDLSNN 226
+L Y+G E ++A+EL R+ VSY +G+ D N+
Sbjct: 9 LLLYNGTQTERDFLALELRDGRLEVSYDLGSGAAVVRSGDKLND 52
>gnl|CDD|176164 cd08475, PBP2_CrgA_like_6, The C-terminal substrate binding domain
of an uncharacterized LysR-type transcriptional
regulator CrgA-like, contains the type 2 periplasmic
binding fold. This CD represents the substrate binding
domain of an uncharacterized LysR-type transcriptional
regulator (LTTR) CrgA-like 6. The LTTRs are acting as
both auto-repressors and activators of target promoters,
controlling operons involved in a wide variety of
cellular processes such as amino acid biosynthesis, CO2
fixation, antibiotic resistance, degradation of aromatic
compounds, nodule formation of nitrogen-fixing bacteria,
and synthesis of virulence factors, to name a few. In
contrast to the tetrameric form of other LTTRs, CrgA
from Neisseria meningitides assembles into an octameric
ring, which can bind up to four 63-bp DNA
oligonucleotides. Phylogenetic cluster analysis showed
that the CrgA-like regulators form a subclass of the
LTTRs that function as octamers. The CrgA is an
auto-repressor of its own gene and activates the
expression of the mdaB gene which coding for an
NADPH-quinone reductase and that its action is increased
by MBL (alpha-methylene-gamma-butyrolactone), an inducer
of NADPH-quinone oxidoreductase. The structural
topology of this substrate-binding domain is most
similar to that of the type 2 periplasmic binding
proteins (PBP2), which are responsible for the uptake of
a variety of substrates such as phosphate, sulfate,
polysaccharides, lysine/arginine/ornithine, and
histidine. The PBP2 bind their ligand in the cleft
between these domains in a manner resembling a Venus
flytrap. After binding their specific ligand with high
affinity, they can interact with a cognate membrane
transport complex comprised of two integral membrane
domains and two cytoplasmically located ATPase domains.
This interaction triggers the ligand translocation
across the cytoplasmic membrane energized by ATP
hydrolysis.
Length = 199
Score = 33.7 bits (78), Expect = 0.14
Identities = 12/39 (30%), Positives = 18/39 (46%), Gaps = 6/39 (15%)
Query: 283 AFADLHAITHLAIGGNPLYCDCGLAWLSEWVKRDYVEPG 321
F D AI A+ G G+A L W+ D+++ G
Sbjct: 130 QFDDGEAIADAALAG------LGIAQLPTWLVADHLQRG 162
>gnl|CDD|218955 pfam06247, Plasmod_Pvs28, Plasmodium ookinete surface protein
Pvs28. This family consists of several ookinete surface
protein (Pvs28) from several species of Plasmodium.
Pvs25 and Pvs28 are expressed on the surface of
ookinetes. These proteins are potential candidates for
vaccine and induce antibodies that block the infectivity
of Plasmodium vivax in immunised animals.
Length = 196
Score = 31.3 bits (71), Expect = 1.1
Identities = 31/129 (24%), Positives = 43/129 (33%), Gaps = 10/129 (7%)
Query: 365 PCTNQGVCEPLP----ERQYQCRCTPGYHGQHCEFMIDACYGNPCRNSGTCKLLEEGRFN 420
C C E+ +C C GY + + C C SG C + N
Sbjct: 51 VCGEYATCINQANKAEEKALKCGCINGYTLSQGVCVPNKCNNKVC-GSGKCIVDPANPNN 109
Query: 421 --CECAPGYTGD---RCEINIDDCLSHKCENNATCVDLIQAYECRCAPGFQGEFCQTKTP 475
C C G D +C + S KC+ N C + YEC C GF G+ T +
Sbjct: 110 TTCSCNIGKVPDQNGKCTKTGETKCSLKCKENEECKLVGGYYECVCKEGFPGDGGGTGSG 169
Query: 476 FCAGEYNPC 484
+
Sbjct: 170 GPPTSSSVM 178
Score = 30.9 bits (70), Expect = 1.4
Identities = 20/93 (21%), Positives = 33/93 (35%), Gaps = 10/93 (10%)
Query: 457 YECRCAPGF--QGEFCQTKTPFCA---GEYNPCRNGARCLDHFSH-----YTCECTPGFS 506
+EC+C G+ + E + C C A C++ + C C G++
Sbjct: 20 FECKCNEGYVLKNENTCEEKVKCDKLENVNKVCGEYATCINQANKAEEKALKCGCINGYT 79
Query: 507 GVNCTVNVDDCVNHMCQNGGTCVDGINDYTCKC 539
+ C N +C +G VD N C
Sbjct: 80 LSQGVCVPNKCNNKVCGSGKCIVDPANPNNTTC 112
>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain.
Length = 42
Score = 27.7 bits (62), Expect = 1.7
Identities = 12/33 (36%), Positives = 19/33 (57%), Gaps = 2/33 (6%)
Query: 436 IDDCLS--HKCENNATCVDLIQAYECRCAPGFQ 466
+D+C H C N CV+ I ++EC C G++
Sbjct: 2 VDECADGTHNCPANTVCVNTIGSFECVCPDGYE 34
>gnl|CDD|216930 pfam02210, Laminin_G_2, Laminin G domain. This family includes the
Thrombospondin N-terminal-like domain, a Laminin G
subfamily.
Length = 124
Score = 29.7 bits (67), Expect = 1.8
Identities = 12/47 (25%), Positives = 23/47 (48%), Gaps = 3/47 (6%)
Query: 184 VLFY-DGRAGECYIAVELFSSRIRVSYYVGNYPVSTMYRD--LSNNQ 227
+L Y G G ++A+EL R+ + Y +G+ + L++ Q
Sbjct: 9 LLLYAGGEDGLDFLALELEDGRLVLRYDLGSGGSVLLLSGKKLNDGQ 55
>gnl|CDD|185268 PRK15370, PRK15370, E3 ubiquitin-protein ligase SlrP; Provisional.
Length = 754
Score = 31.2 bits (70), Expect = 1.8
Identities = 19/61 (31%), Positives = 29/61 (47%), Gaps = 6/61 (9%)
Query: 240 SKLSQLILSYNKLQCIDRDALAGLKSLRIISLHGNDISMIPEGAFADLHAITHLAIGGNP 299
S L L L +NK+ C+ + L+ L S++ N I +P + ITHL + N
Sbjct: 262 SALQSLDLFHNKISCLPENLPEELRYL---SVYDNSIRTLPAHLPS---GITHLNVQSNS 315
Query: 300 L 300
L
Sbjct: 316 L 316
Score = 30.8 bits (69), Expect = 2.7
Identities = 21/59 (35%), Positives = 29/59 (49%), Gaps = 4/59 (6%)
Query: 97 TRVDCSQRGLKEIPKDIPLYTTELILNDNEIGKIKSDGLFGRLPNLIKLDLRRNQVTGI 155
T + GL IP IP T LIL++NE+ + + L G N+ L NQ+T I
Sbjct: 181 TELRLKILGLTTIPACIPEQITTLILDNNELKSL-PENLQG---NIKTLYANSNQLTSI 235
>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain.
Length = 35
Score = 27.1 bits (60), Expect = 2.2
Identities = 10/27 (37%), Positives = 14/27 (51%), Gaps = 1/27 (3%)
Query: 441 SHKCENNATCVDLIQAYECRCAPGFQG 467
C N TC++ +Y C C PG+ G
Sbjct: 5 GGPCSNG-TCINTPGSYTCSCPPGYTG 30
Score = 27.1 bits (60), Expect = 2.3
Identities = 13/32 (40%), Positives = 15/32 (46%), Gaps = 2/32 (6%)
Query: 565 QHHDCKNGICFQPQGSNDYLCKCAPGYSGKES 596
C NG C GS Y C C PGY+G +
Sbjct: 4 SGGPCSNGTCINTPGS--YTCSCPPGYTGDKR 33
Score = 27.1 bits (60), Expect = 2.4
Identities = 13/31 (41%), Positives = 15/31 (48%), Gaps = 2/31 (6%)
Query: 42 QHHDCKNGICFQPQGSNDYLCKCAPGYSGVE 72
C NG C GS Y C C PGY+G +
Sbjct: 4 SGGPCSNGTCINTPGS--YTCSCPPGYTGDK 32
Score = 27.1 bits (60), Expect = 2.4
Identities = 12/28 (42%), Positives = 16/28 (57%), Gaps = 1/28 (3%)
Query: 482 NPCRNGARCLDHFSHYTCECTPGFSGVN 509
PC NG C++ YTC C PG++G
Sbjct: 6 GPCSNG-TCINTPGSYTCSCPPGYTGDK 32
Score = 25.6 bits (56), Expect = 7.7
Identities = 16/30 (53%), Positives = 18/30 (60%), Gaps = 2/30 (6%)
Query: 402 GNPCRNSGTCKLLEEGRFNCECAPGYTGDR 431
G PC N GTC G + C C PGYTGD+
Sbjct: 5 GGPCSN-GTCINTP-GSYTCSCPPGYTGDK 32
>gnl|CDD|204999 pfam12661, hEGF, Human growth factor-like EGF. hEGF, or human
growth factor-like EGF, domains have six conserved
residues disulfide-bonded into the characteristic
'ababcc' pattern. They are involved in growth and
proliferation of cells, in proteins of the Notch/Delta
pathway, neurogulin and selectins. hEGFs are also found
in mosaic proteins with four-disulfide laminin EGFs such
as aggrecan and perlecan. The core fold of the EGF
domain consists of two small beta-hairpins packed
against each other. Two major structural variants have
been identified based on the structural context of the
C-terminal Cys residue of disulfide 'c' in the
C-terminal hairpin: hEGFs and cEGFs. In hEGFs the
C-terminal thiol resides in the beta-turn, resulting in
shorter loop-lengths between the Cys residues of
disulfide 'c', typically C[8-9]XC. These shorter
loop-lengths are also typical of the four-disulfide EGF
domains, laminin ad integrin. Tandem hEGF domains have
six linking residues between terminal cysteines of
adjacent domains. hEGF domains may or may not bind
calcium in the linker region. hEGF domains with the
consensus motif CXD4X[F,Y]XCXC are hydroxylated
exclusively in the Asp residue.
Length = 13
Score = 26.5 bits (60), Expect = 2.5
Identities = 9/12 (75%), Positives = 10/12 (83%)
Query: 421 CECAPGYTGDRC 432
C+C PGYTG RC
Sbjct: 2 CQCPPGYTGPRC 13
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 26.7 bits (60), Expect = 3.8
Identities = 9/26 (34%), Positives = 13/26 (50%)
Query: 442 HKCENNATCVDLIQAYECRCAPGFQG 467
C NATC + ++ C C G+ G
Sbjct: 6 GGCHPNATCTNTGGSFTCTCKSGYTG 31
>gnl|CDD|215343 PLN02638, PLN02638, cellulose synthase A (UDP-forming), catalytic
subunit.
Length = 1079
Score = 30.3 bits (68), Expect = 4.2
Identities = 19/64 (29%), Positives = 27/64 (42%), Gaps = 3/64 (4%)
Query: 523 QNGGTCVDGINDYTCKCEGDFVGKFCEIAPFVA---MMYPQTSPCQHHDCKNGICFQPQG 579
+ G + C+ GD VGK + PFVA +P PC ++ K+G PQ
Sbjct: 6 ETGAKPMKHGGGQVCQICGDNVGKTVDGEPFVACDVCAFPVCRPCYEYERKDGNQSCPQC 65
Query: 580 SNDY 583
Y
Sbjct: 66 KTKY 69
Score = 29.9 bits (67), Expect = 5.4
Identities = 19/62 (30%), Positives = 26/62 (41%), Gaps = 3/62 (4%)
Query: 2 GGTCVDGINDYTCKCEGDFVGKFCEIAPFVA---MMYPQTSPCQHHDCKNGICFQPQGSN 58
G + C+ GD VGK + PFVA +P PC ++ K+G PQ
Sbjct: 8 GAKPMKHGGGQVCQICGDNVGKTVDGEPFVACDVCAFPVCRPCYEYERKDGNQSCPQCKT 67
Query: 59 DY 60
Y
Sbjct: 68 KY 69
>gnl|CDD|215680 pfam00053, Laminin_EGF, Laminin EGF-like (Domains III and V). This
family is like pfam00008 but has 8 conserved cysteines
instead of six.
Length = 49
Score = 26.2 bits (58), Expect = 7.9
Identities = 12/31 (38%), Positives = 14/31 (45%)
Query: 381 QCRCTPGYHGQHCEFMIDACYGNPCRNSGTC 411
QC C PG G+HC+ YG P C
Sbjct: 19 QCLCKPGVTGRHCDRCKPGYYGLPSDPGQGC 49
>gnl|CDD|238064 cd00116, LRR_RI, Leucine-rich repeats (LRRs), ribonuclease
inhibitor (RI)-like subfamily. LRRs are 20-29 residue
sequence motifs present in many proteins that
participate in protein-protein interactions and have
different functions and cellular locations. LRRs
correspond to structural units consisting of a beta
strand (LxxLxLxxN/CxL conserved pattern) and an alpha
helix. This alignment contains 12 strands corresponding
to 11 full repeats, consistent with the extent observed
in the subfamily acting as Ran GTPase Activating
Proteins (RanGAP1).
Length = 319
Score = 28.9 bits (65), Expect = 8.4
Identities = 14/57 (24%), Positives = 24/57 (42%), Gaps = 5/57 (8%)
Query: 259 ALAGLKSLRIISLHGNDI----SMIPEGAFADLHAITHLAIGGNPLYCDCGLAWLSE 311
L +L ++ L+ N + + A L ++ L +G N L D G A L+
Sbjct: 188 GLKANCNLEVLDLNNNGLTDEGASALAETLASLKSLEVLNLGDNNL-TDAGAAALAS 243
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.323 0.140 0.472
Gapped
Lambda K H
0.267 0.0686 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 29,368,236
Number of extensions: 2739643
Number of successful extensions: 1921
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1887
Number of HSP's successfully gapped: 130
Length of query: 604
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 502
Effective length of database: 6,413,494
Effective search space: 3219573988
Effective search space used: 3219573988
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 62 (27.7 bits)