RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15407
         (604 letters)



>gnl|CDD|206026 pfam13855, LRR_8, Leucine rich repeat. 
          Length = 60

 Score = 66.0 bits (162), Expect = 7e-14
 Identities = 23/55 (41%), Positives = 32/55 (58%)

Query: 222 DLSNNQIGFLSNFTFSPLSKLSQLILSYNKLQCIDRDALAGLKSLRIISLHGNDI 276
           DLSNN++  + +  F  L  L  L LS N L  I  +A +GL SLR + L GN++
Sbjct: 6   DLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60



 Score = 61.0 bits (149), Expect = 4e-12
 Identities = 22/60 (36%), Positives = 34/60 (56%)

Query: 241 KLSQLILSYNKLQCIDRDALAGLKSLRIISLHGNDISMIPEGAFADLHAITHLAIGGNPL 300
            L  L LS N+L  I   A  GL +L+++ L GN+++ I   AF+ L ++  L + GN L
Sbjct: 1   NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60



 Score = 50.2 bits (121), Expect = 3e-08
 Identities = 20/45 (44%), Positives = 24/45 (53%), Gaps = 1/45 (2%)

Query: 118 TELILNDNEIGKIKSDGLFGRLPNLIKLDLRRNQVTGIEDNAFEG 162
             L L++N +  I  DG F  LPNL  LDL  N +T I   AF G
Sbjct: 3   KSLDLSNNRLTVI-PDGAFKGLPNLKVLDLSGNNLTSISPEAFSG 46



 Score = 32.5 bits (75), Expect = 0.048
 Identities = 16/49 (32%), Positives = 19/49 (38%), Gaps = 4/49 (8%)

Query: 106 LKEIPKDIPLYT---TELILNDNEIGKIKSDGLFGRLPNLIKLDLRRNQ 151
           L  IP            L L+ N +  I S   F  LP+L  LDL  N 
Sbjct: 12  LTVIPDGAFKGLPNLKVLDLSGNNLTSI-SPEAFSGLPSLRSLDLSGNN 59



 Score = 31.7 bits (73), Expect = 0.099
 Identities = 12/22 (54%), Positives = 15/22 (68%)

Query: 141 NLIKLDLRRNQVTGIEDNAFEG 162
           NL  LDL  N++T I D AF+G
Sbjct: 1   NLKSLDLSNNRLTVIPDGAFKG 22


>gnl|CDD|188093 TIGR00864, PCC, polycystin cation channel protein.  The Polycystin
           Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a
           huge protein of 4303aas. Its repeated leucine-rich (LRR)
           segment is found in many proteins. It contains 16
           polycystic kidney disease (PKD) domains, one
           LDL-receptor class A domain, one C-type lectin family
           domain, and 16-18 putative TMSs in positions between
           residues 2200 and 4100. Polycystin-L has been shown to
           be a cation (Na+, K+ and Ca2+) channel that is activated
           by Ca2+. Two members of the PCC family (polycystin 1 and
           2) are mutated in autosomal dominant polycystic kidney
           disease, and polycystin-L is deleted in mice with renal
           and retinal defects. Note: this model is restricted to
           the amino half for technical reasons.
          Length = 2740

 Score = 62.8 bits (152), Expect = 5e-10
 Identities = 29/82 (35%), Positives = 41/82 (50%), Gaps = 3/82 (3%)

Query: 271 LHGNDISMIPEGAFADLHAITHLAIGGNPLYCDCGLAWLSEWVKRDYV---EPGIARCTE 327
           +  N IS I EG  A+L  ++ + + GNP  CDCGLA L  W +   V   +P  A C  
Sbjct: 2   ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAG 61

Query: 328 PASMRDKLLLTAPASSFQCKEK 349
           P ++  + LL  P     C E+
Sbjct: 62  PGALAGQPLLGIPLLDSGCDEE 83


>gnl|CDD|227223 COG4886, COG4886, Leucine-rich repeat (LRR) protein [Function
           unknown].
          Length = 394

 Score = 53.0 bits (127), Expect = 2e-07
 Identities = 53/199 (26%), Positives = 73/199 (36%), Gaps = 41/199 (20%)

Query: 106 LKEIPKDIPLYT---TELILNDNEIGKIKSDGLFGRLPNLIKLDLRRNQVTGIEDNAFEG 162
           + +IP  I L      EL L+DN+I  + S      LPNL  LDL  N ++ +       
Sbjct: 128 ITDIPPLIGLLKSNLKELDLSDNKIESLPSPLRN--LPNLKNLDLSFNDLSDLPKLLSNL 185

Query: 163 ENDQ-GCLGDNYCPPKCTCTVRVLFYDGRAGECYIAVELFSSRIRVSYYVGNYPVSTMYR 221
            N     L  N         +  L             EL                     
Sbjct: 186 SNLNNLDLSGN--------KISDL--PPEIELLSALEEL--------------------- 214

Query: 222 DLSNNQIGFLSNFTFSPLSKLSQLILSYNKLQCIDRDALAGLKSLRIISLHGNDISMIPE 281
           DLSNN I      + S L  LS L LS NKL+ +  +++  L +L  + L  N IS I  
Sbjct: 215 DLSNNSI-IELLSSLSNLKNLSGLELSNNKLEDLP-ESIGNLSNLETLDLSNNQISSIS- 271

Query: 282 GAFADLHAITHLAIGGNPL 300
            +   L  +  L + GN L
Sbjct: 272 -SLGSLTNLRELDLSGNSL 289



 Score = 43.8 bits (103), Expect = 2e-04
 Identities = 24/77 (31%), Positives = 40/77 (51%), Gaps = 2/77 (2%)

Query: 222 DLSNNQIGFLSNFTFSPLSKLSQLILSYNKLQCIDRDALAGLKSLRIISLHGNDISMIPE 281
           DL NN I  +        S L +L LS NK++ +    L  L +L+ + L  ND+S +P+
Sbjct: 122 DLDNNNITDIPPLIGLLKSNLKELDLSDNKIESLPSP-LRNLPNLKNLDLSFNDLSDLPK 180

Query: 282 GAFADLHAITHLAIGGN 298
              ++L  + +L + GN
Sbjct: 181 LL-SNLSNLNNLDLSGN 196



 Score = 32.2 bits (73), Expect = 0.77
 Identities = 17/63 (26%), Positives = 31/63 (49%), Gaps = 4/63 (6%)

Query: 97  TRVDCSQRGLKEIPKDIPLYT--TELILNDNEIGKIKSDGLFGRLPNLIKLDLRRNQVTG 154
             +D S   + ++P +I L +   EL L++N I  I+       L NL  L+L  N++  
Sbjct: 189 NNLDLSGNKISDLPPEIELLSALEELDLSNNSI--IELLSSLSNLKNLSGLELSNNKLED 246

Query: 155 IED 157
           + +
Sbjct: 247 LPE 249


>gnl|CDD|214507 smart00082, LRRCT, Leucine rich repeat C-terminal domain. 
          Length = 51

 Score = 45.1 bits (107), Expect = 1e-06
 Identities = 20/52 (38%), Positives = 27/52 (51%), Gaps = 3/52 (5%)

Query: 298 NPLYCDCGLAWLSEWVKRDYV--EPGIARCTEPASMRDKLLLTAPASSFQCK 347
           NP  CDC L WL  W++ +    +P   RC  P+S+R   LL    S F+C 
Sbjct: 1   NPFICDCELRWLLRWLQANEHLQDPVDLRCASPSSLRGP-LLELLHSEFKCP 51


>gnl|CDD|214470 smart00013, LRRNT, Leucine rich repeat N-terminal domain. 
          Length = 33

 Score = 42.7 bits (101), Expect = 8e-06
 Identities = 20/33 (60%), Positives = 24/33 (72%)

Query: 87  ACPSGCSCDGTRVDCSQRGLKEIPKDIPLYTTE 119
           ACP+ C+C GT VDCS RGL E+P D+P  TT 
Sbjct: 1   ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33


>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
           large number of membrane-bound and extracellular (mostly
           animal) proteins. Many of these proteins require calcium
           for their biological function and calcium-binding sites
           have been found to be located at the N-terminus of
           particular EGF-like domains; calcium-binding may be
           crucial for numerous protein-protein interactions. Six
           conserved core cysteines form three disulfide bridges as
           in non calcium-binding EGF domains, whose structures are
           very similar. EGF_CA can be found in tandem repeat
           arrangements.
          Length = 38

 Score = 42.6 bits (101), Expect = 9e-06
 Identities = 16/38 (42%), Positives = 24/38 (63%), Gaps = 1/38 (2%)

Query: 513 NVDDCV-NHMCQNGGTCVDGINDYTCKCEGDFVGKFCE 549
           ++D+C   + CQNGGTCV+ +  Y C C   + G+ CE
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38



 Score = 41.9 bits (99), Expect = 2e-05
 Identities = 16/38 (42%), Positives = 25/38 (65%), Gaps = 1/38 (2%)

Query: 435 NIDDCLS-HKCENNATCVDLIQAYECRCAPGFQGEFCQ 471
           +ID+C S + C+N  TCV+ + +Y C C PG+ G  C+
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38



 Score = 40.3 bits (95), Expect = 5e-05
 Identities = 14/32 (43%), Positives = 19/32 (59%)

Query: 480 EYNPCRNGARCLDHFSHYTCECTPGFSGVNCT 511
             NPC+NG  C++    Y C C PG++G NC 
Sbjct: 7   SGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38



 Score = 39.9 bits (94), Expect = 7e-05
 Identities = 21/38 (55%), Positives = 23/38 (60%), Gaps = 2/38 (5%)

Query: 397 IDACY-GNPCRNSGTCKLLEEGRFNCECAPGYTGDRCE 433
           ID C  GNPC+N GTC     G + C C PGYTG  CE
Sbjct: 2   IDECASGNPCQNGGTCVNTV-GSYRCSCPPGYTGRNCE 38



 Score = 34.5 bits (80), Expect = 0.006
 Identities = 14/31 (45%), Positives = 18/31 (58%), Gaps = 1/31 (3%)

Query: 364 SPCTNQGVCEPLPERQYQCRCTPGYHGQHCE 394
           +PC N G C       Y+C C PGY G++CE
Sbjct: 9   NPCQNGGTCVNTVGS-YRCSCPPGYTGRNCE 38



 Score = 33.8 bits (78), Expect = 0.011
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 1  NGGTCVDGINDYTCKCEGDFVGKFCE 26
          NGGTCV+ +  Y C C   + G+ CE
Sbjct: 13 NGGTCVNTVGSYRCSCPPGYTGRNCE 38


>gnl|CDD|201808 pfam01462, LRRNT, Leucine rich repeat N-terminal domain.  Leucine
           Rich Repeats pfam00560 are short sequence motifs present
           in a number of proteins with diverse functions and
           cellular locations. Leucine Rich Repeats are often
           flanked by cysteine rich domains. This domain is often
           found at the N-terminus of tandem leucine rich repeats.
          Length = 28

 Score = 39.5 bits (93), Expect = 8e-05
 Identities = 18/28 (64%), Positives = 22/28 (78%)

Query: 87  ACPSGCSCDGTRVDCSQRGLKEIPKDIP 114
           ACP+ C C GT VDCS RGL E+P+D+P
Sbjct: 1   ACPAPCVCSGTVVDCSGRGLTEVPRDLP 28


>gnl|CDD|205079 pfam12799, LRR_4, Leucine Rich repeats (2 copies).  Leucine rich
           repeats are short sequence motifs present in a number of
           proteins with diverse functions and cellular locations.
           These repeats are usually involved in protein-protein
           interactions. Each Leucine Rich Repeat is composed of a
           beta-alpha unit. These units form elongated non-globular
           structures. Leucine Rich Repeats are often flanked by
           cysteine rich domains.
          Length = 43

 Score = 39.8 bits (94), Expect = 9e-05
 Identities = 13/39 (33%), Positives = 20/39 (51%), Gaps = 3/39 (7%)

Query: 118 TELILNDNEIGKIKSDGLFGRLPNLIKLDLRRNQVTGIE 156
             L L++N+I  +        LPNL  LDL  N++T + 
Sbjct: 4   ETLDLSNNQITDLPP---LSNLPNLETLDLSGNKITDLS 39



 Score = 39.4 bits (93), Expect = 1e-04
 Identities = 12/41 (29%), Positives = 20/41 (48%), Gaps = 2/41 (4%)

Query: 241 KLSQLILSYNKLQCIDRDALAGLKSLRIISLHGNDISMIPE 281
            L  L LS N++  +    L+ L +L  + L GN I+ +  
Sbjct: 2   NLETLDLSNNQITDLP--PLSNLPNLETLDLSGNKITDLSP 40



 Score = 35.5 bits (83), Expect = 0.002
 Identities = 16/35 (45%), Positives = 18/35 (51%), Gaps = 2/35 (5%)

Query: 222 DLSNNQIGFLSNFTFSPLSKLSQLILSYNKLQCID 256
           DLSNNQI  L     S L  L  L LS NK+  + 
Sbjct: 7   DLSNNQITDLPPL--SNLPNLETLDLSGNKITDLS 39



 Score = 32.5 bits (75), Expect = 0.033
 Identities = 9/37 (24%), Positives = 18/37 (48%), Gaps = 2/37 (5%)

Query: 264 KSLRIISLHGNDISMIPEGAFADLHAITHLAIGGNPL 300
            +L  + L  N I+ +P    ++L  +  L + GN +
Sbjct: 1   TNLETLDLSNNQITDLPP--LSNLPNLETLDLSGNKI 35


>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 37.6 bits (88), Expect = 5e-04
 Identities = 16/39 (41%), Positives = 25/39 (64%), Gaps = 2/39 (5%)

Query: 513 NVDDCV-NHMCQNGGTCVDGINDYTCKCEGDFV-GKFCE 549
           ++D+C   + CQNGGTCV+ +  Y C+C   +  G+ CE
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39



 Score = 36.8 bits (86), Expect = 8e-04
 Identities = 16/39 (41%), Positives = 25/39 (64%), Gaps = 2/39 (5%)

Query: 435 NIDDCLS-HKCENNATCVDLIQAYECRCAPGFQ-GEFCQ 471
           +ID+C S + C+N  TCV+ + +Y C C PG+  G  C+
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39



 Score = 35.3 bits (82), Expect = 0.003
 Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 1/33 (3%)

Query: 480 EYNPCRNGARCLDHFSHYTCECTPGFS-GVNCT 511
             NPC+NG  C++    Y CEC PG++ G NC 
Sbjct: 7   SGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39



 Score = 32.6 bits (75), Expect = 0.030
 Identities = 22/39 (56%), Positives = 24/39 (61%), Gaps = 3/39 (7%)

Query: 397 IDACY-GNPCRNSGTCKLLEEGRFNCECAPGYT-GDRCE 433
           ID C  GNPC+N GTC     G + CEC PGYT G  CE
Sbjct: 2   IDECASGNPCQNGGTCVNTV-GSYRCECPPGYTDGRNCE 39



 Score = 29.5 bits (67), Expect = 0.31
 Identities = 14/33 (42%), Positives = 18/33 (54%), Gaps = 2/33 (6%)

Query: 363 KSPCTNQGVCEPLPERQYQCRCTPGYH-GQHCE 394
            +PC N G C       Y+C C PGY  G++CE
Sbjct: 8   GNPCQNGGTCVNTVG-SYRCECPPGYTDGRNCE 39



 Score = 28.8 bits (65), Expect = 0.58
 Identities = 12/27 (44%), Positives = 17/27 (62%), Gaps = 1/27 (3%)

Query: 1  NGGTCVDGINDYTCKCEGDFV-GKFCE 26
          NGGTCV+ +  Y C+C   +  G+ CE
Sbjct: 13 NGGTCVNTVGSYRCECPPGYTDGRNCE 39


>gnl|CDD|197688 smart00370, LRR, Leucine-rich repeats, outliers. 
          Length = 24

 Score = 36.6 bits (86), Expect = 0.001
 Identities = 10/24 (41%), Positives = 14/24 (58%)

Query: 263 LKSLRIISLHGNDISMIPEGAFAD 286
           L +LR + L  N +S +P GAF  
Sbjct: 1   LPNLRELDLSNNQLSSLPPGAFQG 24



 Score = 34.2 bits (80), Expect = 0.006
 Identities = 12/24 (50%), Positives = 17/24 (70%)

Query: 139 LPNLIKLDLRRNQVTGIEDNAFEG 162
           LPNL +LDL  NQ++ +   AF+G
Sbjct: 1   LPNLRELDLSNNQLSSLPPGAFQG 24



 Score = 28.5 bits (65), Expect = 0.64
 Identities = 9/24 (37%), Positives = 12/24 (50%)

Query: 239 LSKLSQLILSYNKLQCIDRDALAG 262
           L  L +L LS N+L  +   A  G
Sbjct: 1   LPNLRELDLSNNQLSSLPPGAFQG 24


>gnl|CDD|197687 smart00369, LRR_TYP, Leucine-rich repeats, typical (most populated)
           subfamily. 
          Length = 24

 Score = 36.6 bits (86), Expect = 0.001
 Identities = 10/24 (41%), Positives = 14/24 (58%)

Query: 263 LKSLRIISLHGNDISMIPEGAFAD 286
           L +LR + L  N +S +P GAF  
Sbjct: 1   LPNLRELDLSNNQLSSLPPGAFQG 24



 Score = 34.2 bits (80), Expect = 0.006
 Identities = 12/24 (50%), Positives = 17/24 (70%)

Query: 139 LPNLIKLDLRRNQVTGIEDNAFEG 162
           LPNL +LDL  NQ++ +   AF+G
Sbjct: 1   LPNLRELDLSNNQLSSLPPGAFQG 24



 Score = 28.5 bits (65), Expect = 0.64
 Identities = 9/24 (37%), Positives = 12/24 (50%)

Query: 239 LSKLSQLILSYNKLQCIDRDALAG 262
           L  L +L LS N+L  +   A  G
Sbjct: 1   LPNLRELDLSNNQLSSLPPGAFQG 24


>gnl|CDD|215652 pfam00008, EGF, EGF-like domain.  There is no clear separation
           between noise and signal. pfam00053 is very similar, but
           has 8 instead of 6 conserved cysteines. Includes some
           cytokine receptors. The EGF domain misses the N-terminus
           regions of the Ca2+ binding EGF domains (this is the
           main reason of discrepancy between swiss-prot domain
           start/end and Pfam). The family is hard to model due to
           many similar but different sub-types of EGF domains.
           Pfam certainly misses a number of EGF domains.
          Length = 32

 Score = 34.7 bits (80), Expect = 0.004
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query: 482 NPCRNGARCLDHFSHYTCECTPGFSG 507
           NPC NG  C+D    YTCEC  G++G
Sbjct: 5   NPCSNGGTCVDTPGGYTCECPEGYTG 30



 Score = 32.0 bits (73), Expect = 0.040
 Identities = 11/26 (42%), Positives = 13/26 (50%)

Query: 444 CENNATCVDLIQAYECRCAPGFQGEF 469
           C N  TCVD    Y C C  G+ G+ 
Sbjct: 7   CSNGGTCVDTPGGYTCECPEGYTGKR 32



 Score = 32.0 bits (73), Expect = 0.043
 Identities = 15/29 (51%), Positives = 18/29 (62%)

Query: 519 NHMCQNGGTCVDGINDYTCKCEGDFVGKF 547
           N+ C NGGTCVD    YTC+C   + GK 
Sbjct: 4   NNPCSNGGTCVDTPGGYTCECPEGYTGKR 32



 Score = 28.9 bits (65), Expect = 0.44
 Identities = 16/29 (55%), Positives = 17/29 (58%), Gaps = 1/29 (3%)

Query: 403 NPCRNSGTCKLLEEGRFNCECAPGYTGDR 431
           NPC N GTC     G + CEC  GYTG R
Sbjct: 5   NPCSNGGTCVDTPGG-YTCECPEGYTGKR 32



 Score = 28.6 bits (64), Expect = 0.67
 Identities = 12/28 (42%), Positives = 14/28 (50%), Gaps = 1/28 (3%)

Query: 365 PCTNQGVCEPLPERQYQCRCTPGYHGQH 392
           PC+N G C   P   Y C C  GY G+ 
Sbjct: 6   PCSNGGTCVDTPGG-YTCECPEGYTGKR 32



 Score = 26.6 bits (59), Expect = 3.5
 Identities = 13/24 (54%), Positives = 15/24 (62%)

Query: 1  NGGTCVDGINDYTCKCEGDFVGKF 24
          NGGTCVD    YTC+C   + GK 
Sbjct: 9  NGGTCVDTPGGYTCECPEGYTGKR 32



 Score = 26.6 bits (59), Expect = 3.7
 Identities = 12/36 (33%), Positives = 16/36 (44%), Gaps = 6/36 (16%)

Query: 559 PQTSPCQHHDCKNGICFQPQGSNDYLCKCAPGYSGK 594
              +PC +     G C    G   Y C+C  GY+GK
Sbjct: 2   SPNNPCSNG----GTCVDTPG--GYTCECPEGYTGK 31



 Score = 25.5 bits (56), Expect = 9.1
 Identities = 11/35 (31%), Positives = 15/35 (42%), Gaps = 6/35 (17%)

Query: 36 PQTSPCQHHDCKNGICFQPQGSNDYLCKCAPGYSG 70
             +PC +     G C    G   Y C+C  GY+G
Sbjct: 2  SPNNPCSNG----GTCVDTPG--GYTCECPEGYTG 30


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
           growth factor (EGF) presents in a large number of
           proteins, mostly animal; the list of proteins currently
           known to contain one or more copies of an EGF-like
           pattern is large and varied; the functional significance
           of EGF-like domains in what appear to be unrelated
           proteins is not yet clear; a common feature is that
           these repeats are found in the extracellular domain of
           membrane-bound proteins or in proteins known to be
           secreted (exception: prostaglandin G/H synthase); the
           domain includes six cysteine residues which have been
           shown to be involved in disulfide bonds; the main
           structure is a two-stranded beta-sheet followed by a
           loop to a C-terminal short two-stranded sheet;
           Subdomains between the conserved cysteines vary in
           length; the region between the 5th and 6th cysteine
           contains two conserved glycines of which at  least  one 
           is  present  in  most EGF-like domains; a subset of
           these bind calcium.
          Length = 36

 Score = 34.4 bits (79), Expect = 0.007
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 440 LSHKCENNATCVDLIQAYECRCAPGFQGEF 469
            S+ C N  TCV+   +Y C C PG+ G+ 
Sbjct: 4   ASNPCSNGGTCVNTPGSYRCVCPPGYTGDR 33



 Score = 33.6 bits (77), Expect = 0.011
 Identities = 12/28 (42%), Positives = 16/28 (57%)

Query: 482 NPCRNGARCLDHFSHYTCECTPGFSGVN 509
           NPC NG  C++    Y C C PG++G  
Sbjct: 6   NPCSNGGTCVNTPGSYRCVCPPGYTGDR 33



 Score = 32.1 bits (73), Expect = 0.043
 Identities = 11/29 (37%), Positives = 15/29 (51%)

Query: 519 NHMCQNGGTCVDGINDYTCKCEGDFVGKF 547
           ++ C NGGTCV+    Y C C   + G  
Sbjct: 5   SNPCSNGGTCVNTPGSYRCVCPPGYTGDR 33



 Score = 30.9 bits (70), Expect = 0.12
 Identities = 18/32 (56%), Positives = 19/32 (59%), Gaps = 2/32 (6%)

Query: 403 NPCRNSGTCKLLEEGRFNCECAPGYTGD-RCE 433
           NPC N GTC     G + C C PGYTGD  CE
Sbjct: 6   NPCSNGGTCVNTP-GSYRCVCPPGYTGDRSCE 36



 Score = 28.2 bits (63), Expect = 1.1
 Identities = 15/32 (46%), Positives = 18/32 (56%), Gaps = 2/32 (6%)

Query: 364 SPCTNQGVCEPLPERQYQCRCTPGYHGQ-HCE 394
           +PC+N G C   P   Y+C C PGY G   CE
Sbjct: 6   NPCSNGGTCVNTPG-SYRCVCPPGYTGDRSCE 36



 Score = 26.7 bits (59), Expect = 3.1
 Identities = 10/24 (41%), Positives = 12/24 (50%)

Query: 1  NGGTCVDGINDYTCKCEGDFVGKF 24
          NGGTCV+    Y C C   + G  
Sbjct: 10 NGGTCVNTPGSYRCVCPPGYTGDR 33


>gnl|CDD|178695 PLN03150, PLN03150, hypothetical protein; Provisional.
          Length = 623

 Score = 37.9 bits (88), Expect = 0.014
 Identities = 20/56 (35%), Positives = 30/56 (53%)

Query: 222 DLSNNQIGFLSNFTFSPLSKLSQLILSYNKLQCIDRDALAGLKSLRIISLHGNDIS 277
           +LS N I      +   ++ L  L LSYN       ++L  L SLRI++L+GN +S
Sbjct: 448 NLSGNSIRGNIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSLRILNLNGNSLS 503


>gnl|CDD|215061 PLN00113, PLN00113, leucine-rich repeat receptor-like protein
           kinase; Provisional.
          Length = 968

 Score = 36.4 bits (84), Expect = 0.052
 Identities = 28/80 (35%), Positives = 38/80 (47%), Gaps = 2/80 (2%)

Query: 222 DLSNNQIGFLSNFTFSPLSKLSQLILSYNKLQCIDRDALAGLKSLRIISLHGNDIS-MIP 280
           DLS NQ           LS+L QL LS NKL     D L+  K L  + L  N +S  IP
Sbjct: 481 DLSRNQFSGAVPRKLGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIP 540

Query: 281 EGAFADLHAITHLAIGGNPL 300
              F+++  ++ L +  N L
Sbjct: 541 AS-FSEMPVLSQLDLSQNQL 559


>gnl|CDD|214598 smart00282, LamG, Laminin G domain. 
          Length = 132

 Score = 33.5 bits (77), Expect = 0.087
 Identities = 13/44 (29%), Positives = 20/44 (45%), Gaps = 1/44 (2%)

Query: 184 VLFYDGRAGEC-YIAVELFSSRIRVSYYVGNYPVSTMYRDLSNN 226
           +L Y G  G   Y+A+EL   R+ + Y +G+ P          N
Sbjct: 14  LLLYAGSKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLN 57


>gnl|CDD|215681 pfam00054, Laminin_G_1, Laminin G domain. 
          Length = 131

 Score = 33.4 bits (77), Expect = 0.10
 Identities = 14/44 (31%), Positives = 23/44 (52%), Gaps = 1/44 (2%)

Query: 184 VLFYDGRAGE-CYIAVELFSSRIRVSYYVGNYPVSTMYRDLSNN 226
           +L Y+G   E  ++A+EL   R+ VSY +G+        D  N+
Sbjct: 9   LLLYNGTQTERDFLALELRDGRLEVSYDLGSGAAVVRSGDKLND 52


>gnl|CDD|176164 cd08475, PBP2_CrgA_like_6, The C-terminal substrate binding domain
           of an uncharacterized LysR-type transcriptional
           regulator CrgA-like, contains the type 2 periplasmic
           binding fold.  This CD represents the substrate binding
           domain of an uncharacterized LysR-type transcriptional
           regulator (LTTR) CrgA-like 6. The LTTRs are acting as
           both auto-repressors and activators of target promoters,
           controlling operons involved in a wide variety of
           cellular processes such as amino acid biosynthesis, CO2
           fixation, antibiotic resistance, degradation of aromatic
           compounds, nodule formation of nitrogen-fixing bacteria,
           and synthesis of virulence factors, to name a few. In
           contrast to the tetrameric form of other LTTRs, CrgA
           from Neisseria meningitides assembles into an octameric
           ring, which can bind up to four 63-bp DNA
           oligonucleotides. Phylogenetic cluster analysis showed
           that the CrgA-like regulators form a subclass of the
           LTTRs that function as octamers. The CrgA is an
           auto-repressor of its own gene and activates the
           expression of the mdaB gene which coding for an
           NADPH-quinone reductase and that its action is increased
           by MBL (alpha-methylene-gamma-butyrolactone), an inducer
           of NADPH-quinone oxidoreductase.  The structural
           topology of this substrate-binding domain is most
           similar to that of the type 2 periplasmic binding
           proteins (PBP2), which are responsible for the uptake of
           a variety of substrates such as phosphate, sulfate,
           polysaccharides, lysine/arginine/ornithine, and
           histidine. The PBP2 bind their ligand in the cleft
           between these domains in a manner resembling a Venus
           flytrap. After binding their specific ligand with high
           affinity, they can interact with a cognate membrane
           transport complex comprised of two integral membrane
           domains and two cytoplasmically located ATPase domains.
           This interaction triggers the ligand translocation
           across the cytoplasmic membrane energized by ATP
           hydrolysis.
          Length = 199

 Score = 33.7 bits (78), Expect = 0.14
 Identities = 12/39 (30%), Positives = 18/39 (46%), Gaps = 6/39 (15%)

Query: 283 AFADLHAITHLAIGGNPLYCDCGLAWLSEWVKRDYVEPG 321
            F D  AI   A+ G       G+A L  W+  D+++ G
Sbjct: 130 QFDDGEAIADAALAG------LGIAQLPTWLVADHLQRG 162


>gnl|CDD|218955 pfam06247, Plasmod_Pvs28, Plasmodium ookinete surface protein
           Pvs28.  This family consists of several ookinete surface
           protein (Pvs28) from several species of Plasmodium.
           Pvs25 and Pvs28 are expressed on the surface of
           ookinetes. These proteins are potential candidates for
           vaccine and induce antibodies that block the infectivity
           of Plasmodium vivax in immunised animals.
          Length = 196

 Score = 31.3 bits (71), Expect = 1.1
 Identities = 31/129 (24%), Positives = 43/129 (33%), Gaps = 10/129 (7%)

Query: 365 PCTNQGVCEPLP----ERQYQCRCTPGYHGQHCEFMIDACYGNPCRNSGTCKLLEEGRFN 420
            C     C        E+  +C C  GY       + + C    C  SG C +      N
Sbjct: 51  VCGEYATCINQANKAEEKALKCGCINGYTLSQGVCVPNKCNNKVC-GSGKCIVDPANPNN 109

Query: 421 --CECAPGYTGD---RCEINIDDCLSHKCENNATCVDLIQAYECRCAPGFQGEFCQTKTP 475
             C C  G   D   +C    +   S KC+ N  C  +   YEC C  GF G+   T + 
Sbjct: 110 TTCSCNIGKVPDQNGKCTKTGETKCSLKCKENEECKLVGGYYECVCKEGFPGDGGGTGSG 169

Query: 476 FCAGEYNPC 484
                 +  
Sbjct: 170 GPPTSSSVM 178



 Score = 30.9 bits (70), Expect = 1.4
 Identities = 20/93 (21%), Positives = 33/93 (35%), Gaps = 10/93 (10%)

Query: 457 YECRCAPGF--QGEFCQTKTPFCA---GEYNPCRNGARCLDHFSH-----YTCECTPGFS 506
           +EC+C  G+  + E    +   C         C   A C++  +        C C  G++
Sbjct: 20  FECKCNEGYVLKNENTCEEKVKCDKLENVNKVCGEYATCINQANKAEEKALKCGCINGYT 79

Query: 507 GVNCTVNVDDCVNHMCQNGGTCVDGINDYTCKC 539
                   + C N +C +G   VD  N     C
Sbjct: 80  LSQGVCVPNKCNNKVCGSGKCIVDPANPNNTTC 112


>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain. 
          Length = 42

 Score = 27.7 bits (62), Expect = 1.7
 Identities = 12/33 (36%), Positives = 19/33 (57%), Gaps = 2/33 (6%)

Query: 436 IDDCLS--HKCENNATCVDLIQAYECRCAPGFQ 466
           +D+C    H C  N  CV+ I ++EC C  G++
Sbjct: 2   VDECADGTHNCPANTVCVNTIGSFECVCPDGYE 34


>gnl|CDD|216930 pfam02210, Laminin_G_2, Laminin G domain.  This family includes the
           Thrombospondin N-terminal-like domain, a Laminin G
           subfamily.
          Length = 124

 Score = 29.7 bits (67), Expect = 1.8
 Identities = 12/47 (25%), Positives = 23/47 (48%), Gaps = 3/47 (6%)

Query: 184 VLFY-DGRAGECYIAVELFSSRIRVSYYVGNYPVSTMYRD--LSNNQ 227
           +L Y  G  G  ++A+EL   R+ + Y +G+     +     L++ Q
Sbjct: 9   LLLYAGGEDGLDFLALELEDGRLVLRYDLGSGGSVLLLSGKKLNDGQ 55


>gnl|CDD|185268 PRK15370, PRK15370, E3 ubiquitin-protein ligase SlrP; Provisional.
          Length = 754

 Score = 31.2 bits (70), Expect = 1.8
 Identities = 19/61 (31%), Positives = 29/61 (47%), Gaps = 6/61 (9%)

Query: 240 SKLSQLILSYNKLQCIDRDALAGLKSLRIISLHGNDISMIPEGAFADLHAITHLAIGGNP 299
           S L  L L +NK+ C+  +    L+ L   S++ N I  +P    +    ITHL +  N 
Sbjct: 262 SALQSLDLFHNKISCLPENLPEELRYL---SVYDNSIRTLPAHLPS---GITHLNVQSNS 315

Query: 300 L 300
           L
Sbjct: 316 L 316



 Score = 30.8 bits (69), Expect = 2.7
 Identities = 21/59 (35%), Positives = 29/59 (49%), Gaps = 4/59 (6%)

Query: 97  TRVDCSQRGLKEIPKDIPLYTTELILNDNEIGKIKSDGLFGRLPNLIKLDLRRNQVTGI 155
           T +     GL  IP  IP   T LIL++NE+  +  + L G   N+  L    NQ+T I
Sbjct: 181 TELRLKILGLTTIPACIPEQITTLILDNNELKSL-PENLQG---NIKTLYANSNQLTSI 235


>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain. 
          Length = 35

 Score = 27.1 bits (60), Expect = 2.2
 Identities = 10/27 (37%), Positives = 14/27 (51%), Gaps = 1/27 (3%)

Query: 441 SHKCENNATCVDLIQAYECRCAPGFQG 467
              C N  TC++   +Y C C PG+ G
Sbjct: 5   GGPCSNG-TCINTPGSYTCSCPPGYTG 30



 Score = 27.1 bits (60), Expect = 2.3
 Identities = 13/32 (40%), Positives = 15/32 (46%), Gaps = 2/32 (6%)

Query: 565 QHHDCKNGICFQPQGSNDYLCKCAPGYSGKES 596
               C NG C    GS  Y C C PGY+G + 
Sbjct: 4   SGGPCSNGTCINTPGS--YTCSCPPGYTGDKR 33



 Score = 27.1 bits (60), Expect = 2.4
 Identities = 13/31 (41%), Positives = 15/31 (48%), Gaps = 2/31 (6%)

Query: 42 QHHDCKNGICFQPQGSNDYLCKCAPGYSGVE 72
              C NG C    GS  Y C C PGY+G +
Sbjct: 4  SGGPCSNGTCINTPGS--YTCSCPPGYTGDK 32



 Score = 27.1 bits (60), Expect = 2.4
 Identities = 12/28 (42%), Positives = 16/28 (57%), Gaps = 1/28 (3%)

Query: 482 NPCRNGARCLDHFSHYTCECTPGFSGVN 509
            PC NG  C++    YTC C PG++G  
Sbjct: 6   GPCSNG-TCINTPGSYTCSCPPGYTGDK 32



 Score = 25.6 bits (56), Expect = 7.7
 Identities = 16/30 (53%), Positives = 18/30 (60%), Gaps = 2/30 (6%)

Query: 402 GNPCRNSGTCKLLEEGRFNCECAPGYTGDR 431
           G PC N GTC     G + C C PGYTGD+
Sbjct: 5   GGPCSN-GTCINTP-GSYTCSCPPGYTGDK 32


>gnl|CDD|204999 pfam12661, hEGF, Human growth factor-like EGF.  hEGF, or human
           growth factor-like EGF, domains have six conserved
           residues disulfide-bonded into the characteristic
           'ababcc' pattern. They are involved in growth and
           proliferation of cells, in proteins of the Notch/Delta
           pathway, neurogulin and selectins. hEGFs are also found
           in mosaic proteins with four-disulfide laminin EGFs such
           as aggrecan and perlecan. The core fold of the EGF
           domain consists of two small beta-hairpins packed
           against each other. Two major structural variants have
           been identified based on the structural context of the
           C-terminal Cys residue of disulfide 'c' in the
           C-terminal hairpin: hEGFs and cEGFs. In hEGFs the
           C-terminal thiol resides in the beta-turn, resulting in
           shorter loop-lengths between the Cys residues of
           disulfide 'c', typically C[8-9]XC. These shorter
           loop-lengths are also typical of the four-disulfide EGF
           domains, laminin ad integrin. Tandem hEGF domains have
           six linking residues between terminal cysteines of
           adjacent domains. hEGF domains may or may not bind
           calcium in the linker region. hEGF domains with the
           consensus motif CXD4X[F,Y]XCXC are hydroxylated
           exclusively in the Asp residue.
          Length = 13

 Score = 26.5 bits (60), Expect = 2.5
 Identities = 9/12 (75%), Positives = 10/12 (83%)

Query: 421 CECAPGYTGDRC 432
           C+C PGYTG RC
Sbjct: 2   CQCPPGYTGPRC 13


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
           EGF-like domain homologues. This family includes the
           C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 26.7 bits (60), Expect = 3.8
 Identities = 9/26 (34%), Positives = 13/26 (50%)

Query: 442 HKCENNATCVDLIQAYECRCAPGFQG 467
             C  NATC +   ++ C C  G+ G
Sbjct: 6   GGCHPNATCTNTGGSFTCTCKSGYTG 31


>gnl|CDD|215343 PLN02638, PLN02638, cellulose synthase A (UDP-forming), catalytic
           subunit.
          Length = 1079

 Score = 30.3 bits (68), Expect = 4.2
 Identities = 19/64 (29%), Positives = 27/64 (42%), Gaps = 3/64 (4%)

Query: 523 QNGGTCVDGINDYTCKCEGDFVGKFCEIAPFVA---MMYPQTSPCQHHDCKNGICFQPQG 579
           + G   +       C+  GD VGK  +  PFVA     +P   PC  ++ K+G    PQ 
Sbjct: 6   ETGAKPMKHGGGQVCQICGDNVGKTVDGEPFVACDVCAFPVCRPCYEYERKDGNQSCPQC 65

Query: 580 SNDY 583
              Y
Sbjct: 66  KTKY 69



 Score = 29.9 bits (67), Expect = 5.4
 Identities = 19/62 (30%), Positives = 26/62 (41%), Gaps = 3/62 (4%)

Query: 2  GGTCVDGINDYTCKCEGDFVGKFCEIAPFVA---MMYPQTSPCQHHDCKNGICFQPQGSN 58
          G   +       C+  GD VGK  +  PFVA     +P   PC  ++ K+G    PQ   
Sbjct: 8  GAKPMKHGGGQVCQICGDNVGKTVDGEPFVACDVCAFPVCRPCYEYERKDGNQSCPQCKT 67

Query: 59 DY 60
           Y
Sbjct: 68 KY 69


>gnl|CDD|215680 pfam00053, Laminin_EGF, Laminin EGF-like (Domains III and V).  This
           family is like pfam00008 but has 8 conserved cysteines
           instead of six.
          Length = 49

 Score = 26.2 bits (58), Expect = 7.9
 Identities = 12/31 (38%), Positives = 14/31 (45%)

Query: 381 QCRCTPGYHGQHCEFMIDACYGNPCRNSGTC 411
           QC C PG  G+HC+      YG P      C
Sbjct: 19  QCLCKPGVTGRHCDRCKPGYYGLPSDPGQGC 49


>gnl|CDD|238064 cd00116, LRR_RI, Leucine-rich repeats (LRRs), ribonuclease
           inhibitor (RI)-like subfamily. LRRs are 20-29 residue
           sequence motifs present in many proteins that
           participate in protein-protein interactions and have
           different functions and cellular locations. LRRs
           correspond to structural units consisting of a beta
           strand (LxxLxLxxN/CxL conserved pattern) and an alpha
           helix. This alignment contains 12 strands corresponding
           to 11 full repeats, consistent with the extent observed
           in the subfamily acting as Ran GTPase Activating
           Proteins (RanGAP1).
          Length = 319

 Score = 28.9 bits (65), Expect = 8.4
 Identities = 14/57 (24%), Positives = 24/57 (42%), Gaps = 5/57 (8%)

Query: 259 ALAGLKSLRIISLHGNDI----SMIPEGAFADLHAITHLAIGGNPLYCDCGLAWLSE 311
            L    +L ++ L+ N +    +       A L ++  L +G N L  D G A L+ 
Sbjct: 188 GLKANCNLEVLDLNNNGLTDEGASALAETLASLKSLEVLNLGDNNL-TDAGAAALAS 243


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.140    0.472 

Gapped
Lambda     K      H
   0.267   0.0686    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 29,368,236
Number of extensions: 2739643
Number of successful extensions: 1921
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1887
Number of HSP's successfully gapped: 130
Length of query: 604
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 502
Effective length of database: 6,413,494
Effective search space: 3219573988
Effective search space used: 3219573988
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 62 (27.7 bits)