RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9046
         (215 letters)



>gnl|CDD|206637 cd11304, Cadherin_repeat, Cadherin tandem repeat domain.  Cadherins
           are glycoproteins involved in Ca2+-mediated cell-cell
           adhesion. The cadherin repeat domains occur as tandem
           repeats in the extracellular regions, which are thought
           to mediate cell-cell contact when bound to calcium. They
           play numerous roles in cell fate, signalling,
           proliferation, differentiation, and migration; members
           include E-, N-, P-, T-, VE-, CNR-, proto-, and
           FAT-family cadherin, desmocollin, and desmoglein, a
           large variety of domain architectures with varying
           repeat copy numbers. Cadherin-repeat containing proteins
           exist as monomers, homodimers, or heterodimers.
          Length = 98

 Score = 54.6 bits (132), Expect = 3e-10
 Identities = 31/79 (39%), Positives = 46/79 (58%), Gaps = 8/79 (10%)

Query: 142 TYRTSIQENIIITPSPT-LLTLEATDSDE--NPHLTYKILSGNDDDVFAVD-DTGILTIQ 197
           +Y  S+ EN    P  T +LT+ ATD D   N  +TY I+SGN+D +F++D  TG +T  
Sbjct: 1   SYEVSVPEN---APPGTVVLTVSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTA 57

Query: 198 GPL-AKGKTVYNLLVQVSD 215
            PL  + ++ Y L V  +D
Sbjct: 58  KPLDREEQSSYTLTVTATD 76



 Score = 41.1 bits (97), Expect = 3e-05
 Identities = 31/117 (26%), Positives = 44/117 (37%), Gaps = 31/117 (26%)

Query: 29  YLLEIEEEQPIGSLLGKFTALDKDSSIA---LYVLEQDGD--MFSVN----NITVCEFQL 79
           Y + + E  P G+++   +A D DS       Y +    +  +FS++     IT      
Sbjct: 2   YEVSVPENAPPGTVVLTVSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITT---AK 58

Query: 80  SVDLAAADNFQSEYNATIKENSPGDTKVVLVSAEDASTP--CSHFRVNIKVLDVNDN 134
            +D       QS Y  T+              A D   P   S   V I VLDVNDN
Sbjct: 59  PLDREE----QSSYTLTVT-------------ATDGGGPPLSSTATVTITVLDVNDN 98


>gnl|CDD|215665 pfam00028, Cadherin, Cadherin domain. 
          Length = 92

 Score = 47.3 bits (113), Expect = 2e-07
 Identities = 27/77 (35%), Positives = 37/77 (48%), Gaps = 6/77 (7%)

Query: 143 YRTSIQENIIITPSPTLLTLEATDSDENPH--LTYKILSGNDDDVFAVD-DTGILTIQGP 199
           Y  S+ EN        +LT+ ATD+D  P+  + Y IL G     F +D DTG L+   P
Sbjct: 1   YSASVPENA--PVGTEVLTVTATDADLGPNGRIFYSILGGGPGGWFRIDPDTGDLSTTKP 58

Query: 200 L-AKGKTVYNLLVQVSD 215
           L  +    Y L V  +D
Sbjct: 59  LDRESIGEYELTVLATD 75



 Score = 30.7 bits (70), Expect = 0.13
 Identities = 13/50 (26%), Positives = 20/50 (40%), Gaps = 5/50 (10%)

Query: 29 YLLEIEEEQPIGSLLGKFTALDKD----SSIALYVLEQD-GDMFSVNNIT 73
          Y   + E  P+G+ +   TA D D      I   +L    G  F ++  T
Sbjct: 1  YSASVPENAPVGTEVLTVTATDADLGPNGRIFYSILGGGPGGWFRIDPDT 50


>gnl|CDD|214520 smart00112, CA, Cadherin repeats.  Cadherins are glycoproteins
           involved in Ca2+-mediated cell-cell adhesion. Cadherin
           domains occur as repeats in the extracellular regions
           which are thought to mediate cell-cell contact when
           bound to calcium.
          Length = 81

 Score = 43.9 bits (104), Expect = 2e-06
 Identities = 25/56 (44%), Positives = 36/56 (64%), Gaps = 4/56 (7%)

Query: 164 ATDSDE--NPHLTYKILSGNDDDVFAVD-DTGILTIQGPL-AKGKTVYNLLVQVSD 215
           ATD+D   N  +TY ILSGNDD +F++D +TG +T   PL  + +  Y L V+ +D
Sbjct: 2   ATDADSGENGKVTYSILSGNDDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATD 57



 Score = 30.4 bits (69), Expect = 0.16
 Identities = 15/29 (51%), Positives = 15/29 (51%), Gaps = 2/29 (6%)

Query: 110 VSAEDASTP--CSHFRVNIKVLDVNDNRP 136
           V A D   P   S   V I VLDVNDN P
Sbjct: 53  VEATDGGGPPLSSTATVTITVLDVNDNAP 81


>gnl|CDD|181629 PRK09057, PRK09057, coproporphyrinogen III oxidase; Provisional.
          Length = 380

 Score = 29.1 bits (66), Expect = 1.5
 Identities = 14/37 (37%), Positives = 17/37 (45%), Gaps = 2/37 (5%)

Query: 29  YLLEIEEEQPIGSLL--GKFTALDKDSSIALYVLEQD 63
           Y L IEE      L   GK    D+D +  LY L Q+
Sbjct: 190 YQLTIEEGTAFYGLHAAGKLILPDEDLAADLYELTQE 226


>gnl|CDD|133405 cd04778, HTH_MerR-like_sg2, Helix-Turn-Helix DNA binding domain of
           putative transcription regulators from the MerR
           superfamily.  Putative helix-turn-helix (HTH) MerR-like
           transcription regulators (subgroup 2). Based on sequence
           similarity, these proteins are predicted to function as
           transcription regulators that mediate responses to
           stress in eubacteria. They belong to the MerR
           superfamily of transcription regulators that promote
           transcription of various stress regulons by
           reconfiguring the operator sequence located between the
           -35 and -10 promoter elements. A typical MerR regulator
           is comprised of two distinct domains that harbor the
           regulatory (effector-binding) site and the active
           (DNA-binding) site. Their N-terminal domains are
           homologous and contain a DNA-binding winged HTH motif,
           while the C-terminal domains are often dissimilar and
           bind specific coactivator molecules such as metal ions,
           drugs, and organic substrates.
          Length = 219

 Score = 28.5 bits (64), Expect = 2.0
 Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 6/52 (11%)

Query: 20  PPWTRDEPNYLLEIEEEQPIGSLLGKFTALDKDSSIALYVLEQDGDMFSVNN 71
            PW+ +EP   L  E +    S  G   AL +  ++AL +LE DGD   V +
Sbjct: 88  GPWSTEEPGTALLAELQ----SEFGDERALAR--AVALGLLEPDGDRVLVPS 133


>gnl|CDD|177875 PLN02230, PLN02230, phosphoinositide phospholipase C 4.
          Length = 598

 Score = 28.1 bits (62), Expect = 3.5
 Identities = 32/127 (25%), Positives = 56/127 (44%), Gaps = 15/127 (11%)

Query: 96  TIKENSPGDTKV-VLVSAEDASTPCSHFRVNIKVLDV---------NDNRPEFKSKTYRT 145
           +IK N+   +K  V+++ ED  TP   F+V   +            ++   EF S     
Sbjct: 191 SIKANAFAISKYPVIITLEDHLTPKLQFKVAKMITQTFGDMLYYHDSEGCQEFPSPE--- 247

Query: 146 SIQENIIITPSPTLLTLEATDSDENPHLTYKILSGNDDDVFAVDDTGILTIQGPLAKGKT 205
            ++E I+I+  P    LEA D+ E      +    +D+DV+  +   +++ Q  L K  +
Sbjct: 248 ELKEKILISTKPPKEYLEANDAKEKD--NGEKGKDSDEDVWGKEPEDLISTQSDLDKVTS 305

Query: 206 VYNLLVQ 212
             N L Q
Sbjct: 306 SVNDLNQ 312


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 27.3 bits (61), Expect = 8.1
 Identities = 14/72 (19%), Positives = 25/72 (34%), Gaps = 7/72 (9%)

Query: 31  LEIEEEQPIGSLLGKFTALDKDSSIALY---VLEQDGDMFSVNNITVCEFQLSVDLAAAD 87
           LE + ++    L  + +A +   +       +LE     F   +I     QL  DL    
Sbjct: 300 LEDQLKEARDELNQELSAANAKLAADRSELELLEDQKGAFEDADIE----QLQADLDQLP 355

Query: 88  NFQSEYNATIKE 99
           + +SE       
Sbjct: 356 SIRSELEEVEAR 367


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.313    0.133    0.377 

Gapped
Lambda     K      H
   0.267   0.0710    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,753,617
Number of extensions: 994212
Number of successful extensions: 597
Number of sequences better than 10.0: 1
Number of HSP's gapped: 595
Number of HSP's successfully gapped: 19
Length of query: 215
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 122
Effective length of database: 6,812,680
Effective search space: 831146960
Effective search space used: 831146960
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 57 (25.8 bits)