RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5886
         (184 letters)



>gnl|CDD|206637 cd11304, Cadherin_repeat, Cadherin tandem repeat domain.  Cadherins
           are glycoproteins involved in Ca2+-mediated cell-cell
           adhesion. The cadherin repeat domains occur as tandem
           repeats in the extracellular regions, which are thought
           to mediate cell-cell contact when bound to calcium. They
           play numerous roles in cell fate, signalling,
           proliferation, differentiation, and migration; members
           include E-, N-, P-, T-, VE-, CNR-, proto-, and
           FAT-family cadherin, desmocollin, and desmoglein, a
           large variety of domain architectures with varying
           repeat copy numbers. Cadherin-repeat containing proteins
           exist as monomers, homodimers, or heterodimers.
          Length = 98

 Score = 74.7 bits (184), Expect = 5e-18
 Identities = 32/68 (47%), Positives = 38/68 (55%), Gaps = 4/68 (5%)

Query: 106 PYEFSVPEGQD-GLAVGRVHATDADEGQNAVVYYSV---PEDIPFVVDANTGDIRTNKAL 161
            YE SVPE    G  V  V ATD D G+N  V YS+    ED  F +D +TG+I T K L
Sbjct: 1   SYEVSVPENAPPGTVVLTVSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPL 60

Query: 162 DYEKQQLY 169
           D E+Q  Y
Sbjct: 61  DREEQSSY 68



 Score = 59.2 bits (144), Expect = 4e-12
 Identities = 34/87 (39%), Positives = 43/87 (49%), Gaps = 21/87 (24%)

Query: 30 GVTLMQVKATDKDSGKFGLVEYRLLDNNTDAASYFNLDPAT-----------EHQGPIQ- 77
          G  ++ V ATD DSG+ G V Y ++  N D    F++DP+T           E Q     
Sbjct: 13 GTVVLTVSATDPDSGENGEVTYSIVSGNED--GLFSIDPSTGEITTAKPLDREEQSSYTL 70

Query: 78 -----DEGG--LSSTATVNIRVTDIND 97
               D GG  LSSTATV I V D+ND
Sbjct: 71 TVTATDGGGPPLSSTATVTITVLDVND 97


>gnl|CDD|215665 pfam00028, Cadherin, Cadherin domain. 
          Length = 92

 Score = 67.7 bits (166), Expect = 2e-15
 Identities = 29/67 (43%), Positives = 35/67 (52%), Gaps = 4/67 (5%)

Query: 107 YEFSVPEGQ-DGLAVGRVHATDADEGQNAVVYYSVPEDIP---FVVDANTGDIRTNKALD 162
           Y  SVPE    G  V  V ATDAD G N  ++YS+    P   F +D +TGD+ T K LD
Sbjct: 1   YSASVPENAPVGTEVLTVTATDADLGPNGRIFYSILGGGPGGWFRIDPDTGDLSTTKPLD 60

Query: 163 YEKQQLY 169
            E    Y
Sbjct: 61  RESIGEY 67



 Score = 35.4 bits (82), Expect = 0.003
 Identities = 26/83 (31%), Positives = 32/83 (38%), Gaps = 21/83 (25%)

Query: 30 GVTLMQVKATDKDSGKFGLVEYRLLDNNTDAASYFNLDPAT-----------EHQGP--- 75
          G  ++ V ATD D G  G + Y +L        +F +DP T           E  G    
Sbjct: 12 GTEVLTVTATDADLGPNGRIFYSIL--GGGPGGWFRIDPDTGDLSTTKPLDRESIGEYEL 69

Query: 76 ---IQDEG--GLSSTATVNIRVT 93
               D G   LSST TV I V 
Sbjct: 70 TVLATDSGGPPLSSTTTVTITVL 92


>gnl|CDD|214520 smart00112, CA, Cadherin repeats.  Cadherins are glycoproteins
           involved in Ca2+-mediated cell-cell adhesion. Cadherin
           domains occur as repeats in the extracellular regions
           which are thought to mediate cell-cell contact when
           bound to calcium.
          Length = 81

 Score = 55.8 bits (135), Expect = 5e-11
 Identities = 23/49 (46%), Positives = 29/49 (59%), Gaps = 3/49 (6%)

Query: 124 HATDADEGQNAVVYYSV---PEDIPFVVDANTGDIRTNKALDYEKQQLY 169
            ATDAD G+N  V YS+    +D  F +D  TG+I T K LD E+Q  Y
Sbjct: 1   SATDADSGENGKVTYSILSGNDDGLFSIDPETGEITTTKPLDREEQPEY 49



 Score = 50.0 bits (120), Expect = 9e-09
 Identities = 33/83 (39%), Positives = 41/83 (49%), Gaps = 21/83 (25%)

Query: 37  KATDKDSGKFGLVEYRLLDNNTDAASYFNLDPAT---------------EHQGPIQ--DE 79
            ATD DSG+ G V Y +L  N D    F++DP T               E+   ++  D 
Sbjct: 1   SATDADSGENGKVTYSILSGNDD--GLFSIDPETGEITTTKPLDREEQPEYTLTVEATDG 58

Query: 80  GG--LSSTATVNIRVTDINDKNP 100
           GG  LSSTATV I V D+ND  P
Sbjct: 59  GGPPLSSTATVTITVLDVNDNAP 81


>gnl|CDD|224082 COG1160, COG1160, Predicted GTPases [General function prediction
           only].
          Length = 444

 Score = 30.2 bits (69), Expect = 0.56
 Identities = 15/57 (26%), Positives = 19/57 (33%), Gaps = 3/57 (5%)

Query: 121 GRVHATDADEGQNAVVYYSVPEDIPFVVDANTGDIRTNKALDYEKQQLYLERVFLIS 177
           GR   T ADE    +         P ++  N  D    + L YE   L       IS
Sbjct: 93  GREGITPADE---EIAKILRRSKKPVILVVNKIDNLKAEELAYEFYSLGFGEPVPIS 146


>gnl|CDD|218696 pfam05692, Myco_haema, Mycoplasma haemagglutinin.  This family
           consists of several haemagglutinin sequences from
           Mycoplasma synoviae and Mycoplasma gallisepticum. The
           major plasma membrane proteins, pMGAs, of Mycoplasma
           gallisepticum are cell adhesin (hemagglutinin)
           molecules. It has been shown that the genetic
           determinants that code for the haemagglutinins are
           organised into a large family of genes and that only one
           of these genes is predominately expressed in any given
           strain.
          Length = 437

 Score = 28.0 bits (63), Expect = 3.3
 Identities = 28/116 (24%), Positives = 49/116 (42%), Gaps = 21/116 (18%)

Query: 36  VKATDKDSGKFGLVEYRLLDNNTDAASY-FNLDPATEHQGPIQDEGGLSS---------- 84
           VK++D +    GL +Y+L  NN +  +  FN                 S+          
Sbjct: 131 VKSSDSN--NVGL-QYKL--NNGNPKAIEFNSTTTASASSDRSSSEESSADETATETNTM 185

Query: 85  --TATVN-IRVTDINDKNPEFQSLPYEFSVPEGQDGLA--VGRVHATDADEGQNAV 135
             T TV+ I V  +   N +F     EFSVP G + +A  +G ++ T ++  ++ +
Sbjct: 186 NPTPTVDDINVAKVTLSNLKFGENTIEFSVPTGDNKVAPMIGNMYLTSSENNKDKI 241


>gnl|CDD|235965 PRK07205, PRK07205, hypothetical protein; Provisional.
          Length = 444

 Score = 27.0 bits (60), Expect = 6.3
 Identities = 14/36 (38%), Positives = 20/36 (55%)

Query: 101 EFQSLPYEFSVPEGQDGLAVGRVHATDADEGQNAVV 136
           E   L +E+ V E +  +    VHA DA +G NAV+
Sbjct: 227 ELDKLGFEYVVKENEVTVLGKSVHAKDAPQGINAVI 262


>gnl|CDD|241009 cd12565, RRM1_MRD1, RNA recognition motif 1 in yeast multiple
          RNA-binding domain-containing protein 1 (MRD1) and
          similar proteins.  This subgroup corresponds to the
          RRM1 of MRD1 which is encoded by a novel yeast gene
          MRD1 (multiple RNA-binding domain). It is
          well-conserved in yeast and its homologs exist in all
          eukaryotes. MRD1 is present in the nucleolus and the
          nucleoplasm. It interacts with the 35 S precursor rRNA
          (pre-rRNA) and U3 small nucleolar RNAs (snoRNAs). MRD1
          is essential for the initial processing at the A0-A2
          cleavage sites in the 35 S pre-rRNA. It contains 5
          conserved RNA recognition motifs (RRMs), also termed
          RBDs (RNA binding domains) or RNPs (ribonucleoprotein
          domains), which may play an important structural role
          in organizing specific rRNA processing events. .
          Length = 76

 Score = 25.6 bits (57), Expect = 6.3
 Identities = 9/30 (30%), Positives = 16/30 (53%)

Query: 36 VKATDKDSGKFGLVEYRLLDNNTDAASYFN 65
          ++  D  S +FG V ++  ++   A  YFN
Sbjct: 33 MRTRDGKSRRFGFVGFKSEEDAQQAVKYFN 62


>gnl|CDD|225157 COG2248, COG2248, Predicted hydrolase (metallo-beta-lactamase
           superfamily) [General function prediction only].
          Length = 304

 Score = 27.0 bits (60), Expect = 6.4
 Identities = 14/48 (29%), Positives = 24/48 (50%), Gaps = 2/48 (4%)

Query: 101 EFQSLPYEFS--VPEGQDGLAVGRVHATDADEGQNAVVYYSVPEDIPF 146
           EF     EFS  VP G++G  +G V      +G++++V+ S  +    
Sbjct: 143 EFGGTVIEFSPPVPHGREGSKLGYVLMVAVTDGKSSIVFASDVQGPIN 190


>gnl|CDD|180760 PRK06934, PRK06934, flavodoxin; Provisional.
          Length = 221

 Score = 26.8 bits (59), Expect = 7.1
 Identities = 8/21 (38%), Positives = 13/21 (61%)

Query: 125 ATDADEGQNAVVYYSVPEDIP 145
           A D +  +  +VY+S PED+ 
Sbjct: 30  AADRNARRVLIVYFSQPEDVK 50


>gnl|CDD|201731 pfam01328, Peroxidase_2, Peroxidase, family 2.  The peroxidases in
           this family do not have similarity to other peroxidases.
          Length = 319

 Score = 26.6 bits (58), Expect = 7.8
 Identities = 24/92 (26%), Positives = 29/92 (31%), Gaps = 14/92 (15%)

Query: 61  ASYFNLDPATEHQGPIQDEGGLS------STATVNIRVTDINDKNPEFQSLPYEFSVPEG 114
           AS FNLD   EH  P + +G  S        A  N  + D N     F      F+  E 
Sbjct: 110 ASLFNLDDLAEHHAPFEHDGSFSRADYKQGPAHSNDFIDDRNFDAEIFDQFLDFFAGKEH 169

Query: 115 QDGLAVGRVHATDADEGQNAVVYYSVPEDIPF 146
            D          DA   + A        D  F
Sbjct: 170 FD--------IADAAAAKLAREKLEEEHDFHF 193


>gnl|CDD|181254 PRK08150, PRK08150, enoyl-CoA hydratase; Provisional.
          Length = 255

 Score = 26.5 bits (59), Expect = 8.8
 Identities = 25/85 (29%), Positives = 33/85 (38%), Gaps = 32/85 (37%)

Query: 80  GGLSSTATVNIRVTDINDKNPEFQSLPYEFSVPEGQDGLAVG-----RV----------- 123
           GGL   +  +IRV D +          Y F++PEGQ G+ VG     RV           
Sbjct: 106 GGLELASAAHIRVADES---------TY-FALPEGQRGIFVGGGGSVRVPRLIGVARMTD 155

Query: 124 -----HATDADEGQNA-VVYYSVPE 142
                   DA EG+   +  Y VP 
Sbjct: 156 MMLTGRVYDAQEGERLGLAQYLVPA 180


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.133    0.379 

Gapped
Lambda     K      H
   0.267   0.0710    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,242,117
Number of extensions: 839511
Number of successful extensions: 458
Number of sequences better than 10.0: 1
Number of HSP's gapped: 452
Number of HSP's successfully gapped: 23
Length of query: 184
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 93
Effective length of database: 6,901,388
Effective search space: 641829084
Effective search space used: 641829084
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 56 (25.4 bits)