RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6837
         (122 letters)



>gnl|CDD|206637 cd11304, Cadherin_repeat, Cadherin tandem repeat domain.  Cadherins
           are glycoproteins involved in Ca2+-mediated cell-cell
           adhesion. The cadherin repeat domains occur as tandem
           repeats in the extracellular regions, which are thought
           to mediate cell-cell contact when bound to calcium. They
           play numerous roles in cell fate, signalling,
           proliferation, differentiation, and migration; members
           include E-, N-, P-, T-, VE-, CNR-, proto-, and
           FAT-family cadherin, desmocollin, and desmoglein, a
           large variety of domain architectures with varying
           repeat copy numbers. Cadherin-repeat containing proteins
           exist as monomers, homodimers, or heterodimers.
          Length = 98

 Score = 83.1 bits (206), Expect = 3e-22
 Identities = 34/91 (37%), Positives = 53/91 (58%), Gaps = 8/91 (8%)

Query: 38  EYTISEDVPVGTLVAKVKAIDPDLP--GTIAYSLVAGDE-GKFVLESSSGNLKLKDTLDR 94
           E ++ E+ P GT+V  V A DPD    G + YS+V+G+E G F ++ S+G +     LDR
Sbjct: 3   EVSVPENAPPGTVVLTVSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPLDR 62

Query: 95  EQKDKYRVQVRASDGVQ-----SADVVLTIL 120
           E++  Y + V A+DG       +A V +T+L
Sbjct: 63  EEQSSYTLTVTATDGGGPPLSSTATVTITVL 93


>gnl|CDD|215665 pfam00028, Cadherin, Cadherin domain. 
          Length = 92

 Score = 71.2 bits (175), Expect = 2e-17
 Identities = 28/73 (38%), Positives = 40/73 (54%), Gaps = 3/73 (4%)

Query: 40  TISEDVPVGTLVAKVKAIDPDLP--GTIAYSLVAGDE-GKFVLESSSGNLKLKDTLDREQ 96
           ++ E+ PVGT V  V A D DL   G I YS++ G   G F ++  +G+L     LDRE 
Sbjct: 4   SVPENAPVGTEVLTVTATDADLGPNGRIFYSILGGGPGGWFRIDPDTGDLSTTKPLDRES 63

Query: 97  KDKYRVQVRASDG 109
             +Y + V A+D 
Sbjct: 64  IGEYELTVLATDS 76


>gnl|CDD|214520 smart00112, CA, Cadherin repeats.  Cadherins are glycoproteins
           involved in Ca2+-mediated cell-cell adhesion. Cadherin
           domains occur as repeats in the extracellular regions
           which are thought to mediate cell-cell contact when
           bound to calcium.
          Length = 81

 Score = 55.0 bits (133), Expect = 2e-11
 Identities = 22/71 (30%), Positives = 39/71 (54%), Gaps = 6/71 (8%)

Query: 55  KAIDPDLP--GTIAYSLVAGDE-GKFVLESSSGNLKLKDTLDREQKDKYRVQVRASDG-- 109
            A D D    G + YS+++G++ G F ++  +G +     LDRE++ +Y + V A+DG  
Sbjct: 1   SATDADSGENGKVTYSILSGNDDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGG 60

Query: 110 -VQSADVVLTI 119
              S+   +TI
Sbjct: 61  PPLSSTATVTI 71


>gnl|CDD|206635 cd00031, CA_like, Cadherin repeat-like domain.  Cadherins are
           glycoproteins involved in Ca2+-mediated cell-cell
           adhesion. The cadherin repeat domains occur as tandem
           repeats in the extracellular regions, which are thought
           to mediate cell-cell contact when bound to calcium. They
           play numerous roles in cell fate, signalling,
           proliferation, differentiation, and migration; members
           include E-, N-, P-, T-, VE-, CNR-, proto-, and
           FAT-family cadherin, desmocollin, and desmoglein, a
           large variety of domain architectures with varying
           repeat copy numbers. Cadherin-repeat containing proteins
           exist as monomers, homodimers, or heterodimers. This
           family also includes the cadherin-like repeats of
           extracellular alpha-dystroglycan.
          Length = 98

 Score = 33.5 bits (76), Expect = 0.005
 Identities = 18/71 (25%), Positives = 25/71 (35%), Gaps = 3/71 (4%)

Query: 39  YTISEDVPVGTLVAKVKAIDPDLPGTIAYSLVAGDE---GKFVLESSSGNLKLKDTLDRE 95
            +  E    G+    +        G I     AG E        E  SG L+  + LDRE
Sbjct: 4   GSAVEGRSRGSFRVSIPTDLIASSGEIIKISAAGKEALPSWLHWEPHSGILEGLEKLDRE 63

Query: 96  QKDKYRVQVRA 106
            K  + + V A
Sbjct: 64  DKGVHYISVSA 74


>gnl|CDD|203892 pfam08266, Cadherin_2, Cadherin-like.  This cadherin domain is
          usually the most N-terminal copy of the domain.
          Length = 84

 Score = 30.2 bits (69), Expect = 0.069
 Identities = 16/65 (24%), Positives = 31/65 (47%), Gaps = 10/65 (15%)

Query: 39 YTISEDVPVGTLVAKVKAIDPDL---PGTIA---YSLVAGDEGK-FVLESSSGNLKLKDT 91
          Y++ E+   G+ V  +     DL      +A     +V+G   + F L   +G+L + + 
Sbjct: 5  YSVPEETEPGSFVGNLAK---DLGLNVQELAARGARIVSGGNKQYFQLNPETGDLLVNER 61

Query: 92 LDREQ 96
          +DRE+
Sbjct: 62 IDREE 66


>gnl|CDD|218656 pfam05612, DUF781, Mouse protein of unknown function (DUF781).
           This family consists of uncharacterized mouse proteins
           of unknown function.
          Length = 356

 Score = 29.9 bits (67), Expect = 0.28
 Identities = 10/18 (55%), Positives = 11/18 (61%)

Query: 21  RILDRNDSPPSFADFHTE 38
           R+L   D PP  ADF TE
Sbjct: 273 RVLQNTDHPPFIADFTTE 290


>gnl|CDD|221854 pfam12920, TcdA_TcdB_pore, TcdA/TcdB pore forming domain.  This
           family represents the most conserved region within the
           C. difficile Toxin A and Toxin B pore forming region.
          Length = 654

 Score = 28.4 bits (64), Expect = 0.82
 Identities = 28/116 (24%), Positives = 46/116 (39%), Gaps = 20/116 (17%)

Query: 21  RILDRNDSPPSFADFHTEYTIS------EDVPVG--------TLVAKVKAIDPDLPGTIA 66
           R+ D       F  F +EYTI+       D PV         TLV  V  +  +  G ++
Sbjct: 409 RLEDDYRFDWDFYAFPSEYTITRLKPEYVDTPVSVILDNRNRTLV--VPTLPEEWRGKLS 466

Query: 67  YSLVAGDEGKFVLESSSGNLKLKDTLDREQKDKYRV---QVRASDGVQSADVVLTI 119
           Y L  G  G++ L  SSG   ++   D  +  K+ +    +     +Q  D  + +
Sbjct: 467 YELK-GAGGEYTLSLSSGAANIELNSDSTKNSKWIIDASALVKESTIQIEDNNIVV 521


>gnl|CDD|181947 PRK09553, tauD, taurine dioxygenase; Reviewed.
          Length = 277

 Score = 27.7 bits (62), Expect = 1.5
 Identities = 18/57 (31%), Positives = 30/57 (52%), Gaps = 11/57 (19%)

Query: 7   NLAHKFG----------VIGCVQLRILD-RNDSPPSFADFHTEYTISEDVPVGTLVA 52
           +LA +FG            G  ++ +LD  ND+PP   ++HT+ T  E  P+G ++A
Sbjct: 57  DLAARFGDLHIHPVYPHAEGVPEIIVLDTHNDNPPDNDNWHTDVTFIETPPLGAILA 113


>gnl|CDD|227885 COG5598, COG5598, Trimethylamine:corrinoid methyltransferase
           [Coenzyme metabolism].
          Length = 526

 Score = 27.7 bits (62), Expect = 1.8
 Identities = 8/39 (20%), Positives = 13/39 (33%), Gaps = 1/39 (2%)

Query: 2   RDLTLNLAHKFGVIGCVQLRILDRNDSPPSFADFHTEYT 40
           R+L   LA     I      + + +D   +     T Y 
Sbjct: 162 RNLA-KLAQHAPNIHFFGGPVCEPDDVDVNKRHLDTIYA 199


>gnl|CDD|237603 PRK14083, PRK14083, HSP90 family protein; Provisional.
          Length = 601

 Score = 27.2 bits (61), Expect = 2.1
 Identities = 9/28 (32%), Positives = 14/28 (50%)

Query: 55 KAIDPDLPGTIAYSLVAGDEGKFVLESS 82
          +A+DP  PG I   L     G  ++E +
Sbjct: 43 RALDPTAPGRIRIELTDAGGGTLIVEDN 70


>gnl|CDD|218090 pfam04450, BSP, Peptidase of plants and bacteria.  These basic
           secretory proteins (BSPs) are believed to be part of the
           plants defence mechanism against pathogens.
          Length = 203

 Score = 26.9 bits (60), Expect = 2.6
 Identities = 12/59 (20%), Positives = 26/59 (44%), Gaps = 2/59 (3%)

Query: 54  VKAIDPDLPGTIAYSLVAGDEGKFVLESSSGNLKL--KDTLDREQKDKYRVQVRASDGV 110
           ++ I+PD PG   +  +  D  + + ++S    ++  +   DR       + +   DGV
Sbjct: 6   LEIINPDTPGGKRFDRLVPDAEQTLSDASRFVWRILYQPPSDRPPVRSVTLTLDDIDGV 64


>gnl|CDD|225185 COG2302, COG2302, Uncharacterized conserved protein, contains
           S4-like domain [Function unknown].
          Length = 257

 Score = 26.9 bits (60), Expect = 2.8
 Identities = 12/44 (27%), Positives = 20/44 (45%), Gaps = 11/44 (25%)

Query: 66  AYSLVAGD----EGKFVLESSSGNLKLKDTLDREQKDKYRVQVR 105
           +Y +  GD     G        G LK+ +     +KDK+R+ +R
Sbjct: 218 SYEVQEGDLISIRGF-------GRLKILEINGVTKKDKFRITLR 254


>gnl|CDD|235405 PRK05320, PRK05320, rhodanese superfamily protein; Provisional.
          Length = 257

 Score = 26.9 bits (60), Expect = 2.9
 Identities = 7/26 (26%), Positives = 14/26 (53%)

Query: 30 PSFADFHTEYTISEDVPVGTLVAKVK 55
            FAD   + ++S+  P   ++ K+K
Sbjct: 66 ARFADLQVKESLSDSQPFRRMLVKLK 91


>gnl|CDD|176796 cd08818, CARD_MDA5_1, Caspase activation and recruitment domain
           found in MDA5, first repeat.  Caspase activation and
           recruitment domain (CARD) found in MDA5
           (melanoma-differentiation-associated gene 5), first
           repeat.  MDA5, also known as IFIH1, contains two
           N-terminal CARD domains and a C-terminal RNA helicase
           domain. MDA5 is a cytoplasmic DEAD box RNA helicase that
           plays an important role in host antiviral response by
           sensing incoming viral RNA. Upon activation, the signal
           is transferred to downstream pathways via the adaptor
           molecule IPS-1 (MAVS, VISA, CARDIF), leading to the
           induction of type I interferons. Although very similar
           in sequence, MDA5 recognizes different sets of viruses
           compared to RIG-I, a related RNA helicase. MDA5
           associates with IPS-1 through a CARD-CARD interaction.
           In general, CARDs are death domains (DDs) found
           associated with caspases. They are known to be important
           in the signaling pathways for apoptosis, inflammation,
           and host-defense mechanisms. DDs are protein-protein
           interaction domains found in a variety of domain
           architectures. Their common feature is that they form
           homodimers by self-association or heterodimers by
           associating with other members of the DD superfamily
           including PYRIN and DED (Death Effector Domain). They
           serve as adaptors in signaling pathways and can recruit
           other proteins into signaling complexes.
          Length = 88

 Score = 25.8 bits (57), Expect = 3.2
 Identities = 8/35 (22%), Positives = 17/35 (48%)

Query: 86  LKLKDTLDREQKDKYRVQVRASDGVQSADVVLTIL 120
           L     L+ E K++ R        + +A+++L+ L
Sbjct: 25  LDYLTFLEAEVKERIRAAAATRGNIAAAELLLSTL 59


>gnl|CDD|222696 pfam14344, DUF4397, Domain of unknown function (DUF4397).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in bacteria, archaea and
           eukaryotes, and is approximately 120 amino acids in
           length.
          Length = 120

 Score = 25.7 bits (57), Expect = 4.7
 Identities = 21/81 (25%), Positives = 29/81 (35%), Gaps = 5/81 (6%)

Query: 38  EYTISEDVPVGTLVAKVKAIDPDLPGTIAYSLVAGDEGKFVLESSSGNLKLKDTLDREQK 97
            YTI       T    V + D  +     Y++ A       L  S   L L D L     
Sbjct: 44  SYTIEVTPAGTTPTTVVLSADVTVEAGTNYTVAAVGT----LAGSLQLLVLTDDLSAVAA 99

Query: 98  DKYRVQVR-ASDGVQSADVVL 117
            + RV+V  A+    + DV L
Sbjct: 100 GEARVRVVHAAPDAPAVDVYL 120


>gnl|CDD|166697 PLN03058, PLN03058, dynein light chain type 1 family protein;
           Provisional.
          Length = 128

 Score = 25.0 bits (54), Expect = 8.6
 Identities = 11/22 (50%), Positives = 15/22 (68%)

Query: 87  KLKDTLDREQKDKYRVQVRASD 108
           K K    ++QKD+  V+VRASD
Sbjct: 21  KKKAKEQQDQKDELNVRVRASD 42


>gnl|CDD|129984 TIGR00906, 2A0303, cationic amino acid transport permease.
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 557

 Score = 25.5 bits (56), Expect = 9.8
 Identities = 11/39 (28%), Positives = 19/39 (48%), Gaps = 2/39 (5%)

Query: 49  TLVAKVKAIDPDLPGTIAYSLVAGDEGKFVLESSSGNLK 87
           TL+     +DPD P  +A+  V     K+++  + G L 
Sbjct: 287 TLMMPYYLLDPDAPFPVAFEYVGWGPAKYIV--AVGALC 323


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.136    0.386 

Gapped
Lambda     K      H
   0.267   0.0710    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,046,151
Number of extensions: 528062
Number of successful extensions: 515
Number of sequences better than 10.0: 1
Number of HSP's gapped: 511
Number of HSP's successfully gapped: 28
Length of query: 122
Length of database: 10,937,602
Length adjustment: 84
Effective length of query: 38
Effective length of database: 7,211,866
Effective search space: 274050908
Effective search space used: 274050908
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 53 (24.2 bits)