RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy11641
         (277 letters)



>gnl|CDD|133004 cd02510, pp-GalNAc-T, pp-GalNAc-T initiates the formation of
           mucin-type O-linked glycans.  UDP-GalNAc: polypeptide
           alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T)
           initiate the formation of mucin-type, O-linked glycans
           by catalyzing the transfer of
           alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to
           hydroxyl groups of Ser or Thr residues of core proteins
           to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These
           enzymes are type II membrane proteins with a GT-A type
           catalytic domain and a lectin domain located on the
           lumen side of the Golgi apparatus. In human, there are
           15 isozymes of pp-GalNAc-Ts, representing the largest of
           all glycosyltransferase families. Each isozyme has
           unique but partially redundant substrate specificity for
           glycosylation sites on acceptor proteins.
          Length = 299

 Score = 59.9 bits (146), Expect = 2e-10
 Identities = 17/47 (36%), Positives = 29/47 (61%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVA 106
           W+  +K+      P    +D GD+SE+ ALR++L CK F W++++V 
Sbjct: 253 WMDEYKEYFYKARPELRNIDYGDLSERKALRERLKCKSFKWYLENVY 299


>gnl|CDD|238092 cd00161, RICIN, Ricin-type beta-trefoil; Carbohydrate-binding
           domain formed from presumed gene triplication. The
           domain is found in a variety of molecules serving
           diverse functions such as enzymatic activity, inhibitory
           toxicity and signal transduction. Highly specific ligand
           binding occurs on exposed surfaces of the compact domain
           sturcture.
          Length = 124

 Score = 55.6 bits (134), Expect = 5e-10
 Identities = 31/144 (21%), Positives = 52/144 (36%), Gaps = 33/144 (22%)

Query: 130 TQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG--ERCVDADKQEKLVEMLK 187
           T  CLD  G +    +    CHG+G++Q + L ++G +       C+D            
Sbjct: 9   TGLCLDVNGGSDGGPVQLYPCHGNGNNQKWTLTSDGTIRIKSSNLCLDVGGDAP------ 62

Query: 188 LPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVH- 246
                             G+ + L+ C  S G+ +  W F N    + +    +CL V  
Sbjct: 63  ------------------GSKVRLYTC--SGGSDNQRWTF-NKDGTIRNLKSGKCLDVKG 101

Query: 247 --PETNQLAMLRCDENNSYQQWRF 268
                  L +  CD     Q+W+F
Sbjct: 102 GNTNGTNLILWTCD-GGPNQKWKF 124



 Score = 37.9 bits (88), Expect = 0.001
 Identities = 12/61 (19%), Positives = 24/61 (39%), Gaps = 2/61 (3%)

Query: 123 GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE--RCVDADKQE 180
           G  +   +  CLD  G    + +    C G   +Q +  N +G + + +  +C+D     
Sbjct: 44  GTIRIKSSNLCLDVGGDAPGSKVRLYTCSGGSDNQRWTFNKDGTIRNLKSGKCLDVKGGN 103

Query: 181 K 181
            
Sbjct: 104 T 104



 Score = 37.5 bits (87), Expect = 0.002
 Identities = 12/67 (17%), Positives = 23/67 (34%), Gaps = 5/67 (7%)

Query: 206 GTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSV--HPETNQLAMLRCDENNSY 263
           G  + L+ C  +       W    S   +       CL V      +++ +  C   +  
Sbjct: 21  GGPVQLYPCHGNGNN--QKWTL-TSDGTIRIKSSNLCLDVGGDAPGSKVRLYTCSGGSDN 77

Query: 264 QQWRFKE 270
           Q+W F +
Sbjct: 78  QRWTFNK 84



 Score = 34.0 bits (78), Expect = 0.020
 Identities = 10/43 (23%), Positives = 20/43 (46%), Gaps = 2/43 (4%)

Query: 1  MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG--ERCVDAD 41
           G +    +    CHG+G++Q + L ++G +       C+D  
Sbjct: 16 NGGSDGGPVQLYPCHGNGNNQKWTLTSDGTIRIKSSNLCLDVG 58



 Score = 27.1 bits (60), Expect = 5.4
 Identities = 8/50 (16%), Positives = 19/50 (38%), Gaps = 2/50 (4%)

Query: 2   GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE--RCVDADKQVISTQK 49
           G    + +    C G   +Q +  N +G + + +  +C+D      +   
Sbjct: 59  GDAPGSKVRLYTCSGGSDNQRWTFNKDGTIRNLKSGKCLDVKGGNTNGTN 108


>gnl|CDD|216044 pfam00652, Ricin_B_lectin, Ricin-type beta-trefoil lectin domain. 
          Length = 124

 Score = 54.9 bits (132), Expect = 1e-09
 Identities = 37/153 (24%), Positives = 61/153 (39%), Gaps = 36/153 (23%)

Query: 121 FWGEAKNLG-TQKCLDSMGRTA-PAVIGTSYCHGSGSSQLFRLNAEGQL---GHGERCVD 175
             GE +N+  + KCLD  G +A    +G   CHG G +QL+ L  +G +   G+   C+D
Sbjct: 1   ASGEIRNVNRSGKCLDVPGGSADGGPVGLYPCHG-GGNQLWTLTGDGTIRSNGNSNLCLD 59

Query: 176 ADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLE 235
                                         G+ + L+ C   SG  +  W + +    + 
Sbjct: 60  VSGG------------------------GNGSKVVLWPCNGGSG--NQRWDY-DGDGTIR 92

Query: 236 HAILKRCLSVHPETN--QLAMLRCDENNSYQQW 266
           +    +CL V   +N  ++ +  CD  N  QQW
Sbjct: 93  NRKSGKCLDVKGASNGTKVILWTCD-GNPNQQW 124



 Score = 31.7 bits (72), Expect = 0.14
 Identities = 10/67 (14%), Positives = 22/67 (32%), Gaps = 5/67 (7%)

Query: 206 GTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHP--ETNQLAMLRCDENNSY 263
           G  + L+ C      +   W          +     CL V      +++ +  C+  +  
Sbjct: 24  GGPVGLYPCHGGGNQL---WTLTGDGTIRSNGNSNLCLDVSGGGNGSKVVLWPCNGGSGN 80

Query: 264 QQWRFKE 270
           Q+W +  
Sbjct: 81  QRWDYDG 87



 Score = 27.1 bits (60), Expect = 5.0
 Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 4/43 (9%)

Query: 2  GRTAPAVIGTSYCHGSGSSQLFRLNAEGQL---GHGERCVDAD 41
          G      +G   CHG G +QL+ L  +G +   G+   C+D  
Sbjct: 20 GSADGGPVGLYPCHG-GGNQLWTLTGDGTIRSNGNSNLCLDVS 61



 Score = 26.4 bits (58), Expect = 9.4
 Identities = 6/29 (20%), Positives = 16/29 (55%), Gaps = 2/29 (6%)

Query: 14  CHGSGSSQLFRLNAEGQLGHGE--RCVDA 40
           C+G   +Q +  + +G + + +  +C+D 
Sbjct: 74  CNGGSGNQRWDYDGDGTIRNRKSGKCLDV 102


>gnl|CDD|214672 smart00458, RICIN, Ricin-type beta-trefoil.  Carbohydrate-binding
           domain formed from presumed gene triplication.
          Length = 118

 Score = 51.7 bits (124), Expect = 1e-08
 Identities = 31/146 (21%), Positives = 54/146 (36%), Gaps = 33/146 (22%)

Query: 127 NLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQL--GHGERCVDADKQEKLVE 184
           +  T KCLD  G   P  +    CHG+G +QL++L ++G +     + C+ A+       
Sbjct: 3   SGNTGKCLDVNGNKNPVGLFD--CHGTGGNQLWKLTSDGAIRIKDTDLCLTANGN----- 55

Query: 185 MLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLS 244
                                G+ + L++C       +  W   N    + +    +CL 
Sbjct: 56  --------------------TGSTVTLYSC--DGTNDNQYWEV-NKDGTIRNPDSGKCLD 92

Query: 245 V-HPETNQLAMLRCDENNSYQQWRFK 269
           V    T    +L     N  Q+W F+
Sbjct: 93  VKDGNTGTKVILWTCSGNPNQKWIFE 118



 Score = 30.2 bits (68), Expect = 0.40
 Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 4/49 (8%)

Query: 1  MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQL--GHGERCVDADKQVIST 47
           G   P  +    CHG+G +QL++L ++G +     + C+ A+    ST
Sbjct: 13 NGNKNPVGLFD--CHGTGGNQLWKLTSDGAIRIKDTDLCLTANGNTGST 59


>gnl|CDD|220036 pfam08837, DUF1810, Protein of unknown function (DUF1810).  This
          is a family of uncharacterized proteins. The structure
          of one of the members in this family has been solved
          and it adopts a mainly alpha helical structure.
          Length = 139

 Score = 30.6 bits (70), Expect = 0.35
 Identities = 19/46 (41%), Positives = 24/46 (52%), Gaps = 3/46 (6%)

Query: 35 ERCVDADKQVISTQKSSLEL-RKRSSWLK-VFKQIRSLG-SPLAMF 77
          +R VDA   V  T  + L   RKRS W+  +F Q+  LG SP A  
Sbjct: 7  QRFVDAQDPVYETALAELRAGRKRSHWMWFIFPQLAGLGRSPTARR 52


>gnl|CDD|151410 pfam10963, DUF2765, Protein of unknown function (DUF2765).  This
           family of proteins has no known function.
          Length = 82

 Score = 28.0 bits (63), Expect = 1.7
 Identities = 8/19 (42%), Positives = 14/19 (73%)

Query: 171 ERCVDADKQEKLVEMLKLP 189
            R VD +++E L E+L++P
Sbjct: 40  MRTVDPEQKEALKELLEVP 58


>gnl|CDD|222968 PHA03065, PHA03065, Hypothetical protein; Provisional.
          Length = 438

 Score = 29.2 bits (66), Expect = 2.7
 Identities = 10/36 (27%), Positives = 20/36 (55%)

Query: 57 RSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDK 92
          RS++ +  +Q       + +F+D G I  + +LR+K
Sbjct: 52 RSTFEEYVQQWVKKAGKVTLFVDRGSIPIKQSLREK 87


>gnl|CDD|236576 PRK09575, vmrA, multidrug efflux pump VmrA; Reviewed.
          Length = 453

 Score = 28.8 bits (65), Expect = 3.1
 Identities = 19/64 (29%), Positives = 29/64 (45%), Gaps = 17/64 (26%)

Query: 176 ADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLF-----NCFNSS------GTVDGP 224
           A + + + ++LKL +   VLAG+  +      +LNLF       FNS        T+ G 
Sbjct: 304 ARQYDNIKKLLKLAMKVTVLAGIAWV-----LLLNLFPETMIALFNSGDSELIAETIVG- 357

Query: 225 WRFH 228
            R H
Sbjct: 358 IRLH 361


>gnl|CDD|220127 pfam09149, DUF1935, Domain of unknown function (DUF1935).  Members
           of this family are found in various bacterial and
           eukaryotic hypothetical proteins, as well as in the
           cysteine protease calpain. Their exact function has not,
           as yet, been defined.
          Length = 97

 Score = 27.2 bits (61), Expect = 3.3
 Identities = 9/31 (29%), Positives = 11/31 (35%), Gaps = 7/31 (22%)

Query: 212 FNCFNSSG-------TVDGPWRFHNSTRQLE 235
             CF   G          G W F+N T+  E
Sbjct: 13  TPCFKEGGLLFRIVDEKGGRWAFYNDTKDYE 43


>gnl|CDD|236965 PRK11742, PRK11742, bifunctional NADH:ubiquinone oxidoreductase
           subunit C/D; Provisional.
          Length = 575

 Score = 28.7 bits (65), Expect = 3.8
 Identities = 16/47 (34%), Positives = 24/47 (51%), Gaps = 1/47 (2%)

Query: 56  KRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVAL-RDKLHCKDFSWF 101
           KR   L+V + +R L  P  M  D+  I E++   R+ L   DF+ F
Sbjct: 29  KREQLLEVLRFLRKLPKPYVMLFDLHGIDERLRTHREGLPAADFTVF 75


>gnl|CDD|214781 smart00703, NRF, N-terminal domain in C. elegans NRF-6 (Nose
           Resistant to Fluoxetine-4) and NDG-4 (resistant to
           nordihydroguaiaretic acid-4).  Also present in several
           other worm and fly proteins.
          Length = 110

 Score = 26.5 bits (59), Expect = 7.4
 Identities = 10/48 (20%), Positives = 17/48 (35%), Gaps = 5/48 (10%)

Query: 106 AYDVYTKFPELPPNLFWGEAKNLGT-QKC--LDSMGRTAPAVIGTSYC 150
           A      + + P  L  G    LG+ ++C  +   G +        YC
Sbjct: 4   ALKQLDAWGKFPSGLLEGNNLWLGSYEECLRISGPGTSTK--YQGKYC 49



 Score = 26.2 bits (58), Expect = 9.1
 Identities = 15/85 (17%), Positives = 20/85 (23%), Gaps = 14/85 (16%)

Query: 77  FLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDS 136
            L +G   E         C   S       Y     +  L P           + K L S
Sbjct: 23  NLWLGSYEE---------CLRISGPGTSTKYQGKYCYLPLVPGKNLNC-----SIKSLTS 68

Query: 137 MGRTAPAVIGTSYCHGSGSSQLFRL 161
           +      V   S C  +    L   
Sbjct: 69  VLVLRTGVCLPSSCSAADLKTLLNQ 93


>gnl|CDD|153319 cd07635, BAR_GRAF2, The Bin/Amphiphysin/Rvs (BAR) domain of GTPase
           Regulator Associated with Focal adhesion 2.  BAR domains
           are dimerization, lipid binding and curvature sensing
           modules found in many different proteins with diverse
           functions. GTPase Regulator Associated with Focal
           adhesion kinase 2 (GRAF2), also called Rho GTPase
           activating protein 10 (ARHGAP10) or PS-GAP, is a GAP
           with activity towards Cdc42 and RhoA which regulates
           caspase-activated p21-activated protein kinase-2
           (PAK-2p34). GRAF2 interacts with PAK-2p34, leading to
           its stabilization and decrease of cell death. It is
           highly expressed in skeletal muscle and also interacts
           with PKNbeta, which is a target of Rho. GRAF2 contains
           an N-terminal BAR domain, followed by a Pleckstrin
           homology (PH) domain, a Rho GAP domain, and a C-terminal
           SH3 domain. BAR domains form dimers that bind to
           membranes, induce membrane bending and curvature, and
           may also be involved in protein-protein interactions.
           The BAR domain of the related protein GRAF directly
           interacts with its Rho GAP domain and inhibits its
           activity. Autoinhibited GRAF is capable of binding
           membranes and tubulating liposomes, showing that the
           membrane-tubulation and GAP-inhibitory functions of the
           BAR domain can occur simultaneously.
          Length = 207

 Score = 27.3 bits (60), Expect = 7.5
 Identities = 26/84 (30%), Positives = 38/84 (45%), Gaps = 5/84 (5%)

Query: 35  ERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFL--DMGDISEQVALRDK 92
           ERC+DA  Q  S    +LE ++    L V     +L  PL  F    +G + E+    DK
Sbjct: 62  ERCIDASLQEFSNFLKNLEEQREIMALNV---TETLIKPLERFRKEQLGAVKEEKKKFDK 118

Query: 93  LHCKDFSWFMKHVAYDVYTKFPEL 116
              K++S   KH+      K P+L
Sbjct: 119 ETEKNYSLLEKHLNLSAKKKEPQL 142


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.135    0.428 

Gapped
Lambda     K      H
   0.267   0.0713    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,660,132
Number of extensions: 1227050
Number of successful extensions: 796
Number of sequences better than 10.0: 1
Number of HSP's gapped: 788
Number of HSP's successfully gapped: 26
Length of query: 277
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 181
Effective length of database: 6,679,618
Effective search space: 1209010858
Effective search space used: 1209010858
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 58 (26.2 bits)