RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6218
         (533 letters)



>gnl|CDD|218687 pfam05679, CHGN, Chondroitin N-acetylgalactosaminyltransferase. 
          Length = 504

 Score = 72.4 bits (178), Expect = 2e-13
 Identities = 25/35 (71%), Positives = 29/35 (82%)

Query: 490 HIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSYE 524
           H+  CL+N  + HEDVELGRCVQKFAGI CTWSY+
Sbjct: 1   HLDSCLRNALSAHEDVELGRCVQKFAGIGCTWSYQ 35



 Score = 57.8 bits (140), Expect = 7e-09
 Identities = 20/29 (68%), Positives = 23/29 (79%)

Query: 54 HIKYCLKNLYTTHEDVELGRCVQKFAGIP 82
          H+  CL+N  + HEDVELGRCVQKFAGI 
Sbjct: 1  HLDSCLRNALSAHEDVELGRCVQKFAGIG 29


>gnl|CDD|190308 pfam02434, Fringe, Fringe-like.  The drosophila protein fringe
           (FNG) is a glucosaminyltransferase that controls the
           response of the Notch receptor to specific ligands. FNG
           is localised to the Golgi apparatus (not secreted as
           previously thought). Modification of Notch occurs
           through glycosylation by FNG. The xenopus homologue,
           lunatic fringe, has been implicated in a variety of
           functions.
          Length = 248

 Score = 61.2 bits (149), Expect = 2e-10
 Identities = 42/197 (21%), Positives = 68/197 (34%), Gaps = 14/197 (7%)

Query: 336 LFVGVMTANSYLNTRATSVYDTWARDIPGRVAFFT-SESSTLPAARPDLPLVKLRGVDDS 394
           +F+ V T   +  TR   +  TW      +   FT  E   LP       +         
Sbjct: 6   IFIAVKTTKKFHKTRLPLLLKTWISRAKHQTYIFTDGEDEGLPTRTGGHLINT--NCSAG 63

Query: 395 YPPQKKSFLMLQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQ-AGR 453
           +  +  S  M            +WF   DDD YV   RL R+L   + ++  ++G+ +  
Sbjct: 64  HCRKALSCKMAVEYDRFLESGKKWFCHVDDDNYVNVPRLVRLLSCYNHTQDVYLGKPSLY 123

Query: 454 GNQEEFGLLSLEYDE--NFCMGGPGVIMSRATLALVAPHIKYCLKNLYTT------HEDV 505
              E    +         F  GG G  +SR     ++P          +T       +D 
Sbjct: 124 RPIEATERVKNNRKVGFWFATGGAGFCISRGLALKMSPWASGG--RFMSTSEKIRLPDDC 181

Query: 506 ELGRCVQKFAGIPCTWS 522
            LG  ++   G+P T S
Sbjct: 182 TLGYIIENLLGVPLTHS 198



 Score = 28.4 bits (64), Expect = 8.8
 Identities = 14/60 (23%), Positives = 22/60 (36%), Gaps = 8/60 (13%)

Query: 33  NFCMGGPGVIMSRATLALVAPHIKYCLKNLYTT------HEDVELGRCVQKFAGIPFLSS 86
            F  GG G  +SR     ++P          +T       +D  LG  ++   G+P   S
Sbjct: 141 WFATGGAGFCISRGLALKMSPWASGG--RFMSTSEKIRLPDDCTLGYIIENLLGVPLTHS 198


>gnl|CDD|224126 COG1205, COG1205, Distinct helicase family with a unique C-terminal
           domain including a metal-binding cysteine cluster
           [General function prediction only].
          Length = 851

 Score = 33.1 bits (76), Expect = 0.38
 Identities = 37/151 (24%), Positives = 47/151 (31%), Gaps = 35/151 (23%)

Query: 358 WARDIPGRVAFFTSESSTLPAARPDLPLVKLRGVDDSYPPQKKSFL-----MLQYMWERY 412
              D+PG+V F      T P  R  +    +R      PP     L     ML Y+  R 
Sbjct: 138 LISDLPGKVTFGRYTGDTPPEERRAI----IRN-----PP---DILLTNPDMLHYLLLRN 185

Query: 413 GDKYEWFMRAD-----DDLYV-RG----------ERLARMLRSVDSSKPQFIG-QAGRGN 455
            D + W +R       D+L+  RG           RL R LR    S  Q I   A   N
Sbjct: 186 HDAWLWLLRNLKYLVVDELHTYRGVQGSEVALLLRRLLRRLR-RYGSPLQIICTSATLAN 244

Query: 456 QEEFGLLSLEYDENFCMGGPGVIMSRATLAL 486
             EF       D    +   G          
Sbjct: 245 PGEFAEELFGRDFEVPVDEDGSPRGLRYFVR 275


>gnl|CDD|200969 pfam00064, Neur, Neuraminidase.  Neuraminidases cleave sialic acid
           residues from glycoproteins. Belong to the sialidase
           family - but this alignment does not generalise to the
           other sialidases. Structure is a 6-sheet beta propeller.
          Length = 468

 Score = 32.4 bits (74), Expect = 0.58
 Identities = 16/56 (28%), Positives = 25/56 (44%), Gaps = 8/56 (14%)

Query: 255 GWSSSAGEKGRGISDVGLS--ESGLLFVGVMTANSYLNTRATSVYDTWARDIPATQ 308
            WSS+A   G+    +G+S  +         TA      R T   ++WA++I  TQ
Sbjct: 177 AWSSTACHDGKAWMTIGVSGPDDN------ATAVIKYGGRPTDTINSWAKNILRTQ 226


>gnl|CDD|131620 TIGR02569, TIGR02569_actnb, TIGR02569 family protein.  This protein
           family is found, so far, only in Actinobacteria,
           including as least five species of Mycobacterium, three
           of Corynebacterium, and Nocardia farcinica, always in a
           single copy per genome. The function is unknown
           [Hypothetical proteins, Conserved].
          Length = 272

 Score = 30.3 bits (68), Expect = 1.9
 Identities = 13/55 (23%), Positives = 22/55 (40%), Gaps = 3/55 (5%)

Query: 414 DKYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQAGRGNQEEFGLLSLEYDE 468
            +  W  R  + L+  G R+ R +RS D    +++    R +    G      DE
Sbjct: 43  ARSAWSARVRETLFPDGVRVVRPVRSTDG---RYVVAGWRADTFVAGTPEPRVDE 94


>gnl|CDD|133028 cd04185, GT_2_like_b, Subfamily of Glycosyltransferase Family GT2
           of unknown function.  GT-2 includes diverse families of
           glycosyltransferases with a common GT-A type structural
           fold, which has two tightly associated beta/alpha/beta
           domains that tend to form a continuous central sheet of
           at least eight beta-strands. These are enzymes that
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. Glycosyltransferases have been
           classified into more than 90 distinct sequence based
           families.
          Length = 202

 Score = 29.5 bits (67), Expect = 3.0
 Identities = 17/73 (23%), Positives = 26/73 (35%), Gaps = 15/73 (20%)

Query: 412 YGDKYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQAGRGNQEEFGLLSLEYDENFC 471
           Y   Y+W    DDD     + L ++L   D   PQF+               L  D +  
Sbjct: 76  YELGYDWIWLMDDDAIPDPDALEKLLAYADKDNPQFLA-------------PLVLDPDGS 122

Query: 472 MGGPGVIMSRATL 484
             G   ++SR  +
Sbjct: 123 FVGV--LISRRVV 133


>gnl|CDD|222183 pfam13506, Glyco_transf_21, Glycosyl transferase family 21.  This
           is a family of ceramide beta-glucosyltransferases -
           EC:2.4.1.80.
          Length = 171

 Score = 29.5 bits (67), Expect = 3.1
 Identities = 7/34 (20%), Positives = 15/34 (44%), Gaps = 1/34 (2%)

Query: 405 LQYMWERYGDKYEWFMRADDDLYVRGERLARMLR 438
           L    +    +Y+  + +D D+ V  + L  +L 
Sbjct: 22  LLQALDAK-ARYDLLVISDSDVRVPPDYLRELLA 54


>gnl|CDD|177594 PHA03356, PHA03356, tegument protein UL11; Provisional.
          Length = 93

 Score = 27.9 bits (62), Expect = 3.7
 Identities = 15/41 (36%), Positives = 17/41 (41%)

Query: 163 SSCGRNFSVSPLSISQRQGPRPTPFQMRGPYSREGRRLSTG 203
           S  G NF +SP      Q P   P +  GP SR  R  S  
Sbjct: 40  SDAGGNFYISPELRVVTQPPGRQPPRPSGPPSRHTRHWSPS 80


>gnl|CDD|227152 COG4815, COG4815, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 145

 Score = 28.3 bits (63), Expect = 6.3
 Identities = 10/37 (27%), Positives = 16/37 (43%)

Query: 281 GVMTANSYLNTRATSVYDTWARDIPATQGQTDGWSSS 317
           G+ T++     R   VY +W   +     +T  W SS
Sbjct: 103 GITTSDGQPGKRGFRVYPSWCEGLNKQARKTQAWQSS 139


>gnl|CDD|224940 COG2029, COG2029, Uncharacterized conserved protein [Function
           unknown].
          Length = 189

 Score = 28.2 bits (63), Expect = 7.9
 Identities = 31/113 (27%), Positives = 43/113 (38%), Gaps = 22/113 (19%)

Query: 380 RPDLPLVKLRGVDDSYPPQKKSFLMLQYMWERYGDKYEWFMRADDDLYVRGERLARML-- 437
            PDL L  LR        Q+    +L+ + E+ G       R  DDLYV G +L   +  
Sbjct: 77  HPDLLLAYLR--------QRLLVAILKEVLEKLGVDLS---REGDDLYVEGRKLTVSIAT 125

Query: 438 RSVDSSKPQF---IGQAGRGNQEEFGLLSLEYDENFCMGGPGVIMSRATLALV 487
            S  SSK      +   G  + +  GL  L Y +          M R  +A V
Sbjct: 126 VSPVSSKIHLGINVKTEGVPDVDAIGLEDLGYGD------ILEFMERVAVAYV 172


>gnl|CDD|226797 COG4347, COG4347, Predicted membrane protein [Function unknown].
          Length = 200

 Score = 28.2 bits (63), Expect = 8.1
 Identities = 8/28 (28%), Positives = 11/28 (39%)

Query: 134 TKFKLGLWSPAIELFFLCPQDFAWASTW 161
           T  K GLW+ A+ L         W   +
Sbjct: 83  TLVKYGLWAVAMNLLVNAEGGDIWWMGY 110


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.136    0.416 

Gapped
Lambda     K      H
   0.267   0.0772    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,986,740
Number of extensions: 2595446
Number of successful extensions: 1715
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1710
Number of HSP's successfully gapped: 18
Length of query: 533
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 432
Effective length of database: 6,457,848
Effective search space: 2789790336
Effective search space used: 2789790336
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (27.2 bits)