RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6218
(533 letters)
>gnl|CDD|218687 pfam05679, CHGN, Chondroitin N-acetylgalactosaminyltransferase.
Length = 504
Score = 72.4 bits (178), Expect = 2e-13
Identities = 25/35 (71%), Positives = 29/35 (82%)
Query: 490 HIKYCLKNLYTTHEDVELGRCVQKFAGIPCTWSYE 524
H+ CL+N + HEDVELGRCVQKFAGI CTWSY+
Sbjct: 1 HLDSCLRNALSAHEDVELGRCVQKFAGIGCTWSYQ 35
Score = 57.8 bits (140), Expect = 7e-09
Identities = 20/29 (68%), Positives = 23/29 (79%)
Query: 54 HIKYCLKNLYTTHEDVELGRCVQKFAGIP 82
H+ CL+N + HEDVELGRCVQKFAGI
Sbjct: 1 HLDSCLRNALSAHEDVELGRCVQKFAGIG 29
>gnl|CDD|190308 pfam02434, Fringe, Fringe-like. The drosophila protein fringe
(FNG) is a glucosaminyltransferase that controls the
response of the Notch receptor to specific ligands. FNG
is localised to the Golgi apparatus (not secreted as
previously thought). Modification of Notch occurs
through glycosylation by FNG. The xenopus homologue,
lunatic fringe, has been implicated in a variety of
functions.
Length = 248
Score = 61.2 bits (149), Expect = 2e-10
Identities = 42/197 (21%), Positives = 68/197 (34%), Gaps = 14/197 (7%)
Query: 336 LFVGVMTANSYLNTRATSVYDTWARDIPGRVAFFT-SESSTLPAARPDLPLVKLRGVDDS 394
+F+ V T + TR + TW + FT E LP +
Sbjct: 6 IFIAVKTTKKFHKTRLPLLLKTWISRAKHQTYIFTDGEDEGLPTRTGGHLINT--NCSAG 63
Query: 395 YPPQKKSFLMLQYMWERYGDKYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQ-AGR 453
+ + S M +WF DDD YV RL R+L + ++ ++G+ +
Sbjct: 64 HCRKALSCKMAVEYDRFLESGKKWFCHVDDDNYVNVPRLVRLLSCYNHTQDVYLGKPSLY 123
Query: 454 GNQEEFGLLSLEYDE--NFCMGGPGVIMSRATLALVAPHIKYCLKNLYTT------HEDV 505
E + F GG G +SR ++P +T +D
Sbjct: 124 RPIEATERVKNNRKVGFWFATGGAGFCISRGLALKMSPWASGG--RFMSTSEKIRLPDDC 181
Query: 506 ELGRCVQKFAGIPCTWS 522
LG ++ G+P T S
Sbjct: 182 TLGYIIENLLGVPLTHS 198
Score = 28.4 bits (64), Expect = 8.8
Identities = 14/60 (23%), Positives = 22/60 (36%), Gaps = 8/60 (13%)
Query: 33 NFCMGGPGVIMSRATLALVAPHIKYCLKNLYTT------HEDVELGRCVQKFAGIPFLSS 86
F GG G +SR ++P +T +D LG ++ G+P S
Sbjct: 141 WFATGGAGFCISRGLALKMSPWASGG--RFMSTSEKIRLPDDCTLGYIIENLLGVPLTHS 198
>gnl|CDD|224126 COG1205, COG1205, Distinct helicase family with a unique C-terminal
domain including a metal-binding cysteine cluster
[General function prediction only].
Length = 851
Score = 33.1 bits (76), Expect = 0.38
Identities = 37/151 (24%), Positives = 47/151 (31%), Gaps = 35/151 (23%)
Query: 358 WARDIPGRVAFFTSESSTLPAARPDLPLVKLRGVDDSYPPQKKSFL-----MLQYMWERY 412
D+PG+V F T P R + +R PP L ML Y+ R
Sbjct: 138 LISDLPGKVTFGRYTGDTPPEERRAI----IRN-----PP---DILLTNPDMLHYLLLRN 185
Query: 413 GDKYEWFMRAD-----DDLYV-RG----------ERLARMLRSVDSSKPQFIG-QAGRGN 455
D + W +R D+L+ RG RL R LR S Q I A N
Sbjct: 186 HDAWLWLLRNLKYLVVDELHTYRGVQGSEVALLLRRLLRRLR-RYGSPLQIICTSATLAN 244
Query: 456 QEEFGLLSLEYDENFCMGGPGVIMSRATLAL 486
EF D + G
Sbjct: 245 PGEFAEELFGRDFEVPVDEDGSPRGLRYFVR 275
>gnl|CDD|200969 pfam00064, Neur, Neuraminidase. Neuraminidases cleave sialic acid
residues from glycoproteins. Belong to the sialidase
family - but this alignment does not generalise to the
other sialidases. Structure is a 6-sheet beta propeller.
Length = 468
Score = 32.4 bits (74), Expect = 0.58
Identities = 16/56 (28%), Positives = 25/56 (44%), Gaps = 8/56 (14%)
Query: 255 GWSSSAGEKGRGISDVGLS--ESGLLFVGVMTANSYLNTRATSVYDTWARDIPATQ 308
WSS+A G+ +G+S + TA R T ++WA++I TQ
Sbjct: 177 AWSSTACHDGKAWMTIGVSGPDDN------ATAVIKYGGRPTDTINSWAKNILRTQ 226
>gnl|CDD|131620 TIGR02569, TIGR02569_actnb, TIGR02569 family protein. This protein
family is found, so far, only in Actinobacteria,
including as least five species of Mycobacterium, three
of Corynebacterium, and Nocardia farcinica, always in a
single copy per genome. The function is unknown
[Hypothetical proteins, Conserved].
Length = 272
Score = 30.3 bits (68), Expect = 1.9
Identities = 13/55 (23%), Positives = 22/55 (40%), Gaps = 3/55 (5%)
Query: 414 DKYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQAGRGNQEEFGLLSLEYDE 468
+ W R + L+ G R+ R +RS D +++ R + G DE
Sbjct: 43 ARSAWSARVRETLFPDGVRVVRPVRSTDG---RYVVAGWRADTFVAGTPEPRVDE 94
>gnl|CDD|133028 cd04185, GT_2_like_b, Subfamily of Glycosyltransferase Family GT2
of unknown function. GT-2 includes diverse families of
glycosyltransferases with a common GT-A type structural
fold, which has two tightly associated beta/alpha/beta
domains that tend to form a continuous central sheet of
at least eight beta-strands. These are enzymes that
catalyze the transfer of sugar moieties from activated
donor molecules to specific acceptor molecules, forming
glycosidic bonds. Glycosyltransferases have been
classified into more than 90 distinct sequence based
families.
Length = 202
Score = 29.5 bits (67), Expect = 3.0
Identities = 17/73 (23%), Positives = 26/73 (35%), Gaps = 15/73 (20%)
Query: 412 YGDKYEWFMRADDDLYVRGERLARMLRSVDSSKPQFIGQAGRGNQEEFGLLSLEYDENFC 471
Y Y+W DDD + L ++L D PQF+ L D +
Sbjct: 76 YELGYDWIWLMDDDAIPDPDALEKLLAYADKDNPQFLA-------------PLVLDPDGS 122
Query: 472 MGGPGVIMSRATL 484
G ++SR +
Sbjct: 123 FVGV--LISRRVV 133
>gnl|CDD|222183 pfam13506, Glyco_transf_21, Glycosyl transferase family 21. This
is a family of ceramide beta-glucosyltransferases -
EC:2.4.1.80.
Length = 171
Score = 29.5 bits (67), Expect = 3.1
Identities = 7/34 (20%), Positives = 15/34 (44%), Gaps = 1/34 (2%)
Query: 405 LQYMWERYGDKYEWFMRADDDLYVRGERLARMLR 438
L + +Y+ + +D D+ V + L +L
Sbjct: 22 LLQALDAK-ARYDLLVISDSDVRVPPDYLRELLA 54
>gnl|CDD|177594 PHA03356, PHA03356, tegument protein UL11; Provisional.
Length = 93
Score = 27.9 bits (62), Expect = 3.7
Identities = 15/41 (36%), Positives = 17/41 (41%)
Query: 163 SSCGRNFSVSPLSISQRQGPRPTPFQMRGPYSREGRRLSTG 203
S G NF +SP Q P P + GP SR R S
Sbjct: 40 SDAGGNFYISPELRVVTQPPGRQPPRPSGPPSRHTRHWSPS 80
>gnl|CDD|227152 COG4815, COG4815, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 145
Score = 28.3 bits (63), Expect = 6.3
Identities = 10/37 (27%), Positives = 16/37 (43%)
Query: 281 GVMTANSYLNTRATSVYDTWARDIPATQGQTDGWSSS 317
G+ T++ R VY +W + +T W SS
Sbjct: 103 GITTSDGQPGKRGFRVYPSWCEGLNKQARKTQAWQSS 139
>gnl|CDD|224940 COG2029, COG2029, Uncharacterized conserved protein [Function
unknown].
Length = 189
Score = 28.2 bits (63), Expect = 7.9
Identities = 31/113 (27%), Positives = 43/113 (38%), Gaps = 22/113 (19%)
Query: 380 RPDLPLVKLRGVDDSYPPQKKSFLMLQYMWERYGDKYEWFMRADDDLYVRGERLARML-- 437
PDL L LR Q+ +L+ + E+ G R DDLYV G +L +
Sbjct: 77 HPDLLLAYLR--------QRLLVAILKEVLEKLGVDLS---REGDDLYVEGRKLTVSIAT 125
Query: 438 RSVDSSKPQF---IGQAGRGNQEEFGLLSLEYDENFCMGGPGVIMSRATLALV 487
S SSK + G + + GL L Y + M R +A V
Sbjct: 126 VSPVSSKIHLGINVKTEGVPDVDAIGLEDLGYGD------ILEFMERVAVAYV 172
>gnl|CDD|226797 COG4347, COG4347, Predicted membrane protein [Function unknown].
Length = 200
Score = 28.2 bits (63), Expect = 8.1
Identities = 8/28 (28%), Positives = 11/28 (39%)
Query: 134 TKFKLGLWSPAIELFFLCPQDFAWASTW 161
T K GLW+ A+ L W +
Sbjct: 83 TLVKYGLWAVAMNLLVNAEGGDIWWMGY 110
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.136 0.416
Gapped
Lambda K H
0.267 0.0772 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,986,740
Number of extensions: 2595446
Number of successful extensions: 1715
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1710
Number of HSP's successfully gapped: 18
Length of query: 533
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 432
Effective length of database: 6,457,848
Effective search space: 2789790336
Effective search space used: 2789790336
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (27.2 bits)