RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy1575
(242 letters)
>gnl|CDD|225661 COG3119, AslA, Arylsulfatase A and related enzymes [Inorganic ion
transport and metabolism].
Length = 475
Score = 110 bits (276), Expect = 6e-28
Identities = 58/182 (31%), Positives = 78/182 (42%), Gaps = 12/182 (6%)
Query: 22 QGWNDVG-FHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGI-D 78
G+ D+G + G PTPNID LA G+ YT P C PSRAA LTG+YPFR G+
Sbjct: 15 LGYGDLGAYGGPVVGPTPNIDRLAAEGVRFTNAYTTSPCCGPSRAALLTGRYPFRTGVGG 74
Query: 79 TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
G +P L + LKE GY T L GKWH+G E+ + GFD G+ G
Sbjct: 75 NAEPPGYPGGLPDEVPTLAELLKEAGYYTALFGKWHLGEKDEDPAGGDHGFDEFYGFLGG 134
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLT----DFFTDQSVHVIKSHNHSRPL 194
V + ++ + + Y+ D+ K +P
Sbjct: 135 LTD-----EWYPELVDVPPPGDVPEFDQEEGDPYVAGKDSADLADRFRRQAKEDAPDKPP 189
Query: 195 FL 196
FL
Sbjct: 190 FL 191
>gnl|CDD|216172 pfam00884, Sulfatase, Sulfatase.
Length = 332
Score = 96.0 bits (239), Expect = 2e-23
Identities = 50/196 (25%), Positives = 74/196 (37%), Gaps = 24/196 (12%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
D+G +G TP +D LA G++ + Y PSR A LTG P +G
Sbjct: 11 LRAPDLGLYGYPRPTTPFLDRLAEEGLLFSNFYSGGTLTAPSRFALLTGLPPHNFGSGVS 70
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
G+ + TE LP LK GY+T IGKWH+ + + N GFD G G
Sbjct: 71 TPIGLPR----TEPSLPDLLKRAGYNTGAIGKWHLSWYNRQSVYKNLGFDKFFGRNTGED 126
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSK-YLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ E S + D+++ + N+ +P FL +
Sbjct: 127 LY----------------KDPEDVGYNCSGGGVSDEALLDEALEFLD--NNDKPFFLVLH 168
Query: 200 HAAVHTGTAGNAKLPT 215
H + P
Sbjct: 169 TMGSHGPPYYPDRYPE 184
>gnl|CDD|237491 PRK13759, PRK13759, arylsulfatase; Provisional.
Length = 485
Score = 65.8 bits (161), Expect = 2e-12
Identities = 54/202 (26%), Positives = 86/202 (42%), Gaps = 37/202 (18%)
Query: 28 GFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPVGAG-- 84
G +G + TPN+D LA G Y+ +P+CTP+RAA LTG + +G VG G
Sbjct: 23 GCNGNKAVETPNLDMLASEGYNFENAYSAVPSCTPARAALLTGLSQWHHGR---VGYGDV 79
Query: 85 VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYND 144
V T LPQ ++ GY T IGK H+ + L GF N + + +GYL
Sbjct: 80 VPWNYKNT---LPQEFRDAGYYTQCIGKMHVFPQRNLL-----GFHNVLLH-DGYLHSGR 130
Query: 145 SIHETDFAVGLDARRNMERYAPQMSSKYL----------------------TDFFTDQSV 182
+ ++ F D + AP T++ +S+
Sbjct: 131 NEDKSQFDFVSDYLAWLREKAPGKDPDLTDIGWDCNSWVARPWDLEERLHPTNWVGSESI 190
Query: 183 HVIKSHNHSRPLFLQITHAAVH 204
++ + ++P FL+++ A H
Sbjct: 191 EFLRRRDPTKPFFLKMSFARPH 212
>gnl|CDD|234202 TIGR03417, chol_sulfatase, choline-sulfatase.
Length = 500
Score = 52.4 bits (126), Expect = 6e-08
Identities = 29/88 (32%), Positives = 37/88 (42%), Gaps = 24/88 (27%)
Query: 37 TPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGI---------DTPVGAGVA 86
PN+ LA +V + Y P C PSRA+F++G+ P R G D P A
Sbjct: 29 APNLKRLAARSVVFDNAYCASPLCAPSRASFMSGQLPSRTGAYDNAAEFASDIPTYA--- 85
Query: 87 KAVPVTEKLLPQYLKELGYSTHLIGKWH 114
YL+ GY T L GK H
Sbjct: 86 -----------HYLRRAGYRTALSGKMH 102
>gnl|CDD|216635 pfam01663, Phosphodiest, Type I phosphodiesterase / nucleotide
pyrophosphatase. This family consists of
phosphodiesterases, including human plasma-cell
membrane glycoprotein PC-1 / alkaline phosphodiesterase
i / nucleotide pyrophosphatase (nppase). These enzymes
catalyze the cleavage of phosphodiester and
phosphosulfate bonds in NAD, deoxynucleotides and
nucleotide sugars. Also in this family is ATX an
autotaxin, tumour cell motility-stimulating protein
which exhibits type I phosphodiesterases activity. The
alignment encompasses the active site. Also present
with in this family is 60-kDa Ca2+-ATPase form F.
odoratum.
Length = 342
Score = 32.4 bits (74), Expect = 0.16
Identities = 17/43 (39%), Positives = 22/43 (51%), Gaps = 2/43 (4%)
Query: 37 TPNIDALAYNGIVLNRHYT-LPTCT-PSRAAFLTGKYPFRYGI 77
TPN+ ALA G+ PT T P+ +TG YP +GI
Sbjct: 21 TPNLAALAKEGVSAPYLTPVFPTLTFPNHYTIVTGLYPGSHGI 63
>gnl|CDD|237444 PRK13607, PRK13607, proline dipeptidase; Provisional.
Length = 443
Score = 32.2 bits (74), Expect = 0.25
Identities = 12/32 (37%), Positives = 15/32 (46%), Gaps = 2/32 (6%)
Query: 30 HGENDIPTPNIDALAYNGIVLNRHYTLPTCTP 61
+ND+P NI AL + VL HYT
Sbjct: 207 QRDNDVPYGNIVALNEHAAVL--HYTKLDHQA 236
>gnl|CDD|224441 COG1524, COG1524, Uncharacterized proteins of the AP superfamily
[General function prediction only].
Length = 450
Score = 31.0 bits (70), Expect = 0.56
Identities = 15/43 (34%), Positives = 22/43 (51%), Gaps = 2/43 (4%)
Query: 37 TPNIDALAYNGIVLNRHYT-LPTCT-PSRAAFLTGKYPFRYGI 77
P + +LA NG+ + + PT T P +TG YP +GI
Sbjct: 61 LPFLSSLAENGVHVAELISVFPTTTRPRHTTLITGSYPDEHGI 103
>gnl|CDD|216284 pfam01074, Glyco_hydro_38, Glycosyl hydrolases family 38 N-terminal
domain. Glycosyl hydrolases are key enzymes of
carbohydrate metabolism.
Length = 269
Score = 28.7 bits (65), Expect = 2.3
Identities = 19/88 (21%), Positives = 31/88 (35%), Gaps = 18/88 (20%)
Query: 96 LPQYLKELGYSTHLIGK--WHIGCNKEELLPFNRGFDNHVGYWNGY-----LTYNDSIHE 148
LPQ LK+ G L + W+ +K + P + W G LT+
Sbjct: 129 LPQILKQAGIDYFLTQRLHWN---DKNKFNP------HLEFIWRGPDGSEILTHMLPFDY 179
Query: 149 TDFAVGLDAR-RNMERYAPQMSSKYLTD 175
G R ++ A + + K T+
Sbjct: 180 -YPTYGAQFRADDLLDQAKKYADKTRTN 206
>gnl|CDD|181775 PRK09314, PRK09314, bifunctional 3,4-dihydroxy-2-butanone
4-phosphate synthase/GTP cyclohydrolase II protein;
Provisional.
Length = 339
Score = 27.6 bits (62), Expect = 7.0
Identities = 9/16 (56%), Positives = 11/16 (68%)
Query: 143 NDSIHETDFAVGLDAR 158
N S HET F V +DA+
Sbjct: 77 NTSNHETAFTVSIDAK 92
>gnl|CDD|219582 pfam07796, DUF1638, Protein of unknown function (DUF1638). This
family contains sequences covering an approximately 270
amino acid stretch of a group of hypothetical proteins.
These proteins are expressed by archaeal species of the
Methanosarcina genus.
Length = 166
Score = 26.8 bits (60), Expect = 7.1
Identities = 21/80 (26%), Positives = 26/80 (32%), Gaps = 17/80 (21%)
Query: 162 ERYAPQMSSK---YLTDFFTDQ-SVHVIK-----SHNHSRPLFLQITHAAVHTGTAGNAK 212
YA + YLT + Q VIK H R ++ H
Sbjct: 72 AFYAELLREPGTFYLTPMWARQWDAFVIKRLGLDRHPELRDMYFG------HYRKV--VY 123
Query: 213 LPTGLLQVPDMEENDRTFAH 232
+ TGL Q D E R FA
Sbjct: 124 IDTGLYQTDDFEAKAREFAD 143
>gnl|CDD|218931 pfam06189, 5-nucleotidase, 5'-nucleotidase. This family consists
of both eukaryotic and prokaryotic 5'-nucleotidase
sequences (EC:3.1.3.5).
Length = 263
Score = 27.1 bits (61), Expect = 7.2
Identities = 11/24 (45%), Positives = 14/24 (58%), Gaps = 3/24 (12%)
Query: 52 RHYTLPTCTPSRAAFLTGKYPFRY 75
HY L +RAAF G+ P+RY
Sbjct: 57 NHYGLDI---TRAAFTGGESPYRY 77
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.138 0.431
Gapped
Lambda K H
0.267 0.0721 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,890,041
Number of extensions: 1213244
Number of successful extensions: 922
Number of sequences better than 10.0: 1
Number of HSP's gapped: 914
Number of HSP's successfully gapped: 20
Length of query: 242
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 148
Effective length of database: 6,768,326
Effective search space: 1001712248
Effective search space used: 1001712248
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 58 (26.1 bits)