RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6259
(227 letters)
>gnl|CDD|214780 smart00702, P4Hc, Prolyl 4-hydroxylase alpha subunit homologues.
Mammalian enzymes catalyse hydroxylation of collagen,
for example. Prokaryotic enzymes might catalyse
hydroxylation of antibiotic peptides. These are
2-oxoglutarate-dependent dioxygenases, requiring
2-oxoglutarate and dioxygen as cosubstrates and ferrous
iron as a cofactor.
Length = 165
Score = 107 bits (270), Expect = 2e-29
Identities = 50/170 (29%), Positives = 75/170 (44%), Gaps = 18/170 (10%)
Query: 57 YDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRI 114
+E +++E ++ RG+V R S +L + +I+ R+
Sbjct: 1 SPAECQKLLEEAEPLGWRGEVTRGIGNPNETSQYRQSNGTWLELL--ERDLVIERIRQRL 58
Query: 115 QDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGA 174
D L+ G + Q+ YG GGHY H D R+A+F+ YL DVE GG
Sbjct: 59 ADFLGLLAGLPLSAE-DAQVARYGPGGHYGPHVD---NFLYGDRIATFILYLNDVEEGGE 114
Query: 175 TIFPSLNL----TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
+FP L L TV P+KG +F+ + H R H CPV G++W
Sbjct: 115 LVFPGLRLMVVATVKPKKGDLLFFPSGHG------RSLHGVCPVTRGSRW 158
>gnl|CDD|177683 PLN00052, PLN00052, prolyl 4-hydroxylase; Provisional.
Length = 310
Score = 70.9 bits (173), Expect = 2e-14
Identities = 56/206 (27%), Positives = 90/206 (43%), Gaps = 31/206 (15%)
Query: 39 KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFL 96
+V+ + PR+ + D+E + +++L+K K++R V + G ++ + R S FL
Sbjct: 46 RVKAVSWQPRIFVYKGFLSDAECDHLVKLAKKKIQRSMVADNKSGKSVMSEVRTSSGMFL 105
Query: 97 YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPR 152
P + +I+ RI T L E +QI Y G H+D D +
Sbjct: 106 DKR---QDPVVSRIEERIAAWTFLPEENAEN----IQILRYEHGQKYEPHFDYFHDKINQ 158
Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAVFW 194
G R A+ + YL+ V+ GG T+FP+ L V P KG AV +
Sbjct: 159 ALGGHRYATVLMYLSTVDKGGETVFPNAEGWENQPKDDTFSECAHKGLAVKPVKGDAVLF 218
Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
++ H + + D H CPV G KW
Sbjct: 219 FSLHIDGVPDPLSLHGSCPVIEGEKW 244
>gnl|CDD|222280 pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily. This
family contains members of the 2-oxoglutarate (2OG) and
Fe(II)-dependent oxygenase superfamily.
Length = 93
Score = 39.3 bits (92), Expect = 2e-04
Identities = 18/96 (18%), Positives = 29/96 (30%), Gaps = 20/96 (20%)
Query: 132 LQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVEL---GGATIFPSLN-----LT 183
++ Y GG H D + R + + YL D GG +
Sbjct: 1 AGLSRYRPGGFLGPHTDNSK----GERRVTLLLYLNDDWWPEEGGELELYPSDDGGVVRV 56
Query: 184 VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
+ P+ G V + + H PV G +
Sbjct: 57 IAPKPGRLVLFRSG--------NSLHGVTPVGAGRR 84
>gnl|CDD|217403 pfam03171, 2OG-FeII_Oxy, 2OG-Fe(II) oxygenase superfamily. This
family contains members of the 2-oxoglutarate (2OG) and
Fe(II)-dependent oxygenase superfamily. This family
includes the C-terminal of prolyl 4-hydroxylase alpha
subunit. The holoenzyme has the activity EC:1.14.11.2
catalyzing the reaction: Procollagen L-proline +
2-oxoglutarate + O2 <=> procollagen trans-
4-hydroxy-L-proline + succinate + CO2. The full enzyme
consists of a alpha2 beta2 complex with the alpha
subunit contributing most of the parts of the active
site. The family also includes lysyl hydrolases,
isopenicillin synthases and AlkB.
Length = 96
Score = 31.6 bits (72), Expect = 0.075
Identities = 15/94 (15%), Positives = 27/94 (28%), Gaps = 15/94 (15%)
Query: 132 LQINNYG--LGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
L +N Y LG H D + + + L + + V P G
Sbjct: 5 LLVNYYPPDLGLGLGPHTDD--------SIITILLQDGGGGL--QFLKDGGWIDVPPGPG 54
Query: 190 S-AVFWYNA--HANTLLDYRMYHSGCPVALGNKW 220
+ V + + + + H P G +
Sbjct: 55 ALVVNFGDLLLVLSNGRYKSVLHRVLPNTGGERI 88
>gnl|CDD|162262 TIGR01229, rocF_arginase, arginase. This model helps resolve
arginases from known and putative agmatinases,
formiminoglutamases, and other related proteins of
unknown specifity. The pathway from arginine to the
polyamine putrescine may procede by hydrolysis to remove
urea (arginase) followed by decarboxylation (ornithine
decarboxylase), or by decarboxylation first (arginine
decarboxylase) followed by removal of urea (agmatinase).
Length = 300
Score = 29.7 bits (67), Expect = 1.3
Identities = 13/35 (37%), Positives = 17/35 (48%), Gaps = 3/35 (8%)
Query: 184 VFPEKGSAVFWYNAHA--NTLLDYRMYH-SGCPVA 215
V P+K V W +AHA NT + G P+A
Sbjct: 106 VHPDKKLGVLWLDAHADINTPETSDSGNIHGMPLA 140
>gnl|CDD|179543 PRK03137, PRK03137, 1-pyrroline-5-carboxylate dehydrogenase;
Provisional.
Length = 514
Score = 29.5 bits (67), Expect = 1.4
Identities = 25/81 (30%), Positives = 38/81 (46%), Gaps = 26/81 (32%)
Query: 48 RVVKIHDAIYDSEINRIIELSK----GKVER----GKVVNYG--DTI--YVDT-----RL 90
R + +H+ +YD + +++EL+K G E G V+N D I Y++ RL
Sbjct: 325 RAI-VHEDVYDEVLEKVVELTKELTVGNPEDNAYMGPVINQASFDKIMSYIEIGKEEGRL 383
Query: 91 --------SKVYFLYPEIFGD 103
SK YF+ P IF D
Sbjct: 384 VLGGEGDDSKGYFIQPTIFAD 404
>gnl|CDD|144630 pfam01109, GM_CSF, Granulocyte-macrophage colony-stimulating
factor.
Length = 122
Score = 27.9 bits (62), Expect = 2.5
Identities = 14/42 (33%), Positives = 21/42 (50%), Gaps = 4/42 (9%)
Query: 110 IQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATP 151
+QTR++ + G + KGPL + + HY HC TP
Sbjct: 55 LQTRLKLYKQGLRGSLTKLKGPLTM----MASHYKQHCPPTP 92
>gnl|CDD|143602 cd07578, nitrilase_1_R1, First nitrilase domain of an
uncharacterized subgroup of the nitrilase superfamily
(putative class 13 nitrilases). Members of this
subgroup have two nitrilase domains. This is the first
of those two domains. The nitrilase superfamily is
comprised of nitrile- or amide-hydrolyzing enzymes and
amide-condensing enzymes, which depend on a Glu-Lys-Cys
catalytic triad. This superfamily has been classified in
the literature based on global and structure based
sequence analysis into thirteen different enzyme classes
(referred to as 1-13). Class 13 represents proteins that
at the time were difficult to place in a distinct
similarity group; this subgroup represents either a new
class or one that was included previously in class 13.
Members of this superfamily generally form homomeric
complexes, the basic building block of which is a
homodimer.
Length = 258
Score = 28.3 bits (63), Expect = 3.5
Identities = 16/53 (30%), Positives = 23/53 (43%), Gaps = 14/53 (26%)
Query: 171 LGGATI--FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
LGGA + S L E+ A +W N R + +GC + N+WG
Sbjct: 160 LGGADVICHISNWLA---ERTPAPYWIN---------RAFENGCYLIESNRWG 200
>gnl|CDD|143462 cd07144, ALDH_ALD2-YMR170C, Saccharomyces cerevisiae aldehyde
dehydrogenase 2 (YMR170c)-like. NAD(P)+-dependent
Saccharomyces cerevisiae aldehyde dehydrogenase 2
(YMR170c, ALD5, EC=1.2.1.5) and other similar sequences,
are present in this CD.
Length = 484
Score = 28.1 bits (63), Expect = 3.8
Identities = 33/98 (33%), Positives = 45/98 (45%), Gaps = 21/98 (21%)
Query: 34 KIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKV 93
K+G ++ + P+V K YD ++ I KGK E K+V YG + L K
Sbjct: 316 KVGSPFDDDTVVGPQVSKTQ---YDRVLSYI---EKGKKEGAKLV-YGGEKAPE-GLGKG 367
Query: 94 YFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGP 131
YF+ P IF D P QDM I +EE + GP
Sbjct: 368 YFIPPTIFTDVP---------QDMR---IVKEEIF-GP 392
>gnl|CDD|221808 pfam12851, Tet_JBP, Oxygenase domain of the 2OGFeDO superfamily. A
double-stranded beta helix (DSBH) fold domain of the
2-oxoglutarate (2OG)-Fe(II)-dependent dioxygenase
(2OGFeDO) superfamily found in various eukaryotes,
bacteria and bacteriophages. Members of this family
catalyze nucleic acid modifications, such as thymidine
hydroxylation during base J synthesis in kinetoplastids,
and the conversion of 5 methyl-cytosine (5-mC) to
5-hydroxymethyl-cytosine (hmC), or further oxidation to
5-formylcytosine (5fC) and 5-carboxylcytosine (5caC).
Metazoan TET proteins contain a cysteine-rich region
inserted into the core of the DSBH fold. Vertebrate TET
proteins are oncogenes that are mutated in various
myeloid cancers. Fungal and algal versions of this
family are linked to a predicted transposase and show
lineage-specific expansions.
Length = 162
Score = 26.9 bits (60), Expect = 6.2
Identities = 28/125 (22%), Positives = 37/125 (29%), Gaps = 26/125 (20%)
Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKG-----PLQIN-NYGLGGHYDLHCDATPRDEGLW 157
+P LY D IG + + +N N H D GL
Sbjct: 48 NPDLYNKL----DSLLEDIGNCKLLTEATPFSTVSVNKNRITALHRDKG---DFNGIGLL 100
Query: 158 RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALG 217
Y D E GG P L + P+ G +F+ N + H PV
Sbjct: 101 ----VTLYFGDFE-GGYLELPDLGVKFAPQPGDVLFF---AGNKYV-----HGVTPVTSS 147
Query: 218 NKWGK 222
N G
Sbjct: 148 NGDGT 152
>gnl|CDD|226084 COG3554, COG3554, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 190
Score = 27.1 bits (60), Expect = 6.5
Identities = 13/37 (35%), Positives = 14/37 (37%), Gaps = 9/37 (24%)
Query: 140 GGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
G Y L CDA WR F DV LG +
Sbjct: 51 GARYRLQCDAG------WRTRRFG---VDVTLGERQL 78
>gnl|CDD|226274 COG3751, EGL-9, Predicted proline hydroxylase [Posttranslational
modification, protein turnover, chaperones].
Length = 252
Score = 27.0 bits (60), Expect = 8.1
Identities = 25/107 (23%), Positives = 37/107 (34%), Gaps = 15/107 (14%)
Query: 80 YGDTI--YVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
GD TR + F + R ++G + + QI Y
Sbjct: 91 SGDLKDELDPTRSPNLPVQRYLEFSE-------SIRFILGLPQLLGILDVSEVEGQITVY 143
Query: 138 GLGGHYDLHCDATPRDEGLWRLASFMFYLTD---VELGGA-TIFPSL 180
G H D + RLA++++YLT E GG +F SL
Sbjct: 144 NPGCFLLKHDDNGRDKDI--RLATYVYYLTREWKPEYGGELRLFHSL 188
>gnl|CDD|235772 PRK06290, PRK06290, aspartate aminotransferase; Provisional.
Length = 410
Score = 26.9 bits (60), Expect = 8.8
Identities = 14/51 (27%), Positives = 24/51 (47%), Gaps = 5/51 (9%)
Query: 12 SVPEDIKSNLKCFYESY-NNTFLKIGPL----KVEELYLDPRVVKIHDAIY 57
S+P+DIK K Y +Y NN + +V + + ++ + DA Y
Sbjct: 171 SIPKDIKEKAKLLYLNYPNNPTGAVATKEFYEEVVDFAKENNIIVVQDAAY 221
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.143 0.443
Gapped
Lambda K H
0.267 0.0691 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,782,669
Number of extensions: 1121075
Number of successful extensions: 906
Number of sequences better than 10.0: 1
Number of HSP's gapped: 899
Number of HSP's successfully gapped: 19
Length of query: 227
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 134
Effective length of database: 6,812,680
Effective search space: 912899120
Effective search space used: 912899120
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 57 (25.8 bits)