RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6259
         (227 letters)



>gnl|CDD|214780 smart00702, P4Hc, Prolyl 4-hydroxylase alpha subunit homologues.
           Mammalian enzymes catalyse hydroxylation of collagen,
           for example. Prokaryotic enzymes might catalyse
           hydroxylation of antibiotic peptides. These are
           2-oxoglutarate-dependent dioxygenases, requiring
           2-oxoglutarate and dioxygen as cosubstrates and ferrous
           iron as a cofactor.
          Length = 165

 Score =  107 bits (270), Expect = 2e-29
 Identities = 50/170 (29%), Positives = 75/170 (44%), Gaps = 18/170 (10%)

Query: 57  YDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRI 114
             +E  +++E ++    RG+V             R S   +L          + +I+ R+
Sbjct: 1   SPAECQKLLEEAEPLGWRGEVTRGIGNPNETSQYRQSNGTWLELL--ERDLVIERIRQRL 58

Query: 115 QDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGA 174
            D   L+ G     +   Q+  YG GGHY  H D         R+A+F+ YL DVE GG 
Sbjct: 59  ADFLGLLAGLPLSAE-DAQVARYGPGGHYGPHVD---NFLYGDRIATFILYLNDVEEGGE 114

Query: 175 TIFPSLNL----TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +FP L L    TV P+KG  +F+ + H       R  H  CPV  G++W
Sbjct: 115 LVFPGLRLMVVATVKPKKGDLLFFPSGHG------RSLHGVCPVTRGSRW 158


>gnl|CDD|177683 PLN00052, PLN00052, prolyl 4-hydroxylase; Provisional.
          Length = 310

 Score = 70.9 bits (173), Expect = 2e-14
 Identities = 56/206 (27%), Positives = 90/206 (43%), Gaps = 31/206 (15%)

Query: 39  KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFL 96
           +V+ +   PR+      + D+E + +++L+K K++R  V +   G ++  + R S   FL
Sbjct: 46  RVKAVSWQPRIFVYKGFLSDAECDHLVKLAKKKIQRSMVADNKSGKSVMSEVRTSSGMFL 105

Query: 97  YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPR 152
                   P + +I+ RI   T L     E     +QI  Y  G     H+D   D   +
Sbjct: 106 DKR---QDPVVSRIEERIAAWTFLPEENAEN----IQILRYEHGQKYEPHFDYFHDKINQ 158

Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAVFW 194
             G  R A+ + YL+ V+ GG T+FP+                    L V P KG AV +
Sbjct: 159 ALGGHRYATVLMYLSTVDKGGETVFPNAEGWENQPKDDTFSECAHKGLAVKPVKGDAVLF 218

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           ++ H + + D    H  CPV  G KW
Sbjct: 219 FSLHIDGVPDPLSLHGSCPVIEGEKW 244


>gnl|CDD|222280 pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily.  This
           family contains members of the 2-oxoglutarate (2OG) and
           Fe(II)-dependent oxygenase superfamily.
          Length = 93

 Score = 39.3 bits (92), Expect = 2e-04
 Identities = 18/96 (18%), Positives = 29/96 (30%), Gaps = 20/96 (20%)

Query: 132 LQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVEL---GGATIFPSLN-----LT 183
             ++ Y  GG    H D +       R  + + YL D      GG       +       
Sbjct: 1   AGLSRYRPGGFLGPHTDNSK----GERRVTLLLYLNDDWWPEEGGELELYPSDDGGVVRV 56

Query: 184 VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           + P+ G  V + +            H   PV  G +
Sbjct: 57  IAPKPGRLVLFRSG--------NSLHGVTPVGAGRR 84


>gnl|CDD|217403 pfam03171, 2OG-FeII_Oxy, 2OG-Fe(II) oxygenase superfamily.  This
           family contains members of the 2-oxoglutarate (2OG) and
           Fe(II)-dependent oxygenase superfamily. This family
           includes the C-terminal of prolyl 4-hydroxylase alpha
           subunit. The holoenzyme has the activity EC:1.14.11.2
           catalyzing the reaction: Procollagen L-proline +
           2-oxoglutarate + O2 <=> procollagen trans-
           4-hydroxy-L-proline + succinate + CO2. The full enzyme
           consists of a alpha2 beta2 complex with the alpha
           subunit contributing most of the parts of the active
           site. The family also includes lysyl hydrolases,
           isopenicillin synthases and AlkB.
          Length = 96

 Score = 31.6 bits (72), Expect = 0.075
 Identities = 15/94 (15%), Positives = 27/94 (28%), Gaps = 15/94 (15%)

Query: 132 LQINNYG--LGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
           L +N Y   LG     H D          + + +       L    +     + V P  G
Sbjct: 5   LLVNYYPPDLGLGLGPHTDD--------SIITILLQDGGGGL--QFLKDGGWIDVPPGPG 54

Query: 190 S-AVFWYNA--HANTLLDYRMYHSGCPVALGNKW 220
           +  V + +     +      + H   P   G + 
Sbjct: 55  ALVVNFGDLLLVLSNGRYKSVLHRVLPNTGGERI 88


>gnl|CDD|162262 TIGR01229, rocF_arginase, arginase.  This model helps resolve
           arginases from known and putative agmatinases,
           formiminoglutamases, and other related proteins of
           unknown specifity. The pathway from arginine to the
           polyamine putrescine may procede by hydrolysis to remove
           urea (arginase) followed by decarboxylation (ornithine
           decarboxylase), or by decarboxylation first (arginine
           decarboxylase) followed by removal of urea (agmatinase).
          Length = 300

 Score = 29.7 bits (67), Expect = 1.3
 Identities = 13/35 (37%), Positives = 17/35 (48%), Gaps = 3/35 (8%)

Query: 184 VFPEKGSAVFWYNAHA--NTLLDYRMYH-SGCPVA 215
           V P+K   V W +AHA  NT       +  G P+A
Sbjct: 106 VHPDKKLGVLWLDAHADINTPETSDSGNIHGMPLA 140


>gnl|CDD|179543 PRK03137, PRK03137, 1-pyrroline-5-carboxylate dehydrogenase;
           Provisional.
          Length = 514

 Score = 29.5 bits (67), Expect = 1.4
 Identities = 25/81 (30%), Positives = 38/81 (46%), Gaps = 26/81 (32%)

Query: 48  RVVKIHDAIYDSEINRIIELSK----GKVER----GKVVNYG--DTI--YVDT-----RL 90
           R + +H+ +YD  + +++EL+K    G  E     G V+N    D I  Y++      RL
Sbjct: 325 RAI-VHEDVYDEVLEKVVELTKELTVGNPEDNAYMGPVINQASFDKIMSYIEIGKEEGRL 383

Query: 91  --------SKVYFLYPEIFGD 103
                   SK YF+ P IF D
Sbjct: 384 VLGGEGDDSKGYFIQPTIFAD 404


>gnl|CDD|144630 pfam01109, GM_CSF, Granulocyte-macrophage colony-stimulating
           factor. 
          Length = 122

 Score = 27.9 bits (62), Expect = 2.5
 Identities = 14/42 (33%), Positives = 21/42 (50%), Gaps = 4/42 (9%)

Query: 110 IQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATP 151
           +QTR++     + G   + KGPL +    +  HY  HC  TP
Sbjct: 55  LQTRLKLYKQGLRGSLTKLKGPLTM----MASHYKQHCPPTP 92


>gnl|CDD|143602 cd07578, nitrilase_1_R1, First nitrilase domain of an
           uncharacterized subgroup of the nitrilase superfamily
           (putative class 13 nitrilases).  Members of this
           subgroup have two nitrilase domains. This is the first
           of those two domains. The nitrilase superfamily is
           comprised of nitrile- or amide-hydrolyzing enzymes and
           amide-condensing enzymes, which depend on a Glu-Lys-Cys
           catalytic triad. This superfamily has been classified in
           the literature based on global and structure based
           sequence analysis into thirteen different enzyme classes
           (referred to as 1-13). Class 13 represents proteins that
           at the time were difficult to place in a distinct
           similarity group; this subgroup represents either a new
           class or one that was included previously in class 13.
           Members of this superfamily generally form homomeric
           complexes, the basic building block of which is a
           homodimer.
          Length = 258

 Score = 28.3 bits (63), Expect = 3.5
 Identities = 16/53 (30%), Positives = 23/53 (43%), Gaps = 14/53 (26%)

Query: 171 LGGATI--FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           LGGA +    S  L    E+  A +W N         R + +GC +   N+WG
Sbjct: 160 LGGADVICHISNWLA---ERTPAPYWIN---------RAFENGCYLIESNRWG 200


>gnl|CDD|143462 cd07144, ALDH_ALD2-YMR170C, Saccharomyces cerevisiae aldehyde
           dehydrogenase 2 (YMR170c)-like.  NAD(P)+-dependent
           Saccharomyces cerevisiae aldehyde dehydrogenase 2
           (YMR170c, ALD5, EC=1.2.1.5) and other similar sequences,
           are present in this CD.
          Length = 484

 Score = 28.1 bits (63), Expect = 3.8
 Identities = 33/98 (33%), Positives = 45/98 (45%), Gaps = 21/98 (21%)

Query: 34  KIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKV 93
           K+G    ++  + P+V K     YD  ++ I    KGK E  K+V YG     +  L K 
Sbjct: 316 KVGSPFDDDTVVGPQVSKTQ---YDRVLSYI---EKGKKEGAKLV-YGGEKAPE-GLGKG 367

Query: 94  YFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGP 131
           YF+ P IF D P         QDM    I +EE + GP
Sbjct: 368 YFIPPTIFTDVP---------QDMR---IVKEEIF-GP 392


>gnl|CDD|221808 pfam12851, Tet_JBP, Oxygenase domain of the 2OGFeDO superfamily.  A
           double-stranded beta helix (DSBH) fold domain of the
           2-oxoglutarate (2OG)-Fe(II)-dependent dioxygenase
           (2OGFeDO) superfamily found in various eukaryotes,
           bacteria and bacteriophages. Members of this family
           catalyze nucleic acid modifications, such as thymidine
           hydroxylation during base J synthesis in kinetoplastids,
           and the conversion of 5 methyl-cytosine (5-mC) to
           5-hydroxymethyl-cytosine (hmC), or further oxidation to
           5-formylcytosine (5fC) and 5-carboxylcytosine (5caC).
           Metazoan TET proteins contain a cysteine-rich region
           inserted into the core of the DSBH fold. Vertebrate TET
           proteins are oncogenes that are mutated in various
           myeloid cancers. Fungal and algal versions of this
           family are linked to a predicted transposase and show
           lineage-specific expansions.
          Length = 162

 Score = 26.9 bits (60), Expect = 6.2
 Identities = 28/125 (22%), Positives = 37/125 (29%), Gaps = 26/125 (20%)

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKG-----PLQIN-NYGLGGHYDLHCDATPRDEGLW 157
           +P LY       D     IG  +          + +N N     H D          GL 
Sbjct: 48  NPDLYNKL----DSLLEDIGNCKLLTEATPFSTVSVNKNRITALHRDKG---DFNGIGLL 100

Query: 158 RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALG 217
                  Y  D E GG    P L +   P+ G  +F+     N  +     H   PV   
Sbjct: 101 ----VTLYFGDFE-GGYLELPDLGVKFAPQPGDVLFF---AGNKYV-----HGVTPVTSS 147

Query: 218 NKWGK 222
           N  G 
Sbjct: 148 NGDGT 152


>gnl|CDD|226084 COG3554, COG3554, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 190

 Score = 27.1 bits (60), Expect = 6.5
 Identities = 13/37 (35%), Positives = 14/37 (37%), Gaps = 9/37 (24%)

Query: 140 GGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
           G  Y L CDA       WR   F     DV LG   +
Sbjct: 51  GARYRLQCDAG------WRTRRFG---VDVTLGERQL 78


>gnl|CDD|226274 COG3751, EGL-9, Predicted proline hydroxylase [Posttranslational
           modification, protein turnover, chaperones].
          Length = 252

 Score = 27.0 bits (60), Expect = 8.1
 Identities = 25/107 (23%), Positives = 37/107 (34%), Gaps = 15/107 (14%)

Query: 80  YGDTI--YVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
            GD       TR   +       F +         R       ++G  +  +   QI  Y
Sbjct: 91  SGDLKDELDPTRSPNLPVQRYLEFSE-------SIRFILGLPQLLGILDVSEVEGQITVY 143

Query: 138 GLGGHYDLHCDATPRDEGLWRLASFMFYLTD---VELGGA-TIFPSL 180
             G     H D     +   RLA++++YLT     E GG   +F SL
Sbjct: 144 NPGCFLLKHDDNGRDKDI--RLATYVYYLTREWKPEYGGELRLFHSL 188


>gnl|CDD|235772 PRK06290, PRK06290, aspartate aminotransferase; Provisional.
          Length = 410

 Score = 26.9 bits (60), Expect = 8.8
 Identities = 14/51 (27%), Positives = 24/51 (47%), Gaps = 5/51 (9%)

Query: 12  SVPEDIKSNLKCFYESY-NNTFLKIGPL----KVEELYLDPRVVKIHDAIY 57
           S+P+DIK   K  Y +Y NN    +       +V +   +  ++ + DA Y
Sbjct: 171 SIPKDIKEKAKLLYLNYPNNPTGAVATKEFYEEVVDFAKENNIIVVQDAAY 221


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.143    0.443 

Gapped
Lambda     K      H
   0.267   0.0691    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,782,669
Number of extensions: 1121075
Number of successful extensions: 906
Number of sequences better than 10.0: 1
Number of HSP's gapped: 899
Number of HSP's successfully gapped: 19
Length of query: 227
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 134
Effective length of database: 6,812,680
Effective search space: 912899120
Effective search space used: 912899120
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 57 (25.8 bits)