RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= 020985
         (319 letters)



>gnl|CDD|238635 cd01310, TatD_DNAse, TatD like proteins;  E.coli TatD is a
           cytoplasmic protein, shown to have magnesium dependent
           DNase activity.
          Length = 251

 Score =  252 bits (647), Expect = 9e-84
 Identities = 97/297 (32%), Positives = 135/297 (45%), Gaps = 48/297 (16%)

Query: 6   LIDIAVNFTDGMFKGIYHGKQCHASDIATVLSRAWSSGVDRIIVTGGSLEESKEALAIAE 65
           LID   +     F           +D   VL+RA  +GV +IIV G  L+ SK AL +A+
Sbjct: 1   LIDTHCHLDFPQFD----------ADRDDVLARAREAGVIKIIVVGTDLKSSKRALELAK 50

Query: 66  TDGRLFCTVGVHPTRCKEFEESGDPEKHFQALLSLAKEGIEKGKVVAIGECGLDYDRLHF 125
               ++  VG+HP    E  +          L  LA       KVVAIGE GLDY R   
Sbjct: 51  KYDNVYAAVGLHPHDADEHVDE-----DLDLLELLA----ANPKVVAIGEIGLDYYRDKS 101

Query: 126 CPSEIQRKYFEKQFELAYATKLPMFLHMREAAADFCAIVERNKDRFTGGVTHSFTGSAED 185
            P E+Q++ F  Q ELA    LP+ +H R+A  D   I++        GV H F+GSAE+
Sbjct: 102 -PREVQKEVFRAQLELAKELNLPVVIHSRDAHEDVLEILKEYGPPK-RGVFHCFSGSAEE 159

Query: 186 RDKLLTFNMYIGINGCSLKTA--ENLDVVRGIPIERMMIETDSPYCEIKNAHAGISFVKS 243
             +LL    YI I+G        E  +VV+ IP+ER+++ETDSPY               
Sbjct: 160 AKELLDLGFYISISGIVTFKNANELREVVKEIPLERLLLETDSPYLAPVPFR-------- 211

Query: 244 TWPSKKKEKYDQDSLVKGRNEPCLVRQVLEVVAGCKGINDIDQLSRTLYHNTCRVFF 300
                             RNEP  V+ V E +A  KGI  +++++     N  R+F 
Sbjct: 212 ----------------GKRNEPAYVKHVAEKIAELKGI-SVEEVAEVTTENAKRLFG 251


>gnl|CDD|223162 COG0084, TatD, Mg-dependent DNase [DNA replication, recombination,
           and repair].
          Length = 256

 Score =  216 bits (552), Expect = 2e-69
 Identities = 101/297 (34%), Positives = 139/297 (46%), Gaps = 46/297 (15%)

Query: 5   RLIDIAVNFTDGMFKGIYHGKQCHASDIATVLSRAWSSGVDRIIVTGGSLEESKEALAIA 64
            LID   +     F            D   V++RA  +GV +++V G  LE+ K AL +A
Sbjct: 2   MLIDTHCHLDFEEFDE----------DRDEVIARAREAGVKKMVVVGTDLEDFKRALELA 51

Query: 65  ETDGRLFCTVGVHPTRCKEFEESGDPEKHFQALLSLAKEGIEKGKVVAIGECGLDYDRLH 124
           E    ++  VGVHP    E       E+  + L  LA+      KVVAIGE GLDY    
Sbjct: 52  EKYPNVYAAVGVHPLDADE-----HSEEDLEELEQLAEH---HPKVVAIGEIGLDYYWDK 103

Query: 125 FCPSEIQRKYFEKQFELAYATKLPMFLHMREAAADFCAIVERNKDRFTGGVTHSFTGSAE 184
               E Q + FE Q ELA    LP+ +H R+A  D   I++       GGV H F+GSAE
Sbjct: 104 EPDKERQEEVFEAQLELAKELNLPVIIHTRDAHEDTLEILKEEGAPV-GGVLHCFSGSAE 162

Query: 185 DRDKLLTFNMYIGINGCSL-KTAENL-DVVRGIPIERMMIETDSPYCEIKNAHAGISFVK 242
           +  KLL    YI I+G    K AE L +V R +P++R+++ETD+PY              
Sbjct: 163 EARKLLDLGFYISISGIVTFKNAEKLREVARELPLDRLLLETDAPYLAP----------- 211

Query: 243 STWPSKKKEKYDQDSLVKGRNEPCLVRQVLEVVAGCKGINDIDQLSRTLYHNTCRVF 299
              P + K           RNEP  VR V E +A  KGI   ++++     N  R+F
Sbjct: 212 --VPYRGK-----------RNEPAYVRHVAEKLAELKGI-SAEEVAEITTENAKRLF 254


>gnl|CDD|216250 pfam01026, TatD_DNase, TatD related DNase.  This family of proteins
           are related to a large superfamily of metalloenzymes.
           TatD, a member of this family has been shown
           experimentally to be a DNase enzyme.
          Length = 255

 Score =  192 bits (489), Expect = 5e-60
 Identities = 95/296 (32%), Positives = 135/296 (45%), Gaps = 45/296 (15%)

Query: 7   IDIAVNFTDGMFKGIYHGKQCHASDIATVLSRAWSSGVDRIIVTGGSLEESKEALAIAET 66
           ID   +     F            D   V+ RA  +GV  ++V G  LE+   AL +A  
Sbjct: 1   IDAHCHLDFKDFDE----------DRDEVIERAREAGVTAVVVVGTDLEDFLRALELARK 50

Query: 67  -DGRLFCTVGVHPTRCKEFEESGDPEKHFQALLSLAKEGIEKGKVVAIGECGLDYDRLHF 125
             G+++  VGVHP    E  E     +  +AL  L     E  KVVAIGE GLDY  +  
Sbjct: 51  YPGKVYAAVGVHPHEADEASE-----EVLEALEKLLL--AEHPKVVAIGEIGLDYYYVDE 103

Query: 126 CPSEIQRKYFEKQFELAYATKLPMFLHMREAAADFCAIVERNKDRFTGGVTHSFTGSAED 185
            P E Q + F +Q ELA    LP+ +H R+A  D   I++         V H FTGS E 
Sbjct: 104 SPKEAQEEVFRRQLELAKELDLPVVIHTRDAEEDLLEILKEAGAPGLRVVLHCFTGSLET 163

Query: 186 RDKLLTFNMYIGINGCSL-KTAENL-DVVRGIPIERMMIETDSPYCEIKNAHAGISFVKS 243
             ++L    YI I+G    K A+ L +VV  IP++R+++ETD+PY              +
Sbjct: 164 AKEILDLGFYISISGIVTFKNAKKLREVVAEIPLDRLLVETDAPYL-------------A 210

Query: 244 TWPSKKKEKYDQDSLVKGRNEPCLVRQVLEVVAGCKGINDIDQLSRTLYHNTCRVF 299
             P + K           RNEP  V  V E +A  KGI   +++++    N  R+F
Sbjct: 211 PVPYRGK-----------RNEPAYVPYVAEKLAELKGI-SEEEVAKITTENAKRLF 254


>gnl|CDD|232780 TIGR00010, TIGR00010, hydrolase, TatD family.  PSI-BLAST, starting
           with a urease alpha subunit, finds a large superfamily
           of proteins, including a number of different enzymes
           that act as hydrolases at C-N bonds other than peptide
           bonds (EC 3.5.-.-), many uncharacterized proteins, and
           the members of this family. Several genomes have
           multiple paralogs related to this family. However, a set
           of 17 proteins can be found, one each from 17 of the
           first 20 genomes, such that each member forms a
           bidirectional best hit across genomes with all other
           members of the set. This core set (and one other
           near-perfect member), but not the other paralogs, form
           the seed for this model. Additionally, members of the
           seed alignment and all trusted hits, but not all
           paralogs, have a conserved motif DxHxH near the amino
           end. The member from E. coli was recently shown to have
           DNase activity [Unknown function, Enzymes of unknown
           specificity].
          Length = 252

 Score =  171 bits (436), Expect = 5e-52
 Identities = 94/283 (33%), Positives = 136/283 (48%), Gaps = 48/283 (16%)

Query: 27  CH------ASDIATVLSRAWSSGVDRIIVTGGSLEESKEALAIAETDGRLFCTVGVHPTR 80
           CH        D+  V+ RA ++GV  ++  G  LE+   AL +AE    ++  VGVHP  
Sbjct: 6   CHLDFLDFEEDVEEVIERAKAAGVTAVVAVGTDLEDFLRALELAEKYPNVYAAVGVHPL- 64

Query: 81  CKEFEESGDPEKHFQALLSLAKEGIEKGKVVAIGECGLDYDRLHFCPSEI--QRKYFEKQ 138
                   D +   +  +   +      KVVAIGE GLDY   +        Q + F  Q
Sbjct: 65  --------DVDDDTKEDIKELERLAAHPKVVAIGETGLDY---YKADEYKRRQEEVFRAQ 113

Query: 139 FELAYATKLPMFLHMREAAADFCAIVERNKDRFTGGVTHSFTGSAEDRDKLLTFNMYIGI 198
            +LA    LP+ +H R+A  D   I+   K +  GGV H FTG AE   KLL    YI I
Sbjct: 114 LQLAEELNLPVIIHARDAEEDVLDILREEKPKV-GGVLHCFTGDAELAKKLLDLGFYISI 172

Query: 199 NG-CSLKTAENL-DVVRGIPIERMMIETDSPYCEIKNAHAGISFVKSTWPSKKKEKYDQD 256
           +G  + K A++L +VVR IP+ER+++ETDSPY              +  P + K      
Sbjct: 173 SGIVTFKNAKSLREVVRKIPLERLLVETDSPYL-------------APVPYRGK------ 213

Query: 257 SLVKGRNEPCLVRQVLEVVAGCKGINDIDQLSRTLYHNTCRVF 299
                RNEP  VR  +E +A  KG+ D+++L++    N  R+F
Sbjct: 214 -----RNEPAFVRYTVEAIAEIKGM-DVEELAQITTKNAKRLF 250


>gnl|CDD|182449 PRK10425, PRK10425, DNase TatD; Provisional.
          Length = 258

 Score =  150 bits (381), Expect = 7e-44
 Identities = 94/303 (31%), Positives = 143/303 (47%), Gaps = 60/303 (19%)

Query: 8   DIAVNFTDGMFKGIYHGKQCHASDIATVLSRAWSSGVDRIIVTGGSLEESKEALAIAETD 67
           DI VN T   F          A D   V++RA+++GV+ +++TG +L ES++A  +A   
Sbjct: 3   DIGVNLTSSQF----------AKDRDDVVARAFAAGVNGMLITGTNLRESQQAQKLARQY 52

Query: 68  GRLFCTVGVHPTRCKEFEESGDPEKHFQALLSLAKEGIEKGKVVAIGECGLDYDRLHFCP 127
              + T GVHP    ++      E   +A++ LA +     +VVAIGECGLD++R +F  
Sbjct: 53  PSCWSTAGVHPHDSSQW--QAATE---EAIIELAAQP----EVVAIGECGLDFNR-NFST 102

Query: 128 SEIQRKYFEKQFELAYATKLPMFLHMREAAADFCAIVERNKDRFTGGVTHSFTGSAEDRD 187
            E Q + F  Q  +A    +P+F+H R+A   F A++E   D+  G V H FTG+ E+  
Sbjct: 103 PEEQERAFVAQLAIAAELNMPVFMHCRDAHERFMALLEPWLDKLPGAVLHCFTGTREEMQ 162

Query: 188 KLLTFNMYIGING--CSLKTAENLDVVRG---------IPIERMMIETDSPYCEIKNAHA 236
             L   +YIGI G  C        D  RG         IP ER+++ETD+PY   ++   
Sbjct: 163 ACLARGLYIGITGWVC--------DERRGLELRELLPLIPAERLLLETDAPYLLPRDLT- 213

Query: 237 GISFVKSTWPSKKKEKYDQDSLVKGRNEPCLVRQVLEVVAGCKGINDIDQLSRTLYHNTC 296
                    P               RNEP  +  +L+ +A  +G  D   L+ T   N  
Sbjct: 214 ---------PKPASR----------RNEPAFLPHILQRIAHWRG-EDAAWLAATTDANAR 253

Query: 297 RVF 299
            +F
Sbjct: 254 TLF 256


>gnl|CDD|236767 PRK10812, PRK10812, putative DNAse; Provisional.
          Length = 265

 Score = 98.3 bits (245), Expect = 5e-24
 Identities = 75/241 (31%), Positives = 113/241 (46%), Gaps = 44/241 (18%)

Query: 63  IAETDGRLFCTVGVHPTRCKEFEESGDPEKHFQALLSLAKEGIEKGKVVAIGECGLDYDR 122
           + E D  +F + GVHP        + D     + L  LA E      VVA+GE GLDY  
Sbjct: 54  VGERDNVVF-SCGVHPL-------NQDEPYDVEELRRLAAEE----GVVAMGETGLDY-- 99

Query: 123 LHFCPS--EIQRKYFEKQFELAYATKLPMFLHMREAAADFCAIVERNKDRFTGGVTHSFT 180
            ++ P     Q++ F    ++      P+ +H R+A AD  AI+   K    GGV H FT
Sbjct: 100 -YYTPETKVRQQESFRHHIQIGRELNKPVIVHTRDARADTLAILREEKVTDCGGVLHCFT 158

Query: 181 GSAEDRDKLLTFNMYIGING-CSLKTAENL-DVVRGIPIERMMIETDSPYCEIKNAHAGI 238
              E   KLL    YI  +G  + + AE L D  R +P++R+++ETDSPY          
Sbjct: 159 EDRETAGKLLDLGFYISFSGIVTFRNAEQLRDAARYVPLDRLLVETDSPYL--------- 209

Query: 239 SFVKSTWPSKKKEKYDQDSLVKGRNEPCLVRQVLEVVAGCKGINDIDQLSRTLYHNTCRV 298
               +  P + KE           N+P +VR V E +A  KG++ +++L++    N  R+
Sbjct: 210 ----APVPHRGKE-----------NQPAMVRDVAEYMAVLKGVS-VEELAQVTTDNFARL 253

Query: 299 F 299
           F
Sbjct: 254 F 254



 Score = 33.2 bits (76), Expect = 0.13
 Identities = 22/61 (36%), Positives = 31/61 (50%), Gaps = 8/61 (13%)

Query: 117 GLDYDRLHFCPSEIQRKYFEK--QFELAYATKLPMFLHMREAAADFCAIVERNKDRFTGG 174
           GLDY  LH    ++  K   +  +F LA AT LP + HMR+       + ER+   F+ G
Sbjct: 12  GLDYQSLHKDVDDVLAKAAARDVKFCLAVATTLPGYRHMRD------LVGERDNVVFSCG 65

Query: 175 V 175
           V
Sbjct: 66  V 66


>gnl|CDD|171118 PRK11449, PRK11449, putative deoxyribonuclease YjjV; Provisional.
          Length = 258

 Score = 86.6 bits (214), Expect = 1e-19
 Identities = 79/299 (26%), Positives = 122/299 (40%), Gaps = 50/299 (16%)

Query: 5   RLIDIAVNFTDGMFKGIYHGKQCHASDIATVLSRAWSSGVDRIIVTGGSLEESKEALAIA 64
           R ID   +F    F G          D    L RA  +GV +IIV     E     LA+A
Sbjct: 4   RFIDTHCHFDFPPFSG----------DEEASLQRAAQAGVGKIIVPATEAENFARVLALA 53

Query: 65  ETDGRLFCTVGVHPTRCKEFEESGDPEKHFQALLSLAKEGIEKGKVVAIGECGLDY--DR 122
           E    L+  +G+HP    E       ++  QAL           KVVA+GE GLD   D 
Sbjct: 54  ERYQPLYAALGLHPGML-EKHSDVSLDQLQQALER------RPAKVVAVGEIGLDLFGDD 106

Query: 123 LHFCPSEIQRKYFEKQFELAYATKLPMFLHMREAAADFCAIVERNKDRFTGGVTHSFTGS 182
             F   E Q+   ++Q +LA    LP+ LH R    D  A+  +  D    GV H F+GS
Sbjct: 107 PQF---ERQQWLLDEQLKLAKRYDLPVILHSRR-THDKLAMHLKRHDLPRTGVVHGFSGS 162

Query: 183 AEDRDKLLTFNMYIGINGCSL--KTAENLDVVRGIPIERMMIETDSPYCEIKNAHAGISF 240
            +  ++ +     IG+ G     + ++  DV+  +P+  +++ETD+P   +         
Sbjct: 163 LQQAERFVQLGYKIGVGGTITYPRASKTRDVIAKLPLASLLLETDAPDMPLNGFQG---- 218

Query: 241 VKSTWPSKKKEKYDQDSLVKGRNEPCLVRQVLEVVAGCKGINDIDQLSRTLYHNTCRVF 299
            +   P +    +  D L + R EP                   D+++  L +NT  +F
Sbjct: 219 -QPNRPEQAARVF--DVLCELRPEPA------------------DEIAEVLLNNTYTLF 256


>gnl|CDD|224024 COG1099, COG1099, Predicted metal-dependent hydrolases with the
           TIM-barrel fold [General function prediction only].
          Length = 254

 Score = 48.9 bits (117), Expect = 1e-06
 Identities = 25/84 (29%), Positives = 40/84 (47%), Gaps = 15/84 (17%)

Query: 69  RLFCTVGVHPTRCKEFEESGDPEKHFQALLSLAKEGIEKGKVVAIGECGLDYDRLHFCPS 128
           +L   VGVHP           P +  + L  L +E +    VVAIGE GL+        +
Sbjct: 65  KLKVAVGVHP--------RAIPPELEEVLEEL-EELLSNEDVVAIGEIGLEEA------T 109

Query: 129 EIQRKYFEKQFELAYATKLPMFLH 152
           + +++ F +Q ELA    +P+ +H
Sbjct: 110 DEEKEVFREQLELARELDVPVIVH 133


>gnl|CDD|224744 COG1831, COG1831, Predicted metal-dependent hydrolase (urease
           superfamily) [General function prediction only].
          Length = 285

 Score = 46.6 bits (111), Expect = 6e-06
 Identities = 32/114 (28%), Positives = 45/114 (39%), Gaps = 13/114 (11%)

Query: 65  ETDGRLFCTVGVHPTRCKEFEESG----DPEKHFQALLSLAKEGIEKGKVVAIGECGLDY 120
           E     +  VGVHP       E+G    +  +  +  L LA + +E+GK VAIGE G   
Sbjct: 73  EGPVEAYAVVGVHPAEVSRLAEAGRSPEEALEEMRHALELAAKLVEEGKAVAIGEVG--- 129

Query: 121 DRLHF-CPSEIQRKY---FEKQFELAYATKLPMFLHM-REAAADFCAIVERNKD 169
            R H+    E+        E   ELA      + LH        +  I E  K+
Sbjct: 130 -RPHYPVSEEVWEASNEVLEYAMELAKDVDCAVQLHTESLDEETYEEIAEMAKE 182


>gnl|CDD|235680 PRK06034, PRK06034, hypothetical protein; Provisional.
          Length = 279

 Score = 30.1 bits (68), Expect = 1.5
 Identities = 9/32 (28%), Positives = 12/32 (37%)

Query: 51  GGSLEESKEALAIAETDGRLFCTVGVHPTRCK 82
              L+   +AL +A    R    VG H  R  
Sbjct: 243 FVQLDTIADALRLAGASVRSTALVGSHAARYD 274


>gnl|CDD|226561 COG4075, COG4075, Uncharacterized conserved protein, homolog of
           nitrogen regulatory protein PII [Function unknown].
          Length = 110

 Score = 28.2 bits (63), Expect = 2.1
 Identities = 13/37 (35%), Positives = 20/37 (54%)

Query: 75  GVHPTRCKEFEESGDPEKHFQALLSLAKEGIEKGKVV 111
           GV P + K F +  DPE   +A+  L+ + +  G VV
Sbjct: 38  GVSPDKWKGFSKEEDPESAIKAIRDLSDKAVLIGTVV 74


>gnl|CDD|118654 pfam10126, Nit_Regul_Hom, Uncharacterized protein, homolog of
           nitrogen regulatory protein PII.  This domain, found in
           various hypothetical archaeal proteins, has no known
           function. It is distantly similar to the nitrogen
           regulatory protein PII.
          Length = 110

 Score = 28.2 bits (63), Expect = 2.2
 Identities = 11/43 (25%), Positives = 20/43 (46%)

Query: 75  GVHPTRCKEFEESGDPEKHFQALLSLAKEGIEKGKVVAIGECG 117
           G+ P   K F    DPE   +A+  L+++ +    VV+  +  
Sbjct: 38  GMSPDDWKGFSLREDPEMAIKAIRDLSEDAVMINTVVSEEKVE 80


>gnl|CDD|214910 smart00917, LeuA_dimer, LeuA allosteric (dimerisation) domain.
           This is the C-terminal regulatory (R) domain of
           alpha-isopropylmalate synthase, which catalyses the
           first committed step in the leucine biosynthetic
           pathway. This domain, is an internally duplicated
           structure with a novel fold. It comprises two similar
           units that are arranged such that the two -helices pack
           together in the centre, crossing at an angle of 34
           degrees, sandwiched between the two three-stranded,
           antiparallel beta-sheets. The overall domain is thus
           constructed as a beta-alpha-beta three-layer sandwich.
          Length = 131

 Score = 28.2 bits (64), Expect = 2.8
 Identities = 9/30 (30%), Positives = 13/30 (43%), Gaps = 4/30 (13%)

Query: 53  SLEESKEALAIA----ETDGRLFCTVGVHP 78
           +L    +ALA      E  GR+   VG+  
Sbjct: 85  ALTGGTDALAEVYVELEYGGRIVWGVGIDT 114


>gnl|CDD|219915 pfam08578, DUF1765, Protein of unknown function (DUF1765).  This
           region represents a conserved region found in
           hypothetical proteins from fungi, mycetozoa and
           entamoebidae.
          Length = 123

 Score = 27.5 bits (62), Expect = 4.4
 Identities = 7/28 (25%), Positives = 15/28 (53%), Gaps = 4/28 (14%)

Query: 219 RMMIETDSPYCEIKNAHAGISFVKSTWP 246
           RMM+ +D     ++     ++F+ + WP
Sbjct: 56  RMMLNSDHAQSLLRA----LAFLFNIWP 79


>gnl|CDD|173112 PRK14649, PRK14649, UDP-N-acetylenolpyruvoylglucosamine reductase;
           Provisional.
          Length = 295

 Score = 27.5 bits (61), Expect = 8.4
 Identities = 11/20 (55%), Positives = 12/20 (60%)

Query: 21  IYHGKQCHASDIATVLSRAW 40
           IY    C+  D ATVL RAW
Sbjct: 123 IYGNAGCYGGDTATVLIRAW 142


>gnl|CDD|183846 PRK13020, PRK13020, riboflavin synthase subunit alpha; Provisional.
          Length = 206

 Score = 27.1 bits (61), Expect = 8.8
 Identities = 8/13 (61%), Positives = 10/13 (76%)

Query: 195 YIGINGCSLKTAE 207
           +IG+NGCSL   E
Sbjct: 138 FIGVNGCSLTVGE 150


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.137    0.410 

Gapped
Lambda     K      H
   0.267   0.0844    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 16,376,754
Number of extensions: 1581789
Number of successful extensions: 1436
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1403
Number of HSP's successfully gapped: 25
Length of query: 319
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 222
Effective length of database: 6,635,264
Effective search space: 1473028608
Effective search space used: 1473028608
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (26.3 bits)