RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= 020985
(319 letters)
>gnl|CDD|238635 cd01310, TatD_DNAse, TatD like proteins; E.coli TatD is a
cytoplasmic protein, shown to have magnesium dependent
DNase activity.
Length = 251
Score = 252 bits (647), Expect = 9e-84
Identities = 97/297 (32%), Positives = 135/297 (45%), Gaps = 48/297 (16%)
Query: 6 LIDIAVNFTDGMFKGIYHGKQCHASDIATVLSRAWSSGVDRIIVTGGSLEESKEALAIAE 65
LID + F +D VL+RA +GV +IIV G L+ SK AL +A+
Sbjct: 1 LIDTHCHLDFPQFD----------ADRDDVLARAREAGVIKIIVVGTDLKSSKRALELAK 50
Query: 66 TDGRLFCTVGVHPTRCKEFEESGDPEKHFQALLSLAKEGIEKGKVVAIGECGLDYDRLHF 125
++ VG+HP E + L LA KVVAIGE GLDY R
Sbjct: 51 KYDNVYAAVGLHPHDADEHVDE-----DLDLLELLA----ANPKVVAIGEIGLDYYRDKS 101
Query: 126 CPSEIQRKYFEKQFELAYATKLPMFLHMREAAADFCAIVERNKDRFTGGVTHSFTGSAED 185
P E+Q++ F Q ELA LP+ +H R+A D I++ GV H F+GSAE+
Sbjct: 102 -PREVQKEVFRAQLELAKELNLPVVIHSRDAHEDVLEILKEYGPPK-RGVFHCFSGSAEE 159
Query: 186 RDKLLTFNMYIGINGCSLKTA--ENLDVVRGIPIERMMIETDSPYCEIKNAHAGISFVKS 243
+LL YI I+G E +VV+ IP+ER+++ETDSPY
Sbjct: 160 AKELLDLGFYISISGIVTFKNANELREVVKEIPLERLLLETDSPYLAPVPFR-------- 211
Query: 244 TWPSKKKEKYDQDSLVKGRNEPCLVRQVLEVVAGCKGINDIDQLSRTLYHNTCRVFF 300
RNEP V+ V E +A KGI +++++ N R+F
Sbjct: 212 ----------------GKRNEPAYVKHVAEKIAELKGI-SVEEVAEVTTENAKRLFG 251
>gnl|CDD|223162 COG0084, TatD, Mg-dependent DNase [DNA replication, recombination,
and repair].
Length = 256
Score = 216 bits (552), Expect = 2e-69
Identities = 101/297 (34%), Positives = 139/297 (46%), Gaps = 46/297 (15%)
Query: 5 RLIDIAVNFTDGMFKGIYHGKQCHASDIATVLSRAWSSGVDRIIVTGGSLEESKEALAIA 64
LID + F D V++RA +GV +++V G LE+ K AL +A
Sbjct: 2 MLIDTHCHLDFEEFDE----------DRDEVIARAREAGVKKMVVVGTDLEDFKRALELA 51
Query: 65 ETDGRLFCTVGVHPTRCKEFEESGDPEKHFQALLSLAKEGIEKGKVVAIGECGLDYDRLH 124
E ++ VGVHP E E+ + L LA+ KVVAIGE GLDY
Sbjct: 52 EKYPNVYAAVGVHPLDADE-----HSEEDLEELEQLAEH---HPKVVAIGEIGLDYYWDK 103
Query: 125 FCPSEIQRKYFEKQFELAYATKLPMFLHMREAAADFCAIVERNKDRFTGGVTHSFTGSAE 184
E Q + FE Q ELA LP+ +H R+A D I++ GGV H F+GSAE
Sbjct: 104 EPDKERQEEVFEAQLELAKELNLPVIIHTRDAHEDTLEILKEEGAPV-GGVLHCFSGSAE 162
Query: 185 DRDKLLTFNMYIGINGCSL-KTAENL-DVVRGIPIERMMIETDSPYCEIKNAHAGISFVK 242
+ KLL YI I+G K AE L +V R +P++R+++ETD+PY
Sbjct: 163 EARKLLDLGFYISISGIVTFKNAEKLREVARELPLDRLLLETDAPYLAP----------- 211
Query: 243 STWPSKKKEKYDQDSLVKGRNEPCLVRQVLEVVAGCKGINDIDQLSRTLYHNTCRVF 299
P + K RNEP VR V E +A KGI ++++ N R+F
Sbjct: 212 --VPYRGK-----------RNEPAYVRHVAEKLAELKGI-SAEEVAEITTENAKRLF 254
>gnl|CDD|216250 pfam01026, TatD_DNase, TatD related DNase. This family of proteins
are related to a large superfamily of metalloenzymes.
TatD, a member of this family has been shown
experimentally to be a DNase enzyme.
Length = 255
Score = 192 bits (489), Expect = 5e-60
Identities = 95/296 (32%), Positives = 135/296 (45%), Gaps = 45/296 (15%)
Query: 7 IDIAVNFTDGMFKGIYHGKQCHASDIATVLSRAWSSGVDRIIVTGGSLEESKEALAIAET 66
ID + F D V+ RA +GV ++V G LE+ AL +A
Sbjct: 1 IDAHCHLDFKDFDE----------DRDEVIERAREAGVTAVVVVGTDLEDFLRALELARK 50
Query: 67 -DGRLFCTVGVHPTRCKEFEESGDPEKHFQALLSLAKEGIEKGKVVAIGECGLDYDRLHF 125
G+++ VGVHP E E + +AL L E KVVAIGE GLDY +
Sbjct: 51 YPGKVYAAVGVHPHEADEASE-----EVLEALEKLLL--AEHPKVVAIGEIGLDYYYVDE 103
Query: 126 CPSEIQRKYFEKQFELAYATKLPMFLHMREAAADFCAIVERNKDRFTGGVTHSFTGSAED 185
P E Q + F +Q ELA LP+ +H R+A D I++ V H FTGS E
Sbjct: 104 SPKEAQEEVFRRQLELAKELDLPVVIHTRDAEEDLLEILKEAGAPGLRVVLHCFTGSLET 163
Query: 186 RDKLLTFNMYIGINGCSL-KTAENL-DVVRGIPIERMMIETDSPYCEIKNAHAGISFVKS 243
++L YI I+G K A+ L +VV IP++R+++ETD+PY +
Sbjct: 164 AKEILDLGFYISISGIVTFKNAKKLREVVAEIPLDRLLVETDAPYL-------------A 210
Query: 244 TWPSKKKEKYDQDSLVKGRNEPCLVRQVLEVVAGCKGINDIDQLSRTLYHNTCRVF 299
P + K RNEP V V E +A KGI +++++ N R+F
Sbjct: 211 PVPYRGK-----------RNEPAYVPYVAEKLAELKGI-SEEEVAKITTENAKRLF 254
>gnl|CDD|232780 TIGR00010, TIGR00010, hydrolase, TatD family. PSI-BLAST, starting
with a urease alpha subunit, finds a large superfamily
of proteins, including a number of different enzymes
that act as hydrolases at C-N bonds other than peptide
bonds (EC 3.5.-.-), many uncharacterized proteins, and
the members of this family. Several genomes have
multiple paralogs related to this family. However, a set
of 17 proteins can be found, one each from 17 of the
first 20 genomes, such that each member forms a
bidirectional best hit across genomes with all other
members of the set. This core set (and one other
near-perfect member), but not the other paralogs, form
the seed for this model. Additionally, members of the
seed alignment and all trusted hits, but not all
paralogs, have a conserved motif DxHxH near the amino
end. The member from E. coli was recently shown to have
DNase activity [Unknown function, Enzymes of unknown
specificity].
Length = 252
Score = 171 bits (436), Expect = 5e-52
Identities = 94/283 (33%), Positives = 136/283 (48%), Gaps = 48/283 (16%)
Query: 27 CH------ASDIATVLSRAWSSGVDRIIVTGGSLEESKEALAIAETDGRLFCTVGVHPTR 80
CH D+ V+ RA ++GV ++ G LE+ AL +AE ++ VGVHP
Sbjct: 6 CHLDFLDFEEDVEEVIERAKAAGVTAVVAVGTDLEDFLRALELAEKYPNVYAAVGVHPL- 64
Query: 81 CKEFEESGDPEKHFQALLSLAKEGIEKGKVVAIGECGLDYDRLHFCPSEI--QRKYFEKQ 138
D + + + + KVVAIGE GLDY + Q + F Q
Sbjct: 65 --------DVDDDTKEDIKELERLAAHPKVVAIGETGLDY---YKADEYKRRQEEVFRAQ 113
Query: 139 FELAYATKLPMFLHMREAAADFCAIVERNKDRFTGGVTHSFTGSAEDRDKLLTFNMYIGI 198
+LA LP+ +H R+A D I+ K + GGV H FTG AE KLL YI I
Sbjct: 114 LQLAEELNLPVIIHARDAEEDVLDILREEKPKV-GGVLHCFTGDAELAKKLLDLGFYISI 172
Query: 199 NG-CSLKTAENL-DVVRGIPIERMMIETDSPYCEIKNAHAGISFVKSTWPSKKKEKYDQD 256
+G + K A++L +VVR IP+ER+++ETDSPY + P + K
Sbjct: 173 SGIVTFKNAKSLREVVRKIPLERLLVETDSPYL-------------APVPYRGK------ 213
Query: 257 SLVKGRNEPCLVRQVLEVVAGCKGINDIDQLSRTLYHNTCRVF 299
RNEP VR +E +A KG+ D+++L++ N R+F
Sbjct: 214 -----RNEPAFVRYTVEAIAEIKGM-DVEELAQITTKNAKRLF 250
>gnl|CDD|182449 PRK10425, PRK10425, DNase TatD; Provisional.
Length = 258
Score = 150 bits (381), Expect = 7e-44
Identities = 94/303 (31%), Positives = 143/303 (47%), Gaps = 60/303 (19%)
Query: 8 DIAVNFTDGMFKGIYHGKQCHASDIATVLSRAWSSGVDRIIVTGGSLEESKEALAIAETD 67
DI VN T F A D V++RA+++GV+ +++TG +L ES++A +A
Sbjct: 3 DIGVNLTSSQF----------AKDRDDVVARAFAAGVNGMLITGTNLRESQQAQKLARQY 52
Query: 68 GRLFCTVGVHPTRCKEFEESGDPEKHFQALLSLAKEGIEKGKVVAIGECGLDYDRLHFCP 127
+ T GVHP ++ E +A++ LA + +VVAIGECGLD++R +F
Sbjct: 53 PSCWSTAGVHPHDSSQW--QAATE---EAIIELAAQP----EVVAIGECGLDFNR-NFST 102
Query: 128 SEIQRKYFEKQFELAYATKLPMFLHMREAAADFCAIVERNKDRFTGGVTHSFTGSAEDRD 187
E Q + F Q +A +P+F+H R+A F A++E D+ G V H FTG+ E+
Sbjct: 103 PEEQERAFVAQLAIAAELNMPVFMHCRDAHERFMALLEPWLDKLPGAVLHCFTGTREEMQ 162
Query: 188 KLLTFNMYIGING--CSLKTAENLDVVRG---------IPIERMMIETDSPYCEIKNAHA 236
L +YIGI G C D RG IP ER+++ETD+PY ++
Sbjct: 163 ACLARGLYIGITGWVC--------DERRGLELRELLPLIPAERLLLETDAPYLLPRDLT- 213
Query: 237 GISFVKSTWPSKKKEKYDQDSLVKGRNEPCLVRQVLEVVAGCKGINDIDQLSRTLYHNTC 296
P RNEP + +L+ +A +G D L+ T N
Sbjct: 214 ---------PKPASR----------RNEPAFLPHILQRIAHWRG-EDAAWLAATTDANAR 253
Query: 297 RVF 299
+F
Sbjct: 254 TLF 256
>gnl|CDD|236767 PRK10812, PRK10812, putative DNAse; Provisional.
Length = 265
Score = 98.3 bits (245), Expect = 5e-24
Identities = 75/241 (31%), Positives = 113/241 (46%), Gaps = 44/241 (18%)
Query: 63 IAETDGRLFCTVGVHPTRCKEFEESGDPEKHFQALLSLAKEGIEKGKVVAIGECGLDYDR 122
+ E D +F + GVHP + D + L LA E VVA+GE GLDY
Sbjct: 54 VGERDNVVF-SCGVHPL-------NQDEPYDVEELRRLAAEE----GVVAMGETGLDY-- 99
Query: 123 LHFCPS--EIQRKYFEKQFELAYATKLPMFLHMREAAADFCAIVERNKDRFTGGVTHSFT 180
++ P Q++ F ++ P+ +H R+A AD AI+ K GGV H FT
Sbjct: 100 -YYTPETKVRQQESFRHHIQIGRELNKPVIVHTRDARADTLAILREEKVTDCGGVLHCFT 158
Query: 181 GSAEDRDKLLTFNMYIGING-CSLKTAENL-DVVRGIPIERMMIETDSPYCEIKNAHAGI 238
E KLL YI +G + + AE L D R +P++R+++ETDSPY
Sbjct: 159 EDRETAGKLLDLGFYISFSGIVTFRNAEQLRDAARYVPLDRLLVETDSPYL--------- 209
Query: 239 SFVKSTWPSKKKEKYDQDSLVKGRNEPCLVRQVLEVVAGCKGINDIDQLSRTLYHNTCRV 298
+ P + KE N+P +VR V E +A KG++ +++L++ N R+
Sbjct: 210 ----APVPHRGKE-----------NQPAMVRDVAEYMAVLKGVS-VEELAQVTTDNFARL 253
Query: 299 F 299
F
Sbjct: 254 F 254
Score = 33.2 bits (76), Expect = 0.13
Identities = 22/61 (36%), Positives = 31/61 (50%), Gaps = 8/61 (13%)
Query: 117 GLDYDRLHFCPSEIQRKYFEK--QFELAYATKLPMFLHMREAAADFCAIVERNKDRFTGG 174
GLDY LH ++ K + +F LA AT LP + HMR+ + ER+ F+ G
Sbjct: 12 GLDYQSLHKDVDDVLAKAAARDVKFCLAVATTLPGYRHMRD------LVGERDNVVFSCG 65
Query: 175 V 175
V
Sbjct: 66 V 66
>gnl|CDD|171118 PRK11449, PRK11449, putative deoxyribonuclease YjjV; Provisional.
Length = 258
Score = 86.6 bits (214), Expect = 1e-19
Identities = 79/299 (26%), Positives = 122/299 (40%), Gaps = 50/299 (16%)
Query: 5 RLIDIAVNFTDGMFKGIYHGKQCHASDIATVLSRAWSSGVDRIIVTGGSLEESKEALAIA 64
R ID +F F G D L RA +GV +IIV E LA+A
Sbjct: 4 RFIDTHCHFDFPPFSG----------DEEASLQRAAQAGVGKIIVPATEAENFARVLALA 53
Query: 65 ETDGRLFCTVGVHPTRCKEFEESGDPEKHFQALLSLAKEGIEKGKVVAIGECGLDY--DR 122
E L+ +G+HP E ++ QAL KVVA+GE GLD D
Sbjct: 54 ERYQPLYAALGLHPGML-EKHSDVSLDQLQQALER------RPAKVVAVGEIGLDLFGDD 106
Query: 123 LHFCPSEIQRKYFEKQFELAYATKLPMFLHMREAAADFCAIVERNKDRFTGGVTHSFTGS 182
F E Q+ ++Q +LA LP+ LH R D A+ + D GV H F+GS
Sbjct: 107 PQF---ERQQWLLDEQLKLAKRYDLPVILHSRR-THDKLAMHLKRHDLPRTGVVHGFSGS 162
Query: 183 AEDRDKLLTFNMYIGINGCSL--KTAENLDVVRGIPIERMMIETDSPYCEIKNAHAGISF 240
+ ++ + IG+ G + ++ DV+ +P+ +++ETD+P +
Sbjct: 163 LQQAERFVQLGYKIGVGGTITYPRASKTRDVIAKLPLASLLLETDAPDMPLNGFQG---- 218
Query: 241 VKSTWPSKKKEKYDQDSLVKGRNEPCLVRQVLEVVAGCKGINDIDQLSRTLYHNTCRVF 299
+ P + + D L + R EP D+++ L +NT +F
Sbjct: 219 -QPNRPEQAARVF--DVLCELRPEPA------------------DEIAEVLLNNTYTLF 256
>gnl|CDD|224024 COG1099, COG1099, Predicted metal-dependent hydrolases with the
TIM-barrel fold [General function prediction only].
Length = 254
Score = 48.9 bits (117), Expect = 1e-06
Identities = 25/84 (29%), Positives = 40/84 (47%), Gaps = 15/84 (17%)
Query: 69 RLFCTVGVHPTRCKEFEESGDPEKHFQALLSLAKEGIEKGKVVAIGECGLDYDRLHFCPS 128
+L VGVHP P + + L L +E + VVAIGE GL+ +
Sbjct: 65 KLKVAVGVHP--------RAIPPELEEVLEEL-EELLSNEDVVAIGEIGLEEA------T 109
Query: 129 EIQRKYFEKQFELAYATKLPMFLH 152
+ +++ F +Q ELA +P+ +H
Sbjct: 110 DEEKEVFREQLELARELDVPVIVH 133
>gnl|CDD|224744 COG1831, COG1831, Predicted metal-dependent hydrolase (urease
superfamily) [General function prediction only].
Length = 285
Score = 46.6 bits (111), Expect = 6e-06
Identities = 32/114 (28%), Positives = 45/114 (39%), Gaps = 13/114 (11%)
Query: 65 ETDGRLFCTVGVHPTRCKEFEESG----DPEKHFQALLSLAKEGIEKGKVVAIGECGLDY 120
E + VGVHP E+G + + + L LA + +E+GK VAIGE G
Sbjct: 73 EGPVEAYAVVGVHPAEVSRLAEAGRSPEEALEEMRHALELAAKLVEEGKAVAIGEVG--- 129
Query: 121 DRLHF-CPSEIQRKY---FEKQFELAYATKLPMFLHM-REAAADFCAIVERNKD 169
R H+ E+ E ELA + LH + I E K+
Sbjct: 130 -RPHYPVSEEVWEASNEVLEYAMELAKDVDCAVQLHTESLDEETYEEIAEMAKE 182
>gnl|CDD|235680 PRK06034, PRK06034, hypothetical protein; Provisional.
Length = 279
Score = 30.1 bits (68), Expect = 1.5
Identities = 9/32 (28%), Positives = 12/32 (37%)
Query: 51 GGSLEESKEALAIAETDGRLFCTVGVHPTRCK 82
L+ +AL +A R VG H R
Sbjct: 243 FVQLDTIADALRLAGASVRSTALVGSHAARYD 274
>gnl|CDD|226561 COG4075, COG4075, Uncharacterized conserved protein, homolog of
nitrogen regulatory protein PII [Function unknown].
Length = 110
Score = 28.2 bits (63), Expect = 2.1
Identities = 13/37 (35%), Positives = 20/37 (54%)
Query: 75 GVHPTRCKEFEESGDPEKHFQALLSLAKEGIEKGKVV 111
GV P + K F + DPE +A+ L+ + + G VV
Sbjct: 38 GVSPDKWKGFSKEEDPESAIKAIRDLSDKAVLIGTVV 74
>gnl|CDD|118654 pfam10126, Nit_Regul_Hom, Uncharacterized protein, homolog of
nitrogen regulatory protein PII. This domain, found in
various hypothetical archaeal proteins, has no known
function. It is distantly similar to the nitrogen
regulatory protein PII.
Length = 110
Score = 28.2 bits (63), Expect = 2.2
Identities = 11/43 (25%), Positives = 20/43 (46%)
Query: 75 GVHPTRCKEFEESGDPEKHFQALLSLAKEGIEKGKVVAIGECG 117
G+ P K F DPE +A+ L+++ + VV+ +
Sbjct: 38 GMSPDDWKGFSLREDPEMAIKAIRDLSEDAVMINTVVSEEKVE 80
>gnl|CDD|214910 smart00917, LeuA_dimer, LeuA allosteric (dimerisation) domain.
This is the C-terminal regulatory (R) domain of
alpha-isopropylmalate synthase, which catalyses the
first committed step in the leucine biosynthetic
pathway. This domain, is an internally duplicated
structure with a novel fold. It comprises two similar
units that are arranged such that the two -helices pack
together in the centre, crossing at an angle of 34
degrees, sandwiched between the two three-stranded,
antiparallel beta-sheets. The overall domain is thus
constructed as a beta-alpha-beta three-layer sandwich.
Length = 131
Score = 28.2 bits (64), Expect = 2.8
Identities = 9/30 (30%), Positives = 13/30 (43%), Gaps = 4/30 (13%)
Query: 53 SLEESKEALAIA----ETDGRLFCTVGVHP 78
+L +ALA E GR+ VG+
Sbjct: 85 ALTGGTDALAEVYVELEYGGRIVWGVGIDT 114
>gnl|CDD|219915 pfam08578, DUF1765, Protein of unknown function (DUF1765). This
region represents a conserved region found in
hypothetical proteins from fungi, mycetozoa and
entamoebidae.
Length = 123
Score = 27.5 bits (62), Expect = 4.4
Identities = 7/28 (25%), Positives = 15/28 (53%), Gaps = 4/28 (14%)
Query: 219 RMMIETDSPYCEIKNAHAGISFVKSTWP 246
RMM+ +D ++ ++F+ + WP
Sbjct: 56 RMMLNSDHAQSLLRA----LAFLFNIWP 79
>gnl|CDD|173112 PRK14649, PRK14649, UDP-N-acetylenolpyruvoylglucosamine reductase;
Provisional.
Length = 295
Score = 27.5 bits (61), Expect = 8.4
Identities = 11/20 (55%), Positives = 12/20 (60%)
Query: 21 IYHGKQCHASDIATVLSRAW 40
IY C+ D ATVL RAW
Sbjct: 123 IYGNAGCYGGDTATVLIRAW 142
>gnl|CDD|183846 PRK13020, PRK13020, riboflavin synthase subunit alpha; Provisional.
Length = 206
Score = 27.1 bits (61), Expect = 8.8
Identities = 8/13 (61%), Positives = 10/13 (76%)
Query: 195 YIGINGCSLKTAE 207
+IG+NGCSL E
Sbjct: 138 FIGVNGCSLTVGE 150
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.137 0.410
Gapped
Lambda K H
0.267 0.0844 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 16,376,754
Number of extensions: 1581789
Number of successful extensions: 1436
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1403
Number of HSP's successfully gapped: 25
Length of query: 319
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 222
Effective length of database: 6,635,264
Effective search space: 1473028608
Effective search space used: 1473028608
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (26.3 bits)