RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4448
         (487 letters)



>gnl|CDD|238635 cd01310, TatD_DNAse, TatD like proteins;  E.coli TatD is a
           cytoplasmic protein, shown to have magnesium dependent
           DNase activity.
          Length = 251

 Score =  194 bits (495), Expect = 6e-59
 Identities = 88/265 (33%), Positives = 124/265 (46%), Gaps = 58/265 (21%)

Query: 3   YIDIGANLKDAMYEGFYSSKNQKHEPDIDHVLNRAWNAGLEKIIVTGTNVEDSISSLKLA 62
            ID   +L    ++            D D VL RA  AG+ KIIV GT+++ S  +L+LA
Sbjct: 1   LIDTHCHLDFPQFD-----------ADRDDVLARAREAGVIKIIVVGTDLKSSKRALELA 49

Query: 63  QSDERLYSTVGCHPTRCSEFENDPEGYLQSLDKIIKEGGKKVVAFGEFGLDYDRVQYCPV 122
           +  + +Y+ VG HP    E  ++    L  L+ +      KVVA GE GLDY R +  P 
Sbjct: 50  KKYDNVYAAVGLHPHDADEHVDE---DLDLLELLAAN--PKVVAIGEIGLDYYRDKS-PR 103

Query: 123 ETQLKYFRKQLDLSVTHKLPLFLHCRNAKSDFIEIMKEYAPKLPRKGVIHSFDGTPFQA- 181
           E Q + FR QL+L+    LP+ +H R+A  D +EI+KEY    P++GV H F G+  +A 
Sbjct: 104 EVQKEVFRAQLELAKELNLPVVIHSRDAHEDVLEILKEY--GPPKRGVFHCFSGSAEEAK 161

Query: 182 --VD-----SL-------KTKENLETVKSIPEDRLLLETDCPWCEVKPSHAGFAYIRTQH 227
             +D     S+          E  E VK IP +RLLLETD P+    P            
Sbjct: 162 ELLDLGFYISISGIVTFKNANELREVVKEIPLERLLLETDSPYLAPVPFR---------- 211

Query: 228 EKVKKEQWKPDKMVKSRNEPANIVY 252
                           RNEPA + +
Sbjct: 212 --------------GKRNEPAYVKH 222



 Score =  133 bits (337), Expect = 5e-36
 Identities = 54/183 (29%), Positives = 79/183 (43%), Gaps = 41/183 (22%)

Query: 238 DKMVKSRNEPANIVYFRKQLDLSVTHKLPLFLHCRNAKSDFIEIMKEYAPKLPRKGVIHS 297
           D              FR QL+L+    LP+ +H R+A  D +EI+KEY    P++GV H 
Sbjct: 95  DYYRDKSPREVQKEVFRAQLELAKELNLPVVIHSRDAHEDVLEILKEY--GPPKRGVFHC 152

Query: 298 FDGTPFQA---VD-----SL-------KTKENLETVKSIPEDRLLLETDCPWCEVKPSHA 342
           F G+  +A   +D     S+          E  E VK IP +RLLLETD P+    P   
Sbjct: 153 FSGSAEEAKELLDLGFYISISGIVTFKNANELREVVKEIPLERLLLETDSPYLAPVPFR- 211

Query: 343 GFAYIRTQHEKVKKEQWKPDKMVKSRNEPANIVQILEIVAAVRGVEREKLGPIIHQNTLR 402
                                    RNEPA +  + E +A ++G+  E++  +  +N  R
Sbjct: 212 -----------------------GKRNEPAYVKHVAEKIAELKGISVEEVAEVTTENAKR 248

Query: 403 LFF 405
           LF 
Sbjct: 249 LFG 251



 Score = 49.9 bits (120), Expect = 8e-07
 Identities = 22/59 (37%), Positives = 29/59 (49%), Gaps = 6/59 (10%)

Query: 428 RLYSTVGCHPTRCSEFENDPEGYLQSLDKIIKEGGKKVVAFGEFGLDYDRVQYCPVETQ 486
            +Y+ VG HP    E  ++    L  L+ +      KVVA GE GLDY R +  P E Q
Sbjct: 54  NVYAAVGLHPHDADEHVDE---DLDLLELLAAN--PKVVAIGEIGLDYYRDKS-PREVQ 106


>gnl|CDD|223162 COG0084, TatD, Mg-dependent DNase [DNA replication, recombination,
           and repair].
          Length = 256

 Score =  177 bits (452), Expect = 2e-52
 Identities = 80/267 (29%), Positives = 122/267 (45%), Gaps = 56/267 (20%)

Query: 1   MKYIDIGANLKDAMYEGFYSSKNQKHEPDIDHVLNRAWNAGLEKIIVTGTNVEDSISSLK 60
           M  ID   +L    ++            D D V+ RA  AG++K++V GT++ED   +L+
Sbjct: 1   MMLIDTHCHLDFEEFDE-----------DRDEVIARAREAGVKKMVVVGTDLEDFKRALE 49

Query: 61  LAQSDERLYSTVGCHPTRCSEFENDPEGYLQSLDKIIKEGGKKVVAFGEFGLDYDRVQYC 120
           LA+    +Y+ VG HP    + +   E  L+ L+++ +    KVVA GE GLDY   +  
Sbjct: 50  LAEKYPNVYAAVGVHPL---DADEHSEEDLEELEQLAEHH-PKVVAIGEIGLDYYWDKEP 105

Query: 121 PVETQLKYFRKQLDLSVTHKLPLFLHCRNAKSDFIEIMKEYAPKLPRKGVIHSFDGTPFQ 180
             E Q + F  QL+L+    LP+ +H R+A  D +EI+KE     P  GV+H F G+  +
Sbjct: 106 DKERQEEVFEAQLELAKELNLPVIIHTRDAHEDTLEILKEE--GAPVGGVLHCFSGSAEE 163

Query: 181 AVDSLKT---------------KENLETVKSIPEDRLLLETDCPWCEVKPSHAGFAYIRT 225
           A   L                 ++  E  + +P DRLLLETD P+    P + G      
Sbjct: 164 ARKLLDLGFYISISGIVTFKNAEKLREVARELPLDRLLLETDAPYLAPVP-YRG------ 216

Query: 226 QHEKVKKEQWKPDKMVKSRNEPANIVY 252
                             RNEPA + +
Sbjct: 217 -----------------KRNEPAYVRH 226



 Score =  121 bits (305), Expect = 2e-31
 Identities = 49/170 (28%), Positives = 74/170 (43%), Gaps = 41/170 (24%)

Query: 250 IVYFRKQLDLSVTHKLPLFLHCRNAKSDFIEIMKEYAPKLPRKGVIHSFDGTPFQAVDSL 309
              F  QL+L+    LP+ +H R+A  D +EI+KE     P  GV+H F G+  +A   L
Sbjct: 111 EEVFEAQLELAKELNLPVIIHTRDAHEDTLEILKEE--GAPVGGVLHCFSGSAEEARKLL 168

Query: 310 KT---------------KENLETVKSIPEDRLLLETDCPWCEVKPSHAGFAYIRTQHEKV 354
                            ++  E  + +P DRLLLETD P+    P + G           
Sbjct: 169 DLGFYISISGIVTFKNAEKLREVARELPLDRLLLETDAPYLAPVP-YRG----------- 216

Query: 355 KKEQWKPDKMVKSRNEPANIVQILEIVAAVRGVEREKLGPIIHQNTLRLF 404
                        RNEPA +  + E +A ++G+  E++  I  +N  RLF
Sbjct: 217 ------------KRNEPAYVRHVAEKLAELKGISAEEVAEITTENAKRLF 254



 Score = 47.6 bits (114), Expect = 5e-06
 Identities = 20/60 (33%), Positives = 30/60 (50%), Gaps = 4/60 (6%)

Query: 427 ERLYSTVGCHPTRCSEFENDPEGYLQSLDKIIKEGGKKVVAFGEFGLDYDRVQYCPVETQ 486
             +Y+ VG HP    + +   E  L+ L+++ +    KVVA GE GLDY   +    E Q
Sbjct: 55  PNVYAAVGVHPL---DADEHSEEDLEELEQLAEHH-PKVVAIGEIGLDYYWDKEPDKERQ 110


>gnl|CDD|216250 pfam01026, TatD_DNase, TatD related DNase.  This family of proteins
           are related to a large superfamily of metalloenzymes.
           TatD, a member of this family has been shown
           experimentally to be a DNase enzyme.
          Length = 255

 Score =  156 bits (396), Expect = 2e-44
 Identities = 79/242 (32%), Positives = 111/242 (45%), Gaps = 44/242 (18%)

Query: 27  EPDIDHVLNRAWNAGLEKIIVTGTNVEDSISSLKLA-QSDERLYSTVGCHPTRCSEFEND 85
           + D D V+ RA  AG+  ++V GT++ED + +L+LA +   ++Y+ VG HP    E   +
Sbjct: 13  DEDRDEVIERAREAGVTAVVVVGTDLEDFLRALELARKYPGKVYAAVGVHPHEADEASEE 72

Query: 86  PEGYLQSLDKIIKEGGKKVVAFGEFGLDYDRVQYCPVETQLKYFRKQLDLSVTHKLPLFL 145
               L+ L         KVVA GE GLDY  V   P E Q + FR+QL+L+    LP+ +
Sbjct: 73  VLEALEKLLLAEHP---KVVAIGEIGLDYYYVDESPKEAQEEVFRRQLELAKELDLPVVI 129

Query: 146 HCRNAKSDFIEIMKEYAPKLPRKGVIHSFDGTPFQA---------------VDSLKTKEN 190
           H R+A+ D +EI+KE      R  V+H F G+   A               V     K+ 
Sbjct: 130 HTRDAEEDLLEILKEAGAPGLR-VVLHCFTGSLETAKEILDLGFYISISGIVTFKNAKKL 188

Query: 191 LETVKSIPEDRLLLETDCPWCEVKPSHAGFAYIRTQHEKVKKEQWKPDKMVKSRNEPANI 250
            E V  IP DRLL+ETD P+                          P      RNEPA +
Sbjct: 189 REVVAEIPLDRLLVETDAPYL------------------------APVPYRGKRNEPAYV 224

Query: 251 VY 252
            Y
Sbjct: 225 PY 226



 Score =  109 bits (274), Expect = 4e-27
 Identities = 50/170 (29%), Positives = 73/170 (42%), Gaps = 40/170 (23%)

Query: 250 IVYFRKQLDLSVTHKLPLFLHCRNAKSDFIEIMKEYAPKLPRKGVIHSFDGTPFQA---- 305
              FR+QL+L+    LP+ +H R+A+ D +EI+KE      R  V+H F G+   A    
Sbjct: 110 EEVFRRQLELAKELDLPVVIHTRDAEEDLLEILKEAGAPGLR-VVLHCFTGSLETAKEIL 168

Query: 306 -----------VDSLKTKENLETVKSIPEDRLLLETDCPWCEVKPSHAGFAYIRTQHEKV 354
                      V     K+  E V  IP DRLL+ETD P+                    
Sbjct: 169 DLGFYISISGIVTFKNAKKLREVVAEIPLDRLLVETDAPYL------------------- 209

Query: 355 KKEQWKPDKMVKSRNEPANIVQILEIVAAVRGVEREKLGPIIHQNTLRLF 404
                 P      RNEPA +  + E +A ++G+  E++  I  +N  RLF
Sbjct: 210 -----APVPYRGKRNEPAYVPYVAEKLAELKGISEEEVAKITTENAKRLF 254



 Score = 51.9 bits (125), Expect = 2e-07
 Identities = 24/76 (31%), Positives = 30/76 (39%), Gaps = 3/76 (3%)

Query: 411 TPTRFNTCVSFAPYEIERLYSTVGCHPTRCSEFENDPEGYLQSLDKIIKEGGKKVVAFGE 470
               F   +  A     ++Y+ VG HP    E   +    L+ L         KVVA GE
Sbjct: 37  DLEDFLRALELARKYPGKVYAAVGVHPHEADEASEEVLEALEKLLLAEHP---KVVAIGE 93

Query: 471 FGLDYDRVQYCPVETQ 486
            GLDY  V   P E Q
Sbjct: 94  IGLDYYYVDESPKEAQ 109


>gnl|CDD|182449 PRK10425, PRK10425, DNase TatD; Provisional.
          Length = 258

 Score =  127 bits (322), Expect = 7e-34
 Identities = 71/222 (31%), Positives = 109/222 (49%), Gaps = 34/222 (15%)

Query: 5   DIGANLKDAMYEGFYSSKNQKHEPDIDHVLNRAWNAGLEKIIVTGTNVEDSISSLKLAQS 64
           DIG NL  + +             D D V+ RA+ AG+  +++TGTN+ +S  + KLA+ 
Sbjct: 3   DIGVNLTSSQFAK-----------DRDDVVARAFAAGVNGMLITGTNLRESQQAQKLARQ 51

Query: 65  DERLYSTVGCHPTRCSEFENDPEGYLQSLDKIIKEGGKKVVAFGEFGLDYDRVQYCPVET 124
               +ST G HP   S+++   E  +  L         +VVA GE GLD++R    P E 
Sbjct: 52  YPSCWSTAGVHPHDSSQWQAATEEAIIEL-----AAQPEVVAIGECGLDFNRNFSTPEE- 105

Query: 125 QLKYFRKQLDLSVTHKLPLFLHCRNAKSDFIEIMKEYAPKLPRKGVIHSFDGTPFQAVDS 184
           Q + F  QL ++    +P+F+HCR+A   F+ +++ +  KLP   V+H F GT  +    
Sbjct: 106 QERAFVAQLAIAAELNMPVFMHCRDAHERFMALLEPWLDKLP-GAVLHCFTGTREEMQAC 164

Query: 185 L----------------KTKENLETVKSIPEDRLLLETDCPW 210
           L                +  E  E +  IP +RLLLETD P+
Sbjct: 165 LARGLYIGITGWVCDERRGLELRELLPLIPAERLLLETDAPY 206



 Score = 81.3 bits (201), Expect = 3e-17
 Identities = 49/172 (28%), Positives = 72/172 (41%), Gaps = 45/172 (26%)

Query: 253 FRKQLDLSVTHKLPLFLHCRNAKSDFIEIMKEYAPKLPRKGVIHSFDGTPFQAVDSL--- 309
           F  QL ++    +P+F+HCR+A   F+ +++ +  KLP   V+H F GT  +    L   
Sbjct: 110 FVAQLAIAAELNMPVFMHCRDAHERFMALLEPWLDKLP-GAVLHCFTGTREEMQACLARG 168

Query: 310 -------------KTKENLETVKSIPEDRLLLETDCPWC---EVKPSHAGFAYIRTQHEK 353
                        +  E  E +  IP +RLLLETD P+    ++ P              
Sbjct: 169 LYIGITGWVCDERRGLELRELLPLIPAERLLLETDAPYLLPRDLTP-------------- 214

Query: 354 VKKEQWKPDKMVKS-RNEPANIVQILEIVAAVRGVEREKLGPIIHQNTLRLF 404
                 KP     S RNEPA +  IL+ +A  RG +   L      N   LF
Sbjct: 215 ------KP----ASRRNEPAFLPHILQRIAHWRGEDAAWLAATTDANARTLF 256



 Score = 33.5 bits (77), Expect = 0.17
 Identities = 17/48 (35%), Positives = 25/48 (52%), Gaps = 5/48 (10%)

Query: 430 YSTVGCHPTRCSEFENDPEGYLQSLDKIIKEGGKKVVAFGEFGLDYDR 477
           +ST G HP   S+++   E  +  L         +VVA GE GLD++R
Sbjct: 56  WSTAGVHPHDSSQWQAATEEAIIEL-----AAQPEVVAIGECGLDFNR 98


>gnl|CDD|232780 TIGR00010, TIGR00010, hydrolase, TatD family.  PSI-BLAST, starting
           with a urease alpha subunit, finds a large superfamily
           of proteins, including a number of different enzymes
           that act as hydrolases at C-N bonds other than peptide
           bonds (EC 3.5.-.-), many uncharacterized proteins, and
           the members of this family. Several genomes have
           multiple paralogs related to this family. However, a set
           of 17 proteins can be found, one each from 17 of the
           first 20 genomes, such that each member forms a
           bidirectional best hit across genomes with all other
           members of the set. This core set (and one other
           near-perfect member), but not the other paralogs, form
           the seed for this model. Additionally, members of the
           seed alignment and all trusted hits, but not all
           paralogs, have a conserved motif DxHxH near the amino
           end. The member from E. coli was recently shown to have
           DNase activity [Unknown function, Enzymes of unknown
           specificity].
          Length = 252

 Score =  123 bits (311), Expect = 3e-32
 Identities = 73/243 (30%), Positives = 113/243 (46%), Gaps = 47/243 (19%)

Query: 25  KHEPDIDHVLNRAWNAGLEKIIVTGTNVEDSISSLKLAQSDERLYSTVGCHPTRCSEFEN 84
             E D++ V+ RA  AG+  ++  GT++ED + +L+LA+    +Y+ VG HP    +   
Sbjct: 12  DFEEDVEEVIERAKAAGVTAVVAVGTDLEDFLRALELAEKYPNVYAAVGVHPLDVDDDTK 71

Query: 85  DPEGYLQSLDKIIKEGGKKVVAFGEFGLDYDRVQYCPVETQLKYFRKQLDLSVTHKLPLF 144
           +    L+ L         KVVA GE GLDY +        Q + FR QL L+    LP+ 
Sbjct: 72  EDIKELERL-----AAHPKVVAIGETGLDYYKADE-YKRRQEEVFRAQLQLAEELNLPVI 125

Query: 145 LHCRNAKSDFIEIMKEYAPKLPRKGVIHSFDGTP--------------FQAVDSLKTKEN 190
           +H R+A+ D ++I++E  PK+   GV+H F G                   + + K  ++
Sbjct: 126 IHARDAEEDVLDILREEKPKVG--GVLHCFTGDAELAKKLLDLGFYISISGIVTFKNAKS 183

Query: 191 L-ETVKSIPEDRLLLETDCPWCEVKPSHAGFAYIRTQHEKVKKEQWKPDKMVKSRNEPAN 249
           L E V+ IP +RLL+ETD P+    P + G                        RNEPA 
Sbjct: 184 LREVVRKIPLERLLVETDSPYLAPVP-YRG-----------------------KRNEPAF 219

Query: 250 IVY 252
           + Y
Sbjct: 220 VRY 222



 Score = 93.1 bits (232), Expect = 2e-21
 Identities = 50/167 (29%), Positives = 79/167 (47%), Gaps = 41/167 (24%)

Query: 253 FRKQLDLSVTHKLPLFLHCRNAKSDFIEIMKEYAPKLPRKGVIHSFDGTP---------- 302
           FR QL L+    LP+ +H R+A+ D ++I++E  PK+   GV+H F G            
Sbjct: 110 FRAQLQLAEELNLPVIIHARDAEEDVLDILREEKPKVG--GVLHCFTGDAELAKKLLDLG 167

Query: 303 ----FQAVDSLKTKENL-ETVKSIPEDRLLLETDCPWCEVKPSHAGFAYIRTQHEKVKKE 357
                  + + K  ++L E V+ IP +RLL+ETD P+    P + G              
Sbjct: 168 FYISISGIVTFKNAKSLREVVRKIPLERLLVETDSPYLAPVP-YRG-------------- 212

Query: 358 QWKPDKMVKSRNEPANIVQILEIVAAVRGVEREKLGPIIHQNTLRLF 404
                     RNEPA +   +E +A ++G++ E+L  I  +N  RLF
Sbjct: 213 ---------KRNEPAFVRYTVEAIAEIKGMDVEELAQITTKNAKRLF 250



 Score = 34.2 bits (79), Expect = 0.11
 Identities = 17/49 (34%), Positives = 22/49 (44%), Gaps = 5/49 (10%)

Query: 427 ERLYSTVGCHPTRCSEFENDPEGYLQSLDKIIKEGGKKVVAFGEFGLDY 475
             +Y+ VG HP    +   +    L+ L         KVVA GE GLDY
Sbjct: 53  PNVYAAVGVHPLDVDDDTKEDIKELERL-----AAHPKVVAIGETGLDY 96


>gnl|CDD|171118 PRK11449, PRK11449, putative deoxyribonuclease YjjV; Provisional.
          Length = 258

 Score = 83.1 bits (205), Expect = 7e-18
 Identities = 71/237 (29%), Positives = 104/237 (43%), Gaps = 36/237 (15%)

Query: 34  LNRAWNAGLEKIIVTGTNVEDSISSLKLAQSDERLYSTVGCHPTRCSEFENDPEGYLQSL 93
           L RA  AG+ KIIV  T  E+    L LA+  + LY+ +G HP    +  +     L  L
Sbjct: 25  LQRAAQAGVGKIIVPATEAENFARVLALAERYQPLYAALGLHPGMLEKHSDVS---LDQL 81

Query: 94  DKIIKEGGKKVVAFGEFGLDY--DRVQYCPVETQLKYFRKQLDLSVTHKLPLFLHCRNAK 151
            + ++    KVVA GE GLD   D  Q+   E Q     +QL L+  + LP+ LH R   
Sbjct: 82  QQALERRPAKVVAVGEIGLDLFGDDPQF---ERQQWLLDEQLKLAKRYDLPVILHSRRTH 138

Query: 152 SDFIEIMKEYAPKLPRKGVIHSFDGTPFQA---------------VDSLKTKENLETVKS 196
                 +K +   LPR GV+H F G+  QA               +   +  +  + +  
Sbjct: 139 DKLAMHLKRHD--LPRTGVVHGFSGSLQQAERFVQLGYKIGVGGTITYPRASKTRDVIAK 196

Query: 197 IPEDRLLLETDCPWCEVKPSHAGFAYIRTQHEKVKKEQWKP--DKMVKSRNEPANIV 251
           +P   LLLETD P   +     GF     Q +  + EQ     D + + R EPA+ +
Sbjct: 197 LPLASLLLETDAPDMPLN----GF-----QGQPNRPEQAARVFDVLCELRPEPADEI 244



 Score = 46.1 bits (109), Expect = 2e-05
 Identities = 38/164 (23%), Positives = 64/164 (39%), Gaps = 41/164 (25%)

Query: 256 QLDLSVTHKLPLFLHCRNAKSDFIEIMKEYAPKLPRKGVIHSFDGTPFQA---------- 305
           QL L+  + LP+ LH R         +K +   LPR GV+H F G+  QA          
Sbjct: 119 QLKLAKRYDLPVILHSRRTHDKLAMHLKRHD--LPRTGVVHGFSGSLQQAERFVQLGYKI 176

Query: 306 -----VDSLKTKENLETVKSIPEDRLLLETDCPWCEVKPSHAGFAYIRTQHEKVKKEQWK 360
                +   +  +  + +  +P   LLLETD P   +     GF             Q +
Sbjct: 177 GVGGTITYPRASKTRDVIAKLPLASLLLETDAPDMPLN----GF-------------QGQ 219

Query: 361 PDKMVKSRNEPANIVQILEIVAAVRGVEREKLGPIIHQNTLRLF 404
           P       N P    ++ +++  +R    +++  ++  NT  LF
Sbjct: 220 P-------NRPEQAARVFDVLCELRPEPADEIAEVLLNNTYTLF 256



 Score = 30.3 bits (68), Expect = 2.0
 Identities = 24/96 (25%), Positives = 37/96 (38%), Gaps = 14/96 (14%)

Query: 382 AAVRGVEREKLGPIIHQNTLRLFFPHELPTPTRFNTCVSFAPYEIERLYSTVGCHPTRCS 441
           A+++   +  +G II   T    F   L    R+             LY+ +G HP    
Sbjct: 23  ASLQRAAQAGVGKIIVPATEAENFARVLALAERYQP-----------LYAALGLHPGMLE 71

Query: 442 EFENDPEGYLQSLDKIIKEGGKKVVAFGEFGLDYDR 477
           +  +     L  L + ++    KVVA GE GLD   
Sbjct: 72  KHSDVS---LDQLQQALERRPAKVVAVGEIGLDLFG 104


>gnl|CDD|236767 PRK10812, PRK10812, putative DNAse; Provisional.
          Length = 265

 Score = 52.1 bits (125), Expect = 2e-07
 Identities = 58/222 (26%), Positives = 92/222 (41%), Gaps = 43/222 (19%)

Query: 29  DIDHVLNRAWNAGLEKIIVTGTNVEDSISSLKLAQSDERLYSTVGCHPTRCSEFENDPEG 88
           D+D VL +A    ++  +   T +        L    + +  + G HP       N  E 
Sbjct: 21  DVDDVLAKAAARDVKFCLAVATTLPGYRHMRDLVGERDNVVFSCGVHPL------NQDEP 74

Query: 89  Y-LQSLDKIIKEGGKKVVAFGEFGLDYDRVQYCP--VETQLKYFRKQLDLSVTHKLPLFL 145
           Y ++ L ++     + VVA GE GLDY    Y P     Q + FR  + +      P+ +
Sbjct: 75  YDVEELRRL--AAEEGVVAMGETGLDY---YYTPETKVRQQESFRHHIQIGRELNKPVIV 129

Query: 146 HCRNAKSDFIEIM------------------KEYAPKLPRKGVIHSFDG-TPFQAVDSLK 186
           H R+A++D + I+                  +E A KL   G   SF G   F+  + L+
Sbjct: 130 HTRDARADTLAILREEKVTDCGGVLHCFTEDRETAGKLLDLGFYISFSGIVTFRNAEQLR 189

Query: 187 TKENLETVKSIPEDRLLLETDCPWCEVKPSHAGF----AYIR 224
                +  + +P DRLL+ETD P+    P H G     A +R
Sbjct: 190 -----DAARYVPLDRLLVETDSPYLAPVP-HRGKENQPAMVR 225



 Score = 47.8 bits (114), Expect = 5e-06
 Identities = 43/158 (27%), Positives = 67/158 (42%), Gaps = 48/158 (30%)

Query: 266 PLFLHCRNAKSDFIEIM------------------KEYAPKLPRKGVIHSFDG-TPFQAV 306
           P+ +H R+A++D + I+                  +E A KL   G   SF G   F+  
Sbjct: 126 PVIVHTRDARADTLAILREEKVTDCGGVLHCFTEDRETAGKLLDLGFYISFSGIVTFRNA 185

Query: 307 DSLKTKENLETVKSIPEDRLLLETDCPWCEVKPSHAGFAYIRTQHEKVKKEQWKPDKMVK 366
           + L+     +  + +P DRLL+ETD P+    P H G            KE         
Sbjct: 186 EQLR-----DAARYVPLDRLLVETDSPYLAPVP-HRG------------KE--------- 218

Query: 367 SRNEPANIVQILEIVAAVRGVEREKLGPIIHQNTLRLF 404
             N+PA +  + E +A ++GV  E+L  +   N  RLF
Sbjct: 219 --NQPAMVRDVAEYMAVLKGVSVEELAQVTTDNFARLF 254


>gnl|CDD|224024 COG1099, COG1099, Predicted metal-dependent hydrolases with the
           TIM-barrel fold [General function prediction only].
          Length = 254

 Score = 44.7 bits (106), Expect = 4e-05
 Identities = 41/170 (24%), Positives = 70/170 (41%), Gaps = 36/170 (21%)

Query: 67  RLYSTVGCHPTRCSEFENDPEGYLQSLDKIIKEGGKKVVAFGEFGLDYDRVQYCPVETQL 126
           +L   VG HP        + E  L+ L++++      VVA GE GL+         + + 
Sbjct: 65  KLKVAVGVHP---RAIPPELEEVLEELEELLSNED--VVAIGEIGLEEA------TDEEK 113

Query: 127 KYFRKQLDLSVTHKLPLFLHC-----RNAKSDFIEIMKEYAPKLPRKGVIHSFDGTPFQA 181
           + FR+QL+L+    +P+ +H      + A S  ++I+ E   K P   VI   D    + 
Sbjct: 114 EVFREQLELARELDVPVIVHTPRRNKKEATSKILDILIESGLK-PSLVVI---DHVNEET 169

Query: 182 VDS----------------LKTKENLETVKSIPEDRLLLETDCPWCEVKP 215
           VD                 L  +E +E V+    +R++L +D       P
Sbjct: 170 VDEVLDEEFYVGLTVQPGKLTVEEAVEIVREYGAERIILNSDAGSAASDP 219



 Score = 31.2 bits (71), Expect = 0.90
 Identities = 29/173 (16%), Positives = 62/173 (35%), Gaps = 42/173 (24%)

Query: 245 NEPANIVYFRKQLDLSVTHKLPLFLHC--RNAKSDFIEIMKEYAPKLPRKGVIHSFDGTP 302
                   FR+QL+L+    +P+ +H   RN K    +I+           ++      P
Sbjct: 108 ATDEEKEVFREQLELARELDVPVIVHTPRRNKKEATSKILD----------ILIESGLKP 157

Query: 303 FQAV-DSLKTKENLETVKSIPEDRLLLETDCPWC--EVKP---SHAGFAYIRTQHEKVKK 356
              V D +    N ETV        +L+ +  +    V+P   +      I  ++     
Sbjct: 158 SLVVIDHV----NEETVDE------VLDEEF-YVGLTVQPGKLTVEEAVEIVREY----- 201

Query: 357 EQWKPDKMVKSRNEPANIVQILEIVAAV-----RGVEREKLGPIIHQNTLRLF 404
                ++++ + +  +     L +         RGV  E++  ++ +N L  +
Sbjct: 202 ---GAERIILNSDAGSAASDPLAVPRTALEMEERGVGEEEIEKVVRENALSFY 251


>gnl|CDD|133044 cd06422, NTP_transferase_like_1, NTP_transferase_like_1 is a
          member of the nucleotidyl transferase family.  This is
          a subfamily of nucleotidyl transferases. Nucleotidyl
          transferases transfer nucleotides onto phosphosugars.
          The activated sugars are precursors for synthesis of
          lipopolysaccharide, glycolipids and polysaccharides.
          Other subfamilies of nucleotidyl transferases include
          Alpha-D-Glucose-1-Phosphate Cytidylyltransferase,
          Mannose-1-phosphate guanyltransferase, and
          Glucose-1-phosphate thymidylyltransferase.
          Length = 221

 Score = 32.2 bits (74), Expect = 0.42
 Identities = 17/53 (32%), Positives = 24/53 (45%), Gaps = 14/53 (26%)

Query: 26 HEPDIDHVLNRAWNAGLEKIIVTGTNV-----------EDSISSLKLAQSDER 67
           +P IDH L+R   AG+ +I+V   N             DS   L++  SDE 
Sbjct: 29 GKPLIDHALDRLAAAGIRRIVV---NTHHLADQIEAHLGDSRFGLRITISDEP 78


>gnl|CDD|181585 PRK08936, PRK08936, glucose-1-dehydrogenase; Provisional.
          Length = 261

 Score = 31.2 bits (71), Expect = 1.0
 Identities = 22/99 (22%), Positives = 40/99 (40%), Gaps = 27/99 (27%)

Query: 82  FENDPEGYLQSLDKIIKEGGKKVVAFGEFGLDYDRVQYCPVETQLKYFRKQLDLSV---- 137
           + +D E      ++I K GG+ +   G+  ++ D V    ++T +K F   LD+ +    
Sbjct: 38  YRSDEEEANDVAEEIKKAGGEAIAVKGDVTVESDVVNL--IQTAVKEF-GTLDVMINNAG 94

Query: 138 ------THKLPL--------------FLHCRNAKSDFIE 156
                 +H++ L              FL  R A   F+E
Sbjct: 95  IENAVPSHEMSLEDWNKVINTNLTGAFLGSREAIKYFVE 133


>gnl|CDD|184139 PRK13555, PRK13555, azoreductase; Provisional.
          Length = 208

 Score = 30.1 bits (67), Expect = 2.1
 Identities = 18/46 (39%), Positives = 20/46 (43%), Gaps = 9/46 (19%)

Query: 83  ENDPEGYLQSLDKIIKEGGKKVVAFGEFGLDYDRVQYCPVETQLKY 128
            N PEG           GGKKVV  G  G DY   Q  P+E  + Y
Sbjct: 126 ANGPEGLA---------GGKKVVVLGARGSDYSSEQMAPMEMAVNY 162


>gnl|CDD|226332 COG3811, COG3811, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 85

 Score = 27.8 bits (62), Expect = 3.4
 Identities = 12/38 (31%), Positives = 21/38 (55%), Gaps = 5/38 (13%)

Query: 164 KLPRKGVIHSFDGTPFQAVDSLKTKENLETVKSIPEDR 201
           KL RK +I S  G P++      T+  L+ V++  ++R
Sbjct: 53  KLKRKKLIKSSQGGPYRI-----TRRGLKNVRAQLDNR 85



 Score = 27.8 bits (62), Expect = 3.4
 Identities = 12/38 (31%), Positives = 21/38 (55%), Gaps = 5/38 (13%)

Query: 288 KLPRKGVIHSFDGTPFQAVDSLKTKENLETVKSIPEDR 325
           KL RK +I S  G P++      T+  L+ V++  ++R
Sbjct: 53  KLKRKKLIKSSQGGPYRI-----TRRGLKNVRAQLDNR 85


>gnl|CDD|224028 COG1103, COG1103, Archaea-specific pyridoxal phosphate-dependent
           enzymes [General function prediction only].
          Length = 382

 Score = 29.4 bits (66), Expect = 4.1
 Identities = 8/25 (32%), Positives = 15/25 (60%)

Query: 81  EFENDPEGYLQSLDKIIKEGGKKVV 105
           E++  PEGY + ++++  EGG    
Sbjct: 134 EYKITPEGYAEVIEEVKDEGGDPPA 158



 Score = 29.4 bits (66), Expect = 4.1
 Identities = 8/25 (32%), Positives = 15/25 (60%)

Query: 442 EFENDPEGYLQSLDKIIKEGGKKVV 466
           E++  PEGY + ++++  EGG    
Sbjct: 134 EYKITPEGYAEVIEEVKDEGGDPPA 158


>gnl|CDD|219573 pfam07781, Reovirus_Mu2, Reovirus minor core protein Mu-2.  This
           family represents the Reovirus core protein Mu-2. Mu-2
           is a microtubule associated protein and is thought to
           play a key role in the formation and structural
           organisation of reovirus inclusion bodies.
          Length = 735

 Score = 29.4 bits (66), Expect = 4.9
 Identities = 31/167 (18%), Positives = 47/167 (28%), Gaps = 19/167 (11%)

Query: 300 GTPFQAVDSLKTKENLETVKSIPEDRLLLETDCPWCEVKPSHAGFAYIRTQHE---KVKK 356
           GT FQ + S   K    +  S P   LL +   PW E  P  A         E    +  
Sbjct: 562 GTLFQVLLSYCYKMQATSTASKPAGLLLKQLFGPWLEAAPLLAPLTPNSLSAELAWYIPA 621

Query: 357 EQWKPD----------KMVKSRNEPANIVQILEIV----AAVRGVERE--KLGPIIHQNT 400
           +  + +               R  P  I  +         A   V+R   ++G  I    
Sbjct: 622 KVLRANGWCICEDRHVTFSFIRGPPLVISVLRLYDWSRFRATIIVDRTLVEVGADIRAVA 681

Query: 401 LRLFFPHELPTPTRFNTCVSFAPYEIERLYSTVGCHPTRCSEFENDP 447
           + + +    P     +    F       L     C   R    +N P
Sbjct: 682 VSVIWTEYAPPVPLISNPALFTLPATYHLSLNCSCTSGRGWAAKNMP 728


>gnl|CDD|236522 PRK09450, cyaA, adenylate cyclase; Provisional.
          Length = 830

 Score = 29.1 bits (66), Expect = 5.7
 Identities = 14/41 (34%), Positives = 15/41 (36%), Gaps = 5/41 (12%)

Query: 401 LRLFFPHELPTPTRFNTCVSFAPYEIERL--YSTVGCHPTR 439
           LR  FP  LP PT         P EI  L     +   PT 
Sbjct: 529 LRQHFPLRLPAPTPQALL---QPCEIRHLLIIVNLEVDPTA 566


>gnl|CDD|132929 cd07018, S49_SppA_67K_type, Signal peptide peptidase A (SppA) 67K
           type, a serine protease, has catalytic Ser-Lys dyad.
           Signal peptide peptidase A (SppA; Peptidase S49;
           Protease IV) 67K type: SppA is found in all three
           domains of life and is involved in the cleavage of
           signal peptides after their removal from the precursor
           proteins by signal peptidases. Members in this subfamily
           contain an amino-terminal domain in addition to the
           carboxyl-terminal protease domain that is conserved in
           all the S49 family members (sometimes referred to as 67K
           type), similar to E. coli and Arabidopsis thaliana SppA
           peptidases. Unlike the eukaryotic functional homologs
           that are proposed to be aspartic proteases,
           site-directed mutagenesis and sequence analysis have
           shown that members in this subfamily, mostly bacterial,
           are serine proteases. The predicted active site serine
           for members in this family occurs in a transmembrane
           domain. Mutagenesis studies also suggest that the
           catalytic center comprises a Ser-Lys dyad (both residues
           absolutely conserved within bacteria, chloroplast and
           mitochondrial signal peptidase family members) and not
           the usual Ser-His-Asp catalytic triad found in the
           majority of serine proteases. Interestingly, the single
           membrane spanning E. coli SppA carries out catalysis
           using a Ser-Lys dyad with the serine located in the
           conserved carboxy-terminal protease domain and the
           lysine in the non-conserved amino-terminal domain.
          Length = 222

 Score = 28.3 bits (64), Expect = 8.3
 Identities = 14/80 (17%), Positives = 31/80 (38%), Gaps = 23/80 (28%)

Query: 49  GTNVEDSISSLKLAQSDER---LY---STVGCHPTRCSEFENDPEGYLQSLDKII---KE 99
             ++ D + +L+ A  D+R   +      +     +           L+ L + +   + 
Sbjct: 28  ELSLRDLLEALEKAAEDDRIKGIVLDLDGLSGGLAK-----------LEELRQALERFRA 76

Query: 100 GGKKVVAFGEFGLDYDRVQY 119
            GK V+A+ +    Y + QY
Sbjct: 77  SGKPVIAYADG---YSQGQY 93


>gnl|CDD|173651 cd05095, PTKc_DDR2, Catalytic domain of the Protein Tyrosine
           Kinase, Discoidin Domain Receptor 2.  Protein Tyrosine
           Kinase (PTK) family; mammalian Discoidin Domain Receptor
           2 (DDR2) and homologs; catalytic (c) domain. The PTKc
           family is part of a larger superfamily that includes the
           catalytic domains of other kinases such as protein
           serine/threonine kinases, RIO kinases, and
           phosphoinositide 3-kinase (PI3K). PTKs catalyze the
           transfer of the gamma-phosphoryl group from ATP to
           tyrosine (tyr) residues in protein substrates. DDR2 is a
           member of the DDR subfamily, which are receptor tyr
           kinases (RTKs) containing an extracellular discoidin
           homology domain, a transmembrane segment, an extended
           juxtamembrane region, and an intracellular catalytic
           domain. The binding of the ligand, collagen, to DDRs
           results in a slow but sustained receptor activation.
           DDR2 binds mostly to fibrillar collagens. More recently,
           it has been reported to also bind collagen X. DDR2 is
           widely expressed in many tissues with the highest levels
           found in skeletal muscle, skin, kidney and lung. It is
           important in cell proliferation and development. Mice,
           with a deletion of DDR2, suffer from dwarfism and
           delayed healing of epidermal wounds. DDR2 also
           contributes to collagen (type I) regulation by
           inhibiting fibrillogenesis and altering the morphology
           of collagen fibers. It is also expressed in immature
           dendritic cells (DCs), where it plays a role in DC
           activation and function.
          Length = 296

 Score = 28.3 bits (63), Expect = 8.3
 Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 15/62 (24%)

Query: 108 GEFGLDYDRVQYCPVETQLKYFRKQLDLSVTHKLPLFLHC--------RNAKSDF---IE 156
           G+FG     V  C  E   K+  K   L V+   P+ +          +NA++DF   I+
Sbjct: 16  GQFG----EVHLCEAEGMEKFMDKDFSLDVSGNQPVLVAVKMLREDANKNARNDFLKEIK 71

Query: 157 IM 158
           IM
Sbjct: 72  IM 73


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.136    0.414 

Gapped
Lambda     K      H
   0.267   0.0764    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,711,750
Number of extensions: 2376037
Number of successful extensions: 2333
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2286
Number of HSP's successfully gapped: 50
Length of query: 487
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 386
Effective length of database: 6,457,848
Effective search space: 2492729328
Effective search space used: 2492729328
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.2 bits)