RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3808
         (330 letters)



>gnl|CDD|223540 COG0464, SpoVK, ATPases of the AAA+ class [Posttranslational
           modification, protein turnover, chaperones].
          Length = 494

 Score =  204 bits (521), Expect = 6e-62
 Identities = 124/351 (35%), Positives = 173/351 (49%), Gaps = 36/351 (10%)

Query: 1   MTTFDKGINIAIKATEADKNKNYEEALKLYTH-------SVQYFLHALKYEITSAEGKQS 53
           M    +G  I I AT        + A +           ++      L+           
Sbjct: 111 MDGLKRGQVIVIGATNRPDGL--DPAKRRPGRFDREIEVNLPDEAGRLEILQIHTRLMFL 168

Query: 54  IREKCDEYLARAE--KLKEYLKKGKKQPVKDGESRTKDDKKESDEDDSEDPDKKKMQANL 111
                 + LA     K    L    K+       R  D   E     +ED D ++    +
Sbjct: 169 GPPGTGKTLAARTVGKSGADLGALAKEAALRELRRAIDLVGE-YIGVTED-DFEEALKKV 226

Query: 112 EGA--IVMEKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFTGKRV-PWKGILLFGPPG 168
             +  ++ E  +V   D+ GLE AKE LKEA+  P+K P+LF    + P KG+LL+GPPG
Sbjct: 227 LPSRGVLFEDEDVTLDDIGGLEEAKEELKEAIETPLKRPELFRKLGLRPPKGVLLYGPPG 286

Query: 169 TGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDE-- 226
           TGK+ LAKAVA E+  S F SV  S+L+SKW+GESEK ++ LFE AR   PSIIFIDE  
Sbjct: 287 TGKTLLAKAVALESR-SRFISVKGSELLSKWVGESEKNIRELFEKARKLAPSIIFIDEID 345

Query: 227 ------------RIDRGISNALFLLLSGVGNDMDGILVLGATNIPWVLDAAIRR--RFEK 272
                          R +   L  L        +G+LV+ ATN P  LD A+ R  RF++
Sbjct: 346 SLASGRGPSEDGSGRRVVGQLLTEL--DGIEKAEGVLVIAATNRPDDLDPALLRPGRFDR 403

Query: 273 RIYIALPELNARLFMFRHHLGNTRNTIS-DENLKELAKRTEGYSGADISTL 322
            IY+ LP+L  RL +F+ HL + +  ++ D +L+ELA+ TEGYSGADI+ L
Sbjct: 404 LIYVPLPDLEERLEIFKIHLRDKKPPLAEDVDLEELAEITEGYSGADIAAL 454



 Score = 81.8 bits (202), Expect = 3e-17
 Identities = 70/179 (39%), Positives = 97/179 (54%), Gaps = 16/179 (8%)

Query: 159 KGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHR 218
           KG+LL GPPGTGK+ LA+A+A E     F S++  +++SK++GESE  ++ LFE A    
Sbjct: 19  KGVLLHGPPGTGKTLLARALANEGAE--FLSINGPEILSKYVGESELRLRELFEEAEKLA 76

Query: 219 PSIIFIDERI----DRGISNALFLL--LSGVGNDMDG-----ILVLGATNIPWVLDAAIR 267
           PSIIFIDE       R           ++ +   MDG     ++V+GATN P  LD A R
Sbjct: 77  PSIIFIDEIDALAPKRSSDQGEVERRVVAQLLALMDGLKRGQVIVIGATNRPDGLDPAKR 136

Query: 268 R--RFEKRIYIALPELNARLFMFRHHLGNTRNTISDENLKELAKRTEGYSGADISTLER 324
           R  RF++ I + LP+   RL + + H             K LA RT G SGAD+  L +
Sbjct: 137 RPGRFDREIEVNLPDEAGRLEILQIHT-RLMFLGPPGTGKTLAARTVGKSGADLGALAK 194


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
           Provisional.
          Length = 389

 Score =  192 bits (490), Expect = 2e-58
 Identities = 108/250 (43%), Positives = 144/250 (57%), Gaps = 43/250 (17%)

Query: 97  DDSEDPDKKKMQANLEGAIVMEKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFTGKRV 156
              +DP  + M+       V+E PNV + D+ GLE     ++EAV LP+K P+LF  + V
Sbjct: 110 PSEKDPRVQAME-------VIESPNVTYEDIGGLEEQIREVREAVELPLKKPELF--EEV 160

Query: 157 ---PWKGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFEL 213
              P KG+LL+GPPGTGK+ LAKAVA E N +TF  V  S+LV K++GE  +LV+ LFEL
Sbjct: 161 GIEPPKGVLLYGPPGTGKTLLAKAVAHETN-ATFIRVVGSELVQKFIGEGARLVRELFEL 219

Query: 214 ARAHRPSIIFIDE-------RIDRGIS------NALFLLLSGVGNDMDG------ILVLG 254
           AR   PSIIFIDE       R D G S        L  LL+    +MDG      + ++ 
Sbjct: 220 AREKAPSIIFIDEIDAIAAKRTDSGTSGDREVQRTLMQLLA----EMDGFDPRGNVKIIA 275

Query: 255 ATNIPWVLDAAIRR--RFEKRIYIALPELNARLFMFRHHLGNTR--NTISDENLKELAKR 310
           ATN   +LD AI R  RF++ I + LP+   RL + + H   TR  N   D +L+ELA+ 
Sbjct: 276 ATNRIDILDPAILRPGRFDRIIEVPLPDEEGRLEILKIH---TRKMNLADDVDLEELAEL 332

Query: 311 TEGYSGADIS 320
           TEG SGAD+ 
Sbjct: 333 TEGASGADLK 342


>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 406

 Score =  178 bits (453), Expect = 1e-52
 Identities = 100/249 (40%), Positives = 141/249 (56%), Gaps = 43/249 (17%)

Query: 97  DDSEDPDKKKMQANLEGAIVMEKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFTGKRV 156
               DP    M+       V EKP+V + D+ GL+   + ++E V LP+K P+LF  + +
Sbjct: 130 PPEVDPRVSVME-------VEEKPDVTYEDIGGLDEQIQEIREVVELPLKNPELF--EEL 180

Query: 157 ---PWKGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFEL 213
              P KG+LL+GPPGTGK+ LAKAVA + + +TF  V  S+LV K++GE  +LV+ LFEL
Sbjct: 181 GIDPPKGVLLYGPPGTGKTLLAKAVANQTD-ATFIRVVGSELVQKYIGEGARLVRELFEL 239

Query: 214 ARAHRPSIIFIDE-------RI------DRGISNALFLLLSGVGNDMDG------ILVLG 254
           AR   PSIIFIDE       R       DR +   +  LL    N +DG      + V+ 
Sbjct: 240 AREKAPSIIFIDEIDAIGAKRFDSGTSGDREVQRTMLELL----NQLDGFDPRGNVKVIM 295

Query: 255 ATNIPWVLDAAIRR--RFEKRIYIALPELNARLFMFRHHLGNTR--NTISDENLKELAKR 310
           ATN P +LD A+ R  RF+++I   LP+   R  + + H   TR  N   D +L+ LA+ 
Sbjct: 296 ATNRPDILDPALLRPGRFDRKIEFPLPDEEGRAEILKIH---TRKMNLADDVDLELLARL 352

Query: 311 TEGYSGADI 319
           TEG+SGAD+
Sbjct: 353 TEGFSGADL 361


>gnl|CDD|233328 TIGR01243, CDC48, AAA family ATPase, CDC48 subfamily.  This
           subfamily of the AAA family ATPases includes two members
           each from three archaeal species. It also includes yeast
           CDC48 (cell division control protein 48) and the human
           ortholog, transitional endoplasmic reticulum ATPase
           (valosin-containing protein). These proteins in
           eukaryotes are involved in the budding and transfer of
           membrane from the transitional endoplasmic reticulum to
           the Golgi apparatus.
          Length = 733

 Score =  178 bits (452), Expect = 2e-50
 Identities = 103/257 (40%), Positives = 152/257 (59%), Gaps = 33/257 (12%)

Query: 87  TKDDKKESDEDDSEDPDKKKMQANLEGAIVMEKPNVKWTDVAGLEAAKEALKEAVILPIK 146
           T  D  E+          K ++ +    +++E PNV+W+D+ GLE  K+ L+EAV  P+K
Sbjct: 424 TMKDFMEA---------LKMVEPSAIREVLVEVPNVRWSDIGGLEEVKQELREAVEWPLK 474

Query: 147 FPQLFT--GKRVPWKGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESE 204
            P++F   G R P KG+LLFGPPGTGK+ LAKAVATE+  + F +V   +++SKW+GESE
Sbjct: 475 HPEIFEKMGIRPP-KGVLLFGPPGTGKTLLAKAVATES-GANFIAVRGPEILSKWVGESE 532

Query: 205 KLVKTLFELARAHRPSIIFIDERID-----RG----------ISNALFLLLSGVGNDMDG 249
           K ++ +F  AR   P+IIF DE ID     RG          I N L   + G+  ++  
Sbjct: 533 KAIREIFRKARQAAPAIIFFDE-IDAIAPARGARFDTSVTDRIVNQLLTEMDGI-QELSN 590

Query: 250 ILVLGATNIPWVLDAAIRR--RFEKRIYIALPELNARLFMFRHHLGNTRNTISDENLKEL 307
           ++V+ ATN P +LD A+ R  RF++ I +  P+  AR  +F+ H   +     D +L+EL
Sbjct: 591 VVVIAATNRPDILDPALLRPGRFDRLILVPPPDEEARKEIFKIHT-RSMPLAEDVDLEEL 649

Query: 308 AKRTEGYSGADISTLER 324
           A+ TEGY+GADI  + R
Sbjct: 650 AEMTEGYTGADIEAVCR 666



 Score =  149 bits (378), Expect = 3e-40
 Identities = 87/221 (39%), Positives = 134/221 (60%), Gaps = 19/221 (8%)

Query: 120 PNVKWTDVAGLEAAKEALKEAVILPIKFPQLFTGKRV-PWKGILLFGPPGTGKSYLAKAV 178
           P V + D+ GL+ AKE ++E V LP+K P+LF    + P KG+LL+GPPGTGK+ LAKAV
Sbjct: 173 PKVTYEDIGGLKEAKEKIREMVELPMKHPELFEHLGIEPPKGVLLYGPPGTGKTLLAKAV 232

Query: 179 ATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDE------------ 226
           A EA  + F S++  +++SK+ GESE+ ++ +F+ A  + PSIIFIDE            
Sbjct: 233 ANEA-GAYFISINGPEIMSKYYGESEERLREIFKEAEENAPSIIFIDEIDAIAPKREEVT 291

Query: 227 -RIDRGISNALFLLLSGVGNDMDGILVLGATNIPWVLDAAIRR--RFEKRIYIALPELNA 283
             +++ +   L  L+ G+      ++V+GATN P  LD A+RR  RF++ I I +P+  A
Sbjct: 292 GEVEKRVVAQLLTLMDGL-KGRGRVIVIGATNRPDALDPALRRPGRFDREIVIRVPDKRA 350

Query: 284 RLFMFRHHLGNTRNTISDENLKELAKRTEGYSGADISTLER 324
           R  + + H  N      D +L +LA+ T G+ GAD++ L +
Sbjct: 351 RKEILKVHTRN-MPLAEDVDLDKLAEVTHGFVGADLAALAK 390


>gnl|CDD|130309 TIGR01242, 26Sp45, 26S proteasome subunit P45 family.  Many
           proteins may score above the trusted cutoff because an
           internal.
          Length = 364

 Score =  168 bits (427), Expect = 2e-49
 Identities = 104/245 (42%), Positives = 143/245 (58%), Gaps = 39/245 (15%)

Query: 99  SEDPDKKKMQANLEGAIVMEKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFTGKRV-P 157
           S+DP  K M+       V E+PNV + D+ GLE     ++EAV LP+K P+LF    + P
Sbjct: 103 SKDPLVKGME-------VEERPNVSYEDIGGLEEQIREIREAVELPLKHPELFEEVGIEP 155

Query: 158 WKGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAH 217
            KG+LL+GPPGTGK+ LAKAVA E N +TF  V  S+LV K++GE  +LV+ +FELA+  
Sbjct: 156 PKGVLLYGPPGTGKTLLAKAVAHETN-ATFIRVVGSELVRKYIGEGARLVREIFELAKEK 214

Query: 218 RPSIIFIDE-------RI------DRGISNALFLLLSGVGNDMDG------ILVLGATNI 258
            PSIIFIDE       R       DR +   L  LL+    ++DG      + V+ ATN 
Sbjct: 215 APSIIFIDEIDAIAAKRTDSGTSGDREVQRTLMQLLA----ELDGFDPRGNVKVIAATNR 270

Query: 259 PWVLDAAIRR--RFEKRIYIALPELNARLFMFRHHLGNTR--NTISDENLKELAKRTEGY 314
           P +LD A+ R  RF++ I + LP+   RL + + H   TR      D +L+ +AK TEG 
Sbjct: 271 PDILDPALLRPGRFDRIIEVPLPDFEGRLEILKIH---TRKMKLAEDVDLEAIAKMTEGA 327

Query: 315 SGADI 319
           SGAD+
Sbjct: 328 SGADL 332


>gnl|CDD|233327 TIGR01241, FtsH_fam, ATP-dependent metalloprotease FtsH.
           HflB(FtsH) is a pleiotropic protein required for correct
           cell division in bacteria. It has ATP-dependent zinc
           metalloprotease activity. It was formerly designated
           cell division protein FtsH [Cellular processes, Cell
           division, Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 495

 Score =  167 bits (426), Expect = 4e-48
 Identities = 97/226 (42%), Positives = 133/226 (58%), Gaps = 27/226 (11%)

Query: 118 EKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFT--GKRVPWKGILLFGPPGTGKSYLA 175
           EKP V + DVAG++ AKE L E V   +K P  FT  G ++P KG+LL GPPGTGK+ LA
Sbjct: 48  EKPKVTFKDVAGIDEAKEELMEIVDF-LKNPSKFTKLGAKIP-KGVLLVGPPGTGKTLLA 105

Query: 176 KAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDERID-----R 230
           KAVA EA    FFS+S SD V  ++G     V+ LFE A+ + P IIFIDE ID     R
Sbjct: 106 KAVAGEAGVP-FFSISGSDFVEMFVGVGASRVRDLFEQAKKNAPCIIFIDE-IDAVGRQR 163

Query: 231 GIS------------NALFLLLSGVGNDMDGILVLGATNIPWVLDAAIRR--RFEKRIYI 276
           G              N L + + G G +  G++V+ ATN P VLD A+ R  RF++++ +
Sbjct: 164 GAGLGGGNDEREQTLNQLLVEMDGFGTN-TGVIVIAATNRPDVLDPALLRPGRFDRQVVV 222

Query: 277 ALPELNARLFMFRHHLGNTRNTISDENLKELAKRTEGYSGADISTL 322
            LP++  R  + + H  N      D +LK +A+RT G+SGAD++ L
Sbjct: 223 DLPDIKGREEILKVHAKNK-KLAPDVDLKAVARRTPGFSGADLANL 267


>gnl|CDD|215649 pfam00004, AAA, ATPase family associated with various cellular
           activities (AAA).  AAA family proteins often perform
           chaperone-like functions that assist in the assembly,
           operation, or disassembly of protein complexes.
          Length = 131

 Score =  139 bits (353), Expect = 4e-41
 Identities = 57/131 (43%), Positives = 78/131 (59%), Gaps = 17/131 (12%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPS 220
           +LL+GPPGTGK+ LAKAVA E   + F  +S S+LVSK++GESEK ++ LFE A+   P 
Sbjct: 1   LLLYGPPGTGKTTLAKAVAKELG-APFIEISGSELVSKYVGESEKRLRELFEAAKKLAPC 59

Query: 221 IIFIDERID--------------RGISNALFLLLSGVGNDMDGILVLGATNIPWVLDAAI 266
           +IFIDE ID              R + N L   L G  + +  ++V+ ATN P  LD A+
Sbjct: 60  VIFIDE-IDALAGSRGSGGDSESRRVVNQLLTELDGFTSSLSKVIVIAATNRPDKLDPAL 118

Query: 267 RR-RFEKRIYI 276
            R RF++ I  
Sbjct: 119 LRGRFDRIIEF 129


>gnl|CDD|240423 PTZ00454, PTZ00454, 26S protease regulatory subunit 6B-like
           protein; Provisional.
          Length = 398

 Score =  138 bits (349), Expect = 1e-37
 Identities = 85/227 (37%), Positives = 128/227 (56%), Gaps = 28/227 (12%)

Query: 118 EKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFTGKRV-PWKGILLFGPPGTGKSYLAK 176
           EKP+V ++D+ GL+  K+ ++EAV LP+  P+L+    + P +G+LL+GPPGTGK+ LAK
Sbjct: 138 EKPDVTYSDIGGLDIQKQEIREAVELPLTCPELYEQIGIDPPRGVLLYGPPGTGKTMLAK 197

Query: 177 AVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDE-------RI- 228
           AVA     +TF  V  S+ V K+LGE  ++V+ +F LAR + PSIIFIDE       R  
Sbjct: 198 AVAHHTT-ATFIRVVGSEFVQKYLGEGPRMVRDVFRLARENAPSIIFIDEVDSIATKRFD 256

Query: 229 -----DRGISNALFLLLSGVGNDMDG------ILVLGATNIPWVLDAAIRR--RFEKRIY 275
                DR +   L  LL    N MDG      + V+ ATN    LD A+ R  R +++I 
Sbjct: 257 AQTGADREVQRILLELL----NQMDGFDQTTNVKVIMATNRADTLDPALLRPGRLDRKIE 312

Query: 276 IALPELNARLFMFRHHLGNTRNTISDENLKELAKRTEGYSGADISTL 322
             LP+   +  +F+  + +  N   + +L++   R E  S ADI+ +
Sbjct: 313 FPLPDRRQKRLIFQ-TITSKMNLSEEVDLEDFVSRPEKISAADIAAI 358


>gnl|CDD|223541 COG0465, HflB, ATP-dependent Zn proteases [Posttranslational
           modification, protein turnover, chaperones].
          Length = 596

 Score =  138 bits (349), Expect = 1e-36
 Identities = 93/232 (40%), Positives = 132/232 (56%), Gaps = 39/232 (16%)

Query: 118 EKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFT--GKRVPWKGILLFGPPGTGKSYLA 175
           ++  V + DVAG++ AKE L E ++  +K P+ +   G ++P KG+LL GPPGTGK+ LA
Sbjct: 143 DQVKVTFADVAGVDEAKEELSE-LVDFLKNPKKYQALGAKIP-KGVLLVGPPGTGKTLLA 200

Query: 176 KAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDERID-----R 230
           KAVA EA    FFS+S SD V  ++G     V+ LFE A+ + P IIFIDE ID     R
Sbjct: 201 KAVAGEAGVP-FFSISGSDFVEMFVGVGASRVRDLFEQAKKNAPCIIFIDE-IDAVGRQR 258

Query: 231 GISNALFLLLSGVGND------------MDG------ILVLGATNIPWVLDAAIRR--RF 270
           G          G GND            MDG      ++V+ ATN P VLD A+ R  RF
Sbjct: 259 GAG-------LGGGNDEREQTLNQLLVEMDGFGGNEGVIVIAATNRPDVLDPALLRPGRF 311

Query: 271 EKRIYIALPELNARLFMFRHHLGNTRNTISDENLKELAKRTEGYSGADISTL 322
           +++I + LP++  R  + + H  N      D +LK++A+ T G+SGAD++ L
Sbjct: 312 DRQILVELPDIKGREQILKVHAKNKPLA-EDVDLKKIARGTPGFSGADLANL 362


>gnl|CDD|214386 CHL00176, ftsH, cell division protein; Validated.
          Length = 638

 Score =  138 bits (349), Expect = 2e-36
 Identities = 90/226 (39%), Positives = 132/226 (58%), Gaps = 43/226 (19%)

Query: 126 DVAGLEAAKEALKEAVILPIKFPQLFT--GKRVPWKGILLFGPPGTGKSYLAKAVATEAN 183
           D+AG+E AKE  +E V   +K P+ FT  G ++P KG+LL GPPGTGK+ LAKA+A EA 
Sbjct: 184 DIAGIEEAKEEFEEVVSF-LKKPERFTAVGAKIP-KGVLLVGPPGTGKTLLAKAIAGEA- 240

Query: 184 NSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDERID-----RGISNALFL 238
              FFS+S S+ V  ++G     V+ LF+ A+ + P I+FIDE ID     RG       
Sbjct: 241 EVPFFSISGSEFVEMFVGVGAARVRDLFKKAKENSPCIVFIDE-IDAVGRQRG------- 292

Query: 239 LLSGVG--ND------------MDG------ILVLGATNIPWVLDAAIRR--RFEKRIYI 276
             +G+G  ND            MDG      ++V+ ATN   +LDAA+ R  RF+++I +
Sbjct: 293 --AGIGGGNDEREQTLNQLLTEMDGFKGNKGVIVIAATNRVDILDAALLRPGRFDRQITV 350

Query: 277 ALPELNARLFMFRHHLGNTRNTISDENLKELAKRTEGYSGADISTL 322
           +LP+   RL + + H  N +    D +L+ +A+RT G+SGAD++ L
Sbjct: 351 SLPDREGRLDILKVHARN-KKLSPDVSLELIARRTPGFSGADLANL 395


>gnl|CDD|224144 COG1223, COG1223, Predicted ATPase (AAA+ superfamily) [General
           function prediction only].
          Length = 368

 Score =  132 bits (334), Expect = 8e-36
 Identities = 75/218 (34%), Positives = 115/218 (52%), Gaps = 22/218 (10%)

Query: 117 MEKPNVKWTDVAGLEAAKEALKEAVILP-IKFPQLFTGKRVPWKGILLFGPPGTGKSYLA 175
               ++   DV G E AK   +  +I+  ++ P+ F G   P K +L +GPPGTGK+ +A
Sbjct: 113 EIISDITLDDVIGQEEAKRKCR--LIMEYLENPERF-GDWAP-KNVLFYGPPGTGKTMMA 168

Query: 176 KAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDE----RIDRG 231
           KA+A EA       V +++L+ + +G+  + +  L+E AR   P I+FIDE     +DR 
Sbjct: 169 KALANEAK-VPLLLVKATELIGEHVGDGARRIHELYERARKAAPCIVFIDELDAIALDRR 227

Query: 232 ----------ISNALFLLLSGVGNDMDGILVLGATNIPWVLDAAIRRRFEKRIYIALPEL 281
                     I NAL   L G+  + +G++ + ATN P +LD AIR RFE+ I   LP  
Sbjct: 228 YQELRGDVSEIVNALLTELDGI-KENEGVVTIAATNRPELLDPAIRSRFEEEIEFKLPND 286

Query: 282 NARLFMFRHHLGNTRNTISDENLKELAKRTEGYSGADI 319
             RL +  ++         D +L+ LA +T+G SG DI
Sbjct: 287 EERLEILEYYA-KKFPLPVDADLRYLAAKTKGMSGRDI 323


>gnl|CDD|185575 PTZ00361, PTZ00361, 26 proteosome regulatory subunit 4-like
           protein; Provisional.
          Length = 438

 Score =  121 bits (306), Expect = 2e-31
 Identities = 87/247 (35%), Positives = 132/247 (53%), Gaps = 39/247 (15%)

Query: 97  DDSEDPDKKKMQANLEGAIVMEKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLF--TGK 154
            D  DP    M+       V + P   + D+ GLE   + +KEAV LP+  P+L+   G 
Sbjct: 162 LDEVDPLVSVMK-------VDKAPLESYADIGGLEQQIQEIKEAVELPLTHPELYDDIGI 214

Query: 155 RVPWKGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELA 214
           + P KG++L+GPPGTGK+ LAKAVA E + +TF  V  S+L+ K+LG+  KLV+ LF +A
Sbjct: 215 KPP-KGVILYGPPGTGKTLLAKAVANETS-ATFLRVVGSELIQKYLGDGPKLVRELFRVA 272

Query: 215 RAHRPSIIFIDE-------RID------RGISNALFLLLSGVGNDMDG------ILVLGA 255
             + PSI+FIDE       R D      + I   +  LL    N +DG      + V+ A
Sbjct: 273 EENAPSIVFIDEIDAIGTKRYDATSGGEKEIQRTMLELL----NQLDGFDSRGDVKVIMA 328

Query: 256 TNIPWVLDAAIRR--RFEKRIYIALPELNARLFMFRHHLGNTRNTISDE-NLKELAKRTE 312
           TN    LD A+ R  R +++I    P+   +  +F  H   ++ T++++ +L+E     +
Sbjct: 329 TNRIESLDPALIRPGRIDRKIEFPNPDEKTKRRIFEIH--TSKMTLAEDVDLEEFIMAKD 386

Query: 313 GYSGADI 319
             SGADI
Sbjct: 387 ELSGADI 393


>gnl|CDD|182683 PRK10733, hflB, ATP-dependent metalloprotease; Reviewed.
          Length = 644

 Score =  115 bits (289), Expect = 2e-28
 Identities = 84/222 (37%), Positives = 125/222 (56%), Gaps = 31/222 (13%)

Query: 124 WTDVAGLEAAKEALKEAVIL---PIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVAT 180
           + DVAG + AKE + E V     P +F +L  G ++P KG+L+ GPPGTGK+ LAKA+A 
Sbjct: 151 FADVAGCDEAKEEVAELVEYLREPSRFQKL--GGKIP-KGVLMVGPPGTGKTLLAKAIAG 207

Query: 181 EANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDERID-----RGIS-- 233
           EA    FF++S SD V  ++G     V+ +FE A+   P IIFIDE ID     RG    
Sbjct: 208 EA-KVPFFTISGSDFVEMFVGVGASRVRDMFEQAKKAAPCIIFIDE-IDAVGRQRGAGLG 265

Query: 234 ----------NALFLLLSGV-GNDMDGILVLGATNIPWVLDAAIRR--RFEKRIYIALPE 280
                     N + + + G  GN  +GI+V+ ATN P VLD A+ R  RF++++ + LP+
Sbjct: 266 GGHDEREQTLNQMLVEMDGFEGN--EGIIVIAATNRPDVLDPALLRPGRFDRQVVVGLPD 323

Query: 281 LNARLFMFRHHLGNTRNTISDENLKELAKRTEGYSGADISTL 322
           +  R  + + H+        D +   +A+ T G+SGAD++ L
Sbjct: 324 VRGREQILKVHMRRV-PLAPDIDAAIIARGTPGFSGADLANL 364


>gnl|CDD|239141 cd02678, MIT_VPS4, MIT: domain contained within Microtubule
          Interacting and Trafficking molecules. This sub-family
          of MIT domains is found in intracellular protein
          transport proteins of the AAA-ATPase family. The
          molecular function of the MIT domain is unclear.
          Length = 75

 Score =  101 bits (255), Expect = 2e-27
 Identities = 43/73 (58%), Positives = 53/73 (72%), Gaps = 1/73 (1%)

Query: 5  DKGINIAIKATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLAR 64
           K I +  KA E D   NYEEAL+LY H+++YF+HALKYE  + + K+SIR KC EYL R
Sbjct: 4  QKAIELVKKAIEEDNAGNYEEALRLYQHALEYFMHALKYE-KNPKSKESIRAKCTEYLDR 62

Query: 65 AEKLKEYLKKGKK 77
          AEKLKEYL K +K
Sbjct: 63 AEKLKEYLAKKEK 75


>gnl|CDD|200312 TIGR03689, pup_AAA, proteasome ATPase.  In the Actinobacteria, as
           shown for Mycobacterium tuberculosis, some proteins are
           modified by ligation between an epsilon-amino group of a
           lysine side chain and the C-terminal carboxylate of the
           ubiquitin-like protein Pup. This modification leads to
           protein degradation by the archaeal-like proteasome
           found in the Actinobacteria. Members of this protein
           family belong to the AAA family of ATPases and tend to
           be clustered with the genes for Pup, the Pup ligase
           PafA, and structural components of the proteasome. This
           protein forms hexameric rings with ATPase activity
           [Protein fate, Degradation of proteins, peptides, and
           glycopeptides].
          Length = 512

 Score =  105 bits (264), Expect = 3e-25
 Identities = 67/213 (31%), Positives = 112/213 (52%), Gaps = 41/213 (19%)

Query: 115 IVMEKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFTGKRV-PWKGILLFGPPGTGKSY 173
           ++ E P+V + D+ GL +  E +++AV LP   P+L+    + P KG+LL+GPPG GK+ 
Sbjct: 172 VLEEVPDVTYADIGGLGSQIEQIRDAVELPFLHPELYREYGLKPPKGVLLYGPPGCGKTL 231

Query: 174 LAKAVATE---------ANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAH----RPS 220
           +AKAVA              S F ++   +L++K++GE+E+ ++ +F+ AR      RP 
Sbjct: 232 IAKAVANSLAARIGAEGGGKSYFLNIKGPELLNKYVGETERQIRLIFQRAREKASEGRPV 291

Query: 221 IIFIDERIDRGISNALFLLL-SGVGNDM------------------DGILVLGATNIPWV 261
           I+F DE       ++LF    SGV +D+                  D ++V+GA+N   +
Sbjct: 292 IVFFDEM------DSLFRTRGSGVSSDVETTVVPQLLAEIDGVESLDNVIVIGASNREDM 345

Query: 262 LDAAIRR--RFEKRIYIALPELNARLFMFRHHL 292
           +D AI R  R + +I I  P+  A   +F  +L
Sbjct: 346 IDPAILRPGRLDVKIRIERPDAEAAADIFAKYL 378


>gnl|CDD|99707 cd00009, AAA, The AAA+ (ATPases Associated with a wide variety of
           cellular Activities) superfamily represents an ancient
           group of ATPases belonging to the ASCE (for additional
           strand, catalytic E) division of the P-loop NTPase fold.
           The ASCE division also includes ABC, RecA-like,
           VirD4-like, PilT-like, and SF1/2 helicases. Members of
           the AAA+ ATPases function as molecular chaperons, ATPase
           subunits of proteases, helicases, or nucleic-acid
           stimulated ATPases. The AAA+ proteins contain several
           distinct features in addition to the conserved
           alpha-beta-alpha core domain structure and the Walker A
           and B motifs of the P-loop NTPases.
          Length = 151

 Score = 95.7 bits (238), Expect = 4e-24
 Identities = 62/163 (38%), Positives = 89/163 (54%), Gaps = 24/163 (14%)

Query: 128 AGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVATEA--NNS 185
            G E A EAL+EA+ LP            P K +LL+GPPGTGK+ LA+A+A E     +
Sbjct: 1   VGQEEAIEALREALELP------------PPKNLLLYGPPGTGKTTLARAIANELFRPGA 48

Query: 186 TFFSVSSSDLVSKWLGESEK---LVKTLFELARAHRPSIIFIDE--RIDRGISNALFLLL 240
            F  +++SDL+   +        LV+ LFELA   +P ++FIDE   + RG  NAL  +L
Sbjct: 49  PFLYLNASDLLEGLVVAELFGHFLVRLLFELAEKAKPGVLFIDEIDSLSRGAQNALLRVL 108

Query: 241 SGVGNDM---DGILVLGATNIPWV--LDAAIRRRFEKRIYIAL 278
             + +     + + V+GATN P +  LD A+  R + RI I L
Sbjct: 109 ETLNDLRIDRENVRVIGATNRPLLGDLDRALYDRLDIRIVIPL 151


>gnl|CDD|146707 pfam04212, MIT, MIT (microtubule interacting and transport)
          domain.  The MIT domain forms an asymmetric three-helix
          bundle and binds ESCRT-III (endosomal sorting complexes
          required for transport) substrates.
          Length = 69

 Score = 78.4 bits (194), Expect = 1e-18
 Identities = 33/69 (47%), Positives = 48/69 (69%), Gaps = 1/69 (1%)

Query: 4  FDKGINIAIKATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLA 63
           +K + +  KA EAD+  NYEEAL+LY  +++Y L ALKYE    + ++++R+K  EYL 
Sbjct: 2  LEKALELVKKAVEADEAGNYEEALELYKEAIEYLLQALKYE-PDPKRREALRQKIAEYLD 60

Query: 64 RAEKLKEYL 72
          RAE+LKE L
Sbjct: 61 RAEELKELL 69


>gnl|CDD|239121 cd02656, MIT, MIT: domain contained within Microtubule
          Interacting and Trafficking molecules. The MIT domain
          is found in sorting nexins, the nuclear thiol protease
          PalBH, the AAA protein spastin and archaebacterial
          proteins with similar domain architecture, vacuolar
          sorting proteins and others. The molecular function of
          the MIT domain is unclear.
          Length = 75

 Score = 75.8 bits (187), Expect = 1e-17
 Identities = 31/73 (42%), Positives = 46/73 (63%), Gaps = 1/73 (1%)

Query: 5  DKGINIAIKATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLAR 64
           +   +  +A + D++ NYEEAL+LY  ++ Y L ALK E    + ++ +R+K  EYL R
Sbjct: 4  QQAKELIKQAVKEDEDGNYEEALELYKEALDYLLQALKAEKEP-KLRKLLRKKVKEYLDR 62

Query: 65 AEKLKEYLKKGKK 77
          AE LKE LKK K+
Sbjct: 63 AEFLKELLKKQKQ 75


>gnl|CDD|214640 smart00382, AAA, ATPases associated with a variety of cellular
           activities.  AAA - ATPases associated with a variety of
           cellular activities. This profile/alignment only detects
           a fraction of this vast family. The poorly conserved
           N-terminal helix is missing from the alignment.
          Length = 148

 Score = 73.2 bits (179), Expect = 7e-16
 Identities = 41/148 (27%), Positives = 64/148 (43%), Gaps = 24/148 (16%)

Query: 157 PWKGILLFGPPGTGKSYLAKAVATEAN--NSTFFSVSSSDL--------------VSKWL 200
           P + IL+ GPPG+GK+ LA+A+A E          +   D+                K  
Sbjct: 1   PGEVILIVGPPGSGKTTLARALARELGPPGGGVIYIDGEDILEEVLDQLLLIIVGGKKAS 60

Query: 201 GESEKLVKTLFELARAHRPSIIFIDE--RIDRGISNALFLLLSGVGNDM-----DGILVL 253
           G  E  ++    LAR  +P ++ +DE   +      AL LLL  +   +       + V+
Sbjct: 61  GSGELRLRLALALARKLKPDVLILDEITSLLDAEQEALLLLLEELRLLLLLKSEKNLTVI 120

Query: 254 GATNIPWVLD-AAIRRRFEKRIYIALPE 280
             TN    L  A +RRRF++RI + L  
Sbjct: 121 LTTNDEKDLGPALLRRRFDRRIVLLLIL 148


>gnl|CDD|177094 CHL00195, ycf46, Ycf46; Provisional.
          Length = 489

 Score = 75.4 bits (186), Expect = 4e-15
 Identities = 66/222 (29%), Positives = 103/222 (46%), Gaps = 31/222 (13%)

Query: 120 PNVKWTDVAGLEAAKEALKEAVILPIKFPQLFT------GKRVPWKGILLFGPPGTGKSY 173
            N K +D+ GL+  K+ LK       K    F+      G   P +G+LL G  GTGKS 
Sbjct: 223 VNEKISDIGGLDNLKDWLK-------KRSTSFSKQASNYGLPTP-RGLLLVGIQGTGKSL 274

Query: 174 LAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDERIDRGIS 233
            AKA+A +        +    L    +GESE  ++ +  +A A  P I++IDE ID+  S
Sbjct: 275 TAKAIANDW-QLPLLRLDVGKLFGGIVGESESRMRQMIRIAEALSPCILWIDE-IDKAFS 332

Query: 234 NALFLLLSGVGNDMDGIL-------------VLGATNIPWVLDAAIRR-RFEKRIYIALP 279
           N+     SG  N +                 V  A NI  +    +R+ RF++  ++ LP
Sbjct: 333 NSESKGDSGTTNRVLATFITWLSEKKSPVFVVATANNIDLLPLEILRKGRFDEIFFLDLP 392

Query: 280 ELNARLFMFRHHLGNTR-NTISDENLKELAKRTEGYSGADIS 320
            L  R  +F+ HL   R  +    ++K+L+K +  +SGA+I 
Sbjct: 393 SLEEREKIFKIHLQKFRPKSWKKYDIKKLSKLSNKFSGAEIE 434


>gnl|CDD|197854 smart00745, MIT, Microtubule Interacting and Trafficking molecule
          domain. 
          Length = 77

 Score = 68.1 bits (167), Expect = 8e-15
 Identities = 26/72 (36%), Positives = 45/72 (62%), Gaps = 1/72 (1%)

Query: 2  TTFDKGINIAIKATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEY 61
              K   +  KA +AD+  NYEEAL+LY  +++Y L  +K E + ++ +++++ K  EY
Sbjct: 3  DYLSKAKELISKALKADEAGNYEEALELYKKAIEYLLEGIKVE-SDSKRREALKAKAAEY 61

Query: 62 LARAEKLKEYLK 73
          L RAE++K+ L 
Sbjct: 62 LDRAEEIKKSLL 73


>gnl|CDD|239147 cd02684, MIT_2, MIT: domain contained within Microtubule
          Interacting and Trafficking molecules. This sub-family
          of MIT domains is found in proteins with an n-terminal
          serine/threonine kinase domain. The molecular function
          of the MIT domain is unclear.
          Length = 75

 Score = 56.7 bits (137), Expect = 1e-10
 Identities = 26/70 (37%), Positives = 47/70 (67%), Gaps = 1/70 (1%)

Query: 3  TFDKGINIAIKATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYL 62
          + +K I + ++A + D+  +   AL LY  ++QYF+ AL YE T A+ K+++R+K  +Y+
Sbjct: 2  SLEKAIALVVQAVKKDQRGDAAAALSLYCSALQYFVPALHYE-TDAQRKEALRQKVLQYV 60

Query: 63 ARAEKLKEYL 72
          +RAE+LK  +
Sbjct: 61 SRAEELKALI 70


>gnl|CDD|219538 pfam07728, AAA_5, AAA domain (dynein-related subfamily).  This Pfam
           entry includes some of the AAA proteins not detected by
           the pfam00004 model.
          Length = 135

 Score = 54.6 bits (132), Expect = 2e-09
 Identities = 41/136 (30%), Positives = 54/136 (39%), Gaps = 26/136 (19%)

Query: 160 GILLFGPPGTGKSYLAKAVATEANNSTFFSV------SSSDLVSKWLGESEKLVKTLFEL 213
           G+LL GPPGTGKS LA+ +A   +N   F V      +  DL  +   +          L
Sbjct: 1   GVLLVGPPGTGKSELAERLAAALSNRPVFYVQLTRDTTEEDLKGRRNIDPGGASWVDGPL 60

Query: 214 ARAHRPS-IIFIDERIDRGISNAL-----------FLLLSG---VGNDMDGILVLGATNI 258
            RA R   I  +DE I+R   + L            LL  G   V    DG  ++   N 
Sbjct: 61  VRAAREGEIAVLDE-INRANPDVLNSLLSLLDERRLLLPEGGELVKAAPDGFRLIATMNP 119

Query: 259 P----WVLDAAIRRRF 270
                  L  A+R RF
Sbjct: 120 LDRGLNELSPALRSRF 135


>gnl|CDD|239146 cd02683, MIT_1, MIT: domain contained within Microtubule
          Interacting and Trafficking molecules. This sub-family
          of MIT domains is found in proteins with unknown
          function, co-occuring with an as yet undescribed
          domain. The molecular function of the MIT domain is
          unclear.
          Length = 77

 Score = 45.5 bits (108), Expect = 9e-07
 Identities = 21/71 (29%), Positives = 37/71 (52%), Gaps = 1/71 (1%)

Query: 6  KGINIAIKATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLARA 65
              +  +A E D+   ++EAL  Y   +   +  LK     A+ K+++R+K  EY+ RA
Sbjct: 5  AAKEVLKRAVELDQEGRFQEALVCYQEGIDLLMQVLKGTKDEAK-KKNLRQKISEYMDRA 63

Query: 66 EKLKEYLKKGK 76
          E +K+ L + K
Sbjct: 64 EAIKKRLDQEK 74


>gnl|CDD|188437 TIGR03922, T7SS_EccA, type VII secretion AAA-ATPase EccA.  This
           model represents the AAA family ATPase, EccA, of the
           actinobacterial flavor of type VII secretion systems.
           Species such as Mycobacterium tuberculosis have several
           instances of this system per genome, designated EccA1,
           EccA2, etc [Protein fate, Protein and peptide secretion
           and trafficking].
          Length = 557

 Score = 47.9 bits (114), Expect = 4e-06
 Identities = 54/219 (24%), Positives = 91/219 (41%), Gaps = 38/219 (17%)

Query: 84  ESRTK--DDKKESDEDDSEDPDKKKMQANL--EGAIVMEKPNVKWTDVAGLEAAKE---A 136
           E+RT   D        +  DP   + +A L  E    + +         GLE  K    A
Sbjct: 238 EARTDPWDPSSAPSRAEFVDPAAAERKAKLLAEAEAELAE-------QIGLERVKRQVAA 290

Query: 137 LKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVA------TEANNSTFFSV 190
           LK +  + +   +           +L  GPPGTGK+ +A+ VA                V
Sbjct: 291 LKSSTAMALARAERGLPVAQTSNHMLFAGPPGTGKTTIARVVAKIYCGLGVLRKPLVREV 350

Query: 191 SSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDER---IDRGISN-------ALFLLL 240
           S +DL+ +++GESE     + + A      ++F+DE    ++ G          A+  LL
Sbjct: 351 SRADLIGQYIGESEAKTNEIIDSALG---GVLFLDEAYTLVETGYGQKDPFGLEAIDTLL 407

Query: 241 SGVGNDMDGILVLGA---TNIPWVLDA--AIRRRFEKRI 274
           + + ND D ++V+GA    ++   L+    +R RF + I
Sbjct: 408 ARMENDRDRLVVIGAGYRKDLDKFLEVNEGLRSRFTRVI 446


>gnl|CDD|225165 COG2256, MGS1, ATPase related to the helicase subunit of the
           Holliday junction resolvase [DNA replication,
           recombination, and repair].
          Length = 436

 Score = 47.2 bits (113), Expect = 6e-06
 Identities = 47/184 (25%), Positives = 79/184 (42%), Gaps = 45/184 (24%)

Query: 163 LFGPPGTGKSYLAKAVATEAN------NSTFFSVSSSDLVSKWLGESEKLVKTLFELARA 216
           L+GPPGTGK+ LA+ +A   N      ++    V              K ++ + E AR 
Sbjct: 53  LWGPPGTGKTTLARLIAGTTNAAFEALSAVTSGV--------------KDLREIIEEARK 98

Query: 217 HRPS----IIFIDE--RIDRGISNALFLLLSGVGNDMDGILVLGAT--NIPWVLDAAIRR 268
           +R      I+F+DE  R ++   +AL   L  V N    I+++GAT  N  + L+ A+  
Sbjct: 99  NRLLGRRTILFLDEIHRFNKAQQDAL---LPHVENGT--IILIGATTENPSFELNPALLS 153

Query: 269 RFEKRIYIALPELNARLFMFR---------HHLGNTRNTISDENLKELAKRTEGYSGADI 319
           R   R++  L  L++                 LG     + +E L  L + + G +   +
Sbjct: 154 R--ARVFE-LKPLSSEDIKKLLKRALLDEERGLGGQIIVLDEEALDYLVRLSNGDARRAL 210

Query: 320 STLE 323
           + LE
Sbjct: 211 NLLE 214


>gnl|CDD|237355 PRK13342, PRK13342, recombination factor protein RarA; Reviewed.
          Length = 413

 Score = 47.0 bits (113), Expect = 7e-06
 Identities = 41/164 (25%), Positives = 73/164 (44%), Gaps = 54/164 (32%)

Query: 161 ILLFGPPGTGKSYLAKAVATEAN------NSTFFSVSSSDLVSKWLGESEKLVKTLFELA 214
           ++L+GPPGTGK+ LA+ +A   +      ++    V              K ++ + E A
Sbjct: 39  MILWGPPGTGKTTLARIIAGATDAPFEALSAVTSGV--------------KDLREVIEEA 84

Query: 215 RAHRPS----IIFIDE--RIDRGISNALFLLLSGVGNDMDGILVLGAT--NIPWVLDAAI 266
           R  R +    I+FIDE  R ++   +A   LL  V +    I ++GAT  N  + ++ A+
Sbjct: 85  RQRRSAGRRTILFIDEIHRFNKAQQDA---LLPHVEDGT--ITLIGATTENPSFEVNPAL 139

Query: 267 RRRFEKRIYIALPELNARLFMFRHHLGNTRNTISDENLKELAKR 310
             R             A++F  +          S+E++++L KR
Sbjct: 140 LSR-------------AQVFELKPL--------SEEDIEQLLKR 162


>gnl|CDD|223786 COG0714, COG0714, MoxR-like ATPases [General function prediction
           only].
          Length = 329

 Score = 45.5 bits (108), Expect = 2e-05
 Identities = 41/165 (24%), Positives = 58/165 (35%), Gaps = 30/165 (18%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVS------SSDLVSKWLGESEKLVKTLFE-- 212
           +LL GPPG GK+ LA+A+A  A    F  +        SDL+  +   +  L    F   
Sbjct: 46  VLLEGPPGVGKTLLARALA-RALGLPFVRIQCTPDLLPSDLLGTYAYAALLLEPGEFRFV 104

Query: 213 ---LARAHRPSIIFIDE--RIDRGISNALFLLLS-------GVGND--MDGILVLGATNI 258
              L  A R  I+ +DE  R    + NAL   L        G+         +V+   N 
Sbjct: 105 PGPLFAAVRV-ILLLDEINRAPPEVQNALLEALEERQVTVPGLTTIRLPPPFIVIATQNP 163

Query: 259 PWV-----LDAAIRRRFEKRIYIA-LPELNARLFMFRHHLGNTRN 297
                   L  A+  RF  RIY+           +     G    
Sbjct: 164 GEYEGTYPLPEALLDRFLLRIYVDYPDSEEEERIILARVGGVDEL 208


>gnl|CDD|219536 pfam07724, AAA_2, AAA domain (Cdc48 subfamily).  This Pfam entry
           includes some of the AAA proteins not detected by the
           pfam00004 model.
          Length = 168

 Score = 44.1 bits (105), Expect = 2e-05
 Identities = 29/96 (30%), Positives = 39/96 (40%), Gaps = 14/96 (14%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNS--TFFSVSSSDL-----VSKWLGESEKLVK----- 208
            L  GP G GK+ LAKA+A            +  S+      VS+ +G     V      
Sbjct: 6   FLFLGPTGVGKTELAKALAELLFGDERALIRIDMSEYMEEHSVSRLIGAPPGYVGYEEGG 65

Query: 209 TLFELARAHRPSIIFIDE--RIDRGISNALFLLLSG 242
            L E  R    SI+ IDE  +   G+ N L  +L G
Sbjct: 66  QLTEAVRRKPYSIVLIDEIEKAHPGVQNDLLQILEG 101


>gnl|CDD|222104 pfam13401, AAA_22, AAA domain. 
          Length = 124

 Score = 43.0 bits (102), Expect = 2e-05
 Identities = 17/81 (20%), Positives = 34/81 (41%), Gaps = 15/81 (18%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSV------SSSDLVSKWLGE---------SEK 205
            +L G  G+GK+ L + +A +  N     V      +  DL+ K L           + +
Sbjct: 7   GVLTGESGSGKTTLLRRLARQLPNRRVVYVEAPSLGTPKDLLRKILRALGLPLSGGTTAE 66

Query: 206 LVKTLFELARAHRPSIIFIDE 226
           L++ + +  +     ++ IDE
Sbjct: 67  LLEAILDALKRRGRPLLIIDE 87


>gnl|CDD|224145 COG1224, TIP49, DNA helicase TIP49, TBP-interacting protein
           [Transcription].
          Length = 450

 Score = 43.1 bits (102), Expect = 1e-04
 Identities = 28/83 (33%), Positives = 46/83 (55%), Gaps = 8/83 (9%)

Query: 129 GLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVATEANNST-F 187
           G E A+EA    VI+     ++    ++  +GIL+ GPPGTGK+ LA  +A E      F
Sbjct: 43  GQEEAREAA--GVIV-----KMIKQGKMAGRGILIVGPPGTGKTALAMGIARELGEDVPF 95

Query: 188 FSVSSSDLVSKWLGESEKLVKTL 210
            ++S S++ S  + ++E L + L
Sbjct: 96  VAISGSEIYSLEVKKTEALTQAL 118


>gnl|CDD|147949 pfam06068, TIP49, TIP49 C-terminus.  This family consists of the
           C-terminal region of several eukaryotic and archaeal
           RuvB-like 1 (Pontin or TIP49a) and RuvB-like 2 (Reptin
           or TIP49b) proteins. The N-terminal domain contains the
           pfam00004 domain. In zebrafish, the liebeskummer (lik)
           mutation, causes development of hyperplastic embryonic
           hearts. lik encodes Reptin, a component of a
           DNA-stimulated ATPase complex. Beta-catenin and Pontin,
           a DNA-stimulated ATPase that is often part of complexes
           with Reptin, are in the same genetic pathways. The
           Reptin/Pontin ratio serves to regulate heart growth
           during development, at least in part via the
           beta-catenin pathway. TBP-interacting protein 49 (TIP49)
           was originally identified as a TBP-binding protein, and
           two related proteins are encoded by individual genes,
           tip49a and b. Although the function of this gene family
           has not been elucidated, they are supposed to play a
           critical role in nuclear events because they interact
           with various kinds of nuclear factors and have DNA
           helicase activities.TIP49a has been suggested to act as
           an autoantigen in some patients with autoimmune
           diseases.
          Length = 395

 Score = 43.1 bits (102), Expect = 1e-04
 Identities = 26/79 (32%), Positives = 44/79 (55%), Gaps = 8/79 (10%)

Query: 129 GLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVATEANNST-F 187
           G E A+EA    VI+     ++    ++  + +L+ GPPGTGK+ LA A++ E    T F
Sbjct: 28  GQEKAREAA--GVIV-----EMIKEGKIAGRAVLIAGPPGTGKTALAIAISKELGEDTPF 80

Query: 188 FSVSSSDLVSKWLGESEKL 206
             +S S++ S  + ++E L
Sbjct: 81  CPISGSEVYSLEMKKTEAL 99


>gnl|CDD|221983 pfam13207, AAA_17, AAA domain. 
          Length = 114

 Score = 39.2 bits (91), Expect = 3e-04
 Identities = 26/124 (20%), Positives = 46/124 (37%), Gaps = 21/124 (16%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPS 220
           IL+ GPPG+GKS LAK +A +        +S  DL+           + L EL       
Sbjct: 2   ILITGPPGSGKSTLAKKLAEKLG---IPVISLDDLL---------REEGLAELDDGELDD 49

Query: 221 IIFIDERIDRGISNALFL--LLSGVGNDMDGILVLGATNIPW-------VLDAAIRRRFE 271
           I    E ++  +        ++ GV      + +  A  + +            ++RR +
Sbjct: 50  IDIDLELLEEILDELAKQEWVIDGVRESTLELRLEEADLVVFLDLPLPACRFRLLKRRLQ 109

Query: 272 KRIY 275
           +   
Sbjct: 110 RGRG 113


>gnl|CDD|223679 COG0606, COG0606, Predicted ATPase with chaperone activity
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 490

 Score = 41.5 bits (98), Expect = 4e-04
 Identities = 21/51 (41%), Positives = 24/51 (47%), Gaps = 14/51 (27%)

Query: 126 DVAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAK 176
           DV G E AK AL+ A            G       +LL GPPGTGK+ LA 
Sbjct: 180 DVKGQEQAKRALEIAA----------AGG----HNLLLVGPPGTGKTMLAS 216


>gnl|CDD|224401 COG1484, DnaC, DNA replication protein [DNA replication,
           recombination, and repair].
          Length = 254

 Score = 40.5 bits (95), Expect = 7e-04
 Identities = 28/93 (30%), Positives = 46/93 (49%), Gaps = 14/93 (15%)

Query: 159 KGILLFGPPGTGKSYLAKAVATEANN---STFFSVSSSDLVSKW---LGESEKLVKTLFE 212
           + ++L GPPG GK++LA A+  E      S  F +++ DL+SK      E     K L E
Sbjct: 106 ENLVLLGPPGVGKTHLAIAIGNELLKAGISVLF-ITAPDLLSKLKAAFDEGRLEEKLLRE 164

Query: 213 LARAHRPSIIFIDE----RIDRGISNALFLLLS 241
           L +     ++ ID+       +  ++ LF L+S
Sbjct: 165 LKKV---DLLIIDDIGYEPFSQEEADLLFQLIS 194


>gnl|CDD|144608 pfam01078, Mg_chelatase, Magnesium chelatase, subunit ChlI.
           Magnesium-chelatase is a three-component enzyme that
           catalyzes the insertion of Mg2+ into protoporphyrin IX.
           This is the first unique step in the synthesis of
           (bacterio)chlorophyll. Due to this, it is thought that
           Mg-chelatase has an important role in channelling inter-
           mediates into the (bacterio)chlorophyll branch in
           response to conditions suitable for photosynthetic
           growth. ChlI and BchD have molecular weight between
           38-42 kDa.
          Length = 207

 Score = 39.8 bits (94), Expect = 7e-04
 Identities = 20/60 (33%), Positives = 26/60 (43%), Gaps = 20/60 (33%)

Query: 124 WTDVAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKG---ILLFGPPGTGKSYLAKAVAT 180
             DV G E AK AL+ A                   G   +L+ GPPG+GK+ LAK +  
Sbjct: 2   LADVKGQEQAKRALEIAAA-----------------GGHNLLMIGPPGSGKTMLAKRLPG 44


>gnl|CDD|214396 CHL00206, ycf2, Ycf2; Provisional.
          Length = 2281

 Score = 41.0 bits (96), Expect = 8e-04
 Identities = 41/163 (25%), Positives = 62/163 (38%), Gaps = 57/163 (34%)

Query: 157  PWKGILLFGPPGTGKSYLAKAVAT------------------------------------ 180
            P +GIL+ G  GTG+SYL K +AT                                    
Sbjct: 1629 PSRGILVIGSIGTGRSYLVKYLATNSYVPFITVFLNKFLDNKPKGFLIDDIDIDDSDDID 1688

Query: 181  ---------EANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFIDERIDRG 231
                     +    T  +  + D++ K        +   FELA+A  P II+I    D  
Sbjct: 1689 DSDDIDRDLDTELLTMMNALTMDMMPK---IDRFYITLQFELAKAMSPCIIWIPNIHDLN 1745

Query: 232  I--SNALFL--LLSGVGNDMD-----GILVLGATNIPWVLDAA 265
            +  SN L L  L++ +  D +      ILV+ +T+IP  +D A
Sbjct: 1746 VNESNYLSLGLLVNSLSRDCERCSTRNILVIASTHIPQKVDPA 1788


>gnl|CDD|110677 pfam01695, IstB_IS21, IstB-like ATP binding protein.  This protein
           contains an ATP/GTP binding P-loop motif. It is found
           associated with IS21 family insertion sequences. The
           function of this protein is unknown, but it may perform
           a transposase function.
          Length = 178

 Score = 38.4 bits (90), Expect = 0.002
 Identities = 29/92 (31%), Positives = 49/92 (53%), Gaps = 15/92 (16%)

Query: 160 GILLFGPPGTGKSYLAKAVATEA---NNSTFFSVSSSDLVSKWLGESE---KLVKTLFEL 213
            +LL GPPG GK++LA A+  +A     S  F   + DLV + L  +    +L +TL  L
Sbjct: 49  NLLLLGPPGVGKTHLACALGHQACRAGYSVLF-TRTPDLVEQ-LKRARGDGRLARTLQRL 106

Query: 214 ARAHRPSIIFIDE----RIDRGISNALFLLLS 241
           A+A    ++ +D+     + +  ++ LF L+S
Sbjct: 107 AKA---DLLILDDIGYLPLSQEAAHLLFELIS 135


>gnl|CDD|223637 COG0563, Adk, Adenylate kinase and related kinases [Nucleotide
           transport and metabolism].
          Length = 178

 Score = 38.0 bits (89), Expect = 0.002
 Identities = 23/87 (26%), Positives = 37/87 (42%), Gaps = 8/87 (9%)

Query: 159 KGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHR 218
             IL+ GPPG GKS LAK +   A       + + D++   + E  +L + + +      
Sbjct: 1   MRILILGPPGAGKSTLAKKL---AKKLGLPHLDTGDILRAAIAERTELGEEIKKYIDKGE 57

Query: 219 --PS---IIFIDERIDRGISNALFLLL 240
             P       + ER+D     A F+L 
Sbjct: 58  LVPDEIVNGLVKERLDEADCKAGFILD 84


>gnl|CDD|236153 PRK08116, PRK08116, hypothetical protein; Validated.
          Length = 268

 Score = 38.1 bits (89), Expect = 0.003
 Identities = 13/22 (59%), Positives = 17/22 (77%)

Query: 160 GILLFGPPGTGKSYLAKAVATE 181
           G+LL+G  GTGK+YLA  +A E
Sbjct: 116 GLLLWGSVGTGKTYLAACIANE 137


>gnl|CDD|216186 pfam00910, RNA_helicase, RNA helicase.  This family includes RNA
           helicases thought to be involved in duplex unwinding
           during viral RNA replication. Members of this family are
           found in a variety of single stranded RNA viruses.
          Length = 105

 Score = 36.1 bits (84), Expect = 0.004
 Identities = 14/33 (42%), Positives = 16/33 (48%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSS 193
           I L+GPPG GKS LAK +A             S
Sbjct: 1   IWLYGPPGCGKSTLAKYLARALLKHLGLPKKDS 33


>gnl|CDD|136670 PRK08181, PRK08181, transposase; Validated.
          Length = 269

 Score = 36.8 bits (85), Expect = 0.011
 Identities = 27/87 (31%), Positives = 47/87 (54%), Gaps = 7/87 (8%)

Query: 161 ILLFGPPGTGKSYLAKAV--ATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHR 218
           +LLFGPPG GKS+LA A+  A   N        ++DLV K L  + + ++    +A+  +
Sbjct: 109 LLLFGPPGGGKSHLAAAIGLALIENGWRVLFTRTTDLVQK-LQVARRELQLESAIAKLDK 167

Query: 219 PSIIFIDERI----DRGISNALFLLLS 241
             ++ +D+      D+  ++ LF L+S
Sbjct: 168 FDLLILDDLAYVTKDQAETSVLFELIS 194


>gnl|CDD|237495 PRK13765, PRK13765, ATP-dependent protease Lon; Provisional.
          Length = 637

 Score = 36.5 bits (85), Expect = 0.015
 Identities = 26/85 (30%), Positives = 40/85 (47%), Gaps = 14/85 (16%)

Query: 95  DEDDSEDPDKKKMQANLEGAIVMEKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFTGK 154
           +E++ ED +      + E    +E P      V G E A E +K+A             +
Sbjct: 1   EEEEDEDEEDLLGGLDFETTSDIEVPERLIDQVIGQEHAVEVIKKAAK-----------Q 49

Query: 155 RVPWKGILLFGPPGTGKSYLAKAVA 179
           R   + +++ G PGTGKS LAKA+A
Sbjct: 50  R---RHVMMIGSPGTGKSMLAKAMA 71


>gnl|CDD|234619 PRK00080, ruvB, Holliday junction DNA helicase RuvB; Reviewed.
          Length = 328

 Score = 35.5 bits (83), Expect = 0.027
 Identities = 12/21 (57%), Positives = 16/21 (76%)

Query: 161 ILLFGPPGTGKSYLAKAVATE 181
           +LL+GPPG GK+ LA  +A E
Sbjct: 54  VLLYGPPGLGKTTLANIIANE 74


>gnl|CDD|225164 COG2255, RuvB, Holliday junction resolvasome, helicase subunit [DNA
           replication, recombination, and repair].
          Length = 332

 Score = 35.6 bits (83), Expect = 0.028
 Identities = 13/23 (56%), Positives = 16/23 (69%)

Query: 161 ILLFGPPGTGKSYLAKAVATEAN 183
           +LLFGPPG GK+ LA  +A E  
Sbjct: 55  VLLFGPPGLGKTTLAHIIANELG 77


>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La.  This protein is induced
           by heat shock and other stresses in E. coli, B.
           subtilis, and other species. The yeast member,
           designated PIM1, is located in the mitochondrial matrix,
           required for mitochondrial function, and also induced by
           heat shock [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 775

 Score = 35.7 bits (83), Expect = 0.029
 Identities = 44/192 (22%), Positives = 82/192 (42%), Gaps = 39/192 (20%)

Query: 35  QYFLHALKYEITSAEGKQSIREKCDEYLARAEK---LKEYLKKGKKQPVKDGESRTKDDK 91
           +  L  LK E+   + +  I +K +E + + ++   L+E LK  KK+   + + + + +K
Sbjct: 192 KKALELLKKELELLKLQNKITKKVEEKMEKTQREYYLREQLKAIKKELGIEKDDKDELEK 251

Query: 92  KESDEDDSEDPD--KKKMQANLEGAIVMEKP---------------NVKWTDV----AGL 130
            +   ++ + P+  KK ++  L    ++E                 ++ W         L
Sbjct: 252 LKEKLEELKLPEEVKKVIEKELTKLSLLEPSSSEFTVTRNYLDWLTDLPWGKYSKENLDL 311

Query: 131 EAAKEAL----------KEAVILPIKFPQLFTGKRVPWKGILLF-GPPGTGKSYLAKAVA 179
           + AKE L          KE ++  +   +L    + P   IL   GPPG GK+ L K++A
Sbjct: 312 KRAKEILDEDHYGLKKVKERILEYLAVQKLRGKMKGP---ILCLVGPPGVGKTSLGKSIA 368

Query: 180 TEANNSTFFSVS 191
            +A N  F   S
Sbjct: 369 -KALNRKFVRFS 379


>gnl|CDD|213179 cd00267, ABC_ATPase, ATP-binding cassette transporter
           nucleotide-binding domain.  ABC transporters are a large
           family of proteins involved in the transport of a wide
           variety of different compounds, like sugars, ions,
           peptides, and more complex organic molecules. The
           nucleotide-binding domain shows the highest similarity
           between all members of the family. ABC transporters are
           a subset of nucleotide hydrolases that contain a
           signature motif, Q-loop, and H-loop/switch region, in
           addition to, the Walker A motif/P-loop and Walker B
           motif commonly found in a number of ATP- and GTP-binding
           and hydrolyzing proteins.
          Length = 157

 Score = 34.5 bits (80), Expect = 0.030
 Identities = 20/84 (23%), Positives = 34/84 (40%), Gaps = 23/84 (27%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNS----TFFSVSSSDLVSKWL------------GESE 204
           + L GP G+GKS L +A+A     +           + L  + L            G+ +
Sbjct: 28  VALVGPNGSGKSTLLRAIAGLLKPTSGEILIDGKDIAKLPLEELRRRIGYVPQLSGGQRQ 87

Query: 205 KLVKTLFELARA--HRPSIIFIDE 226
           ++      LARA    P ++ +DE
Sbjct: 88  RV-----ALARALLLNPDLLLLDE 106


>gnl|CDD|225806 COG3267, ExeA, Type II secretory pathway, component ExeA (predicted
           ATPase) [Intracellular trafficking and secretion].
          Length = 269

 Score = 35.1 bits (81), Expect = 0.031
 Identities = 41/189 (21%), Positives = 73/189 (38%), Gaps = 33/189 (17%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSD-----------LVSKWLGESEKLVKT 209
           + + G  G+GK+ L +A+    N      V               +V+    + +  V  
Sbjct: 54  LAVTGEVGSGKTVLRRALLASLNEDQVAVVVIDKPTLSDATLLEAIVADLESQPKVNVNA 113

Query: 210 LFE---------LARAHRPSIIFIDERIDRGIS--NALFLLLSGV--GNDMDGILVLGAT 256
           + E         + +  RP ++ +DE  D   S   AL LL +     + +  I+++G  
Sbjct: 114 VLEQIDRELAALVKKGKRPVVLMVDEAHDLNDSALEALRLLTNLEEDSSKLLSIVLIGQP 173

Query: 257 NIPWVLDAAIRRRFEKR--IYIALPELN---ARLFMFRHHL---GNTRNTISDENLKELA 308
            +   L   + R  E+R  I I LP L      L++ RH L   G      SD+ L  + 
Sbjct: 174 KLRPRLRLPVLRELEQRIDIRIELPPLTEAETGLYL-RHRLEGAGLPEPLFSDDALLLIH 232

Query: 309 KRTEGYSGA 317
           + ++G    
Sbjct: 233 EASQGIPRL 241


>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
           chaperones [Posttranslational modification, protein
           turnover, chaperones].
          Length = 786

 Score = 35.7 bits (83), Expect = 0.033
 Identities = 21/76 (27%), Positives = 37/76 (48%), Gaps = 11/76 (14%)

Query: 162 LLFGPPGTGKSYLAKAVATE-AN--------NSTFFSVSSSDLVS--KWLGESEKLVKTL 210
           +L G PG GK+ + + +A    N        +   +S+    LV+  K+ GE E+ +K +
Sbjct: 195 VLVGEPGVGKTAIVEGLAQRIVNGDVPESLKDKRIYSLDLGSLVAGAKYRGEFEERLKAV 254

Query: 211 FELARAHRPSIIFIDE 226
            +     +  I+FIDE
Sbjct: 255 LKEVEKSKNVILFIDE 270



 Score = 28.0 bits (63), Expect = 9.2
 Identities = 54/212 (25%), Positives = 83/212 (39%), Gaps = 43/212 (20%)

Query: 38  LHALKYEITSAEGKQSIREKCDEYLARAEKLKEYLKKGKKQPVKDGESRTKDDKKESDED 97
           L  L+ E  + E +Q  +EK         KL + + K K+  + + E   K+ + E DED
Sbjct: 410 LAQLEIEKEALEREQDEKEK---------KLIDEIIKLKEGRIPELE---KELEAEVDED 457

Query: 98  DSED--------PDKKKMQANLEGAIVMEKPNVKWTDVAGLEAAKEALKEAVILPIKFPQ 149
           D  +        P  K ++   E   ++         V G + A EA+ +A+        
Sbjct: 458 DIAEVVARWTGIPVAKLLED--EKEKLLNLERRLKKRVIGQDEAVEAVSDAIRR------ 509

Query: 150 LFTGKRVPWK--GILLF-GPPGTGKSYLAKAVATEANNST--FFSVSSSDL-----VSKW 199
              G   P +  G  LF GP G GK+ LAKA+A            +  S+      VS+ 
Sbjct: 510 ARAGLGDPNRPIGSFLFLGPTGVGKTELAKALAEALFGDEQALIRIDMSEYMEKHSVSRL 569

Query: 200 LGESEKLV-----KTLFELARAHRPSIIFIDE 226
           +G     V       L E  R    S+I +DE
Sbjct: 570 IGAPPGYVGYEEGGQLTEAVRRKPYSVILLDE 601


>gnl|CDD|239140 cd02677, MIT_SNX15, MIT: domain contained within Microtubule
          Interacting and Trafficking molecules. This MIT domain
          sub-family is found in sorting nexin 15 and related
          proteins. The molecular function of the MIT domain is
          unclear.
          Length = 75

 Score = 32.7 bits (75), Expect = 0.035
 Identities = 17/56 (30%), Positives = 32/56 (57%), Gaps = 1/56 (1%)

Query: 13 KATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLARAEKL 68
           A E ++  +YE A + Y   V   L  ++ + +S E +++++ K  EYL RAE++
Sbjct: 12 LALEKEEEGDYEAAFEFYRAGVDLLLKGVQGD-SSPERREAVKRKIAEYLKRAEEI 66


>gnl|CDD|223989 COG1061, SSL2, DNA or RNA helicases of superfamily II
           [Transcription / DNA replication, recombination, and
           repair].
          Length = 442

 Score = 35.1 bits (81), Expect = 0.049
 Identities = 12/40 (30%), Positives = 19/40 (47%)

Query: 166 PPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEK 205
           P G GK+ +A     E   ST   V + +L+ +W    +K
Sbjct: 63  PTGAGKTVVAAEAIAELKRSTLVLVPTKELLDQWAEALKK 102


>gnl|CDD|234203 TIGR03420, DnaA_homol_Hda, DnaA regulatory inactivator Hda.
           Members of this protein family are Hda (Homologous to
           DnaA). These proteins are about half the length of DnaA
           and homologous over length of Hda. In the model species
           Escherichia coli, the initiation of DNA replication
           requires DnaA bound to ATP rather than ADP; Hda helps
           facilitate the conversion of DnaA-ATP to DnaA-ADP [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 226

 Score = 34.1 bits (79), Expect = 0.055
 Identities = 30/129 (23%), Positives = 49/129 (37%), Gaps = 25/129 (19%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPS 220
           + L+G  G+GKS+L +A    A         S+  +   L E  +    + E     +  
Sbjct: 41  LYLWGESGSGKSHLLQAACAAAEERG----KSAIYLP--LAELAQADPEVLEGLE--QAD 92

Query: 221 IIFID--ERID--RGISNALFLLLSGVGNDMDGILVLGATNIPWVLDAAIRRRFEKRIYI 276
           ++ +D  E I        ALF L + V  +  G L++     P  L             +
Sbjct: 93  LVCLDDVEAIAGQPEWQEALFHLYNRV-REAGGRLLIAGRAAPAQLP------------L 139

Query: 277 ALPELNARL 285
            LP+L  RL
Sbjct: 140 RLPDLRTRL 148


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 34.9 bits (81), Expect = 0.060
 Identities = 24/58 (41%), Positives = 32/58 (55%), Gaps = 8/58 (13%)

Query: 126 DVAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVATEAN 183
           DV G E AKE L+E +     +      K  P K +LL+GPPG GK+ LA A+A +  
Sbjct: 15  DVVGNEKAKEQLREWIE---SWL-----KGKPKKALLLYGPPGVGKTSLAHALANDYG 64


>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 782

 Score = 34.8 bits (81), Expect = 0.070
 Identities = 26/67 (38%), Positives = 33/67 (49%), Gaps = 15/67 (22%)

Query: 129 GLEAAKEALKE--AVILPIKFPQLFTGKRVPWKG--ILLFGPPGTGKSYLAKAVATEANN 184
           GLE  KE + E  AV       Q  T K    KG  + L GPPG GK+ L K++A +A  
Sbjct: 327 GLEKVKERILEYLAV-------QKLTKKL---KGPILCLVGPPGVGKTSLGKSIA-KALG 375

Query: 185 STFFSVS 191
             F  +S
Sbjct: 376 RKFVRIS 382


>gnl|CDD|237354 PRK13341, PRK13341, recombination factor protein RarA/unknown
           domain fusion protein; Reviewed.
          Length = 725

 Score = 34.6 bits (80), Expect = 0.071
 Identities = 34/97 (35%), Positives = 47/97 (48%), Gaps = 10/97 (10%)

Query: 162 LLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFE-LARAHRPS 220
           +L+GPPG GK+ LA+ +A    N T    SS + V   + +    V    E L R  + +
Sbjct: 56  ILYGPPGVGKTTLARIIA----NHTRAHFSSLNAVLAGVKDLRAEVDRAKERLERHGKRT 111

Query: 221 IIFIDERIDRGISNALFLLLSGVGNDMDGILVL-GAT 256
           I+FIDE + R        LL  V N   G + L GAT
Sbjct: 112 ILFIDE-VHRFNKAQQDALLPWVEN---GTITLIGAT 144


>gnl|CDD|180607 PRK06526, PRK06526, transposase; Provisional.
          Length = 254

 Score = 34.1 bits (78), Expect = 0.072
 Identities = 23/89 (25%), Positives = 45/89 (50%), Gaps = 9/89 (10%)

Query: 161 ILLFGPPGTGKSYLAKAV---ATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAH 217
           ++  GPPGTGK++LA  +   A +A +   F+ +++  V++ L  +    +   EL +  
Sbjct: 101 VVFLGPPGTGKTHLAIGLGIRACQAGHRVLFA-TAAQWVAR-LAAAHHAGRLQAELVKLG 158

Query: 218 RPSIIFIDE----RIDRGISNALFLLLSG 242
           R  ++ +DE      +   +N  F L+S 
Sbjct: 159 RYPLLIVDEVGYIPFEPEAANLFFQLVSS 187


>gnl|CDD|221970 pfam13191, AAA_16, AAA ATPase domain.  This family of domains
           contain a P-loop motif that is characteristic of the AAA
           superfamily.
          Length = 154

 Score = 33.3 bits (76), Expect = 0.076
 Identities = 27/122 (22%), Positives = 42/122 (34%), Gaps = 37/122 (30%)

Query: 161 ILLFGPPGTGKSYLAKAVAT------------EANNSTFFSVSSSDLVSKWLGESEK--- 205
           +LL GP GTGK+ L + +              E N    FS +  +L+ + L E      
Sbjct: 27  VLLTGPSGTGKTSLLRELLEGLLVAAGKCDQAERNPPYAFSQALRELLRQLLRELAAELL 86

Query: 206 --------------------LVKTLFELARAHRPSIIFIDE--RIDRGISNALFLLLSGV 243
                               LV+ L  L    RP ++ +D+    D    + L  LL  +
Sbjct: 87  LLREALLAALGAELIEGLQDLVELLERLLARARPLVLVLDDLQWADEESLDLLAALLRRL 146

Query: 244 GN 245
             
Sbjct: 147 ER 148


>gnl|CDD|216270 pfam01057, Parvo_NS1, Parvovirus non-structural protein NS1.  This
           family also contains the NS2 protein. Parvoviruses
           encode two non-structural proteins, NS1 and NS2. The
           mRNA for NS2 contains the coding sequence for the first
           87 amino acids of NS1, then by an alternative splicing
           mechanism mRNA from a different reading frame, encoding
           the last 78 amino acids, makes up the full length of the
           NS2 mRNA. NS1, is the major non-structural protein. It
           is essential for DNA replication. It is an 83-kDa
           nuclear phosphoprotein. It has DNA helicase and ATPase
           activity.
          Length = 271

 Score = 34.2 bits (79), Expect = 0.077
 Identities = 9/20 (45%), Positives = 14/20 (70%)

Query: 161 ILLFGPPGTGKSYLAKAVAT 180
           +  +GP  TGK+ LA+A+A 
Sbjct: 116 VWFYGPASTGKTNLAQAIAH 135


>gnl|CDD|224160 COG1239, ChlI, Mg-chelatase subunit ChlI [Coenzyme metabolism].
          Length = 423

 Score = 34.3 bits (79), Expect = 0.087
 Identities = 17/46 (36%), Positives = 23/46 (50%), Gaps = 8/46 (17%)

Query: 135 EALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVAT 180
           + LK A+ L    PQ+         G L+ G  GT KS LA+A+A 
Sbjct: 23  DPLKLALGLNAVDPQI--------GGALIAGEKGTAKSTLARALAD 60


>gnl|CDD|239143 cd02680, MIT_calpain7_2, MIT: domain contained within Microtubule
          Interacting and Trafficking molecules. This sub-family
          of MIT domains is found in the nuclear thiol protease
          PalBH. The molecular function of the MIT domain is
          unclear.
          Length = 75

 Score = 31.5 bits (71), Expect = 0.10
 Identities = 19/62 (30%), Positives = 34/62 (54%), Gaps = 1/62 (1%)

Query: 13 KATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLARAEKLKEYL 72
          +A + D+  N EEA++LYT +V+  ++    E      +  +++   + L RAE LKE +
Sbjct: 12 QAFDEDEKGNAEEAIELYTEAVELCINT-SNETMDQALQTKLKQLARQALDRAEALKESM 70

Query: 73 KK 74
           K
Sbjct: 71 SK 72


>gnl|CDD|238026 cd00071, GMPK, Guanosine monophosphate kinase (GMPK, EC 2.7.4.8),
           also known as guanylate kinase (GKase), catalyzes the
           reversible phosphoryl transfer from adenosine
           triphosphate (ATP) to guanosine monophosphate (GMP) to
           yield adenosine diphosphate (ADP) and guanosine
           diphosphate (GDP). It plays an essential role in the
           biosynthesis of guanosine triphosphate (GTP). This
           enzyme is also important for the activation of some
           antiviral and anticancer agents, such as acyclovir,
           ganciclovir, carbovir, and thiopurines.
          Length = 137

 Score = 32.5 bits (75), Expect = 0.10
 Identities = 15/31 (48%), Positives = 19/31 (61%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVS 191
           I+L GP G GKS L K +  E + +  FSVS
Sbjct: 2   IVLSGPSGVGKSTLLKRLLEEFDPNFGFSVS 32


>gnl|CDD|129721 TIGR00635, ruvB, Holliday junction DNA helicase, RuvB subunit.  All
           proteins in this family for which functions are known
           are 5'-3' DNA helicases that, as part of a complex with
           RuvA homologs serve as a 5'-3' Holliday junction
           helicase. RuvA specifically binds Holliday junctions as
           a sandwich of two tetramers and maintains the
           configuration of the junction. It forms a complex with
           two hexameric rings of RuvB, the subunit that contains
           helicase activity. The complex drives ATP-dependent
           branch migration of the Holliday junction recombination
           intermediate. The endonuclease RuvC resolves junctions
           [DNA metabolism, DNA replication, recombination, and
           repair].
          Length = 305

 Score = 33.8 bits (78), Expect = 0.11
 Identities = 12/23 (52%), Positives = 16/23 (69%)

Query: 161 ILLFGPPGTGKSYLAKAVATEAN 183
           +LL+GPPG GK+ LA  +A E  
Sbjct: 33  LLLYGPPGLGKTTLAHIIANEMG 55


>gnl|CDD|181681 PRK09183, PRK09183, transposase/IS protein; Provisional.
          Length = 259

 Score = 33.5 bits (77), Expect = 0.11
 Identities = 23/87 (26%), Positives = 37/87 (42%), Gaps = 20/87 (22%)

Query: 161 ILLFGPPGTGKSYLAKAV---ATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAH 217
           I+L GP G GK++LA A+   A  A     F  +++DL              L +L+ A 
Sbjct: 105 IVLLGPSGVGKTHLAIALGYEAVRAGIKVRF-TTAADL--------------LLQLSTAQ 149

Query: 218 RPSIIFIDERIDRGISNALFLLLSGVG 244
           R         + RG+     L++  +G
Sbjct: 150 RQG--RYKTTLQRGVMAPRLLIIDEIG 174


>gnl|CDD|235422 PRK05342, clpX, ATP-dependent protease ATP-binding subunit ClpX;
           Provisional.
          Length = 412

 Score = 33.6 bits (78), Expect = 0.11
 Identities = 29/90 (32%), Positives = 42/90 (46%), Gaps = 38/90 (42%)

Query: 161 ILLFGPPGTGKSYLAK----------AVA-----TEANNSTFFSVSSSDLVSKWLGES-E 204
           ILL GP G+GK+ LA+          A+A     TEA                ++GE  E
Sbjct: 111 ILLIGPTGSGKTLLAQTLARILDVPFAIADATTLTEA---------------GYVGEDVE 155

Query: 205 K-LVKTL----FELARAHRPSIIFIDERID 229
             L+K L    +++ +A R  I++IDE ID
Sbjct: 156 NILLKLLQAADYDVEKAQR-GIVYIDE-ID 183


>gnl|CDD|184311 PRK13764, PRK13764, ATPase; Provisional.
          Length = 602

 Score = 33.7 bits (78), Expect = 0.13
 Identities = 16/51 (31%), Positives = 23/51 (45%), Gaps = 18/51 (35%)

Query: 160 GILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTL 210
           GIL+ G PG GKS  A+A+A                   +  +  K+VKT+
Sbjct: 259 GILIAGAPGAGKSTFAQALAE------------------FYADMGKIVKTM 291


>gnl|CDD|190347 pfam02562, PhoH, PhoH-like protein.  PhoH is a cytoplasmic protein
           and predicted ATPase that is induced by phosphate
           starvation.
          Length = 205

 Score = 32.8 bits (76), Expect = 0.14
 Identities = 12/22 (54%), Positives = 15/22 (68%)

Query: 161 ILLFGPPGTGKSYLAKAVATEA 182
           +   GP GTGK+YLA A A +A
Sbjct: 22  VFGIGPAGTGKTYLAVAAAVDA 43


>gnl|CDD|237090 PRK12402, PRK12402, replication factor C small subunit 2; Reviewed.
          Length = 337

 Score = 33.4 bits (77), Expect = 0.14
 Identities = 24/72 (33%), Positives = 33/72 (45%), Gaps = 18/72 (25%)

Query: 126 DVAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVATEA--- 182
           D+ G +   E L  AV  P   P L           L+ GPPG+GK+   +A+A E    
Sbjct: 16  DILGQDEVVERLSRAVDSPN-LPHL-----------LVQGPPGSGKTAAVRALARELYGD 63

Query: 183 ---NNSTFFSVS 191
              NN T F+V+
Sbjct: 64  PWENNFTEFNVA 75


>gnl|CDD|221955 pfam13173, AAA_14, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily.
          Length = 127

 Score = 31.8 bits (73), Expect = 0.15
 Identities = 14/66 (21%), Positives = 25/66 (37%), Gaps = 2/66 (3%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRPS 220
           I++ GP   GK+ L      E  +     ++  D     L     L++   +L       
Sbjct: 5   IVITGPRQVGKTTLLLQFLKELLSENILYINLDDPRFLRL-ALLDLLEEFLKLLLP-GKK 62

Query: 221 IIFIDE 226
            +F+DE
Sbjct: 63  YLFLDE 68


>gnl|CDD|203260 pfam05496, RuvB_N, Holliday junction DNA helicase ruvB N-terminus. 
           The RuvB protein makes up part of the RuvABC revolvasome
           which catalyzes the resolution of Holliday junctions
           that arise during genetic recombination and DNA repair.
           Branch migration is catalyzed by the RuvB protein that
           is targeted to the Holliday junction by the structure
           specific RuvA protein. This family contains the
           N-terminal region of the protein.
          Length = 231

 Score = 32.4 bits (75), Expect = 0.24
 Identities = 12/21 (57%), Positives = 16/21 (76%)

Query: 161 ILLFGPPGTGKSYLAKAVATE 181
           +LL+GPPG GK+ LA  +A E
Sbjct: 53  VLLYGPPGLGKTTLANIIANE 73


>gnl|CDD|222866 PHA02544, 44, clamp loader, small subunit; Provisional.
          Length = 316

 Score = 32.7 bits (75), Expect = 0.24
 Identities = 22/61 (36%), Positives = 30/61 (49%), Gaps = 8/61 (13%)

Query: 139 EAVILPIKFPQLFTG----KRVPWKGILLFGP-PGTGKSYLAKAVATEANNSTFFSVSSS 193
           +  ILP    + F       R+P   +LL  P PGTGK+ +AKA+  E      F V+ S
Sbjct: 21  DECILPAADKETFKSIVKKGRIP--NMLLHSPSPGTGKTTVAKALCNEVGAEVLF-VNGS 77

Query: 194 D 194
           D
Sbjct: 78  D 78


>gnl|CDD|234053 TIGR02903, spore_lon_C, ATP-dependent protease, Lon family.
           Members of this protein family resemble the widely
           distributed ATP-dependent protease La, also called Lon
           and LonA. It resembles even more closely LonB, which is
           a LonA paralog found in genomes if and only if the
           species is capable of endospore formation (as in
           Bacillus subtilis, Clostridium tetani, and select other
           members of the Firmicutes) and expressed specifically in
           the forespore compartment. Members of this family are
           restricted to a subset of spore-forming species, and are
           very likely to participate in the program of endospore
           formation. We propose the designation LonC [Protein
           fate, Degradation of proteins, peptides, and
           glycopeptides, Cellular processes, Sporulation and
           germination].
          Length = 615

 Score = 32.9 bits (75), Expect = 0.28
 Identities = 33/126 (26%), Positives = 59/126 (46%), Gaps = 22/126 (17%)

Query: 66  EKLKEYLKKGKKQPVKD-GESRTKDDKKESDEDDSEDPDKKKMQANLEGAIVMEKPNVKW 124
           E+  +YL++ + Q +K+          K+ +    E  +KKK+  + +  +   +P   +
Sbjct: 100 ERQNKYLEEIRLQVLKEEKGPENSSTLKKLER--LEKLEKKKLHKSAQSLL---RPR-AF 153

Query: 125 TDVAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVATEA-- 182
           +++ G E A +AL   V  P  FPQ           I+L+GPPG GK+  A+    EA  
Sbjct: 154 SEIVGQERAIKALLAKVASP--FPQ----------HIILYGPPGVGKTTAARLALEEAKK 201

Query: 183 -NNSTF 187
             ++ F
Sbjct: 202 LKHTPF 207


>gnl|CDD|222307 pfam13671, AAA_33, AAA domain.  This family of domains contain only
           a P-loop motif, that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           just short fragments so there is no Walker B motif.
          Length = 143

 Score = 31.5 bits (72), Expect = 0.30
 Identities = 25/78 (32%), Positives = 35/78 (44%), Gaps = 20/78 (25%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLV------------SKWLGESEKLVK 208
           IL+ G PG+GKS  A+ +  E        V SSD +            S +   S ++ +
Sbjct: 2   ILMVGLPGSGKSTFARRLLRELG----AVVLSSDTLRKRLRGDGPPDISYYARASGRVYQ 57

Query: 209 TLFELA----RAHRPSII 222
            L ELA    RA RP I+
Sbjct: 58  RLLELAREALRAGRPVIV 75


>gnl|CDD|236353 PRK08939, PRK08939, primosomal protein DnaI; Reviewed.
          Length = 306

 Score = 32.1 bits (74), Expect = 0.32
 Identities = 45/182 (24%), Positives = 73/182 (40%), Gaps = 30/182 (16%)

Query: 15  TEADKNKNYEEALK-LYTH-SVQYFLHALKYEITSAEGKQSIREKCDEYLARAEKLKE-- 70
              D  K YE+  + +     VQ FL   + E+T     +S+  K  EY+ + +K     
Sbjct: 13  KRQDFRKRYEKLKQEVLADPDVQAFLQEHQEELTQEMIDRSL-SKLYEYIDQRKKCDRGC 71

Query: 71  --YLKKGKKQPVKDGES------RTKDDKKESDEDDSEDPDKKKMQANLEGAIVMEKP-- 120
             Y K  + + V +G         T +  +          D++K       +I M K   
Sbjct: 72  PNYAKGYQPKLVMNGGYIDVSYLPTPEKIEA---------DEEKAIKKRIQSIYMPKDLL 122

Query: 121 NVKWTDVAGLEAA-KEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVA 179
                D+   +    +AL  A+     +P    G++V  KG+ L+G  G GKSYL  A+A
Sbjct: 123 QASLADIDLDDRDRLDALMAALDFLEAYPP---GEKV--KGLYLYGDFGVGKSYLLAAIA 177

Query: 180 TE 181
            E
Sbjct: 178 NE 179


>gnl|CDD|215947 pfam00493, MCM, MCM2/3/5 family. 
          Length = 327

 Score = 32.1 bits (74), Expect = 0.33
 Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 14/68 (20%)

Query: 135 EALKEAVILPIKFPQLF--TGKRVPWKG-------ILLFGPPGTGKSYLAKAVATEANNS 185
           E +K+A++L     QLF    K +P          +LL G PGT KS L K VA  A  +
Sbjct: 30  EDVKKAILL-----QLFGGVKKNLPDGTRLRGDINVLLVGDPGTAKSQLLKYVAKLAPRA 84

Query: 186 TFFSVSSS 193
            + S   S
Sbjct: 85  VYTSGKGS 92


>gnl|CDD|224140 COG1219, ClpX, ATP-dependent protease Clp, ATPase subunit
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 408

 Score = 32.2 bits (74), Expect = 0.33
 Identities = 28/91 (30%), Positives = 40/91 (43%), Gaps = 38/91 (41%)

Query: 161 ILLFGPPGTGKSYLAKAVA---------------TEANNSTFFSVSSSDLVSKWLGES-- 203
           ILL GP G+GK+ LA+ +A               TEA                ++GE   
Sbjct: 100 ILLIGPTGSGKTLLAQTLAKILNVPFAIADATTLTEAG---------------YVGEDVE 144

Query: 204 ---EKLV-KTLFELARAHRPSIIFIDERIDR 230
               KL+    +++ RA R  II+IDE ID+
Sbjct: 145 NILLKLLQAADYDVERAER-GIIYIDE-IDK 173


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 32.5 bits (74), Expect = 0.34
 Identities = 18/91 (19%), Positives = 39/91 (42%), Gaps = 4/91 (4%)

Query: 23  YEEALKL--YTHSVQYFLHAL--KYEITSAEGKQSIREKCDEYLARAEKLKEYLKKGKKQ 78
           ++EA+ +          L AL   YE   +    S  ++ +      +KL++Y KK  + 
Sbjct: 329 FKEAINIARNLDDNDLTLLALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEV 388

Query: 79  PVKDGESRTKDDKKESDEDDSEDPDKKKMQA 109
             +   S   + K + ++   E+ +KK+ + 
Sbjct: 389 KDETDASEEAEAKAKEEKLKQEENEKKQKEQ 419


>gnl|CDD|224768 COG1855, COG1855, ATPase (PilT family) [General function prediction
           only].
          Length = 604

 Score = 32.3 bits (74), Expect = 0.34
 Identities = 12/22 (54%), Positives = 16/22 (72%)

Query: 159 KGILLFGPPGTGKSYLAKAVAT 180
           +GIL+ G PG GKS  A+A+A 
Sbjct: 264 EGILIAGAPGAGKSTFAQALAE 285


>gnl|CDD|233120 TIGR00764, lon_rel, lon-related putative ATP-dependent protease.
           This model represents a set of proteins with extensive
           C-terminal homology to the ATP-dependent protease La,
           product of the lon gene of E. coli. The model is based
           on a seed alignment containing only archaeal members,
           but several bacterial proteins match the model well.
           Because several species, including Thermotoga maritima
           and Treponema pallidum, contain both a close homolog of
           the lon protease and nearly full-length homolog of the
           members of this family, we suggest there may also be a
           functional division between the two families. Members of
           this family from Pyrococcus horikoshii and Pyrococcus
           abyssi each contain a predicted intein [Protein fate,
           Degradation of proteins, peptides, and glycopeptides].
          Length = 608

 Score = 32.5 bits (74), Expect = 0.35
 Identities = 20/53 (37%), Positives = 25/53 (47%), Gaps = 14/53 (26%)

Query: 127 VAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVA 179
           V G E A E +K+A                  + +LL G PG GKS LAKA+A
Sbjct: 20  VIGQEEAVEIIKKAA--------------KQKRNVLLIGEPGVGKSMLAKAMA 58


>gnl|CDD|232949 TIGR00382, clpX, endopeptidase Clp ATP-binding regulatory subunit
           (clpX).  A member of the ATP-dependent proteases, ClpX
           has ATP-dependent chaperone activity and is required for
           specific ATP-dependent proteolytic activities expressed
           by ClpPX. The gene is also found to be involved in
           stress tolerance in Bacillus subtilis and is essential
           for the efficient acquisition of genes specifying type
           IA and IB restriction [Protein fate, Protein folding and
           stabilization, Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 413

 Score = 32.0 bits (73), Expect = 0.39
 Identities = 25/76 (32%), Positives = 43/76 (56%), Gaps = 8/76 (10%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGES-EKLVKTLFELA----- 214
           ILL GP G+GK+ LA+ +A   N     + +++   + ++GE  E ++  L + A     
Sbjct: 119 ILLIGPTGSGKTLLAQTLARILNVPFAIADATTLTEAGYVGEDVENILLKLLQAADYDVE 178

Query: 215 RAHRPSIIFIDERIDR 230
           +A +  II+IDE ID+
Sbjct: 179 KAQK-GIIYIDE-IDK 192


>gnl|CDD|235878 PRK06851, PRK06851, hypothetical protein; Provisional.
          Length = 367

 Score = 31.9 bits (73), Expect = 0.39
 Identities = 15/37 (40%), Positives = 18/37 (48%), Gaps = 3/37 (8%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTF---FSVSSSD 194
            +L G PGTGKS L K +  E     +   F   SSD
Sbjct: 33  FILKGGPGTGKSTLMKKIGEEFLEKGYDVEFLHCSSD 69


>gnl|CDD|224142 COG1221, PspF, Transcriptional regulators containing an AAA-type
           ATPase domain and a DNA-binding domain [Transcription /
           Signal transduction mechanisms].
          Length = 403

 Score = 32.0 bits (73), Expect = 0.41
 Identities = 20/83 (24%), Positives = 36/83 (43%), Gaps = 12/83 (14%)

Query: 161 ILLFGPPGTGKSYLAKAV---ATEANNSTFFSVSSSDLVSKWLGESEKL--VKTLFELAR 215
           +L+ G  GTGK   A+ +   +     + F + + +   S+ L E+E     K  F  A+
Sbjct: 104 VLIIGETGTGKELFARLIHALSARRAEAPFIAFNCAAY-SENLQEAELFGHEKGAFTGAQ 162

Query: 216 AHRPSIIFIDERIDRGISNALFL 238
             +  +    E+ + G    LFL
Sbjct: 163 GGKAGLF---EQANGGT---LFL 179


>gnl|CDD|239144 cd02681, MIT_calpain7_1, MIT: domain contained within Microtubule
          Interacting and Trafficking molecules. This sub-family
          of MIT domains is found in the nuclear thiol protease
          PalBH. The molecular function of the MIT domain is
          unclear.
          Length = 76

 Score = 29.5 bits (66), Expect = 0.42
 Identities = 17/63 (26%), Positives = 33/63 (52%)

Query: 14 ATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLARAEKLKEYLK 73
          A + D+   Y EA+  Y  + Q  ++A      +    ++I+EK +EYL RA+ L + ++
Sbjct: 13 AVQRDQEGRYSEAVFYYKEAAQLLIYAEMAGTLNDSHLKTIQEKSNEYLDRAQALHQLVQ 72

Query: 74 KGK 76
            +
Sbjct: 73 GQQ 75


>gnl|CDD|219014 pfam06414, Zeta_toxin, Zeta toxin.  This family consists of several
           bacterial zeta toxin proteins. Zeta toxin is thought to
           be part of a postregulational killing system in
           bacteria. It relies on antitoxin/toxin systems that
           secure stable inheritance of low and medium copy number
           plasmids during cell division and kill cells that have
           lost the plasmid.
          Length = 191

 Score = 31.5 bits (72), Expect = 0.45
 Identities = 19/86 (22%), Positives = 29/86 (33%), Gaps = 24/86 (27%)

Query: 159 KGILLFGPPGTGKSYLAKAVATEANNS-----------TFF----------SVSSSDLVS 197
             +LL G PG GK+ LA+A+  E               T+              +S+L  
Sbjct: 13  VAVLLGGQPGAGKTELARALLEELGGGNVVRIDPDELRTYHPDYDELQKADPKDASELTQ 72

Query: 198 KWLGESEKLVKTLFELARAHRPSIIF 223
                    V+ L + A     +II 
Sbjct: 73  PDASR---WVEKLIDYAIERGYNIIL 95


>gnl|CDD|224616 COG1702, PhoH, Phosphate starvation-inducible protein PhoH,
           predicted ATPase [Signal transduction mechanisms].
          Length = 348

 Score = 31.9 bits (73), Expect = 0.45
 Identities = 12/18 (66%), Positives = 14/18 (77%)

Query: 165 GPPGTGKSYLAKAVATEA 182
           GP GTGK+YLA A A +A
Sbjct: 150 GPAGTGKTYLAVAKAVDA 167


>gnl|CDD|218685 pfam05673, DUF815, Protein of unknown function (DUF815).  This
           family consists of several bacterial proteins of unknown
           function.
          Length = 248

 Score = 31.7 bits (73), Expect = 0.46
 Identities = 30/106 (28%), Positives = 39/106 (36%), Gaps = 18/106 (16%)

Query: 122 VKWTDVAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVATE 181
           V   D+ G++  KEAL           Q   G   P   +LL+G  GTGKS L KA+  E
Sbjct: 25  VDLDDLVGIDRQKEALLRNTE------QFLAGL--PANNVLLWGARGTGKSSLVKALLNE 76

Query: 182 ANNST--FFSVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFID 225
             +       V   DL          L   +  L       I+F D
Sbjct: 77  YADQGLRLIEVDKDDLGD--------LPDIVDLLRGRPYRFILFCD 114


>gnl|CDD|183521 PRK12422, PRK12422, chromosomal replication initiation protein;
           Provisional.
          Length = 445

 Score = 31.7 bits (72), Expect = 0.48
 Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 5/42 (11%)

Query: 143 LPIKFPQLFTG-----KRVPWKGILLFGPPGTGKSYLAKAVA 179
           LP +  Q FT      K  P+  I LFGP G+GK++L +A  
Sbjct: 121 LPHRILQEFTKVSEQGKGFPFNPIYLFGPEGSGKTHLMQAAV 162


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 31.8 bits (73), Expect = 0.50
 Identities = 29/128 (22%), Positives = 39/128 (30%), Gaps = 20/128 (15%)

Query: 34  VQYFLHALKYEI--TSAEGKQSIREKC----------DEYLARAEKLK---EYLKKGKKQ 78
           VQY+  A K EI     E K++   K            E  AR  + K   E      K 
Sbjct: 428 VQYYRQA-KAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKD 486

Query: 79  PVKDGESRTKDDKKESDE----DDSEDPDKKKMQANLEGAIVMEKPNVKWTDVAGLEAAK 134
            V    +R K  K  + +         PD   + A  E      +        A     K
Sbjct: 487 AVAAALARVKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARARQAEKQAAAAADPK 546

Query: 135 EALKEAVI 142
           +A   A I
Sbjct: 547 KAAVAAAI 554


>gnl|CDD|223666 COG0593, DnaA, ATPase involved in DNA replication initiation [DNA
           replication, recombination, and repair].
          Length = 408

 Score = 31.5 bits (72), Expect = 0.62
 Identities = 11/42 (26%), Positives = 22/42 (52%), Gaps = 4/42 (9%)

Query: 161 ILLFGPPGTGKSYLAKAVATEAN----NSTFFSVSSSDLVSK 198
           + ++G  G GK++L +A+  EA     N+    ++S D  + 
Sbjct: 116 LFIYGGVGLGKTHLLQAIGNEALANGPNARVVYLTSEDFTND 157


>gnl|CDD|239145 cd02682, MIT_AAA_Arch, MIT: domain contained within Microtubule
          Interacting and Trafficking molecules. This sub-family
          of MIT domains is found in mostly archaebacterial
          AAA-ATPases. The molecular function of the MIT domain
          is unclear.
          Length = 75

 Score = 29.0 bits (65), Expect = 0.65
 Identities = 16/60 (26%), Positives = 29/60 (48%), Gaps = 1/60 (1%)

Query: 11 AIKATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLARAEKLKE 70
          AI A +A+K  N E+A+  Y  +++     +K    S   +    +  +EY  R E L++
Sbjct: 10 AINAVKAEKEGNAEDAITNYKKAIEVLSQIVKNYPDS-PTRLIYEQMINEYKRRIEVLEK 68


>gnl|CDD|222194 pfam13521, AAA_28, AAA domain. 
          Length = 162

 Score = 30.6 bits (70), Expect = 0.65
 Identities = 9/19 (47%), Positives = 14/19 (73%)

Query: 161 ILLFGPPGTGKSYLAKAVA 179
           I++ G P TGK+ L +A+A
Sbjct: 2   IVITGGPSTGKTTLLEALA 20


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 30.4 bits (69), Expect = 0.71
 Identities = 12/56 (21%), Positives = 22/56 (39%), Gaps = 1/56 (1%)

Query: 60  EYLARAEKLKEYLKKGKK-QPVKDGESRTKDDKKESDEDDSEDPDKKKMQANLEGA 114
           EY  + +   +  K  KK    KD +   KDDK E  ++   +   + +  +    
Sbjct: 77  EYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSET 132



 Score = 29.7 bits (67), Expect = 1.6
 Identities = 18/61 (29%), Positives = 29/61 (47%), Gaps = 1/61 (1%)

Query: 53  SIREKCDEYLARAEKL-KEYLKKGKKQPVKDGESRTKDDKKESDEDDSEDPDKKKMQANL 111
             ++K  E     EK+ KEY +K K +  K    + KD  K+  +D  +D  +KK +   
Sbjct: 59  EAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118

Query: 112 E 112
           E
Sbjct: 119 E 119


>gnl|CDD|222163 pfam13479, AAA_24, AAA domain.  This AAA domain is found in a wide
           variety of presumed phage proteins.
          Length = 201

 Score = 30.7 bits (70), Expect = 0.72
 Identities = 11/20 (55%), Positives = 15/20 (75%), Gaps = 1/20 (5%)

Query: 157 PWKGILLFGPPGTGKSYLAK 176
             K +L++GPPG GK+ LAK
Sbjct: 3   NIK-VLIYGPPGIGKTSLAK 21


>gnl|CDD|213788 TIGR03263, guanyl_kin, guanylate kinase.  Members of this family
           are the enzyme guanylate kinase, also called GMP kinase.
           This enzyme transfers a phosphate from ATP to GMP,
           yielding ADP and GDP [Purines, pyrimidines, nucleosides,
           and nucleotides, Nucleotide and nucleoside
           interconversions].
          Length = 179

 Score = 30.5 bits (70), Expect = 0.74
 Identities = 14/32 (43%), Positives = 21/32 (65%), Gaps = 1/32 (3%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSS 192
           I++ GP G GKS L KA+  E + +  FS+S+
Sbjct: 3   IVISGPSGAGKSTLVKAL-LEEDPNLKFSISA 33


>gnl|CDD|235871 PRK06835, PRK06835, DNA replication protein DnaC; Validated.
          Length = 329

 Score = 31.0 bits (71), Expect = 0.74
 Identities = 9/23 (39%), Positives = 16/23 (69%)

Query: 159 KGILLFGPPGTGKSYLAKAVATE 181
           + +L +G  GTGK++L+  +A E
Sbjct: 184 ENLLFYGNTGTGKTFLSNCIAKE 206


>gnl|CDD|222000 pfam13238, AAA_18, AAA domain. 
          Length = 128

 Score = 29.7 bits (67), Expect = 0.84
 Identities = 17/52 (32%), Positives = 24/52 (46%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFE 212
           IL+ G PG+GK+ LAK +A    +        + LV +   E     K L E
Sbjct: 1   ILITGTPGSGKTTLAKELAERLGDVLRDLAKENGLVLELDEEITDESKRLDE 52


>gnl|CDD|129465 TIGR00368, TIGR00368, Mg chelatase-related protein.  The N-terminal
           end matches very strongly a pfam Mg_chelatase domain
           [Unknown function, General].
          Length = 499

 Score = 31.0 bits (70), Expect = 0.85
 Identities = 23/73 (31%), Positives = 31/73 (42%), Gaps = 19/73 (26%)

Query: 104 KKKMQANLEGAIVMEKPNVKWTDVAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILL 163
           K K   N    I ++       D+ G + AK AL+ A            G       +LL
Sbjct: 176 KPKSIINKSYIIDLD-----LKDIKGQQHAKRALEIAA----------AGGH----NLLL 216

Query: 164 FGPPGTGKSYLAK 176
           FGPPG+GK+ LA 
Sbjct: 217 FGPPGSGKTMLAS 229


>gnl|CDD|222165 pfam13481, AAA_25, AAA domain.  This AAA domain is found in a wide
           variety of presumed DNA repair proteins.
          Length = 154

 Score = 29.9 bits (68), Expect = 0.86
 Identities = 12/19 (63%), Positives = 13/19 (68%)

Query: 161 ILLFGPPGTGKSYLAKAVA 179
            LL G PGTGKS LA  +A
Sbjct: 36  TLLAGAPGTGKSTLALDLA 54


>gnl|CDD|236828 PRK11034, clpA, ATP-dependent Clp protease ATP-binding subunit;
           Provisional.
          Length = 758

 Score = 31.0 bits (70), Expect = 0.94
 Identities = 23/76 (30%), Positives = 36/76 (47%), Gaps = 11/76 (14%)

Query: 162 LLFGPPGTGKSYLAKAVATE---------ANNSTFFSVSSSDLV--SKWLGESEKLVKTL 210
           LL G  G GK+ +A+ +A             + T +S+    L+  +K+ G+ EK  K L
Sbjct: 211 LLVGESGVGKTAIAEGLAWRIVQGDVPEVMADCTIYSLDIGSLLAGTKYRGDFEKRFKAL 270

Query: 211 FELARAHRPSIIFIDE 226
            +       SI+FIDE
Sbjct: 271 LKQLEQDTNSILFIDE 286


>gnl|CDD|234763 PRK00440, rfc, replication factor C small subunit; Reviewed.
          Length = 319

 Score = 30.6 bits (70), Expect = 1.0
 Identities = 22/57 (38%), Positives = 27/57 (47%), Gaps = 14/57 (24%)

Query: 126 DVAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLF-GPPGTGKSYLAKAVATE 181
           ++ G E   E LK  V            K +P    LLF GPPGTGK+  A A+A E
Sbjct: 18  EIVGQEEIVERLKSYV----------KEKNMPH---LLFAGPPGTGKTTAALALARE 61


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 31.2 bits (71), Expect = 1.0
 Identities = 25/89 (28%), Positives = 47/89 (52%), Gaps = 2/89 (2%)

Query: 51  KQSIREKCDEYLARAEKLKEYLKKGKKQPVKDGESRTKDDKKESDEDDSEDPDKKKMQAN 110
            +SI ++ +    + E+L+E L++ +   ++D ESR  D KKE DE +++  + ++    
Sbjct: 849 IKSIEKEIENLNGKKEELEEELEELEAA-LRDLESRLGDLKKERDELEAQLRELERKIEE 907

Query: 111 LEGAIVMEKPNVKWTDVAGLEAAKEALKE 139
           LE  I  ++  +       LEA +E L E
Sbjct: 908 LEAQIEKKRKRLSELKAK-LEALEEELSE 935


>gnl|CDD|233369 TIGR01351, adk, adenylate kinase.  Adenylate kinase (EC 2.7.4.3)
           converts ATP + AMP to ADP + ADP, that is, uses ATP as a
           phosphate donor for AMP. Most members of this family are
           known or believed to be adenylate kinase. However, some
           members accept other nucleotide triphosphates as donors,
           may be unable to use ATP, and may fail to complement
           adenylate kinase mutants. An example of a
           nucleoside-triphosphate--adenylate kinase (EC 2.7.4.10)
           is a GTP:AMP phosphotransferase. This family is
           designated subfamily rather than equivalog for this
           reason [Purines, pyrimidines, nucleosides, and
           nucleotides, Nucleotide and nucleoside
           interconversions].
          Length = 210

 Score = 30.3 bits (69), Expect = 1.0
 Identities = 13/35 (37%), Positives = 20/35 (57%), Gaps = 3/35 (8%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDL 195
           ++L GPPG+GK   AK +A +        +S+ DL
Sbjct: 2   LVLLGPPGSGKGTQAKRIAEKYG---LPHISTGDL 33


>gnl|CDD|233959 TIGR02639, ClpA, ATP-dependent Clp protease ATP-binding subunit
           clpA.  [Protein fate, Degradation of proteins, peptides,
           and glycopeptides].
          Length = 730

 Score = 30.8 bits (70), Expect = 1.1
 Identities = 20/76 (26%), Positives = 34/76 (44%), Gaps = 11/76 (14%)

Query: 162 LLFGPPGTGKSYLA---------KAVATEANNSTFFSVSSSDLV--SKWLGESEKLVKTL 210
           LL G PG GK+ +            V     N+  +S+    L+  +K+ G+ E+ +K +
Sbjct: 206 LLVGEPGVGKTAIVEGLALRIAEGKVPERLKNAKIYSLDMGTLLAGTKYRGDFEERLKAV 265

Query: 211 FELARAHRPSIIFIDE 226
                    +I+FIDE
Sbjct: 266 VSEIEKEPNAILFIDE 281


>gnl|CDD|201353 pfam00625, Guanylate_kin, Guanylate kinase. 
          Length = 183

 Score = 30.0 bits (68), Expect = 1.1
 Identities = 14/33 (42%), Positives = 20/33 (60%)

Query: 159 KGILLFGPPGTGKSYLAKAVATEANNSTFFSVS 191
           + I+L GP G GKS++ KA+  E      +SVS
Sbjct: 3   RPIVLSGPSGVGKSHIKKALLDEYPEKFGYSVS 35


>gnl|CDD|238540 cd01120, RecA-like_NTPases, RecA-like NTPases. This family includes
           the NTP binding domain of F1 and V1 H+ATPases, DnaB and
           related helicases as well as bacterial RecA and related
           eukaryotic and archaeal recombinases. This group also
           includes bacterial conjugation proteins and related DNA
           transfer proteins involved in type II and type IV
           secretion.
          Length = 165

 Score = 29.8 bits (67), Expect = 1.3
 Identities = 20/69 (28%), Positives = 29/69 (42%), Gaps = 8/69 (11%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTF------FSVSSSDLVSKWLGESEK--LVKTLFE 212
           IL+FGP G+GK+ LA  +A                    +L  + +GES K  L   +  
Sbjct: 2   ILVFGPTGSGKTTLALQLALNIATKGGKVVYVDIEEEIEELTERLIGESLKGALDNLIIV 61

Query: 213 LARAHRPSI 221
            A A  P+ 
Sbjct: 62  FATADDPAA 70


>gnl|CDD|216506 pfam01443, Viral_helicase1, Viral (Superfamily 1) RNA helicase.
           Helicase activity for this family has been demonstrated
           and NTPase activity. This helicase has multiple roles at
           different stages of viral RNA replication, as dissected
           by mutational analysis.
          Length = 226

 Score = 30.0 bits (68), Expect = 1.3
 Identities = 15/70 (21%), Positives = 27/70 (38%), Gaps = 7/70 (10%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKL----VKTLFELARA 216
           I++ G PG GKS L + +             +++L ++   +   L    V T       
Sbjct: 1   IVVHGVPGCGKSTLIRKLLRTDLTV---IRPTAELRTEGKPDLPNLNVRTVDTFLMALLK 57

Query: 217 HRPSIIFIDE 226
               I+ +DE
Sbjct: 58  PTGKILILDE 67


>gnl|CDD|214361 CHL00095, clpC, Clp protease ATP binding subunit.
          Length = 821

 Score = 30.8 bits (70), Expect = 1.3
 Identities = 48/193 (24%), Positives = 77/193 (39%), Gaps = 54/193 (27%)

Query: 161 ILLFGPPGTGKSYLAKAVA---------TEANNSTFFSVSSSDLV--SKWLGESEKLVKT 209
            +L G PG GK+ +A+ +A             +    ++    L+  +K+ GE E+ +K 
Sbjct: 203 PILIGEPGVGKTAIAEGLAQRIVNRDVPDILEDKLVITLDIGLLLAGTKYRGEFEERLKR 262

Query: 210 LFELARAHRPSIIFIDERIDRGISNALFLLLSGVGN-----DMDGIL----------VLG 254
           +F+  + +   I+ IDE             L G G      D   IL           +G
Sbjct: 263 IFDEIQENNNIILVIDE----------VHTLIGAGAAEGAIDAANILKPALARGELQCIG 312

Query: 255 ATNIPWV-----LDAAIRRRFEKRIYIALP---ELNARLFMFR-----HHLGNTRNTISD 301
           AT +         D A+ RRF+  +Y+  P   E    LF  R     HH      +ISD
Sbjct: 313 ATTLDEYRKHIEKDPALERRFQP-VYVGEPSVEETIEILFGLRSRYEKHH----NLSISD 367

Query: 302 ENLKELAKRTEGY 314
           + L+  AK ++ Y
Sbjct: 368 KALEAAAKLSDQY 380


>gnl|CDD|187579 cd05271, NDUFA9_like_SDR_a, NADH dehydrogenase (ubiquinone) 1 alpha
           subcomplex, subunit 9, 39 kDa, (NDUFA9) -like, atypical
           (a) SDRs.  This subgroup of extended SDR-like proteins
           are atypical SDRs. They have a glycine-rich
           NAD(P)-binding motif similar to the typical SDRs,
           GXXGXXG, and have the YXXXK active site motif (though
           not the other residues of the SDR tetrad). Members
           identified include NDUFA9 (mitochondrial) and putative
           nucleoside-diphosphate-sugar epimerase. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Atypical SDRs include
           biliverdin IX beta reductase (BVR-B,aka flavin
           reductase), NMRa (a negative transcriptional regulator
           of various fungi), progesterone 5-beta-reductase like
           proteins, phenylcoumaran benzylic ether and
           pinoresinol-lariciresinol reductases, phenylpropene
           synthases, eugenol synthase, triphenylmethane reductase,
           isoflavone reductases, and others. SDRs are a
           functionally diverse family of oxidoreductases that have
           a single domain with a structurally conserved Rossmann
           fold, an NAD(P)(H)-binding region, and a structurally
           diverse C-terminal region. Sequence identity between
           different SDR enzymes is typically in the 15-30% range;
           they catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. In addition to the
           Rossmann fold core region typical of all SDRs, extended
           SDRs have a less conserved C-terminal extension of
           approximately 100 amino acids, and typically have a
           TGXXGXXG cofactor binding motif. Complex (multidomain)
           SDRs such as ketoreductase domains of fatty acid
           synthase have a GGXGXXG NAD(P)-binding motif and an
           altered active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 273

 Score = 30.3 bits (69), Expect = 1.4
 Identities = 23/69 (33%), Positives = 30/69 (43%), Gaps = 14/69 (20%)

Query: 174 LAKAVATEANNSTFFSVS---------SSDLVSKWLGESEKLVKTLFELARAHRPSIIFI 224
           LAKA   EA       +S         S  L SK   E E+ V+  F  A   RPS++F 
Sbjct: 98  LAKAAK-EAGVERLIHISALGADANSPSKYLRSK--AEGEEAVREAFPEATIVRPSVVFG 154

Query: 225 DERIDRGIS 233
            E  DR ++
Sbjct: 155 RE--DRFLN 161


>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
          Length = 613

 Score = 30.2 bits (69), Expect = 1.4
 Identities = 10/23 (43%), Positives = 15/23 (65%), Gaps = 1/23 (4%)

Query: 154 KRVPWKG-ILLFGPPGTGKSYLA 175
           K +P K  ++++GPP TGKS   
Sbjct: 429 KGIPKKNCLVIYGPPDTGKSMFC 451


>gnl|CDD|183114 PRK11388, PRK11388, DNA-binding transcriptional regulator DhaR;
           Provisional.
          Length = 638

 Score = 30.4 bits (69), Expect = 1.4
 Identities = 47/203 (23%), Positives = 80/203 (39%), Gaps = 56/203 (27%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNST--FFSV-----SSSDLVSKWLG----ESEKLVKT 209
           +LL G  G GK+ LA+A+  E+  +   + +V         L  ++LG    +SE    +
Sbjct: 351 VLLCGEEGVGKALLAQAIHNESERAAGPYIAVNCQLYPDEALAEEFLGSDRTDSENGRLS 410

Query: 210 LFELARAHRPSIIFID--ERIDRGISNALFLLL-SGVGNDMDG-------ILVLGATNIP 259
            FELA       +F++  E +   + +AL  +L +GV   +D        + V+  T   
Sbjct: 411 KFELAHG---GTLFLEKVEYLSPELQSALLQVLKTGVITRLDSRRLIPVDVRVIATTTAD 467

Query: 260 WVLDAAI-RRRFEKRIY-------IALPELNARL----FMFRHHLG------NTRNTISD 301
             L   + + RF +++Y       I +P L  R      +  + L       +TR  I D
Sbjct: 468 --LAMLVEQNRFSRQLYYALHAFEITIPPLRMRREDIPALVNNKLRSLEKRFSTRLKIDD 525

Query: 302 E------------NLKELAKRTE 312
           +            N  EL    E
Sbjct: 526 DALARLVSYRWPGNDFELRSVIE 548


>gnl|CDD|236442 PRK09270, PRK09270, nucleoside triphosphate hydrolase
           domain-containing protein; Reviewed.
          Length = 229

 Score = 29.9 bits (68), Expect = 1.6
 Identities = 14/56 (25%), Positives = 20/56 (35%), Gaps = 12/56 (21%)

Query: 131 EAAKEALKEAVILPIKFPQLFTGKRVPWKGILLF--GPPGTGKSYLAKAVATEANN 184
              K  L+    L     Q    +R      ++   GPPG GKS LA+ +      
Sbjct: 14  AVHKPLLRRLAAL-----QAEPQRR-----TIVGIAGPPGAGKSTLAEFLEALLQQ 59


>gnl|CDD|148114 pfam06309, Torsin, Torsin.  This family consists of several
           eukaryotic torsin proteins. Torsion dystonia is an
           autosomal dominant movement disorder characterized by
           involuntary, repetitive muscle contractions and twisted
           postures. The most severe early-onset form of dystonia
           has been linked to mutations in the human DYT1 (TOR1A)
           gene encoding a protein termed torsinA. While causative
           genetic alterations have been identified, the function
           of torsin proteins and the molecular mechanism
           underlying dystonia remain unknown. Phylogenetic
           analysis of the torsin protein family indicates these
           proteins share distant sequence similarity with the
           large and diverse family of (pfam00004) proteins. It has
           been suggested that torsins play a role in effectively
           managing protein folding and that possible breakdown in
           a neuroprotective mechanism that is, in part, mediated
           by torsins may be responsible for the neuronal
           dysfunction associated with dystonia.
          Length = 127

 Score = 28.9 bits (65), Expect = 1.6
 Identities = 9/37 (24%), Positives = 19/37 (51%), Gaps = 4/37 (10%)

Query: 165 GPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLG 201
           G  GTGK+++A+ +A    ++ +     S  V  ++ 
Sbjct: 60  GWTGTGKNFVAEIIA----DNLYRDGLRSPYVHHFVA 92


>gnl|CDD|131948 TIGR02902, spore_lonB, ATP-dependent protease LonB.  Members of
           this protein are LonB, a paralog of the ATP-dependent
           protease La (LonA, TIGR00763). LonB proteins are found
           strictly, and almost universally, in endospore-forming
           bacteria. This protease was shown, in Bacillus subtilis,
           to be expressed specifically in the forespore, during
           sporulation, under control of sigma(F). The lonB gene,
           despite location immediately upstream of lonA, was shown
           to be monocistronic. LonB appears able to act on
           sigma(H) for post-translation control, but lonB mutation
           did not produce an obvious sporulation defect under the
           conditions tested. Note that additional paralogs of LonA
           and LonB occur in the Clostridium lineage and this model
           selects only one per species as the protein that
           corresponds to LonB in B. subtilis [Protein fate,
           Degradation of proteins, peptides, and glycopeptides,
           Cellular processes, Sporulation and germination].
          Length = 531

 Score = 30.1 bits (68), Expect = 1.6
 Identities = 19/62 (30%), Positives = 30/62 (48%), Gaps = 12/62 (19%)

Query: 126 DVAGLEAAKEALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSYLAKAVATEANNS 185
           ++ G E   +ALK A+  P   PQ           ++++GPPG GK+  A+ V  EA  +
Sbjct: 66  EIIGQEEGIKALKAALCGP--NPQ----------HVIIYGPPGVGKTAAARLVLEEAKKN 113

Query: 186 TF 187
             
Sbjct: 114 PA 115


>gnl|CDD|172992 PRK14526, PRK14526, adenylate kinase; Provisional.
          Length = 211

 Score = 29.8 bits (67), Expect = 1.6
 Identities = 13/36 (36%), Positives = 23/36 (63%), Gaps = 3/36 (8%)

Query: 160 GILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDL 195
            ++  GPPG+GK  +AK ++ E N   ++ +S+ DL
Sbjct: 2   KLVFLGPPGSGKGTIAKILSNELN---YYHISTGDL 34


>gnl|CDD|215651 pfam00006, ATP-synt_ab, ATP synthase alpha/beta family,
           nucleotide-binding domain.  This family includes the ATP
           synthase alpha and beta subunits, the ATP synthase
           associated with flagella and the termination factor Rho.
          Length = 213

 Score = 29.7 bits (68), Expect = 1.7
 Identities = 9/19 (47%), Positives = 12/19 (63%)

Query: 161 ILLFGPPGTGKSYLAKAVA 179
           I +FG  GTGK+ L   +A
Sbjct: 18  IGIFGGSGTGKTVLLGMIA 36


>gnl|CDD|214631 smart00350, MCM, minichromosome maintenance proteins. 
          Length = 509

 Score = 29.9 bits (68), Expect = 2.0
 Identities = 21/53 (39%), Positives = 26/53 (49%), Gaps = 14/53 (26%)

Query: 135 EALKEAVILPIKFPQLFTG--KRVPWKG-------ILLFGPPGTGKSYLAKAV 178
           E +K+A++L      LF G  K +P          ILL G PGT KS L K V
Sbjct: 209 EDIKKAILL-----LLFGGVHKNLPDGMKIRGDINILLCGDPGTAKSQLLKYV 256


>gnl|CDD|238979 cd02021, GntK, Gluconate kinase (GntK) catalyzes the phosphoryl
           transfer from ATP to gluconate. The resulting product
           gluconate-6-phoshate is an important precursor of
           gluconate metabolism. GntK acts as a dimmer composed of
           two identical subunits.
          Length = 150

 Score = 29.1 bits (66), Expect = 2.0
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query: 161 ILLFGPPGTGKSYLAKAVATEAN 183
           I++ G  G+GKS + KA+A    
Sbjct: 2   IVVMGVSGSGKSTVGKALAERLG 24


>gnl|CDD|165222 PHA02896, PHA02896, A-type inclusion like protein; Provisional.
          Length = 616

 Score = 30.0 bits (67), Expect = 2.0
 Identities = 20/69 (28%), Positives = 35/69 (50%), Gaps = 10/69 (14%)

Query: 52  QSIREKCDEYLARAEKLKEY--------LKKGKKQPVKDGESRTKDDKKESDEDDSEDPD 103
           +SIR + + YL + E    Y         +K  ++P  D +    +DK +SD+D   D D
Sbjct: 498 KSIRNRRNGYLLKDEDRYYYDRGHRNLGAEKDPEKPHPDKDEEYPEDKDKSDDDRLNDAD 557

Query: 104 K--KKMQAN 110
           K  +K++A+
Sbjct: 558 KAMRKLEAD 566


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 30.1 bits (67), Expect = 2.1
 Identities = 25/100 (25%), Positives = 43/100 (43%), Gaps = 4/100 (4%)

Query: 13   KATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLARAEKLK--E 70
             A EA+  +   EA +      +    A K +    +     ++K +E   +A++LK   
Sbjct: 1355 AADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAA 1414

Query: 71   YLKKGKKQPVKDGESRTKDD--KKESDEDDSEDPDKKKMQ 108
              KK   +  K  E + K D  KK+++E    D  KKK +
Sbjct: 1415 AAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAE 1454



 Score = 29.3 bits (65), Expect = 3.3
 Identities = 24/95 (25%), Positives = 51/95 (53%), Gaps = 1/95 (1%)

Query: 14   ATEADKNKNYEEALKLYTHSVQYFLHALKYEITSAEGKQSIREKCDEYLARAEKLKEYLK 73
            A +A+++K   E  K      +    ALK E   A+  + +++K  E   +AE+LK+  +
Sbjct: 1667 AKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKA-E 1725

Query: 74   KGKKQPVKDGESRTKDDKKESDEDDSEDPDKKKMQ 108
            +  K   ++ +   ++DKK+++E   ++ +KKK+ 
Sbjct: 1726 EENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIA 1760


>gnl|CDD|226997 COG4650, RtcR, Sigma54-dependent transcription regulator containing
           an AAA-type ATPase domain and a DNA-binding domain
           [Transcription / Signal transduction mechanisms].
          Length = 531

 Score = 29.9 bits (67), Expect = 2.3
 Identities = 11/18 (61%), Positives = 14/18 (77%)

Query: 161 ILLFGPPGTGKSYLAKAV 178
           ILL GP G GKS+LA+ +
Sbjct: 211 ILLNGPTGAGKSFLARRI 228


>gnl|CDD|180215 PRK05707, PRK05707, DNA polymerase III subunit delta'; Validated.
          Length = 328

 Score = 29.7 bits (67), Expect = 2.3
 Identities = 14/31 (45%), Positives = 16/31 (51%), Gaps = 1/31 (3%)

Query: 149 QLFTGKRVPWKGILLFGPPGTGKSYLAKAVA 179
           QL    R P    LL GP G GK  LA+ +A
Sbjct: 14  QLAGRGRHP-HAYLLHGPAGIGKRALAERLA 43


>gnl|CDD|116340 pfam07726, AAA_3, ATPase family associated with various cellular
           activities (AAA).  This Pfam entry includes some of the
           AAA proteins not detected by the pfam00004 model.
          Length = 131

 Score = 28.7 bits (65), Expect = 2.4
 Identities = 9/19 (47%), Positives = 13/19 (68%)

Query: 161 ILLFGPPGTGKSYLAKAVA 179
           +LL G PG  K+ LA+ +A
Sbjct: 2   VLLEGVPGLAKTLLARTLA 20


>gnl|CDD|223272 COG0194, Gmk, Guanylate kinase [Nucleotide transport and
           metabolism].
          Length = 191

 Score = 29.0 bits (66), Expect = 2.4
 Identities = 15/32 (46%), Positives = 20/32 (62%), Gaps = 2/32 (6%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSS 192
           I+L GP G GKS L KA+    ++   FSVS+
Sbjct: 7   IVLSGPSGVGKSTLVKAL--LEDDKLRFSVSA 36


>gnl|CDD|184150 PRK13569, PRK13569, anthranilate synthase component I; Provisional.
          Length = 506

 Score = 29.6 bits (67), Expect = 2.5
 Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 11/71 (15%)

Query: 15  TEADKNKNYEEA---LKLYTHS-------VQYFLHALKYEITSAEGKQSIREKCDEYLAR 64
           TE +K + Y+EA   ++             +  L A      S EG  S  EK +++L  
Sbjct: 178 TEEEKIEKYKEAQAEIETLIEKLARRKAEKELLLPADSERTVSFEGVTSNYEK-EQFLRD 236

Query: 65  AEKLKEYLKKG 75
            EK+KEY+K G
Sbjct: 237 VEKIKEYIKAG 247


>gnl|CDD|233443 TIGR01509, HAD-SF-IA-v3, haloacid dehalogenase superfamily,
           subfamily IA, variant 3 with third motif having DD or
           ED.  This model represents part of one structural
           subfamily of the Haloacid Dehalogenase (HAD) superfamily
           of aspartate-nucleophile hydrolases. The superfamily is
           defined by the presence of three short catalytic motifs.
           The subfamilies are defined based on the location and
           the observed or predicted fold of a so-called "capping
           domain", or the absence of such a domain. Subfamily I
           consists of sequences in which the capping domain is
           found in between the first and second catalytic motifs.
           Subfamily II consists of sequences in which the capping
           domain is found between the second and third motifs.
           Subfamily III sequences have no capping domain in either
           of these positions.The Subfamily IA and IB capping
           domains are predicted by PSI-PRED to consist of an alpha
           helical bundle. Subfamily I encompasses such a wide
           region of sequence space (the sequences are highly
           divergent) that representing it with a single model is
           impossible, resulting in an overly broad description
           which allows in many unrelated sequences. Subfamily IA
           and IB are separated based on an aparrent phylogenetic
           bifurcation. Subfamily IA is still too broad to model,
           but cannot be further subdivided into large chunks based
           on phylogenetic trees. Of the three motifs defining the
           HAD superfamily, the third has three variant forms : (1)
           hhhhsDxxx(x)D, (2) hhhhssxxx(x)D and (3) hhhhDDxxx(x)s
           where _s_ refers to a small amino acid and _h_ to a
           hydrophobic one. All three of these variants are found
           in subfamily IA. Individual models were made based on
           seeds exhibiting only one of the variants each. Variant
           3 (this model) is found in the enzymes
           beta-phosphoglucomutase (TIGR01990) and
           deoxyglucose-6-phosphatase, while many other enzymes of
           subfamily IA exhibit this variant as well as variant 1
           (TIGR01549). These three variant models were created
           withthe knowledge that there will be overlap among them
           - this is by design and serves the purpose of
           eliminating the overlap with models of more distantly
           relatedHAD subfamilies caused by an overly broad single
           model [Unknown function, Enzymes of unknown
           specificity].
          Length = 177

 Score = 28.9 bits (65), Expect = 2.6
 Identities = 17/68 (25%), Positives = 30/68 (44%), Gaps = 8/68 (11%)

Query: 246 DMDGILVLGATNIPWVLDAAIRRRFEKRIYI---ALPELNARLFMFRHHLGNTRNTISDE 302
           D+DG+LV  A  I  ++D A        + +      EL  R +  ++       T+S E
Sbjct: 5   DLDGVLVDTAHAIAKLVDRAEFPLVPDELGVSEVGSLELALRRWKAKYGR-----TMSAE 59

Query: 303 NLKELAKR 310
           + + L K+
Sbjct: 60  DAQLLYKQ 67


>gnl|CDD|223718 COG0645, COG0645, Predicted kinase [General function prediction
           only].
          Length = 170

 Score = 28.9 bits (65), Expect = 2.6
 Identities = 10/23 (43%), Positives = 16/23 (69%)

Query: 159 KGILLFGPPGTGKSYLAKAVATE 181
           + +L+ G PG+GKS LA+ +A  
Sbjct: 2   RLVLVGGLPGSGKSTLARGLAEL 24


>gnl|CDD|218292 pfam04851, ResIII, Type III restriction enzyme, res subunit. 
          Length = 100

 Score = 28.0 bits (63), Expect = 2.8
 Identities = 12/46 (26%), Positives = 18/46 (39%), Gaps = 3/46 (6%)

Query: 158 WKGILLFGPPGTGKSYLAKAVATEANN---STFFSVSSSDLVSKWL 200
            K  L+    G+GK+  A A+            F V   DL+ + L
Sbjct: 18  KKRGLIVMATGSGKTLTAAALIARLAKGKKKVLFVVPRKDLLEQAL 63


>gnl|CDD|223401 COG0324, MiaA, tRNA delta(2)-isopentenylpyrophosphate transferase
           [Translation, ribosomal structure and biogenesis].
          Length = 308

 Score = 29.1 bits (66), Expect = 2.9
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query: 161 ILLFGPPGTGKSYLAKAVATEAN 183
           I++ GP  +GK+ LA A+A    
Sbjct: 6   IVIAGPTASGKTALAIALAKRLG 28


>gnl|CDD|238713 cd01428, ADK, Adenylate kinase (ADK) catalyzes the reversible
           phosphoryl transfer from adenosine triphosphates (ATP)
           to adenosine monophosphates (AMP) and to yield adenosine
           diphosphates (ADP). This enzyme is required for the
           biosynthesis of ADP and is essential for homeostasis of
           adenosine phosphates.
          Length = 194

 Score = 28.7 bits (65), Expect = 2.9
 Identities = 14/36 (38%), Positives = 21/36 (58%), Gaps = 3/36 (8%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLV 196
           ILL GPPG+GK   A+ +A +        +S+ DL+
Sbjct: 2   ILLLGPPGSGKGTQAERLAKKYG---LPHISTGDLL 34


>gnl|CDD|215967 pfam00519, PPV_E1_C, Papillomavirus helicase.  This protein is a
           DNA helicase that is required for initiation of viral
           DNA replication. This protein forms a complex with the
           E2 protein pfam00508.
          Length = 432

 Score = 29.4 bits (67), Expect = 3.1
 Identities = 7/15 (46%), Positives = 11/15 (73%)

Query: 161 ILLFGPPGTGKSYLA 175
           ++++GPP TGKS   
Sbjct: 265 LVIYGPPDTGKSMFC 279


>gnl|CDD|215758 pfam00158, Sigma54_activat, Sigma-54 interaction domain. 
          Length = 168

 Score = 28.5 bits (65), Expect = 3.4
 Identities = 8/18 (44%), Positives = 12/18 (66%)

Query: 161 ILLFGPPGTGKSYLAKAV 178
           +L+ G  GTGK   A+A+
Sbjct: 25  VLITGESGTGKELFARAI 42


>gnl|CDD|237464 PRK13670, PRK13670, hypothetical protein; Provisional.
          Length = 388

 Score = 29.0 bits (66), Expect = 3.5
 Identities = 22/83 (26%), Positives = 36/83 (43%), Gaps = 17/83 (20%)

Query: 7   GINIAIKATEADKNKNYEEALKL-----YTHS------VQYFLHALKYEITSAEGKQSIR 55
           G+   IK     K  +++E ++L     YT++          L+A K ++        IR
Sbjct: 268 GLENRIKK-AIKKASSFDEFVELVKTKRYTYTRIQRLLTYILLNATKEDVEKTSLPPYIR 326

Query: 56  EKCDEYLARAEKLKEYLKKGKKQ 78
                 L   EK ++YLKK KK+
Sbjct: 327 -----VLGFTEKGQQYLKKIKKK 344


>gnl|CDD|234063 TIGR02928, TIGR02928, orc1/cdc6 family replication initiation
           protein.  Members of this protein family are found
           exclusively in the archaea. This set of DNA binding
           proteins shows homology to the origin recognition
           complex subunit 1/cell division control protein 6 family
           in eukaryotes. Several members may be found in genome
           and interact with each other [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 365

 Score = 29.1 bits (66), Expect = 3.6
 Identities = 8/26 (30%), Positives = 14/26 (53%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNST 186
           + ++G  GTGK+ + K V  E   + 
Sbjct: 43  VFIYGKTGTGKTAVTKYVMKELEEAA 68


>gnl|CDD|182120 PRK09862, PRK09862, putative ATP-dependent protease; Provisional.
          Length = 506

 Score = 29.2 bits (65), Expect = 3.7
 Identities = 11/15 (73%), Positives = 13/15 (86%)

Query: 161 ILLFGPPGTGKSYLA 175
           +LL GPPGTGK+ LA
Sbjct: 213 LLLIGPPGTGKTMLA 227


>gnl|CDD|215300 PLN02545, PLN02545, 3-hydroxybutyryl-CoA dehydrogenase.
          Length = 295

 Score = 28.9 bits (65), Expect = 3.8
 Identities = 16/59 (27%), Positives = 26/59 (44%), Gaps = 6/59 (10%)

Query: 171 KSYLAKAVATEA-----NNSTFFSVSSSDLVSKWLGESEKLVKTLF-ELARAHRPSIIF 223
           K  +++  A          +    +  +D + + + ESE L K LF EL R  +PS I 
Sbjct: 57  KGKMSQEEADATLGRIRCTTNLEELRDADFIIEAIVESEDLKKKLFSELDRICKPSAIL 115


>gnl|CDD|183088 PRK11331, PRK11331, 5-methylcytosine-specific restriction enzyme
           subunit McrB; Provisional.
          Length = 459

 Score = 28.9 bits (64), Expect = 3.8
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 159 KGILLFGPPGTGKSYLAKAVA 179
           K I+L GPPG GK+++A+ +A
Sbjct: 195 KNIILQGPPGVGKTFVARRLA 215


>gnl|CDD|202726 pfam03691, UPF0167, Uncharacterized protein family (UPF0167).  The
           proteins in this family are about 200 amino acids long
           and each contain 3 CXXC motifs.
          Length = 177

 Score = 28.5 bits (64), Expect = 3.9
 Identities = 8/19 (42%), Positives = 13/19 (68%)

Query: 298 TISDENLKELAKRTEGYSG 316
            + +E +KE+ +RT GYS 
Sbjct: 79  GVDEELVKEVLERTPGYSS 97


>gnl|CDD|219468 pfam07565, Band_3_cyto, Band 3 cytoplasmic domain.  This family
           contains the cytoplasmic domain of the Band 3 anion
           exchange proteins that exchange Cl-/HCO3-. Band 3
           constitutes the most abundant polypeptide in the red
           blood cell membrane, comprising 25% of the total
           membrane protein. The cytoplasmic domain of band 3
           functions primarily as an anchoring site for other
           membrane-associated proteins. Included among the protein
           ligands of cdb3 are ankyrin, protein 4.2, protein 4.1,
           glyceraldehyde-3-phosphate dehydrogenase (GAPDH),
           phosphofructokinase, aldolase, hemoglobin, hemichromes,
           and the protein tyrosine kinase (p72syk).
          Length = 250

 Score = 28.7 bits (65), Expect = 4.0
 Identities = 23/107 (21%), Positives = 40/107 (37%), Gaps = 29/107 (27%)

Query: 86  RTKDDKKESDEDDSEDPDKKKMQANLEGAIV-------MEKPN---VKWTDVAGLEAAKE 135
                              +K+    E + V       +E+P    V+ ++   LE+  E
Sbjct: 119 LNSQSGSLPSMSKFNSTLMRKIPPGAEASNVLVGEVDFLEQPVLAFVRLSEAVVLESVLE 178

Query: 136 ALKEAVILPIKFPQLFTGKRVPWKGILLFGPPGTGKSY--LAKAVAT 180
                V +P++F  LF          +L GP G+G  Y  + +A+AT
Sbjct: 179 -----VPVPVRF--LF----------VLLGPSGSGLDYHEIGRAIAT 208


>gnl|CDD|221803 pfam12846, AAA_10, AAA-like domain.  This family of domains contain
           a P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins.
          Length = 316

 Score = 28.9 bits (65), Expect = 4.1
 Identities = 10/19 (52%), Positives = 14/19 (73%)

Query: 161 ILLFGPPGTGKSYLAKAVA 179
           +L+ GP G+GKS L K +A
Sbjct: 4   MLIVGPSGSGKSTLLKLLA 22


>gnl|CDD|206680 cd01893, Miro1, Mitochondrial Rho family 1 (Miro1), N-terminal.
           Miro1 subfamily. Miro (mitochondrial Rho) proteins have
           tandem GTP-binding domains separated by a linker region
           containing putative calcium-binding EF hand motifs.
           Genes encoding Miro-like proteins were found in several
           eukaryotic organisms. This CD represents the N-terminal
           GTPase domain of Miro proteins. These atypical Rho
           GTPases have roles in mitochondrial homeostasis and
           apoptosis. Most Rho proteins contain a lipid
           modification site at the C-terminus; however, Miro is
           one of few Rho subfamilies that lack this feature.
          Length = 168

 Score = 28.1 bits (63), Expect = 4.4
 Identities = 9/23 (39%), Positives = 13/23 (56%)

Query: 161 ILLFGPPGTGKSYLAKAVATEAN 183
           I+L G  G GKS L  ++ +E  
Sbjct: 5   IVLIGDEGVGKSSLIMSLVSEEF 27


>gnl|CDD|218719 pfam05729, NACHT, NACHT domain.  This NTPase domain is found in
           apoptosis proteins as well as those involved in MHC
           transcription activation. This family is closely related
           to pfam00931.
          Length = 165

 Score = 28.0 bits (63), Expect = 4.8
 Identities = 16/88 (18%), Positives = 27/88 (30%), Gaps = 14/88 (15%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNST---------FFSVSSSDLVSKWLGESEKLVKTLF 211
           ++L G  G+GK+ L + +A                F          +    ++ L     
Sbjct: 3   VILQGEAGSGKTTLLQKIALLWAQGKLPQDFDFVFFLPCRELSRSGE-ASLADLLFSQWP 61

Query: 212 ELARAHRPSIIFIDERIDRGISNALFLL 239
           E A         I E  +R     L +L
Sbjct: 62  EPAAPVSEVWAVILELPER----LLLIL 85


>gnl|CDD|235825 PRK06547, PRK06547, hypothetical protein; Provisional.
          Length = 172

 Score = 28.2 bits (63), Expect = 4.8
 Identities = 19/61 (31%), Positives = 28/61 (45%), Gaps = 6/61 (9%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLG---ESEKLVKTLFELARAH 217
           +L+ G  G+GK+ LA A+A       F  V   DL   W G    SE + + + +  R  
Sbjct: 18  VLIDGRSGSGKTTLAGALAART---GFQLVHLDDLYPGWHGLAAASEHVAEAVLDEGRPG 74

Query: 218 R 218
           R
Sbjct: 75  R 75


>gnl|CDD|222258 pfam13604, AAA_30, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. There is a Walker A and
           Walker B.
          Length = 195

 Score = 28.3 bits (64), Expect = 4.8
 Identities = 8/19 (42%), Positives = 11/19 (57%)

Query: 161 ILLFGPPGTGKSYLAKAVA 179
            ++ GP GTGK+   KA  
Sbjct: 21  AVVQGPAGTGKTTSLKAAR 39


>gnl|CDD|213965 TIGR04352, HprK_rel_A, HprK-related kinase A.  A number of protein
           families resemble HPr kinase (see TIGR00679) but do not
           belong to that system. They include this family, which
           appears instead to be the marker for a different type of
           gene neighborhood, in which one of the conserved
           neighboring proteins resembles (but is distinct from)
           PqqD.
          Length = 280

 Score = 28.4 bits (64), Expect = 4.9
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 159 KGILLFGPPGTGKSYLAKAVA 179
           + ++L  PPG+GKS L  A+A
Sbjct: 109 RALILPAPPGSGKSTLCAALA 129


>gnl|CDD|238978 cd02020, CMPK, Cytidine monophosphate kinase (CMPK) catalyzes the
           reversible phosphorylation of cytidine monophosphate
           (CMP) to produce cytidine diphosphate (CDP), using ATP
           as the preferred phosphoryl donor.
          Length = 147

 Score = 27.8 bits (63), Expect = 5.0
 Identities = 9/15 (60%), Positives = 12/15 (80%)

Query: 165 GPPGTGKSYLAKAVA 179
           GP G+GKS +AK +A
Sbjct: 6   GPAGSGKSTVAKLLA 20


>gnl|CDD|214432 MTH00144, ND2, NADH dehydrogenase subunit 2; Provisional.
          Length = 328

 Score = 28.3 bits (64), Expect = 5.0
 Identities = 7/17 (41%), Positives = 10/17 (58%)

Query: 185 STFFSVSSSDLVSKWLG 201
            T  S+SSS  +  W+G
Sbjct: 16  GTLLSLSSSHWLGVWVG 32


>gnl|CDD|224292 COG1373, COG1373, Predicted ATPase (AAA+ superfamily) [General
           function prediction only].
          Length = 398

 Score = 28.5 bits (64), Expect = 5.1
 Identities = 17/68 (25%), Positives = 27/68 (39%), Gaps = 7/68 (10%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLFELARAHRP- 219
           IL+ GP   GK+ L K +         + ++  DL    +    +L+  L          
Sbjct: 40  ILILGPRQVGKTTLLKLLIKGLLEEIIY-INFDDLRLDRI----ELLDLLRAYIELKERE 94

Query: 220 -SIIFIDE 226
            S IF+DE
Sbjct: 95  KSYIFLDE 102


>gnl|CDD|197284 cd09188, PLDc_FAM83H_N, N-terminal phospholipase D-like domain of
           the uncharacterized protein, Family with sequence
           similarity 83H.  N-terminal phospholipase D (PLD)-like
           domain of the protein, Family with sequence similarity
           83H (FAM83H) on chromosome 8q24.3, which localizes in
           the intracellular environment and is associated with
           vesicles, can be regulated by kinases, and plays
           important roles during ameloblast differentiation and
           enamel matrix calcification. The gene encoding protein
           FAM83H is the first gene involved in the etiology of
           amelogenesis imperfecta (AI), that encodes a
           non-secreted protein due to the absence of a signal
           peptide. Defects in gene FAM83H cause autosomal dominant
           hypocalcified amelogenesis imperfecta (ADHCAI). Since
           the N-terminal PLD-like domain of FAM83H shows only
           trace similarity to the PLD catalytic domain and lacks
           the functionally important histidine residue, FAM83H may
           share a similar three-dimensional fold with PLD enzymes,
           but is most unlikely to carry PLD activity.
          Length = 265

 Score = 28.3 bits (63), Expect = 5.2
 Identities = 25/81 (30%), Positives = 37/81 (45%), Gaps = 12/81 (14%)

Query: 207 VKTLFELARAHRPSIIFIDERIDRGISNALFLLLSGVGNDMDGILVLGATNIPWVLDAAI 266
           V TL +      PSI    E   R I +A  +    +   MD  +      +  +L+AA 
Sbjct: 106 VTTLVQPPPPDNPSI---KEEARRMIRSAQQV----IAVVMD--IFTDVDILSELLEAAA 156

Query: 267 RRRFEKRIYIALPELNARLFM 287
           RR     +YI L E+NA+LF+
Sbjct: 157 RR---VPVYILLDEMNAQLFL 174


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 28.1 bits (63), Expect = 5.2
 Identities = 15/42 (35%), Positives = 20/42 (47%)

Query: 65  AEKLKEYLKKGKKQPVKDGESRTKDDKKESDEDDSEDPDKKK 106
           AE  +E  K+ KK+     E + K DKKE   +      KKK
Sbjct: 147 AEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKK 188


>gnl|CDD|218528 pfam05272, VirE, Virulence-associated protein E.  This family
           contains several bacterial virulence-associated protein
           E like proteins. These proteins contain a P-loop motif.
          Length = 198

 Score = 28.1 bits (63), Expect = 5.3
 Identities = 14/50 (28%), Positives = 21/50 (42%), Gaps = 4/50 (8%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSK----WLGESEKL 206
           ++L G  G+GKS   K +  E    +  S    D   K    W+ E  +L
Sbjct: 55  LILQGAQGSGKSTFLKKLGGEWFTDSIRSFEGKDAYEKLQGVWIVEIAEL 104


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 28.7 bits (64), Expect = 5.3
 Identities = 19/101 (18%), Positives = 34/101 (33%), Gaps = 14/101 (13%)

Query: 56   EKCDEYLARAEKLKEYLKKGK------------KQPVKDGESRTKDDKKESDEDDSEDPD 103
            +K D+  A+AE+ +E L++              +Q  K    +    K    E   E   
Sbjct: 1166 DKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYG 1225

Query: 104  KKKMQANLEGAIVMEKPNVKWTDVAGLEAAKEALKEAVILP 144
               M+      +V  KP  +        AA +  +E   + 
Sbjct: 1226 SSAMETENVAEVV--KPKGRAGAKKKAPAAAKEKEEEDEIL 1264


>gnl|CDD|238899 cd01918, HprK_C, HprK/P, the bifunctional histidine-containing
           protein kinase/phosphatase, controls the phosphorylation
           state of the phosphocarrier protein HPr and regulates
           the utilization of carbon sources by gram-positive
           bacteria. It catalyzes both the ATP-dependent
           phosphorylation of Ser-46 of HPr and its
           dephosphorylation by phosphorolysis. The latter reaction
           uses inorganic phosphate as substrate and produces
           pyrophosphate. Phosphoenolpyruvate carboxykinase (PEPCK)
           and the C-terminal catalytic domain of HprK/P are
           structurally similar with conserved active site residues
           suggesting these two phosphotransferases have related
           functions.  The HprK/P N-terminal domain is structurally
           similar to the N-terminal domains of the MurE and MurF
           amino acid ligases.
          Length = 149

 Score = 27.6 bits (62), Expect = 5.3
 Identities = 10/17 (58%), Positives = 12/17 (70%)

Query: 159 KGILLFGPPGTGKSYLA 175
            G+L+ GP G GKS LA
Sbjct: 15  IGVLITGPSGIGKSELA 31


>gnl|CDD|222074 pfam13362, Toprim_3, Toprim domain.  The toprim domain is found in
           a wide variety of enzymes involved in nucleic acid
           manipulation.
          Length = 95

 Score = 26.9 bits (60), Expect = 5.5
 Identities = 15/48 (31%), Positives = 18/48 (37%), Gaps = 6/48 (12%)

Query: 231 GISNALFLLLSGVGNDMDGILVLGATNIPWVLDAAIRRRFEKRIYIAL 278
           GI  AL   L         + VLGA N+  V      R    R+ IA 
Sbjct: 7   GIETAL--SLRRALPGAPVVAVLGANNLKAVAPPERGR----RVIIAA 48


>gnl|CDD|224391 COG1474, CDC6, Cdc6-related protein, AAA superfamily ATPase [DNA
           replication, recombination, and repair /
           Posttranslational modification, protein turnover,
           chaperones].
          Length = 366

 Score = 28.5 bits (64), Expect = 5.6
 Identities = 26/132 (19%), Positives = 50/132 (37%), Gaps = 25/132 (18%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNS---------------TFFSVSSSDLVSKWLGESE- 204
           I+++GP GTGK+   K V  E   S               T + V S  +++K       
Sbjct: 45  IIIYGPTGTGKTATVKFVMEELEESSANVEVVYINCLELRTPYQVLSK-ILNKLGKVPLT 103

Query: 205 -----KLVKTLFELARAHRPSIIFIDERIDRGISNALFLL--LSGVGNDMD-GILVLGAT 256
                +++K L++       ++I I + +D  +     +L  L     +    + ++  +
Sbjct: 104 GDSSLEILKRLYDNLSKKGKTVIVILDEVDALVDKDGEVLYSLLRAPGENKVKVSIIAVS 163

Query: 257 NIPWVLDAAIRR 268
           N    LD    R
Sbjct: 164 NDDKFLDYLDPR 175


>gnl|CDD|225713 COG3172, NadR, Predicted ATPase/kinase involved in NAD metabolism
           [Coenzyme metabolism].
          Length = 187

 Score = 27.8 bits (62), Expect = 5.6
 Identities = 18/78 (23%), Positives = 30/78 (38%), Gaps = 12/78 (15%)

Query: 159 KGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLVKTLF------- 211
           K + + G   +GKS L   +A    N+T       + V + LG  E L  + +       
Sbjct: 9   KTVAILGGESSGKSTLVNKLA-NIFNTTSAWEYGREYVFEHLGGDEALQYSDYAKIALGQ 67

Query: 212 ----ELARAHRPSIIFID 225
               + A  +   + FID
Sbjct: 68  AAYEDAAVRYANKVAFID 85


>gnl|CDD|107157 PHA02244, PHA02244, ATPase-like protein.
          Length = 383

 Score = 28.5 bits (63), Expect = 5.7
 Identities = 24/88 (27%), Positives = 44/88 (50%), Gaps = 4/88 (4%)

Query: 161 ILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSD--LVSKWLGESEKLVKTLFELARAHR 218
           + L G  G+GK+++A+ +A   +   +F  +  D   +  ++  + K  +T F  A   +
Sbjct: 122 VFLKGGAGSGKNHIAEQIAEALDLDFYFMNAIMDEFELKGFIDANGKFHETPFYEAFK-K 180

Query: 219 PSIIFIDERIDRGISNALFLLLSGVGND 246
             + FIDE ID  I  AL ++ S + N 
Sbjct: 181 GGLFFIDE-IDASIPEALIIINSAIANK 207


>gnl|CDD|183452 PRK12337, PRK12337, 2-phosphoglycerate kinase; Provisional.
          Length = 475

 Score = 28.2 bits (63), Expect = 6.1
 Identities = 10/19 (52%), Positives = 13/19 (68%)

Query: 161 ILLFGPPGTGKSYLAKAVA 179
           +L+ G  G GKS LA A+A
Sbjct: 258 VLIGGVSGVGKSVLASALA 276


>gnl|CDD|226126 COG3598, RepA, RecA-family ATPase [DNA replication, recombination,
           and repair].
          Length = 402

 Score = 28.3 bits (63), Expect = 6.2
 Identities = 34/134 (25%), Positives = 53/134 (39%), Gaps = 15/134 (11%)

Query: 148 PQLFTGKRVPWKGILLFGPPGTGKSYLAKAVATEANNSTFFSVSSSDLVSKWLGESEKLV 207
           PQL           +L+G  G GK+ L   +            +  +L    + E  K++
Sbjct: 79  PQLIDEFFRKGYVSILYGDSGVGKTTLLLYLCIAL-------AAGKNLFGNKVKEPGKVL 131

Query: 208 KTLFELARAHRPSIIFIDERIDRGISNALFLLLSGVGNDMDGILVLGATNIPWVLDAAIR 267
               EL   +R  I+   ER+   +   + L  + V N MD   V GA +   VL   + 
Sbjct: 132 YVSLEL---YREDIL---ERL-EPVRARMGLSPADVRN-MDLTDVSGAADESDVLSPKLY 183

Query: 268 RRFEKRIYIALPEL 281
           RRFEK +    P+ 
Sbjct: 184 RRFEKILEQKRPDF 197


>gnl|CDD|179615 PRK03634, PRK03634, rhamnulose-1-phosphate aldolase; Provisional.
          Length = 274

 Score = 28.0 bits (63), Expect = 6.4
 Identities = 9/28 (32%), Positives = 16/28 (57%)

Query: 283 ARLFMFRHHLGNTRNTISDENLKELAKR 310
           A +++    +G  + TI+DE L  L +R
Sbjct: 235 AEIYVKVLSMGGMKQTITDEELIALGER 262


>gnl|CDD|236134 PRK07940, PRK07940, DNA polymerase III subunit delta'; Validated.
          Length = 394

 Score = 28.3 bits (64), Expect = 6.5
 Identities = 18/61 (29%), Positives = 23/61 (37%), Gaps = 10/61 (16%)

Query: 124 WTDVAGLEAAKEALKEAVILPIKFPQLFTGKRVP----WKGILLFGPPGTGKSYLAKAVA 179
           W D+ G EA    L+ A                     W   L  GPPG+G+S  A+A A
Sbjct: 4   WDDLVGQEAVVAELRAAARAA---RADVAAAGSGMTHAW---LFTGPPGSGRSVAARAFA 57

Query: 180 T 180
            
Sbjct: 58  A 58


>gnl|CDD|221912 pfam13086, AAA_11, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins.
          Length = 220

 Score = 27.7 bits (62), Expect = 6.6
 Identities = 8/11 (72%), Positives = 10/11 (90%)

Query: 162 LLFGPPGTGKS 172
           L+ GPPGTGK+
Sbjct: 21  LIQGPPGTGKT 31


>gnl|CDD|238260 cd00464, SK, Shikimate kinase (SK) is the fifth enzyme in the
           shikimate pathway, a seven-step biosynthetic pathway
           which converts erythrose-4-phosphate to chorismic acid,
           found in bacteria, fungi and plants. Chorismic acid is a
           important intermediate in the synthesis of aromatic
           compounds, such as aromatic amino acids, p-aminobenzoic
           acid, folate and ubiquinone. Shikimate kinase catalyses
           the phosphorylation of the 3-hydroxyl group of shikimic
           acid using ATP.
          Length = 154

 Score = 27.5 bits (62), Expect = 6.6
 Identities = 7/19 (36%), Positives = 12/19 (63%)

Query: 161 ILLFGPPGTGKSYLAKAVA 179
           I+L G  G GK+ + + +A
Sbjct: 2   IVLIGMMGAGKTTVGRLLA 20


>gnl|CDD|236347 PRK08903, PRK08903, DnaA regulatory inactivator Hda; Validated.
          Length = 227

 Score = 28.0 bits (63), Expect = 6.7
 Identities = 24/109 (22%), Positives = 41/109 (37%), Gaps = 22/109 (20%)

Query: 161 ILLFGPPGTGKSYLAKAVATEAN----NSTFFSVSSSDLVSKWLGESEKLVKTLFELARA 216
             L+G  G+G+S+L +A+  +A+    N+ +   +S  L                     
Sbjct: 45  FYLWGEAGSGRSHLLQALVADASYGGRNARYLDAASPLL----------------AFDFD 88

Query: 217 HRPSIIFID--ERIDRGISNALFLLLSGVGNDMDGILVLGATNIPWVLD 263
               +  +D  ER+D     ALF L + V     G L++     P  L 
Sbjct: 89  PEAELYAVDDVERLDDAQQIALFNLFNRVRAHGQGALLVAGPAAPLALP 137


>gnl|CDD|213191 cd03224, ABC_TM1139_LivF_branched, ATP-binding cassette domain of
           branched-chain amino acid transporter.  LivF (TM1139) is
           part of the LIV-I bacterial ABC-type two-component
           transport system that imports neutral, branched-chain
           amino acids. The E. coli branched-chain amino acid
           transporter comprises a heterodimer of ABC transporters
           (LivF and LivG), a heterodimer of six-helix TM domains
           (LivM and LivH), and one of two alternative soluble
           periplasmic substrate binding proteins (LivK or LivJ).
           ABC transporters are a large family of proteins involved
           in the transport of a wide variety of different
           compounds, like sugars, ions, peptides, and more complex
           organic molecules.
          Length = 222

 Score = 27.8 bits (63), Expect = 6.8
 Identities = 30/104 (28%), Positives = 37/104 (35%), Gaps = 27/104 (25%)

Query: 206 LVKTLFELARAHRPSIIF----I-----DERIDRGIS-----NALFLLLSGVGNDMDGIL 251
           L+KT+  L      SI F    I      ER   GI        +F  L+   N     L
Sbjct: 42  LLKTIMGLLPPRSGSIRFDGRDITGLPPHERARAGIGYVPEGRRIFPELTVEEN-----L 96

Query: 252 VLGATNIPWVLDAAIRRRFEKRIYIALPELNARLFMFRHHLGNT 295
           +LGA        A  + R E R+Y   P L  R    R  L  T
Sbjct: 97  LLGAYARR---RAKRKARLE-RVYELFPRLKER----RKQLAGT 132


>gnl|CDD|227034 COG4690, PepD, Dipeptidase [Amino acid transport and metabolism].
          Length = 464

 Score = 28.2 bits (63), Expect = 6.9
 Identities = 10/37 (27%), Positives = 16/37 (43%)

Query: 282 NARLFMFRHHLGNTRNTISDENLKELAKRTEGYSGAD 318
               F   H+  N  + I  E+LK+ AK+   +   D
Sbjct: 182 APNQFRIDHYEFNDEDYICSEDLKDFAKKQGLWDPFD 218


>gnl|CDD|178593 PLN03021, PLN03021, Low-temperature-induced protein; Provisional.
          Length = 619

 Score = 28.4 bits (62), Expect = 6.9
 Identities = 14/38 (36%), Positives = 20/38 (52%)

Query: 66  EKLKEYLKKGKKQPVKDGESRTKDDKKESDEDDSEDPD 103
           +K+KE  KK K    K G     D + + DE D +DP+
Sbjct: 38  KKVKEKAKKIKNSLTKHGNGHDHDVEDDDDEYDEQDPE 75


>gnl|CDD|223775 COG0703, AroK, Shikimate kinase [Amino acid transport and
           metabolism].
          Length = 172

 Score = 27.6 bits (62), Expect = 7.2
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 161 ILLFGPPGTGKSYLAKAVATEAN 183
           I+L G  G GKS + +A+A   N
Sbjct: 5   IVLIGFMGAGKSTIGRALAKALN 27


>gnl|CDD|213244 cd03277, ABC_SMC5_euk, ATP-binding cassette domain of eukaryotic
          SMC5 proteins.  The structural maintenance of
          chromosomes (SMC) proteins are large (approximately 110
          to 170 kDa), and each is arranged into five
          recognizable domains. Amino-acid sequence homology of
          SMC proteins between species is largely confined to the
          amino- and carboxy-terminal globular domains. The
          amino-terminal domain contains a 'Walker A'
          nucleotide-binding domain (GxxGxGKS/T, in the
          single-letter amino-acid code), which by mutational
          studies has been shown to be essential in several
          proteins. The carboxy-terminal domain contains a
          sequence (the DA-box) that resembles a 'Walker B'
          motif, and a motif with homology to the signature
          sequence of the ATP-binding cassette (ABC) family of
          ATPases. The sequence homology within the
          carboxy-terminal domain is relatively high within the
          SMC1-SMC4 group, whereas SMC5 and SMC6 show some
          divergence in both of these sequences. In eukaryotic
          cells, the proteins are found as heterodimers of SMC1
          paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6
          (formerly known as Rad18).
          Length = 213

 Score = 27.6 bits (62), Expect = 7.5
 Identities = 7/19 (36%), Positives = 14/19 (73%)

Query: 60 EYLARAEKLKEYLKKGKKQ 78
          + L RA+K+ E++K+G  +
Sbjct: 52 KLLGRAKKVGEFVKRGCDE 70


>gnl|CDD|219839 pfam08433, KTI12, Chromatin associated protein KTI12.  This is a
           family of chromatin associated proteins which interact
           with the Elongator complex, a component of the
           elongating form of RNA polymerase II. The Elongator
           complex has histone acetyltransferase activity.
          Length = 266

 Score = 27.6 bits (62), Expect = 7.9
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query: 161 ILLFGPPGTGKSYLAKAVATE 181
           I+L G P +GKS  AK +A  
Sbjct: 2   IILTGLPSSGKSTRAKELAKY 22


>gnl|CDD|233087 TIGR00678, holB, DNA polymerase III, delta' subunit.  This model
           describes the N-terminal half of the delta' subunit of
           DNA polymerase III. Delta' is homologous to the gamma
           and tau subunits, which form an outgroup for
           phylogenetic comparison. The gamma/tau branch of the
           tree is much more tighly conserved than the delta'
           branch, and some members of that branch score more
           highly against this model than some proteins
           classisified as delta'. The noise cutoff is set to
           detect weakly scoring delta' subunits rather than to
           exclude gamma/tau subunits. At position 126-127 of the
           seed alignment, this family lacks the HM motif of
           gamma/tau; at 132 it has a near-invariant A vs. an
           invariant F in gamma/tau [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 188

 Score = 27.6 bits (62), Expect = 8.1
 Identities = 11/19 (57%), Positives = 12/19 (63%), Gaps = 1/19 (5%)

Query: 162 LLF-GPPGTGKSYLAKAVA 179
            LF GP G GK  LA A+A
Sbjct: 17  YLFAGPEGVGKELLALALA 35


>gnl|CDD|227429 COG5098, COG5098, Chromosome condensation complex Condensin,
           subunit D2 [Chromatin structure and dynamics / Cell
           division and chromosome partitioning].
          Length = 1128

 Score = 28.0 bits (62), Expect = 8.3
 Identities = 17/78 (21%), Positives = 32/78 (41%), Gaps = 7/78 (8%)

Query: 41  LKYEITSAEGKQSIREKCDEYLARAEKLKEYLKKGKKQPVKDGESRTKDDKKESDEDDSE 100
           +++   S  G Q      ++ L  AE     LK G ++ + DGE       KE ++D+ +
Sbjct: 415 MRHPFASEHGSQLRLTLWEKNLGSAENQLNSLKSGLQETLCDGE-------KEVEQDEGQ 467

Query: 101 DPDKKKMQANLEGAIVME 118
              + +   N      +E
Sbjct: 468 CRTELEGSFNKSAESRIE 485


>gnl|CDD|215749 pfam00148, Oxidored_nitro, Nitrogenase component 1 type
           Oxidoreductase. 
          Length = 397

 Score = 28.0 bits (63), Expect = 8.3
 Identities = 10/36 (27%), Positives = 20/36 (55%), Gaps = 3/36 (8%)

Query: 189 SVSSSDLVSKWLGESEKLVKTLFELARAHRPSIIFI 224
            ++  D+V    G  EKL + + E+ + ++P  IF+
Sbjct: 48  DLTEKDVV---FGGEEKLEEAIREIDKRYKPKAIFV 80


>gnl|CDD|226372 COG3854, SpoIIIAA, ncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 308

 Score = 27.9 bits (62), Expect = 8.4
 Identities = 10/31 (32%), Positives = 15/31 (48%), Gaps = 4/31 (12%)

Query: 158 WKGILLFGPPGTGKSYL----AKAVATEANN 184
           W   L+ GPP  GK+ L    A+ ++   N 
Sbjct: 137 WLNTLIIGPPQVGKTTLLRDIARLLSDGINQ 167


>gnl|CDD|234579 PRK00023, cmk, cytidylate kinase; Provisional.
          Length = 225

 Score = 27.4 bits (62), Expect = 8.6
 Identities = 8/17 (47%), Positives = 12/17 (70%)

Query: 165 GPPGTGKSYLAKAVATE 181
           GP G+GK  +AK +A +
Sbjct: 11  GPAGSGKGTVAKILAKK 27


>gnl|CDD|225327 COG2607, COG2607, Predicted ATPase (AAA+ superfamily) [General
           function prediction only].
          Length = 287

 Score = 27.7 bits (62), Expect = 8.9
 Identities = 12/21 (57%), Positives = 15/21 (71%)

Query: 161 ILLFGPPGTGKSYLAKAVATE 181
           +LL+G  GTGKS L KA+  E
Sbjct: 88  VLLWGARGTGKSSLVKALLNE 108


>gnl|CDD|236022 PRK07471, PRK07471, DNA polymerase III subunit delta'; Validated.
          Length = 365

 Score = 27.7 bits (62), Expect = 9.5
 Identities = 18/57 (31%), Positives = 26/57 (45%), Gaps = 15/57 (26%)

Query: 125 TDVAGLEAAKEALKEAVILPIKFPQLFTGKRVP--WKGILLFGPPGTGKSYLAKAVA 179
           T + G  AA+ AL +A          +   R+   W   L+ GP G GK+ LA  +A
Sbjct: 19  TALFGHAAAEAALLDA----------YRSGRLHHAW---LIGGPQGIGKATLAYRMA 62


>gnl|CDD|182791 PRK10865, PRK10865, protein disaggregation chaperone; Provisional.
          Length = 857

 Score = 27.9 bits (62), Expect = 9.7
 Identities = 39/154 (25%), Positives = 66/154 (42%), Gaps = 31/154 (20%)

Query: 162 LLFGPPGTGKSYLAKAVATEA---------NNSTFFSVSSSDLV--SKWLGE-SEKLVKT 209
           +L G PG GK+ + + +A                  ++    LV  +K+ GE  E+L   
Sbjct: 203 VLIGEPGVGKTAIVEGLAQRIINGEVPEGLKGRRVLALDMGALVAGAKYRGEFEERLKGV 262

Query: 210 LFELARAHRPSIIFIDE---RIDRGISNALFLLLSGVGNDMDGILV------LGATNIPW 260
           L +LA+     I+FIDE    +  G ++         GN +   L       +GAT +  
Sbjct: 263 LNDLAKQEGNVILFIDELHTMVGAGKADGAM----DAGNMLKPALARGELHCVGATTLDE 318

Query: 261 VL-----DAAIRRRFEKRIYIALPELNARLFMFR 289
                  DAA+ RRF+K +++A P +   + + R
Sbjct: 319 YRQYIEKDAALERRFQK-VFVAEPSVEDTIAILR 351


>gnl|CDD|224162 COG1241, MCM2, Predicted ATPase involved in replication control,
           Cdc46/Mcm family [DNA replication, recombination, and
           repair].
          Length = 682

 Score = 27.7 bits (62), Expect = 10.0
 Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 14/54 (25%)

Query: 135 EALKEAVILPIKFPQLF--TGKRVPWKG-------ILLFGPPGTGKSYLAKAVA 179
           E +K+A++L     QLF    K +P          ILL G PGT KS L K VA
Sbjct: 292 EDVKKAILL-----QLFGGVKKNLPDGTRIRGDIHILLVGDPGTAKSQLLKYVA 340


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.133    0.377 

Gapped
Lambda     K      H
   0.267   0.0740    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,148,804
Number of extensions: 1715775
Number of successful extensions: 3567
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3410
Number of HSP's successfully gapped: 304
Length of query: 330
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 233
Effective length of database: 6,635,264
Effective search space: 1546016512
Effective search space used: 1546016512
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (26.5 bits)