RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4538
         (343 letters)



>gnl|CDD|223540 COG0464, SpoVK, ATPases of the AAA+ class [Posttranslational
           modification, protein turnover, chaperones].
          Length = 494

 Score =  205 bits (523), Expect = 5e-62
 Identities = 100/219 (45%), Positives = 132/219 (60%), Gaps = 13/219 (5%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
           +LL+GPPGTGKTL+ K VA + ++ F  +  S L SKW GE EK +R LF  A    PSI
Sbjct: 279 VLLYGPPGTGKTLLAKAVALESRSRFISVKGSELLSKWVGESEKNIRELFEKARKLAPSI 338

Query: 61  IFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELDEAAR 120
           IFIDEIDSL   R   E+ + RR+  + L  LDG    +   VLVI ATNRP +LD A  
Sbjct: 339 IFIDEIDSLASGRGPSEDGSGRRVVGQLLTELDGIEKAEG--VLVIAATNRPDDLDPALL 396

Query: 121 R--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVD-DIEEVGRMTTDFSGADMASLCR 177
           R  R  + +Y+PLPD + R EI    LR+    L  D D+EE+  +T  +SGAD+A+L R
Sbjct: 397 RPGRFDRLIYVPLPDLEERLEIFKIHLRDKKPPLAEDVDLEELAEITEGYSGADIAALVR 456

Query: 178 EASLGPVRSIDLSRIDALDVRPISIDDFRDALKTVRPSV 216
           EA+L  +R            R +++DDF DALK ++PSV
Sbjct: 457 EAALEALRE--------ARRREVTLDDFLDALKKIKPSV 487



 Score =  118 bits (296), Expect = 1e-29
 Identities = 91/238 (38%), Positives = 125/238 (52%), Gaps = 19/238 (7%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
           +LL GPPGTGKTL+ + +A    A F  I+   + SK+ GE E  +R LF  A    PSI
Sbjct: 21  VLLHGPPGTGKTLLARALA-NEGAEFLSINGPEILSKYVGESELRLRELFEEAEKLAPSI 79

Query: 61  IFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELDEAAR 120
           IFIDEID+L  +RS  + E  RR+  + L  +DG   L    V+VIGATNRP  LD A R
Sbjct: 80  IFIDEIDALAPKRSSDQGEVERRVVAQLLALMDG---LKRGQVIVIGATNRPDGLDPAKR 136

Query: 121 R--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASLCRE 178
           R  R  + + + LPDE  R EI+    R +         + +   T   SGAD+ +L +E
Sbjct: 137 RPGRFDREIEVNLPDEAGRLEILQIHTRLMFLGPPG-TGKTLAARTVGKSGADLGALAKE 195

Query: 179 ASLGPVRSIDLSRIDALDVRPISIDDFRDALKTVRPSVCQADFVHQPSIIFIDEIDSL 236
           A+L  +R       + + V   + DDF +ALK V PS           ++F DE  +L
Sbjct: 196 AALRELRRAIDLVGEYIGV---TEDDFEEALKKVLPS---------RGVLFEDEDVTL 241



 Score = 88.7 bits (220), Expect = 2e-19
 Identities = 39/81 (48%), Positives = 48/81 (59%), Gaps = 4/81 (4%)

Query: 223 HQPSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQEL 282
             PSIIFIDEIDSL   R   E+ + RR+  + L  LDG    +   VLVI ATNRP +L
Sbjct: 334 LAPSIIFIDEIDSLASGRGPSEDGSGRRVVGQLLTELDGIEKAEG--VLVIAATNRPDDL 391

Query: 283 DEAARR--RLVKRLYIPLPDE 301
           D A  R  R  + +Y+PLPD 
Sbjct: 392 DPALLRPGRFDRLIYVPLPDL 412



 Score = 53.3 bits (128), Expect = 7e-08
 Identities = 44/120 (36%), Positives = 59/120 (49%), Gaps = 11/120 (9%)

Query: 192 IDALDVRPISIDDFRDALKTVRPS------VCQADFVHQPSIIFIDEIDSLLCQRSDQEN 245
           +       +SI+      K V  S      + +      PSIIFIDEID+L  +RS  + 
Sbjct: 38  LANEGAEFLSINGPEILSKYVGESELRLRELFEEAEKLAPSIIFIDEIDALAPKRSSDQG 97

Query: 246 ETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELDEAARR--RLVKRLYIPLPDEQK 303
           E  RR+  + L  +DG   L    V+VIGATNRP  LD A RR  R  + + + LPDE  
Sbjct: 98  EVERRVVAQLLALMDG---LKRGQVIVIGATNRPDGLDPAKRRPGRFDREIEVNLPDEAG 154


>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 406

 Score =  165 bits (421), Expect = 7e-48
 Identities = 88/221 (39%), Positives = 123/221 (55%), Gaps = 25/221 (11%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
           +LL+GPPGTGKTL+ K VA Q  ATF  +  S L  K+ GEG ++VR LF +A    PSI
Sbjct: 188 VLLYGPPGTGKTLLAKAVANQTDATFIRVVGSELVQKYIGEGARLVRELFELAREKAPSI 247

Query: 61  IFIDEIDSLLCQRSDQ----ENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELD 116
           IFIDEID++  +R D     + E  R +  E L  LDG     +  V VI ATNRP  LD
Sbjct: 248 IFIDEIDAIGAKRFDSGTSGDREVQRTM-LELLNQLDGFDPRGN--VKVIMATNRPDILD 304

Query: 117 EAARR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVD---DIEEVGRMTTDFSGAD 171
            A  R  R  +++  PLPDE+ R EI    L+  T  + +    D+E + R+T  FSGAD
Sbjct: 305 PALLRPGRFDRKIEFPLPDEEGRAEI----LKIHTRKMNLADDVDLELLARLTEGFSGAD 360

Query: 172 MASLCREASLGPVRSIDLSRIDALDVRPISIDDFRDALKTV 212
           + ++C EA +  +R              ++++DF  A++ V
Sbjct: 361 LKAICTEAGMFAIRE---------RRDEVTMEDFLKAVEKV 392



 Score = 65.7 bits (161), Expect = 6e-12
 Identities = 37/86 (43%), Positives = 47/86 (54%), Gaps = 9/86 (10%)

Query: 223 HQPSIIFIDEIDSLLCQRSDQ----ENETSRRLKTEFLISLDGASTLDDDLVLVIGATNR 278
             PSIIFIDEID++  +R D     + E  R +  E L  LDG     +  V VI ATNR
Sbjct: 243 KAPSIIFIDEIDAIGAKRFDSGTSGDREVQRTM-LELLNQLDGFDPRGN--VKVIMATNR 299

Query: 279 PQELDEAARR--RLVKRLYIPLPDEQ 302
           P  LD A  R  R  +++  PLPDE+
Sbjct: 300 PDILDPALLRPGRFDRKIEFPLPDEE 325


>gnl|CDD|215649 pfam00004, AAA, ATPase family associated with various cellular
           activities (AAA).  AAA family proteins often perform
           chaperone-like functions that assist in the assembly,
           operation, or disassembly of protein complexes.
          Length = 131

 Score =  152 bits (386), Expect = 6e-46
 Identities = 62/132 (46%), Positives = 77/132 (58%), Gaps = 2/132 (1%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
           +LL+GPPGTGKT + K VA +  A F  IS S L SK+ GE EK +R LF  A    P +
Sbjct: 1   LLLYGPPGTGKTTLAKAVAKELGAPFIEISGSELVSKYVGESEKRLRELFEAAKKLAPCV 60

Query: 61  IFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELDEAAR 120
           IFIDEID+L   R    +  SRR+  + L  LDG  T     V+VI ATNRP +LD A  
Sbjct: 61  IFIDEIDALAGSRGSGGDSESRRVVNQLLTELDGF-TSSLSKVIVIAATNRPDKLDPALL 119

Query: 121 R-RLVKRLYIPL 131
           R R  + +  PL
Sbjct: 120 RGRFDRIIEFPL 131



 Score = 85.7 bits (213), Expect = 1e-20
 Identities = 40/126 (31%), Positives = 57/126 (45%), Gaps = 12/126 (9%)

Query: 174 SLCREASLGPVRSIDLSRIDALDVRPISIDDFRDALKTVRPSVCQADFVHQPSIIFIDEI 233
           ++ +E    P   I  S + +  V   S    R+  +  +           P +IFIDEI
Sbjct: 17  AVAKELGA-PFIEISGSELVSKYVG-ESEKRLRELFEAAK--------KLAPCVIFIDEI 66

Query: 234 DSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELDEAARR-RLVK 292
           D+L   R    +  SRR+  + L  LDG  T     V+VI ATNRP +LD A  R R  +
Sbjct: 67  DALAGSRGSGGDSESRRVVNQLLTELDGF-TSSLSKVIVIAATNRPDKLDPALLRGRFDR 125

Query: 293 RLYIPL 298
            +  PL
Sbjct: 126 IIEFPL 131


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
           Provisional.
          Length = 389

 Score =  159 bits (405), Expect = 1e-45
 Identities = 83/218 (38%), Positives = 121/218 (55%), Gaps = 17/218 (7%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
           +LL+GPPGTGKTL+ K VA +  ATF  +  S L  K+ GEG ++VR LF +A    PSI
Sbjct: 168 VLLYGPPGTGKTLLAKAVAHETNATFIRVVGSELVQKFIGEGARLVRELFELAREKAPSI 227

Query: 61  IFIDEIDSLLCQRSDQENETSR---RLKTEFLISLDGASTLDDDLVLVIGATNRPQELDE 117
           IFIDEID++  +R+D      R   R   + L  +DG     +  V +I ATNR   LD 
Sbjct: 228 IFIDEIDAIAAKRTDSGTSGDREVQRTLMQLLAEMDGFDPRGN--VKIIAATNRIDILDP 285

Query: 118 AARR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASL 175
           A  R  R  + + +PLPDE+ R EI+    R +  +  V D+EE+  +T   SGAD+ ++
Sbjct: 286 AILRPGRFDRIIEVPLPDEEGRLEILKIHTRKMNLADDV-DLEELAELTEGASGADLKAI 344

Query: 176 CREASLGPVRSIDLSRIDALDVRPISIDDFRDALKTVR 213
           C EA +  +R          D   ++++DF  A++ V 
Sbjct: 345 CTEAGMFAIRD---------DRTEVTMEDFLKAIEKVM 373



 Score = 57.9 bits (141), Expect = 2e-09
 Identities = 33/85 (38%), Positives = 45/85 (52%), Gaps = 7/85 (8%)

Query: 223 HQPSIIFIDEIDSLLCQRSDQENETSR---RLKTEFLISLDGASTLDDDLVLVIGATNRP 279
             PSIIFIDEID++  +R+D      R   R   + L  +DG     +  V +I ATNR 
Sbjct: 223 KAPSIIFIDEIDAIAAKRTDSGTSGDREVQRTLMQLLAEMDGFDPRGN--VKIIAATNRI 280

Query: 280 QELDEAARR--RLVKRLYIPLPDEQ 302
             LD A  R  R  + + +PLPDE+
Sbjct: 281 DILDPAILRPGRFDRIIEVPLPDEE 305


>gnl|CDD|233328 TIGR01243, CDC48, AAA family ATPase, CDC48 subfamily.  This
           subfamily of the AAA family ATPases includes two members
           each from three archaeal species. It also includes yeast
           CDC48 (cell division control protein 48) and the human
           ortholog, transitional endoplasmic reticulum ATPase
           (valosin-containing protein). These proteins in
           eukaryotes are involved in the budding and transfer of
           membrane from the transitional endoplasmic reticulum to
           the Golgi apparatus.
          Length = 733

 Score =  163 bits (413), Expect = 7e-45
 Identities = 92/243 (37%), Positives = 141/243 (58%), Gaps = 15/243 (6%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
           +LL+GPPGTGKTL+ K VA +  A F  I+   + SK+YGE E+ +R +F  A  + PSI
Sbjct: 215 VLLYGPPGTGKTLLAKAVANEAGAYFISINGPEIMSKYYGESEERLREIFKEAEENAPSI 274

Query: 61  IFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELDEAAR 120
           IFIDEID++  +R +   E  +R+  + L  +DG        V+VIGATNRP  LD A R
Sbjct: 275 IFIDEIDAIAPKREEVTGEVEKRVVAQLLTLMDGLK--GRGRVIVIGATNRPDALDPALR 332

Query: 121 R--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASLCRE 178
           R  R  + + I +PD++AR EI+    RN+  +  V D++++  +T  F GAD+A+L +E
Sbjct: 333 RPGRFDREIVIRVPDKRARKEILKVHTRNMPLAEDV-DLDKLAEVTHGFVGADLAALAKE 391

Query: 179 ASLGPVRS-IDLSRID---------ALDVRPISIDDFRDALKTVRPSVCQADFVHQPSII 228
           A++  +R  I   +I+          L    +++ DF +ALK V PS  +   V  P++ 
Sbjct: 392 AAMAALRRFIREGKINFEAEEIPAEVLKELKVTMKDFMEALKMVEPSAIREVLVEVPNVR 451

Query: 229 FID 231
           + D
Sbjct: 452 WSD 454



 Score =  156 bits (396), Expect = 1e-42
 Identities = 87/232 (37%), Positives = 132/232 (56%), Gaps = 15/232 (6%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
           +LLFGPPGTGKTL+ K VA +  A F  +    + SKW GE EK +R +F  A    P+I
Sbjct: 490 VLLFGPPGTGKTLLAKAVATESGANFIAVRGPEILSKWVGESEKAIREIFRKARQAAPAI 549

Query: 61  IFIDEIDSLLCQRSDQENET-SRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELDEAA 119
           IF DEID++   R  + + + + R+  + L  +DG   L +  V+VI ATNRP  LD A 
Sbjct: 550 IFFDEIDAIAPARGARFDTSVTDRIVNQLLTEMDGIQELSN--VVVIAATNRPDILDPAL 607

Query: 120 RR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASLCR 177
            R  R  + + +P PDE+AR EI     R++  +  V D+EE+  MT  ++GAD+ ++CR
Sbjct: 608 LRPGRFDRLILVPPPDEEARKEIFKIHTRSMPLAEDV-DLEELAEMTEGYTGADIEAVCR 666

Query: 178 EASLGPVRS---------IDLSRIDALDVRPISIDDFRDALKTVRPSVCQAD 220
           EA++  +R          +++   + L    + +  F +ALK V+PSV + D
Sbjct: 667 EAAMAALRESIGSPAKEKLEVGEEEFLKDLKVEMRHFLEALKKVKPSVSKED 718



 Score = 61.8 bits (150), Expect = 2e-10
 Identities = 34/80 (42%), Positives = 48/80 (60%), Gaps = 4/80 (5%)

Query: 225 PSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELDE 284
           PSIIFIDEID++  +R +   E  +R+  + L  +DG        V+VIGATNRP  LD 
Sbjct: 272 PSIIFIDEIDAIAPKREEVTGEVEKRVVAQLLTLMDGLK--GRGRVIVIGATNRPDALDP 329

Query: 285 AARR--RLVKRLYIPLPDEQ 302
           A RR  R  + + I +PD++
Sbjct: 330 ALRRPGRFDREIVIRVPDKR 349



 Score = 58.4 bits (141), Expect = 2e-09
 Identities = 31/81 (38%), Positives = 47/81 (58%), Gaps = 5/81 (6%)

Query: 225 PSIIFIDEIDSLLCQRSDQENET-SRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELD 283
           P+IIF DEID++   R  + + + + R+  + L  +DG   L +  V+VI ATNRP  LD
Sbjct: 547 PAIIFFDEIDAIAPARGARFDTSVTDRIVNQLLTEMDGIQELSN--VVVIAATNRPDILD 604

Query: 284 EAARR--RLVKRLYIPLPDEQ 302
            A  R  R  + + +P PDE+
Sbjct: 605 PALLRPGRFDRLILVPPPDEE 625


>gnl|CDD|130309 TIGR01242, 26Sp45, 26S proteasome subunit P45 family.  Many
           proteins may score above the trusted cutoff because an
           internal.
          Length = 364

 Score =  145 bits (367), Expect = 2e-40
 Identities = 84/217 (38%), Positives = 121/217 (55%), Gaps = 17/217 (7%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
           +LL+GPPGTGKTL+ K VA +  ATF  +  S L  K+ GEG ++VR +F +A    PSI
Sbjct: 159 VLLYGPPGTGKTLLAKAVAHETNATFIRVVGSELVRKYIGEGARLVREIFELAKEKAPSI 218

Query: 61  IFIDEIDSLLCQRSDQENETSR---RLKTEFLISLDGASTLDDDLVLVIGATNRPQELDE 117
           IFIDEID++  +R+D      R   R   + L  LDG     +  V VI ATNRP  LD 
Sbjct: 219 IFIDEIDAIAAKRTDSGTSGDREVQRTLMQLLAELDGFDPRGN--VKVIAATNRPDILDP 276

Query: 118 AARR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASL 175
           A  R  R  + + +PLPD + R EI+    R +  +  V D+E + +MT   SGAD+ ++
Sbjct: 277 ALLRPGRFDRIIEVPLPDFEGRLEILKIHTRKMKLAEDV-DLEAIAKMTEGASGADLKAI 335

Query: 176 CREASLGPVRSIDLSRIDALDVRPISIDDFRDALKTV 212
           C EA +  +R  +           +++DDF  A++ V
Sbjct: 336 CTEAGMFAIR--EERDY-------VTMDDFIKAVEKV 363



 Score = 57.5 bits (139), Expect = 3e-09
 Identities = 35/83 (42%), Positives = 45/83 (54%), Gaps = 7/83 (8%)

Query: 225 PSIIFIDEIDSLLCQRSDQENETSR---RLKTEFLISLDGASTLDDDLVLVIGATNRPQE 281
           PSIIFIDEID++  +R+D      R   R   + L  LDG     +  V VI ATNRP  
Sbjct: 216 PSIIFIDEIDAIAAKRTDSGTSGDREVQRTLMQLLAELDGFDPRGN--VKVIAATNRPDI 273

Query: 282 LDEAARR--RLVKRLYIPLPDEQ 302
           LD A  R  R  + + +PLPD +
Sbjct: 274 LDPALLRPGRFDRIIEVPLPDFE 296


>gnl|CDD|233327 TIGR01241, FtsH_fam, ATP-dependent metalloprotease FtsH.
           HflB(FtsH) is a pleiotropic protein required for correct
           cell division in bacteria. It has ATP-dependent zinc
           metalloprotease activity. It was formerly designated
           cell division protein FtsH [Cellular processes, Cell
           division, Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 495

 Score =  132 bits (335), Expect = 7e-35
 Identities = 82/206 (39%), Positives = 113/206 (54%), Gaps = 17/206 (8%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
           +LL GPPGTGKTL+ K VA +    FF IS S     + G G   VR LF  A  + P I
Sbjct: 91  VLLVGPPGTGKTLLAKAVAGEAGVPFFSISGSDFVEMFVGVGASRVRDLFEQAKKNAPCI 150

Query: 61  IFIDEIDSLLCQR-------SDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQ 113
           IFIDEID++  QR       +D+  +T  +L    L+ +DG  T  +  V+VI ATNRP 
Sbjct: 151 IFIDEIDAVGRQRGAGLGGGNDEREQTLNQL----LVEMDGFGT--NTGVIVIAATNRPD 204

Query: 114 ELDEAARR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGAD 171
            LD A  R  R  +++ + LPD + R EI+    +N   +  V D++ V R T  FSGAD
Sbjct: 205 VLDPALLRPGRFDRQVVVDLPDIKGREEILKVHAKNKKLAPDV-DLKAVARRTPGFSGAD 263

Query: 172 MASLCREASLGPVRSIDLSRIDALDV 197
           +A+L  EA+L   R  + + I   D+
Sbjct: 264 LANLLNEAALLAARK-NKTEITMNDI 288



 Score = 52.3 bits (126), Expect = 2e-07
 Identities = 34/87 (39%), Positives = 49/87 (56%), Gaps = 15/87 (17%)

Query: 223 HQPSIIFIDEIDSLLCQR-------SDQENETSRRLKTEFLISLDGASTLDDDLVLVIGA 275
           + P IIFIDEID++  QR       +D+  +T  +L    L+ +DG  T  +  V+VI A
Sbjct: 146 NAPCIIFIDEIDAVGRQRGAGLGGGNDEREQTLNQL----LVEMDGFGT--NTGVIVIAA 199

Query: 276 TNRPQELDEAARR--RLVKRLYIPLPD 300
           TNRP  LD A  R  R  +++ + LPD
Sbjct: 200 TNRPDVLDPALLRPGRFDRQVVVDLPD 226


>gnl|CDD|185575 PTZ00361, PTZ00361, 26 proteosome regulatory subunit 4-like
           protein; Provisional.
          Length = 438

 Score =  130 bits (328), Expect = 2e-34
 Identities = 84/227 (37%), Positives = 116/227 (51%), Gaps = 33/227 (14%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
           ++L+GPPGTGKTL+ K VA +  ATF  +  S L  K+ G+G K+VR LF VA  + PSI
Sbjct: 220 VILYGPPGTGKTLLAKAVANETSATFLRVVGSELIQKYLGDGPKLVRELFRVAEENAPSI 279

Query: 61  IFIDEIDSLLCQRSDQENETSR---RLKTEFLISLDGASTLDDDLVLVIGATNRPQELDE 117
           +FIDEID++  +R D  +   +   R   E L  LDG  +  D  V VI ATNR + LD 
Sbjct: 280 VFIDEIDAIGTKRYDATSGGEKEIQRTMLELLNQLDGFDSRGD--VKVIMATNRIESLDP 337

Query: 118 AARR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASL 175
           A  R  R+ +++  P PDE+ +  I       +T +  V D+EE      + SGAD+ ++
Sbjct: 338 ALIRPGRIDRKIEFPNPDEKTKRRIFEIHTSKMTLAEDV-DLEEFIMAKDELSGADIKAI 396

Query: 176 CREASLGPVRSIDLSRIDALDVRPISIDDFRDALKTVRPSVCQADFV 222
           C EA L                          AL+  R  V QADF 
Sbjct: 397 CTEAGL-------------------------LALRERRMKVTQADFR 418



 Score = 55.5 bits (134), Expect = 1e-08
 Identities = 34/85 (40%), Positives = 48/85 (56%), Gaps = 7/85 (8%)

Query: 223 HQPSIIFIDEIDSLLCQRSDQENETSR---RLKTEFLISLDGASTLDDDLVLVIGATNRP 279
           + PSI+FIDEID++  +R D  +   +   R   E L  LDG  +  D  V VI ATNR 
Sbjct: 275 NAPSIVFIDEIDAIGTKRYDATSGGEKEIQRTMLELLNQLDGFDSRGD--VKVIMATNRI 332

Query: 280 QELDEAARR--RLVKRLYIPLPDEQ 302
           + LD A  R  R+ +++  P PDE+
Sbjct: 333 ESLDPALIRPGRIDRKIEFPNPDEK 357


>gnl|CDD|240423 PTZ00454, PTZ00454, 26S protease regulatory subunit 6B-like
           protein; Provisional.
          Length = 398

 Score =  124 bits (314), Expect = 1e-32
 Identities = 76/191 (39%), Positives = 106/191 (55%), Gaps = 10/191 (5%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
           +LL+GPPGTGKT++ K VA    ATF  +  S    K+ GEG +MVR +F +A  + PSI
Sbjct: 182 VLLYGPPGTGKTMLAKAVAHHTTATFIRVVGSEFVQKYLGEGPRMVRDVFRLARENAPSI 241

Query: 61  IFIDEIDSLLCQRSDQENETSR---RLKTEFLISLDGASTLDDDL-VLVIGATNRPQELD 116
           IFIDE+DS+  +R D +    R   R+  E L  +DG    D    V VI ATNR   LD
Sbjct: 242 IFIDEVDSIATKRFDAQTGADREVQRILLELLNQMDG---FDQTTNVKVIMATNRADTLD 298

Query: 117 EAARR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMAS 174
            A  R  RL +++  PLPD + +  I   +   +  S  V D+E+        S AD+A+
Sbjct: 299 PALLRPGRLDRKIEFPLPDRRQKRLIFQTITSKMNLSEEV-DLEDFVSRPEKISAADIAA 357

Query: 175 LCREASLGPVR 185
           +C+EA +  VR
Sbjct: 358 ICQEAGMQAVR 368



 Score = 55.5 bits (134), Expect = 1e-08
 Identities = 36/85 (42%), Positives = 47/85 (55%), Gaps = 9/85 (10%)

Query: 225 PSIIFIDEIDSLLCQRSDQENETSR---RLKTEFLISLDGASTLDDDL-VLVIGATNRPQ 280
           PSIIFIDE+DS+  +R D +    R   R+  E L  +DG    D    V VI ATNR  
Sbjct: 239 PSIIFIDEVDSIATKRFDAQTGADREVQRILLELLNQMDG---FDQTTNVKVIMATNRAD 295

Query: 281 ELDEAARR--RLVKRLYIPLPDEQK 303
            LD A  R  RL +++  PLPD ++
Sbjct: 296 TLDPALLRPGRLDRKIEFPLPDRRQ 320


>gnl|CDD|223541 COG0465, HflB, ATP-dependent Zn proteases [Posttranslational
           modification, protein turnover, chaperones].
          Length = 596

 Score =  114 bits (288), Expect = 3e-28
 Identities = 78/214 (36%), Positives = 115/214 (53%), Gaps = 19/214 (8%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWY-GEGEKMVRALFAVASVHQPS 59
           +LL GPPGTGKTL+ K VA +    FF IS S    + + G G   VR LF  A  + P 
Sbjct: 186 VLLVGPPGTGKTLLAKAVAGEAGVPFFSISGSD-FVEMFVGVGASRVRDLFEQAKKNAPC 244

Query: 60  IIFIDEIDSLLCQRSDQE---NETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELD 116
           IIFIDEID++  QR       N+   +   + L+ +DG     ++ V+VI ATNRP  LD
Sbjct: 245 IIFIDEIDAVGRQRGAGLGGGNDEREQTLNQLLVEMDGFGG--NEGVIVIAATNRPDVLD 302

Query: 117 EAARR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMAS 174
            A  R  R  +++ + LPD + R +I+    +N   +  V D++++ R T  FSGAD+A+
Sbjct: 303 PALLRPGRFDRQILVELPDIKGREQILKVHAKNKPLAEDV-DLKKIARGTPGFSGADLAN 361

Query: 175 LCREASLGPVRSIDLSRIDALDVRPISIDDFRDA 208
           L  EA+L   R          + + I++ D  +A
Sbjct: 362 LLNEAALLAARR---------NKKEITMRDIEEA 386



 Score = 52.3 bits (126), Expect = 2e-07
 Identities = 31/83 (37%), Positives = 46/83 (55%), Gaps = 7/83 (8%)

Query: 223 HQPSIIFIDEIDSLLCQRSDQE---NETSRRLKTEFLISLDGASTLDDDLVLVIGATNRP 279
           + P IIFIDEID++  QR       N+   +   + L+ +DG     ++ V+VI ATNRP
Sbjct: 241 NAPCIIFIDEIDAVGRQRGAGLGGGNDEREQTLNQLLVEMDGFGG--NEGVIVIAATNRP 298

Query: 280 QELDEAARR--RLVKRLYIPLPD 300
             LD A  R  R  +++ + LPD
Sbjct: 299 DVLDPALLRPGRFDRQILVELPD 321


>gnl|CDD|182683 PRK10733, hflB, ATP-dependent metalloprotease; Reviewed.
          Length = 644

 Score =  112 bits (282), Expect = 2e-27
 Identities = 72/194 (37%), Positives = 106/194 (54%), Gaps = 16/194 (8%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
           +L+ GPPGTGKTL+ K +A + K  FF IS S     + G G   VR +F  A    P I
Sbjct: 188 VLMVGPPGTGKTLLAKAIAGEAKVPFFTISGSDFVEMFVGVGASRVRDMFEQAKKAAPCI 247

Query: 61  IFIDEIDSLLCQRS-------DQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQ 113
           IFIDEID++  QR        D+  +T  ++    L+ +DG     ++ ++VI ATNRP 
Sbjct: 248 IFIDEIDAVGRQRGAGLGGGHDEREQTLNQM----LVEMDGFE--GNEGIIVIAATNRPD 301

Query: 114 ELDEAARR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGAD 171
            LD A  R  R  +++ + LPD + R +I+   +R +  +  + D   + R T  FSGAD
Sbjct: 302 VLDPALLRPGRFDRQVVVGLPDVRGREQILKVHMRRVPLAPDI-DAAIIARGTPGFSGAD 360

Query: 172 MASLCREASLGPVR 185
           +A+L  EA+L   R
Sbjct: 361 LANLVNEAALFAAR 374



 Score = 44.3 bits (104), Expect = 6e-05
 Identities = 31/85 (36%), Positives = 47/85 (55%), Gaps = 15/85 (17%)

Query: 225 PSIIFIDEIDSLLCQRS-------DQENETSRRLKTEFLISLDGASTLDDDLVLVIGATN 277
           P IIFIDEID++  QR        D+  +T  ++    L+ +DG     ++ ++VI ATN
Sbjct: 245 PCIIFIDEIDAVGRQRGAGLGGGHDEREQTLNQM----LVEMDGFE--GNEGIIVIAATN 298

Query: 278 RPQELDEAARR--RLVKRLYIPLPD 300
           RP  LD A  R  R  +++ + LPD
Sbjct: 299 RPDVLDPALLRPGRFDRQVVVGLPD 323


>gnl|CDD|224144 COG1223, COG1223, Predicted ATPase (AAA+ superfamily) [General
           function prediction only].
          Length = 368

 Score =  109 bits (275), Expect = 2e-27
 Identities = 71/221 (32%), Positives = 107/221 (48%), Gaps = 22/221 (9%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
           +L +GPPGTGKT++ K +A + K     + A+ L  +  G+G + +  L+  A    P I
Sbjct: 154 VLFYGPPGTGKTMMAKALANEAKVPLLLVKATELIGEHVGDGARRIHELYERARKAAPCI 213

Query: 61  IFIDEIDSLLCQRSDQE-----NETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQEL 115
           +FIDE+D++   R  QE     +E    L TE    LDG    +++ V+ I ATNRP+ L
Sbjct: 214 VFIDELDAIALDRRYQELRGDVSEIVNALLTE----LDG--IKENEGVVTIAATNRPELL 267

Query: 116 DEAARRRLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVD-DIEEVGRMTTDFSGADMAS 174
           D A R R  + +   LP+++ R EI+    +     L VD D+  +   T   SG D+  
Sbjct: 268 DPAIRSRFEEEIEFKLPNDEERLEILEYYAK--KFPLPVDADLRYLAAKTKGMSGRDI-- 323

Query: 175 LCREASLGPVRSIDLSRIDALDVRPISIDDFRDALKTVRPS 215
             +E  L       L R  A D   +  +D   ALK  R  
Sbjct: 324 --KEKVLKTA----LHRAIAEDREKVEREDIEKALKKERKR 358



 Score = 50.2 bits (120), Expect = 7e-07
 Identities = 31/84 (36%), Positives = 47/84 (55%), Gaps = 11/84 (13%)

Query: 225 PSIIFIDEIDSLLCQRSDQE-----NETSRRLKTEFLISLDGASTLDDDLVLVIGATNRP 279
           P I+FIDE+D++   R  QE     +E    L TE    LDG    +++ V+ I ATNRP
Sbjct: 211 PCIVFIDELDAIALDRRYQELRGDVSEIVNALLTE----LDG--IKENEGVVTIAATNRP 264

Query: 280 QELDEAARRRLVKRLYIPLPDEQK 303
           + LD A R R  + +   LP++++
Sbjct: 265 ELLDPAIRSRFEEEIEFKLPNDEE 288


>gnl|CDD|214386 CHL00176, ftsH, cell division protein; Validated.
          Length = 638

 Score =  107 bits (270), Expect = 8e-26
 Identities = 76/203 (37%), Positives = 106/203 (52%), Gaps = 12/203 (5%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
           +LL GPPGTGKTL+ K +A + +  FF IS S     + G G   VR LF  A  + P I
Sbjct: 219 VLLVGPPGTGKTLLAKAIAGEAEVPFFSISGSEFVEMFVGVGAARVRDLFKKAKENSPCI 278

Query: 61  IFIDEIDSLLCQRSDQ---ENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELDE 117
           +FIDEID++  QR       N+   +   + L  +DG        V+VI ATNR   LD 
Sbjct: 279 VFIDEIDAVGRQRGAGIGGGNDEREQTLNQLLTEMDGFKGNKG--VIVIAATNRVDILDA 336

Query: 118 AARR--RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASL 175
           A  R  R  +++ + LPD + R +I+    RN   S  V  +E + R T  FSGAD+A+L
Sbjct: 337 ALLRPGRFDRQITVSLPDREGRLDILKVHARNKKLSPDV-SLELIARRTPGFSGADLANL 395

Query: 176 CREASLGPVR----SIDLSRIDA 194
             EA++   R    +I +  ID 
Sbjct: 396 LNEAAILTARRKKATITMKEIDT 418



 Score = 45.0 bits (107), Expect = 4e-05
 Identities = 29/81 (35%), Positives = 41/81 (50%), Gaps = 7/81 (8%)

Query: 225 PSIIFIDEIDSLLCQRSDQ---ENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQE 281
           P I+FIDEID++  QR       N+   +   + L  +DG        V+VI ATNR   
Sbjct: 276 PCIVFIDEIDAVGRQRGAGIGGGNDEREQTLNQLLTEMDGFKGNKG--VIVIAATNRVDI 333

Query: 282 LDEAARR--RLVKRLYIPLPD 300
           LD A  R  R  +++ + LPD
Sbjct: 334 LDAALLRPGRFDRQITVSLPD 354


>gnl|CDD|200312 TIGR03689, pup_AAA, proteasome ATPase.  In the Actinobacteria, as
           shown for Mycobacterium tuberculosis, some proteins are
           modified by ligation between an epsilon-amino group of a
           lysine side chain and the C-terminal carboxylate of the
           ubiquitin-like protein Pup. This modification leads to
           protein degradation by the archaeal-like proteasome
           found in the Actinobacteria. Members of this protein
           family belong to the AAA family of ATPases and tend to
           be clustered with the genes for Pup, the Pup ligase
           PafA, and structural components of the proteasome. This
           protein forms hexameric rings with ATPase activity
           [Protein fate, Degradation of proteins, peptides, and
           glycopeptides].
          Length = 512

 Score = 92.1 bits (229), Expect = 1e-20
 Identities = 63/166 (37%), Positives = 85/166 (51%), Gaps = 27/166 (16%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQC----------KATFFCISASTLTSKWYGEGEKMVRALF 50
           +LL+GPPG GKTLI K VA             K+ F  I    L +K+ GE E+ +R +F
Sbjct: 219 VLLYGPPGCGKTLIAKAVANSLAARIGAEGGGKSYFLNIKGPELLNKYVGETERQIRLIF 278

Query: 51  A----VASVHQPSIIFIDEIDSLLCQR-----SDQENETSRRLKTEFLISLDGASTLDDD 101
                 AS  +P I+F DE+DSL   R     SD E     +L  E    +DG  +LD+ 
Sbjct: 279 QRAREKASEGRPVIVFFDEMDSLFRTRGSGVSSDVETTVVPQLLAE----IDGVESLDN- 333

Query: 102 LVLVIGATNRPQELDEAARR--RLVKRLYIPLPDEQARCEIVTKLL 145
            V+VIGA+NR   +D A  R  RL  ++ I  PD +A  +I  K L
Sbjct: 334 -VIVIGASNREDMIDPAILRPGRLDVKIRIERPDAEAAADIFAKYL 378



 Score = 46.2 bits (110), Expect = 2e-05
 Identities = 33/85 (38%), Positives = 45/85 (52%), Gaps = 13/85 (15%)

Query: 225 PSIIFIDEIDSLLCQR-----SDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRP 279
           P I+F DE+DSL   R     SD E     +L  E    +DG  +LD+  V+VIGA+NR 
Sbjct: 290 PVIVFFDEMDSLFRTRGSGVSSDVETTVVPQLLAE----IDGVESLDN--VIVIGASNRE 343

Query: 280 QELDEAARR--RLVKRLYIPLPDEQ 302
             +D A  R  RL  ++ I  PD +
Sbjct: 344 DMIDPAILRPGRLDVKIRIERPDAE 368


>gnl|CDD|99707 cd00009, AAA, The AAA+ (ATPases Associated with a wide variety of
           cellular Activities) superfamily represents an ancient
           group of ATPases belonging to the ASCE (for additional
           strand, catalytic E) division of the P-loop NTPase fold.
           The ASCE division also includes ABC, RecA-like,
           VirD4-like, PilT-like, and SF1/2 helicases. Members of
           the AAA+ ATPases function as molecular chaperons, ATPase
           subunits of proteases, helicases, or nucleic-acid
           stimulated ATPases. The AAA+ proteins contain several
           distinct features in addition to the conserved
           alpha-beta-alpha core domain structure and the Walker A
           and B motifs of the P-loop NTPases.
          Length = 151

 Score = 86.4 bits (214), Expect = 1e-20
 Identities = 50/139 (35%), Positives = 67/139 (48%), Gaps = 17/139 (12%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQC---KATFFCISASTLTSKWYGEGEK---MVRALFAVAS 54
           +LL+GPPGTGKT + + +A +     A F  ++AS L             +VR LF +A 
Sbjct: 22  LLLYGPPGTGKTTLARAIANELFRPGAPFLYLNASDLLEGLVVAELFGHFLVRLLFELAE 81

Query: 55  VHQPSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRP-- 112
             +P ++FIDEIDSL              L        D     ++  V VIGATNRP  
Sbjct: 82  KAKPGVLFIDEIDSL-------SRGAQNALLRVLETLNDLRIDREN--VRVIGATNRPLL 132

Query: 113 QELDEAARRRLVKRLYIPL 131
            +LD A   RL  R+ IPL
Sbjct: 133 GDLDRALYDRLDIRIVIPL 151



 Score = 44.4 bits (105), Expect = 1e-05
 Identities = 29/78 (37%), Positives = 36/78 (46%), Gaps = 11/78 (14%)

Query: 223 HQPSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRP--Q 280
            +P ++FIDEIDSL              L        D     ++  V VIGATNRP   
Sbjct: 83  AKPGVLFIDEIDSL-------SRGAQNALLRVLETLNDLRIDREN--VRVIGATNRPLLG 133

Query: 281 ELDEAARRRLVKRLYIPL 298
           +LD A   RL  R+ IPL
Sbjct: 134 DLDRALYDRLDIRIVIPL 151


>gnl|CDD|214640 smart00382, AAA, ATPases associated with a variety of cellular
           activities.  AAA - ATPases associated with a variety of
           cellular activities. This profile/alignment only detects
           a fraction of this vast family. The poorly conserved
           N-terminal helix is missing from the alignment.
          Length = 148

 Score = 67.4 bits (164), Expect = 1e-13
 Identities = 38/151 (25%), Positives = 61/151 (40%), Gaps = 25/151 (16%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFC---ISASTLTS--------------KWYGEGE 43
           IL+ GPPG+GKT + + +A +          I    +                K  G GE
Sbjct: 5   ILIVGPPGSGKTTLARALARELGPPGGGVIYIDGEDILEEVLDQLLLIIVGGKKASGSGE 64

Query: 44  KMVRALFAVASVHQPSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLV 103
             +R   A+A   +P ++ +DEI SLL    +        L+   L+  +         +
Sbjct: 65  LRLRLALALARKLKPDVLILDEITSLLDAEQEALLLLLEELRLLLLLKSEKN-------L 117

Query: 104 LVIGATNRPQELDEAA-RRRLVKRLYIPLPD 133
            VI  TN  ++L  A  RRR  +R+ + L  
Sbjct: 118 TVILTTNDEKDLGPALLRRRFDRRIVLLLIL 148



 Score = 39.3 bits (91), Expect = 6e-04
 Identities = 20/79 (25%), Positives = 34/79 (43%), Gaps = 8/79 (10%)

Query: 223 HQPSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQEL 282
            +P ++ +DEI SLL    +        L+   L+  +         + VI  TN  ++L
Sbjct: 77  LKPDVLILDEITSLLDAEQEALLLLLEELRLLLLLKSEKN-------LTVILTTNDEKDL 129

Query: 283 DEAA-RRRLVKRLYIPLPD 300
             A  RRR  +R+ + L  
Sbjct: 130 GPALLRRRFDRRIVLLLIL 148


>gnl|CDD|177094 CHL00195, ycf46, Ycf46; Provisional.
          Length = 489

 Score = 51.2 bits (123), Expect = 4e-07
 Identities = 47/175 (26%), Positives = 78/175 (44%), Gaps = 8/175 (4%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
           +LL G  GTGK+L  K +A   +     +    L     GE E  +R +  +A    P I
Sbjct: 262 LLLVGIQGTGKSLTAKAIANDWQLPLLRLDVGKLFGGIVGESESRMRQMIRIAEALSPCI 321

Query: 61  IFIDEIDSLLCQ-RSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQELD-EA 118
           ++IDEID       S  ++ T+ R+   F+  L    +     V V+   N    L  E 
Sbjct: 322 LWIDEIDKAFSNSESKGDSGTTNRVLATFITWLSEKKS----PVFVVATANNIDLLPLEI 377

Query: 119 ARR-RLVKRLYIPLPDEQARCEIVTKLLRNI-TNSLTVDDIEEVGRMTTDFSGAD 171
            R+ R  +  ++ LP  + R +I    L+     S    DI+++ +++  FSGA+
Sbjct: 378 LRKGRFDEIFFLDLPSLEEREKIFKIHLQKFRPKSWKKYDIKKLSKLSNKFSGAE 432


>gnl|CDD|237355 PRK13342, PRK13342, recombination factor protein RarA; Reviewed.
          Length = 413

 Score = 50.5 bits (122), Expect = 6e-07
 Identities = 43/171 (25%), Positives = 64/171 (37%), Gaps = 69/171 (40%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVA----SVH 56
           ++L+GPPGTGKT + + +A    A F  +SA T        G K +R +   A    S  
Sbjct: 39  MILWGPPGTGKTTLARIIAGATDAPFEALSAVT-------SGVKDLREVIEEARQRRSAG 91

Query: 57  QPSIIFIDEI--------DSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGA 108
           + +I+FIDEI        D+LL                           ++D  + +IGA
Sbjct: 92  RRTILFIDEIHRFNKAQQDALL-------------------------PHVEDGTITLIGA 126

Query: 109 T--------NRP----------QELDEAARRRLVKR-------LYIPLPDE 134
           T        N            + L E    +L+KR         + L DE
Sbjct: 127 TTENPSFEVNPALLSRAQVFELKPLSEEDIEQLLKRALEDKERGLVELDDE 177


>gnl|CDD|232949 TIGR00382, clpX, endopeptidase Clp ATP-binding regulatory subunit
           (clpX).  A member of the ATP-dependent proteases, ClpX
           has ATP-dependent chaperone activity and is required for
           specific ATP-dependent proteolytic activities expressed
           by ClpPX. The gene is also found to be involved in
           stress tolerance in Bacillus subtilis and is essential
           for the efficient acquisition of genes specifying type
           IA and IB restriction [Protein fate, Protein folding and
           stabilization, Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 413

 Score = 46.3 bits (110), Expect = 1e-05
 Identities = 30/73 (41%), Positives = 38/73 (52%), Gaps = 6/73 (8%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWY-GEG-EKMVRALFAVA----S 54
           ILL GP G+GKTL+ + +A      F    A+TLT   Y GE  E ++  L   A     
Sbjct: 119 ILLIGPTGSGKTLLAQTLARILNVPFAIADATTLTEAGYVGEDVENILLKLLQAADYDVE 178

Query: 55  VHQPSIIFIDEID 67
             Q  II+IDEID
Sbjct: 179 KAQKGIIYIDEID 191


>gnl|CDD|225165 COG2256, MGS1, ATPase related to the helicase subunit of the
           Holliday junction resolvase [DNA replication,
           recombination, and repair].
          Length = 436

 Score = 42.6 bits (101), Expect = 2e-04
 Identities = 30/111 (27%), Positives = 52/111 (46%), Gaps = 28/111 (25%)

Query: 3   LFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVA----SVHQP 58
           L+GPPGTGKT + + +A    A F  +SA T        G K +R +   A     + + 
Sbjct: 53  LWGPPGTGKTTLARLIAGTTNAAFEALSAVT-------SGVKDLREIIEEARKNRLLGRR 105

Query: 59  SIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGAT 109
           +I+F+DEI           N+     + + L+       +++  +++IGAT
Sbjct: 106 TILFLDEIHRF--------NKA----QQDALLPH-----VENGTIILIGAT 139


>gnl|CDD|235422 PRK05342, clpX, ATP-dependent protease ATP-binding subunit ClpX;
           Provisional.
          Length = 412

 Score = 42.1 bits (100), Expect = 3e-04
 Identities = 32/78 (41%), Positives = 42/78 (53%), Gaps = 16/78 (20%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATF----FCIS-ASTLT-SKWYGEG-EKMVRALFAVA 53
           ILL GP G+GKTL+     AQ  A      F I+ A+TLT + + GE  E ++  L   A
Sbjct: 111 ILLIGPTGSGKTLL-----AQTLARILDVPFAIADATTLTEAGYVGEDVENILLKLLQAA 165

Query: 54  --SVH--QPSIIFIDEID 67
              V   Q  I++IDEID
Sbjct: 166 DYDVEKAQRGIVYIDEID 183


>gnl|CDD|223786 COG0714, COG0714, MoxR-like ATPases [General function prediction
           only].
          Length = 329

 Score = 41.3 bits (97), Expect = 4e-04
 Identities = 44/182 (24%), Positives = 72/182 (39%), Gaps = 32/182 (17%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCIS------ASTLTSKW--------YGEGEKMV 46
           +LL GPPG GKTL+ + +A      F  I        S L   +         GE   + 
Sbjct: 46  VLLEGPPGVGKTLLARALARALGLPFVRIQCTPDLLPSDLLGTYAYAALLLEPGEFRFVP 105

Query: 47  RALFAVASVHQPSIIFIDEIDSLLCQRSDQENETS--RRLKTEFLISLDGASTLD-DDLV 103
             LFA   V    I+ +DEI+     R+  E + +    L+ E  +++ G +T+      
Sbjct: 106 GPLFAAVRV----ILLLDEIN-----RAPPEVQNALLEALE-ERQVTVPGLTTIRLPPPF 155

Query: 104 LVIGATNRP-----QELDEAARRRLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIE 158
           +VI   N         L EA   R + R+Y+  PD +    I+   +  +        ++
Sbjct: 156 IVIATQNPGEYEGTYPLPEALLDRFLLRIYVDYPDSEEEERIILARVGGVDELDLESLVK 215

Query: 159 EV 160
            V
Sbjct: 216 PV 217


>gnl|CDD|224140 COG1219, ClpX, ATP-dependent protease Clp, ATPase subunit
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 408

 Score = 41.0 bits (97), Expect = 6e-04
 Identities = 29/74 (39%), Positives = 40/74 (54%), Gaps = 8/74 (10%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWY-GEG-----EKMVRAL-FAVA 53
           ILL GP G+GKTL+ + +A      F    A+TLT   Y GE       K+++A  + V 
Sbjct: 100 ILLIGPTGSGKTLLAQTLAKILNVPFAIADATTLTEAGYVGEDVENILLKLLQAADYDVE 159

Query: 54  SVHQPSIIFIDEID 67
              +  II+IDEID
Sbjct: 160 RA-ERGIIYIDEID 172


>gnl|CDD|213179 cd00267, ABC_ATPase, ATP-binding cassette transporter
           nucleotide-binding domain.  ABC transporters are a large
           family of proteins involved in the transport of a wide
           variety of different compounds, like sugars, ions,
           peptides, and more complex organic molecules. The
           nucleotide-binding domain shows the highest similarity
           between all members of the family. ABC transporters are
           a subset of nucleotide hydrolases that contain a
           signature motif, Q-loop, and H-loop/switch region, in
           addition to, the Walker A motif/P-loop and Walker B
           motif commonly found in a number of ATP- and GTP-binding
           and hydrolyzing proteins.
          Length = 157

 Score = 38.0 bits (89), Expect = 0.002
 Identities = 29/139 (20%), Positives = 50/139 (35%), Gaps = 29/139 (20%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQC-----KATFFCISASTLTSKWYGE----------GEKM 45
           + L GP G+GK+ + + +A        +        + L  +              G++ 
Sbjct: 28  VALVGPNGSGKSTLLRAIAGLLKPTSGEILIDGKDIAKLPLEELRRRIGYVPQLSGGQRQ 87

Query: 46  VRALFAVASVHQPSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLV 105
            R   A A +  P ++ +DE  S L    D  +    RL        +   T       V
Sbjct: 88  -RVALARALLLNPDLLLLDEPTSGL----DPASR--ERLLELLRELAEEGRT-------V 133

Query: 106 IGATNRPQELDEAARRRLV 124
           I  T+ P+  + AA R +V
Sbjct: 134 IIVTHDPELAELAADRVIV 152



 Score = 27.6 bits (62), Expect = 6.1
 Identities = 17/71 (23%), Positives = 27/71 (38%), Gaps = 13/71 (18%)

Query: 221 FVHQPSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQ 280
            +  P ++ +DE  S L    D  +    RL        +   T       VI  T+ P+
Sbjct: 95  LLLNPDLLLLDEPTSGL----DPASR--ERLLELLRELAEEGRT-------VIIVTHDPE 141

Query: 281 ELDEAARRRLV 291
             + AA R +V
Sbjct: 142 LAELAADRVIV 152


>gnl|CDD|131948 TIGR02902, spore_lonB, ATP-dependent protease LonB.  Members of
           this protein are LonB, a paralog of the ATP-dependent
           protease La (LonA, TIGR00763). LonB proteins are found
           strictly, and almost universally, in endospore-forming
           bacteria. This protease was shown, in Bacillus subtilis,
           to be expressed specifically in the forespore, during
           sporulation, under control of sigma(F). The lonB gene,
           despite location immediately upstream of lonA, was shown
           to be monocistronic. LonB appears able to act on
           sigma(H) for post-translation control, but lonB mutation
           did not produce an obvious sporulation defect under the
           conditions tested. Note that additional paralogs of LonA
           and LonB occur in the Clostridium lineage and this model
           selects only one per species as the protein that
           corresponds to LonB in B. subtilis [Protein fate,
           Degradation of proteins, peptides, and glycopeptides,
           Cellular processes, Sporulation and germination].
          Length = 531

 Score = 37.8 bits (88), Expect = 0.007
 Identities = 57/245 (23%), Positives = 99/245 (40%), Gaps = 56/245 (22%)

Query: 1   ILLFGPPGTGKTLIGKCV---AAQC-------KATFFCISASTLTSKWYGEGEKMVRAL- 49
           ++++GPPG GKT   + V   A +         A F  I A+T      G  + ++ ++ 
Sbjct: 89  VIIYGPPGVGKTAAARLVLEEAKKNPASPFKEGAAFVEIDATTARFDERGIADPLIGSVH 148

Query: 50  ---------FAVASVHQPS----------IIFIDEIDSLLCQRSDQENETSRRLKTEFLI 90
                      +A + QP           ++FIDEI  L      Q N+  + L+    +
Sbjct: 149 DPIYQGAGPLGIAGIPQPKPGAVTRAHGGVLFIDEIGEL---HPVQMNKLLKVLEDR-KV 204

Query: 91  SLDGASTLDDDLVL-----------------VIGATNR-PQELDEAARRRLVKRLYIPLP 132
            LD A    ++  +                 +IGAT R P+E+  A R R V+  + PL 
Sbjct: 205 FLDSAYYNSENPNIPSHIHDIFQNGLPADFRLIGATTRNPEEIPPALRSRCVEIFFRPLL 264

Query: 133 DEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASLCREASLGPVRSIDLSRI 192
           DE+ + EI       I  +L    +E + +  +  +G +  ++ + A+ G        RI
Sbjct: 265 DEEIK-EIAKNAAEKIGINLEKHALELIVKYAS--NGREAVNIVQLAA-GIALGEGRKRI 320

Query: 193 DALDV 197
            A D+
Sbjct: 321 LAEDI 325


>gnl|CDD|188437 TIGR03922, T7SS_EccA, type VII secretion AAA-ATPase EccA.  This
           model represents the AAA family ATPase, EccA, of the
           actinobacterial flavor of type VII secretion systems.
           Species such as Mycobacterium tuberculosis have several
           instances of this system per genome, designated EccA1,
           EccA2, etc [Protein fate, Protein and peptide secretion
           and trafficking].
          Length = 557

 Score = 37.9 bits (88), Expect = 0.008
 Identities = 40/148 (27%), Positives = 62/148 (41%), Gaps = 21/148 (14%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQC-------KATFFCISASTLTSKWYGEGEKMVRALFAVA 53
           +L  GPPGTGKT I + VA          K     +S + L  ++ GE E     +   A
Sbjct: 315 MLFAGPPGTGKTTIARVVAKIYCGLGVLRKPLVREVSRADLIGQYIGESEAKTNEIIDSA 374

Query: 54  SVHQPSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQ 113
                 ++F+DE  +L+     Q++          L  ++     D D ++VIGA  R  
Sbjct: 375 ---LGGVLFLDEAYTLVETGYGQKDPFGLEAIDTLLARMEN----DRDRLVVIGAGYRKD 427

Query: 114 -----ELDEAARRRLVKRLYIP--LPDE 134
                E++E  R R  + +  P   PDE
Sbjct: 428 LDKFLEVNEGLRSRFTRVIEFPSYSPDE 455


>gnl|CDD|219538 pfam07728, AAA_5, AAA domain (dynein-related subfamily).  This Pfam
           entry includes some of the AAA proteins not detected by
           the pfam00004 model.
          Length = 135

 Score = 35.0 bits (81), Expect = 0.018
 Identities = 31/137 (22%), Positives = 45/137 (32%), Gaps = 17/137 (12%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQ-CKATFFCI------SASTLTSKW-YGEGEKMVRALFAV 52
           +LL GPPGTGK+ + + +AA       F +      +   L  +     G         V
Sbjct: 2   VLLVGPPGTGKSELAERLAAALSNRPVFYVQLTRDTTEEDLKGRRNIDPGGASWVDGPLV 61

Query: 53  ASVHQPSIIFIDEIDSLLCQRSDQENETSRRL--KTEFLISLDGASTLDDDLVLVIGATN 110
            +  +  I  +DEI+       D  N     L  +   L           D   +I   N
Sbjct: 62  RAAREGEIAVLDEINRA---NPDVLNSLLSLLDERRLLLPEGGELVKAAPDGFRLIATMN 118

Query: 111 RP----QELDEAARRRL 123
                  EL  A R R 
Sbjct: 119 PLDRGLNELSPALRSRF 135


>gnl|CDD|129721 TIGR00635, ruvB, Holliday junction DNA helicase, RuvB subunit.
          All proteins in this family for which functions are
          known are 5'-3' DNA helicases that, as part of a
          complex with RuvA homologs serve as a 5'-3' Holliday
          junction helicase. RuvA specifically binds Holliday
          junctions as a sandwich of two tetramers and maintains
          the configuration of the junction. It forms a complex
          with two hexameric rings of RuvB, the subunit that
          contains helicase activity. The complex drives
          ATP-dependent branch migration of the Holliday junction
          recombination intermediate. The endonuclease RuvC
          resolves junctions [DNA metabolism, DNA replication,
          recombination, and repair].
          Length = 305

 Score = 35.7 bits (83), Expect = 0.022
 Identities = 20/69 (28%), Positives = 31/69 (44%), Gaps = 8/69 (11%)

Query: 1  ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
          +LL+GPPG GKT +   +A +        S   L        EK       + ++ +  +
Sbjct: 33 LLLYGPPGLGKTTLAHIIANEMGVNLKITSGPAL--------EKPGDLAAILTNLEEGDV 84

Query: 61 IFIDEIDSL 69
          +FIDEI  L
Sbjct: 85 LFIDEIHRL 93


>gnl|CDD|234619 PRK00080, ruvB, Holliday junction DNA helicase RuvB; Reviewed.
          Length = 328

 Score = 35.5 bits (83), Expect = 0.031
 Identities = 22/70 (31%), Positives = 32/70 (45%), Gaps = 10/70 (14%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQP-S 59
           +LL+GPPG GKT +   +A +        S   L        EK    L A+ +  +   
Sbjct: 54  VLLYGPPGLGKTTLANIIANEMGVNIRITSGPAL--------EK-PGDLAAILTNLEEGD 104

Query: 60  IIFIDEIDSL 69
           ++FIDEI  L
Sbjct: 105 VLFIDEIHRL 114


>gnl|CDD|225164 COG2255, RuvB, Holliday junction resolvasome, helicase subunit [DNA
           replication, recombination, and repair].
          Length = 332

 Score = 35.2 bits (82), Expect = 0.042
 Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 8/66 (12%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQPSI 60
           +LLFGPPG GKT +   +A +        S   L        EK       + ++ +  +
Sbjct: 55  VLLFGPPGLGKTTLAHIIANELGVNLKITSGPAL--------EKPGDLAAILTNLEEGDV 106

Query: 61  IFIDEI 66
           +FIDEI
Sbjct: 107 LFIDEI 112


>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La.  This protein is induced
           by heat shock and other stresses in E. coli, B.
           subtilis, and other species. The yeast member,
           designated PIM1, is located in the mitochondrial matrix,
           required for mitochondrial function, and also induced by
           heat shock [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 775

 Score = 35.0 bits (81), Expect = 0.061
 Identities = 22/83 (26%), Positives = 33/83 (39%), Gaps = 20/83 (24%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGE--------------KMV 46
           + L GPPG GKT +GK +A      F   S   +      E E              +++
Sbjct: 350 LCLVGPPGVGKTSLGKSIAKALNRKFVRFSLGGVRD----EAEIRGHRRTYVGAMPGRII 405

Query: 47  RALFAVASVHQPSIIFIDEIDSL 69
           + L   A    P +  +DEID +
Sbjct: 406 QGL-KKAKTKNP-LFLLDEIDKI 426


>gnl|CDD|238979 cd02021, GntK, Gluconate kinase (GntK) catalyzes the phosphoryl
          transfer from ATP to gluconate. The resulting product
          gluconate-6-phoshate is an important precursor of
          gluconate metabolism. GntK acts as a dimmer composed of
          two identical subunits.
          Length = 150

 Score = 33.4 bits (77), Expect = 0.065
 Identities = 10/27 (37%), Positives = 17/27 (62%)

Query: 1  ILLFGPPGTGKTLIGKCVAAQCKATFF 27
          I++ G  G+GK+ +GK +A +  A F 
Sbjct: 2  IVVMGVSGSGKSTVGKALAERLGAPFI 28


>gnl|CDD|219536 pfam07724, AAA_2, AAA domain (Cdc48 subfamily).  This Pfam entry
          includes some of the AAA proteins not detected by the
          pfam00004 model.
          Length = 168

 Score = 33.7 bits (78), Expect = 0.069
 Identities = 21/84 (25%), Positives = 30/84 (35%), Gaps = 21/84 (25%)

Query: 1  ILLFGPPGTGKTLIGKCVAAQ---CKATFFCISASTLTSKW--------------YGEGE 43
           L  GP G GKT + K +A      +     I  S    +               Y EG 
Sbjct: 6  FLFLGPTGVGKTELAKALAELLFGDERALIRIDMSEYMEEHSVSRLIGAPPGYVGYEEGG 65

Query: 44 KMVRALFAVASVHQPSIIFIDEID 67
          ++  A+         SI+ IDEI+
Sbjct: 66 QLTEAV----RRKPYSIVLIDEIE 85


>gnl|CDD|222104 pfam13401, AAA_22, AAA domain. 
          Length = 124

 Score = 32.6 bits (75), Expect = 0.089
 Identities = 16/85 (18%), Positives = 30/85 (35%), Gaps = 16/85 (18%)

Query: 1  ILLFGPPGTGKTLIGKCVAAQ-CKATFFCISASTLTS---------------KWYGEGEK 44
           +L G  G+GKT + + +A Q        + A +L +                  G   +
Sbjct: 7  GVLTGESGSGKTTLLRRLARQLPNRRVVYVEAPSLGTPKDLLRKILRALGLPLSGGTTAE 66

Query: 45 MVRALFAVASVHQPSIIFIDEIDSL 69
          ++ A+          ++ IDE   L
Sbjct: 67 LLEAILDALKRRGRPLLIIDEAQHL 91


>gnl|CDD|144608 pfam01078, Mg_chelatase, Magnesium chelatase, subunit ChlI.
          Magnesium-chelatase is a three-component enzyme that
          catalyzes the insertion of Mg2+ into protoporphyrin IX.
          This is the first unique step in the synthesis of
          (bacterio)chlorophyll. Due to this, it is thought that
          Mg-chelatase has an important role in channelling
          inter- mediates into the (bacterio)chlorophyll branch
          in response to conditions suitable for photosynthetic
          growth. ChlI and BchD have molecular weight between
          38-42 kDa.
          Length = 207

 Score = 33.3 bits (77), Expect = 0.10
 Identities = 9/17 (52%), Positives = 14/17 (82%)

Query: 1  ILLFGPPGTGKTLIGKC 17
          +L+ GPPG+GKT++ K 
Sbjct: 25 LLMIGPPGSGKTMLAKR 41


>gnl|CDD|129465 TIGR00368, TIGR00368, Mg chelatase-related protein.  The N-terminal
           end matches very strongly a pfam Mg_chelatase domain
           [Unknown function, General].
          Length = 499

 Score = 33.7 bits (77), Expect = 0.13
 Identities = 10/18 (55%), Positives = 15/18 (83%)

Query: 1   ILLFGPPGTGKTLIGKCV 18
           +LLFGPPG+GKT++   +
Sbjct: 214 LLLFGPPGSGKTMLASRL 231


>gnl|CDD|238260 cd00464, SK, Shikimate kinase (SK) is the fifth enzyme in the
          shikimate pathway, a seven-step biosynthetic pathway
          which converts erythrose-4-phosphate to chorismic acid,
          found in bacteria, fungi and plants. Chorismic acid is
          a important intermediate in the synthesis of aromatic
          compounds, such as aromatic amino acids, p-aminobenzoic
          acid, folate and ubiquinone. Shikimate kinase catalyses
          the phosphorylation of the 3-hydroxyl group of shikimic
          acid using ATP.
          Length = 154

 Score = 32.5 bits (75), Expect = 0.15
 Identities = 10/27 (37%), Positives = 14/27 (51%)

Query: 1  ILLFGPPGTGKTLIGKCVAAQCKATFF 27
          I+L G  G GKT +G+ +A      F 
Sbjct: 2  IVLIGMMGAGKTTVGRLLAKALGLPFV 28


>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 782

 Score = 33.7 bits (78), Expect = 0.17
 Identities = 16/33 (48%), Positives = 19/33 (57%), Gaps = 6/33 (18%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKAT---FFCIS 30
           + L GPPG GKT +GK +A   KA    F  IS
Sbjct: 353 LCLVGPPGVGKTSLGKSIA---KALGRKFVRIS 382


>gnl|CDD|204202 pfam09336, Vps4_C, Vps4 C terminal oligomerisation domain.  This
           domain is found at the C terminal of ATPase proteins
           involved in vacuolar sorting. It forms an alpha helix
           structure and is required for oligomerisation.
          Length = 62

 Score = 29.9 bits (68), Expect = 0.24
 Identities = 9/22 (40%), Positives = 15/22 (68%)

Query: 199 PISIDDFRDALKTVRPSVCQAD 220
           P+++ DF  AL T +P+V + D
Sbjct: 29  PLTMKDFIKALATTKPTVSKDD 50


>gnl|CDD|222165 pfam13481, AAA_25, AAA domain.  This AAA domain is found in a wide
           variety of presumed DNA repair proteins.
          Length = 154

 Score = 31.9 bits (73), Expect = 0.26
 Identities = 20/80 (25%), Positives = 30/80 (37%), Gaps = 10/80 (12%)

Query: 1   ILLFGPPGTGKT--LIGKCVAAQCKATFFCISASTLTSKWYGEGE--------KMVRALF 50
            LL G PGTGK+   +    A      F            Y +GE        ++     
Sbjct: 36  TLLAGAPGTGKSTLALDLAAAVATGRPFLGPFPVEPGRVLYLDGEDSEAGLRRRLRALGE 95

Query: 51  AVASVHQPSIIFIDEIDSLL 70
           A+  +  P ++ ID + SLL
Sbjct: 96  ALEEIEGPDLVVIDPLASLL 115


>gnl|CDD|213527 TIGR00390, hslU, ATP-dependent protease HslVU, ATPase subunit.
           This model represents the ATPase subunit of HslVU, while
           the proteasome-related peptidase subunit is HslV.
           Residues 54-61 of the model contain a P-loop ATP-binding
           motif. Cys-287 of E. coli (position 308 in the seed
           alignment), studied in MEDLINE:98389714, is Ser in other
           members of the seed alignment [Protein fate, Protein
           folding and stabilization].
          Length = 441

 Score = 32.5 bits (74), Expect = 0.31
 Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 2/55 (3%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWY--GEGEKMVRALFAVA 53
           IL+ GP G GKT I + +A    A F  + A+  T   Y   + E MVR L  +A
Sbjct: 50  ILMIGPTGVGKTEIARRLAKLANAPFIKVEATKFTEVGYVGRDVESMVRDLVDIA 104



 Score = 28.6 bits (64), Expect = 5.6
 Identities = 33/121 (27%), Positives = 50/121 (41%), Gaps = 31/121 (25%)

Query: 51  AVASVHQPSIIFIDEIDSLLCQRSDQENETSRRLKTEFLISLDGAST-------LDDDLV 103
           A+  V Q  IIFIDEID +  +      + SR      L+ +   ST       +  D +
Sbjct: 241 AIDRVEQSGIIFIDEIDKIAKKGESSGADVSREGVQRDLLPIVEGSTVNTKYGSVKTDHI 300

Query: 104 LVI--GATN--RPQELDEAARRRLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEE 159
           L I  GA +  +P +L               +P+ Q R  I  +L      +LT+DD E 
Sbjct: 301 LFIAAGAFHLAKPSDL---------------IPELQGRFPIRVEL-----KALTIDDFER 340

Query: 160 V 160
           +
Sbjct: 341 I 341


>gnl|CDD|187579 cd05271, NDUFA9_like_SDR_a, NADH dehydrogenase (ubiquinone) 1 alpha
           subcomplex, subunit 9, 39 kDa, (NDUFA9) -like, atypical
           (a) SDRs.  This subgroup of extended SDR-like proteins
           are atypical SDRs. They have a glycine-rich
           NAD(P)-binding motif similar to the typical SDRs,
           GXXGXXG, and have the YXXXK active site motif (though
           not the other residues of the SDR tetrad). Members
           identified include NDUFA9 (mitochondrial) and putative
           nucleoside-diphosphate-sugar epimerase. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Atypical SDRs include
           biliverdin IX beta reductase (BVR-B,aka flavin
           reductase), NMRa (a negative transcriptional regulator
           of various fungi), progesterone 5-beta-reductase like
           proteins, phenylcoumaran benzylic ether and
           pinoresinol-lariciresinol reductases, phenylpropene
           synthases, eugenol synthase, triphenylmethane reductase,
           isoflavone reductases, and others. SDRs are a
           functionally diverse family of oxidoreductases that have
           a single domain with a structurally conserved Rossmann
           fold, an NAD(P)(H)-binding region, and a structurally
           diverse C-terminal region. Sequence identity between
           different SDR enzymes is typically in the 15-30% range;
           they catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. In addition to the
           Rossmann fold core region typical of all SDRs, extended
           SDRs have a less conserved C-terminal extension of
           approximately 100 amino acids, and typically have a
           TGXXGXXG cofactor binding motif. Complex (multidomain)
           SDRs such as ketoreductase domains of fatty acid
           synthase have a GGXGXXG NAD(P)-binding motif and an
           altered active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 273

 Score = 32.2 bits (74), Expect = 0.32
 Identities = 14/32 (43%), Positives = 20/32 (62%), Gaps = 2/32 (6%)

Query: 34  LTSKWYGEGEKMVRALFAVASVHQPSIIFIDE 65
           L SK   EGE+ VR  F  A++ +PS++F  E
Sbjct: 127 LRSK--AEGEEAVREAFPEATIVRPSVVFGRE 156


>gnl|CDD|222866 PHA02544, 44, clamp loader, small subunit; Provisional.
          Length = 316

 Score = 32.3 bits (74), Expect = 0.37
 Identities = 10/32 (31%), Positives = 16/32 (50%)

Query: 1  ILLFGPPGTGKTLIGKCVAAQCKATFFCISAS 32
          +L    PGTGKT + K +  +  A    ++ S
Sbjct: 46 LLHSPSPGTGKTTVAKALCNEVGAEVLFVNGS 77


>gnl|CDD|224401 COG1484, DnaC, DNA replication protein [DNA replication,
           recombination, and repair].
          Length = 254

 Score = 32.0 bits (73), Expect = 0.38
 Identities = 23/73 (31%), Positives = 34/73 (46%), Gaps = 11/73 (15%)

Query: 1   ILLFGPPGTGKTL----IGKCVAAQCKATFFCISASTLTSKW---YGEGEKMVRALFAVA 53
           ++L GPPG GKT     IG  +     +  F I+A  L SK    + EG    + L  + 
Sbjct: 108 LVLLGPPGVGKTHLAIAIGNELLKAGISVLF-ITAPDLLSKLKAAFDEGRLEEKLLRELK 166

Query: 54  SVHQPSIIFIDEI 66
            V    ++ ID+I
Sbjct: 167 KV---DLLIIDDI 176


>gnl|CDD|236153 PRK08116, PRK08116, hypothetical protein; Validated.
          Length = 268

 Score = 31.5 bits (72), Expect = 0.55
 Identities = 10/18 (55%), Positives = 13/18 (72%)

Query: 2   LLFGPPGTGKTLIGKCVA 19
           LL+G  GTGKT +  C+A
Sbjct: 118 LLWGSVGTGKTYLAACIA 135


>gnl|CDD|237354 PRK13341, PRK13341, recombination factor protein RarA/unknown
           domain fusion protein; Reviewed.
          Length = 725

 Score = 31.9 bits (73), Expect = 0.58
 Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 16/72 (22%)

Query: 2   LLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQP--- 58
           +L+GPPG GKT + + +A   +A F  ++A  L       G K +RA   V    +    
Sbjct: 56  ILYGPPGVGKTTLARIIANHTRAHFSSLNA-VLA------GVKDLRA--EVDRAKERLER 106

Query: 59  ----SIIFIDEI 66
               +I+FIDE+
Sbjct: 107 HGKRTILFIDEV 118


>gnl|CDD|224142 COG1221, PspF, Transcriptional regulators containing an AAA-type
           ATPase domain and a DNA-binding domain [Transcription /
           Signal transduction mechanisms].
          Length = 403

 Score = 31.2 bits (71), Expect = 0.70
 Identities = 18/83 (21%), Positives = 30/83 (36%), Gaps = 21/83 (25%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQC-------KATFFC--ISASTLTSKWYGEGEKMVRALFA 51
           +L+ G  GTGK L  + + A            F C   S +   ++ +G      +  F 
Sbjct: 104 VLIIGETGTGKELFARLIHALSARRAEAPFIAFNCAAYSENLQEAELFG----HEKGAFT 159

Query: 52  VASVHQPSII--------FIDEI 66
            A   +  +         F+DEI
Sbjct: 160 GAQGGKAGLFEQANGGTLFLDEI 182


>gnl|CDD|236828 PRK11034, clpA, ATP-dependent Clp protease ATP-binding subunit;
           Provisional.
          Length = 758

 Score = 31.3 bits (71), Expect = 0.81
 Identities = 25/81 (30%), Positives = 36/81 (44%), Gaps = 12/81 (14%)

Query: 2   LLFGPPGTGKTLIGKCVA------------AQCKATFFCISASTLTSKWYGEGEKMVRAL 49
           LL G  G GKT I + +A            A C      I +    +K+ G+ EK  +AL
Sbjct: 211 LLVGESGVGKTAIAEGLAWRIVQGDVPEVMADCTIYSLDIGSLLAGTKYRGDFEKRFKAL 270

Query: 50  FAVASVHQPSIIFIDEIDSLL 70
                    SI+FIDEI +++
Sbjct: 271 LKQLEQDTNSILFIDEIHTII 291


>gnl|CDD|233959 TIGR02639, ClpA, ATP-dependent Clp protease ATP-binding subunit
           clpA.  [Protein fate, Degradation of proteins, peptides,
           and glycopeptides].
          Length = 730

 Score = 31.1 bits (71), Expect = 0.82
 Identities = 24/77 (31%), Positives = 36/77 (46%), Gaps = 12/77 (15%)

Query: 2   LLFGPPGTGKTLI----------GKCVAAQCKATFFCISASTLT--SKWYGEGEKMVRAL 49
           LL G PG GKT I          GK       A  + +   TL   +K+ G+ E+ ++A+
Sbjct: 206 LLVGEPGVGKTAIVEGLALRIAEGKVPERLKNAKIYSLDMGTLLAGTKYRGDFEERLKAV 265

Query: 50  FAVASVHQPSIIFIDEI 66
            +       +I+FIDEI
Sbjct: 266 VSEIEKEPNAILFIDEI 282


>gnl|CDD|233355 TIGR01313, therm_gnt_kin, carbohydrate kinase, thermoresistant
          glucokinase family.  This model represents a subfamily
          of proteins that includes thermoresistant and
          thermosensitve isozymes of gluconate kinase
          (gluconokinase) in E. coli and other related proteins;
          members of this family are often named by similarity to
          the thermostable isozyme. These proteins show homology
          to shikimate kinases and adenylate kinases but not to
          gluconate kinases from the FGGY family of carbohydrate
          kinases.
          Length = 163

 Score = 30.1 bits (68), Expect = 0.97
 Identities = 9/26 (34%), Positives = 14/26 (53%)

Query: 1  ILLFGPPGTGKTLIGKCVAAQCKATF 26
           +L G  G+GK+ I   +A +  A F
Sbjct: 1  FVLMGVAGSGKSTIASALAHRLGAKF 26


>gnl|CDD|234053 TIGR02903, spore_lon_C, ATP-dependent protease, Lon family.
           Members of this protein family resemble the widely
           distributed ATP-dependent protease La, also called Lon
           and LonA. It resembles even more closely LonB, which is
           a LonA paralog found in genomes if and only if the
           species is capable of endospore formation (as in
           Bacillus subtilis, Clostridium tetani, and select other
           members of the Firmicutes) and expressed specifically in
           the forespore compartment. Members of this family are
           restricted to a subset of spore-forming species, and are
           very likely to participate in the program of endospore
           formation. We propose the designation LonC [Protein
           fate, Degradation of proteins, peptides, and
           glycopeptides, Cellular processes, Sporulation and
           germination].
          Length = 615

 Score = 30.9 bits (70), Expect = 1.1
 Identities = 56/219 (25%), Positives = 80/219 (36%), Gaps = 75/219 (34%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCK----------ATFFCISASTLTSKWYGEGEKMVRALF 50
           I+L+GPPG GKT   +    + K          A F  +  +TL  +W  +  ++   L 
Sbjct: 178 IILYGPPGVGKTTAARLALEEAKKLKHTPFAEDAPFVEVDGTTL--RW--DPREVTNPL- 232

Query: 51  AVASVHQP---------------------------SIIFIDEI---DSLLCQR------- 73
            + SVH P                            ++FIDEI   D LL  +       
Sbjct: 233 -LGSVHDPIYQGARRDLAETGVPEPKTGLVTDAHGGVLFIDEIGELDPLLQNKLLKVLED 291

Query: 74  -----------SDQENETSRRLKTEFLISLDGASTLDDDLVLVIGATNR-PQELDEAARR 121
                       D  N   + +K  F    +GA     D VL IGAT R P+E++ A R 
Sbjct: 292 KRVEFSSSYYDPDDPN-VPKYIKKLF---EEGAPA---DFVL-IGATTRDPEEINPALRS 343

Query: 122 RLVKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEV 160
           R  +  + PL  E     IV      I N      +EE+
Sbjct: 344 RCAEVFFEPLTPEDIA-LIVLNAAEKI-NVHLAAGVEEL 380


>gnl|CDD|213190 cd03223, ABCD_peroxisomal_ALDP, ATP-binding cassette domain of
           peroxisomal transporter, subfamily D.  Peroxisomal
           ATP-binding cassette transporter (Pat) is involved in
           the import of very long-chain fatty acids (VLCFA) into
           the peroxisome. The peroxisomal membrane forms a
           permeability barrier for a wide variety of metabolites
           required for and formed during fatty acid
           beta-oxidation. To communicate with the cytoplasm and
           mitochondria, peroxisomes need dedicated proteins to
           transport such hydrophilic molecules across their
           membranes. X-linked adrenoleukodystrophy (X-ALD) is
           caused by mutations in the ALD gene, which encodes ALDP
           (adrenoleukodystrophy protein ), a peroxisomal integral
           membrane protein that is a member of the ATP-binding
           cassette (ABC) transporter protein family. The disease
           is characterized by a striking and unpredictable
           variation in phenotypic expression. Phenotypes include
           the rapidly progressive childhood cerebral form (CCALD),
           the milder adult form, adrenomyeloneuropathy (AMN), and
           variants without neurologic involvement (i.e.
           asymptomatic).
          Length = 166

 Score = 29.8 bits (68), Expect = 1.1
 Identities = 26/107 (24%), Positives = 41/107 (38%), Gaps = 37/107 (34%)

Query: 1   ILLFGPPGTGKTLI------------GKCVAAQCKATFFC-----ISASTLTSK----WY 39
           +L+ GP GTGK+ +            G+    + +   F      +   TL  +    W 
Sbjct: 30  LLITGPSGTGKSSLFRALAGLWPWGSGRIGMPEGEDLLFLPQRPYLPLGTLREQLIYPWD 89

Query: 40  ---GEGEK----MVRALFAVASVHQPSIIFIDEIDSLLCQRSDQENE 79
                GE+      R L      H+P  +F+DE  S L    D+E+E
Sbjct: 90  DVLSGGEQQRLAFARLLL-----HKPKFVFLDEATSAL----DEESE 127


>gnl|CDD|223272 COG0194, Gmk, Guanylate kinase [Nucleotide transport and
          metabolism].
          Length = 191

 Score = 30.2 bits (69), Expect = 1.1
 Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 1/33 (3%)

Query: 1  ILLFGPPGTGKTLIGKCVAAQCKATFFCISAST 33
          I+L GP G GK+ + K +        F +SA+T
Sbjct: 7  IVLSGPSGVGKSTLVKALLED-DKLRFSVSATT 38


>gnl|CDD|238540 cd01120, RecA-like_NTPases, RecA-like NTPases. This family
          includes the NTP binding domain of F1 and V1 H+ATPases,
          DnaB and related helicases as well as bacterial RecA
          and related eukaryotic and archaeal recombinases. This
          group also includes bacterial conjugation proteins and
          related DNA transfer proteins involved in type II and
          type IV secretion.
          Length = 165

 Score = 29.8 bits (67), Expect = 1.2
 Identities = 21/84 (25%), Positives = 31/84 (36%), Gaps = 9/84 (10%)

Query: 1  ILLFGPPGTGKT-----LIGKCVAAQCKATFFCI--SASTLTSKWYGEGEK--MVRALFA 51
          IL+FGP G+GKT     L         K  +  I      LT +  GE  K  +   +  
Sbjct: 2  ILVFGPTGSGKTTLALQLALNIATKGGKVVYVDIEEEIEELTERLIGESLKGALDNLIIV 61

Query: 52 VASVHQPSIIFIDEIDSLLCQRSD 75
           A+   P+   +      L +R  
Sbjct: 62 FATADDPAAARLLSKAERLRERGG 85


>gnl|CDD|215651 pfam00006, ATP-synt_ab, ATP synthase alpha/beta family,
          nucleotide-binding domain.  This family includes the
          ATP synthase alpha and beta subunits, the ATP synthase
          associated with flagella and the termination factor
          Rho.
          Length = 213

 Score = 30.1 bits (69), Expect = 1.2
 Identities = 11/29 (37%), Positives = 16/29 (55%)

Query: 1  ILLFGPPGTGKTLIGKCVAAQCKATFFCI 29
          I +FG  GTGKT++   +A   KA    +
Sbjct: 18 IGIFGGSGTGKTVLLGMIARNAKADVVEV 46


>gnl|CDD|222163 pfam13479, AAA_24, AAA domain.  This AAA domain is found in a wide
           variety of presumed phage proteins.
          Length = 201

 Score = 29.9 bits (68), Expect = 1.2
 Identities = 15/52 (28%), Positives = 23/52 (44%), Gaps = 8/52 (15%)

Query: 186 SIDLSRIDALDVRPISIDDFRDALKTVRPSVCQADFVHQPSIIFIDEIDSLL 237
           +  LSR D +D+R  S  D RD L  +       + + +   + ID I  L 
Sbjct: 37  TKVLSRGDNVDIR--SWQDLRDFLDELA------EDLAEYDTLVIDTITKLE 80



 Score = 29.9 bits (68), Expect = 1.5
 Identities = 20/77 (25%), Positives = 30/77 (38%), Gaps = 9/77 (11%)

Query: 1  ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYGEG------EKMVRALFAVAS 54
          +L++GPPG GKT + K      K  F  + A T      G+       + +   L  +A 
Sbjct: 6  VLIYGPPGIGKTSLAK--TLPPKTLFLDLDAGTTKVLSRGDNVDIRSWQDLRDFLDELAE 63

Query: 55 VHQP-SIIFIDEIDSLL 70
                 + ID I  L 
Sbjct: 64 DLAEYDTLVIDTITKLE 80


>gnl|CDD|223679 COG0606, COG0606, Predicted ATPase with chaperone activity
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 490

 Score = 30.3 bits (69), Expect = 1.5
 Identities = 10/17 (58%), Positives = 13/17 (76%)

Query: 1   ILLFGPPGTGKTLIGKC 17
           +LL GPPGTGKT++   
Sbjct: 201 LLLVGPPGTGKTMLASR 217


>gnl|CDD|216186 pfam00910, RNA_helicase, RNA helicase.  This family includes RNA
          helicases thought to be involved in duplex unwinding
          during viral RNA replication. Members of this family
          are found in a variety of single stranded RNA viruses.
          Length = 105

 Score = 28.7 bits (65), Expect = 1.6
 Identities = 9/16 (56%), Positives = 12/16 (75%)

Query: 1  ILLFGPPGTGKTLIGK 16
          I L+GPPG GK+ + K
Sbjct: 1  IWLYGPPGCGKSTLAK 16


>gnl|CDD|224141 COG1220, HslU, ATP-dependent protease HslVU (ClpYQ), ATPase subunit
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 444

 Score = 30.3 bits (69), Expect = 1.7
 Identities = 19/55 (34%), Positives = 28/55 (50%), Gaps = 2/55 (3%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWY--GEGEKMVRALFAVA 53
           IL+ GP G GKT I + +A    A F  + A+  T   Y   + E ++R L  +A
Sbjct: 53  ILMIGPTGVGKTEIARRLAKLAGAPFIKVEATKFTEVGYVGRDVESIIRDLVEIA 107



 Score = 28.8 bits (65), Expect = 5.2
 Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 2/33 (6%)

Query: 51  AVASVHQPSIIFIDEIDSLLCQRSD-QENETSR 82
           A+ +  Q  I+FIDEID  + +R      + SR
Sbjct: 244 AIDAAEQNGIVFIDEIDK-IAKRGGSGGPDVSR 275


>gnl|CDD|235364 PRK05201, hslU, ATP-dependent protease ATP-binding subunit HslU;
           Provisional.
          Length = 443

 Score = 30.0 bits (69), Expect = 1.7
 Identities = 10/17 (58%), Positives = 12/17 (70%)

Query: 51  AVASVHQPSIIFIDEID 67
           A+  V Q  I+FIDEID
Sbjct: 243 AIERVEQNGIVFIDEID 259


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
          Provisional.
          Length = 482

 Score = 30.3 bits (69), Expect = 1.8
 Identities = 9/12 (75%), Positives = 11/12 (91%)

Query: 1  ILLFGPPGTGKT 12
          +LL+GPPG GKT
Sbjct: 42 LLLYGPPGVGKT 53


>gnl|CDD|235335 PRK05057, aroK, shikimate kinase I; Reviewed.
          Length = 172

 Score = 29.3 bits (66), Expect = 1.9
 Identities = 12/27 (44%), Positives = 16/27 (59%)

Query: 1  ILLFGPPGTGKTLIGKCVAAQCKATFF 27
          I L GP G GK+ IG+ +A Q    F+
Sbjct: 7  IFLVGPMGAGKSTIGRQLAQQLNMEFY 33


>gnl|CDD|182120 PRK09862, PRK09862, putative ATP-dependent protease; Provisional.
          Length = 506

 Score = 29.9 bits (67), Expect = 2.3
 Identities = 14/49 (28%), Positives = 24/49 (48%), Gaps = 11/49 (22%)

Query: 1   ILLFGPPGTGKTLIGKCV-----------AAQCKATFFCISASTLTSKW 38
           +LL GPPGTGKT++   +           A +  A    ++A ++  +W
Sbjct: 213 LLLIGPPGTGKTMLASRINGLLPDLSNEEALESAAILSLVNAESVQKQW 261


>gnl|CDD|221912 pfam13086, AAA_11, AAA domain.  This family of domains contain a
          P-loop motif that is characteristic of the AAA
          superfamily. Many of the proteins in this family are
          conjugative transfer proteins.
          Length = 220

 Score = 29.2 bits (66), Expect = 2.4
 Identities = 9/11 (81%), Positives = 10/11 (90%)

Query: 2  LLFGPPGTGKT 12
          L+ GPPGTGKT
Sbjct: 21 LIQGPPGTGKT 31


>gnl|CDD|116340 pfam07726, AAA_3, ATPase family associated with various cellular
          activities (AAA).  This Pfam entry includes some of the
          AAA proteins not detected by the pfam00004 model.
          Length = 131

 Score = 28.7 bits (65), Expect = 2.4
 Identities = 9/19 (47%), Positives = 13/19 (68%)

Query: 1  ILLFGPPGTGKTLIGKCVA 19
          +LL G PG  KTL+ + +A
Sbjct: 2  VLLEGVPGLAKTLLARTLA 20


>gnl|CDD|222036 pfam13304, AAA_21, AAA domain. 
          Length = 256

 Score = 29.3 bits (65), Expect = 2.6
 Identities = 11/40 (27%), Positives = 22/40 (55%), Gaps = 2/40 (5%)

Query: 33  TLTSKWYGEGEKMVRALFA--VASVHQPSIIFIDEIDSLL 70
            L  K   +G K + AL    ++++ + S++ IDE ++ L
Sbjct: 184 KLLIKGISDGTKRLLALLLALLSALPKGSLLLIDEPENGL 223


>gnl|CDD|203260 pfam05496, RuvB_N, Holliday junction DNA helicase ruvB N-terminus. 
           The RuvB protein makes up part of the RuvABC revolvasome
           which catalyzes the resolution of Holliday junctions
           that arise during genetic recombination and DNA repair.
           Branch migration is catalyzed by the RuvB protein that
           is targeted to the Holliday junction by the structure
           specific RuvA protein. This family contains the
           N-terminal region of the protein.
          Length = 231

 Score = 29.0 bits (66), Expect = 2.8
 Identities = 23/71 (32%), Positives = 33/71 (46%), Gaps = 18/71 (25%)

Query: 1   ILLFGPPGTGKTLIGKCVA----AQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVH 56
           +LL+GPPG GKT +   +A       + T    S   L        EK    L A+ +  
Sbjct: 53  VLLYGPPGLGKTTLANIIANEMGVNIRIT----SGPAL--------EK-PGDLAAILTNL 99

Query: 57  QP-SIIFIDEI 66
           +P  ++FIDEI
Sbjct: 100 EPGDVLFIDEI 110


>gnl|CDD|222000 pfam13238, AAA_18, AAA domain. 
          Length = 128

 Score = 28.2 bits (63), Expect = 2.8
 Identities = 9/16 (56%), Positives = 12/16 (75%)

Query: 1  ILLFGPPGTGKTLIGK 16
          IL+ G PG+GKT + K
Sbjct: 1  ILITGTPGSGKTTLAK 16


>gnl|CDD|226583 COG4098, comFA, Superfamily II DNA/RNA helicase required for DNA
           uptake (late competence protein) [DNA replication,
           recombination, and repair].
          Length = 441

 Score = 29.4 bits (66), Expect = 2.9
 Identities = 15/58 (25%), Positives = 24/58 (41%), Gaps = 11/58 (18%)

Query: 18  VAAQCKATFFCISASTLTSKWYGEGEKMVRALFAVASVHQP-------SIIFIDEIDS 68
           +  + K  F       L    YG+ +   RA   VA+ HQ         ++ IDE+D+
Sbjct: 160 LYPRLKQAFSNCDIDLL----YGDSDSYFRAPLVVATTHQLLRFKQAFDLLIIDEVDA 213


>gnl|CDD|224145 COG1224, TIP49, DNA helicase TIP49, TBP-interacting protein
           [Transcription].
          Length = 450

 Score = 29.2 bits (66), Expect = 3.1
 Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 2/51 (3%)

Query: 1   ILLFGPPGTGKTLIGKCVAAQC-KATFFC-ISASTLTSKWYGEGEKMVRAL 49
           IL+ GPPGTGKT +   +A +  +   F  IS S + S    + E + +AL
Sbjct: 68  ILIVGPPGTGKTALAMGIARELGEDVPFVAISGSEIYSLEVKKTEALTQAL 118


>gnl|CDD|238713 cd01428, ADK, Adenylate kinase (ADK) catalyzes the reversible
          phosphoryl transfer from adenosine triphosphates (ATP)
          to adenosine monophosphates (AMP) and to yield
          adenosine diphosphates (ADP). This enzyme is required
          for the biosynthesis of ADP and is essential for
          homeostasis of adenosine phosphates.
          Length = 194

 Score = 28.4 bits (64), Expect = 4.0
 Identities = 9/16 (56%), Positives = 11/16 (68%)

Query: 1  ILLFGPPGTGKTLIGK 16
          ILL GPPG+GK    +
Sbjct: 2  ILLLGPPGSGKGTQAE 17


>gnl|CDD|187652 cd08948, 5beta-POR_like_SDR_a, progesterone 5-beta-reductase-like
           proteins (5beta-POR), atypical (a) SDRs.  5beta-POR
           catalyzes the reduction of progesterone to
           5beta-pregnane-3,20-dione in Digitalis plants. This
           subgroup of atypical-extended SDRs, shares the structure
           of an extended SDR, but has a different glycine-rich
           nucleotide binding motif  (GXXGXXG) and lacks the YXXXK
           active site motif of classical and extended SDRs.
           Tyr-179 and Lys 147 are present in the active site, but
           not in the usual SDR configuration. Given these
           differences, it has been proposed that this subfamily
           represents a new SDR class. Other atypical SDRs include
           biliverdin IX beta reductase (BVR-B,aka flavin
           reductase), NMRa (a negative transcriptional regulator
           of various fungi), phenylcoumaran benzylic ether and
           pinoresinol-lariciresinol reductases, phenylpropene
           synthases, eugenol synthase, triphenylmethane reductase,
           isoflavone reductases, and others. SDRs are a
           functionally diverse family of oxidoreductases that have
           a single domain with a structurally conserved Rossmann
           fold, an NAD(P)(H)-binding region, and a structurally
           diverse C-terminal region. Sequence identity between
           different SDR enzymes is typically in the 15-30% range;
           they catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. In addition to the
           Rossmann fold core region typical of all SDRs, extended
           SDRs have a less conserved C-terminal extension of
           approximately 100 amino acids, and typically have a
           TGXXGXXG cofactor binding motif. Complex (multidomain)
           SDRs such as ketoreductase domains of fatty acid
           synthase have a GGXGXXG NAD(P)-binding motif and an
           altered active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 308

 Score = 28.8 bits (65), Expect = 4.0
 Identities = 18/71 (25%), Positives = 24/71 (33%), Gaps = 13/71 (18%)

Query: 99  DDDLVLVIGA--TNRPQELDEAARRRLVK------RLYIPLPDEQARCEIVTKLLRNITN 150
           D  LV  IG    +    +  A    L          YI  PDE    E+   +LRN   
Sbjct: 42  DPRLVEHIGIDLLDPADTVLRAKLPGLEDVTHVFYAAYIERPDEAELVEVNGAMLRN--- 98

Query: 151 SLTVDDIEEVG 161
              +D +E   
Sbjct: 99  --FLDALEPAS 107


>gnl|CDD|184195 PRK13635, cbiO, cobalt transporter ATP-binding subunit;
           Provisional.
          Length = 279

 Score = 28.8 bits (65), Expect = 4.3
 Identities = 23/65 (35%), Positives = 33/65 (50%), Gaps = 17/65 (26%)

Query: 57  QPSIIFIDEIDSLLCQRSDQEN-ETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQEL 115
           QP II +DE  S+L  R  +E  ET R+LK +  I+             V+  T+   +L
Sbjct: 158 QPDIIILDEATSMLDPRGRREVLETVRQLKEQKGIT-------------VLSITH---DL 201

Query: 116 DEAAR 120
           DEAA+
Sbjct: 202 DEAAQ 206



 Score = 28.8 bits (65), Expect = 4.3
 Identities = 23/65 (35%), Positives = 33/65 (50%), Gaps = 17/65 (26%)

Query: 224 QPSIIFIDEIDSLLCQRSDQEN-ETSRRLKTEFLISLDGASTLDDDLVLVIGATNRPQEL 282
           QP II +DE  S+L  R  +E  ET R+LK +  I+             V+  T+   +L
Sbjct: 158 QPDIIILDEATSMLDPRGRREVLETVRQLKEQKGIT-------------VLSITH---DL 201

Query: 283 DEAAR 287
           DEAA+
Sbjct: 202 DEAAQ 206


>gnl|CDD|190347 pfam02562, PhoH, PhoH-like protein.  PhoH is a cytoplasmic
          protein and predicted ATPase that is induced by
          phosphate starvation.
          Length = 205

 Score = 28.2 bits (64), Expect = 4.5
 Identities = 8/21 (38%), Positives = 10/21 (47%)

Query: 1  ILLFGPPGTGKTLIGKCVAAQ 21
          +   GP GTGKT +    A  
Sbjct: 22 VFGIGPAGTGKTYLAVAAAVD 42


>gnl|CDD|233120 TIGR00764, lon_rel, lon-related putative ATP-dependent protease. 
          This model represents a set of proteins with extensive
          C-terminal homology to the ATP-dependent protease La,
          product of the lon gene of E. coli. The model is based
          on a seed alignment containing only archaeal members,
          but several bacterial proteins match the model well.
          Because several species, including Thermotoga maritima
          and Treponema pallidum, contain both a close homolog of
          the lon protease and nearly full-length homolog of the
          members of this family, we suggest there may also be a
          functional division between the two families. Members
          of this family from Pyrococcus horikoshii and
          Pyrococcus abyssi each contain a predicted intein
          [Protein fate, Degradation of proteins, peptides, and
          glycopeptides].
          Length = 608

 Score = 28.7 bits (64), Expect = 4.8
 Identities = 9/19 (47%), Positives = 14/19 (73%)

Query: 1  ILLFGPPGTGKTLIGKCVA 19
          +LL G PG GK+++ K +A
Sbjct: 40 VLLIGEPGVGKSMLAKAMA 58


>gnl|CDD|225804 COG3265, GntK, Gluconate kinase [Carbohydrate transport and
          metabolism].
          Length = 161

 Score = 28.0 bits (63), Expect = 4.8
 Identities = 8/22 (36%), Positives = 13/22 (59%)

Query: 5  GPPGTGKTLIGKCVAAQCKATF 26
          G  G+GK+ +G  +A +  A F
Sbjct: 2  GVSGSGKSTVGSALAERLGAKF 23


>gnl|CDD|178613 PLN03055, PLN03055, AMP deaminase; Provisional.
          Length = 602

 Score = 28.7 bits (64), Expect = 4.8
 Identities = 19/69 (27%), Positives = 30/69 (43%), Gaps = 2/69 (2%)

Query: 128 YIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASLCREASLGPVRSI 187
           Y P P  Q    +V  ++         +++  V   TT F+  DM  + R  SLG VR+ 
Sbjct: 53  YEPEPPSQHVFRMVDGVMHVYAPDDAKEELFPVPDATTFFT--DMHRILRIVSLGNVRTF 110

Query: 188 DLSRIDALD 196
              R+  L+
Sbjct: 111 CHHRLKLLE 119


>gnl|CDD|213188 cd03221, ABCF_EF-3, ATP-binding cassette domain of elongation
          factor 3, subfamily F.  Elongation factor 3 (EF-3) is a
          cytosolic protein required by fungal ribosomes for in
          vitro protein synthesis and for in vivo growth. EF-3
          stimulates the binding of the EF-1: GTP: aa-tRNA
          ternary complex to the ribosomal A site by facilitated
          release of the deacylated tRNA from the E site. The
          reaction requires ATP hydrolysis. EF-3 contains two ATP
          nucleotide binding sequence (NBS) motifs. NBSI is
          sufficient for the intrinsic ATPase activity. NBSII is
          essential for the ribosome-stimulated functions.
          Length = 144

 Score = 27.8 bits (63), Expect = 4.9
 Identities = 17/65 (26%), Positives = 31/65 (47%), Gaps = 5/65 (7%)

Query: 5  GPPGTGKTLIGKCVAAQCKATF-FCISASTLTSKWYGE---GEKMVRALFAVASVHQPSI 60
          G  G GK+ + K +A + +         ST+   ++ +   GEKM R   A   +  P++
Sbjct: 33 GRNGAGKSTLLKLIAGELEPDEGIVTWGSTVKIGYFEQLSGGEKM-RLALAKLLLENPNL 91

Query: 61 IFIDE 65
          + +DE
Sbjct: 92 LLLDE 96


>gnl|CDD|147949 pfam06068, TIP49, TIP49 C-terminus.  This family consists of the
          C-terminal region of several eukaryotic and archaeal
          RuvB-like 1 (Pontin or TIP49a) and RuvB-like 2 (Reptin
          or TIP49b) proteins. The N-terminal domain contains the
          pfam00004 domain. In zebrafish, the liebeskummer (lik)
          mutation, causes development of hyperplastic embryonic
          hearts. lik encodes Reptin, a component of a
          DNA-stimulated ATPase complex. Beta-catenin and Pontin,
          a DNA-stimulated ATPase that is often part of complexes
          with Reptin, are in the same genetic pathways. The
          Reptin/Pontin ratio serves to regulate heart growth
          during development, at least in part via the
          beta-catenin pathway. TBP-interacting protein 49
          (TIP49) was originally identified as a TBP-binding
          protein, and two related proteins are encoded by
          individual genes, tip49a and b. Although the function
          of this gene family has not been elucidated, they are
          supposed to play a critical role in nuclear events
          because they interact with various kinds of nuclear
          factors and have DNA helicase activities.TIP49a has
          been suggested to act as an autoantigen in some
          patients with autoimmune diseases.
          Length = 395

 Score = 28.8 bits (65), Expect = 5.0
 Identities = 16/38 (42%), Positives = 24/38 (63%), Gaps = 2/38 (5%)

Query: 1  ILLFGPPGTGKTLIGKCVAAQC-KATFFC-ISASTLTS 36
          +L+ GPPGTGKT +   ++ +  + T FC IS S + S
Sbjct: 53 VLIAGPPGTGKTALAIAISKELGEDTPFCPISGSEVYS 90


>gnl|CDD|235871 PRK06835, PRK06835, DNA replication protein DnaC; Validated.
          Length = 329

 Score = 28.7 bits (65), Expect = 5.0
 Identities = 9/19 (47%), Positives = 13/19 (68%)

Query: 1   ILLFGPPGTGKTLIGKCVA 19
           +L +G  GTGKT +  C+A
Sbjct: 186 LLFYGNTGTGKTFLSNCIA 204


>gnl|CDD|227629 COG5317, COG5317, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 175

 Score = 27.9 bits (62), Expect = 5.2
 Identities = 27/105 (25%), Positives = 38/105 (36%), Gaps = 23/105 (21%)

Query: 236 LLCQRSDQENETSRR--LKTEFLISLDGASTLDD---------DLVLVIGATNRPQELDE 284
           LL QR+    E +R   L  +  + LD  S   D             ++  + R Q    
Sbjct: 79  LLLQRAVNSGEMTREQVLSEKSKVRLDTPSADRDAPGWNELPESFRDLVERSLRLQAR-- 136

Query: 285 AARRRLVKRLYIPLPDEQKVP-GSNPTMDKIDQENETSRRLKTEF 328
              RRL + +Y       +VP G NP   +ID        L T F
Sbjct: 137 --VRRLDREIYGESEATDEVPRGRNPVQAQID-------LLTTAF 172


>gnl|CDD|223775 COG0703, AroK, Shikimate kinase [Amino acid transport and
          metabolism].
          Length = 172

 Score = 27.9 bits (63), Expect = 5.5
 Identities = 10/27 (37%), Positives = 14/27 (51%)

Query: 1  ILLFGPPGTGKTLIGKCVAAQCKATFF 27
          I+L G  G GK+ IG+ +A      F 
Sbjct: 5  IVLIGFMGAGKSTIGRALAKALNLPFI 31


>gnl|CDD|237495 PRK13765, PRK13765, ATP-dependent protease Lon; Provisional.
          Length = 637

 Score = 28.8 bits (65), Expect = 5.6
 Identities = 8/19 (42%), Positives = 15/19 (78%)

Query: 1  ILLFGPPGTGKTLIGKCVA 19
          +++ G PGTGK+++ K +A
Sbjct: 53 VMMIGSPGTGKSMLAKAMA 71


>gnl|CDD|233208 TIGR00956, 3a01205, Pleiotropic Drug Resistance (PDR) Family
           protein.  [Transport and binding proteins, Other].
          Length = 1394

 Score = 28.9 bits (65), Expect = 5.6
 Identities = 9/19 (47%), Positives = 13/19 (68%)

Query: 52  VASVHQPSIIFIDEIDSLL 70
           + ++HQPS I  +E D LL
Sbjct: 956 LCTIHQPSAILFEEFDRLL 974



 Score = 28.2 bits (63), Expect = 8.1
 Identities = 9/16 (56%), Positives = 11/16 (68%)

Query: 222 VHQPSIIFIDEIDSLL 237
           +HQPS I  +E D LL
Sbjct: 959 IHQPSAILFEEFDRLL 974


>gnl|CDD|234063 TIGR02928, TIGR02928, orc1/cdc6 family replication initiation
          protein.  Members of this protein family are found
          exclusively in the archaea. This set of DNA binding
          proteins shows homology to the origin recognition
          complex subunit 1/cell division control protein 6
          family in eukaryotes. Several members may be found in
          genome and interact with each other [DNA metabolism,
          DNA replication, recombination, and repair].
          Length = 365

 Score = 28.4 bits (64), Expect = 6.0
 Identities = 8/18 (44%), Positives = 12/18 (66%)

Query: 1  ILLFGPPGTGKTLIGKCV 18
          + ++G  GTGKT + K V
Sbjct: 43 VFIYGKTGTGKTAVTKYV 60


>gnl|CDD|238687 cd01393, recA_like, RecA is a  bacterial enzyme which has roles
          in homologous recombination, DNA repair, and the
          induction of the SOS response.  RecA couples ATP
          hydrolysis to DNA strand exchange. While prokaryotes
          have a single RecA protein, eukaryotes have multiple
          RecA homologs such as Rad51, DMC1 and Rad55/57.
          Archaea have the RecA-like homologs radA and radB.
          Length = 226

 Score = 27.9 bits (63), Expect = 6.1
 Identities = 7/20 (35%), Positives = 12/20 (60%)

Query: 3  LFGPPGTGKTLIGKCVAAQC 22
          +FG  G+GKT +   +A + 
Sbjct: 24 IFGEFGSGKTQLCLQLAVEA 43


>gnl|CDD|224054 COG1131, CcmA, ABC-type multidrug transport system, ATPase
           component [Defense mechanisms].
          Length = 293

 Score = 28.0 bits (63), Expect = 6.2
 Identities = 9/24 (37%), Positives = 14/24 (58%), Gaps = 1/24 (4%)

Query: 42  GEKMVRALFAVASVHQPSIIFIDE 65
           G K  R   A+A +H P ++ +DE
Sbjct: 140 GMKQ-RLSIALALLHDPELLILDE 162


>gnl|CDD|219158 pfam06745, KaiC, KaiC.  This family represents a conserved region
          within bacterial and archaeal proteins, most of which
          are hypothetical. More than one copy is sometimes found
          in each protein. This family includes KaiC, which is
          one of the Kai proteins among which direct
          protein-protein association may be a critical process
          in the generation of circadian rhythms in
          cyanobacteria.
          Length = 231

 Score = 28.0 bits (63), Expect = 6.3
 Identities = 9/16 (56%), Positives = 12/16 (75%)

Query: 1  ILLFGPPGTGKTLIGK 16
          +L+ G PGTGKT+ G 
Sbjct: 22 VLITGGPGTGKTIFGL 37


>gnl|CDD|238544 cd01124, KaiC, KaiC is a circadian clock protein primarily found
          in cyanobacteria KaiC is a RecA-like ATPase, having
          both Walker A and Walker B motifs. A related protein is
          found in archaea.
          Length = 187

 Score = 27.7 bits (62), Expect = 6.4
 Identities = 9/15 (60%), Positives = 9/15 (60%)

Query: 1  ILLFGPPGTGKTLIG 15
           LL G PGTGKT   
Sbjct: 2  TLLSGGPGTGKTTFA 16


>gnl|CDD|221983 pfam13207, AAA_17, AAA domain. 
          Length = 114

 Score = 26.9 bits (59), Expect = 6.7
 Identities = 10/40 (25%), Positives = 17/40 (42%)

Query: 1  ILLFGPPGTGKTLIGKCVAAQCKATFFCISASTLTSKWYG 40
          IL+ GPPG+GK+ + K +A +       +           
Sbjct: 2  ILITGPPGSGKSTLAKKLAEKLGIPVISLDDLLREEGLAE 41


>gnl|CDD|214364 CHL00100, ilvH, acetohydroxyacid synthase small subunit.
          Length = 174

 Score = 27.8 bits (62), Expect = 6.8
 Identities = 17/77 (22%), Positives = 32/77 (41%), Gaps = 12/77 (15%)

Query: 124 VKRLYIPLPDEQARCEIVTKLLRNITNSLTVDDIEEVGRMTTDFSGADMASLCREASLGP 183
           + R+ + +P +    E +TK L  + N L V DI             ++  + RE  L  
Sbjct: 43  ISRITMVVPGDDRTIEQLTKQLYKLVNILKVQDI------------TNIPCVERELMLIK 90

Query: 184 VRSIDLSRIDALDVRPI 200
           +     +R + L++  I
Sbjct: 91  INVNSQTRPEILEIAQI 107


>gnl|CDD|173927 cd07766, DHQ_Fe-ADH, Dehydroquinate synthase-like (DHQ-like) and
           iron-containing alcohol dehydrogenases (Fe-ADH).
           Dehydroquinate synthase-like. This superfamily divides
           into two subgroups: the dehydroquinate synthase-like,
           and a large metal-containing  alcohol dehydrogenases
           (ADH), known as iron-containing alcohol dehydrogenases.
           Dehydroquinate synthase (DHQS) catalyzes the conversion
           of 3-deoxy-D-arabino-heptulosonate-7-phosphate (DAHP) to
           dehydroquinate (DHQ) in the second step of the shikimate
           pathway. This pathway involves seven sequential
           enzymatic steps in the conversion of erythrose
           4-phosphate and phosphoenolpyruvate into chorismate for
           subsequent synthesis of aromatic compounds.
           Dehydroquinate synthase-like group includes
           dehydroquinate synthase, 2-deoxy-scyllo-inosose
           synthase, and 2-epi-5-epi-valiolone synthase. The
           alcohol dehydrogenases in this superfamily contain a
           dehydroquinate synthase-like protein structural fold and
           mostly contain iron. They are distinct from other
           alcohol dehydrogenases which contains different protein
           domains. There are several distinct families of alcohol
           dehydrogenases: Zinc-containing long-chain alcohol
           dehydrogenases; insect-type, or short-chain alcohol
           dehydrogenases; iron-containing alcohol dehydrogenases,
           and others. The iron-containing family has a Rossmann
           fold-like topology that resembles the fold of the
           zinc-dependent alcohol dehydrogenases, but lacks
           sequence homology, and differs in strand arrangement.
           ADH catalyzes the reversible oxidation of alcohol to
           acetaldehyde with the simultaneous reduction of NAD(P)+
           to NAD(P)H.
          Length = 332

 Score = 28.1 bits (63), Expect = 6.8
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 301 EQKVPGSNPTMDKIDQENETSRRLKTEFLISLDGASTLD 339
                G NPT +++ +  E +R  + + +I++ G STLD
Sbjct: 54  IFDGVGPNPTFEEVKEAVERARAAEVDAVIAVGGGSTLD 92


>gnl|CDD|213197 cd03230, ABC_DR_subfamily_A, ATP-binding cassette domain of the
           drug resistance transporter and related proteins,
           subfamily A.  This family of ATP-binding proteins
           belongs to a multi-subunit transporter involved in drug
           resistance (BcrA and DrrA), nodulation, lipid transport,
           and lantibiotic immunity. In bacteria and archaea, these
           transporters usually include an ATP-binding protein and
           one or two integral membrane proteins. Eukaryotic
           systems of the ABCA subfamily display ABC domains that
           are quite similar to this family. The ATP-binding domain
           shows the highest similarity between all members of the
           ABC transporter family. ABC transporters are a subset of
           nucleotide hydrolases that contain a signature motif,
           Q-loop, and H-loop/switch region, in addition to, the
           Walker A motif/P-loop and Walker B motif commonly found
           in a number of ATP- and GTP-binding and hydrolyzing
           proteins.
          Length = 173

 Score = 27.7 bits (63), Expect = 7.0
 Identities = 9/24 (37%), Positives = 13/24 (54%), Gaps = 1/24 (4%)

Query: 42  GEKMVRALFAVASVHQPSIIFIDE 65
           G K  R   A A +H P ++ +DE
Sbjct: 99  GMKQ-RLALAQALLHDPELLILDE 121


>gnl|CDD|216270 pfam01057, Parvo_NS1, Parvovirus non-structural protein NS1.  This
           family also contains the NS2 protein. Parvoviruses
           encode two non-structural proteins, NS1 and NS2. The
           mRNA for NS2 contains the coding sequence for the first
           87 amino acids of NS1, then by an alternative splicing
           mechanism mRNA from a different reading frame, encoding
           the last 78 amino acids, makes up the full length of the
           NS2 mRNA. NS1, is the major non-structural protein. It
           is essential for DNA replication. It is an 83-kDa
           nuclear phosphoprotein. It has DNA helicase and ATPase
           activity.
          Length = 271

 Score = 28.1 bits (63), Expect = 7.3
 Identities = 6/14 (42%), Positives = 9/14 (64%)

Query: 1   ILLFGPPGTGKTLI 14
           +  +GP  TGKT +
Sbjct: 116 VWFYGPASTGKTNL 129


>gnl|CDD|234654 PRK00131, aroK, shikimate kinase; Reviewed.
          Length = 175

 Score = 27.5 bits (62), Expect = 7.4
 Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 1  ILLFGPPGTGKTLIGKCVAAQCKATFF 27
          I+L G  G GK+ IG+ +A +    F 
Sbjct: 7  IVLIGFMGAGKSTIGRLLAKRLGYDFI 33


>gnl|CDD|216285 pfam01075, Glyco_transf_9, Glycosyltransferase family 9
           (heptosyltransferase).  Members of this family belong to
           glycosyltransferase family 9. Lipopolysaccharide is a
           major component of the outer leaflet of the outer
           membrane in Gram-negative bacteria. It is composed of
           three domains; lipid A, Core oligosaccharide and the
           O-antigen. All of these enzymes transfer heptose to the
           lipopolysaccharide core.
          Length = 249

 Score = 27.7 bits (62), Expect = 7.5
 Identities = 21/98 (21%), Positives = 30/98 (30%), Gaps = 20/98 (20%)

Query: 27  FCISASTLTSKWYGEGEKMVRALFAVASVHQPSIIFIDEIDSLLCQRSDQENETSRRLKT 86
           FC  AS    +W  E       L          ++       L     D+E E ++RL  
Sbjct: 112 FCPGASRPAKRWPAEH---FAELAQALQERGWQVV-------LFGGAEDREEEIAKRLAA 161

Query: 87  EFLISLDGASTLDDDLVLVIGATNRPQELDEAARRRLV 124
                      L    V ++G T+  Q     A   LV
Sbjct: 162 G----------LRTPCVNLLGKTSLEQAAALLAGADLV 189


>gnl|CDD|236783 PRK10875, recD, exonuclease V subunit alpha; Provisional.
          Length = 615

 Score = 28.0 bits (63), Expect = 8.0
 Identities = 13/35 (37%), Positives = 18/35 (51%), Gaps = 3/35 (8%)

Query: 273 IGATNRPQELDEAARRRLVKRLYIPLPDE---QKV 304
               N   E+DEA  R+ +  L+ P+ DE   QKV
Sbjct: 125 FNEVNHAIEVDEALLRQTLDALFGPVTDEVDWQKV 159


>gnl|CDD|234763 PRK00440, rfc, replication factor C small subunit; Reviewed.
          Length = 319

 Score = 27.5 bits (62), Expect = 9.5
 Identities = 11/12 (91%), Positives = 11/12 (91%), Gaps = 1/12 (8%)

Query: 2  LLF-GPPGTGKT 12
          LLF GPPGTGKT
Sbjct: 41 LLFAGPPGTGKT 52


>gnl|CDD|226115 COG3587, COG3587, Restriction endonuclease [Defense mechanisms].
          Length = 985

 Score = 27.8 bits (62), Expect = 9.8
 Identities = 14/42 (33%), Positives = 19/42 (45%), Gaps = 4/42 (9%)

Query: 47  RALFAVASVHQPSIIFIDEIDSLLCQRSDQENETSRRLKTEF 88
            ALF    +   S+ FID+I S        +NE   +LK  F
Sbjct: 375 EALFDKRGIKALSLFFIDKIASYR----SYDNELPAKLKGLF 412


>gnl|CDD|214940 smart00962, SRP54, SRP54-type protein, GTPase domain.  This entry
          represents the GTPase domain of the 54 kDa SRP54
          component, a GTP-binding protein that interacts with
          the signal sequence when it emerges from the ribosome.
          SRP54 of the signal recognition particle has a
          three-domain structure: an N-terminal helical bundle
          domain, a GTPase domain, and the M-domain that binds
          the 7s RNA and also binds the signal sequence. The
          extreme C-terminal region is glycine-rich and lower in
          complexity and poorly conserved between species. The
          GTPase domain is evolutionary related to P-loop NTPase
          domains found in a variety of other proteins.
          Length = 197

 Score = 27.4 bits (62), Expect = 9.8
 Identities = 14/24 (58%), Positives = 16/24 (66%), Gaps = 2/24 (8%)

Query: 1  ILLFGPPGTGKT-LIGKCVAAQCK 23
          ILL GP G GKT  I K +AA+ K
Sbjct: 4  ILLVGPNGVGKTTTIAK-LAARLK 26


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.136    0.385 

Gapped
Lambda     K      H
   0.267   0.0889    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,819,087
Number of extensions: 1774297
Number of successful extensions: 2679
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2587
Number of HSP's successfully gapped: 186
Length of query: 343
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 245
Effective length of database: 6,590,910
Effective search space: 1614772950
Effective search space used: 1614772950
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (26.2 bits)