RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy568
         (1614 letters)



>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
          Length = 3164

 Score =  830 bits (2145), Expect = 0.0
 Identities = 349/1616 (21%), Positives = 587/1616 (36%), Gaps = 140/1616 (8%)

Query: 21   EGNEEGGPWMEKVLQLYQISNLNHGLMMVGPSGSGKSTAWKVLLKALERYEGVEGVAHII 80
            E  +  G  + + +   +  +              K   +         Y       ++ 
Sbjct: 1160 ESQQVSGLDVAQFVSFLRSVDTGAFHAEYFRVFLCKIKHYTDACD----YLWHVKSPYVK 1215

Query: 81   DPKAISKEALYGVLDPNTREWTDGLFTHILRRIIDNVRGEIS-KRQWIIFDGDVDPEWVE 139
                          D    E     F    R   +++   ++  +     +      +VE
Sbjct: 1216 KK----------YFDA-DMELR-QFFLMFNR---EDMEARLADSKMEYEVE-----RYVE 1255

Query: 140  NLNSVLDDNKLLTLPNGERLSLPPNIRIMFEVQDLKYATLATVSRCGMIWFSEDVLSTEM 199
               + +   KL     GE        +++        +    V RC + + S   LST+ 
Sbjct: 1256 KTKAEVSSLKLELSSVGEG-------QVVVSNLG---SIGDKVGRCLVEYDSISRLSTKG 1305

Query: 200  IFENYLSRLRNIALDDIDDDSSLLITVDATGKAPDDVLSPALTLQQDVASILSTHFAPDG 259
            +F + L       LD+  D  S                      +  +            
Sbjct: 1306 VFLDELGD-TKRYLDECLDFFSC------------FEEVQKEIDELSMVFCAD------- 1345

Query: 260  LVVRALDYAMQQEHIMD-------FTRLRALGSLFSMLNQGVRNVLQYNHSHSDFPLSQD 312
                AL ++    HI+             A  SL     +                L   
Sbjct: 1346 ----ALRFSADLYHIVKERRFSGVLAGSDASESLGGKSIELAAI------------LEHK 1389

Query: 313  VVERYIPRILVYSLLWSFAGDGKLKMRSDFGNFLRSVTTITLPATSS----DIVDFEVNI 368
             +   + R +   L     GD   +    F           L   S      I+ F ++ 
Sbjct: 1390 DLIVEMKRGINDVLKLRIFGDKCRESTPRFYLISDGDLIKDLNERSDYEEMLIMMFNISA 1449

Query: 369  KNGEWVPWSNKVPQIEVETQKVA-ASDVVVPTLDTVRHESLLYTWLAEHKPLVLCGPPGS 427
                       +   E+  ++V    +VV+PT DT   +S     L   +  + CGPPGS
Sbjct: 1450 VITNNGS----IAGFELRGERVMLRKEVVIPTSDTGFVDSFSNEALNTLRSYIYCGPPGS 1505

Query: 428  GKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTFDHYCEYRKTPNGVILSPIQLGKWL 487
            GK M +  +LR+    EV   NFS+ T     L   +   EY      V L P  + K L
Sbjct: 1506 GKEMLMCPSLRSELITEVKYFNFSTCTMTPSKLSVLERETEYYPNTGVVRLYPKPVVKDL 1565

Query: 488  VLFCDEINLPDMDKYATQRVISFLRQLIEQRGFYRPADKQWVSLERIQCVGACNPPTDPG 547
            VLFCDEINLP   +Y    VI FLR L+E++GF+      WV++  I   GACNP TD G
Sbjct: 1566 VLFCDEINLPYGFEYYPPTVIVFLRPLVERQGFWSSIAVSWVTICGIILYGACNPGTDEG 1625

Query: 548  RKPLSHRFLRHVPVIYVDYPGETSLKQIYGTFSRAMLRLIPPLRGYADALTNAMVELYLA 607
            R     RF+R    ++  YP   SL+ IY                 ++   +A VELYL+
Sbjct: 1626 RVKYYERFIRKPVFVFCCYPELASLRNIYEAVLMGSYLCFDEFNRLSEETMSASVELYLS 1685

Query: 608  SQEKFTQDMQPHYVYSPREMTRWVRGICEAIRPLESLTVEGLVRLWAHEALRLFQDRLVN 667
            S++K    +Q +Y Y PRE+TR +R I              L+  W  EA+R   DRLV 
Sbjct: 1686 SKDKTKFFLQMNYGYKPRELTRSLRAIFGYAETRIDTPDVSLIIDWYCEAIREKIDRLVQ 1745

Query: 668  DVERQWTNENIDAVAMKYFSNIDKEVLA-RPILYSNWLSKNYVPVGTTELREYVQARLKV 726
              E   + +++    ++    +    +    I +S  L      +   +L  +V+   K+
Sbjct: 1746 QKESSTSRQDLYDFGLRAIREMIAGHIGEAEITFSMILFFGMACLLKKDLAVFVEEVRKI 1805

Query: 727  FYEEELDVQLVLFDEVLDHVLRIDRIFRQPQGHLLLIGVSGAGKTTLSRFVAFMNGLSVF 786
            F    LDV+ V + + L H+LR  R      GH +L GV   G      FV ++N  ++ 
Sbjct: 1806 FGSSHLDVEAVAYKDALLHILRSRRGLLVVGGHGVLKGVLIRGACDAREFVCWLNPRNMR 1865

Query: 787  QIRAHNKYTGADFDEDLRTVLRRSGCKNEKIAFLLDESNVLESGFLERMNTLLANGEIPG 846
            +I  H      DF + L+    R      +    + ES  +ES FLE  N LL N     
Sbjct: 1866 EIFGHRDELTGDFRDSLKVQDLRRNIHGGRECLFIFESIPVESSFLEDFNPLLDNNRFLC 1925

Query: 847  LFEGDEYTTLMTQCKEGAQREGLMLDSNEELYKWFTQQVMKNLHVVFTMNPSSEGLKDRA 906
            LF G+E   +    +   +   L  D+   L + F   + +NL VVF+   S +      
Sbjct: 1926 LFSGNERIRIPENLRFVFESTSLEKDTEATLTRVFLVYMEENLPVVFSACCSQDTSVLAG 1985

Query: 907  ATSPALFNRCVLNWFGDWSDTALYQVAKEFTSKIDLDGPQNWKAPDFFPSVCSLVSTTPS 966
              SPAL NRC +++   W    + Q A    +     G   +   +       L     +
Sbjct: 1986 IRSPALKNRCFIDFKKLWDTEEMSQYANSVETLSRDGGRVFFINGE-------LGVGKGA 2038

Query: 967  --HRDSVINACVYVHQTLHKANARLSKRGSRTMAITPRHYLDFINHFVKLYREKCSELEE 1024
                    +A V            +S     ++  +   ++  +  +         EL+ 
Sbjct: 2039 LISEVFGDDAVVI-----EGRGFEISMIEG-SLGESKIKFIGGLKVYDARCVIYIEELDC 2092

Query: 1025 QQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQ 1084
              ++L  G+ K  E    + E+++ L+     L  K + A+  L     ++ E E + V 
Sbjct: 2093 TNVNLVEGVRKYNEYGRGMGELKEQLSNTVVILGVKEKNADDALSGTPGERLEREVKSVF 2152

Query: 1085 SQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMANPPSV 1144
             +  +  +     E+ +++  VM+  +  +PAV++A   V +IKK  L E+RS   PP  
Sbjct: 2153 VEAPRDMLFLLEEEVRKRKGSVMKFKSSKKPAVLEAVLFVYKIKKASLREIRSFIRPPGD 2212

Query: 1145 VKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITD-EVREKMHSRYLSN 1203
            + + +E +C LLG  A  W   +  + R++FI  I   +  E+  D E R    +R  S+
Sbjct: 2213 LCIEMEDVCDLLGFEAKIWFGEQQSLRRDDFIRIIG-KYPDEIEFDLEARRFREARECSD 2271

Query: 1204 PDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGEETK 1263
            P ++    NRAS ACGP+ +W + + + + +L+   PLR E K ++ +A   + +    K
Sbjct: 2272 PSFTGSILNRASKACGPLKRWLVRECNRSKVLEVKIPLREEEKRIDGEAFLVEDRLTLGK 2331

Query: 1264 DLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQA 1323
             L + L       +                          Y+  I +      D+D V  
Sbjct: 2332 GLSSDLMTFKLRRRS------------------------YYSLDILRVHGKIADMDTVHK 2367

Query: 1324 KVERSMALLKSLGIERERWEATSETFRSQMATIIGDVLLSSAYLAYAGYFDQHYRQSLFS 1383
             V RS+ + + L  E   W          M  + GD   SS    Y G      R   F 
Sbjct: 2368 DVLRSIFVSEILINEDSEWGGVFSEVPKLMVELDGDGHPSSCLHPYIGTLGFLCRAIEF- 2426

Query: 1384 TWNSHLIAAGIQFR-PEIA---LTEYLSSPDERLRWQGNALPSDHLCTENA-IMLRRFNR 1438
                  I    +FR  EI             E  + +     S     EN+ I     + 
Sbjct: 2427 --GMSFIRISKEFRDKEIRRRQFITEGVQKIEDFKEEAC---STDYGLENSRIRKDLQDL 2481

Query: 1439 YPLIIDPSGQATEFILKEFESRKITKTSFLDDAFRKNLESALRFGNPLLVQDVENYDTIL 1498
              ++ DPS +      + ++ +K    SF +  F   L  A R G+  ++ D E  D  +
Sbjct: 2482 TAVLNDPSSKIVTSQRQMYDEKKAILGSFREMEFAFGLSQARREGSDKIIGDAEALDEEI 2541

Query: 1499 NPVLNRELRRTGGRVLITLGDQDIDISPTFVIFLSTRDPTVEFPPDICSRVTFVNFTVTR 1558
              ++  E +     V + +   +I  S    +F  +   + +       ++  V F    
Sbjct: 2542 GRLIKEEFKSNLSEVKVMINPPEIVRSTVEAVFWLSEGRSGDMGSIEWKQLIQVMFVSKV 2601

Query: 1559 SSLQSQCLNRVLKAERPDIDTKRSDLLKLQGEFHLRLRHLEKSLLGALNESKGKLL 1614
               +++  + + K     +      L  L+    L L  L + LL  L  S   + 
Sbjct: 2602 LGCETEIPDALEKLVSGPLFVHEKALNALKACGSLFLWVLARYLLAKLMLSISNME 2657


>gnl|CDD|193256 pfam12780, AAA_8, P-loop containing dynein motor region D4.  The 380
            kDa motor unit of dynein belongs to the AAA class of
            chaperone-like ATPases. The core of the 380 kDa motor
            unit contains a concatenated chain of six AAA modules, of
            which four correspond to the ATP binding sites with
            P-loop signatures described previously, and two are
            modules in which the P loop has been lost in evolution.
            This particular family is the D4 ATP-binding region of
            the motor.
          Length = 268

 Score =  156 bits (396), Expect = 2e-42
 Identities = 86/283 (30%), Positives = 141/283 (49%), Gaps = 25/283 (8%)

Query: 734  VQLVLFDEVLDHVLRIDRIFRQPQGHLLLIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNK 793
            + LVLF++ + HV RI+RI   P+G+ LL+GV G+GK +LSR  A+++ L VFQI     
Sbjct: 8    MNLVLFEDAMAHVCRINRILESPRGNALLVGVGGSGKQSLSRLAAYISSLDVFQITLRKG 67

Query: 794  YTGADFDEDLRTVLRRSGCKNEKIAFLLDESNVLESGFLERMNTLLANGEIPGLFEGDEY 853
            Y   D   DL     ++G KN    FL+ ++ V +  FL  +N LLA+GEIP LF  DE 
Sbjct: 68   YGIPDLKIDLAAQCIKAGVKNVPTVFLMTDAQVADEQFLVLINDLLASGEIPDLFMDDEV 127

Query: 854  TTLMTQCKEGAQREGLMLDSNEELYKWFTQQVMKNLHVVFTMNPSSEGLKDRAATSPALF 913
              +++  +   +  G + D+ E  +K+F  +V + L V+   +P    L+ R+   PA+ 
Sbjct: 128  ENIISSVRNEVKSLG-LNDTRENCWKFFIDRVRRQLKVILCFSPVGSVLRVRSRKFPAVV 186

Query: 914  NRCVLNWFGDWSDTALYQVAKEFTSKIDLDGPQNWKAPDFFPSVCSLVSTTPSHRDSVIN 973
            N   ++WF +W   AL  V+  F  + +    +   +  FF +                 
Sbjct: 187  NCTAIDWFHEWPQDALVSVSARFLEETEGIPWEVKASISFFMA----------------- 229

Query: 974  ACVYVHQTLHKANARLSKRGSRTMAITPRHYLDFINHFVKLYR 1016
               YVH ++++ +        R    TP+ +L+ I    KLY+
Sbjct: 230  ---YVHTSVNEMSRVYLANERRYNYTTPKSFLEQI----KLYQ 265


>gnl|CDD|193253 pfam12777, MT, Microtubule-binding stalk of dynein motor.  the 380
            kDa motor unit of dynein belongs to the AAA class of
            chaperone-like ATPases. The core of the 380 kDa motor
            unit contains a concatenated chain of six AAA modules, of
            which four correspond to the ATP binding sites with
            P-loop signatures described previously, and two are
            modules in which the P loop has been lost in evolution.
            This family is the region between D4 and D5 and is the
            two predicted alpha-helical coiled coil segments that
            form the stalk supporting the ATP-sensitive microtubule
            binding component.
          Length = 344

 Score =  142 bits (358), Expect = 1e-36
 Identities = 100/368 (27%), Positives = 173/368 (47%), Gaps = 33/368 (8%)

Query: 1029 LNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDI 1088
            L  GL K+  T  QV++++  LA++  EL+ KNE A+  ++ +  + ++  K K  +   
Sbjct: 3    LENGLMKLQSTASQVDDLKAKLAIQEVELKQKNEDADKLIQVVGVETEKVSKEKAIADQE 62

Query: 1089 QAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMANPPSVVKLA 1148
            + ++     E+ +K+    EDLA+ EPA++ AQ+A+  + K  L EL+S  +PP  V   
Sbjct: 63   EVKVAVINKEVTEKQKDCEEDLAKAEPALLAAQEALNTLNKNNLTELKSFGSPPDAVVNV 122

Query: 1149 LESICLLLGENAT-----DWKAIRAVVMR-ENFINSIVSNFNTEMITDEVREKMHSRYLS 1202
              ++ +L            WKA + ++ + + F++S++ NF+ E I +    K    YL 
Sbjct: 123  TAAVMVLTAPGGKIPKDRSWKAAKVMMGKVDTFLDSLI-NFDKENIHENCL-KAIKPYLG 180

Query: 1203 NPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGEET 1262
            +P++  E     S A   +  W I  + + ++   VEP R  L+    + +  + K    
Sbjct: 181  DPEFDPEFIRSKSTAAAGLCSWVINIVRFYEVYCDVEPKRQALEKANAELAAAQEKLARI 240

Query: 1263 KDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQ 1322
            K  I +L  ++A+    + +  A A  +K              Q  A+AT     L N  
Sbjct: 241  KAKIAELNANLANLTAAFEK--ATADKLK-------------CQQEAEATNRTILLAN-- 283

Query: 1323 AKVERSMALLKSLGIERERWEATSETFRSQMATIIGDVLLSSAYLAYAGYFDQHYRQSLF 1382
                    L+  L  E  RW    E F+SQ  T+ GDVLL +A+++Y GYF + YR  L 
Sbjct: 284  -------RLVGGLASENVRWAEAVENFKSQEVTLCGDVLLITAFVSYVGYFTKKYRNELM 336

Query: 1383 S-TWNSHL 1389
               W  +L
Sbjct: 337  ERFWIPYL 344


>gnl|CDD|193257 pfam12781, AAA_9, ATP-binding dynein motor region D5.  The 380 kDa
            motor unit of dynein belongs to the AAA class of
            chaperone-like ATPases. The core of the 380 kDa motor
            unit contains a concatenated chain of six AAA modules, of
            which four correspond to the ATP binding sites with
            P-loop signatures described previously, and two are
            modules in which the P loop has been lost in evolution.
            This particular family is the D5 ATP-binding region of
            the motor, but has lost its P-loop.
          Length = 228

 Score =  129 bits (325), Expect = 2e-33
 Identities = 71/205 (34%), Positives = 119/205 (58%), Gaps = 12/205 (5%)

Query: 1415 WQGNALPSDHLCTENAIMLRRFNRYPLIIDPSGQATEFILKEFESR----KITKTSFLDD 1470
            W    LPSD + TENA +L    R+PL++DP  Q  ++I  ++ S     +I +  +LD 
Sbjct: 11   WNNEGLPSDRMSTENATILGNCERWPLMVDPQLQGIKWIKNKYGSDLRVIRIGQKGYLD- 69

Query: 1471 AFRKNLESALRFGNPLLVQDV-ENYDTILNPVLNRELRRTGGRVLITLGDQDIDISPTFV 1529
                 +E A+  G+ +L++++ E+ D +L+P+L R   + G    I +GD++++ +P F 
Sbjct: 70   ----VIEQAISEGDTVLIENIEESIDPVLDPLLGRNTIKKGR--YIKIGDKEVEYNPKFR 123

Query: 1530 IFLSTRDPTVEFPPDICSRVTFVNFTVTRSSLQSQCLNRVLKAERPDIDTKRSDLLKLQG 1589
            + L T+     + P++ ++ T +NFTVTR  L+ Q L  V+  ERPD++  +SDL K Q 
Sbjct: 124  LILHTKLANPHYKPEMQAQCTLINFTVTRDGLEDQLLAAVVAKERPDLEQLKSDLTKQQN 183

Query: 1590 EFHLRLRHLEKSLLGALNESKGKLL 1614
            EF + L+ LE SLL  L+ + G  L
Sbjct: 184  EFKIVLKELEDSLLSRLSSASGNFL 208


>gnl|CDD|193251 pfam12775, AAA_7, P-loop containing dynein motor region D3.  the
           380 kDa motor unit of dynein belongs to the AAA class of
           chaperone-like ATPases. The core of the 380 kDa motor
           unit contains a concatenated chain of six AAA modules,
           of which four correspond to the ATP binding sites with
           P-loop signatures described previously, and two are
           modules in which the P loop has been lost in evolution.
           This particular family is the D3 and is an ATP binding
           site.
          Length = 272

 Score = 93.8 bits (233), Expect = 1e-20
 Identities = 75/277 (27%), Positives = 132/277 (47%), Gaps = 31/277 (11%)

Query: 395 VVVPTLDTVRHESLLYTWLAEHKPLVLCGPPGSGKTMTLLSALRAL-PDMEVV-SLNFSS 452
            +V T +T+R    +   +   +P++L G  G+GK++ +   L +L PD  +V ++ F+ 
Sbjct: 12  CLVHTSETIRVRYFMDLLMERQRPVMLVGNAGTGKSVLVGDKLASLDPDAYLVKNVPFNY 71

Query: 453 ATTPELLLKTFDHYCEYRKTPNGVILSPIQLGKWLVLFCDEINLPDMDKYATQRVISFLR 512
            TT  +L    +   E +   N     P    K LV F D++N+P++D Y T +  + +R
Sbjct: 72  YTTSAMLQAVLEKPLEKKAGRN---YGPPGTKK-LVYFIDDMNMPEVDAYGTVQPHTLIR 127

Query: 513 QLIEQRGFYRPADKQWVSLERI---QCVGACNPPTDPGRKPLSHRFLRHVPVIYVDYPGE 569
           Q ++   +Y   D+  ++L+ I   Q V   NP    G   ++ R  RH  V  + +PG+
Sbjct: 128 QHMDYGHWY---DRSKLTLKEIHNCQYVSCMNPTA--GSFTINSRLQRHFCVFALSFPGQ 182

Query: 570 TSLKQIY----------GTFSRAMLRLIPPLRGYADALTNAMVELYLASQEKFTQDMQPH 619
            +L  IY          G+ S A+ +  PPL   A AL   +   +L +  KF      H
Sbjct: 183 DALSTIYNIILTQHLALGSVSAALQKSSPPLVDAALALHQKIATTFLPTAIKF------H 236

Query: 620 YVYSPREMTRWVRGICEAIRPLESLTVEGLVRLWAHE 656
           YV++ R+++   +G+  +         + L+RLW HE
Sbjct: 237 YVFNLRDLSNIFQGLLFSSVECLKSPWD-LIRLWLHE 272


>gnl|CDD|219538 pfam07728, AAA_5, AAA domain (dynein-related subfamily).  This Pfam
           entry includes some of the AAA proteins not detected by
           the pfam00004 model.
          Length = 135

 Score = 67.7 bits (166), Expect = 3e-13
 Identities = 34/151 (22%), Positives = 59/151 (39%), Gaps = 26/151 (17%)

Query: 45  GLMMVGPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPKAISKEALYGVLDPNTR--EWT 102
           G+++VGP G+GKS   + L  AL               +  ++E L G  + +     W 
Sbjct: 1   GVLLVGPPGTGKSELAERLAAALSNRPVFYVQLT----RDTTEEDLKGRRNIDPGGASWV 56

Query: 103 DGLFTHILRRIIDNVRGEISKRQWIIFDGDVD---PEWVENLNSVLDDNKLLTLPNGERL 159
           DG      R               I    +++   P+ + +L S+LD+ +LL    GE +
Sbjct: 57  DGPLVRAAR------------EGEIAVLDEINRANPDVLNSLLSLLDERRLLLPEGGELV 104

Query: 160 SLPP-NIRIMFEV----QDLKYATLATVSRC 185
              P   R++  +    + L   + A  SR 
Sbjct: 105 KAAPDGFRLIATMNPLDRGLNELSPALRSRF 135



 Score = 44.2 bits (105), Expect = 5e-05
 Identities = 33/142 (23%), Positives = 56/142 (39%), Gaps = 11/142 (7%)

Query: 418 PLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTFDHYCEYRKTPNGVI 477
            ++L GPPG+GK+        AL +  V  +  +  TT E L    +         +G +
Sbjct: 1   GVLLVGPPGTGKSELAERLAAALSNRPVFYVQLTRDTTEEDLKGRRNIDPGGASWVDGPL 60

Query: 478 LSPIQLGKWLVLFCDEINLPDMDKYATQRVISFLRQLIEQRGFYRPADKQWVSLER--IQ 535
           +   + G+  +   DEIN  + D      V++ L  L+++R    P   + V       +
Sbjct: 61  VRAAREGE--IAVLDEINRANPD------VLNSLLSLLDERRLLLPEGGELVKAAPDGFR 112

Query: 536 CVGACNPPTDPGRKPLSHRFLR 557
            +   NP  D G   LS     
Sbjct: 113 LIATMNPL-DRGLNELSPALRS 133



 Score = 41.9 bits (99), Expect = 4e-04
 Identities = 37/169 (21%), Positives = 59/169 (34%), Gaps = 45/169 (26%)

Query: 759 HLLLIGVSGAGKTTLS-RFVAFMNGLSVFQIRAHNKYTGADF----------DEDLRTVL 807
            +LL+G  G GK+ L+ R  A ++   VF ++     T  D              +   L
Sbjct: 1   GVLLVGPPGTGKSELAERLAAALSNRPVFYVQLTRDTTEEDLKGRRNIDPGGASWVDGPL 60

Query: 808 RRSGCKNEKIAFLLDESNVLESGFLERMNTLLANGEIPGLFEGDEYTTLMTQCKEGAQRE 867
            R+  +  +IA  LDE N      L  + +LL    +                       
Sbjct: 61  VRAA-REGEIAV-LDEINRANPDVLNSLLSLLDERRL----------------------- 95

Query: 868 GLMLDSNEELYKWFTQQVMKNLHVVFTMNPSSEGLKDRAATSPALFNRC 916
            L+ +  E +             ++ TMNP   GL +    SPAL +R 
Sbjct: 96  -LLPEGGELVKA-----APDGFRLIATMNPLDRGLNE---LSPALRSRF 135


>gnl|CDD|99707 cd00009, AAA, The AAA+ (ATPases Associated with a wide variety of
           cellular Activities) superfamily represents an ancient
           group of ATPases belonging to the ASCE (for additional
           strand, catalytic E) division of the P-loop NTPase fold.
           The ASCE division also includes ABC, RecA-like,
           VirD4-like, PilT-like, and SF1/2 helicases. Members of
           the AAA+ ATPases function as molecular chaperons, ATPase
           subunits of proteases, helicases, or nucleic-acid
           stimulated ATPases. The AAA+ proteins contain several
           distinct features in addition to the conserved
           alpha-beta-alpha core domain structure and the Walker A
           and B motifs of the P-loop NTPases.
          Length = 151

 Score = 67.9 bits (166), Expect = 5e-13
 Identities = 36/167 (21%), Positives = 58/167 (34%), Gaps = 21/167 (12%)

Query: 398 PTLDTVRHESLLYTWLAEHKPLVLCGPPGSGKTMTLLSALRAL--PDMEVVSLNFSSATT 455
              +           L   K L+L GPPG+GKT    +    L  P    + LN S    
Sbjct: 1   VGQEEAIEALREALELPPPKNLLLYGPPGTGKTTLARAIANELFRPGAPFLYLNASDLLE 60

Query: 456 PELLLKTFDHYCEYRKTPNGVILSPIQLGKWLVLFCDEINLPDMDKYATQRVISFLRQLI 515
             ++ + F H+               +  K  VLF DEI+   + + A   ++  L  L 
Sbjct: 61  GLVVAELFGHFLVRLLF------ELAEKAKPGVLFIDEID--SLSRGAQNALLRVLETLN 112

Query: 516 EQRGFYRPADKQWVSLERIQCVGACNPPTDPG-RKPLSHRFLRHVPV 561
           + R             E ++ +GA N P      + L  R    + +
Sbjct: 113 DLRIDR----------ENVRVIGATNRPLLGDLDRALYDRLDIRIVI 149



 Score = 47.1 bits (112), Expect = 8e-06
 Identities = 26/98 (26%), Positives = 38/98 (38%), Gaps = 7/98 (7%)

Query: 749 IDRIFRQPQGHLLLIGVSGAGKTTLSRFVA---FMNGLSVFQIRAHNKYTGADFDEDLRT 805
            + +   P  +LLL G  G GKTTL+R +A   F  G     + A +   G    E    
Sbjct: 11  REALELPPPKNLLLYGPPGTGKTTLARAIANELFRPGAPFLYLNASDLLEGLVVAELFGH 70

Query: 806 VLRRS----GCKNEKIAFLLDESNVLESGFLERMNTLL 839
            L R       K +     +DE + L  G    +  +L
Sbjct: 71  FLVRLLFELAEKAKPGVLFIDEIDSLSRGAQNALLRVL 108



 Score = 29.4 bits (66), Expect = 7.6
 Identities = 10/40 (25%), Positives = 18/40 (45%)

Query: 32 KVLQLYQISNLNHGLMMVGPSGSGKSTAWKVLLKALERYE 71
          + L+          L++ GP G+GK+T  + +   L R  
Sbjct: 8  EALREALELPPPKNLLLYGPPGTGKTTLARAIANELFRPG 47


>gnl|CDD|214640 smart00382, AAA, ATPases associated with a variety of cellular
           activities.  AAA - ATPases associated with a variety of
           cellular activities. This profile/alignment only detects
           a fraction of this vast family. The poorly conserved
           N-terminal helix is missing from the alignment.
          Length = 148

 Score = 54.3 bits (130), Expect = 3e-08
 Identities = 29/153 (18%), Positives = 49/153 (32%), Gaps = 17/153 (11%)

Query: 416 HKPLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTFDHYCEYRKTPNG 475
            + +++ GPPGSGKT    +  R L       +        E +L          K  +G
Sbjct: 2   GEVILIVGPPGSGKTTLARALARELGPPGGGVIYIDGEDILEEVLDQLLLIIVGGKKASG 61

Query: 476 -------VILSPIQLGKWLVLFCDEINLPDMDKYATQRVISFLRQLIEQRGFYRPADKQW 528
                  + L+  +  K  VL  DEI      +          ++ +             
Sbjct: 62  SGELRLRLALALARKLKPDVLILDEITSLLDAE----------QEALLLLLEELRLLLLL 111

Query: 529 VSLERIQCVGACNPPTDPGRKPLSHRFLRHVPV 561
            S + +  +   N   D G   L  RF R + +
Sbjct: 112 KSEKNLTVILTTNDEKDLGPALLRRRFDRRIVL 144



 Score = 35.4 bits (81), Expect = 0.068
 Identities = 36/163 (22%), Positives = 54/163 (33%), Gaps = 19/163 (11%)

Query: 756 PQGHLLLIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNKYTGADFDEDLRTVLRRSGCKNE 815
           P   +L++G  G+GKTTL+R +A   G     +     Y   +   +             
Sbjct: 1   PGEVILIVGPPGSGKTTLARALARELGPPGGGV----IYIDGEDILEEVLDQLLLIIVGG 56

Query: 816 KIAFLLDESNVLESGFLERMNTLLANGEIPGLFEGDEYTTLMTQCKEGAQREGLMLDSNE 875
           K A    E  +           L    +   L   DE T+L+         +  +L   E
Sbjct: 57  KKASGSGELRL------RLALALARKLKPDVLIL-DEITSLLD------AEQEALLLLLE 103

Query: 876 ELYKWFTQQVMKNLHVVFTMNPSSEGLKDRAATSPALFNRCVL 918
           EL      +  KNL V+ T N   +     A        R VL
Sbjct: 104 ELRLLLLLKSEKNLTVILTTNDEKD--LGPALLRRRFDRRIVL 144



 Score = 35.4 bits (81), Expect = 0.091
 Identities = 26/90 (28%), Positives = 36/90 (40%), Gaps = 8/90 (8%)

Query: 43  NHGLMMVGPSGSGKSTAWKVLLKALERYEGVEGV-AHIIDPKAISKEALYGVLDPNTREW 101
              +++VGP GSGK+T    L +AL R  G  G     ID + I +E L  +L       
Sbjct: 2   GEVILIVGPPGSGKTT----LARALARELGPPGGGVIYIDGEDILEEVLDQLLLIIVGGK 57

Query: 102 TDGLFT-HILRRIIDNVRGEISKRQWIIFD 130
                    LR  +   R    K   +I D
Sbjct: 58  KASGSGELRLRLALALARK--LKPDVLILD 85


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
            bacterial type.  SMC (structural maintenance of
            chromosomes) proteins bind DNA and act in organizing and
            segregating chromosomes for partition. SMC proteins are
            found in bacteria, archaea, and eukaryotes. This family
            represents the SMC protein of most bacteria. The smc gene
            is often associated with scpB (TIGR00281) and scpA genes,
            where scp stands for segregation and condensation
            protein. SMC was shown (in Caulobacter crescentus) to be
            induced early in S phase but present and bound to DNA
            throughout the cell cycle [Cellular processes, Cell
            division, DNA metabolism, Chromosome-associated
            proteins].
          Length = 1179

 Score = 52.0 bits (125), Expect = 3e-06
 Identities = 60/331 (18%), Positives = 135/331 (40%), Gaps = 34/331 (10%)

Query: 1015 YREKCSELEEQQLHLNV--------GLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANL 1066
            Y+E  +EL E +L L V         L ++ E +++ EE  + L  + QEL+ K E   L
Sbjct: 215  YKELKAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRL 274

Query: 1067 KLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQ---- 1122
            ++ E+ ++ +E +K      +  + +E+Q   + ++   +   L ++E  + + +     
Sbjct: 275  EVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDE 334

Query: 1123 ------AVKEIKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIR-AVVMRENF 1175
                   ++E  ++   EL S+      ++  LE +   L E     + +R  V   E  
Sbjct: 335  LAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQ 394

Query: 1176 INSIVSNFNTEMITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADML 1235
            I S+ +         E  E    R     +   +K   A +                ++ 
Sbjct: 395  IASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAEL---------------KELQ 439

Query: 1236 KKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLD 1295
             ++E L  EL+ L+ +    +   EE ++ + + E+++ + + E AQL A+  +++   +
Sbjct: 440  AELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQE 499

Query: 1296 NVQAKFYEYAQLIAQATAIKTDLDNVQAKVE 1326
            N++        L+   + +   L  +   + 
Sbjct: 500  NLEGFSEGVKALLKNQSGLSGILGVLSELIS 530



 Score = 46.6 bits (111), Expect = 1e-04
 Identities = 64/326 (19%), Positives = 131/326 (40%), Gaps = 28/326 (8%)

Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQ 1075
             EK  ELEE+   L   L ++ + +E++EE  + L  + +EL  +  A    L  +  + 
Sbjct: 683  EEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEV 742

Query: 1076 QEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVEL 1135
            ++ E+R  Q      E+E +  E+ ++     E+LA+ E A ++  +A  E  K++L  L
Sbjct: 743  EQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAE-AEIEELEAQIEQLKEELKAL 801

Query: 1136 RSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDEVREK 1195
            R   +        L +   LL E A +    R   +      +     + E   +E+ E 
Sbjct: 802  REALD-------ELRAELTLLNEEAAN-LRERLESLERRIAATERRLEDLEEQIEELSED 853

Query: 1196 MHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASEN 1255
            + S      +                     +++      +      L L   E++    
Sbjct: 854  IESLAAEIEELEELIEE------------LESELEALLNERASLEEALALLRSELEELSE 901

Query: 1256 KAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQL-----IAQ 1310
            + +  E+K   ++L + +   +++ AQL  +   ++  +DN+Q +  E   L      A 
Sbjct: 902  ELRELESK--RSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEAL 959

Query: 1311 ATAIKTDLDNVQAKVERSMALLKSLG 1336
               I+ D +  + +++R    +K LG
Sbjct: 960  ENKIEDDEEEARRRLKRLENKIKELG 985



 Score = 40.8 bits (96), Expect = 0.006
 Identities = 21/96 (21%), Positives = 49/96 (51%), Gaps = 3/96 (3%)

Query: 1008 INHFVKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLK 1067
            +    ++ RE+ + LE Q   L   L ++   ++++ E    L  K +EL+ + E+   +
Sbjct: 300  LEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAE 359

Query: 1068 LKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKR 1103
            L+E+  + +E E R    ++++ ++E    ++AQ  
Sbjct: 360  LEELEAELEELESR---LEELEEQLETLRSKVAQLE 392



 Score = 40.0 bits (94), Expect = 0.013
 Identities = 53/320 (16%), Positives = 122/320 (38%), Gaps = 41/320 (12%)

Query: 1036 IAETVEQVE--EMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQ------- 1086
            + E   Q++  E Q   A + +EL+++     L L  +  ++   E  ++Q +       
Sbjct: 195  LNELERQLKSLERQAEKAERYKELKAELRELELALLVLRLEELREELEELQEELKEAEEE 254

Query: 1087 --DIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAV-------------KEIKKQQ 1131
              ++ AE+++   ++ + R+ V E   ++E    + Q+ +             K+I +++
Sbjct: 255  LEELTAELQELEEKLEELRLEVSELEEEIE----ELQKELYALANEISRLEQQKQILRER 310

Query: 1132 LVELRSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDE 1191
            L  L             LES    L E   + +     +  E  + S+ +        + 
Sbjct: 311  LANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEE--LESLEAELEEL---EA 365

Query: 1192 VREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKK-----VEPLRLELK 1246
              E++ SR L   +   E            +     +I   +   +      E L+ E++
Sbjct: 366  ELEELESR-LEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIE 424

Query: 1247 SLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQ 1306
             L  +  E + K  E +  + +LE+ +   ++E  +L      ++ +L+  +       +
Sbjct: 425  ELLKKLEEAELK--ELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAER 482

Query: 1307 LIAQATAIKTDLDNVQAKVE 1326
             +AQ  A    L+ +Q  +E
Sbjct: 483  ELAQLQARLDSLERLQENLE 502



 Score = 38.9 bits (91), Expect = 0.026
 Identities = 28/116 (24%), Positives = 57/116 (49%), Gaps = 6/116 (5%)

Query: 1242 RLELKSLEVQASENKAKGE--ETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQA 1299
             L L+  E++    + + E     + I++LE+     ++  A L  Q   ++  L+ +++
Sbjct: 271  ELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELES 330

Query: 1300 KFYEYAQLIAQATA----IKTDLDNVQAKVERSMALLKSLGIERERWEATSETFRS 1351
            K  E A+ +A+       +K +L++++A++E   A L+ L    E  E   ET RS
Sbjct: 331  KLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRS 386



 Score = 33.9 bits (78), Expect = 0.96
 Identities = 15/105 (14%), Positives = 49/105 (46%), Gaps = 3/105 (2%)

Query: 1243 LELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFY 1302
            L L+ L  +  E + + +E ++ + +L   +   +++  +L  + + ++ +++ +Q + Y
Sbjct: 232  LRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELY 291

Query: 1303 EYAQLIAQATAIKTDLDNVQAKVERSMALLKSLGIERERWEATSE 1347
              A  I+    ++     ++ ++      L+ L  + E  E+  +
Sbjct: 292  ALANEIS---RLEQQKQILRERLANLERQLEELEAQLEELESKLD 333



 Score = 30.8 bits (70), Expect = 7.2
 Identities = 24/127 (18%), Positives = 52/127 (40%), Gaps = 11/127 (8%)

Query: 1242 RLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKF 1301
            R E++ LE +  E + K  E +  + +L K +   ++E  QL  +   +   +  ++   
Sbjct: 676  RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDL 735

Query: 1302 YE-------YAQLIAQATAIKTDLDN----VQAKVERSMALLKSLGIERERWEATSETFR 1350
                       + IAQ +   T+L+     ++ ++E +   L     E E  EA  E  +
Sbjct: 736  ARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLK 795

Query: 1351 SQMATII 1357
             ++  + 
Sbjct: 796  EELKALR 802


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
            chromosome partitioning].
          Length = 1163

 Score = 51.6 bits (124), Expect = 4e-06
 Identities = 56/351 (15%), Positives = 140/351 (39%), Gaps = 65/351 (18%)

Query: 1015 YREKCSELEEQQL-----HLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKN---EAANL 1066
            Y+E  +EL E +L      L     ++ E  E++  +++ L    +EL+      E    
Sbjct: 215  YQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKS 274

Query: 1067 KLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKE 1126
            +L+E+ ++ +E ++  ++ ++   E+E +   + ++   +  +L ++E  + + ++ ++ 
Sbjct: 275  ELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEA 334

Query: 1127 IKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTE 1186
            +K++                  LE    LL E       +           S +     E
Sbjct: 335  LKEE------------------LEERETLLEELEQLLAELEEAKEELEEKLSALLEELEE 376

Query: 1187 MITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELK 1246
            +  + +RE++                         ++  +A+I       ++E L+ E++
Sbjct: 377  LF-EALREELAE-----------------------LEAELAEIR-----NELEELKREIE 407

Query: 1247 SLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQ 1306
            SLE +      + E+ K+ + +LE  +   + E  +L         +L+ ++ +     +
Sbjct: 408  SLEERLERLSERLEDLKEELKELEAELEELQTELEEL-------NEELEELEEQ---LEE 457

Query: 1307 LIAQATAIKTDLDNVQAKVERSMALLKSLGIERERWEATSETFRSQMATII 1357
            L  +   ++ +L  +Q +++R    L SL    +R EA     +   A + 
Sbjct: 458  LRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLE 508



 Score = 46.2 bits (110), Expect = 2e-04
 Identities = 45/310 (14%), Positives = 118/310 (38%), Gaps = 54/310 (17%)

Query: 1020 SELEEQQLHLNVGLGKIAETVEQVEEMQKSLA---VKSQELQSKNEAANLKLKEMIKDQQ 1076
             EL+E +  L     ++ +  E+++ ++  L       +EL+ + E    +L+E+ ++  
Sbjct: 667  RELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELA 726

Query: 1077 EAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELR 1136
              E+   Q Q    E+E++  E+ ++   + E L ++E  +   ++A+ ++K++      
Sbjct: 727  ALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEE------ 780

Query: 1137 SMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDEVREKM 1196
                        +E +           + +  +          +     E+ + E R + 
Sbjct: 781  ------------IEELEEKRQALQ---EELEELEEELEEAERRLDALERELESLEQRRER 825

Query: 1197 HSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENK 1256
              + +                          +    ++ +K++ L  EL+ LE +  E K
Sbjct: 826  LEQEIEE-----------------------LEEEIEELEEKLDELEEELEELEKELEELK 862

Query: 1257 AKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKT 1316
             + EE +    +LE  +   ++E  +L  +       L  ++++  E  + I +      
Sbjct: 863  EELEELEAEKEELEDELKELEEEKEELEEE-------LRELESELAELKEEIEKLRERLE 915

Query: 1317 DLDNVQAKVE 1326
            +L+    ++E
Sbjct: 916  ELEAKLERLE 925



 Score = 42.8 bits (101), Expect = 0.002
 Identities = 51/339 (15%), Positives = 133/339 (39%), Gaps = 26/339 (7%)

Query: 1035 KIAETVEQVEEMQKSLAVKSQELQS-KNEAANLKLKEM-IKDQQEAEKRKVQSQDIQAEI 1092
            ++ E  E++ E++  L    +EL+S KNE  +L+     ++ Q E  +R+++    +   
Sbjct: 668  ELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAA 727

Query: 1093 EKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMANPPSVVKLALESI 1152
             ++ +E  Q R+  +E+  +     ++  Q   E  +++L  L         +K  +E +
Sbjct: 728  LEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALA---KLKEEIEEL 784

Query: 1153 CLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDEVREKMHSRYLSNPDYSYEKAN 1212
                       + +  +          +     E+ + E R +   + +   +   E+  
Sbjct: 785  EEKRQALQ---EELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELE 841

Query: 1213 RASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGE-------ETKDL 1265
                      +    +    ++ +++E L  E + LE +  E + + E       E +  
Sbjct: 842  EKLDEL--EEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESE 899

Query: 1266 ITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQAKV 1325
            + +L++ I   ++   +L A+   ++ +L  ++ +  E  +       ++T+L+    ++
Sbjct: 900  LAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDT-----LETELEREIERL 954

Query: 1326 ERSMALLKSLGIER-ERWEATSETFR---SQMATIIGDV 1360
            E  +  L  + +   E +E   E +    SQ   +    
Sbjct: 955  EEEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAK 993



 Score = 41.6 bits (98), Expect = 0.004
 Identities = 44/252 (17%), Positives = 95/252 (37%), Gaps = 38/252 (15%)

Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQ 1075
            +E+  ELEE+       L  +   +E +E+ ++ L  + +EL+ + E    KL E+ ++ 
Sbjct: 792  QEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEEL 851

Query: 1076 QEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVEL 1135
            +E EK   + ++   E+E +  E+  +   + E+  ++E  + + +  + E+ K+++ +L
Sbjct: 852  EELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAEL-KEEIEKL 910

Query: 1136 RSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDEVREK 1195
            R            LE     L E   +                       + +  E+  +
Sbjct: 911  RERLEELEAKLERLEVELPELEEELEE--------------------EYEDTLETELERE 950

Query: 1196 MHSRYLSNPDYSYEKANRASMACGPMVKWAIAQIS-----YADMLKKVEPLRLELKSLEV 1250
            +            E+      A GP+   AI +       Y ++  + E L    + L  
Sbjct: 951  I------------ERLEEEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLE 998

Query: 1251 QASENKAKGEET 1262
               E   +  E 
Sbjct: 999  VIEELDKEKRER 1010



 Score = 41.6 bits (98), Expect = 0.004
 Identities = 59/318 (18%), Positives = 136/318 (42%), Gaps = 51/318 (16%)

Query: 1015 YREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAAN------LKL 1068
             +E   ELEE +  L+    ++ E  E++EE +K +     EL+   E         L+L
Sbjct: 234  LKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLEL 293

Query: 1069 KEMIKD--------QQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDA 1120
            KE I++        ++  E+ + + ++++  +E+   +I   +    E+L + E  + + 
Sbjct: 294  KEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALK----EELEERETLLEEL 349

Query: 1121 QQAVKEIKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIV 1180
            +Q + E+++    EL       S +   LE +   L E   +      +    N +  + 
Sbjct: 350  EQLLAELEE-AKEELEEKL---SALLEELEELFEALREELAE--LEAELAEIRNELEEL- 402

Query: 1181 SNFNTEMITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEP 1240
                 E+                 +   E+  R S      ++    ++   ++  ++E 
Sbjct: 403  ---KREI-----------------ESLEERLERLSE----RLEDLKEEL--KELEAELEE 436

Query: 1241 LRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAK 1300
            L+ EL+ L  +  E + + EE +D + +LE+ +A  ++E  +L  + ++++  LD ++A+
Sbjct: 437  LQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAE 496

Query: 1301 FYEYAQLIAQATAIKTDL 1318
                  + A   A+++ L
Sbjct: 497  QRASQGVRAVLEALESGL 514



 Score = 34.7 bits (80), Expect = 0.53
 Identities = 27/126 (21%), Positives = 57/126 (45%), Gaps = 2/126 (1%)

Query: 1008 INHFVKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLK 1067
            I+   +   E  +ELEE +  L     KI    E++EE +  L    Q L    EA   +
Sbjct: 304  ISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKE-E 362

Query: 1068 LKEMIKDQQEAEKRKVQS-QDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKE 1126
            L+E +    E  +   ++ ++  AE+E +  EI  +   +  ++  +E  +    + +++
Sbjct: 363  LEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLED 422

Query: 1127 IKKQQL 1132
            +K++  
Sbjct: 423  LKEELK 428



 Score = 34.3 bits (79), Expect = 0.79
 Identities = 59/354 (16%), Positives = 127/354 (35%), Gaps = 54/354 (15%)

Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQ 1075
            + +  ELEE+   L   L ++ E +E++EE  +SL     +L+ + E    K       Q
Sbjct: 736  QSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKR------Q 789

Query: 1076 QEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVEL 1135
               E+ +    +++ E+E+    +      +     + E      +Q ++E++++     
Sbjct: 790  ALQEELE----ELEEELEEAERRLDALERELESLEQRRE----RLEQEIEELEEEI---- 837

Query: 1136 RSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDEVREK 1195
                     ++  L+ +   L E     K +  +      + +       E+   E  ++
Sbjct: 838  -------EELEEKLDELEEELEELE---KELEELKEELEELEAEKEELEDELKELEEEKE 887

Query: 1196 MHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASEN 1255
                 L   +    +            +    +    ++  K+E L +EL  LE +  E 
Sbjct: 888  ELEEELRELESELAELKE---------EIEKLRERLEELEAKLERLEVELPELEEELEE- 937

Query: 1256 KAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIK 1315
                E    L T+LE+ I   ++E   L         + + V+     Y +L +Q   ++
Sbjct: 938  ----EYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEE---RYEELKSQREDLE 990

Query: 1316 TDLDNVQAKVERSMALLKSLGIER--ERWEATSETFRSQMATIIGDVLLSSAYL 1367
               + +   +E     L     ER  E ++  +E F      + G     +A L
Sbjct: 991  EAKEKLLEVIEE----LDKEKRERFKETFDKINENFSEIFKELFGG---GTAEL 1037


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 51.2 bits (122), Expect = 6e-06
 Identities = 82/384 (21%), Positives = 140/384 (36%), Gaps = 68/384 (17%)

Query: 416  HKPLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTFDHYC-------- 467
             KP++L G PG GKT +L++AL      +++ +N S  T    L   F            
Sbjct: 1543 GKPILLEGSPGVGKT-SLITALARKTGKKLIRINLSEQTD---LCDLFGSDLPVEEGGEF 1598

Query: 468  EYRKTPNGVILSPIQLGKWLVLFCDEINLPDMDKYATQRVISFLRQLIEQRG--FYRPAD 525
             +   P    L  ++ G W++L  DEINL      A+Q V+  L   ++ R   +    D
Sbjct: 1599 RWMDAP---FLHAMRDGGWVLL--DEINL------ASQSVLEGLNACLDHRREAYIPELD 1647

Query: 526  KQWVSLERIQCVGACNPP-TDPGRKPLSHRFLRHVPVIYVDYPGETSLKQIYGTFSRAML 584
            K +      +   A NP     GRK L   FL    V+ +D      +  I    +  M 
Sbjct: 1648 KTFDVHPNFRVFAAQNPQDQGGGRKGLPKSFLNRFSVVKMDGLTTDDITHI----ANKMY 1703

Query: 585  RLIPPLRGYADALTNAMVELYLASQEKFTQDMQPHYVYSPREMTRWVR-----GICEAIR 639
              +            + ++  +     F     P + ++ R+  RW+      G  E + 
Sbjct: 1704 PQVNEDWRLKIIKFMSRLQDNIEKDISFGSFGSP-WEFNLRDTLRWLILLNQVGTLEDVD 1762

Query: 640  PLESLTVEGLVRLWAHEALRLFQDRLVNDVERQWTNENIDAVAMKYFSNIDKEVLARPIL 699
              + +    + R+   E      +                +++ + F  +   V  R   
Sbjct: 1763 TSDFIDESVVRRMRTVEDRVRTCELFKEVFGDYEPRTIGFSLSSQCF-KVGHSVTVRMKE 1821

Query: 700  YSNWLSKNYVPVGTTELREYVQARLKVFYEEELDVQLVLFDEVLDHVLR-IDRIFRQPQG 758
                L  ++V                      L  QL    +VL+ V+R I+     P  
Sbjct: 1822 RRPRLDDSFVL---------------------LHSQL----QVLESVMRCINM--NWP-- 1852

Query: 759  HLLLIGVSGAGKTTLSRFVAFMNG 782
             L+L+G +G GKT+L RF+A + G
Sbjct: 1853 -LILVGDTGVGKTSLLRFLASIFG 1875



 Score = 41.1 bits (96), Expect = 0.006
 Identities = 38/137 (27%), Positives = 63/137 (45%), Gaps = 16/137 (11%)

Query: 397 VPTLDTVRHESLLYTWLAEHKPLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTP 456
           V TL+T+   +     L    P+ L G  GSGK+  L+S L       +V ++    T  
Sbjct: 130 VGTLETLDRLAKEAVDLKIAVPIYLEGGRGSGKSF-LISELCDEGGQRIVEIHLREITDA 188

Query: 457 ELLLKTFDHY----CEYRKTPNGVILSPIQLGKWLVLFCDEINLPDMDKYATQRVISFLR 512
           ++L+ T+        E+ K   GV++  +  G W +LF        +DK A   V+S+L 
Sbjct: 189 KVLIGTYTSPKPGDFEWMK---GVLIEAVVSGDW-ILF------KRIDK-APHGVLSYLL 237

Query: 513 QLIEQRGFYRPADKQWV 529
            L+E+R    P+  + V
Sbjct: 238 TLLEKRRLLIPSRGETV 254



 Score = 36.1 bits (83), Expect = 0.23
 Identities = 106/501 (21%), Positives = 178/501 (35%), Gaps = 105/501 (20%)

Query: 475  GVILSPIQLGKWLVLFCDEINLPDMDKYATQRVISFL------RQLIEQRGFYRPADKQW 528
            G+++  ++ G+WL+L  DEINL   +   T  +I  L        L+ + G  RP     
Sbjct: 627  GILVKCMRNGEWLLL--DEINLASEE---TLELIDGLLGKKESGILLSESGDLRPIRIH- 680

Query: 529  VSLERIQCVGACNPPTDPGRKPLSHRFLRHVPVIYVDYPGETSLKQIYGTFSRAMLRLIP 588
                  +  G  NP TD G++ L   F      IYV  P  TSLK +     + + R   
Sbjct: 681  ---PGFRLFGCMNPGTDVGKRKLEGSFRSRFTEIYVHSP-STSLKDLLSIIHKYIGRYEI 736

Query: 589  PLRGYADALTNAMVELYLASQEKF-------TQDMQPHYVYSPREMTRWVRGICEAIRPL 641
                  + ++  + ELYL ++              +PH   S R + R +  +       
Sbjct: 737  S----EERVSRKIAELYLEAKSLSLGKELVDGAVQEPH--VSGRTLVRTLNYVTNIGE-- 788

Query: 642  ESLTVEGLVRLWAHEALRLFQDRLVNDVERQWTNENID---AVAMKYFSNIDKEVLARPI 698
                + GL R   +E   +    L++ V        +       ++  + I+K+    P 
Sbjct: 789  ----IYGLRRS-LYEGFCMSFLTLLDFVSESILVSLLSKFTLGLLESQNAIEKQKPKVP- 842

Query: 699  LYSNWLSKNYVPVGTTELREYVQARLKVFYEEELDVQLVLFDEVLDHVLRIDRIFRQPQG 758
                    +Y+         Y +       EE+     ++   V  + L   R       
Sbjct: 843  ------DHSYI-----AFCHYWKHGGSFPVEEQ--EHYIITPFVQKNYLNTMRAASLSNF 889

Query: 759  HLLLIGVSGAGKTTLSRFVAFMNGLSVFQIRAHN-----KYTGADFDEDLRTVLRRSGCK 813
             LL+ G + +GKT++  ++A   G    +I  H      +Y G    +D  ++  + G  
Sbjct: 890  PLLIQGPTSSGKTSMILYLARETGHKFVRINNHEHTDLQEYIGTYVTDDDGSLSFKEGVL 949

Query: 814  NEKIA----FLLDESNVLESGFLERMNTLLANGEIPGLFEGDEYTTLMTQCKEGAQREGL 869
             E +      +LDE N+  +  LE +N L                               
Sbjct: 950  VEALRRGYWIVLDELNLAPTDVLEALNRL------------------------------- 978

Query: 870  MLDSNEELYKWFTQQVMK---NLHVVFTMNPSSEGLKDRAATSPALFNRCVLNWFGDWSD 926
             LD N EL+   TQ+V+    N  +  T NP   G   R   S A  NR +   F D  +
Sbjct: 979  -LDDNRELFIPETQEVVVPHPNFRLFATQNPPG-GYGGRKGLSRAFRNRFLEMHFDDIPE 1036

Query: 927  TALY-------QVAKEFTSKI 940
              L        ++A  +  KI
Sbjct: 1037 DELEEILHGRCEIAPSYAKKI 1057



 Score = 32.3 bits (73), Expect = 2.7
 Identities = 44/165 (26%), Positives = 75/165 (45%), Gaps = 19/165 (11%)

Query: 418  PLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTF----DHYCEYRKTP 473
            PL++ GP  SGKT +++  L      + V +N    T  +  + T+    D    +++  
Sbjct: 890  PLLIQGPTSSGKT-SMILYLARETGHKFVRINNHEHTDLQEYIGTYVTDDDGSLSFKE-- 946

Query: 474  NGVILSPIQLGKWLVLFCDEINLPDMDKYATQRVISFLRQLI-EQRGFYRPADKQWV-SL 531
             GV++  ++ G W+VL  DE+NL   D      V+  L +L+ + R  + P  ++ V   
Sbjct: 947  -GVLVEALRRGYWIVL--DELNLAPTD------VLEALNRLLDDNRELFIPETQEVVVPH 997

Query: 532  ERIQCVGACNPPTD-PGRKPLSHRFLRHVPVIYVDYPGETSLKQI 575
               +     NPP    GRK LS  F      ++ D   E  L++I
Sbjct: 998  PNFRLFATQNPPGGYGGRKGLSRAFRNRFLEMHFDDIPEDELEEI 1042


>gnl|CDD|222104 pfam13401, AAA_22, AAA domain. 
          Length = 124

 Score = 42.3 bits (100), Expect = 2e-04
 Identities = 29/105 (27%), Positives = 40/105 (38%), Gaps = 14/105 (13%)

Query: 419 LVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKT----FDHYCEYRKTPN 474
            VL G  GSGKT  L    R LP+  VV +   S  TP+ LL+              T  
Sbjct: 7   GVLTGESGSGKTTLLRRLARQLPNRRVVYVEAPSLGTPKDLLRKILRALGLPLSGGTTAE 66

Query: 475 GV--ILSPIQLGKWLVLFCDEI-NLPDMDKYATQRVISFLRQLIE 516
            +  IL  ++     +L  DE  +L           +  LR L +
Sbjct: 67  LLEAILDALKRRGRPLLIIDEAQHLSL-------EALEELRDLYD 104



 Score = 36.1 bits (84), Expect = 0.029
 Identities = 21/64 (32%), Positives = 28/64 (43%), Gaps = 4/64 (6%)

Query: 754 RQPQGHLLLIGVSGAGKTTLSR-FVAFMNGLSVFQIRAHNKYTGADFDEDLRTVLRRSGC 812
           R+  G  +L G SG+GKTTL R     +    V  + A +  T  D    LR +LR  G 
Sbjct: 1   RRGAGIGVLTGESGSGKTTLLRRLARQLPNRRVVYVEAPSLGTPKDL---LRKILRALGL 57

Query: 813 KNEK 816
               
Sbjct: 58  PLSG 61



 Score = 29.2 bits (66), Expect = 7.0
 Identities = 20/83 (24%), Positives = 29/83 (34%), Gaps = 18/83 (21%)

Query: 49  VGPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPKAISKEALYGVL-----DPNTREWTD 103
            G SGSGK+T  + L + L        V ++  P   + + L   +      P +   T 
Sbjct: 10  TGESGSGKTTLLRRLARQLPNRR----VVYVEAPSLGTPKDLLRKILRALGLPLSGGTTA 65

Query: 104 GLFTHILRR---------IIDNV 117
            L   IL           IID  
Sbjct: 66  ELLEAILDALKRRGRPLLIIDEA 88


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
            represents a relatively rare but broadly distributed
            uncharacterized protein family, distributed in 1-2
            percent of bacterial genomes, all of which have outer
            membranes. In many of these genomes, it is part of a
            two-gene pair.
          Length = 555

 Score = 45.5 bits (108), Expect = 2e-04
 Identities = 33/165 (20%), Positives = 68/165 (41%), Gaps = 23/165 (13%)

Query: 1038 ETVEQVEEMQKSLAVKSQ-------ELQSKNEAANLK-----LKEM-IKDQQEAEKRKVQ 1084
            +TVE  EE++KSL    Q       +L +K +    K     +K+  IK+  E +K K +
Sbjct: 161  KTVETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEE 220

Query: 1085 SQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMANPPSV 1144
               ++ E +       ++++   ++  Q +   + A   + E+KK    +L+ + N  ++
Sbjct: 221  FDKLKKEGKA-----DKQKIKSAKNDLQNDKKQLKAD--LAELKKAPQNDLKRLENKYAI 273

Query: 1145 VKLALESIC-LLLGENATDW--KAIRAVVMRENFINSIVSNFNTE 1186
                L++    L G     +  K ++     E  +N        E
Sbjct: 274  KSGDLKNFAVDLFGPEIRKYLQKFLKYYDQAEPLLNKSKKEPKEE 318


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
            P-loop motif that is characteristic of the AAA
            superfamily. Many of the proteins in this family are
            conjugative transfer proteins. This family includes the
            PrrC protein that is thought to be the active component
            of the anticodon nuclease.
          Length = 713

 Score = 44.2 bits (105), Expect = 6e-04
 Identities = 16/67 (23%), Positives = 30/67 (44%)

Query: 1260 EETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLD 1319
            EE ++LI QLE+ I  Y+    + + +  +I            +  +L A   A++  L+
Sbjct: 278  EEYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILE 337

Query: 1320 NVQAKVE 1326
                K+E
Sbjct: 338  KNLQKLE 344



 Score = 32.7 bits (75), Expect = 2.0
 Identities = 19/89 (21%), Positives = 42/89 (47%), Gaps = 3/89 (3%)

Query: 1251 QASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQ 1310
            +  +NKAK +    L+ +L++ I +Y+ E   L     +++ ++  ++A   E   L  +
Sbjct: 388  KKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEA---EIKALEKE 444

Query: 1311 ATAIKTDLDNVQAKVERSMALLKSLGIER 1339
               ++  L N++   +    LLK+ G   
Sbjct: 445  IKELEKQLTNIEPTADEINKLLKAYGFGN 473


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
            Mitofilin controls mitochondrial cristae morphology.
            Mitofilin is enriched in the narrow space between the
            inner boundary and the outer membranes, where it forms a
            homotypic interaction and assembles into a large
            multimeric protein complex. The first 78 amino acids
            contain a typical amino-terminal-cleavable mitochondrial
            presequence rich in positive-charged and hydroxylated
            residues and a membrane anchor domain. In addition, it
            has three centrally located coiled coil domains.
          Length = 493

 Score = 43.1 bits (102), Expect = 0.001
 Identities = 32/168 (19%), Positives = 63/168 (37%), Gaps = 36/168 (21%)

Query: 1012 VKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEM 1071
            +     K + LE+Q            E  E  ++ ++ L    QEL+ + EA   KLK  
Sbjct: 210  LARLESKEAALEKQLRLEF-----EREKEELRKKYEEKL---RQELERQAEAHEQKLKNE 261

Query: 1072 IKDQQEAEKRKVQSQDIQAEIEKQ------TVEIAQKRVFVMEDLAQVEPAVMDA----- 1120
            +   Q  E ++  +++I+ ++E++       +     R+  +E          D      
Sbjct: 262  LA-LQAIELQREFNKEIKEKVEEERNGRLAKLAELNSRLKGLEKALDSRSEAEDENHKVQ 320

Query: 1121 --QQAVKEIK--------------KQQLVELRSMANPPSVVKLALESI 1152
                AV+ +K               ++L  L+ +A    +V  AL S+
Sbjct: 321  QLWLAVEALKSALKSGSAGSPRPLVKELDALKELAKDDELVDAALASL 368


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
            recombination, and repair].
          Length = 908

 Score = 42.8 bits (101), Expect = 0.002
 Identities = 55/367 (14%), Positives = 127/367 (34%), Gaps = 46/367 (12%)

Query: 1002 RHYLDFINHFVKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKN 1061
                +      +L +     L+E +  L     ++ + +E+  E  K L    QEL+ + 
Sbjct: 343  SELEELAEEKNELAKLLEERLKELEERL----EELEKELEKALERLKQLEEAIQELKEEL 398

Query: 1062 EAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQ 1121
               +  L+E+   Q+E E+ + + ++++ E+E+   EI +                 + Q
Sbjct: 399  AELSAALEEI---QEELEELEKELEELERELEELEEEIKKL----------------EEQ 439

Query: 1122 QAVKEIKKQQLVELRSMANPPSVVKLAL----ESICLLLGENATDWKAIR-AVVMRENFI 1176
                E K+  + EL        V    L    E   L L E   +      +    E  +
Sbjct: 440  INQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAEL 499

Query: 1177 NSIVSNFNTEMITDE----VREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYA 1232
               +     E+   E       ++        +   EK               + ++   
Sbjct: 500  REEIEELEKELRELEEELIELLELEEALKEELEEKLEKLEN-----------LLEELEEL 548

Query: 1233 DMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKT 1292
                +++ L+ EL+ LE +  E K    E   L+   ++ +   ++   +L  +   ++ 
Sbjct: 549  KEKLQLQQLKEELRQLEDRLQELKEL-LEELRLLRTRKEELEELRERLKELKKKLKELEE 607

Query: 1293 DLDNVQA--KFYEYAQLIAQATAIKTDLDNVQAKVERSMALLKSLGIERERWEATSETFR 1350
             L  ++   +  E ++   +    + +L++   K+     L + L    E  E   E   
Sbjct: 608  RLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELE 667

Query: 1351 SQMATII 1357
            +++   +
Sbjct: 668  AEIRREL 674



 Score = 41.7 bits (98), Expect = 0.004
 Identities = 68/350 (19%), Positives = 130/350 (37%), Gaps = 74/350 (21%)

Query: 1010 HFVKLYREKCSELE--EQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLK 1067
            H  +L      ELE  E++L       ++ E +E++E+  + L  +  EL    EA   +
Sbjct: 471  HEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEE 530

Query: 1068 LKEMIKD------QQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQ 1121
            L+E ++       + E  K K+Q Q ++ E+     ++  +   + E L          +
Sbjct: 531  LEEKLEKLENLLEELEELKEKLQLQQLKEELR----QLEDRLQELKELL---------EE 577

Query: 1122 QAVKEIKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVS 1181
              +   +K++L ELR                   L E     K +     R + +  ++ 
Sbjct: 578  LRLLRTRKEELEELRER-----------------LKELKKKLKELEE---RLSQLEELLQ 617

Query: 1182 NFNTEMITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPL 1241
            +       +E+ E       S      EK N            A  +      L+++E  
Sbjct: 618  SLELSEAENELEEA-EEELESE----LEKLN----------LQAELEELLQAALEELEEK 662

Query: 1242 RLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKF 1301
              EL++ E++    + + EE      QLE+      +E  QL  +   ++ +L+ +  K 
Sbjct: 663  VEELEA-EIRRELQRIENEE------QLEE----KLEELEQLEEELEQLREELEELLKKL 711

Query: 1302 YEYAQLIAQATAIKTDLDNVQAKVERSMALLKSLGIERERWEATSETFRS 1351
             E  QLI +       L++ +A++E     L+ L    E  E   E    
Sbjct: 712  GEIEQLIEE-------LESRKAELEELKKELEKLEKALELLEELREKLGK 754



 Score = 40.5 bits (95), Expect = 0.008
 Identities = 55/381 (14%), Positives = 139/381 (36%), Gaps = 55/381 (14%)

Query: 982  LHKANARLSKRGSRTMAI--TPRHYLDFINHFVKLYREKCSELEEQQLHLNVGLGKIAET 1039
            L +A   L +  +   A     +  L+ +   ++    +  ELEE+   L   + ++   
Sbjct: 387  LEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESK 446

Query: 1040 VEQVEEMQKSLA---VKSQELQ--SKNEAANLK------LKEMIKDQQEAEKRKVQSQDI 1088
               + E+  +     V  QEL    + E   L       L+E +  ++E  + + + +++
Sbjct: 447  ELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIEEL 506

Query: 1089 QAEI---EKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVE-LRSMANPPSV 1144
            + E+   E++ +E+ +    + E+L +    + +  + ++E+K++  ++ L+        
Sbjct: 507  EKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLED 566

Query: 1145 VKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNT-EMITDEVREKMHSRYLSN 1203
                L+   LL        +      +RE  +  +       E    ++ E + S     
Sbjct: 567  RLQELKE--LLEELRLLRTRKEELEELRER-LKELKKKLKELEERLSQLEELLQSL---E 620

Query: 1204 PDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGEETK 1263
               +  +   A                        E L  EL+ L +QA   +      +
Sbjct: 621  LSEAENELEEAE-----------------------EELESELEKLNLQAELEELL----Q 653

Query: 1264 DLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQA 1323
              + +LE+ +   + E  + + +    +   + ++    E  QL  +   ++ +L+ +  
Sbjct: 654  AALEELEEKVEELEAEIRRELQRIENEEQLEEKLE----ELEQLEEELEQLREELEELLK 709

Query: 1324 KVERSMALLKSLGIERERWEA 1344
            K+     L++ L   +   E 
Sbjct: 710  KLGEIEQLIEELESRKAELEE 730



 Score = 35.5 bits (82), Expect = 0.33
 Identities = 63/369 (17%), Positives = 146/369 (39%), Gaps = 45/369 (12%)

Query: 1013 KLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMI 1072
            +   E   E+EE +  L  GL  + E +E++ E  KSL  + ++L+ K E    +L+E+ 
Sbjct: 291  ERLEELEREIEELEEELE-GLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELA 349

Query: 1073 KDQ----QEAEKRKVQSQDIQAEIEKQTV---------EIAQKRVFVMEDLAQVEPAVMD 1119
            +++    +  E+R  + ++   E+EK+           E A +   + E+LA++  A+ +
Sbjct: 350  EEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQE--LKEELAELSAALEE 407

Query: 1120 AQQAVKEIKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSI 1179
             Q+ ++E++K +L EL         ++   E I  L        + I  +  +E  I  +
Sbjct: 408  IQEELEELEK-ELEELE------RELEELEEEIKKLE-------EQINQLESKELMIAEL 453

Query: 1180 VSNFNTEMITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVE 1239
                    +  +   + H + L        +     ++     +         ++ +  +
Sbjct: 454  AGAGEKCPVCGQELPEEHEKELLELYELELEELEEELS-----REKEEAELREEIEELEK 508

Query: 1240 PLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQA 1299
             LR   + L       +A  EE ++ + +LE  +   ++   +L  Q    +        
Sbjct: 509  ELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELR------ 562

Query: 1300 KFYEYAQLIAQATAIKTDLDNVQAKVERSMALLKSLGIERERWEATSETFRSQMATIIGD 1359
               +    + +   +  +L  ++ + E    L + L   +++ +   E   SQ+  ++  
Sbjct: 563  ---QLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERL-SQLEELLQS 618

Query: 1360 VLLSSAYLA 1368
            + LS A   
Sbjct: 619  LELSEAENE 627



 Score = 31.7 bits (72), Expect = 4.5
 Identities = 56/351 (15%), Positives = 126/351 (35%), Gaps = 28/351 (7%)

Query: 1012 VKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEM 1071
            ++   E+ +ELEE++  L     ++ E      E  K    + +EL+   E    K++ +
Sbjct: 234  IEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERL 293

Query: 1072 ---IK----DQQEAEKRKVQSQDIQAEIEKQTVEIAQ---------KRVFVMEDLAQVEP 1115
                +     ++E E  +   ++++  +EK      +         K    +E+LA+ + 
Sbjct: 294  EELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKN 353

Query: 1116 AVMDAQQAVKEIKKQQLVELRSMANPPSVVKLAL-ESICLLLGENATDWKAIRAVVMREN 1174
             +    +   +  +++L EL             L E+I  L  E A    A+  +     
Sbjct: 354  ELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELE 413

Query: 1175 FINSIVSNFNTEM--ITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYA 1232
             +   +     E+  + +E+++                    +    P V        + 
Sbjct: 414  ELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCP-VCGQELPEEHE 472

Query: 1233 DMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKT 1292
              L   E   LEL+ LE + S  K    E  +L  ++E+     ++   +LI      + 
Sbjct: 473  KELL--ELYELELEELEEELSREK----EEAELREEIEELEKELRELEEELIELLELEEA 526

Query: 1293 DLDNVQAKFYEYAQLIAQATAIKTDL--DNVQAKVERSMALLKSLGIERER 1341
              + ++ K  +   L+ +   +K  L    ++ ++ +    L+ L    E 
Sbjct: 527  LKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEE 577



 Score = 31.3 bits (71), Expect = 5.4
 Identities = 21/117 (17%), Positives = 60/117 (51%), Gaps = 9/117 (7%)

Query: 1015 YREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKD 1074
               + SEL E    L   L +  + ++++EE+Q+    + +EL+ + EA    L+E + +
Sbjct: 190  LEGQLSELLEDIEDLLEALEEELKELKKLEEIQE--EQEEEELEQEIEA----LEERLAE 243

Query: 1075 QQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQ 1131
             +E ++R    ++++A + +      +      E+L ++E  + + ++ ++ +++ +
Sbjct: 244  LEEEKER---LEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELE 297


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
            protein; Reviewed.
          Length = 782

 Score = 41.4 bits (98), Expect = 0.004
 Identities = 28/111 (25%), Positives = 51/111 (45%), Gaps = 4/111 (3%)

Query: 1036 IAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEM--IKDQQEAEKRKVQSQDIQAEIE 1093
            I E  E++ E+  SL    +EL+ K E A   LKE   +K++ E +K K+Q +     +E
Sbjct: 511  IGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEE-EDKLLE 569

Query: 1094 KQTVEIAQKRVFVMEDLAQVEPAVMDAQQA-VKEIKKQQLVELRSMANPPS 1143
            +   E  Q      ++  ++   +   Q+     +K  +L+E R   N  +
Sbjct: 570  EAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKAN 620



 Score = 31.3 bits (72), Expect = 4.8
 Identities = 18/81 (22%), Positives = 35/81 (43%), Gaps = 16/81 (19%)

Query: 1233 DMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKT 1292
            +++  +E L  EL+    +A E +A  +E + L  +LE+     ++E  +L+ +A     
Sbjct: 520  ELIASLEELERELEQ---KAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEK--- 573

Query: 1293 DLDNVQAKFYEYAQLIAQATA 1313
                      E  Q I +A  
Sbjct: 574  ----------EAQQAIKEAKK 584


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily
            archaeal type.  SMC (structural maintenance of
            chromosomes) proteins bind DNA and act in organizing and
            segregating chromosomes for partition. SMC proteins are
            found in bacteria, archaea, and eukaryotes. It is found
            in a single copy and is homodimeric in prokaryotes, but
            six paralogs (excluded from this family) are found in
            eukarotes, where SMC proteins are heterodimeric. This
            family represents the SMC protein of archaea and a few
            bacteria (Aquifex, Synechocystis, etc); the SMC of other
            bacteria is described by TIGR02168. The N- and C-terminal
            domains of this protein are well conserved, but the
            central hinge region is skewed in composition and highly
            divergent [Cellular processes, Cell division, DNA
            metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 41.6 bits (98), Expect = 0.004
 Identities = 50/324 (15%), Positives = 122/324 (37%), Gaps = 43/324 (13%)

Query: 1032 GLGKIAETVEQVEEMQKSLAV----------KSQELQSKNEAANLKLKEMIKDQQEAEKR 1081
               ++    E++E +++ L+           +  EL  +   A+ K+ E+ K+ ++ E+ 
Sbjct: 672  EPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQE 731

Query: 1082 KVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMANP 1141
            + + ++   E+E+    + Q+   V  +L ++E A ++  +      ++ L +L +  + 
Sbjct: 732  EEKLKERLEELEEDLSSLEQEIENVKSELKELE-ARIEELEEDLHKLEEALNDLEARLSH 790

Query: 1142 PSVVKLALESICLLLGENATDWKAIRAVVMR-ENFINSIVSNFNTEMITDEVREKMHSRY 1200
              + ++  E               +   V R E  +  I    N   +  E  EK     
Sbjct: 791  SRIPEIQAEL------------SKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQEL 838

Query: 1201 LSNPDY--SYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAK 1258
                       K+    +      K  + +    ++   +  L   L  L+ +  E +A+
Sbjct: 839  QEQRIDLKEQIKSIEKEIENLNGKKEELEEE-LEELEAALRDLESRLGDLKKERDELEAQ 897

Query: 1259 GEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKT-- 1316
              E +  I +LE  I   +   ++L A+           +A   E +++       +   
Sbjct: 898  LRELERKIEELEAQIEKKRKRLSELKAKL----------EALEEELSEIEDPKGEDEEIP 947

Query: 1317 ----DLDNVQAKVERSMALLKSLG 1336
                 L++VQA+++R    +++L 
Sbjct: 948  EEELSLEDVQAELQRVEEEIRALE 971



 Score = 40.1 bits (94), Expect = 0.012
 Identities = 53/338 (15%), Positives = 128/338 (37%), Gaps = 44/338 (13%)

Query: 1021 ELEEQQLHLNVGLGKIAETVEQVEEMQ--KSLAVKSQELQSKNEAANLKLKEMIKDQQEA 1078
            ELEE + ++      I E  +Q+E ++  +  A + Q L  +            K+  E 
Sbjct: 178  ELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALER 237

Query: 1079 EKRKVQSQ--DIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELR 1136
            +K  ++ Q   ++ E+EK T EI++          ++E      ++  K+IK     E  
Sbjct: 238  QKEAIERQLASLEEELEKLTEEISELE-------KRLEEIEQLLEELNKKIKDLGEEEQL 290

Query: 1137 SMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDEVREKM 1196
             +      ++  + S+     E +   K        E                +E+  ++
Sbjct: 291  RVKEKIGELEAEIASL-----ERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREI 345

Query: 1197 HSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENK 1256
                        E+  R  +              YA++ +++E LR EL+ ++ + +E +
Sbjct: 346  EE----------ERKRRDKL-----------TEEYAELKEELEDLRAELEEVDKEFAETR 384

Query: 1257 AKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKT 1316
             + ++ ++ + +L++ I   K E  +L  +   +  +L ++ A     A + A+   ++ 
Sbjct: 385  DELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAI---AGIEAKINELEE 441

Query: 1317 DLDNVQAKVERS----MALLKSLGIERERWEATSETFR 1350
            + ++   ++++       L   L    +      E + 
Sbjct: 442  EKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYD 479



 Score = 39.3 bits (92), Expect = 0.018
 Identities = 56/305 (18%), Positives = 112/305 (36%), Gaps = 40/305 (13%)

Query: 1015 YREKCSELEEQQLHLNVGLGKIAET--------VEQVEEMQKSLAVKSQELQSKNEAANL 1066
            Y+    E  E + +  +   +  E         +  +EE  + L  +  EL+ + E    
Sbjct: 213  YQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQ 272

Query: 1067 KLKEM---IKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQA 1123
             L+E+   IKD  E E+ +     ++ +I +   EIA     + E   ++E A     + 
Sbjct: 273  LLEELNKKIKDLGEEEQLR-----VKEKIGELEAEIASLERSIAEKERELEDAEERLAKL 327

Query: 1124 VKEIKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNF 1183
              EI K    E+  +       +   + +     E   + + +RA +   +         
Sbjct: 328  EAEIDKLLA-EIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVD-----KEFA 381

Query: 1184 NTEMITDEVREKM------HSRYLSNPDYSYEKANRASMACGPM---VKWAIAQISYADM 1234
             T     + REK+       +      D   E+  R S     +   +    A+I   ++
Sbjct: 382  ETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKI--NEL 439

Query: 1235 LKKVEPLRLELKSLEVQASENKAKGEE-------TKDLITQLEKSIASYKDEYAQLIAQA 1287
             ++ E   LE+K  E +  +  A   +        K+   ++EK ++  + E A+  AQA
Sbjct: 440  EEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQA 499

Query: 1288 TAIKT 1292
             A + 
Sbjct: 500  RASEE 504



 Score = 35.4 bits (82), Expect = 0.29
 Identities = 25/118 (21%), Positives = 52/118 (44%), Gaps = 12/118 (10%)

Query: 1227 AQISYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQ 1286
                 +D  +K+  +  E++ LE +  + K + EE ++ ++ LE+ I + K E  +L A+
Sbjct: 707  LSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEAR 766

Query: 1287 ATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQAKVERSMALLKSLGIERERWEA 1344
               ++ DL  ++    +          ++  L    +++    A L  L  E  R EA
Sbjct: 767  IEELEEDLHKLEEALND----------LEARLS--HSRIPEIQAELSKLEEEVSRIEA 812



 Score = 33.9 bits (78), Expect = 0.92
 Identities = 23/128 (17%), Positives = 61/128 (47%), Gaps = 17/128 (13%)

Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQ 1075
             E+ S +E +   +   L ++    E +E+       + QELQ +      +  + I+ +
Sbjct: 804  EEEVSRIEARLREIEQKLNRLTLEKEYLEK-------EIQELQEQRIDLKEQ-IKSIEKE 855

Query: 1076 QE---AEKRKVQSQ--DIQAEI---EKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEI 1127
             E    +K +++ +  +++A +   E +  ++ ++R  +   L ++E  + + +  + E 
Sbjct: 856  IENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQI-EK 914

Query: 1128 KKQQLVEL 1135
            K+++L EL
Sbjct: 915  KRKRLSEL 922



 Score = 32.7 bits (75), Expect = 1.9
 Identities = 30/143 (20%), Positives = 68/143 (47%), Gaps = 10/143 (6%)

Query: 1010 HFVKLY-REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKL 1068
               K Y  ++  EL+EQ++ L     +I    +++E +        +EL+   EAA   L
Sbjct: 825  TLEKEYLEKEIQELQEQRIDLKE---QIKSIEKEIENLNGKKEELEEELEEL-EAALRDL 880

Query: 1069 KEMIKD-QQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEI 1127
            +  + D ++E ++ + Q ++++ +IE+   +I +KR  + E  A++E      ++ + EI
Sbjct: 881  ESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLE----ALEEELSEI 936

Query: 1128 KKQQLVELRSMANPPSVVKLALE 1150
            +  +  +        S+  +  E
Sbjct: 937  EDPKGEDEEIPEEELSLEDVQAE 959


>gnl|CDD|223786 COG0714, COG0714, MoxR-like ATPases [General function prediction
           only].
          Length = 329

 Score = 40.5 bits (95), Expect = 0.005
 Identities = 61/274 (22%), Positives = 96/274 (35%), Gaps = 30/274 (10%)

Query: 393 SDVVVPTLDTVRHESLLYTWLAEHKPLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSS 452
             VVV   + +    L    L     ++L GPPG GKT+   +  RAL  +  V +  + 
Sbjct: 23  EKVVVGDEEVIE---LALLALLAGGHVLLEGPPGVGKTLLARALARAL-GLPFVRIQCTP 78

Query: 453 ATTPELLLKTFD---HYCEYRKTPNGVILS-PIQLGKWLVLFCDEINLPDMDKYATQRVI 508
              P  LL T+       E  +      +  P+     ++L  DEIN       A   V 
Sbjct: 79  DLLPSDLLGTYAYAALLLEPGEF---RFVPGPLFAAVRVILLLDEINR------APPEVQ 129

Query: 509 SFLRQLIEQRGFYRPADKQWVSLERIQCVGA-CNPPTDPGRKPLSHRFLRHVPV-IYVDY 566
           + L + +E+R    P     + L     V A  NP    G  PL    L    + IYVDY
Sbjct: 130 NALLEALEERQVTVP-GLTTIRLPPPFIVIATQNPGEYEGTYPLPEALLDRFLLRIYVDY 188

Query: 567 PGETSLKQIYGTFSRAMLRLIPPLRGYADALTNAMVELYLASQEKFTQDMQPHYVYSPRE 626
           P     ++I    +R        L      +     E  L  Q++  +      V    E
Sbjct: 189 PDSEEEERII--LARVGGVDELDLESLVKPVL--SDEELLRLQKEVKK------VPVSDE 238

Query: 627 MTRWVRGICEAIRPLESLTVEGLVRLWAHEALRL 660
           +  ++  +  A+R    + +    R        L
Sbjct: 239 VIDYIVTLVAALREAPDVALGASPRASLALLAAL 272



 Score = 37.0 bits (86), Expect = 0.068
 Identities = 31/123 (25%), Positives = 49/123 (39%), Gaps = 19/123 (15%)

Query: 729 EEELDVQLVLFDEVLDHVLRIDRIFRQPQGHLLLIGVSGAGKTTLSRFVAFMNGLSVFQI 788
             EL+  +V  +EV++  L          GH+LL G  G GKT L+R +A   GL   +I
Sbjct: 19  RSELEKVVVGDEEVIELALLALLA----GGHVLLEGPPGVGKTLLARALARALGLPFVRI 74

Query: 789 RAHNKYTGADFD--EDLRTVLRRSGCK---------NEKIAFLLDESNV----LESGFLE 833
           +       +D         +L   G             ++  LLDE N     +++  LE
Sbjct: 75  QCTPDLLPSDLLGTYAYAALLLEPGEFRFVPGPLFAAVRVILLLDEINRAPPEVQNALLE 134

Query: 834 RMN 836
            + 
Sbjct: 135 ALE 137


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein TolA;
            Provisional.
          Length = 387

 Score = 40.6 bits (95), Expect = 0.006
 Identities = 26/115 (22%), Positives = 48/115 (41%), Gaps = 12/115 (10%)

Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQ 1075
            +E+  +LE+++L          E  +Q EE  K    ++   Q + E A  K     K +
Sbjct: 100  QERLKQLEKERL-------AAQEQKKQAEEAAK----QAALKQKQAEEAAAKAAAAAKAK 148

Query: 1076 QEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQ 1130
             EAE ++  +   +A  E +    A+       + A+ +     A +A  E KK+
Sbjct: 149  AEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAE-AKKKAEAEAAAKAAAEAKKK 202



 Score = 38.2 bits (89), Expect = 0.031
 Identities = 27/96 (28%), Positives = 46/96 (47%), Gaps = 5/96 (5%)

Query: 1044 EEMQKSLAVKSQELQSKNEAANLKLKEMIKD---QQEAEKRKVQSQDIQAEIEKQTVEIA 1100
            E+ +K    +++ELQ K  A   +LK++ K+    QE +K+  ++    A  +KQ  E A
Sbjct: 79   EQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAA 138

Query: 1101 QKRVFVMEDLAQVEPAVMD--AQQAVKEIKKQQLVE 1134
             K     +  A+ E       A++A  E KK+   E
Sbjct: 139  AKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAE 174



 Score = 33.6 bits (77), Expect = 0.81
 Identities = 28/126 (22%), Positives = 49/126 (38%), Gaps = 13/126 (10%)

Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQ 1075
            R+K  + + ++L         AE     +  ++ LA + Q+ Q   EAA        K +
Sbjct: 81   RKKKEQQQAEELQQKQ----AAEQERLKQLEKERLAAQEQKKQ-AEEAAKQAAL---KQK 132

Query: 1076 QEAEKRKVQSQD--IQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLV 1133
            Q  E     +     +AE E +    A K+       A+ +     A++A  E KK+   
Sbjct: 133  QAEEAAAKAAAAAKAKAEAEAKRAAAAAKKA---AAEAKKKAEAEAAKKAAAEAKKKAEA 189

Query: 1134 ELRSMA 1139
            E  + A
Sbjct: 190  EAAAKA 195



 Score = 30.5 bits (69), Expect = 8.1
 Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 11/80 (13%)

Query: 1055 QELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVE 1114
            Q+ Q K+     + ++  + QQ  E +    Q   AE E+   ++ ++R+   E   Q E
Sbjct: 67   QQQQQKSAKRAEEQRKKKEQQQAEELQ----QKQAAEQERLK-QLEKERLAAQEQKKQAE 121

Query: 1115 PAVMDAQQAVKEIKKQQLVE 1134
                  + A +   KQ+  E
Sbjct: 122  ------EAAKQAALKQKQAE 135


>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA.  During
            the bacterial cell cycle, the tubulin-like cell-division
            protein FtsZ polymerises into a ring structure that
            establishes the location of the nascent division site.
            EzrA modulates the frequency and position of FtsZ ring
            formation.
          Length = 559

 Score = 40.6 bits (96), Expect = 0.007
 Identities = 26/126 (20%), Positives = 56/126 (44%), Gaps = 14/126 (11%)

Query: 1222 VKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYA 1281
            +K     +    + + +E +  EL  L+    +N+ + EE KD   +L K++ +    Y 
Sbjct: 103  IKEIEELL--DTIEEDIEQILEELNELKESEEKNRKEVEELKDKYRELRKTLLAKSFSYG 160

Query: 1282 QLIAQATAIKTDLDNVQAKFYEYAQL-----IAQAT----AIKTDLDNVQAKVERSMALL 1332
              +     ++  LD ++ +F ++ +L       +A      ++ + D ++ K+E    LL
Sbjct: 161  PAL---DELEKQLDELEEEFEQFVELTESGDYLEAREVLLKLEEETDALEQKMEEIPPLL 217

Query: 1333 KSLGIE 1338
            K L  E
Sbjct: 218  KELQNE 223



 Score = 31.8 bits (73), Expect = 3.8
 Identities = 23/100 (23%), Positives = 53/100 (53%), Gaps = 8/100 (8%)

Query: 1038 ETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQE-----AEKRKVQS--QDIQA 1090
            + + +++ +Q+S  +   EL++  E    +L+E+     +     AEK+   S  Q+   
Sbjct: 324  QLLLELDRLQQSYTLNEDELETVRELEK-QLEELEAQYDQLVERIAEKKVAYSELQEELE 382

Query: 1091 EIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQ 1130
            EIEKQ  EI +++  + E L  +    ++A++ ++E +++
Sbjct: 383  EIEKQLEEIEKEQEELSESLQGLRKDELEAREKLQEYRQK 422



 Score = 30.6 bits (70), Expect = 9.0
 Identities = 26/125 (20%), Positives = 54/125 (43%), Gaps = 26/125 (20%)

Query: 1020 SELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAE 1079
            SEL+E+         +I + +E++E+ Q+ L+   Q L+     A  KL+E  +   E  
Sbjct: 375  SELQEELE-------EIEKQLEEIEKEQEELSESLQGLRKDELEAREKLQEYRQKLHEI- 426

Query: 1080 KRKVQSQDI--------------QAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVK 1125
            KR ++  ++                EIE    E+ +  +    ++  V   + +A   V+
Sbjct: 427  KRYLEKSNLPGLPEDYLEYFFVVSDEIEALADELNEVPI----NMDAVNRQLEEATDDVE 482

Query: 1126 EIKKQ 1130
             +K++
Sbjct: 483  TLKEK 487


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
            and conversion].
          Length = 161

 Score = 38.8 bits (91), Expect = 0.007
 Identities = 17/102 (16%), Positives = 41/102 (40%), Gaps = 8/102 (7%)

Query: 1035 KIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKD------QQEAEKRKVQSQDI 1088
            KIA+ + + E +++       E + + E A  +  E+I+       Q   E +    +++
Sbjct: 41   KIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEEL 100

Query: 1089 QAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQ 1130
            +   E    EI  ++   +E+L         A    +++  +
Sbjct: 101  ERIKEAAEAEIEAEKERALEELRAE--VAELAVAIAEKLLGK 140



 Score = 34.6 bits (80), Expect = 0.16
 Identities = 21/71 (29%), Positives = 39/71 (54%), Gaps = 7/71 (9%)

Query: 1069 KEMIKDQ-QEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEI 1127
            +  I D   EAE+ K ++Q + AE E++ +E A+++   + + A+ E     A+Q  +EI
Sbjct: 39   QAKIADDLAEAERLKEEAQALLAEYEQE-LEEAREQASEIIEQAKKE-----AEQIAEEI 92

Query: 1128 KKQQLVELRSM 1138
            K +   EL  +
Sbjct: 93   KAEAEEELERI 103


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
            includes outer membrane proteins such as OmpH among
            others. Skp (OmpH) has been characterized as a molecular
            chaperone that interacts with unfolded proteins as they
            emerge in the periplasm from the Sec translocation
            machinery.
          Length = 157

 Score = 37.6 bits (88), Expect = 0.014
 Identities = 20/95 (21%), Positives = 49/95 (51%), Gaps = 5/95 (5%)

Query: 1041 EQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQS--QDIQAEIEKQTVE 1098
            ++ +++Q  L  K +ELQ + +    +   + ++ ++A+++++Q   Q++Q + +    E
Sbjct: 42   KEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAAQQE 101

Query: 1099 IAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLV 1133
            + QK+    E L  +   +  A + V + K   LV
Sbjct: 102  LQQKQ---QELLQPIYDKIDKAIKEVAKEKGYDLV 133



 Score = 29.5 bits (67), Expect = 7.0
 Identities = 19/96 (19%), Positives = 48/96 (50%), Gaps = 12/96 (12%)

Query: 1046 MQKSLAVKS--QELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKR 1103
            + +S A K+  ++L+ + +    +L++  K+ Q+ E      Q +Q +    + E  + +
Sbjct: 27   LSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEE------QKLQKQAATLSEEARKAK 80

Query: 1104 VFVMEDLAQVEPAVMDAQQAV-KEIKKQQLVELRSM 1138
                ++L Q +  +   QQA  +E++++Q   L+ +
Sbjct: 81   ---QQELQQKQQELQQKQQAAQQELQQKQQELLQPI 113


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
            or mitostatin, was first defined as a meiosis-specific
            nuclear structural protein. It has since been linked with
            mitochondrial movement. It is associated with the
            mitochondrial outer membrane, and over-expression leads
            to reduction in mitochondrial motility whereas lack of it
            enhances mitochondrial movement. The activity appears to
            be mediated through binding the mitochondria to the actin
            intermediate filaments (IFs).
          Length = 349

 Score = 39.1 bits (92), Expect = 0.016
 Identities = 24/101 (23%), Positives = 52/101 (51%), Gaps = 5/101 (4%)

Query: 1041 EQVEEMQKSLAVKSQELQSKNEAANLKLKEMIK-DQQEAEKRKVQSQDIQAEIEKQTVEI 1099
            EQ+EE ++ L  +  E +++ E    K  E  + +Q+ AEKR+++  + + E+E+Q  E 
Sbjct: 241  EQIEEKEERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEK 300

Query: 1100 AQKRVFVMEDLAQVEPAVMDA----QQAVKEIKKQQLVELR 1136
             ++R    E+  +    + +     Q  ++E +++ L E  
Sbjct: 301  EERRAAEREEELEEGERLREEEAERQARIEEERQRLLKEHA 341



 Score = 33.3 bits (77), Expect = 1.0
 Identities = 20/97 (20%), Positives = 50/97 (51%), Gaps = 1/97 (1%)

Query: 1041 EQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAE-KRKVQSQDIQAEIEKQTVEI 1099
             Q+EE ++  A + +E +  +E    +  + + +++E E KRK + ++ +A +++Q  E 
Sbjct: 24   AQIEEKKRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEER 83

Query: 1100 AQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELR 1136
             ++R    E+  Q    + +  + ++E  + +  E R
Sbjct: 84   EKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKR 120



 Score = 32.2 bits (74), Expect = 2.5
 Identities = 16/60 (26%), Positives = 31/60 (51%)

Query: 1044 EEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKR 1103
            EE +K    K +  +++ E    K + + +++ E E  + +  + QAE E+   E A+KR
Sbjct: 223  EEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEEAERERMLEKQAEDEELEQENAEKR 282



 Score = 31.8 bits (73), Expect = 3.1
 Identities = 28/105 (26%), Positives = 48/105 (45%), Gaps = 2/105 (1%)

Query: 1036 IAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQ 1095
            +A    Q EE +       +      +    + KE  K+++EAEKR+ Q Q++Q   E+Q
Sbjct: 184  VARLRAQQEEAEDEREELDELRADLYQEEYER-KERQKEKEEAEKRRRQKQELQRAREEQ 242

Query: 1096 TVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMAN 1140
             +E  ++R+       + E   M  +QA  E  +Q+  E R M  
Sbjct: 243  -IEEKEERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKR 286


>gnl|CDD|144608 pfam01078, Mg_chelatase, Magnesium chelatase, subunit ChlI.
           Magnesium-chelatase is a three-component enzyme that
           catalyzes the insertion of Mg2+ into protoporphyrin IX.
           This is the first unique step in the synthesis of
           (bacterio)chlorophyll. Due to this, it is thought that
           Mg-chelatase has an important role in channelling inter-
           mediates into the (bacterio)chlorophyll branch in
           response to conditions suitable for photosynthetic
           growth. ChlI and BchD have molecular weight between
           38-42 kDa.
          Length = 207

 Score = 38.3 bits (90), Expect = 0.016
 Identities = 44/156 (28%), Positives = 57/156 (36%), Gaps = 54/156 (34%)

Query: 419 LVLCGPPGSGKTMTLLSALRA-LPDME---------VVSLNFSSATTPELLLKTFDHYCE 468
           L++ GPPGSGKTM L   L   LP +          + S+         +  + F     
Sbjct: 25  LLMIGPPGSGKTM-LAKRLPGILPPLTEQEALEVTAIHSVAGLGGDGGLIRRRPFRA--- 80

Query: 469 YRKTPN-----------GVILSP--IQLGKWLVLFCDEINLPDMDKYATQRVISFLRQLI 515
               P+           G I  P  I L    VLF DE  LP+  +    RV+  LRQ +
Sbjct: 81  ----PHHSASAAALVGGGSIPRPGEISLAHNGVLFLDE--LPEFSR----RVLESLRQPL 130

Query: 516 E------QRGFYR---PADKQWVSLERIQCVGACNP 542
           E       R   +   PA        R Q V A NP
Sbjct: 131 EDGEITISRARAKVTFPA--------RFQLVAAMNP 158



 Score = 31.3 bits (72), Expect = 3.1
 Identities = 9/16 (56%), Positives = 12/16 (75%)

Query: 43 NHGLMMVGPSGSGKST 58
           H L+M+GP GSGK+ 
Sbjct: 22 GHNLLMIGPPGSGKTM 37


>gnl|CDD|178532 PLN02946, PLN02946, cysteine-tRNA ligase.
          Length = 557

 Score = 39.1 bits (91), Expect = 0.018
 Identities = 43/169 (25%), Positives = 73/169 (43%), Gaps = 32/169 (18%)

Query: 977  YVHQTLHKANARLSKRGSR-TMAITPRHYLDFINHFVKLYREKCSELEEQQLHLNVGLGK 1035
            Y++QTLH     L +  S       P   L+ IN F   +    S+     LH  V L  
Sbjct: 375  YIYQTLHDCEESLQQHDSTFEKDSVPPDTLNCINKFHDEFVTSMSD----DLHTPVALAA 430

Query: 1036 IAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKD----------------QQEAE 1079
            ++E ++ + ++  +   K QE + ++ AA   L++ I+D                QQ  E
Sbjct: 431  LSEPLKTINDLLHTRKGKKQEKRLESLAA---LEKKIRDVLSVLGLMPTSYSEALQQLRE 487

Query: 1080 K----RKVQSQDIQAEIEKQTVEIAQKRV----FVMEDLAQVEPAVMDA 1120
            K     K+  + +  +IE++TV    K       + +DLA V  A+MD+
Sbjct: 488  KALRRAKLTEEQVLQKIEERTVARKNKEYEKSDAIRKDLAAVGIALMDS 536


>gnl|CDD|238026 cd00071, GMPK, Guanosine monophosphate kinase (GMPK, EC 2.7.4.8),
           also known as guanylate kinase (GKase), catalyzes the
           reversible phosphoryl transfer from adenosine
           triphosphate (ATP) to guanosine monophosphate (GMP) to
           yield adenosine diphosphate (ADP) and guanosine
           diphosphate (GDP). It plays an essential role in the
           biosynthesis of guanosine triphosphate (GTP). This
           enzyme is also important for the activation of some
           antiviral and anticancer agents, such as acyclovir,
           ganciclovir, carbovir, and thiopurines.
          Length = 137

 Score = 37.1 bits (87), Expect = 0.018
 Identities = 17/39 (43%), Positives = 20/39 (51%), Gaps = 5/39 (12%)

Query: 418 PLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFS-SATT 455
            +VL GP G GK+ TLL  L    D    +  FS S TT
Sbjct: 1   LIVLSGPSGVGKS-TLLKRLLEEFDP---NFGFSVSHTT 35



 Score = 29.4 bits (67), Expect = 7.9
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 46 LMMVGPSGSGKSTAWKVLLK 65
          +++ GPSG GKST  K LL+
Sbjct: 2  IVLSGPSGVGKSTLLKRLLE 21


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
            [Function unknown].
          Length = 265

 Score = 38.5 bits (90), Expect = 0.019
 Identities = 50/272 (18%), Positives = 102/272 (37%), Gaps = 52/272 (19%)

Query: 1052 VKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLA 1111
            V +  L  K +  + KL E+ K+++  +    + + +  +IE    EI  K   + +++ 
Sbjct: 24   VFAALLSDKIQNQDSKLSELQKEKKNIQN---EIESLDNQIE----EIQSKIDELQKEID 76

Query: 1112 QVEPAVMDAQQAVKEIKK-----QQLVE--LRSM---ANPPSVVKLALESICLLLGENAT 1161
            Q +  +   Q+ + E+K+     Q+L++   R+M       S + + L S          
Sbjct: 77   QSKAEIKKLQKEIAELKENIVERQELLKKRARAMQVNGTATSYIDVILNS---------- 126

Query: 1162 DWKAIRAVVMRENFINSIVSNFNTEMITDEVREKMHSRYLSNPDYSYEKANRASMACGPM 1221
              K+   ++ R   I+ IV   + +++  +  +K   + L     + E            
Sbjct: 127  --KSFSDLISRVTAISVIVD-ADKKILEQQKEDK---KSLEEKQAALEDKLE-------- 172

Query: 1222 VKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYA 1281
                  Q      L  +   + E  +L    +  +A     K  + + +K++A      A
Sbjct: 173  -TLVALQNELETQLNSLNSQKAEKNALIAALAAKEASALGEKAALEE-QKALAE---AAA 227

Query: 1282 QLIAQATAIKTDLDNVQAKFYEYAQLIAQATA 1313
               A+  A         A+  E A L A ATA
Sbjct: 228  AEAAKQEAAA----KAAAQ--EQAALQAAATA 253



 Score = 35.5 bits (82), Expect = 0.17
 Identities = 24/92 (26%), Positives = 38/92 (41%), Gaps = 7/92 (7%)

Query: 1040 VEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEI 1099
            +EQ +E +KSL  K   L+ K E        ++  Q E E +       +AE       +
Sbjct: 150  LEQQKEDKKSLEEKQAALEDKLE-------TLVALQNELETQLNSLNSQKAEKNALIAAL 202

Query: 1100 AQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQ 1131
            A K    + + A +E     A+ A  E  KQ+
Sbjct: 203  AAKEASALGEKAALEEQKALAEAAAAEAAKQE 234



 Score = 32.4 bits (74), Expect = 1.7
 Identities = 24/106 (22%), Positives = 37/106 (34%), Gaps = 3/106 (2%)

Query: 1021 ELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEK 1080
             LEE+Q  L   L  +     ++E    SL  +  E  +   A   K    + ++   E+
Sbjct: 159  SLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASALGEKAALEE 218

Query: 1081 RKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKE 1126
            +K  ++   AE  KQ            E  A    A      AV E
Sbjct: 219  QKALAEAAAAEAAKQEAAAKAAA---QEQAALQAAATAAQPSAVTE 261


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 39.6 bits (93), Expect = 0.020
 Identities = 63/343 (18%), Positives = 123/343 (35%), Gaps = 69/343 (20%)

Query: 1014 LYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIK 1073
            LY  +     EQ   +        E   ++ E+ ++ +   Q+ Q+ ++  NL    +  
Sbjct: 295  LYTSRRQLAAEQYRLV--------EMARELAELNEAESDLEQDYQAASDHLNLVQTALR- 345

Query: 1074 DQQEAEKRKVQSQD---IQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQ 1130
             QQE  +R     +    + E + + VE A       E   + E     A++ V E+K Q
Sbjct: 346  -QQEKIERYQADLEELEERLEEQNEVVEEAD------EQQEENEARAEAAEEEVDELKSQ 398

Query: 1131 ------QLVELRSMANPPSVVKLALESICLLLG------ENATDWKAIRAVVMRENFINS 1178
                   L   ++ A        ALE    L G      +NA DW  +     +E     
Sbjct: 399  LADYQQALDVQQTRAIQYQQAVQALERAKQLCGLPDLTADNAEDW--LEEFQAKE----- 451

Query: 1179 IVSNFNTEMITDEVRE-----KMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQ----- 1228
                   +  T+E+        +     S  + +Y+   + +        W +A+     
Sbjct: 452  -------QEATEELLSLEQKLSVAQAAHSQFEQAYQLVRKIAGEVSRSEAWDVARELLRR 504

Query: 1229 -ISYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEK-SIASYKDE--YAQLI 1284
                  + ++++ LR+ L  LE Q    + + E    L+ +  K    +  DE    QL 
Sbjct: 505  LREQRHLAEQLQQLRMRLSELE-QRLRQQQRAER---LLAEFCKRLGKNLDDEDELEQLQ 560

Query: 1285 AQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQAKVER 1327
             +  A    L          ++   +  A++  L+ +QA+++R
Sbjct: 561  EELEARLESLSESV------SEARERRMALRQQLEQLQARIQR 597


>gnl|CDD|222307 pfam13671, AAA_33, AAA domain.  This family of domains contain only
           a P-loop motif, that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           just short fragments so there is no Walker B motif.
          Length = 143

 Score = 36.9 bits (86), Expect = 0.025
 Identities = 10/31 (32%), Positives = 16/31 (51%), Gaps = 1/31 (3%)

Query: 419 LVLCGPPGSGKTMTLLSALRALPDMEVVSLN 449
           +++ G PGSGK+ T    L       V+S +
Sbjct: 2   ILMVGLPGSGKS-TFARRLLRELGAVVLSSD 31



 Score = 33.4 bits (77), Expect = 0.30
 Identities = 12/22 (54%), Positives = 16/22 (72%)

Query: 46 LMMVGPSGSGKSTAWKVLLKAL 67
          ++MVG  GSGKST  + LL+ L
Sbjct: 2  ILMVGLPGSGKSTFARRLLREL 23



 Score = 30.0 bits (68), Expect = 5.6
 Identities = 7/16 (43%), Positives = 14/16 (87%)

Query: 760 LLLIGVSGAGKTTLSR 775
           +L++G+ G+GK+T +R
Sbjct: 2   ILMVGLPGSGKSTFAR 17


>gnl|CDD|221983 pfam13207, AAA_17, AAA domain. 
          Length = 114

 Score = 36.1 bits (83), Expect = 0.029
 Identities = 17/82 (20%), Positives = 33/82 (40%), Gaps = 2/82 (2%)

Query: 761 LLIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNKYTG--ADFDEDLRTVLRRSGCKNEKIA 818
           L+ G  G+GK+TL++ +A   G+ V  +    +  G     D +L  +        E + 
Sbjct: 3   LITGPPGSGKSTLAKKLAEKLGIPVISLDDLLREEGLAELDDGELDDIDIDLELLEEILD 62

Query: 819 FLLDESNVLESGFLERMNTLLA 840
            L  +  V++      +   L 
Sbjct: 63  ELAKQEWVIDGVRESTLELRLE 84



 Score = 35.3 bits (81), Expect = 0.053
 Identities = 22/82 (26%), Positives = 37/82 (45%), Gaps = 13/82 (15%)

Query: 47  MMVGPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPKAISKEALYGVLDPNTREWTDGLF 106
           ++ GP GSGKST    L K L    G      +I    + +E     LD    +  D   
Sbjct: 3   LITGPPGSGKST----LAKKLAEKLG----IPVISLDDLLREEGLAELDDGELDDID--- 51

Query: 107 THILRRIIDNVRGEISKRQWII 128
             I   +++ +  E++K++W+I
Sbjct: 52  --IDLELLEEILDELAKQEWVI 71



 Score = 30.7 bits (69), Expect = 1.8
 Identities = 13/29 (44%), Positives = 18/29 (62%), Gaps = 1/29 (3%)

Query: 420 VLCGPPGSGKTMTLLSALRALPDMEVVSL 448
           ++ GPPGSGK+ TL   L     + V+SL
Sbjct: 3   LITGPPGSGKS-TLAKKLAEKLGIPVISL 30


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
            (CALCOCO1) like.  Proteins found in this family are
            similar to the coiled-coil transcriptional coactivator
            protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
            This protein binds to a highly conserved N-terminal
            domain of p160 coactivators, such as GRIP1, and thus
            enhances transcriptional activation by a number of
            nuclear receptors. CALCOCO1 has a central coiled-coil
            region with three leucine zipper motifs, which is
            required for its interaction with GRIP1 and may regulate
            the autonomous transcriptional activation activity of the
            C-terminal region.
          Length = 546

 Score = 38.4 bits (89), Expect = 0.030
 Identities = 31/167 (18%), Positives = 66/167 (39%), Gaps = 26/167 (15%)

Query: 1017 EKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQEL----------QSKNEAANL 1066
               S++  +   L   L    E  E++EE  K     S+ +          +++N+   L
Sbjct: 164  GDVSDMRSRIEQLETALRHSTEKTEELEEQHKEAQSSSESMSAERNALLAQRAENQQRIL 223

Query: 1067 KLKE----MIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQ 1122
            +L++    + + +QE ++    +QDI+AE+E+   E+ Q+   + +   Q      +   
Sbjct: 224  ELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQR---LKKMTIQRRDEETERID 280

Query: 1123 AVKEIK--KQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIR 1167
               E +   + L  L+           + +    LLGE      ++R
Sbjct: 281  LQLENEQLHEDLRTLQERLE-------SSQQKAGLLGEELASLGSLR 320


>gnl|CDD|215649 pfam00004, AAA, ATPase family associated with various cellular
           activities (AAA).  AAA family proteins often perform
           chaperone-like functions that assist in the assembly,
           operation, or disassembly of protein complexes.
          Length = 131

 Score = 36.0 bits (84), Expect = 0.037
 Identities = 21/67 (31%), Positives = 32/67 (47%), Gaps = 5/67 (7%)

Query: 760 LLLIGVSGAGKTTLSRFVAFMNGLSVFQIRAH---NKYTGADFDEDLRTVLRRSGCKNEK 816
           LLL G  G GKTTL++ VA   G    +I      +KY G + ++ LR +   +      
Sbjct: 1   LLLYGPPGTGKTTLAKAVAKELGAPFIEISGSELVSKYVG-ESEKRLRELFEAAKKLAPC 59

Query: 817 IAFLLDE 823
           +   +DE
Sbjct: 60  V-IFIDE 65



 Score = 35.3 bits (82), Expect = 0.067
 Identities = 32/159 (20%), Positives = 54/159 (33%), Gaps = 39/159 (24%)

Query: 419 LVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSAT------TPELLLKTFDHYCEYRKT 472
           L+L GPPG+GKT    +  + L     + ++ S         + + L + F+   +    
Sbjct: 1   LLLYGPPGTGKTTLAKAVAKEL-GAPFIEISGSELVSKYVGESEKRLRELFEAAKKLAPC 59

Query: 473 PNGVILSPIQLGKWLVLFCDEI-----NLPDMDKYATQRVISFLRQLIEQRGFYRPADKQ 527
                          V+F DEI     +        ++RV++ L   +         D  
Sbjct: 60  ---------------VIFIDEIDALAGSRGSGGDSESRRVVNQLLTEL---------DGF 95

Query: 528 WVSLERIQCVGACNPPTDPGRKPLSHRFLRHVPVIYVDY 566
             SL ++  + A N P       L  RF R    I    
Sbjct: 96  TSSLSKVIVIAATNRPDKLDPALLRGRFDRI---IEFPL 131


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 38.6 bits (89), Expect = 0.038
 Identities = 52/277 (18%), Positives = 106/277 (38%), Gaps = 27/277 (9%)

Query: 1017 EKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLA--VKSQELQSKNEAANLKLKEMIKD 1074
            E+  + EE +      L K AE  ++ EE++K  A   K  E   K E  N    E  K 
Sbjct: 1678 EEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKK 1737

Query: 1075 QQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVE 1134
            + E +K+K +      E +K+   + ++     E++ + + AV++ +   ++ K++  V+
Sbjct: 1738 EAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVD 1797

Query: 1135 LR---SMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDE 1191
             +      N  ++++   E   ++      +  AI+ V   +N        F        
Sbjct: 1798 KKIKDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKN 1857

Query: 1192 VREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQ 1251
                         D++ EK  +              +I  AD ++K     ++   +E +
Sbjct: 1858 NENGEDGN--KEADFNKEKDLKEDD---------EEEIEEADEIEK-----IDKDDIERE 1901

Query: 1252 ASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQAT 1288
               N   G+    +  +L+      KDEY +  A+ T
Sbjct: 1902 IPNNNMAGKNNDIIDDKLD------KDEYIKRDAEET 1932



 Score = 37.0 bits (85), Expect = 0.10
 Identities = 25/105 (23%), Positives = 47/105 (44%), Gaps = 1/105 (0%)

Query: 1035 KIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEK 1094
            K AE  ++ +E +K+   K  + ++K +A   K  +  K + E  K+K  +   +AE  K
Sbjct: 1284 KKAEEKKKADEAKKAEEKKKAD-EAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAK 1342

Query: 1095 QTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMA 1139
            +  E A+       D A+      +A +  KE  K++    +  A
Sbjct: 1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKA 1387



 Score = 37.0 bits (85), Expect = 0.11
 Identities = 26/123 (21%), Positives = 54/123 (43%), Gaps = 9/123 (7%)

Query: 1013 KLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKS---LAVKSQELQSKNEAANLKLK 1069
            K+  E+  + EE++  +     K AE  ++ EE++K+     +K+ E   K E    K +
Sbjct: 1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAE 1678

Query: 1070 EMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKK 1129
            E  K   E +++K      +   E +  E  +K+    E+  + E   +   +   +IK 
Sbjct: 1679 EAKK--AEEDEKKAAEALKKEAEEAKKAEELKKK--EAEEKKKAEE--LKKAEEENKIKA 1732

Query: 1130 QQL 1132
            ++ 
Sbjct: 1733 EEA 1735



 Score = 34.3 bits (78), Expect = 0.70
 Identities = 24/110 (21%), Positives = 45/110 (40%), Gaps = 10/110 (9%)

Query: 1033 LGKIAETVEQVEEMQKSL--AVKSQELQSKNE----AANLKLKEMIKDQQEAEKRKVQSQ 1086
              K AE  ++ +E +K    A K+ E + K E    A   K K     + +  K+K +  
Sbjct: 1423 AKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEA 1482

Query: 1087 DIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELR 1136
                E +K+  E  +K     ++  +   A   A +A K  + ++  E +
Sbjct: 1483 KKADEAKKKAEEAKKK----ADEAKKAAEAKKKADEAKKAEEAKKADEAK 1528



 Score = 33.6 bits (76), Expect = 1.2
 Identities = 26/103 (25%), Positives = 50/103 (48%), Gaps = 10/103 (9%)

Query: 1035 KIAETVEQVEEMQK---SLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAE 1091
            K  E  ++ EE +K   +   K++E +   EAA  +  E   D+ EA + K ++ + + E
Sbjct: 1316 KADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAE-AEAAADEAEAAEEKAEAAEKKKE 1374

Query: 1092 IEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVK---EIKKQQ 1131
              K+  + A+K+    E+  + + A   A++  K   E+KK  
Sbjct: 1375 EAKKKADAAKKKA---EEKKKADEAKKKAEEDKKKADELKKAA 1414



 Score = 30.9 bits (69), Expect = 7.1
 Identities = 31/103 (30%), Positives = 49/103 (47%), Gaps = 12/103 (11%)

Query: 1033 LGKIAETVEQVEEMQK---SLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQ 1089
              K AE  ++ +E +K       K+ EL+ K  AA  K  E  K  +  EK+K      +
Sbjct: 1383 AKKKAEEKKKADEAKKKAEEDKKKADELK-KAAAAKKKADEAKKKAE--EKKKADEAKKK 1439

Query: 1090 AEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVK--EIKKQ 1130
            AE E +  + A+K+    E+  + E A   A++A K  E KK+
Sbjct: 1440 AE-EAKKADEAKKKA---EEAKKAEEAKKKAEEAKKADEAKKK 1478



 Score = 30.5 bits (68), Expect = 9.8
 Identities = 22/104 (21%), Positives = 48/104 (46%), Gaps = 2/104 (1%)

Query: 1037 AETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQT 1096
            A+ +++ EE +K+   K  E + K + A  K +E    + +  K+K +    +A+  K+ 
Sbjct: 1280 ADELKKAEEKKKADEAKKAEEKKKADEAKKKAEE--AKKADEAKKKAEEAKKKADAAKKK 1337

Query: 1097 VEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMAN 1140
             E A+K     +  A+      +A +   E  +++  E +  A+
Sbjct: 1338 AEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKAD 1381


>gnl|CDD|213195 cd03228, ABCC_MRP_Like, ATP-binding cassette domain of multidrug
          resistance protein-like transporters.  The MRP
          (Multidrug Resistance Protein)-like transporters are
          involved in drug, peptide, and lipid export. They
          belong to the subfamily C of the ATP-binding cassette
          (ABC) superfamily of transport proteins. The ABCC
          subfamily contains transporters with a diverse
          functional spectrum that includes ion transport, cell
          surface receptor, and toxin secretion activities. The
          MRP-like family, similar to all ABC proteins, have a
          common four-domain core structure constituted by two
          membrane-spanning domains, each composed of six
          transmembrane (TM) helices, and two nucleotide-binding
          domains (NBD). ABC transporters are a subset of
          nucleotide hydrolases that contain a signature motif,
          Q-loop, and H-loop/switch region, in addition to, the
          Walker A motif/P-loop and Walker B motif commonly found
          in a number of ATP- and GTP-binding and hydrolyzing
          proteins.
          Length = 171

 Score = 36.6 bits (86), Expect = 0.043
 Identities = 14/24 (58%), Positives = 17/24 (70%)

Query: 49 VGPSGSGKSTAWKVLLKALERYEG 72
          VGPSGSGKST  K+LL+  +   G
Sbjct: 34 VGPSGSGKSTLLKLLLRLYDPTSG 57


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of proteins
            contain a band 4.1 domain (pfam00373), at their amino
            terminus. This family represents the rest of these
            proteins.
          Length = 244

 Score = 37.4 bits (87), Expect = 0.044
 Identities = 29/148 (19%), Positives = 65/148 (43%), Gaps = 6/148 (4%)

Query: 1021 ELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQS---KNEAANLKLKEMI-KDQQ 1076
            ELEE+   +   + +  + +E+ EE    L  K ++ +      E    +L+E   + ++
Sbjct: 9    ELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEENRRLEE 68

Query: 1077 EAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQ--QAVKEIKKQQLVE 1134
            EA   + + + ++AE+++ T E+A+      +  A+      + +  Q   E  +Q+L+E
Sbjct: 69   EAAASEEERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQELREAQEAHERARQELLE 128

Query: 1135 LRSMANPPSVVKLALESICLLLGENATD 1162
              +    P  V   +    L   +N  +
Sbjct: 129  AAAAPTAPPHVAAPVNGEQLEPDDNGEE 156


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
            autophagy-related subunit 14.  The Atg14 or Apg14
            proteins are hydrophilic proteins with a predicted
            molecular mass of 40.5 kDa, and have a coiled-coil motif
            at the N terminus region. Yeast cells with mutant Atg14
            are defective not only in autophagy but also in sorting
            of carboxypeptidase Y (CPY), a vacuolar-soluble
            hydrolase, to the vacuole. Subcellular fractionation
            indicate that Apg14p and Apg6p are peripherally
            associated with a membrane structure(s). Apg14p was
            co-immunoprecipitated with Apg6p, suggesting that they
            form a stable protein complex. These results imply that
            Apg6/Vps30p has two distinct functions: in the autophagic
            process and in the vacuolar protein sorting pathway.
            Apg14p may be a component specifically required for the
            function of Apg6/Vps30p through the autophagic pathway.
            There are 17 auto-phagosomal component proteins which are
            categorized into six functional units, one of which is
            the AS-PI3K complex (Vps30/Atg6 and Atg14). The AS-PI3K
            complex and the Atg2-Atg18 complex are essential for
            nucleation, and the specific function of the AS-PI3K
            apparently is to produce phosphatidylinositol 3-phosphate
            (PtdIns(3)P) at the pre-autophagosomal structure (PAS).
            The localisation of this complex at the PAS is controlled
            by Atg14. Autophagy mediates the cellular response to
            nutrient deprivation, protein aggregation, and pathogen
            invasion in humans, and malfunction of autophagy has been
            implicated in multiple human diseases including cancer.
            This effect seems to be mediated through direct
            interaction of the human Atg14 with Beclin 1 in the human
            phosphatidylinositol 3-kinase class III complex.
          Length = 307

 Score = 37.3 bits (87), Expect = 0.048
 Identities = 40/220 (18%), Positives = 80/220 (36%), Gaps = 33/220 (15%)

Query: 958  CSL--VSTTPSHRDSVINACVYVHQT----LHKANARLSKRGSRTMAITPRHYLDFINHF 1011
            C +   S  P +        +Y  +     L   N  L ++    +              
Sbjct: 2    CPICHRSRRPFYCAHCARNSLYELRLDLARLLLENEELKQKVEEALEGATNEDGKLAADL 61

Query: 1012 VKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEM 1071
            +KL   +  E   Q         +I++  E++E+ ++ +    + L  +    +      
Sbjct: 62   LKLEVARKKERLNQIRA------RISQLKEEIEQKRERIEELKRALAQRRSDLSSAS--- 112

Query: 1072 IKDQQEAEKRKVQSQDIQAEIEK-------QTVEIAQKRVFVMEDLAQVEPAVM------ 1118
                Q  ++R  Q + +Q EI++           +A+KR F+  +LA++ P         
Sbjct: 113  ---YQLEKRRASQLEKLQDEIKRTRSKLNALHSLLAEKRSFLCRELAKLFPLRRVIRGRK 169

Query: 1119 -DAQQAVKEIKKQQLVELRSM-ANPPSVVKLALESICLLL 1156
             D+      I    LV+LR + + PP V+  +L  +  LL
Sbjct: 170  GDSSSEPYTIGGVPLVDLRDLNSLPPEVISASLGYMAQLL 209


>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
            Provisional.
          Length = 569

 Score = 37.9 bits (89), Expect = 0.049
 Identities = 24/123 (19%), Positives = 58/123 (47%), Gaps = 14/123 (11%)

Query: 1222 VKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYA 1281
            +    + +   +  + +E +  EL+ L     +N+ + E+ KDL  +L KS+ + +  + 
Sbjct: 107  INEIESLLDLIE--EDIEQILEELQELLESEEKNREEVEQLKDLYRELRKSLLANRFSFG 164

Query: 1282 QLIAQATAIKTDLDNVQAKFYEYAQLIA-----QATAI----KTDLDNVQAKVERSMALL 1332
              + +   ++  L+N++ +F ++ +L       +A  I    + +L  ++  +E    LL
Sbjct: 165  PALDE---LEKQLENLEEEFSQFVELTESGDYVEAREILDQLEEELAALEQIMEEIPELL 221

Query: 1333 KSL 1335
            K L
Sbjct: 222  KEL 224



 Score = 32.9 bits (76), Expect = 1.6
 Identities = 21/113 (18%), Positives = 51/113 (45%), Gaps = 17/113 (15%)

Query: 1035 KIAETVEQVEEMQKSLAVKSQELQ-----SKNEAANLK-----LKEMIKDQQE-----AE 1079
             + + +E  +E  K L  +   ++     +++E  +++     L+ + K   E     AE
Sbjct: 314  TLPDFLEHAKEQNKELKEEIDRVKQSYTLNESELESVRQLEKQLESLEKQYDEITERIAE 373

Query: 1080 KRKVQS--QDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQ 1130
            +    S  Q+   EI KQ  EI +++  + E L  +    ++A++ ++  + +
Sbjct: 374  QEIAYSELQEELEEILKQLEEIEKEQEKLSEMLQGLRKDELEAREKLERYRNK 426


>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
          Length = 281

 Score = 37.3 bits (87), Expect = 0.053
 Identities = 24/134 (17%), Positives = 63/134 (47%), Gaps = 19/134 (14%)

Query: 1009 NHFVKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKL 1068
             H ++ YR K   ++E++           E  EQVE++++    +++E+  + E    ++
Sbjct: 19   THEIQKYRFKVLSIKEKE-------RLREEEEEQVEQLREEANDEAKEIIEEAEEDAFEI 71

Query: 1069 KEMIKDQ--QEAEKRKVQS----QDIQAEIEKQTVEIAQKRVFVMEDLAQ------VEPA 1116
             E  +++  +E  K+  ++    + +Q +IE++  E  ++   ++E+          E  
Sbjct: 72   VEAAEEEAKEELLKKTDEASSIIEKLQMQIEREQEEWEEELERLIEEAKAEGYEEGYEKG 131

Query: 1117 VMDAQQAVKEIKKQ 1130
              +  + V+E+ +Q
Sbjct: 132  REEGLEEVRELIEQ 145



 Score = 31.9 bits (73), Expect = 2.6
 Identities = 26/135 (19%), Positives = 53/135 (39%), Gaps = 18/135 (13%)

Query: 1020 SELEEQQLHLNVGL-GKIAETVEQVEEMQKSLAVKSQEL--QSKNEAANLKLKEMIKDQQ 1076
             E +E+ L         I +   Q+E  Q+    + + L  ++K E      ++  ++  
Sbjct: 77   EEAKEELLKKTDEASSIIEKLQMQIEREQEEWEEELERLIEEAKAEGYEEGYEKGREEGL 136

Query: 1077 E------------AEKRKVQSQDIQAEIEKQTVEIA---QKRVFVMEDLAQVEPAVMDAQ 1121
            E             EK   + ++I    E++ VE+A    K+V         E A+   +
Sbjct: 137  EEVRELIEQLNKIIEKLIKKREEILESSEEEIVELALDIAKKVIKEISENSKEIALALVK 196

Query: 1122 QAVKEIKKQQLVELR 1136
            + +KE+K    + +R
Sbjct: 197  ELLKEVKDATDITIR 211


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
            includes outer membrane proteins such as OmpH among
            others. Skp (OmpH) has been characterized as a molecular
            chaperone that interacts with unfolded proteins as they
            emerge in the periplasm from the Sec translocation
            machinery.
          Length = 140

 Score = 35.6 bits (83), Expect = 0.055
 Identities = 19/102 (18%), Positives = 47/102 (46%), Gaps = 6/102 (5%)

Query: 1035 KIAETVEQ-VEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQS--QDIQAE 1091
               + +E+  ++ Q  L    +ELQ   E        + +  +E +++++Q   Q+ Q +
Sbjct: 18   AAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRK 77

Query: 1092 IEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLV 1133
             +K   ++ +++    E+L ++   +  A + V + K   LV
Sbjct: 78   QQKLQQDLQKRQ---QEELQKILDKINKAIKEVAKKKGYDLV 116


>gnl|CDD|221955 pfam13173, AAA_14, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily.
          Length = 127

 Score = 35.2 bits (82), Expect = 0.056
 Identities = 25/111 (22%), Positives = 36/111 (32%), Gaps = 24/111 (21%)

Query: 415 EHKPLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTFDHYCEYRKTPN 474
            +K +V+ GP   GKT  LL  L+ L    ++ +N        L L            P 
Sbjct: 1   NNKIIVITGPRQVGKTTLLLQFLKELLSENILYINLDDPRFLRLALLDLLEEFLKLLLPG 60

Query: 475 GVILSPIQLGKWLVLFCDEINLPDMDKYATQRV---ISFLRQLIEQRGFYR 522
                         LF DEI          Q+V      L++L +     R
Sbjct: 61  KKY-----------LFLDEI----------QKVPDWEDALKRLYDDGRNLR 90


>gnl|CDD|236694 PRK10436, PRK10436, hypothetical protein; Provisional.
          Length = 462

 Score = 37.2 bits (87), Expect = 0.079
 Identities = 13/18 (72%), Positives = 15/18 (83%)

Query: 423 GPPGSGKTMTLLSALRAL 440
           GP GSGKT+TL SAL+ L
Sbjct: 225 GPTGSGKTVTLYSALQTL 242


>gnl|CDD|180941 PRK07352, PRK07352, F0F1 ATP synthase subunit B; Validated.
          Length = 174

 Score = 35.7 bits (83), Expect = 0.081
 Identities = 30/106 (28%), Positives = 46/106 (43%), Gaps = 22/106 (20%)

Query: 1035 KIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQD----IQA 1090
             I + +++ EE  +  A    E Q K   A          QQEAE+ +  ++     I+A
Sbjct: 54   AILQALKEAEERLRQAAQALAEAQQKLAQA----------QQEAERIRADAKARAEAIRA 103

Query: 1091 EIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQA-VKEIKKQQLVEL 1135
            EIEKQ +E        M  L Q   A + A+Q  V    +++  EL
Sbjct: 104  EIEKQAIE-------DMARLKQTAAADLSAEQERVIAQLRREAAEL 142


>gnl|CDD|129465 TIGR00368, TIGR00368, Mg chelatase-related protein.  The N-terminal
           end matches very strongly a pfam Mg_chelatase domain
           [Unknown function, General].
          Length = 499

 Score = 37.1 bits (86), Expect = 0.085
 Identities = 40/154 (25%), Positives = 55/154 (35%), Gaps = 23/154 (14%)

Query: 419 LVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTFDHYCEYR--KTPN-- 474
           L+L GPPGSGKTM        LP +       ++     +         + R  ++P+  
Sbjct: 214 LLLFGPPGSGKTMLASRLQGILPPLTNEEAIETARIWSLVGKLIDRKQIKQRPFRSPHHS 273

Query: 475 ---------GVILSP--IQLGKWLVLFCDEINLPDMDKYATQRVISFLRQLIEQRGFYRP 523
                    G I  P  I L    VLF DE  LP+  +     V+  LR+ IE       
Sbjct: 274 ASKPALVGGGPIPLPGEISLAHNGVLFLDE--LPEFKR----SVLDALREPIEDGSISIS 327

Query: 524 ADKQWVSL-ERIQCVGACNP-PTDPGRKPLSHRF 555
                +    R Q V A NP P        +H  
Sbjct: 328 RASAKIFYPARFQLVAAMNPCPCGHYGGKNTHCR 361


>gnl|CDD|221970 pfam13191, AAA_16, AAA ATPase domain.  This family of domains
           contain a P-loop motif that is characteristic of the AAA
           superfamily.
          Length = 154

 Score = 35.3 bits (81), Expect = 0.085
 Identities = 19/70 (27%), Positives = 27/70 (38%), Gaps = 5/70 (7%)

Query: 740 DEVLDHVLR-IDRIFRQPQGHLLLIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNKYTGAD 798
           +E L+ +L  + R        +LL G SG GKT+L R    + GL V            +
Sbjct: 6   EEELERLLDALRRARSGGPPSVLLTGPSGTGKTSLLR--ELLEGLLV--AAGKCDQAERN 61

Query: 799 FDEDLRTVLR 808
                   LR
Sbjct: 62  PPYAFSQALR 71


>gnl|CDD|227900 COG5613, COG5613, Uncharacterized conserved protein [Function
            unknown].
          Length = 400

 Score = 36.9 bits (85), Expect = 0.086
 Identities = 19/107 (17%), Positives = 37/107 (34%), Gaps = 4/107 (3%)

Query: 1012 VKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEM 1071
               Y  + +    QQ  +   L K+   VE  +++ K L      +    +      +++
Sbjct: 99   SYPYLSQVNLASPQQFEIE--LAKLLSLVESAQKVSKLLNSNEIYMDQNTKKMENNQQKI 156

Query: 1072 IKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVM 1118
             K  QE E +K +     ++I           V ++   A V  A  
Sbjct: 157  KK--QEEEAKKAEKSGQASKIFGWVSAFGSLIVALIMVAAGVGAAAG 201


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
            multi-subunit complex made up of two heavy chains and
            four light chains it is a fundamental contractile protein
            found in all eukaryote cell types. This family consists
            of the coiled-coil myosin heavy chain tail region. The
            coiled-coil is composed of the tail from two molecules of
            myosin. These can then assemble into the macromolecular
            thick filament. The coiled-coil region provides the
            structural backbone the thick filament.
          Length = 859

 Score = 37.3 bits (87), Expect = 0.090
 Identities = 23/89 (25%), Positives = 43/89 (48%), Gaps = 11/89 (12%)

Query: 1016 REKCSELEEQQLHLNVGLGKIAETVE-------QVEEMQKSLAVKSQELQSKNEAANLKL 1068
              +  ELEE +  LN  + ++ E  E        +E+ +  L  + ++LQ + E AN   
Sbjct: 313  ALRAEELEELKKKLNQKISELEEAAEAANAKCDSLEKTKSRLQSELEDLQIELERANAAA 372

Query: 1069 KEMIKDQQEAEK----RKVQSQDIQAEIE 1093
             E+ K Q+  +K     K +  ++QAE++
Sbjct: 373  SELEKKQKNFDKILAEWKRKVDELQAELD 401



 Score = 35.0 bits (81), Expect = 0.36
 Identities = 23/67 (34%), Positives = 36/67 (53%), Gaps = 7/67 (10%)

Query: 1037 AETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQ--DIQAEIEK 1094
            A   E++EE++K L  K  EL+   EAAN K      D  E  K ++QS+  D+Q E+E+
Sbjct: 313  ALRAEELEELKKKLNQKISELEEAAEAANAKC-----DSLEKTKSRLQSELEDLQIELER 367

Query: 1095 QTVEIAQ 1101
                 ++
Sbjct: 368  ANAAASE 374



 Score = 32.7 bits (75), Expect = 2.0
 Identities = 54/308 (17%), Positives = 119/308 (38%), Gaps = 62/308 (20%)

Query: 1033 LGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEM--IKDQQEAEKRKVQSQDIQA 1090
            L ++ + VE +    K+L  +  +L  +       + E+   + + EAEK ++Q+   +A
Sbjct: 421  LEELKDQVEALRRENKNLQDEIHDLTDQLGEGGRNVHELEKARRRLEAEKDELQAALEEA 480

Query: 1091 EIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMANPPSVVKLALE 1150
            E                  L   E  V+ AQ  + +I+ +  +E R              
Sbjct: 481  E----------------AALELEESKVLRAQVELSQIRSE--IERR-------------- 508

Query: 1151 SICLLLGENATDWKAIRAVVMRENFINSIVSNFNTE--MITDEVR--EKMHSRYLSNPDY 1206
                 L E   +++  R    R   I S+ +    E     +  R  +K+    ++  + 
Sbjct: 509  -----LAEKEEEFENTRKNHQRA--IESLQATLEAEAKGKAEASRLKKKLEGD-INELEI 560

Query: 1207 SYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLI 1266
            + + AN+A+                A+  K V+  + ++K L+ Q  E +   E+ ++ +
Sbjct: 561  ALDHANKAN----------------AEAQKNVKKYQQQVKELQTQVEEEQRAREDAREQL 604

Query: 1267 TQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQAKVE 1326
               E+   + + E  +L +     +      + +  E ++ + + TA  + L   + K+E
Sbjct: 605  AVAERRATALEAELEELRSALEQAERARKQAETELAEASERVNELTAQNSSLIAQKRKLE 664

Query: 1327 RSMALLKS 1334
              +A L+S
Sbjct: 665  GELAALQS 672


>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
            This model describes a conserved domain found in surface
            proteins of a number of Firmutes. Many members have LPXTG
            C-terminal anchoring motifs and a substantial number have
            the KxYKxGKxW putative sorting signal at the N-terminus.
            The tetracycline resistance plasmid pCF10 in Enterococcus
            faecalis promotes conjugal plasmid transfer in response
            to sex pheromones, but PgrA/Sec10 encoded by that
            plasmid, a member of this family, specifically inhibits
            the ability of cells to receive homologous plasmids. The
            phenomenon is called surface exclusion.
          Length = 356

 Score = 36.6 bits (85), Expect = 0.090
 Identities = 13/65 (20%), Positives = 25/65 (38%), Gaps = 3/65 (4%)

Query: 1262 TKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNV 1321
            T + +  L+ ++A+ + + A         +  L   Q     YA   A     + +L N 
Sbjct: 252  TPNSLKALQAALATAQADLAAAQTALNTAQAALTTAQT---AYAAAQAALATAQKELANA 308

Query: 1322 QAKVE 1326
            QA+  
Sbjct: 309  QAQAL 313


>gnl|CDD|237355 PRK13342, PRK13342, recombination factor protein RarA; Reviewed.
          Length = 413

 Score = 36.6 bits (86), Expect = 0.092
 Identities = 21/90 (23%), Positives = 35/90 (38%), Gaps = 15/90 (16%)

Query: 407 SLLYTWLAEHKP--LVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTFD 464
             L   +   +   ++L GPPG+GKT TL   +    D    +L+  ++   + L +  +
Sbjct: 25  KPLRRMIEAGRLSSMILWGPPGTGKT-TLARIIAGATDAPFEALSAVTSGVKD-LREVIE 82

Query: 465 HYCEYRKTPNGVILSPIQLGKWLVLFCDEI 494
              + R      I           LF DEI
Sbjct: 83  EARQRRSAGRRTI-----------LFIDEI 101



 Score = 31.6 bits (73), Expect = 3.9
 Identities = 28/104 (26%), Positives = 41/104 (39%), Gaps = 33/104 (31%)

Query: 739 FDEVL--DHVLRIDRIFRQP--QGHL---LLIGVSGAGKTTLSRFVAFMNGLSVFQIRAH 791
            DEV+  +H+L   +  R+    G L   +L G  G GKTTL+R +A             
Sbjct: 11  LDEVVGQEHLLGPGKPLRRMIEAGRLSSMILWGPPGTGKTTLARIIA------------- 57

Query: 792 NKYTGADFDE---------DLRTVL---RRSGCKNEKIAFLLDE 823
              T A F+          DLR V+   R+      +    +DE
Sbjct: 58  -GATDAPFEALSAVTSGVKDLREVIEEARQRRSAGRRTILFIDE 100


>gnl|CDD|233547 TIGR01730, RND_mfp, RND family efflux transporter, MFP subunit.  This
            model represents the MFP (membrane fusion protein)
            component of the RND family of transporters. RND refers
            to Resistance, Nodulation, and cell Division. It is, in
            part, a subfamily of pfam00529 (Pfam release 7.5) but
            hits substantial numbers of proteins missed by that
            model. The related HlyD secretion protein, for which
            pfam00529 is named, is outside the scope of this model.
            Attributed functions imply outward transport. These
            functions include nodulation, acriflavin resistance,
            heavy metal efflux, and multidrug resistance proteins.
            Most members of this family are found in Gram-negative
            bacteria. The proposed function of MFP proteins is to
            bring the inner and outer membranes together and enable
            transport to the outside of the outer membrane. Note,
            however, that a few members of this family are found in
            Gram-positive bacteria, where there is no outer membrane
            [Transport and binding proteins, Unknown substrate].
          Length = 322

 Score = 36.5 bits (85), Expect = 0.092
 Identities = 20/97 (20%), Positives = 37/97 (38%), Gaps = 6/97 (6%)

Query: 1233 DMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKT 1292
              +KK + L   L   + Q +   A  +       QLE +  S+ +   +L+ +    + 
Sbjct: 46   QKVKKGQVL-ARLDDDDYQLALQAALAQ-LAAAEAQLELAQRSF-ERAERLVKRNAVSQA 102

Query: 1293 DLDNVQAKFYEYAQLIAQATAIKTDLDNVQAKVERSM 1329
            DLD+ +A         A   A K  L + Q  +  + 
Sbjct: 103  DLDDAKA---AVEAAQADLEAAKASLASAQLNLRYTE 136



 Score = 33.8 bits (78), Expect = 0.59
 Identities = 20/91 (21%), Positives = 41/91 (45%), Gaps = 6/91 (6%)

Query: 1245 LKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEY 1304
            +  + V+  +   KG     ++ +L+     Y+      +AQ  A +  L+  Q  F   
Sbjct: 37   ITKISVREGQKVKKG----QVLARLDD--DDYQLALQAALAQLAAAEAQLELAQRSFERA 90

Query: 1305 AQLIAQATAIKTDLDNVQAKVERSMALLKSL 1335
             +L+ +    + DLD+ +A VE + A L++ 
Sbjct: 91   ERLVKRNAVSQADLDDAKAAVEAAQADLEAA 121


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
            family is the highly conserved central region of a number
            of metazoan proteins referred to as growth-arrest
            proteins. In mouse, Gas8 is predominantly a testicular
            protein, whose expression is developmentally regulated
            during puberty and spermatogenesis. In humans, it is
            absent in infertile males who lack the ability to
            generate gametes. The localisation of Gas8 in the
            motility apparatus of post-meiotic gametocytes and mature
            spermatozoa, together with the detection of Gas8 also in
            cilia at the apical surfaces of epithelial cells lining
            the pulmonary bronchi and Fallopian tubes suggests that
            the Gas8 protein may have a role in the functioning of
            motile cellular appendages. Gas8 is a microtubule-binding
            protein localised to regions of dynein regulation in
            mammalian cells.
          Length = 201

 Score = 36.0 bits (84), Expect = 0.093
 Identities = 26/118 (22%), Positives = 48/118 (40%), Gaps = 18/118 (15%)

Query: 1235 LKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDL 1294
             ++VE LR +LK       ++K   +  K  + +LEK + + K E   L  +   ++ + 
Sbjct: 68   EEEVEELRKKLK----DYEKDKQSLKNLKARLKELEKELKNLKWESEVLEQRFEKVERER 123

Query: 1295 DNVQAKF----YEYAQLIAQATAIKTDLDNV--QAKVERSMALLKSLGIERERWEATS 1346
            D +  KF     +  Q        KT L N+  + K+E     L+    +     A +
Sbjct: 124  DELYDKFEAAIQDVQQ--------KTGLKNLLLEQKLEALNEELEKKEAQLNEVLAAA 173


>gnl|CDD|233230 TIGR01000, bacteriocin_acc, bacteriocin secretion accessory protein. 
            This family represents an accessory protein that works
            with the bacteriocin maturation and ABC transport
            secretion protein described by TIGR01193 [Transport and
            binding proteins, Other].
          Length = 457

 Score = 36.6 bits (85), Expect = 0.10
 Identities = 22/129 (17%), Positives = 46/129 (35%), Gaps = 5/129 (3%)

Query: 1230 SYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATA 1289
             +   L +VE L  E +    ++       E+TK  + Q          +Y Q +  A +
Sbjct: 141  LFNGYLAQVESLTSETQQQNDKSQTQNEAAEKTKAQLDQQISKTDQKLQDY-QALKNAIS 199

Query: 1290 IKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQAKVERSMALLKSLGIERERWEATSETF 1349
              T + N       Y    AQ  +        Q K      +L ++  + ++ + +  ++
Sbjct: 200  NGTKVANFNPYQSLYENYQAQLKSASDKDQKNQVKS----TILATIQQQIDQLQKSIASY 255

Query: 1350 RSQMATIIG 1358
            + Q A +  
Sbjct: 256  QVQKAGLTK 264



 Score = 36.2 bits (84), Expect = 0.14
 Identities = 25/96 (26%), Positives = 43/96 (44%), Gaps = 4/96 (4%)

Query: 1242 RLELKSLEVQASENKAKGEETKDL---ITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQ 1298
            + +LKS   +  +N+ K      +   I QL+KSIASY+ + A L     +      N +
Sbjct: 218  QAQLKSASDKDQKNQVKSTILATIQQQIDQLQKSIASYQVQKAGLTKSTASNYASSQNSK 277

Query: 1299 AKFYEYAQLIAQATAIKTDLDNVQAKVERSMALLKS 1334
                +  Q +A+     TDL+    ++E  +  LK 
Sbjct: 278  LAQLK-EQQLAKVKQEITDLNQKLLELESKIKSLKE 312


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 36.3 bits (85), Expect = 0.13
 Identities = 31/132 (23%), Positives = 63/132 (47%), Gaps = 21/132 (15%)

Query: 1004 YLDFINHFVKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEA 1063
                 N F K  RE+ +EL++ +           + + Q EE   +L  K + L+ + E 
Sbjct: 66   IHKLRNEFEKELRERRNELQKLE-----------KRLLQKEE---NLDRKLELLEKREEE 111

Query: 1064 ANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIA-----QKRVFVMEDLAQVEPAVM 1118
               K KE+ + QQE EK++ + +++  E  ++   I+     + +  ++E +   E A  
Sbjct: 112  LEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEILLEKV--EEEARH 169

Query: 1119 DAQQAVKEIKKQ 1130
            +A   +KEI+++
Sbjct: 170  EAAVLIKEIEEE 181


>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA binding. 
            This is the middle region of a family of TATA element
            modulatory factor 1 proteins conserved in eukaryotes that
            contains at its N-terminal section a number of leucine
            zippers that could potentially form coiled coil
            structures. The whole proteins bind to the TATA element
            of some RNA polymerase II promoters and repress their
            activity. by competing with the binding of TATA binding
            protein. TMFs are evolutionarily conserved golgins that
            bind Rab6, a ubiquitous ras-like GTP-binding Golgi
            protein, and contribute to Golgi organisation in animal
            and plant cells.
          Length = 74

 Score = 33.1 bits (76), Expect = 0.13
 Identities = 15/69 (21%), Positives = 33/69 (47%), Gaps = 7/69 (10%)

Query: 1033 LGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEI 1092
              K+AE  EQ+ ++ +      ++L  K    N  +K++    +E EK   +  +++ ++
Sbjct: 4    EKKLAEKDEQIAQLMEEG----EKLSKKELKHNNTIKKLRAKNKELEK---EIAELKKKL 56

Query: 1093 EKQTVEIAQ 1101
            EK   E+  
Sbjct: 57   EKLEKELEN 65


>gnl|CDD|223703 COG0630, VirB11, Type IV secretory pathway, VirB11 components, and
           related ATPases involved in archaeal flagella
           biosynthesis [Cell motility and secretion /
           Intracellular trafficking and secretion].
          Length = 312

 Score = 36.2 bits (84), Expect = 0.13
 Identities = 16/43 (37%), Positives = 22/43 (51%), Gaps = 2/43 (4%)

Query: 401 DTVRHESLLYTWLA--EHKPLVLCGPPGSGKTMTLLSALRALP 441
            T+  E   Y WLA    K +++CG   SGKT  L + L  +P
Sbjct: 126 GTISPEQAAYLWLAIEARKSIIICGGTASGKTTLLNALLDFIP 168


>gnl|CDD|234659 PRK00137, rplI, 50S ribosomal protein L9; Reviewed.
          Length = 147

 Score = 34.7 bits (81), Expect = 0.14
 Identities = 11/42 (26%), Positives = 20/42 (47%), Gaps = 2/42 (4%)

Query: 1063 AANLKLKEMIKDQQEA--EKRKVQSQDIQAEIEKQTVEIAQK 1102
              NLK  E  + + EA   +   +++ +  ++E  TV I  K
Sbjct: 41   KGNLKQLEARRAELEAKAAEELAEAEALAEKLEGLTVTIKAK 82



 Score = 31.6 bits (73), Expect = 1.6
 Identities = 20/85 (23%), Positives = 36/85 (42%), Gaps = 16/85 (18%)

Query: 1032 GLGKIA--ETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRK------- 1082
            G    A    ++Q+E  +  L  K+ E     EA    L E ++      K K       
Sbjct: 34   GKAVRATKGNLKQLEARRAELEAKAAEE--LAEA--EALAEKLEGLTVTIKAKAGEDGKL 89

Query: 1083 ---VQSQDIQAEIEKQTVEIAQKRV 1104
               V ++DI   ++KQ +EI ++++
Sbjct: 90   FGSVTTKDIAEALKKQGIEIDKRKI 114


>gnl|CDD|131368 TIGR02315, ABC_phnC, phosphonate ABC transporter, ATP-binding
           protein.  Phosphonates are a class of
           phosphorus-containing organic compound with a stable
           direct C-P bond rather than a C-O-P linkage. A number of
           bacterial species have operons, typically about 14 genes
           in size, with genes for ATP-dependent transport of
           phosphonates, degradation, and regulation of the
           expression of the system. Members of this protein family
           are the ATP-binding cassette component of tripartite ABC
           transporters of phosphonates [Transport and binding
           proteins, Anions].
          Length = 243

 Score = 35.4 bits (82), Expect = 0.15
 Identities = 24/80 (30%), Positives = 33/80 (41%), Gaps = 4/80 (5%)

Query: 762 LIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNKYTGADFDEDLRTVLRRSGCKNEKIAFLL 821
           +IG SGAGK+TL R +  +   S   I            + LR + RR G   +    L+
Sbjct: 33  IIGPSGAGKSTLLRCINRLVEPSSGSILLEGTDITKLRGKKLRKLRRRIGMIFQHYN-LI 91

Query: 822 DESNVLE---SGFLERMNTL 838
           +   VLE    G L    T 
Sbjct: 92  ERLTVLENVLHGRLGYKPTW 111


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 36.1 bits (84), Expect = 0.16
 Identities = 21/46 (45%), Positives = 24/46 (52%), Gaps = 9/46 (19%)

Query: 411 TWLA--EHKPLVLCGPPGSGKTMTLLSALRALP-DM--EVVSLNFS 451
           +WL     K L+L GPPG GKT    S   AL  D   EV+ LN S
Sbjct: 32  SWLKGKPKKALLLYGPPGVGKT----SLAHALANDYGWEVIELNAS 73


>gnl|CDD|224401 COG1484, DnaC, DNA replication protein [DNA replication,
           recombination, and repair].
          Length = 254

 Score = 35.5 bits (82), Expect = 0.16
 Identities = 32/115 (27%), Positives = 44/115 (38%), Gaps = 21/115 (18%)

Query: 409 LYTWLAEHKPLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSS--ATTPEL---LLKTF 463
           L  +    + LVL GPPG GKT  L  A+      E++    S    T P+L   L   F
Sbjct: 98  LVEFFERGENLVLLGPPGVGKT-HLAIAI----GNELLKAGISVLFITAPDLLSKLKAAF 152

Query: 464 DHYCEYRKTPNGVILSPIQLGKWLVLFCDEINLPDMDKYATQRVISFLRQLIEQR 518
           D            +L   +L K  +L  D+I      +         L QLI +R
Sbjct: 153 D-----EGRLEEKLLR--ELKKVDLLIIDDIGYEPFSQEEA----DLLFQLISRR 196


>gnl|CDD|148750 pfam07321, YscO, Type III secretion protein YscO.  This family
            contains the bacterial type III secretion protein YscO,
            which is approximately 150 residues long. YscO has been
            shown to be required for high-level expression and
            secretion of the anti-host proteins V antigen and Yops in
            Yersinia pestis.
          Length = 152

 Score = 34.3 bits (79), Expect = 0.17
 Identities = 20/95 (21%), Positives = 38/95 (40%), Gaps = 11/95 (11%)

Query: 1005 LDFINHFVKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAA 1064
            L+     V L RE  + LE+Q       L +  + +E   +  +    + QE +   E  
Sbjct: 69   LEKWQQQVGLLRENEASLEQQ-------LAEAKQRLEAERQRLRQARQQLQEARKAQE-- 119

Query: 1065 NLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEI 1099
              K  E+ + +Q   + + Q  +   + E +TV  
Sbjct: 120  --KFAELARQEQAEAQAQRQYLEELEQEEFRTVSR 152


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
            proteins-interacting protein 1.  Members of this family
            of proteins act as negative regulators of G1 to S cell
            cycle phase progression by inhibiting cyclin-dependent
            kinases. Inhibitory effects are additive with GADD45
            proteins but occur also in the absence of GADD45
            proteins. Furthermore, they act as a repressor of the
            orphan nuclear receptor NR4A1 by inhibiting AB
            domain-mediated transcriptional activity.
          Length = 217

 Score = 35.2 bits (81), Expect = 0.18
 Identities = 35/144 (24%), Positives = 63/144 (43%), Gaps = 33/144 (22%)

Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEA------ANL-KL 1068
             EK  ELEE++           E    + +M +    + +E +++ +A       N+ K+
Sbjct: 84   AEKLEELEEEER----------EWYPSLNQMLEENREQQKEKEARRQAREAEIAKNMAKM 133

Query: 1069 KEMIKDQQEAEKRKVQSQDIQAEIEKQT--VEIAQKRV-FVM--------EDLAQVEPAV 1117
             +MI D   A+KRK + Q  +A  E++   V  A++   + +        E L Q E   
Sbjct: 134  PQMIAD-WRAQKRK-REQKARAAKERKERLVAEAREHFGYWVDPRDPRFQEMLQQKEKE- 190

Query: 1118 MDAQQAVKEIKKQQLVELRSMANP 1141
               ++ VKE K+++  E R  A  
Sbjct: 191  --EKKKVKEAKRREKEEKRMAALV 212


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria with
            the myosin-like domain [Function unknown].
          Length = 499

 Score = 35.8 bits (82), Expect = 0.19
 Identities = 25/160 (15%), Positives = 56/160 (35%), Gaps = 21/160 (13%)

Query: 1020 SELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAE 1079
            + L +Q   L   L  +AE   Q+E   +SL    ++LQ+       ++ ++     + E
Sbjct: 140  ARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIE 199

Query: 1080 K---------------------RKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVM 1118
            +                     R   +Q     I+++  +I+QK   +     Q+     
Sbjct: 200  QEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERER 259

Query: 1119 DAQQAVKEIKKQQLVELRSMANPPSVVKLALESICLLLGE 1158
              Q+      + +    +  A   + V+L  ++     G+
Sbjct: 260  QLQRLETAQARLEQEVAQLEAYYQAYVRLRQQAAATQRGQ 299



 Score = 35.0 bits (80), Expect = 0.37
 Identities = 25/125 (20%), Positives = 54/125 (43%), Gaps = 5/125 (4%)

Query: 1017 EKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQ 1076
             +   L+  Q  L     ++   V  ++     +  ++Q L ++  AA  + +E+ +   
Sbjct: 165  AQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRANAAQARTEELARRAA 224

Query: 1077 EAEKRKVQSQDIQAEIEKQTVEIAQKRVFVME---DLAQVEPAVMDAQQAVKEIKK--QQ 1131
             A++     Q   A+I ++  +IA +   + E    L ++E A    +Q V +++   Q 
Sbjct: 225  AAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLETAQARLEQEVAQLEAYYQA 284

Query: 1132 LVELR 1136
             V LR
Sbjct: 285  YVRLR 289



 Score = 33.8 bits (77), Expect = 0.81
 Identities = 16/99 (16%), Positives = 39/99 (39%), Gaps = 6/99 (6%)

Query: 1038 ETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTV 1097
              V Q+++++  L     EL +          E    + E +K + + +     + ++  
Sbjct: 71   SGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQERE----AVRQELA 126

Query: 1098 EIAQKRVFVMEDLAQVEPAVMDAQQAVKEI--KKQQLVE 1134
               Q      ++LA++     D Q  +K +  +++QL  
Sbjct: 127  AARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEA 165



 Score = 30.8 bits (69), Expect = 6.9
 Identities = 30/153 (19%), Positives = 61/153 (39%), Gaps = 37/153 (24%)

Query: 1233 DMLKKVEPLRLELKSLEVQ-----------ASENKAKGEETKDLITQLE---KSIASYKD 1278
            D+  ++  LR EL + + +            SE +   +E + +  +L    +++A  + 
Sbjct: 78   DIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQ 137

Query: 1279 EYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKT-----------------DLDNV 1321
            E A+L  QA  ++T L  +     +  QL AQA +++                  DL   
Sbjct: 138  ELARLTKQAQDLQTRLKTLAE---QRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLR 194

Query: 1322 QAKVERSMALLKSLGIERERWEATSETFRSQMA 1354
             A++E+     ++L       +A +E    + A
Sbjct: 195  SAQIEQEA---QNLATRANAAQARTEELARRAA 224


>gnl|CDD|233914 TIGR02538, type_IV_pilB, type IV-A pilus assembly ATPase PilB.
           This model describes a protein of type IV pilus
           biogenesis designated PilB in Pseudomonas aeruginosa but
           PilF in Neisseria gonorrhoeae; the more common usage,
           reflected here, is PilB. This protein is an ATPase
           involved in protein export for pilin assembly and is
           closely related to GspE (TIGR02533) of type II
           secretion, also called the main terminal branch of the
           general secretion pathway. Note that type IV pilus
           systems are often divided into type IV-A and IV-B, with
           the latter group including bundle-forming pilus,
           mannose-sensitive hemagglutinin, etc. Members of this
           family are found in type IV-A systems [Cell envelope,
           Surface structures, Protein fate, Protein and peptide
           secretion and trafficking].
          Length = 564

 Score = 36.1 bits (84), Expect = 0.19
 Identities = 23/46 (50%), Positives = 28/46 (60%), Gaps = 6/46 (13%)

Query: 414 AEHKP--LVL-CGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTP 456
           A HKP  +VL  GP GSGKT++L +AL  L   EV   N S+A  P
Sbjct: 311 AIHKPQGMVLVTGPTGSGKTVSLYTALNILNTEEV---NISTAEDP 353


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
            protein is found in bacteria and eukaryotes. Proteins in
            this family are typically between 943 to 1234 amino acids
            in length. This family contains a P-loop motif suggesting
            it is a nucleotide binding protein. It may be involved in
            replication.
          Length = 1198

 Score = 36.2 bits (84), Expect = 0.19
 Identities = 62/301 (20%), Positives = 105/301 (34%), Gaps = 72/301 (23%)

Query: 1016 REKCSELEEQ--QLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIK 1073
              K  + EEQ  Q +  +   K AE   +    Q  L +  Q LQ++ ++   KL+  I 
Sbjct: 620  VAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDL--QRLQNEQQSLKDKLELAIA 677

Query: 1074 DQQEAEKRKVQSQDIQAE-IEKQ---------------TVEIAQKRVFVMEDL------- 1110
            ++++  + +++  D Q + + +Q                 E   K   V  +L       
Sbjct: 678  ERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERLAKWQVVEGELDNQLAQL 737

Query: 1111 -AQVEPAVMDAQQAVKEIKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIRAV 1169
             A +E A   A+  +KE+KKQ   EL S+   P+ VK                       
Sbjct: 738  SAAIEAARTQAKARLKELKKQYDRELASLDVDPNTVK--------------------ELK 777

Query: 1170 VMRENFINSIVSNFNTEMITDEVREK---MHSRYLSNPDYSYEKANRASMACGPMVKWAI 1226
               E    +I       +   EVRE    M   +L       E+ N              
Sbjct: 778  RQIEELETTIE---RIAVRRPEVREYRAFMQETWLHRDSLREERPNL------------- 821

Query: 1227 AQISYADMLKKVEPLRLELKSLEVQASENKAKGEE----TKDLITQLEKSIASYKDEYAQ 1282
              I   ++    E L+ EL  L       + K E+     +  + QL++ +   +DE  Q
Sbjct: 822  -AIQLRELESSAEELQQELTRLIKDTKLRRKKLEQERKALEKQLDQLDELLRGLRDEMRQ 880

Query: 1283 L 1283
            L
Sbjct: 881  L 881


>gnl|CDD|222000 pfam13238, AAA_18, AAA domain. 
          Length = 128

 Score = 34.0 bits (78), Expect = 0.20
 Identities = 13/48 (27%), Positives = 21/48 (43%)

Query: 761 LLIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNKYTGADFDEDLRTVLR 808
           L+ G  G+GKTTL++ +A   G  +  +   N       +E      R
Sbjct: 2   LITGTPGSGKTTLAKELAERLGDVLRDLAKENGLVLELDEEITDESKR 49



 Score = 28.9 bits (65), Expect = 9.5
 Identities = 19/86 (22%), Positives = 32/86 (37%), Gaps = 13/86 (15%)

Query: 49  VGPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPKAISKEALYGVLDPNTREWTDGLFTH 108
            G  GSGK+T  K L + L            +      +  L   LD    + +  L   
Sbjct: 4   TGTPGSGKTTLAKELAERL----------GDVLRDLAKENGLVLELDEEITDESKRLDED 53

Query: 109 ILRRIIDNVRGEISKR---QWIIFDG 131
            L +++D +   I +    + +I DG
Sbjct: 54  KLAKLLDKLEKIIEELAEGENVIIDG 79


>gnl|CDD|236982 PRK11784, PRK11784, tRNA 2-selenouridine synthase; Provisional.
          Length = 345

 Score = 35.6 bits (83), Expect = 0.21
 Identities = 14/32 (43%), Positives = 16/32 (50%), Gaps = 1/32 (3%)

Query: 409 LYTWLAEHKPLVLCGPPGSGKTMTLLSALRAL 440
           L    A+   +VL G  GSGKT  LL AL   
Sbjct: 134 LEEAPAQFPLVVLGGNTGSGKT-ELLQALANA 164


>gnl|CDD|151277 pfam10828, DUF2570, Protein of unknown function (DUF2570).  This is a
            family of proteins with unknown function.
          Length = 110

 Score = 33.3 bits (76), Expect = 0.23
 Identities = 21/70 (30%), Positives = 30/70 (42%), Gaps = 10/70 (14%)

Query: 1062 EAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQ 1121
             A N    + I+ QQ+A +R         E E+Q VE  QK         ++E    +  
Sbjct: 31   RAENKAQAQTIEQQQKANQRLT----DALEQERQAVEEQQKIA------NEIERKAEENA 80

Query: 1122 QAVKEIKKQQ 1131
            + VK I KQQ
Sbjct: 81   EEVKTILKQQ 90


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 35.8 bits (83), Expect = 0.24
 Identities = 29/121 (23%), Positives = 63/121 (52%), Gaps = 6/121 (4%)

Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQ 1075
            +E+  ELE++   L     K+ E + ++EE  + L  + +EL+ K +    +LKE  ++ 
Sbjct: 237  KEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELK-ELKEKAEEY 295

Query: 1076 QEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVEL 1135
             +  +   +  D   EIEK+   + ++   + E + ++E    + ++ ++E+KK +L EL
Sbjct: 296  IKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELE----EKEERLEELKK-KLKEL 350

Query: 1136 R 1136
             
Sbjct: 351  E 351



 Score = 33.9 bits (78), Expect = 0.87
 Identities = 24/85 (28%), Positives = 42/85 (49%), Gaps = 3/85 (3%)

Query: 1017 EKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQ 1076
            EK   LEE +  L     ++ E  E+ E  +++ A K +  + K     L  +++ K+ +
Sbjct: 335  EKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELE 394

Query: 1077 EAEKRKVQSQDIQAEIEKQTVEIAQ 1101
            E EK K   ++I+ EI K T  I +
Sbjct: 395  ELEKAK---EEIEEEISKITARIGE 416



 Score = 31.6 bits (72), Expect = 4.8
 Identities = 57/317 (17%), Positives = 130/317 (41%), Gaps = 61/317 (19%)

Query: 1012 VKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANL---KL 1068
            +K   E+  +L E        L +I + + ++EE    +  + +EL+ K E       KL
Sbjct: 288  LKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKL 347

Query: 1069 KEMIKDQQEAEKRKVQSQDIQA------------------EIEKQTVEIAQKRVFVMEDL 1110
            KE+ K  +E E+R    ++ +A                  ++EK+  E+ + +  + E++
Sbjct: 348  KELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEI 407

Query: 1111 AQVEPAVMDAQQAVKEIKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIRAVV 1170
            +++   + + ++ +KE+KK  + EL+             +  C + G   T+        
Sbjct: 408  SKITARIGELKKEIKELKK-AIEELKKA-----------KGKCPVCGRELTEEH------ 449

Query: 1171 MRENFINSI---VSNFNTEMITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIA 1227
             R+  +      +     E+   E +E+   + L   +   +K +               
Sbjct: 450  -RKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESEL------------- 495

Query: 1228 QISYADMLKKVEPLRLELKSLEVQASENKAKG-EETKDLITQLEKSIASYKDEY---AQL 1283
             I   ++ ++++ L  +LK   ++  E KA+  E+ K+ + +L+  I S K E     +L
Sbjct: 496  -IKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEEL 554

Query: 1284 IAQATAIKTDLDNVQAK 1300
              +   ++  LD ++ +
Sbjct: 555  KKKLAELEKKLDELEEE 571


>gnl|CDD|222005 pfam13245, AAA_19, Part of AAA domain. 
          Length = 73

 Score = 32.3 bits (74), Expect = 0.24
 Identities = 9/27 (33%), Positives = 14/27 (51%)

Query: 414 AEHKPLVLCGPPGSGKTMTLLSALRAL 440
           +     V+ G PG+GKT T  + +  L
Sbjct: 8   SGRSLFVVDGGPGTGKTATAAAIIARL 34


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
            [Function unknown].
          Length = 294

 Score = 35.0 bits (81), Expect = 0.26
 Identities = 54/333 (16%), Positives = 121/333 (36%), Gaps = 78/333 (23%)

Query: 1017 EKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQ 1076
                +L+E +L       ++ E +E+++E +  L  ++ EL  K +  N K++E+ +  Q
Sbjct: 3    AMLDKLDELELKRK----QLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQ 58

Query: 1077 E-AEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVEL 1135
            E  E+R         EI ++  E+ +KR  +   L               +  +++  EL
Sbjct: 59   ELREERD--------EINEEVQELKEKRDEINAKL---------------QELRKEYREL 95

Query: 1136 RSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDEVREK 1195
            +   N  ++   +++S+                    E  I  +     T ++T E   +
Sbjct: 96   KEKRNEFNLGGRSIKSL--------------------EREIERLEKKQQTSVLTPEEERE 135

Query: 1196 MHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKV-EPLRLELKSLEVQASE 1254
            +  +                          I ++       K       +LK L+ +  E
Sbjct: 136  LVQK--------------------------IKELRKELEDAKKALEENEKLKELKAEIDE 169

Query: 1255 NKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAI 1314
             K K  E  + I +L      Y +E  +L  +A  ++ + D +  +F E      +   +
Sbjct: 170  LKKKAREIHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHEEFVEL---SKKIDEL 226

Query: 1315 KTDLDNVQAKVERSMALLKSLGIERERWEATSE 1347
              +  N+Q ++      +K+L  + +  +   +
Sbjct: 227  HEEFRNLQNELRELEKKIKALRAKEKAAKRREK 259


>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome
            partitioning [Cell division and chromosome partitioning].
          Length = 1480

 Score = 35.6 bits (82), Expect = 0.26
 Identities = 77/439 (17%), Positives = 157/439 (35%), Gaps = 103/439 (23%)

Query: 1014 LYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIK 1073
            LY  +     EQ  H+++     +  + ++   +  L     + Q+ ++  NL    + +
Sbjct: 295  LYTSRQQLAAEQYRHVDM-----SRELAELNGAEGDL---EADYQAASDHLNLVQTAL-R 345

Query: 1074 DQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPA----------VMDAQQA 1123
             Q++ E+ +   +++   +E+Q   + +      E+ A+ E A          + D QQA
Sbjct: 346  QQEKIERYQADLEELTIRLEEQNEVVEEANERQEENEARAEAAELEVDELKSQLADYQQA 405

Query: 1124 V-----KEIKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINS 1178
            +     + I+ QQ +     A         L  +  L  ++A +W         E     
Sbjct: 406  LDVQQTRAIQYQQAIAALERAK-------ELCHLPDLTADSAEEWLETFQAKEEE----- 453

Query: 1179 IVSNFNTEMITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVK---WAIAQ------I 1229
                   ++++ E +  M     S  + +Y+     +   G + +   W +A+       
Sbjct: 454  ----ATEKLLSLEQKMSMAQAAHSQFEQAYQLVVAIA---GELARSEAWDVARELLREGP 506

Query: 1230 SYADMLKKVEPLRLELKSLEVQASE-----------NKAKG-----EETKDLITQLEKSI 1273
                + ++V+PLR+ L  LE +  +            K +G     EE + L  +LE  I
Sbjct: 507  DQRHLAEQVQPLRMRLSELEQRLRQQQSAERLLADFCKRQGKNLDAEELEALHQELEALI 566

Query: 1274 ASYKDEYAQLIAQATAIKTDLDNVQAKFYEYA--------------QLIAQATAIKTDLD 1319
             S  D  +    Q  A++ + + +Q++                   QL  Q+    TD  
Sbjct: 567  ESLSDSVSNAREQRMALRQEQEQLQSRIQSLMQRAPVWLAAQNALEQLSEQSGEEFTDSQ 626

Query: 1320 NVQAKVERSMALLKSLGIERERWEATSETFRSQ-----------------MATIIGDVLL 1362
            +V   +++ +   +   +ER+   A       +                 +A   G VLL
Sbjct: 627  DVTEYMQQLLEREREATVERDELGARKNALDEEIERLSQPGGSEDQRLNALAERFGGVLL 686

Query: 1363 SSAY----LAYAGYFDQHY 1377
            S  Y    +  A YF   Y
Sbjct: 687  SEIYDDVTIEDAPYFSALY 705


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
            envelope biogenesis, outer membrane].
          Length = 387

 Score = 35.3 bits (81), Expect = 0.27
 Identities = 17/86 (19%), Positives = 44/86 (51%), Gaps = 1/86 (1%)

Query: 1044 EEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIA-QK 1102
            ++ +K     ++EL+ K  A   +LK++ K++ +A++++ Q+++ + + + +  +   Q 
Sbjct: 79   QQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQA 138

Query: 1103 RVFVMEDLAQVEPAVMDAQQAVKEIK 1128
            R    E   + E A   A     ++K
Sbjct: 139  RKAAAEQKKKAEAAKAKAAAEAAKLK 164



 Score = 33.4 bits (76), Expect = 1.1
 Identities = 25/119 (21%), Positives = 47/119 (39%), Gaps = 1/119 (0%)

Query: 1013 KLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMI 1072
            +L +E+    E+Q+           E  +Q EE  +  A + ++     +A        +
Sbjct: 105  QLEKERLKAQEQQKQAEEAEKQAQLEQKQQ-EEQARKAAAEQKKKAEAAKAKAAAEAAKL 163

Query: 1073 KDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQ 1131
            K   EA+K+  ++     E + +    A K+    E  A  E A  +A+   K  KK +
Sbjct: 164  KAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAE 222



 Score = 32.2 bits (73), Expect = 2.4
 Identities = 18/62 (29%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 1042 QVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQ 1101
              EE++   A + QE   + E   LK +E  K  +EAEK+    Q  Q E  ++     +
Sbjct: 88   VAEELKPKQAAE-QERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQK 146

Query: 1102 KR 1103
            K+
Sbjct: 147  KK 148



 Score = 30.3 bits (68), Expect = 8.1
 Identities = 20/102 (19%), Positives = 41/102 (40%), Gaps = 1/102 (0%)

Query: 1034 GKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIE 1093
            G + +   +++  Q S     Q+ + K E    +LK     +QE  K+  + +    E +
Sbjct: 58   GAVVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQ 117

Query: 1094 KQTVEIA-QKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVE 1134
            KQ  E   Q ++   +   Q   A  + ++  +  K +   E
Sbjct: 118  KQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAE 159



 Score = 30.3 bits (68), Expect = 9.1
 Identities = 23/103 (22%), Positives = 42/103 (40%), Gaps = 10/103 (9%)

Query: 1038 ETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIE--KQ 1095
            E ++Q+E+ +    +K+QE Q + E A  + +   K Q+E  ++    Q  +AE    K 
Sbjct: 101  ERLKQLEKER----LKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKA 156

Query: 1096 TVEIAQKRVFV----MEDLAQVEPAVMDAQQAVKEIKKQQLVE 1134
              E A+ +         + A        A+      KK+   E
Sbjct: 157  AAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAE 199


>gnl|CDD|130214 TIGR01144, ATP_synt_b, ATP synthase, F0 subunit b.  This model
            describes the F1/F0 ATP synthase b subunit in bacteria
            only. Scoring just below the trusted cutoff are the
            N-terminal domains of Mycobacterial b/delta fusion
            proteins and a subunit from an archaeon, Methanosarcina
            barkeri, in which the ATP synthase homolog differs in
            architecture and is not experimentally confirmed. This
            model helps resolve b from the related b' subunit. Within
            the family is an example from a sodium-translocating
            rather than proton-translocating ATP synthase [Energy
            metabolism, ATP-proton motive force interconversion].
          Length = 147

 Score = 33.5 bits (77), Expect = 0.30
 Identities = 24/96 (25%), Positives = 45/96 (46%), Gaps = 11/96 (11%)

Query: 1035 KIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEK 1094
            KIA+ +   E  +K  A+  ++ Q   + A  + +E+I   + A KR  +   I  E + 
Sbjct: 30   KIADGLASAERAKKEAALAQKKAQVILKEAKDEAQEII---ENANKRGSE---ILEEAKA 83

Query: 1095 QTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQ 1130
            +  E  +K    ++  A+ E       QA +E++KQ
Sbjct: 84   EAREEREK----IKAQARAEIEAEKE-QAREELRKQ 114


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
            consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 35.0 bits (80), Expect = 0.32
 Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 5/77 (6%)

Query: 1041 EQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIA 1100
            +QV E QK    K+Q    KN+   LK K+      +A   K +S+  + E E + +E  
Sbjct: 273  KQVAENQKREIEKAQIEIKKNDEEALKAKD-----HKAFDLKQESKASEKEAEDKELEAQ 327

Query: 1101 QKRVFVMEDLAQVEPAV 1117
            +KR  V EDL + +P V
Sbjct: 328  KKREPVAEDLQKTKPQV 344



 Score = 35.0 bits (80), Expect = 0.34
 Identities = 19/89 (21%), Positives = 40/89 (44%), Gaps = 9/89 (10%)

Query: 1041 EQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIA 1100
            E++++ Q       Q+     + A+ +  E+ + QQEA+    +  D  +  E + V   
Sbjct: 220  EELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLP-KPADTSSPKEDKQVAEN 278

Query: 1101 QKRVFVMEDLAQVEPAVMDAQQAVKEIKK 1129
            QKR        ++E A ++ ++  +E  K
Sbjct: 279  QKR--------EIEKAQIEIKKNDEEALK 299



 Score = 31.9 bits (72), Expect = 3.2
 Identities = 20/85 (23%), Positives = 38/85 (44%), Gaps = 5/85 (5%)

Query: 1052 VKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQ--AEIEKQTVEIAQKRVFVMED 1109
            +K +E Q   + A    +E+ K Q +A+K + ++   Q  A+ ++  V   Q+    +  
Sbjct: 203  LKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPK 262

Query: 1110 LAQVEPAVMD---AQQAVKEIKKQQ 1131
             A       D   A+   +EI+K Q
Sbjct: 263  PADTSSPKEDKQVAENQKREIEKAQ 287


>gnl|CDD|222165 pfam13481, AAA_25, AAA domain.  This AAA domain is found in a wide
           variety of presumed DNA repair proteins.
          Length = 154

 Score = 33.4 bits (77), Expect = 0.37
 Identities = 14/60 (23%), Positives = 20/60 (33%), Gaps = 5/60 (8%)

Query: 381 PQIEVETQKVAASDVVVPTLDTVRHESLLYTWLAEHKPLVLCGPPGSGKTMTLLSALRAL 440
              E+        D   P L       L+   L      +L G PG+GK+   L    A+
Sbjct: 3   EPSELLPSSAEDLDAPPPPLRW-----LVKGLLPRGGLTLLAGAPGTGKSTLALDLAAAV 57


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found in
            cell division proteins which are required for
            kinetochore-spindle association.
          Length = 312

 Score = 34.6 bits (80), Expect = 0.38
 Identities = 18/129 (13%), Positives = 54/129 (41%), Gaps = 12/129 (9%)

Query: 1017 EKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQEL-QSKNEAANL------KLK 1069
            E    L+E    L   L  +     ++ + + +L  + ++L Q ++E  +       + K
Sbjct: 151  ENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAK 210

Query: 1070 EMIKDQQEAEKRKVQSQ-DIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIK 1128
            E +K   +    KV+   +++ E+++   +I           +++   + +A++ +++ +
Sbjct: 211  EKLKKLLQEIMIKVKKLEELEEELQELESKIEDLT----NKKSELNTEIAEAEKKLEQCR 266

Query: 1129 KQQLVELRS 1137
                 E+  
Sbjct: 267  GFTFKEIEK 275



 Score = 30.4 bits (69), Expect = 8.1
 Identities = 12/73 (16%), Positives = 32/73 (43%), Gaps = 8/73 (10%)

Query: 1262 TKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKF----YEYAQLIAQATAIK-- 1315
             + L   L++++   K++Y  L+ +   + +    ++ +      E  QL      ++  
Sbjct: 142  LEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDC 201

Query: 1316 --TDLDNVQAKVE 1326
              T+LD  + K++
Sbjct: 202  DPTELDRAKEKLK 214


>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
          Length = 865

 Score = 35.1 bits (82), Expect = 0.38
 Identities = 21/96 (21%), Positives = 42/96 (43%), Gaps = 7/96 (7%)

Query: 1036 IAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQ 1095
            + E    ++E+   L  K  EL  + EA   +LKE+ K+ ++ + +   +     ++  Q
Sbjct: 696  LNEQEALLKELAALLKAKPSELPERVEALLEELKELEKELEQLKAKLAAAA--AGDLLAQ 753

Query: 1096 TVEIAQKRVFVMEDLAQVEPAVMDA-QQAVKEIKKQ 1130
              E+   +V      AQVE     A +    ++K +
Sbjct: 754  AKEVNGVKVLA----AQVEGVDAKALRTLADDLKDK 785


>gnl|CDD|224046 COG1121, ZnuC, ABC-type Mn/Zn transport systems, ATPase component
          [Inorganic ion transport and metabolism].
          Length = 254

 Score = 34.5 bits (80), Expect = 0.39
 Identities = 11/24 (45%), Positives = 16/24 (66%)

Query: 49 VGPSGSGKSTAWKVLLKALERYEG 72
          +GP+G+GKST  K +L  L+   G
Sbjct: 36 IGPNGAGKSTLLKAILGLLKPSSG 59


>gnl|CDD|221803 pfam12846, AAA_10, AAA-like domain.  This family of domains
          contain a P-loop motif that is characteristic of the
          AAA superfamily. Many of the proteins in this family
          are conjugative transfer proteins.
          Length = 316

 Score = 34.3 bits (79), Expect = 0.44
 Identities = 19/41 (46%), Positives = 25/41 (60%), Gaps = 3/41 (7%)

Query: 43 NHGLMMVGPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPK 83
          N  +++VGPSGSGKST  K+L   L    G   +  +IDPK
Sbjct: 1  NPNMLIVGPSGSGKSTLLKLLALRLLAR-GGRVI--VIDPK 38


>gnl|CDD|213223 cd03256, ABC_PhnC_transporter, ATP-binding cassette domain of the
           binding protein-dependent phosphonate transport system. 
           Phosphonates are a class of organophosphorus compounds
           characterized by a chemically stable
           carbon-to-phosphorus (C-P) bond. Phosphonates are
           widespread among naturally occurring compounds in all
           kingdoms of wildlife, but only prokaryotic
           microorganisms are able to cleave this bond. Certain
           bacteria such as E. coli can use alkylphosphonates as a
           phosphorus source. ABC transporters are a subset of
           nucleotide hydrolases that contain a signature motif,
           Q-loop, and H-loop/switch region, in addition to, the
           Walker A motif/P-loop and Walker B motif commonly found
           in a number of ATP- and GTP-binding and hydrolyzing
           proteins.
          Length = 241

 Score = 34.1 bits (79), Expect = 0.45
 Identities = 31/94 (32%), Positives = 45/94 (47%), Gaps = 15/94 (15%)

Query: 762 LIGVSGAGKTTL----SRFVAFMNGLSVFQIRAHNKYTGADFDEDLRTVLRRSGCKNEKI 817
           LIG SGAGK+TL    +  V   +G  +      NK  G      LR + R+ G   ++ 
Sbjct: 32  LIGPSGAGKSTLLRCLNGLVEPTSGSVLIDGTDINKLKGK----ALRQLRRQIGMIFQQF 87

Query: 818 AFLLDESNVLE---SGFLERMNTLLANGEIPGLF 848
             L++  +VLE   SG L R +T  +   + GLF
Sbjct: 88  N-LIERLSVLENVLSGRLGRRSTWRS---LFGLF 117



 Score = 31.4 bits (72), Expect = 2.8
 Identities = 16/44 (36%), Positives = 22/44 (50%), Gaps = 7/44 (15%)

Query: 41 NLNHGLM--MVGPSGSGKSTAWKVLLKALERY-EGVEGVAHIID 81
          ++N G    ++GPSG+GKST    LL+ L    E   G   I  
Sbjct: 23 SINPGEFVALIGPSGAGKST----LLRCLNGLVEPTSGSVLIDG 62


>gnl|CDD|238550 cd01130, VirB11-like_ATPase, Type IV secretory pathway component
           VirB11, and related ATPases. The homohexamer, VirB11 is
           one of eleven Vir proteins, which are required for
           T-pilus biogenesis and virulence in the transfer of
           T-DNA from the Ti (tumor-inducing) plasmid of bacterial
           to plant cells. The pilus is a fibrous cell surface
           organelle, which mediates adhesion between bacteria
           during conjugative transfer or between bacteria and host
           eukaryotic cells during infection. VirB11- related
           ATPases include the archaeal flagella biosynthesis
           protein and the pilus assembly proteins CpaF/TadA and
           TrbB.  This alignment contains the C-terminal domain,
           which is the ATPase.
          Length = 186

 Score = 33.8 bits (78), Expect = 0.45
 Identities = 23/55 (41%), Positives = 31/55 (56%), Gaps = 9/55 (16%)

Query: 410 YTWLA--EHKPLVLCGPPGSGKTMTLLSALRAL--PDMEVVSLNFSSATTPELLL 460
           Y WLA    K +++ G  GSGKT TLL+AL A   PD  ++++      T EL L
Sbjct: 17  YLWLAVEARKNILISGGTGSGKT-TLLNALLAFIPPDERIITIE----DTAELQL 66


>gnl|CDD|236686 PRK10416, PRK10416, signal recognition particle-docking protein
           FtsY; Provisional.
          Length = 318

 Score = 34.3 bits (80), Expect = 0.46
 Identities = 15/64 (23%), Positives = 36/64 (56%), Gaps = 4/64 (6%)

Query: 713 TTELREYVQARLKVF-YEEELDVQLVLFDEVLDHVLRIDRIFRQPQGHL---LLIGVSGA 768
           T E+ E ++ R+K    ++  +++ +L +E+ + +  +++     +      L++GV+G 
Sbjct: 66  TEEIIEELRERVKRKNLKDPEELKELLKEELAEILEPVEKPLNIEEKKPFVILVVGVNGV 125

Query: 769 GKTT 772
           GKTT
Sbjct: 126 GKTT 129


>gnl|CDD|213788 TIGR03263, guanyl_kin, guanylate kinase.  Members of this family
           are the enzyme guanylate kinase, also called GMP kinase.
           This enzyme transfers a phosphate from ATP to GMP,
           yielding ADP and GDP [Purines, pyrimidines, nucleosides,
           and nucleotides, Nucleotide and nucleoside
           interconversions].
          Length = 179

 Score = 33.6 bits (78), Expect = 0.46
 Identities = 18/40 (45%), Positives = 24/40 (60%), Gaps = 6/40 (15%)

Query: 417 KPLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFS-SATT 455
             +V+ GP G+GK+ TL+ AL      E  +L FS SATT
Sbjct: 1   LLIVISGPSGAGKS-TLVKALLE----EDPNLKFSISATT 35


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This family
            consists of several eukaryotic proteins of unknown
            function.
          Length = 536

 Score = 34.7 bits (80), Expect = 0.47
 Identities = 22/144 (15%), Positives = 61/144 (42%), Gaps = 12/144 (8%)

Query: 1036 IAETVEQVEEMQKSLAVKSQELQSKNEAANLKLK-EMIKDQQEAEKRKVQSQDIQ---AE 1091
            +          + S   ++++++    AA      EM  ++   ++R+ + + +Q    E
Sbjct: 222  LLSKFRLDSWDKGSRFTRTEKVEQAATAAAEVPPAEMDTEEDRTKEREAELEALQEQIDE 281

Query: 1092 IEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRS---MANPPSVVKLA 1148
            +E    E+  +   +   + QV   +   +Q  +E++++  ++ R+   + +  + V   
Sbjct: 282  LESSIEEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIKKRTVELLPDAENNVA-K 340

Query: 1149 LESICLLLGEN----ATDWKAIRA 1168
            L+++ +   E     A  W+A R 
Sbjct: 341  LQALVVASSERLLELAQQWEAHRT 364


>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
            This domain is found associated with the Pfam tRNA
            synthetase class II domain (pfam00587) and represents the
            N-terminal domain of seryl-tRNA synthetase.
          Length = 108

 Score = 32.2 bits (74), Expect = 0.48
 Identities = 13/71 (18%), Positives = 39/71 (54%), Gaps = 3/71 (4%)

Query: 1034 GKIAETVEQVEEM---QKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQA 1090
            G     V+++ E+   ++ L V+ +ELQ++    + ++ +  K +++AE    + ++++ 
Sbjct: 22   GVDVLDVDELLELDEERRELQVELEELQAERNELSKEIGKAKKKKEDAEALIAEVKELKD 81

Query: 1091 EIEKQTVEIAQ 1101
            E++    E+ +
Sbjct: 82   ELKALEAELRE 92


>gnl|CDD|223637 COG0563, Adk, Adenylate kinase and related kinases [Nucleotide
           transport and metabolism].
          Length = 178

 Score = 33.4 bits (77), Expect = 0.51
 Identities = 9/19 (47%), Positives = 16/19 (84%)

Query: 760 LLLIGVSGAGKTTLSRFVA 778
           +L++G  GAGK+TL++ +A
Sbjct: 3   ILILGPPGAGKSTLAKKLA 21


>gnl|CDD|217843 pfam04012, PspA_IM30, PspA/IM30 family.  This family includes PspA a
            protein that suppresses sigma54-dependent transcription.
            The PspA protein, a negative regulator of the Escherichia
            coli phage shock psp operon, is produced when virulence
            factors are exported through secretins in many
            Gram-negative pathogenic bacteria and its homologue in
            plants, VIPP1, plays a critical role in thylakoid
            biogenesis, essential for photosynthesis. Activation of
            transcription by the enhancer-dependent bacterial
            sigma(54) containing RNA polymerase occurs through ATP
            hydrolysis-driven protein conformational changes enabled
            by activator proteins that belong to the large AAA(+)
            mechanochemical protein family. It has been shown that
            PspA directly and specifically acts upon and binds to the
            AAA(+) domain of the PspF transcription activator.
          Length = 220

 Score = 33.9 bits (78), Expect = 0.51
 Identities = 25/112 (22%), Positives = 47/112 (41%), Gaps = 10/112 (8%)

Query: 1236 KKVEPLRLELKSLEVQASENKAKGEE--TKDLITQ---LEKSIASYKDEYAQLIAQATAI 1290
            +K+E  + + K LE +A     KG E   ++ + +   LEK   + + +  Q  +    +
Sbjct: 58   RKLEEQKEQAKKLENKARAALTKGNEELAREALAEIATLEKQAEALETQLTQQRSAVEQL 117

Query: 1291 KTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQA--KVERSMALLKSLGIERE 1340
            +  L  ++ K     QL A+ TA+K  L   +A   V  S+    +      
Sbjct: 118  RKQLAALETK---IQQLKAKKTALKARLKAAKAQEAVNTSLGSASTESATDS 166


>gnl|CDD|206350 pfam14182, YgaB, YgaB-like protein.  The YgaB-like protein family
            includes the B. subtilis YgaB protein, which is
            functionally uncharacterized. This family of proteins is
            found in bacteria. Proteins in this family are
            approximately 90 amino acids in length.
          Length = 79

 Score = 31.5 bits (72), Expect = 0.52
 Identities = 22/70 (31%), Positives = 35/70 (50%), Gaps = 14/70 (20%)

Query: 1041 EQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQS---------QDIQAE 1091
            EQ++ M K L      LQS+ E      KE+++ +++AE   +Q          ++IQ  
Sbjct: 8    EQMKTMDKLL-----FLQSELERCQEIEKELLELEKKAELESIQQEISRMRKELKEIQEL 62

Query: 1092 IEKQTVEIAQ 1101
             EKQT E+ Q
Sbjct: 63   FEKQTEEVIQ 72


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
          family of uncharacterized proteins found by clustering
          human gut metagenomic sequences.
          Length = 379

 Score = 34.2 bits (79), Expect = 0.53
 Identities = 11/26 (42%), Positives = 15/26 (57%)

Query: 46 LMMVGPSGSGKSTAWKVLLKALERYE 71
          L++V  SG  KS   K+ +K LE  E
Sbjct: 42 LLVVAESGERKSAVDKLAMKPLEEIE 67


>gnl|CDD|177744 PLN00135, PLN00135, malate dehydrogenase.
          Length = 309

 Score = 34.0 bits (78), Expect = 0.53
 Identities = 27/94 (28%), Positives = 46/94 (48%), Gaps = 20/94 (21%)

Query: 235 DVLSPALTLQQDVASILSTHFAPD-------------GLVVRALDYAMQQEHIMDFTRL- 280
           DV+S  +++ +  AS L  H APD              L+++    ++ +++I   TRL 
Sbjct: 78  DVMSKNVSIYKSQASALEKHAAPDCKVLVVANPANTNALILKEFAPSIPEKNITCLTRLD 137

Query: 281 --RALGSLFSMLN---QGVRNVLQY-NHSHSDFP 308
             RALG +   L      V+NV+ + NHS + +P
Sbjct: 138 HNRALGQISERLGVPVSDVKNVIIWGNHSSTQYP 171


>gnl|CDD|180240 PRK05759, PRK05759, F0F1 ATP synthase subunit B; Validated.
          Length = 156

 Score = 32.8 bits (76), Expect = 0.59
 Identities = 29/108 (26%), Positives = 50/108 (46%), Gaps = 24/108 (22%)

Query: 1035 KIAETVEQVEEMQKSLA-----VKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQS-QDI 1088
            KIA+ +   E  +K L       ++Q  +++ EAA     E+I   ++A+KR  Q  ++ 
Sbjct: 39   KIADGLAAAERAKKELELAQAKYEAQLAEARAEAA-----EII---EQAKKRAAQIIEEA 90

Query: 1089 QAE--------IEKQTVEIAQKRVFVMEDL-AQVEP-AVMDAQQAVKE 1126
            +AE          +   EI Q+R    E+L  QV   AV  A++ +  
Sbjct: 91   KAEAEAEAARIKAQAQAEIEQERKRAREELRKQVADLAVAGAEKILGR 138


>gnl|CDD|234033 TIGR02857, CydD, thiol reductant ABC exporter, CydD subunit.  The
           gene pair cydCD encodes an ABC-family transporter in
           which each gene contains an N-terminal membrane-spanning
           domain (pfam00664) and a C-terminal ATP-binding domain
           (pfam00005). In E. coli these genes were discovered as
           mutants which caused the terminal heme-copper oxidase
           complex cytochrome bd to fail to assemble. Recent work
           has shown that the transporter is involved in export of
           redox-active thiol compounds such as cysteine and
           glutathione. The linkage to assembly of the cytochrome
           bd complex is further supported by the conserved operon
           structure found outside the gammaproteobacteria
           (cydABCD) containing both the transporter and oxidase
           genes components. The genes used as the seed members for
           this model are all either found in the
           gammproteobacterial context or the CydABCD context. All
           members of this family scoring above trusted at the time
           of its creation were from genomes which encode a
           cytochrome bd complex. Unfortunately, the gene symbol
           nomenclature adopted based on this operon in B. subtilis
           assigns cydC to the third gene in the operon where this
           gene is actually homologous to the E. coli cydD gene. We
           have chosen to name all homologs in this family in
           accordance with the precedence of publication of the E.
           coli name, CydD.
          Length = 529

 Score = 34.2 bits (79), Expect = 0.59
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query: 49  VGPSGSGKSTAWKVLLKALERYEGV 73
           VGPSG+GKST   +LL  ++  EG 
Sbjct: 354 VGPSGAGKSTLLNLLLGFVDPTEGS 378



 Score = 30.3 bits (69), Expect = 9.8
 Identities = 10/19 (52%), Positives = 13/19 (68%)

Query: 755 QPQGHLLLIGVSGAGKTTL 773
            P   + L+G SGAGK+TL
Sbjct: 346 PPGERVALVGPSGAGKSTL 364


>gnl|CDD|217429 pfam03215, Rad17, Rad17 cell cycle checkpoint protein. 
          Length = 490

 Score = 34.2 bits (78), Expect = 0.66
 Identities = 24/62 (38%), Positives = 32/62 (51%), Gaps = 3/62 (4%)

Query: 6  EKIKEVCREEFLVCGEGNEEGGPWMEKVLQLYQISNLNHGLMMVGPSGSGKSTAWKVLLK 65
          EK K   R+E  +  +   E   W++ V   +  SN    L++ GPSG GKST  KVL K
Sbjct: 11 EKYKPRRRDELAIHKKKIAEVDHWLKAV---FLESNKQLILLLTGPSGCGKSTTVKVLSK 67

Query: 66 AL 67
           L
Sbjct: 68 EL 69



 Score = 30.4 bits (68), Expect = 9.0
 Identities = 18/71 (25%), Positives = 30/71 (42%), Gaps = 14/71 (19%)

Query: 368 IKNGEWVPWS-----NKVPQIEVETQKVAASDVVVPTLDTVRHESLLYTWLAEHKPLVLC 422
           +K  E   W+      +  ++ +  +K+A    V   L  V  ES       +   L+L 
Sbjct: 1   MKADEHELWTEKYKPRRRDELAIHKKKIAE---VDHWLKAVFLES------NKQLILLLT 51

Query: 423 GPPGSGKTMTL 433
           GP G GK+ T+
Sbjct: 52  GPSGCGKSTTV 62


>gnl|CDD|225165 COG2256, MGS1, ATPase related to the helicase subunit of the
           Holliday junction resolvase [DNA replication,
           recombination, and repair].
          Length = 436

 Score = 33.7 bits (78), Expect = 0.74
 Identities = 22/76 (28%), Positives = 36/76 (47%), Gaps = 13/76 (17%)

Query: 419 LVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTFDHYCEYRKTPNGVIL 478
           ++L GPPG+GKT TL   +    +    +L+  ++   + L +  +   E RK       
Sbjct: 51  MILWGPPGTGKT-TLARLIAGTTNAAFEALSAVTSGVKD-LREIIE---EARKN------ 99

Query: 479 SPIQLGKWLVLFCDEI 494
               LG+  +LF DEI
Sbjct: 100 --RLLGRRTILFLDEI 113


>gnl|CDD|106979 PHA00671, PHA00671, hypothetical protein.
          Length = 135

 Score = 32.3 bits (72), Expect = 0.75
 Identities = 28/115 (24%), Positives = 55/115 (47%), Gaps = 10/115 (8%)

Query: 1054 SQELQSKNEAANLKLKEMIKDQ--QEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLA 1111
            S  + +K E A    ++  + Q  Q+A++ K +S  +  E +KQT E  Q++  +M+   
Sbjct: 19   SMYMSNKQEKAQRSAQQQAQQQAQQQADQAKAESDRLTQEYQKQT-EAYQQQAKIMQQ-- 75

Query: 1112 QVEPAVMDAQQAVKEIKKQQLVELRSMANPPSVVKLALESICLLLGENATDWKAI 1166
                 ++D+QQ+     ++Q    R+++   S  K   +S  +L G    D  A+
Sbjct: 76   ----QLLDSQQSFNRANQKQPNSQRALSAVGSAAKSG-QSGTMLTGPQGVDQSAL 125


>gnl|CDD|129721 TIGR00635, ruvB, Holliday junction DNA helicase, RuvB subunit.  All
           proteins in this family for which functions are known
           are 5'-3' DNA helicases that, as part of a complex with
           RuvA homologs serve as a 5'-3' Holliday junction
           helicase. RuvA specifically binds Holliday junctions as
           a sandwich of two tetramers and maintains the
           configuration of the junction. It forms a complex with
           two hexameric rings of RuvB, the subunit that contains
           helicase activity. The complex drives ATP-dependent
           branch migration of the Holliday junction recombination
           intermediate. The endonuclease RuvC resolves junctions
           [DNA metabolism, DNA replication, recombination, and
           repair].
          Length = 305

 Score = 33.4 bits (77), Expect = 0.76
 Identities = 14/39 (35%), Positives = 21/39 (53%)

Query: 741 EVLDHVLRIDRIFRQPQGHLLLIGVSGAGKTTLSRFVAF 779
           E L   +   ++ ++   HLLL G  G GKTTL+  +A 
Sbjct: 14  EQLQLFIEAAKMRQEALDHLLLYGPPGLGKTTLAHIIAN 52


>gnl|CDD|213190 cd03223, ABCD_peroxisomal_ALDP, ATP-binding cassette domain of
           peroxisomal transporter, subfamily D.  Peroxisomal
           ATP-binding cassette transporter (Pat) is involved in
           the import of very long-chain fatty acids (VLCFA) into
           the peroxisome. The peroxisomal membrane forms a
           permeability barrier for a wide variety of metabolites
           required for and formed during fatty acid
           beta-oxidation. To communicate with the cytoplasm and
           mitochondria, peroxisomes need dedicated proteins to
           transport such hydrophilic molecules across their
           membranes. X-linked adrenoleukodystrophy (X-ALD) is
           caused by mutations in the ALD gene, which encodes ALDP
           (adrenoleukodystrophy protein ), a peroxisomal integral
           membrane protein that is a member of the ATP-binding
           cassette (ABC) transporter protein family. The disease
           is characterized by a striking and unpredictable
           variation in phenotypic expression. Phenotypes include
           the rapidly progressive childhood cerebral form (CCALD),
           the milder adult form, adrenomyeloneuropathy (AMN), and
           variants without neurologic involvement (i.e.
           asymptomatic).
          Length = 166

 Score = 32.9 bits (76), Expect = 0.76
 Identities = 15/36 (41%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 744 DHVLRIDRIFRQPQG-HLLLIGVSGAGKTTLSRFVA 778
             VL  D  F    G  LL+ G SG GK++L R +A
Sbjct: 13  GRVLLKDLSFEIKPGDRLLITGPSGTGKSSLFRALA 48



 Score = 29.4 bits (67), Expect = 8.4
 Identities = 11/22 (50%), Positives = 16/22 (72%), Gaps = 4/22 (18%)

Query: 46 LMMVGPSGSGKSTAWKVLLKAL 67
          L++ GPSG+GKS+    L +AL
Sbjct: 30 LLITGPSGTGKSS----LFRAL 47


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
            family is found in eukaryotes. It is a coiled-coil domain
            of unknwon function.
          Length = 126

 Score = 31.8 bits (73), Expect = 0.82
 Identities = 16/105 (15%), Positives = 51/105 (48%), Gaps = 5/105 (4%)

Query: 1013 KLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMI 1072
            + +  +   L++++  L     ++ E++ + ++  K    K +      + A  + K   
Sbjct: 21   EEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRA---EKKAEEEKKLRK 77

Query: 1073 KDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVF--VMEDLAQVEP 1115
            + ++E ++ K + ++++AEIEK   ++ + + +   +E + +  P
Sbjct: 78   EKEEEIKELKAELEELKAEIEKLEEKLEEYQPYEEFLESVVERSP 122



 Score = 31.8 bits (73), Expect = 0.90
 Identities = 21/99 (21%), Positives = 46/99 (46%), Gaps = 12/99 (12%)

Query: 1042 QVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQ 1101
            ++EE+Q +L  K +E + + E    + +E+ K ++E ++  +       + +K   E   
Sbjct: 8    EMEEVQLALDAKREEFERREELLKQREEELEKKEEELQESLI-------KFDKFLKENEA 60

Query: 1102 KRVFVMEDLAQVEPAVMDAQQAVKEIK--KQQLVELRSM 1138
            KR    +   +        ++  +EIK  K +L EL++ 
Sbjct: 61   KRRRAEKKAEE---EKKLRKEKEEEIKELKAELEELKAE 96


>gnl|CDD|223256 COG0178, UvrA, Excinuclease ATPase subunit [DNA replication,
           recombination, and repair].
          Length = 935

 Score = 34.1 bits (79), Expect = 0.82
 Identities = 29/82 (35%), Positives = 34/82 (41%), Gaps = 26/82 (31%)

Query: 45  GLMMV--GPSGSGKST-AWKVLLKALER-----------YEGVEGVAHI-----IDPKAI 85
           G+     G SGSGKST     L+ AL R           Y+ +EG+ HI     ID   I
Sbjct: 627 GVFTCVTGVSGSGKSTLINDTLVPALARHLNGTKEEPGPYKKIEGLEHIDKVIDIDQSPI 686

Query: 86  -----SKEALY-GVLDPNTREW 101
                S  A Y GV D   RE 
Sbjct: 687 GRTPRSNPATYTGVFDD-IREL 707


>gnl|CDD|223540 COG0464, SpoVK, ATPases of the AAA+ class [Posttranslational
           modification, protein turnover, chaperones].
          Length = 494

 Score = 33.6 bits (77), Expect = 0.84
 Identities = 11/37 (29%), Positives = 18/37 (48%)

Query: 754 RQPQGHLLLIGVSGAGKTTLSRFVAFMNGLSVFQIRA 790
            +P   +LL G  G GKT L++ VA  +      ++ 
Sbjct: 273 LRPPKGVLLYGPPGTGKTLLAKAVALESRSRFISVKG 309


>gnl|CDD|225164 COG2255, RuvB, Holliday junction resolvasome, helicase subunit [DNA
           replication, recombination, and repair].
          Length = 332

 Score = 33.7 bits (78), Expect = 0.86
 Identities = 21/67 (31%), Positives = 30/67 (44%), Gaps = 22/67 (32%)

Query: 716 LREYV-QARLKVFYEEELDVQL---VLFDEVLDHVLRIDRIFRQPQGHLLLIGVSGAGKT 771
           L E++ Q ++K    E+L + +       E LDHVL              L G  G GKT
Sbjct: 25  LDEFIGQEKVK----EQLQIFIKAAKKRGEALDHVL--------------LFGPPGLGKT 66

Query: 772 TLSRFVA 778
           TL+  +A
Sbjct: 67  TLAHIIA 73


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 34.1 bits (78), Expect = 0.86
 Identities = 30/140 (21%), Positives = 63/140 (45%), Gaps = 5/140 (3%)

Query: 1006 DFINHFVKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAV----KSQELQSKN 1061
            +  NH +  Y EK S LEE+   + + +  I E +  +++ ++ L      KS    +K 
Sbjct: 465  EKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEINKSINEYNKI 524

Query: 1062 EAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEI-AQKRVFVMEDLAQVEPAVMDA 1120
            E+A   L+++     E + +  + ++I+   +   +E    KR   +  LA +    ++ 
Sbjct: 525  ESARADLEDIKIKINELKDKHDKYEEIKNRYKSLKLEDLDSKRTSWLNALAVISLIDIET 584

Query: 1121 QQAVKEIKKQQLVELRSMAN 1140
             ++     K+QL +L S   
Sbjct: 585  NRSRSNEIKKQLNDLESRLQ 604


>gnl|CDD|234814 PRK00698, tmk, thymidylate kinase; Validated.
          Length = 205

 Score = 32.8 bits (76), Expect = 0.87
 Identities = 17/75 (22%), Positives = 26/75 (34%), Gaps = 21/75 (28%)

Query: 50  GPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPKAISKEALYGVLDPNTREWTDGLFTHI 109
           G  G+GKST  ++L + LE            D           V    TRE         
Sbjct: 10  GIDGAGKSTQIELLKELLE--------QQGRDV----------VF---TREPGGTPLGEK 48

Query: 110 LRRIIDNVRGEISKR 124
           LR ++ +   E+  +
Sbjct: 49  LRELLLDPNEEMDDK 63


>gnl|CDD|223436 COG0359, RplI, Ribosomal protein L9 [Translation, ribosomal structure
            and biogenesis].
          Length = 148

 Score = 32.2 bits (74), Expect = 0.87
 Identities = 23/79 (29%), Positives = 38/79 (48%), Gaps = 15/79 (18%)

Query: 1032 GLGKIAETVEQVEEMQ------KSLAVKSQELQSKNEAANLKLKEMIKDQQEAEK--RKV 1083
            GLGK  + VE  +         K LAV + +        NLKL E  + + E +      
Sbjct: 11   GLGKKGDIVEVKDGYARNFLIPKGLAVPATK-------GNLKLLEARRAKLEKKAAEELA 63

Query: 1084 QSQDIQAEIEKQTVEIAQK 1102
            +++ ++ ++E +TVEIA K
Sbjct: 64   EAEALKEKLEGKTVEIAVK 82



 Score = 29.5 bits (67), Expect = 8.0
 Identities = 15/85 (17%), Positives = 34/85 (40%), Gaps = 16/85 (18%)

Query: 1032 GLGKIA--ETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRK------- 1082
            GL   A    ++ +E  +  L  + +  +   EA    LKE ++ +      K       
Sbjct: 34   GLAVPATKGNLKLLEARRAKL--EKKAAEELAEA--EALKEKLEGKTVEIAVKAGEDGKL 89

Query: 1083 ---VQSQDIQAEIEKQTVEIAQKRV 1104
               V S+DI   ++    ++ ++++
Sbjct: 90   FGSVTSKDIAEALKAAGFKLDKRKI 114


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner membrane
            complex of itself with TolQ and TolR to the outer
            membrane complex of TolB and OprL (also called Pal). Most
            of the length of the protein consists of low-complexity
            sequence that may differ in both length and composition
            from one species to another, complicating efforts to
            discriminate TolA (the most divergent gene in the tol-pal
            system) from paralogs such as TonB. Selection of members
            of the seed alignment and criteria for setting scoring
            cutoffs are based largely conserved operon struction.
            //The Tol-Pal complex is required for maintaining outer
            membrane integrity. Also involved in transport (uptake)
            of colicins and filamentous DNA, and implicated in
            pathogenesis. Transport is energized by the proton motive
            force. TolA is an inner membrane protein that interacts
            with periplasmic TolB and with outer membrane porins
            ompC, phoE and lamB [Transport and binding proteins,
            Other, Cellular processes, Pathogenesis].
          Length = 346

 Score = 33.7 bits (77), Expect = 0.88
 Identities = 23/123 (18%), Positives = 47/123 (38%), Gaps = 8/123 (6%)

Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMI--- 1072
             ++  E E+Q+        ++ +     +  +++     Q  + + +A   K K+     
Sbjct: 74   EQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAK 133

Query: 1073 -KDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQ 1131
             K + EAEK+  +    QAE E +    A+ +    E   + E       +A  E K + 
Sbjct: 134  AKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEA----EAKAKAEAKAKA 189

Query: 1132 LVE 1134
              E
Sbjct: 190  KAE 192



 Score = 32.1 bits (73), Expect = 2.5
 Identities = 21/109 (19%), Positives = 42/109 (38%), Gaps = 7/109 (6%)

Query: 1033 LGKIAETVEQVEEMQKSLAVKSQEL-QSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAE 1091
              K+ +  E+ E+ + +   + +EL Q        K  E    Q E ++++ +    +  
Sbjct: 70   QKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQA 129

Query: 1092 IEKQTVEIAQKRVFVMEDLA-QVEP-----AVMDAQQAVKEIKKQQLVE 1134
             E +    A+      E+   Q E      A  +A++   E KK+   E
Sbjct: 130  AEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAE 178


>gnl|CDD|238549 cd01129, PulE-GspE, PulE/GspE The type II secretory pathway is the
           main terminal branch of the general secretory pathway
           (GSP).  It is responsible for the export the majority of
           Gram-negative bacterial exoenzymes and toxins. PulE is a
           cytoplasmic protein of the GSP, which contains an ATP
           binding site and a tetracysteine motif. This subgroup
           also includes PillB and HofB.
          Length = 264

 Score = 33.0 bits (76), Expect = 0.93
 Identities = 15/46 (32%), Positives = 23/46 (50%), Gaps = 1/46 (2%)

Query: 27  GPWMEKVLQLYQISNLNHGLMMV-GPSGSGKSTAWKVLLKALERYE 71
           G   E +    ++    HG+++V GP+GSGK+T     L  L   E
Sbjct: 63  GLKPENLEIFRKLLEKPHGIILVTGPTGSGKTTTLYSALSELNTPE 108


>gnl|CDD|153364 cd07680, F-BAR_PACSIN1, The F-BAR (FES-CIP4 Homology and
            Bin/Amphiphysin/Rvs) domain of Protein kinase C and
            Casein kinase Substrate in Neurons 1 (PACSIN1).  F-BAR
            domains are dimerization modules that bind and bend
            membranes and are found in proteins involved in membrane
            dynamics and actin reorganization. Protein kinase C and
            Casein kinase Substrate in Neurons (PACSIN) proteins,
            also called Synaptic dynamin-associated proteins
            (Syndapins), act as regulators of cytoskeletal and
            membrane dynamics. Vetebrates harbor three isoforms with
            distinct expression patterns and specific functions.
            PACSIN 1 or Syndapin I is expressed specifically in the
            brain and is localized in neurites and synaptic boutons.
            It binds the brain-specific proteins dynamin I,
            synaptojanin, synapsin I, and neural Wiskott-Aldrich
            syndrome protein (nWASP), and functions as a link between
            the cytoskeletal machinery and synaptic vesicle
            endocytosis. PACSIN 1 interacts with huntingtin and may
            be implicated in the neuropathology of Huntington's
            disease. It contains an N-terminal F-BAR domain and a
            C-terminal SH3 domain. F-BAR domains form banana-shaped
            dimers with a positively-charged concave surface that
            binds to negatively-charged lipid membranes. They can
            induce membrane deformation in the form of long tubules.
          Length = 258

 Score = 33.1 bits (75), Expect = 0.94
 Identities = 35/140 (25%), Positives = 66/140 (47%), Gaps = 13/140 (9%)

Query: 1009 NHFVKLYREKCSELEEQQLHLNV--GLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANL 1066
            N+ +    EK    ++   H  +  G  +  E  +   + QK  A K +EL++  +A +L
Sbjct: 85   NNLLNEDLEKVKNWQKDAYHKQIMGGFKETKEAEDGFRKAQKPWAKKMKELEAAKKAYHL 144

Query: 1067 KLK-EMIKDQQEAEKRKVQS------QDIQAEIE--KQTVEIAQKRV-FVMEDLAQVEPA 1116
              K E +   +EA  +  QS      + +Q +++  KQ V+  Q++   V++D+ +  P 
Sbjct: 145  ACKEEKLAMTREANSKAEQSVTPEQQKKLQDKVDKCKQDVQKTQEKYEKVLDDVGKTTPQ 204

Query: 1117 VMDAQQAVKEIKKQQLVELR 1136
             M+  + V E + QQ  E R
Sbjct: 205  YMENMEQVFE-QCQQFEEKR 223


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 33.4 bits (77), Expect = 0.98
 Identities = 25/112 (22%), Positives = 53/112 (47%), Gaps = 14/112 (12%)

Query: 1235 LKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDL 1294
            L+K++    EL+ +  + +E   K  E K+ I+  ++S+ +        + +A  +K  +
Sbjct: 315  LEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITL-------VDKAKKVKAAI 367

Query: 1295 DNVQAKFYEYAQLIAQATAIKTDLDN-----VQAKVERSMA--LLKSLGIER 1339
            + +QA+F + A+ +A+       +       V+ K  R +   LLK  GI+ 
Sbjct: 368  EELQAEFVDNAEELAKLQDELDKIVKTKSELVKEKYHRGIVTDLLKDSGIKA 419


>gnl|CDD|218380 pfam05010, TACC, Transforming acidic coiled-coil-containing protein
            (TACC).  This family contains the proteins TACC 1, 2 and
            3 the genes for which are found concentrated in the
            centrosomes of eukaryotic and may play a conserved role
            in organising centrosomal microtubules. The human TACC
            proteins have been linked to cancer and TACC2 has been
            identified as a possible tumour suppressor (AZU-1). The
            functional homologue (Alp7) in Schizosaccharomyces pombe
            has been shown to be required for organisation of bipolar
            spindles.
          Length = 207

 Score = 32.8 bits (75), Expect = 0.99
 Identities = 21/107 (19%), Positives = 52/107 (48%), Gaps = 3/107 (2%)

Query: 1228 QISYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQA 1287
            + S  DM   ++ ++ E+ S E+++ E K K E+ +    ++ K +  ++   AQ+I + 
Sbjct: 1    KYSQKDMDAALKKIKREVISSELESEELKKKYEKLRSENLEMGKIVDEFEKTIAQMI-EE 59

Query: 1288 TAIKTDL--DNVQAKFYEYAQLIAQATAIKTDLDNVQAKVERSMALL 1332
            +  + +L    +Q    E  Q  A   +++T   ++  + E+   ++
Sbjct: 60   SQKQKELSKRELQEVLAEKDQAYADLNSLETSFSDLFKRYEKYKEVI 106


>gnl|CDD|217417 pfam03194, LUC7, LUC7 N_terminus.  This family contains the N
            terminal region of several LUC7 protein homologues and
            only contains eukaryotic proteins. LUC7 has been shown to
            be a U1 snRNA associated protein with a role in splice
            site recognition. The family also contains human and
            mouse LUC7 like (LUC7L) proteins and human cisplatin
            resistance-associated overexpressed protein (CROP).
          Length = 252

 Score = 33.2 bits (76), Expect = 1.0
 Identities = 32/163 (19%), Positives = 61/163 (37%), Gaps = 26/163 (15%)

Query: 974  ACVYVHQTLHKANARLSKRGSRTMAITPRHYLDFINHFVKLYREKCSELEEQQLHLNVGL 1033
             C  VH    KA+   + +            L+ +  FV     K          ++   
Sbjct: 55   PCPKVHDLKLKADYERASKSQDYFPYE-VEALEILERFVHDCDRK----------IDKAK 103

Query: 1034 GKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKV-QSQDIQAEI 1092
             ++  T E  E+ + +   K++EL   +E     L E    +   E+ KV ++  +  E+
Sbjct: 104  QRLELTQE--EQTKIAADSKAEELAELDEEIGKLLAEA---EALGEEGKVDEAMKLMKEV 158

Query: 1093 EKQTVEIAQKRVFVMEDLAQVE-PAVMDAQQAVKEIKKQQLVE 1134
            E+     A+K+     D  +   P+   AQQ     +K ++ E
Sbjct: 159  EEL---KAKKKELEDSDEVRNAAPSSAQAQQ-----QKLRVCE 193


>gnl|CDD|234719 PRK00300, gmk, guanylate kinase; Provisional.
          Length = 205

 Score = 32.8 bits (76), Expect = 1.0
 Identities = 17/38 (44%), Positives = 22/38 (57%), Gaps = 6/38 (15%)

Query: 419 LVLCGPPGSGKTMTLLSALRALPDMEVVSLNFS-SATT 455
           +VL GP G+GK+ TL+ AL         +L  S SATT
Sbjct: 8   IVLSGPSGAGKS-TLVKALLERDP----NLQLSVSATT 40


>gnl|CDD|153042 pfam12608, DUF3773, Protein of unknown function (DUF3773).  This
           family of proteins is found in bacteria and eukaryotes.
           Proteins in this family are approximately 110 amino
           acids in length.
          Length = 101

 Score = 31.4 bits (71), Expect = 1.1
 Identities = 10/50 (20%), Positives = 20/50 (40%), Gaps = 3/50 (6%)

Query: 192 EDVLSTEMIFENYLSRLRNIALDDIDDDSSLLITVDATGKAPDDVLSPAL 241
              +  E    ++    R ++ D++  D+ LL    A  +  D + S  L
Sbjct: 6   MLDILAEEAGADFYLDGREVSYDEVFSDTGLL---PALARRADQLASLCL 52


>gnl|CDD|227321 COG4988, CydD, ABC-type transport system involved in cytochrome bd
           biosynthesis, ATPase and permease components [Energy
           production and conversion / Posttranslational
           modification, protein turnover, chaperones].
          Length = 559

 Score = 33.4 bits (77), Expect = 1.1
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 49  VGPSGSGKSTAWKVLLKALERYEG 72
           VG SG+GKST   +LL  L   +G
Sbjct: 353 VGASGAGKSTLLNLLLGFLAPTQG 376


>gnl|CDD|184375 PRK13891, PRK13891, conjugal transfer protein TrbE; Provisional.
          Length = 852

 Score = 33.6 bits (77), Expect = 1.1
 Identities = 14/34 (41%), Positives = 20/34 (58%), Gaps = 1/34 (2%)

Query: 39  ISNLNHGLMMVGPSGSGKSTAWKVLLKALERYEG 72
           + +L H  M  GP+G+GKST   ++   L RY G
Sbjct: 485 VRDLGHTFMF-GPTGAGKSTHLGIIAAQLRRYAG 517


>gnl|CDD|216584 pfam01580, FtsK_SpoIIIE, FtsK/SpoIIIE family.  FtsK has extensive
          sequence similarity to wide variety of proteins from
          prokaryotes and plasmids, termed the FtsK/SpoIIIE
          family. This domain contains a putative ATP binding
          P-loop motif. It is found in the FtsK cell division
          protein from E. coli and the stage III sporulation
          protein E SpoIIIE, which has roles in regulation of
          prespore specific gene expression in B. subtilis. A
          mutation in FtsK causes a temperature sensitive block
          in cell division and it is involved in peptidoglycan
          synthesis or modification. The SpoIIIE protein is
          implicated in intercellular chromosomal DNA transfer.
          Length = 201

 Score = 32.4 bits (74), Expect = 1.2
 Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 2/58 (3%)

Query: 27 GPWMEKVLQLYQISNLNHGLMMVGPSGSGKSTAWKVLLKALERYEGVEGVA-HIIDPK 83
          G  +     +  +  + H L+  G +GSGKST    L+ +L      E V  ++IDPK
Sbjct: 23 GKDISGNPVVADLVKMPHLLI-AGATGSGKSTFLNTLILSLAARHSPEEVRLYLIDPK 79


>gnl|CDD|233328 TIGR01243, CDC48, AAA family ATPase, CDC48 subfamily.  This
           subfamily of the AAA family ATPases includes two members
           each from three archaeal species. It also includes yeast
           CDC48 (cell division control protein 48) and the human
           ortholog, transitional endoplasmic reticulum ATPase
           (valosin-containing protein). These proteins in
           eukaryotes are involved in the budding and transfer of
           membrane from the transitional endoplasmic reticulum to
           the Golgi apparatus.
          Length = 733

 Score = 33.3 bits (76), Expect = 1.3
 Identities = 49/176 (27%), Positives = 73/176 (41%), Gaps = 34/176 (19%)

Query: 417 KPLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFSSATTPELLLKTFDHYCEYRKTPNGV 476
           K ++L GPPG+GK  TLL+   A         NF +   PE+L K      E  K    +
Sbjct: 488 KGVLLFGPPGTGK--TLLAKAVATES----GANFIAVRGPEILSKWVG---ESEKAIREI 538

Query: 477 ILSPIQLGKWLVLFCDEINL------PDMDKYATQRVISFLRQLIEQRGFYRPADKQWVS 530
                Q    ++ F DEI+          D   T R+++ L  L E  G    ++     
Sbjct: 539 FRKARQAAPAIIFF-DEIDAIAPARGARFDTSVTDRIVNQL--LTEMDGIQELSN----- 590

Query: 531 LERIQCVGACNPP--TDPG-RKPLSHRFLRHVPVIYVDYPGETSLKQIYGTFSRAM 583
              +  + A N P   DP   +P   RF R   +I V  P E + K+I+   +R+M
Sbjct: 591 ---VVVIAATNRPDILDPALLRP--GRFDR---LILVPPPDEEARKEIFKIHTRSM 638


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists of
            several eukaryotic mitotic checkpoint (Mitotic arrest
            deficient or MAD) proteins. The mitotic spindle
            checkpoint monitors proper attachment of the bipolar
            spindle to the kinetochores of aligned sister chromatids
            and causes a cell cycle arrest in prometaphase when
            failures occur. Multiple components of the mitotic
            spindle checkpoint have been identified in yeast and
            higher eukaryotes. In S.cerevisiae, the existence of a
            Mad1-dependent complex containing Mad2, Mad3, Bub3 and
            Cdc20 has been demonstrated.
          Length = 722

 Score = 33.4 bits (76), Expect = 1.3
 Identities = 64/354 (18%), Positives = 132/354 (37%), Gaps = 46/354 (12%)

Query: 1017 EKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVK--SQELQSKNEAANLKLKE---- 1070
            ++  ELE +   L     K      + EE  K L  K  ++ L+ +NE  +   +     
Sbjct: 107  DRNLELEVRLKALEELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESI 166

Query: 1071 -MIKDQQEAEKRKVQSQDIQ-----AEIE--KQTVEIAQKRVFVMEDLAQVEPAVMDAQQ 1122
              IK+     + + Q+ D +     +E+E  ++ +E  QK +   E   Q       A  
Sbjct: 167  SRIKNDLSEMQCRAQNADTELKLLESELEELREQLEECQKELAEAEKKLQ-SLTSEQASS 225

Query: 1123 AVKEIKKQQL-VELRSMANPPSVVKLALESICLLLGENATDWKAIRAVVMREN-FINSIV 1180
            A   +K + L  EL+       VVK   E +  +      + +   A +  EN  + S+ 
Sbjct: 226  ADNSVKIKHLEEELKRYEQDAEVVKSMKEQLLQI-----PELERELAALREENRKLRSM- 279

Query: 1181 SNFNTEMITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEP 1240
               + E++ +E+                +  +R         K A  ++    +  +++ 
Sbjct: 280  -KEDNELLKEEL---------------EDLQSRLERFEKMREKLADLELEKEKLENELK- 322

Query: 1241 LRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAK 1300
                 KSL      N    ++    I  L+      K++   + + A  ++T L  +Q  
Sbjct: 323  ---SWKSLLQDIGLNLRTPDDLSRRIVVLQNEELQLKEKNGSISSSAKQLETTLQQLQL- 378

Query: 1301 FYEYAQLIAQATAIKTDLDNVQAKVERSMALLKSLGIERERWEATSETFRSQMA 1354
              E  + +++   +K  L+ ++A V R    L  +  ER+   A   ++  ++ 
Sbjct: 379  --ERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKERDGLRAILNSYDKELT 430


>gnl|CDD|234734 PRK00349, uvrA, excinuclease ABC subunit A; Reviewed.
          Length = 943

 Score = 33.5 bits (78), Expect = 1.3
 Identities = 26/75 (34%), Positives = 33/75 (44%), Gaps = 24/75 (32%)

Query: 50  GPSGSGKST-----AWKVLLKALER-------YEGVEGVAHI-----IDPKAI-----SK 87
           G SGSGKST      +K L + L         ++ +EG+ H+     ID   I     S 
Sbjct: 642 GVSGSGKSTLINETLYKALARKLNGAKKVPGKHKEIEGLEHLDKVIDIDQSPIGRTPRSN 701

Query: 88  EALY-GVLDPNTREW 101
            A Y GV DP  RE 
Sbjct: 702 PATYTGVFDP-IREL 715


>gnl|CDD|235290 PRK04351, PRK04351, hypothetical protein; Provisional.
          Length = 149

 Score = 31.7 bits (73), Expect = 1.3
 Identities = 12/24 (50%), Positives = 15/24 (62%), Gaps = 2/24 (8%)

Query: 1503 NRELRRTGGRVLITLGDQDIDISP 1526
            N+ LR TGGR L  L D  I+ +P
Sbjct: 29   NKRLRTTGGRYL--LKDHHIEFNP 50


>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0).  Part of the CF(0)
            (base unit) of the ATP synthase. The base unit is thought
            to translocate protons through membrane (inner membrane
            in mitochondria, thylakoid membrane in plants,
            cytoplasmic membrane in bacteria). The B subunits are
            thought to interact with the stalk of the CF(1) subunits.
            This domain should not be confused with the ab CF(1)
            proteins (in the head of the ATP synthase) which are
            found in pfam00006.
          Length = 132

 Score = 31.5 bits (72), Expect = 1.4
 Identities = 25/98 (25%), Positives = 48/98 (48%), Gaps = 15/98 (15%)

Query: 1035 KIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKD-QQEAEKRKVQSQDIQAEIE 1093
            KIA  +++ EE  K  A    E + +   A  +  E+I + ++EA+K K   ++I AE +
Sbjct: 34   KIANNIKEAEERLKQAAALLAEAEQQLAQARAEASEIINNAKKEAQKLK---EEILAEAQ 90

Query: 1094 KQ-TVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQ 1130
            K     +   R     ++ Q +      +QA+ E+++Q
Sbjct: 91   KDAERLLESAR----AEIEQEK------EQALAELRQQ 118


>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
           secretion].
          Length = 407

 Score = 33.1 bits (76), Expect = 1.4
 Identities = 14/47 (29%), Positives = 24/47 (51%), Gaps = 2/47 (4%)

Query: 728 YEEELDVQLVLFDEVLDHVLR--IDRIFRQPQGHLLLIGVSGAGKTT 772
            ++E    L  F E L  +L   I+ +  + +  + L+G +G GKTT
Sbjct: 172 MKDESYEDLRYFSEKLRKLLLSLIENLIVEQKRVIALVGPTGVGKTT 218


>gnl|CDD|213200 cd03233, ABCG_PDR_domain1, First domain of the pleiotropic drug
          resistance-like subfamily G of ATP-binding cassette
          transporters.  The pleiotropic drug resistance (PDR) is
          a well-described phenomenon occurring in fungi and
          shares several similarities with processes in bacteria
          and higher eukaryotes. This PDR subfamily represents
          domain I of its (ABC-IM)2 organization. ABC
          transporters are a large family of proteins involved in
          the transport of a wide variety of different compounds
          including sugars, ions, peptides, and more complex
          organic molecules. The nucleotide-binding domain shows
          the highest similarity between all members of the
          family. ABC transporters are a subset of nucleotide
          hydrolases that contain a signature motif, Q-loop, and
          H-loop/switch region, in addition to, the Walker A
          motif/P-loop and Walker B motif commonly found in a
          number of ATP- and GTP-binding and hydrolyzing
          proteins.
          Length = 202

 Score = 32.2 bits (74), Expect = 1.4
 Identities = 12/30 (40%), Positives = 16/30 (53%)

Query: 46 LMMVGPSGSGKSTAWKVLLKALERYEGVEG 75
          ++++G  GSG ST  K L    E    VEG
Sbjct: 36 VLVLGRPGSGCSTLLKALANRTEGNVSVEG 65


>gnl|CDD|145928 pfam03040, CemA, CemA family.  Members of this family are probable
            integral membrane proteins. Their molecular function is
            unknown. CemA proteins are found in the inner envelope
            membrane of chloroplasts but not in the thylakoid
            membrane. A cyanobacterial member of this family has been
            implicated in CO2 transport, but is probably not a CO2
            transporter itself. They are predicted to be haem-binding
            however this has not been proven experimentally.
          Length = 230

 Score = 32.5 bits (75), Expect = 1.4
 Identities = 14/58 (24%), Positives = 27/58 (46%), Gaps = 10/58 (17%)

Query: 1044 EEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQ 1101
               ++    K QE +       L+L E+IK+           Q+++ EI K+ +E+A+
Sbjct: 51   SIQEERALEKLQEFEEL-----LELDELIKESPP-----TSLQELRIEIHKKAIELAK 98


>gnl|CDD|235450 PRK05416, PRK05416, glmZ(sRNA)-inactivating NTPase; Provisional.
          Length = 288

 Score = 32.4 bits (75), Expect = 1.6
 Identities = 22/71 (30%), Positives = 31/71 (43%), Gaps = 17/71 (23%)

Query: 50  GPSGSGKSTAWKVLLKALE--RYEGVEG---------VAHIIDPKAISKEALYGVLDPNT 98
           G SG+GKS A    L+ALE   Y  V+          V  +     I K A+  V+D  +
Sbjct: 13  GLSGAGKSVA----LRALEDLGYYCVDNLPPSLLPKLVELLAQSGGIRKVAV--VIDVRS 66

Query: 99  REWTDGLFTHI 109
           R + D L   +
Sbjct: 67  RPFFDDLPEAL 77


>gnl|CDD|223272 COG0194, Gmk, Guanylate kinase [Nucleotide transport and
           metabolism].
          Length = 191

 Score = 32.1 bits (74), Expect = 1.7
 Identities = 19/40 (47%), Positives = 22/40 (55%), Gaps = 7/40 (17%)

Query: 417 KPLVLCGPPGSGKTMTLLSALRALPDMEVVSLNFS-SATT 455
             +VL GP G GK+ TL+ AL      E   L FS SATT
Sbjct: 5   LLIVLSGPSGVGKS-TLVKALL-----EDDKLRFSVSATT 38


>gnl|CDD|224055 COG1132, MdlB, ABC-type multidrug transport system, ATPase and
           permease components [Defense mechanisms].
          Length = 567

 Score = 32.8 bits (75), Expect = 1.7
 Identities = 14/27 (51%), Positives = 19/27 (70%)

Query: 46  LMMVGPSGSGKSTAWKVLLKALERYEG 72
           + +VGPSGSGKST  K+LL+  +   G
Sbjct: 358 VAIVGPSGSGKSTLIKLLLRLYDPTSG 384


>gnl|CDD|238899 cd01918, HprK_C, HprK/P, the bifunctional histidine-containing
          protein kinase/phosphatase, controls the
          phosphorylation state of the phosphocarrier protein HPr
          and regulates the utilization of carbon sources by
          gram-positive bacteria. It catalyzes both the
          ATP-dependent phosphorylation of Ser-46 of HPr and its
          dephosphorylation by phosphorolysis. The latter
          reaction uses inorganic phosphate as substrate and
          produces pyrophosphate. Phosphoenolpyruvate
          carboxykinase (PEPCK) and the C-terminal catalytic
          domain of HprK/P are structurally similar with
          conserved active site residues suggesting these two
          phosphotransferases have related functions.  The HprK/P
          N-terminal domain is structurally similar to the
          N-terminal domains of the MurE and MurF amino acid
          ligases.
          Length = 149

 Score = 31.4 bits (72), Expect = 1.8
 Identities = 11/25 (44%), Positives = 14/25 (56%), Gaps = 6/25 (24%)

Query: 44 HGLMMVGPSGSGKSTAWKVLLKALE 68
           G+++ GPSG GKS        ALE
Sbjct: 15 IGVLITGPSGIGKSEL------ALE 33


>gnl|CDD|213184 cd03217, ABC_FeS_Assembly, ABC-type transport system involved in
          Fe-S cluster assembly, ATPase component.  Biosynthesis
          of iron-sulfur clusters (Fe-S) depends on multi-protein
          systems. The SUF system of E. coli and Erwinia
          chrysanthemi is important for Fe-S biogenesis under
          stressful conditions. The SUF system is made of six
          proteins: SufC is an atypical cytoplasmic ABC-ATPase,
          which forms a complex with SufB and SufD; SufA plays
          the role of a scaffold protein for assembly of
          iron-sulfur clusters and delivery to target proteins;
          SufS is a cysteine desulfurase which mobilizes the
          sulfur atom from cysteine and provides it to the
          cluster; SufE has no associated function yet.
          Length = 200

 Score = 31.7 bits (73), Expect = 1.8
 Identities = 17/35 (48%), Positives = 20/35 (57%), Gaps = 9/35 (25%)

Query: 44 HGLMMVGPSGSGKSTAWKVLLKAL---ERYEGVEG 75
          H LM  GP+GSGKST    L K +    +YE  EG
Sbjct: 29 HALM--GPNGSGKST----LAKTIMGHPKYEVTEG 57


>gnl|CDD|226164 COG3638, COG3638, ABC-type phosphate/phosphonate transport system,
           ATPase component [Inorganic ion transport and
           metabolism].
          Length = 258

 Score = 32.2 bits (74), Expect = 1.8
 Identities = 24/90 (26%), Positives = 39/90 (43%), Gaps = 5/90 (5%)

Query: 753 FRQPQG-HLLLIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNKYTGADFDEDLRTVLRRSG 811
               QG  + +IG SGAGK+TL R +  +   +  +I  +         ++LR + R  G
Sbjct: 25  LEINQGEMVAIIGPSGAGKSTLLRSLNGLVDPTSGEILFNGVQITKLKGKELRKLRRDIG 84

Query: 812 CKNEKIAFLLDES---NVLESGFLERMNTL 838
              ++   +   S   NVL  G L   +T 
Sbjct: 85  MIFQQFNLVPRLSVLENVL-LGRLGYTSTW 113



 Score = 31.5 bits (72), Expect = 3.2
 Identities = 16/49 (32%), Positives = 25/49 (51%), Gaps = 6/49 (12%)

Query: 41 NLNHGLM--MVGPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPKAISK 87
           +N G M  ++GPSG+GKST    LL++L           + +   I+K
Sbjct: 26 EINQGEMVAIIGPSGAGKST----LLRSLNGLVDPTSGEILFNGVQITK 70


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division and
            chromosome partitioning].
          Length = 420

 Score = 32.8 bits (75), Expect = 1.8
 Identities = 22/89 (24%), Positives = 39/89 (43%), Gaps = 7/89 (7%)

Query: 1244 ELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYE 1303
            +LK ++ + +  + K  E +D   +LEK + S + E A L AQ      DL  ++ +   
Sbjct: 39   QLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQ--- 95

Query: 1304 YAQLIAQATAIKTDLDNVQAKVERSMALL 1332
                IA   A    L+  + +  R +A  
Sbjct: 96   ----IADLNARLNALEVQEREQRRRLAEQ 120



 Score = 30.8 bits (70), Expect = 5.6
 Identities = 21/91 (23%), Positives = 35/91 (38%), Gaps = 8/91 (8%)

Query: 1041 EQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIA 1100
            EQ  +  K LA   +E +      N    E+  DQ++ E+ +     ++ EI       A
Sbjct: 193  EQRAQQAK-LAQLLEERKKTLAQLN---SELSADQKKLEELRANESRLKNEIASAEAAAA 248

Query: 1101 QKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQ 1131
            + R    E  A  E A   A+ A  +   + 
Sbjct: 249  KAR----EAAAAAEAAAARARAAEAKRTGET 275


>gnl|CDD|213179 cd00267, ABC_ATPase, ATP-binding cassette transporter
          nucleotide-binding domain.  ABC transporters are a
          large family of proteins involved in the transport of a
          wide variety of different compounds, like sugars, ions,
          peptides, and more complex organic molecules. The
          nucleotide-binding domain shows the highest similarity
          between all members of the family. ABC transporters are
          a subset of nucleotide hydrolases that contain a
          signature motif, Q-loop, and H-loop/switch region, in
          addition to, the Walker A motif/P-loop and Walker B
          motif commonly found in a number of ATP- and
          GTP-binding and hydrolyzing proteins.
          Length = 157

 Score = 31.4 bits (72), Expect = 1.8
 Identities = 18/42 (42%), Positives = 23/42 (54%), Gaps = 4/42 (9%)

Query: 49 VGPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPKAISKEAL 90
          VGP+GSGKST    LL+A+           +ID K I+K  L
Sbjct: 31 VGPNGSGKST----LLRAIAGLLKPTSGEILIDGKDIAKLPL 68



 Score = 29.9 bits (68), Expect = 6.2
 Identities = 14/46 (30%), Positives = 24/46 (52%)

Query: 759 HLLLIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNKYTGADFDEDLR 804
            + L+G +G+GK+TL R +A +   +  +I    K       E+LR
Sbjct: 27  IVALVGPNGSGKSTLLRAIAGLLKPTSGEILIDGKDIAKLPLEELR 72


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
            domain.  This domain is found in a number of different
            types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 31.2 bits (71), Expect = 1.9
 Identities = 18/81 (22%), Positives = 33/81 (40%), Gaps = 9/81 (11%)

Query: 1060 KNEAANLKLKEMIKDQQEAEKRKVQ---SQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPA 1116
            K     L+  ++   ++EAEK K +        AE EK+  E+ +K+        + +  
Sbjct: 66   KKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKEEKKIM 125

Query: 1117 VMD------AQQAVKEIKKQQ 1131
              D       Q+   EI ++Q
Sbjct: 126  FADTSSLSPEQRQYYEIMQKQ 146


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
            seryl-tRNA synthetase found in most organisms. This
            protein is a class II tRNA synthetase, and is recognized
            by the pfam model tRNA-synt_2b. The seryl-tRNA
            synthetases of two archaeal species, Methanococcus
            jannaschii and Methanobacterium thermoautotrophicum,
            differ considerably and are included in a different model
            [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 32.3 bits (74), Expect = 2.0
 Identities = 19/102 (18%), Positives = 44/102 (43%), Gaps = 15/102 (14%)

Query: 1013 KLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSK-NEAA------- 1064
            KL R     ++E      + +    E +  +++ +K L  + +ELQ+K NE +       
Sbjct: 5    KLLRNNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAK 64

Query: 1065 -------NLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEI 1099
                       KE+ + ++E  +     + ++AE++ + + I
Sbjct: 65   GQKKDKIEEIKKELKELKEELTELSAALKALEAELQDKLLSI 106


>gnl|CDD|110677 pfam01695, IstB_IS21, IstB-like ATP binding protein.  This protein
           contains an ATP/GTP binding P-loop motif. It is found
           associated with IS21 family insertion sequences. The
           function of this protein is unknown, but it may perform
           a transposase function.
          Length = 178

 Score = 31.5 bits (72), Expect = 2.1
 Identities = 10/19 (52%), Positives = 14/19 (73%)

Query: 412 WLAEHKPLVLCGPPGSGKT 430
           W+ + + L+L GPPG GKT
Sbjct: 43  WIEQAENLLLLGPPGVGKT 61


>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
            Validated.
          Length = 605

 Score = 32.5 bits (74), Expect = 2.2
 Identities = 14/54 (25%), Positives = 22/54 (40%)

Query: 1525 SPTFVIFLSTRDPTVEFPPDICSRVTFVNFTVTRSSLQSQCLNRVLKAERPDID 1578
             P  V+F+       + P  I SR    NF    +S   + L  + K E+  I+
Sbjct: 146  PPKHVVFIFATTEFQKIPLTIISRCQRYNFKKLNNSELQELLKSIAKKEKIKIE 199


>gnl|CDD|237917 PRK15134, PRK15134, microcin C ABC transporter ATP-binding protein
           YejF; Provisional.
          Length = 529

 Score = 32.4 bits (74), Expect = 2.3
 Identities = 16/28 (57%), Positives = 18/28 (64%)

Query: 419 LVLCGPPGSGKTMTLLSALRALPDMEVV 446
           L L G  GSGK++T LS LR LP   VV
Sbjct: 38  LALVGESGSGKSVTALSILRLLPSPPVV 65


>gnl|CDD|183133 PRK11432, fbpC, ferric transporter ATP-binding subunit;
           Provisional.
          Length = 351

 Score = 32.0 bits (73), Expect = 2.3
 Identities = 13/24 (54%), Positives = 17/24 (70%), Gaps = 1/24 (4%)

Query: 756 PQGHLL-LIGVSGAGKTTLSRFVA 778
            QG ++ L+G SG GKTT+ R VA
Sbjct: 30  KQGTMVTLLGPSGCGKTTVLRLVA 53


>gnl|CDD|132313 TIGR03269, met_CoM_red_A2, methyl coenzyme M reductase system,
           component A2.  The enzyme that catalyzes the final step
           in methanogenesis, methyl coenzyme M reductase, contains
           alpha, beta, and gamma chains. In older literature, the
           complex of alpha, beta, and gamma chains was termed
           component C, while this single chain protein was termed
           methyl coenzyme M reductase system component A2 [Energy
           metabolism, Methanogenesis].
          Length = 520

 Score = 32.5 bits (74), Expect = 2.4
 Identities = 11/17 (64%), Positives = 15/17 (88%)

Query: 762 LIGVSGAGKTTLSRFVA 778
           ++G SGAGKTTLS+ +A
Sbjct: 315 IVGTSGAGKTTLSKIIA 331


>gnl|CDD|234357 TIGR03797, NHLM_micro_ABC2, NHLM bacteriocin system ABC
           transporter, ATP-binding protein.  Members of this
           protein family are ABC transporter ATP-binding subunits,
           part of a three-gene putative bacteriocin transport
           operon. The other subunits include another ATP-binding
           subunit (TIGR03796), which has an N-terminal leader
           sequence cleavage domain, and an HlyD homolog
           (TIGR03794). In a number of genomes, members of protein
           families related to nitrile hydratase alpha subunit or
           to nif11 have undergone paralogous family expansions,
           with members possessing a putative bacteriocin cleavage
           region ending with a classic Gly-Gly motif. Those sets
           of putative bacteriocins, members of this protein family
           and its partners TIGR03794 and TIGR03796, and
           cyclodehydratase/docking scaffold fusion proteins of
           thiazole/oxazole biosynthesis frequently show correlated
           species distribution and co-clustering within many of
           those genomes [Transport and binding proteins, Amino
           acids, peptides and amines, Cellular processes,
           Biosynthesis of natural products].
          Length = 686

 Score = 32.2 bits (74), Expect = 2.4
 Identities = 12/16 (75%), Positives = 14/16 (87%)

Query: 49  VGPSGSGKSTAWKVLL 64
           VGPSGSGKST  ++LL
Sbjct: 485 VGPSGSGKSTLLRLLL 500


>gnl|CDD|224059 COG1136, SalX, ABC-type antimicrobial peptide transport system,
          ATPase component [Defense mechanisms].
          Length = 226

 Score = 31.7 bits (73), Expect = 2.5
 Identities = 10/13 (76%), Positives = 12/13 (92%)

Query: 46 LMMVGPSGSGKST 58
          + +VGPSGSGKST
Sbjct: 34 VAIVGPSGSGKST 46


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 32.3 bits (74), Expect = 2.5
 Identities = 17/99 (17%), Positives = 33/99 (33%), Gaps = 4/99 (4%)

Query: 1008 INHFVKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANL- 1066
            +        E     EE +L          E   ++ E+Q++        ++KNE     
Sbjct: 819  LKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFK 878

Query: 1067 ---KLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQK 1102
               K    +   ++   + V S D   + EK T +  + 
Sbjct: 879  NDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTKKKKKD 917


>gnl|CDD|223455 COG0378, HypB, Ni2+-binding GTPase involved in regulation of
           expression and maturation of urease and hydrogenase
           [Posttranslational modification, protein turnover,
           chaperones / Transcription].
          Length = 202

 Score = 31.4 bits (72), Expect = 2.6
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 421 LCGPPGSGKTMTLLSALRALPD 442
           + GPPGSGKT  +   LRAL D
Sbjct: 18  VGGPPGSGKTALIEKTLRALKD 39


>gnl|CDD|233265 TIGR01085, murE, UDP-N-acetylmuramyl-tripeptide synthetase.  Most
           members of this family are EC 6.3.2.13,
           UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--2,
           6-diaminopimelate ligase. An exception is Staphylococcus
           aureus, in which diaminopimelate is replaced by lysine
           in the peptidoglycan and MurE is EC 6.3.2.7. The
           Mycobacteria, part of the closest neighboring branch
           outside of the low-GC Gram-positive bacteria, use
           diaminopimelate. A close homolog, scoring just below the
           trusted cutoff, is found (with introns) in Arabidopsis
           thaliana. Its role is unknown [Cell envelope,
           Biosynthesis and degradation of murein sacculus and
           peptidoglycan].
          Length = 464

 Score = 32.3 bits (74), Expect = 2.6
 Identities = 17/60 (28%), Positives = 26/60 (43%), Gaps = 7/60 (11%)

Query: 726 VFYEEELDVQLVLFDEV----LDHVLRI--DRIFRQPQGHLLLIGVSGA-GKTTLSRFVA 778
           V  E ++D  +     +    L H L       +  P   L +IGV+G  GKTT +  +A
Sbjct: 45  VVVERDVDFYVAPVPVIIVPDLRHALSSLAAAFYGHPSKKLKVIGVTGTNGKTTTTSLIA 104


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
            Provisional.
          Length = 1123

 Score = 32.2 bits (74), Expect = 2.6
 Identities = 30/145 (20%), Positives = 54/145 (37%), Gaps = 32/145 (22%)

Query: 1219 GPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKD 1278
            GP V     +     + ++V  L+ +L+    + ++++A  E  +  +  LE   A  ++
Sbjct: 132  GPFVPPEDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEE 191

Query: 1279 EYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQAKVERSMALLKSLGI- 1337
            +  +L AQ                   QL  +A    T  +  Q + E +    K L + 
Sbjct: 192  KQQELEAQ-----------------LEQLQEKAAE--TSQERKQKRKEITDQAAKRLELS 232

Query: 1338 ERER------------WEATSETFR 1350
            E E             WEA S+T R
Sbjct: 233  EEETRILIDQQLRKAGWEADSKTLR 257


>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
          Provisional.
          Length = 472

 Score = 32.0 bits (73), Expect = 2.6
 Identities = 14/39 (35%), Positives = 26/39 (66%), Gaps = 2/39 (5%)

Query: 38 QISNLNHGLMMVGPSGSGKSTAWKVLLKAL--ERYEGVE 74
          + ++++H  +  GP G+GK+T  ++L K+L  E  +GVE
Sbjct: 31 KKNSISHAYIFAGPRGTGKTTVARILAKSLNCENRKGVE 69


>gnl|CDD|131721 TIGR02673, FtsE, cell division ATP-binding protein FtsE.  This
          model describes FtsE, a member of the ABC transporter
          ATP-binding protein family. This protein, and its
          permease partner FtsX, localize to the division site.
          In a number of species, the ftsEX gene pair is located
          next to FtsY, the signal recognition particle-docking
          protein [Cellular processes, Cell division].
          Length = 214

 Score = 31.4 bits (72), Expect = 2.6
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query: 46 LMMVGPSGSGKSTAWKVLLKALERYEG 72
          L + GPSG+GK+T  K+L  AL    G
Sbjct: 31 LFLTGPSGAGKTTLLKLLYGALTPSRG 57


>gnl|CDD|224048 COG1123, COG1123, ATPase components of various ABC-type transport
           systems, contain duplicated ATPase [General function
           prediction only].
          Length = 539

 Score = 32.2 bits (74), Expect = 2.8
 Identities = 10/17 (58%), Positives = 15/17 (88%)

Query: 762 LIGVSGAGKTTLSRFVA 778
           L+G SG+GK+TL+R +A
Sbjct: 322 LVGESGSGKSTLARILA 338



 Score = 31.4 bits (72), Expect = 5.0
 Identities = 12/26 (46%), Positives = 14/26 (53%)

Query: 49  VGPSGSGKSTAWKVLLKALERYEGVE 74
           VG SGSGKST  ++L   L    G  
Sbjct: 323 VGESGSGKSTLARILAGLLPPSSGSI 348


>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).  This
            family consists of several plant proteins of unknown
            function. Several sequences in this family are described
            as being "myosin heavy chain-like".
          Length = 484

 Score = 31.9 bits (72), Expect = 2.9
 Identities = 66/337 (19%), Positives = 130/337 (38%), Gaps = 35/337 (10%)

Query: 1023 EEQQLHL-NVGLGKIAETVEQVEEMQKSLAVKSQELQS-KNEAANLKLKEMIKDQQEAEK 1080
            +E +L L      K+ + +E V E +K   +K  EL+S K    +LKLK      ++AEK
Sbjct: 34   KETELMLAQEEANKLKKELE-VAEKEKLQVLK--ELESTKRTVEDLKLK-----LEKAEK 85

Query: 1081 RKVQSQDIQAEIEKQTVEIAQKRVFVME------DLAQVEPAVMDAQQAVKEIKKQQLVE 1134
             + Q+        KQ  E+A+ R   +E      ++ +   A  +     +E++K +   
Sbjct: 86   EEQQA--------KQDSELAKLRAEELEQGIQELEVERYITATAELDSVKEELRKIRQEY 137

Query: 1135 LRSMANPPSVVKLALESICLLLGENATDWKAIRAVVMRENFINSIVSNFNTEMITDEVRE 1194
               +    + +K A E+IC      A+     +   + +  I    S         E  E
Sbjct: 138  DALVEERDAALKRAEEAIC------ASKVNEKKVEELTKEIIAMKESLERAHAAHLEAEE 191

Query: 1195 KMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASE 1254
            +     L               A    ++     +     L+K+     EL+SL+ + S 
Sbjct: 192  ERIGAALEKDQDRETYEKELKEA-EKELERLKQDLDPEKDLEKLAEASAELESLQKEISI 250

Query: 1255 NKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAKFYEYAQLIAQATAI 1314
              +     K+L  + + ++    +E   L     ++K +L+  + +  E  +   +A   
Sbjct: 251  MASVASVLKEL-EEAKANLEKAAEEEKSLRNLVESLKQELEEEKKELEELREKEGEAEEA 309

Query: 1315 KTDL--DNVQAKVERSM-ALLKSLGIERERWEATSET 1348
             + L  +  + K E+   A  K + I  +  +A+SE 
Sbjct: 310  ASSLEAELNRTKSEKESKAREKMVEIPLKLQQASSEA 346



 Score = 30.3 bits (68), Expect = 9.2
 Identities = 35/142 (24%), Positives = 60/142 (42%), Gaps = 12/142 (8%)

Query: 1019 CSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKL--------KE 1070
              ELEE++  L     K  E  E    ++  L     E +SK     +++         E
Sbjct: 286  KQELEEEKKELEELREKEGEAEEAASSLEAELNRTKSEKESKAREKMVEIPLKLQQASSE 345

Query: 1071 MIKDQQEAEKRKVQSQDIQAEIE--KQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIK 1128
              + ++EAE  + + + ++ E E  K  +E A+ R+       + E A      A+ EIK
Sbjct: 346  AEEARKEAEAAREELRKLKEEAEQTKAALETAELRLEAALK--EAEAAKAAEALALAEIK 403

Query: 1129 KQQLVELRSMANPPSVVKLALE 1150
              Q  E  + A+ P  + L+LE
Sbjct: 404  ALQESEESAKADSPRKITLSLE 425


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members of
            this protein family are the bacterial ATP-dependent
            chaperone ClpB. This protein belongs to the AAA family,
            ATPases associated with various cellular activities
            (pfam00004). This molecular chaperone does not act as a
            protease, but rather serves to disaggregate misfolded and
            aggregated proteins [Protein fate, Protein folding and
            stabilization].
          Length = 852

 Score = 32.2 bits (74), Expect = 3.0
 Identities = 23/86 (26%), Positives = 41/86 (47%), Gaps = 20/86 (23%)

Query: 1241 LRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAIKTDLDNVQAK 1300
            L +E ++L+      K K E +K+ +  LEK +A  ++EYA L  Q  A K  +  +Q  
Sbjct: 417  LEIEREALK------KEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKAAIQGIQ-- 468

Query: 1301 FYEYAQLIAQATAIKTDLDNVQAKVE 1326
                         IK +++ V+ ++E
Sbjct: 469  ------------QIKEEIEQVRLELE 482


>gnl|CDD|233062 TIGR00630, uvra, excinuclease ABC, A subunit.  This family is a
           member of the ABC transporter superfamily of proteins of
           which all members for which functions are known except
           the UvrA proteins are involved in the transport of
           material through membranes. UvrA orthologs are involved
           in the recognition of DNA damage as a step in nucleotide
           excision repair. This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 925

 Score = 32.3 bits (74), Expect = 3.0
 Identities = 26/77 (33%), Positives = 29/77 (37%), Gaps = 25/77 (32%)

Query: 45  GLMMV--GPSGSGKST-AWKVLLKALER-----------YEGVEGVAHI-----IDPKAI 85
           GL     G SGSGKST     L  AL             Y  +EG+ H+     ID   I
Sbjct: 634 GLFTCITGVSGSGKSTLINDTLYPALANRLNGAKTVPGRYTSIEGLEHLDKVIHIDQSPI 693

Query: 86  -----SKEALY-GVLDP 96
                S  A Y GV D 
Sbjct: 694 GRTPRSNPATYTGVFDE 710


>gnl|CDD|234132 TIGR03167, tRNA_sel_U_synt, tRNA 2-selenouridine synthase.  The
           Escherichia coli YbbB protein was shown to encode a
           selenophosphate-dependent tRNA 2-selenouridine synthase,
           essential for modification of some tRNAs to replace a
           sulfur atom with selenium. This enzyme works with SelD,
           the selenium donor protein, which also acts in
           selenocysteine incorporation. Although the members of
           this protein family show a fairly deep split, sequences
           from both sides of the split are supported by
           co-occurence with, and often proximity to, the selD gene
           [Protein synthesis, tRNA and rRNA base modification].
          Length = 311

 Score = 31.8 bits (73), Expect = 3.1
 Identities = 13/32 (40%), Positives = 14/32 (43%), Gaps = 1/32 (3%)

Query: 409 LYTWLAEHKPLVLCGPPGSGKTMTLLSALRAL 440
           L         +VL G  GSGKT  LL AL   
Sbjct: 120 LEELPQPFPLIVLGGMTGSGKT-ELLHALANA 150


>gnl|CDD|213233 cd03266, ABC_NatA_sodium_exporter, ATP-binding cassette domain of
           the Na+ transporter.  NatA is the ATPase component of a
           bacterial ABC-type Na+ transport system called NatAB,
           which catalyzes ATP-dependent electrogenic Na+ extrusion
           without mechanically coupled proton or K+ uptake. NatB
           possess six putative membrane spanning regions at its
           C-terminus. In B. subtilis, NatAB is inducible by agents
           such as ethanol and protonophores, which lower the
           proton-motive force across the membrane. The closest
           sequence similarity to NatA is exhibited by DrrA of the
           two-component daunorubicin- and doxorubicin-efflux
           system. Hence, the functional NatAB is presumably
           assembled with two copies of a single ATP-binding
           protein and a single integral membrane protein.
          Length = 218

 Score = 31.2 bits (71), Expect = 3.1
 Identities = 16/42 (38%), Positives = 23/42 (54%), Gaps = 2/42 (4%)

Query: 739 FDEVLDHVLRIDRI-FRQPQGHLL-LIGVSGAGKTTLSRFVA 778
           F +V   V  +D + F    G +  L+G +GAGKTT  R +A
Sbjct: 11  FRDVKKTVQAVDGVSFTVKPGEVTGLLGPNGAGKTTTLRMLA 52


>gnl|CDD|225982 COG3451, VirB4, Type IV secretory pathway, VirB4 components
           [Intracellular trafficking and secretion].
          Length = 796

 Score = 32.0 bits (73), Expect = 3.2
 Identities = 14/63 (22%), Positives = 22/63 (34%), Gaps = 11/63 (17%)

Query: 21  EGNEEGGPWMEKVLQLYQISNLNHGL-----------MMVGPSGSGKSTAWKVLLKALER 69
            G   G PW E +  L                     +++GP+G+GK+     LL    +
Sbjct: 403 LGPFRGNPWGEPLTVLKTEDGSPFYFNFHVGEDVGHTLIIGPTGAGKTVLLSFLLAQALK 462

Query: 70  YEG 72
           Y  
Sbjct: 463 YGN 465


>gnl|CDD|219014 pfam06414, Zeta_toxin, Zeta toxin.  This family consists of several
           bacterial zeta toxin proteins. Zeta toxin is thought to
           be part of a postregulational killing system in
           bacteria. It relies on antitoxin/toxin systems that
           secure stable inheritance of low and medium copy number
           plasmids during cell division and kill cells that have
           lost the plasmid.
          Length = 191

 Score = 31.1 bits (71), Expect = 3.2
 Identities = 18/96 (18%), Positives = 30/96 (31%), Gaps = 27/96 (28%)

Query: 48  MVGPSGSGKSTAWKVLLKALERYEGVEGVAHII-D------PKAISKEALYG-----VLD 95
           + G  G+GK+   + LL+ L        V  I  D      P     +         +  
Sbjct: 17  LGGQPGAGKTELARALLEELGG----GNVVRIDPDELRTYHPDYDELQKADPKDASELTQ 72

Query: 96  PNTREWTDGLFTHILRRIIDNVRGEISKRQWIIFDG 131
           P+   W + L  + + R              II +G
Sbjct: 73  PDASRWVEKLIDYAIER-----------GYNIILEG 97



 Score = 30.3 bits (69), Expect = 5.9
 Identities = 11/31 (35%), Positives = 17/31 (54%)

Query: 419 LVLCGPPGSGKTMTLLSALRALPDMEVVSLN 449
           ++L G PG+GKT    + L  L    VV ++
Sbjct: 15  VLLGGQPGAGKTELARALLEELGGGNVVRID 45


>gnl|CDD|213227 cd03260, ABC_PstB_phosphate_transporter, ATP-binding cassette
          domain of the phosphate transport system.  Phosphate
          uptake is of fundamental importance in the cell
          physiology of bacteria because phosphate is required as
          a nutrient. The Pst system of E. coli comprises four
          distinct subunits encoded by the pstS, pstA, pstB, and
          pstC genes. The PstS protein is a phosphate-binding
          protein located in the periplasmic space. PstA and PstC
          are hydrophobic and they form the transmembrane portion
          of the Pst system. PstB is the catalytic subunit, which
          couples the energy of ATP hydrolysis to the import of
          phosphate across cellular membranes through the Pst
          system, often referred as ABC-protein. PstB belongs to
          one of the largest superfamilies of proteins
          characterized by a highly conserved adenosine
          triphosphate (ATP) binding cassette (ABC), which is
          also a nucleotide binding domain (NBD).
          Length = 227

 Score = 31.4 bits (72), Expect = 3.3
 Identities = 12/21 (57%), Positives = 14/21 (66%), Gaps = 4/21 (19%)

Query: 49 VGPSGSGKSTAWKVLLKALER 69
          +GPSG GKST    LL+ L R
Sbjct: 32 IGPSGCGKST----LLRLLNR 48



 Score = 30.2 bits (69), Expect = 6.5
 Identities = 15/45 (33%), Positives = 19/45 (42%), Gaps = 6/45 (13%)

Query: 740 DEVLDHV-LRIDRIFRQPQGHLLLIGVSGAGKTTLSRFVAFMNGL 783
              L  + L I            LIG SG GK+TL R +  +N L
Sbjct: 13  KHALKDISLDI-----PKGEITALIGPSGCGKSTLLRLLNRLNDL 52


>gnl|CDD|216246 pfam01017, STAT_alpha, STAT protein, all-alpha domain.  STAT proteins
            (Signal Transducers and Activators of Transcription) are
            a family of transcription factors that are specifically
            activated to regulate gene transcription when cells
            encounter cytokines and growth factors. STAT proteins
            also include an SH2 domain pfam00017.
          Length = 182

 Score = 31.1 bits (71), Expect = 3.4
 Identities = 17/123 (13%), Positives = 44/123 (35%), Gaps = 22/123 (17%)

Query: 1017 EKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQ 1076
            EK  E++ +   L   +    + ++Q+E++Q     + + LQS  E              
Sbjct: 2    EKQLEIDSKVEDLRNSVQDTEQDIKQLEDLQDEFDFRYKTLQSLIET-----------PA 50

Query: 1077 EAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEI--------K 1128
                 K   +  +  +++   ++ QKR  + +   +    +   +     +        K
Sbjct: 51   NGTSLKEHLKQEKLTLQQMLNKLDQKRKELADKHQET---LGLLEALQNALLDEELIEWK 107

Query: 1129 KQQ 1131
            ++Q
Sbjct: 108  RRQ 110


>gnl|CDD|132050 TIGR03005, ectoine_ehuA, ectoine/hydroxyectoine ABC transporter,
          ATP-binding protein.  Members of this family are the
          ATP-binding protein of a conserved four gene ABC
          transporter operon found next to ectoine unilization
          operons and ectoine biosynthesis operons. Ectoine is a
          compatible solute that protects enzymes from high
          osmolarity. It is released by some species in response
          to hypoosmotic shock, and it is taken up by a number of
          bacteria as a compatible solute or for consumption.
          This family shows strong sequence similiarity to a
          number of amino acid ABC transporter ATP-binding
          proteins.
          Length = 252

 Score = 31.3 bits (71), Expect = 3.4
 Identities = 12/29 (41%), Positives = 19/29 (65%), Gaps = 1/29 (3%)

Query: 44 HGLMMVGPSGSGKSTAWKVLLKALERYEG 72
            + ++GPSGSGKST  ++L+  LE  + 
Sbjct: 27 EKVALIGPSGSGKSTILRILM-TLEPIDE 54


>gnl|CDD|226967 COG4608, AppF, ABC-type oligopeptide transport system, ATPase
          component [Amino acid transport and metabolism].
          Length = 268

 Score = 31.4 bits (72), Expect = 3.5
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query: 48 MVGPSGSGKSTAWKVLLKALERYEG 72
          +VG SG GKST  +++L   E   G
Sbjct: 44 LVGESGCGKSTLGRLILGLEEPTSG 68


>gnl|CDD|224051 COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase
          component [Amino acid transport and metabolism].
          Length = 240

 Score = 31.3 bits (72), Expect = 3.5
 Identities = 13/23 (56%), Positives = 15/23 (65%), Gaps = 4/23 (17%)

Query: 49 VGPSGSGKSTAWKVLLKALERYE 71
          +GPSGSGKST    LL+ L   E
Sbjct: 34 IGPSGSGKST----LLRCLNGLE 52



 Score = 30.9 bits (71), Expect = 4.8
 Identities = 22/77 (28%), Positives = 37/77 (48%), Gaps = 16/77 (20%)

Query: 741 EVLDHV-LRIDRIFRQPQGH-LLLIGVSGAGKTTLSRFVAFMNGLSVFQ---IRAHNKYT 795
           EVL  + L +++      G  +++IG SG+GK+TL R    +NGL       I    +  
Sbjct: 16  EVLKGISLSVEK------GEVVVIIGPSGSGKSTLLR---CLNGLEEPDSGSITVDGEDV 66

Query: 796 GADFDEDLRTVLRRSGC 812
           G    +D+  + R+ G 
Sbjct: 67  GD--KKDILKLRRKVGM 81


>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
            protein, PEP-CTERM locus subfamily.  Members of this
            protein family belong to the family of polysaccharide
            chain length determinant proteins (pfam02706). All are
            found in species that encode the PEP-CTERM/exosortase
            system predicted to act in protein sorting in a number of
            Gram-negative bacteria, and are found near the epsH
            homolog that is the putative exosortase gene [Cell
            envelope, Biosynthesis and degradation of surface
            polysaccharides and lipopolysaccharides].
          Length = 498

 Score = 31.6 bits (72), Expect = 3.7
 Identities = 21/123 (17%), Positives = 48/123 (39%), Gaps = 19/123 (15%)

Query: 1237 KVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSI--------------ASYKDEYAQ 1282
            ++E L  +L +L ++ ++       TK  I QLE+                      Y Q
Sbjct: 255  RIEALEKQLDALRLRYTDKHPDVIATKREIAQLEEQKEEEGSAKNGGPERGEIANPVYQQ 314

Query: 1283 LIAQATAIKTDLDNVQAKFYEYAQLIAQATAIKTDLDNVQAKVERSMA-LLKSLGIERER 1341
            L  +    + ++ +++A+  E    I +   +++ L  +   VE  +  L +   + +  
Sbjct: 315  LQIELAEAEAEIASLEARVAELTARIER---LESLLRTIPE-VEAELTQLNRDYEVNKSN 370

Query: 1342 WEA 1344
            +E 
Sbjct: 371  YEQ 373


>gnl|CDD|213224 cd03257, ABC_NikE_OppD_transporters, ATP-binding cassette domain
          of nickel/oligopeptides specific transporters.  The ABC
          transporter subfamily specific for the transport of
          dipeptides, oligopeptides (OppD), and nickel (NikDE).
          The NikABCDE system of E. coli belongs to this family
          and is composed of the periplasmic binding protein
          NikA, two integral membrane components (NikB and NikC),
          and two ATPase (NikD and NikE). The NikABCDE
          transporter is synthesized under anaerobic conditions
          to meet the increased demand for nickel resulting from
          hydrogenase synthesis. The molecular mechanism of
          nickel uptake in many bacteria and most archaea is not
          known. Many other members of this ABC family are also
          involved in the uptake of dipeptides and oligopeptides.
          The oligopeptide transport system (Opp) is a
          five-component ABC transport composed of a
          membrane-anchored substrate binding proteins (SRP),
          OppA, two transmembrane proteins, OppB and OppC, and
          two ATP-binding domains, OppD and OppF.
          Length = 228

 Score = 30.9 bits (71), Expect = 3.7
 Identities = 12/25 (48%), Positives = 16/25 (64%)

Query: 48 MVGPSGSGKSTAWKVLLKALERYEG 72
          +VG SGSGKST  + +L  L+   G
Sbjct: 36 LVGESGSGKSTLARAILGLLKPTSG 60


>gnl|CDD|213229 cd03262, ABC_HisP_GlnQ, ATP-binding cassette domain of the
          histidine and glutamine transporters.  HisP and GlnQ
          are the ATP-binding components of the bacterial
          periplasmic histidine and glutamine permeases,
          respectively. Histidine permease is a multi-subunit
          complex containing the HisQ and HisM integral membrane
          subunits and two copies of HisP. HisP has properties
          intermediate between those of integral and peripheral
          membrane proteins and is accessible from both sides of
          the membrane, presumably by its interaction with HisQ
          and HisM. The two HisP subunits form a homodimer within
          the complex. The domain structure of the amino acid
          uptake systems is typical for prokaryotic extracellular
          solute binding protein-dependent uptake systems. All of
          the amino acid uptake systems also have at least one,
          and in a few cases, two extracellular solute binding
          proteins located in the periplasm of Gram-negative
          bacteria, or attached to the cell membrane of
          Gram-positive bacteria. The best-studied member of the
          PAAT (polar amino acid transport) family is the HisJQMP
          system of S. typhimurium, where HisJ is the
          extracellular solute binding proteins and HisP is the
          ABC protein.
          Length = 213

 Score = 31.0 bits (71), Expect = 3.8
 Identities = 15/41 (36%), Positives = 23/41 (56%), Gaps = 4/41 (9%)

Query: 48 MVGPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPKAISKE 88
          ++GPSGSGKST    LL+ +   E  +    IID   ++ +
Sbjct: 31 IIGPSGSGKST----LLRCINLLEEPDSGTIIIDGLKLTDD 67


>gnl|CDD|224985 COG2074, COG2074, 2-phosphoglycerate kinase [Carbohydrate transport
           and metabolism].
          Length = 299

 Score = 31.2 bits (71), Expect = 3.9
 Identities = 25/101 (24%), Positives = 38/101 (37%), Gaps = 25/101 (24%)

Query: 711 VGTTELREYVQARLKVFYEEELDVQLVLFDEVLDHVLRIDRIFRQPQGHL--LLIGVSGA 768
           V   E+RE  Q  L+    E     L+             R  R+ +  L  L+ G SG 
Sbjct: 53  VTKDEIREVYQKLLEKGDPEVAKRYLLW------------RRIRKMKRPLIILIGGASGV 100

Query: 769 GKTTLSRFVAFMNGLSVFQIRAHNKYTGADFDEDLRTVLRR 809
           GK+T++  +A   G+              D    +R VLR+
Sbjct: 101 GKSTIAGELARRLGIRSV--------ISTDS---IREVLRK 130


>gnl|CDD|222647 pfam14282, FlxA, FlxA-like protein.  This family includes FlxA from
            E. coli. The expression of FlxA is regulated by the FliA
            sigma factor, a transcription factor specific for class 3
            flagellar operons. However FlxA is not required for
            flagellar function or formation.
          Length = 106

 Score = 29.7 bits (67), Expect = 3.9
 Identities = 14/62 (22%), Positives = 35/62 (56%), Gaps = 3/62 (4%)

Query: 1042 QVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQ 1101
            Q++ +QK +   +++L+  + + +L  +E  K QQ  ++   Q Q +QA+I +   + ++
Sbjct: 20   QIKSLQKQIQSLTKQLKELSSSEDLTAEEKQKQQQLIQQ---QIQMLQAQIAQLQQQQSE 76

Query: 1102 KR 1103
            + 
Sbjct: 77   EA 78


>gnl|CDD|225363 COG2804, PulE, Type II secretory pathway, ATPase PulE/Tfp pilus
           assembly pathway, ATPase PilB [Cell motility and
           secretion / Intracellular trafficking and secretion].
          Length = 500

 Score = 31.5 bits (72), Expect = 3.9
 Identities = 23/92 (25%), Positives = 40/92 (43%), Gaps = 21/92 (22%)

Query: 735 QLVLFDEVLDHVLRIDRIFRQPQGHLLLIGVSGAGKTTL------SRFVAFMNGLSV--- 785
           +L +    L    R+ R+  +PQG +L+ G +G+GKTT              N +++   
Sbjct: 239 KLGMSPFQLA---RLLRLLNRPQGLILVTGPTGSGKTTTLYAALSELNTPERNIITIEDP 295

Query: 786 --------FQIRAHNKYTGADFDEDLRTVLRR 809
                    Q++ + K  G  F   LR +LR+
Sbjct: 296 VEYQLPGINQVQVNPK-IGLTFARALRAILRQ 326


>gnl|CDD|223626 COG0552, FtsY, Signal recognition particle GTPase [Intracellular
           trafficking and secretion].
          Length = 340

 Score = 31.5 bits (72), Expect = 3.9
 Identities = 21/67 (31%), Positives = 33/67 (49%), Gaps = 9/67 (13%)

Query: 713 TTELREYVQARLKVFYEEELDVQLVLFD---EVLDHVLRIDRIFRQPQGH----LLLIGV 765
             ELR+    + K+  EE   V+  L +   E+L  V ++D     P+      +L +GV
Sbjct: 90  IEELRKREGKKKKIKDEE--TVKEALREALIEILRPVDKVDLPLEIPKEKKPFVILFVGV 147

Query: 766 SGAGKTT 772
           +G GKTT
Sbjct: 148 NGVGKTT 154


>gnl|CDD|226447 COG3937, COG3937, Uncharacterized conserved protein [Function
            unknown].
          Length = 108

 Score = 29.7 bits (67), Expect = 4.0
 Identities = 27/104 (25%), Positives = 50/104 (48%), Gaps = 16/104 (15%)

Query: 1032 GLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAE 1091
            G G  AET E+V+++   L VK  EL +  EA    + ++++  +EA          Q E
Sbjct: 15   GAGLAAETAEKVQKLVDEL-VKKGEL-NAEEAKRF-VDDLLRQAKEA----------QGE 61

Query: 1092 IEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVEL 1135
            +E++   I +K   ++ DL     + MD      +  ++Q+ +L
Sbjct: 62   LEEK---IPRKIEEMLSDLEVARQSEMDELTERVDALERQVADL 102


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
            unknown. It should not be considered a DNA mismatch
            repair protein. It is likely a DNA mismatch binding
            protein of unknown cellular function [DNA metabolism,
            Other].
          Length = 771

 Score = 31.7 bits (72), Expect = 4.0
 Identities = 16/113 (14%), Positives = 49/113 (43%), Gaps = 8/113 (7%)

Query: 1041 EQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIE-KQTVEI 1099
            E++  + + L+   +EL+ KNE     LKE  K ++E E+   + ++ +   + +   E 
Sbjct: 511  EEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEA 570

Query: 1100 AQKRVFVMEDLAQV-------EPAVMDAQQAVKEIKKQQLVELRSMANPPSVV 1145
             +    + +++  +       +       ++++++ K +  + +    P +  
Sbjct: 571  QEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQ 623


>gnl|CDD|237421 PRK13539, PRK13539, cytochrome c biogenesis protein CcmA;
           Provisional.
          Length = 207

 Score = 31.0 bits (71), Expect = 4.2
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 755 QPQGHLLLIGVSGAGKTTLSRFVA 778
                L+L G +G+GKTTL R +A
Sbjct: 26  AAGEALVLTGPNGSGKTTLLRLIA 49


>gnl|CDD|238005 cd00046, DEXDc, DEAD-like helicases superfamily. A diverse family
           of proteins involved in ATP-dependent RNA or DNA
           unwinding. This domain contains the ATP-binding region.
          Length = 144

 Score = 30.4 bits (69), Expect = 4.3
 Identities = 8/19 (42%), Positives = 12/19 (63%)

Query: 419 LVLCGPPGSGKTMTLLSAL 437
           ++L  P GSGKT+  L  +
Sbjct: 3   VLLAAPTGSGKTLAALLPI 21


>gnl|CDD|234401 TIGR03925, T7SS_EccC_b, type VII secretion protein EccCb.  This
           model represents the C-terminal domain or EccCb subunit
           of the type VII secretion protein EccC as found in the
           Actinobacteria. Type VII secretion is defined more
           broadly as including secretion systems for ESAT-6-like
           proteins in the Firmicutes as well as in the
           Actinobacteria, but this family does not show close
           homologs in the Firmicutes [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 566

 Score = 31.5 bits (72), Expect = 4.4
 Identities = 12/53 (22%), Positives = 25/53 (47%), Gaps = 5/53 (9%)

Query: 46  LMMVGPSGSGKSTAWKVLLKAL-ERYEGVEGVAHIIDPKAISKEALYGVLDPN 97
           L++ G S SGK+T  + + + +  RY   +    ++D     + +L G +   
Sbjct: 366 LLVFGDSESGKTTLLRAIARGITRRYSPQQARLVLVDY----RRSLLGAVPEE 414


>gnl|CDD|234277 TIGR03604, TOMM_cyclo_SagD, thiazole/oxazole-forming peptide
           maturase, SagD family component.  Members of this
           protein family include enzymes related to SagD,
           previously referred to as a scaffold or docking protein
           involved in the biosynthesis of streptolysin S in
           Streptococcus pyogenes from the protoxin polypeptide
           (product of the sagA gene). Newer evidence describes an
           enzymatic activity, an ATP-dependent cyclodehydration
           reaction, previously ascribed to the SagC component.
           This protein family serves as a marker for widely
           distributed prokaryotic systems for making a general
           class of heterocycle-containing bacteriocins.
          Length = 377

 Score = 31.1 bits (71), Expect = 4.5
 Identities = 11/59 (18%), Positives = 25/59 (42%), Gaps = 12/59 (20%)

Query: 575 IYGTFSRAMLRLIPPLRGYADALTNAMVELYLASQEKFTQDMQPHYVYSPREMTRWVRG 633
           ++ +++    R + PL           + L+  S+E++ +   P   + P    +WV G
Sbjct: 33  VFASYAELGERALNPLS----------LGLH--SEEQYAEPGFPFRPFDPDTPIQWVPG 79


>gnl|CDD|213220 cd03253, ABCC_ATM1_transporter, ATP-binding cassette domain of
          iron-sulfur clusters transporter, subfamily C.  ATM1 is
          an ABC transporter that is expressed in the
          mitochondria. Although the specific function of ATM1 is
          unknown, its disruption results in the accumulation of
          excess mitochondrial iron, loss of mitochondrial
          cytochromes, oxidative damage to mitochondrial DNA, and
          decreased levels of cytosolic heme proteins. ABC
          transporters are a large family of proteins involved in
          the transport of a wide variety of different compounds,
          like sugars, ions, peptides, and more complex organic
          molecules. The nucleotide binding domain shows the
          highest similarity between all members of the family.
          ABC transporters are a subset of nucleotide hydrolases
          that contain a signature motif, Q-loop, and
          H-loop/switch region, in addition to, the Walker A
          motif/P-loop and Walker B motif commonly found in a
          number of ATP- and GTP-binding and hydrolyzing
          proteins.
          Length = 236

 Score = 31.0 bits (71), Expect = 4.5
 Identities = 11/17 (64%), Positives = 14/17 (82%)

Query: 49 VGPSGSGKSTAWKVLLK 65
          VGPSGSGKST  ++L +
Sbjct: 33 VGPSGSGKSTILRLLFR 49


>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2.  This
            family consists of several Eukaryotic rRNA processing
            protein EBP2 sequences. Ebp2p is required for the
            maturation of 25S rRNA and 60S subunit assembly. Ebp2p
            may be one of the target proteins of Rrs1p for executing
            the signal to regulate ribosome biogenesis. This family
            also plays a role in chromosome segregation.
          Length = 270

 Score = 31.2 bits (71), Expect = 4.6
 Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 10/66 (15%)

Query: 1037 AETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEK--RKVQSQDIQAEIE- 1093
            AE V+  E MQK   V+ ++L +  EAA  K  E  + Q+E +K  ++VQ + +Q   + 
Sbjct: 108  AEMVKSDEHMQK---VR-KKLLA--EAAAKKASEEARKQRELKKFGKQVQVEKLQERAKE 161

Query: 1094 -KQTVE 1098
             K  +E
Sbjct: 162  KKDMLE 167


>gnl|CDD|224045 COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport
          systems, ATPase components [Inorganic ion transport and
          metabolism / Coenzyme metabolism].
          Length = 258

 Score = 31.0 bits (71), Expect = 4.6
 Identities = 12/22 (54%), Positives = 14/22 (63%), Gaps = 4/22 (18%)

Query: 49 VGPSGSGKSTAWKVLLKALERY 70
          +GP+GSGKST    LLK L   
Sbjct: 34 LGPNGSGKST----LLKCLAGL 51


>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region.  This family
            represents an N-terminal conserved region found in
            several huntingtin-associated protein 1 (HAP1)
            homologues. HAP1 binds to huntingtin in a polyglutamine
            repeat-length-dependent manner. However, its possible
            role in the pathogenesis of Huntington's disease is
            unclear. This family also includes a similar N-terminal
            conserved region from hypothetical protein products of
            ALS2CR3 genes found in the human juvenile amyotrophic
            lateral sclerosis critical region 2q33-2q34.
          Length = 307

 Score = 31.3 bits (71), Expect = 4.6
 Identities = 29/147 (19%), Positives = 63/147 (42%), Gaps = 39/147 (26%)

Query: 1016 REKCSELEEQQLHL----------------------NVGLGKIAETVEQVEEMQKSLAVK 1053
            +EK   LEE+  HL                      N  + ++ E  +Q+  + + LA K
Sbjct: 167  QEKLKLLEEENEHLRSEASHLKTETVTYEEKEQQLVNDCVKQLREANDQIASLSEELAKK 226

Query: 1054 SQELQSKNEAANLKLKEMIKDQQE-----AEKRKVQ-----SQDIQAEIEKQTVEIAQKR 1103
            +++L+ + E     L +++  Q++      E  ++Q     ++D Q +++ +  E+  K 
Sbjct: 227  TEDLERQQEEITHLLSQIVDLQKKCKSYALENEELQQHLAAAKDAQRQLQAELQELQDKY 286

Query: 1104 VFVMEDLAQVEPAVMDAQQAVKEIKKQ 1130
               ME L        +AQ+ +K ++ +
Sbjct: 287  AECMEML-------HEAQEELKNLRNK 306


>gnl|CDD|213192 cd03225, ABC_cobalt_CbiO_domain1, First domain of the ATP-binding
          cassette component of cobalt transport system.  Domain
          I of the ABC component of a cobalt transport family
          found in bacteria, archaea, and eukaryota. The
          transition metal cobalt is an essential component of
          many enzymes and must be transported into cells in
          appropriate amounts when needed. This ABC transport
          system of the CbiMNQO family is involved in cobalt
          transport in association with the cobalamin (vitamin
          B12) biosynthetic pathways. Most of cobalt (Cbi)
          transport systems possess a separate CbiN component,
          the cobalt-binding periplasmic protein, and they are
          encoded by the conserved gene cluster cbiMNQO. Both the
          CbiM and CbiQ proteins are integral cytoplasmic
          membrane proteins, and the CbiO protein has the linker
          peptide and the Walker A and B motifs commonly found in
          the ATPase components of the ABC-type transport
          systems.
          Length = 211

 Score = 30.5 bits (70), Expect = 4.8
 Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 7/35 (20%)

Query: 36 LYQIS-NLNHG--LMMVGPSGSGKSTAWKVLLKAL 67
          L  IS  +  G  +++VGP+GSGKST    LL+ L
Sbjct: 17 LDDISLTIKKGEFVLIVGPNGSGKST----LLRLL 47



 Score = 30.5 bits (70), Expect = 5.3
 Identities = 15/43 (34%), Positives = 24/43 (55%), Gaps = 9/43 (20%)

Query: 742 VLDHV-LRIDRIFRQPQGHLLLIGVSGAGKTTLSRFVAFMNGL 783
            LD + L I     +    +L++G +G+GK+TL R    +NGL
Sbjct: 16  ALDDISLTI-----KKGEFVLIVGPNGSGKSTLLR---LLNGL 50


>gnl|CDD|215558 PLN03073, PLN03073, ABC transporter F family; Provisional.
          Length = 718

 Score = 31.4 bits (71), Expect = 4.8
 Identities = 20/53 (37%), Positives = 26/53 (49%), Gaps = 14/53 (26%)

Query: 26  GGPWMEKVLQLYQISNLNHG------LMMVGPSGSGKSTAWKVLLKALERYEG 72
           GGP + K        NLN G      + MVGP+G GKST  K++   L+   G
Sbjct: 520 GGPLLFK--------NLNFGIDLDSRIAMVGPNGIGKSTILKLISGELQPSSG 564


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This family
            includes several uncharacterized proteins from
            Caenorhabditis elegans.
          Length = 517

 Score = 31.2 bits (69), Expect = 4.9
 Identities = 30/118 (25%), Positives = 63/118 (53%), Gaps = 11/118 (9%)

Query: 1035 KIAETVEQVEEMQKSLAVKSQELQSKNEAANLKL-------KEMIKDQQEAEKRKVQSQD 1087
            ++ E  +QVE+++KS   K+Q+  +K+  A  K+       KE  + + EA+KR  Q ++
Sbjct: 59   ELTEQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEE 118

Query: 1088 -IQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMANPPSV 1144
              Q + E++ ++  Q++     DL +++ A    ++AVK  K ++  + +  + P  V
Sbjct: 119  HKQWKAEQERIQKEQEK--KEADLKKLQ-AEKKKEKAVKAEKAEKAEKTKKASTPAPV 173


>gnl|CDD|233193 TIGR00929, VirB4_CagE, type IV secretion/conjugal transfer ATPase,
           VirB4 family.  Type IV secretion systems are found in
           Gram-negative pathogens. They export proteins, DNA, or
           complexes in different systems and are related to
           plasmid conjugation systems. This model represents
           related ATPases that include VirB4 in Agrobacterium
           tumefaciens (DNA export) CagE in Helicobacter pylori
           (protein export) and plasmid TraB (conjugation).
          Length = 785

 Score = 31.5 bits (72), Expect = 4.9
 Identities = 11/21 (52%), Positives = 15/21 (71%)

Query: 758 GHLLLIGVSGAGKTTLSRFVA 778
           GH L+ G +G+GKTTL  F+ 
Sbjct: 435 GHTLIFGPTGSGKTTLLNFLL 455


>gnl|CDD|172734 PRK14246, PRK14246, phosphate ABC transporter ATP-binding
          protein; Provisional.
          Length = 257

 Score = 30.8 bits (69), Expect = 4.9
 Identities = 14/24 (58%), Positives = 19/24 (79%)

Query: 48 MVGPSGSGKSTAWKVLLKALERYE 71
          ++GPSGSGKST  KVL + +E Y+
Sbjct: 41 IMGPSGSGKSTLLKVLNRLIEIYD 64


>gnl|CDD|223473 COG0396, sufC, Cysteine desulfurase activator ATPase
          [Posttranslational modification, protein turnover,
          chaperones].
          Length = 251

 Score = 30.6 bits (70), Expect = 5.0
 Identities = 14/32 (43%), Positives = 19/32 (59%), Gaps = 3/32 (9%)

Query: 44 HGLMMVGPSGSGKSTAWKVLLKALERYEGVEG 75
          H +M  GP+GSGKST    ++    +YE  EG
Sbjct: 33 HAIM--GPNGSGKSTLAYTIM-GHPKYEVTEG 61


>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein.  The Bacillus subtilis divIVA1
            mutation causes misplacement of the septum during cell
            division, resulting in the formation of small, circular,
            anucleate mini-cells. Inactivation of divIVA produces a
            mini-cell phenotype, whereas overproduction of DivIVA
            results in a filamentation phenotype. These proteins
            appear to contain coiled-coils.
          Length = 131

 Score = 29.9 bits (68), Expect = 5.1
 Identities = 24/97 (24%), Positives = 46/97 (47%), Gaps = 6/97 (6%)

Query: 1013 KLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMI 1072
             LY+E   EL+E+   L     K+ E  E  E +QK+L V  +  +     A  + + +I
Sbjct: 36   ALYKEN-EELKEEIERLE---EKLEEYKELEETLQKTLVVAQETAEEVKANAQKEAELII 91

Query: 1073 KD-QQEAEKRKVQSQDIQAEIEKQTVEI-AQKRVFVM 1107
            K+ + +AE+    + +   ++  +  E+  Q R+F  
Sbjct: 92   KEAEAKAERIVNDANEEAKKLATEYDELKRQSRLFRT 128


>gnl|CDD|213196 cd03229, ABC_Class3, ATP-binding cassette domain of the binding
          protein-dependent transport systems.  This class is
          comprised of all BPD (Binding Protein Dependent)
          systems that are largely represented in archaea and
          eubacteria and are primarily involved in scavenging
          solutes from the environment. ABC transporters are a
          large family of proteins involved in the transport of a
          wide variety of different compounds, like sugars, ions,
          peptides, and more complex organic molecules. The
          nucleotide binding domain shows the highest similarity
          between all members of the family. ABC transporters are
          a subset of nucleotide hydrolases that contain a
          signature motif, Q-loop, and H-loop/switch region, in
          addition to, the Walker A motif/P-loop and Walker B
          motif commonly found in a number of ATP- and
          GTP-binding and hydrolyzing proteins.
          Length = 178

 Score = 30.2 bits (69), Expect = 5.1
 Identities = 11/24 (45%), Positives = 14/24 (58%)

Query: 49 VGPSGSGKSTAWKVLLKALERYEG 72
          +GPSGSGKST  + +    E   G
Sbjct: 32 LGPSGSGKSTLLRCIAGLEEPDSG 55


>gnl|CDD|237422 PRK13548, hmuV, hemin importer ATP-binding subunit; Provisional.
          Length = 258

 Score = 30.9 bits (71), Expect = 5.2
 Identities = 11/19 (57%), Positives = 15/19 (78%), Gaps = 4/19 (21%)

Query: 49 VGPSGSGKSTAWKVLLKAL 67
          +GP+G+GKST    LL+AL
Sbjct: 34 LGPNGAGKST----LLRAL 48


>gnl|CDD|178596 PLN03025, PLN03025, replication factor C subunit; Provisional.
          Length = 319

 Score = 30.9 bits (70), Expect = 5.5
 Identities = 11/22 (50%), Positives = 16/22 (72%)

Query: 419 LVLCGPPGSGKTMTLLSALRAL 440
           L+L GPPG+GKT ++L+    L
Sbjct: 37  LILSGPPGTGKTTSILALAHEL 58


>gnl|CDD|213216 cd03249, ABC_MTABC3_MDL1_MDL2, ATP-binding cassette domain of a
          mitochondrial protein MTABC3 and related proteins.
          MTABC3 (also known as ABCB6) is a mitochondrial
          ATP-binding cassette protein involved in iron
          homeostasis and one of four ABC transporters expressed
          in the mitochondrial inner membrane, the other three
          being MDL1(ABC7), MDL2, and ATM1. In fact, the yeast
          MDL1 (multidrug resistance-like protein 1) and MDL2
          (multidrug resistance-like protein 2) transporters are
          also included in this CD. MDL1 is an ATP-dependent
          permease that acts as a high-copy suppressor of ATM1
          and is thought to have a role in resistance to
          oxidative stress. Interestingly, subfamily B is more
          closely related to the carboxyl-terminal component of
          subfamily C than the two halves of ABCC molecules are
          with one another.
          Length = 238

 Score = 30.6 bits (70), Expect = 5.5
 Identities = 11/22 (50%), Positives = 14/22 (63%), Gaps = 4/22 (18%)

Query: 49 VGPSGSGKSTAWKVLLKALERY 70
          VG SG GKST    ++  LER+
Sbjct: 35 VGSSGCGKST----VVSLLERF 52


>gnl|CDD|236442 PRK09270, PRK09270, nucleoside triphosphate hydrolase
           domain-containing protein; Reviewed.
          Length = 229

 Score = 30.7 bits (70), Expect = 5.6
 Identities = 11/20 (55%), Positives = 14/20 (70%), Gaps = 1/20 (5%)

Query: 421 LCGPPGSGKTMTLLSALRAL 440
           + GPPG+GK+ TL   L AL
Sbjct: 38  IAGPPGAGKS-TLAEFLEAL 56


>gnl|CDD|116340 pfam07726, AAA_3, ATPase family associated with various cellular
           activities (AAA).  This Pfam entry includes some of the
           AAA proteins not detected by the pfam00004 model.
          Length = 131

 Score = 29.8 bits (68), Expect = 5.6
 Identities = 11/20 (55%), Positives = 14/20 (70%)

Query: 759 HLLLIGVSGAGKTTLSRFVA 778
           H+LL GV G  KT L+R +A
Sbjct: 1   HVLLEGVPGLAKTLLARTLA 20


>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
            [Function unknown].
          Length = 548

 Score = 31.0 bits (70), Expect = 5.6
 Identities = 23/109 (21%), Positives = 44/109 (40%), Gaps = 8/109 (7%)

Query: 1033 LGKIAETVEQ-VEEMQKSLAVKSQELQSKNEA-ANLKLKEMIKDQQEAEKRKVQSQDIQA 1090
            +G I    E  +E  ++S  + + + Q++ EA A   + E I  Q EA     +  +   
Sbjct: 324  VGLIERQKETELEPQERSYFINAAQRQAQEEAKAAANIAEAIGAQAEAAVETARETEEAE 383

Query: 1091 EIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMA 1139
              E+  +  A       E   Q +  +    +A K   + Q  E+++ A
Sbjct: 384  RAEQAALVAAA------EAAEQEQVEIAVRAEAAKAEAEAQAAEIKAEA 426


>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450).  This
            family of proteins are functionally uncharacterized. This
            protein is found in bacteria and eukaryotes. Proteins in
            this family are about 260 amino acids in length.
          Length = 250

 Score = 30.7 bits (70), Expect = 5.6
 Identities = 19/83 (22%), Positives = 38/83 (45%), Gaps = 8/83 (9%)

Query: 1036 IAETVEQV-EEMQKSL--AVKSQELQSKNEAANLKLKEMIKD-QQEAEKRKVQSQDIQAE 1091
             A +++Q     Q+++  A +SQ+   K      +L   I+   +E E  +V +  +Q  
Sbjct: 20   AAASLDQALAAAQQTVAAAAQSQKKIDKWADQTQELLAEIRQLLKEIENLRVYNDQLQRL 79

Query: 1092 IEKQTVEIAQKRVFVMEDLAQVE 1114
            +  Q  EIA     + + + Q+E
Sbjct: 80   VANQQQEIAS----LQQQIEQIE 98


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 31.2 bits (71), Expect = 5.7
 Identities = 24/118 (20%), Positives = 59/118 (50%), Gaps = 10/118 (8%)

Query: 1015 YREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKD 1074
             RE+  ELE +   L   + ++ E +E+ E++ +    + + L+ + E       E+I +
Sbjct: 473  DRERVEELEAELEDLEEEVEEVEERLERAEDLVE-AEDRIERLEERREDLE----ELIAE 527

Query: 1075 QQE-AEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQ 1131
            ++E  E+++ ++++++    +   E  +KR    E  A+ E    +A++ V E+  + 
Sbjct: 528  RRETIEEKRERAEELRERAAELEAEAEEKR----EAAAEAEEEAEEAREEVAELNSKL 581


>gnl|CDD|223521 COG0444, DppD, ABC-type dipeptide/oligopeptide/nickel transport
          system, ATPase component [Amino acid transport and
          metabolism / Inorganic ion transport and metabolism].
          Length = 316

 Score = 31.0 bits (71), Expect = 5.8
 Identities = 13/33 (39%), Positives = 17/33 (51%), Gaps = 2/33 (6%)

Query: 42 LNHG--LMMVGPSGSGKSTAWKVLLKALERYEG 72
          L  G  L +VG SGSGKS   K ++  L +   
Sbjct: 28 LKKGEILGIVGESGSGKSVLAKAIMGLLPKPNA 60


>gnl|CDD|233878 TIGR02467, CbiE, precorrin-6y C5,15-methyltransferase
            (decarboxylating), CbiE subunit.  This model recognizes
            the CbiE methylase which is responsible, in part (along
            with CbiT), for methylating precorrin-6y (or
            cobalt-precorrin-6y) at both the 5 and 15 positions as
            well as the concomitant decarbozylation at C-12. In many
            organisms, this protein is fused to the CbiT subunit. The
            fused protein, when found in organisms catalyzing the
            oxidative version of the cobalamin biosynthesis pathway,
            is called CobL.
          Length = 204

 Score = 30.4 bits (69), Expect = 5.8
 Identities = 10/29 (34%), Positives = 13/29 (44%)

Query: 1388 HLIAAGIQFRPEIALTEYLSSPDERLRWQ 1416
             LI  GI    E+ + E L   DER+   
Sbjct: 158  ELIELGIGGSYELTVGENLGYEDERITEG 186


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
            model describes the DndB protein encoded by an operon
            associated with a sulfur-containing modification to DNA.
            The operon is sporadically distributed in bacteria, much
            like some restriction enzyme operons. DndD is described
            as a putative ATPase. The small number of examples known
            so far include species from among the Firmicutes,
            Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
            metabolism, Restriction/modification].
          Length = 650

 Score = 31.2 bits (71), Expect = 5.8
 Identities = 46/258 (17%), Positives = 84/258 (32%), Gaps = 56/258 (21%)

Query: 1034 GKIAETVEQVEEMQKSLAVKSQELQSKNE---------AANLKLKEMIKDQQEAEKRKVQ 1084
            G + E  EQ+E   K +    +  +++                L +  K Q + E++  Q
Sbjct: 258  GDLFEEREQLERQLKEIEAARKANRAQLRELAADPLPLLLIPNLLDSTKAQLQKEEQSQQ 317

Query: 1085 SQDIQAEIEKQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRSMANPPSV 1144
            +Q  Q E+E++  E+ +     +  LA            VKEI  +       +A     
Sbjct: 318  NQLTQEELEERDKELLE----SLPKLALPAE-------HVKEIAAE-------LAEIDKP 359

Query: 1145 VKLALESICLLLGENATDWK-AIRAVVMRENFINSIVSNFNTEMIT--DEVREKMHSRYL 1201
                 E    L G   T  +  I+ V        S +     E+     EV +K+     
Sbjct: 360  ATTDSEIPHRLSGSELTQLEVLIQQVKRELQDAKSQLLKELRELEEELAEVDKKIS---- 415

Query: 1202 SNPDYSYEKANRASMACGPMVKWAIAQISYADMLKKVEPLRLELKSLEVQASENKAKGEE 1261
            + P                           A +L+++   + EL   E +  E   + E 
Sbjct: 416  TIPSEE----------------------QIAQLLEELGEAQNELFRSEAEIEELLRQLET 453

Query: 1262 TKDLITQLEKSIASYKDE 1279
             K+ I  L K++     +
Sbjct: 454  LKEAIEALRKTLDEKTKQ 471


>gnl|CDD|222194 pfam13521, AAA_28, AAA domain. 
          Length = 162

 Score = 29.8 bits (68), Expect = 5.9
 Identities = 12/21 (57%), Positives = 14/21 (66%), Gaps = 1/21 (4%)

Query: 420 VLCGPPGSGKTMTLLSALRAL 440
           V+ G P +GKT TLL AL A 
Sbjct: 3   VITGGPSTGKT-TLLEALAAR 22


>gnl|CDD|162242 TIGR01187, potA, spermidine/putrescine ABC transporter ATP-binding
           subunit.  This model describes spermidine/putrescine ABC
           transporter, ATP binding subunit in bacteria and its
           equivalents in archaea. This transport system belong to
           the larger ATP-Binding Cassette (ABC) transporter
           superfamily. The characteristic feature of these
           transporter is the obligatory coupling of ATP hydrolysis
           to substrate translocation. The minimal configuration of
           bacterial ABC transport system: an ATPase or ATP binding
           subunit; An integral membrane protein; a hydrophilic
           polypetpide, which likely functions as substrate binding
           protein. Polyamines like spermidine and putrescine play
           vital role in cell proliferation, differentiation, and
           ion homeostasis. The concentration of polyamines within
           the cell are regulated by biosynthesis, degradation and
           transport (uptake and efflux included) [Transport and
           binding proteins, Amino acids, peptides and amines].
          Length = 325

 Score = 30.9 bits (70), Expect = 5.9
 Identities = 15/48 (31%), Positives = 22/48 (45%), Gaps = 10/48 (20%)

Query: 762 LIGVSGAGKTTLSRFVA----------FMNGLSVFQIRAHNKYTGADF 799
           L+G SG GKTTL R +A           ++G  V  +  H ++    F
Sbjct: 1   LLGPSGCGKTTLLRLLAGFEQPDSGSIMLDGEDVTNVPPHLRHINMVF 48


>gnl|CDD|224042 COG1117, PstB, ABC-type phosphate transport system, ATPase
          component [Inorganic ion transport and metabolism].
          Length = 253

 Score = 30.6 bits (70), Expect = 6.0
 Identities = 12/22 (54%), Positives = 15/22 (68%), Gaps = 4/22 (18%)

Query: 48 MVGPSGSGKSTAWKVLLKALER 69
          ++GPSG GKST    LL+ L R
Sbjct: 38 LIGPSGCGKST----LLRCLNR 55


>gnl|CDD|213264 cd03297, ABC_ModC_molybdenum_transporter, ATP-binding cassette
           domain of the molybdenum transport system.  ModC is an
           ABC-type transporter and the ATPase component of a
           molybdate transport system that also includes the
           periplasmic binding protein ModA and the membrane
           protein ModB. ABC transporters are a large family of
           proteins involved in the transport of a wide variety of
           different compounds, like sugars, ions, peptides and
           more complex organic molecules. The nucleotide binding
           domain shows the highest similarity between all members
           of the family. ABC transporters are a subset of
           nucleotide hydrolases that contain a signature motif,
           Q-loop, and H-loop/switch region, in addition to, the
           Walker A motif/P-loop and Walker B motif commonly found
           in a number of ATP- and GTP-binding and hydrolyzing
           proteins.
          Length = 214

 Score = 30.3 bits (69), Expect = 6.2
 Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 2/38 (5%)

Query: 741 EVLDHVLRIDRIFRQPQGHLLLIGVSGAGKTTLSRFVA 778
            + D  L+ID  F   +    + G SGAGK+TL R +A
Sbjct: 9   RLPDFTLKID--FDLNEEVTGIFGASGAGKSTLLRCIA 44


>gnl|CDD|213222 cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of
          the transporters involved in export of lipoprotein and
          macrolide, and cell division protein.  This family is
          comprised of MJ0796 ATP-binding cassette,
          macrolide-specific ABC-type efflux carrier (MacAB), and
          proteins involved in cell division (FtsE), and release
          of lipoproteins from the cytoplasmic membrane (LolCDE).
          They are clustered together phylogenetically. MacAB is
          an exporter that confers resistance to macrolides,
          while the LolCDE system is not a transporter at all. An
          FtsE null mutants showed filamentous growth and
          appeared viable on high salt medium only, indicating a
          role for FtsE in cell division and/or salt transport.
          The LolCDE complex catalyzes the release of
          lipoproteins from the cytoplasmic membrane prior to
          their targeting to the outer membrane.
          Length = 218

 Score = 30.5 bits (70), Expect = 6.2
 Identities = 10/13 (76%), Positives = 12/13 (92%)

Query: 46 LMMVGPSGSGKST 58
          + +VGPSGSGKST
Sbjct: 33 VAIVGPSGSGKST 45



 Score = 29.8 bits (68), Expect = 8.7
 Identities = 12/35 (34%), Positives = 20/35 (57%), Gaps = 8/35 (22%)

Query: 741 EVLDHV-LRIDRIFRQPQG-HLLLIGVSGAGKTTL 773
           + L  V L I++      G  + ++G SG+GK+TL
Sbjct: 18  QALKGVSLSIEK------GEFVAIVGPSGSGKSTL 46


>gnl|CDD|179157 PRK00889, PRK00889, adenylylsulfate kinase; Provisional.
          Length = 175

 Score = 30.0 bits (68), Expect = 6.3
 Identities = 10/17 (58%), Positives = 14/17 (82%)

Query: 762 LIGVSGAGKTTLSRFVA 778
             G+SGAGKTT++R +A
Sbjct: 9   FTGLSGAGKTTIARALA 25


>gnl|CDD|224043 COG1118, CysA, ABC-type sulfate/molybdate transport systems, ATPase
           component [Inorganic ion transport and metabolism].
          Length = 345

 Score = 30.8 bits (70), Expect = 6.4
 Identities = 13/24 (54%), Positives = 17/24 (70%), Gaps = 1/24 (4%)

Query: 756 PQGHLL-LIGVSGAGKTTLSRFVA 778
             G L+ L+G SGAGK+TL R +A
Sbjct: 26  KSGELVALLGPSGAGKSTLLRIIA 49


>gnl|CDD|222036 pfam13304, AAA_21, AAA domain. 
          Length = 256

 Score = 30.5 bits (68), Expect = 6.5
 Identities = 13/33 (39%), Positives = 17/33 (51%), Gaps = 1/33 (3%)

Query: 420 VLCGPPGSGKTMTLLSALRALPDMEVVSLNFSS 452
           V+ GP GSGK+  LL AL  L  +  + L    
Sbjct: 3   VIIGPNGSGKS-NLLKALALLLLLLSLGLTLDR 34


>gnl|CDD|234015 TIGR02788, VirB11, P-type DNA transfer ATPase VirB11.  The VirB11
           protein is found in the vir locus of Agrobacterium Ti
           plasmids where it is involved in the type IV secretion
           system for DNA transfer. VirB11 is believed to be an
           ATPase. VirB11 is a homolog of the P-like conjugation
           system TrbB protein and the Flp pilus sytem protein
           TadA.
          Length = 308

 Score = 30.8 bits (70), Expect = 6.7
 Identities = 15/46 (32%), Positives = 22/46 (47%)

Query: 399 TLDTVRHESLLYTWLAEHKPLVLCGPPGSGKTMTLLSALRALPDME 444
            LD    +  L   +A  K +++ G  GSGKT  L S +  +P  E
Sbjct: 127 LLDAGDIKEFLRLAIASRKNIIISGGTGSGKTTFLKSLVDEIPKDE 172


>gnl|CDD|225183 COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain
           an N-terminal double-glycine peptidase domain [Defense
           mechanisms].
          Length = 709

 Score = 31.0 bits (71), Expect = 6.7
 Identities = 13/24 (54%), Positives = 16/24 (66%)

Query: 49  VGPSGSGKSTAWKVLLKALERYEG 72
           VG SGSGKST  K+LL   +  +G
Sbjct: 505 VGRSGSGKSTLLKLLLGLYKPQQG 528


>gnl|CDD|218640 pfam05565, Sipho_Gp157, Siphovirus Gp157.  This family contains both
            viral and bacterial proteins which are related to the
            Gp157 protein of the Streptococcus thermophilus SFi
            bacteriophages. It is thought that bacteria possessing
            the gene coding for this protein have an increased
            resistance to the bacteriophage.
          Length = 162

 Score = 29.9 bits (68), Expect = 6.7
 Identities = 16/64 (25%), Positives = 26/64 (40%), Gaps = 2/64 (3%)

Query: 1233 DMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDE-YAQLIAQA-TAI 1290
            +   KVE +   +K+LE      KA+ +   +    +E  +   KD     + A     I
Sbjct: 37   EFEDKVENIAKVIKNLEADIEAIKAEIKRLAERKKSIENKVKRLKDYLEEAMEATGIKKI 96

Query: 1291 KTDL 1294
            KT L
Sbjct: 97   KTPL 100


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
            protein family have a signal peptide, a strongly
            conserved SH3 domain, a variable region, and then a
            C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 30.0 bits (68), Expect = 6.7
 Identities = 14/86 (16%), Positives = 42/86 (48%)

Query: 1016 REKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQ 1075
            RE+  EL+++   L   L ++ E + ++++  + L  +   L+++ E    +L  + +  
Sbjct: 65   RERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLS 124

Query: 1076 QEAEKRKVQSQDIQAEIEKQTVEIAQ 1101
              A +   ++++++ E+ +   E   
Sbjct: 125  ANAIELDEENRELREELAELKQENEA 150


>gnl|CDD|213202 cd03235, ABC_Metallic_Cations, ATP-binding cassette domain of the
          metal-type transporters.  This family includes
          transporters involved in the uptake of various metallic
          cations such as iron, manganese, and zinc. The ATPases
          of this group of transporters are very similar to
          members of iron-siderophore uptake family suggesting
          that they share a common ancestor. The best
          characterized metal-type ABC transporters are the
          YfeABCD system of Y. pestis, the SitABCD system of
          Salmonella enterica serovar Typhimurium, and the
          SitABCD transporter of Shigella flexneri. Moreover
          other uncharacterized homologs of these metal-type
          transporters are mainly found in pathogens like
          Haemophilus or enteroinvasive E. coli isolates.
          Length = 213

 Score = 30.2 bits (69), Expect = 6.7
 Identities = 12/24 (50%), Positives = 16/24 (66%)

Query: 49 VGPSGSGKSTAWKVLLKALERYEG 72
          VGP+G+GKST  K +L  L+   G
Sbjct: 31 VGPNGAGKSTLLKAILGLLKPTSG 54


>gnl|CDD|223873 COG0802, COG0802, Predicted ATPase or kinase [General function
           prediction only].
          Length = 149

 Score = 29.5 bits (67), Expect = 6.8
 Identities = 12/19 (63%), Positives = 14/19 (73%)

Query: 760 LLLIGVSGAGKTTLSRFVA 778
           +LL G  GAGKTTL R +A
Sbjct: 28  VLLSGDLGAGKTTLVRGIA 46


>gnl|CDD|226969 COG4618, ArpD, ABC-type protease/lipase transport system, ATPase
           and permease components [General function prediction
           only].
          Length = 580

 Score = 30.7 bits (70), Expect = 6.9
 Identities = 11/20 (55%), Positives = 16/20 (80%)

Query: 45  GLMMVGPSGSGKSTAWKVLL 64
            L ++GPSGSGKST  ++L+
Sbjct: 364 ALGIIGPSGSGKSTLARLLV 383


>gnl|CDD|226359 COG3839, MalK, ABC-type sugar transport systems, ATPase
          components [Carbohydrate transport and metabolism].
          Length = 338

 Score = 30.7 bits (70), Expect = 7.0
 Identities = 10/20 (50%), Positives = 14/20 (70%), Gaps = 4/20 (20%)

Query: 48 MVGPSGSGKSTAWKVLLKAL 67
          ++GPSG GKST    LL+ +
Sbjct: 34 LLGPSGCGKST----LLRMI 49


>gnl|CDD|213180 cd03213, ABCG_EPDR, Eye pigment and drug resistance transporter
           subfamily G of the ATP-binding cassette superfamily.
           ABCG transporters are involved in eye pigment (EP)
           precursor transport, regulation of lipid-trafficking
           mechanisms, and pleiotropic drug resistance (DR). DR is
           a well-described phenomenon occurring in fungi and
           shares several similarities with processes in bacteria
           and higher eukaryotes. Compared to other members of the
           ABC transporter subfamilies, the ABCG transporter family
           is composed of proteins that have an ATP-binding
           cassette domain at the N-terminus and a TM
           (transmembrane) domain at the C-terminus.
          Length = 194

 Score = 30.2 bits (69), Expect = 7.0
 Identities = 12/22 (54%), Positives = 14/22 (63%), Gaps = 1/22 (4%)

Query: 758 GHLLLI-GVSGAGKTTLSRFVA 778
           G L  I G SGAGK+TL   +A
Sbjct: 35  GELTAIMGPSGAGKSTLLNALA 56


>gnl|CDD|188099 TIGR00972, 3a0107s01c2, phosphate ABC transporter, ATP-binding
          protein.  This model represents the ATP-binding protein
          of a family of ABC transporters for inorganic
          phosphate. In the model species Escherichia coli, a
          constitutive transporter for inorganic phosphate, with
          low affinity, is also present. The high affinity
          transporter that includes this polypeptide is induced
          when extracellular phosphate concentrations are low.
          The proteins most similar to the members of this family
          but not included appear to be amino acid transporters
          [Transport and binding proteins, Anions].
          Length = 247

 Score = 30.3 bits (69), Expect = 7.1
 Identities = 15/30 (50%), Positives = 20/30 (66%), Gaps = 5/30 (16%)

Query: 48 MVGPSGSGKSTAWKVLLKALER-YEGVEGV 76
          ++GPSG GKST    LL++L R  + V GV
Sbjct: 32 LIGPSGCGKST----LLRSLNRMNDLVPGV 57


>gnl|CDD|226646 COG4178, COG4178, ABC-type uncharacterized transport system,
           permease and ATPase components [General function
           prediction only].
          Length = 604

 Score = 30.7 bits (70), Expect = 7.2
 Identities = 13/24 (54%), Positives = 17/24 (70%)

Query: 755 QPQGHLLLIGVSGAGKTTLSRFVA 778
           +P   LL+ G SGAGKT+L R +A
Sbjct: 417 RPGERLLITGESGAGKTSLLRALA 440


>gnl|CDD|129690 TIGR00602, rad24, checkpoint protein rad24.  All proteins in this
           family for which functions are known are involved in DNA
           damage tolerance (likely cell cycle checkpoints).This
           family is based on the phylogenomic analysis of JA Eisen
           (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 637

 Score = 30.7 bits (69), Expect = 7.2
 Identities = 23/63 (36%), Positives = 33/63 (52%), Gaps = 5/63 (7%)

Query: 6   EKIKEVCREEFLVCGEGNEEGGPWME-KVLQLYQISNLNHGLMMVGPSGSGKSTAWKVLL 64
           EK K   + E  V  +  EE   W++ +VL+    +     L++ GPSG GKST  K+L 
Sbjct: 76  EKYKPETQHELAVHKKKIEEVETWLKAQVLE----NAPKRILLITGPSGCGKSTTIKILS 131

Query: 65  KAL 67
           K L
Sbjct: 132 KEL 134


>gnl|CDD|233327 TIGR01241, FtsH_fam, ATP-dependent metalloprotease FtsH.
           HflB(FtsH) is a pleiotropic protein required for correct
           cell division in bacteria. It has ATP-dependent zinc
           metalloprotease activity. It was formerly designated
           cell division protein FtsH [Cellular processes, Cell
           division, Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 495

 Score = 30.7 bits (70), Expect = 7.3
 Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 7/48 (14%)

Query: 754 RQPQGHLLLIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNKYTGADFDE 801
           + P+G +LL+G  G GKT L++ VA   G+  F I      +G+DF E
Sbjct: 86  KIPKG-VLLVGPPGTGKTLLAKAVAGEAGVPFFSI------SGSDFVE 126


>gnl|CDD|203260 pfam05496, RuvB_N, Holliday junction DNA helicase ruvB N-terminus. 
           The RuvB protein makes up part of the RuvABC revolvasome
           which catalyzes the resolution of Holliday junctions
           that arise during genetic recombination and DNA repair.
           Branch migration is catalyzed by the RuvB protein that
           is targeted to the Holliday junction by the structure
           specific RuvA protein. This family contains the
           N-terminal region of the protein.
          Length = 231

 Score = 30.1 bits (69), Expect = 7.4
 Identities = 23/71 (32%), Positives = 29/71 (40%), Gaps = 30/71 (42%)

Query: 716 LREYV-QARLKVFYEEELDVQLVLF-------DEVLDHVLRIDRIFRQPQGHLLLIGVSG 767
           L EY+ Q ++K    E L +    F        E LDHVL              L G  G
Sbjct: 23  LDEYIGQEKVK----ENLKI----FIEAAKKRGEALDHVL--------------LYGPPG 60

Query: 768 AGKTTLSRFVA 778
            GKTTL+  +A
Sbjct: 61  LGKTTLANIIA 71


>gnl|CDD|181906 PRK09493, glnQ, glutamine ABC transporter ATP-binding protein;
          Reviewed.
          Length = 240

 Score = 30.1 bits (68), Expect = 7.6
 Identities = 21/53 (39%), Positives = 28/53 (52%), Gaps = 16/53 (30%)

Query: 48 MVGPSGSGKSTAWKVLLK---ALERYEG----VEGVAHIIDPKA----ISKEA 89
          ++GPSGSGKST    LL+    LE        V+G+  + DPK     I +EA
Sbjct: 32 IIGPSGSGKST----LLRCINKLEEITSGDLIVDGL-KVNDPKVDERLIRQEA 79


>gnl|CDD|213199 cd03232, ABCG_PDR_domain2, Second domain of the pleiotropic drug
           resistance-like (PDR) subfamily G of ATP-binding
           cassette transporters.  The pleiotropic drug resistance
           (PDR) is a well-described phenomenon occurring in fungi
           and shares several similarities with processes in
           bacteria and higher eukaryotes. This PDR subfamily
           represents domain I of its (ABC-IM)2 organization. ABC
           transporters are a large family of proteins involved in
           the transport of a wide variety of different compounds
           including sugars, ions, peptides, and more complex
           organic molecules. The nucleotide binding domain shows
           the highest similarity between all members of the
           family. ABC transporters are a subset of nucleotide
           hydrolases that contain a signature motif, Q-loop, and
           H-loop/switch region, in addition to, the Walker A
           motif/P-loop and Walker B motif commonly found in a
           number of ATP- and GTP-binding and hydrolyzing proteins.
          Length = 192

 Score = 29.9 bits (68), Expect = 7.7
 Identities = 10/12 (83%), Positives = 11/12 (91%)

Query: 762 LIGVSGAGKTTL 773
           L+G SGAGKTTL
Sbjct: 38  LMGESGAGKTTL 49


>gnl|CDD|223203 COG0125, Tmk, Thymidylate kinase [Nucleotide transport and
           metabolism].
          Length = 208

 Score = 30.0 bits (68), Expect = 7.8
 Identities = 40/244 (16%), Positives = 76/244 (31%), Gaps = 86/244 (35%)

Query: 50  GPSGSGKSTAWKVLLKALERYEGVEGVAHIIDPKAISKEALYGVLDPNTREWTDGLFTHI 109
           G  G+GK+T  ++L + LE   G++ V                     TRE         
Sbjct: 10  GIDGAGKTTQAELLKERLEE-RGIKVVL--------------------TREPGGTPIGEK 48

Query: 110 LRRIIDNVRGEISKR-------------------------QWIIFD-------------G 131
           +R ++ N   ++S +                         + +I D              
Sbjct: 49  IRELLLNGEEKLSPKAEALLFAADRAQHLEEVIKPALKEGKVVICDRYVDSSLAYQGGGR 108

Query: 132 DVDPEWVENLNSVLDDNKLLTLPNGERLSLPPNIRIMFEVQDLKYATLATVSRCGMIWFS 191
            +D +WV  LN                  L P++ +  +V   + A L  + + G     
Sbjct: 109 GLDLDWVLALNEFA------------PGGLKPDLTLYLDV-PPEVA-LERIRKRGE---L 151

Query: 192 EDVLSTEMIFENYLSRLRNIALDDIDDDSSLLITVDATGKAPDDVLSPALTLQQDVASIL 251
            D    E     +L ++R   L+        +I +DA+    ++V        +++  IL
Sbjct: 152 RDRFEKEDD--EFLEKVREGYLELAAKFPERIIVIDASRP-LEEV-------HEEILKIL 201

Query: 252 STHF 255
               
Sbjct: 202 KERL 205


>gnl|CDD|215414 PLN02772, PLN02772, guanylate kinase.
          Length = 398

 Score = 30.6 bits (69), Expect = 7.8
 Identities = 19/41 (46%), Positives = 23/41 (56%), Gaps = 7/41 (17%)

Query: 417 KPLVLCGPPGSGKTMTLLSAL-RALPDMEVVSLNFS-SATT 455
           KP+V+ GP G GK  TL+S L +  P M      FS S TT
Sbjct: 136 KPIVISGPSGVGKG-TLISMLMKEFPSM----FGFSVSHTT 171


>gnl|CDD|132302 TIGR03258, PhnT, 2-aminoethylphosphonate ABC transport system,
           ATP-binding component PhnT.  This ATP-binding component
           of an ABC transport system is found in Salmonella and
           Burkholderia lineages in the vicinity of enzymes for the
           breakdown of 2-aminoethylphosphonate.
          Length = 362

 Score = 30.3 bits (68), Expect = 7.8
 Identities = 13/19 (68%), Positives = 14/19 (73%)

Query: 760 LLLIGVSGAGKTTLSRFVA 778
           L LIG SG GKTTL R +A
Sbjct: 34  LALIGKSGCGKTTLLRAIA 52


>gnl|CDD|179118 PRK00771, PRK00771, signal recognition particle protein Srp54;
           Provisional.
          Length = 437

 Score = 30.6 bits (70), Expect = 7.8
 Identities = 25/73 (34%), Positives = 38/73 (52%), Gaps = 19/73 (26%)

Query: 709 VPVGTTELREYVQARLKVFYEEELDVQLV-LFDEVLDHVLRIDRIFRQPQGHLLLIGVSG 767
            P G T  RE+V   +K+ YEE     LV L  E  +       +  +PQ  ++L+G+ G
Sbjct: 60  PPKGLTP-REHV---IKIVYEE-----LVKLLGEETE----PLVLPLKPQ-TIMLVGLQG 105

Query: 768 AGKTT----LSRF 776
           +GKTT    L+R+
Sbjct: 106 SGKTTTAAKLARY 118


>gnl|CDD|129124 TIGR00012, L29, ribosomal protein L29.  This model describes a
            ribosomal large subunit protein, called L29 in
            prokaryotic (50S) large subunits and L35 in eukaryotic
            (60S) large subunits [Protein synthesis, Ribosomal
            proteins: synthesis and modification].
          Length = 55

 Score = 27.6 bits (62), Expect = 8.0
 Identities = 12/49 (24%), Positives = 22/49 (44%), Gaps = 3/49 (6%)

Query: 1226 IAQISYADMLKKVEPLRLELKSLEVQASENKAKGEETKDLITQLEKSIA 1274
            + + S  ++ KK++ L+ EL  L  Q +            I Q+ + IA
Sbjct: 2    LREKSKEELAKKLDELKKELFELRFQKA---TGQLAKPHRIRQVRRDIA 47


>gnl|CDD|224041 COG1116, TauB, ABC-type nitrate/sulfonate/bicarbonate transport
          system, ATPase component [Inorganic ion transport and
          metabolism].
          Length = 248

 Score = 30.2 bits (69), Expect = 8.0
 Identities = 10/22 (45%), Positives = 15/22 (68%), Gaps = 4/22 (18%)

Query: 46 LMMVGPSGSGKSTAWKVLLKAL 67
          + ++GPSG GKST    LL+ +
Sbjct: 32 VAILGPSGCGKST----LLRLI 49


>gnl|CDD|184430 PRK13975, PRK13975, thymidylate kinase; Provisional.
          Length = 196

 Score = 29.7 bits (67), Expect = 8.0
 Identities = 16/58 (27%), Positives = 27/58 (46%), Gaps = 7/58 (12%)

Query: 764 GVSGAGKTTLSRFVA-FMNGLSVFQIRAHNKYTGADFDEDLRTVLRRSGCKNEKIAFL 820
           G+ G+GKTT ++ +A  +N     +       T     + +R +L  S C  E +A L
Sbjct: 9   GIDGSGKTTQAKLLAEKLNAFWTCEP------TDGKIGKLIREILSGSKCDKETLALL 60


>gnl|CDD|224058 COG1135, AbcC, ABC-type metal ion transport system, ATPase
           component [Inorganic ion transport and metabolism].
          Length = 339

 Score = 30.3 bits (69), Expect = 8.1
 Identities = 12/23 (52%), Positives = 15/23 (65%), Gaps = 3/23 (13%)

Query: 762 LIGVSGAGKTTLSRFVAFMNGLS 784
           +IG SGAGK+TL R +   N L 
Sbjct: 37  IIGYSGAGKSTLLRLI---NLLE 56


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
            (cytomatrix at the active zone) complex which is involved
            in determining the site of synaptic vesicle fusion. The
            C-terminus is a PDZ-binding motif that binds directly to
            RIM (a small G protein Rab-3A effector). The family also
            contains four coiled-coil domains.
          Length = 774

 Score = 30.8 bits (69), Expect = 8.1
 Identities = 23/129 (17%), Positives = 57/129 (44%), Gaps = 14/129 (10%)

Query: 1012 VKLYREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEM 1071
            + L R+   +L+ +   L   L    E ++++ EM +S  + ++  +  +        E 
Sbjct: 130  LFLLRDTLEKLQLRMERLQQTLDARDEEIKKLLEMLQSKGLSAKAEEEDS--------ER 181

Query: 1072 IKDQQEAEKRKVQSQDIQAEIEKQTVEIAQK--RVFVMEDLAQVEPAVMDAQQAVKEIKK 1129
            +   +EAE ++   + +  + EK+ + + ++  +   ME     + A  +A Q + + K 
Sbjct: 182  LGRIREAEDQQSHLEVLLEQKEKEHMMLREEIHQKLQMER----DDAKTEASQKLIDEKD 237

Query: 1130 QQLVELRSM 1138
             ++ E   M
Sbjct: 238  TKIKEFEKM 246


>gnl|CDD|213260 cd03293, ABC_NrtD_SsuB_transporters, ATP-binding cassette domain
          of the nitrate and sulfonate transporters.  NrtD and
          SsuB are the ATP-binding subunits of the bacterial
          ABC-type nitrate and sulfonate transport systems,
          respectively. ABC transporters are a large family of
          proteins involved in the transport of a wide variety of
          different compounds, like sugars, ions, peptides, and
          more complex organic molecules. The nucleotide binding
          domain shows the highest similarity between all members
          of the family. ABC transporters are a subset of
          nucleotide hydrolases that contain a signature motif,
          Q-loop, and H-loop/switch region, in addition to, the
          Walker A motif/P-loop and Walker B motif commonly found
          in a number of ATP- and GTP-binding and hydrolyzing
          proteins.
          Length = 220

 Score = 30.1 bits (69), Expect = 8.2
 Identities = 11/22 (50%), Positives = 15/22 (68%), Gaps = 4/22 (18%)

Query: 46 LMMVGPSGSGKSTAWKVLLKAL 67
          + +VGPSG GKST    LL+ +
Sbjct: 33 VALVGPSGCGKST----LLRII 50


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
            functions are known are involvedin recombination,
            recombinational repair, and/or non-homologous end
            joining.They are components of an exonuclease complex
            with MRE11 homologs. This family is distantly related to
            the SbcC family of bacterial proteins.This family is
            based on the phylogenomic analysis of JA Eisen (1999,
            Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 30.8 bits (69), Expect = 8.2
 Identities = 25/140 (17%), Positives = 60/140 (42%), Gaps = 9/140 (6%)

Query: 1003 HYLDFINHFVKLYREKCSELEEQQLHLNVGLGK-------IAETVEQVEEMQKSLAVKSQ 1055
            H LD +   ++L R+   + +EQ  HL     +       I   +++ ++ ++ L   S 
Sbjct: 836  HELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELST 895

Query: 1056 ELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVM-EDLAQVE 1114
            E+QS         ++    +   EK + + +++ +  E    + AQ +V  + E +  + 
Sbjct: 896  EVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSN-KKAQDKVNDIKEKVKNIH 954

Query: 1115 PAVMDAQQAVKEIKKQQLVE 1134
              + D +  +++ K   L +
Sbjct: 955  GYMKDIENKIQDGKDDYLKQ 974


>gnl|CDD|213213 cd03246, ABCC_Protease_Secretion, ATP-binding cassette domain of
          PrtD, subfamily C.  This family represents the ABC
          component of the protease secretion system PrtD, a
          60-kDa integral membrane protein sharing 37% identity
          with HlyB, the ABC component of the alpha-hemolysin
          secretion pathway, in the C-terminal domain. They
          export degradative enzymes by using a type I protein
          secretion system and lack an N-terminal signal peptide,
          but contain a C-terminal secretion signal. The Type I
          secretion apparatus is made up of three components, an
          ABC transporter, a membrane fusion protein (MFP), and
          an outer membrane protein (OMP). For the HlyA
          transporter complex, HlyB (ABC transporter) and HlyD
          (MFP) reside in the inner membrane of E. coli. The OMP
          component is TolC, which is thought to interact with
          the MFP to form a continuous channel across the
          periplasm from the cytoplasm to the exterior. HlyB
          belongs to the family of ABC transporters, which are
          ubiquitous, ATP-dependent transmembrane pumps or
          channels. The spectrum of transport substrates ranges
          from inorganic ions, nutrients such as amino acids,
          sugars, or peptides, hydrophobic drugs, to large
          polypeptides, such as HlyA.
          Length = 173

 Score = 29.5 bits (67), Expect = 8.2
 Identities = 10/16 (62%), Positives = 14/16 (87%)

Query: 49 VGPSGSGKSTAWKVLL 64
          +GPSGSGKST  +++L
Sbjct: 34 IGPSGSGKSTLARLIL 49


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
            unknown].
          Length = 984

 Score = 30.6 bits (69), Expect = 8.3
 Identities = 21/88 (23%), Positives = 45/88 (51%), Gaps = 6/88 (6%)

Query: 1020 SELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQE---LQSKNEAANLKLK-EMIKDQ 1075
             EL+E++L L        +  E+VEE+   +A  S++   L+     A L+ + E +K+ 
Sbjct: 769  RELKEEELALLEEAIDALD--EEVEELHAQVAALSRQIAQLEGGGTVAELRQRRESLKED 826

Query: 1076 QEAEKRKVQSQDIQAEIEKQTVEIAQKR 1103
             E + RK  S  +  ++ ++ + + ++R
Sbjct: 827  LEEKARKWASLRLAVQVLEEALRLFKER 854


>gnl|CDD|237870 PRK14964, PRK14964, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 491

 Score = 30.5 bits (69), Expect = 8.3
 Identities = 17/42 (40%), Positives = 27/42 (64%), Gaps = 8/42 (19%)

Query: 741 EVLDHVLR----IDRIFRQPQGHLLLIGVSGAGKTTLSRFVA 778
           +VL  +LR    +++I   PQ  +LL+G SG GKTT +R ++
Sbjct: 19  DVLVRILRNAFTLNKI---PQS-ILLVGASGVGKTTCARIIS 56


>gnl|CDD|238260 cd00464, SK, Shikimate kinase (SK) is the fifth enzyme in the
           shikimate pathway, a seven-step biosynthetic pathway
           which converts erythrose-4-phosphate to chorismic acid,
           found in bacteria, fungi and plants. Chorismic acid is a
           important intermediate in the synthesis of aromatic
           compounds, such as aromatic amino acids, p-aminobenzoic
           acid, folate and ubiquinone. Shikimate kinase catalyses
           the phosphorylation of the 3-hydroxyl group of shikimic
           acid using ATP.
          Length = 154

 Score = 29.4 bits (67), Expect = 8.4
 Identities = 11/17 (64%), Positives = 14/17 (82%)

Query: 762 LIGVSGAGKTTLSRFVA 778
           LIG+ GAGKTT+ R +A
Sbjct: 4   LIGMMGAGKTTVGRLLA 20


>gnl|CDD|172735 PRK14247, PRK14247, phosphate ABC transporter ATP-binding
          protein; Provisional.
          Length = 250

 Score = 30.3 bits (68), Expect = 8.4
 Identities = 12/23 (52%), Positives = 17/23 (73%)

Query: 48 MVGPSGSGKSTAWKVLLKALERY 70
          ++GPSGSGKST  +V  + +E Y
Sbjct: 34 LMGPSGSGKSTLLRVFNRLIELY 56


>gnl|CDD|107346 cd06351, PBP1_iGluR_N_LIVBP_like, N-terminal
           leucine/isoleucine/valine-binding protein (LIVBP)-like
           domain of the NMDA, AMPA, and kainate receptor subtypes
           of ionotropic glutamate receptors (iGluRs).  N-terminal
           leucine/isoleucine/valine-binding protein (LIVBP)-like
           domain of the NMDA, AMPA, and kainate receptor subtypes
           of ionotropic glutamate receptors (iGluRs). While this
           N-terminal domain belongs to the periplasmic-binding
           fold type I superfamily, the glutamate-binding domain of
           the iGluR is structurally homologous to the
           periplasmic-binding fold type II. The LIVBP-like domain
           of iGluRs is thought to play a role in the initial
           assembly of iGluR subunits, but it is not well
           understood how this domain is arranged and functions in
           intact iGluR. Glutamate mediates the majority of
           excitatory synaptic transmission in the central nervous
           system via two broad classes of ionotropic receptors
           characterized by their response to glutamate agonists:
           N-methyl-aspartate (NMDA) and non-NMDA receptors. NMDA
           receptors have intrinsically slow kinetics, are highly
           permeable to Ca2+, and are blocked by extracellular Mg2+
           in a voltage-dependent manner. On the other hand,
           non-NMDA receptors have faster kinetics, are weakly
           permeable to Ca2+, and are not blocked by extracellular
           Mg2+. While non-NMDA receptors typically mediate
           excitatory synaptic responses at resting membrane
           potentials, NMDA receptors contribute to several forms
           of synaptic plasticity and are suggested to play an
           important role in the development of synaptic pathways.
          Length = 328

 Score = 30.2 bits (68), Expect = 8.5
 Identities = 11/48 (22%), Positives = 19/48 (39%), Gaps = 3/48 (6%)

Query: 558 HVPVIYVDYPGETSLKQIYGTFSRAMLRLIPPLRGYADALTNAMVELY 605
            +P I +    E    +   + +   L+L P L   ADAL + +    
Sbjct: 87  EIPHISISGGSEGLSDKEESSTT---LQLYPSLEDLADALLDLLEYYN 131


>gnl|CDD|130309 TIGR01242, 26Sp45, 26S proteasome subunit P45 family.  Many
           proteins may score above the trusted cutoff because an
           internal.
          Length = 364

 Score = 30.5 bits (69), Expect = 8.5
 Identities = 22/66 (33%), Positives = 35/66 (53%), Gaps = 8/66 (12%)

Query: 720 VQARLKVFYEE--ELDVQLVLFDEVLDHVLRIDRIFRQ-----PQGHLLLIGVSGAGKTT 772
           V+ R  V YE+   L+ Q+    E ++  L+   +F +     P+G +LL G  G GKT 
Sbjct: 113 VEERPNVSYEDIGGLEEQIREIREAVELPLKHPELFEEVGIEPPKG-VLLYGPPGTGKTL 171

Query: 773 LSRFVA 778
           L++ VA
Sbjct: 172 LAKAVA 177


>gnl|CDD|221816 pfam12866, DUF3823, Protein of unknown function (DUF3823).  This is
           a family of uncharacterized proteins from Bacteroidetes.
           It has characteristic DN and DR sequence-motifs. The
           function is not known.
          Length = 220

 Score = 29.9 bits (68), Expect = 8.6
 Identities = 22/68 (32%), Positives = 26/68 (38%), Gaps = 14/68 (20%)

Query: 330 FAGDGKLKMRSDFGNFLRSVTTITLPATSSDIVDFEVN----IKNGEWVPWSNKVPQIEV 385
           F GD KL +    G +   V TI +    +   DFEV     IKN            I  
Sbjct: 75  FDGDYKLVLVRGNGPWEYPVDTIDVEVKGNTTQDFEVTPYYRIKN----------ASISK 124

Query: 386 ETQKVAAS 393
           E  KV AS
Sbjct: 125 EGNKVTAS 132


>gnl|CDD|173809 cd07483, Peptidases_S8_Subtilisin_Novo-like, Peptidase S8 family
            domain in Subtilisin_Novo-like proteins.  Subtilisins are
            a group of alkaline proteinases originating from
            different strains of Bacillus subtilis.  Novo is one of
            the strains that produced enzymes belonging to this
            group.  The enzymes obtained from the Novo and BPN'
            strains are identical.  The Carlsburg and Novo
            subtilisins are thought to have arisen from a common
            ancestral protein.  They have similar peptidase and
            esterase activities, pH profiles, catalyze
            transesterification reactions, and are both inhibited by
            diispropyl fluorophosphate, though they differ in 85
            positions in the amino acid sequence.  Members of the
            peptidases S8 and S35 clan include endopeptidases,
            exopeptidases and also a tripeptidyl-peptidase. The S8
            family has an Asp/His/Ser catalytic triad similar to that
            found in trypsin-like proteases, but do not share their
            three-dimensional structure and are not homologous to
            trypsin. The S53 family contains a catalytic triad
            Glu/Asp/Ser with an additional acidic residue Asp in the
            oxyanion hole, similar to that of subtilisin.. The
            stability of these enzymes may be enhanced by calcium,
            some members have been shown to bind up to 4 ions via
            binding sites with different affinity. Some members of
            this clan contain disulfide bonds. These enzymes can be
            intra- and extracellular, some function at extreme
            temperatures and pH values.
          Length = 291

 Score = 30.4 bits (69), Expect = 8.6
 Identities = 20/57 (35%), Positives = 27/57 (47%), Gaps = 3/57 (5%)

Query: 1177 NSIVSNF-NTEMITDEVREKMHSRYLSNPDYSYEKANRASMACGPMVKWAIAQI-SY 1231
            N++V+NF N      +V       Y + PD  YE  +  SMA  P+V    A I SY
Sbjct: 218  NNLVANFSNYGKKNVDVFAPGERIYSTTPDNEYETDSGTSMAA-PVVSGVAALIWSY 273


>gnl|CDD|213197 cd03230, ABC_DR_subfamily_A, ATP-binding cassette domain of the
          drug resistance transporter and related proteins,
          subfamily A.  This family of ATP-binding proteins
          belongs to a multi-subunit transporter involved in drug
          resistance (BcrA and DrrA), nodulation, lipid
          transport, and lantibiotic immunity. In bacteria and
          archaea, these transporters usually include an
          ATP-binding protein and one or two integral membrane
          proteins. Eukaryotic systems of the ABCA subfamily
          display ABC domains that are quite similar to this
          family. The ATP-binding domain shows the highest
          similarity between all members of the ABC transporter
          family. ABC transporters are a subset of nucleotide
          hydrolases that contain a signature motif, Q-loop, and
          H-loop/switch region, in addition to, the Walker A
          motif/P-loop and Walker B motif commonly found in a
          number of ATP- and GTP-binding and hydrolyzing
          proteins.
          Length = 173

 Score = 29.7 bits (68), Expect = 8.6
 Identities = 15/48 (31%), Positives = 27/48 (56%), Gaps = 5/48 (10%)

Query: 44 HGLMMVGPSGSGKSTAWKVLLKALERYEG---VEGVAHIIDPKAISKE 88
          +GL  +GP+G+GK+T  K++L  L+   G   V G     +P+ + + 
Sbjct: 29 YGL--LGPNGAGKTTLIKIILGLLKPDSGEIKVLGKDIKKEPEEVKRR 74


>gnl|CDD|184585 PRK14239, PRK14239, phosphate transporter ATP-binding protein;
          Provisional.
          Length = 252

 Score = 30.1 bits (68), Expect = 8.7
 Identities = 12/22 (54%), Positives = 17/22 (77%), Gaps = 4/22 (18%)

Query: 48 MVGPSGSGKSTAWKVLLKALER 69
          ++GPSGSGKST    LL+++ R
Sbjct: 36 LIGPSGSGKST----LLRSINR 53


>gnl|CDD|221912 pfam13086, AAA_11, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins.
          Length = 220

 Score = 30.0 bits (68), Expect = 8.8
 Identities = 9/18 (50%), Positives = 14/18 (77%)

Query: 423 GPPGSGKTMTLLSALRAL 440
           GPPG+GKT T++  ++ L
Sbjct: 24  GPPGTGKTTTIVEIIKQL 41


>gnl|CDD|225438 COG2884, FtsE, Predicted ATPase involved in cell division [Cell
          division and chromosome partitioning].
          Length = 223

 Score = 29.9 bits (68), Expect = 9.0
 Identities = 9/20 (45%), Positives = 12/20 (60%)

Query: 49 VGPSGSGKSTAWKVLLKALE 68
           GPSG+GKST  K++     
Sbjct: 34 TGPSGAGKSTLLKLIYGEER 53


>gnl|CDD|218528 pfam05272, VirE, Virulence-associated protein E.  This family
          contains several bacterial virulence-associated protein
          E like proteins. These proteins contain a P-loop motif.
          Length = 198

 Score = 29.7 bits (67), Expect = 9.0
 Identities = 11/20 (55%), Positives = 13/20 (65%)

Query: 44 HGLMMVGPSGSGKSTAWKVL 63
          H L++ G  GSGKST  K L
Sbjct: 53 HVLILQGAQGSGKSTFLKKL 72


>gnl|CDD|182778 PRK10851, PRK10851, sulfate/thiosulfate transporter subunit;
           Provisional.
          Length = 353

 Score = 30.4 bits (69), Expect = 9.1
 Identities = 14/32 (43%), Positives = 19/32 (59%)

Query: 762 LIGVSGAGKTTLSRFVAFMNGLSVFQIRAHNK 793
           L+G SG+GKTTL R +A +   +   IR H  
Sbjct: 33  LLGPSGSGKTTLLRIIAGLEHQTSGHIRFHGT 64


>gnl|CDD|224260 COG1341, COG1341, Predicted GTPase or GTP-binding protein [General
           function prediction only].
          Length = 398

 Score = 30.4 bits (69), Expect = 9.1
 Identities = 17/73 (23%), Positives = 29/73 (39%), Gaps = 9/73 (12%)

Query: 714 TELREYVQARLKVFYEEELDVQLV--------LFDEVLDHVLRIDRIFRQPQGHLLLIGV 765
             LRE     L V        QLV          +E+ D          +  G ++++G 
Sbjct: 23  LGLREVEGGALSVARVRTTYHQLVEVPEDRSEPLEEIADTWESKSESAGKV-GVVMVVGP 81

Query: 766 SGAGKTTLSRFVA 778
             +GK+TL+ ++A
Sbjct: 82  VDSGKSTLTTYLA 94


>gnl|CDD|206649 cd01850, CDC_Septin, CDC/Septin GTPase family.  Septins are a
          conserved family of GTP-binding proteins associated
          with diverse processes in dividing and non-dividing
          cells. They were first discovered in the budding yeast
          S. cerevisiae as a set of genes (CDC3, CDC10, CDC11 and
          CDC12) required for normal bud morphology. Septins are
          also present in metazoan cells, where they are required
          for cytokinesis in some systems, and implicated in a
          variety of other processes involving organization of
          the cell cortex and exocytosis. In humans, 12 septin
          genes generate dozens of polypeptides, many of which
          comprise heterooligomeric complexes. Since septin
          mutants are commonly defective in cytokinesis and
          formation of the neck formation of the neck
          filaments/septin rings, septins have been considered to
          be the primary constituents of the neck filaments.
          Septins belong to the GTPase superfamily for their
          conserved GTPase motifs and enzymatic activities.
          Length = 275

 Score = 30.2 bits (69), Expect = 9.1
 Identities = 9/13 (69%), Positives = 11/13 (84%)

Query: 46 LMMVGPSGSGKST 58
          +M+VG SG GKST
Sbjct: 7  IMVVGESGLGKST 19


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 30.4 bits (69), Expect = 9.2
 Identities = 24/104 (23%), Positives = 50/104 (48%), Gaps = 6/104 (5%)

Query: 1034 GKIAETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIE 1093
             K+     Q+EE +K+L+ +  EL+   +  + +L   +      + RK+  + + AE+E
Sbjct: 98   EKLDNLENQLEEREKALSARELELEELEKQLDNELYR-VAGLTPEQARKLLLKLLDAELE 156

Query: 1094 KQTVEIAQKRVFVMEDLAQVEPAVMDAQQAVKEIKKQQLVELRS 1137
            ++      +RV  +E+ A +E A   AQ  + +  ++   E  S
Sbjct: 157  EE----KAQRVKKIEEEADLE-AERKAQNILAQAMQRIASETAS 195


>gnl|CDD|213181 cd03214, ABC_Iron-Siderophores_B12_Hemin, ATP-binding component
          of iron-siderophores, vitamin B12 and hemin
          transporters and related proteins.  ABC transporters,
          involved in the uptake of siderophores, heme, and
          vitamin B12, are widely conserved in bacteria and
          archaea. Only very few species lack representatives of
          the siderophore family transporters. The E. coli BtuCD
          protein is an ABC transporter mediating vitamin B12
          uptake. The two ATP-binding cassettes (BtuD) are in
          close contact with each other, as are the two
          membrane-spanning subunits (BtuC); this arrangement is
          distinct from that observed for the E. coli lipid
          flippase MsbA. The BtuC subunits provide 20
          transmembrane helices grouped around a translocation
          pathway that is closed to the cytoplasm by a gate
          region, whereas the dimer arrangement of the BtuD
          subunits resembles the ATP-bound form of the Rad50 DNA
          repair enzyme. A prominent cytoplasmic loop of BtuC
          forms the contact region with the ATP-binding cassette
          and represent a conserved motif among the ABC
          transporters.
          Length = 180

 Score = 29.7 bits (68), Expect = 9.2
 Identities = 11/19 (57%), Positives = 14/19 (73%), Gaps = 4/19 (21%)

Query: 49 VGPSGSGKSTAWKVLLKAL 67
          +GP+G+GKST    LLK L
Sbjct: 31 LGPNGAGKST----LLKTL 45


>gnl|CDD|234041 TIGR02868, CydC, thiol reductant ABC exporter, CydC subunit.  The
           gene pair cydCD encodes an ABC-family transporter in
           which each gene contains an N-terminal membrane-spanning
           domain (pfam00664) and a C-terminal ATP-binding domain
           (pfam00005). In E. coli these genes were discovered as
           mutants which caused the terminal heme-copper oxidase
           complex cytochrome bd to fail to assemble. Recent work
           has shown that the transporter is involved in export of
           redox-active thiol compounds such as cysteine and
           glutathione. The linkage to assembly of the cytochrome
           bd complex is further supported by the conserved operon
           structure found outside the gammaproteobacteria
           (cydABCD) containing both the transporter and oxidase
           genes components. The genes used as the seed members for
           this model are all either found in the
           gammproteobacterial context or the CydABCD context. All
           members of this family scoring above trusted at the time
           of its creation were from genomes which encode a
           cytochrome bd complex.
          Length = 530

 Score = 30.4 bits (69), Expect = 9.2
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 49  VGPSGSGKSTAWKVLLKALERYEG 72
           +GPSGSGKST    L   L+  +G
Sbjct: 367 LGPSGSGKSTLLATLAGLLDPLQG 390


>gnl|CDD|153370 cd07686, F-BAR_Fer, The F-BAR (FES-CIP4 Homology and
            Bin/Amphiphysin/Rvs) domain of Fer (Fes related) tyrosine
            kinase.  F-BAR domains are dimerization modules that bind
            and bend membranes and are found in proteins involved in
            membrane dynamics and actin reorganization. Fer (Fes
            related) is a cytoplasmic (or nonreceptor) tyrosine
            kinase expressed in a wide variety of tissues, and is
            found to reside in both the cytoplasm and the nucleus. It
            plays important roles in neuronal polarization and
            neurite development, cytoskeletal reorganization, cell
            migration, growth factor signaling, and the regulation of
            cell-cell interactions mediated by adherens junctions and
            focal adhesions. Fer kinase also regulates cell cycle
            progression in malignant cells. It contains an N-terminal
            F-BAR domain, an SH2 domain, and a C-terminal catalytic
            kinase domain. F-BAR domains form banana-shaped dimers
            with a positively-charged concave surface that binds to
            negatively-charged lipid membranes. They can induce
            membrane deformation in the form of long tubules.
          Length = 234

 Score = 30.0 bits (67), Expect = 9.4
 Identities = 20/58 (34%), Positives = 33/58 (56%), Gaps = 4/58 (6%)

Query: 1040 VEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQ-SQDIQAEIEKQT 1096
            V+Q E++ K +   ++EL   N     +L  MIKD+Q+ +K  +   Q I+AE+ K T
Sbjct: 71   VQQTEQLSKIMKTHAEEL---NSGPLHRLTMMIKDKQQVKKSYIGVHQQIEAEMYKVT 125


>gnl|CDD|227584 COG5259, RSC8, RSC chromatin remodeling complex subunit RSC8
            [Chromatin structure and dynamics / Transcription].
          Length = 531

 Score = 30.2 bits (68), Expect = 9.4
 Identities = 23/133 (17%), Positives = 49/133 (36%), Gaps = 14/133 (10%)

Query: 1005 LDFINHFVKLYREKCSELEEQQLH------LNVGLGKIAETVEQVEEMQKSLAVKSQELQ 1058
            LD      KL        EE+++       +   L K+   +  ++E++KS +++ QEL 
Sbjct: 401  LDSGKEKAKLQATN----EERKMERLRNVLIQAQLEKLKMKLGHLKELEKSTSLERQELD 456

Query: 1059 SKNEAANLKLKE---MIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQKRV-FVMEDLAQVE 1114
            +      L  +E    I +        +     +    K + E  +  +   + +   V+
Sbjct: 457  ANLLLRRLNAEEKLFAIDEVLSKCLNLISDDQGKNPDNKPSTETGENHLGQEISNETDVK 516

Query: 1115 PAVMDAQQAVKEI 1127
            P   +A Q  +  
Sbjct: 517  PISKEAPQFYRYW 529


>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
            SseC is a secreted protein that forms a complex together
            with SecB and SecD on the surface of Salmonella. All
            these proteins are secreted by the type III secretion
            system. Many mucosal pathogens use type III secretion
            systems for the injection of effector proteins into
            target cells. SecB, SseC and SecD are inserted into the
            target cell membrane. where they form a small pore or
            translocon. In addition to SseC, this family includes the
            bacterial secreted proteins PopB, PepB, YopB and EspD
            which are thought to be directly involved in pore
            formation, and type III secretion system translocon.
          Length = 303

 Score = 30.1 bits (68), Expect = 9.5
 Identities = 10/50 (20%), Positives = 21/50 (42%)

Query: 1241 LRLELKSLEVQASENKAKGEETKDLITQLEKSIASYKDEYAQLIAQATAI 1290
                +  L+ +A + +A     +  + QL+  I   K+E+ +L  Q    
Sbjct: 232  TGAAVAVLQKEAGKLEADLALLRAEMEQLQAVIKQLKEEFKKLQEQFQQA 281


>gnl|CDD|214345 CHL00043, cemA, envelope membrane protein.
          Length = 261

 Score = 29.8 bits (68), Expect = 9.6
 Identities = 16/55 (29%), Positives = 26/55 (47%), Gaps = 10/55 (18%)

Query: 1047 QKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQAEIEKQTVEIAQ 1101
            +K    K  EL+       L L EMIK+  E        Q ++  I K+T+++A+
Sbjct: 85   EKRALEKFIELEEL-----LLLDEMIKEYPE-----THLQKLRIGIHKETIQLAK 129


>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D.  BicD
            proteins consist of three coiled-coiled domains and are
            involved in dynein-mediated minus end-directed transport
            from the Golgi apparatus to the endoplasmic reticulum
            (ER). For full functioning they bind with GSK-3beta
            pfam05350 to maintain the anchoring of microtubules to
            the centromere. It appears that amino-acid residues
            437-617 of BicD and the kinase activity of GSK-3 are
            necessary for the formation of a complex between BicD and
            GSK-3beta in intact cells.
          Length = 711

 Score = 30.6 bits (69), Expect = 9.6
 Identities = 28/100 (28%), Positives = 54/100 (54%), Gaps = 8/100 (8%)

Query: 1018 KCSELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANL---KLKEMIKD 1074
            K +   ++ L L   L ++   +  V+   + L+  SQEL+ +NE   L   +L++ IK+
Sbjct: 28   KEAYYLQRILELQAELKQLRAELSNVQAENERLSSLSQELKEENEMLELQRGRLRDEIKE 87

Query: 1075 QQEAEKRKVQSQDIQAEIEKQTVEIAQKRVFVMEDLAQVE 1114
             +  E R +Q     +E+E++ + + QK+V V+   +QVE
Sbjct: 88   YKFREARLLQD---YSELEEENISL-QKQVSVLRQ-SQVE 122


>gnl|CDD|235570 PRK05703, flhF, flagellar biosynthesis regulator FlhF; Validated.
          Length = 424

 Score = 30.2 bits (69), Expect = 9.7
 Identities = 22/128 (17%), Positives = 42/128 (32%), Gaps = 18/128 (14%)

Query: 1020 SELEEQQLHLNVGLGKIAETVEQVEEMQKSLAVKSQELQSKNEAANLK--------LKEM 1071
            S    Q   L           E  +  +          +SK     L         LK +
Sbjct: 86   SRTNSQDALLQAENALPEWKKELEKPSEPKEEEPKAAAESKVVQKELDELRDELKELKNL 145

Query: 1072 IKDQQEAEKRKVQSQDIQAEIEKQTV------EIAQKRVFVMEDLAQVEPAVMD-AQQAV 1124
            ++DQ    ++  +     AE+ K+        EIA+K   +++ L +  P     A + +
Sbjct: 146  LEDQLSGLRQVERIPPEFAELYKRLKRSGLSPEIAEK---LLKLLLEHMPPRERTAWRYL 202

Query: 1125 KEIKKQQL 1132
             E+    +
Sbjct: 203  LELLANMI 210


>gnl|CDD|226127 COG3599, DivIVA, Cell division initiation protein [Cell division and
            chromosome partitioning].
          Length = 212

 Score = 29.7 bits (67), Expect = 9.7
 Identities = 17/136 (12%), Positives = 52/136 (38%), Gaps = 16/136 (11%)

Query: 1232 ADMLKKVEPLRLELKSLEVQASENKAK------GEETKDLI---TQLEKSIASYKDEYAQ 1282
             ++ ++++       S  +Q +E +A+        E  D++   +   + +       A 
Sbjct: 54   DELKEELKEAADAEDSQAIQQAETEAEELKQAAEAEADDILKRASAQAQRVFGKAQYKAD 113

Query: 1283 LIAQATAIKTDLDNVQAKF-YEYAQLIAQATA----IKTD--LDNVQAKVERSMALLKSL 1335
               +   I  +L++++ +   E  +L +   A     K +   + +++ V+   A  +  
Sbjct: 114  RYLEDAKIAQELEDLKRQAQVERQRLRSDIEAQLASAKQEDWDEILRSTVDEVEAANEEA 173

Query: 1336 GIERERWEATSETFRS 1351
                +  +A ++  R 
Sbjct: 174  ERLADAAQADADRLRD 189


>gnl|CDD|181939 PRK09544, znuC, high-affinity zinc transporter ATPase; Reviewed.
          Length = 251

 Score = 30.1 bits (68), Expect = 9.7
 Identities = 17/41 (41%), Positives = 25/41 (60%), Gaps = 3/41 (7%)

Query: 36 LYQIS-NLNHG--LMMVGPSGSGKSTAWKVLLKALERYEGV 73
          L  +S  L  G  L ++GP+G+GKST  +V+L  +   EGV
Sbjct: 20 LSDVSLELKPGKILTLLGPNGAGKSTLVRVVLGLVAPDEGV 60


>gnl|CDD|184697 PRK14475, PRK14475, F0F1 ATP synthase subunit B; Provisional.
          Length = 167

 Score = 29.5 bits (66), Expect = 9.8
 Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 5/66 (7%)

Query: 1037 AETVEQVEEMQKSLAVKSQELQSKNEAANLKLKEMIKDQQEAEKRKVQSQDIQ--AEIEK 1094
             E   Q   M   LA    + +     A  KL+E IK + E  +RK+   + Q  A+++ 
Sbjct: 72   EEAERQAAAM---LAAAKADARRMEAEAKEKLEEQIKRRAEMAERKIAQAEAQAAADVKA 128

Query: 1095 QTVEIA 1100
              V++A
Sbjct: 129  AAVDLA 134


>gnl|CDD|213262 cd03295, ABC_OpuCA_Osmoprotection, ATP-binding cassette domain of
          the osmoprotectant transporter.  OpuCA is a the ATP
          binding component of a bacterial solute transporter
          that serves a protective role to cells growing in a
          hyperosmolar environment. ABC (ATP-binding cassette)
          transporter nucleotide-binding domain; ABC transporters
          are a large family of proteins involved in the
          transport of a wide variety of different compounds,
          like sugars, ions, peptides, and more complex organic
          molecules. The nucleotide binding domain shows the
          highest similarity between all members of the family.
          ABC transporters are a subset of nucleotide hydrolases
          that contain a signature motif, Q-loop, and
          H-loop/switch region, in addition, to the Walker A
          motif/P-loop and Walker B motif commonly found in a
          number of ATP- and GTP-binding and hydrolyzing
          proteins.
          Length = 242

 Score = 30.0 bits (68), Expect = 9.9
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query: 46 LMMVGPSGSGKSTAWKVLLKALERYEG 72
          L+++GPSGSGK+T  K++ + +E   G
Sbjct: 30 LVLIGPSGSGKTTTMKMINRLIEPTSG 56


>gnl|CDD|227320 COG4987, CydC, ABC-type transport system involved in cytochrome bd
           biosynthesis, fused ATPase and permease components
           [Energy production and conversion / Posttranslational
           modification, protein turnover, chaperones].
          Length = 573

 Score = 30.4 bits (69), Expect = 9.9
 Identities = 16/43 (37%), Positives = 24/43 (55%), Gaps = 8/43 (18%)

Query: 49  VGPSGSGKSTAWKVLLKALE------RYEGVEGVAHIIDPKAI 85
           +G SGSGKST  ++L  A +         GVE +A  +D +A+
Sbjct: 370 LGRSGSGKSTLLQLLAGAWDPQQGSITLNGVE-IAS-LDEQAL 410


>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
            Synaptonemal complex protein 1 (SCP-1) is the major
            component of the transverse filaments of the synaptonemal
            complex. Synaptonemal complexes are structures that are
            formed between homologous chromosomes during meiotic
            prophase.
          Length = 787

 Score = 30.5 bits (68), Expect = 9.9
 Identities = 43/183 (23%), Positives = 78/183 (42%), Gaps = 24/183 (13%)

Query: 1000 TPRHYLDFINHFVKL---YREKCSELEEQQLHLNVGLGKIAETVEQVEEMQKS------- 1049
            T + Y+D  N+  K+   + E   + E  +L ++  L +  E ++ +EE  K        
Sbjct: 181  TRQVYMDLNNNIEKMILAFEELRVQAENARLEMHFKLKEDHEKIQHLEEEYKKEINDKEK 240

Query: 1050 ----LAVKSQELQSKNEAANLKLKEMI-KDQQEAEKRKVQSQDIQAEIEKQ---TVEIAQ 1101
                L ++  E ++K +     L+E   K  Q  EK K+Q ++++  IEK+   T E+  
Sbjct: 241  QVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKTKLQDENLKELIEKKDHLTKELED 300

Query: 1102 KRVFVMEDLAQVEPAVMDAQQAVKEI------KKQQLVELRSMANPPSVVKLALESICLL 1155
             ++ +   ++  +    D Q A K I      K+ Q+ EL       S V    E+    
Sbjct: 301  IKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCS 360

Query: 1156 LGE 1158
            L E
Sbjct: 361  LEE 363


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.134    0.393 

Gapped
Lambda     K      H
   0.267   0.0889    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 83,718,427
Number of extensions: 8537761
Number of successful extensions: 12083
Number of sequences better than 10.0: 1
Number of HSP's gapped: 11705
Number of HSP's successfully gapped: 642
Length of query: 1614
Length of database: 10,937,602
Length adjustment: 110
Effective length of query: 1504
Effective length of database: 6,058,662
Effective search space: 9112227648
Effective search space used: 9112227648
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 66 (28.9 bits)