RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5601
         (270 letters)



>gnl|CDD|241242 cd01206, EVH1_Homer_Vesl, Homer/Vesl family proteins EVH1 domain.
           Homer/Vesl proteins are synaptic scaffolding proteins,
          required for long-term potentiation, a form of synaptic
          plasticity thought to underlie memory formation. They
          contains an N-terminal EVH1 domain and bind to both
          neurotransmitter receptors, such as the metabotropic
          group 1 glutamate receptor (mGluR) and to other
          scaffolding proteins via PPXXF motifs, in order to
          target them to the synaptic junction.  These mGluRs
          possess a long C-terminal intracellular tail that may
          be important for subcellular localization of the
          receptor. The C-terminus is also the site of binding by
          the immediate early gene (IEG), Homer 1a. In contrast
          to Homer 1a, other Homer members additionally encode a
          C-terminal coiled-coil (CC) domain and form multivalent
          complexes that bind group 1 mGluRs. Homer 1a competes
          with constitutively expressed CC-Homers to modify the
          association of group 1 mGluRs with CC-Homer complexes.
          Since Homer proteins are strikingly enriched at the
          postsynaptic density (PSD), these observations suggest
          a role for the Homer family in regulating synaptic
          metabotropic receptor function.  PSD-Zip45 (also named
          Homer 1c/Vesl-1L) has an EVH1 domain with a longer
          alpha-helix and its linking part included in the
          conserved region of Homer 1 (CRH1) interacts with the
          EVH1 domain of the neighbour CRH1 molecule in the
          crystal, suggesting that the EVH1 domain recognizes the
          PPXXF motif found in the binding partners, and the
          SPLTP sequence (P-motif) in the linking region of the
          CRH1. The two types of binding are partly overlapped in
          the EVH1 domain, implying a mechanism to regulate
          multimerization of Homer 1 family proteins.  Homer 2
          and Homer 3 are negative regulators of T cell
          activation. They bind the nuclear factor of activated T
          cells (NFAT) and compete with calcineurin binding. NFAT
          plays a critical role in calcium-dependent signaling in
          other cell types, including muscle and neurons.
          Homer-NFAT binding is also antagonized by active
          serine-threonine kinase AKT, enhancing TCR signaling
          via calcineurin-dependent dephosphorylation of NFAT
          resulting in changes in cytokine expression and an
          increase in effector-memory T cell populations in
          Homer-deficient mice. The EVH1 domains are part of the
          PH domain superamily.  There are 5 EVH1 subfamilies:
          Enables/VASP, Homer/Vesl, WASP, Dcp1, and Spred.
          Ligands are known for three of the EVH1 subfamilies,
          all of which bind proline-rich sequences: the
          Enabled/VASP family binds to FPPPP peptides, the
          Homer/Vesl family binds PPxxF peptides, and the WASP
          family binds LPPPEP peptides. EVH1 has a PH-like fold,
          despite having minimal sequence similarity to PH or PTB
          domains.
          Length = 109

 Score = 57.7 bits (140), Expect = 7e-11
 Identities = 19/21 (90%), Positives = 20/21 (95%)

Query: 1  MTFTKTSQKFGQWSDIRANTV 21
          MTFTKTSQKFGQW+D RANTV
Sbjct: 63 MTFTKTSQKFGQWADSRANTV 83


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 48.1 bits (115), Expect = 3e-06
 Identities = 53/246 (21%), Positives = 100/246 (40%), Gaps = 9/246 (3%)

Query: 25  QAQLDTLLASQDAQRRIIDTLNHQLAGRVADSANAKKWEIELANLKSNNLRLTSALQEST 84
           + +L+ L          ++ L  +L          +    E++ L+     L   L    
Sbjct: 238 REELEELQEELKEAEEELEELTAELQELEEKLEELRL---EVSELEEEIEELQKELYALA 294

Query: 85  ANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAA-VELRKELTSVRSKNETLDSELK 143
             +   ++Q Q  +E    L+ +  ELEA     E+   EL +EL  +  K E L    +
Sbjct: 295 NEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEEL---KE 351

Query: 144 CRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHA 203
             +    +L   +  +E+      EL ++L ++RSK   L+ ++   + E+++L      
Sbjct: 352 ELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLER 411

Query: 204 VEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHRE 263
           +ED+ + L QE  EL     L +A+L  L A  +     ++ L  +L      L  +  E
Sbjct: 412 LEDRRERLQQEIEELLK--KLEEAELKELQAELEELEEELEELQEELERLEEALEELREE 469

Query: 264 INTALQ 269
           +  A Q
Sbjct: 470 LEEAEQ 475



 Score = 40.0 bits (94), Expect = 0.001
 Identities = 51/194 (26%), Positives = 93/194 (47%), Gaps = 6/194 (3%)

Query: 72  NNLRLTSA-LQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTS 130
             L  T   L      + E ++QL++ + + +K + +Y EL+A     E A+ L   L  
Sbjct: 179 RKLERTRENLDRLEDILNELERQLKSLERQAEKAE-RYKELKAELRELELAL-LVLRLEE 236

Query: 131 VRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCR 190
           +R + E L  ELK  +EE+++LT     +E+K     ELR E++ +  + E L  EL   
Sbjct: 237 LREELEELQEELKEAEEELEELTAELQELEEK---LEELRLEVSELEEEIEELQKELYAL 293

Query: 191 DEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQL 250
             E+ +L   K  + ++   L ++  EL+A +  ++++LD L          ++ L  +L
Sbjct: 294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEEL 353

Query: 251 AGRVAELATIHREI 264
               AEL  +  E+
Sbjct: 354 ESLEAELEELEAEL 367



 Score = 39.3 bits (92), Expect = 0.002
 Identities = 43/242 (17%), Positives = 101/242 (41%), Gaps = 9/242 (3%)

Query: 37  AQRRIIDTLNHQLAGRVADSANAKK----WEIELANLKSNNLRLTSALQESTANVEEWKK 92
            +RR I+ L  ++       A  +K       EL  L+    +L   L+E +  +   +K
Sbjct: 674 ERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRK 733

Query: 93  QLQAYKEENQKLKTKYIELEATKGVSEA-AVELRKELTSVRSKNETLDSELKCRDEEVKK 151
            L   + E ++L+ +  +L       EA   EL + L     +    ++E++  + ++++
Sbjct: 734 DLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQ 793

Query: 152 LTNAKHAVEDKCKAA----VELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDK 207
           L     A+ +           L +E  ++R + E+L+  +   +  ++ L      + + 
Sbjct: 794 LKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSED 853

Query: 208 CKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREINTA 267
            + L+ E  EL+  I  ++++L+ LL  + +    +  L  +L     EL  +  + +  
Sbjct: 854 IESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSEL 913

Query: 268 LQ 269
            +
Sbjct: 914 RR 915



 Score = 35.4 bits (82), Expect = 0.033
 Identities = 42/230 (18%), Positives = 102/230 (44%), Gaps = 22/230 (9%)

Query: 42  IDTLNHQLAGRVADSANAKKWEIELANLKSNNLR--LTSALQESTANVEEWKKQLQAYKE 99
           +  L   L     +    +  E++    ++      LT+ LQE    +EE + ++   +E
Sbjct: 222 LRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEE 281

Query: 100 ENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAV 159
           E ++L+ +   L        + +E +K++   R  N  L+ +L+  + ++++L +     
Sbjct: 282 EIEELQKELYALANEI----SRLEQQKQILRERLAN--LERQLEELEAQLEELESKL--- 332

Query: 160 EDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQ 219
           ++  +   EL ++L  ++ + E+L++EL+   E   +L   +  +E+  + L    +++ 
Sbjct: 333 DELAEELAELEEKLEELKEELESLEAELE---ELEAELEELESRLEELEEQLETLRSKV- 388

Query: 220 AAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREINTALQ 269
                  AQL+  +AS + +   ++    +L  R   L     E+   L+
Sbjct: 389 -------AQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLE 431



 Score = 34.6 bits (80), Expect = 0.057
 Identities = 46/224 (20%), Positives = 97/224 (43%), Gaps = 19/224 (8%)

Query: 27  QLDTLLASQDAQRRIIDTLNHQLAGRVAD-SANAKKWEIELANLKSNNLRLTSALQESTA 85
           +L   +++       ++    QL  R+A  S    + E E+  L+         L E+ A
Sbjct: 723 ELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEA 782

Query: 86  NVEEWKKQLQAYKEENQKLKTKYIELEATKG--------VSEAAVELRKELTSVRSKNET 137
            +EE + Q++  KEE + L+    EL A           + E    L + + +   + E 
Sbjct: 783 EIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLED 842

Query: 138 LDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELT-------SVRSKNETLDSELKCR 190
           L+ +++   E+++ L      +E+  +   EL  EL        S+      L SEL+  
Sbjct: 843 LEEQIEELSEDIESLA---AEIEELEELIEELESELEALLNERASLEEALALLRSELEEL 899

Query: 191 DEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLA 234
            EE+++L + +  +  + + L ++ A+L+  +  ++ ++D L  
Sbjct: 900 SEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQE 943



 Score = 31.6 bits (72), Expect = 0.54
 Identities = 27/163 (16%), Positives = 64/163 (39%), Gaps = 4/163 (2%)

Query: 112 EATKGVSEAAVELRKELTSVRSKNETLDSELKCRD----EEVKKLTNAKHAVEDKCKAAV 167
           E  + + E   EL K L  +R + E L+ EL+       E  ++++  +  +        
Sbjct: 684 EKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVE 743

Query: 168 ELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQA 227
           +L + +  +  +   L++E++  +E +++        E + + L  +  +L+  +  ++ 
Sbjct: 744 QLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALRE 803

Query: 228 QLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREINTALQT 270
            LD L A           L  +L      +A   R +    + 
Sbjct: 804 ALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQ 846


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 48.1 bits (115), Expect = 3e-06
 Identities = 49/226 (21%), Positives = 88/226 (38%), Gaps = 21/226 (9%)

Query: 24  VQAQLDTLLASQDAQRRIIDTLNHQLAGRVADSANAKKWEIELANLKSNNLRLTSALQES 83
           V+ ++  L A   +  R I     +L          +  E  LA L++   +L + ++E 
Sbjct: 292 VKEKIGELEAEIASLERSIAEKEREL----------EDAEERLAKLEAEIDKLLAEIEEL 341

Query: 84  TANVEEWKKQLQAYKEENQKLKTKYIELEATKG-VSEAAVELRKELTSVRSKNETLDSEL 142
              +EE +K+     EE  +LK +  +L A    V +   E R EL   R K E L  E+
Sbjct: 342 EREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREI 401

Query: 143 KCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLT-NAK 201
                E+ +L       +   +   +L   +  + +K   L+ E + +  E+KK     +
Sbjct: 402 NELKRELDRLQEEL---QRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLE 458

Query: 202 HAVEDKCKL------LSQENAELQAAISLVQAQLDTLLASQDAQRR 241
               D  K       L +E   ++  +S +Q +L    A   A   
Sbjct: 459 QLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEE 504



 Score = 41.6 bits (98), Expect = 4e-04
 Identities = 54/255 (21%), Positives = 122/255 (47%), Gaps = 18/255 (7%)

Query: 28  LDTLLASQDAQRRIIDTLNHQLAGRVAD-SANAKKWEIELANLKSNNLRLTSALQESTAN 86
           L   L+S  ++ R I+    +L+  ++D S    + E E+  L+    +L   L+E   +
Sbjct: 686 LKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEED 745

Query: 87  VEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAV-------------ELRKELTSVRS 133
           +   +++++  K E ++L+ +  ELE      E A+             E++ EL+ +  
Sbjct: 746 LSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEE 805

Query: 134 KNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELT----SVRSKNETLDSELKC 189
           +   +++ L+  ++++ +LT  K  +E + +   E R +L     S+  + E L+ + + 
Sbjct: 806 EVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEE 865

Query: 190 RDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQ 249
            +EE+++L  A   +E +   L +E  EL+A +  ++ +++ L A  + +R+ +  L  +
Sbjct: 866 LEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAK 925

Query: 250 LAGRVAELATIHREI 264
           L     EL+ I    
Sbjct: 926 LEALEEELSEIEDPK 940



 Score = 40.1 bits (94), Expect = 0.001
 Identities = 54/246 (21%), Positives = 103/246 (41%), Gaps = 21/246 (8%)

Query: 24  VQAQLDTLLASQDAQRRIIDTLNHQLAGRVADSANA-----KKWEIELANLKSNNLRLTS 78
           ++ QL +L    +     I  L  +L                  E E   +K     L +
Sbjct: 242 IERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEA 301

Query: 79  ALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETL 138
            +     ++ E +++L+  +E   KL+ +  +L A         EL +E+   R + + L
Sbjct: 302 EIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIE------ELEREIEEERKRRDKL 355

Query: 139 DSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLT 198
             E     EE++ L      +E+  K   E R EL   R K E L  E+     E+ +L 
Sbjct: 356 TEEYAELKEELEDLRA---ELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRL- 411

Query: 199 NAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELA 258
                 +++ + LS+E A+L AAI+ ++A+++ L   ++ +   I     +L    A+L+
Sbjct: 412 ------QEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLS 465

Query: 259 TIHREI 264
              +E+
Sbjct: 466 KYEQEL 471



 Score = 36.6 bits (85), Expect = 0.013
 Identities = 36/162 (22%), Positives = 64/162 (39%), Gaps = 5/162 (3%)

Query: 113 ATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAA----VE 168
           A +G    +     EL  +R + E L  EL     E++++ N    +  +   A     E
Sbjct: 661 APRGGILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGE 720

Query: 169 LRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQ 228
           + KE+  +  + E L   L+  +E++  L      V+ + K L     EL+  +  ++  
Sbjct: 721 IEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEA 780

Query: 229 LDTLLASQDAQR-RIIDTLNHQLAGRVAELATIHREINTALQ 269
           L+ L A     R   I     +L   V+ +    REI   L 
Sbjct: 781 LNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLN 822


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 41.6 bits (98), Expect = 3e-04
 Identities = 49/203 (24%), Positives = 91/203 (44%), Gaps = 15/203 (7%)

Query: 63  EIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAV 122
           E E++ L+     L + L+E    +EE K++++A KEE ++ +T   ELE      E A 
Sbjct: 301 EGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAK 360

Query: 123 E-----LRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVR 177
           E     L   L  +    E L  EL   + E+ ++ N             EL++E+ S+ 
Sbjct: 361 EELEEKLSALLEELEELFEALREELAELEAELAEIRNELE----------ELKREIESLE 410

Query: 178 SKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQD 237
            + E L   L+   EE+K+L      ++ + + L++E  EL+  +  ++ +L  L     
Sbjct: 411 ERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELA 470

Query: 238 AQRRIIDTLNHQLAGRVAELATI 260
             +  +  L  +L+   A L  +
Sbjct: 471 ELQEELQRLEKELSSLEARLDRL 493



 Score = 39.7 bits (93), Expect = 0.001
 Identities = 46/180 (25%), Positives = 86/180 (47%), Gaps = 7/180 (3%)

Query: 87  VEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVE-LRKELTSVRSKNETLDSELKCR 145
           ++E +K+L+  +EE  +L+ +  EL+     +E  +E L+ EL  +R + E L  EL   
Sbjct: 234 LKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLEL 293

Query: 146 DEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVE 205
            EE+++L      + ++ +   EL  EL  +  + E L  +++   EE+++       +E
Sbjct: 294 KEEIEELEGEISLLRERLE---ELENELEELEERLEELKEKIEALKEELEERETLLEELE 350

Query: 206 DKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREIN 265
                L +   EL+  +S +  +L+ L     A R  +  L  +LA    EL  + REI 
Sbjct: 351 QLLAELEEAKEELEEKLSALLEELEELFE---ALREELAELEAELAEIRNELEELKREIE 407



 Score = 35.5 bits (82), Expect = 0.027
 Identities = 46/194 (23%), Positives = 91/194 (46%), Gaps = 8/194 (4%)

Query: 80  LQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLD 139
           L+     +EE +KQL+  + + +K + +Y EL+A     E A+ L  +L  +R + E L+
Sbjct: 188 LERLEDLLEELEKQLEKLERQAEKAE-RYQELKAELRELELAL-LLAKLKELRKELEELE 245

Query: 140 SELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTN 199
            EL   +EE+++L       E + +   EL+ EL  +R + E L  EL    EE+++L  
Sbjct: 246 EELSRLEEELEELQEELEEAEKEIE---ELKSELEELREELEELQEELLELKEEIEELEG 302

Query: 200 AKHAVEDKCKLLSQENAELQAAISLVQAQLD---TLLASQDAQRRIIDTLNHQLAGRVAE 256
               + ++ + L  E  EL+  +  ++ +++     L  ++     ++ L  +L     E
Sbjct: 303 EISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEE 362

Query: 257 LATIHREINTALQT 270
           L      +   L+ 
Sbjct: 363 LEEKLSALLEELEE 376



 Score = 34.7 bits (80), Expect = 0.053
 Identities = 33/156 (21%), Positives = 80/156 (51%), Gaps = 3/156 (1%)

Query: 114 TKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKEL 173
            K + E   EL  +L  +  + ++L +EL+  ++ +++L      +E +     EL++EL
Sbjct: 669 LKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQ---LEELKREL 725

Query: 174 TSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLL 233
            ++  + E L S L+  +EE+++L      ++++ + L +E   L+ A++ ++ +++ L 
Sbjct: 726 AALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELE 785

Query: 234 ASQDAQRRIIDTLNHQLAGRVAELATIHREINTALQ 269
             + A +  ++ L  +L      L  + RE+ +  Q
Sbjct: 786 EKRQALQEELEELEEELEEAERRLDALERELESLEQ 821



 Score = 33.1 bits (76), Expect = 0.15
 Identities = 35/170 (20%), Positives = 77/170 (45%), Gaps = 1/170 (0%)

Query: 98  KEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKH 157
            E   +L+    EL++ K    +  +L +EL     + E    ELK     +++      
Sbjct: 677 AELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQ 736

Query: 158 A-VEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENA 216
           + +E+  +   EL +EL  ++ + E L+ EL+  +E + KL      +E+K + L +E  
Sbjct: 737 SRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELE 796

Query: 217 ELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREINT 266
           EL+  +   + +LD L    ++  +  + L  ++     E+  +  +++ 
Sbjct: 797 ELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDE 846



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 38/172 (22%), Positives = 71/172 (41%), Gaps = 8/172 (4%)

Query: 27  QLDTLLASQDAQRRIIDTLNHQLAGRVADSANAK-KWEIELANLKSNNLRLTSALQESTA 85
            L   L   + +    +     L   +      + + E E+  L+     L   L E   
Sbjct: 790 ALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEE 849

Query: 86  NVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCR 145
            +EE +K+L+  KEE ++L+ +  ELE      +   EL +E   +  +   L+SEL   
Sbjct: 850 ELEELEKELEELKEELEELEAEKEELE------DELKELEEEKEELEEELRELESELAEL 903

Query: 146 DEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKL 197
            EE++KL      +E K +  +E+         + E  D+     + E+++L
Sbjct: 904 KEEIEKLRERLEELEAKLE-RLEVELPELEEELEEEYEDTLETELEREIERL 954


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 39.8 bits (93), Expect = 0.001
 Identities = 50/188 (26%), Positives = 82/188 (43%), Gaps = 8/188 (4%)

Query: 63  EIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAV 122
           E ELA L+  N  LTS   ES A     K ++   +E + K K    ++E  K   E   
Sbjct: 273 EKELAELQHRNDELTSLAAESQA----LKDEIDVLRESSDKAKKLEAQVETYKKKLEDLN 328

Query: 123 ELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNET 182
           +LR+++  +  +N          +EE+KK   A+  +E   +   EL  +L+    K + 
Sbjct: 329 DLRRQVKLLEERNAMYMQNTVQLEEELKKANAARGQLETYKRQVQELHAKLSEESKKADK 388

Query: 183 LDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRI 242
           L+ E K  +E+++ L   K  +  +   L + N EL+      QAQ D L  +       
Sbjct: 389 LEFEYKRLEEKLEALQKEKERLLAERDSLRETNEELRCG----QAQQDQLTQADAGTSPS 444

Query: 243 IDTLNHQL 250
            D L  +L
Sbjct: 445 GDNLAAEL 452



 Score = 28.6 bits (64), Expect = 4.0
 Identities = 47/223 (21%), Positives = 84/223 (37%), Gaps = 30/223 (13%)

Query: 54  ADSANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIE--- 110
            +S  A K E E   L+     L           E    +  + +E N++L+    +   
Sbjct: 381 EESKKADKLEFEYKRLEEKLEALQKE-------KERLLAERDSLRETNEELRCGQAQQDQ 433

Query: 111 -LEATKGVSEAAVELRKELTS--VRSKNETLDSE---LKCRDE--EVKKLTNAKHAVEDK 162
             +A  G S +   L  EL     R K   L  E   L+   E  E +++T  +  +ED 
Sbjct: 434 LTQADAGTSPSGDNLAAELLPSEYREKLIRLQHENKMLRLGQEGSENERITELQQLLEDA 493

Query: 163 CKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENA------ 216
            +   EL  +L   R  N+ +  EL+ + E+++K    + +  +   LL  +        
Sbjct: 494 NRRNNELETQL---RLANQRI-LELQQQVEDLQKALQEQGSKSEDSSLLKSKLEEHLEQL 549

Query: 217 -ELQAAISLVQAQLDTLLASQDAQR-RIIDTLNHQLAGRVAEL 257
            E    +   + Q++ L   QD    R I  L   L  +  ++
Sbjct: 550 HEANEELQKKREQIEELEPDQDQNLSRKIAELEAALQKKDEDM 592


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 39.1 bits (91), Expect = 0.002
 Identities = 42/161 (26%), Positives = 71/161 (44%), Gaps = 10/161 (6%)

Query: 60  KKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSE 119
           KK + EL   + N   L + +   T+     K  L   K+EN  L+TK   + + K    
Sbjct: 416 KKLQAELQQARQNESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAK---- 471

Query: 120 AAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNA-KHAVEDKCKAAVELRKELTSVRS 178
                +K+  S++S  + L SE   R    K+L    K   E++  AA    +   S   
Sbjct: 472 -----QKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREE 526

Query: 179 KNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQ 219
             E+L    +  + E+KKL +     E++C++L +E  EL+
Sbjct: 527 CAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKEAQELR 567


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 39.3 bits (92), Expect = 0.002
 Identities = 36/184 (19%), Positives = 74/184 (40%), Gaps = 15/184 (8%)

Query: 87  VEEWKKQLQAYKEENQKLKTKYIELEAT-KGVSEAAVELRKELTSVRSKNETLDSELKCR 145
           V ++       +E  Q+ +       A  K   E  V+   EL   +       + LK  
Sbjct: 595 VPDYAANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQA 654

Query: 146 DEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVE 205
             ++++L N + +++DK + A+  RK+     ++   LD++LK   E+ +     +   +
Sbjct: 655 RLDLQRLQNEQQSLKDKLELAIAERKQ--QAETQLRQLDAQLKQLLEQQQAF--LEALKD 710

Query: 206 DKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREIN 265
           D  +L ++  A+ Q     +  QL  L A+ +A R              A L  + ++ +
Sbjct: 711 DFRELRTERLAKWQVVEGELDNQLAQLSAAIEAAR----------TQAKARLKELKKQYD 760

Query: 266 TALQ 269
             L 
Sbjct: 761 RELA 764



 Score = 36.6 bits (85), Expect = 0.012
 Identities = 48/244 (19%), Positives = 94/244 (38%), Gaps = 17/244 (6%)

Query: 27  QLDTLLASQDAQRRIIDTLNHQLAGRVADSANAKKWEIELANLKSNNLRLTSALQESTAN 86
              + L  QD +  I D     L      +   +K + +   L S  LRL     E  A+
Sbjct: 218 PPKSRLNPQDVEHWIADI--QALRAIQKVAPEIEKLQEDFEQLLSLELRLQHLHGELVAD 275

Query: 87  VEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRD 146
            E   ++ +  +E   +L+ +   LE    + EA  EL +EL++  +K     SEL+  +
Sbjct: 276 EERLAEEQEERQEAKNRLRQQLRTLE--DQLKEARDELNQELSAANAKLAADRSELELLE 333

Query: 147 EEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVED 206
           ++            D  +   +L  +L S+RS+ E +++ L     + + +      ++ 
Sbjct: 334 DQKG-----AFEDADIEQLQADL-DQLPSIRSELEEVEARLDALTGKHQDVQRKYERLKQ 387

Query: 207 KCKLLSQENAELQAAISLVQAQLDTLLASQDAQR-RIIDTLNHQLAGRVAELATIHREIN 265
           K K       +L+  +     +L  +   +D Q+  I + L    +    +L     E N
Sbjct: 388 KIKE------QLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFN 441

Query: 266 TALQ 269
               
Sbjct: 442 EEEY 445



 Score = 30.4 bits (69), Expect = 1.2
 Identities = 38/207 (18%), Positives = 72/207 (34%), Gaps = 26/207 (12%)

Query: 8   QKFGQWSDIRANTVSLVQAQLDTLLASQDAQRRIIDTL--NHQLAGRVADSANAKKWEIE 65
            + G + D     +     QL ++ +  +     +D L   HQ   R        K+E  
Sbjct: 334 DQKGAFEDADIEQLQADLDQLPSIRSELEEVEARLDALTGKHQDVQR--------KYERL 385

Query: 66  LANLKSNNLRLTSALQESTANVEEWKKQLQAYKEE-----NQKLKTKYIELEATKGVSEA 120
              +K    R      E  A + E K + +A  EE       +L+ +    +      E 
Sbjct: 386 KQKIKEQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFNEEEY 445

Query: 121 AVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVE-LRKELTSVRSK 179
            +ELR      R  + T   E   + E   +        +++ +A VE L+ EL  +R +
Sbjct: 446 ELELRLGRLKQRLDSATATPEELEQLEINDEALEKAQEEQEQAEANVEQLQSELRQLRKR 505

Query: 180 NETLD----------SELKCRDEEVKK 196
            +              +L+   +E++ 
Sbjct: 506 RDEALEALQRAERRLLQLRQALDELEL 532


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 37.7 bits (88), Expect = 0.002
 Identities = 23/78 (29%), Positives = 36/78 (46%), Gaps = 3/78 (3%)

Query: 65  ELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVEL 124
           ELA L+     L   L E     +E K++L   + E ++L+    EL   K +S  A+EL
Sbjct: 74  ELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQK---ELARIKQLSANAIEL 130

Query: 125 RKELTSVRSKNETLDSEL 142
            +E   +R +   L  E 
Sbjct: 131 DEENRELREELAELKQEN 148



 Score = 33.4 bits (77), Expect = 0.077
 Identities = 16/92 (17%), Positives = 45/92 (48%), Gaps = 4/92 (4%)

Query: 150 KKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCK 209
           ++L   +  + +  +   EL++E   ++ +  TL++EL+   +E+ ++        +   
Sbjct: 73  QELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIE--- 129

Query: 210 LLSQENAELQAAISLVQAQLDTLLASQDAQRR 241
            L +EN EL+  ++ ++ + + L A  +  + 
Sbjct: 130 -LDEENRELREELAELKQENEALEAENERLQE 160



 Score = 29.6 bits (67), Expect = 1.2
 Identities = 26/101 (25%), Positives = 56/101 (55%), Gaps = 13/101 (12%)

Query: 87  VEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRD 146
           + E +++L   +EE  +L+ +  EL+          EL++EL+++ ++ E L  EL    
Sbjct: 68  LPELQQELAELQEELAELQEQLAELQQENQ------ELKQELSTLEAELERLQKEL---- 117

Query: 147 EEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSEL 187
             +K+L+     ++++ +   ELR+EL  ++ +NE L++E 
Sbjct: 118 ARIKQLSANAIELDEENR---ELREELAELKQENEALEAEN 155



 Score = 29.2 bits (66), Expect = 1.7
 Identities = 17/80 (21%), Positives = 29/80 (36%)

Query: 24  VQAQLDTLLASQDAQRRIIDTLNHQLAGRVADSANAKKWEIELANLKSNNLRLTSALQES 83
           +Q QL  L       ++ + TL  +L     + A  K+       L   N  L   L E 
Sbjct: 85  LQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENRELREELAEL 144

Query: 84  TANVEEWKKQLQAYKEENQK 103
               E  + + +  +E  Q+
Sbjct: 145 KQENEALEAENERLQENEQR 164



 Score = 28.4 bits (64), Expect = 2.8
 Identities = 14/89 (15%), Positives = 34/89 (38%)

Query: 24  VQAQLDTLLASQDAQRRIIDTLNHQLAGRVADSANAKKWEIELANLKSNNLRLTSALQES 83
           +Q +L  L       +  +  L  +      + +  +     L    +   +L++   E 
Sbjct: 71  LQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIEL 130

Query: 84  TANVEEWKKQLQAYKEENQKLKTKYIELE 112
                E +++L   K+EN+ L+ +   L+
Sbjct: 131 DEENRELREELAELKQENEALEAENERLQ 159


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 37.0 bits (86), Expect = 0.010
 Identities = 40/186 (21%), Positives = 65/186 (34%), Gaps = 13/186 (6%)

Query: 38   QRRIIDTLNHQLAGRVADSANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAY 97
            + + +           A+    K      A  + + L+  +   +S   VE  ++QLQ  
Sbjct: 828  REKKLRETEEVEFSLKAEVLIQKFGRSLKAKKRFSLLKKETIYLQSAQRVELAERQLQEL 887

Query: 98   KEEN---QKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTN 154
            K +      LK   +ELE         +EL+K L+S   +N    +EL      +KKL N
Sbjct: 888  KIDVKSISSLKLVNLELE------SEIIELKKSLSSDLIENLEFKTEL---IARLKKLLN 938

Query: 155  AKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQE 214
                 E      V+L  EL  +      L    +  ++ +KK T            L   
Sbjct: 939  NIDLEEGPSIEYVKL-PELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNF 997

Query: 215  NAELQA 220
              EL  
Sbjct: 998  KKELAE 1003


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 35.0 bits (81), Expect = 0.031
 Identities = 16/94 (17%), Positives = 39/94 (41%), Gaps = 2/94 (2%)

Query: 168 ELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQE--NAELQAAISLV 225
           ELR +L  +  +NE L  +++   E          A   K ++  ++    +++A IS +
Sbjct: 24  ELRLDLARLLLENEELKQKVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQL 83

Query: 226 QAQLDTLLASQDAQRRIIDTLNHQLAGRVAELAT 259
           + +++      +  +R +      L+    +L  
Sbjct: 84  KEEIEQKRERIEELKRALAQRRSDLSSASYQLEK 117



 Score = 27.7 bits (62), Expect = 6.8
 Identities = 26/128 (20%), Positives = 55/128 (42%), Gaps = 10/128 (7%)

Query: 123 ELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNET 182
           ELR +L  +  +NE L  +++   E      + K A +         ++ L  +R++   
Sbjct: 24  ELRLDLARLLLENEELKQKVEEALEGATN-EDGKLAADLLKLEVARKKERLNQIRARISQ 82

Query: 183 LDSELKCRDEEVKKL--------TNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLA 234
           L  E++ + E +++L        ++   A     K  + +  +LQ  I   +++L   L 
Sbjct: 83  LKEEIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQDEIKRTRSKL-NALH 141

Query: 235 SQDAQRRI 242
           S  A++R 
Sbjct: 142 SLLAEKRS 149



 Score = 27.7 bits (62), Expect = 6.9
 Identities = 23/157 (14%), Positives = 60/157 (38%), Gaps = 15/157 (9%)

Query: 14  SDIRANTVSLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVADSANAKKWEIELANLKSNN 73
              R +   L       LL +++ ++++ + L          +A+  K E+     + N 
Sbjct: 16  HCARNSLYELRLDLARLLLENEELKQKVEEALEGATNEDGKLAADLLKLEVARKKERLNQ 75

Query: 74  LRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRS 133
           +R  + + +    +E+ +++++  K    + ++          +S A+ +L K   S   
Sbjct: 76  IR--ARISQLKEEIEQKRERIEELKRALAQRRSD---------LSSASYQLEKRRASQLE 124

Query: 134 KNETLDSELKCRDEEVKKLTN-AKHAVEDKCKAAVEL 169
           K +    E+K    ++  L +         C+   +L
Sbjct: 125 KLQ---DEIKRTRSKLNALHSLLAEKRSFLCRELAKL 158


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 35.1 bits (80), Expect = 0.037
 Identities = 39/149 (26%), Positives = 66/149 (44%), Gaps = 2/149 (1%)

Query: 81   QESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDS 140
              +    +E KK+ +  K+ ++  K K  E +      + A E +K   + +   E   +
Sbjct: 1414 AAAKKKADEAKKKAEEKKKADE-AKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKA 1472

Query: 141  -ELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTN 199
             E K + EE KK   AK   E+  K A E +K   + +  +E   +E   + +E KK   
Sbjct: 1473 DEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEE 1532

Query: 200  AKHAVEDKCKLLSQENAELQAAISLVQAQ 228
            AK A E K     ++  EL+ A  L +A+
Sbjct: 1533 AKKADEAKKAEEKKKADELKKAEELKKAE 1561



 Score = 34.0 bits (77), Expect = 0.092
 Identities = 36/159 (22%), Positives = 67/159 (42%), Gaps = 4/159 (2%)

Query: 54   ADSANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEA 113
            A  A+  K + E A  K++  +  +   +  A   + + +  A + E  + K +  E + 
Sbjct: 1314 AKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKK- 1372

Query: 114  TKGVSEAAVELRKELTSVRSKNETLD--SELKCRDEEVKKLTNAKHAVEDKCKAAVELRK 171
             +   + A   +K+    +  +E      E K + +E+KK   AK   ++  K A E +K
Sbjct: 1373 KEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432

Query: 172  ELTSVRSKNETLDS-ELKCRDEEVKKLTNAKHAVEDKCK 209
               + +   E   + E K + EE KK   AK   E+  K
Sbjct: 1433 ADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK 1471



 Score = 33.6 bits (76), Expect = 0.13
 Identities = 50/170 (29%), Positives = 71/170 (41%), Gaps = 13/170 (7%)

Query: 50   AGRVADSANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYI 109
            A + AD A  K  E + A+             E+    EE KK  +A K+  +  K    
Sbjct: 1416 AKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKAD-- 1473

Query: 110  ELEATKGVSEA--AVELRKELTSVRSKNETLD--SELKCRDEEVKKLTNAKHAVE----D 161
              EA K   EA  A E +K+    + K +     +E K + +E KK   AK A E    +
Sbjct: 1474 --EAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAE 1531

Query: 162  KCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLL 211
            + K A E +K     +  +E   +E   + EE KK   AK A EDK   L
Sbjct: 1532 EAKKADEAKKA-EEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMAL 1580



 Score = 32.8 bits (74), Expect = 0.23
 Identities = 48/195 (24%), Positives = 71/195 (36%), Gaps = 17/195 (8%)

Query: 33   ASQDAQRRIIDTLNHQLAGRVADSANAKKWE---IELANLKSNNLRLTSALQESTANVEE 89
            A +  + RI + +      +   +  AKK E   I+   LK          Q      EE
Sbjct: 1586 AKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEE 1645

Query: 90   WKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSEL------- 142
             KK  +  K E +       E +  +   + A E +K     +   E L  E        
Sbjct: 1646 KKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAE 1705

Query: 143  ---KCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTN 199
               K   EE KK    K A E+    A E +KE    + K E    E K  +EE KK+ +
Sbjct: 1706 ELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAE----EAKKDEEEKKKIAH 1761

Query: 200  AKHAVEDKCKLLSQE 214
             K   E K + + +E
Sbjct: 1762 LKKEEEKKAEEIRKE 1776



 Score = 30.9 bits (69), Expect = 0.76
 Identities = 30/127 (23%), Positives = 50/127 (39%), Gaps = 1/127 (0%)

Query: 82   ESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVS-EAAVELRKELTSVRSKNETLDS 140
            E   N EE +K  +A      + +      EA K    + A E +K   + +++ +    
Sbjct: 1246 EEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKAD 1305

Query: 141  ELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNA 200
            E K + EE KK   AK   E+  K A   +K+    +   E   +E +   +E +     
Sbjct: 1306 EAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEK 1365

Query: 201  KHAVEDK 207
              A E K
Sbjct: 1366 AEAAEKK 1372



 Score = 30.9 bits (69), Expect = 0.94
 Identities = 41/159 (25%), Positives = 63/159 (39%), Gaps = 11/159 (6%)

Query: 58   NAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYK-EENQKLKTKYIELEATKG 116
            N +  + E A +     R  +   E     +E KK  +  K +E +K + K    EA K 
Sbjct: 1251 NEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKK 1310

Query: 117  VSEA--AVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVE----DKCKAAVELR 170
              EA  A E +K+    + K +      K + EE KK   A  A      D+ +AA E  
Sbjct: 1311 AEEAKKADEAKKKAEEAKKKAD----AAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKA 1366

Query: 171  KELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCK 209
            +     + + +      K + EE KK   AK   E+  K
Sbjct: 1367 EAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKK 1405



 Score = 30.5 bits (68), Expect = 1.0
 Identities = 38/132 (28%), Positives = 57/132 (43%), Gaps = 6/132 (4%)

Query: 81   QESTANVEEWKKQLQAYK--EENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETL 138
            +E+    +E KK  +A K  +E +K +      EA K       +  K+    +  +E  
Sbjct: 1493 EEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELK 1552

Query: 139  DSELKCRDEEVKKLTNAKHAVEDKCKA---AVELRK-ELTSVRSKNETLDSELKCRDEEV 194
             +E   + EE KK   AK A EDK  A   A E +K E   +    +  + E K + EE 
Sbjct: 1553 KAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEA 1612

Query: 195  KKLTNAKHAVED 206
            KK   AK   E+
Sbjct: 1613 KKAEEAKIKAEE 1624



 Score = 30.5 bits (68), Expect = 1.1
 Identities = 42/150 (28%), Positives = 63/150 (42%), Gaps = 13/150 (8%)

Query: 82   ESTANVEEWKKQLQAYKEE---NQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETL 138
            E     EE KK  +A K E   N  L+      +A +   E  ++L +E   ++++    
Sbjct: 1555 EELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKK 1614

Query: 139  DSELKCRDEEVKKLTNAKHAVE-------DKCKAAVELRKELTSVRSKNETLDSELKCRD 191
              E K + EE+KK    K  VE       ++ K A EL+K     + K      E K  +
Sbjct: 1615 AEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAA---EEAKKAE 1671

Query: 192  EEVKKLTNAKHAVEDKCKLLSQENAELQAA 221
            E+ KK   AK A ED+ K       E + A
Sbjct: 1672 EDKKKAEEAKKAEEDEKKAAEALKKEAEEA 1701



 Score = 30.5 bits (68), Expect = 1.1
 Identities = 27/129 (20%), Positives = 49/129 (37%), Gaps = 5/129 (3%)

Query: 81   QESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDS 140
            +E+  + EE KK  +    E  +   +       +  +    E  ++   ++   E   +
Sbjct: 1233 EEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKA 1292

Query: 141  ELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNA 200
            +   + EE KK   AK   E+  KA  E +K+    + K +      K + EE KK   A
Sbjct: 1293 DEAKKAEEKKKADEAKKKAEEAKKAD-EAKKKAEEAKKKAD----AAKKKAEEAKKAAEA 1347

Query: 201  KHAVEDKCK 209
              A  +   
Sbjct: 1348 AKAEAEAAA 1356



 Score = 29.0 bits (64), Expect = 4.0
 Identities = 36/152 (23%), Positives = 54/152 (35%), Gaps = 29/152 (19%)

Query: 82   ESTANVEEWKKQLQAYK------------------------EENQKLKTKYIELEATKGV 117
            E     +E KK+ +  K                        E  +  +    E EA    
Sbjct: 1299 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADE 1358

Query: 118  SEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVR 177
            +EAA E  +     + + +      K + EE KK   AK   E+  K A EL+K   + +
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKK 1418

Query: 178  SKNETLDSELKCRDEEVKKLTNAKHAVEDKCK 209
              +     E K + EE KK   AK   E+  K
Sbjct: 1419 KAD-----EAKKKAEEKKKADEAKKKAEEAKK 1445


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 34.4 bits (79), Expect = 0.060
 Identities = 41/193 (21%), Positives = 88/193 (45%), Gaps = 1/193 (0%)

Query: 60  KKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSE 119
            + E E   L+    RL          ++  +++L+  +   ++L+ K   LE  +   E
Sbjct: 242 AELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIE 301

Query: 120 AAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSK 179
              E  + L ++  + E L  +LK  +E ++KL      +E + +   E + EL  +  +
Sbjct: 302 ELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEE 361

Query: 180 N-ETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDA 238
             + L+  L+  ++E++K       +E+  + L +E AEL AA+  +Q +L+ L    + 
Sbjct: 362 RLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEE 421

Query: 239 QRRIIDTLNHQLA 251
             R ++ L  ++ 
Sbjct: 422 LERELEELEEEIK 434



 Score = 30.5 bits (69), Expect = 1.0
 Identities = 44/208 (21%), Positives = 90/208 (43%), Gaps = 8/208 (3%)

Query: 63  EIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKT-KYIELEATKGVSEAA 121
           ++E    +     L   ++     + E +++ +  +E   +L   + +ELEA K   E  
Sbjct: 217 KLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEEL 276

Query: 122 VELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNE 181
            EL + L  +  K E     L+  + E+++L      +    +   EL ++L S+  + E
Sbjct: 277 RELERLLEELEEKIE----RLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLE 332

Query: 182 TLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRR 241
            L+ +L+  + E+++L   K+ +    KLL +   EL+  +  ++ +L+  L        
Sbjct: 333 KLEEKLEKLESELEELAEEKNELA---KLLEERLKELEERLEELEKELEKALERLKQLEE 389

Query: 242 IIDTLNHQLAGRVAELATIHREINTALQ 269
            I  L  +LA   A L  I  E+    +
Sbjct: 390 AIQELKEELAELSAALEEIQEELEELEK 417



 Score = 28.6 bits (64), Expect = 4.1
 Identities = 22/101 (21%), Positives = 48/101 (47%)

Query: 85  ANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKC 144
           A +EE +++++  + E ++   +    E  +   E   +L +EL  +R + E L  +L  
Sbjct: 654 AALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGE 713

Query: 145 RDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDS 185
            ++ +++L + K  +E+  K   +L K L  +    E L  
Sbjct: 714 IEQLIEELESRKAELEELKKELEKLEKALELLEELREKLGK 754



 Score = 27.4 bits (61), Expect = 9.6
 Identities = 32/183 (17%), Positives = 79/183 (43%), Gaps = 2/183 (1%)

Query: 87  VEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRD 146
           ++E K +++  + +  +L     +L          ++  +E+     + E L+ E++  +
Sbjct: 180 IKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQE-EQEEEELEQEIEALE 238

Query: 147 EEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVED 206
           E + +L   K  +E+     +E+           E    EL+   EE+++       +E 
Sbjct: 239 ERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELER 298

Query: 207 KCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREINT 266
           + + L +E   L+A +  ++  L+ L + ++   ++ + L  +L   + ELA    E+  
Sbjct: 299 EIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLE-KLESELEELAEEKNELAK 357

Query: 267 ALQ 269
            L+
Sbjct: 358 LLE 360


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 33.5 bits (77), Expect = 0.11
 Identities = 28/110 (25%), Positives = 49/110 (44%), Gaps = 12/110 (10%)

Query: 92  KQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVR---SKNETLDSELKCRDEE 148
           + ++  +EEN +LK +  EL+          +L  EL   R         D E++ RD  
Sbjct: 429 ETVERLEEENSELKRELEELKREI------EKLESELERFRREVRDKVRKDREIRARDRR 482

Query: 149 VKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLT 198
           +++L      +E+K K   EL ++L  +R   +   S      + V+KLT
Sbjct: 483 IERLEK---ELEEKKKRVEELERKLAELRKMRKLELSGKGTPVKVVEKLT 529



 Score = 30.4 bits (69), Expect = 0.96
 Identities = 16/75 (21%), Positives = 37/75 (49%)

Query: 123 ELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNET 182
           +L + +  +  +N  L  EL+    E++KL +       + +  V   +E+ +   + E 
Sbjct: 426 KLEETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIER 485

Query: 183 LDSELKCRDEEVKKL 197
           L+ EL+ + + V++L
Sbjct: 486 LEKELEEKKKRVEEL 500


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 33.3 bits (77), Expect = 0.14
 Identities = 27/109 (24%), Positives = 49/109 (44%), Gaps = 3/109 (2%)

Query: 68  NLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKE 127
            L+       + L+E+    EE +++ +  +EE  KL  +  E EA + + EA  E  + 
Sbjct: 531 ELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLL-EEAEKEAQQAIKEAKKEADEI 589

Query: 128 LTSVRSKNETLDSELKCRD--EEVKKLTNAKHAVEDKCKAAVELRKELT 174
           +  +R   +   + +K  +  E  K+L  A    E K K   E ++EL 
Sbjct: 590 IKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 32.7 bits (75), Expect = 0.20
 Identities = 34/135 (25%), Positives = 64/135 (47%), Gaps = 4/135 (2%)

Query: 59  AKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVS 118
            K  E EL   +    +L   L ++   + E +K+L+  ++E ++L+ KY E E  + + 
Sbjct: 607 LKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSE-EEYEELR 665

Query: 119 EAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRK---ELTS 175
           E  +EL +EL  +R++ E L+   +   + ++KL       E   K   +L K    +  
Sbjct: 666 EEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEE 725

Query: 176 VRSKNETLDSELKCR 190
           +R K +   + LK R
Sbjct: 726 LREKVKKYKALLKER 740



 Score = 32.0 bits (73), Expect = 0.36
 Identities = 28/158 (17%), Positives = 67/158 (42%), Gaps = 1/158 (0%)

Query: 65  ELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVEL 124
           E+  L+     L  + ++    + E +++++  K+E ++L+ K  EL+  K  +E  ++L
Sbjct: 239 EIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKL 298

Query: 125 RKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLD 184
            +       +   ++  L   +EE+  +      +E+K +   EL+K+L  +  + E L+
Sbjct: 299 SEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELE 358

Query: 185 SELKCRDEEVKKLTNA-KHAVEDKCKLLSQENAELQAA 221
              +  +E   K     +           +   EL+  
Sbjct: 359 ERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEEL 396



 Score = 31.6 bits (72), Expect = 0.43
 Identities = 34/126 (26%), Positives = 59/126 (46%), Gaps = 9/126 (7%)

Query: 78  SALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNET 137
           S L+E    +EE  K+L+  +E  ++LK K  ELE      E   EL +E  + + + E 
Sbjct: 317 SRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELER 376

Query: 138 LDSELKCRDEE--VKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVK 195
           L   L     E   K+L   +       KA  E+ +E++ + ++   L  E+K   + ++
Sbjct: 377 LKKRLTGLTPEKLEKELEELE-------KAKEEIEEEISKITARIGELKKEIKELKKAIE 429

Query: 196 KLTNAK 201
           +L  AK
Sbjct: 430 ELKKAK 435



 Score = 31.2 bits (71), Expect = 0.60
 Identities = 40/179 (22%), Positives = 78/179 (43%), Gaps = 17/179 (9%)

Query: 60  KKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKE-----------ENQKLKTKY 108
           +++  EL  ++     +    ++    + E +K L+   E           +  + K K 
Sbjct: 455 EEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKK 514

Query: 109 IELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVE 168
             LE  +  +E   +L+++L  ++ + ++L  EL+  +E  KKL   +  +++  +   E
Sbjct: 515 YNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAE 574

Query: 169 LRKEL-----TSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAI 222
           L KEL      SV    E L  EL+    E  +L +A+  +E + K L +   EL  A 
Sbjct: 575 LLKELEELGFESVEELEERL-KELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAF 632



 Score = 30.4 bits (69), Expect = 1.3
 Identities = 30/120 (25%), Positives = 58/120 (48%)

Query: 90  WKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEV 149
           +K   +  KE  ++++     ++ T+ + E   E  KEL  V  +   + SEL    EE+
Sbjct: 164 YKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREEL 223

Query: 150 KKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCK 209
           +KL      +E+  +   EL KEL S+      L+ +++  +E +++L      +E+K K
Sbjct: 224 EKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVK 283



 Score = 28.5 bits (64), Expect = 4.3
 Identities = 39/195 (20%), Positives = 84/195 (43%), Gaps = 17/195 (8%)

Query: 88  EEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDE 147
           E  +K ++  +   + +K K  ELE    V     E+  EL  +R + E L+ E+K  +E
Sbjct: 179 ERLEKFIKRTENIEELIKEKEKELEE---VLREINEISSELPELREELEKLEKEVKELEE 235

Query: 148 EVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDK 207
             +++   +  +E    +  +L +++  +  + E L  E++  +E+VK+L   K   E+ 
Sbjct: 236 LKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEY 295

Query: 208 CKL-------------LSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRV 254
            KL             + +  + L+  I+ ++ ++   L  ++ +   +     +L  R+
Sbjct: 296 IKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKE-LEEKEERLEELKKKLKELEKRL 354

Query: 255 AELATIHREINTALQ 269
            EL   H     A  
Sbjct: 355 EELEERHELYEEAKA 369



 Score = 28.1 bits (63), Expect = 6.7
 Identities = 39/139 (28%), Positives = 74/139 (53%), Gaps = 17/139 (12%)

Query: 80  LQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLD 139
           L+E TA ++  +K+L+  +E+ +KL+ +  ELE           L+KE  S   K + L 
Sbjct: 454 LEEYTAELKRIEKELKEIEEKERKLRKELRELEK---------VLKKE--SELIKLKELA 502

Query: 140 SELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTN 199
            +LK  +E++KK    +  +E K +   +L+++L  ++ + ++L  EL    E++++L  
Sbjct: 503 EQLKELEEKLKKYNLEE--LEKKAEEYEKLKEKLIKLKGEIKSLKKEL----EKLEELKK 556

Query: 200 AKHAVEDKCKLLSQENAEL 218
               +E K   L +E AEL
Sbjct: 557 KLAELEKKLDELEEELAEL 575


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 32.2 bits (73), Expect = 0.20
 Identities = 25/104 (24%), Positives = 45/104 (43%), Gaps = 8/104 (7%)

Query: 67  ANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRK 126
           A LK+  L+      +   + EE K++L+  ++E ++L  +  ELEA         E+++
Sbjct: 119 AGLKT--LQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYE------EVQE 170

Query: 127 ELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELR 170
            L  +  +N  L+  LK    EV  L      +E   +   E  
Sbjct: 171 RLKRLEVENSRLEEMLKKLPGEVYDLKKRWDELEPGVELPEEEL 214



 Score = 31.4 bits (71), Expect = 0.46
 Identities = 18/92 (19%), Positives = 35/92 (38%), Gaps = 3/92 (3%)

Query: 126 KELTSVRSKNETLDSELKCR---DEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNET 182
           K L  V    +  +   + +   +E  K+       +E+      E+++ L  +  +N  
Sbjct: 122 KTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSR 181

Query: 183 LDSELKCRDEEVKKLTNAKHAVEDKCKLLSQE 214
           L+  LK    EV  L      +E   +L  +E
Sbjct: 182 LEEMLKKLPGEVYDLKKRWDELEPGVELPEEE 213


>gnl|CDD|241234 cd00837, EVH1_family, EVH1 (Drosophila Enabled
          (Ena)/Vasodilator-stimulated phosphoprotein (VASP)
          homology 1) domain.  The EVH1 domains are part of the
          PH domain superfamily.  EVH1 subfamilies include
          Enables/VASP, Homer/Vesl, WASP,  and Spred.  Ligands
          are known for three of the EVH1 subfamilies, all of
          which bind proline-rich sequences: the Enabled/VASP
          family binds to FPPPP peptides, the Homer/Vesl family
          binds PPxxF peptides, and the WASP family binds LPPPEP
          peptides. EVH1 has a PH-like fold, despite having
          minimal sequence similarity to PH or PTB domains.
          Length = 102

 Score = 30.9 bits (70), Expect = 0.22
 Identities = 7/17 (41%), Positives = 11/17 (64%)

Query: 1  MTFTKTSQKFGQWSDIR 17
          + + K +Q F QW+D R
Sbjct: 62 LVYNKATQTFHQWADDR 78


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 32.2 bits (74), Expect = 0.25
 Identities = 38/177 (21%), Positives = 83/177 (46%), Gaps = 30/177 (16%)

Query: 90  WKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEV 149
            KK   A +   + ++    E E  K   EA +E ++E+  +R++   L+ ELK R  E+
Sbjct: 24  EKKLGSAEELAKRIIEEAKKEAETLK--KEALLEAKEEVHKLRAE---LERELKERRNEL 78

Query: 150 KKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCK 209
           ++L       E +             +  + ETLD +++  D++ + L   +  + +K K
Sbjct: 79  QRL-------ERR-------------LLQREETLDRKMESLDKKEENLEKKEKELSNKEK 118

Query: 210 LLSQENAELQAAISLVQAQLDTL--LASQDAQRRIIDTLNHQLAGRVAELATIHREI 264
            L ++  EL+  I+  + +L+ +  L  ++A+  +++ +  +      E A + +EI
Sbjct: 119 NLDEKEEELEELIAEQREELERISGLTQEEAKEILLEEVEEEAR---HEAAKLIKEI 172


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 32.3 bits (74), Expect = 0.26
 Identities = 49/256 (19%), Positives = 98/256 (38%), Gaps = 37/256 (14%)

Query: 38  QRRIIDTLNHQLAGRVA----DSANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQ 93
            R  ++ L  +    +A    D A+A+  E     L+  +  L   L+E     +   ++
Sbjct: 284 LRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEE 343

Query: 94  LQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEV---- 149
            ++ +E+   L+ +  EL       E A EL  EL   R   E    E++  +EE+    
Sbjct: 344 AESLREDADDLEERAEELR------EEAAELESELEEAREAVEDRREEIEELEEEIEELR 397

Query: 150 KKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKK------------- 196
           ++  +A   + +      ELR+E   +R +   L++ L+   E V++             
Sbjct: 398 ERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEALLEAGKCPEC 457

Query: 197 ---LTNAKHA-----VEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNH 248
              +  + H        ++ + L  E  +L+  +  V+ +L+      +A+ RI      
Sbjct: 458 GQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEER 517

Query: 249 QLAGRVAELATIHREI 264
                + EL    RE 
Sbjct: 518 --REDLEELIAERRET 531



 Score = 30.4 bits (69), Expect = 1.3
 Identities = 51/240 (21%), Positives = 95/240 (39%), Gaps = 36/240 (15%)

Query: 49  LAGRVADSANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKY 108
           +  R+  + +  + E  +  L+     L   + E    +EE +++ +  +E   +L+ + 
Sbjct: 494 VEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEA 553

Query: 109 IELEATKGVSEAAVELRKELTSVRSKNETLDS---ELKCRDEEVKKLTNAKHAVEDKCKA 165
            E        EAA E  +E    R +   L+S   ELK R E ++++     A+ D    
Sbjct: 554 EEKR------EAAAEAEEEAEEAREEVAELNSKLAELKERIESLERIRTLLAAIADAEDE 607

Query: 166 AVELRKELTSVRSKNETLDSELKCRDEEVKKL----------------TNAKHA---VED 206
              LR++  ++   N+     L  + E  ++L                  A+     VE+
Sbjct: 608 IERLREKREALAELNDERRERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQVEE 667

Query: 207 KCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREINT 266
           K   L +E  +LQA I  V+ +L+ L   ++  RR        L  RV  L  ++ E   
Sbjct: 668 KLDELREERDDLQAEIGAVENELEELEELRE--RR------EALENRVEALEALYDEAEE 719



 Score = 29.6 bits (67), Expect = 2.3
 Identities = 36/177 (20%), Positives = 75/177 (42%), Gaps = 19/177 (10%)

Query: 87  VEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRD 146
           +EE +++++  + E + L+ +  E+E     +E  VE    +  +  + E L+       
Sbjct: 470 IEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLE------- 522

Query: 147 EEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVED 206
              + +   +  +E+K + A ELR+           L++E + + E   +         +
Sbjct: 523 ---ELIAERRETIEEKRERAEELRER-------AAELEAEAEEKREAAAEAEEEAEEARE 572

Query: 207 KCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHRE 263
           +   L+ + AEL+  I  +  ++ TLLA+       I+ L  +     AEL    RE
Sbjct: 573 EVAELNSKLAELKERIESL-ERIRTLLAAIADAEDEIERLREKREAL-AELNDERRE 627



 Score = 28.9 bits (65), Expect = 3.7
 Identities = 37/149 (24%), Positives = 65/149 (43%), Gaps = 10/149 (6%)

Query: 125 RKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLD 184
            KE   +  +   L+SEL   DEE+++        E++ + A E R E   V  ++E   
Sbjct: 198 EKEEKDLHERLNGLESELAELDEEIERY-------EEQREQARETRDEADEVLEEHEERR 250

Query: 185 SELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLAS---QDAQRR 241
            EL+  + E++ L       E + + L++E  +L+  +  ++ + D LLA     DA   
Sbjct: 251 EELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAE 310

Query: 242 IIDTLNHQLAGRVAELATIHREINTALQT 270
            ++    +L  R  EL     E   A Q 
Sbjct: 311 AVEARREELEDRDEELRDRLEECRVAAQA 339



 Score = 28.1 bits (63), Expect = 6.2
 Identities = 36/167 (21%), Positives = 63/167 (37%), Gaps = 3/167 (1%)

Query: 101 NQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVE 160
             +L     E+E  +   E A E R E   V  ++E    EL+  + E++ L       E
Sbjct: 212 ESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDLRETIAETE 271

Query: 161 DKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQA 220
            + +   EL +E+  +R + E L+ E      E         AVE + + L   + EL+ 
Sbjct: 272 RERE---ELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRD 328

Query: 221 AISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREINTA 267
            +   +          ++ R   D L  +      E A +  E+  A
Sbjct: 329 RLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEA 375



 Score = 27.7 bits (62), Expect = 8.5
 Identities = 48/206 (23%), Positives = 80/206 (38%), Gaps = 15/206 (7%)

Query: 65  ELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEA---A 121
           EL  L++    L   + E+    EE  ++++  +E  ++L+ +  +L A  G+ +A   A
Sbjct: 252 ELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEA 311

Query: 122 VELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNE 181
           VE R+E    R   E  D   +CR         A+   ED    A +L +    +R +  
Sbjct: 312 VEARREELEDR-DEELRDRLEECRVAAQAHNEEAESLRED----ADDLEERAEELREEAA 366

Query: 182 TLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRR 241
            L+SEL+   E V+        +E+       E  EL+         L       +  R 
Sbjct: 367 ELESELEEAREAVEDRREEIEELEE-------EIEELRERFGDAPVDLGNAEDFLEELRE 419

Query: 242 IIDTLNHQLAGRVAELATIHREINTA 267
             D L  + A   A L T    +  A
Sbjct: 420 ERDELREREAELEATLRTARERVEEA 445


>gnl|CDD|144235 pfam00568, WH1, WH1 domain.  WASp Homology domain 1 (WH1) domain.
          WASP is the protein that is defective in
          Wiskott-Aldrich syndrome (WAS). The majority of point
          mutations occur within the amino- terminal WH1 domain.
          The metabotropic glutamate receptors mGluR1alpha and
          mGluR5 bind a protein called homer, which is a WH1
          domain homologue. A subset of WH1 domains has been
          termed a "EVH1" domain and appear to bind a polyproline
          motif.
          Length = 111

 Score = 30.5 bits (69), Expect = 0.33
 Identities = 4/18 (22%), Positives = 8/18 (44%)

Query: 1  MTFTKTSQKFGQWSDIRA 18
          M + +    F  ++D R 
Sbjct: 71 MEYNQARPFFHTFADSRC 88


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 31.7 bits (73), Expect = 0.42
 Identities = 40/159 (25%), Positives = 71/159 (44%), Gaps = 18/159 (11%)

Query: 70  KSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELT 129
           K    ++  A +E+   +EE KK+ +A K+E          LEA + + +   E  KEL 
Sbjct: 27  KIAEAKIKEAEEEAKRILEEAKKEAEAIKKE--------ALLEAKEEIHKLRNEFEKELR 78

Query: 130 SVRSKNETLDSELKCRDEEVKK----LTNAKHAVEDKCKAAVELRKELTSVRSKNETLDS 185
             R++ + L+  L  ++E + +    L   +  +E K K   + ++EL     + E L  
Sbjct: 79  ERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIE 138

Query: 186 ELKCRDEEVKKLTN--AKHAVEDKCKLLSQENAELQAAI 222
           E     E +  LT   AK  + +K +    E A  +AA+
Sbjct: 139 EQLQELERISGLTAEEAKEILLEKVE----EEARHEAAV 173



 Score = 29.7 bits (68), Expect = 1.9
 Identities = 38/176 (21%), Positives = 80/176 (45%), Gaps = 30/176 (17%)

Query: 91  KKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVK 150
            K  +A +E  + L+    E EA K   EA +E ++E+  +R++    + EL+ R  E++
Sbjct: 31  AKIKEAEEEAKRILEEAKKEAEAIK--KEALLEAKEEIHKLRNE---FEKELRERRNELQ 85

Query: 151 KLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKL 210
           KL                       +  K E LD +L+  ++  ++L   +  +E K + 
Sbjct: 86  KLEK--------------------RLLQKEENLDRKLELLEKREEELEKKEKELEQKQQE 125

Query: 211 LSQENAELQAAISLVQAQLDTL--LASQDAQRRIIDTLNHQLAGRVAELATIHREI 264
           L ++  EL+  I     +L+ +  L +++A+  +++ +  +      E A + +EI
Sbjct: 126 LEKKEEELEELIEEQLQELERISGLTAEEAKEILLEKVEEEA---RHEAAVLIKEI 178


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 31.1 bits (70), Expect = 0.62
 Identities = 39/217 (17%), Positives = 93/217 (42%), Gaps = 13/217 (5%)

Query: 61  KWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLK----TKYIELEATKG 116
           K+E  +  +K  +      L++  + +E  +++++A +    +L      + I  E  + 
Sbjct: 313 KYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNIDELHKQLRKQGISTEQFEL 372

Query: 117 VSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSV 176
           +++   +L +EL  +  +++ L   +K R  E + +     ++E   +    L + +T  
Sbjct: 373 MNQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIFK---SLEKTLRQYDSLIQNITRS 429

Query: 177 RSK--NETLDSELKCRDEEV-KKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLL 233
           RS+  +   DS LK   E++  K +    +++   K + + N E+Q  I   + +  TL 
Sbjct: 430 RSQIGHNVNDSSLKINIEQLFPKGSGINESIK---KSILELNDEIQERIKTEENKSITLE 486

Query: 234 ASQDAQRRIIDTLNHQLAGRVAELATIHREINTALQT 270
                 +  I+ L   L     EL+  + +   + + 
Sbjct: 487 EDIKNLKHDINELTQILEKLELELSEANSKFELSKEE 523


>gnl|CDD|113356 pfam04582, Reo_sigmaC, Reovirus sigma C capsid protein. 
          Length = 326

 Score = 31.0 bits (70), Expect = 0.64
 Identities = 17/120 (14%), Positives = 46/120 (38%), Gaps = 4/120 (3%)

Query: 124 LRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETL 183
           + + LT++ S   +L   +        +L++    ++D   +  + +  L  + +  + L
Sbjct: 33  IYERLTNLESATASLSESV---STISSRLSDLSADLQDMTTSLDDAKATLNGLITTVQAL 89

Query: 184 DSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTL-LASQDAQRRI 242
            + +     E+  L+    A +     L          IS +++ +    L   D ++R+
Sbjct: 90  QTSVDSLSSELADLSLTVSAHDAAISDLQTTVHANSTDISNLKSSVSANGLNITDLEQRV 149


>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase.  The model describes
           alanine--tRNA ligase. This enzyme catalyzes the reaction
           (tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
           pyrophosphate + AMP) [Protein synthesis, tRNA
           aminoacylation].
          Length = 851

 Score = 30.8 bits (70), Expect = 0.81
 Identities = 23/108 (21%), Positives = 42/108 (38%), Gaps = 6/108 (5%)

Query: 154 NAKHAVEDKCK-AAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLS 212
              +  EDK K  A  L+     +  K E    E K   +E++ L   K A +   +L  
Sbjct: 701 EYLNEQEDKLKELADILKVTPNELPKKIERFFEEWKALQKELESL-KLKIADQKIDELKD 759

Query: 213 QENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATI 260
           +  AE    + ++   +D       + + + D L  +L   +  LA+ 
Sbjct: 760 K--AETINGVKVLVEVVDA--KDMKSLKTMADRLKSKLGSAIYVLASF 803


>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
           Provisional.
          Length = 276

 Score = 30.3 bits (69), Expect = 0.91
 Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 10/91 (10%)

Query: 66  LANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELR 125
           + ++ S   R+ +A +E  + V E    L   +EEN++LK + +ELE+     +   +L 
Sbjct: 43  VGDVVSPVQRVVNAPREFVSGVFESLASLFDLREENEELKKELLELES---RLQELEQLE 99

Query: 126 KELTSVRSKNETLDSELKCRDEEVKKLTNAK 156
            E       N  L   L  ++    +   A+
Sbjct: 100 AE-------NARLRELLNLKESLDYQFITAR 123


>gnl|CDD|234309 TIGR03683, A-tRNA_syn_arch, alanyl-tRNA synthetase.  This family of
           alanyl-tRNA synthetases is limited to the archaea, and
           is a subset of those sequences identified by the model
           pfam07973 covering the second additional domain (SAD) of
           alanyl and threonyl tRNA synthetases.
          Length = 902

 Score = 30.7 bits (70), Expect = 0.97
 Identities = 14/34 (41%), Positives = 22/34 (64%), Gaps = 4/34 (11%)

Query: 87  VEEWKKQLQAYKEENQKLKTKYIELEATKGVSEA 120
            EEWK+Q    ++E ++LK K  EL+  + +SEA
Sbjct: 771 FEEWKEQ----RKEIERLKKKLAELKIYELISEA 800


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 29.9 bits (68), Expect = 1.1
 Identities = 19/59 (32%), Positives = 31/59 (52%), Gaps = 3/59 (5%)

Query: 57  ANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAY---KEENQKLKTKYIELE 112
            N +  E  +A +   N RL   L+++   VEE +K+L+ Y   K+  + LK +  ELE
Sbjct: 41  KNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLKARLKELE 99


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 30.4 bits (69), Expect = 1.2
 Identities = 21/85 (24%), Positives = 38/85 (44%), Gaps = 13/85 (15%)

Query: 168 ELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQA 227
           ELRK L  + S+NE     LK  +E ++K             +  Q    +Q+    +  
Sbjct: 70  ELRKRLAKLISENE----ALKAENERLQKREQ---------SIDQQIQQAVQSETQELTK 116

Query: 228 QLDTLLASQDAQRRIIDTLNHQLAG 252
           +++ L + +   + +ID L  +LAG
Sbjct: 117 EIEQLKSERQQLQGLIDQLQRRLAG 141


>gnl|CDD|237110 PRK12472, PRK12472, hypothetical protein; Provisional.
          Length = 508

 Score = 30.2 bits (68), Expect = 1.3
 Identities = 33/143 (23%), Positives = 53/143 (37%), Gaps = 27/143 (18%)

Query: 75  RLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSK 134
            L    +++    +E K    A   E   LK    +LE  K  ++A ++ R +     +K
Sbjct: 194 TLAREAEDAARAADEAKTAAAAAAREAAPLKASLRKLERAKARADAELK-RADKALAAAK 252

Query: 135 NETLDSELKCRDEEVK------------KLTNAKHAVEDKCKAAVELRKELTSVRSKNET 182
            +    E K R EE +            +L  AK   E K  AA   ++   +  +K   
Sbjct: 253 TD----EAKARAEERQQKAAQQAAEAATQLDTAKADAEAKRAAAAATKEAAKAAAAKKA- 307

Query: 183 LDSELKCRDEEVKKLTNAKHAVE 205
                    E  K  T+AK A+E
Sbjct: 308 ---------ETAKAATDAKLALE 321


>gnl|CDD|222790 PHA00430, PHA00430, tail fiber protein.
          Length = 568

 Score = 30.2 bits (68), Expect = 1.3
 Identities = 23/129 (17%), Positives = 42/129 (32%), Gaps = 8/129 (6%)

Query: 54  ADSANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEA 113
            + AN  + E + A  ++   R  +    S  N ++W+ +      E  + K       A
Sbjct: 165 RNEANRSRNEADRARNQAE--RFNNESGASATNTKQWRSEADGSNSEANRFKG-----YA 217

Query: 114 TKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVEL-RKE 172
               S       +  +S +  N   D   K          +  +A      AA    R +
Sbjct: 218 DSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASAAHASEVNAANSATAAATSANRAK 277

Query: 173 LTSVRSKNE 181
             + R+K E
Sbjct: 278 QQADRAKTE 286


>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
           binding.  This is the C-terminal conserved coiled coil
           region of a family of TATA element modulatory factor 1
           proteins conserved in eukaryotes. The proteins bind to
           the TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMF1_TATA_bd is the most conserved
           part of the TMFs. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells. The Rab6-binding
           domain appears to be the same region as this C-terminal
           family.
          Length = 121

 Score = 28.7 bits (65), Expect = 1.3
 Identities = 21/73 (28%), Positives = 35/73 (47%)

Query: 127 ELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSE 186
           EL S++ +   L++E     +E+ KLT     + +  K   EL KEL  +  + ET    
Sbjct: 32  ELASLKDELARLEAERDEARQEIVKLTEENEELRELKKEIEELEKELEDLEQRYETTLEL 91

Query: 187 LKCRDEEVKKLTN 199
           L  + E V++L  
Sbjct: 92  LGEKSERVEELRA 104


>gnl|CDD|221489 pfam12252, SidE, Dot/Icm substrate protein.  This family of proteins
            is found in bacteria. Proteins in this family are
            typically between 397 and 1543 amino acids in length.
            This family is the SidE protein in the Dot/Icm pathway of
            Legionella pneumophila bacteria. There is little
            literature describing the family.
          Length = 1443

 Score = 30.2 bits (68), Expect = 1.4
 Identities = 22/71 (30%), Positives = 36/71 (50%), Gaps = 8/71 (11%)

Query: 85   ANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSK----NETLDS 140
            +++E+ +KQLQ     + KL   Y+ LE TK +S    E  K  + ++SK     +T   
Sbjct: 1164 SDIEKLEKQLQ---VIDTKLADAYL-LEVTKQISALEKEKPKNQSELKSKIAKFFDTTAD 1219

Query: 141  ELKCRDEEVKK 151
                R+E +KK
Sbjct: 1220 IEVLRNERIKK 1230


>gnl|CDD|237551 PRK13902, alaS, alanyl-tRNA synthetase; Provisional.
          Length = 900

 Score = 30.2 bits (69), Expect = 1.4
 Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 4/34 (11%)

Query: 87  VEEWKKQLQAYKEENQKLKTKYIELEATKGVSEA 120
            EEWK+Q    K+E +KL+ +  EL A++ +S+A
Sbjct: 767 FEEWKEQ----KKEIEKLRKELAELLASELLSKA 796


>gnl|CDD|183218 PRK11590, PRK11590, hypothetical protein; Provisional.
          Length = 211

 Score = 29.6 bits (67), Expect = 1.5
 Identities = 10/31 (32%), Positives = 13/31 (41%)

Query: 8   QKFGQWSDIRANTVSLVQAQLDTLLASQDAQ 38
             F +W         +VQ +L T L S DA 
Sbjct: 84  ADFVRWFRDNVTAFPVVQERLTTYLLSSDAD 114


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
           functions are known are involvedin recombination,
           recombinational repair, and/or non-homologous end
           joining.They are components of an exonuclease complex
           with MRE11 homologs. This family is distantly related to
           the SbcC family of bacterial proteins.This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 30.0 bits (67), Expect = 1.6
 Identities = 27/103 (26%), Positives = 54/103 (52%), Gaps = 3/103 (2%)

Query: 95  QAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTN 154
           Q  + + QK++   +EL+  K   E A E+R ++TS  ++ E+    +K  + E+  L N
Sbjct: 193 QVRQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQLESSREIVKSYENELDPLKN 252

Query: 155 AKHAVEDKCKAAVELRKELTSVRSKN---ETLDSELKCRDEEV 194
               +E      ++L  E+ +++S+    E  +SEL+ + E+V
Sbjct: 253 RLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKV 295


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 29.8 bits (67), Expect = 1.7
 Identities = 21/127 (16%), Positives = 49/127 (38%), Gaps = 8/127 (6%)

Query: 56  SANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATK 115
           SA  K+ E +  +L+         L+E     +E +++++  KE  +  K   +E EA +
Sbjct: 521 SALEKELEQKNEHLEK-------LLKEQEKLKKELEQEMEELKERERNKKL-ELEKEAQE 572

Query: 116 GVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTS 175
            +     E+   +  ++ K      E+K  ++ VK     +   +       +   +   
Sbjct: 573 ALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQADKIGDKVR 632

Query: 176 VRSKNET 182
           +R   + 
Sbjct: 633 IRYFGQK 639


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 29.6 bits (66), Expect = 2.0
 Identities = 41/232 (17%), Positives = 80/232 (34%), Gaps = 20/232 (8%)

Query: 26  AQLDTLLASQDAQRRIIDTLNHQL------AGRVADSANAKKWEIELANLKSNNLR--LT 77
            QLD +     A R  + T   +                  + E E    +    R  L 
Sbjct: 74  FQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLA 133

Query: 78  SALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATK-GVSEAAVELRKELTSVRSKNE 136
            A QE    +    KQ Q  +   + L  +  +LEA    +  +  +L+   T ++S+  
Sbjct: 134 KAQQE----LARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVL 189

Query: 137 TLDSELKCRDEEVKKLTNAKHAVEDKCK-------AAVELRKELTSVRSKNETLDSELKC 189
            L       ++E + L    +A + + +       AA +  + +    ++      ++  
Sbjct: 190 DLKLRSAQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAA 249

Query: 190 RDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRR 241
           R E++++       +E     L QE A+L+A             A+Q  Q  
Sbjct: 250 RAEQIRERERQLQRLETAQARLEQEVAQLEAYYQAYVRLRQQAAATQRGQVL 301


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 29.5 bits (67), Expect = 2.1
 Identities = 28/144 (19%), Positives = 54/144 (37%), Gaps = 18/144 (12%)

Query: 63  EIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIEL------EATKG 116
            IE    + +N RL       T  + E   +L++Y  +   L+ +  ++      E  K 
Sbjct: 32  HIEDLKEELSNERLRKLRSLLTK-LSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKD 90

Query: 117 VSEAAVELRKELTSVRSKNETLDSELK-CRDE--EVKKLTNAKHAVEDKCKAAVELRKEL 173
           V E   ++ KE+  +  +   L++E+K    E   ++   N    ++        L  + 
Sbjct: 91  VEEELEKIEKEIKELEEEISELENEIKELEQEIERLEPWGNFD--LDLSL----LLGFKY 144

Query: 174 TSVR--SKNETLDSELKCRDEEVK 195
            SV   +  E    ELK   +   
Sbjct: 145 VSVFVGTVPEDKLEELKLESDVEN 168



 Score = 28.0 bits (63), Expect = 6.8
 Identities = 20/90 (22%), Positives = 36/90 (40%), Gaps = 19/90 (21%)

Query: 76  LTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKN 135
            +  ++E    +EE +K+ ++  EE ++L  KY+E           + L + L     + 
Sbjct: 213 PSELIREIKEELEEIEKERESLLEELKELAKKYLEEL---------LALYEYLEIELERA 263

Query: 136 ETLDSELKCR----------DEEVKKLTNA 155
           E L   LK            ++ VKKL   
Sbjct: 264 EALSKFLKTDKTFAIEGWVPEDRVKKLKEL 293



 Score = 28.0 bits (63), Expect = 7.3
 Identities = 27/102 (26%), Positives = 39/102 (38%), Gaps = 23/102 (22%)

Query: 109 IELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVE 168
           +ELE     SE   E+++EL  +  + E+L  ELK   E  KK         ++  A  E
Sbjct: 205 LELEEEGTPSELIREIKEELEEIEKERESLLEELK---ELAKKY-------LEELLALYE 254

Query: 169 LRKELTSVRSKNETLDSELKCR----------DEEVKKLTNA 200
               L     + E L   LK            ++ VKKL   
Sbjct: 255 Y---LEIELERAEALSKFLKTDKTFAIEGWVPEDRVKKLKEL 293


>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 753

 Score = 29.6 bits (67), Expect = 2.2
 Identities = 31/140 (22%), Positives = 50/140 (35%), Gaps = 15/140 (10%)

Query: 65  ELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVEL 124
            L  L      +   L+E    VE+   +++     N   KT  +EL+A + V  A + L
Sbjct: 505 LLEELIEKLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKA-QIVVLAHMGL 563

Query: 125 RKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLD 184
                  + +       LK      KK      A E        L+ +   V  + +   
Sbjct: 564 PVPAEEAKVEAVDEVKFLK-----KKKGILDAGAFEST------LKDKKNKVLPEAKK-- 610

Query: 185 SELKCRDEEVKKLTNAKHAV 204
            +LK  D EV+ +T    AV
Sbjct: 611 RKLKLGD-EVEVITGEPGAV 629


>gnl|CDD|220717 pfam10360, DUF2433, Protein of unknown function (DUF2433).  This
          is a conserved 120 residue region of a family of
          proteins found in fungi. The function is not known.
          Length = 132

 Score = 28.3 bits (63), Expect = 2.3
 Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 6/76 (7%)

Query: 8  QKFGQWSDIRANTVSLVQAQLDTLLASQDAQRRIID-TLNHQLAGRVADSAN---AKKWE 63
           K  +   I       V+ +++TLL   ++QR +++  L+  L  +V  S N    K   
Sbjct: 19 SKLSRAKAIFGEVWDTVKDEVETLLTPNESQRNLLENALS--LFEKVPTSINGGIPKIGP 76

Query: 64 IELANLKSNNLRLTSA 79
          + L +  +  +R  +A
Sbjct: 77 LSLQSAAAEAMRDETA 92


>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
          Length = 343

 Score = 29.2 bits (66), Expect = 2.4
 Identities = 19/53 (35%), Positives = 30/53 (56%), Gaps = 4/53 (7%)

Query: 210 LLSQENAELQAAISLVQAQLDTLLA-SQDAQRRIID---TLNHQLAGRVAELA 258
           LL+Q+ A L+  ++ ++A LD      +++Q +I D    LN  LA RV EL 
Sbjct: 141 LLNQQIAALRRQLAALEAALDASEKRDRESQAKIADLGRRLNVALAQRVQELN 193


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 29.2 bits (66), Expect = 2.5
 Identities = 22/109 (20%), Positives = 51/109 (46%), Gaps = 20/109 (18%)

Query: 84  TANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELK 143
             ++E+ +++++A K + ++LK   +  E          +L+++L   +SK E  + +L 
Sbjct: 276 EKSMEKLQEKIKALKYQLKRLKKMILLFEMIS-------DLKRKL---KSKFERDNEKLD 325

Query: 144 CRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDE 192
              +E KK        E+K K      K++  +  + E L+ +   ++E
Sbjct: 326 AEVKEKKKEKK----KEEKKK------KQIERLEERIEKLEVQATDKEE 364


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
           Provisional.
          Length = 389

 Score = 29.0 bits (66), Expect = 2.8
 Identities = 13/48 (27%), Positives = 27/48 (56%), Gaps = 1/48 (2%)

Query: 92  KQLQAYKEENQKLKTKYIELEATK-GVSEAAVELRKELTSVRSKNETL 138
           ++L+A +E N +L+ +  +LE     +     +L +EL  ++S+ E L
Sbjct: 1   ERLEALEERNSELEEQIRQLELKLRDLEAENEKLERELERLKSELEKL 48



 Score = 28.6 bits (65), Expect = 3.3
 Identities = 11/42 (26%), Positives = 23/42 (54%)

Query: 65  ELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKT 106
             + L+    +L   L++  A  E+ +++L+  K E +KLK+
Sbjct: 9   RNSELEEQIRQLELKLRDLEAENEKLERELERLKSELEKLKS 50


>gnl|CDD|220671 pfam10275, Peptidase_C65, Peptidase C65 Otubain.  This family of
           proteins conserved from plants to humans is a highly
           specific ubiquitin iso-peptidase that removes ubiquitin
           from proteins. The modification of cellular proteins by
           ubiquitin (Ub) is an important event that underlies
           protein stability and function in eukaryote being a
           dynamic and reversible process. Otubain carries several
           key conserved domains: (i) the OTU (ovarian tumour
           domain) in which there is an active cysteine protease
           triad (ii) a nuclear localisation signal, (iii) a Ub
           interaction motif (UIM)-like motif phi-xx-A-xxxs-xx-Ac
           (where phi indicates an aromatic amino acid, x indicates
           any amino acid and Ac indicates an acidic amino acid),
           (iv) a Ub-associated (UBA)-like domain and (v) the LxxLL
           motif.
          Length = 237

 Score = 28.8 bits (65), Expect = 2.8
 Identities = 21/131 (16%), Positives = 38/131 (29%), Gaps = 32/131 (24%)

Query: 75  RLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKG------------------ 116
            L    +  +A  +E+      Y ++ + L  KY  +   +G                  
Sbjct: 7   PLVGEKEPLSALTKEYALADPIYLKKIKDLSKKYSGIRRVRGDGNCFYRAFGFSYLEILL 66

Query: 117 VSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSV 176
             +   E+ + +  + S  E L        +E           ED     +EL   L  +
Sbjct: 67  RKKDPAEINRFIARIESLKEKLV---ALGFDE--------FTFEDFLDEFIEL---LEKL 112

Query: 177 RSKNETLDSEL 187
                T   EL
Sbjct: 113 EDGGSTSREEL 123


>gnl|CDD|182809 PRK10884, PRK10884, SH3 domain-containing protein; Provisional.
          Length = 206

 Score = 28.5 bits (64), Expect = 3.0
 Identities = 29/121 (23%), Positives = 49/121 (40%), Gaps = 19/121 (15%)

Query: 4   TKTSQKFGQWSDIRANTVSLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVADSANAKKWE 63
              +  + Q  D +  T  +   QL T      + R  +  L +Q+          K   
Sbjct: 61  VNANTNYAQIRDSKGRTAWIPLKQLST----TPSLRTRVPDLENQV----------KTLT 106

Query: 64  IELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVE 123
            +L N+ +   + T+ +Q+  A  +     +   KEENQKLK + I   A K V  A ++
Sbjct: 107 DKLNNIDNTWNQRTAEMQQKVAQSD---SVINGLKEENQKLKNQLI--VAQKKVDAANLQ 161

Query: 124 L 124
           L
Sbjct: 162 L 162


>gnl|CDD|178768 PLN03229, PLN03229, acetyl-coenzyme A carboxylase carboxyl
           transferase subunit alpha; Provisional.
          Length = 762

 Score = 29.1 bits (65), Expect = 3.3
 Identities = 38/164 (23%), Positives = 70/164 (42%), Gaps = 32/164 (19%)

Query: 60  KKWEIELAN-LKSNNLRLTSALQESTANVEEW-----KKQLQAYKEE-NQKL-------- 104
           K+ E+ELA  LKS  L +    +++    E+      ++++++  EE N+K+        
Sbjct: 611 KEIELELAGVLKSMGLEVIGVTKKNKDTAEQTPPPNLQEKIESLNEEINKKIERVIRSSD 670

Query: 105 ---KTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVED 161
              K + ++LE  K      V  ++       K E L+ ++K      +K+  A ++ E 
Sbjct: 671 LKSKIELLKLEVAKASKTPDVTEKE-------KIEALEQQIK------QKIAEALNSSEL 717

Query: 162 KCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVE 205
           K K   EL  EL + R      +  LK  D++ +        VE
Sbjct: 718 KEKFE-ELEAELAAARETAAESNGSLKNDDDKEEDSKEDGSRVE 760


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 28.2 bits (63), Expect = 3.6
 Identities = 18/97 (18%), Positives = 45/97 (46%)

Query: 168 ELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQA 227
           EL++EL  +  +   L+SEL+   E+++ L     ++E++ + L +   EL   +  ++ 
Sbjct: 90  ELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEESIKELAKELRELRQ 149

Query: 228 QLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREI 264
            L   +     +   +     +L   + EL ++  ++
Sbjct: 150 DLREEVEELREELERLQENLQRLQEAIQELQSLLEQL 186



 Score = 27.0 bits (60), Expect = 8.2
 Identities = 19/97 (19%), Positives = 46/97 (47%), Gaps = 2/97 (2%)

Query: 123 ELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNET 182
           EL++EL  +  +   L+SEL+   E+++ L     ++E++ ++  E  KEL       E 
Sbjct: 90  ELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEESIKELAKELR--EL 147

Query: 183 LDSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQ 219
                +  +E  ++L   +  ++   + + +  + L+
Sbjct: 148 RQDLREEVEELREELERLQENLQRLQEAIQELQSLLE 184


>gnl|CDD|236076 PRK07710, PRK07710, acetolactate synthase catalytic subunit;
           Reviewed.
          Length = 571

 Score = 28.6 bits (64), Expect = 4.0
 Identities = 16/53 (30%), Positives = 23/53 (43%), Gaps = 13/53 (24%)

Query: 81  QESTANVEEWKKQLQA--------YKEENQKLKTKY-IEL--EATKGVSEAAV 122
           +    N  EW   L+         YK  ++ +K +  IE+  E TKG  EA V
Sbjct: 344 EGKKENHHEWLSLLKNWKEKYPLSYKRNSESIKPQKAIEMLYEITKG--EAIV 394


>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
           Members of this protein family are homologs of ClpB, an
           ATPase associated with chaperone-related functions.
           These ClpB homologs, designated ClpV1, are a key
           component of the bacterial pathogenicity-associated type
           VI secretion system [Protein fate, Protein and peptide
           secretion and trafficking, Cellular processes,
           Pathogenesis].
          Length = 852

 Score = 28.4 bits (64), Expect = 4.7
 Identities = 30/162 (18%), Positives = 56/162 (34%), Gaps = 32/162 (19%)

Query: 124 LRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETL 183
           L +E       +E L  EL+    E+  L     A+E + +   EL + + ++R++ E  
Sbjct: 438 LEREAALGADHDERLA-ELR---AELAALEAELAALEARWQQEKELVEAILALRAELE-- 491

Query: 184 DSELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQAAISLVQAQLD------------- 230
                    +         A+  +   L    A  Q    LV  ++D             
Sbjct: 492 --ADADAPAD------DDAALRAQLAELEAALASAQGEEPLVFPEVDAQAVAEVVADWTG 543

Query: 231 ----TLLASQ-DAQRRIIDTLNHQLAGRVAELATIHREINTA 267
                ++  + +A   + D L  ++ G+   L  I   I TA
Sbjct: 544 IPVGRMVRDEIEAVLSLPDRLAERVIGQDHALEAIAERIRTA 585


>gnl|CDD|224483 COG1567, COG1567, CRISPR system related protein, RAMP superfamily
           [Defense    mechanisms].
          Length = 313

 Score = 28.2 bits (63), Expect = 4.8
 Identities = 18/124 (14%), Positives = 40/124 (32%), Gaps = 24/124 (19%)

Query: 60  KKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEEN-------QKLKTKYIELE 112
           ++ +  + N   + LR++S L     +    K    A   E        +  K  +I + 
Sbjct: 46  REIDDLIENFAKSPLRISSLLPYIGNDYLVPKPLHNALILEGNDEIKKKKLKKISFIPI- 104

Query: 113 ATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVE-LRK 171
                         +L S        D + + ++   ++      ++  K K A+   R 
Sbjct: 105 -------------DQLESFLD--GEADEKYQVKELPKRQTEIGVDSLSKKAKKAIHNGRI 149

Query: 172 ELTS 175
           +  S
Sbjct: 150 DADS 153


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
           Provisional.
          Length = 2722

 Score = 28.6 bits (64), Expect = 5.0
 Identities = 28/152 (18%), Positives = 64/152 (42%), Gaps = 12/152 (7%)

Query: 54  ADSANAKKWEIELANLKSNNLR--LTSALQESTANVEEWKKQLQAYKEENQKLKTKYIEL 111
            +  N    +  +   + NN+   ++++L + T    + K  ++ YKEE +KL+    ++
Sbjct: 695 KELQNLLSLKENIIKKQLNNIEQDISNSLNQYTIKYNDLKSSIEEYKEEEEKLEVYKHQI 754

Query: 112 EATKGVSEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRK 171
              K  +E  + L +    +     T +  L+ +D     + N ++ + +      E +K
Sbjct: 755 INRK--NEFILHLYENDKDLPDGKNTYEEFLQYKDT----ILNKENKISNDINILKENKK 808

Query: 172 ELTSVRSKNETLDSELKC----RDEEVKKLTN 199
               + +    L  +L+      DEE+K+L  
Sbjct: 809 NNQDLLNSYNILIQKLEAHTEKNDEELKQLLQ 840


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 28.3 bits (63), Expect = 5.5
 Identities = 34/214 (15%), Positives = 86/214 (40%), Gaps = 27/214 (12%)

Query: 15  DIRANTVSLVQAQLDTLLASQDAQRR-IIDTLNHQLAGRVADSANAKKWEIELANLKSNN 73
           D+  N++ + Q ++D+L++   AQR+ I+D +                  +E+ +L+ N 
Sbjct: 127 DVFLNSIFVGQGEMDSLISGDPAQRKKILDEI------------------LEINSLERNY 168

Query: 74  LRLTSALQESTANVEEW---KKQLQAYKEENQKLKTKYIELEATKGVSEAAVE-LRKELT 129
            +L   +    A +      +++L++   E + +K +  + E +  ++   +E L  E  
Sbjct: 169 DKLKDVIDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYN 228

Query: 130 SVRSKNETLDSELK---CRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSK-NETLDS 185
           +       L S L      ++   +  +     E      +E       +  +  + ++ 
Sbjct: 229 NAMDDYNNLKSALNELSSLEDMKNRYESEIKTAESDLSMELEKNNYYKELEERHMKIIND 288

Query: 186 ELKCRDEEVKKLTNAKHAVEDKCKLLSQENAELQ 219
            +      +      K+ +E+K ++LS  +AE+ 
Sbjct: 289 PVYKNRNYINDYFKYKNDIENKKQILSNIDAEIN 322


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 28.1 bits (63), Expect = 5.5
 Identities = 31/137 (22%), Positives = 59/137 (43%), Gaps = 11/137 (8%)

Query: 123 ELRKELTSVRSKNETLDSELKCRDEEVKKLTNA---KHAVEDKCKAAVELRKELTSVRSK 179
           EL+KEL  +  K E L++E++ ++EE++K  N    K   +   K    L + L  +  K
Sbjct: 99  ELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGLNYK 158

Query: 180 NETLDSELKCRDEEVKKLTNAKH-----AVEDKCKLLSQENAELQAAISLVQAQLDTLLA 234
               +   K   E    + NA        ++ K K L   N    A ++L     D +  
Sbjct: 159 KNFKE---KLLKELKSVILNASSLLSLEELKAKIKTLFSSNKPELALLTLSVIDFDEIEQ 215

Query: 235 SQDAQRRIIDTLNHQLA 251
           ++  ++ II + +  ++
Sbjct: 216 AEILEKSIIGSSDVPIS 232



 Score = 28.1 bits (63), Expect = 6.3
 Identities = 38/169 (22%), Positives = 67/169 (39%), Gaps = 26/169 (15%)

Query: 60  KKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEE----NQKLKTKY------I 109
           +    +      + L       E   +VEE K  L+A +E      QKL+ K       I
Sbjct: 295 ESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNLQKLEEKLKDPSTSI 354

Query: 110 ELEATKGVSEAAVELRKEL-TSVRSKNE---TLDSEL-KCRD--------EEVKKLTNAK 156
           ELE+   + E+  ++   +   +R  NE    L  E  K +         E  + +   +
Sbjct: 355 ELESITDLIESINDIIDAINELIREHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQ 414

Query: 157 HAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVE 205
              +   KA   L KE+  + ++ + L+ E+K   E  K+LTN +   +
Sbjct: 415 KEKKGLEKAINSLEKEIKQLEAEIKALEKEIK---ELEKQLTNIEPTAD 460


>gnl|CDD|235399 PRK05303, flgI, flagellar basal body P-ring protein; Provisional.
          Length = 367

 Score = 27.8 bits (63), Expect = 5.6
 Identities = 7/24 (29%), Positives = 12/24 (50%)

Query: 245 TLNHQLAGRVAELATIHREINTAL 268
           T+N    GR+   A + RE+ +  
Sbjct: 160 TVNVPTVGRIPNGAIVEREVPSDF 183


>gnl|CDD|233513 TIGR01657, P-ATPase-V, P-type ATPase of unknown pump specificity
           (type V).  These P-type ATPases form a distinct clade
           but the substrate of their pumping activity has yet to
           be determined. This clade has been designated type V in.
          Length = 1054

 Score = 28.1 bits (63), Expect = 5.9
 Identities = 17/77 (22%), Positives = 31/77 (40%), Gaps = 3/77 (3%)

Query: 89  EWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVEL--RKELTSVRSKNETLDSELKCRD 146
           EWK +L+     N+  +T  I        S+  VEL  +     ++++N     E +   
Sbjct: 39  EWKVKLRYVPVSNEDAETVVIVDPTPNSGSDYIVELSNKSLSNDLQTENAVEGGE-EPIY 97

Query: 147 EEVKKLTNAKHAVEDKC 163
            + +K   + H  E K 
Sbjct: 98  FDFRKQRFSYHEKELKI 114


>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242). 
          Length = 401

 Score = 27.9 bits (62), Expect = 6.7
 Identities = 24/103 (23%), Positives = 51/103 (49%), Gaps = 1/103 (0%)

Query: 148 EVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDK 207
           +++KLT     + +  ++  E +KE   ++ +N+ L ++L        K       +E+ 
Sbjct: 76  KIRKLTEQDEGLRNIRESIEERQKESDQLKLQNQKLVNQLGHARGVFIKTKGRYQKLEEL 135

Query: 208 CKLLSQENAELQAAI-SLVQAQLDTLLASQDAQRRIIDTLNHQ 249
            + L +EN  LQ  + +LVQ   + +  +Q+  R + +TL +Q
Sbjct: 136 SRHLKEENQCLQIQLDALVQECNEKIEENQELNRELAETLAYQ 178


>gnl|CDD|187543 cd05232, UDP_G4E_4_SDR_e, UDP-glucose 4 epimerase, subgroup 4,
           extended (e) SDRs.  UDP-glucose 4 epimerase (aka
           UDP-galactose-4-epimerase), is a homodimeric extended
           SDR. It catalyzes the NAD-dependent conversion of
           UDP-galactose to UDP-glucose, the final step in Leloir
           galactose synthesis. This subgroup is comprised of
           bacterial proteins, and includes the Staphylococcus
           aureus capsular polysaccharide Cap5N, which may have a
           role in the synthesis of UDP-N-acetyl-d-fucosamine. This
           subgroup has the characteristic active site tetrad and
           NAD-binding motif of the extended SDRs. Extended SDRs
           are distinct from classical SDRs. In addition to the
           Rossmann fold (alpha/beta folding pattern with a central
           beta-sheet) core region typical of all SDRs, extended
           SDRs have a less conserved C-terminal extension of
           approximately 100 amino acids. Extended SDRs are a
           diverse collection of proteins, and include isomerases,
           epimerases, oxidoreductases, and lyases; they typically
           have a TGXXGXXG cofactor binding motif. SDRs are a
           functionally diverse family of oxidoreductases that have
           a single domain with a structurally conserved Rossmann
           fold, an NAD(P)(H)-binding region, and a structurally
           diverse C-terminal region. Sequence identity between
           different SDR enzymes is typically in the 15-30% range;
           they catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 303

 Score = 27.7 bits (62), Expect = 6.8
 Identities = 14/74 (18%), Positives = 29/74 (39%), Gaps = 13/74 (17%)

Query: 208 CKLLSQENAELQAAI--------SLVQAQLDTLLASQDAQRRIID-----TLNHQLAGRV 254
              L     E++ A+        S+V A+L  + +  D    +          H +  + 
Sbjct: 16  VDKLLSRGEEVRIAVRNAENAEPSVVLAELPDIDSFTDLFLGVDAVVHLAARVHVMNDQG 75

Query: 255 AELATIHREINTAL 268
           A+  + +R++NT L
Sbjct: 76  ADPLSDYRKVNTEL 89


>gnl|CDD|237746 PRK14529, PRK14529, adenylate kinase; Provisional.
          Length = 223

 Score = 27.4 bits (61), Expect = 6.9
 Identities = 12/32 (37%), Positives = 17/32 (53%)

Query: 97  YKEENQKLKTKYIELEATKGVSEAAVELRKEL 128
           +K+   K  TKYIEL+    + E    L K+L
Sbjct: 191 FKDLAAKGSTKYIELDGEGSIDEIKETLLKQL 222


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 27.3 bits (61), Expect = 7.0
 Identities = 29/108 (26%), Positives = 50/108 (46%), Gaps = 4/108 (3%)

Query: 35  QDAQRRIIDTLNHQLAGRVADSANAKKWEIELANLKSNNLRLTSALQESTANVEEWKKQL 94
           Q+ + RI        A +      A   EI++A  + N+L     L E    +E+ +K++
Sbjct: 69  QEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLE--DELAELMEEIEKLEKEI 126

Query: 95  QAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKNETLDSEL 142
           +  KE  ++L+    E EA   + E   E+R+E   + SK E L  +L
Sbjct: 127 EDLKERLERLEKNLAEAEAR--LEEEVAEIREEGQELSSKREELKEKL 172


>gnl|CDD|223020 PHA03246, PHA03246, large tegument protein UL36; Provisional.
          Length = 3095

 Score = 28.0 bits (62), Expect = 7.0
 Identities = 14/49 (28%), Positives = 24/49 (48%)

Query: 147  EEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRDEEVK 195
              + KLT      +     + ++R+++   RS+ ETL + L  R  EVK
Sbjct: 1032 NSIVKLTKGLDGAKLASSLSSDIREKIRQKRSETETLIARLSARYAEVK 1080


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 28.0 bits (63), Expect = 7.1
 Identities = 19/60 (31%), Positives = 25/60 (41%), Gaps = 4/60 (6%)

Query: 207 KCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELATIHREINT 266
           K KLL  E+  +Q    L Q     LL   D Q+   + L  QLA   A+L     E+  
Sbjct: 50  KQKLLEAEDKLVQQ--DLEQTLA--LLDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEA 105


>gnl|CDD|219186 pfam06810, Phage_GP20, Phage minor structural protein GP20.  This
           family consists of several phage minor structural
           protein GP20 sequences of around 180 residues in length.
           The function of this family is unknown.
          Length = 156

 Score = 26.9 bits (60), Expect = 7.9
 Identities = 15/70 (21%), Positives = 34/70 (48%), Gaps = 7/70 (10%)

Query: 118 SEAAVELRKELTSVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVR 177
            +     + +L     + ++L  +LK RD++++ L       + K K   EL+K++  ++
Sbjct: 12  GKDIQNPKSKLDKANEERDSLKKQLKNRDKQIEDL-------KKKVKDNEELQKKIEKLK 64

Query: 178 SKNETLDSEL 187
            +N+T   E 
Sbjct: 65  QQNKTAKEEY 74


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
           This family consists of a number of sequences found in
           Arabidopsis thaliana, Oryza sativa and Lycopersicon
           esculentum (Tomato). The function of this family is
           unknown.
          Length = 767

 Score = 27.6 bits (61), Expect = 8.0
 Identities = 31/152 (20%), Positives = 65/152 (42%), Gaps = 11/152 (7%)

Query: 75  RLTSALQESTANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSK 134
                L+   +  E    +L   K++ ++ K K  E E      +   EL+ EL + +  
Sbjct: 591 STLEELETLKSEKENLDGELSKCKDDLEESKNKLQETE------KKLEELKSELDASQES 644

Query: 135 NETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVE----LRKELTSVRSKNETLDSELKCR 190
           N   +++LKC  E  + L      +E + K+  E    L  EL   R  ++ L ++ +  
Sbjct: 645 NSLAETQLKCMVESYESLELRAKDLEAEHKSLQEKISSLEDELEKERQNHQELIAKCREL 704

Query: 191 DEEVKKLTNAK-HAVEDKCKLLSQENAELQAA 221
           +E++++    +     D+ +   ++  E+ AA
Sbjct: 705 EEKIERAEQEENMQKLDEEEQKIKQEKEIAAA 736


>gnl|CDD|223294 COG0216, PrfA, Protein chain release factor A [Translation,
           ribosomal structure and biogenesis].
          Length = 363

 Score = 27.6 bits (62), Expect = 8.1
 Identities = 23/100 (23%), Positives = 44/100 (44%), Gaps = 4/100 (4%)

Query: 141 ELKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNETLDSELKCRD--EEVKKLT 198
            L  R EE++ L +    + D      +L KE   +    E      K ++  E+ K++ 
Sbjct: 11  SLLERYEELEALLSDPEVISDP-DEYRKLSKEYAELEPIVEKYREYKKAQEDLEDAKEML 69

Query: 199 NAKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDA 238
             +   E + ++  +E  EL+A I  ++ +L  LL  +D 
Sbjct: 70  AEEKDPEMR-EMAEEEIKELEAKIEELEEELKILLLPKDP 108


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
           biogenesis [Translation, ribosomal structure and
           biogenesis].
          Length = 1077

 Score = 27.8 bits (61), Expect = 8.5
 Identities = 34/154 (22%), Positives = 61/154 (39%), Gaps = 9/154 (5%)

Query: 59  AKKWEIELANLKSNNLRLTSALQESTANVEEWKKQLQAYKEEN--QKLKTKYIELEATKG 116
            KKW   LA LKS       A  +S    EE  +  +    E+   +  +   E+E ++G
Sbjct: 567 KKKWS-SLAQLKSR--FQKDATLDSIEGEEELIQDDEKGNFEDLEDEENSSDNEMEESRG 623

Query: 117 VSEAA--VELRKELT--SVRSKNETLDSELKCRDEEVKKLTNAKHAVEDKCKAAVELRKE 172
            S  A   E   E+   + R +N     EL+   E  ++    K  V+   +   ++ ++
Sbjct: 624 SSVTAENEESADEVDYETEREENARKKEELRGNFELEERGDPEKKDVDWYTEEKRKIEEQ 683

Query: 173 LTSVRSKNETLDSELKCRDEEVKKLTNAKHAVED 206
           L   RS+ ET+  E +   E  +     +  +  
Sbjct: 684 LKINRSEFETMVPESRVVIEGYRAGRYVRIVLSH 717


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 27.4 bits (61), Expect = 8.5
 Identities = 15/65 (23%), Positives = 27/65 (41%)

Query: 200 AKHAVEDKCKLLSQENAELQAAISLVQAQLDTLLASQDAQRRIIDTLNHQLAGRVAELAT 259
           +  A + + K + +E A L+  I   Q Q   L     +    I +L  QL     +L  
Sbjct: 32  SAAADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKK 91

Query: 260 IHREI 264
           + ++I
Sbjct: 92  LRKQI 96


>gnl|CDD|217511 pfam03358, FMN_red, NADPH-dependent FMN reductase. 
          Length = 151

 Score = 26.8 bits (60), Expect = 8.8
 Identities = 12/54 (22%), Positives = 22/54 (40%), Gaps = 3/54 (5%)

Query: 46 NHQLAGRVADSANAKKWEIELANLKSNNLRLTSA---LQESTANVEEWKKQLQA 96
            +LA   A+       E+EL +L    L L            +V+E ++++ A
Sbjct: 16 TRKLAEWAAELLEEAGAEVELIDLADLPLPLYDEDLEACGDPDDVQELREKIAA 69


>gnl|CDD|216884 pfam02119, FlgI, Flagellar P-ring protein. 
          Length = 342

 Score = 27.5 bits (62), Expect = 8.8
 Identities = 7/26 (26%), Positives = 11/26 (42%)

Query: 245 TLNHQLAGRVAELATIHREINTALQT 270
           T+N    GR+   A + RE+      
Sbjct: 135 TVNVPTVGRIPNGAIVEREVPFDFGQ 160


>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 879

 Score = 27.6 bits (62), Expect = 10.0
 Identities = 24/100 (24%), Positives = 38/100 (38%), Gaps = 5/100 (5%)

Query: 85  ANVEEWKKQLQAYKEENQKLKTKYIELEATKGVSEAAVELRKELTSVRSKN---ETLDSE 141
           A +E  ++Q    KE    LK     L   + + E   EL KEL  ++ K    E  D  
Sbjct: 705 AALEYLQEQEDLLKEIASLLKVPPELLPKVERLLEELKELEKELERLKKKLAAAELADLL 764

Query: 142 LKCRDEEVKKLTNAKHAVEDKCKAAVELRKELTSVRSKNE 181
              + EE+  +      V+       ELR+    ++ K  
Sbjct: 765 SNAKAEEIGGVKVLAKEVDG--ADMKELREIADDLKKKLG 802


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.309    0.121    0.317 

Gapped
Lambda     K      H
   0.267   0.0722    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,471,420
Number of extensions: 1155802
Number of successful extensions: 2479
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2034
Number of HSP's successfully gapped: 514
Length of query: 270
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 175
Effective length of database: 6,723,972
Effective search space: 1176695100
Effective search space used: 1176695100
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 58 (26.1 bits)