RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8596
         (237 letters)



>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 59.3 bits (144), Expect = 3e-10
 Identities = 56/231 (24%), Positives = 105/231 (45%), Gaps = 19/231 (8%)

Query: 7   SSEGALSSLEEAIGDKEKQMNQLREQRDRADQER---QEERDLHERDIAEYKLKLHAYES 63
             E  ++SLE +I +KE+++    E+  + + E      E +  ER+I E + +      
Sbjct: 298 ELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTE 357

Query: 64  EVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTT 123
           E  EL     K EL+    E+     ++   R  L     + E+LK E+   +       
Sbjct: 358 EYAEL-----KEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKR------ 406

Query: 124 LSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLK 183
                ELDRLQE+  +   +L    A +  ++A    +  E++    +++K + ++ +L 
Sbjct: 407 -----ELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLA 461

Query: 184 AKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQAT 234
           A L   + E+  +KEEY+R +  LS+L  E  +A  +   S+E +   +A 
Sbjct: 462 ADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAV 512



 Score = 44.7 bits (106), Expect = 2e-05
 Identities = 43/222 (19%), Positives = 95/222 (42%), Gaps = 11/222 (4%)

Query: 19  IGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEEL-----ELGKS 73
           I +K +Q+ +LR +R++A++ +   ++  E    E    L   E+   +      +L   
Sbjct: 193 IDEKRQQLERLRREREKAERYQALLKEKREY---EGYELLKEKEALERQKEAIERQLASL 249

Query: 74  KAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRL 133
           + EL+K   E+       E   Q L  L   N+++K   E  Q           AE+  L
Sbjct: 250 EEELEKLTEEISELEKRLEEIEQLLEEL---NKKIKDLGEEEQLRVKEKIGELEAEIASL 306

Query: 134 QEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEM 193
           +    +   +L  A+  L  ++A+ +++ +E + ++ ++E+ +    +L  +    + E+
Sbjct: 307 ERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEEL 366

Query: 194 ESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATL 235
           E ++ E E      +    E      +++K K E+   +  L
Sbjct: 367 EDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKREL 408



 Score = 42.4 bits (100), Expect = 1e-04
 Identities = 45/230 (19%), Positives = 106/230 (46%), Gaps = 14/230 (6%)

Query: 11  ALSSLEEAIGDKEKQMNQLREQRDRADQ---ERQEERDLHERDIAEYKLKLHAYESEVEE 67
            LS     IG+ EK++ QL ++ ++  +   E +E+    E++I   K +L   E+ +EE
Sbjct: 710 ELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEE 769

Query: 68  LELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTS 127
           LE    K E      E   S +     +  LS+LE E  R++  L   +    R TL   
Sbjct: 770 LEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKE 829

Query: 128 AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLE 187
                +QE  ++        + +++ ++ + E +  +++ ++E++E+ +  +  L+++L 
Sbjct: 830 YLEKEIQELQEQ----RIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLG 885

Query: 188 NTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLGE 237
           + + E + ++ +    +        + ++   +++K ++ L   +A L  
Sbjct: 886 DLKKERDELEAQLRELE-------RKIEELEAQIEKKRKRLSELKAKLEA 928



 Score = 41.2 bits (97), Expect = 3e-04
 Identities = 55/239 (23%), Positives = 105/239 (43%), Gaps = 27/239 (11%)

Query: 9   EGALSSLEEAIGDKEKQMNQLREQRDRADQERQEE----RDLHERDIAEYKLKLHAYESE 64
           E  L+SLEE +    +++++L ++ +  +Q  +E     +DL E +    K K+   E+E
Sbjct: 243 ERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAE 302

Query: 65  VEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTL 124
           +  LE       + +   E+  +          + +L  E E L+ E+E  +        
Sbjct: 303 IASLE-----RSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEER-------- 349

Query: 125 STSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEK-------SQG 177
                 D+L E+Y +   +L   +AEL  V  +    R E K  +EK+EK        + 
Sbjct: 350 ---KRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKR 406

Query: 178 EVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLG 236
           E+ RL+ +L+    E+  +       +  ++ L  E++   LE+ K + +LE+  A L 
Sbjct: 407 ELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLS 465



 Score = 38.9 bits (91), Expect = 0.002
 Identities = 41/205 (20%), Positives = 93/205 (45%), Gaps = 14/205 (6%)

Query: 9   EGALSSLEEAIGDKEKQMNQLREQRDRADQERQE---ERDLHERDIAEYKLKLHAYESEV 65
           E     L+E + + E+ ++ L ++ +    E +E     +  E D+ + +  L+  E+ +
Sbjct: 729 EQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARL 788

Query: 66  EELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLS 125
               + + +AEL K   EV R  A      Q+L+RL LE E L+ E++  Q         
Sbjct: 789 SHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQ--------- 839

Query: 126 TSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAK 185
              +   L+E+      ++     +   ++ + E + +  + ++ ++   + E   L+A+
Sbjct: 840 --EQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQ 897

Query: 186 LENTQGEMESMKEEYERTQGTLSRL 210
           L   + ++E ++ + E+ +  LS L
Sbjct: 898 LRELERKIEELEAQIEKKRKRLSEL 922



 Score = 34.3 bits (79), Expect = 0.054
 Identities = 40/208 (19%), Positives = 85/208 (40%), Gaps = 8/208 (3%)

Query: 30  REQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELGKSKAELDKAANEVGRSGA 89
           +E  +R     +EE +    +I+E + +L   E  +EEL   K K   ++    V     
Sbjct: 239 KEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNK-KIKDLGEEEQLRVKEKIG 297

Query: 90  DWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQA 149
           + EA    L R   E ER   + E              AE+D+L  + ++   ++   + 
Sbjct: 298 ELEAEIASLERSIAEKERELEDAEERLA-------KLEAEIDKLLAEIEELEREIEEERK 350

Query: 150 ELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSR 209
               +  +   ++ E + ++ ++E+   E    + +L++ + ++E +K E    +  L R
Sbjct: 351 RRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDR 410

Query: 210 LLSERDKAVLEMDKSKEELERSQATLGE 237
           L  E  +   E+      +   +A + E
Sbjct: 411 LQEELQRLSEELADLNAAIAGIEAKINE 438



 Score = 33.1 bits (76), Expect = 0.14
 Identities = 41/180 (22%), Positives = 81/180 (45%), Gaps = 15/180 (8%)

Query: 69  ELGKSKAELDKAANEVGRSGADWEAARQRLSRLELEN---ERLKHELERSQTTFGRTTLS 125
           +  K+  EL++    + R     +  RQ+L RL  E    ER +  L+  +   G   L 
Sbjct: 171 KKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLK 230

Query: 126 TSAELDRLQEKYDKTCADLRRAQAELRV-VQADNERVRSEEKTMQEKVEK----SQGEVY 180
               L+R +E  ++  A L     +L   +    +R+   E+ ++E  +K     + E  
Sbjct: 231 EKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQL 290

Query: 181 RLKAKLENTQGEMESM-------KEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQA 233
           R+K K+   + E+ S+       + E E  +  L++L +E DK + E+++ + E+E  + 
Sbjct: 291 RVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERK 350



 Score = 31.6 bits (72), Expect = 0.38
 Identities = 41/192 (21%), Positives = 86/192 (44%), Gaps = 13/192 (6%)

Query: 14  SLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEEL--ELG 71
            L E   + ++++  LR + +  D+E  E RD       +Y+ KL   + E+ EL  EL 
Sbjct: 354 KLTEEYAELKEELEDLRAELEEVDKEFAETRDEL----KDYREKLEKLKREINELKRELD 409

Query: 72  KSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELD 131
           + + EL + + E+    A       +++ LE E E    E+++ +          +A+L 
Sbjct: 410 RLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLE----QLAADLS 465

Query: 132 RLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQG 191
           + +++      +  R + EL  +Q +     ++ +  +E+V   +     LKA ++   G
Sbjct: 466 KYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAVEEVLKASIQGVHG 525

Query: 192 ---EMESMKEEY 200
              ++ S+ E Y
Sbjct: 526 TVAQLGSVGERY 537



 Score = 28.9 bits (65), Expect = 3.3
 Identities = 49/251 (19%), Positives = 99/251 (39%), Gaps = 23/251 (9%)

Query: 10  GALSSLEEAIGDKEKQMNQLREQRDRADQ-ERQEERDLHERDIAEYKLKLHAYESEVEEL 68
           G   +    I     +  +L+  R+R +  +R+      E    E +L   + E      
Sbjct: 657 GGSRAPRGGILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASR 716

Query: 69  ELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSA 128
           ++G+ + E+++   E  +     E   + LS LE E E +K EL+  +            
Sbjct: 717 KIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHK 776

Query: 129 ----------------------ELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEK 166
                                 EL +L+E+  +  A LR  + +L  +  + E +  E +
Sbjct: 777 LEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQ 836

Query: 167 TMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKE 226
            +QE+    + ++  ++ ++EN  G+ E ++EE E  +  L  L S       E D+ + 
Sbjct: 837 ELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEA 896

Query: 227 ELERSQATLGE 237
           +L   +  + E
Sbjct: 897 QLRELERKIEE 907


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 53.9 bits (130), Expect = 2e-08
 Identities = 51/239 (21%), Positives = 103/239 (43%), Gaps = 18/239 (7%)

Query: 7   SSEGALSSLEEAIGDKEKQMNQLREQRDRADQERQE---ERDLHERDIAEYKLKLHAYES 63
                +S+L + +   E ++ QL E+  +  +E  E   E +  E  + E + +L   E+
Sbjct: 723 ELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEA 782

Query: 64  EVEELE---------LGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELER 114
           E+EELE         L   +  LD+   E+     +    R+RL  LE      +  LE 
Sbjct: 783 EIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLED 842

Query: 115 SQTTFGRTTL---STSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEK 171
            +      +    S +AE++ L+E  ++  ++L     E   ++     +RSE + + E+
Sbjct: 843 LEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEE 902

Query: 172 VEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRL---LSERDKAVLEMDKSKEE 227
           + + + +   L+ +LE  + ++  ++   E  +  +  L   LSE     LE  ++ E 
Sbjct: 903 LRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALEN 961



 Score = 48.9 bits (117), Expect = 9e-07
 Identities = 59/219 (26%), Positives = 111/219 (50%), Gaps = 11/219 (5%)

Query: 24  KQMNQLREQRDRADQER---QEERDLHERDIAEYKLKLHAYESEVEELE--LGKSKAELD 78
            ++ +LRE+ +   +E    +EE +    ++ E + KL     EV ELE  + + + EL 
Sbjct: 232 LRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELY 291

Query: 79  KAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYD 138
             ANE+ R     +  R+RL+ LE + E L+ +LE  +          + EL  L+EK +
Sbjct: 292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEELE----SKLDELAEELAELEEKLE 347

Query: 139 KTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKE 198
           +   +L   +AEL  ++A+ E + S  + ++E++E  + +V +L+ ++ +   E+E ++ 
Sbjct: 348 ELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEA 407

Query: 199 EYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLGE 237
             ER +    RL  E ++  L     + EL+  QA L E
Sbjct: 408 RLERLEDRRERLQQEIEE--LLKKLEEAELKELQAELEE 444



 Score = 45.4 bits (108), Expect = 1e-05
 Identities = 56/236 (23%), Positives = 106/236 (44%), Gaps = 16/236 (6%)

Query: 7   SSEGALSSLEEAIGDKEKQMNQLREQRDRA-------DQERQEERDLHERDIAEYKLKLH 59
            +   L  LE+ + + E+Q+  L  Q ++A        + R+ E  L    + E + +L 
Sbjct: 183 RTRENLDRLEDILNELERQLKSLERQAEKAERYKELKAELRELELALLVLRLEELREELE 242

Query: 60  AYESEVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTF 119
             + E++E E      EL++   E+       E  R  +S LE E E L+ EL       
Sbjct: 243 ELQEELKEAEE-----ELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEI 297

Query: 120 GRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEV 179
            R       +   L+E+       L   +A+L  +++  + +  E   ++EK+E+ + E+
Sbjct: 298 SRLE----QQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEEL 353

Query: 180 YRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATL 235
             L+A+LE  + E+E ++   E  +  L  L S+  +  L++     E+ER +A L
Sbjct: 354 ESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARL 409



 Score = 43.9 bits (104), Expect = 4e-05
 Identities = 44/227 (19%), Positives = 96/227 (42%), Gaps = 6/227 (2%)

Query: 11  ALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELEL 70
           AL+ L + + + E+++ QLR++ +   ++    R    R   E +++           EL
Sbjct: 699 ALAELRKELEELEEELEQLRKELEELSRQISALRKDLAR--LEAEVEQLEERIAQLSKEL 756

Query: 71  GKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAEL 130
            + +AE+++    +  +  +   A   +  LE + E+LK EL+              AEL
Sbjct: 757 TELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELK----ALREALDELRAEL 812

Query: 131 DRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQ 190
             L E+       L   +  +   +   E +  + + + E +E    E+  L+  +E  +
Sbjct: 813 TLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELE 872

Query: 191 GEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLGE 237
            E+E++  E    +  L+ L SE ++   E+ + + +    +  L E
Sbjct: 873 SELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEE 919



 Score = 43.9 bits (104), Expect = 4e-05
 Identities = 53/226 (23%), Positives = 87/226 (38%), Gaps = 12/226 (5%)

Query: 15  LEEAIGDKEKQMNQLREQRDRADQERQEERDL---HERDIAEYKLKLHAYESEVEELELG 71
           LEE I + EK + +LR++ +  ++E ++ R       R I+  +  L   E+EVE+LE  
Sbjct: 689 LEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEER 748

Query: 72  KSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELD 131
            ++   +    E      +           E E E    ELE           +    LD
Sbjct: 749 IAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIE--ELEAQIEQLKEELKALREALD 806

Query: 132 RLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQG 191
            L+ +       L    A LR      ER  +  +   E +E+       L   +E+   
Sbjct: 807 ELRAELT----LLNEEAANLRERLESLERRIAATERRLEDLEEQIE---ELSEDIESLAA 859

Query: 192 EMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLGE 237
           E+E ++E  E  +  L  LL+ER      +   + ELE     L E
Sbjct: 860 EIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRE 905



 Score = 41.2 bits (97), Expect = 3e-04
 Identities = 51/217 (23%), Positives = 99/217 (45%), Gaps = 20/217 (9%)

Query: 9   EGALSSLEEAIGDKEKQMNQLREQRDRADQ---ERQEERDLHERDIAEYKLKLHAYESEV 65
           +  L +L   I   E+Q   LRE+    ++   E + + +  E  + E   +L   E ++
Sbjct: 287 QKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKL 346

Query: 66  EEL--ELGKSKAELDKAANEVGRSGADWEAARQRLSRL-----ELEN---------ERLK 109
           EEL  EL   +AEL++   E+    +  E   ++L  L     +LE          ERL+
Sbjct: 347 EELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLE 406

Query: 110 HELERSQTTFGRTTLSTSAELDRLQE-KYDKTCADLRRAQAELRVVQADNERVRSEEKTM 168
             LER +    R        L +L+E +  +  A+L   + EL  +Q + ER+    + +
Sbjct: 407 ARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEEL 466

Query: 169 QEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQG 205
           +E++E+++  +   + +L   Q  ++S++   E  +G
Sbjct: 467 REELEEAEQALDAAERELAQLQARLDSLERLQENLEG 503



 Score = 41.2 bits (97), Expect = 4e-04
 Identities = 44/218 (20%), Positives = 91/218 (41%), Gaps = 11/218 (5%)

Query: 20  GDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELGKSKAELDK 79
           G  +   + L  +R R  +E +E+ +  E  IAE +  L     E+EELE      EL++
Sbjct: 664 GSAKTNSSIL--ERRREIEELEEKIEELEEKIAELEKALAELRKELEELE-----EELEQ 716

Query: 80  AANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDK 139
              E+        A R+ L+RLE E E+L+  + +                +RL+E  ++
Sbjct: 717 LRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEE 776

Query: 140 TCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEE 199
                   +     ++   E +++  + + E     + E+  L  +  N +  +ES++  
Sbjct: 777 LAEAEAEIEELEAQIEQLKEELKALREALDEL----RAELTLLNEEAANLRERLESLERR 832

Query: 200 YERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLGE 237
              T+  L  L  + ++   +++    E+E  +  + E
Sbjct: 833 IAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEE 870



 Score = 32.7 bits (75), Expect = 0.16
 Identities = 29/151 (19%), Positives = 63/151 (41%), Gaps = 3/151 (1%)

Query: 17  EAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELGKSKAE 76
           E++  + +++    E+ +   +E +E+ +     +A+ +L++ +  +E+E LE    + E
Sbjct: 354 ESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLE 413

Query: 77  LDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTT---LSTSAELDRL 133
             +   +        +     L  L+ E E L+ ELE  Q    R          EL+  
Sbjct: 414 DRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEA 473

Query: 134 QEKYDKTCADLRRAQAELRVVQADNERVRSE 164
           ++  D    +L + QA L  ++   E +   
Sbjct: 474 EQALDAAERELAQLQARLDSLERLQENLEGF 504


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 47.4 bits (113), Expect = 3e-06
 Identities = 51/229 (22%), Positives = 97/229 (42%), Gaps = 5/229 (2%)

Query: 12  LSSLEEAIGDKEKQMNQLREQRDRAD--QERQEERDLHERDIAEYKLKLHAYESEVEELE 69
           L  LE+ + + EKQ+ +L  Q ++A+  QE + E    E  +   KLK    E E  E E
Sbjct: 188 LERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEE 247

Query: 70  LGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTS-- 127
           L + + EL++   E+  +  + E  +  L  L  E E L+ EL   +          S  
Sbjct: 248 LSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLL 307

Query: 128 -AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKL 186
              L+ L+ + ++    L   + ++  ++ + E   +  + +++ + + +     L+ KL
Sbjct: 308 RERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKL 367

Query: 187 ENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATL 235
                E+E + E        L   L+E    + E+ +  E LE     L
Sbjct: 368 SALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERL 416



 Score = 47.0 bits (112), Expect = 5e-06
 Identities = 46/232 (19%), Positives = 100/232 (43%), Gaps = 5/232 (2%)

Query: 9   EGALSSLEEAIGDKEKQMNQLREQRDRADQER---QEERDLHERDIAEYKLKLHAYESEV 65
           +  L  L E + + ++++ +L+E+ +  + E    +E  +  E ++ E + +L   + ++
Sbjct: 273 KSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKI 332

Query: 66  EEL--ELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTT 123
           E L  EL + +  L++    +       E   ++LS L  E E L   L           
Sbjct: 333 EALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAEL 392

Query: 124 LSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLK 183
                EL+ L+ + +     L R    L  ++ + + + +E + +Q ++E+   E+  L+
Sbjct: 393 AEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELE 452

Query: 184 AKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATL 235
            +LE  +  ++ ++ E    Q  L RL  E       +D+ + E   SQ   
Sbjct: 453 EQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVR 504



 Score = 45.9 bits (109), Expect = 9e-06
 Identities = 46/227 (20%), Positives = 102/227 (44%), Gaps = 5/227 (2%)

Query: 7   SSEGALSSLEEAIGDKEKQMNQLREQRDRADQER---QEERDLHERDIAEYKLKLHAYES 63
           S E  L  L   + + E+Q+ +L+ +    ++E    Q   +  E ++ E + +L   + 
Sbjct: 699 SLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQE 758

Query: 64  EVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTT 123
            +EELE      E   A  +      + +    +    ELE E  + E         R  
Sbjct: 759 RLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDA--LEREL 816

Query: 124 LSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLK 183
            S     +RL+++ ++   ++   + +L  ++ + E +  E + ++E++E+ + E   L+
Sbjct: 817 ESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELE 876

Query: 184 AKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELER 230
            +L+  + E E ++EE    +  L+ L  E +K    +++ + +LER
Sbjct: 877 DELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLER 923



 Score = 44.7 bits (106), Expect = 2e-05
 Identities = 59/234 (25%), Positives = 113/234 (48%), Gaps = 9/234 (3%)

Query: 9   EGALSSLEEAIGDKEKQMNQLREQRDRADQERQEER---DLHERDIAEYKLKLHAYESEV 65
           E  L+ LE  +   E+++  L+ +    +   +E R   +  ER + E K +L A E E+
Sbjct: 673 EEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEEL 732

Query: 66  EELE--LGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTT 123
           E+L+  L + + EL++   E+       E   + L  LE    +LK E+E  +       
Sbjct: 733 EQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQ 792

Query: 124 LSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLK 183
                EL+ L+E+ ++    L   + EL  ++   ER+  E + ++E++E+ + ++  L+
Sbjct: 793 ----EELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELE 848

Query: 184 AKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLGE 237
            +LE  + E+E +KEE E  +     L  E  +   E ++ +EEL   ++ L E
Sbjct: 849 EELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAE 902



 Score = 42.8 bits (101), Expect = 1e-04
 Identities = 53/241 (21%), Positives = 112/241 (46%), Gaps = 12/241 (4%)

Query: 9   EGALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDL----------HERDIAEYKLKL 58
           E  L  L+E + + EK++ +L+ + +   +E +E ++            E +I+  + +L
Sbjct: 252 EEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERL 311

Query: 59  HAYESEVEELE--LGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQ 116
              E+E+EELE  L + K +++    E+       E   Q L+ LE   E L+ +L    
Sbjct: 312 EELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALL 371

Query: 117 TTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQ 176
                   +   EL  L+ +  +   +L   + E+  ++   ER+    + ++E++++ +
Sbjct: 372 EELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELE 431

Query: 177 GEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLG 236
            E+  L+ +LE    E+E ++E+ E  +  L  L  E  +   E+ + ++EL   +A L 
Sbjct: 432 AELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLD 491

Query: 237 E 237
            
Sbjct: 492 R 492



 Score = 42.0 bits (99), Expect = 2e-04
 Identities = 49/232 (21%), Positives = 106/232 (45%), Gaps = 11/232 (4%)

Query: 7   SSEGALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVE 66
                L  LEE +   E+++ +L+E+ + A++E +E     + ++ E + +L   + E+ 
Sbjct: 236 ELRKELEELEEELSRLEEELEELQEELEEAEKEIEE----LKSELEELREELEELQEELL 291

Query: 67  ELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLST 126
           EL+    + E + +           E     L  LE   E LK ++E  +        + 
Sbjct: 292 ELKEEIEELEGEISLLRERL-----EELENELEELEERLEELKEKIEALKEELEER-ETL 345

Query: 127 SAELDRLQEKYDKTCADLRRAQAELRV-VQADNERVRSEEKTMQEKVEKSQGEVYRLKAK 185
             EL++L  + ++   +L    + L   ++   E +R E   ++ ++ + + E+  LK +
Sbjct: 346 LEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKRE 405

Query: 186 LENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLGE 237
           +E+ +  +E + E  E  +  L  L +E ++   E+++  EELE  +  L E
Sbjct: 406 IESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEE 457



 Score = 40.5 bits (95), Expect = 5e-04
 Identities = 53/227 (23%), Positives = 117/227 (51%), Gaps = 16/227 (7%)

Query: 9   EGALSSLEEAIGDKEKQMNQLREQRDRADQER---QEERDLHERDIAEYKLKLHAYESEV 65
           +  L++LEE +   + ++ +L E+ +  ++E    QE  +  E ++   +  L   + E+
Sbjct: 722 KRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEI 781

Query: 66  EELE--LGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTT 123
           EELE      + EL++   E+  +    +A  + L  LE   ERL+ E+E  +       
Sbjct: 782 EELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELE------- 834

Query: 124 LSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLK 183
                E++ L+EK D+   +L   + EL  ++ + E + +E++ +++++++ + E   L+
Sbjct: 835 ----EEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELE 890

Query: 184 AKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELER 230
            +L   + E+  +KEE E+ +  L  L ++ ++  +E+ + +EELE 
Sbjct: 891 EELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEE 937



 Score = 39.7 bits (93), Expect = 9e-04
 Identities = 51/226 (22%), Positives = 95/226 (42%), Gaps = 24/226 (10%)

Query: 7    SSEGALSSLEEAIGDKEKQMNQLREQRDRADQE---RQEERDLHERDIAEYKLKLHAYES 63
              E  L   E  +   E+++  L ++R+R +QE    +EE +  E  + E + +L   E 
Sbjct: 797  ELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEK 856

Query: 64   EVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTT 123
            E+EEL     K EL++   E      + +   +    LE E   L+ EL   +       
Sbjct: 857  ELEEL-----KEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKE------ 905

Query: 124  LSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLK 183
                 E+++L+E+ ++  A L R + EL  ++ + E    +  T++ ++E+    +    
Sbjct: 906  -----EIEKLRERLEELEAKLERLEVELPELEEELEEEYED--TLETELEREIERLEEEI 958

Query: 184  AKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELE 229
              L           EEYE  +     L S+R+      +K  E +E
Sbjct: 959  EALGPVN---LRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIE 1001



 Score = 38.2 bits (89), Expect = 0.003
 Identities = 49/191 (25%), Positives = 91/191 (47%), Gaps = 12/191 (6%)

Query: 55  KLKLHAYESEVEELE--LGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHEL 112
           K +L   E E+ ELE  L K + EL    NE+       E  R++L  LE + E LK EL
Sbjct: 666 KRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKREL 725

Query: 113 ---ERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQ-------ADNERVR 162
              E               EL+ L+E+ ++    L   + EL  ++        + E + 
Sbjct: 726 AALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELE 785

Query: 163 SEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMD 222
            + + +QE++E+ + E+   + +L+  + E+ES+++  ER +  +  L  E ++   ++D
Sbjct: 786 EKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLD 845

Query: 223 KSKEELERSQA 233
           + +EELE  + 
Sbjct: 846 ELEEELEELEK 856



 Score = 35.8 bits (83), Expect = 0.017
 Identities = 37/146 (25%), Positives = 69/146 (47%), Gaps = 4/146 (2%)

Query: 92  EAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAEL 151
             A +     EL+ E  + EL                EL+ L+E+  +   +L   Q EL
Sbjct: 207 RQAEKAERYQELKAELRELEL----ALLLAKLKELRKELEELEEELSRLEEELEELQEEL 262

Query: 152 RVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLL 211
              + + E ++SE + ++E++E+ Q E+  LK ++E  +GE+  ++E  E  +  L  L 
Sbjct: 263 EEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELE 322

Query: 212 SERDKAVLEMDKSKEELERSQATLGE 237
              ++   +++  KEELE  +  L E
Sbjct: 323 ERLEELKEKIEALKEELEERETLLEE 348



 Score = 33.1 bits (76), Expect = 0.13
 Identities = 29/124 (23%), Positives = 61/124 (49%)

Query: 114 RSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVE 173
           RS     R       EL  L+ + +K   +L+  + ELR ++   E +R + + ++ ++E
Sbjct: 660 RSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLE 719

Query: 174 KSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQA 233
           + + E+  L+ +LE  Q  +E ++EE E  +  L  L    ++   E++  +E L + + 
Sbjct: 720 ELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKE 779

Query: 234 TLGE 237
            + E
Sbjct: 780 EIEE 783



 Score = 29.3 bits (66), Expect = 2.0
 Identities = 35/162 (21%), Positives = 74/162 (45%), Gaps = 11/162 (6%)

Query: 74  KAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRL 133
           K EL +   E+    A  E   + L  L+ E   L+  LE  +             L+ L
Sbjct: 666 KRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQ-----------LEEL 714

Query: 134 QEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEM 193
           + + ++   +L   + EL  +Q+  E +  E + ++E++E+ Q  +  L+ +LE+ +  +
Sbjct: 715 ERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEAL 774

Query: 194 ESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATL 235
             +KEE E  +     L  E ++   E+++++  L+  +  L
Sbjct: 775 AKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALEREL 816



 Score = 28.9 bits (65), Expect = 3.2
 Identities = 31/150 (20%), Positives = 66/150 (44%), Gaps = 1/150 (0%)

Query: 87  SGADWEAARQRLSRLELENERLKHELERSQTTFGRTTL-STSAELDRLQEKYDKTCADLR 145
           +G           + EL+    +     +Q       L S   EL  L++  ++    L 
Sbjct: 653 TGGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLE 712

Query: 146 RAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQG 205
             + +L  ++ +   +  E + +Q ++E+ + E+  L+ +LE  Q  +E ++EE E  + 
Sbjct: 713 ELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEE 772

Query: 206 TLSRLLSERDKAVLEMDKSKEELERSQATL 235
            L++L  E ++   +    +EELE  +  L
Sbjct: 773 ALAKLKEEIEELEEKRQALQEELEELEEEL 802


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 40.4 bits (94), Expect = 6e-04
 Identities = 45/144 (31%), Positives = 79/144 (54%), Gaps = 6/144 (4%)

Query: 97  RLSRLELENERLKHELERSQTTFGRTTLS----TSAELDRLQEKYDKTCADLRRAQAELR 152
           +L   + ENE LK ELER Q+  G +  S     S EL R +       A +   + +LR
Sbjct: 4   QLENAQRENELLKKELERKQSKLGSSMNSIKTFWSPELKRERALRKDEAARITVWKEQLR 63

Query: 153 VVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLS 212
           V Q +N+  +   + +QE++ KSQ E+ RL+ KLE +  E   ++ + E+++G  SR+ +
Sbjct: 64  VTQEENQHAQLTVQALQEEL-KSQRELNRLQQKLEKSGNETGELQ-QLEKSEGQFSRIQA 121

Query: 213 ERDKAVLEMDKSKEELERSQATLG 236
           ER++  +E+   ++ LE+ Q  + 
Sbjct: 122 ERERQAMELFLLRDTLEKLQLRME 145



 Score = 30.8 bits (69), Expect = 0.78
 Identities = 44/176 (25%), Positives = 79/176 (44%), Gaps = 6/176 (3%)

Query: 61  YESEVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFG 120
           YE E +   L K  A L  A  E      + E  R R  + E +   L+ ++E  Q TF 
Sbjct: 350 YELERKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKLRVLQKKIENLQETFR 409

Query: 121 RTTLSTSAELDRLQEKYDKTCADLRRAQAEL------RVVQADNERVRSEEKTMQEKVEK 174
           R       E +RL+     T  D    + E       R+++   E+   +E+  QE+ E 
Sbjct: 410 RKERRLKEEKERLRSLQTDTNTDTALEKLEKALAEKERIIERLKEQRDRDERYEQEEFET 469

Query: 175 SQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELER 230
            + E   LK +++N Q ++   + + E  +  +S+L S + K   +++++  ELE+
Sbjct: 470 YKKEFEDLKEEVQNLQLKLSERELQLELLKEEVSKLASNQLKQRSDLERAHIELEK 525



 Score = 30.0 bits (67), Expect = 1.2
 Identities = 48/212 (22%), Positives = 96/212 (45%), Gaps = 3/212 (1%)

Query: 11  ALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELEL 70
           AL  LE+A+ +KE+ + +L+EQRDR ++  QEE + ++++  + K ++   + ++ E EL
Sbjct: 434 ALEKLEKALAEKERIIERLKEQRDRDERYEQEEFETYKKEFEDLKEEVQNLQLKLSEREL 493

Query: 71  GKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAEL 130
              + EL K       S    + +    + +ELE  R KHE    +    R    ++   
Sbjct: 494 ---QLELLKEEVSKLASNQLKQRSDLERAHIELEKIREKHEKLEKELKRLRANPESADRG 550

Query: 131 DRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQ 190
             +     ++ AD   A+ E+  +    E+   E    + +  +   E+ + +  L   Q
Sbjct: 551 SAVDAGTSRSRADSAGARNEVDRLLDRLEKAEQERDDTEMEAGRLAKELEKAQRHLTKQQ 610

Query: 191 GEMESMKEEYERTQGTLSRLLSERDKAVLEMD 222
            + E+ + E+ER    L       +K+  E +
Sbjct: 611 EKTEATRIEFERKSAELLEEAERLEKSEAEEE 642


>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
           division and chromosome partitioning].
          Length = 570

 Score = 39.7 bits (93), Expect = 8e-04
 Identities = 43/196 (21%), Positives = 78/196 (39%), Gaps = 4/196 (2%)

Query: 7   SSEGALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVE 66
             E  L+ +EE I    + +N+L E  ++  +E     +L+E    +     H Y     
Sbjct: 108 DIEQQLTLIEEDIEQILEDLNELVESEEKNSEEIDHVLELYEELRRDVLANRHQYGEAAP 167

Query: 67  ELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLST 126
           ELE      E + +      S  D+  AR+ L   E     L+  +ER  +         
Sbjct: 168 ELEKKLENIEEELSQFVELTSSGDYIEAREVLEEAEEHMIALRSIMERIPSLLAELQTEL 227

Query: 127 SAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKL 186
             +L  L+  Y     D++     L  V  D+   R +E+ ++     +Q E+   + +L
Sbjct: 228 PGQLQDLKAGYR----DMKEEGYHLEHVNIDSRLERLKEQLVENSELLTQLELDEAEEEL 283

Query: 187 ENTQGEMESMKEEYER 202
              Q ++ES+ +  ER
Sbjct: 284 GLIQEKIESLYDLLER 299


>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
          Length = 977

 Score = 38.3 bits (89), Expect = 0.002
 Identities = 61/243 (25%), Positives = 111/243 (45%), Gaps = 39/243 (16%)

Query: 8   SEGALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEE 67
           S+  L  L   I + EK +  L + R +A ++   E+ L E++  + K+        + E
Sbjct: 126 SDFQLEDLVGMIQNAEKNILLLNQARLQALEDL--EKILTEKEALQGKIN-------ILE 176

Query: 68  LELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTS 127
           + L ++ A +  AA E           +  +  LE + E+L++EL     T G    S S
Sbjct: 177 MRLSETDARIKLAAQE-----------KIHVEILEEQLEKLRNELLIRGATEGLCVHSLS 225

Query: 128 AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKS--QGEVYRLKAK 185
            ELD L+E+      D++  +AEL  V    ERV   EK      E+S     +  L++K
Sbjct: 226 KELDVLKEENMLLKDDIQFLKAELIEVAETEERVFKLEK------ERSLLDASLRELESK 279

Query: 186 LENTQGEM--------ESMKEEYERTQGTLSRLLSERDKAVLEMDKSKE---ELERSQAT 234
               Q ++        +   E+ E  Q  L R  ++ +KA L +D++++   ++++ +A+
Sbjct: 280 FIVAQEDVSKLSPLQYDCWWEKVENLQDLLDRATNQVEKAALVLDQNQDLRDKVDKLEAS 339

Query: 235 LGE 237
           L E
Sbjct: 340 LKE 342


>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein.  Members of this
           protein family belong to a conserved gene four-gene
           neighborhood found sporadically in a phylogenetically
           broad range of bacteria: Nocardia farcinica,
           Symbiobacterium thermophilum, and Streptomyces
           avermitilis (Actinobacteria), Geobacillus kaustophilus
           (Firmicutes), Azoarcus sp. EbN1 and Ralstonia
           solanacearum (Betaproteobacteria). Proteins in this
           family average over 1400 amino acids in length
           [Hypothetical proteins, Conserved].
          Length = 1353

 Score = 37.5 bits (87), Expect = 0.005
 Identities = 37/239 (15%), Positives = 86/239 (35%), Gaps = 19/239 (7%)

Query: 11  ALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELEL 70
            ++ L+  +   + ++ +L  +       ++   D      ++  L+     +   E + 
Sbjct: 743 RIAELDARLAAVDDELAELARELRALGARQRALADELAGAPSDRSLRAAHRRAAEAERQA 802

Query: 71  GKSKAELDKAANEVGRSGADWEAARQRLSR-------------LELENERLKHELERSQT 117
             ++ EL +AA +   + A W+ AR+ L R             LE     LK   +   T
Sbjct: 803 ESAERELARAARKAAAAAAAWKQARRELERDAADLDLPTDPDALEAVGLALKRFGDHLHT 862

Query: 118 TFGRT--TLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQE----K 171
                      +      + +  +  +D R A  +    +A+ E      +T++E     
Sbjct: 863 LEVAVRELRHAATRAAEQRARAARAESDAREAAEDAAEARAEAEEASLRLRTLEESVGAM 922

Query: 172 VEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELER 230
           V++ +  +   +A L +   E+  + E     +    R   +R +A   +D+  E  + 
Sbjct: 923 VDEIRARLAETRAALASGGRELPRLAEALATAEEARGRAEEKRAEADATLDERAEARDH 981



 Score = 27.5 bits (61), Expect = 8.1
 Identities = 29/132 (21%), Positives = 54/132 (40%), Gaps = 8/132 (6%)

Query: 83  EVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCA 142
           +V  +    +  R  L RLE     L++ L+R +          +  L   Q +YD+   
Sbjct: 224 DVADALEQLDEYRDELERLEALERALRNFLQRYRRYARTMLRRRATRLRSAQTQYDQLSR 283

Query: 143 DLRR-------AQAELRVVQADNERVRSEEKTMQEKVEKSQG-EVYRLKAKLENTQGEME 194
           DL R       A+ E R + A  E +  E   ++ ++E  QG   Y+   +LE  + + E
Sbjct: 284 DLGRARDELETAREEERELDARTEALEREADALRTRLEALQGSPAYQDAEELERARADAE 343

Query: 195 SMKEEYERTQGT 206
           +++      +  
Sbjct: 344 ALQAAAADARQA 355


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 36.9 bits (86), Expect = 0.007
 Identities = 61/242 (25%), Positives = 101/242 (41%), Gaps = 33/242 (13%)

Query: 9   EGALSSLEEAIGDKEKQMNQLREQRDRAD------QERQEERDLHERDIAEYKLKLHAYE 62
           E  L+ L+E I   E+Q  Q RE RD AD      +ER+EE +  E +I + +  +   E
Sbjct: 212 ESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDLRETIAETE 271

Query: 63  SEVEEL---------ELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELE 113
            E EEL          L + + E D    E G   AD EA   R   LE  +E L+  LE
Sbjct: 272 REREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLE 331

Query: 114 RSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVE 173
             +            E + L+E  D               ++   E +R E   ++ ++E
Sbjct: 332 ECRVAAQAHN----EEAESLREDADD--------------LEERAEELREEAAELESELE 373

Query: 174 KSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQA 233
           +++  V   + ++E  + E+E ++E +      L       ++   E D+ +E     +A
Sbjct: 374 EAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEA 433

Query: 234 TL 235
           TL
Sbjct: 434 TL 435


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 36.5 bits (84), Expect = 0.009
 Identities = 24/117 (20%), Positives = 50/117 (42%)

Query: 118 TFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQG 177
              R   S   +LD ++ +      +L  AQ E R  + + E  RSE +  +++ E  + 
Sbjct: 64  LLNRNLRSGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQ 123

Query: 178 EVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQAT 234
           E+   +  L   Q E+  + ++ +  Q  L  L  +R +   +    +   ++ QA+
Sbjct: 124 ELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQAS 180



 Score = 30.0 bits (67), Expect = 1.2
 Identities = 36/188 (19%), Positives = 69/188 (36%), Gaps = 5/188 (2%)

Query: 21  DKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELGKSKAELDKA 80
           D   Q+  LR +   A  E++      ER+ A  +L+    E E    EL  ++  L KA
Sbjct: 78  DIRPQLRALRTELGTAQGEKRAAET--EREAARSELQKARQEREAVRQELAAARQNLAKA 135

Query: 81  ANEVGRSGADWEAARQRLSRLELENERL---KHELERSQTTFGRTTLSTSAELDRLQEKY 137
             E+ R     +  + RL  L  +  +L      L+ SQ     +     +++  L+ + 
Sbjct: 136 QQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRS 195

Query: 138 DKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMK 197
            +   + +         QA  E +       Q+  +  Q    ++  K +      E ++
Sbjct: 196 AQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIR 255

Query: 198 EEYERTQG 205
           E   + Q 
Sbjct: 256 ERERQLQR 263


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 35.8 bits (83), Expect = 0.018
 Identities = 47/247 (19%), Positives = 85/247 (34%), Gaps = 48/247 (19%)

Query: 9   EGALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESE---- 64
           E  L SLE+ I + E+ +N L E++   D+  +E  +L   DI ++K  L   E      
Sbjct: 87  EENLESLEKEINELEEWLNVLDEEKSFLDENLEELSELSNLDI-DFK-YLRGAEGLKLGF 144

Query: 65  ----VEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFG 120
               +   +L   + EL             W A R  + + E+E E L+           
Sbjct: 145 VAGVINREKLEAFEREL-------------WRACRGYIRQAEIE-EPLEDP--------K 182

Query: 121 RTTLSTSAELDRLQEKYDKTCADLR------RAQAELRVVQADNERVRSEEKTMQEKVEK 174
           +T            +K  K                  R         R EE      +E+
Sbjct: 183 KTVFIIFFVGKEDLDKVKKILDSFGFELYDVPETEGERSELISKVNKRIEELQRV--LEQ 240

Query: 175 SQGEVYRLKAKLENT----QGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELER 230
           ++  + ++  K+ +       ++   K  YE    TL+    +    + E     ++LE+
Sbjct: 241 TESHLEKVLVKIADELLAWDEQVSKEKAVYE----TLNLFNYDTKTLIAEGWVPAKDLEK 296

Query: 231 SQATLGE 237
            +A L  
Sbjct: 297 LKAALEN 303


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 34.6 bits (80), Expect = 0.026
 Identities = 23/94 (24%), Positives = 43/94 (45%), Gaps = 1/94 (1%)

Query: 67  ELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLST 126
           + EL + + EL +   ++     + +  +Q LS LE E ERL+ EL R +       +  
Sbjct: 72  QQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANA-IEL 130

Query: 127 SAELDRLQEKYDKTCADLRRAQAELRVVQADNER 160
             E   L+E+  +   +    +AE   +Q + +R
Sbjct: 131 DEENRELREELAELKQENEALEAENERLQENEQR 164



 Score = 31.1 bits (71), Expect = 0.34
 Identities = 23/103 (22%), Positives = 46/103 (44%), Gaps = 10/103 (9%)

Query: 144 LRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMK------ 197
           L   Q EL  +Q +   ++ +   +Q++ ++ + E+  L+A+LE  Q E+  +K      
Sbjct: 68  LPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANA 127

Query: 198 ----EEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLG 236
               EE    +  L+ L  E +    E ++ +E  +R     G
Sbjct: 128 IELDEENRELREELAELKQENEALEAENERLQENEQRRWFLYG 170



 Score = 29.2 bits (66), Expect = 1.6
 Identities = 23/96 (23%), Positives = 46/96 (47%), Gaps = 6/96 (6%)

Query: 23  EKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELGKSKAELDKAAN 82
           ++++ +L+E+     ++  E +     +  E K +L   E+E+E L+  K  A + + + 
Sbjct: 72  QQELAELQEELAELQEQLAELQQ----ENQELKQELSTLEAELERLQ--KELARIKQLSA 125

Query: 83  EVGRSGADWEAARQRLSRLELENERLKHELERSQTT 118
                  +    R+ L+ L+ ENE L+ E ER Q  
Sbjct: 126 NAIELDEENRELREELAELKQENEALEAENERLQEN 161



 Score = 28.1 bits (63), Expect = 2.9
 Identities = 25/107 (23%), Positives = 44/107 (41%), Gaps = 14/107 (13%)

Query: 92  EAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAEL 151
            +AR+RL  L+ E   L+ EL   Q            E   L+++     A+L R Q EL
Sbjct: 62  PSARERLPELQQELAELQEELAELQEQLAEL----QQENQELKQELSTLEAELERLQKEL 117

Query: 152 RVVQ----------ADNERVRSEEKTMQEKVEKSQGEVYRLKAKLEN 188
             ++           +N  +R E   ++++ E  + E  RL+   + 
Sbjct: 118 ARIKQLSANAIELDEENRELREELAELKQENEALEAENERLQENEQR 164



 Score = 26.9 bits (60), Expect = 9.1
 Identities = 23/109 (21%), Positives = 44/109 (40%), Gaps = 6/109 (5%)

Query: 12  LSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELG 71
           L  L++ + + ++++ +L+EQ     QE QE +           L+      + E   + 
Sbjct: 68  LPELQQELAELQEELAELQEQLAELQQENQELKQELS------TLEAELERLQKELARIK 121

Query: 72  KSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFG 120
           +  A   +   E      +    +Q    LE ENERL+   +R    +G
Sbjct: 122 QLSANAIELDEENRELREELAELKQENEALEAENERLQENEQRRWFLYG 170


>gnl|CDD|153339 cd07655, F-BAR_PACSIN, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Protein kinase C and
           Casein kinase Substrate in Neurons (PACSIN) proteins.
           F-BAR domains are dimerization modules that bind and
           bend membranes and are found in proteins involved in
           membrane dynamics and actin reorganization. Protein
           kinase C and Casein kinase Substrate in Neurons (PACSIN)
           proteins, also called Synaptic dynamin-associated
           proteins (Syndapins), act as regulators of cytoskeletal
           and membrane dynamics. They bind both dynamin and
           Wiskott-Aldrich syndrome protein (WASP), and may provide
           direct links between the actin cytoskeletal machinery
           through WASP and dynamin-dependent endocytosis.
           Vetebrates harbor three isoforms with distinct
           expression patterns and specific functions. PACSINs
           contain an N-terminal F-BAR domain and a C-terminal SH3
           domain. F-BAR domains form banana-shaped dimers with a
           positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 258

 Score = 34.6 bits (80), Expect = 0.027
 Identities = 20/85 (23%), Positives = 41/85 (48%), Gaps = 8/85 (9%)

Query: 128 AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLE 187
            ++++ ++ Y   C   + AQ +    ++D      + K +Q+KVEK + EV + K K E
Sbjct: 133 KKVEKAKKAYHAACKAEKSAQKQENNAKSDTSLSPDQVKKLQDKVEKCKQEVSKTKDKYE 192

Query: 188 NTQGE--------MESMKEEYERTQ 204
               +        ME M++ +++ Q
Sbjct: 193 KALEDLNKYNPRYMEDMEQVFDKCQ 217


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 35.1 bits (81), Expect = 0.028
 Identities = 32/210 (15%), Positives = 82/210 (39%), Gaps = 20/210 (9%)

Query: 36  ADQERQEERDLHERDIAEYKLKLHAYESEVEELE--LGKSKAELDKAANEVGRSGADWEA 93
           +     ++    +++IA  + K+   + +  +LE  L   + E+     ++  +  D + 
Sbjct: 32  SAAADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKK 91

Query: 94  ARQRLSRLELENERLKHELERSQ-------TTFGRTTLSTSAELDRLQEKYDKTCADLRR 146
            R++++ L      L+ +    +           R+  +    L    E   ++   +R 
Sbjct: 92  LRKQIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQRS---VRL 148

Query: 147 AQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGT 206
           A     +  A  ER+ + + T          ++  ++A++   Q E+ ++  E    Q  
Sbjct: 149 AIYYGALNPARAERIDALKAT--------LKQLAAVRAEIAAEQAELTTLLSEQRAQQAK 200

Query: 207 LSRLLSERDKAVLEMDKSKEELERSQATLG 236
           L++LL ER K + +++      ++    L 
Sbjct: 201 LAQLLEERKKTLAQLNSELSADQKKLEELR 230



 Score = 29.7 bits (67), Expect = 1.3
 Identities = 42/222 (18%), Positives = 81/222 (36%), Gaps = 23/222 (10%)

Query: 12  LSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELG 71
            + LE+ +   E ++  L  Q      + ++ R    + IA+   +L+A   EV+E E  
Sbjct: 61  RAKLEKQLKSLETEIASLEAQLIETADDLKKLR----KQIADLNARLNA--LEVQEREQR 114

Query: 72  KSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELD 131
           +  AE   A    GR+          +S  + +         R    +G           
Sbjct: 115 RRLAEQLAALQRSGRN----PPPALLVSPEDAQRS------VRLAIYYGALN----PARA 160

Query: 132 RLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQG 191
              +    T   L   +AE+   QA+   + SE++  Q K+ +   E  +  A+L +   
Sbjct: 161 ERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELS 220

Query: 192 EMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQA 233
             +   EE    +  L   ++    A     K++E    ++A
Sbjct: 221 ADQKKLEELRANESRLKNEIA---SAEAAAAKAREAAAAAEA 259


>gnl|CDD|224482 COG1566, EmrA, Multidrug resistance efflux pump [Defense
           mechanisms].
          Length = 352

 Score = 34.6 bits (80), Expect = 0.038
 Identities = 20/97 (20%), Positives = 38/97 (39%), Gaps = 20/97 (20%)

Query: 142 ADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYE 201
           A L +A+A L   +A    +R++  + Q  + +++ +                    E E
Sbjct: 91  AALEQAEAALAAAEAQLRNLRAQLASAQALIAQAEAQDLDQAQN-------------ELE 137

Query: 202 RTQ-----GTLSRLLSERDKAVLEMDKSKEELERSQA 233
           R       G +SR   E D+A   +  ++  L  +QA
Sbjct: 138 RRAELAQRGVVSR--EELDRARAALQAAEAALAAAQA 172



 Score = 28.4 bits (64), Expect = 3.2
 Identities = 33/141 (23%), Positives = 54/141 (38%), Gaps = 15/141 (10%)

Query: 50  DIAEYKLKLHAYESEVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLK 109
           D  +Y+  L   E+ +   E     A+L     ++  + A    A         + ++ +
Sbjct: 85  DPRDYRAALEQAEAALAAAE-----AQLRNLRAQLASAQALIAQAE------AQDLDQAQ 133

Query: 110 HELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQ 169
           +ELER      R  +S   ELDR +       A L  AQA  +   A  E   S     Q
Sbjct: 134 NELERRAELAQRGVVSRE-ELDRARAALQAAEAALAAAQAAQKQNLALLESEVSGA---Q 189

Query: 170 EKVEKSQGEVYRLKAKLENTQ 190
            +V  ++  + + K  LE T 
Sbjct: 190 AQVASAEAALDQAKLDLERTV 210


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 33.7 bits (78), Expect = 0.068
 Identities = 35/203 (17%), Positives = 65/203 (32%), Gaps = 32/203 (15%)

Query: 49  RDIAEYKLKLHAYESEVEELELGK----SKAELDKAANEVGRSGADWEAARQRLSRLELE 104
           R        L   + +V    L +     + EL+K   E+     +       +  LE E
Sbjct: 63  RSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQE 122

Query: 105 NERLK------------HELERSQTTFGRTT----LSTSAELDRLQEKYDKT-------- 140
            ERL+               +      G            E D    +Y  T        
Sbjct: 123 IERLEPWGNFDLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDKGYVYVV 182

Query: 141 CADLRRAQAELRVV--QADNERVR-SEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMK 197
              L+    E+     +   ER+   EE T  E + + + E+  ++ + E+   E++ + 
Sbjct: 183 VVVLKELSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELA 242

Query: 198 EEYERTQGTLSRLLS-ERDKAVL 219
           ++Y      L   L  E ++A  
Sbjct: 243 KKYLEELLALYEYLEIELERAEA 265



 Score = 27.6 bits (62), Expect = 8.0
 Identities = 19/90 (21%), Positives = 37/90 (41%), Gaps = 22/90 (24%)

Query: 11  ALSSLEEAIGDKEKQMNQLREQ-RDRADQERQEERDLHE-----RDIAEYKLK------- 57
            +  ++E + + EK+   L E+ ++ A +  +E   L+E      + AE   K       
Sbjct: 216 LIREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIELERAEALSKFLKTDKT 275

Query: 58  --LHAY--ESEVEELELGKSKAELDKAANE 83
             +  +  E  V++L     K  +DKA   
Sbjct: 276 FAIEGWVPEDRVKKL-----KELIDKATGG 300


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 33.8 bits (78), Expect = 0.073
 Identities = 24/113 (21%), Positives = 45/113 (39%), Gaps = 31/113 (27%)

Query: 95  RQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVV 154
           R+RL++L  ENE LK                  AE +RLQ             + E  + 
Sbjct: 72  RKRLAKLISENEALK------------------AENERLQ-------------KREQSID 100

Query: 155 QADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTL 207
           Q   + V+SE + + +++E+ + E  +L+  ++  Q  +  +          L
Sbjct: 101 QQIQQAVQSETQELTKEIEQLKSERQQLQGLIDQLQRRLAGVLTGPSGGGSDL 153



 Score = 30.0 bits (68), Expect = 1.1
 Identities = 15/73 (20%), Positives = 42/73 (57%), Gaps = 7/73 (9%)

Query: 142 ADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEM----ESMK 197
           A+++  +  L  + ++NE +++E + +Q++ E+S     +++  +++   E+    E +K
Sbjct: 66  AEVKELRKRLAKLISENEALKAENERLQKR-EQSID--QQIQQAVQSETQELTKEIEQLK 122

Query: 198 EEYERTQGTLSRL 210
            E ++ QG + +L
Sbjct: 123 SERQQLQGLIDQL 135



 Score = 28.1 bits (63), Expect = 5.0
 Identities = 18/65 (27%), Positives = 29/65 (44%), Gaps = 4/65 (6%)

Query: 177 GEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVL----EMDKSKEELERSQ 232
            EV  L+ +L     E E++K E ER Q     +  +  +AV     E+ K  E+L+  +
Sbjct: 66  AEVKELRKRLAKLISENEALKAENERLQKREQSIDQQIQQAVQSETQELTKEIEQLKSER 125

Query: 233 ATLGE 237
             L  
Sbjct: 126 QQLQG 130


>gnl|CDD|213754 TIGR02971, heterocyst_DevB, ABC exporter membrane fusion protein,
           DevB family.  Members of this protein family are found
           mostly in the Cyanobacteria, but also in the
           Planctomycetes. DevB from Anabaena sp. strain PCC 7120
           is partially characterized as a membrane fusion protein
           of the DevBCA ABC exporter, probably a glycolipid
           exporter, required for heterocyst formation. Most
           Cyanobacteria have one member only, but Nostoc sp. PCC
           7120 has seven members.
          Length = 327

 Score = 32.9 bits (75), Expect = 0.13
 Identities = 29/110 (26%), Positives = 54/110 (49%), Gaps = 4/110 (3%)

Query: 128 AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLE 187
           AELD  + + D+  A L + +A  +  +   +R       + + V   Q  + RL+A+LE
Sbjct: 55  AELDVARTQLDEAKARLAQVRAGAKKGEIAAQRAARAAAKLFKDVAAQQATLNRLEAELE 114

Query: 188 NTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLGE 237
             Q E++  +  +    G +S   S+ D   L++  ++EELE + A+  E
Sbjct: 115 TAQREVDRYRSLF--RDGAVSA--SDLDSKALKLRTAEEELEEALASRSE 160


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 32.2 bits (73), Expect = 0.22
 Identities = 30/117 (25%), Positives = 53/117 (45%), Gaps = 5/117 (4%)

Query: 88  GADWEAARQRLSRLELENERLKHELE-----RSQTTFGRTTLSTSAELDRLQEKYDKTCA 142
           G D E     L R EL+N  ++  L+                    +L+ LQ++ ++   
Sbjct: 97  GHDVEHIDVELVRKELKNALVRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLK 156

Query: 143 DLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEE 199
           +L   +AE   VQ   +R+  E   ++E ++K  GEVY LK + +  +  +E  +EE
Sbjct: 157 ELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDELEPGVELPEEE 213



 Score = 29.5 bits (66), Expect = 1.4
 Identities = 24/99 (24%), Positives = 51/99 (51%)

Query: 103 LENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVR 162
           ++ E ++ EL+ +    G  TL    E   L+E Y++    L   Q E   +  + E + 
Sbjct: 103 IDVELVRKELKNALVRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELE 162

Query: 163 SEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYE 201
           +E + +QE++++ + E  RL+  L+   GE+  +K+ ++
Sbjct: 163 AEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWD 201


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 32.3 bits (73), Expect = 0.22
 Identities = 44/225 (19%), Positives = 97/225 (43%), Gaps = 12/225 (5%)

Query: 10  GALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELE 69
           G L  L+  I  KE+++  L+   D   ++      L ++ I+  + +L   E E    E
Sbjct: 330 GKLEKLKSEIELKEEEIKALQSNIDELHKQ------LRKQGISTEQFELMNQEREKLTRE 383

Query: 70  LGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAE 129
           L K   + DK    V     + +   + L +   + + L   + RS++  G     +S +
Sbjct: 384 LDKINIQSDKLTKSVKSRKLEAQGIFKSLEKTLRQYDSLIQNITRSRSQIGHNVNDSSLK 443

Query: 130 LDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEK---TMQEKVEKSQGEVYRLKAKL 186
           ++  Q     +  +    ++ L +     ER+++EE    T++E ++  + ++  L   L
Sbjct: 444 INIEQLFPKGSGINESIKKSILELNDEIQERIKTEENKSITLEEDIKNLKHDINELTQIL 503

Query: 187 ENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEELERS 231
           E  + E+     ++E ++    R L  +    +E++K ++EL   
Sbjct: 504 EKLELELSEANSKFELSKEENERELVAQR---IEIEKLEKELNDL 545


>gnl|CDD|233547 TIGR01730, RND_mfp, RND family efflux transporter, MFP subunit.
           This model represents the MFP (membrane fusion protein)
           component of the RND family of transporters. RND refers
           to Resistance, Nodulation, and cell Division. It is, in
           part, a subfamily of pfam00529 (Pfam release 7.5) but
           hits substantial numbers of proteins missed by that
           model. The related HlyD secretion protein, for which
           pfam00529 is named, is outside the scope of this model.
           Attributed functions imply outward transport. These
           functions include nodulation, acriflavin resistance,
           heavy metal efflux, and multidrug resistance proteins.
           Most members of this family are found in Gram-negative
           bacteria. The proposed function of MFP proteins is to
           bring the inner and outer membranes together and enable
           transport to the outside of the outer membrane. Note,
           however, that a few members of this family are found in
           Gram-positive bacteria, where there is no outer membrane
           [Transport and binding proteins, Unknown substrate].
          Length = 322

 Score = 31.9 bits (73), Expect = 0.24
 Identities = 20/79 (25%), Positives = 32/79 (40%), Gaps = 4/79 (5%)

Query: 131 DRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQ 190
           D  Q       A L  A+A+L + Q   ER       + ++   SQ ++   KA +E  Q
Sbjct: 60  DDYQLALQAALAQLAAAEAQLELAQRSFERAE----RLVKRNAVSQADLDDAKAAVEAAQ 115

Query: 191 GEMESMKEEYERTQGTLSR 209
            ++E+ K      Q  L  
Sbjct: 116 ADLEAAKASLASAQLNLRY 134


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 31.2 bits (71), Expect = 0.41
 Identities = 13/53 (24%), Positives = 26/53 (49%)

Query: 12  LSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESE 64
           L  L  AI     + ++L E+   A++ R+E R    ++I++ K K+   +  
Sbjct: 237 LQELTIAIEALTNKKSELLEEIAEAEKIREECRGWSAKEISKLKAKVSLLQKL 289


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 31.1 bits (71), Expect = 0.41
 Identities = 19/64 (29%), Positives = 32/64 (50%), Gaps = 9/64 (14%)

Query: 41  QEERDLHERDIAEYKLKLHAYESEVEELELGKSKAELDKAANEVGRSGADWEAARQRLSR 100
           +E R+ +E ++ EY+ +   +E+E + LE         K A +  + G D EA  + L  
Sbjct: 68  EELREEYEEELKEYEAEKEIWEAEKKGLE---------KKAKKAIKKGKDEEALAEELLE 118

Query: 101 LELE 104
           LE E
Sbjct: 119 LEAE 122


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 30.3 bits (69), Expect = 0.47
 Identities = 19/89 (21%), Positives = 46/89 (51%), Gaps = 2/89 (2%)

Query: 147 AQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGT 206
           A A+ ++   D ++V SE    +   ++ + E  +L+A+L+  + E++  +++ ++   T
Sbjct: 12  AAAQAKIGVVDVQKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAAT 71

Query: 207 LSRLLSERDKAVLEMDKSKEELERSQATL 235
           LS     R     E+ + ++EL++ Q   
Sbjct: 72  LSE--EARKAKQQELQQKQQELQQKQQAA 98


>gnl|CDD|220640 pfam10234, Cluap1, Clusterin-associated protein-1.  This protein is
           conserved from worms to humans. The protein of 413 amino
           acids contains a central coiled-coil domain, possibly
           the region that binds to clusterin. Cluap1 expression is
           highest in the nucleus and gradually increases during
           late S to G2/M phases of the cell cycle and returns to
           the basal level in the G0/G1 phases. In addition, it is
           upregulated in colon cancer tissues compared to
           corresponding non-cancerous mucosa. It thus plays a
           crucial role in the life of the cell.
          Length = 269

 Score = 30.7 bits (70), Expect = 0.51
 Identities = 22/70 (31%), Positives = 39/70 (55%)

Query: 160 RVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVL 219
            ++   K +  ++++ Q E+  LK+   N + ++E  K+E ERTQ  L  L S R   + 
Sbjct: 168 ALKEAIKNLAARLQQLQAELDNLKSDEANLEAKIERKKQELERTQKRLQALQSVRPAFME 227

Query: 220 EMDKSKEELE 229
           E +K +EEL+
Sbjct: 228 EYEKLEEELQ 237



 Score = 28.1 bits (63), Expect = 3.5
 Identities = 18/64 (28%), Positives = 38/64 (59%)

Query: 147 AQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGT 206
             A L+ +QA+ + ++S+E  ++ K+E+ + E+ R + +L+  Q    +  EEYE+ +  
Sbjct: 176 LAARLQQLQAELDNLKSDEANLEAKIERKKQELERTQKRLQALQSVRPAFMEEYEKLEEE 235

Query: 207 LSRL 210
           L +L
Sbjct: 236 LQKL 239


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 31.1 bits (71), Expect = 0.52
 Identities = 21/93 (22%), Positives = 36/93 (38%), Gaps = 7/93 (7%)

Query: 11  ALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELEL 70
           AL      +  + +   + + Q     + +Q+E    E   AE + K    E+++E+L  
Sbjct: 146 ALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQL-- 203

Query: 71  GKSKAELDKAANEVGRSGADWEAARQRLSRLEL 103
            + KA       +  R     E   Q   RLEL
Sbjct: 204 -QEKAAETSQERKQKRK----EITDQAAKRLEL 231


>gnl|CDD|226654 COG4191, COG4191, Signal transduction histidine kinase regulating
           C4-dicarboxylate transport system [Signal transduction
           mechanisms].
          Length = 603

 Score = 31.1 bits (71), Expect = 0.52
 Identities = 22/70 (31%), Positives = 29/70 (41%), Gaps = 2/70 (2%)

Query: 87  SGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRR 146
             A W   R+R      E +  + ELER            +A L     + ++  A LRR
Sbjct: 311 LLALWLRRRRRARLRLAELQEARAELERRVEERTADLTRANARLQAEIAEREQAEAALRR 370

Query: 147 AQAELRVVQA 156
           AQ EL  VQA
Sbjct: 371 AQDEL--VQA 378


>gnl|CDD|224455 COG1538, TolC, Outer membrane protein [Cell envelope biogenesis,
           outer membrane / Intracellular trafficking and
           secretion].
          Length = 457

 Score = 29.8 bits (67), Expect = 1.2
 Identities = 13/80 (16%), Positives = 39/80 (48%), Gaps = 3/80 (3%)

Query: 143 DLRRAQAELRVVQADNERVRSEEKTMQEKVEK---SQGEVYRLKAKLENTQGEMESMKEE 199
           DL  AQ +L + +        + +  +++ +    ++ +V + +A+L + + ++ + + +
Sbjct: 162 DLLAAQEQLALAEETLAAAEEQLELAEKRYDAGLATRLDVLQAEAQLASARAQLAAAQAQ 221

Query: 200 YERTQGTLSRLLSERDKAVL 219
             + +  L+RLL      +L
Sbjct: 222 LAQARNALARLLGLEPGELL 241


>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
          [Posttranslational modification, protein turnover,
          chaperones].
          Length = 881

 Score = 29.9 bits (67), Expect = 1.2
 Identities = 9/41 (21%), Positives = 23/41 (56%)

Query: 13 SSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAE 53
          S  ++   D++ Q++  ++  ++ D++++EE  L E D   
Sbjct: 5  SDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEEDAML 45


>gnl|CDD|233228 TIGR00998, 8a0101, efflux pump membrane protein (multidrug
           resistance protein A).  [Transport and binding proteins,
           Other].
          Length = 334

 Score = 29.8 bits (67), Expect = 1.3
 Identities = 23/96 (23%), Positives = 43/96 (44%), Gaps = 6/96 (6%)

Query: 142 ADLRRAQAELRVVQADNERVRSEEKTMQ--EKVEKSQGEVYRLKAKLENTQGEMESMKEE 199
             L    AEL + +A+        +T Q    V++ Q +V  LK KLE  + ++   + +
Sbjct: 71  VRLDPTNAELALAKAEANLAALVRQTKQLEITVQQLQAKVESLKIKLEQAREKLLQAELD 130

Query: 200 YERTQGTLSRLLSERDKAVLEMDKSKEELERSQATL 235
             R      + L  R+    E+D +++ L  ++A L
Sbjct: 131 LRRRVPLFKKGLISRE----ELDHARKALLSAKAAL 162


>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
           Provisional.
          Length = 276

 Score = 29.6 bits (67), Expect = 1.3
 Identities = 18/93 (19%), Positives = 28/93 (30%), Gaps = 29/93 (31%)

Query: 70  LGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAE 129
           +G   + + +  N      +    +   L  L  ENE LK EL                 
Sbjct: 43  VGDVVSPVQRVVNAPREFVSGVFESLASLFDLREENEELKKELLE--------------- 87

Query: 130 LDRLQEKYDKTCADLRRAQAELRVVQADNERVR 162
                         L     EL  ++A+N R+R
Sbjct: 88  --------------LESRLQELEQLEAENARLR 106


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 29.3 bits (66), Expect = 1.3
 Identities = 13/83 (15%), Positives = 40/83 (48%)

Query: 150 ELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSR 209
            ++ +  + +R+    K +++ ++K++ E+  L   LE  + E+E ++ +  + +  +  
Sbjct: 11  AIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQE 70

Query: 210 LLSERDKAVLEMDKSKEELERSQ 232
           +     +A  ++   K+E E   
Sbjct: 71  IRERIKRAEEKLSAVKDERELRA 93


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 29.6 bits (67), Expect = 1.5
 Identities = 13/48 (27%), Positives = 27/48 (56%)

Query: 158 NERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQG 205
            E++R +EK ++E  +    E   L +KLE  Q ++E +++  E+ + 
Sbjct: 323 TEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEKLKS 370


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 29.2 bits (66), Expect = 1.7
 Identities = 12/53 (22%), Positives = 26/53 (49%)

Query: 12  LSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESE 64
           L  LE  I D   + ++L  +   A+++ ++ R    ++I + K +L   +S 
Sbjct: 234 LQELESKIEDLTNKKSELNTEIAEAEKKLEQCRGFTFKEIEKLKEQLKLLQSL 286


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 29.5 bits (67), Expect = 1.7
 Identities = 17/90 (18%), Positives = 41/90 (45%), Gaps = 5/90 (5%)

Query: 121 RTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEE-----KTMQEKVEKS 175
           +T +    +   L+E+  +   +L+  + EL  ++   +++  +         +E++EK 
Sbjct: 320 KTEIEVGVDFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKL 379

Query: 176 QGEVYRLKAKLENTQGEMESMKEEYERTQG 205
                +L  +LE  + E++ +KEE E    
Sbjct: 380 LETKEKLSEELEELEEELKELKEELESLYS 409



 Score = 27.2 bits (61), Expect = 9.7
 Identities = 18/75 (24%), Positives = 36/75 (48%), Gaps = 2/75 (2%)

Query: 159 ERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAV 218
           + +  E K ++E++EK +  + +L  K           +E+ E+   T  +L  E ++  
Sbjct: 337 KELEEELKELEEELEKIKKLLKKLPKKARGQLPPE--KREQLEKLLETKEKLSEELEELE 394

Query: 219 LEMDKSKEELERSQA 233
            E+ + KEELE   +
Sbjct: 395 EELKELKEELESLYS 409


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
           Provisional.
          Length = 389

 Score = 29.4 bits (67), Expect = 1.8
 Identities = 18/58 (31%), Positives = 22/58 (37%), Gaps = 12/58 (20%)

Query: 57  KLHAYESEVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELER 114
           +L A E    ELE    + EL                      +LE E ERLK ELE+
Sbjct: 2   RLEALEERNSELEEQIRQLELKL------------RDLEAENEKLERELERLKSELEK 47


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 29.4 bits (67), Expect = 1.9
 Identities = 16/69 (23%), Positives = 29/69 (42%), Gaps = 9/69 (13%)

Query: 15  LEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELGKSK 74
           L E I   E+   +L ++ + A+   +E   L E    E + K    + E ++L      
Sbjct: 518 LNELIASLEELERELEQKAEEAEALLKEAEKLKE----ELEEKKEKLQEEEDKL-----L 568

Query: 75  AELDKAANE 83
            E +K A +
Sbjct: 569 EEAEKEAQQ 577


>gnl|CDD|163494 TIGR03782, Bac_Flav_CT_J, Bacteroides conjugative transposon TraJ
           protein.  Members of this protein family are designated
           TraM and are found in a proposed transfer region of a
           class of conjugative transposon found in the Bacteroides
           lineage. This family is related conjugation system
           proteins in the Proteobacteria, including TrbL of
           Agrobacterium Ti plasmids and VirB6 [Cellular processes,
           DNA transformation].
          Length = 322

 Score = 28.9 bits (65), Expect = 2.3
 Identities = 18/75 (24%), Positives = 34/75 (45%), Gaps = 17/75 (22%)

Query: 26  MNQLREQRDRADQE---RQEERDLHERDIAEYKLKLHAYESEVEELELGKSKAE------ 76
           MN+ REQ+D+ + E   R  E          Y +    ++ +++EL    S  +      
Sbjct: 101 MNRYREQKDKLEYEAMVRDPET--------AYLVSDEEFDRKLDELGWSPSDMKTMAGMY 152

Query: 77  LDKAANEVGRSGADW 91
           +++AA  + +S  DW
Sbjct: 153 MERAAYNMKKSVRDW 167


>gnl|CDD|179310 PRK01622, PRK01622, OxaA-like protein precursor; Validated.
          Length = 256

 Score = 28.6 bits (64), Expect = 2.3
 Identities = 13/40 (32%), Positives = 23/40 (57%), Gaps = 7/40 (17%)

Query: 165 EKTMQEKVEKSQGEVYRLKAKLENT-----QGE--MESMK 197
           ++ MQEK+   + E+ +++AKL+ T     Q E   E M+
Sbjct: 86  QRGMQEKMAVMKPELDKIQAKLKVTKDLEKQKEYQKEMME 125


>gnl|CDD|153363 cd07679, F-BAR_PACSIN2, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Protein kinase C and
           Casein kinase Substrate in Neurons 2 (PACSIN2).  F-BAR
           domains are dimerization modules that bind and bend
           membranes and are found in proteins involved in membrane
           dynamics and actin reorganization. Protein kinase C and
           Casein kinase Substrate in Neurons (PACSIN) proteins,
           also called Synaptic dynamin-associated proteins
           (Syndapins), act as regulators of cytoskeletal and
           membrane dynamics. Vetebrates harbor three isoforms with
           distinct expression patterns and specific functions.
           PACSIN 2 or Syndapin II is expressed ubiquitously and is
           involved in the regulation of tubulin polymerization. It
           associates with Golgi membranes and forms a complex with
           dynamin II which is crucial in promoting vesicle
           formation from the trans-Golgi network. PACSIN 2
           contains an N-terminal F-BAR domain and a C-terminal SH3
           domain. F-BAR domains form banana-shaped dimers with a
           positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 258

 Score = 28.9 bits (64), Expect = 2.4
 Identities = 26/103 (25%), Positives = 50/103 (48%), Gaps = 8/103 (7%)

Query: 129 ELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLEN 188
           E++  ++ Y   C + + A +     +AD      + K +Q+KVEK + +V + K K E 
Sbjct: 134 EVEAAKKAYHTACKEEKLATSREANSKADPALNPEQLKKLQDKVEKCKQDVLKTKEKYEK 193

Query: 189 TQGE--------MESMKEEYERTQGTLSRLLSERDKAVLEMDK 223
           +  E        ME+M++ +E+ Q    + L    + +LE+ K
Sbjct: 194 SLKELDQTTPQYMENMEQVFEQCQQFEEKRLRFFREVLLEVQK 236


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 29.3 bits (65), Expect = 2.5
 Identities = 35/217 (16%), Positives = 92/217 (42%), Gaps = 4/217 (1%)

Query: 21   DKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELG--KSKAELD 78
            ++ K+ ++ +++ + A ++  E +   E      + K      + +E +      KA+  
Sbjct: 1480 EEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEA 1539

Query: 79   KAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYD 138
            K A E  ++    +A   + +  + + E  K   E       +   +  AE  R++E   
Sbjct: 1540 KKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMK 1599

Query: 139  KTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKE 198
                + +    E +  +A+  ++++EE    E+ +K   ++ + +A+ +    E++  +E
Sbjct: 1600 LYEEEKKMKAEEAK--KAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEE 1657

Query: 199  EYERTQGTLSRLLSERDKAVLEMDKSKEELERSQATL 235
            E +      ++   E  K   E  K++E+ +++   L
Sbjct: 1658 ENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEAL 1694



 Score = 28.6 bits (63), Expect = 4.4
 Identities = 32/200 (16%), Positives = 85/200 (42%), Gaps = 9/200 (4%)

Query: 2    QAHHCSSEGALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAY 61
            +A     E  +   EE    K ++  +  E + +A++ ++ E +  + +  + K      
Sbjct: 1588 KAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKK-----E 1642

Query: 62   ESEVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGR 121
              E ++ E  K   E +K         A+ +  +   ++   E+E+   E  + +    +
Sbjct: 1643 AEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAK 1702

Query: 122  TTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYR 181
                 + EL + + +  K   +L++A+ E ++   + ++   E+K   E+ +K + E  +
Sbjct: 1703 ----KAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKK 1758

Query: 182  LKAKLENTQGEMESMKEEYE 201
            +    +  + + E +++E E
Sbjct: 1759 IAHLKKEEEKKAEEIRKEKE 1778


>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
           Members of this protein family are homologs of ClpB, an
           ATPase associated with chaperone-related functions.
           These ClpB homologs, designated ClpV1, are a key
           component of the bacterial pathogenicity-associated type
           VI secretion system [Protein fate, Protein and peptide
           secretion and trafficking, Cellular processes,
           Pathogenesis].
          Length = 852

 Score = 28.8 bits (65), Expect = 2.7
 Identities = 22/102 (21%), Positives = 43/102 (42%), Gaps = 1/102 (0%)

Query: 92  EAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAEL 151
           E  R+R++ LELE + L+ E         R      AEL  L+ +     A  ++ +  +
Sbjct: 422 EDLRRRIAALELELDALEREAALGADHDERLAELR-AELAALEAELAALEARWQQEKELV 480

Query: 152 RVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEM 193
             + A    + ++     +     + ++  L+A L + QGE 
Sbjct: 481 EAILALRAELEADADAPADDDAALRAQLAELEAALASAQGEE 522


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 28.6 bits (65), Expect = 2.9
 Identities = 28/107 (26%), Positives = 49/107 (45%), Gaps = 17/107 (15%)

Query: 145 RRAQAELRVVQADNERVRSEEKTMQ---------EKVEKSQGEVYRLKAKLENTQGEMES 195
            R + E  + +  NE  + E++ +Q         E +EK + E+ + + +LE  Q E+E 
Sbjct: 69  LRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEK 128

Query: 196 MKEEYERTQGT-------LSRLLSERDKAVLEMDKSKEELERSQATL 235
            +EE E            +S L +E  K +L ++K +EE     A L
Sbjct: 129 KEEELEELIEEQLQELERISGLTAEEAKEIL-LEKVEEEARHEAAVL 174


>gnl|CDD|222629 pfam14257, DUF4349, Domain of unknown function (DUF4349).  This
           family of proteins is found in bacteria and archaea.
           Proteins in this family are typically between 282 and
           353 amino acids in length. There is a single completely
           conserved residue D that may be functionally important.
          Length = 262

 Score = 28.3 bits (64), Expect = 3.0
 Identities = 14/50 (28%), Positives = 26/50 (52%), Gaps = 2/50 (4%)

Query: 128 AELDRLQEKYDK--TCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKS 175
           A  +RL    ++  T  DL   + EL  VQA+ E + ++ + + ++V  S
Sbjct: 146 ASEERLLALLERAGTVEDLLEVERELSRVQAEIESLEAQLRYLDDRVAYS 195


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 28.6 bits (64), Expect = 3.0
 Identities = 14/60 (23%), Positives = 29/60 (48%)

Query: 24  KQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELGKSKAELDKAANE 83
           K   +++   D +  +RQE    + + + +Y  KL   + E +  E  ++KA+ +K   E
Sbjct: 351 KLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQE 410


>gnl|CDD|221380 pfam12017, Tnp_P_element, Transposase protein.  Protein in this
           family are transposases found in insects. This region is
           about 230 amino acids in length and is found associated
           with pfam05485.
          Length = 236

 Score = 28.3 bits (63), Expect = 3.0
 Identities = 12/47 (25%), Positives = 27/47 (57%)

Query: 158 NERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQ 204
           N     E K++++K+   + E++ L+ +LE +Q   +S+ + +  TQ
Sbjct: 13  NHTTLMENKSLRQKIRALEKEMHSLRQQLEESQQLEKSLGKIFTETQ 59


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 27.9 bits (63), Expect = 3.2
 Identities = 18/62 (29%), Positives = 24/62 (38%), Gaps = 7/62 (11%)

Query: 55  KLKLHAYESEVE--ELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHEL 112
            +K H  E   +  ELE    +AEL        +     E AR R    ELE +  + E 
Sbjct: 48  SIKQHLAELAADLLELEAAAPRAELQAKIARYKK-----EKARYRSEAKELEAKAKEAEA 102

Query: 113 ER 114
           E 
Sbjct: 103 ES 104



 Score = 27.2 bits (61), Expect = 5.9
 Identities = 15/48 (31%), Positives = 20/48 (41%), Gaps = 1/48 (2%)

Query: 22 KEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELE 69
            KQ +      D  + E    R   +  IA YK +   Y SE +ELE
Sbjct: 48 SIKQ-HLAELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELE 94


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 28.2 bits (63), Expect = 3.4
 Identities = 23/104 (22%), Positives = 49/104 (47%), Gaps = 4/104 (3%)

Query: 125 STSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKA 184
               +L  LQE+ +     +   ++EL  ++ D + +R   K+++E++E  +  +  L  
Sbjct: 83  ELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEESIKELAK 142

Query: 185 KLENTQG----EMESMKEEYERTQGTLSRLLSERDKAVLEMDKS 224
           +L   +     E+E ++EE ER Q  L RL     +    +++ 
Sbjct: 143 ELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLLEQL 186



 Score = 27.8 bits (62), Expect = 3.8
 Identities = 24/117 (20%), Positives = 51/117 (43%), Gaps = 8/117 (6%)

Query: 90  DWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQA 149
             E  +  LS L+ +   L+ ELE              +EL+ L+E        L+  + 
Sbjct: 73  KLEELQGELSELKQQLSELQEELED----LEERIAELESELEDLKEDLQLLRELLKSLEE 128

Query: 150 ELRVVQADNERVRSE----EKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 202
            L  ++   + +  E     + ++E+VE+ + E+ RL+  L+  Q  ++ ++   E+
Sbjct: 129 RLESLEESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLLEQ 185



 Score = 27.0 bits (60), Expect = 7.6
 Identities = 21/97 (21%), Positives = 43/97 (44%), Gaps = 5/97 (5%)

Query: 12  LSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELG 71
           L  LEE I + E ++  L+E        R+  + L ER +   +  +     E+ EL   
Sbjct: 95  LEDLEERIAELESELEDLKEDL---QLLRELLKSLEER-LESLEESIKELAKELRELRQ- 149

Query: 72  KSKAELDKAANEVGRSGADWEAARQRLSRLELENERL 108
             + E+++   E+ R   + +  ++ +  L+   E+L
Sbjct: 150 DLREEVEELREELERLQENLQRLQEAIQELQSLLEQL 186


>gnl|CDD|237110 PRK12472, PRK12472, hypothetical protein; Provisional.
          Length = 508

 Score = 28.3 bits (63), Expect = 3.9
 Identities = 22/100 (22%), Positives = 43/100 (43%), Gaps = 5/100 (5%)

Query: 96  QRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQ 155
           + L+R   +  R   E + +     R      A L +L+    +  A+L+RA   L   +
Sbjct: 193 ETLAREAEDAARAADEAKTAAAAAAREAAPLKASLRKLERAKARADAELKRADKALAAAK 252

Query: 156 ADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMES 195
            D  + R+EE+  Q+K  +   E      +L+  + + E+
Sbjct: 253 TDEAKARAEER--QQKAAQQAAEA---ATQLDTAKADAEA 287


>gnl|CDD|177920 PLN02281, PLN02281, chlorophyllide a oxygenase.
          Length = 536

 Score = 28.2 bits (62), Expect = 4.0
 Identities = 19/76 (25%), Positives = 37/76 (48%), Gaps = 1/76 (1%)

Query: 161 VRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLE 220
           V+ E   +QE++ K+  +V+  +A++     ++  M EE    +    R+++E DK    
Sbjct: 126 VKKELAGLQEELSKAHQQVHISEARVSTALDKLAHM-EELVNDRLLPGRVVTELDKPSSS 184

Query: 221 MDKSKEELERSQATLG 236
              S  EL+R +   G
Sbjct: 185 TTASAVELDREKTNTG 200


>gnl|CDD|240301 PTZ00166, PTZ00166, DNA polymerase delta catalytic subunit;
           Provisional.
          Length = 1054

 Score = 28.5 bits (64), Expect = 4.1
 Identities = 8/25 (32%), Positives = 13/25 (52%)

Query: 207 LSRLLSERDKAVLEMDKSKEELERS 231
           +  L++ R KA  EM   K+ L + 
Sbjct: 617 VEELIAARKKAKKEMKDEKDPLLKK 641


>gnl|CDD|223914 COG0845, AcrA, Membrane-fusion protein [Cell envelope biogenesis,
           outer membrane].
          Length = 372

 Score = 28.1 bits (62), Expect = 4.1
 Identities = 17/73 (23%), Positives = 27/73 (36%), Gaps = 1/73 (1%)

Query: 89  ADWEAARQRLSRLELE-NERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRA 147
           A  + A  +L+R +         +L+R           + AELD  Q       A +  A
Sbjct: 105 AALDQAEAQLARAQALLAPAELGDLQREAKLAAEKAAVSQAELDAAQALLRAAEALVEAA 164

Query: 148 QAELRVVQADNER 160
           QA L   + + E 
Sbjct: 165 QAALASAKLNLEY 177


>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
          Length = 3164

 Score = 28.4 bits (63), Expect = 4.4
 Identities = 11/64 (17%), Positives = 29/64 (45%)

Query: 168  MQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDKSKEE 227
            ++E+ ++  GE + ++ +L   +G    +     R +   S  +      + +MD   ++
Sbjct: 2309 LREEEKRIDGEAFLVEDRLTLGKGLSSDLMTFKLRRRSYYSLDILRVHGKIADMDTVHKD 2368

Query: 228  LERS 231
            + RS
Sbjct: 2369 VLRS 2372


>gnl|CDD|200547 cd10921, CE4_MJ0505_like, Putative catalytic domain of
           uncharacterized protein MJ0505 from Methanocaldococcus
           jannaschii and similar proteins.  This family contains
           an uncharacterized protein MJ0505 from
           Methanocaldococcus jannaschii and its prokaryotic
           homologs. Although their biochemical properties remain
           to be determined, members in this family is composed of
           a seven-stranded barrel with a detectable sequence
           similarity to the six-stranded barrel rhizobial
           NodB-like proteins, which remove N-linked or O-linked
           acetyl groups of cell wall polysaccharides and belong to
           a larger carbohydrate esterase 4 (CE4) superfamily.
          Length = 206

 Score = 27.7 bits (62), Expect = 4.9
 Identities = 24/79 (30%), Positives = 34/79 (43%), Gaps = 9/79 (11%)

Query: 46  LHERDIAEYKLKLHAYESEVEELELGKSKAE--LDKAANEVGRSGADWEAARQRL---SR 100
           LHE +   YK++LH Y+    E    K+ AE  L+KA   +   G D       L    R
Sbjct: 59  LHELEKEGYKVELHGYDHIYHEFNCNKTTAEEKLEKALEILSALGFDNI----TLFLPPR 114

Query: 101 LELENERLKHELERSQTTF 119
             +  + LK  LE + T  
Sbjct: 115 YAISEDSLKVILEHNFTVI 133


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 28.0 bits (63), Expect = 5.1
 Identities = 19/76 (25%), Positives = 34/76 (44%), Gaps = 14/76 (18%)

Query: 127 SAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKL 186
           +AEL RL+++ +K   ++ R + +L      NE   ++     E VEK        K KL
Sbjct: 813 AAELARLEKELEKLEKEIDRIEKKL-----SNEGFVAKAPE--EVVEKE-------KEKL 858

Query: 187 ENTQGEMESMKEEYER 202
              Q ++  ++E    
Sbjct: 859 AEYQVKLAKLEERLAV 874


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 28.1 bits (63), Expect = 5.3
 Identities = 36/178 (20%), Positives = 65/178 (36%), Gaps = 30/178 (16%)

Query: 60  AYESEVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTF 119
           A     EE    + +   ++   E+       +   + + RLE EN  LK ELE      
Sbjct: 393 ALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELE------ 446

Query: 120 GRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEV 179
                    EL R  EK +      RR              VR + +  +E +      +
Sbjct: 447 ---------ELKREIEKLESELERFRR-------------EVRDKVRKDRE-IRARDRRI 483

Query: 180 YRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERDKAVLEMDK-SKEELERSQATLG 236
            RL+ +LE  +  +E ++ +    +      LS +   V  ++K + E +E ++   G
Sbjct: 484 ERLEKELEEKKKRVEELERKLAELRKMRKLELSGKGTPVKVVEKLTLEAIEEAEEEYG 541


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 28.1 bits (63), Expect = 5.5
 Identities = 29/160 (18%), Positives = 64/160 (40%), Gaps = 12/160 (7%)

Query: 2   QAHHCSSEGALSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAY 61
           ++     E  L +L     D +++  +L+++     +   E+ +     I E K +  A 
Sbjct: 358 RSELEEVEARLDALTGKHQDVQRKYERLKQKIKEQLERDLEKNNERLAAIREEKDRQKAA 417

Query: 62  ESEVEELELGKSKAELDKAANEVGRSGADWEAARQRLSRLELENERLKHELERSQTTFGR 121
             E  +    + + +L+    E            +    LEL   RLK  L+ +  T   
Sbjct: 418 IEEDLQALESQLRQQLEAGKLE----------FNEEEYELELRLGRLKQRLDSATATPEE 467

Query: 122 TTLSTSAE--LDRLQEKYDKTCADLRRAQAELRVVQADNE 159
                  +  L++ QE+ ++  A++ + Q+ELR ++   +
Sbjct: 468 LEQLEINDEALEKAQEEQEQAEANVEQLQSELRQLRKRRD 507


>gnl|CDD|222790 PHA00430, PHA00430, tail fiber protein.
          Length = 568

 Score = 27.9 bits (62), Expect = 5.7
 Identities = 19/138 (13%), Positives = 45/138 (32%), Gaps = 5/138 (3%)

Query: 86  RSGADWEAARQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLR 145
            + + W A R   +R   E +R +++ ER     G +  +T            +      
Sbjct: 157 WNQSAWNA-RNEANRSRNEADRARNQAERFNNESGASATNTKQWRSEADGSNSEA----N 211

Query: 146 RAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQG 205
           R +     + +  E  + + ++  ++   +     +  A          +          
Sbjct: 212 RFKGYADSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASAAHASEVNAANSATAAAT 271

Query: 206 TLSRLLSERDKAVLEMDK 223
           + +R   + D+A  E DK
Sbjct: 272 SANRAKQQADRAKTEADK 289



 Score = 27.9 bits (62), Expect = 5.7
 Identities = 17/91 (18%), Positives = 30/91 (32%), Gaps = 1/91 (1%)

Query: 28  QLREQRDRADQERQE-ERDLHERDIAEYKLKLHAYESEVEELELGKSKAELDKAANEVGR 86
             ++ R  AD    E  R     D     ++    ++E    E   +     KAA     
Sbjct: 195 NTKQWRSEADGSNSEANRFKGYADSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASA 254

Query: 87  SGADWEAARQRLSRLELENERLKHELERSQT 117
           + A    A    +       R K + +R++T
Sbjct: 255 AHASEVNAANSATAAATSANRAKQQADRAKT 285


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 27.6 bits (62), Expect = 5.8
 Identities = 32/149 (21%), Positives = 69/149 (46%), Gaps = 8/149 (5%)

Query: 93  AARQRLSRLELENERLKHELERSQTTFGRT-TLSTSAELDRLQEKYDKTCADLRR--AQA 149
            A ++L   E   +R+  E ++   T  +   L    E+ +L+ + ++   + R    + 
Sbjct: 22  IAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRL 81

Query: 150 ELRVVQAD---NERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGT 206
           E R++Q +   + ++ S +K  +E +EK + E+   +  L+  + E+E +  E       
Sbjct: 82  ERRLLQREETLDRKMESLDKK-EENLEKKEKELSNKEKNLDEKEEELEELIAEQREELER 140

Query: 207 LSRLLSERDKAVLEMDKSKEELERSQATL 235
           +S L  E  K +L +++ +EE     A L
Sbjct: 141 ISGLTQEEAKEIL-LEEVEEEARHEAAKL 168


>gnl|CDD|215558 PLN03073, PLN03073, ABC transporter F family; Provisional.
          Length = 718

 Score = 27.9 bits (62), Expect = 6.2
 Identities = 23/82 (28%), Positives = 37/82 (45%), Gaps = 6/82 (7%)

Query: 156 ADNERVRSEEKTMQEK-----VEKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRL 210
           +D+   R     + EK     + K +  V  L A +  + G M+  +   ++ +     L
Sbjct: 63  SDDAECRLVCSKLAEKFGKHGLVKPKPSVRSLAAPVRMSDG-MDDSEVAKKKPEPDDGPL 121

Query: 211 LSERDKAVLEMDKSKEELERSQ 232
           LSERD A +E  K KEE +R  
Sbjct: 122 LSERDLAKIERRKRKEERQREV 143


>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 366 and 597 amino
           acids in length. There is a single completely conserved
           residue R that may be functionally important.
          Length = 297

 Score = 27.2 bits (61), Expect = 7.0
 Identities = 22/90 (24%), Positives = 39/90 (43%), Gaps = 11/90 (12%)

Query: 128 AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRS--------EEKTMQEKVEKSQGEV 179
           AE+  LQ + D+  A++  A  E    Q + E              K  +EK++ +Q  +
Sbjct: 142 AEIAALQAEIDEAQAEVNAAYQE---AQCEAEGTGGTGVAGKGPVYKEKREKLDAAQARL 198

Query: 180 YRLKAKLENTQGEMESMKEEYERTQGTLSR 209
             LKA+L+    ++E+ K   ER +     
Sbjct: 199 ETLKARLDAAIAQLEAQKAALERNRQAAVA 228


>gnl|CDD|233594 TIGR01844, type_I_sec_TolC, type I secretion outer membrane
           protein, TolC family.  Members of This model are outer
           membrane proteins from the TolC subfamily within the RND
           (Resistance-Nodulation-cell Division) efflux systems.
           These proteins, unlike the NodT subfamily, appear not to
           be lipoproteins. All are believed to participate in type
           I protein secretion, an ABC transporter system for
           protein secretion without cleavage of a signal sequence,
           although they may, like TolC, participate also in the
           efflux of smaller molecules as well. This family
           includes the well-documented examples TolC (E. coli),
           PrtF (Erwinia), and AprF (Pseudomonas aeruginosa)
           [Protein fate, Protein and peptide secretion and
           trafficking, Transport and binding proteins, Porins].
          Length = 415

 Score = 27.3 bits (61), Expect = 7.3
 Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 5/87 (5%)

Query: 120 GRTTLSTSAELDRLQEKYD--KTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQG 177
                S S+ L   Q  +D   T   +R A+A      A  E +R+  + +  +  ++  
Sbjct: 64  NTDLNSGSSTLTLSQPLFDGGSTWNAVRAAEAA---ALAARETLRATAQDLILRTAEAYM 120

Query: 178 EVYRLKAKLENTQGEMESMKEEYERTQ 204
           EV R +  L   +  + ++KE+ +  +
Sbjct: 121 EVLRAQEILALAEANLAALKEQLDLAR 147


>gnl|CDD|215974 pfam00529, HlyD, HlyD family secretion protein. 
          Length = 304

 Score = 27.5 bits (61), Expect = 7.4
 Identities = 22/97 (22%), Positives = 34/97 (35%), Gaps = 1/97 (1%)

Query: 142 ADLRRAQAELRVVQADNE-RVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEY 200
           A L RA+AEL   QAD + R+   E+   + +          +A                
Sbjct: 40  AALDRAEAELAAAQADAQARLARAERDRLQALVAQAAIQSEQEADAARAALREALAASRG 99

Query: 201 ERTQGTLSRLLSERDKAVLEMDKSKEELERSQATLGE 237
             ++  L         A+  +  S   +  SQATLG 
Sbjct: 100 AASKAQLEAARDNLRSALALLPISLGRIGASQATLGA 136


>gnl|CDD|201956 pfam01756, ACOX, Acyl-CoA oxidase.  This is a family of Acyl-CoA
           oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA
           into trans-2- enoyl-CoA.
          Length = 186

 Score = 26.8 bits (60), Expect = 7.6
 Identities = 9/37 (24%), Positives = 19/37 (51%)

Query: 110 HELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRR 146
             LE+    F R    +  ++D+++E+  K  A++R 
Sbjct: 90  WILEKHSGDFLRFGYLSPDQIDQVREQILKLLAEVRP 126


>gnl|CDD|227320 COG4987, CydC, ABC-type transport system involved in cytochrome bd
           biosynthesis, fused ATPase and permease components
           [Energy production and conversion / Posttranslational
           modification, protein turnover, chaperones].
          Length = 573

 Score = 27.3 bits (61), Expect = 7.7
 Identities = 12/52 (23%), Positives = 17/52 (32%), Gaps = 5/52 (9%)

Query: 105 NERLKHELE--RSQTT---FGRTTLSTSAELDRLQEKYDKTCADLRRAQAEL 151
              L       RSQ T    G+  L      D  +   + T A   +AQ + 
Sbjct: 187 GAHLAQGRAALRSQFTDWVQGQAELLIFGAEDAYRTALEATEASWLKAQRKQ 238


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 27.3 bits (61), Expect = 8.0
 Identities = 32/223 (14%), Positives = 88/223 (39%), Gaps = 28/223 (12%)

Query: 12  LSSLEEAIGDKEKQMNQLREQ--------RDRADQERQEERDLHERDIAEYKLKLHAYES 63
           +  +++ I    K + + R++        +++ D+  +E + +   +I E   +L     
Sbjct: 190 IDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKA-EIEELTDELLNLVM 248

Query: 64  EVEELE--LGKSKAELDKAANEVG---------RSGADWEAARQRLSRLELENERLKHEL 112
           ++E+    L K      K  +++            G       Q++S       ++K +L
Sbjct: 249 DIEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKL 308

Query: 113 ERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKV 172
           +  Q +          +LD   ++ ++   +      +L  ++      +    T+ +K 
Sbjct: 309 KELQHSL--------EKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKA 360

Query: 173 EKSQGEVYRLKAKLENTQGEMESMKEEYERTQGTLSRLLSERD 215
           +K +  +  L+A+  +   E+  +++E ++   T S L+ E+ 
Sbjct: 361 KKVKAAIEELQAEFVDNAEELAKLQDELDKIVKTKSELVKEKY 403


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 27.3 bits (61), Expect = 8.8
 Identities = 17/79 (21%), Positives = 30/79 (37%), Gaps = 2/79 (2%)

Query: 30  REQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEEL--ELGKSKAELDKAANEVGRS 87
           R ++         E +  E ++ E   K      E+  L  EL +++  L+    +    
Sbjct: 197 RRKKSELPSSILSEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLESLEKKFRSE 256

Query: 88  GADWEAARQRLSRLELENE 106
           G D    R++L R   E E
Sbjct: 257 GGDLFEEREQLERQLKEIE 275


>gnl|CDD|118668 pfam10140, YukC, WXG100 protein secretion system (Wss), protein
           YukC.  Members of this family of proteins include
           predicted membrane proteins homologous to YukC in B.
           subtilis. The YukC protein family would participate to
           the formation of a translocon required for the secretion
           of WXG100 proteins (pfam06013) in monoderm bacteria, the
           WXG100 protein secretion system (Wss). This family
           includes EssB in Staphylococcus aureus.
          Length = 359

 Score = 27.2 bits (61), Expect = 8.8
 Identities = 11/35 (31%), Positives = 18/35 (51%)

Query: 24  KQMNQLREQRDRADQERQEERDLHERDIAEYKLKL 58
           K   Q++   D +  ERQE+ D  E ++ EY  + 
Sbjct: 325 KYREQVKNDDDLSGDERQEKLDELEDELDEYWKER 359


>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
           endonuclease-like fold, RmuC family [General function
           prediction only].
          Length = 448

 Score = 27.0 bits (60), Expect = 9.1
 Identities = 13/105 (12%), Positives = 51/105 (48%), Gaps = 1/105 (0%)

Query: 95  RQRLSRLELENERLKHELERSQTTFGRTTLSTSAELDRLQEKYDKTCADLRRAQAELRVV 154
            +  +RL+ +  + + +L+    +  + +       + + E+ ++  A+L +   + +++
Sbjct: 76  NELKARLQQQLLQSREQLQLLIESLAQLSSEFQELANEIFEELNRRLAELNQQNLK-QLL 134

Query: 155 QADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEE 199
           +   E +    + +++++ +S  E   L  +++   GE++ + +E
Sbjct: 135 KPLREVLEKFREQLEQRIHESAEERSTLLEEIDRLLGEIQQLAQE 179


>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724).  This
           family contains several uncharacterized proteins found
           in Arabidopsis thaliana and other plants. This region is
           often found associated with Agenet domains and may
           contain coiled-coil.
          Length = 190

 Score = 26.6 bits (59), Expect = 9.7
 Identities = 22/93 (23%), Positives = 40/93 (43%), Gaps = 3/93 (3%)

Query: 12  LSSLEEAIGDKEKQMNQLREQRDRADQERQEERDLHERDIAEYKLKLHAYESEVEELELG 71
            S L + +  K+ Q  +L E++    +  ++E    E D    +L+    E + +   L 
Sbjct: 92  QSRLNKLLSLKDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLK 151

Query: 72  KSKAELDKAANEVGRSGADWEAARQRLSRLELE 104
           + K   DK   E+ R  ++    +Q L   ELE
Sbjct: 152 EKKEAEDK---EIARLKSEASKIKQELVDAELE 181


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.305    0.122    0.311 

Gapped
Lambda     K      H
   0.267   0.0788    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,407,066
Number of extensions: 1088175
Number of successful extensions: 3280
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2393
Number of HSP's successfully gapped: 786
Length of query: 237
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 143
Effective length of database: 6,768,326
Effective search space: 967870618
Effective search space used: 967870618
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (22.0 bits)
S2: 57 (25.6 bits)