RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16118
         (1070 letters)



>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
            chromosome partitioning].
          Length = 1163

 Score =  251 bits (644), Expect = 5e-69
 Identities = 224/1074 (20%), Positives = 482/1074 (44%), Gaps = 154/1074 (14%)

Query: 36   GSGALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAK 95
            G    K+  E  + ++ + EE         + +  + ++ + + ++AE+YQ ++ E+   
Sbjct: 166  GVSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELREL 225

Query: 96   EVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAK 155
            E+   L KL     +++ELE+EL + + E+E+++   E+AE  + E K E   L  EL +
Sbjct: 226  ELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEE 285

Query: 156  VDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQ 215
            + +E+ E+  EI +    +   +ER+  ++ +L   ++ L E+++  EA  +++ + ET 
Sbjct: 286  LQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETL 345

Query: 216  LADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLV 275
            L ++ +  AE E                              E    +++KL++  + L 
Sbjct: 346  LEELEQLLAELE------------------------------EAKEELEEKLSALLEELE 375

Query: 276  EVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKR 335
            E              L  +LA++    AE   +                           
Sbjct: 376  E----------LFEALREELAELEAELAEIRNE--------------------------- 398

Query: 336  AGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQN 395
                   L+ + RE +  +++L+    +    + E+K+   E+EE Q  +++L + + + 
Sbjct: 399  -------LEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEEL 451

Query: 396  EASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVE 455
            E  L++ +   +EL  ++   +  +Q L+KEL  +   L   + ++      +    ++E
Sbjct: 452  EEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVR---AVLE 508

Query: 456  NFKKAYSGVYDRMINMCHPVHKRYNVAITKVLGKYMEAIVVDSEKTARLCIQYLKDHQLD 515
              +    GVY  +  +   V ++Y  A+   LG  ++A+VV++E+ A+  I++LK+++  
Sbjct: 509  ALESGLPGVYGPVAELIK-VKEKYETALEAALGNRLQAVVVENEEVAKKAIEFLKENKAG 567

Query: 516  PETFLPIDYLQTKPLKERLRNIRDPKNVKLLYDVLKYQPEDIKRVVLFATNNALVCETPE 575
              TFLP+D ++       L++   P  + L  D++ + P+  +  V F   + LV +  E
Sbjct: 568  RATFLPLDRIKPLR---SLKSDAAPGFLGLASDLIDFDPK-YEPAVRFVLGDTLVVDDLE 623

Query: 576  DAMKVAYDIEPQHRYDAVALDGTFYQKSGIMSGG-----SLDLARKAKRWDDKEMGNLKA 630
             A ++A   + + +Y  V LDG   + SG ++GG     S    ++  +  ++E+  L+A
Sbjct: 624  QARRLAR--KLRIKYRIVTLDGDLVEPSGSITGGSRNKRSSLAQKRELKELEEELAELEA 681

Query: 631  QKEKLSEELREAMKKSRK-ESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDA 689
            Q EKL EEL+    + R  E  L  ++  ++ LE         L+  K ++A LE E++ 
Sbjct: 682  QLEKLEEELKSLKNELRSLEDLLEELRRQLEELER-------QLEELKRELAALEEELEQ 734

Query: 690  LNARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEAE 749
            L +R +  E +++ +E  +    + +   +EE+ S+E+ + +   +   +   RQ  + E
Sbjct: 735  LQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEE 794

Query: 750  LRSQQE---------RQKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAE 800
            L   +E              + +  ++   R E+ + + EEE+   +   ++L  E+   
Sbjct: 795  LEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEEL 854

Query: 801  ADKLENMRATRLTKKQAVDAMDEEIGKARREVGSIAKDIQAAQKSCVNLESKLEMKKSER 860
              +LE ++      +   + +++E+ +   E   + ++++  +     L+ ++E  +   
Sbjct: 855  EKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERL 914

Query: 861  HDILMNCKMNDIVLPMLRVQKYD-------RKLAKSIQEMTSRLQTIQAPNLRAMEKLEH 913
             ++    +  ++ LP L  +  +        +L + I+ +   ++ +   NLRA+E+ E 
Sbjct: 915  EELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEE 974

Query: 914  AKENLMKTNEEFENARKRAKKAKANFDRIKKERYDKFTRCFEHVS-------NEIDGAGS 966
             +E   +   + E+  +  +K     + + KE+ ++F   F+ ++        E+ G G+
Sbjct: 975  VEERYEELKSQREDLEEAKEKLLEVIEELDKEKRERFKETFDKINENFSEIFKELFGGGT 1034

Query: 967  ESVLPRPFLGPENPEEPLTYRV----------------------STTIVS-----HRYHP 999
                    L    P++PLT  +                      S T ++      +Y P
Sbjct: 1035 AE------LELTEPDDPLTAGIEISARPPGKKLQSLSLLSGGEKSLTALALLFAIQKYRP 1088

Query: 1000 APFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSLVGI 1053
            APF+VLDE+DAALD+ N+ +VA  I   ++++ Q IVI+ ++     AD LVG+
Sbjct: 1089 APFYVLDEVDAALDDANVERVARLIKEMSKET-QFIVITHRKGTMEAADRLVGV 1141



 Score = 40.5 bits (95), Expect = 0.005
 Identities = 15/31 (48%), Positives = 19/31 (61%)

Query: 8  IEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG 38
          IE+  FKS+     I     FTA++GPNGSG
Sbjct: 6  IELKGFKSFADPTEINFSPGFTAIVGPNGSG 36


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily
            archaeal type.  SMC (structural maintenance of
            chromosomes) proteins bind DNA and act in organizing and
            segregating chromosomes for partition. SMC proteins are
            found in bacteria, archaea, and eukaryotes. It is found
            in a single copy and is homodimeric in prokaryotes, but
            six paralogs (excluded from this family) are found in
            eukarotes, where SMC proteins are heterodimeric. This
            family represents the SMC protein of archaea and a few
            bacteria (Aquifex, Synechocystis, etc); the SMC of other
            bacteria is described by TIGR02168. The N- and C-terminal
            domains of this protein are well conserved, but the
            central hinge region is skewed in composition and highly
            divergent [Cellular processes, Cell division, DNA
            metabolism, Chromosome-associated proteins].
          Length = 1164

 Score =  181 bits (461), Expect = 2e-46
 Identities = 205/1062 (19%), Positives = 433/1062 (40%), Gaps = 145/1062 (13%)

Query: 83   EKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREK 142
             K ++  EE+  +EVE  + +L   +  I E   +L++ + E EK ER +     +L+EK
Sbjct: 170  RKKEKALEEL--EEVEENIERL---DLIIDEKRQQLERLRREREKAERYQA----LLKEK 220

Query: 143  KK-EQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQA 201
            ++ E   L +E   ++++   ++ ++      L K  E +S ++K+L   ++ L E+ + 
Sbjct: 221  REYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKK 280

Query: 202  NEAHN--------KDIADLETQLADVRKRKAEYERQSIPGRDINLESAQ-DVEINKKRPS 252
             +           + I +LE ++A + +  AE ER+     D     A+ + EI+K    
Sbjct: 281  IKDLGEEEQLRVKEKIGELEAEIASLERSIAEKEREL---EDAEERLAKLEAEIDKLLAE 337

Query: 253  LIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPG 312
            + + +  +   +K+     +   E+++  E    ++ +++ + A+ R    +Y       
Sbjct: 338  IEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYRE----- 392

Query: 313  RDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIK 372
                            K E  KR      +++ + RE    Q++L     +       I 
Sbjct: 393  ----------------KLEKLKR------EINELKRELDRLQEELQRLSEELADLNAAIA 430

Query: 373  KKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIE 432
                ++ E ++  +     I++ E  L+       +   ++   K     ++KEL ++  
Sbjct: 431  GIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQR 490

Query: 433  ELGDAKTDKHEDTRRKKKQELVEN-FKKAYSGVYDRMINMCHPVHKRYNVAITKVLGKYM 491
            EL +A+        R +    VE   K +  GV+   +     V +RY  AI    G  +
Sbjct: 491  ELAEAEAQARASEERVRGGRAVEEVLKASIQGVHG-TVAQLGSVGERYATAIEVAAGNRL 549

Query: 492  EAIVVDSEKTARLCIQYLKDHQLDPETFLPIDYLQTKPLKERLRNIRDPKNVKLLYDVLK 551
              +VV+ +  A+  I+ LK  +    TFLP++ ++ +     L  + +   +    D+++
Sbjct: 550  NNVVVEDDAVAKEAIELLKRRKAGRATFLPLNKMRDERRD--LSILSEDGVIGFAVDLVE 607

Query: 552  YQPEDIKRVVLFATNNALVCETPEDAMKVAYDIEPQHRYDAVALDGTFYQKSGIMSGGS- 610
            + P+  +    +   + LV E  E A ++        +Y  V L+G  ++KSG M+GGS 
Sbjct: 608  FDPK-YEPAFKYVFGDTLVVEDIEAARRLM------GKYRMVTLEGELFEKSGAMTGGSR 660

Query: 611  -----------------------------LDLARKAKRWDDKEMGNLKAQKEKLSEELRE 641
                                         L   +   R  +  +  L  +    S ++ E
Sbjct: 661  APRGGILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGE 720

Query: 642  AMKK-SRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDALNARADATEPK 700
              K+  + E E   ++  ++ LE  L+   Q+++N KS++ +LEA I+ L       E  
Sbjct: 721  IEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEA 780

Query: 701  IKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEAELRSQQERQKIC 760
            +  +EA +      I   + E++ +E+ V R        + +R+ E+   R   E++ + 
Sbjct: 781  LNDLEARL--SHSRIPEIQAELSKLEEEVSRI------EARLREIEQKLNRLTLEKEYLE 832

Query: 761  QDKDTKKN-VARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATRLTKKQAVD 819
            ++    +      +  +   E+E+    G +E+L  E+      L ++ +     K+  D
Sbjct: 833  KEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERD 892

Query: 820  AMDEEIGKARREVGSIAKDIQAAQKSCVNLESKLEMKKSERHDILMNCKMNDIVLPMLRV 879
             ++ ++ +  R++  +   I+  +K    L++KLE  + E  +I      ++ +      
Sbjct: 893  ELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELS 952

Query: 880  QKYDRKLAKSIQEMTSRLQTIQAPNLRAMEKLEHAKE---NLMKTNEEFENARKRAKKAK 936
                  +   +Q +   ++ ++  N+ A+++ E   +    L +   + E  RK   +  
Sbjct: 953  L---EDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILERI 1009

Query: 937  ANFDRIKK----ERYDKFTRCFEHVSNEIDGAGSESVLPRPFLGPENPEEPLTYRV---- 988
              +++ K+    E ++     F  +  E+ G   E +L       ENP++P    +    
Sbjct: 1010 EEYEKKKREVFMEAFEAINENFNEIFAELSGGTGELIL-------ENPDDPFAGGLELSA 1062

Query: 989  ------------------STTIVS-----HRYHPAPFFVLDEIDAALDNTNIGKVASYIV 1025
                              S T +S      RY P+PF+  DE+D  LD  N+ +VA  I 
Sbjct: 1063 KPKGKPVQRLEAMSGGEKSLTALSFIFAIQRYKPSPFYAFDEVDMFLDGVNVERVAKLIR 1122

Query: 1026 TKTQDSLQTIVISLKEEFFSHADSLVGICPGSVTISSICFGH 1067
             K  ++ Q IV+SL+     +AD  +G+       S +    
Sbjct: 1123 EKAGEA-QFIVVSLRSPMIEYADRAIGVTMRRNGESQVFGLK 1163



 Score = 52.0 bits (125), Expect = 2e-06
 Identities = 51/240 (21%), Positives = 115/240 (47%), Gaps = 8/240 (3%)

Query: 38   GALKDDYERLKAEMIQAE-EETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKE 96
              L++    L+A +  +   E      K +  V+  +    E ++      + +E + KE
Sbjct: 775  HKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKE 834

Query: 97   VEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKV 156
            ++    +    +  IK +E E++   G+ E++E   E+ E  LR+ +   G L +E  ++
Sbjct: 835  IQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDEL 894

Query: 157  DQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQL 216
            + ++RE++ +I +    + K ++R+S ++ KL + ++ L E+ +  +  +++I + E  L
Sbjct: 895  EAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEI-EDPKGEDEEIPEEELSL 953

Query: 217  ADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVE 276
             DV+      E +      +N+ + Q+ E   KR   +K K      + KL   +K+++E
Sbjct: 954  EDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELKEK------RAKLEEERKAILE 1007



 Score = 49.7 bits (119), Expect = 8e-06
 Identities = 38/196 (19%), Positives = 93/196 (47%), Gaps = 11/196 (5%)

Query: 40  LKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEH 99
           L+ + ++L AE+ + E E      ++  +  E  E K      E+ + +R E+   + E 
Sbjct: 327 LEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELK------EELEDLRAELEEVDKEF 380

Query: 100 QLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQE 159
              +        K+  ++L+K K E+ +++R  ++ +  L+   +E   LN  +A ++ +
Sbjct: 381 AETRDEL-----KDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAK 435

Query: 160 IREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADV 219
           I E++ E   K   + K + ++  +   L+  ++ L ++++  +   K+++ L+ +LA+ 
Sbjct: 436 INELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEA 495

Query: 220 RKRKAEYERQSIPGRD 235
             +    E +   GR 
Sbjct: 496 EAQARASEERVRGGRA 511


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
            bacterial type.  SMC (structural maintenance of
            chromosomes) proteins bind DNA and act in organizing and
            segregating chromosomes for partition. SMC proteins are
            found in bacteria, archaea, and eukaryotes. This family
            represents the SMC protein of most bacteria. The smc gene
            is often associated with scpB (TIGR00281) and scpA genes,
            where scp stands for segregation and condensation
            protein. SMC was shown (in Caulobacter crescentus) to be
            induced early in S phase but present and bound to DNA
            throughout the cell cycle [Cellular processes, Cell
            division, DNA metabolism, Chromosome-associated
            proteins].
          Length = 1179

 Score =  170 bits (433), Expect = 5e-43
 Identities = 193/1039 (18%), Positives = 446/1039 (42%), Gaps = 142/1039 (13%)

Query: 103  KLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQG------------ALN 150
            KL     ++  LED L++ + +++ +ER+ EKAE   +E K E               L 
Sbjct: 180  KLERTRENLDRLEDILNELERQLKSLERQAEKAE-RYKELKAELRELELALLVLRLEELR 238

Query: 151  RELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIA 210
             EL ++ +E++E + E+ +    L + +E++  ++ +++  ++ + E+++   A   +I+
Sbjct: 239  EELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEIS 298

Query: 211  DLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASA 270
             LE Q   +R+R A  ERQ             + ++ +    L +  E ++ +++KL   
Sbjct: 299  RLEQQKQILRERLANLERQ---------LEELEAQLEELESKLDELAEELAELEEKLEEL 349

Query: 271  KKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKA 330
            K+ L  +    E    ++ +LE++L ++ +   + E                ++   L+ 
Sbjct: 350  KEELESLEAELEELEAELEELESRLEELEE---QLETLR-------------SKVAQLEL 393

Query: 331  EATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKR--IDKL 388
            +      +I +    + R  +  +++L  E+ + ++   E + K  + E  +    +++L
Sbjct: 394  QIASLNNEIERLEARLER-LEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEEL 452

Query: 389  EDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRK 448
            ++ + + E +L++ ++  EE    + +++  + +LQ  L+  +E L +      E  +  
Sbjct: 453  QEELERLEEALEELREELEEAEQALDAAERELAQLQARLD-SLERLQENLEGFSEGVKAL 511

Query: 449  KKQELVENFKKAYSGVYDRMINMCHPVHKRYNVAITKVLGKYMEAIVVDSEKTARLCIQY 508
             K       +   SG+   +  +   V + Y  AI   LG  ++A+VV++   A+  I +
Sbjct: 512  LKN------QSGLSGILGVLSELIS-VDEGYEAAIEAALGGRLQAVVVENLNAAKKAIAF 564

Query: 509  LKDHQLDPETFLPIDYLQ-TKPLKERLRNIRDPKNVKLLYDVLKYQPEDIKRVVLFATNN 567
            LK ++L   TFLP+D ++ T+        +++ +    +   L      +++ + +    
Sbjct: 565  LKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGG 624

Query: 568  ALVCETPEDAMKVAYDIEPQHRYDAVALDGTFYQKSGIMSGGSLD-----LARKAKRWDD 622
             LV +  ++A+++A   + +  Y  V LDG   +  G+++GGS       L R+      
Sbjct: 625  VLVVDDLDNALELA--KKLRPGYRIVTLDGDLVRPGGVITGGSAKTNSSILERR------ 676

Query: 623  KEMGNLKAQKEKLSEELREAMKK-SRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIA 681
            +E+  L+ + E+L E++ E  K  +    EL  ++  ++ L   L    + +   +  +A
Sbjct: 677  REIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLA 736

Query: 682  KLEAEIDALNARADATEPKIKAIEASMTARGD-------TISRKKEEMNSVEDIVFRDFC 734
            +LEAE++ L  R      ++  +EA +    +        ++  + E+  +E  + +   
Sbjct: 737  RLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQL-- 794

Query: 735  KSIGVSTIRQYEEAELRSQQERQKICQD----KDTKKNVARWERAVSDDEEELARAQGAE 790
            K    +     +E         ++        +  ++ +A  ER + D EE++       
Sbjct: 795  KEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDI 854

Query: 791  EKLAGEMRAEADKLENMRATRLTKKQAVDAMDEEIGKARREVGSIAKDIQAAQKSCVNLE 850
            E LA E+    + +E + +          +++E +   R E+  ++++++  +     L 
Sbjct: 855  ESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELR 914

Query: 851  SKLEMKKSERHDILMNCK--------------------MNDIVLPMLRVQKYDRKLAKSI 890
             +LE  + +   + +  +                    + +      +++  + +  + +
Sbjct: 915  RELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRL 974

Query: 891  QEMTSRLQTIQAPNLRAMEKLEHAKE---NLMKTNEEFENARKRAKKAKANFDRIKKERY 947
            + + ++++ +   NL A+E+ E  KE    L    E+   A++  ++A    DR  +ER 
Sbjct: 975  KRLENKIKELGPVNLAAIEEYEELKERYDFLTAQKEDLTEAKETLEEAIEEIDREARER- 1033

Query: 948  DKFTRCFEHVSNEIDGAGSESVLPRPFLGPE------NPEEPLTYRV------------- 988
              F   F+ V+          V P+ F G E      +PE+ L   +             
Sbjct: 1034 --FKDTFDQVNENFQR-----VFPKLFGGGEAELRLTDPEDLLEAGIEIFAQPPGKKNQN 1086

Query: 989  ---------STTIVS-----HRYHPAPFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQT 1034
                     + T ++      +  PAPF +LDE+DA LD+ N+ + A+ +   ++++ Q 
Sbjct: 1087 LSLLSGGEKALTALALLFAIFKVKPAPFCILDEVDAPLDDANVERFANLLKEFSKNT-QF 1145

Query: 1035 IVISLKEEFFSHADSLVGI 1053
            IVI+  +     AD L G+
Sbjct: 1146 IVITHNKGTMEVADQLYGV 1164



 Score = 55.1 bits (133), Expect = 2e-07
 Identities = 71/425 (16%), Positives = 178/425 (41%), Gaps = 65/425 (15%)

Query: 39   ALKDDYERLKAEMIQAEEETN--MSYLKKKGVVAERKEAKIEKDEAEKYQRIREEI--VA 94
             L++    L+  + +  +E       L++     E    +I     +   R+  E+  + 
Sbjct: 688  ELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQIS-ALRKDLARLEAEVEQLE 746

Query: 95   KEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELA 154
            + +     +L   E +I+ELE+ L++ + E+ + E   E+ E  + + K+E  AL   L 
Sbjct: 747  ERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALD 806

Query: 155  KVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLET 214
            ++  E+  ++ E    R       ER+  +++++A+ ++ L ++ +  E  ++DI  L  
Sbjct: 807  ELRAELTLLNEEAANLR-------ERLESLERRIAATERRLEDLEEQIEELSEDIESLAA 859

Query: 215  QLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSL 274
            ++ ++ +   E E +                       L       + +++ LA  +  L
Sbjct: 860  EIEELEELIEELESE-----------------------LEALLNERASLEEALALLRSEL 896

Query: 275  VEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATK 334
             E+ +         ++L  +L ++R++ A+ E +        LE  ++            
Sbjct: 897  EELSEELRELESKRSELRRELEELREKLAQLELR--------LEGLEV------------ 936

Query: 335  RAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQ 394
               +I    + ++ E       L+     + + +++ ++ R  ++  + +I +L      
Sbjct: 937  ---RIDNLQERLSEEY---SLTLEEAEALENKIEDDEEEARRRLKRLENKIKELG---PV 987

Query: 395  NEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELV 454
            N A++++ ++LKE  +  + + K  + E ++ LE+ IEE+     ++ +DT  +  +   
Sbjct: 988  NLAAIEEYEELKERYD-FLTAQKEDLTEAKETLEEAIEEIDREARERFKDTFDQVNENFQ 1046

Query: 455  ENFKK 459
              F K
Sbjct: 1047 RVFPK 1051



 Score = 47.0 bits (112), Expect = 6e-05
 Identities = 46/246 (18%), Positives = 111/246 (45%), Gaps = 32/246 (13%)

Query: 220 RKRKAEYERQSIPGRDINLESAQDV--EINKKRPSLIKSKERVSHIQKKLASAKKSL-VE 276
           ++R+ E ER+ +     NL+  +D+  E+ ++  SL          Q + A   K L  E
Sbjct: 171 KERRKETERK-LERTRENLDRLEDILNELERQLKSL--------ERQAEKAERYKELKAE 221

Query: 277 VRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRA 336
           +R+          +L   +  + + + E E              ++ E      E T   
Sbjct: 222 LREL---------ELALLVLRLEELREELEELQ----------EELKEAEEELEELTAEL 262

Query: 337 GKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNE 396
            ++ ++L+ +  E    +++++   ++     NEI +   + +  ++R+  LE  + + E
Sbjct: 263 QELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELE 322

Query: 397 ASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHE-DTRRKKKQELVE 455
           A L++ +   +EL  ++   + +++EL++ELE +  EL + + +  E ++R ++ +E +E
Sbjct: 323 AQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLE 382

Query: 456 NFKKAY 461
             +   
Sbjct: 383 TLRSKV 388



 Score = 35.8 bits (83), Expect = 0.17
 Identities = 33/199 (16%), Positives = 83/199 (41%), Gaps = 22/199 (11%)

Query: 50  EMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNE- 108
           ++++         L++     E  E+K+++   E    + E++   + E +  +    E 
Sbjct: 305 QILRERLANLERQLEELEAQLEELESKLDELAEEL-AELEEKLEELKEELESLEAELEEL 363

Query: 109 -TDIKELED-------ELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEI 160
             +++ELE        +L+  + +V ++E +     N +   +     L     ++ QEI
Sbjct: 364 EAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEI 423

Query: 161 R------------EMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKD 208
                        E+  E+ +    L + +E +  +++ L   ++ L E  QA +A  ++
Sbjct: 424 EELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERE 483

Query: 209 IADLETQLADVRKRKAEYE 227
           +A L+ +L  + + +   E
Sbjct: 484 LAQLQARLDSLERLQENLE 502



 Score = 30.4 bits (69), Expect = 7.0
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 5  LQYIEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG 38
          L+ +E+  FKS+    +I   K  T ++GPNG G
Sbjct: 2  LKKLELAGFKSFADPTTINFDKGITGIVGPNGCG 35


>gnl|CDD|213242 cd03275, ABC_SMC1_euk, ATP-binding cassette domain of eukaryotic SMC1
            proteins.  The structural maintenance of chromosomes
            (SMC) proteins are large (approximately 110 to 170 kDa),
            and each is arranged into five recognizable domains.
            Amino-acid sequence homology of SMC proteins between
            species is largely confined to the amino- and
            carboxy-terminal globular domains. The amino-terminal
            domain contains a 'Walker A' nucleotide-binding domain
            (GxxGxGKS/T, in the single-letter amino-acid code), which
            by mutational studies has been shown to be essential in
            several proteins. The carboxy-terminal domain contains a
            sequence (the DA-box) that resembles a 'Walker B' motif,
            and a motif with homology to the signature sequence of
            the ATP-binding cassette (ABC) family of ATPases. The
            sequence homology within the carboxy-terminal domain is
            relatively high within the SMC1-SMC4 group, whereas SMC5
            and SMC6 show some divergence in both of these sequences.
            In eukaryotic cells, the proteins are found as
            heterodimers of SMC1 paired with SMC3, SMC2 with SMC4,
            and SMC5 with SMC6 (formerly known as Rad18).
          Length = 247

 Score =  126 bits (319), Expect = 2e-32
 Identities = 43/61 (70%), Positives = 49/61 (80%)

Query: 995  HRYHPAPFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSLVGIC 1054
            H Y PAPFFVLDE+DAALDNTN+GKVASYI  +   + Q IVISLKEEFFS AD+LVG+ 
Sbjct: 173  HSYQPAPFFVLDEVDAALDNTNVGKVASYIREQAGPNFQFIVISLKEEFFSKADALVGVY 232

Query: 1055 P 1055
             
Sbjct: 233  R 233



 Score = 66.4 bits (163), Expect = 6e-12
 Identities = 21/34 (61%), Positives = 28/34 (82%)

Query: 5  LQYIEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG 38
          L+ +E++NFKSYKG+  IGP  +FT +IGPNGSG
Sbjct: 1  LKRLELENFKSYKGRHVIGPFDRFTCIIGPNGSG 34


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
            found at the N terminus of SMC proteins. The SMC
            (structural maintenance of chromosomes) superfamily
            proteins have ATP-binding domains at the N- and
            C-termini, and two extended coiled-coil domains separated
            by a hinge in the middle. The eukaryotic SMC proteins
            form two kind of heterodimers: the SMC1/SMC3 and the
            SMC2/SMC4 types. These heterodimers constitute an
            essential part of higher order complexes, which are
            involved in chromatin and DNA dynamics. This family also
            includes the RecF and RecN proteins that are involved in
            DNA metabolism and recombination.
          Length = 1162

 Score =  130 bits (327), Expect = 2e-30
 Identities = 154/994 (15%), Positives = 360/994 (36%), Gaps = 70/994 (7%)

Query: 121  KKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKER 180
            K+ + E++++  E+ EN+       +    +EL   +Q  + ++    K++  L +    
Sbjct: 169  KRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLL 228

Query: 181  VSHIQKKLASAKKSLVEVRQANEAHN-KDIADLETQLADVRKRKAEYERQSIPGRDINLE 239
                 K        L E+ +  +        +LE +   + +   E + +    +    E
Sbjct: 229  YLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEE 288

Query: 240  SAQ-DVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADV 298
                  E  + +  L+K + R    ++KL  ++K L ++ +  +   ++I +LE +L ++
Sbjct: 289  LKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKEL 348

Query: 299  RKRKAEYERQ-----SIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGD 353
              ++   E +      +  +   LE   + +        +  A    ++L+  N E+K  
Sbjct: 349  EIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEA 408

Query: 354  QDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDV 413
            +  L+   +++   + E K++   +EE ++ ++  +  + + +  L+       +   ++
Sbjct: 409  KLLLELSEQEEDLLKEEKKEELKIVEELEESLETKQGKLTEEKEELEKQALKLLKDKLEL 468

Query: 414  GSSKNRVQELQKELEQVIEELGDAKTDKHEDT-RRKKKQELVENFKKAYSGVYDRMINMC 472
              S++ ++E +        EL   +    E + +  K +E +              I   
Sbjct: 469  KKSEDLLKETKLVKLLEQLELLLLRQKLEEASQKESKAREGLAVLLALIKDGVGLRIISA 528

Query: 473  HPVHKRYNVAITKVLGKYMEAIVVDSEKTARLCIQYLKDHQLDPETFLPIDYLQTKPLKE 532
                    VA+         A++V+    A    +  K  +   E  LP           
Sbjct: 529  AGRGGDLGVAVANYKVAISVAVIVEVSAVADEVDERQKLVRALTELPLPARTFVLLDPLL 588

Query: 533  RLRNIRDPKNVKLLYDVLKYQPEDIKRVVLFATNNALVCETPEDAMKVAYDIEPQHRYDA 592
            +L               L  Q              A V E      ++   +E      A
Sbjct: 589  KLPLKSIAVLEIDPALNL-AQLNKATLEADEDDKRAKVVEGILKDTELTKLLE-----SA 642

Query: 593  VALDGTFYQKSGIMSGGSLDLARKAKRWDDKEMGNLKAQKEKLSEELREAMKKSRKESEL 652
             A +    +   +  G +     KA   +  +    + + ++ +E      +  R++ E+
Sbjct: 643  KAKESGLRKGVSLEEGLAEKSELKASLSELTKELLAEQELQEKAESELAKNEILRRQEEI 702

Query: 653  NTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDALNARADATEPKIKAIEASMTARG 712
               +  IK    +L   +++L   K Q A+ +   +         E + +  ++ +    
Sbjct: 703  KKKEQRIKEELKKLKLEKEELLADKVQEAQDKINEELKLLEQKIKEKEEEEEKSRLKKEE 762

Query: 713  DTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEAELRSQQERQKICQDKDTKKNVARW 772
            +   + +  +   E     +  K+  +    + EE     ++E + + ++   +  +   
Sbjct: 763  EEEEKSELSLKEKELAEEEE--KTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEE 820

Query: 773  ERAVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATRLTKKQAVDAMDEEIGKARREV 832
            E+ + + EE++   +  E  L  +   + +KL      RL ++   + + +E+     E+
Sbjct: 821  EQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEEL 880

Query: 833  GSIAKDIQAAQKSCVNLESKLEMKKSERHDILMNCKMNDIVLPMLR----VQKYDRKLAK 888
                   +   K     E K E+++  + D L+  K N+I   +      + KY+ +  +
Sbjct: 881  EEQKLKDELESKEEKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPEE 940

Query: 889  S--------IQEMTSRLQTIQ-----------APNLRAMEKLEHAKENLMKTNEEFENAR 929
                      +E  ++ +  +             N+  M   E  ++      +E +  R
Sbjct: 941  LLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKER 1000

Query: 930  KRAKKAKANFDRIKKERYDKFTRCFEHVSNEID---GAGSESVLPRPFLGPENPEEPLTY 986
               +K +   + I++            VS             +     L  E+ ++P + 
Sbjct: 1001 LEEEKKELLREIIEETCQRFKEFLELFVSINRGLNKVFFYLELGGSAELRLEDSDDPFSG 1060

Query: 987  RVSTTIVS---------------------------HRYHPAPFFVLDEIDAALDNTNIGK 1019
             +  +                               +Y PAPF++LDEIDAALD+ N+ +
Sbjct: 1061 GIEISARPPGKGVKNLDNLSGGEKTLVALALIFAIQKYRPAPFYLLDEIDAALDDQNVSR 1120

Query: 1020 VASYIVTKTQDSLQTIVISLKEEFFSHADSLVGI 1053
            VA+Y+   ++++ Q IVISL+EE    AD LVG+
Sbjct: 1121 VANYLKELSKNA-QFIVISLREEMLEKADRLVGV 1153



 Score = 35.3 bits (81), Expect = 0.24
 Identities = 15/31 (48%), Positives = 20/31 (64%)

Query: 8  IEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG 38
          IE++ FKSY     +     FTA++GPNGSG
Sbjct: 5  IEIEGFKSYAKTVILPFSPGFTAIVGPNGSG 35


>gnl|CDD|214944 smart00968, SMC_hinge, SMC proteins Flexible Hinge Domain.  This
           entry represents the hinge region of the SMC (Structural
           Maintenance of Chromosomes) family of proteins. The
           hinge region is responsible for formation of the DNA
           interacting dimer. It is also possible that the precise
           structure of it is an essential determinant of the
           specificity of the DNA-protein interaction.
          Length = 120

 Score = 98.1 bits (245), Expect = 4e-24
 Identities = 37/122 (30%), Positives = 68/122 (55%), Gaps = 5/122 (4%)

Query: 462 SGVYDRMINMCHPVHKRYNVAITKVLGKYMEAIVVDSEKTARLCIQYLKDHQLDPETFLP 521
            GV  R+ ++   V  +Y  A+   LG  ++A+VVD+E+TA+  I++LK ++L   TFLP
Sbjct: 1   PGVLGRVADLIS-VDPKYETALEAALGGRLQAVVVDTEETAKKAIEFLKKNRLGRATFLP 59

Query: 522 IDYLQTKPL---KERLRNIRDPKNVKLLYDVLKYQPEDIKRVVLFATNNALVCETPEDAM 578
           +D ++ +     K R   + +P  V    D+++Y PE ++  + +   N LV +  E A 
Sbjct: 60  LDKIKPRSPAGSKLREALLPEPGFVGPAIDLVEYDPE-LRPALEYLLGNTLVVDDLETAR 118

Query: 579 KV 580
           ++
Sbjct: 119 RL 120


>gnl|CDD|219044 pfam06470, SMC_hinge, SMC proteins Flexible Hinge Domain.  This
           family represents the hinge region of the SMC
           (Structural Maintenance of Chromosomes) family of
           proteins. The hinge region is responsible for formation
           of the DNA interacting dimer. It is also possible that
           the precise structure of it is an essential determinant
           of the specificity of the DNA-protein interaction.
          Length = 117

 Score = 96.6 bits (241), Expect = 1e-23
 Identities = 39/119 (32%), Positives = 70/119 (58%), Gaps = 3/119 (2%)

Query: 462 SGVYDRMINMCHPVHKRYNVAITKVLGKYMEAIVVDSEKTARLCIQYLKDHQLDPETFLP 521
            GVY R+ ++   V + Y  A+   LG  ++AIVVD E+TA+  I++LK + L   TFLP
Sbjct: 2   KGVYGRLADLIE-VDEGYEKAVEAALGGRLQAIVVDDEETAKAAIEFLKKNNLGRATFLP 60

Query: 522 IDYLQTKPLKERLRNIRDPKNVKLLYDVLKYQPEDIKRVVLFATNNALVCETPEDAMKV 580
           +D ++ +PL +  R I  P     L D+++Y  E +++ + +   N LV +  ++A+++
Sbjct: 61  LDRIKPRPLDDSPREI--PGGATPLIDLVEYPDEKLRKALRYLLGNTLVVDDLDEALEL 117


>gnl|CDD|213241 cd03274, ABC_SMC4_euk, ATP-binding cassette domain of eukaryotic SMC4
            proteins.  The structural maintenance of chromosomes
            (SMC) proteins are large (approximately 110 to 170 kDa),
            and each is arranged into five recognizable domains.
            Amino-acid sequence homology of SMC proteins between
            species is largely confined to the amino- and
            carboxy-terminal globular domains. The amino-terminal
            domain contains a 'Walker A' nucleotide-binding domain
            (GxxGxGKS/T, in the single-letter amino-acid code), which
            by mutational studies has been shown to be essential in
            several proteins. The carboxy-terminal domain contains a
            sequence (the DA-box) that resembles a 'Walker B' motif,
            and a motif with homology to the signature sequence of
            the ATP-binding cassette (ABC) family of ATPases. The
            sequence homology within the carboxy-terminal domain is
            relatively high within the SMC1-SMC4 group, whereas SMC5
            and SMC6 show some divergence in both of these sequences.
            In eukaryotic cells, the proteins are found as
            heterodimers of SMC1 paired with SMC3, SMC2 with SMC4,
            and SMC5 with SMC6 (formerly known as Rad18).
          Length = 212

 Score = 73.1 bits (180), Expect = 2e-14
 Identities = 32/59 (54%), Positives = 41/59 (69%), Gaps = 1/59 (1%)

Query: 995  HRYHPAPFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSLVGI 1053
            H Y P P +V+DEIDAALD  N+  VA+YI  +T+++ Q IVISL+   F  AD LVGI
Sbjct: 145  HHYKPTPLYVMDEIDAALDFRNVSIVANYIKERTKNA-QFIVISLRNNMFELADRLVGI 202



 Score = 55.0 bits (133), Expect = 3e-08
 Identities = 19/36 (52%), Positives = 27/36 (75%), Gaps = 1/36 (2%)

Query: 4  ILQYIEVDNFKSYKGKFSIGPL-KKFTAVIGPNGSG 38
          I+  + ++NFKSY G+  IGP  K F+A++GPNGSG
Sbjct: 2  IITKLVLENFKSYAGEQVIGPFHKSFSAIVGPNGSG 37


>gnl|CDD|213206 cd03239, ABC_SMC_head, The SMC head domain belongs to the ATP-binding
            cassette superfamily.  The structural maintenance of
            chromosomes (SMC) proteins are essential for successful
            chromosome transmission during replication and
            segregation of the genome in all organisms. SMCs are
            generally present as single proteins in bacteria, and as
            at least six distinct proteins in eukaryotes. The
            proteins range in size from approximately 110 to 170 kDa,
            and each has five distinct domains: amino- and
            carboxy-terminal globular domains, which contain
            sequences characteristic of ATPases, two coiled-coil
            regions separating the terminal domains , and a central
            flexible hinge. SMC proteins function together with other
            proteins in a range of chromosomal transactions,
            including chromosome condensation, sister-chromatid
            cohesion, recombination, DNA repair, and epigenetic
            silencing of gene expression.
          Length = 178

 Score = 66.9 bits (164), Expect = 1e-12
 Identities = 29/59 (49%), Positives = 40/59 (67%)

Query: 995  HRYHPAPFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSLVGI 1053
                P+PF+VLDEIDAALD TN  +V+  I    + + Q IVI+LK+E F +AD L+G+
Sbjct: 112  QEIKPSPFYVLDEIDAALDPTNRRRVSDMIKEMAKHTSQFIVITLKKEMFENADKLIGV 170



 Score = 40.8 bits (96), Expect = 0.001
 Identities = 15/31 (48%), Positives = 21/31 (67%)

Query: 8  IEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG 38
          I + NFKSY+ +  +G    F A++GPNGSG
Sbjct: 4  ITLKNFKSYRDETVVGGSNSFNAIVGPNGSG 34


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 71.3 bits (174), Expect = 3e-12
 Identities = 115/483 (23%), Positives = 205/483 (42%), Gaps = 31/483 (6%)

Query: 41   KDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQ 100
            K D  + KAE  +   E   +  +     AE  E K E  E +K +  ++   AK+   +
Sbjct: 1330 KADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEE 1389

Query: 101  LFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEI 160
              K      + K+  +E  KK  E++K    K+KA+    +KK E+     E  K  +E 
Sbjct: 1390 KKK----ADEAKKKAEEDKKKADELKKAAAAKKKADEA--KKKAEEKKKADEAKKKAEEA 1443

Query: 161  REMDVEINKKRPSLIKSKE--RVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLAD 218
            ++ D E  KK     K++E  + +   KK   AKK   E ++A+EA  K  A+   + AD
Sbjct: 1444 KKAD-EAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKK--AEEAKKKAD 1500

Query: 219  VRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVR 278
              K+ AE ++++   +    E A+  +  KK     K+ E     +KK A   K   E++
Sbjct: 1501 EAKKAAEAKKKADEAK--KAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELK 1558

Query: 279  QANEAHNKDIADLETQLADVRKRKAEYERQSIPGR------------DINLESAQMTEYT 326
            +A E    + A    +  ++  RKAE  +++   R             +  E A+  E  
Sbjct: 1559 KAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEA 1618

Query: 327  NLKAEATKRA---GKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQK 383
             +KAE  K+A    K ++QL     E+K   ++L     +      E  KK  E ++  +
Sbjct: 1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAE 1678

Query: 384  RIDKLEDHIRQNEASLK---DNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTD 440
               K E+  ++   +LK   +  K  EEL       K + +EL+K  E+   +  +AK +
Sbjct: 1679 EAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKE 1738

Query: 441  KHEDTRRKKKQELVENFKKAYSGVYDRMINMCHPVHKRYNVAITKVLGKYMEAIVVDSEK 500
              ED ++ ++ +  E  KK  + +          + K     I + L +  E   ++ +K
Sbjct: 1739 AEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDK 1798

Query: 501  TAR 503
              +
Sbjct: 1799 KIK 1801



 Score = 63.6 bits (154), Expect = 5e-10
 Identities = 94/447 (21%), Positives = 193/447 (43%), Gaps = 27/447 (6%)

Query: 40   LKDDYERLKAEMIQ--AEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEV 97
             K +  + KA+  +  AEE+      KKK    ++K  +++K  A K +   +E   K  
Sbjct: 1371 KKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKA--DEAKKKAE 1428

Query: 98   EHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILR----EKKKEQGALNREL 153
            E +       + +  +  DE  KK  E +K E  K+KAE   +    +KK E+     E 
Sbjct: 1429 EKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEA 1488

Query: 154  AKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANE---AHNKDIA 210
             K  +E ++   E  K   +  K+ E     + K A   K   E ++A+E   A  K  A
Sbjct: 1489 KKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKA 1548

Query: 211  DLETQLADVRK----RKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKK 266
            D   +  +++K    +KAE  +++   +++ L  A++ +  ++       K      + K
Sbjct: 1549 DELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMK 1608

Query: 267  LASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYT 326
               AKK+     +A E    +  + + ++  ++K++AE ++++   +    E+       
Sbjct: 1609 AEEAKKAEEAKIKAEELKKAE--EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEE 1666

Query: 327  NLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRID 386
              KAE  K+  +  ++ +   ++      K   E ++  + + +  +++ + EE +K  +
Sbjct: 1667 AKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEE 1726

Query: 387  KLEDHIRQNEASLKDNKKLKEELNSDVGSSK----------NRVQELQKELEQVIEELGD 436
            + +    + +   +++KK  EE   D    K           + +E++KE E VIEE  D
Sbjct: 1727 ENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELD 1786

Query: 437  AKTDKHEDTRRKKKQELVENFKKAYSG 463
             + +K      KK +++ +NF     G
Sbjct: 1787 EEDEKRRMEVDKKIKDIFDNFANIIEG 1813



 Score = 61.3 bits (148), Expect = 3e-09
 Identities = 95/422 (22%), Positives = 180/422 (42%), Gaps = 26/422 (6%)

Query: 39   ALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIRE--EIVAKE 96
            A+K +  R   E+ +AEE+      KK     +  EAK + +EA+K    ++  E   K+
Sbjct: 1271 AIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKK 1330

Query: 97   VEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNR--ELA 154
             +    K    +   +  + E +    E E  E + E AE    E KK+  A  +  E  
Sbjct: 1331 ADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEK 1390

Query: 155  KVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLET 214
            K   E ++   E  KK   L K+       +KK   AKK   E ++A+EA  K     E 
Sbjct: 1391 KKADEAKKKAEEDKKKADELKKAAAA----KKKADEAKKKAEEKKKADEAKKK---AEEA 1443

Query: 215  QLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSL 274
            + AD  K+KAE  +++   +    E+ +  E  KK     +  ++    +KK   AKK  
Sbjct: 1444 KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKK----AEEAKKADEAKKKAEEAKKKA 1499

Query: 275  VEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATK 334
             E ++A EA  K     +        +KA+  +++   +    + A+  E    KA+  K
Sbjct: 1500 DEAKKAAEAKKK----ADEAKKAEEAKKADEAKKAEEAK--KADEAKKAEEKK-KADELK 1552

Query: 335  RAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHE---MEEAQKRIDKLEDH 391
            +A ++ +  +    E+    ++  N   ++ +   + ++ R E       +++  K E+ 
Sbjct: 1553 KAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEA 1612

Query: 392  IRQNEASLK-DNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKK 450
             +  EA +K +  K  EE    V   K +  E +K+ E++ +   + K    E+ ++ ++
Sbjct: 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEE 1672

Query: 451  QE 452
             +
Sbjct: 1673 DK 1674



 Score = 55.9 bits (134), Expect = 1e-07
 Identities = 92/408 (22%), Positives = 176/408 (43%), Gaps = 32/408 (7%)

Query: 48   KAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHN 107
            + E+ +  E+   +   +K   A++ EA  + +E  K + +R+   A++ E    +    
Sbjct: 1153 RVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAA--RKAEE 1210

Query: 108  ETDIKELEDELDKKKGE-VEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVE 166
            E   +E     D KK E V+K E  K+ AE    E KK +   N E  +  +E R     
Sbjct: 1211 ERKAEEARKAEDAKKAEAVKKAEEAKKDAE----EAKKAEEERNNEEIRKFEEARMAHFA 1266

Query: 167  INKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEY 226
               +R + IK++E       + A   K   E ++A+EA   +    E + AD  K+KAE 
Sbjct: 1267 ---RRQAAIKAEE------ARKADELKKAEEKKKADEAKKAE----EKKKADEAKKKAEE 1313

Query: 227  ERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNK 286
             +++   +    E+ +  +  KK+    K     +  + +  +A        +  EA  K
Sbjct: 1314 AKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAA--KAEAEAAADEAEAAEEKAEAAEK 1371

Query: 287  DIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTI 346
               +   + AD  K+KAE ++++   +    E  +  +     A A K+A +  ++ +  
Sbjct: 1372 KKEE-AKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEK 1430

Query: 347  NREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLK-DNKKL 405
               +K D+ K   E  ++     E KKK  E ++A++   K E+  + +EA  K +  K 
Sbjct: 1431 ---KKADEAKKKAEEAKKAD---EAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKK 1484

Query: 406  KEELNSDVGSSKNRVQELQK--ELEQVIEELGDAKTDKHEDTRRKKKQ 451
             +E       +K +  E +K  E ++  +E   A+  K  D  +K ++
Sbjct: 1485 ADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEE 1532



 Score = 54.4 bits (130), Expect = 4e-07
 Identities = 113/444 (25%), Positives = 187/444 (42%), Gaps = 51/444 (11%)

Query: 39   ALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKI--EKDEAEKYQRIREEIVAKE 96
            A KD  E  KAE  +  EE       +    A R+ A    E  +A++ ++  E+  A E
Sbjct: 1235 AKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADE 1294

Query: 97   VEHQLFKLYHNETDIKELE----DELDKKKGEVEK-IERRKEKAENILREKKKEQGALNR 151
             +    K   +E   K  E    DE  KK  E +K  +  K+KAE    E KK       
Sbjct: 1295 AKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAE----EAKKAA----- 1345

Query: 152  ELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIAD 211
            E AK + E    + E  +++    + K+     +KK  +AKK   E ++A+EA  K    
Sbjct: 1346 EAAKAEAEAAADEAEAAEEKAEAAEKKK--EEAKKKADAAKKKAEEKKKADEAKKK---- 1399

Query: 212  LETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAK 271
                 A+  K+KA+  +++   +    E+ +  E  KK     K  E      KK   AK
Sbjct: 1400 -----AEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEE----AKKADEAK 1450

Query: 272  KSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAE 331
            K   E ++A EA  K     E + AD  K+KAE  +++   +    E+ +  +     AE
Sbjct: 1451 KKAEEAKKAEEAKKK---AEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAE 1507

Query: 332  ATKRA--------GKILQQLDTINREQKGDQDKLDNELRQ--QVQTQNEIKKKRHEMEEA 381
            A K+A         K   +       +K D+ K   E ++  +++   E+KK   + +  
Sbjct: 1508 AKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAE 1567

Query: 382  QKRI---DKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVI--EELGD 436
            + +    DK     +  EA   +  +++E +       K + +E +K  E  I  EEL  
Sbjct: 1568 EAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKK 1627

Query: 437  AKTDKHEDTRRKKKQELVENFKKA 460
            A+ +K +  + KKK+   E  KKA
Sbjct: 1628 AEEEKKKVEQLKKKEA--EEKKKA 1649



 Score = 51.3 bits (122), Expect = 3e-06
 Identities = 92/409 (22%), Positives = 166/409 (40%), Gaps = 35/409 (8%)

Query: 45   ERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKL 104
            E  K E  +AEE       KKK   A + E   + ++A K +  R+   AK VE  + + 
Sbjct: 1102 EAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVE--IARK 1159

Query: 105  YHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMD 164
              +    +E     D KK E  +      KAE    E +K + A   E A+  +E R+ +
Sbjct: 1160 AEDARKAEEARKAEDAKKAEAARKAEEVRKAE----ELRKAEDARKAEAARKAEEERKAE 1215

Query: 165  VEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKA 224
                 +     ++ ++    +K    AKK+  E R   E    + A +          KA
Sbjct: 1216 EARKAEDAKKAEAVKKAEEAKKDAEEAKKA-EEERNNEEIRKFEEARMAHFARRQAAIKA 1274

Query: 225  EYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAH 284
            E  R++        +  +  E  KK     K++E+     KK   AKK   E ++A+EA 
Sbjct: 1275 EEARKA--------DELKKAEEKKKADEAKKAEEK-----KKADEAKKKAEEAKKADEAK 1321

Query: 285  NKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLD 344
             K  A+   + AD  K+KAE             E+ +  E    +AEA     +  ++  
Sbjct: 1322 KK--AEEAKKKADAAKKKAE-------------EAKKAAEAAKAEAEAAADEAEAAEEKA 1366

Query: 345  TINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKK 404
                ++K +  K  +  +++ + + +  + + + EE +K+ D+L+      + + +  KK
Sbjct: 1367 EAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKK 1426

Query: 405  LKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQEL 453
             +E+  +D    K    +   E ++  EE   A+  K +    KK  E 
Sbjct: 1427 AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEA 1475



 Score = 39.4 bits (91), Expect = 0.012
 Identities = 86/433 (19%), Positives = 181/433 (41%), Gaps = 30/433 (6%)

Query: 40   LKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEH 99
             K   E  KA+  +  EE   +   KK    ++ E K + +EA+K +  +   + K  E 
Sbjct: 1527 AKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEA 1586

Query: 100  QLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQE 159
            +  +    E  +K  E+E   K  E +K E  K KAE + + +         E  K  ++
Sbjct: 1587 KKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAE---------EEKKKVEQ 1637

Query: 160  IREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADV 219
            +++ + E  KK   L K++E   +  K    AKK+  + ++A EA   +  + E + A+ 
Sbjct: 1638 LKKKEAEEKKKAEELKKAEE--ENKIKAAEEAKKAEEDKKKAEEAKKAE--EDEKKAAEA 1693

Query: 220  RKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQ 279
             K++AE  +++   +    E  +  E  KK     K K      +K+    KK   E ++
Sbjct: 1694 LKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIK--AEEAKKEAEEDKKKAEEAKK 1751

Query: 280  ANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKI 339
              E   K IA L+ +  + +  +   E++++   +++ E  +     + K +        
Sbjct: 1752 DEEEKKK-IAHLKKE-EEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFAN 1809

Query: 340  LQQLDTINREQKGDQDKL-DNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEAS 398
            + +          D  ++ D+ +++   ++N   ++    E+ +   +         EA 
Sbjct: 1810 IIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEAD 1869

Query: 399  LKDNKKLKEELNSDVGSSKNRVQELQKELEQVIE---------ELGDAKTDKHEDTRR-- 447
                K LKE+   ++  +    +  + ++E+ I          ++ D K DK E  +R  
Sbjct: 1870 FNKEKDLKEDDEEEIEEADEIEKIDKDDIEREIPNNNMAGKNNDIIDDKLDKDEYIKRDA 1929

Query: 448  -KKKQELVENFKK 459
             + ++E+++  KK
Sbjct: 1930 EETREEIIKISKK 1942



 Score = 38.6 bits (89), Expect = 0.024
 Identities = 88/392 (22%), Positives = 170/392 (43%), Gaps = 56/392 (14%)

Query: 39   ALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVE 98
            A K +  +   E  +AEE        KK   A++ E K + DE +K + +++    K+ E
Sbjct: 1515 AKKAEEAKKADEAKKAEE-------AKKADEAKKAEEKKKADELKKAEELKKAEEKKKAE 1567

Query: 99   HQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQ 158
                         +E ++   +K  E +K E  + +    L E++K+  A   E AK  +
Sbjct: 1568 EA--------KKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKA---EEAKKAE 1616

Query: 159  EIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLAD 218
            E +    E+ K      + K++V  ++KK A  KK   E+++A E  NK  A  E + A+
Sbjct: 1617 EAKIKAEELKKAE----EEKKKVEQLKKKEAEEKKKAEELKKAEE-ENKIKAAEEAKKAE 1671

Query: 219  VRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVR 278
              K+KAE  +++        +  +  E  KK     +  ++   ++KK A  KK   E++
Sbjct: 1672 EDKKKAEEAKKA------EEDEKKAAEALKKEA---EEAKKAEELKKKEAEEKKKAEELK 1722

Query: 279  QANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGK 338
            +A E +     + + + A+  K+KAE         +   +  +  +  +LK E  K+A +
Sbjct: 1723 KAEEENKIKAEEAKKE-AEEDKKKAE---------EAKKDEEEKKKIAHLKKEEEKKAEE 1772

Query: 339  ILQQLDTINREQKGDQDKLDNELRQQV-QTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEA 397
            I ++ + +  E+    D+ D + R +V +   +I      + E  K          +   
Sbjct: 1773 IRKEKEAVIEEE---LDEEDEKRRMEVDKKIKDIFDNFANIIEGGK----------EGNL 1819

Query: 398  SLKDNKKLKEELNSDVGSSKNRVQELQKELEQ 429
             + D+K++++    +V  SKN   E     E+
Sbjct: 1820 VINDSKEMEDSAIKEVADSKNMQLEEADAFEK 1851



 Score = 37.8 bits (87), Expect = 0.038
 Identities = 84/371 (22%), Positives = 147/371 (39%), Gaps = 33/371 (8%)

Query: 89   REEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGA 148
            +E+  A E   + F         +  + E  +K  E +K      KAE    E +K + A
Sbjct: 1084 KEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAE----EARKAEDA 1139

Query: 149  LNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKD 208
               E A+  ++ +   VEI +K     K++E       K A A +   EVR+A E    +
Sbjct: 1140 RKAEEARKAEDAKR--VEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAE 1197

Query: 209  IADLETQLADVRK----RKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQ 264
                     D RK    RKAE ER++   R    E A+  E  KK     K  E     +
Sbjct: 1198 ---------DARKAEAARKAEEERKAEEAR--KAEDAKKAEAVKKAEEAKKDAE-----E 1241

Query: 265  KKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTE 324
             K A  +++  E+R+  EA     A  +  +     RKA+  +++   +    + A+  E
Sbjct: 1242 AKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKK--KADEAKKAE 1299

Query: 325  YTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKR 384
                KA+  K+  +  ++ D   ++ +  + K D   ++  + +   +  + E E A   
Sbjct: 1300 EKK-KADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADE 1358

Query: 385  IDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQV---IEELGDAKTDK 441
             +  E+     E   ++ KK  +         K +  E +K+ E+     +EL  A   K
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKKKADAAKKK-AEEKKKADEAKKKAEEDKKKADELKKAAAAK 1417

Query: 442  HEDTRRKKKQE 452
             +    KKK E
Sbjct: 1418 KKADEAKKKAE 1428



 Score = 34.0 bits (77), Expect = 0.52
 Identities = 43/222 (19%), Positives = 91/222 (40%), Gaps = 12/222 (5%)

Query: 629  KAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEID 688
             A+ +K ++E ++A +  +K  E    +   K  E +     +  +  K+   K   E+ 
Sbjct: 1505 AAEAKKKADEAKKAEEA-KKADEAKKAEEAKKADEAK-----KAEEKKKADELKKAEELK 1558

Query: 689  ALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEA 748
                +  A E K    + +M  R    ++K EE    E +   +  K +     ++ EEA
Sbjct: 1559 KAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEA 1618

Query: 749  ELRSQQ------ERQKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAEAD 802
            ++++++      E++K+ Q K  +    +    +   EEE       E K A E + +A+
Sbjct: 1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAE 1678

Query: 803  KLENMRATRLTKKQAVDAMDEEIGKARREVGSIAKDIQAAQK 844
            + +          +A+    EE  KA       A++ + A++
Sbjct: 1679 EAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEE 1720


>gnl|CDD|213245 cd03278, ABC_SMC_barmotin, ATP-binding cassette domain of barmotin, a
            member of the SMC protein family.  Barmotin is a tight
            junction-associated protein expressed in rat epithelial
            cells which is thought to have an important regulatory
            role in tight junction barrier function. Barmotin belongs
            to the SMC protein family. SMC proteins are large
            (approximately 110 to 170 kDa), and each is arranged into
            five recognizable domains. Amino-acid sequence homology
            of SMC proteins between species is largely confined to
            the amino- and carboxy-terminal globular domains. The
            amino-terminal domain contains a 'Walker A'
            nucleotide-binding domain (GxxGxGKS/T, in the
            single-letter amino-acid code), which by mutational
            studies has been shown to be essential in several
            proteins. The carboxy-terminal domain contains a sequence
            (the DA-box) that resembles a 'Walker B' motif, and a
            motif with homology to the signature sequence of the
            ATP-binding cassette (ABC) family of ATPases. The
            sequence homology within the carboxy-terminal domain is
            relatively high within the SMC1-SMC4 group, whereas SMC5
            and SMC6 show some divergence in both of these sequences.
            In eukaryotic cells, the proteins are found as
            heterodimers of SMC1 paired with SMC3, SMC2 with SMC4,
            and SMC5 with SMC6 (formerly known as Rad18).
          Length = 197

 Score = 65.2 bits (160), Expect = 8e-12
 Identities = 23/59 (38%), Positives = 37/59 (62%), Gaps = 1/59 (1%)

Query: 996  RYHPAPFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSLVGIC 1054
            R  P+PF VLDE+DAALD+ N+ + A  +   ++++ Q IVI+ ++     AD L G+ 
Sbjct: 132  RVRPSPFCVLDEVDAALDDANVERFARLLKEFSKET-QFIVITHRKGTMEAADRLYGVT 189



 Score = 43.6 bits (104), Expect = 1e-04
 Identities = 14/31 (45%), Positives = 20/31 (64%)

Query: 8  IEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG 38
          +E+  FKS+  K +I      TA++GPNGSG
Sbjct: 4  LELKGFKSFADKTTIPFPPGLTAIVGPNGSG 34


>gnl|CDD|213240 cd03273, ABC_SMC2_euk, ATP-binding cassette domain of eukaryotic SMC2
            proteins.  The structural maintenance of chromosomes
            (SMC) proteins are large (approximately 110 to 170 kDa),
            and each is arranged into five recognizable domains.
            Amino-acid sequence homology of SMC proteins between
            species is largely confined to the amino- and
            carboxy-terminal globular domains. The amino-terminal
            domain contains a 'Walker A' nucleotide-binding domain
            (GxxGxGKS/T, in the single-letter amino-acid code), which
            by mutational studies has been shown to be essential in
            several proteins. The carboxy-terminal domain contains a
            sequence (the DA-box) that resembles a 'Walker B' motif,
            and a motif with homology to the signature sequence of
            the ATP-binding cassette (ABC) family of ATPases. The
            sequence homology within the carboxy-terminal domain is
            relatively high within the SMC1-SMC4 group, whereas SMC5
            and SMC6 show some divergence in both of these sequences.
            In eukaryotic cells, the proteins are found as
            heterodimers of SMC1 paired with SMC3, SMC2 with SMC4,
            and SMC5 with SMC6 (formerly known as Rad18).
          Length = 251

 Score = 60.0 bits (146), Expect = 1e-09
 Identities = 28/58 (48%), Positives = 40/58 (68%), Gaps = 7/58 (12%)

Query: 996  RYHPAPFFVLDEIDAALD--NT-NIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSL 1050
             + PAP ++LDE+DAALD  +T NIG++   I T  + S Q IV+SLKE  F++A+ L
Sbjct: 185  LFKPAPMYILDEVDAALDLSHTQNIGRM---IKTHFKGS-QFIVVSLKEGMFNNANVL 238



 Score = 34.6 bits (80), Expect = 0.18
 Identities = 14/32 (43%), Positives = 18/32 (56%), Gaps = 1/32 (3%)

Query: 8  IEVDNFKSYKGKFSIGPL-KKFTAVIGPNGSG 38
          I +D FKSY  +  I     +F A+ G NGSG
Sbjct: 6  IILDGFKSYATRTVISGFDPQFNAITGLNGSG 37


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 61.2 bits (149), Expect = 2e-09
 Identities = 92/431 (21%), Positives = 207/431 (48%), Gaps = 54/431 (12%)

Query: 40  LKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEH 99
           L++  ERL+ E+ +  +E      + +      +EAK +K+E E+ ++    +  +++E 
Sbjct: 333 LEEKEERLE-ELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEK 391

Query: 100 QLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQE 159
           +L +L   + +I+E   ++  + GE++K  +  +KA   L++ K +     REL +  ++
Sbjct: 392 ELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRK 451

Query: 160 --IREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLA 217
             + E   E+ +    L + +E+   ++K+L   +K L +  +  +   K++A+   QL 
Sbjct: 452 ELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIK--LKELAE---QLK 506

Query: 218 DVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEV 277
           ++ ++  +Y          NLE     E+ KK     K KE++  ++ ++ S KK L ++
Sbjct: 507 ELEEKLKKY----------NLE-----ELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKL 551

Query: 278 RQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAG 337
               E   K +A+LE +L ++ +  AE  +                E   L  E+ +   
Sbjct: 552 ----EELKKKLAELEKKLDELEEELAELLK----------------ELEELGFESVEELE 591

Query: 338 KILQQLDTINR---EQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQ 394
           + L++L+       E K  + +L+ E ++  + + E+ K   E+ E +KR+++L   + +
Sbjct: 592 ERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEE 651

Query: 395 -----NEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKK 449
                +E   ++ ++   EL+ ++   +  ++EL+K  E++ + L   K +  E   R+K
Sbjct: 652 LEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEE---REK 708

Query: 450 KQELVENFKKA 460
            ++ +E  +KA
Sbjct: 709 AKKELEKLEKA 719



 Score = 54.3 bits (131), Expect = 3e-07
 Identities = 58/347 (16%), Positives = 164/347 (47%), Gaps = 8/347 (2%)

Query: 112 KELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKR 171
            E+  E+ ++   +EK  +R E  E +++EK+KE   + RE+ ++  E+ E+  E+ K  
Sbjct: 168 GEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLE 227

Query: 172 PSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSI 231
               K  + +  +++++   +K L  +  +     + I +LE ++ +++K   E E +  
Sbjct: 228 ----KEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVK 283

Query: 232 PGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADL 291
             +++  ++ + +++++     +     +     +L      + E  +  E   + + +L
Sbjct: 284 ELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEEL 343

Query: 292 ETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQK 351
           + +L ++ KR  E E +     +   +  ++          T    K+ ++L+ + + ++
Sbjct: 344 KKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTP--EKLEKELEELEKAKE 401

Query: 352 GDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNS 411
             ++++     +  + + EIK+ +  +EE +K   K +  +   E + +  K+L EE  +
Sbjct: 402 EIEEEISKITARIGELKKEIKELKKAIEELKKA--KGKCPVCGRELTEEHRKELLEEYTA 459

Query: 412 DVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFK 458
           ++   +  ++E++++  ++ +EL + +    +++   K +EL E  K
Sbjct: 460 ELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLK 506



 Score = 54.3 bits (131), Expect = 3e-07
 Identities = 84/409 (20%), Positives = 184/409 (44%), Gaps = 42/409 (10%)

Query: 72  RKEAKIEKDEAEKYQRIREEI--VAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIE 129
            KE K   +  EK+ +  E I  + KE E +L ++     +I     EL ++  ++EK  
Sbjct: 171 IKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEV 230

Query: 130 RRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLA 189
           +  E+ +  + E +KE  +L     K++++IRE++  I + +  + + +E+V  + K+L 
Sbjct: 231 KELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKEL-KELK 289

Query: 190 SAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKK 249
              +  +++ +  E +  ++ ++E +L+ + +     E +    +++  +  +  E+ KK
Sbjct: 290 EKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERI---KELEEKEERLEELKKK 346

Query: 250 RPSLIKSKERVSHIQKKLASAKKSLVEVRQ-ANEAHNKDIADLETQLADVRKRKAEYERQ 308
              L K  E +    +    AK    E+ +            LE +L ++ K K E E +
Sbjct: 347 LKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEE 406

Query: 309 --SIPGRDINLES-----------------------AQMTEY--TNLKAEAT---KRAGK 338
              I  R   L+                         ++TE     L  E T   KR  K
Sbjct: 407 ISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEK 466

Query: 339 ILQQLDTINREQKGDQDKLDNELRQQ--VQTQNEIKKKRHEMEEAQKRID--KLEDHIRQ 394
            L++++   R+ + +  +L+  L+++  +    E+ ++  E+EE  K+ +  +LE    +
Sbjct: 467 ELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEE 526

Query: 395 NEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHE 443
            E   +   KLK E+ S +     +++EL+K+L ++ ++L + + +  E
Sbjct: 527 YEKLKEKLIKLKGEIKS-LKKELEKLEELKKKLAELEKKLDELEEELAE 574



 Score = 50.8 bits (122), Expect = 4e-06
 Identities = 81/435 (18%), Positives = 204/435 (46%), Gaps = 33/435 (7%)

Query: 40  LKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEH 99
           L+    +L+ ++ + EE       + + +  + KE K  K++AE+Y ++ E    +E   
Sbjct: 250 LEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFY--EEYLD 307

Query: 100 QLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQE 159
           +L ++    + ++E  + ++++  E+E+ E R E+ +  L+E +K    L  E  ++ +E
Sbjct: 308 ELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEEL-EERHELYEE 366

Query: 160 IREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADV 219
            +    E+ + +  L  +      ++K+L   +K+  E+ +        I +L+ ++ ++
Sbjct: 367 AKAKKEELERLKKRL--TGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKEL 424

Query: 220 RKRKAEYERQS----IPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLV 275
           +K   E ++      + GR++  E  +++ + +    L + ++ +  I++K    +K L 
Sbjct: 425 KKAIEELKKAKGKCPVCGRELTEEHRKEL-LEEYTAELKRIEKELKEIEEKERKLRKELR 483

Query: 276 EVRQANEAHNKDIADLET--QLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEAT 333
           E+ +  +  ++ I   E   QL ++ ++  +Y  + +          +  EY  LK +  
Sbjct: 484 ELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEEL--------EKKAEEYEKLKEKLI 535

Query: 334 KRAGKILQQLDTINREQ--KGDQDKLDNELRQQVQTQNEIKKKRHEM-----EEAQKRID 386
           K  G+I      + + +  K    +L+ +L +  +   E+ K+  E+     EE ++R+ 
Sbjct: 536 KLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLK 595

Query: 387 KLED------HIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTD 440
           +LE        ++  E  L+  +K  ++L  ++  +   + E +K LE++ +EL + +  
Sbjct: 596 ELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKK 655

Query: 441 KHEDTRRKKKQELVE 455
             E+   + ++E +E
Sbjct: 656 YSEEEYEELREEYLE 670



 Score = 49.7 bits (119), Expect = 8e-06
 Identities = 88/426 (20%), Positives = 194/426 (45%), Gaps = 64/426 (15%)

Query: 41  KDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEK--YQRIREEIVAKEVE 98
            ++ E L  E  +  EE  +  + +          ++EK E E    + ++EEI   E E
Sbjct: 188 TENIEELIKEKEKELEEV-LREINEISSELPELREELEKLEKEVKELEELKEEIEELEKE 246

Query: 99  HQLFK--LYHNETDIKELED----------ELDKKKGEVEKIERRKEKAENILREKKKEQ 146
            +  +      E  I+ELE+          EL++K  E+++++ + E+   +    ++  
Sbjct: 247 LESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYL 306

Query: 147 GALNR---ELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANE 203
             L      L+++++EI  ++  I +    L + +ER+  ++KKL   +K L E+ + +E
Sbjct: 307 DELREIEKRLSRLEEEINGIEERIKE----LEEKEERLEELKKKLKELEKRLEELEERHE 362

Query: 204 AHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHI 263
            +  +    + +L  ++KR      + +      LE A++ EI ++   +     R+  +
Sbjct: 363 LYE-EAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKE-EIEEEISKI---TARIGEL 417

Query: 264 QKKLASAKKSLVEVRQA-----------NEAHNKDI-ADLETQLADVRKRKAEYERQSIP 311
           +K++   KK++ E+++A            E H K++  +   +L  + K   E E +   
Sbjct: 418 KKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKE-- 475

Query: 312 GRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQN-- 369
                           L+ E  +    + ++ + I  ++  +Q K   EL ++++  N  
Sbjct: 476 --------------RKLRKELRELEKVLKKESELIKLKELAEQLK---ELEEKLKKYNLE 518

Query: 370 EIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQ 429
           E++KK  E E+ ++++ KL+  I+    SLK   +  EEL   +   + ++ EL++EL +
Sbjct: 519 ELEKKAEEYEKLKEKLIKLKGEIK----SLKKELEKLEELKKKLAELEKKLDELEEELAE 574

Query: 430 VIEELG 435
           +++EL 
Sbjct: 575 LLKELE 580



 Score = 48.5 bits (116), Expect = 2e-05
 Identities = 73/322 (22%), Positives = 148/322 (45%), Gaps = 23/322 (7%)

Query: 121 KKGEVEKIERRKEKAENILREKKKEQGALNRE-LAKVDQEIREMDVEINKKRPSLIKSKE 179
           ++GE++ I    E  E ++R+       L  +      + + E+  EI ++   L K  +
Sbjct: 133 RQGEIDAILESDESREKVVRQ------ILGLDDYENAYKNLGEVIKEIKRRIERLEKFIK 186

Query: 180 RVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLE 239
           R  +I++ +   +K L EV +     + ++ +L  +L  + K   E E            
Sbjct: 187 RTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELK------EEI 240

Query: 240 SAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVR 299
              + E+     S  K +E++  +++++   KK + E+    E   K++ +L+ +  +  
Sbjct: 241 EELEKELESLEGSKRKLEEKIRELEERIEELKKEIEEL----EEKVKELKELKEKAEEYI 296

Query: 300 KRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDN 359
           K    YE      R+I    +++ E  N   E  K   +  ++L+ + ++ K  + +L  
Sbjct: 297 KLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRL-E 355

Query: 360 ELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNR 419
           EL ++ +   E K K+ E+E  +KR+  L     + E  L++ +K KEE+  ++     R
Sbjct: 356 ELEERHELYEEAKAKKEELERLKKRLTGLT--PEKLEKELEELEKAKEEIEEEISKITAR 413

Query: 420 VQELQ---KELEQVIEELGDAK 438
           + EL+   KEL++ IEEL  AK
Sbjct: 414 IGELKKEIKELKKAIEELKKAK 435



 Score = 47.4 bits (113), Expect = 4e-05
 Identities = 82/405 (20%), Positives = 184/405 (45%), Gaps = 52/405 (12%)

Query: 45  ERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKL 104
           E+ K E+ +   +      + K  + E K+A  E  +A+    +    + +E   +L + 
Sbjct: 397 EKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEE 456

Query: 105 YHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKK--EQGALNRELAKVDQEIRE 162
           Y  E  +K +E EL + + +  K+ +   + E +L+++ +  +   L  +L ++++++++
Sbjct: 457 YTAE--LKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKK 514

Query: 163 MDVE-INKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRK 221
            ++E + KK     K KE++  ++ ++ S KK L ++    E   K +A+LE +L ++ +
Sbjct: 515 YNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKL----EELKKKLAELEKKLDELEE 570

Query: 222 RKAEYERQSIPGRDINLESAQDVE--------INKKRPSLIKSKERVSHIQKKLASAKKS 273
             AE  ++     ++  ES +++E           +   L  +++ +   +K+L   ++ 
Sbjct: 571 ELAELLKEL---EELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEE 627

Query: 274 LVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEAT 333
           L +  +      K + +L  +L ++ K+ +E E + +    + L      E   L+AE  
Sbjct: 628 LDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSR----ELAGLRAE-- 681

Query: 334 KRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIR 393
                 L++L+    E K   +KL  EL               E E+A+K ++KLE  + 
Sbjct: 682 ------LEELEKRREEIKKTLEKLKEELE--------------EREKAKKELEKLEKALE 721

Query: 394 QNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAK 438
           + E   +  KK K  L       +  + ++ +   ++ EEL + K
Sbjct: 722 RVEELREKVKKYKALL------KERALSKVGEIASEIFEELTEGK 760



 Score = 44.7 bits (106), Expect = 3e-04
 Identities = 43/222 (19%), Positives = 100/222 (45%), Gaps = 14/222 (6%)

Query: 245 EINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE 304
           EI ++   L K  +R  +I++ +   +K L EV +     + ++ +L  +L  + K   E
Sbjct: 173 EIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKE 232

Query: 305 YERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQ 364
            E              ++ E             K+ +++  +    +  + +++ EL ++
Sbjct: 233 LEELK----------EEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIE-ELEEK 281

Query: 365 VQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNE---ASLKDNKKLKEELNSDVGSSKNRVQ 421
           V+   E+K+K  E  +  +  ++  D +R+ E   + L++     EE   ++   + R++
Sbjct: 282 VKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLE 341

Query: 422 ELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFKKAYSG 463
           EL+K+L+++ + L + +         K K+E +E  KK  +G
Sbjct: 342 ELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTG 383



 Score = 33.9 bits (78), Expect = 0.64
 Identities = 39/180 (21%), Positives = 88/180 (48%), Gaps = 23/180 (12%)

Query: 628 LKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEI 687
           +K ++++L E LRE  + S +  EL   +  ++ LE  +    ++L+  K +I +LE E+
Sbjct: 195 IKEKEKELEEVLREINEISSELPEL---REELEKLEKEV----KELEELKEEIEELEKEL 247

Query: 688 DALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEE 747
           ++L       E KI+ +E       + I   K+E+  +E+ V         +  +++  E
Sbjct: 248 ESLEGSKRKLEEKIRELE-------ERIEELKKEIEELEEKVKE-------LKELKEKAE 293

Query: 748 AELRSQQERQKICQDK-DTKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLEN 806
             ++  +  ++   +  + +K ++R E  ++  EE +   +  EE+L  E++ +  +LE 
Sbjct: 294 EYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERL-EELKKKLKELEK 352



 Score = 32.3 bits (74), Expect = 1.6
 Identities = 71/362 (19%), Positives = 154/362 (42%), Gaps = 28/362 (7%)

Query: 623 KEMGNLKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAK 682
           KE+   + + E+L ++L+E  K+  +  E + +    K  +  L   ++ L        K
Sbjct: 331 KELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTG--LTPEK 388

Query: 683 LEAEIDALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVED------IVFRDFCKS 736
           LE E++ L    +  E +I  I A +      I   K+ +  ++       +  R+  + 
Sbjct: 389 LEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEE 448

Query: 737 IGVSTIRQYEEAELRSQQERQKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLAGE 796
                + +Y     R ++E ++I + +   +   R    V   E EL +     ++LA +
Sbjct: 449 HRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIK----LKELAEQ 504

Query: 797 MRAEADKLENMRATRLTKK-QAVDAMDEEIGKARREVGSIAKDI---QAAQKSCVNLESK 852
           ++   +KL+      L KK +  + + E++ K + E+ S+ K++   +  +K    LE K
Sbjct: 505 LKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKK 564

Query: 853 L---EMKKSERHDILMNCKMNDIVLPMLRVQK----YDRKLAKSIQEMTSRLQTIQAPNL 905
           L   E + +E    L       +     R+++    Y+  L   +++    L+  +    
Sbjct: 565 LDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYL--ELKDAEKELEREEKELK 622

Query: 906 RAMEKLEHAKENLMKTNEEFENARKRAKKAKANFDRIKKERYDKFTRCFEHVSNEIDGAG 965
           +  E+L+ A E L +T +  E  RK  ++ +  +    +E Y++    +  +S E+ G  
Sbjct: 623 KLEEELDKAFEELAETEKRLEELRKELEELEKKYS---EEEYEELREEYLELSRELAGLR 679

Query: 966 SE 967
           +E
Sbjct: 680 AE 681



 Score = 32.0 bits (73), Expect = 1.9
 Identities = 37/141 (26%), Positives = 70/141 (49%), Gaps = 16/141 (11%)

Query: 39  ALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVE 98
             + + ++L+ E+ +A EE   +  + + +  E +E + +  E E+Y+ +REE +    E
Sbjct: 616 REEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSE-EEYEELREEYLELSRE 674

Query: 99  H------------QLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQ 146
                        +  ++      +KE  +E +K K E+EK+E+  E+ E  LREK K+ 
Sbjct: 675 LAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEE-LREKVKKY 733

Query: 147 GALNRELAKVDQEIREMDVEI 167
            AL +E A    ++ E+  EI
Sbjct: 734 KALLKERAL--SKVGEIASEI 752



 Score = 30.8 bits (70), Expect = 4.4
 Identities = 45/234 (19%), Positives = 96/234 (41%), Gaps = 18/234 (7%)

Query: 623 KEMGNLKAQKEKLSEELREAMKKSRKESELNTVQST---IKGLEIRLN--------YSRQ 671
           KE+  ++ ++ KL +ELRE  K  +KESEL  ++     +K LE +L            +
Sbjct: 466 KELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAE 525

Query: 672 DLQNTKSQIAKLEAEIDALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFR 731
           + +  K ++ KL+ EI +L    +  E   K +        +      E +  +E++ F 
Sbjct: 526 EYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFE 585

Query: 732 DFCKSIGVSTIRQYEEAELRSQQERQKICQDKDTKKNVARWERAVSDDEEELARAQGAEE 791
              +      +++ E       + +    + +  +K + + E  +    EELA  +   E
Sbjct: 586 SVEEL--EERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLE 643

Query: 792 KLAGEMRA-----EADKLENMRATRLTKKQAVDAMDEEIGKARREVGSIAKDIQ 840
           +L  E+         ++ E +R   L   + +  +  E+ +  +    I K ++
Sbjct: 644 ELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLE 697



 Score = 30.0 bits (68), Expect = 8.0
 Identities = 26/137 (18%), Positives = 62/137 (45%), Gaps = 17/137 (12%)

Query: 343 LDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASL--- 399
           L  + +E K   ++L+  +++    +  IK+K  E+EE  + I+++   + +    L   
Sbjct: 167 LGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKL 226

Query: 400 ----KDNKKLKEELNS----------DVGSSKNRVQELQKELEQVIEELGDAKTDKHEDT 445
               K+ ++LKEE+                 + +++EL++ +E++ +E+ + +    E  
Sbjct: 227 EKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELK 286

Query: 446 RRKKKQELVENFKKAYS 462
             K+K E      + Y 
Sbjct: 287 ELKEKAEEYIKLSEFYE 303


>gnl|CDD|213194 cd03227, ABC_Class2, ATP-binding cassette domain of non-transporter
            proteins.  ABC-type Class 2 contains systems involved in
            cellular processes other than transport. These families
            are characterized by the fact that the ABC subunit is
            made up of duplicated, fused ABC modules (ABC2). No known
            transmembrane proteins or domains are associated with
            these proteins.
          Length = 162

 Score = 54.7 bits (132), Expect = 2e-08
 Identities = 20/59 (33%), Positives = 27/59 (45%)

Query: 995  HRYHPAPFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSLVGI 1053
                P P ++LDEID  LD  +   +A  I+       Q IVI+   E    AD L+ I
Sbjct: 95   ASLKPRPLYILDEIDRGLDPRDGQALAEAILEHLVKGAQVIVITHLPELAELADKLIHI 153



 Score = 31.6 bits (72), Expect = 1.1
 Identities = 11/32 (34%), Positives = 14/32 (43%), Gaps = 1/32 (3%)

Query: 8  IEVDNFKSYKGKFSI-GPLKKFTAVIGPNGSG 38
          I +  F SY     +       T + GPNGSG
Sbjct: 2  IVLGRFPSYFVPNDVTFGEGSLTIITGPNGSG 33


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 55.8 bits (135), Expect = 1e-07
 Identities = 75/413 (18%), Positives = 175/413 (42%), Gaps = 48/413 (11%)

Query: 77  IEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAE 136
           +  D+     +++ +I  KE +    +L   E+++ EL++E+++ + + E+    +++A+
Sbjct: 181 VLSDQRGSLDQLKAQIEEKEEKDLHERLNGLESELAELDEEIERYEEQREQARETRDEAD 240

Query: 137 NIL---REKKKEQGALNRELAKVDQEIRE-------MDVEINKKRPSLIKSKERVSHI-- 184
            +L    E+++E   L  E+  + + I E       +  E+   R  L + +E    +  
Sbjct: 241 EVLEEHEERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLA 300

Query: 185 -------------------QKKLASAKKSLVEVRQANEAHNK-------DIADLETQLAD 218
                              + +    +  L E R A +AHN+       D  DLE +  +
Sbjct: 301 EAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEE 360

Query: 219 VRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVR 278
           +R+  AE E +    R+   +  +++E  ++   + + +ER       L +A+  L E+R
Sbjct: 361 LREEAAELESELEEAREAVEDRREEIEELEEE--IEELRERFGDAPVDLGNAEDFLEELR 418

Query: 279 QANEAHNKDIADLETQLADVRKRKAEYERQSIPGR----DINLESAQMTEYTNLKAEATK 334
           +  +   +  A+LE  L   R+R  E E     G+       +E +   E      E  +
Sbjct: 419 EERDELREREAELEATLRTARERVEEAEALLEAGKCPECGQPVEGSPHVETI---EEDRE 475

Query: 335 RAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQ 394
           R  ++  +L+ +  E+  + ++        V+ ++ I++     E+ ++ I +  + I +
Sbjct: 476 RVEELEAELEDL-EEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEE 534

Query: 395 NEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRR 447
                ++ ++   EL ++    +    E ++E E+  EE+ +  +   E   R
Sbjct: 535 KRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKER 587



 Score = 54.7 bits (132), Expect = 2e-07
 Identities = 78/380 (20%), Positives = 178/380 (46%), Gaps = 37/380 (9%)

Query: 72  RKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERR 131
           R++A   ++ AE+ +    E+   E+E     +     +I+ELE+E+++ +         
Sbjct: 348 REDADDLEERAEELREEAAEL-ESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVD 406

Query: 132 KEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASA 191
              AE+ L E ++E+  L    A+++  +R     + +    L   K        + +  
Sbjct: 407 LGNAEDFLEELREERDELREREAELEATLRTARERVEEAEALLEAGKCPECGQPVEGSPH 466

Query: 192 KKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRP 251
            +++ E R+  E    ++ DLE ++ +V +R               LE A+D+   + R 
Sbjct: 467 VETIEEDRERVEELEAELEDLEEEVEEVEER---------------LERAEDLVEAEDR- 510

Query: 252 SLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIP 311
            + + +ER   +++ +A  ++++ E R+  E   +  A+LE +  + R+  AE E ++  
Sbjct: 511 -IERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEE 569

Query: 312 GRDINLESAQMTEYTNLKAEATKRAGKILQQLDTIN---------REQKGDQDKLDNELR 362
            R+   E A++        E  +   +I   L  I          RE++    +L++E R
Sbjct: 570 ARE---EVAELNSKLAELKERIESLERIRTLLAAIADAEDEIERLREKREALAELNDERR 626

Query: 363 QQVQTQNEIKKKRHE------MEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSS 416
           +++  + E K++         +EEA++  ++ E+++ Q E  L + ++ +++L +++G+ 
Sbjct: 627 ERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAV 686

Query: 417 KNRVQELQKELEQVIEELGD 436
           +N ++EL +EL +  E L +
Sbjct: 687 ENELEEL-EELRERREALEN 705



 Score = 52.3 bits (126), Expect = 1e-06
 Identities = 85/448 (18%), Positives = 194/448 (43%), Gaps = 48/448 (10%)

Query: 42  DDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQL 101
           +++E  + E+   E E             ER+E     +E    +   EE+  +E +  L
Sbjct: 244 EEHEERREELETLEAEIEDLRETIAETEREREELA---EEVRDLRERLEEL-EEERDDLL 299

Query: 102 FKLYHNETDIKELE---DELDKKKGEVE---------------KIERRKEKAENI---LR 140
            +   ++ D + +E   +EL+ +  E+                + E  +E A+++     
Sbjct: 300 AEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAE 359

Query: 141 EKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQ 200
           E ++E   L  EL +  + + +   EI +    + + +ER       L +A+  L E+R+
Sbjct: 360 ELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELRE 419

Query: 201 ANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERV 260
             +   +  A+LE  L   R+R  E E     G+    E  Q VE +    ++ + +ERV
Sbjct: 420 ERDELREREAELEATLRTARERVEEAEALLEAGK--CPECGQPVEGSPHVETIEEDRERV 477

Query: 261 SHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYE------RQSIPGRD 314
             ++ +L   ++ + EV +  E   +D+ + E ++  + +R+ + E      R++I  + 
Sbjct: 478 EELEAELEDLEEEVEEVEERLER-AEDLVEAEDRIERLEERREDLEELIAERRETIEEKR 536

Query: 315 INLES---------AQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQV 365
              E          A+  E     AEA + A +  +++  +N +    ++++++ L +  
Sbjct: 537 ERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIES-LERIR 595

Query: 366 QTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQK 425
                I     E+E  +++ + L +   +    L + ++ K EL ++    + R++E ++
Sbjct: 596 TLLAAIADAEDEIERLREKREALAELNDERRERLAEKRERKRELEAEF--DEARIEEARE 653

Query: 426 ELEQVIEELGDAKTDKHEDTRRKKKQEL 453
           + E+  E L   + ++  D  R+++ +L
Sbjct: 654 DKERAEEYL--EQVEEKLDELREERDDL 679



 Score = 35.8 bits (83), Expect = 0.16
 Identities = 51/227 (22%), Positives = 90/227 (39%), Gaps = 49/227 (21%)

Query: 637 EELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDALNARADA 696
           EE RE +     E+E+  ++ TI   E       +++++ + ++ +LE E D L A A  
Sbjct: 247 EERREEL--ETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGL 304

Query: 697 TEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEAELRSQQER 756
            +   +A+E    AR + +  +                           EE   R ++ R
Sbjct: 305 DDADAEAVE----ARREELEDRD--------------------------EELRDRLEECR 334

Query: 757 QKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATRLTKKQ 816
                        A   R  +DD EE  RA+   E+ A E+ +E   LE  R     +++
Sbjct: 335 VAA----QAHNEEAESLREDADDLEE--RAEELREE-AAELESE---LEEAREAVEDRRE 384

Query: 817 AVDAMDEEIGKARREVGSIAKDIQAAQKSCVNLESKLEMKKSERHDI 863
            ++ ++EEI + R   G    D+        N E  LE  + ER ++
Sbjct: 385 EIEELEEEIEELRERFGDAPVDLG-------NAEDFLEELREERDEL 424



 Score = 32.3 bits (74), Expect = 1.8
 Identities = 47/232 (20%), Positives = 93/232 (40%), Gaps = 26/232 (11%)

Query: 637 EELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDALNARADA 696
           EE RE  +    E+EL  ++  ++ +E RL             + + E  I+ L  R + 
Sbjct: 471 EEDRE--RVEELEAELEDLEEEVEEVEERLE--------RAEDLVEAEDRIERLEERRED 520

Query: 697 TEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEAELRSQQER 756
            E  I     ++  + +     +E    +E         +          EAE  +++ R
Sbjct: 521 LEELIAERRETIEEKRERAEELRERAAELE---------AEAEEKREAAAEAEEEAEEAR 571

Query: 757 QKICQ-DKDTKKNVARWER--AVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATRL- 812
           +++ + +    +   R E    +      +A A+   E+L  E R    +L + R  RL 
Sbjct: 572 EEVAELNSKLAELKERIESLERIRTLLAAIADAEDEIERLR-EKREALAELNDERRERLA 630

Query: 813 TKKQAVDAMDEEIGKARREVGSIAKDIQAAQKSCVNLESKLEMKKSERHDIL 864
            K++    ++ E  +AR E     +D + A++    +E KL+  + ER D+ 
Sbjct: 631 EKRERKRELEAEFDEARIE--EAREDKERAEEYLEQVEEKLDELREERDDLQ 680



 Score = 31.9 bits (73), Expect = 2.2
 Identities = 20/112 (17%), Positives = 47/112 (41%), Gaps = 8/112 (7%)

Query: 624 EMGNLKAQKEKLSEELREAMKKSRKES--------ELNTVQSTIKGLEIRLNYSRQDLQN 675
           E+ +L+ + E+L EE  + + ++  +             ++   + L  RL   R   Q 
Sbjct: 280 EVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQA 339

Query: 676 TKSQIAKLEAEIDALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVED 727
              +   L  + D L  RA+    +   +E+ +    + +  ++EE+  +E+
Sbjct: 340 HNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEE 391



 Score = 30.8 bits (70), Expect = 4.6
 Identities = 42/237 (17%), Positives = 98/237 (41%), Gaps = 29/237 (12%)

Query: 622 DKEMGNLKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIA 681
           + E+ +L+ + E++ E L  A      E  +  ++   + LE  +   R+ ++  + +  
Sbjct: 481 EAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAE 540

Query: 682 KLEAEIDALNARADATEPKI-KAIEASMTARG--DTISRK----KEEMNSVEDIVFRDFC 734
           +L      L A A+       +A E +  AR     ++ K    KE + S+E I      
Sbjct: 541 ELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLERIRTL--- 597

Query: 735 KSIGVSTIRQYEEAELRSQQERQKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLA 794
               ++ I   E+ E+   +E+++   + + ++             E LA  +  + +L 
Sbjct: 598 ----LAAIADAED-EIERLREKREALAELNDER------------RERLAEKRERKRELE 640

Query: 795 GEMRAEADKLENMRATRLTKKQAVDAMDEEIGKARREVGSIAKDIQAAQKSCVNLES 851
            E   +  ++E  R  +   ++ ++ ++E++ + R E   +  +I A +     LE 
Sbjct: 641 AEF--DEARIEEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAVENELEELEE 695


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 54.0 bits (130), Expect = 4e-07
 Identities = 84/435 (19%), Positives = 196/435 (45%), Gaps = 41/435 (9%)

Query: 40  LKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEH 99
           L++  E+LK+   + E+       K + + +E +E   EK+E  K    R + + + +E 
Sbjct: 317 LEELLEKLKSLEERLEKLEE----KLEKLESELEELAEEKNELAKLLEERLKELEERLEE 372

Query: 100 QLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQE 159
              +L      +K+LE+ + + K E+ ++    E+ +  L E +KE   L REL ++++E
Sbjct: 373 LEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEE 432

Query: 160 IREMDVEINKKRPSLIKSKERVSHI--------------QKKLASAKKSLVEVRQANEAH 205
           I++++ +IN+     +   E                   +K+L    +  +E  +   + 
Sbjct: 433 IKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSR 492

Query: 206 NKDIADLETQLADVRKRKAEYERQSIPG-----RDINLESAQDVEINKKRPSLIKSKERV 260
            K+ A+L  ++ ++ K   E E + I               +  ++      L + KE++
Sbjct: 493 EKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKL 552

Query: 261 S--HIQKKLASAKKSLVEVRQANEAHNK------DIADLETQLADVRKRKAEYERQSIPG 312
               ++++L   +  L E+++  E          ++ +L  +L +++K+  E E + +  
Sbjct: 553 QLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEER-LSQ 611

Query: 313 RDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDN------ELRQQVQ 366
            +  L+S +++E  N   EA +     L++L+     ++  Q  L+       EL  +++
Sbjct: 612 LEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIR 671

Query: 367 TQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQE---L 423
            + +  +   ++EE  + +++LE+ + Q    L++  K   E+   +   ++R  E   L
Sbjct: 672 RELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAELEEL 731

Query: 424 QKELEQVIEELGDAK 438
           +KELE++ + L   +
Sbjct: 732 KKELEKLEKALELLE 746



 Score = 50.5 bits (121), Expect = 5e-06
 Identities = 73/438 (16%), Positives = 198/438 (45%), Gaps = 26/438 (5%)

Query: 39  ALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVE 98
             +++   L+  + + EE+       ++ +    +E +  +   E+ + + E++  K +E
Sbjct: 271 IREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKL--KSLE 328

Query: 99  HQLFKLYHNETDIKELEDELDKKKGEVEK-IERRKEKAENILREKKKEQGALNRELAKVD 157
            +L KL      ++   +EL ++K E+ K +E R ++ E  L E +KE       L +++
Sbjct: 329 ERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLE 388

Query: 158 QEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLA 217
           + I+E+  E+ +   +L + +E +  ++K+L   ++ L E+ +  +   + I  LE++  
Sbjct: 389 EAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKEL 448

Query: 218 DVRKRKAEYERQSIPGRDINLESAQDV------------EINKKRPSLIKSKERVSHIQK 265
            + +     E+  + G+++  E  +++            E   +     + +E +  ++K
Sbjct: 449 MIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEK 508

Query: 266 KLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEY 325
           +L   ++ L+E+ +  EA      +LE +L  +     E E        + L+  +  E 
Sbjct: 509 ELRELEEELIELLELEEA---LKEELEEKLEKLENLLEELEELK---EKLQLQQLK--EE 560

Query: 326 TNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRI 385
                +  +   ++L++L  + R +K + ++L   L++  +   E++++  ++EE  + +
Sbjct: 561 LRQLEDRLQELKELLEEL-RLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSL 619

Query: 386 DKLEDH--IRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHE 443
           +  E    + + E  L+   +            +  ++EL++++E++  E+         
Sbjct: 620 ELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIEN 679

Query: 444 DTRRKKKQELVENFKKAY 461
           + + ++K E +E  ++  
Sbjct: 680 EEQLEEKLEELEQLEEEL 697



 Score = 49.0 bits (117), Expect = 1e-05
 Identities = 55/290 (18%), Positives = 136/290 (46%), Gaps = 7/290 (2%)

Query: 178 KERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDIN 237
           KE +   + K+   +  L E+ +  E   + + +   +L  + + + E E + +  ++I 
Sbjct: 177 KEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELE-QEIE 235

Query: 238 LESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLAD 297
               +  E+ +++  L + K R+  I+     A K   E  +  E   +++ +   +L +
Sbjct: 236 ALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEE 295

Query: 298 VRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKL 357
           + +   E E +   G    LE  +      LK+   +R  K+ ++L+ +  E + +  + 
Sbjct: 296 LEREIEELEEELE-GLRALLEELEE-LLEKLKSL-EERLEKLEEKLEKLESELE-ELAEE 351

Query: 358 DNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSK 417
            NEL + ++    +K+    +EE +K ++K  + ++Q E ++++ K+   EL++ +   +
Sbjct: 352 KNELAKLLEE--RLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQ 409

Query: 418 NRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFKKAYSGVYDR 467
             ++EL+KELE++  EL + + +  +   +  + E  E      +G  ++
Sbjct: 410 EELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEK 459



 Score = 40.1 bits (94), Expect = 0.006
 Identities = 69/416 (16%), Positives = 168/416 (40%), Gaps = 50/416 (12%)

Query: 39  ALKDDYERLKAEMIQAEE-----ETNMSYLKKKGVVAERKEAKIE------KDEAEKYQR 87
            L++  E L+ E+ +A E     E  +  LK++         +I+      + E E+ +R
Sbjct: 365 ELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELER 424

Query: 88  IREEI--VAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKE 145
             EE+    K++E Q+ +L   E  I EL    +K    V   E  +E  + +L   + E
Sbjct: 425 ELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCP--VCGQELPEEHEKELLELYELE 482

Query: 146 QGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAH 205
              L  EL++ ++E  E+  EI +    L + +E +  + +   + K+ L    +  E  
Sbjct: 483 LEELEEELSR-EKEEAELREEIEELEKELRELEEELIELLELEEALKEEL---EEKLEKL 538

Query: 206 NKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDV--EINKKRPSLIKSKERVSHI 263
              + +LE     ++ ++ + E + +  R   L+   +    +  ++  L + +ER+  +
Sbjct: 539 ENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKEL 598

Query: 264 QKKLASAKKSLVEVRQANEAHNKDIADLETQ--------------------------LAD 297
           +KKL   ++ L ++ +  ++     A+ E +                          L +
Sbjct: 599 KKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEE 658

Query: 298 VRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKG---DQ 354
           + ++  E E +            Q+ E      +  +   ++ ++L+ + ++        
Sbjct: 659 LEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLI 718

Query: 355 DKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELN 410
           ++L++   +  + + E++K    +E  ++  +KL     + +       +++ E N
Sbjct: 719 EELESRKAELEELKKELEKLEKALELLEELREKLGKAGLRADILRNLLAQIEAEAN 774



 Score = 36.3 bits (84), Expect = 0.12
 Identities = 35/206 (16%), Positives = 84/206 (40%), Gaps = 10/206 (4%)

Query: 41  KDDYERLKAEMIQAEEE--TNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVE 98
             +    + E+ +AEEE  + +  L  +  + E  +A +E+ E +  +   E     +  
Sbjct: 618 SLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRI 677

Query: 99  HQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENI---LREKKKEQGALNRELAK 155
               +L     ++++LE+EL++ + E+E++ ++  + E +   L  +K E   L +EL K
Sbjct: 678 ENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELEK 737

Query: 156 VDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQ 215
           +++ +  ++    K   + ++     + I + L +  ++      +  + N+      T 
Sbjct: 738 LEKALELLEELREKLGKAGLR-----ADILRNLLAQIEAEANEILSKLSLNRYDLRRLTI 792

Query: 216 LADVRKRKAEYERQSIPGRDINLESA 241
             D               R I   S 
Sbjct: 793 RKDGNGGLVVVVYDGGEVRPIKTLSG 818



 Score = 30.1 bits (68), Expect = 7.9
 Identities = 36/242 (14%), Positives = 94/242 (38%), Gaps = 10/242 (4%)

Query: 627 NLKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAE 686
             KA+ E+L  +L E ++      E    +   +  ++      Q+ +  + +I  LE  
Sbjct: 182 EAKAKIEELEGQLSELLEDIEDLLEALEEE-LKELKKLEEIQEEQEEEELEQEIEALEER 240

Query: 687 IDALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYE 746
           +  L    +  E     +    +   + +  ++EE+  +E +             I + E
Sbjct: 241 LAELEEEKERLEELKARLLEIESLELEALKIREEELRELERL------LEELEEKIERLE 294

Query: 747 EAELRSQQERQKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLEN 806
           E E   ++  +++   +   + +      +   EE L + +   EKL  E+  E  + +N
Sbjct: 295 ELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESEL-EELAEEKN 353

Query: 807 MRATRLTKKQAVDAMDEEIGKARREVGSIAKDIQAAQKSCVNLESKLEMKKSERHDILMN 866
             A  L ++     ++E + +  +E+    + ++  +++   L+ +L    +   +I   
Sbjct: 354 ELAKLLEERLK--ELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEE 411

Query: 867 CK 868
            +
Sbjct: 412 LE 413


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 52.0 bits (125), Expect = 1e-06
 Identities = 66/312 (21%), Positives = 127/312 (40%), Gaps = 51/312 (16%)

Query: 13  FKSYKGKFSIGPLKKFTAVIGPNGSGALKDDYERLKAEMIQAEEETNMSYLKKKGVVAER 72
           + +YK K     L+K    +   G   +  D ER+KA +I+               +++ 
Sbjct: 354 YLAYKPK-----LEKVERKLPELG---IWKDVERIKALVIRGY--------PLAEALSKV 397

Query: 73  KEAKIEKDEAEKYQRIREEIVA--KEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIER 130
           KE +  +++    +  R EI    K ++     +   E +  EL+ EL++ K E+EK+E 
Sbjct: 398 KEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLES 457

Query: 131 RKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLAS 190
             E+    +R+K ++           D+EIR  D  I +    L + K+RV  +++KLA 
Sbjct: 458 ELERFRREVRDKVRK-----------DREIRARDRRIERLEKELEEKKKRVEELERKLAE 506

Query: 191 AKK--------SLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQ 242
            +K            V+   +   + I + E +   +++          P        A+
Sbjct: 507 LRKMRKLELSGKGTPVKVVEKLTLEAIEEAEEEYG-IKEGDVILVED--PSG----GGAR 559

Query: 243 DVEIN-KKRPSLIKSKERVSHI------QKKLASAKKSLVEVRQANEAHNKDIADLETQL 295
             E    K+P  I   E +SH       + ++    +  V++ + +E    D  +L   +
Sbjct: 560 TAEELIDKKPRAIIRGEEMSHAAAEEFFKNEIPVLPEGDVQIIRLDEFAVVDSEELRRAI 619

Query: 296 ADVRKRKAEYER 307
            + +KR  E ER
Sbjct: 620 EEWKKRFEERER 631



 Score = 32.0 bits (73), Expect = 1.9
 Identities = 15/87 (17%), Positives = 40/87 (45%), Gaps = 1/87 (1%)

Query: 370 EIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQ 429
           ++K++    E+     ++    I   E  +K  ++  E L  +    K  ++EL++E+E+
Sbjct: 396 KVKEEERPREKEGT-EEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEK 454

Query: 430 VIEELGDAKTDKHEDTRRKKKQELVEN 456
           +  EL   + +  +  R+ ++    + 
Sbjct: 455 LESELERFRREVRDKVRKDREIRARDR 481


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 51.1 bits (122), Expect = 3e-06
 Identities = 67/421 (15%), Positives = 171/421 (40%), Gaps = 39/421 (9%)

Query: 70  AERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETD----IKELEDELDKKKGEV 125
             +K + ++KD  +  ++        ++ +Q+ +L   E D    +K +E    K +   
Sbjct: 327 IIKKLSVLQKDYNDYIKKKSRY---DDLNNQILELEGYEMDYNSYLKSIESLKKKIEEYS 383

Query: 126 EKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEI---NKKRPSLIKSKERVS 182
           + IER       IL+ ++ +  A+ +EL +++ +++++  ++   N++  +L ++ + +S
Sbjct: 384 KNIERMSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELS 443

Query: 183 HIQKKLASAKKSLV--------EVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGR 234
              + L       V        +       +N+  + LE ++     R+ E E + I  +
Sbjct: 444 RNMEMLNGQSVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKI-----REIEIEVKDIDEK 498

Query: 235 DINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLET- 293
            ++L+  ++   +++    I    ++   +  L   K  + E++  ++ + +     ++ 
Sbjct: 499 IVDLKKRKEYLESEEINKSINEYNKIESARADLEDIKIKINELKDKHDKYEEIKNRYKSL 558

Query: 294 QLADVRKRKAEYER--QSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQK 351
           +L D+  ++  +      I   DI    ++  E      +   R  +I            
Sbjct: 559 KLEDLDSKRTSWLNALAVISLIDIETNRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYID 618

Query: 352 GDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNS 411
               +++NE        NEI++ +  +E+ + +ID  +  I + ++ + D K++   +N 
Sbjct: 619 KSIREIENEANNLNNKYNEIQENKILIEKLRGKIDNYKKQIAEIDSIIPDLKEITSRIND 678

Query: 412 -------------DVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFK 458
                        D  +++ R++   + L   I EL D   D +E     KK +      
Sbjct: 679 IEDNLKKSRKALDDAKANRARLESTIEILRTRINELSDRINDINETLESMKKIKKAIGDL 738

Query: 459 K 459
           K
Sbjct: 739 K 739



 Score = 34.1 bits (78), Expect = 0.45
 Identities = 22/87 (25%), Positives = 39/87 (44%), Gaps = 4/87 (4%)

Query: 623 KEMGNLKAQKEKLSEEL----REAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKS 678
            E+   K   EKL  ++    ++  +      +L  + S I  +E  L  SR+ L + K+
Sbjct: 636 NEIQENKILIEKLRGKIDNYKKQIAEIDSIIPDLKEITSRINDIEDNLKKSRKALDDAKA 695

Query: 679 QIAKLEAEIDALNARADATEPKIKAIE 705
             A+LE+ I+ L  R +    +I  I 
Sbjct: 696 NRARLESTIEILRTRINELSDRINDIN 722



 Score = 33.3 bits (76), Expect = 0.87
 Identities = 16/94 (17%), Positives = 40/94 (42%)

Query: 630 AQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDA 689
           AQ++K+ +E+ E     R   +L  V   ++     ++Y  + L+++  ++  ++ +I  
Sbjct: 149 AQRKKILDEILEINSLERNYDKLKDVIDMLRAEISNIDYLEEKLKSSNLELENIKKQIAD 208

Query: 690 LNARADATEPKIKAIEASMTARGDTISRKKEEMN 723
                  T  +I+ +        D  +  K  +N
Sbjct: 209 DEKSHSITLKEIERLSIEYNNAMDDYNNLKSALN 242



 Score = 31.4 bits (71), Expect = 2.9
 Identities = 52/334 (15%), Positives = 120/334 (35%), Gaps = 43/334 (12%)

Query: 70  AERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIE 129
           A+RK+   E  E    +R  +    K+V   +  L    ++I  LE++L     E+E I+
Sbjct: 149 AQRKKILDEILEINSLERNYD--KLKDV---IDMLRAEISNIDYLEEKLKSSNLELENIK 203

Query: 130 RRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLA 189
           ++    E       K              EI  + +E N         K  ++ +     
Sbjct: 204 KQIADDEKSHSITLK--------------EIERLSIEYNNAMDDYNNLKSALNELSSLED 249

Query: 190 SAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKK 249
              +           +  +I   E+ L+   ++   Y  + +  R + + +    +    
Sbjct: 250 MKNR-----------YESEIKTAESDLSMELEKNNYY--KELEERHMKIINDPVYKNRNY 296

Query: 250 RPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQS 309
                K K  + + ++ L++    +        A  K ++ L+    D  K+K+ Y+   
Sbjct: 297 INDYFKYKNDIENKKQILSNIDAEI----NKYHAIIKKLSVLQKDYNDYIKKKSRYD--- 349

Query: 310 IPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQN 369
               D+N +  ++  Y        K    + ++++  ++  +     +   L+ Q    +
Sbjct: 350 ----DLNNQILELEGYEMDYNSYLKSIESLKKKIEEYSKNIERMSAFISEILKIQEIDPD 405

Query: 370 EIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNK 403
            IKK+ +E+    + I      + Q   +L++N 
Sbjct: 406 AIKKELNEINVKLQDISSKVSSLNQRIRALRENL 439


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
            functions are known are involvedin recombination,
            recombinational repair, and/or non-homologous end
            joining.They are components of an exonuclease complex
            with MRE11 homologs. This family is distantly related to
            the SbcC family of bacterial proteins.This family is
            based on the phylogenomic analysis of JA Eisen (1999,
            Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 47.3 bits (112), Expect = 4e-05
 Identities = 64/372 (17%), Positives = 156/372 (41%), Gaps = 15/372 (4%)

Query: 109  TDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREM--DVE 166
              +K  E EL KK+   +++       ++I+  K+KE   L  +L KV+++I+ +  D+E
Sbjct: 709  DKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIE 768

Query: 167  INKKRPSLIKSKERVSHIQKKLASAKKSL------VEVRQANEAHNKDIADLETQLADVR 220
              +     I  +E  + +     +  +        VE + A +A     +DL+  +  V 
Sbjct: 769  EQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQGSDLDRTVQQVN 828

Query: 221  KRKAEY-ERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQ 279
            + K E           I L      +  ++   L      +   + ++ +  +   +  +
Sbjct: 829  QEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEE 888

Query: 280  ANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKI 339
                 + ++  L  ++ D +++ +  E      +    E     E +N KA+   +   I
Sbjct: 889  QLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSNKKAQ--DKVNDI 946

Query: 340  LQQLDTINREQKGDQDKL-DNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEAS 398
             +++  I+   K  ++K+ D +     Q + E+     ++EE +K  +K+ + +R     
Sbjct: 947  KEKVKNIHGYMKDIENKIQDGKDDYLKQKETELNTVNAQLEECEKHQEKINEDMRLMRQD 1006

Query: 399  LKDNKKLKEELNSDVGSSK--NRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVEN 456
            +   K  +  L  ++   K  N ++E+++EL+Q ++E+G  +  + +   +K ++ + + 
Sbjct: 1007 IDTQKIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQMQVLQMKQEHQKLEENI-DL 1065

Query: 457  FKKAYSGVYDRM 468
             K+ +     R 
Sbjct: 1066 IKRNHVLALGRQ 1077



 Score = 42.0 bits (98), Expect = 0.002
 Identities = 73/394 (18%), Positives = 155/394 (39%), Gaps = 39/394 (9%)

Query: 83  EKYQRIREEIVAKEVEHQLFKLYHNETDIK--ELEDELDKKKGEVE---KIERRKEKAEN 137
           E  +++R+    K  EHQ+   Y  +   K  E+ D++  K+ ++E   +I +  E   +
Sbjct: 189 ETLRQVRQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQLESSREIVKSYENELD 248

Query: 138 ILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVE 197
            L+ + KE      ++ K+D EI+ +      ++  + K    +    +K+        +
Sbjct: 249 PLKNRLKEIEHNLSKIMKLDNEIKALK----SRKKQMEKDNSELELKMEKVFQGTDE--Q 302

Query: 198 VRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSK 257
           +      H + + + E +L D ++   E E+ +   R +N E  + + + + R  L   +
Sbjct: 303 LNDLYHNHQRTVREKERELVDCQR---ELEKLNKERRLLNQEKTE-LLVEQGRLQL---Q 355

Query: 258 ERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINL 317
                   +   +    +  R   +   +     E Q+ +      E  RQ    +    
Sbjct: 356 ADRHQEHIRARDSLIQSLATRLELDGFERG-PFSERQIKNFHTLVIE--RQEDEAKTAAQ 412

Query: 318 ESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHE 377
             A +     LK E          Q D I  E+KG    ++ +     + Q E+K    E
Sbjct: 413 LCADLQSKERLKQE----------QADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKE 462

Query: 378 MEEAQ---KRIDKLEDHIRQNEA--SLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIE 432
           +++ +    RI +L+  +R+ E   S  +   L E L  +V S +N   +L ++L ++ +
Sbjct: 463 LQQLEGSSDRILELDQELRKAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQ 522

Query: 433 ELGDAKTDKHEDTRRKKKQELVENFKKAYSGVYD 466
           E+       H  T R + + L ++       +  
Sbjct: 523 EMEQL---NHHTTTRTQMEMLTKDKMDKDEQIRK 553



 Score = 35.8 bits (82), Expect = 0.15
 Identities = 50/303 (16%), Positives = 113/303 (37%), Gaps = 21/303 (6%)

Query: 637 EELREAMKKSRKESELNTV-----QSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDALN 691
           +     +KK  K  +         QS I   E  +   R  LQ     I +L+ +I+   
Sbjct: 712 KSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQE 771

Query: 692 ARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEAELR 751
                  P+ ++ +  +T     + R + E+  VE  + +   K  G    R  ++    
Sbjct: 772 TLLGTIMPEEESAKVCLTDVT-IMERFQMELKDVERKIAQQAAKLQGSDLDRTVQQVNQE 830

Query: 752 SQQERQKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATR 811
            Q+++ ++                V    E   +    +++    ++++ ++L++ +   
Sbjct: 831 KQEKQHEL--------------DTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQI 876

Query: 812 LTKKQAVDAMDEEIGKARREVGSIAKDIQAAQKSCVNLESKLEMKKSERHDILMNCKMND 871
            T  Q     +E++ +   EV S+ ++I+ A++    LE+ LE K  +  + L++ K   
Sbjct: 877 GTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLE-KDQQEKEELISSKETS 935

Query: 872 IVLPMLRVQKYDRKLAKSIQEMTSRLQTIQAPNLRAMEKLEHAKENLMKTNEEFENARKR 931
                 +V     K+      M      IQ      +++ E     +    EE E  +++
Sbjct: 936 NKKAQDKVNDIKEKVKNIHGYMKDIENKIQDGKDDYLKQKETELNTVNAQLEECEKHQEK 995

Query: 932 AKK 934
             +
Sbjct: 996 INE 998



 Score = 33.1 bits (75), Expect = 0.94
 Identities = 74/450 (16%), Positives = 172/450 (38%), Gaps = 62/450 (13%)

Query: 49  AEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNE 108
            E+ +AE+ +    LKK+    + ++A +++   +  Q + +         Q+  L  ++
Sbjct: 485 RELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDK 544

Query: 109 TDIKEL--------EDELDKKKG------EVEKIERRKEKAENILREKKKEQGALNRELA 154
            D  E          DEL    G      ++E     K K  N  R++  +   LN+ELA
Sbjct: 545 MDKDEQIRKIKSRHSDELTSLLGYFPNKKQLEDWLHSKSKEINQTRDRLAK---LNKELA 601

Query: 155 KVDQEIREMDVEINKKRPSLIKSKERV-------------SHIQKKLASAKKSLVEVRQA 201
            ++Q    ++ E+  K   L   ++++               +++++  + K    +  A
Sbjct: 602 SLEQNKNHINNELESKEEQLSSYEDKLFDVCGSQDEESDLERLKEEIEKSSKQRAMLAGA 661

Query: 202 NEAHNKDIADL-----------------ETQLADVRKRKAEYERQSIPGRDINLESAQDV 244
              +++ I  L                 E +L +      + + +  P +  + ES    
Sbjct: 662 TAVYSQFITQLTDENQSCCPVCQRVFQTEAELQEFIS-DLQSKLRLAPDKLKSTESELKK 720

Query: 245 EINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE 304
           +  ++   L  +  R S I  K    +K + E+R   +  N+DI  L+  + +       
Sbjct: 721 KEKRRDEMLGLAPGRQSIIDLK----EKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGT 776

Query: 305 YERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQ 364
              +    +    +   M  +     +  ++  +   +L         D D+   ++ Q+
Sbjct: 777 IMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQGS------DLDRTVQQVNQE 830

Query: 365 VQ-TQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDV---GSSKNRV 420
            Q  Q+E+     ++E  +K I   ++ I+  ++   + K  K ++ +++      + ++
Sbjct: 831 KQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQL 890

Query: 421 QELQKELEQVIEELGDAKTDKHEDTRRKKK 450
            EL  E++ +I E+ DAK          +K
Sbjct: 891 VELSTEVQSLIREIKDAKEQDSPLETFLEK 920


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 46.6 bits (111), Expect = 5e-05
 Identities = 48/293 (16%), Positives = 99/293 (33%), Gaps = 74/293 (25%)

Query: 172 PSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSI 231
            S     +++  IQK++A+ +K + E +       K +  LET++A +            
Sbjct: 31  FSAAADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEA---------- 80

Query: 232 PGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADL 291
                                LI++ + +  ++K++A                N  +  L
Sbjct: 81  --------------------QLIETADDLKKLRKQIADL--------------NARLNAL 106

Query: 292 ETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTN-----LKAEATKRAGKILQQLDTI 346
           E Q  + R+R AE            L + Q +         +  E  +R+ ++      +
Sbjct: 107 EVQEREQRRRLAEQ-----------LAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGAL 155

Query: 347 NREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLK----DN 402
           N  +          +     T  ++   R E+   Q  +  L    R  +A L     + 
Sbjct: 156 NPARAE-------RIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEER 208

Query: 403 KKLKEELNSDVGSSKNRVQELQK---ELEQVIEELGDAKTDKHEDTRRKKKQE 452
           KK   +LNS++ + + +++EL+     L+  I     A     E     +   
Sbjct: 209 KKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAAAAEAAA 261



 Score = 45.5 bits (108), Expect = 1e-04
 Identities = 36/220 (16%), Positives = 85/220 (38%), Gaps = 20/220 (9%)

Query: 107 NETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVE 166
            + +I  LE ++ +++ +  K+E++ +  E  +   + +      +L K+ ++I +++  
Sbjct: 43  IQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNAR 102

Query: 167 INKKRPSLIKSKERVSHI------------------QKKLASAKKSLVEVRQANEAHNKD 208
           +N       + + R++                     +    + +  +     N A  + 
Sbjct: 103 LNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGALNPARAER 162

Query: 209 IADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLA 268
           I  L+  L  +   +AE   +        L S Q  +  K    L + K+ ++ +  +L+
Sbjct: 163 IDALKATLKQLAAVRAEIAAEQAELT--TLLSEQRAQQAKLAQLLEERKKTLAQLNSELS 220

Query: 269 SAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQ 308
           + +K L E+R        +IA  E   A  R+  A  E  
Sbjct: 221 ADQKKLEELRANESRLKNEIASAEAAAAKAREAAAAAEAA 260



 Score = 40.9 bits (96), Expect = 0.003
 Identities = 48/288 (16%), Positives = 99/288 (34%), Gaps = 64/288 (22%)

Query: 110 DIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINK 169
                + +L + + E+  +E++       +RE++ ++  L ++L  ++ EI         
Sbjct: 32  SAAADDKQLKQIQKEIAALEKK-------IREQQDQRAKLEKQLKSLETEIAS------- 77

Query: 170 KRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQ 229
                         ++ +L      L ++R+     N  +  LE Q  + R+R AE    
Sbjct: 78  --------------LEAQLIETADDLKKLRKQIADLNARLNALEVQEREQRRRLAEQ--- 120

Query: 230 SIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIA 289
                   L + Q    N   P        VS   +    + +  +     N A  + I 
Sbjct: 121 --------LAALQRSGRN---PPP---ALLVS--PEDAQRSVRLAIYYGALNPARAERID 164

Query: 290 DLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINRE 349
            L+  L  +   +AE   +               E T L +E   +  K+ Q L+    E
Sbjct: 165 ALKATLKQLAAVRAEIAAEQ-------------AELTTLLSEQRAQQAKLAQLLE----E 207

Query: 350 QKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEA 397
           +K    +L++EL    +   E++     ++      +      R+  A
Sbjct: 208 RKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAA 255



 Score = 39.7 bits (93), Expect = 0.008
 Identities = 48/301 (15%), Positives = 92/301 (30%), Gaps = 63/301 (20%)

Query: 649 ESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDALNARADATEPKIKAIEASM 708
           + +L  +Q  I  LE ++   +      + Q+  LE EI +L A+   T   +K +   +
Sbjct: 37  DKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQI 96

Query: 709 TARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEAELRSQQER--QKICQDKDTK 766
                 ++                              E + R Q+ R  +++   + + 
Sbjct: 97  ADLNARLNA----------------------------LEVQEREQRRRLAEQLAALQRSG 128

Query: 767 KNVARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATRLTKKQAVDAMDEEIG 826
           +N         +D +   R       L        D L+         +  + A   E+ 
Sbjct: 129 RNPPPALLVSPEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELT 188

Query: 827 KARREVGSIAKDIQAAQKSCVNLESKLEMKKSERHDILMNCKMNDIVLPMLRVQKYDRKL 886
               E        Q AQ++      KL     ER                   +K   +L
Sbjct: 189 TLLSE--------QRAQQA------KLAQLLEER-------------------KKTLAQL 215

Query: 887 AKSIQEMTSRLQTIQAPNLRAMEKLEHAKENLMKTNEEFENARKRAKKAKANFDRIKKER 946
              +     +L+ ++A   R   ++  A+    K  E    A   A +A+A   +   E 
Sbjct: 216 NSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAAAAEAAAARARAAEAKRTGET 275

Query: 947 Y 947
           Y
Sbjct: 276 Y 276



 Score = 35.9 bits (83), Expect = 0.12
 Identities = 29/244 (11%), Positives = 87/244 (35%), Gaps = 25/244 (10%)

Query: 609 GSLDLARKAKRWDDKEMGNLKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNY 668
            ++  A  +   DDK++  ++ +   L +++RE   +  K      ++  +K LE     
Sbjct: 24  AAVLAAAFSAAADDKQLKQIQKEIAALEKKIREQQDQRAK------LEKQLKSLE----- 72

Query: 669 SRQDLQNTKSQIAKLEAEIDALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDI 728
              ++ + ++Q+ +   ++  L  +      ++ A+E     +   ++            
Sbjct: 73  --TEIASLEAQLIETADDLKKLRKQIADLNARLNALEVQEREQRRRLAE-------QLAA 123

Query: 729 VFRDFCKSIGVSTIRQYE-EAELRSQQERQKICQ-DKDTKKNVARWERAVSDDEEELARA 786
           + R          +   + +  +R       +     +    +    + ++    E+A  
Sbjct: 124 LQRSGRNPPPALLVSPEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAE 183

Query: 787 QGAEEKLAGEMRAEADKLENMRATRLTKKQAVDAMDEEIGKARREVGSIAKDIQAAQKSC 846
           Q     L  E RA+  KL         +K+ +  ++ E+   ++++  +  +    +   
Sbjct: 184 QAELTTLLSEQRAQQAKLA---QLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEI 240

Query: 847 VNLE 850
            + E
Sbjct: 241 ASAE 244


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
           double-strand DNA break proteins. This domain contains a
           P-loop motif.
          Length = 1118

 Score = 46.7 bits (111), Expect = 8e-05
 Identities = 75/367 (20%), Positives = 142/367 (38%), Gaps = 63/367 (17%)

Query: 42  DDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQL 101
             ++ L   + +AEEE  ++ L+K     E ++ + EK   E+ +R+   +  +      
Sbjct: 174 RTWKDLVKALDEAEEE--LANLRK-----ELRQLEKEKQRLERLRRLLPLLAER------ 220

Query: 102 FKLYHNETDIKELEDELDKKKGEV----EKIERRKEKAENILREKKKEQGALNRELAKVD 157
                     K LE +L    GEV         R E+A   LR  ++    L   L  + 
Sbjct: 221 ----------KALEQQLAAL-GEVIDLPPDAVERYEEARAELRAARRNLELLTERLEALQ 269

Query: 158 QEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLA 217
            E+ E+ ++           +E ++      A+A ++L + R       +D+ D E ++A
Sbjct: 270 AELDEISLD-----------EELLAQ-----AAAIEALHQQRGEYRNAEQDLPDREGEIA 313

Query: 218 DVRKRKAEYERQSIPGRDI-NLESAQDVEINKKRPS-LIKSKERVSHIQKKLASAKKSLV 275
           + R+  A    Q  P  D   +ES +     K+  + L K KE    + + L SA+ +L 
Sbjct: 314 NAREAAAALLAQIGPDADEEAVESLRPSLAAKETVTELEKRKEA---LDQALKSARDALE 370

Query: 276 EVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKR 335
           E  +  +     +A L T       R A  +      R  ++++A         A A + 
Sbjct: 371 ERERELKQVRAQLAALPTVTVPEPLRAALADAL----RAGDIDAALAALEQE-VAVAKRE 425

Query: 336 AGKILQQLDTINREQKGDQDKLDNELRQQVQT--------QNEIKKKRHEMEEAQKRIDK 387
             + L +L     + +     L     +QVQ             ++KR  + E ++ ++ 
Sbjct: 426 LAQALSRLGLWRGDLEELV-ALPVPSAEQVQAFLKEAEEIMQAKRQKRDRLLELEEDLET 484

Query: 388 LEDHIRQ 394
           LE  +R 
Sbjct: 485 LELQLRH 491



 Score = 35.1 bits (81), Expect = 0.24
 Identities = 56/338 (16%), Positives = 118/338 (34%), Gaps = 26/338 (7%)

Query: 72  RKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERR 131
           R+   + +  A  Y+   E   A ++  +L +      +++ L  + +  +  +E++E+ 
Sbjct: 519 RRGYALLEPTASAYEEAVES--ADQLADRLLREAQLVGELQSLRQQEEAARRRLEQLEKE 576

Query: 132 KEKAENILREKKKEQGALNRELA--KVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLA 189
            E  E  L   ++   A            E+ +   E    R  +    +  + +   L 
Sbjct: 577 LEVLELALAALREAWQAQWAAAGLPLTPAEMEDWLAERATAREQVRAYFKARAELDALLD 636

Query: 190 SAKKSLVEVRQA--NEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEIN 247
              +    +R A    A      +L   L   R+   E E+Q+   +    E  +D E  
Sbjct: 637 RRARLRAALRAALKAVAIVLPGEELAELLELARQLLEEAEKQA-ARKASLDERLRDAERA 695

Query: 248 KKRPSLIKSKERVSHIQKKLASA----KKSLVEVRQANEAHNKDIADLETQLADVRKRKA 303
                L +++ER    Q  L +       +L+E      A      D    L +++++  
Sbjct: 696 -----LEEAEERHDEAQSALEAWQEQWYDALLEAGLGGRASPAGALDALELLQNIKEKLQ 750

Query: 304 EYERQSIPGRDINLESAQMT-EYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELR 362
                     D+    A M  +    + E    A  +  ++     ++     K    L+
Sbjct: 751 AA-------DDLRQRIAAMERDLARFEEEVEALAEAVAPEMLGTPADETARALKQR--LK 801

Query: 363 QQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLK 400
           +   T    +K   E+EEA+K + +    + + EA L 
Sbjct: 802 RARDTAAAAEKLAEEIEEAEKEVSEAAAALDEAEARLT 839



 Score = 34.7 bits (80), Expect = 0.28
 Identities = 75/390 (19%), Positives = 147/390 (37%), Gaps = 41/390 (10%)

Query: 86  QRIREEIVAK--EVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKK 143
            R  EEI+    E+   LF        +  + DEL+K+  E+ K            R +K
Sbjct: 101 VRGGEEILEAQGELGQYLFSAAAGVGSLGSVRDELEKEADELWK-----------PRGRK 149

Query: 144 KEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANE 203
            E     +EL +++ EIRE+ ++      +     + +   +++LA+ +K      +  E
Sbjct: 150 PEINVALKELKELEAEIREVQLKTR----TWKDLVKALDEAEEELANLRKE----LRQLE 201

Query: 204 AHNKDIADLETQLADVRKRKAEYERQSIPGRDINL-ESAQDV------EINKKRPSLIKS 256
              + +  L   L  + +RKA  ++ +  G  I+L   A +       E+   R +L   
Sbjct: 202 KEKQRLERLRRLLPLLAERKALEQQLAALGEVIDLPPDAVERYEEARAELRAARRNLELL 261

Query: 257 KERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDIN 316
            ER+  +Q +L       + + +   A    I  L  Q      R AE +     G   N
Sbjct: 262 TERLEALQAELDE-----ISLDEELLAQAAAIEALHQQR--GEYRNAEQDLPDREGEIAN 314

Query: 317 LESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQ-VQTQNEIKKKR 375
              A       +  +A + A     +     +E   + +K    L Q     ++ ++++ 
Sbjct: 315 AREAAAALLAQIGPDADEEA-VESLRPSLAAKETVTELEKRKEALDQALKSARDALEERE 373

Query: 376 HEMEEAQKRIDKLEDHI--RQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEE 433
            E+++ + ++  L          A+L D  +  +   +   + +  V   ++EL Q +  
Sbjct: 374 RELKQVRAQLAALPTVTVPEPLRAALADALRAGDIDAALA-ALEQEVAVAKRELAQALSR 432

Query: 434 LGDAKTDKHED-TRRKKKQELVENFKKAYS 462
           LG  + D  E         E V+ F K   
Sbjct: 433 LGLWRGDLEELVALPVPSAEQVQAFLKEAE 462



 Score = 32.0 bits (73), Expect = 2.2
 Identities = 34/280 (12%), Positives = 87/280 (31%), Gaps = 26/280 (9%)

Query: 35  NGSGALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVA 94
             +         L   +  AE     +  +     +  +  + +  +A     +      
Sbjct: 673 EEAEKQAARKASLDERLRDAERALEEAEERHDEAQSALEAWQEQWYDALLEAGLGGRASP 732

Query: 95  KEVEHQLFKLYHNETDIKELEDELDKKKGEVE-KIERRKEKAENILRE-----KKKEQGA 148
                   +L  N  +  +  D+L ++   +E  + R +E+ E +               
Sbjct: 733 AGA-LDALELLQNIKEKLQAADDLRQRIAAMERDLARFEEEVEALAEAVAPEMLGTPADE 791

Query: 149 LNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKD 208
             R L +  +  R+      K    + ++++ VS     L  A+  L  + +A      +
Sbjct: 792 TARALKQRLKRARDTAAAAEKLAEEIEEAEKEVSEAAAALDEAEARLTALLRAARCTTIE 851

Query: 209 IADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLA 268
                 + +D  +   +                    I     +L+++   +S       
Sbjct: 852 ELLAAVERSDTYRELRK-------------------RIAALERTLVRAGGGLSLEALVAE 892

Query: 269 SAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQ 308
           +A     E+    E   +DI +LE +L ++ +     +++
Sbjct: 893 AAALDPDELPARLEELARDIEELEEELNELAQEVGAAKQE 932



 Score = 30.9 bits (70), Expect = 4.3
 Identities = 73/470 (15%), Positives = 161/470 (34%), Gaps = 53/470 (11%)

Query: 42  DDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIRE-EIVAKEVEHQ 100
           + YE  +AE+  A     +   + + + AE  E  ++++   +   I        E  + 
Sbjct: 242 ERYEEARAELRAARRNLELLTERLEALQAELDEISLDEELLAQAAAIEALHQQRGEYRNA 301

Query: 101 LFKLYHNETDIKELEDELD------KKKGEVEKIERRK--EKAENILREKKKEQGALNRE 152
              L   E +I    +             + E +E  +    A+  + E +K + AL++ 
Sbjct: 302 EQDLPDREGEIANAREAAAALLAQIGPDADEEAVESLRPSLAAKETVTELEKRKEALDQA 361

Query: 153 LAKVDQEIREMDVEINKKRPSL--IKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIA 210
           L      + E + E+ + R  L  + +      ++  LA A +   ++  A  A  +++A
Sbjct: 362 LKSARDALEERERELKQVRAQLAALPTVTVPEPLRAALADALR-AGDIDAALAALEQEVA 420

Query: 211 DLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASA 270
             + +LA    R   +         + + SA+ V+   K    I   +R    + +L   
Sbjct: 421 VAKRELAQALSRLGLWRGDLEELVALPVPSAEQVQAFLKEAEEIMQAKR--QKRDRLLEL 478

Query: 271 ----KKSLVEVRQANEAH----NKDIADLET---QLADVRKRKAEYERQSIPGRDINLES 319
               +   +++R  + A      +++A        L    +R       +    +  +ES
Sbjct: 479 EEDLETLELQLRHLDAAGAVPTEEEVAAARARRDALWQDIRRGYALLEPTASAYEEAVES 538

Query: 320 A-QMTEYTNLKAEATKRAGKILQQLDTINR--EQ-KGDQDKLDNELR------QQVQTQN 369
           A Q+ +    +A+       + QQ +   R  EQ + + + L+  L       Q      
Sbjct: 539 ADQLADRLLREAQLVGELQSLRQQEEAARRRLEQLEKELEVLELALAALREAWQAQWAAA 598

Query: 370 EIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEE------------LNSDVGSSK 417
            +     EME+         + +R    +  +   L +                 +    
Sbjct: 599 GLPLTPAEMEDWLAERATAREQVRAYFKARAELDALLDRRARLRAALRAALKAVAIVLPG 658

Query: 418 NRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFKKAYSGVYDR 467
             + EL +   Q++EE            R+    E + + ++A     +R
Sbjct: 659 EELAELLELARQLLEEAEKQA------ARKASLDERLRDAERALEEAEER 702


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 46.2 bits (110), Expect = 9e-05
 Identities = 67/376 (17%), Positives = 134/376 (35%), Gaps = 76/376 (20%)

Query: 106 HNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDV 165
             + ++K+LE+++++ + E+EK E   EKA+N   +K     A  +   K D  + E   
Sbjct: 99  ELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDK-----AWKKLAKKYDSNLSEALK 153

Query: 166 EINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE 225
            +N K+    K  + +  +    AS+  SL E++   +            L        E
Sbjct: 154 GLNYKKNFKEKLLKELKSVILN-ASSLLSLEELKAKIKTLFSSNKPELALLTLSVIDFDE 212

Query: 226 YERQSIPGRDINLESAQDVEINKKRPSLIK------------------------------ 255
            E+  I  + I   S   +                                         
Sbjct: 213 IEQAEILEKSIIGSSDVPISELINNLGNSDWVKEGLEYHEEGDTCPFCQQTITEERKAEL 272

Query: 256 ----------SKERVSHIQKKLASAKKSLVE-------VRQANEAHNKDIADLETQLADV 298
                       E++  +  K  S  +  +E         + N     D+ +L+  L  +
Sbjct: 273 EAHFDEEYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEAL 332

Query: 299 RK-----RKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGD 353
            +      +   E+   P   I LES             T     I   +D IN   +  
Sbjct: 333 EEILEKNLQKLEEKLKDPSTSIELES------------ITDLIESINDIIDAINELIREH 380

Query: 354 QDKLDNELRQQVQTQNEIKKK--RHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNS 411
            +K+DN  ++    +N+ KKK   H + E ++ ID  +   +  E ++   +K  ++L +
Sbjct: 381 NEKIDNLKKE----KNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEA 436

Query: 412 DVGSSKNRVQELQKEL 427
           ++ + +  ++EL+K+L
Sbjct: 437 EIKALEKEIKELEKQL 452



 Score = 45.8 bits (109), Expect = 1e-04
 Identities = 33/199 (16%), Positives = 82/199 (41%), Gaps = 28/199 (14%)

Query: 108 ETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEI 167
           +  I++LE+ +DK +  +EK     E+ E+IL  +K+       ++ ++   +  ++  +
Sbjct: 281 QELIEQLEELIDKYESHIEKA---LEELESILDTEKENS-EFKLDVEELKALLEALEEIL 336

Query: 168 NKKRPSLIKSKERVSH------IQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRK 221
            K    L +  +  S       I   + S    +  + +    HN+ I +L+ +    +K
Sbjct: 337 EKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLKKEKNKAKK 396

Query: 222 RKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQAN 281
           +   +    +     ++++ Q                    ++K + S +K + ++    
Sbjct: 397 KLWLHLVAEL---KEDIDAYQKE---------------KKGLEKAINSLEKEIKQLEAEI 438

Query: 282 EAHNKDIADLETQLADVRK 300
           +A  K+I +LE QL ++  
Sbjct: 439 KALEKEIKELEKQLTNIEP 457



 Score = 37.3 bits (87), Expect = 0.049
 Identities = 35/186 (18%), Positives = 74/186 (39%), Gaps = 20/186 (10%)

Query: 39  ALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDE-AEKYQRIREEIVAKEV 97
            L D YE    + ++  E    +  +      + +E K   +   E  ++  +++  K  
Sbjct: 289 ELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNLQKLEEKLK 348

Query: 98  E-HQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNR-ELAK 155
           +     +L      I+ + D +D      E I    EK +N+ +EK K +  L    +A+
Sbjct: 349 DPSTSIELESITDLIESINDIIDA---INELIREHNEKIDNLKKEKNKAKKKLWLHLVAE 405

Query: 156 VDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQ 215
           + ++I     E                 ++K + S +K + ++    +A  K+I +LE Q
Sbjct: 406 LKEDIDAYQKEKKG--------------LEKAINSLEKEIKQLEAEIKALEKEIKELEKQ 451

Query: 216 LADVRK 221
           L ++  
Sbjct: 452 LTNIEP 457



 Score = 34.6 bits (80), Expect = 0.31
 Identities = 18/96 (18%), Positives = 41/96 (42%), Gaps = 12/96 (12%)

Query: 75  AKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEK 134
            KI+  + EK  + ++++        L  +   + DI   + E    +  +  +E+  ++
Sbjct: 382 EKIDNLKKEK-NKAKKKLW-------LHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQ 433

Query: 135 AENILREKKKEQGALNRELAKVDQEIREMDVEINKK 170
            E  ++  +KE   L ++L  ++        EINK 
Sbjct: 434 LEAEIKALEKEIKELEKQLTNIEPTAD----EINKL 465



 Score = 31.1 bits (71), Expect = 3.5
 Identities = 23/90 (25%), Positives = 43/90 (47%), Gaps = 6/90 (6%)

Query: 106 HNETDIKELEDELDKKKGEVEKIERRKEKAE--NILREKKKEQGALN---RELAKVDQEI 160
           HNE  I  L+ E +K K ++      + K +     +EKK  + A+N   +E+ +++ EI
Sbjct: 380 HNEK-IDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEI 438

Query: 161 REMDVEINKKRPSLIKSKERVSHIQKKLAS 190
           + ++ EI +    L   +     I K L +
Sbjct: 439 KALEKEIKELEKQLTNIEPTADEINKLLKA 468


>gnl|CDD|227318 COG4985, COG4985, ABC-type phosphate transport system, auxiliary
           component [Inorganic ion transport and metabolism].
          Length = 289

 Score = 43.8 bits (103), Expect = 3e-04
 Identities = 28/110 (25%), Positives = 52/110 (47%), Gaps = 7/110 (6%)

Query: 337 GKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKR-IDKLEDHIRQN 395
           GK+L+        Q+ D D L+ EL++++     ++ +  +M E Q R I+   + +R  
Sbjct: 145 GKLLRFDSN---GQELDGDPLERELQKRLLEVETLRDQVDKMVEQQVRVINSQLERLRLE 201

Query: 396 EASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDT 445
           +  L+ N +L +E      + K+   ELQK L Q+  EL   + +     
Sbjct: 202 KRRLQLNGQLDDEFQQHYVAEKS---ELQKRLAQLQTELDALRAELERQF 248


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 44.4 bits (106), Expect = 3e-04
 Identities = 28/119 (23%), Positives = 58/119 (48%), Gaps = 4/119 (3%)

Query: 48  KAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHN 107
           +A+ I  E +     +KK+ ++  ++E    ++E EK  R R      E++    +L   
Sbjct: 39  EAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERR----NELQKLEKRLLQK 94

Query: 108 ETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVE 166
           E ++    + L+K++ E+EK E+  E+ +  L +K++E   L  E  +  + I  +  E
Sbjct: 95  EENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAE 153



 Score = 40.5 bits (96), Expect = 0.005
 Identities = 30/126 (23%), Positives = 63/126 (50%), Gaps = 18/126 (14%)

Query: 64  KKKGVVAERKEAKIEKDEAEKYQRIREEIV--AKEVEHQLFKLYHNETDIKELEDELDKK 121
           + K   AE +  +I ++  ++ + I++E +  AKE  H+L           E E EL ++
Sbjct: 30  EAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKL---------RNEFEKELRER 80

Query: 122 KGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERV 181
           + E++K+E+R  + E  L  K          L K ++E+ + + E+ +K+  L K +E +
Sbjct: 81  RNELQKLEKRLLQKEENLDRKL-------ELLEKREEELEKKEKELEQKQQELEKKEEEL 133

Query: 182 SHIQKK 187
             + ++
Sbjct: 134 EELIEE 139



 Score = 39.0 bits (92), Expect = 0.014
 Identities = 26/107 (24%), Positives = 52/107 (48%), Gaps = 13/107 (12%)

Query: 365 VQTQNEIKKKRHEME----EAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRV 420
           ++ + EI K R+E E    E +  + KLE  + Q E +L    +L E+   ++   +  +
Sbjct: 60  LEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKEL 119

Query: 421 QELQKELEQVIEELGDAKTDKHED--------TRRKKKQELVENFKK 459
           ++ Q+ELE+  EEL +   ++           T  + K+ L+E  ++
Sbjct: 120 EQKQQELEKKEEEL-EELIEEQLQELERISGLTAEEAKEILLEKVEE 165



 Score = 37.5 bits (88), Expect = 0.037
 Identities = 23/108 (21%), Positives = 64/108 (59%), Gaps = 6/108 (5%)

Query: 330 AEATKRAGKILQQ----LDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRI 385
            EA + A +IL++     + I +E   +  +  ++LR + +   E++++R+E+++ +KR+
Sbjct: 34  KEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFE--KELRERRNELQKLEKRL 91

Query: 386 DKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEE 433
            + E+++ +    L+  ++  E+   ++   +  +++ ++ELE++IEE
Sbjct: 92  LQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEE 139



 Score = 32.8 bits (76), Expect = 1.1
 Identities = 25/126 (19%), Positives = 55/126 (43%), Gaps = 6/126 (4%)

Query: 112 KELEDELDKKKGEV-EKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKK 170
           KE E    +   E  E+I + + + E  LRE++ E   L + L + ++ +      + K+
Sbjct: 49  KEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKR 108

Query: 171 RPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLET-----QLADVRKRKAE 225
              L K ++ +   Q++L   ++ L E+ +      + I+ L        L +  + +A 
Sbjct: 109 EEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEILLEKVEEEAR 168

Query: 226 YERQSI 231
           +E   +
Sbjct: 169 HEAAVL 174



 Score = 29.7 bits (68), Expect = 9.1
 Identities = 48/235 (20%), Positives = 92/235 (39%), Gaps = 76/235 (32%)

Query: 120 KKKGEVEKIERRKEKAENILREKKKEQGALNREL------------AKVDQEIREMDVEI 167
           KK  E  KI+  +E+A+ IL E KKE  A+ +E              + ++E+RE   E+
Sbjct: 26  KKIAE-AKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNEL 84

Query: 168 NKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYE 227
            K    L++ +E +    + L   ++ L +  +          +LE +  ++ K++ E E
Sbjct: 85  QKLEKRLLQKEENLDRKLELLEKREEELEKKEK----------ELEQKQQELEKKEEELE 134

Query: 228 RQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKD 287
                               +     ++  ER+S +      AK+ L+E           
Sbjct: 135 --------------------ELIEEQLQELERISGLTA--EEAKEILLE----------- 161

Query: 288 IADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQ 342
                 ++    + +A +E   +  ++I  E+         K EA K+A +IL Q
Sbjct: 162 ------KV----EEEARHEAAVLI-KEIEEEA---------KEEADKKAKEILAQ 196


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 44.2 bits (105), Expect = 3e-04
 Identities = 28/136 (20%), Positives = 59/136 (43%), Gaps = 19/136 (13%)

Query: 327 NLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKR-- 384
           NLK +         ++LD ++++    + + + EL      +  +K+KR E+    +   
Sbjct: 156 NLKDDLESLIASAKEELDQLSKKLAELKAEEEEEL------ERALKEKREELLSKLEEEL 209

Query: 385 IDKLEDHIRQNEASLK-DNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHE 443
           + +LE      E  L+ + ++ KEEL           ++ +++L Q +E   +A   K +
Sbjct: 210 LARLESKEAALEKQLRLEFEREKEEL----------RKKYEEKLRQELERQAEAHEQKLK 259

Query: 444 DTRRKKKQELVENFKK 459
           +    +  EL   F K
Sbjct: 260 NELALQAIELQREFNK 275



 Score = 37.7 bits (88), Expect = 0.031
 Identities = 66/292 (22%), Positives = 111/292 (38%), Gaps = 43/292 (14%)

Query: 170 KRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKR---KAEY 226
           K         R S + +++A A+     V        K I +L   L ++ +    +   
Sbjct: 76  KDEPKQVKIPRQSGLSEEVAEAEAKATSVAAEA-TTPKSIQELVEALEELLEELLKETAS 134

Query: 227 ER--QSIPGRDINL-ESAQDVEINKKRPSLIKS-KERVSHIQKKLASAKKSLVE--VRQA 280
           +   Q +     +L +S ++  +     SLI S KE +  + KKLA  K    E   R  
Sbjct: 135 DPVVQELVSIFNDLIDSIKEDNLKDDLESLIASAKEELDQLSKKLAELKAEEEEELERAL 194

Query: 281 NEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKIL 340
            E   + ++ LE +L      + E +          LE     E+   K E  K+  + L
Sbjct: 195 KEKREELLSKLEEEL----LARLESKE-------AALEKQLRLEFEREKEELRKKYEEKL 243

Query: 341 QQLDTINREQKGDQDKLDNELRQQVQTQN-----EIKKKRHE--------MEEAQKRIDK 387
           +Q   + R+ +  + KL NEL  Q          EIK+K  E        + E   R+  
Sbjct: 244 RQ--ELERQAEAHEQKLKNELALQAIELQREFNKEIKEKVEEERNGRLAKLAELNSRLKG 301

Query: 388 LEDHIRQNEASLKDNKKLK------EELNSDV-GSSKNRVQELQKELEQVIE 432
           LE  +     +  +N K++      E L S +   S    + L KEL+ + E
Sbjct: 302 LEKALDSRSEAEDENHKVQQLWLAVEALKSALKSGSAGSPRPLVKELDALKE 353


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 43.7 bits (104), Expect = 5e-04
 Identities = 49/260 (18%), Positives = 91/260 (35%), Gaps = 49/260 (18%)

Query: 210 ADLETQLADVRKRKAEYERQSIPGRDIN-----LESAQDV--EINKKRPSLIKSKERVSH 262
           AD++ QL  + K+K       +  +D+      L+       E  + +  L ++  ++  
Sbjct: 39  ADVQAQLDALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQKEETEQLKQQLAQAPAKLRQ 98

Query: 263 IQKKLASAKKSLVEVRQANEAHNKDIADLE-------TQLADVRKRKAEYERQSIPGRDI 315
            Q +L + K    E  +   +    +  LE        QL + +   AEY  Q +  +  
Sbjct: 99  AQAELEALKDDNDEETRETLS-TLSLRQLESRLAQTLDQLQNAQNDLAEYNSQLVSLQTQ 157

Query: 316 NLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKR 375
             E AQ   Y N +     R  +I   L       K     L    R  +Q +       
Sbjct: 158 -PERAQAALYANSQ-----RLQQIRNLLKGGKVGGK----ALRPSQRVLLQAE------- 200

Query: 376 HEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELG 435
                       L         SL+ N +L++ L         R+Q L+ +L Q+++E  
Sbjct: 201 ---------QALLNAQNDLQRKSLEGNTQLQDLLQKQRDYLTARIQRLEHQL-QLLQEAI 250

Query: 436 DAKTDKHEDTRRKKKQELVE 455
           ++K       R    ++ V+
Sbjct: 251 NSK-------RLTLSEKTVQ 263



 Score = 38.7 bits (91), Expect = 0.020
 Identities = 59/329 (17%), Positives = 119/329 (36%), Gaps = 93/329 (28%)

Query: 94  AKEVEHQLFKLY---HNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALN 150
             +V+ QL  L      E + K ++ +L++    ++KI+R+KE+ E            L 
Sbjct: 38  EADVQAQLDALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQKEETE-----------QLK 86

Query: 151 RELAKVDQEIREMDVEINK-KRPSLIKSKER-----VSHIQKKLASAKKSLVEVRQANEA 204
           ++LA+   ++R+   E+   K  +  +++E      +  ++ +LA     L   +     
Sbjct: 87  QQLAQAPAKLRQAQAELEALKDDNDEETRETLSTLSLRQLESRLAQTLDQLQNAQNDLAE 146

Query: 205 HNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQ 264
           +N  +  L+TQ    R + A Y          N +  Q  +I           + +   Q
Sbjct: 147 YNSQLVSLQTQPE--RAQAALYA---------NSQRLQ--QIRNLLKGGKVGGKALRPSQ 193

Query: 265 KKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRD--INLESAQM 322
           + L  A+++L                L  Q         + +R+S+ G     +L   Q 
Sbjct: 194 RVLLQAEQAL----------------LNAQN--------DLQRKSLEGNTQLQDLLQKQR 229

Query: 323 TEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQT-QNEIKKKRHEMEEA 381
            +Y   +                I R            L  Q+Q  Q  I  KR  +  +
Sbjct: 230 -DYLTAR----------------IQR------------LEHQLQLLQEAINSKR--LTLS 258

Query: 382 QKRIDKLEDHIRQNEASLKDNKKLKEELN 410
           +K + + +   +   A ++ N  + +EL 
Sbjct: 259 EKTVQEAQS--QDEAARIQANPLVAQELE 285


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 43.1 bits (102), Expect = 8e-04
 Identities = 44/237 (18%), Positives = 91/237 (38%), Gaps = 47/237 (19%)

Query: 252 SLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE-YERQSI 310
            L K K R  + Q +    K   ++  Q  + +NK+I +   +  +   RK   Y+    
Sbjct: 170 KLNKDKIRELNQQIQTLDMKIDHIQ--QQIKTYNKNIEEQRKKNGENIARKQNKYDELVE 227

Query: 311 PGRDINLESAQMTE-----------YTNLKAEATKRAGKILQQLDTINRE----QKGD-- 353
             + I  E  ++T+            +    +    A KI  +++   +     +KG   
Sbjct: 228 EAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVC 287

Query: 354 -----------------QDKLDN---ELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIR 393
                            +DKL      L +     +E+++   E  E  K++ +L++ I 
Sbjct: 288 PTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKIS 347

Query: 394 QNEASL-------KDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHE 443
            N+ SL       K  K   EEL ++   +   + +LQ EL+++++   +   +K+ 
Sbjct: 348 TNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDELDKIVKTKSELVKEKYH 404



 Score = 38.5 bits (90), Expect = 0.023
 Identities = 33/215 (15%), Positives = 79/215 (36%), Gaps = 31/215 (14%)

Query: 112 KELEDELDKKKGEV-----EKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVE 166
            +  +E  KK GE       K +   E+A+ I    K E   L  EL  +  +I +    
Sbjct: 201 NKNIEEQRKKNGENIARKQNKYDELVEEAKTI----KAEIEELTDELLNLVMDIEDPSAA 256

Query: 167 INKKRPSLIKSKERVSHIQK----------------KLASAKKSLVEVRQANEAHNKDIA 210
           +NK   +  K K ++   QK                +++     + +++   +     + 
Sbjct: 257 LNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLE 316

Query: 211 DLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASA 270
            L+T + ++ +   E+  QS    ++        +I+  + SLI   ++   ++  +   
Sbjct: 317 KLDTAIDELEEIMDEFNEQSKKLLELK------NKISTNKQSLITLVDKAKKVKAAIEEL 370

Query: 271 KKSLVEVRQANEAHNKDIADLETQLADVRKRKAEY 305
           +   V+  +       ++  +    +++ K K   
Sbjct: 371 QAEFVDNAEELAKLQDELDKIVKTKSELVKEKYHR 405



 Score = 35.8 bits (83), Expect = 0.12
 Identities = 18/105 (17%), Positives = 40/105 (38%), Gaps = 6/105 (5%)

Query: 622 DKEMGNLKAQKEKLSEELREAMKK----SRKESELNTVQSTIKGLEIRLNYSRQDLQNTK 677
              +  +K + ++L   L +             E N     +  L+ +++ ++Q L    
Sbjct: 298 PDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLV 357

Query: 678 SQIAKLEAEIDALNA-RADATEPKIKAIEASMTARGDTISRKKEE 721
            +  K++A I+ L A   D  E  +  ++  +     T S   +E
Sbjct: 358 DKAKKVKAAIEELQAEFVDNAEE-LAKLQDELDKIVKTKSELVKE 401


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 43.1 bits (102), Expect = 9e-04
 Identities = 41/263 (15%), Positives = 97/263 (36%), Gaps = 28/263 (10%)

Query: 178 KERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDIN 237
           +ER+   ++ L SA    V  ++  E              + +KR     R ++    ++
Sbjct: 606 RERLQQAEEALQSA----VAKQKQAEEQLVQAN----AELEEQKRAEAEARTALKQARLD 657

Query: 238 LESAQDVEINKKRPSLIKSKERVSHIQK---KLASAKKSLVEVRQANEAHNKDIADLETQ 294
           L+  Q+ + + K    +   ER    +    +L +  K L+E +QA     KD       
Sbjct: 658 LQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKD------D 711

Query: 295 LADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQ 354
             ++R  +    +      D  L          L A       +   +L  + ++   + 
Sbjct: 712 FRELRTERLAKWQVVEGELDNQL--------AQLSAAIEAARTQAKARLKELKKQYDREL 763

Query: 355 DKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKL---EDHIRQNEASLKDNKKLKEELNS 411
             LD +     + + +I++    +E    R  ++      +++        ++ +  L  
Sbjct: 764 ASLDVDPNTVKELKRQIEELETTIERIAVRRPEVREYRAFMQETWLHRDSLREERPNLAI 823

Query: 412 DVGSSKNRVQELQKELEQVIEEL 434
            +   ++  +ELQ+EL ++I++ 
Sbjct: 824 QLRELESSAEELQQELTRLIKDT 846



 Score = 42.4 bits (100), Expect = 0.001
 Identities = 72/426 (16%), Positives = 152/426 (35%), Gaps = 57/426 (13%)

Query: 70  AERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIE 129
           AE +E + E     + Q    E   KE   +L +      ++     +L   + E+E +E
Sbjct: 280 AEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQ------ELSAANAKLAADRSELELLE 333

Query: 130 RRKEKAENILREK-KKEQGAL---NRELAKVDQEIREMDVEINK------KRPSLIKS-- 177
            +K   E+   E+ + +   L     EL +V+  +  +  +         +    IK   
Sbjct: 334 DQKGAFEDADIEQLQADLDQLPSIRSELEEVEARLDALTGKHQDVQRKYERLKQKIKEQL 393

Query: 178 KERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVR---KRKAEYERQSIPGR 234
           +  +    ++LA+ ++     + A     +D+  LE+QL       K +   E   +  R
Sbjct: 394 ERDLEKNNERLAAIREEKDRQKAA---IEEDLQALESQLRQQLEAGKLEFNEEEYELELR 450

Query: 235 DINLESAQ----------------DVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVR 278
              L+                   D  + K +    +++  V  +Q +L   +K      
Sbjct: 451 LGRLKQRLDSATATPEELEQLEINDEALEKAQEEQEQAEANVEQLQSELRQLRK---RRD 507

Query: 279 QANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGK 338
           +A EA  +     E +L  +R+   E E Q  P     L   +     N      +  GK
Sbjct: 508 EALEALQRA----ERRLLQLRQALDELELQLSPQAGSLLHFLR-----NEAPGWEESIGK 558

Query: 339 IL---QQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEA-QKRIDKLEDHIRQ 394
           ++       T    Q  +    D      +  Q          E   ++R+ + E+ ++ 
Sbjct: 559 VISPELLERTDLDPQLVEGSDSDTLYGVSLDLQRLDVPDYAANETELRERLQQAEEALQS 618

Query: 395 NEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELV 454
             A  K  ++   + N+++   K    E +  L+Q   +L   + ++ +  + K +  + 
Sbjct: 619 AVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQ-NEQQSLKDKLELAIA 677

Query: 455 ENFKKA 460
           E  ++A
Sbjct: 678 ERKQQA 683



 Score = 42.0 bits (99), Expect = 0.002
 Identities = 77/431 (17%), Positives = 158/431 (36%), Gaps = 59/431 (13%)

Query: 62  YLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKK 121
           Y + K  + E+ E  +EK+  E+   IREE   ++           E D++ LE +L ++
Sbjct: 382 YERLKQKIKEQLERDLEKNN-ERLAAIREEKDRQKAAI--------EEDLQALESQLRQQ 432

Query: 122 KGEVEKIERRKEKAENILREKKKEQGALNRELAKVD--QEIREMDVEINKKRPSLIKSKE 179
                      E+   +     + +  L+   A  +  +++   D  + K +    +++ 
Sbjct: 433 LEA--GKLEFNEEEYELELRLGRLKQRLDSATATPEELEQLEINDEALEKAQEEQEQAEA 490

Query: 180 RVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLE 239
            V  +Q +L   +K      +A EA  +     E +L  +R+   E E Q  P     L 
Sbjct: 491 NVEQLQSELRQLRK---RRDEALEALQRA----ERRLLQLRQALDELELQLSPQAGSLLH 543

Query: 240 --SAQDVEINKKRPSLIKSK--ERVS-HIQKKLASAKKSL------VEVRQANEAHNKD- 287
               +     +    +I  +  ER     Q    S   +L      ++     +    + 
Sbjct: 544 FLRNEAPGWEESIGKVISPELLERTDLDPQLVEGSDSDTLYGVSLDLQRLDVPDYAANET 603

Query: 288 -----IADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQ 342
                +   E  L     ++ + E Q +     N E  +          A K+A   LQ+
Sbjct: 604 ELRERLQQAEEALQSAVAKQKQAEEQLV---QANAELEEQKRAEAEARTALKQARLDLQR 660

Query: 343 L--------DTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQK------RIDKL 388
           L        D +       + + + +LRQ      ++ +++    EA K      R ++L
Sbjct: 661 LQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERL 720

Query: 389 EDHIRQNEASLKDNK--KLKEELNSDVGSSKNRVQELQKELEQVIEELG-DAKTDKHEDT 445
                Q      DN+  +L   + +    +K R++EL+K+ ++ +  L  D  T K    
Sbjct: 721 AK--WQVVEGELDNQLAQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTVKELKR 778

Query: 446 RRKKKQELVEN 456
           + ++ +  +E 
Sbjct: 779 QIEELETTIER 789



 Score = 36.6 bits (85), Expect = 0.078
 Identities = 45/246 (18%), Positives = 97/246 (39%), Gaps = 38/246 (15%)

Query: 626 GNLKAQKEKLSEELRE-AMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLE 684
           G L A +E+L+EE  E    K+R   +L T++  +K     LN   Q+L    +++A   
Sbjct: 270 GELVADEERLAEEQEERQEAKNRLRQQLRTLEDQLKEARDELN---QELSAANAKLAADR 326

Query: 685 AEIDALNARADATE----PKIKA-------IEASMTARGDTISRKKEEMNSVEDIVFRDF 733
           +E++ L  +  A E     +++A       I + +      +     +   V+    R +
Sbjct: 327 SELELLEDQKGAFEDADIEQLQADLDQLPSIRSELEEVEARLDALTGKHQDVQ----RKY 382

Query: 734 CKSIGV--------STIRQYEEAELRSQQERQKICQDKDTKKNVARWE-------RAVSD 778
            +                    A +R +++RQK   ++D +   ++            ++
Sbjct: 383 ERLKQKIKEQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFNE 442

Query: 779 DEEELARAQGAEEKLAGEMRAEADKLENMRATRLTKKQAVDAMDEEIGKARREVGSIAKD 838
           +E EL    G  ++      A  ++LE +        +A++   EE  +A   V  +  +
Sbjct: 443 EEYELELRLGRLKQRLDSATATPEELEQLEI----NDEALEKAQEEQEQAEANVEQLQSE 498

Query: 839 IQAAQK 844
           ++  +K
Sbjct: 499 LRQLRK 504



 Score = 36.2 bits (84), Expect = 0.13
 Identities = 50/316 (15%), Positives = 125/316 (39%), Gaps = 33/316 (10%)

Query: 95  KEVEHQLFKLYHNETDIKELEDELDKKKGEVEK-IERRKEKAENILREKKKEQGALNREL 153
           +        L     D++ L++E    K ++E  I  RK++AE  LR+   +   L  + 
Sbjct: 642 RAEAEARTALKQARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQ 701

Query: 154 AKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLE 213
               + +++   E+  +R  L K +     +  +LA    ++   R   +A    + +L+
Sbjct: 702 QAFLEALKDDFRELRTER--LAKWQVVEGELDNQLAQLSAAIEAARTQAKAR---LKELK 756

Query: 214 TQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKS 273
            Q             + +   D++  +    E+ ++   L  + ER++  + ++   +  
Sbjct: 757 KQY-----------DRELASLDVDPNT--VKELKRQIEELETTIERIAVRRPEVREYRAF 803

Query: 274 LVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEAT 333
           + E     ++  ++  +L  QL ++     E +++          +  + +    + +  
Sbjct: 804 MQETWLHRDSLREERPNLAIQLRELESSAEELQQE---------LTRLIKDTKLRRKKLE 854

Query: 334 KRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIR 393
           +    + +QLD ++   +G +D++     +Q+    E            +R+D+LE+  R
Sbjct: 855 QERKALEKQLDQLDELLRGLRDEM-----RQLAELKEPANANQAEGSISERLDQLEEFKR 909

Query: 394 QNEASLKDNKKLKEEL 409
           + +    D KK  E  
Sbjct: 910 KRKRLSGDLKKFLERF 925



 Score = 32.0 bits (73), Expect = 2.1
 Identities = 49/296 (16%), Positives = 103/296 (34%), Gaps = 52/296 (17%)

Query: 662 LEIRLNYSRQDLQNTKSQIAKLEAEIDALNARADATEPKIKAIEASMTARG--DTISRKK 719
           L  RL  + + LQ+  ++  + E ++   NA  +    K    EA    +     + R +
Sbjct: 605 LRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQ--KRAEAEARTALKQARLDLQRLQ 662

Query: 720 EEMNSVED---IVFRDFCKSIGVSTIRQYEEAELRSQQERQKICQDKDTKKN-----VAR 771
            E  S++D   +   +  +       +   + +   +Q++  +   KD  +      +A+
Sbjct: 663 NEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERLAK 722

Query: 772 WERAVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATRLTKKQAVDAMDEEIGKARRE 831
           W+    + + +LA+   A E    + +A   +L+      L        +D         
Sbjct: 723 WQVVEGELDNQLAQLSAAIEAARTQAKARLKELKKQYDRELASL----DVDPNT------ 772

Query: 832 VGSIAKDIQAAQKSCVNLESKLEMKKSERHDIL-----MN---CKMNDIVL--PMLRVQK 881
           V  + + I+        LE+ +E     R ++      M       + +    P L +Q 
Sbjct: 773 VKELKRQIE-------ELETTIERIAVRRPEVREYRAFMQETWLHRDSLREERPNLAIQL 825

Query: 882 YDRKLAKSIQEMTSRLQTIQAP----------NLRAMEKLEH-AKENLMKTNEEFE 926
             R+L  S +E+   L  +               +A+EK      E L    +E  
Sbjct: 826 --RELESSAEELQQELTRLIKDTKLRRKKLEQERKALEKQLDQLDELLRGLRDEMR 879


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 41.9 bits (99), Expect = 0.001
 Identities = 29/117 (24%), Positives = 56/117 (47%), Gaps = 4/117 (3%)

Query: 113 ELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELA---KVDQEIREMDV-EIN 168
            LE+ L+  K + E + +      +I  + +K+  AL  E+A   ++  E+   D  E+ 
Sbjct: 151 GLEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELE 210

Query: 169 KKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE 225
           K R  L     ++S  +K+L   ++ L E+  A EA     ++L  ++A+  K + E
Sbjct: 211 KARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEEIAEAEKIREE 267



 Score = 33.1 bits (76), Expect = 0.84
 Identities = 35/189 (18%), Positives = 78/189 (41%), Gaps = 12/189 (6%)

Query: 127 KIERRKEKAENILREKKKEQGALNRELAKVDQEIRE-MDVEI-NKKRPSLIKSKE----- 179
            I   ++    I  E   E   L RE      +I+  MD +    K  + ++SK+     
Sbjct: 81  YISEGRDLFREIEEETLAENPPLFREYFSASPDIKLLMDNQFQLVKTHARLQSKKTWYEW 140

Query: 180 RVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLE 239
           R+  ++      +++L  +++  E  NKD+  + +    +RK+    + +       +L 
Sbjct: 141 RMQLLEGLKEGLEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIA-----SLR 195

Query: 240 SAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVR 299
              D         L K+++ +  +  K++  +K L E++Q  +     I  L  + +++ 
Sbjct: 196 QLADELNLCDPLELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELL 255

Query: 300 KRKAEYERQ 308
           +  AE E+ 
Sbjct: 256 EEIAEAEKI 264


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
           multi-subunit complex made up of two heavy chains and
           four light chains it is a fundamental contractile
           protein found in all eukaryote cell types. This family
           consists of the coiled-coil myosin heavy chain tail
           region. The coiled-coil is composed of the tail from two
           molecules of myosin. These can then assemble into the
           macromolecular thick filament. The coiled-coil region
           provides the structural backbone the thick filament.
          Length = 859

 Score = 41.6 bits (98), Expect = 0.003
 Identities = 80/415 (19%), Positives = 169/415 (40%), Gaps = 65/415 (15%)

Query: 66  KGVVAERKEAKIEKDEAEKYQ---RIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKK 122
                   E ++ + E+E  Q   ++ +E     V     K+   E  I+ELE+EL+ ++
Sbjct: 2   LERQKRELENQLYRKESELSQLSSKLEDEQAL--VAQLQKKIKELEARIRELEEELEAER 59

Query: 123 GEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMD------VEINKKRPSLIK 176
               K E+ +                L+REL ++ + + E        +E+NKKR     
Sbjct: 60  AARAKAEKARAD--------------LSRELEELSERLEEAGGATAAQIELNKKR----- 100

Query: 177 SKERVSHIQKKLASAKKSLVEVRQANEA-HNKDIADLETQLADVRKRK--AEYERQSIPG 233
            +  ++ ++K L  A     E        H   I +L  Q+  ++K+K  AE E+  +  
Sbjct: 101 -EAELAKLRKDLEEANLQHEEALATLRKKHQDAINELSEQIEQLQKQKAKAEKEKSQLQA 159

Query: 234 RDINLESAQDVEINKKRPSLIKSK---ERVSHIQKKLASAKKSLVEVRQA-------NEA 283
              +L +  D     K  +  K+K    ++S +Q KL   ++ L ++          N  
Sbjct: 160 EVDDLLAQLDQIAKAKLNAEKKAKQLESQLSELQVKLDELQRQLNDLTSQKSRLQSENSD 219

Query: 284 HNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQL 343
             + + + E Q++++ K K++ E             +Q+ E      E ++    +  QL
Sbjct: 220 LTRQLEEAEAQVSNLSKLKSQLE-------------SQLEEAKRSLEEESRERANLQAQL 266

Query: 344 DTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQK--------RIDKLEDHIRQN 395
             +  +    +++L+ E   + + + ++ K   E+++ +         R ++LE+  ++ 
Sbjct: 267 RQLEHDLDSLREQLEEESEAKAELERQLSKANAEIQQWRSKFESEGALRAEELEELKKKL 326

Query: 396 EASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKK 450
              + + ++  E  N+   S +     LQ ELE +  EL  A     E  +++K 
Sbjct: 327 NQKISELEEAAEAANAKCDSLEKTKSRLQSELEDLQIELERANAAASELEKKQKN 381



 Score = 37.7 bits (88), Expect = 0.040
 Identities = 76/359 (21%), Positives = 155/359 (43%), Gaps = 57/359 (15%)

Query: 108 ETDIKELEDELD-KKKGEVEKIER--------RKEKAENILREKKKEQGALN-------- 150
           E  + E E+E +  +K     IE          K KAE   R KKK +G +N        
Sbjct: 506 ERRLAEKEEEFENTRKNHQRAIESLQATLEAEAKGKAE-ASRLKKKLEGDINELEIALDH 564

Query: 151 ---------RELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQA 201
                    + + K  Q+++E+  ++ +++ +   ++E+++  +++  + +  L E+R A
Sbjct: 565 ANKANAEAQKNVKKYQQQVKELQTQVEEEQRAREDAREQLAVAERRATALEAELEELRSA 624

Query: 202 NEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVS 261
            E             A+  +++AE E      R   L +     I +KR    K +  ++
Sbjct: 625 LEQ------------AERARKQAETELAEASERVNELTAQNSSLIAQKR----KLEGELA 668

Query: 262 HIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQ 321
            +Q  L  A     E++ A E   K  AD      ++R+ +   +      + +  +  +
Sbjct: 669 ALQSDLDEAVN---ELKAAEERAKKAQADAARLAEELRQEQEHSQHLERLRKQLESQVKE 725

Query: 322 MTE-YTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEM-- 378
           +       +A A K   K++Q+L+   RE + +   LD E R+  +TQ  ++K    +  
Sbjct: 726 LQVRLDEAEAAALKGGKKMIQKLEARVRELEAE---LDGEQRRHAETQKNLRKMERRVKE 782

Query: 379 -----EEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIE 432
                EE +K +++L+D + + +A +K  K+  EE       + ++ ++ Q+ELE   E
Sbjct: 783 LQFQVEEDKKNLERLQDLVDKLQAKIKTYKRQLEEAEEVAQINLSKYRKAQRELEDAEE 841



 Score = 33.5 bits (77), Expect = 0.76
 Identities = 63/364 (17%), Positives = 155/364 (42%), Gaps = 37/364 (10%)

Query: 71  ERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIER 130
           E+K+   +K  AE  +++ E     +   +  +     T++  L++EL++ K +VE + R
Sbjct: 376 EKKQKNFDKILAEWKRKVDELQAELDTAQREARNL--STELFRLKNELEELKDQVEALRR 433

Query: 131 RKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLAS 190
             +  ++ + +   + G   R + ++++  R ++ E ++ + +L +++  +   + K   
Sbjct: 434 ENKNLQDEIHDLTDQLGEGGRNVHELEKARRRLEAEKDELQAALEEAEAALELEESK--- 490

Query: 191 AKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKR 250
             ++ VE+ Q      + +A+ E +  + RK               N + A +  +    
Sbjct: 491 VLRAQVELSQIRSEIERRLAEKEEEFENTRK---------------NHQRAIE-SLQATL 534

Query: 251 PSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSI 310
            +  K K   S ++KKL      + E+  A +  NK  A+ +  +   +++  E + Q  
Sbjct: 535 EAEAKGKAEASRLKKKLEG---DINELEIALDHANKANAEAQKNVKKYQQQVKELQTQ-- 589

Query: 311 PGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNE 370
               +  E     +     A A +RA  +  +L+ +       +  L+   R + Q + E
Sbjct: 590 ----VEEEQRAREDAREQLAVAERRATALEAELEEL-------RSALEQAERARKQAETE 638

Query: 371 IKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQV 430
           + +    + E   +   L    R+ E  L   +   +E  +++ +++ R ++ Q +  ++
Sbjct: 639 LAEASERVNELTAQNSSLIAQKRKLEGELAALQSDLDEAVNELKAAEERAKKAQADAARL 698

Query: 431 IEEL 434
            EEL
Sbjct: 699 AEEL 702



 Score = 32.7 bits (75), Expect = 1.3
 Identities = 66/347 (19%), Positives = 145/347 (41%), Gaps = 61/347 (17%)

Query: 109 TDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEIN 168
           +++++L+ EL++      ++E++++  + IL E K++   L  EL    +E R +  E+ 
Sbjct: 356 SELEDLQIELERANAAASELEKKQKNFDKILAEWKRKVDELQAELDTAQREARNLSTELF 415

Query: 169 KKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYER 228
           + +  L + K++V                +R+ N+    +I DL  QL +  +   E E+
Sbjct: 416 RLKNELEELKDQVE--------------ALRRENKNLQDEIHDLTDQLGEGGRNVHELEK 461

Query: 229 QSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDI 288
                         + E ++ + +L +++  +   + K     ++ VE+ Q      + +
Sbjct: 462 ---------ARRRLEAEKDELQAALEEAEAALELEESK---VLRAQVELSQIRSEIERRL 509

Query: 289 ADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINR 348
           A+ E +  + RK            R I    A +      KAEA++   K+   +     
Sbjct: 510 AEKEEEFENTRKNHQ---------RAIESLQATLEAEAKGKAEASRLKKKLEGDI----- 555

Query: 349 EQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEE 408
                     NEL   +   N+         EAQK + K +  +++ +  +++ ++ +E+
Sbjct: 556 ----------NELEIALDHANKAN------AEAQKNVKKYQQQVKELQTQVEEEQRARED 599

Query: 409 LNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVE 455
               +  ++ R   L+ ELE++   L  A     E  R++ + EL E
Sbjct: 600 AREQLAVAERRATALEAELEELRSALEQA-----ERARKQAETELAE 641


>gnl|CDD|236798 PRK10929, PRK10929, putative mechanosensitive channel protein;
           Provisional.
          Length = 1109

 Score = 41.6 bits (98), Expect = 0.003
 Identities = 73/344 (21%), Positives = 148/344 (43%), Gaps = 63/344 (18%)

Query: 128 IERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKR--PSLIKSKERVSHIQ 185
           +E RK   E      K+ Q  ++    K+  E+R+   ++N +R  P  +        ++
Sbjct: 57  LEERKGSLERA----KQYQQVID-NFPKLSAELRQ---QLNNERDEPRSVPPNMSTDALE 108

Query: 186 KKLASAKKSLVEV-RQANEAHNK--DIADLETQL----ADVRKRKAEYER--QSIPGRDI 236
           +++      L+E  RQA +  ++  +I+D  +QL     + R++  E ER  Q++   + 
Sbjct: 109 QEILQVSSQLLEKSRQAQQEQDRAREISDSLSQLPQQQTEARRQLNEIERRLQTLGTPNT 168

Query: 237 NLESAQDVEINKKRPSLIKSKERVSHIQKKLASA--KKSL----VEVRQANEAH-NKDIA 289
            L  AQ   +  +  +L   K  V  ++    SA  ++ L     E+ +      +  + 
Sbjct: 169 PLAQAQLTALQAESAAL---KALVDELELAQLSANNRQELARLRSELAKKRSQQLDAYLQ 225

Query: 290 DLETQLADVRKRKAE--------YERQS--IP-------------GRDINLESAQMTEYT 326
            L  QL   R+R+AE           QS  +P              + +N ++ +M    
Sbjct: 226 ALRNQLNSQRQREAERALESTELLAEQSGDLPKSIVAQFKINRELSQALNQQAQRMDLIA 285

Query: 327 NLKAEATKRAGKILQQLDTINREQK---GDQDKLDNELRQQVQTQNEIKK-KRHEMEEAQ 382
           + + +A  +  ++ Q L+T+ REQ    G  + L   LR QV    E+ K ++ + E AQ
Sbjct: 286 SQQRQAASQTLQVRQALNTL-REQSQWLGVSNALGEALRAQVARLPEMPKPQQLDTEMAQ 344

Query: 383 KRIDKL--EDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQ 424
            R+ +L  ED + +        +++++     + + +NR+ + Q
Sbjct: 345 LRVQRLRYEDLLNK----QPQLRQIRQADGQPLTAEQNRILDAQ 384


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 41.1 bits (97), Expect = 0.003
 Identities = 48/258 (18%), Positives = 96/258 (37%), Gaps = 53/258 (20%)

Query: 78  EKDEAEKYQRIREEIVAKEVEHQLFKL--YHNETDIKELEDELDKKKGEVEKIERRKEKA 135
           E+   E+ +++R      ++   L KL  Y  + +    E +    K   E I+  +E+ 
Sbjct: 38  EELSNERLRKLRSL--LTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEEL 95

Query: 136 ENILREKKKEQGALNRELAKVDQEIREMDVEINKKRP--------SLIKSKE-------R 180
           E I +E K+    L  E+++++ EI+E++ EI +  P        SL+   +        
Sbjct: 96  EKIEKEIKE----LEEEISELENEIKELEQEIERLEPWGNFDLDLSLLLGFKYVSVFVGT 151

Query: 181 VSHIQKKLASAKKSLVEVRQANEAHNKD---IADLETQLADVRK--RKAEYERQSIPGRD 235
           V   + +    +  +  V   +         +  L+    +V +  +K  +ER  +    
Sbjct: 152 VPEDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLELE--- 208

Query: 236 INLESAQDVEINKKRPS--LIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLET 293
                       +  PS  + + KE +  I+K+  S  + L E+ +            E 
Sbjct: 209 -----------EEGTPSELIREIKEELEEIEKERESLLEELKELAKKYL---------EE 248

Query: 294 QLADVRKRKAEYERQSIP 311
            LA     + E ER    
Sbjct: 249 LLALYEYLEIELERAEAL 266



 Score = 38.0 bits (89), Expect = 0.026
 Identities = 45/291 (15%), Positives = 105/291 (36%), Gaps = 52/291 (17%)

Query: 188 LASAKKSLVEV-RQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEI 246
           L S K  ++E   +    H   I DL+ +L++ R RK       +      L S      
Sbjct: 14  LKSYKDEVLEALHELGVVH---IEDLKEELSNERLRKLRSLLTKLSEALDKLRSYLPKL- 69

Query: 247 NKKRPSLIKSKERVSHIQKKLASAKKSLVEV-RQANEAHNKDIADLETQLADVRKRKAEY 305
                   K     S +++ +   ++ L ++ ++  E   + I++LE ++ ++ +     
Sbjct: 70  -NPLREEKKKVSVKS-LEELIKDVEEELEKIEKEIKELEEE-ISELENEIKELEQEIERL 126

Query: 306 ERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQV 365
           E    P  + +L+ + +  +  +        G +              +DKL+    +  
Sbjct: 127 E----PWGNFDLDLSLLLGFKYVSV----FVGTV-------------PEDKLEELKLESD 165

Query: 366 QTQNEIKKKRHEME-----EAQKRIDKLEDHIRQNEAS---LKDNKKLKEELNSDVGSSK 417
               E                ++  D++E+ +++       L++     E +       K
Sbjct: 166 VENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLELEEEGTPSELIRE----IK 221

Query: 418 NRVQELQKELEQVIEELGDAKTDKHEDTRR---------KKKQELVENFKK 459
             ++E++KE E ++EEL +    K+ +             ++ E +  F K
Sbjct: 222 EELEEIEKERESLLEELKE-LAKKYLEELLALYEYLEIELERAEALSKFLK 271



 Score = 31.1 bits (71), Expect = 3.5
 Identities = 42/263 (15%), Positives = 99/263 (37%), Gaps = 49/263 (18%)

Query: 721 EMNSVEDIVFRDFCKSIGVSTIRQYEE---------AELRSQQERQKICQDKDTKKNVAR 771
           E+  V      D  + +    +R+             +LRS   +    +++  K +V  
Sbjct: 27  ELGVVH---IEDLKEELSNERLRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKS 83

Query: 772 WERAVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATRLTKKQAVD---AMDEEIGKA 828
            E  + D EEEL +     EK   E+  E  +LEN       + + ++     D ++   
Sbjct: 84  LEELIKDVEEELEKI----EKEIKELEEEISELENEIKELEQEIERLEPWGNFDLDLSLL 139

Query: 829 RRE------VGSIAKDIQAAQKSCVNLESKLEMKKSERHDILMNCKMNDIVLPMLRVQKY 882
                    VG++ +D     K   ++E+   +   + +  ++      +V         
Sbjct: 140 LGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDKGYVYVV------VV--------V 185

Query: 883 DRKLAKSIQEMTSRL--QTIQAPNLR-AMEKLEHAKENLMKTNEEFENARKRAKKAKANF 939
            ++L+  ++E   +L  + ++        E +   KE L +  +E E+  +  K+     
Sbjct: 186 LKELSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKE----- 240

Query: 940 DRIKKERYDKFTRCFEHVSNEID 962
             + K+  ++    +E++  E++
Sbjct: 241 --LAKKYLEELLALYEYLEIELE 261



 Score = 30.3 bits (69), Expect = 6.5
 Identities = 17/77 (22%), Positives = 26/77 (33%), Gaps = 21/77 (27%)

Query: 630 AQKEKLSEELREAMKKSR-------KESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAK 682
             K     E ++ +            E EL  ++  IK LE               +I++
Sbjct: 66  LPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELE--------------EEISE 111

Query: 683 LEAEIDALNARADATEP 699
           LE EI  L    +  EP
Sbjct: 112 LENEIKELEQEIERLEP 128



 Score = 30.3 bits (69), Expect = 7.0
 Identities = 20/120 (16%), Positives = 43/120 (35%), Gaps = 15/120 (12%)

Query: 52  IQAEEETNMSYLKKKGVVAERKEAKIEK-----DEAEKYQRIREEIVAKEVEHQLFKLYH 106
           +  ++   +        V      K           E    + EE+  K++  +  +L  
Sbjct: 152 VPEDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEEL--KKLGFERLELE- 208

Query: 107 NETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVE 166
              +     + + + K E+E+IE+ +E     L+E  K+      E      E  E+++E
Sbjct: 209 ---EEGTPSELIREIKEELEEIEKERESLLEELKELAKK----YLEELLALYEYLEIELE 261


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 40.4 bits (95), Expect = 0.003
 Identities = 23/147 (15%), Positives = 59/147 (40%), Gaps = 26/147 (17%)

Query: 323 TEYTNLKAE---ATKRAGKILQQLDTINREQKGDQDKLD--------NELRQQVQTQNEI 371
            +   L  E     ++  + L+     + +   D  KL+        N++R ++    E 
Sbjct: 27  LDLARLLLENEELKQKVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQLKE- 85

Query: 372 KKKRHEMEEAQKRIDKLEDHIRQNEASLKD-NKKLKEELNSDVGSSKNRVQELQKELEQV 430
                E+E+ ++RI++L+  + Q  + L   + +L++   S +   ++ ++  + +L  +
Sbjct: 86  -----EIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQDEIKRTRSKLNAL 140

Query: 431 IEELGDAKTDKHEDTRRKKKQELVENF 457
                        + R    +EL + F
Sbjct: 141 --------HSLLAEKRSFLCRELAKLF 159


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 41.2 bits (97), Expect = 0.003
 Identities = 59/318 (18%), Positives = 128/318 (40%), Gaps = 17/318 (5%)

Query: 116 DELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLI 175
           D +D+  G++  + RR+ K   +      E  AL  EL +  ++  ++  EI   R  L 
Sbjct: 182 DLIDRLAGDLTNVLRRR-KKSELPSSILSEIEALEAELKEQSEKYEDLAQEIAHLRNELE 240

Query: 176 KSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIP-GR 234
           +++  +  ++KK  S    L E R+  E   + + ++E      R +  E     +P   
Sbjct: 241 EAQRSLESLEKKFRSEGGDLFEEREQLE---RQLKEIEAARKANRAQLRELAADPLPLLL 297

Query: 235 DINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQ 294
             NL  +   ++ K+  S  +++     ++++     +SL ++      H K+IA    +
Sbjct: 298 IPNLLDSTKAQLQKEEQSQ-QNQLTQEELEERDKELLESLPKL-ALPAEHVKEIAAELAE 355

Query: 295 LADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQ 354
           +       +E   +            ++T+   L  +  +       QL    RE + + 
Sbjct: 356 IDKPATTDSEIPHRLSGS--------ELTQLEVLIQQVKRELQDAKSQLLKELRELEEEL 407

Query: 355 DKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVG 414
            ++D ++   + ++ +I +   E+ EAQ  + + E  I +    L+  K+  E L   + 
Sbjct: 408 AEVDKKI-STIPSEEQIAQLLEELGEAQNELFRSEAEIEELLRQLETLKEAIEALRKTLD 466

Query: 415 SSKNRVQELQKELEQVIE 432
               +      ELE+ I 
Sbjct: 467 EKTKQKIN-AFELERAIT 483



 Score = 35.8 bits (83), Expect = 0.15
 Identities = 50/270 (18%), Positives = 107/270 (39%), Gaps = 13/270 (4%)

Query: 49  AEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNE 108
           +E+   E E      K + +  E    + E +EA++     E+    E      +    E
Sbjct: 209 SEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLESLEKKFRSEGGDLFEEREQLE 268

Query: 109 TDIKELEDELDKKKGEVEKIERR-------KEKAENILREKKKEQGALNRELAKVDQEIR 161
             +KE+E      + ++ ++              ++   + +KE+ +   +L +   E+ 
Sbjct: 269 RQLKEIEAARKANRAQLRELAADPLPLLLIPNLLDSTKAQLQKEEQSQQNQLTQE--ELE 326

Query: 162 EMDVEINKKRPSLIKSKERVSHIQKKLA-SAKKSLVEVRQANEAHNKDIADLETQLADVR 220
           E D E+ +  P L    E V  I  +LA   K +  +    +     ++  LE  +  V+
Sbjct: 327 ERDKELLESLPKLALPAEHVKEIAAELAEIDKPATTDSEIPHRLSGSELTQLEVLIQQVK 386

Query: 221 KRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQA 280
           +   + + Q +       E   +V+   K+ S I S+E+++ + ++L  A+  L      
Sbjct: 387 RELQDAKSQLLKELRELEEELAEVD---KKISTIPSEEQIAQLLEELGEAQNELFRSEAE 443

Query: 281 NEAHNKDIADLETQLADVRKRKAEYERQSI 310
            E   + +  L+  +  +RK   E  +Q I
Sbjct: 444 IEELLRQLETLKEAIEALRKTLDEKTKQKI 473


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 41.2 bits (97), Expect = 0.003
 Identities = 43/247 (17%), Positives = 89/247 (36%), Gaps = 33/247 (13%)

Query: 78  EKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAEN 137
            K+  +       +I    +  ++         +K LE  L     EVEK+E   +  E 
Sbjct: 43  LKELEKLKVAEVAQISLSSLLSEVLDYL---RSVKGLEGRLFILPEEVEKLEAELKSLEE 99

Query: 138 ILREKKKEQGALNRELAKVDQEIREMDVEINKKRP---------------SLIKSKE--- 179
           +++  +K    +     K+++ + E+D E+                    SL++  +   
Sbjct: 100 VIKPAEKFSSEVEELTRKLEERLSELDEELEDLEDLLEELEPLAYLDFDLSLLRGLKFLL 159

Query: 180 -RVSHIQKKLASAKKSLVEVRQANEAHNKD------IADLETQLADVRK--RKAEYERQS 230
            R+  ++++   A   ++E   A    N +      +A     L  V K   +  +E   
Sbjct: 160 VRLGLVRREKLEALVGVIEDEVALYGENVEASVVIVVAHGAEDLDKVSKILNELGFELYE 219

Query: 231 IPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKS-LVEVRQANEAHNKDIA 289
           +P  D         E+ +    +    E +    + LA      L+ VR+  E   K + 
Sbjct: 220 VPEFDGGPSELIS-ELEEVIAEIQDELESLRSELEALAEKIAEELLAVREILEI-EKALG 277

Query: 290 DLETQLA 296
           D+ ++LA
Sbjct: 278 DVLSKLA 284



 Score = 33.9 bits (78), Expect = 0.48
 Identities = 41/287 (14%), Positives = 98/287 (34%), Gaps = 39/287 (13%)

Query: 192 KKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRP 251
              L E+      H +D+ + E  L ++ K K     Q      ++   ++ ++  +   
Sbjct: 19  DPVLAELHDFGLVHLEDLEEGEKGLKELEKLKVAEVAQ----ISLSSLLSEVLDYLRSVK 74

Query: 252 SLIKS----KERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYER 307
            L        E V  ++ +L S ++ +    + +    +    LE +L+++ +   + E 
Sbjct: 75  GLEGRLFILPEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEELEDLED 134

Query: 308 QSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQT 367
                    LE  +   Y +      +    +L +L  + RE+          +  +V  
Sbjct: 135 L--------LEELEPLAYLDFDLSLLRGLKFLLVRLGLVRREKLEALVG---VIEDEVAL 183

Query: 368 QNEIKKKRHEM-----EEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRV-Q 421
             E  +    +      E   ++ K+ + +      + +     +   S++ S    V  
Sbjct: 184 YGENVEASVVIVVAHGAEDLDKVSKILNELGFELYEVPE----FDGGPSELISELEEVIA 239

Query: 422 ELQKELEQVIEELGDAKTDKHED---------TRRKKKQELVENFKK 459
           E+Q ELE +  EL     +K  +            K   +++    +
Sbjct: 240 EIQDELESLRSELEALA-EKIAEELLAVREILEIEKALGDVLSKLAR 285


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 40.7 bits (95), Expect = 0.005
 Identities = 58/269 (21%), Positives = 106/269 (39%), Gaps = 53/269 (19%)

Query: 200 QANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQ-DVEINKKRPSLIKSK- 257
            A E   +DI  L+ +L   R+ ++E  R  I        S + D+   KK   ++++K 
Sbjct: 406 SAIERLEQDIKKLQAELQQARQNESEL-RNQISLLTSLERSLKSDLGQLKKENDMLQTKL 464

Query: 258 -ERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDIN 316
              VS  QK     K+S+  + +  ++      + E QLA+ +KRK E E          
Sbjct: 465 NSMVSAKQKD----KQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEE---------- 510

Query: 317 LESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQT-QNEIKKKR 375
                         E   RA          +RE+  +       L+Q  Q  + EIKK  
Sbjct: 511 --------------ETAARAAAQAAA----SREECAES------LKQAKQDLEMEIKKLE 546

Query: 376 HEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELE------- 428
           H+++  ++    LE   ++     ++++K  E L S + + +++   L+  L        
Sbjct: 547 HDLKLKEEECRMLEKEAQELR-KYQESEKETEVLMSALQAMQDKNLMLENSLSAETRLKL 605

Query: 429 QVIEELGDAKTDKH--EDTRRKKKQELVE 455
            +   LGD K           K++ E+V+
Sbjct: 606 DLFSALGDVKRQLEIAHGAIYKREAEIVD 634



 Score = 30.3 bits (68), Expect = 6.1
 Identities = 45/221 (20%), Positives = 94/221 (42%), Gaps = 17/221 (7%)

Query: 108 ETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEI 167
           E DIK+L+ EL + +    ++  +     ++ R  K + G L +E   +  ++  M    
Sbjct: 412 EQDIKKLQAELQQARQNESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAK 471

Query: 168 NKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYE 227
            K + S+   ++R+        +A+K L E ++  +   +  A    Q A  R+  AE  
Sbjct: 472 QKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAESL 531

Query: 228 RQSIPGRDINLESAQ-DVEINKKRPSLIKSKERVSHIQKKLASAKKS------LVEVRQA 280
           +Q+   +D+ +E  + + ++  K       ++    ++K   S K++      L  ++  
Sbjct: 532 KQAK--QDLEMEIKKLEHDLKLKEEECRMLEKEAQELRKYQESEKETEVLMSALQAMQDK 589

Query: 281 NEAHNKDIA-------DLETQLADVRKRKAEYERQSIPGRD 314
           N      ++       DL + L DV KR+ E    +I  R+
Sbjct: 590 NLMLENSLSAETRLKLDLFSALGDV-KRQLEIAHGAIYKRE 629


>gnl|CDD|213239 cd03272, ABC_SMC3_euk, ATP-binding cassette domain of eukaryotic SMC3
            proteins.  The structural maintenance of chromosomes
            (SMC) proteins are large (approximately 110 to 170 kDa),
            and each is arranged into five recognizable domains.
            Amino-acid sequence homology of SMC proteins between
            species is largely confined to the amino- and
            carboxy-terminal globular domains. The amino-terminal
            domain contains a 'Walker A' nucleotide-binding domain
            (GxxGxGKS/T, in the single-letter amino-acid code), which
            by mutational studies has been shown to be essential in
            several proteins. The carboxy-terminal domain contains a
            sequence (the DA-box) that resembles a 'Walker B' motif,
            and a motif with homology to the signature sequence of
            the ATP-binding cassette (ABC) family of ATPases. The
            sequence homology within the carboxy-terminal domain is
            relatively high within the SMC1-SMC4 group, whereas SMC5
            and SMC6 show some divergence in both of these sequences.
            In eukaryotic cells, the proteins are found as
            heterodimers of SMC1 paired with SMC3, SMC2 with SMC4,
            and SMC5 with SMC6 (formerly known as Rad18).
          Length = 243

 Score = 39.6 bits (93), Expect = 0.005
 Identities = 22/59 (37%), Positives = 30/59 (50%), Gaps = 1/59 (1%)

Query: 995  HRYHPAPFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSLVGI 1053
             +  PAPF++ DEIDAALD      VA+ I  +  D  Q I  + + E    AD   G+
Sbjct: 176  QKCDPAPFYLFDEIDAALDAQYRTAVANMI-KELSDGAQFITTTFRPELLEVADKFYGV 233



 Score = 33.4 bits (77), Expect = 0.47
 Identities = 21/53 (39%), Positives = 25/53 (47%), Gaps = 11/53 (20%)

Query: 13 FKSYKGKFSIGPL-KKFTAVIGPNGSG----------ALKDDYERLKAEMIQA 54
          FKSYK +  I P   K   V+G NGSG           L D+Y  L+ E  QA
Sbjct: 9  FKSYKDQTVIEPFSPKHNVVVGRNGSGKSNFFAAIRFVLSDEYTHLREEQRQA 61


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 38.7 bits (90), Expect = 0.008
 Identities = 43/208 (20%), Positives = 85/208 (40%), Gaps = 26/208 (12%)

Query: 8   IEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG----------ALKDDYERLKAEMIQAEEE 57
           + + NF S+K    I   K  T + GPNGSG          AL     RLK    +   +
Sbjct: 2   LTIKNFGSFKD-LEIDFSKGLTLIYGPNGSGKTTILDAIRWALYGKTSRLKKSKGRGIVK 60

Query: 58  TNMSYLKKKGVVAERKEAKIE--KDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELE 115
            ++   K +G      E   E  K + +         + K+   ++ K      +I EL+
Sbjct: 61  GDIEIEKDEGKKKTYVEITFENNKGKLKLRLIEESRELTKKKGKKVKKSILEIVEIDELQ 120

Query: 116 DELDKKKGEVEKIER--------RKEKAENILREKKKEQGALNRELAKVDQEIREMDVEI 167
           + +D+     ++I          R E+ +   +EKK+    L +EL +++ E   ++  +
Sbjct: 121 EFIDELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLL 180

Query: 168 NKKRPSLIKSKERVSHIQKKLASAKKSL 195
            +K     + K+ +  ++++L    + L
Sbjct: 181 EEK-----EKKKELEELKEELEELLEEL 203



 Score = 32.1 bits (73), Expect = 0.95
 Identities = 30/157 (19%), Positives = 65/157 (41%), Gaps = 8/157 (5%)

Query: 278 RQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAG 337
            +     + +I   E +     +   E  +  +  R I        +      ++     
Sbjct: 55  GRGIVKGDIEIEKDEGKKKTYVEITFENNKGKLKLRLIEESRELTKKKGKKVKKSILEIV 114

Query: 338 KILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEA 397
           +I +  + I+   K D++ L   L    +   E+K KR E +E  + ++K  + +   + 
Sbjct: 115 EIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELEKELEELEDEKD 174

Query: 398 SLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEEL 434
            L+   + KE+        K  ++EL++ELE+++EEL
Sbjct: 175 LLEKLLEEKEK--------KKELEELKEELEELLEEL 203


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 38.5 bits (90), Expect = 0.008
 Identities = 20/117 (17%), Positives = 49/117 (41%), Gaps = 15/117 (12%)

Query: 341 QQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLK 400
           ++L  + +E    Q++L        + Q ++ + + E +E ++ +  LE  + + +  L 
Sbjct: 66  ERLPELQQELAELQEELA-------ELQEQLAELQQENQELKQELSTLEAELERLQKELA 118

Query: 401 DNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENF 457
             K+L               +EL++EL ++ +E    + +      R ++ E    F
Sbjct: 119 RIKQLSANAIE----LDEENRELREELAELKQENEALEAEN----ERLQENEQRRWF 167



 Score = 32.3 bits (74), Expect = 0.95
 Identities = 14/82 (17%), Positives = 43/82 (52%)

Query: 375 RHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEEL 434
           R  + E Q+ + +L++ + + +  L + ++  +EL  ++ + +  ++ LQKEL ++ +  
Sbjct: 65  RERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLS 124

Query: 435 GDAKTDKHEDTRRKKKQELVEN 456
            +A     E+   +++   ++ 
Sbjct: 125 ANAIELDEENRELREELAELKQ 146


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 39.7 bits (93), Expect = 0.009
 Identities = 34/239 (14%), Positives = 83/239 (34%), Gaps = 45/239 (18%)

Query: 258 ERVSHIQKKLASAKKSLVEVRQAN-EAHNKD-IADLETQLADVRKRKAEYERQSIPGRDI 315
            RV      L S    L   ++   E +++      + +   + ++ AEY RQ+      
Sbjct: 152 CRVVVAPSLLESNSSRLSFAQEWRLEWNSQGLEDYRKRKRQRLDQKAAEYLRQAAQAGTT 211

Query: 316 NLESAQ----------------------MTEYTNLKAEATKRAGKILQQLDTINREQKGD 353
              SA                         +       A +     +   +   +E++ +
Sbjct: 212 GSSSADDLADLLSKFRLDSWDKGSRFTRTEKVEQAATAAAEVPPAEMDTEEDRTKEREAE 271

Query: 354 QDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKE------ 407
            + L  ++ +   +  E+  +   +    K++++    +RQ    L++  K+K+      
Sbjct: 272 LEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIKKRTVELL 331

Query: 408 --------ELNSDVGSSKNRVQELQKELEQV-------IEELGDAKTDKHEDTRRKKKQ 451
                   +L + V +S  R+ EL ++ E            L +   +K ++T+R+  +
Sbjct: 332 PDAENNVAKLQALVVASSERLLELAQQWEAHRTPLIDEYRSLKEKNRNKEDETQRQLDE 390



 Score = 32.4 bits (74), Expect = 1.6
 Identities = 27/168 (16%), Positives = 67/168 (39%), Gaps = 6/168 (3%)

Query: 69  VAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFK--LYHNETDIKELEDELDKKKGEVE 126
                E    K+   + + ++E+I   E   +     +    + IK++ +EL   + E E
Sbjct: 256 AEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENE 315

Query: 127 KIE---RRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSH 183
           ++E   + K++   +L + +     L   +    + + E+  +    R  LI     +  
Sbjct: 316 ELEEEYKIKKRTVELLPDAENNVAKLQALVVASSERLLELAQQWEAHRTPLIDEYRSLKE 375

Query: 184 IQK-KLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQS 230
             + K    ++ L E+++      +  ++L+T+    ++   EYE   
Sbjct: 376 KNRNKEDETQRQLDEIKKLRNKIEELESELQTKEQLYKQLLDEYENAP 423


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 39.7 bits (93), Expect = 0.009
 Identities = 32/220 (14%), Positives = 75/220 (34%), Gaps = 23/220 (10%)

Query: 52  IQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDI 111
           ++   E   +  ++   + +  +A++EKD   KY  + + +              +  DI
Sbjct: 742 LKKLYEFAYTLEEEIQYLEDLLKARLEKDYQSKYLDLIDNLNDAREIKDKESKAIDLDDI 801

Query: 112 KELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVE----- 166
               + + K++  ++ I         +L+    + GA       +++ I+E   +     
Sbjct: 802 DFELELIGKQEINIDYI-------LELLQTFNDKNGAYESLKELIERIIKEWIEDLRQKK 854

Query: 167 ------INKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVR 220
                 I        K  +    +++    AKK   E      A  + + + E    D  
Sbjct: 855 KLIERLIEAINQYRAKKLDTAEKLEELYILAKK---EEEFKQFAEEEGLNEEELAFYDDL 911

Query: 221 KRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERV 260
                       G ++  + A++  + +K     K+KE V
Sbjct: 912 ALNGGKL--PENGTELVEKLAKEKSLREKNKDDWKAKEEV 949



 Score = 35.5 bits (82), Expect = 0.21
 Identities = 50/310 (16%), Positives = 99/310 (31%), Gaps = 19/310 (6%)

Query: 102 FKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIR 161
            KLY NE +    E          E IE   EK E++L E  KEQ      L    + +R
Sbjct: 660 LKLYSNEGEEDLKEALE-------EAIEELVEKLEDVLNE-FKEQSEDIGYLEDAGELLR 711

Query: 162 EMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRK 221
           +   E+  K     +  + ++ +  +   A  +L ++ +      ++I  LE  L    +
Sbjct: 712 DFAPELEIKVE---RGNKILAKLFGRFLKAFNALKKLYEFAYTLEEEIQYLEDLLKARLE 768

Query: 222 RKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQAN 281
           +  + +   +     +    +D E   K   L      +  I K+  +    L  ++  N
Sbjct: 769 KDYQSKYLDLIDNLNDAREIKDKES--KAIDLDDIDFELELIGKQEINIDYILELLQTFN 826

Query: 282 EAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQ 341
           + +    +  E     + +   E+       + +     +       K   T    + L 
Sbjct: 827 DKNGAYESLKE----LIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELY 882

Query: 342 QL--DTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASL 399
            L       +Q  +++ L+ E             K  E           E  +R+     
Sbjct: 883 ILAKKEEEFKQFAEEEGLNEEELAFYDDLALNGGKLPENGTELVEKLAKEKSLREKNKDD 942

Query: 400 KDNKKLKEEL 409
              K+  E  
Sbjct: 943 WKAKEEVEAK 952


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 39.6 bits (92), Expect = 0.010
 Identities = 71/432 (16%), Positives = 169/432 (39%), Gaps = 44/432 (10%)

Query: 43  DYERLKAEMIQAEEETN--MSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQ 100
           + E LK E+ + + +    M+ +K       ++E  + KDEA +    +E++   + E+Q
Sbjct: 11  ENELLKKELERKQSKLGSSMNSIKTFWSPELKRERALRKDEAARITVWKEQLRVTQEENQ 70

Query: 101 LFKL--------YHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRE 152
             +L          ++ ++  L+ +L+K   E  ++ ++ EK+E      + E+     E
Sbjct: 71  HAQLTVQALQEELKSQRELNRLQQKLEKSGNETGEL-QQLEKSEGQFSRIQAERERQAME 129

Query: 153 LAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADL 212
           L  +   + ++ + + + + +L    E +  + + L S   S     + +E     I + 
Sbjct: 130 LFLLRDTLEKLQLRMERLQQTLDARDEEIKKLLEMLQSKGLSAKAEEEDSER-LGRIREA 188

Query: 213 ETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKK 272
           E Q + +     + E++ +  R+   +  Q    + K  +  K  +       K+   +K
Sbjct: 189 EDQQSHLEVLLEQKEKEHMMLREEIHQKLQMERDDAKTEASQKLIDE---KDTKIKEFEK 245

Query: 273 SLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEA 332
            L +            A+ E      R   +E +R        N    ++    +  A  
Sbjct: 246 MLEK------------AENEIYRLQSRCDTSEADR--------NRLDKEVEAERSALAAM 285

Query: 333 TKRAGKILQQLDTINREQKGDQDKL------DNELRQQVQ-TQNEIKKKRHEMEEAQKRI 385
             +  +  Q+L     E  G Q +L      D+++RQ +   + ++ +   E    Q  +
Sbjct: 286 KAKCDRAAQELSRKKTELLGLQTELETLANQDSDMRQHLDKLKEDLTRAEQEKAILQTEV 345

Query: 386 DKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELG--DAKTDKHE 443
           D L   + +   +L       +    +  +    +++++   E+   +L     K +  +
Sbjct: 346 DALRYELERKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKLRVLQKKIENLQ 405

Query: 444 DTRRKKKQELVE 455
           +T R+K++ L E
Sbjct: 406 ETFRRKERRLKE 417



 Score = 35.8 bits (82), Expect = 0.16
 Identities = 74/376 (19%), Positives = 141/376 (37%), Gaps = 33/376 (8%)

Query: 108 ETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEI 167
           + D+   E E    + EV+ +    E+  N L +K     A   E A    EI +M    
Sbjct: 328 KEDLTRAEQEKAILQTEVDALRYELERKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRY 387

Query: 168 NKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYE 227
            K    L   ++++ ++Q+     ++ L E ++   +   D    +T L  + K  AE E
Sbjct: 388 EKTERKLRVLQKKIENLQETFRRKERRLKEEKERLRSLQTDTNT-DTALEKLEKALAEKE 446

Query: 228 R--QSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQ------ 279
           R  + +  +    E  +  E    +      KE V ++Q KL+  +  L  +++      
Sbjct: 447 RIIERLKEQRDRDERYEQEEFETYKKEFEDLKEEVQNLQLKLSERELQLELLKEEVSKLA 506

Query: 280 ---------------ANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTE 324
                            E   +    LE +L  +R      +R S      +   A    
Sbjct: 507 SNQLKQRSDLERAHIELEKIREKHEKLEKELKRLRANPESADRGSAVDAGTSRSRADSAG 566

Query: 325 YTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKR 384
             N       R  K  Q+ D    E      +L+   R   + Q + +  R E E  +K 
Sbjct: 567 ARNEVDRLLDRLEKAEQERDDTEMEAGRLAKELEKAQRHLTKQQEKTEATRIEFE--RKS 624

Query: 385 IDKLEDHIRQN-----EASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGD--A 437
            + LE+  R       E +L+ + ++     +     ++ VQ+L+ +L+Q+  E     A
Sbjct: 625 AELLEEAERLEKSEAEEETLRQSTQIGHAQAAAHNHIEHHVQKLESDLKQLRAEREQLVA 684

Query: 438 KTDKHEDTRRKKKQEL 453
           + +K + +  + +QEL
Sbjct: 685 QLEKSQQSLMEFQQEL 700


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 39.1 bits (92), Expect = 0.013
 Identities = 34/123 (27%), Positives = 62/123 (50%), Gaps = 21/123 (17%)

Query: 66  KGVVAE-RKEAKIEKDEA-----EKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELD 119
           K ++ E +KEA+  K EA     E+  ++R E+  +E++ +  +L   E  + + E+ LD
Sbjct: 35  KRIIEEAKKEAETLKKEALLEAKEEVHKLRAEL-ERELKERRNELQRLERRLLQREETLD 93

Query: 120 KKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKE 179
           +K   ++K E   EK E  L  K+K           +D++  E++  I ++R  L    E
Sbjct: 94  RKMESLDKKEENLEKKEKELSNKEKN----------LDEKEEELEELIAEQREEL----E 139

Query: 180 RVS 182
           R+S
Sbjct: 140 RIS 142



 Score = 34.5 bits (80), Expect = 0.30
 Identities = 29/136 (21%), Positives = 59/136 (43%), Gaps = 14/136 (10%)

Query: 322 MTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEME-- 379
           +  Y   K  A K+ G   +    I  E K + + L  E    ++ + E+ K R E+E  
Sbjct: 13  IIGYLLRKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEA--LLEAKEEVHKLRAELERE 70

Query: 380 --EAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDA 437
             E +  + +LE  + Q E +L    +  ++   ++   +  +   +K L++  EEL   
Sbjct: 71  LKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEEL--- 127

Query: 438 KTDKHEDTRRKKKQEL 453
                E+   ++++EL
Sbjct: 128 -----EELIAEQREEL 138



 Score = 32.2 bits (74), Expect = 1.9
 Identities = 26/123 (21%), Positives = 60/123 (48%), Gaps = 13/123 (10%)

Query: 121 KKGEVEKIERRKEKAENILREKKKEQGALNREL------------AKVDQEIREMDVEIN 168
           K+   +K+   +E A+ I+ E KKE   L +E             A++++E++E   E+ 
Sbjct: 20  KRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQ 79

Query: 169 KKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLAD-VRKRKAEYE 227
           +    L++ +E +    + L   +++L +  +      K++ + E +L + + +++ E E
Sbjct: 80  RLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELE 139

Query: 228 RQS 230
           R S
Sbjct: 140 RIS 142


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 39.3 bits (92), Expect = 0.014
 Identities = 21/146 (14%), Positives = 51/146 (34%), Gaps = 6/146 (4%)

Query: 312 GRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQ----KGDQDKLDNELRQQVQT 367
              I + +  +++  N       R  +++ Q+     E+    +  ++++     +    
Sbjct: 781 AEKILIITEILSDGINNSD-INDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELI 839

Query: 368 QNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKEL 427
             E K  + E++     I + E+ I+    +  +  + K +    +         L K +
Sbjct: 840 NEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKND-PKKLNKLIIAKDVLIKLV 898

Query: 428 EQVIEELGDAKTDKHEDTRRKKKQEL 453
               E   D KT K +    +K  E 
Sbjct: 899 ISSDEIKQDEKTTKKKKKDLEKTDEE 924


>gnl|CDD|220402 pfam09787, Golgin_A5, Golgin subfamily A member 5.  Members of this
           family of proteins are involved in maintaining Golgi
           structure. They stimulate the formation of Golgi stacks
           and ribbons, and are involved in intra-Golgi retrograde
           transport. Two main interactions have been
           characterized: one with RAB1A that has been activated by
           GTP-binding and another with isoform CASP of CUTL1.
          Length = 509

 Score = 38.3 bits (89), Expect = 0.021
 Identities = 57/280 (20%), Positives = 103/280 (36%), Gaps = 35/280 (12%)

Query: 169 KKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYER 228
            +  S    K+  S     L+S    L     +     +  +DLE +LA ++ R  E   
Sbjct: 70  SRVSSPSSKKDGTSR---SLSSQVDDLASAVSS-----QSSSDLEDELAALKIRLQEA-A 120

Query: 229 QSIPGRDINLESAQ-----DVEINKKRPSLIKSKERVSHIQKKLASAKKSLV----EVRQ 279
           Q +      LE  +     D+   +    L +   R   +Q KL     ++      V  
Sbjct: 121 QELRELKSELEDLRSERSRDLSDEESIKRLQRGAVRS--LQDKLLLLDAAVKRHDGNVIT 178

Query: 280 ANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKI 339
           A     K    LE    ++ +R+    R          E  Q+ +    + E+ K+    
Sbjct: 179 AVLIFLKRTLKLEILFQELAERQTA-LRHLSKHLQEEGELQQLLKLLRAEGESEKQE--- 234

Query: 340 LQQLDTINREQKGDQDKLDNELRQQV--------QTQNEIKKKRHEMEEAQKRIDKLEDH 391
           LQQ           ++K  N L++            Q E+++ +HE E  Q+ I KLE  
Sbjct: 235 LQQYRQKAHRILQSKEKRINFLKEGCLFEGLDSSTAQIELEELKHESEHVQEEITKLEGQ 294

Query: 392 IRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVI 431
           I Q  +  +D    + E + +  S + + +EL +++   I
Sbjct: 295 IIQLRSEAQD---REAEASGEAESFRKQPRELSQQIAPQI 331


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 38.5 bits (90), Expect = 0.022
 Identities = 42/205 (20%), Positives = 73/205 (35%), Gaps = 38/205 (18%)

Query: 76  KIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKA 135
           K   D  E+   +  EI  KEVE  L  L     +++E  + LD++K  +++      + 
Sbjct: 67  KEFLDLEEEILDLEAEI--KEVEENLESLEKEINELEEWLNVLDEEKSFLDENLEELSEL 124

Query: 136 ENILREKKKEQGA-------------------LNRELAKV-DQEIREMDVEI----NKKR 171
            N+  + K  +GA                     REL +     IR+ ++E      KK 
Sbjct: 125 SNLDIDFKYLRGAEGLKLGFVAGVINREKLEAFERELWRACRGYIRQAEIEEPLEDPKKT 184

Query: 172 PSLI--KSKERVSHIQKKLASAKKSLVEV-------RQANEAHNKDIADLETQLADVRKR 222
             +I    KE +  ++K L S    L +V        +     NK I +L+  L      
Sbjct: 185 VFIIFFVGKEDLDKVKKILDSFGFELYDVPETEGERSELISKVNKRIEELQRVLEQTESH 244

Query: 223 KAEY---ERQSIPGRDINLESAQDV 244
             +        +   D  +   + V
Sbjct: 245 LEKVLVKIADELLAWDEQVSKEKAV 269



 Score = 34.7 bits (80), Expect = 0.32
 Identities = 48/265 (18%), Positives = 99/265 (37%), Gaps = 43/265 (16%)

Query: 192 KKSLVEVRQANEAHNKDIADLE-TQLADVRKRKAEYERQ----SIPGRDINLESAQDVEI 246
           +  LV++R  NE    D++  +   + +VR R  E ER+        + + +        
Sbjct: 6   ELGLVQIRDLNE----DVSAFQRKFVNEVR-RCDEVERKLRKLESKIKKLGIPLKDTGGK 60

Query: 247 NKKRPSLI--KSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE 304
               PS      +E +  ++ ++   +++L  +        K+I +LE  L  + + K+ 
Sbjct: 61  PDVPPSKEFLDLEEEILDLEAEIKEVEENLESLE-------KEINELEEWLNVLDEEKSF 113

Query: 305 YER-----QSIPGRDINLESAQMTEYTNLKAEA----TKRAGKILQQLDTINREQKGDQD 355
            +        +   DI+ +  +  E   L   A     ++     ++L    R     Q 
Sbjct: 114 LDENLEELSELSNLDIDFKYLRGAEGLKLGFVAGVINREKLEAFERELWRACRGYIR-QA 172

Query: 356 KLDNELRQQVQTQNEIKKKRH----EMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNS 411
           +++  L        + KK         +E   ++ K+ D        + + +  + EL S
Sbjct: 173 EIEEPL-------EDPKKTVFIIFFVGKEDLDKVKKILDSFGFELYDVPETEGERSELIS 225

Query: 412 DVGSSKNRVQELQKELEQVIEELGD 436
            V     R++ELQ+ LEQ    L  
Sbjct: 226 KV---NKRIEELQRVLEQTESHLEK 247



 Score = 29.7 bits (67), Expect = 9.7
 Identities = 23/138 (16%), Positives = 53/138 (38%), Gaps = 18/138 (13%)

Query: 50  EMIQAEEETNMSYLKKKGVVAERKEAKIEK------DEAEKYQRIREEI--VAKEVEHQL 101
           + ++  E   + +    GV+   K    E+          +   I E +    K V    
Sbjct: 132 KYLRGAEGLKLGF--VAGVINREKLEAFERELWRACRGYIRQAEIEEPLEDPKKTVFIIF 189

Query: 102 FKLYHNETDIKELEDELDKKKGEV--------EKIERRKEKAENILREKKKEQGALNREL 153
           F    +   +K++ D    +  +V        E I +  ++ E + R  ++ +  L + L
Sbjct: 190 FVGKEDLDKVKKILDSFGFELYDVPETEGERSELISKVNKRIEELQRVLEQTESHLEKVL 249

Query: 154 AKVDQEIREMDVEINKKR 171
            K+  E+   D +++K++
Sbjct: 250 VKIADELLAWDEQVSKEK 267


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 37.3 bits (87), Expect = 0.022
 Identities = 26/130 (20%), Positives = 57/130 (43%), Gaps = 10/130 (7%)

Query: 341 QQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLK 400
            ++  I    K  ++KL       V+ + E++    E++ A++RI+ LED + +    ++
Sbjct: 66  SEIQEIRERIKRAEEKLSA-----VKDERELRALNIEIQIAKERINSLEDELAELMEEIE 120

Query: 401 DNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFKKA 460
             +K  E+L   +   +  + E +  LE+ + E+      +       K++EL E     
Sbjct: 121 KLEKEIEDLKERLERLEKNLAEAEARLEEEVAEI-----REEGQELSSKREELKEKLDPE 175

Query: 461 YSGVYDRMIN 470
               Y+R+  
Sbjct: 176 LLSEYERIRK 185



 Score = 35.8 bits (83), Expect = 0.070
 Identities = 23/108 (21%), Positives = 59/108 (54%), Gaps = 2/108 (1%)

Query: 124 EVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSH 183
           E +++E R ++    L++ K E  ALN+ L  ++ E+ +++ ++++    + + +ER+  
Sbjct: 18  EKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKR 77

Query: 184 IQKKLASAKKS--LVEVRQANEAHNKDIADLETQLADVRKRKAEYERQ 229
            ++KL++ K    L  +    +   + I  LE +LA++ +   + E++
Sbjct: 78  AEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKE 125



 Score = 34.6 bits (80), Expect = 0.18
 Identities = 40/208 (19%), Positives = 94/208 (45%), Gaps = 35/208 (16%)

Query: 243 DVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRK 302
           D+E ++  P + + ++ +   + +L +  K+L  +    E     ++ LE+++ ++R+R 
Sbjct: 16  DLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERI 75

Query: 303 AEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELR 362
                                     +AE    A K  ++L  +N E +  ++++++   
Sbjct: 76  K-------------------------RAEEKLSAVKDERELRALNIEIQIAKERINSLED 110

Query: 363 QQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQE 422
           +  +   EI+K   E+E+ ++R+++LE ++ + EA L++      E            QE
Sbjct: 111 ELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEEVAEIRE----------EGQE 160

Query: 423 LQKELEQVIEELGDAKTDKHEDTRRKKK 450
           L  + E++ E+L      ++E  R+ KK
Sbjct: 161 LSSKREELKEKLDPELLSEYERIRKNKK 188



 Score = 32.7 bits (75), Expect = 0.89
 Identities = 38/205 (18%), Positives = 74/205 (36%), Gaps = 29/205 (14%)

Query: 679 QIAKLEAEIDALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIG 738
            I KL+ E D L  R       +K  +A + A    +   + E+  +E+ V +       
Sbjct: 11  AIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQL------ 64

Query: 739 VSTIRQYEEAELRSQQERQKICQDKDTKKNVAR--------WERAVSDDEEELARAQGAE 790
                + E  E+R + +R +        +   R         +  ++  E+ELA      
Sbjct: 65  -----ESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEI 119

Query: 791 EKLAGEMRAEADKLENMRATRLTKKQAVDAMDEEIGKARREVGSIAKDIQAAQKSCVNLE 850
           EKL  E+    ++LE +       +  ++           EV  I ++ Q        L+
Sbjct: 120 EKLEKEIEDLKERLERLEKNLAEAEARLEE----------EVAEIREEGQELSSKREELK 169

Query: 851 SKLEMKKSERHDILMNCKMNDIVLP 875
            KL+ +    ++ +   K    V+P
Sbjct: 170 EKLDPELLSEYERIRKNKKGVGVVP 194



 Score = 30.8 bits (70), Expect = 3.2
 Identities = 30/127 (23%), Positives = 54/127 (42%), Gaps = 10/127 (7%)

Query: 611 LDLARKAKRWDDKEMGNLKAQKEKLSEEL----------REAMKKSRKESELNTVQSTIK 660
           L+   KA    + E+ +L+ Q  +L  E+           E +   + E EL  +   I+
Sbjct: 40  LEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQ 99

Query: 661 GLEIRLNYSRQDLQNTKSQIAKLEAEIDALNARADATEPKIKAIEASMTARGDTISRKKE 720
             + R+N    +L     +I KLE EI+ L  R +  E  +   EA +      I  + +
Sbjct: 100 IAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEEVAEIREEGQ 159

Query: 721 EMNSVED 727
           E++S  +
Sbjct: 160 ELSSKRE 166


>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein.  This is a
           family of proteins which appears to have a highly
           conserved zinc finger domain at the C terminal end,
           described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
           predicted to contain a coiled coil. Members are
           annotated as being tumour-associated antigen HCA127 in
           humans but this could not confirmed.
          Length = 215

 Score = 36.8 bits (85), Expect = 0.039
 Identities = 27/104 (25%), Positives = 47/104 (45%)

Query: 110 DIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINK 169
           +I+   DEL+K K E++  E   E  E  L+E  KE   L  E  +  +E+R++  +IN 
Sbjct: 5   EIRNKTDELEKLKDEIKAEEEALESEEKHLKEYDKEMEELLEEKMQHVEELRQIHADIND 64

Query: 170 KRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLE 213
               + +SK  +     K+A        ++++     K    LE
Sbjct: 65  METEIKQSKSELERRMGKIARMHGEYNPLKESINEMRKLELGLE 108


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
           which functions are known are part of an exonuclease
           complex with sbcD homologs. This complex is involved in
           the initiation of recombination to regulate the levels
           of palindromic sequences in DNA. This family is based on
           the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1042

 Score = 37.6 bits (87), Expect = 0.040
 Identities = 64/453 (14%), Positives = 155/453 (34%), Gaps = 54/453 (11%)

Query: 48  KAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRI---REEIVAKEVEH----- 99
           +   +   E      L  K        +++              R++++ KE++H     
Sbjct: 177 QYTQLALMEFAKKKSLHGK-AELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREAL 235

Query: 100 -QLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRE------ 152
            Q  + +   T  +E ++E  KK+  ++++  R E+        ++ Q  +NR       
Sbjct: 236 QQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRARKAAPL 295

Query: 153 ------LAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLAS--AKKSLVEVRQANEA 204
                 + +++Q+ + +  E+  K  S  K   + +   K+ +S   ++ L++   + E 
Sbjct: 296 AAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEI 355

Query: 205 HNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKE------ 258
           H +D  ++ T + ++  ++     Q I        +      +  +   I  +E      
Sbjct: 356 HIRDAHEVATSIREISCQQHT-LTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDT 414

Query: 259 ---RVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDI 315
                  +Q +LA AKK   E++Q              Q   + K   +   QS+  R+ 
Sbjct: 415 RTSAFRDLQGQLAHAKKQ-QELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQ 473

Query: 316 NLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKG-----------------DQDKLD 358
            L      +    +          L +L        G                    ++ 
Sbjct: 474 QL--QTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQ 531

Query: 359 NELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKN 418
              +   Q +   +   H++   +K+   L++ +++ + S     +       D+ + +N
Sbjct: 532 RGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQN 591

Query: 419 RVQELQKELEQVIEELGDAKTDKHEDTRRKKKQ 451
               LQ   E++ E       ++H   R+ + +
Sbjct: 592 ITVRLQDLTEKLSEAEDMLACEQHALLRKLQPE 624



 Score = 31.5 bits (71), Expect = 3.5
 Identities = 53/402 (13%), Positives = 126/402 (31%), Gaps = 28/402 (6%)

Query: 46  RLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLY 105
             +A+ I  E ++ M    K       K A   K ++   ++ R        E  +   +
Sbjct: 306 EQQAQRIHTELQSKMRSRAK----LLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAH 361

Query: 106 HNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDV 165
              T I+E+  +       +  ++++K      L+   KE   L RE A +D        
Sbjct: 362 EVATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAFRD 421

Query: 166 EINKK-------RPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLAD 218
              +              ++   + I       K   + ++++ ++      + E QL  
Sbjct: 422 LQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLK----EREQQLQT 477

Query: 219 VRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVR 278
             +   +  R               +   ++ P  +       +  ++       L    
Sbjct: 478 KEQIHLQETR-------KKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRM 530

Query: 279 QANE----AHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATK 334
           Q  E           D+  QL   RK++A  + Q    +       Q    +       +
Sbjct: 531 QRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQ 590

Query: 335 RAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQ 394
                LQ L     E +       + L +++Q + +++  R  +++  + +      +  
Sbjct: 591 NITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHA 650

Query: 395 NEASLKDNKKLKEELNSDV--GSSKNRVQELQKELEQVIEEL 434
            + +L   +  +  L+  V         Q   ++++   E+L
Sbjct: 651 LQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQL 692


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 37.7 bits (87), Expect = 0.040
 Identities = 66/357 (18%), Positives = 148/357 (41%), Gaps = 50/357 (14%)

Query: 136 ENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKK---LASAK 192
           +N L   ++E+  L ++   ++ ++ +M   I +   +L  S E+   ++++     S+ 
Sbjct: 142 QNQLEGCQRERNDLMKKFLALEGDVSDMRSRIEQLETALRHSTEKTEELEEQHKEAQSSS 201

Query: 193 KSLVEVRQANEA----HNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQD----- 243
           +S+   R A  A    + + I +LE  +  + ++K E +R     +DI  E  +      
Sbjct: 202 ESMSAERNALLAQRAENQQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELK 261

Query: 244 -------VEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLA 296
                  ++   +    I  +     + + L + ++ L   +Q      +++A     L 
Sbjct: 262 QRLKKMTIQRRDEETERIDLQLENEQLHEDLRTLQERLESSQQKAGLLGEELA----SLG 317

Query: 297 DVRKRKAEYERQSIPGRDINLESAQMT-EYTNLKAEATKRAGKILQQLDTINREQKGDQD 355
            +R        QS       LESAQM+ + + L     +   +  Q+ +T+ +  + D+D
Sbjct: 318 SLRDHTIAELHQS------RLESAQMSLQLSQLNLALKEGQSQWAQERETLRQSAEADKD 371

Query: 356 KLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGS 415
           ++     + ++ +  ++++R + E+ +  +   +D    N   L +N++   EL S +  
Sbjct: 372 RIQKLSAELLKLEEWLQEERSQREKLEVELGIEKD---CNRVQLSENRRELSELRSAL-- 426

Query: 416 SKNRVQELQKELEQVIEELG---------DAKTDKHEDTRRKKKQELVENFKKAYSG 463
                + LQKE EQ+ EE           + + DK  D +  +     E   K+Y  
Sbjct: 427 -----RVLQKEKEQLQEEKQELLDYIRVLELRLDKEADEKWAEDAATCE-DAKSYLH 477



 Score = 35.0 bits (80), Expect = 0.22
 Identities = 41/249 (16%), Positives = 107/249 (42%), Gaps = 22/249 (8%)

Query: 46  RLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRI--REEIVAKEVEHQLFK 103
            L  E   ++  + +  L+     +  K  ++E    E+++      E ++ E    L +
Sbjct: 159 FLALEGDVSDMRSRIEQLETALRHSTEKTEELE----EQHKEAQSSSESMSAERNALLAQ 214

Query: 104 LYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREM 163
              N+  I ELE        +++ + ++K++ + +L   +  +  L R   ++ Q +++M
Sbjct: 215 RAENQQRILELEQ-------DIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLKKM 267

Query: 164 DVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLET----QLADV 219
            ++   +    I  +     + + L + ++ L   +Q      +++A L +     +A++
Sbjct: 268 TIQRRDEETERIDLQLENEQLHEDLRTLQERLESSQQKAGLLGEELASLGSLRDHTIAEL 327

Query: 220 RKRKAEYERQSIPGRDINLESAQDV-EINKKRPSLIKS----KERVSHIQKKLASAKKSL 274
            + + E  + S+    +NL   +   +  ++R +L +S    K+R+  +  +L   ++ L
Sbjct: 328 HQSRLESAQMSLQLSQLNLALKEGQSQWAQERETLRQSAEADKDRIQKLSAELLKLEEWL 387

Query: 275 VEVRQANEA 283
            E R   E 
Sbjct: 388 QEERSQREK 396



 Score = 33.8 bits (77), Expect = 0.54
 Identities = 39/186 (20%), Positives = 82/186 (44%), Gaps = 21/186 (11%)

Query: 263 IQKKLASAKKSLVEVRQANEAHNKDIAD-------LETQLADVRKRKAEYERQSIPGRDI 315
           +Q +L   ++   ++ +   A   D++D       LET L    ++  E E Q    ++ 
Sbjct: 141 LQNQLEGCQRERNDLMKKFLALEGDVSDMRSRIEQLETALRHSTEKTEELEEQH---KEA 197

Query: 316 NLESAQMTEYTNL----KAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEI 371
              S  M+   N     +AE  +R  ++ Q + T+  ++K + D++     Q ++ + E 
Sbjct: 198 QSSSESMSAERNALLAQRAENQQRILELEQDIQTLT-QKKQENDRVLEGT-QDIEAELER 255

Query: 372 KKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVI 431
            K      E ++R+ K+    R  E    D +   E+L+ D+ + + R++  Q++   + 
Sbjct: 256 MKG-----ELKQRLKKMTIQRRDEETERIDLQLENEQLHEDLRTLQERLESSQQKAGLLG 310

Query: 432 EELGDA 437
           EEL   
Sbjct: 311 EELASL 316


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 37.4 bits (87), Expect = 0.044
 Identities = 33/174 (18%), Positives = 72/174 (41%), Gaps = 24/174 (13%)

Query: 74  EAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELE--DELDKKKGE------- 124
           + K E+D+ ++     E+ + KE++    KL   E +++ELE   E  ++KGE       
Sbjct: 272 DEKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQ 331

Query: 125 -VEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSH 183
            +E+  +    A+    E+ K +   ++  ++  Q   +   ++   + +L +    +  
Sbjct: 332 LIEEGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKE 391

Query: 184 IQKKLASAKKSLVEVRQANEAHNKDIADLETQLAD---------VRKRKAEYER 228
                 SAK +L           K I ++  +L +          RK+K  +E+
Sbjct: 392 AIAYYESAKTAL-----EKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440


>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein.  This
            model represents a group of paralogous families in
            plasmodium species alternately annotated as reticulocyte
            binding protein, 235-kDa family protein and rhoptry
            protein. Rhoptry protein is localized on the cell surface
            and is extremely large (although apparently lacking in
            repeat structure) and is important for the process of
            invasion of the RBCs by the parasite. These proteins are
            found in P. falciparum, P. vivax and P. yoelii.
          Length = 2757

 Score = 37.7 bits (87), Expect = 0.045
 Identities = 78/420 (18%), Positives = 192/420 (45%), Gaps = 43/420 (10%)

Query: 110  DIKELEDELDKKKGEVEKIERRKEKAENILR------EKKKEQGALNRELAKVDQEIREM 163
            DIK L+ ++D     +E+I+++ E   + ++      E   ++   N +  +++++I  +
Sbjct: 1119 DIKNLDQKIDHHIKALEEIKKKSENYIDEIKAQINDLEDVADKAISNDDPEEIEKKIENI 1178

Query: 164  DVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRK 223
              +I+KK+    + K+ ++ I  ++   K SL EV+  N ++ K++  L  +  D  K+K
Sbjct: 1179 VTKIDKKKNIYDEIKKLLNEIA-EIEKDKTSLEEVKGINLSYGKNLGKLFLEKIDEEKKK 1237

Query: 224  AEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQ---A 280
            +E+  +++     +L+     EI +K P +         I+ ++ +   S  + +     
Sbjct: 1238 SEHMIKAMEAYIEDLD-----EIKEKSPEIENEMGIEMDIKAEMETFNISHDDDKDHHII 1292

Query: 281  NEAHNKDIADL-------------ETQLADVRKRKAEYERQSIP-GRDINLESAQMTE-Y 325
            ++ H+++I+D+             E+ + D++K   +    +     DINL   ++   Y
Sbjct: 1293 SKKHDENISDIREKSLKIIEDFSEESDINDIKKELQKNLLDAQKHNSDINLYLNEIANIY 1352

Query: 326  TNLKAEATKR-AGKILQQLDTINREQKGDQDKLDN--ELRQQVQTQNEIKKKRHEME--- 379
              LK    K+   ++ +    I    K  +D+LD   +L ++++    +++ + ++E   
Sbjct: 1353 NILKLNKIKKIIDEVKEYTKEIEENNKNIKDELDKSEKLIKKIKDDINLEECKSKIESTL 1412

Query: 380  ------EAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEE 433
                  E  K+I +L++HI   E+++    K  +E N +V      ++    + + +++ 
Sbjct: 1413 DDKDIDECIKKIKELKNHILSEESNIDTYFKNADENNENVLLLFKNIEMADNKSQHILKI 1472

Query: 434  LGDAKTDKHEDTRRKKKQELVENFK-KAYSGVYDRMINMCHPVHKRYNVAITKVLGKYME 492
              D  T+ H+    + K+ + ++   K  +    + I     + ++Y   +T++L KY  
Sbjct: 1473 KKDNATNDHDFNINELKEHIDKSKGCKDEADKNAKAIEKNKELFEQYKKDVTELLNKYSA 1532


>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
           This model describes a conserved domain found in surface
           proteins of a number of Firmutes. Many members have
           LPXTG C-terminal anchoring motifs and a substantial
           number have the KxYKxGKxW putative sorting signal at the
           N-terminus. The tetracycline resistance plasmid pCF10 in
           Enterococcus faecalis promotes conjugal plasmid transfer
           in response to sex pheromones, but PgrA/Sec10 encoded by
           that plasmid, a member of this family, specifically
           inhibits the ability of cells to receive homologous
           plasmids. The phenomenon is called surface exclusion.
          Length = 356

 Score = 36.6 bits (85), Expect = 0.063
 Identities = 22/113 (19%), Positives = 47/113 (41%), Gaps = 15/113 (13%)

Query: 184 IQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQD 243
           +Q  LA+A+  L   + A       +   +T  A  +   A  +++        L +AQ 
Sbjct: 259 LQAALATAQADLAAAQTALNTAQAALTTAQTAYAAAQAALATAQKE--------LANAQA 310

Query: 244 VEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLA 296
             +   + +L  ++  +++ + +LA AK++L          N D+A  +  L 
Sbjct: 311 QALQTAQNNLATAQAALANAEARLAKAKEAL-------ANLNADLAKKQAALD 356



 Score = 30.8 bits (70), Expect = 3.4
 Identities = 23/107 (21%), Positives = 44/107 (41%), Gaps = 12/107 (11%)

Query: 252 SLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIP 311
           +L  ++  ++  Q  L +A+ +L   + A  A    +A  + +LA+ + +  +  +    
Sbjct: 262 ALATAQADLAAAQTALNTAQAALTTAQTAYAAAQAALATAQKELANAQAQALQTAQN--- 318

Query: 312 GRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLD 358
               NL +AQ        A A  R  K  + L  +N +    Q  LD
Sbjct: 319 ----NLATAQ-----AALANAEARLAKAKEALANLNADLAKKQAALD 356


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 36.5 bits (84), Expect = 0.067
 Identities = 39/254 (15%), Positives = 87/254 (34%), Gaps = 34/254 (13%)

Query: 157 DQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQL 216
           ++ +R    +++  RP L   +  +   Q +  +A+      R   +   ++   +  +L
Sbjct: 66  NRNLRSGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQEL 125

Query: 217 ADVRKRKA----EYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKK 272
           A  R+  A    E  R +   +D+        E   +R  L    + +   QK+L ++  
Sbjct: 126 AAARQNLAKAQQELARLTKQAQDLQTRLKTLAE---QRRQLEAQAQSLQASQKQLQASAT 182

Query: 273 SLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEA 332
            L       +  +  I            +       +   R          E     A A
Sbjct: 183 QLKSQVLDLKLRSAQIEQ--------EAQNLATRANAAQART--------EELARRAAAA 226

Query: 333 TKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHI 392
            + A  I QQ D           ++  + +Q      +I+++  +++  +    +LE  +
Sbjct: 227 QQTAQAI-QQRDA----------QISQKAQQIAARAEQIRERERQLQRLETAQARLEQEV 275

Query: 393 RQNEASLKDNKKLK 406
            Q EA  +   +L+
Sbjct: 276 AQLEAYYQAYVRLR 289



 Score = 35.8 bits (82), Expect = 0.13
 Identities = 31/178 (17%), Positives = 65/178 (36%), Gaps = 23/178 (12%)

Query: 280 ANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKI 339
            N      +  L+     +R  + E            L +AQ       K  A       
Sbjct: 65  LNRNLRSGVFQLDDIRPQLRALRTE------------LGTAQGE-----KRAAETEREAA 107

Query: 340 LQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASL 399
             +L    +E++  + +L    +   + Q E+ +   + ++ Q R+  L +  RQ EA  
Sbjct: 108 RSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQA 167

Query: 400 KDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENF 457
           +  +  +++L +     K++V +L+    Q+ +E       ++  TR    Q   E  
Sbjct: 168 QSLQASQKQLQASATQLKSQVLDLKLRSAQIEQE------AQNLATRANAAQARTEEL 219



 Score = 35.4 bits (81), Expect = 0.16
 Identities = 39/224 (17%), Positives = 87/224 (38%), Gaps = 21/224 (9%)

Query: 630 AQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDA 689
            Q + +  +LR         +EL T Q   +  E     +R +LQ  + +   +  E+ A
Sbjct: 74  FQLDDIRPQLRA------LRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAA 127

Query: 690 LN---ARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYE 746
                A+A     ++      +  R  T++ ++ ++ +    +     +    +T  + +
Sbjct: 128 ARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQ 187

Query: 747 EAELRSQQERQKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLEN 806
             +L+ +  +  I Q+    +N+A    A     EELAR   A ++ A  ++    ++  
Sbjct: 188 VLDLKLRSAQ--IEQEA---QNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQ 242

Query: 807 MRATRLTKKQAVDAMDEEIGKARREVGSIAKDIQAAQKSCVNLE 850
                  K Q + A  E+I +  R++  +       ++    LE
Sbjct: 243 -------KAQQIAARAEQIRERERQLQRLETAQARLEQEVAQLE 279



 Score = 33.1 bits (75), Expect = 0.90
 Identities = 34/224 (15%), Positives = 77/224 (34%), Gaps = 34/224 (15%)

Query: 39  ALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEA---------------E 83
           AL+ +    + E   AE E   +  + +    ER+  + E   A               +
Sbjct: 85  ALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLTK 144

Query: 84  KYQRIREEIVA-----KEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENI 138
           + Q ++  +       +++E Q   L  ++  ++    +L  +  +++    + E+    
Sbjct: 145 QAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQN 204

Query: 139 LREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEV 198
           L  +     A   ELA+     ++    I  +R + I         QK    A ++    
Sbjct: 205 LATRANAAQARTEELARRAAAAQQTAQAIQ-QRDAQIS--------QKAQQIAARAEQIR 255

Query: 199 RQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQ 242
            +  +        LET  A + +  A+ E        +  ++A 
Sbjct: 256 ERERQLQ-----RLETAQARLEQEVAQLEAYYQAYVRLRQQAAA 294



 Score = 33.1 bits (75), Expect = 1.0
 Identities = 32/225 (14%), Positives = 86/225 (38%), Gaps = 22/225 (9%)

Query: 245 EINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE 304
           +++  RP L   +  +   Q +  +A+      R   +   ++   +  +LA  R+  A+
Sbjct: 75  QLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAK 134

Query: 305 YERQSIPGRDINLESAQMTEY-----TNLKAEATKRAGKILQQLDTINREQK---GDQDK 356
            ++          E A++T+      T LK  A +R  ++  Q  ++   QK       +
Sbjct: 135 AQQ----------ELARLTKQAQDLQTRLKTLAEQRR-QLEAQAQSLQASQKQLQASATQ 183

Query: 357 LDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSS 416
           L +++        +I+++   +           + + +  A+    ++  + +       
Sbjct: 184 LKSQVLDLKLRSAQIEQEAQNLATRANAAQARTEELARRAAAA---QQTAQAIQQRDAQI 240

Query: 417 KNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFKKAY 461
             + Q++    EQ+ E     +  +    R +++   +E + +AY
Sbjct: 241 SQKAQQIAARAEQIRERERQLQRLETAQARLEQEVAQLEAYYQAY 285



 Score = 30.8 bits (69), Expect = 4.9
 Identities = 14/157 (8%), Positives = 64/157 (40%), Gaps = 1/157 (0%)

Query: 37  SGALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKE 96
             A + +  + + E+ +  ++      + K +  +R++ + +   + +  + + +  A +
Sbjct: 125 LAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQ-SLQASQKQLQASATQ 183

Query: 97  VEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKV 156
           ++ Q+  L      I++    L  +    +       +     ++  +     + ++++ 
Sbjct: 184 LKSQVLDLKLRSAQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQK 243

Query: 157 DQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKK 193
            Q+I     +I ++   L + +   + +++++A  + 
Sbjct: 244 AQQIAARAEQIRERERQLQRLETAQARLEQEVAQLEA 280



 Score = 30.0 bits (67), Expect = 7.9
 Identities = 24/200 (12%), Positives = 71/200 (35%), Gaps = 31/200 (15%)

Query: 108 ETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEI 167
           + + +  E E +  + E++K  + +E     L   ++      +ELA++ ++ +++   +
Sbjct: 94  QGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQAQDLQTRL 153

Query: 168 NK---KRPSLIKSKE---------------------RVSHIQKKLASAKKSLVEVRQANE 203
                +R  L    +                      +     ++    ++L     A +
Sbjct: 154 KTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRANAAQ 213

Query: 204 AHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKE----R 259
           A  +++A         ++     +++         + A   E  ++R   ++  E    R
Sbjct: 214 ARTEELARRAAAA---QQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLETAQAR 270

Query: 260 VSHIQKKLASAKKSLVEVRQ 279
           +     +L +  ++ V +RQ
Sbjct: 271 LEQEVAQLEAYYQAYVRLRQ 290


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 35.5 bits (82), Expect = 0.072
 Identities = 19/103 (18%), Positives = 55/103 (53%), Gaps = 2/103 (1%)

Query: 360 ELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNR 419
             ++  + Q E+ + + ++ E Q+ ++ LE+ I + E+ L+D K+  + L   + S + R
Sbjct: 70  RPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEER 129

Query: 420 VQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFKKAYS 462
           ++ L++ ++++ +EL +    +      ++ +E +E  ++   
Sbjct: 130 LESLEESIKELAKELRE--LRQDLREEVEELREELERLQENLQ 170


>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
           HlyD family.  Type I secretion is an ABC transport
           process that exports proteins, without cleavage of any
           signal sequence, from the cytosol to extracellular
           medium across both inner and outer membranes. The
           secretion signal is found in the C-terminus of the
           transported protein. This model represents the adaptor
           protein between the ATP-binding cassette (ABC) protein
           of the inner membrane and the outer membrane protein,
           and is called the membrane fusion protein. This model
           selects a subfamily closely related to HlyD; it is
           defined narrowly and excludes, for example, colicin V
           secretion protein CvaA and multidrug efflux proteins
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 423

 Score = 36.5 bits (85), Expect = 0.074
 Identities = 32/207 (15%), Positives = 80/207 (38%), Gaps = 25/207 (12%)

Query: 748 AELRSQQERQKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLENM 807
           AEL SQ  R +          VAR  RA +D +  +             +       +++
Sbjct: 84  AELESQVLRLEA--------EVARL-RAEADSQAAIEFPDDLLSAEDPAVPELIKGQQSL 134

Query: 808 RATRLTKKQA-VDAMDEEIGKARREVGSIAKDIQAAQKSCVNLESKLEMKKSERHDILMN 866
             +R +  +A ++ +  +I +   E+  +   +QA ++    +  +LE ++  +   L+ 
Sbjct: 135 FESRKSTLRAQLELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRKLKEKGLV- 193

Query: 867 CKMNDIVLPMLRVQKYDRKLAKSIQEMTSRLQTIQAPNLRAMEKLEHAKENLMKTNEEF- 925
                     L + + +R+ A++  E+      ++        +++  +    +  + F 
Sbjct: 194 --------SRLELLELERERAEAQGELGRLEAELEV----LKRQIDELQLERQQIEQTFR 241

Query: 926 ENARKRAKKAKANFDRIKKERYDKFTR 952
           E   +   +A+A    + +ER +K   
Sbjct: 242 EEVLEELTEAQARLAEL-RERLNKARD 267


>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
          Length = 977

 Score = 36.8 bits (85), Expect = 0.074
 Identities = 79/383 (20%), Positives = 153/383 (39%), Gaps = 58/383 (15%)

Query: 55  EEETNMSYLKKKGVVA-ERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKE 113
           +E   +     + V+   +K    + D      +  E I A + E Q        T+ K+
Sbjct: 70  DENGQLENTSLRTVMELPQKSTSSDDDHNRASMQRDEAIAAIDNEQQ--------TNSKD 121

Query: 114 LEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPS 173
            E   D +  ++  + +  EK  NIL   +    AL   L K+  E   +  +IN     
Sbjct: 122 GEQLSDFQLEDLVGMIQNAEK--NILLLNQARLQALED-LEKILTEKEALQGKINILEMR 178

Query: 174 LIKSKERVSHIQKKLASAKKSLVEV--RQANEAHNKDIADLETQLADVRKRKAEYERQSI 231
           L ++  R+     KLA+ +K  VE+   Q  +  N+ +    T+   V     E +    
Sbjct: 179 LSETDARI-----KLAAQEKIHVEILEEQLEKLRNELLIRGATEGLCVHSLSKELDVL-- 231

Query: 232 PGRDINLESAQDVEINKKRPSLIK-SKERVSHIQKKLASAKKSLVEVRQANEAHNKDIAD 290
             ++ N+    D++  K     +  ++ERV  ++K+ +    SL E+        +D++ 
Sbjct: 232 --KEENMLLKDDIQFLKAELIEVAETEERVFKLEKERSLLDASLRELESKFIVAQEDVSK 289

Query: 291 LETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQ 350
           L     D    K E           NL+        +L   AT +  K    LD  N++ 
Sbjct: 290 LSPLQYDCWWEKVE-----------NLQ--------DLLDRATNQVEKAALVLDQ-NQDL 329

Query: 351 KGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELN 410
           +   DKL+  L++     N  K   +++E  Q+++  LE+ ++ ++           E++
Sbjct: 330 RDKVDKLEASLKEA----NVSKFSSYKVELLQQKLKLLEERLQASD----------HEIH 375

Query: 411 SDVGSSKNRVQELQKELEQVIEE 433
           S +   +  ++E Q  L ++ EE
Sbjct: 376 SYIQLYQESIKEFQDTLSKLKEE 398


>gnl|CDD|213179 cd00267, ABC_ATPase, ATP-binding cassette transporter
            nucleotide-binding domain.  ABC transporters are a large
            family of proteins involved in the transport of a wide
            variety of different compounds, like sugars, ions,
            peptides, and more complex organic molecules. The
            nucleotide-binding domain shows the highest similarity
            between all members of the family. ABC transporters are a
            subset of nucleotide hydrolases that contain a signature
            motif, Q-loop, and H-loop/switch region, in addition to,
            the Walker A motif/P-loop and Walker B motif commonly
            found in a number of ATP- and GTP-binding and hydrolyzing
            proteins.
          Length = 157

 Score = 34.5 bits (80), Expect = 0.10
 Identities = 10/60 (16%), Positives = 24/60 (40%), Gaps = 1/60 (1%)

Query: 998  HPAPFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHA-DSLVGICPG 1056
                  +LDE  + LD  +  ++   +    ++    I+++   E    A D ++ +  G
Sbjct: 97   LNPDLLLLDEPTSGLDPASRERLLELLRELAEEGRTVIIVTHDPELAELAADRVIVLKDG 156


>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 366 and 597 amino
           acids in length. There is a single completely conserved
           residue R that may be functionally important.
          Length = 297

 Score = 35.7 bits (83), Expect = 0.12
 Identities = 23/123 (18%), Positives = 47/123 (38%), Gaps = 16/123 (13%)

Query: 339 ILQQLDTINREQKGD-QDKLDNELRQQVQT-QNEIKKKRHEMEEAQKRIDKLEDHIRQNE 396
           I   L  I +E++   Q ++    R ++     EI   + E++EAQ  ++       Q E
Sbjct: 110 IDTVLLEIQQEEQAAAQAQVAAGFRPKIAALTAEIAALQAEIDEAQAEVNAAYQ-EAQCE 168

Query: 397 ASLKDN----------KKLKEELN---SDVGSSKNRVQELQKELEQVIEELGDAKTDKHE 443
           A               K+ +E+L+   + + + K R+     +LE     L   +     
Sbjct: 169 AEGTGGTGVAGKGPVYKEKREKLDAAQARLETLKARLDAAIAQLEAQKAALERNRQAAVA 228

Query: 444 DTR 446
           + +
Sbjct: 229 EKQ 231



 Score = 34.5 bits (80), Expect = 0.25
 Identities = 25/148 (16%), Positives = 52/148 (35%), Gaps = 22/148 (14%)

Query: 92  IVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNR 151
           ++++ +E ++F+        KE++  L + +   E+    + +     R K     AL  
Sbjct: 96  VISEPLELKIFE--------KEIDTVLLEIQ--QEEQAAAQAQVAAGFRPKIA---ALTA 142

Query: 152 ELAKVDQEIREMDVEINKKRPSLIKSKERVSHI---------QKKLASAKKSLVEVRQAN 202
           E+A +  EI E   E+N          E              ++K      +   +    
Sbjct: 143 EIAALQAEIDEAQAEVNAAYQEAQCEAEGTGGTGVAGKGPVYKEKREKLDAAQARLETLK 202

Query: 203 EAHNKDIADLETQLADVRKRKAEYERQS 230
              +  IA LE Q A + + +     + 
Sbjct: 203 ARLDAAIAQLEAQKAALERNRQAAVAEK 230


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 36.2 bits (84), Expect = 0.12
 Identities = 34/270 (12%), Positives = 89/270 (32%), Gaps = 18/270 (6%)

Query: 111  IKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKK 170
            +++L  EL+KK+ E+EK++      +++  E       L++    ++++    + EI K+
Sbjct: 1104 VEKLNAELEKKEKELEKLKNT--TPKDMWLED------LDKFEEALEEQEEVEEKEIAKE 1155

Query: 171  RPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQS 230
            +    + K +      KL    K   + ++  ++               R    E  +  
Sbjct: 1156 Q----RLKSKTKGKASKLRK-PKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLD 1210

Query: 231  IPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIAD 290
                +    S+   + + +       K  V  ++ K  ++ KS  +  + +         
Sbjct: 1211 DKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGK 1270

Query: 291  LETQLADVRKRKA---EYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQL--DT 345
             +     V   +       ++     +   + +  T+    K      A    ++     
Sbjct: 1271 PKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKK 1330

Query: 346  INREQKGDQDKLDNELRQQVQTQNEIKKKR 375
              R++K           Q  +     +KK+
Sbjct: 1331 TARKKKSKTRVKQASASQSSRLLRRPRKKK 1360


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 36.0 bits (84), Expect = 0.12
 Identities = 27/112 (24%), Positives = 57/112 (50%), Gaps = 6/112 (5%)

Query: 352 GDQDKLDNELRQQVQTQN-EIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNK-KLKEEL 409
            D++KL NEL   ++    E+++K  E E   K  +KL++ + + +  L++ + KL EE 
Sbjct: 513 EDKEKL-NELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEA 571

Query: 410 NSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFKKAY 461
             +    +  ++E +KE +++I+EL   +   +   +  +  E  +   KA 
Sbjct: 572 EKEA---QQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKAN 620



 Score = 34.4 bits (80), Expect = 0.43
 Identities = 19/98 (19%), Positives = 43/98 (43%), Gaps = 13/98 (13%)

Query: 108 ETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQ--------GALNRELAKVDQE 159
           E   +E E  L + +   E++E +KEK +    +  +E             +E  ++ +E
Sbjct: 533 EQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKE 592

Query: 160 IREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVE 197
           +R++     K   + +K+ E +   +K+L  A +   +
Sbjct: 593 LRQL----QKGGYASVKAHE-LIEARKRLNKANEKKEK 625



 Score = 32.1 bits (74), Expect = 2.1
 Identities = 34/125 (27%), Positives = 57/125 (45%), Gaps = 23/125 (18%)

Query: 39  ALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVE 98
           AL  + E+LK E+ + +E+             E ++  +E+ E E  Q I+E   AK+  
Sbjct: 541 ALLKEAEKLKEELEEKKEKLQ-----------EEEDKLLEEAEKEAQQAIKE---AKKEA 586

Query: 99  HQLFKLYHNETDIKELEDELDKKKGEVEKIERRK--EKAENILREKKKEQGALNRELAKV 156
            ++ K      ++++L+          E IE RK   KA     +KKK+Q     EL KV
Sbjct: 587 DEIIK------ELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEEL-KV 639

Query: 157 DQEIR 161
             E++
Sbjct: 640 GDEVK 644



 Score = 31.7 bits (73), Expect = 2.3
 Identities = 25/95 (26%), Positives = 49/95 (51%), Gaps = 5/95 (5%)

Query: 379 EEAQKRIDKLEDHIRQNE--ASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGD 436
           EEA+K I   ED  + NE  ASL++ ++  E+   +  +     ++L++ELE+  E+L  
Sbjct: 505 EEAKKLIG--EDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKL-Q 561

Query: 437 AKTDKHEDTRRKKKQELVENFKKAYSGVYDRMINM 471
            + DK  +   K+ Q+ ++  KK    +   +  +
Sbjct: 562 EEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQL 596


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 35.4 bits (82), Expect = 0.17
 Identities = 23/95 (24%), Positives = 47/95 (49%), Gaps = 7/95 (7%)

Query: 44  YERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFK 103
           YE  + E +   + +    LK+        EA++ ++E E  ++  E+I  KE   +  +
Sbjct: 284 YENYRTEKLSGLKNSGEPSLKEI------HEARLNEEERELKKKFTEKIREKEKRLEELE 337

Query: 104 LYHNETDIKELEDELDKKKGEVEKIERRKEKAENI 138
               E   KEL  +L++ + ++E +E+R EK ++ 
Sbjct: 338 QNLIEE-RKELNSKLEEIQKKLEDLEKRLEKLKSN 371



 Score = 31.1 bits (71), Expect = 3.5
 Identities = 18/83 (21%), Positives = 41/83 (49%), Gaps = 4/83 (4%)

Query: 317 LESAQMTEYTNLKAEATKRAGKILQQLDTINREQ-KGDQDKLDNELRQQVQTQNEIKKKR 375
           L   + +   +LK     R  +  ++L     E+ +  + +L+ EL Q +    E K+  
Sbjct: 292 LSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLE-ELEQNLIE--ERKELN 348

Query: 376 HEMEEAQKRIDKLEDHIRQNEAS 398
            ++EE QK+++ LE  + + +++
Sbjct: 349 SKLEEIQKKLEDLEKRLEKLKSN 371


>gnl|CDD|205733 pfam13555, AAA_29, P-loop containing region of AAA domain. 
          Length = 60

 Score = 31.8 bits (73), Expect = 0.17
 Identities = 12/35 (34%), Positives = 20/35 (57%), Gaps = 3/35 (8%)

Query: 5  LQYIEVDNFKSYKGK-FSIGPLKKFTAVIGPNGSG 38
          L  +++ N+ ++ G    I P  K T + GP+GSG
Sbjct: 1  LTRLQLVNWGTFDGHTIPIDP--KGTLLTGPSGSG 33


>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
          Length = 535

 Score = 35.2 bits (81), Expect = 0.18
 Identities = 29/116 (25%), Positives = 55/116 (47%), Gaps = 6/116 (5%)

Query: 113 ELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRP 172
           E + E  KK  + E    +KE       E +K +  + +E     QE+++++  + ++  
Sbjct: 52  ERDAEHIKKTAKRESKALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRLTERAT 111

Query: 173 SLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYER 228
           SL +  E +S  +K L S ++SL +  +  +   + +  LE Q      +KAE ER
Sbjct: 112 SLDRKDENLSSKEKTLESKEQSLTDKSKHIDEREEQVEKLEEQ------KKAELER 161



 Score = 32.1 bits (73), Expect = 2.0
 Identities = 27/94 (28%), Positives = 47/94 (50%), Gaps = 5/94 (5%)

Query: 70  AERKEAKIE-KDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKI 128
           A +KE  +E K+EA KY   REEI  +E + +  +L   E+ + E    LD+K   +   
Sbjct: 68  ALKKELLLEAKEEARKY---REEI-EQEFKSERQELKQIESRLTERATSLDRKDENLSSK 123

Query: 129 ERRKEKAENILREKKKEQGALNRELAKVDQEIRE 162
           E+  E  E  L +K K       ++ K++++ + 
Sbjct: 124 EKTLESKEQSLTDKSKHIDEREEQVEKLEEQKKA 157


>gnl|CDD|223047 PHA03332, PHA03332, membrane glycoprotein; Provisional.
          Length = 1328

 Score = 35.7 bits (82), Expect = 0.19
 Identities = 21/108 (19%), Positives = 41/108 (37%), Gaps = 12/108 (11%)

Query: 323 TEYTNLKAEATKRAGKILQQLDTINREQKGDQD---KLDNELRQQVQT-QNEIKKKRHEM 378
           T Y N   +AT    ++  ++  +N       D   KL + + +   T  N I+      
Sbjct: 880 TLYVNQLLQATAATAEMASKIGGLNARVDKTSDVITKLGDTIAKISATLDNNIR------ 933

Query: 379 EEAQKRIDKLEDHIRQNEASLKDN-KKLKEELNSDVGSSKNRVQELQK 425
                R+  LED +     ++  N   L  +L     ++  R++E+  
Sbjct: 934 -AVNGRVSDLEDQVNLRFLAVATNFNTLATQLKELGTTTNERIEEVMA 980



 Score = 31.5 bits (71), Expect = 3.5
 Identities = 12/67 (17%), Positives = 30/67 (44%), Gaps = 3/67 (4%)

Query: 363 QQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQE 422
           Q +Q      +   ++     R+DK  D I +   ++    K+   L++++ +   RV +
Sbjct: 885 QLLQATAATAEMASKIGGLNARVDKTSDVITKLGDTI---AKISATLDNNIRAVNGRVSD 941

Query: 423 LQKELEQ 429
           L+ ++  
Sbjct: 942 LEDQVNL 948


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 35.6 bits (82), Expect = 0.19
 Identities = 26/115 (22%), Positives = 57/115 (49%), Gaps = 6/115 (5%)

Query: 338 KILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEA 397
            ++++L  + +E +   + L+  L++Q + + E++++  E++E ++   KLE      EA
Sbjct: 515 VLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNK-KLELEKEAQEA 573

Query: 398 SLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQE 452
                K LK+E+ S +   K +     KE  + IE+L   K  K +  ++    +
Sbjct: 574 L----KALKKEVESIIRELKEKKIHKAKE-IKSIEDLVKLKETKQKIPQKPTNFQ 623



 Score = 32.9 bits (75), Expect = 1.2
 Identities = 25/135 (18%), Positives = 63/135 (46%), Gaps = 5/135 (3%)

Query: 113 ELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKK-R 171
           E+   ++K     +++E++ E  E +L+E++K +  L +E+ ++ +  R   +E+ K+ +
Sbjct: 512 EINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQ 571

Query: 172 PSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSI 231
            +L   K+ V  I ++L   K    +  ++ E    D+  L+     + ++   ++   I
Sbjct: 572 EALKALKKEVESIIRELKEKKIHKAKEIKSIE----DLVKLKETKQKIPQKPTNFQADKI 627

Query: 232 PGRDINLESAQDVEI 246
             +       Q  +I
Sbjct: 628 GDKVRIRYFGQKGKI 642


>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
          Length = 621

 Score = 35.1 bits (81), Expect = 0.22
 Identities = 28/133 (21%), Positives = 64/133 (48%), Gaps = 17/133 (12%)

Query: 334 KRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEME----------EAQK 383
           K  GK  +Q  TI       +D+++  +++  +   E K+KR +++          +A+K
Sbjct: 486 KGTGK--EQSITIQGASTLPKDEVERMVKEAEKNAAEDKEKREKIDLKNQAESLCYQAEK 543

Query: 384 RIDKLEDHIRQNE-ASLKDN-KKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDK 441
           ++ +L+D I + +   +++  KKL++ L +D   +   ++ L +EL++ + E+G      
Sbjct: 544 QLKELKDKISEEKKEKIENLIKKLRQALQND---NYESIKSLLEELQKALMEIGKEVYSS 600

Query: 442 HEDTRRKKKQELV 454
              T      + V
Sbjct: 601 TSTTDPASNDDDV 613


>gnl|CDD|216249 pfam01025, GrpE, GrpE. 
          Length = 165

 Score = 33.7 bits (78), Expect = 0.23
 Identities = 16/63 (25%), Positives = 32/63 (50%), Gaps = 3/63 (4%)

Query: 888 KSIQEMTSRLQTIQAPNLRAMEKLEHAKENLMKTNEEFENARKRAKKAKANFDRIKKERY 947
           +  +E+    + ++       E++E  K+ L++   EFEN RKR ++ +   +  KK   
Sbjct: 4   EEEEELEDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRKRTERER---EEAKKYAI 60

Query: 948 DKF 950
           +KF
Sbjct: 61  EKF 63


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 34.9 bits (81), Expect = 0.24
 Identities = 47/238 (19%), Positives = 83/238 (34%), Gaps = 28/238 (11%)

Query: 124 EVEKIERR---KEKAENILREKKKE--QGALNRELAKVDQEIREMDVEINKKRPSLIKSK 178
           E EK  R    K+ AE     K K+    AL R  AK           I  K  +   + 
Sbjct: 463 EREKAAREARHKKAAEA-RAAKDKDAVAAALARVKAKKAAA----TQPIVIKAGARPDNS 517

Query: 179 ERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINL 238
             ++  + + A A+    E + A  A  K  A +   +A  + +KA              
Sbjct: 518 AVIAAREARKAQARARQAEKQAAAAADPKK-AAVAAAIARAKAKKAA------------Q 564

Query: 239 ESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADV 298
           ++A      +  P        ++  + K A+ + +  E  +     +   A +   +A  
Sbjct: 565 QAANAEAEEEVDPKKAAVAAAIARAKAKKAAQQAASAEPEEQVAEVDPKKAAVAAAIARA 624

Query: 299 RKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDK 356
           + +KAE +  + P   ++   A +       A A  +A K  QQ      E+  D  K
Sbjct: 625 KAKKAEQQANAEPEEPVDPRKAAVA-----AAIARAKARKAAQQQANAEPEEAEDPKK 677


>gnl|CDD|221957 pfam13175, AAA_15, AAA ATPase domain.  This family of domains
           contain a P-loop motif that is characteristic of the AAA
           superfamily.
          Length = 320

 Score = 34.7 bits (80), Expect = 0.25
 Identities = 36/148 (24%), Positives = 63/148 (42%), Gaps = 9/148 (6%)

Query: 5   LQYIEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG------ALKDDYERLKAEMIQAEEET 58
           L+ I++ NFKS K    I   +    +IG NGSG      AL +  E+ K  +       
Sbjct: 1   LKKIKIKNFKSIKNLEEISLNEGINVIIGENGSGKTTILEALDNLKEKKKIYIS---ISD 57

Query: 59  NMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDEL 118
           N + + K+ +  + K+ KI          I+   +   + + L  + ++  D  EL    
Sbjct: 58  NKADIDKENLNKKSKKIKISIIIELSSNEIKSFSIKFYLNYYLEIINNDVEDEIELLLFS 117

Query: 119 DKKKGEVEKIERRKEKAENILREKKKEQ 146
              +  ++KI  + EK + I   KKK+ 
Sbjct: 118 KNLEYILKKILDKIEKFDEIKGAKKKDF 145


>gnl|CDD|237110 PRK12472, PRK12472, hypothetical protein; Provisional.
          Length = 508

 Score = 34.8 bits (80), Expect = 0.26
 Identities = 37/148 (25%), Positives = 64/148 (43%), Gaps = 13/148 (8%)

Query: 699 PKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEAELRSQ---QE 755
           PK +A+ A+  AR +T++R+ E+     D       ++   +     E A L++     E
Sbjct: 180 PKAEAL-AAAPARAETLAREAEDAARAAD-------EAKTAAAAAAREAAPLKASLRKLE 231

Query: 756 RQKICQDKDTKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATRLTKK 815
           R K   D + K+  A    A +  +E  ARA+  ++K A +    A +L+  +A    K+
Sbjct: 232 RAKARADAELKR--ADKALAAAKTDEAKARAEERQQKAAQQAAEAATQLDTAKADAEAKR 289

Query: 816 QAVDAMDEEIGKARREVGSIAKDIQAAQ 843
            A  A  E    A  +    AK    A+
Sbjct: 290 AAAAATKEAAKAAAAKKAETAKAATDAK 317


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 34.9 bits (80), Expect = 0.27
 Identities = 69/362 (19%), Positives = 152/362 (41%), Gaps = 49/362 (13%)

Query: 103 KLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIRE 162
           +L  +++ + +LE+EL +K+ E ++ +   E+  + L E  + +   N EL    + + E
Sbjct: 62  ELIRSKSKLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELEVRLKALEE 121

Query: 163 MDVEINKKRPSLIKSKERVSHIQKKL-ASAKKSLVEVRQANEAHNKDIADLETQLADVRK 221
           ++ +   +     +++E    ++ KL A + K   E     +   + I+ ++  L++++ 
Sbjct: 122 LEKKAENE---AAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQC 178

Query: 222 RKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKK---SLVEVR 278
           R                    D E+      L + +E++   QK+LA A+K   SL   +
Sbjct: 179 RAQN----------------ADTELKLLESELEELREQLEECQKELAEAEKKLQSLTSEQ 222

Query: 279 QANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGK 338
            ++  ++  I  LE +L         YE+ +   + +  +  Q+ E     A   +   K
Sbjct: 223 ASSADNSVKIKHLEEEL-------KRYEQDAEVVKSMKEQLLQIPELERELAALREENRK 275

Query: 339 ILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEAS 398
           +        R  K D + L  EL        +++ +    E+ ++++  LE    + E  
Sbjct: 276 L--------RSMKEDNELLKEEL-------EDLQSRLERFEKMREKLADLELEKEKLENE 320

Query: 399 LKDNKKLKEELNSDVGSS---KNRVQELQKELEQVIEELGDAKTD-KHEDTRRKKKQELV 454
           LK  K L +++  ++ +      R+  LQ E  Q+ E+ G   +  K  +T  ++ Q   
Sbjct: 321 LKSWKSLLQDIGLNLRTPDDLSRRIVVLQNEELQLKEKNGSISSSAKQLETTLQQLQLER 380

Query: 455 EN 456
           + 
Sbjct: 381 QK 382



 Score = 34.9 bits (80), Expect = 0.27
 Identities = 71/409 (17%), Positives = 170/409 (41%), Gaps = 36/409 (8%)

Query: 53  QAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIK 112
           +A E    + L K  + AE  + + EK++  K  +     +  ++     +  + +T++K
Sbjct: 129 EAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNADTELK 188

Query: 113 ELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKV---DQEIR--EMDVEI 167
            LE EL++ + ++E+ ++   +AE  L+    EQ +      K+   ++E++  E D E+
Sbjct: 189 LLESELEELREQLEECQKELAEAEKKLQSLTSEQASSADNSVKIKHLEEELKRYEQDAEV 248

Query: 168 NKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLA---------- 217
            K     +     +      L    + L  +++ NE   +++ DL+++L           
Sbjct: 249 VKSMKEQLLQIPELERELAALREENRKLRSMKEDNELLKEELEDLQSRLERFEKMREKLA 308

Query: 218 --DVRKRKAEYERQS----IPGRDINLESAQDV--EINKKRPSLIKSKERVSHIQKKLAS 269
             ++ K K E E +S    +    +NL +  D+   I   +   ++ KE+   I      
Sbjct: 309 DLELEKEKLENELKSWKSLLQDIGLNLRTPDDLSRRIVVLQNEELQLKEKNGSISSSAKQ 368

Query: 270 AKKSLVEVRQANEAHNKDIADLETQL----ADVRK--RKAEYERQSIPGRDINLES---- 319
            + +L +++   +    +I +L+ +L    A VR+  R+     +   G    L S    
Sbjct: 369 LETTLQQLQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKERDGLRAILNSYDKE 428

Query: 320 AQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEME 379
              T  +    +  + A  ++Q++ +   + +    +L+ ++ QQ    N ++    E++
Sbjct: 429 LTETSVSGQLMKRLEEAEDLVQKVQSHLAKMENQLSELEEDVGQQKDRNNTLET---EIK 485

Query: 380 EAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELE 428
             ++++   E  +   + +    +   E L  +    +     L+ +LE
Sbjct: 486 LLKEQLVSNERLLSFVKEATNALRLKIETLERERDRLRQEKSLLEMKLE 534



 Score = 33.0 bits (75), Expect = 1.2
 Identities = 80/397 (20%), Positives = 154/397 (38%), Gaps = 51/397 (12%)

Query: 40  LKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEH 99
            K    +L+ E++Q E E   + ++      ERK + + ++   +  R  E  V  +   
Sbjct: 66  SKSKLIQLENELMQKELEHKRAQIE-----LERKASTLAENYERELDRNLELEVRLKALE 120

Query: 100 QLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQE 159
           +L K   NE    E E +L K K + E ++ + EK + +   K+     +  +L+++   
Sbjct: 121 ELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESIS-RIKNDLSEMQCR 179

Query: 160 IREMDVEINKKRPSLIKSKERVSHIQKKLASAKK---SLVEVRQANEAHNKDIADLETQL 216
            +  D E+      L + +E++   QK+LA A+K   SL   + ++  ++  I  LE +L
Sbjct: 180 AQNADTELKLLESELEELREQLEECQKELAEAEKKLQSLTSEQASSADNSVKIKHLEEEL 239

Query: 217 ADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVE 276
                                    QD E+ K        KE++  I +          E
Sbjct: 240 KRY---------------------EQDAEVVK------SMKEQLLQIPELERELAALREE 272

Query: 277 VRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRA 336
            R+      +D   L+ +L D++ R   +E+      D+ LE  ++           +  
Sbjct: 273 NRKLRSMK-EDNELLKEELEDLQSRLERFEKMREKLADLELEKEKLENELKSWKSLLQDI 331

Query: 337 GKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNE 396
           G  L+  D ++              R+ V  QNE  + + +         +LE  ++Q +
Sbjct: 332 GLNLRTPDDLS--------------RRIVVLQNEELQLKEKNGSISSSAKQLETTLQQLQ 377

Query: 397 ASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEE 433
              +       EL   + + K  V+ LQ+ L  V +E
Sbjct: 378 LERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKE 414


>gnl|CDD|214710 smart00533, MUTSd, DNA-binding domain of DNA mismatch repair MUTS
           family. 
          Length = 308

 Score = 34.6 bits (80), Expect = 0.27
 Identities = 28/123 (22%), Positives = 58/123 (47%), Gaps = 22/123 (17%)

Query: 622 DKEMGNLKAQKEKLSEELREAMKKSRKESELNTVQ---STIKGLEIRLNYSRQDLQNTKS 678
           D E+  L+ + E+L EEL E +KK R+E  +++++   + + G  I +         TKS
Sbjct: 147 DPELDELREKLEELEEELEELLKKEREELGIDSLKLGYNKVHGYYIEV---------TKS 197

Query: 679 QIAKLEAEIDALNARADA---TEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCK 735
           +  K+  +    ++  +    T P++K +E  +          KEE+  +E  + R+  +
Sbjct: 198 EAKKVPKDFIRRSSLKNTERFTTPELKELENELLEA-------KEEIERLEKEILRELLE 250

Query: 736 SIG 738
            + 
Sbjct: 251 KVL 253


>gnl|CDD|234013 TIGR02785, addA_Gpos, helicase-exonuclease AddAB, AddA subunit,
           Firmicutes type.  AddAB, also called RexAB, substitutes
           for RecBCD in several bacterial lineages. These DNA
           recombination proteins act before synapse and are
           particularly important for DNA repair of double-stranded
           breaks by homologous recombination. The term AddAB is
           used broadly, with AddA homologous between the
           Firmicutes (as modeled here) and the
           alphaproteobacteria, while the partner AddB proteins
           show no strong homology across the two groups of species
           [DNA metabolism, DNA replication, recombination, and
           repair].
          Length = 1230

 Score = 35.1 bits (81), Expect = 0.29
 Identities = 31/140 (22%), Positives = 52/140 (37%), Gaps = 16/140 (11%)

Query: 317 LESAQMT--EYTNLKAEATKRAGK------ILQQLDTINREQKGDQDKLDNELRQQVQTQ 368
           LE  Q        L       A +       LQ +D + +E     D   NELR+ V   
Sbjct: 219 LEGLQEKLQRALELFMAEDGLAPRLENFQLDLQNIDELIQESLAQADW--NELRKAVAAF 276

Query: 369 NEI----KKKRHEMEEAQKRIDKLEDHIRQNEASLKDN--KKLKEELNSDVGSSKNRVQE 422
                   K   E  +  +  DKL +  ++    LK +   + +E+    +   K  V+ 
Sbjct: 277 KFKNLKAAKGDEEDADLLEEADKLREEAKKQLEKLKTDYFTRSEEDHLRIMQEMKPVVKT 336

Query: 423 LQKELEQVIEELGDAKTDKH 442
           L + ++  IE  G  K +K+
Sbjct: 337 LVQLVKDFIERFGAEKREKN 356


>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein.  The sequences
           featured in this family are similar to a region of human
           TPR protein and to yeast myosin-like proteins 1 (MLP1)
           and 2 (MLP2). These proteins share a number of features;
           for example, they all have coiled-coil regions and all
           three are associated with nuclear pores. TPR is thought
           to be a component of nuclear pore complex- attached
           intra-nuclear filaments, and is implicated in nuclear
           protein import. Moreover, its N-terminal region is
           involved in the activation of oncogenic kinases,
           possibly by mediating the dimerisation of kinase domains
           or by targeting these kinases to the nuclear pore
           complex. MLP1 and MLP2 are involved in the process of
           telomere length regulation, where they are thought to
           interact with proteins such as Tel1p and modulate their
           activity.
          Length = 132

 Score = 32.7 bits (75), Expect = 0.40
 Identities = 22/111 (19%), Positives = 52/111 (46%), Gaps = 4/111 (3%)

Query: 320 AQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEME 379
           A++      K +A ++  K+ + L+         Q K + EL +  +   E++  R ++ 
Sbjct: 10  AELKRLLEKKEDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKHAEDIEELQALRKQLN 69

Query: 380 EAQKRIDKLEDHIRQNEASLKDNKKL----KEELNSDVGSSKNRVQELQKE 426
           E +K I +L+      +A L + ++     K+ L  ++   + R++EL ++
Sbjct: 70  ELKKEIAQLKAEAESAQAELSEAEESWEEQKKMLEDELSELEKRIEELNEQ 120


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 33.9 bits (78), Expect = 0.41
 Identities = 52/302 (17%), Positives = 123/302 (40%), Gaps = 22/302 (7%)

Query: 109 TDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEIN 168
             + +  DEL+ K+ ++++     ++  + LR++  E      EL    +E+RE   E+ 
Sbjct: 2   LAMLDKLDELELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELR 61

Query: 169 KKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYER 228
           ++R  + +                    EV++  E  ++  A L+    + R+ K +   
Sbjct: 62  EERDEINE--------------------EVQELKEKRDEINAKLQELRKEYRELKEKRNE 101

Query: 229 QSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDI 288
            ++ GR I     +   + KK+ + + + E    + +K+   +K L + ++A E + K  
Sbjct: 102 FNLGGRSIKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKELEDAKKALEENEKLK 161

Query: 289 ADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINR 348
                     +K +  +E+      +      +M +      E  K A ++ ++   +++
Sbjct: 162 ELKAEIDELKKKAREIHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHEEFVELSK 221

Query: 349 EQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEA--SLKDNKKLK 406
           +     ++  N   +  + + +IK  R + + A++R  + E   R  E     K  +KL 
Sbjct: 222 KIDELHEEFRNLQNELRELEKKIKALRAKEKAAKRREKREELKERAEEIYEKFKRGEKLT 281

Query: 407 EE 408
            E
Sbjct: 282 TE 283



 Score = 32.3 bits (74), Expect = 1.1
 Identities = 42/228 (18%), Positives = 92/228 (40%), Gaps = 23/228 (10%)

Query: 255 KSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRD 314
           K +E     Q+      +   EV++  E  ++  A L+    + R+ K +    ++ GR 
Sbjct: 49  KVRELREKAQELREERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRS 108

Query: 315 IN--------LESAQMTEYTNL---------------KAEATKRAGKILQQLDTINREQK 351
           I         LE  Q T                    + E  K+A +  ++L  +  E  
Sbjct: 109 IKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKELEDAKKALEENEKLKELKAEID 168

Query: 352 GDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNS 411
             + K      +  +  NE ++   EM +  +  D+L     +      +  K  +EL+ 
Sbjct: 169 ELKKKAREIHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHE 228

Query: 412 DVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFKK 459
           +  + +N ++EL+K+++ +  +   AK  +  +  +++ +E+ E FK+
Sbjct: 229 EFRNLQNELRELEKKIKALRAKEKAAKRREKREELKERAEEIYEKFKR 276


>gnl|CDD|235871 PRK06835, PRK06835, DNA replication protein DnaC; Validated.
          Length = 329

 Score = 33.3 bits (77), Expect = 0.60
 Identities = 23/94 (24%), Positives = 43/94 (45%), Gaps = 9/94 (9%)

Query: 342 QLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKD 401
            +  I +E +  +DK   E  +    + E+ KK  E+ E    I KL   I+ + A LK+
Sbjct: 5   YISEILKEYEKRRDK---EELELKNRKEEVYKKIPEIAEIDDEIAKLG--IKLSRAILKN 59

Query: 402 NKKLKEELNSDVGSSKNRVQELQKELEQVIEELG 435
             K +E L       K ++ +L+ +  +++   G
Sbjct: 60  PDKKEETLK----ELKEKITDLRVKKAELLVSNG 89



 Score = 32.2 bits (74), Expect = 1.3
 Identities = 18/78 (23%), Positives = 43/78 (55%), Gaps = 4/78 (5%)

Query: 124 EVEKI-ERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVS 182
           E+ K  E+R++K E  L+ +K+E      E+A++D EI ++ +++++   +++K+ ++  
Sbjct: 8   EILKEYEKRRDKEELELKNRKEEVYKKIPEIAEIDDEIAKLGIKLSR---AILKNPDKKE 64

Query: 183 HIQKKLASAKKSLVEVRQ 200
              K+L      L   + 
Sbjct: 65  ETLKELKEKITDLRVKKA 82


>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
           protein 2.  THO and TREX form a eukaryotic complex which
           functions in messenger ribonucleoprotein metabolism and
           plays a role in preventing the transcription-associated
           genetic instability. Tho2, along with four other
           subunits forms THO.
          Length = 296

 Score = 33.1 bits (76), Expect = 0.71
 Identities = 17/65 (26%), Positives = 34/65 (52%), Gaps = 1/65 (1%)

Query: 355 DKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKL-EDHIRQNEASLKDNKKLKEELNSDV 413
           ++L+ ++++   + + I KK+ E +  +  I KL E+  +  E + K  K+L EE +S  
Sbjct: 34  ERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEEKSSWF 93

Query: 414 GSSKN 418
            S   
Sbjct: 94  PSKDP 98


>gnl|CDD|224045 COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport
          systems, ATPase components [Inorganic ion transport and
          metabolism / Coenzyme metabolism].
          Length = 258

 Score = 32.9 bits (76), Expect = 0.75
 Identities = 17/39 (43%), Positives = 22/39 (56%), Gaps = 9/39 (23%)

Query: 8  IEVDNFK-SYKGK-------FSIGPLKKFTAVIGPNGSG 38
          +EV+N    Y GK       FSI P  + T ++GPNGSG
Sbjct: 3  LEVENLSFGYGGKPILDDLSFSI-PKGEITGILGPNGSG 40


>gnl|CDD|181442 PRK08476, PRK08476, F0F1 ATP synthase subunit B'; Validated.
          Length = 141

 Score = 32.0 bits (73), Expect = 0.75
 Identities = 19/60 (31%), Positives = 32/60 (53%), Gaps = 5/60 (8%)

Query: 670 RQDLQNTK---SQIAKLEAEIDAL--NARADATEPKIKAIEASMTARGDTISRKKEEMNS 724
           + DL+  K   S ++++E EI+ +  NAR +A + + KAI  +       I  KK E+ S
Sbjct: 44  KNDLEKVKTNSSDVSEIEHEIETILKNAREEANKIRQKAIAKAKEEAEKKIEAKKAELES 103


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 33.1 bits (76), Expect = 0.76
 Identities = 25/97 (25%), Positives = 35/97 (36%), Gaps = 4/97 (4%)

Query: 636 SEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDALNARAD 695
            +ELREA+ +  KE    T    +   E      + +    K +I KL+AEID L    D
Sbjct: 1   LKELREALAELAKELRKLTEDEKLAEAEEE----KAEYDALKEEIDKLDAEIDRLEELLD 56

Query: 696 ATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRD 732
             E K  A         +    K E       +   D
Sbjct: 57  ELEAKPAASGEGGGGEEEEEEAKAEAAEFRAYLRGGD 93



 Score = 31.2 bits (71), Expect = 2.8
 Identities = 17/92 (18%), Positives = 39/92 (42%), Gaps = 3/92 (3%)

Query: 378 MEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDA 437
           ++E ++ + +L   +R+      +     EE  ++  + K  + +L  E+++ +EEL D 
Sbjct: 1   LKELREALAELAKELRKLTE--DEKLAEAEEEKAEYDALKEEIDKLDAEIDR-LEELLDE 57

Query: 438 KTDKHEDTRRKKKQELVENFKKAYSGVYDRMI 469
              K   +      E  E   KA +  +   +
Sbjct: 58  LEAKPAASGEGGGGEEEEEEAKAEAAEFRAYL 89



 Score = 29.6 bits (67), Expect = 9.0
 Identities = 21/116 (18%), Positives = 39/116 (33%), Gaps = 20/116 (17%)

Query: 289 ADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINR 348
             L     ++RK   +          +     +  EY  LK E  K   +I         
Sbjct: 6   EALAELAKELRKLTED--------EKLAEAEEEKAEYDALKEEIDKLDAEI--------- 48

Query: 349 EQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKK 404
                 ++L +EL  +     E      E EEA+    +   ++R  + +L + +K
Sbjct: 49  ---DRLEELLDELEAKPAASGEGGGGEEEEEEAKAEAAEFRAYLRGGDDALAEERK 101


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 33.1 bits (76), Expect = 0.97
 Identities = 19/59 (32%), Positives = 32/59 (54%), Gaps = 3/59 (5%)

Query: 380 EAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAK 438
           E +KR+ KL   I +NEA   +N++L++   S     +  VQ   +EL + IE+L   +
Sbjct: 70  ELRKRLAKL---ISENEALKAENERLQKREQSIDQQIQQAVQSETQELTKEIEQLKSER 125


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 33.0 bits (76), Expect = 1.0
 Identities = 28/157 (17%), Positives = 63/157 (40%), Gaps = 25/157 (15%)

Query: 15  SYKGKFSIGPLKKFTAVIGPNGSGALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKE 74
           + + + ++ P      V+  +     ++  +RL+A         N   +K    + E + 
Sbjct: 736 NLRAEMNLSPSAPLKVVLVGS-----EELEDRLEA---------NEDDIKGLANLEELEI 781

Query: 75  AKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERR--- 131
              + +E              E+   L  L     ++  LE EL+K + E+++IE++   
Sbjct: 782 LSPDPEEPPVEVTAVVGGA--ELFLPLAGLIDLAAELARLEKELEKLEKEIDRIEKKLSN 839

Query: 132 -----KEKAENILREKKKEQGALNRELAKVDQEIREM 163
                K   E + +EK+K       +LAK+++ +  +
Sbjct: 840 EGFVAKAPEEVVEKEKEKLAEYQ-VKLAKLEERLAVL 875


>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
          Length = 343

 Score = 32.6 bits (75), Expect = 1.0
 Identities = 39/166 (23%), Positives = 63/166 (37%), Gaps = 30/166 (18%)

Query: 635 LSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDALNARA 694
           LS E+      S K+S L+ + S I  L   L+  RQ  Q+ +  +A L A + A  A  
Sbjct: 44  LSREI------SGKDSALDRLNSQIAELADLLSLERQGNQDLQDSVANLRASLSAAEAER 97

Query: 695 DATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCKSIGVSTIRQYEEAELRSQ- 753
              +  +  +  +  A         +E++S + +  R   +   V  + Q   A LR Q 
Sbjct: 98  SRLQALLAELAGAGAAAEGRAGELAQELDSEKQVSARALAQ---VELLNQQIAA-LRRQL 153

Query: 754 --------------QERQKICQDKDTKKNVARWERAVSDDEEELAR 785
                         +E Q    D   + NVA  +R      +EL R
Sbjct: 154 AALEAALDASEKRDRESQAKIADLGRRLNVALAQRV-----QELNR 194


>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
           Synaptonemal complex protein 1 (SCP-1) is the major
           component of the transverse filaments of the
           synaptonemal complex. Synaptonemal complexes are
           structures that are formed between homologous
           chromosomes during meiotic prophase.
          Length = 787

 Score = 32.8 bits (74), Expect = 1.1
 Identities = 77/402 (19%), Positives = 176/402 (43%), Gaps = 50/402 (12%)

Query: 79  KDEAEKYQRIREEIVA------KEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRK 132
           K++ EK Q + EE         K+V   L ++   E  +K+L   L++ + +  ++E + 
Sbjct: 218 KEDHEKIQHLEEEYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKT 277

Query: 133 EKAENILREKKKEQGALNRELAKVD---------QEIREMDVEINKKRPSLIKSKERVSH 183
           +  +  L+E  +++  L +EL  +          Q+  E D++I  K    +  ++    
Sbjct: 278 KLQDENLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQM 337

Query: 184 IQKKLASAKKSLVE-------------VRQANEAHNKDIADLETQLADVRKRKAEYERQS 230
            +   A A  S V              +R   +   K+   L+    +++K+ +E E  +
Sbjct: 338 EELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQKKSSELEEMT 397

Query: 231 IPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIAD 290
               +  +E  +  +I  +   L+  K++   I ++L   ++ L+ + QA E   K+I D
Sbjct: 398 KFKNNKEVELEELKKILAEDEKLLDEKKQFEKIAEELKGKEQELIFLLQARE---KEIHD 454

Query: 291 LETQLA-----------DVRKRKAEYERQSIPGRDINLESAQMT-EYTNLKAEATKRAGK 338
           LE QL            +V   K E E++ +   ++     ++  E   L  EA+    +
Sbjct: 455 LEIQLTAIKTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQEASDMTLE 514

Query: 339 ILQQLDTINREQKGDQ---DKLDNELRQQVQTQNEIKKKRHEM----EEAQKRIDKLEDH 391
           + +  + I   +K ++    +++N   +++  ++E++  R E     +E + ++DK E++
Sbjct: 515 LKKHQEDIINCKKQEERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEEN 574

Query: 392 IRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEE 433
            R  E  +   +K  + L +   + K +++   K +E++ +E
Sbjct: 575 ARSIEYEVLKKEKQMKILENKCNNLKKQIENKNKNIEELHQE 616


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
           unknown].
          Length = 984

 Score = 32.9 bits (75), Expect = 1.1
 Identities = 43/268 (16%), Positives = 89/268 (33%), Gaps = 34/268 (12%)

Query: 148 ALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNK 207
            +N+ L K+ QE  E+D    K+  +  K  E       +LA  +  L   R    A   
Sbjct: 182 QINQLLEKLKQERNEIDEA-EKEYATYHKLLESRRAEHARLAELRSELRADRDHIRALRD 240

Query: 208 DIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKL 267
            +           +   + E++    R+      +D         +++ ++R +H+QK  
Sbjct: 241 AVELW-----PRLQEWKQLEQELTRRREELATFPRD--------GVLRLEKREAHLQKTE 287

Query: 268 ASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTN 327
           A     LV + +  +  ++ I   E  L  + +   +              S        
Sbjct: 288 AEIDALLVRLAELKDLASQLIPAKEAVLQALVRLHQQ-------------LSEIKASAFE 334

Query: 328 LKAEATKRAGKILQQLDT----INREQKGDQDKLDNELRQ---QVQTQNEIKKKRHEMEE 380
           L          +  + +        E+  D   L+  LR    Q + +         ++E
Sbjct: 335 LTETLAGIEADLRDKEEAAGNGFEAERVHDLRSLECMLRYQSSQRELKQTEAAYCKRLDE 394

Query: 381 AQKRIDKLEDHIRQNEASLKDNKKLKEE 408
            +   D+ E+  RQ  A  ++  +  +E
Sbjct: 395 KRLFEDEAEEEARQRLADDEEEVRAGDE 422


>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
          Length = 448

 Score = 32.8 bits (75), Expect = 1.1
 Identities = 22/83 (26%), Positives = 37/83 (44%), Gaps = 15/83 (18%)

Query: 356 KLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGS 415
            LD E RQ          ++ E++  +K  +KL   +    A LK  K+   EL ++   
Sbjct: 37  ALDKEWRQ----------RQFELDSLRKEFNKLNKEV----AKLKIAKEDATELIAETKE 82

Query: 416 SKNRVQELQKELEQVIEELGDAK 438
            K  + E + E+++    L DAK
Sbjct: 83  LKKEITEKEAEVQEAKAAL-DAK 104


>gnl|CDD|147765 pfam05791, Bacillus_HBL, Bacillus haemolytic enterotoxin (HBL).
           This family consists of several Bacillus haemolytic
           enterotoxins (HblC, HblD, HblA, NheA, and NheB) which
           can cause food poisoning in humans.
          Length = 181

 Score = 31.9 bits (73), Expect = 1.1
 Identities = 17/46 (36%), Positives = 27/46 (58%)

Query: 402 NKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRR 447
           ++K KE L   +   +  +Q  QKE++Q++EEL D K    +DTR 
Sbjct: 98  DQKDKETLKEGLTKLQGEIQTNQKEVDQLLEELKDFKNKLAKDTRS 143


>gnl|CDD|178768 PLN03229, PLN03229, acetyl-coenzyme A carboxylase carboxyl
           transferase subunit alpha; Provisional.
          Length = 762

 Score = 32.9 bits (75), Expect = 1.1
 Identities = 65/361 (18%), Positives = 156/361 (43%), Gaps = 53/361 (14%)

Query: 107 NETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVE 166
            +T ++ELE E++K K ++ K +    K   +         ALN  + K+ +EI   D+E
Sbjct: 427 VKTPVRELEGEVEKLKEQILKAKESSSKPSEL---------ALNEMIEKLKKEI---DLE 474

Query: 167 INKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEY 226
             +   ++         +Q++L + ++   +    ++  +  + +       + K K E+
Sbjct: 475 YTEAVIAM--------GLQERLENLREEFSKANSQDQLMHPVLME------KIEKLKDEF 520

Query: 227 -ERQSIPGRDINLESAQDV--EINKKRPSLIKSKERVSHIQKKLASAKKSLV---EVRQA 280
            +R S     ++L+   D+  E ++ + +L + K +   ++ ++    K ++   E+++ 
Sbjct: 521 NKRLSRAPNYLSLKYKLDMLNEFSRAK-ALSEKKSKAEKLKAEINKKFKEVMDRPEIKEK 579

Query: 281 NEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKIL 340
            EA   ++A       D        E+     ++I LE A + +   L+     +  K  
Sbjct: 580 MEALKAEVASSGASSGDELD-DDLKEKVEKMKKEIELELAGVLKSMGLEVIGVTKKNKDT 638

Query: 341 QQLDTINREQKGDQDKLDNELRQQVQT-------QNEIKKKRHEMEEAQK--------RI 385
            +  T     +   + L+ E+ ++++        +++I+  + E+ +A K        +I
Sbjct: 639 AE-QTPPPNLQEKIESLNEEINKKIERVIRSSDLKSKIELLKLEVAKASKTPDVTEKEKI 697

Query: 386 DKLEDHIRQNEASLKDNKKLK---EELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKH 442
           + LE  I+Q  A   ++ +LK   EEL +++ +++    E    L+   ++  D+K D  
Sbjct: 698 EALEQQIKQKIAEALNSSELKEKFEELEAELAAARETAAESNGSLKNDDDKEEDSKEDGS 757

Query: 443 E 443
            
Sbjct: 758 R 758


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 32.6 bits (75), Expect = 1.2
 Identities = 22/98 (22%), Positives = 47/98 (47%), Gaps = 4/98 (4%)

Query: 368 QNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDN-KKLKEELNSDVGSSKNRVQELQKE 426
             E+K++  E+EE    + +LE+ + + +  LK   KK + +L  +      ++ E +++
Sbjct: 329 FPELKEELKELEE---ELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEK 385

Query: 427 LEQVIEELGDAKTDKHEDTRRKKKQELVENFKKAYSGV 464
           L + +EEL +   +  E+      +  +   K  Y GV
Sbjct: 386 LSEELEELEEELKELKEELESLYSEGKISVNKTIYPGV 423



 Score = 31.5 bits (72), Expect = 2.6
 Identities = 16/86 (18%), Positives = 41/86 (47%), Gaps = 6/86 (6%)

Query: 148 ALNRELAKVDQEIREMDVEINK--KRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAH 205
            L  EL ++++E++E++ E+ K  K    +  K R     +K    +K    + +  E  
Sbjct: 331 ELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEK----LLETKEKL 386

Query: 206 NKDIADLETQLADVRKRKAEYERQSI 231
           ++++ +LE +L ++++       +  
Sbjct: 387 SEELEELEEELKELKEELESLYSEGK 412



 Score = 30.7 bits (70), Expect = 5.0
 Identities = 22/93 (23%), Positives = 40/93 (43%), Gaps = 3/93 (3%)

Query: 78  EKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAEN 137
             +  E+ + + EE+   E E +  K    +   K       +K+ ++EK+   KEK   
Sbjct: 329 FPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSE 388

Query: 138 ILREKKKEQGALNRELAKVDQEIREMDVEINKK 170
            L E ++E   L  EL  +  E +   + +NK 
Sbjct: 389 ELEELEEELKELKEELESLYSEGK---ISVNKT 418


>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
           protein, PEP-CTERM locus subfamily.  Members of this
           protein family belong to the family of polysaccharide
           chain length determinant proteins (pfam02706). All are
           found in species that encode the PEP-CTERM/exosortase
           system predicted to act in protein sorting in a number
           of Gram-negative bacteria, and are found near the epsH
           homolog that is the putative exosortase gene [Cell
           envelope, Biosynthesis and degradation of surface
           polysaccharides and lipopolysaccharides].
          Length = 498

 Score = 32.7 bits (75), Expect = 1.2
 Identities = 16/81 (19%), Positives = 32/81 (39%), Gaps = 13/81 (16%)

Query: 628 LKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEI 687
            K +  +L E+  E         E   + + +              Q  + ++A+ EAEI
Sbjct: 280 TKREIAQLEEQKEEEGSAKNGGPERGEIANPV-------------YQQLQIELAEAEAEI 326

Query: 688 DALNARADATEPKIKAIEASM 708
            +L AR      +I+ +E+ +
Sbjct: 327 ASLEARVAELTARIERLESLL 347


>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
           This family consists of several plant proteins of
           unknown function. Several sequences in this family are
           described as being "myosin heavy chain-like".
          Length = 484

 Score = 32.7 bits (74), Expect = 1.2
 Identities = 67/345 (19%), Positives = 134/345 (38%), Gaps = 19/345 (5%)

Query: 49  AEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQ---RIREEIVAKEVEHQLFKLY 105
           A + +AEE    S + +K V    KE    K+  E+        EE        +     
Sbjct: 146 AALKRAEEAICASKVNEKKVEELTKEIIAMKESLERAHAAHLEAEEERIGAALEKDQDRE 205

Query: 106 HNETDIKELEDELDKKKGEVE-KIERRKEKAENILREKKKEQGALNRELAKVDQEIREMD 164
             E ++KE E EL++ K +++ + +  K    +   E  +++ ++   +A V +E+ E  
Sbjct: 206 TYEKELKEAEKELERLKQDLDPEKDLEKLAEASAELESLQKEISIMASVASVLKELEEAK 265

Query: 165 VEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKA 224
             + K        +  V  ++++L   KK L E+R+      +  + LE +L   +  K 
Sbjct: 266 ANLEKAAEEEKSLRNLVESLKQELEEEKKELEELREKEGEAEEAASSLEAELNRTKSEKE 325

Query: 225 EYERQSIPGRDINLESAQDVEINKK------RPSLIKSKERVSHIQKKLASAKKSLVEVR 278
              R+ +    + L+ A       +      R  L K KE     +  L +A+  L    
Sbjct: 326 SKAREKMVEIPLKLQQASSEAEEARKEAEAAREELRKLKEEAEQTKAALETAELRLEAAL 385

Query: 279 QANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLE--SAQMTEYTNLKAEATKRA 336
           +  EA     A    ++  +++ +   +  S     ++LE       +    +  A K+ 
Sbjct: 386 KEAEAAKAAEALALAEIKALQESEESAKADSPRKITLSLEEYEELSKKAEEAEELAEKKV 445

Query: 337 GKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEA 381
              L Q++     +     KL+       +   EI++K+  +EEA
Sbjct: 446 AAALAQVEEAKESENESLKKLE-------EASKEIEEKKAALEEA 483


>gnl|CDD|221514 pfam12297, EVC2_like, Ellis van Creveld protein 2 like protein.
           This family of proteins is found in eukaryotes. Proteins
           in this family are typically between 571 and 1310 amino
           acids in length. There are two conserved sequence
           motifs: LPA and ELH. EVC2 is implicated in Ellis van
           Creveld chondrodysplastic dwarfism in humans. Mutations
           in this protein can give rise to this congenital
           condition. LIMBIN is a protein which shares around 80%
           sequence homology with EVC2 and it is implicated in a
           similar condition in bovine chondrodysplastic dwarfism.
          Length = 429

 Score = 32.5 bits (74), Expect = 1.2
 Identities = 35/186 (18%), Positives = 68/186 (36%), Gaps = 46/186 (24%)

Query: 44  YERLKAEMIQAEEETN----------MSYLKK-KGVVAERKEAKIEKDEAEKYQRIREEI 92
           ++R  AEM QAEE              S L    G+  E  +  +   + E + +   ++
Sbjct: 238 HQREMAEMEQAEELLKRAPEREAVECSSLLDTLHGLEQEHLQRSLLLQQEEDFAKAHRQL 297

Query: 93  VAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRE 152
              +   +L  ++  +  +K              K E + E+A+++L+E  K Q  +   
Sbjct: 298 AVFQ-RVELHSIFFTQ--LKSA----------TFKGELKPEEAKSLLQEYSKIQETIE-- 342

Query: 153 LAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKD-IAD 211
                     MD     +R           H+ ++ A  ++ LV+  Q+ E+     +  
Sbjct: 343 --------ELMDFFQASQR----------YHLSERFAQ-REYLVKALQSQESRVSGLLNT 383

Query: 212 LETQLA 217
             TQL 
Sbjct: 384 AATQLT 389


>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein.  Members of this
           protein family belong to a conserved gene four-gene
           neighborhood found sporadically in a phylogenetically
           broad range of bacteria: Nocardia farcinica,
           Symbiobacterium thermophilum, and Streptomyces
           avermitilis (Actinobacteria), Geobacillus kaustophilus
           (Firmicutes), Azoarcus sp. EbN1 and Ralstonia
           solanacearum (Betaproteobacteria). Proteins in this
           family average over 1400 amino acids in length
           [Hypothetical proteins, Conserved].
          Length = 1353

 Score = 32.9 bits (75), Expect = 1.2
 Identities = 18/97 (18%), Positives = 35/97 (36%), Gaps = 4/97 (4%)

Query: 109 TDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEIN 168
                 E +  +   +  +     E+A   LR  ++  GA+  E+     E R       
Sbjct: 882 ARAARAESDAREAAEDAAEARAEAEEASLRLRTLEESVGAMVDEIRARLAETRAALASGG 941

Query: 169 KKRPSLIKSKERVSH----IQKKLASAKKSLVEVRQA 201
           ++ P L ++           ++K A A  +L E  +A
Sbjct: 942 RELPRLAEALATAEEARGRAEEKRAEADATLDERAEA 978



 Score = 32.5 bits (74), Expect = 1.5
 Identities = 16/107 (14%), Positives = 37/107 (34%), Gaps = 10/107 (9%)

Query: 123 GEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVS 182
               +   R  +AE+  RE  ++      E  +    +R ++  +             V 
Sbjct: 875 TRAAEQRARAARAESDAREAAEDAAEARAEAEEASLRLRTLEESVG----------AMVD 924

Query: 183 HIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQ 229
            I+ +LA  + +L    +      + +A  E       +++AE +  
Sbjct: 925 EIRARLAETRAALASGGRELPRLAEALATAEEARGRAEEKRAEADAT 971


>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
           exopolysaccharide biosynthesis [Cell envelope
           biogenesis, outer membrane].
          Length = 458

 Score = 32.4 bits (74), Expect = 1.3
 Identities = 52/277 (18%), Positives = 95/277 (34%), Gaps = 69/277 (24%)

Query: 173 SLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIAD------LETQLADVRKRKAEY 226
           S +      S+  K  A    +L +   A++   +  A       L+ +L ++R R  E 
Sbjct: 155 SRVIELSYTSNDPKLAAKLANALAQAYLADQLEAQLEAFRRASDSLDERLEELRARLQEA 214

Query: 227 ERQ----------SIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVE 276
           E Q          +   R   L   Q   +N +   L  ++ R++  + +LAS  + L  
Sbjct: 215 EAQVEDFRAQHGLTDAARGQLLSEQQLSALNTQ---LQSARARLAQAEARLASLLQLLPL 271

Query: 277 VRQAN----EAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEA 332
            R+A        +  I DL  Q A VR++ A+                            
Sbjct: 272 GREAAALREVLESPTIQDLRQQYAQVRQQIADLST------------------------- 306

Query: 333 TKRAGKILQQLDTINREQKGDQDKLDNELRQ-QVQTQNEIKKKRHEMEEAQKRIDKLEDH 391
               G    QL  +  +    + ++  ELRQ      NE+                    
Sbjct: 307 --ELGAKHPQLVALEAQLAELRQQIAAELRQILASLPNELAL------------------ 346

Query: 392 IRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELE 428
           + Q EA+L+      +   S +   + +++EL++E E
Sbjct: 347 LEQQEAALEKELAQLKGRLSKLPKLQVQLRELEREAE 383


>gnl|CDD|224055 COG1132, MdlB, ABC-type multidrug transport system, ATPase and
            permease components [Defense mechanisms].
          Length = 567

 Score = 32.4 bits (74), Expect = 1.3
 Identities = 13/69 (18%), Positives = 27/69 (39%), Gaps = 1/69 (1%)

Query: 991  TIVSHRYHPAPFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSL 1050
             I        P  +LDE  +ALD      +   +    +    T++I+ +     +AD +
Sbjct: 475  AIARALLRNPPILILDEATSALDTETEALIQDALKKLLKGR-TTLIIAHRLSTIKNADRI 533

Query: 1051 VGICPGSVT 1059
            + +  G + 
Sbjct: 534  IVLDNGRIV 542



 Score = 30.1 bits (68), Expect = 7.4
 Identities = 14/77 (18%), Positives = 29/77 (37%), Gaps = 6/77 (7%)

Query: 872 IVLPMLRVQKYDRKLAKSIQEMTSRLQTIQAPNLRAMEKLEHAKENLMKTNEEFENARKR 931
           + L +  + +  RKL++ ++E    L      N R +E L   +       E+ E  R  
Sbjct: 165 LALVLSLLARKSRKLSRRVREALGEL------NARLLESLSGIRVIKAFGAEDRELKRFE 218

Query: 932 AKKAKANFDRIKKERYD 948
               +     ++  R +
Sbjct: 219 EANEELRRANLRASRLE 235


>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
          Length = 455

 Score = 32.5 bits (74), Expect = 1.4
 Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 9/91 (9%)

Query: 75  AKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEK 134
            ++EKD  E      E+ + KE      +L      + + E+ L K+    E ++  ++ 
Sbjct: 364 ERLEKDVEEG-----EKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQD 418

Query: 135 AENILREKKKEQGA----LNRELAKVDQEIR 161
            EN L+E  KE+      L  E ++V++   
Sbjct: 419 EENFLKELSKEEKELLEKLKMEASEVEKLFG 449


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 32.3 bits (74), Expect = 1.4
 Identities = 23/103 (22%), Positives = 43/103 (41%), Gaps = 10/103 (9%)

Query: 338 KILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEA 397
            +    D +    K     +D +L + +   +E KK   E+EE Q          ++NE 
Sbjct: 6   LLRNNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQA---------KRNEL 56

Query: 398 SLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTD 440
           S K   K K +    +   K  ++EL++EL ++   L   + +
Sbjct: 57  S-KQIGKAKGQKKDKIEEIKKELKELKEELTELSAALKALEAE 98


>gnl|CDD|132062 TIGR03017, EpsF, chain length determinant protein EpsF.  Sequences
           in this family of proteins are members of the chain
           length determinant family (pfam02706) which includes the
           wzc protein from E.coli. This family of proteins are
           homologous to the EpsF protein of the methanolan
           biosynthesis operon of Methylobacillus species strain
           12S. The distribution of this protein appears to be
           restricted to a subset of exopolysaccharide operons
           containing a syntenic grouping of genes including a
           variant of the EpsH exosortase protein. Exosortase has
           been proposed to be involved in the targetting and
           processing of proteins containing the PEP-CTERM domain
           to the exopolysaccharide layer.
          Length = 444

 Score = 32.5 bits (74), Expect = 1.4
 Identities = 24/87 (27%), Positives = 39/87 (44%), Gaps = 12/87 (13%)

Query: 673 LQNTKSQIAKLEAEIDALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRD 732
           +QN K+ IA+ E+++  L+ R     P+ K  +A +             + S  +   + 
Sbjct: 256 IQNLKTDIARAESKLAELSQRLGPNHPQYKRAQAEI-----------NSLKSQLNAEIKK 304

Query: 733 FCKSIG-VSTIRQYEEAELRSQQERQK 758
              S+G  S I +  EAELR   E QK
Sbjct: 305 VTSSVGTNSRILKQREAELREALENQK 331


>gnl|CDD|181567 PRK08861, PRK08861, cystathionine gamma-synthase; Provisional.
          Length = 388

 Score = 32.2 bits (73), Expect = 1.5
 Identities = 27/88 (30%), Positives = 43/88 (48%), Gaps = 9/88 (10%)

Query: 978  ENPEEPLTYRVSTTIVSHRYHPA-PFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIV 1036
            E+ ++ L Y  + ++V   YHP+ P     EI A    +  G + S+    + + L+  V
Sbjct: 258  ESAQQILAYLQTQSLVGTIYHPSLPEHPGHEI-AKKQQSGFGSMLSFEFAGSFEQLKVFV 316

Query: 1037 ISLKEEFFSHADSLVG----IC-PGSVT 1059
             +L    FS A+SL G    IC P S+T
Sbjct: 317  KAL--ALFSLAESLGGVESLICHPASMT 342


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 32.3 bits (73), Expect = 1.6
 Identities = 51/396 (12%), Positives = 141/396 (35%), Gaps = 25/396 (6%)

Query: 80  DEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENIL 139
              + Y+   +E+     E  +  +  +  ++K   D L +K  E  KI ++ +      
Sbjct: 246 KLEDNYEPSEQELKLG-FEKFVHIINTDIANLKTQNDNLYEKIQEAMKISQKIKTLREKW 304

Query: 140 REKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVR 199
           R  K +       +  + Q+ +E   ++ K +  +   +E +  +Q  +    K L +  
Sbjct: 305 RALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNIDELHKQLRKQG 364

Query: 200 QANEAHNKDIADLETQLADVRKRKAEYE--RQSIPGRDINLESAQDVEINKKRPSLIKSK 257
            + E       + E    ++ K   + +   +S+  R +  +      + K         
Sbjct: 365 ISTEQFELMNQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIFK-SLEKTLRQYDSLI 423

Query: 258 ERVSHIQKKLASAKK--SLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDI 315
           + ++  + ++       SL    +        I +   +       + +   ++   + I
Sbjct: 424 QNITRSRSQIGHNVNDSSLKINIEQLFPKGSGINESIKKSILELNDEIQERIKTEENKSI 483

Query: 316 NLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKR 375
            LE   +    +   E T+   K+  +L   N              ++  + + E+  +R
Sbjct: 484 TLEED-IKNLKHDINELTQILEKLELELSEAN--------SKFELSKE--ENERELVAQR 532

Query: 376 HEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELG 435
            E+E+ +K ++ L      ++ S+ D ++L +     +   K  +   + ++ + +  + 
Sbjct: 533 IEIEKLEKELNDLN---LLSKTSILDAEQLVQSTEIKLDELKVDLNRKRYKIHKQVIHVI 589

Query: 436 DAKTDKHEDTRRKKKQELVENFKKAYSGVYDRMINM 471
           D  +    +      Q  +E+ +     V + + N+
Sbjct: 590 DITSKFKINI-----QSSLEDLENELGKVIEELRNL 620



 Score = 31.9 bits (72), Expect = 2.5
 Identities = 24/84 (28%), Positives = 42/84 (50%), Gaps = 11/84 (13%)

Query: 623 KEMGNLKAQKEKLSEELREAMKKSRK-----------ESELNTVQSTIKGLEIRLNYSRQ 671
            ++ NLK Q + L E+++EAMK S+K           +S+ N  ++ +  ++ +      
Sbjct: 271 TDIANLKTQNDNLYEKIQEAMKISQKIKTLREKWRALKSDSNKYENYVNAMKQKSQEWPG 330

Query: 672 DLQNTKSQIAKLEAEIDALNARAD 695
            L+  KS+I   E EI AL +  D
Sbjct: 331 KLEKLKSEIELKEEEIKALQSNID 354


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 32.6 bits (75), Expect = 1.6
 Identities = 71/419 (16%), Positives = 159/419 (37%), Gaps = 74/419 (17%)

Query: 84   KYQRIREEIV-AKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKA------- 135
            +Y R  E  +  +    +  +    E +  EL +       +V+K++R  +         
Sbjct: 769  RYSRFPEVPLFGRAAREKRIEQLRAERE--ELAERYATLSFDVQKLQRLHQAFSRFIGSH 826

Query: 136  ---------ENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQK 186
                     E  LR+  + +  L R LA  + + ++        R  L ++KE +S + +
Sbjct: 827  LAVAFEADPEAELRQLNRRRVELERALADHESQEQQQ-------RSQLEQAKEGLSALNR 879

Query: 187  KLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEI 246
             L     +L+      E     + ++  QL +  + K          R +        ++
Sbjct: 880  LLPRL--NLLAD----ETLADRVEEIREQLDEAEEAK----------RFVQQHGNALAQL 923

Query: 247  NKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE-- 304
                  L    E+   +++    A+++  + +Q   A           L +V +R+A   
Sbjct: 924  EPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAKQQAFA-----------LTEVVQRRAHFS 972

Query: 305  YERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQ-KGDQDKLDNELRQ 363
            YE           ++A+M    +   E  ++  +  +Q  T  REQ +  Q +L    + 
Sbjct: 973  YE-----------DAAEMLAKNSDLNEKLRQRLEQAEQERTRAREQLRQAQAQLAQYNQV 1021

Query: 364  QVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQEL 423
                ++    KR  ++E ++ +  L   +  +  + +  +  ++EL++ + ++++R  +L
Sbjct: 1022 LASLKSSYDAKRQMLQELKQELQDLG--VPADSGAEERARARRDELHARLSANRSRRNQL 1079

Query: 424  QKELEQVIEELGDAKTDKHEDTRRKKKQ--ELVENFKKAYSGVYDRMIN--MCHPVHKR 478
            +K+L    E   D  T K     R   +  E V N K  +  V   + +  +   +H+R
Sbjct: 1080 EKQL-TFCEAEMDNLTKKLRKLERDYHEMREQVVNAKAGWCAVLRLVKDNGVERRLHRR 1137


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 32.3 bits (73), Expect = 1.6
 Identities = 28/140 (20%), Positives = 64/140 (45%), Gaps = 5/140 (3%)

Query: 109 TDIKELEDELDKKKGEVEKIE--RRKEKAENILREKKKEQGALNRELAKVDQEIREMDVE 166
           TD+KE E + D K+ +  K E  +++  A+   ++    Q   +++  +V Q+ +E    
Sbjct: 201 TDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNL 260

Query: 167 INKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLA-DVRKRKAE 225
                 S  K  ++V+  QK+    +K+ +E+++ +E   K        L  + +  + E
Sbjct: 261 PKPADTSSPKEDKQVAENQKR--EIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKE 318

Query: 226 YERQSIPGRDINLESAQDVE 245
            E + +  +      A+D++
Sbjct: 319 AEDKELEAQKKREPVAEDLQ 338



 Score = 31.9 bits (72), Expect = 2.0
 Identities = 29/133 (21%), Positives = 50/133 (37%), Gaps = 11/133 (8%)

Query: 343 LDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDN 402
            D   RE + D  +      +  + Q +  K + + + AQ   DK  D +RQ     ++ 
Sbjct: 201 TDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQ---KQQEA 257

Query: 403 KKLKEELNSDVGSSKNRVQE--------LQKELEQVIEELGDAKTDKHEDTRRKKKQELV 454
           K L +  ++       +V E         Q E+++  EE   AK  K  D +++ K    
Sbjct: 258 KNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEK 317

Query: 455 ENFKKAYSGVYDR 467
           E   K       R
Sbjct: 318 EAEDKELEAQKKR 330



 Score = 31.5 bits (71), Expect = 2.8
 Identities = 28/145 (19%), Positives = 58/145 (40%), Gaps = 11/145 (7%)

Query: 64  KKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKG 123
            +KGV   R    +++ E+++  + R + + +E++ +       +      +D  DK++ 
Sbjct: 190 NEKGVNFRRDMTDLKERESQEDAK-RAQQLKEELDKKQIDADKAQQKADFAQDNADKQRD 248

Query: 124 EVEKIERRKEKAENILR-----EKKKEQGALNRELAKVDQEIREMDVEINKKRPSLI--- 175
           EV + ++  +            E K+      RE+ K   EI++ D E  K +       
Sbjct: 249 EVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDL 308

Query: 176 --KSKERVSHIQKKLASAKKSLVEV 198
             +SK      + K   A+K    V
Sbjct: 309 KQESKASEKEAEDKELEAQKKREPV 333


>gnl|CDD|235840 PRK06599, PRK06599, DNA topoisomerase I; Validated.
          Length = 675

 Score = 32.5 bits (75), Expect = 1.6
 Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 17/80 (21%)

Query: 69  VAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLF--KLYH------------NETDIKEL 114
           + ER E +IE    ++Y  I  ++       + F  KL              NE   K L
Sbjct: 178 ICER-EDEIEAFIPQEYWTIEADLATSN--GEPFTAKLVEVNGKKLEKFSITNEEQAKAL 234

Query: 115 EDELDKKKGEVEKIERRKEK 134
              L+ +   V+KIE+++ K
Sbjct: 235 VKALEGQAYTVDKIEKKERK 254


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 32.2 bits (73), Expect = 1.6
 Identities = 29/123 (23%), Positives = 59/123 (47%), Gaps = 12/123 (9%)

Query: 45  ERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKL 104
           ERL +E+  AE+E +      K  +++ +   +E    E      +E+  KE E  L +L
Sbjct: 32  ERLDSELRDAEKERDTY----KQYLSKLESQNVEISNYEALDSELDEL-KKEEERLLDEL 86

Query: 105 YHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMD 164
                  +ELE E D   GE+ +++  KE+ EN   +  +E    +R   +++  ++ ++
Sbjct: 87  -------EELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQLEDNLQSLE 139

Query: 165 VEI 167
           ++ 
Sbjct: 140 LQY 142


>gnl|CDD|144876 pfam01442, Apolipoprotein, Apolipoprotein A1/A4/E domain.  These
           proteins contain several 22 residue repeats which form a
           pair of alpha helices. This family includes:
           Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein
           E.
          Length = 191

 Score = 31.4 bits (72), Expect = 1.7
 Identities = 28/149 (18%), Positives = 61/149 (40%), Gaps = 5/149 (3%)

Query: 322 MTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQ-NEIKKK-RHEME 379
             +  +++A       ++  ++     E +        ELR+++     E+++K     E
Sbjct: 44  QKDLEDVRARLQPYLDELKAKVGQNLEELRQRLAPYAEELRKRLNRDAEELRRKLAPYAE 103

Query: 380 EAQKRIDKLEDHIRQNEASLKDN--KKLKEELNSDVGSSKNRVQELQKELEQVIEELGDA 437
           E + R+ +  + +R       +   +KL E L     S     +E + +L + +EEL + 
Sbjct: 104 ELRDRLRQNVEALRARLGPYVEELRQKLAERLEELKESVGPYAEEYKAQLSEQVEELREK 163

Query: 438 KTDKHEDTRRKKKQELVENFKKAYSGVYD 466
                ED R K +    E+ KK  + + +
Sbjct: 164 LEPYAEDLREKLEP-YAEDLKKQLAQLLE 191


>gnl|CDD|218839 pfam05983, Med7, MED7 protein.  This family consists of several
           eukaryotic proteins which are homologues of the yeast
           MED7 protein. Activation of gene transcription in
           metazoans is a multi-step process that is triggered by
           factors that recognise transcriptional enhancer sites in
           DNA. These factors work with co-activators such as MED7
           to direct transcriptional initiation by the RNA
           polymerase II apparatus.
          Length = 161

 Score = 31.1 bits (71), Expect = 1.8
 Identities = 11/33 (33%), Positives = 21/33 (63%)

Query: 108 ETDIKELEDELDKKKGEVEKIERRKEKAENILR 140
           E+ I  +E++LD+K+ E+E I +  E+   +L 
Sbjct: 129 ESLIMLMEEQLDEKREEIEAIRKVLEEVREVLE 161


>gnl|CDD|226844 COG4420, COG4420, Predicted membrane protein [Function unknown].
          Length = 191

 Score = 31.2 bits (71), Expect = 1.9
 Identities = 22/87 (25%), Positives = 44/87 (50%), Gaps = 9/87 (10%)

Query: 150 NRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDI 209
           NR+ A+ D+   E+D ++N      +K+++ V+ + +KL   +  L  VR   +   + +
Sbjct: 112 NRQ-AERDRLRAELDYQVN------LKAEQEVAALHEKLDELRLDLGYVRDELDDLRELL 164

Query: 210 ADLETQLADVRKRKA--EYERQSIPGR 234
           A++E +LAD    +      + S P R
Sbjct: 165 AEIEPELADEEALRRAVRGGQHSKPER 191



 Score = 30.5 bits (69), Expect = 3.1
 Identities = 18/78 (23%), Positives = 37/78 (47%), Gaps = 8/78 (10%)

Query: 238 LESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLAD 297
           L +  D ++N      +K+++ V+ + +KL   +  L  VR   +   + +A++E +LAD
Sbjct: 120 LRAELDYQVN------LKAEQEVAALHEKLDELRLDLGYVRDELDDLRELLAEIEPELAD 173

Query: 298 VRKRKA--EYERQSIPGR 313
               +      + S P R
Sbjct: 174 EEALRRAVRGGQHSKPER 191


>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 1104

 Score = 31.9 bits (72), Expect = 2.1
 Identities = 43/254 (16%), Positives = 88/254 (34%), Gaps = 44/254 (17%)

Query: 215 QLADVRKRKAEYERQSIPGRD-INLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKS 273
           Q  D   R  + +R+ +  R    L S  D ++   R ++     R      K+   +  
Sbjct: 586 QTGDPTTRWEKDDRRKLGDRSTYRLGSTNDAKVETLRETVKAMLSREDFYMIKIMRQQGE 645

Query: 274 LVEVRQANEAHNKDIA------DLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTN 327
            +++++   A     A      DL +    + + +A  ER +    DI +  A +     
Sbjct: 646 YIKLQEQANALAHIQALNFASIDLPSAQRQIAELQARLERLTHTQSDIAIAKAALDAAQT 705

Query: 328 L--------KAEAT---------KRAGKILQQLDTINREQKGDQDK-------LDNELRQ 363
                    + E T         KRA  + +++ +I ++      +             Q
Sbjct: 706 RQKVLERQYQQEVTECAGLKKDLKRAAMLSRKVHSIAKQGMTGALQALGAAHFPQVAPEQ 765

Query: 364 QVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQ-NEASLKDNKKLKEELNS----------D 412
                 +I++  H   + QKRID +   +R+  E  +      K+E  +          D
Sbjct: 766 HDDIV-DIERIEHR-RQLQKRIDAVNARLRRLREEIIGRMSDAKKEDTAALSEVGAELDD 823

Query: 413 VGSSKNRVQELQKE 426
           +     R+Q L ++
Sbjct: 824 IPEYLARLQTLTED 837


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members
           of this protein family are the bacterial ATP-dependent
           chaperone ClpB. This protein belongs to the AAA family,
           ATPases associated with various cellular activities
           (pfam00004). This molecular chaperone does not act as a
           protease, but rather serves to disaggregate misfolded
           and aggregated proteins [Protein fate, Protein folding
           and stabilization].
          Length = 852

 Score = 31.9 bits (73), Expect = 2.2
 Identities = 34/143 (23%), Positives = 69/143 (48%), Gaps = 16/143 (11%)

Query: 117 ELDKKKGEVEKIERRKEKAENILREK-KKEQGALNRE-LAKVDQEIREMDVEINKKRPSL 174
           E+D K  E+++++RR  + E I RE  KKE+   ++E L  +++E+ E++ E        
Sbjct: 399 EIDSKPEELDELDRRIIQLE-IEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQW 457

Query: 175 IKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGR 234
              K  +  IQ+     K+ + +VR   E   ++  DL         + AE +   +P  
Sbjct: 458 KAEKAAIQGIQQ----IKEEIEQVRLELEQAERE-GDLA--------KAAELQYGKLPEL 504

Query: 235 DINLESAQDVEINKKRPSLIKSK 257
           +  L++A+     + +P L++ +
Sbjct: 505 EKRLQAAEAKLGEETKPRLLREE 527


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 31.1 bits (71), Expect = 2.2
 Identities = 22/91 (24%), Positives = 51/91 (56%), Gaps = 3/91 (3%)

Query: 370 EIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQ 429
            + K +  +E+ +K +++L+  I + +A ++  KK +EE   +       +++L+KEL++
Sbjct: 63  ALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREE-TEERTELLEELKQLEKELKK 121

Query: 430 VIEELGDAKTDKHEDTRRKKKQELVENFKKA 460
           +  EL   +  K++  R +K +E  +  K+A
Sbjct: 122 LKAELEKYE--KNDPERIEKLKEETKVAKEA 150


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 31.5 bits (72), Expect = 2.2
 Identities = 17/57 (29%), Positives = 27/57 (47%), Gaps = 3/57 (5%)

Query: 628 LKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLE 684
           L   KEKL + L+E M K +K  EL      ++ LE ++           ++IA+ E
Sbjct: 206 LDRAKEKLKKLLQEIMIKVKKLEELEE---ELQELESKIEDLTNKKSELNTEIAEAE 259



 Score = 31.1 bits (71), Expect = 2.9
 Identities = 27/124 (21%), Positives = 60/124 (48%), Gaps = 10/124 (8%)

Query: 310 IPGRDINLESAQMTEYTNLKAEATK---RAGKILQQLDTINREQKGDQDKLDNELRQQVQ 366
             G D NLE  +  +Y  L  E         K+  + D +  E      +L++EL     
Sbjct: 146 KEGLDENLEGLK-EDYKLLMKELELLNSIKPKLRDRKDALEEEL-RQLKQLEDELEDCDP 203

Query: 367 T-----QNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQ 421
           T     + ++KK   E+    K++++LE+ +++ E+ ++D    K ELN+++  ++ +++
Sbjct: 204 TELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLE 263

Query: 422 ELQK 425
           + + 
Sbjct: 264 QCRG 267



 Score = 30.8 bits (70), Expect = 3.6
 Identities = 26/136 (19%), Positives = 63/136 (46%), Gaps = 7/136 (5%)

Query: 315 INLESAQMTEYTNLKAEAT------KRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQ 368
           ++ +   +  +  L+A+        K    + + LD      K D   L  EL      +
Sbjct: 115 MDKQFQLVKTFARLEAKKMWYEWRMKLLEGLKEGLDENLEGLKEDYKLLMKELELLNSIK 174

Query: 369 NEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLK-EELNSDVGSSKNRVQELQKEL 427
            +++ ++  +EE  +++ +LED +   + +  D  K K ++L  ++     +++EL++EL
Sbjct: 175 PKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEEL 234

Query: 428 EQVIEELGDAKTDKHE 443
           +++  ++ D    K E
Sbjct: 235 QELESKIEDLTNKKSE 250



 Score = 30.4 bits (69), Expect = 5.1
 Identities = 16/107 (14%), Positives = 52/107 (48%), Gaps = 9/107 (8%)

Query: 110 DIKELEDELDKKKGEVEKIERRKE--------KAENILREKKKEQGALNRELAKVDQEIR 161
            +++ +D L+++  +++++E   E        +A+  L++  +E     ++L ++++E++
Sbjct: 176 KLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQ 235

Query: 162 EMDVEINKKRPSLIKSKERVSHIQKKLASAKK-SLVEVRQANEAHNK 207
           E++ +I        +    ++  +KKL   +  +  E+ +  E    
Sbjct: 236 ELESKIEDLTNKKSELNTEIAEAEKKLEQCRGFTFKEIEKLKEQLKL 282


>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding.  This family
           is found in mammals where it is localised at cell-cell
           adherens junctions, and in Sch. pombe and other fungi
           where it anchors spindle-pole bodies to spindle
           microtubules. It is a coiled-coil structure, and in
           pombe, it is required for anchoring the minus end of
           spindle microtubules to the centrosome equivalent, the
           spindle-pole body. The name ADIP derives from the family
           being composed of Afadin- and alpha -Actinin-Binding
           Proteins Localised at Cell-Cell Adherens Junctions.
          Length = 149

 Score = 30.3 bits (69), Expect = 2.3
 Identities = 26/105 (24%), Positives = 52/105 (49%), Gaps = 7/105 (6%)

Query: 204 AHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHI 263
             ++D+   E+    +RK +AE ER       I     Q +E  ++  +L+++KER   +
Sbjct: 44  QRDRDLEFRESLEETLRKLEAEIERLQN---TIERLKTQ-LEDLERELALLQAKER--QL 97

Query: 264 QKKLASAKKSLVEVRQANEAHNKDIADLETQLA-DVRKRKAEYER 307
           +KKL + ++ L   ++  +     I   +TQ   +++KR  E E+
Sbjct: 98  EKKLKTLEQKLKNEKEEVQRLKNIIQQRKTQYNHELKKRDREIEK 142



 Score = 30.3 bits (69), Expect = 2.3
 Identities = 27/110 (24%), Positives = 54/110 (49%), Gaps = 9/110 (8%)

Query: 283 AHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKA---EATKRAGKI 339
             ++D+   E+    +RK +AE ER       I     Q+ +     A      ++  K 
Sbjct: 44  QRDRDLEFRESLEETLRKLEAEIERLQN---TIERLKTQLEDLERELALLQAKERQLEKK 100

Query: 340 LQQLDTINREQKGDQDKLDNELRQQVQTQ--NEIKKKRHEMEEAQKRIDK 387
           L+ L+   + +K +  +L N + QQ +TQ  +E+KK+  E+E+ ++R+ +
Sbjct: 101 LKTLEQKLKNEKEEVQRLKNII-QQRKTQYNHELKKRDREIEKLKERLLQ 149


>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
           Provisional.
          Length = 475

 Score = 31.7 bits (72), Expect = 2.3
 Identities = 10/59 (16%), Positives = 22/59 (37%), Gaps = 7/59 (11%)

Query: 649 ESELNTVQSTIKGLEIRLNYSRQDLQ-------NTKSQIAKLEAEIDALNARADATEPK 700
           +     +Q T   ++ +    R++L        + + +I KL  +  AL  +  A    
Sbjct: 68  QHATTEMQVTAAQMQKQYEEIRRELDVLNKQRGDDQRRIEKLGQDNAALAEQVKALGAN 126


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 31.9 bits (72), Expect = 2.4
 Identities = 76/399 (19%), Positives = 159/399 (39%), Gaps = 44/399 (11%)

Query: 55   EEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIRE--EIVAKEVEHQLFKLYHNETDIK 112
            E E++M+ + K  VV+E + +  E++  +  + + +  E ++  +  +L+    NE D+ 
Sbjct: 3859 ELESDMNGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWD-EPNEEDLL 3917

Query: 113  ELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRP 172
            E E + +++     + +   ++ +N   E K  Q        K D+E    DV I+ +  
Sbjct: 3918 ETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQ-------EKEDEEEMSDDVGIDDE-- 3968

Query: 173  SLIKSKERVSHIQKKLASAKKSLVEVRQANEAH-----NKDIADLETQLADVRKRKAEYE 227
              I+   + ++ Q    +    L E  + +E       + D+ D++ + AD  K +A+ E
Sbjct: 3969 --IQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAE 4026

Query: 228  RQSIPGRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKD 287
            +   P +D   E    +E N      I+  +     +      +    E  Q NE   +D
Sbjct: 4027 KDE-PMQD---EDP--LEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTED 4080

Query: 288  IADLETQLADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRA---GKILQQLD 344
                + +L    ++    E Q+I             + T   AEA +     G + +  +
Sbjct: 4081 GVKSDEEL----EQGEVPEDQAIDN-----HPKMDAKSTFASAEADEENTDKGIVGENEE 4131

Query: 345  TINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIR--QNEASLKDN 402
                +        D E  Q    Q +    +  M EA ++   L DH+R  Q    + + 
Sbjct: 4132 LGEEDGVRGNGTADGEFEQ---VQEDTSTPKEAMSEADRQYQSLGDHLREWQQANRIHEW 4188

Query: 403  KKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDK 441
            + L E          +    ++++ E+ ++ LG+A+ D+
Sbjct: 4189 EDLTES--QSQAFDDSEFMHVKEDEEEDLQALGNAEKDQ 4225


>gnl|CDD|224034 COG1109, {ManB}, Phosphomannomutase [Carbohydrate transport and
           metabolism].
          Length = 464

 Score = 31.5 bits (72), Expect = 2.6
 Identities = 17/113 (15%), Positives = 35/113 (30%), Gaps = 24/113 (21%)

Query: 623 KEMGNLKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAK 682
           K   +++       E   E ++K  +E        TI G+++ L                
Sbjct: 374 KYPQSVEINVRVTDEGKAEVLEKLLEELREAKKVDTIDGVKVEL---------------- 417

Query: 683 LEAEIDALNARADATEPKIKAIEASMTARGDTISRKKEEMNSVEDIVFRDFCK 735
              +   +  R   TEP I+     +          +E    + ++V R+   
Sbjct: 418 --EDGGRVLVRPSGTEPLIR-----VYVEAKDEELAEELAEEIAELV-REALG 462


>gnl|CDD|227608 COG5283, COG5283, Phage-related tail protein [Function unknown].
          Length = 1213

 Score = 31.8 bits (72), Expect = 2.8
 Identities = 36/228 (15%), Positives = 79/228 (34%), Gaps = 23/228 (10%)

Query: 77  IEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAE 136
            +    +    ++  I       ++      E   K  +D L   KG+ E +    EK +
Sbjct: 16  NDTIAVKNINVLKSSIKDSTQFWKM-----LEKQQKLTKDGLSASKGKYEGLSEAMEKQK 70

Query: 137 NILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLV 196
               + K+E   +NR      Q  ++   E N           + +  + KL S      
Sbjct: 71  KAYEDLKQEVKEVNRAT----QASKKAYQEYNA----------QYTQAENKLRSLSGQFG 116

Query: 197 EVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKS 256
              +      K+I  L+  ++ + K  A   +  +  +  N     D ++   R S  + 
Sbjct: 117 VASEQLMLQQKEIQRLQYAISTLNKSMA--AQARLLEQTGNKFGTADAKVVGLRESFGRQ 174

Query: 257 KERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE 304
            E ++  ++   + K +       +EA  K    L  +L  +++ + +
Sbjct: 175 TEALN--KQLERTKKVADALTYVLDEAQQKLSQALSARLERLQESRTQ 220


>gnl|CDD|224046 COG1121, ZnuC, ABC-type Mn/Zn transport systems, ATPase component
          [Inorganic ion transport and metabolism].
          Length = 254

 Score = 31.0 bits (71), Expect = 2.9
 Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 9/40 (22%)

Query: 7  YIEVDNF-KSYKGK-------FSIGPLKKFTAVIGPNGSG 38
           IEV+N   SY  +        S+    + TA+IGPNG+G
Sbjct: 4  MIEVENLTVSYGNRPVLEDISLSVEK-GEITALIGPNGAG 42


>gnl|CDD|217011 pfam02388, FemAB, FemAB family.  The femAB operon codes for two
           nearly identical approximately 50-kDa proteins involved
           in the formation of the Staphylococcal pentaglycine
           interpeptide bridge in peptidoglycan. These proteins are
           also considered as a factor influencing the level of
           methicillin resistance.
          Length = 408

 Score = 31.1 bits (71), Expect = 3.0
 Identities = 17/71 (23%), Positives = 38/71 (53%), Gaps = 2/71 (2%)

Query: 105 YHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMD 164
            + +  + EL+++L+K + ++ K+E + EK  N  ++ K  +  L ++LA +++ I E  
Sbjct: 238 LNFKEYLDELQEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAE--LEQQLASLEKRIDEAK 295

Query: 165 VEINKKRPSLI 175
             I K    + 
Sbjct: 296 ELIAKYGNEVP 306


>gnl|CDD|217963 pfam04203, Sortase, Sortase family.  The founder member of this
            family is S.aureus sortase, a transpeptidase that
            attaches surface proteins by the threonine of an LPXTG
            motif to the cell wall.
          Length = 124

 Score = 29.9 bits (68), Expect = 3.1
 Identities = 14/43 (32%), Positives = 18/43 (41%), Gaps = 3/43 (6%)

Query: 990  TTIVSHRYHPAPFFVLDEI---DAALDNTNIGKVASYIVTKTQ 1029
            T I  HR     F  LD++   D     T  GK  +Y VT  +
Sbjct: 41   TVIAGHRGGGTMFSDLDKLKKGDKIYVTTLDGKTYTYRVTSVE 83


>gnl|CDD|234806 PRK00635, PRK00635, excinuclease ABC subunit A; Provisional.
          Length = 1809

 Score = 31.3 bits (71), Expect = 3.4
 Identities = 21/77 (27%), Positives = 31/77 (40%), Gaps = 3/77 (3%)

Query: 984  LTYRVSTTIVSHRYHPA---PFFVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLK 1040
            L+ +++  I    Y P      F+LDEI  +LDN     +   + T        I I   
Sbjct: 1702 LSEKIAIKIAKFLYLPPKHPTLFLLDEIATSLDNQQKSALLVQLRTLVSLGHSVIYIDHD 1761

Query: 1041 EEFFSHADSLVGICPGS 1057
                  AD L+ + PGS
Sbjct: 1762 PALLKQADYLIEMGPGS 1778


>gnl|CDD|224300 COG1382, GimC, Prefoldin, chaperonin cofactor [Posttranslational
           modification, protein turnover, chaperones].
          Length = 119

 Score = 29.6 bits (67), Expect = 3.5
 Identities = 29/112 (25%), Positives = 55/112 (49%), Gaps = 8/112 (7%)

Query: 45  ERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKL 104
           +   A++ Q +++     L+K+ + A+ KE +   +E EK     +  V K+V + L K 
Sbjct: 9   QAQLAQLQQLQQQLQKVILQKQQLEAQLKEIEKALEELEKLDE--DAPVYKKVGNLLVK- 65

Query: 105 YHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKE-QGALNRELAK 155
                  +E  DEL+++K  +E   +  EK E  L+E+ +E Q  + + L  
Sbjct: 66  ----VSKEEAVDELEERKETLELRIKTLEKQEEKLQERLEELQSEIQKALGD 113


>gnl|CDD|118696 pfam10168, Nup88, Nuclear pore component.  Nup88 can be divided
           into two structural domains; the N-terminal two-thirds
           of the protein has no obvious structural motifs but is
           the region for binding to Nup98, one of the components
           of the nuclear pore. the C-terminal end is a predicted
           coiled-coil domain. Nup88 is overexpressed in tumour
           cells.
          Length = 717

 Score = 31.4 bits (71), Expect = 3.5
 Identities = 33/192 (17%), Positives = 73/192 (38%), Gaps = 29/192 (15%)

Query: 53  QAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIK 112
           +A +     YL K  +  E  + +++  + +K +++ +          +         + 
Sbjct: 543 RATQVFREQYLLKHDLAREEFQRRVKLLQLQKEKQLED----------IQDCREERKSLS 592

Query: 113 ELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVE---INK 169
           E  ++L       EK E  K   E ++   K+   + N +L  +    R+M  E   INK
Sbjct: 593 ERAEKL------AEKFEEAKYNQELLVNRCKRLLQSANSQLPVLSDSERDMSKELQRINK 646

Query: 170 KRPSLI----KSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAE 225
           +   L     + K++ ++ +  +AS +          E  +K I ++  +L +       
Sbjct: 647 QLQHLANGIKQVKKKKNYQRYHMASQESPKKSSYTLPEKQHKTITEILKELGE------H 700

Query: 226 YERQSIPGRDIN 237
            +R     + IN
Sbjct: 701 IDRMIKQIKRIN 712


>gnl|CDD|236729 PRK10636, PRK10636, putative ABC transporter ATP-binding protein;
           Provisional.
          Length = 638

 Score = 31.3 bits (71), Expect = 3.5
 Identities = 33/129 (25%), Positives = 61/129 (47%), Gaps = 17/129 (13%)

Query: 181 VSHIQKKLASAKKSLVEVRQAN-EAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLE 239
           VSH +  L S    L  V     E  + D+ D +  L+DV+K++    +     ++ N  
Sbjct: 483 VSHDRHLLRSTTDDLYLVHDGKVEPFDGDLEDYQQWLSDVQKQEN---QTDEAPKENNAN 539

Query: 240 SAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADV- 298
           SAQ  +  K+R + ++++ +   ++K++A  +K +       E  N  +A  E +L D  
Sbjct: 540 SAQARKDQKRREAELRTQTQ--PLRKEIARLEKEM-------EKLNAQLAQAEEKLGDSE 590

Query: 299 ---RKRKAE 304
              + RKAE
Sbjct: 591 LYDQSRKAE 599



 Score = 30.5 bits (69), Expect = 6.3
 Identities = 45/189 (23%), Positives = 74/189 (39%), Gaps = 52/189 (27%)

Query: 260 VSHIQKKLASAKKSLVEVRQAN-EAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLE 318
           VSH +  L S    L  V     E  + D+ D +  L+DV+K++    +     ++ N  
Sbjct: 483 VSHDRHLLRSTTDDLYLVHDGKVEPFDGDLEDYQQWLSDVQKQEN---QTDEAPKENNAN 539

Query: 319 SAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEM 378
           SAQ                             + DQ + + ELR Q Q        R E+
Sbjct: 540 SAQ----------------------------ARKDQKRREAELRTQTQPL------RKEI 565

Query: 379 EEAQKRIDKLEDHIRQNEASLKDN----KKLKEELN---SDVGSSKNRVQE-------LQ 424
              +K ++KL   + Q E  L D+    +  K EL        S+K+ ++E        Q
Sbjct: 566 ARLEKEMEKLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEAQ 625

Query: 425 KELEQVIEE 433
           ++LEQ++ E
Sbjct: 626 EQLEQMLLE 634


>gnl|CDD|224031 COG1106, COG1106, Predicted ATPases [General function prediction
          only].
          Length = 371

 Score = 31.0 bits (70), Expect = 3.6
 Identities = 9/34 (26%), Positives = 19/34 (55%), Gaps = 1/34 (2%)

Query: 5  LQYIEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG 38
          ++  ++ NFKS++ +  +    K   + G NG+G
Sbjct: 2  IKSFKIKNFKSFR-ELELEDFGKINIIYGANGAG 34


>gnl|CDD|222036 pfam13304, AAA_21, AAA domain. 
          Length = 256

 Score = 30.5 bits (68), Expect = 3.7
 Identities = 28/185 (15%), Positives = 56/185 (30%), Gaps = 12/185 (6%)

Query: 28  FTAVIGPNGSG------------ALKDDYERLKAEMIQAEEETNMSYLKKKGVVAERKEA 75
              +IGPNGSG             L      L   +    +       + +  +    E 
Sbjct: 1   LNVIIGPNGSGKSNLLKALALLLLLLSLGLTLDRGLNVGIKLLPFLLDENEIEIPLEFEI 60

Query: 76  KIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKA 135
           +    +  +Y+   E      +E  L++    E  + E E   +  +   EK    +   
Sbjct: 61  EEFLIDGIRYRYGFELDKEDILEELLYEYRKGEELLFERERSKESFEKSPEKKRELRGLR 120

Query: 136 ENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSL 195
           E +L         L     ++   I      I     +L   + ++  + K+L      L
Sbjct: 121 EVLLLLNLSLSSFLLLASLEILLSILLPFSFILGNLRNLRNIELKLLDLVKRLFLESDLL 180

Query: 196 VEVRQ 200
             ++ 
Sbjct: 181 RLLKL 185


>gnl|CDD|213205 cd03238, ABC_UvrA, ATP-binding cassette domain of the excision repair
            protein UvrA.  Nucleotide excision repair in eubacteria
            is a process that repairs DNA damage by the removal of a
            12-13-mer oligonucleotide containing the lesion.
            Recognition and cleavage of the damaged DNA is a
            multistep ATP-dependent reaction that requires the UvrA,
            UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases,
            with UvrA having two ATP binding sites, which have the
            characteristic signature of the family of ABC proteins,
            and UvrB having one ATP binding site that is structurally
            related to that of helicases.
          Length = 176

 Score = 30.0 bits (68), Expect = 4.0
 Identities = 15/55 (27%), Positives = 23/55 (41%)

Query: 1003 FVLDEIDAALDNTNIGKVASYIVTKTQDSLQTIVISLKEEFFSHADSLVGICPGS 1057
            F+LDE    L   +I ++   I          I+I    +  S AD ++   PGS
Sbjct: 111  FILDEPSTGLHQQDINQLLEVIKGLIDLGNTVILIEHNLDVLSSADWIIDFGPGS 165


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 31.0 bits (70), Expect = 4.0
 Identities = 25/124 (20%), Positives = 53/124 (42%), Gaps = 13/124 (10%)

Query: 346 INREQKGDQD-KLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKK 404
           I   +      +L+++++  +   +  KK   ++++  K+ +K +D     +  LK  KK
Sbjct: 23  IKNLKHSSYKIRLESDIKDSLLNLDINKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKK 82

Query: 405 LKEELNSD-----VGSSKNRVQELQKELEQVIE-------ELGDAKTDKHEDTRRKKKQE 452
           +K++L+ D        SKN  ++    L   +        E    K   ++ T +KKK+ 
Sbjct: 83  IKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKV 142

Query: 453 LVEN 456
           L   
Sbjct: 143 LSSK 146



 Score = 30.2 bits (68), Expect = 6.8
 Identities = 19/93 (20%), Positives = 38/93 (40%), Gaps = 3/93 (3%)

Query: 104 LYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNREL--AKVDQEIR 161
           L + + + K   ++LDKK  +  K +  K+  +  L++KKK +  L+ +           
Sbjct: 43  LLNLDINKKL-HEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKN 101

Query: 162 EMDVEINKKRPSLIKSKERVSHIQKKLASAKKS 194
                      SL++  +  +   KK  +  KS
Sbjct: 102 NSKQFAGPLAISLMRKPKPKTEKLKKKITVNKS 134


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
           represents a relatively rare but broadly distributed
           uncharacterized protein family, distributed in 1-2
           percent of bacterial genomes, all of which have outer
           membranes. In many of these genomes, it is part of a
           two-gene pair.
          Length = 555

 Score = 30.8 bits (70), Expect = 4.1
 Identities = 22/95 (23%), Positives = 42/95 (44%), Gaps = 11/95 (11%)

Query: 317 LESAQMTEYTNLKAEATKRAG--------KILQQLDTINREQKGDQDKLDNELRQQVQTQ 368
           L + Q  E    + EA K+          KI ++ D + +E K D+ K+    +  +Q  
Sbjct: 187 LPNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEEFDKLKKEGKADKQKI-KSAKNDLQND 245

Query: 369 -NEIKKKRHEMEEA-QKRIDKLEDHIRQNEASLKD 401
             ++K    E+++A Q  + +LE+        LK+
Sbjct: 246 KKQLKADLAELKKAPQNDLKRLENKYAIKSGDLKN 280



 Score = 30.8 bits (70), Expect = 5.0
 Identities = 24/113 (21%), Positives = 47/113 (41%), Gaps = 8/113 (7%)

Query: 111 IKELEDELDKKKGEVEKIERRKEKAENILREKKK--EQGALNRELAKVDQEIREMDVEIN 168
              L+ E  K     E+IE+  +  +   +++KK         E  K  + I++ D+   
Sbjct: 152 RALLKGEDLKTVETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDI--- 208

Query: 169 KKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRK 221
           K    L K KE    ++K+  + K+ +   +        D   L+  LA+++K
Sbjct: 209 KNPLELQKIKEEFDKLKKEGKADKQKIKSAKND---LQNDKKQLKADLAELKK 258


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
           This family consists of a number of sequences found in
           Arabidopsis thaliana, Oryza sativa and Lycopersicon
           esculentum (Tomato). The function of this family is
           unknown.
          Length = 767

 Score = 31.1 bits (70), Expect = 4.1
 Identities = 51/307 (16%), Positives = 113/307 (36%), Gaps = 18/307 (5%)

Query: 128 IERRKEKAENILREKKKEQGALNRELAKVD-QEIREMDVEINKKRPSLIKSKERVSHIQK 186
            +    +   IL + + E  +L R     D  E   M   + ++  ++I+  + +     
Sbjct: 448 EKHIISEISEILEDIEAELDSLERSSNGDDENEEVAMVGSLVEESSAVIERSQELEGAVS 507

Query: 187 KLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEI 246
           +++     L       +      ADLE  + +          Q    +D+++E++   EI
Sbjct: 508 RISEFTSVLEHEVTVCQDLLDGKADLEKFIQEFSLTLEWVVNQEKSLQDVSVEAS---EI 564

Query: 247 NKKRPSL-IKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEY 305
            K    L    KE  S  + K A    +L E+            +L     D+ + K + 
Sbjct: 565 KKNFLGLKSSEKEINSPDEVKGAVCISTLEELETLKSEKENLDGELSKCKDDLEESKNKL 624

Query: 306 ERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQ-------DKLD 358
           +       ++  E     E  +L     K   +  + L+   ++ + +          L+
Sbjct: 625 QETEKKLEELKSELDASQESNSLAETQLKCMVESYESLELRAKDLEAEHKSLQEKISSLE 684

Query: 359 NELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKN 418
           +EL ++ Q   E+  K  E+EE      K+E   ++      D ++ K +   ++ ++  
Sbjct: 685 DELEKERQNHQELIAKCRELEE------KIERAEQEENMQKLDEEEQKIKQEKEIAAAAE 738

Query: 419 RVQELQK 425
           ++ E Q+
Sbjct: 739 KLAECQE 745


>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
           Provisional.
          Length = 276

 Score = 30.7 bits (70), Expect = 4.3
 Identities = 13/43 (30%), Positives = 23/43 (53%)

Query: 369 NEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNS 411
             +   R E EE +K + +LE  +++ E    +N +L+E LN 
Sbjct: 69  ASLFDLREENEELKKELLELESRLQELEQLEAENARLRELLNL 111


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 30.2 bits (69), Expect = 4.3
 Identities = 18/73 (24%), Positives = 49/73 (67%)

Query: 361 LRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRV 420
            + + + + E+K++R+E++  +KR+ + E+ + + + SL+  ++  EE   ++ + + ++
Sbjct: 63  HKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESLEKKEESLEEKEKELAARQQQL 122

Query: 421 QELQKELEQVIEE 433
           +E ++ELE++IEE
Sbjct: 123 EEKEEELEELIEE 135


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 30.5 bits (69), Expect = 4.4
 Identities = 30/221 (13%), Positives = 83/221 (37%), Gaps = 28/221 (12%)

Query: 240 SAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLA--- 296
             QD ++++ +      +  +  +  ++   +  + E+++  +    +I  L+ ++A   
Sbjct: 34  QNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELK 93

Query: 297 -DVRKRKAEYERQSIPGRDINLESAQM----TEYTNLKAEATKRAGKILQQLDTINREQK 351
            ++ +R+   +++          + Q+    T Y ++   + K    ++ ++  I+    
Sbjct: 94  ENIVERQELLKKR--------ARAMQVNGTATSYIDVILNS-KSFSDLISRVTAISVIVD 144

Query: 352 GDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNS 411
            D+      L QQ + +  +++K+  +E+  +    L     + E  L      K E N+
Sbjct: 145 ADK----KILEQQKEDKKSLEEKQAALEDKLE---TLVALQNELETQLNSLNSQKAEKNA 197

Query: 412 DVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQE 452
            + +   +      E       L + K           KQE
Sbjct: 198 LIAALAAKEASALGE----KAALEEQKALAEAAAAEAAKQE 234


>gnl|CDD|233070 TIGR00644, recJ, single-stranded-DNA-specific exonuclease RecJ.
           All proteins in this family are 5'-3' single-strand DNA
           exonucleases. These proteins are used in some aspects of
           mismatch repair, recombination, and recombinational
           repair [DNA metabolism, DNA replication, recombination,
           and repair].
          Length = 539

 Score = 30.7 bits (70), Expect = 4.4
 Identities = 9/47 (19%), Positives = 22/47 (46%)

Query: 328 LKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKK 374
           L  +  K+A ++ ++LD +N E+K  + ++  E  +  +        
Sbjct: 279 LLTDDPKQARQLAEELDELNEERKKIEQEVLAEAEKIAEANLLPNDS 325


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 29.1 bits (66), Expect = 4.5
 Identities = 20/101 (19%), Positives = 49/101 (48%), Gaps = 3/101 (2%)

Query: 129 ERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKL 188
            R  E+ +  L  K++E       L + ++E+ + + E+ +   SLIK  + +   + K 
Sbjct: 6   RREMEEVQLALDAKREEFERREELLKQREEELEKKEEELQE---SLIKFDKFLKENEAKR 62

Query: 189 ASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQ 229
             A+K   E ++  +   ++I +L+ +L +++    + E +
Sbjct: 63  RRAEKKAEEEKKLRKEKEEEIKELKAELEELKAEIEKLEEK 103


>gnl|CDD|234240 TIGR03514, GldB_lipo, gliding motility-associated lipoprotein GldB.
            RN RM PMID:10648514 RT Cloning and characterization of
           the Flavobacterium johnsoniae gliding-motility genes
           gldB and gldC. RA Hunnicutt DW, McBride MJ RL J
           Bacteriol. 2000 Feb;182(4):911-8.
          Length = 319

 Score = 30.4 bits (69), Expect = 4.6
 Identities = 11/33 (33%), Positives = 18/33 (54%)

Query: 103 KLYHNETDIKELEDELDKKKGEVEKIERRKEKA 135
           KL + +  + EL  E++ K  + EKIE+    A
Sbjct: 73  KLVYKDPALNELYTEVEAKFPDFEKIEKDLTAA 105


>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
           (DUF1750).  This is a fungal domain of unknown function.
          Length = 669

 Score = 30.8 bits (69), Expect = 4.7
 Identities = 18/66 (27%), Positives = 33/66 (50%), Gaps = 7/66 (10%)

Query: 623 KEMGNLKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAK 682
           K++ ++ A+ EKL +     M+K  +        S +K  E RL  +  D  +T S+I +
Sbjct: 371 KKVADMTAEIEKLKKRHARRMEKFNR-------TSALKDAEQRLRDAAADPADTGSEIWR 423

Query: 683 LEAEID 688
           LE  ++
Sbjct: 424 LEGRLE 429


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 30.8 bits (70), Expect = 4.7
 Identities = 30/151 (19%), Positives = 57/151 (37%), Gaps = 1/151 (0%)

Query: 333 TKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHI 392
            K A +ILQ+      E+         EL  + + Q   ++ R E EE Q+  ++L    
Sbjct: 32  AKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQR-QEARREREELQREEERLVQKE 90

Query: 393 RQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEELGDAKTDKHEDTRRKKKQE 452
            Q +A  +    L+ +L     +   R  EL++  +Q+  EL        E  R+   + 
Sbjct: 91  EQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLKL 150

Query: 453 LVENFKKAYSGVYDRMINMCHPVHKRYNVAI 483
           L    ++  +    ++        +R    I
Sbjct: 151 LDAELEEEKAQRVKKIEEEADLEAERKAQNI 181



 Score = 30.4 bits (69), Expect = 5.6
 Identities = 21/106 (19%), Positives = 47/106 (44%), Gaps = 4/106 (3%)

Query: 616 KAKRWDDKEMGNLKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQDLQN 675
           K ++   KE   +  + +K +EE  EA     KE     +              R++LQ 
Sbjct: 26  KKRQRLAKEAERILQEAQKEAEEKLEAALLEAKEL----LLRERNQQRQEARREREELQR 81

Query: 676 TKSQIAKLEAEIDALNARADATEPKIKAIEASMTARGDTISRKKEE 721
            + ++ + E ++DA   + D  E +++  E +++AR   +   +++
Sbjct: 82  EEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQ 127



 Score = 30.4 bits (69), Expect = 6.1
 Identities = 19/136 (13%), Positives = 49/136 (36%)

Query: 296 ADVRKRKAEYERQSIPGRDINLESAQMTEYTNLKAEATKRAGKILQQLDTINREQKGDQD 355
           A+ +   A  E + +  R+ N +  +            +R  +  +QLD    +    ++
Sbjct: 46  AEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLEN 105

Query: 356 KLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGS 415
           +L+   +     + E+++   +++    R+  L     +       + +L+EE    V  
Sbjct: 106 QLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLKLLDAELEEEKAQRVKK 165

Query: 416 SKNRVQELQKELEQVI 431
            +       +   Q I
Sbjct: 166 IEEEADLEAERKAQNI 181



 Score = 29.7 bits (67), Expect = 9.3
 Identities = 23/144 (15%), Positives = 61/144 (42%), Gaps = 6/144 (4%)

Query: 729 VFRDFCKSIGVSTIRQYEEAELRSQQERQKICQDKDTKKNVARWERAVSDDEEELARAQG 788
           + +    +     I Q  + E   + E   +   +   +   +  +    + EEL R + 
Sbjct: 25  LKKRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEE 84

Query: 789 AEEKLAGEMRAEADKLENMRATRLTKKQAVDAMDEEIGKARREVGSIAKDI------QAA 842
              +   ++ A A+KL+N+      +++A+ A + E+ +  +++ +    +      QA 
Sbjct: 85  RLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQAR 144

Query: 843 QKSCVNLESKLEMKKSERHDILMN 866
           +     L+++LE +K++R   +  
Sbjct: 145 KLLLKLLDAELEEEKAQRVKKIEE 168


>gnl|CDD|235239 PRK04172, pheS, phenylalanyl-tRNA synthetase subunit alpha;
           Provisional.
          Length = 489

 Score = 30.6 bits (70), Expect = 4.8
 Identities = 26/125 (20%), Positives = 46/125 (36%), Gaps = 19/125 (15%)

Query: 60  MSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELD 119
           +  L +KG     K   I K +A +     E+ +    E          ++      +  
Sbjct: 108 LGNLARKGWAKIEKGKVILKPDAYE--DPEEKALKALAE----GDKEELSEEDLKVLKEL 161

Query: 120 KKKGEVEKIERR------KEKAENILR---EKKKEQGALNRELAKV----DQEIREMDVE 166
           KK+  VE+ ER        +    +L+   E K+E   L  EL K     ++E R  +V+
Sbjct: 162 KKRKLVEEKERTERSVELTDAGLELLKEGIELKEEITQLTPELLKSGEWKEKEFRPYNVK 221

Query: 167 INKKR 171
               +
Sbjct: 222 APPPK 226


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 30.4 bits (70), Expect = 4.9
 Identities = 15/60 (25%), Positives = 29/60 (48%), Gaps = 7/60 (11%)

Query: 380 EAQKRIDKLEDHIRQNEAS-----LKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEEL 434
           E Q  +++L+    +N  S      K   +  E L ++V   K  ++ L+ EL+++  EL
Sbjct: 39  ELQTELEELQ--AERNALSKEIGQAKRKGEDAEALIAEVKELKEEIKALEAELDELEAEL 96



 Score = 30.4 bits (70), Expect = 5.1
 Identities = 21/87 (24%), Positives = 37/87 (42%), Gaps = 20/87 (22%)

Query: 610 SLDLARKAKRWDDKEMGNLKAQKEKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYS 669
            LD  R+  +    E+  L+A++  LS+E+ +A +K      L                 
Sbjct: 32  ELDEERRELQ---TELEELQAERNALSKEIGQAKRKGEDAEAL----------------- 71

Query: 670 RQDLQNTKSQIAKLEAEIDALNARADA 696
             +++  K +I  LEAE+D L A  + 
Sbjct: 72  IAEVKELKEEIKALEAELDELEAELEE 98


>gnl|CDD|238252 cd00446, GrpE, GrpE is the adenine nucleotide exchange factor of
           DnaK (Hsp70)-type ATPases. The GrpE dimer binds to the
           ATPase domain of Hsp70 catalyzing the dissociation of
           ADP, which enables rebinding of ATP, one step in the
           Hsp70 reaction cycle in protein folding. In eukaryotes,
           only the mitochondrial Hsp70, not the cytosolic form, is
           GrpE dependent.
          Length = 137

 Score = 29.4 bits (67), Expect = 5.0
 Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 911 LEHAKENLMKTNEEFENARKRAKKAKAN 938
           LE  K+ L++   EFEN RKR ++ +  
Sbjct: 1   LEELKDKLLRALAEFENYRKRTEREREE 28


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 30.5 bits (69), Expect = 5.2
 Identities = 76/375 (20%), Positives = 151/375 (40%), Gaps = 59/375 (15%)

Query: 113 ELEDELDKKKGEVEKIERRKEKAENILREKKKEQGAL---NRELAKVDQEIREMDVEINK 169
            L+ +L++ + E  ++E  ++       E +KE   L   N EL  +  E + +  EI+ 
Sbjct: 243 LLQSQLEQLQEENFRLEAARDDYRIKCEELEKELAELQHRNDELTSLAAESQALKDEIDV 302

Query: 170 KRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQ 229
            R    +S ++   ++ ++ + KK L           +D+ DL  Q+  + +R A Y + 
Sbjct: 303 LR----ESSDKAKKLEAQVETYKKKL-----------EDLNDLRRQVKLLEERNAMYMQN 347

Query: 230 SIPGRDINLESAQDV-EINKKRPSLIKSKERVSHIQKKLAS----AKKSLVEVRQANEAH 284
                   ++  +++ + N  R  L   K +V  +  KL+     A K   E ++  E  
Sbjct: 348 -------TVQLEEELKKANAARGQLETYKRQVQELHAKLSEESKKADKLEFEYKRLEEKL 400

Query: 285 NKDIADLETQLA--DVRKRKAEYER--QSIPGRDINLESAQMTEYTNLKAE----ATKRA 336
                + E  LA  D  +   E  R  Q+   +    ++       NL AE      +  
Sbjct: 401 EALQKEKERLLAERDSLRETNEELRCGQAQQDQLTQADAGTSPSGDNLAAELLPSEYREK 460

Query: 337 GKILQQLDTINREQKGDQDK---------LD------NELRQQVQTQNE-IKKKRHEMEE 380
              LQ  + + R  +   +          L+      NEL  Q++  N+ I + + ++E+
Sbjct: 461 LIRLQHENKMLRLGQEGSENERITELQQLLEDANRRNNELETQLRLANQRILELQQQVED 520

Query: 381 AQKRIDKLEDHIRQNEASLKDNKKLKEELN--SDVGSSKNRVQELQKELEQVIEELGDAK 438
            QK    L++   ++E S     KL+E L    +      + +E  +ELE   ++    K
Sbjct: 521 LQK---ALQEQGSKSEDSSLLKSKLEEHLEQLHEANEELQKKREQIEELEPDQDQNLSRK 577

Query: 439 TDKHEDTRRKKKQEL 453
             + E   +KK +++
Sbjct: 578 IAELEAALQKKDEDM 592


>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
           subunit.  CENP-Q is one of the components that assembles
           onto the CENP-A-nucleosome distal (CAD) centromere. The
           centromere, which is the basic element of chromosome
           inheritance, is epigenetically determined in mammals.
           CENP-A, the centromere-specific histone H3 variant,
           assembles an array of nucleosomes and it is this that
           seems to be the prime candidate for specifying
           centromere identity. CENP-A nucleosomes directly recruit
           a proximal CENP-A nucleosome associated complex (NAC)
           comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
           CENP-C and CENP-H. Assembly of the CENP-A NAC at
           centromeres is dependent on CENP-M, CENP-N and CENP-T.
           Additionally, there are seven other subunits which make
           up the CENP-A-nucleosome distal (CAD) centromere,
           CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
           CENP-S, also assembling on the CENP-A NAC. Fta7 is the
           equivalent component of the fission yeast Sim4 complex.
          Length = 159

 Score = 29.6 bits (67), Expect = 5.2
 Identities = 19/87 (21%), Positives = 37/87 (42%), Gaps = 1/87 (1%)

Query: 357 LDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKD-NKKLKEELNSDVGS 415
           L  EL+Q    + EI+++  E+E+  + + +LE + +  E   ++ +K L   L      
Sbjct: 36  LAAELKQLELLEEEIRREEAELEKDLEYLQELEKNAKALEREREEESKNLHPVLRLLESE 95

Query: 416 SKNRVQELQKELEQVIEELGDAKTDKH 442
                + LQ  L ++ E       D  
Sbjct: 96  VLEENELLQDSLLELSERNFSPNLDPE 122



 Score = 29.2 bits (66), Expect = 6.7
 Identities = 23/112 (20%), Positives = 48/112 (42%), Gaps = 5/112 (4%)

Query: 89  REEIVAKEVEHQLFKLYHN--ETDIKELEDELDKKKGEVEKI---ERRKEKAENILREKK 143
            EEI  +E E +    Y    E + K LE E +++   +  +      +   EN L +  
Sbjct: 47  EEEIRREEAELEKDLEYLQELEKNAKALEREREEESKNLHPVLRLLESEVLEENELLQDS 106

Query: 144 KEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSL 195
             + +       +D E+ ++  ++NK   SL  + E+++ + + L     +L
Sbjct: 107 LLELSERNFSPNLDPELLDLLKQLNKHLESLQGNLEQLAGLVEALERLYAAL 158


>gnl|CDD|214699 smart00503, SynN, Syntaxin N-terminal domain.  Three-helix domain
           that (in Sso1p) slows the rate of its reaction with the
           SNAP-25 homologue Sec9p.
          Length = 117

 Score = 28.9 bits (65), Expect = 5.4
 Identities = 13/77 (16%), Positives = 37/77 (48%)

Query: 375 RHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEEL 434
           R  +++  + + +L+    +       +K+L+E+L   +   K   +E++ +L+++ +E 
Sbjct: 14  RANIQKISQNVAELQKLHEELLTPPDADKELREKLERLIDDIKRLAKEIRAKLKELEKEN 73

Query: 435 GDAKTDKHEDTRRKKKQ 451
            + +       R +K Q
Sbjct: 74  LENRASGSASDRTRKAQ 90


>gnl|CDD|213244 cd03277, ABC_SMC5_euk, ATP-binding cassette domain of eukaryotic SMC5
            proteins.  The structural maintenance of chromosomes
            (SMC) proteins are large (approximately 110 to 170 kDa),
            and each is arranged into five recognizable domains.
            Amino-acid sequence homology of SMC proteins between
            species is largely confined to the amino- and
            carboxy-terminal globular domains. The amino-terminal
            domain contains a 'Walker A' nucleotide-binding domain
            (GxxGxGKS/T, in the single-letter amino-acid code), which
            by mutational studies has been shown to be essential in
            several proteins. The carboxy-terminal domain contains a
            sequence (the DA-box) that resembles a 'Walker B' motif,
            and a motif with homology to the signature sequence of
            the ATP-binding cassette (ABC) family of ATPases. The
            sequence homology within the carboxy-terminal domain is
            relatively high within the SMC1-SMC4 group, whereas SMC5
            and SMC6 show some divergence in both of these sequences.
            In eukaryotic cells, the proteins are found as
            heterodimers of SMC1 paired with SMC3, SMC2 with SMC4,
            and SMC5 with SMC6 (formerly known as Rad18).
          Length = 213

 Score = 29.9 bits (68), Expect = 5.4
 Identities = 12/27 (44%), Positives = 16/27 (59%)

Query: 999  PAPFFVLDEIDAALDNTNIGKVASYIV 1025
              PF V+DEI+  +D TN  KV   +V
Sbjct: 148  RCPFRVVDEINQGMDPTNERKVFDMLV 174



 Score = 29.1 bits (66), Expect = 8.9
 Identities = 12/31 (38%), Positives = 18/31 (58%), Gaps = 1/31 (3%)

Query: 8  IEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG 38
          I+++NF +Y  +    P      +IGPNGSG
Sbjct: 6  IKLENFVTY-DETEFRPGPSLNMIIGPNGSG 35


>gnl|CDD|203795 pfam07889, DUF1664, Protein of unknown function (DUF1664).  The
           members of this family are hypothetical plant proteins
           of unknown function. The region featured in this family
           is approximately 100 amino acids long.
          Length = 125

 Score = 28.9 bits (65), Expect = 5.9
 Identities = 18/90 (20%), Positives = 49/90 (54%), Gaps = 6/90 (6%)

Query: 765 TKKNVARWERAVSDDEEELARAQGAEEKLAGEMRAEADKLENMRATRLTKKQAVDAMDEE 824
           TK+N++    AV+   ++L   +   E +A   +  + +++N+      +K+  ++  +E
Sbjct: 36  TKRNMSD---AVASVGKQL---EKVSESIAAAKKHLSQRIDNLDDKLDEQKEISESTRDE 89

Query: 825 IGKARREVGSIAKDIQAAQKSCVNLESKLE 854
           + + R ++ +I +D+++  ++   LE KL+
Sbjct: 90  VTEIREDLSNIGEDVKSVHQAVEGLEGKLD 119


>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome
            partitioning [Cell division and chromosome partitioning].
          Length = 1480

 Score = 30.6 bits (69), Expect = 5.9
 Identities = 43/260 (16%), Positives = 107/260 (41%), Gaps = 43/260 (16%)

Query: 210  ADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKS------------- 256
            AD E ++  +  R+ E ER      + N +     +  K+  + +               
Sbjct: 832  ADPEAEIRQLNSRRNELERALSNHENDNQQQRIQFDQAKEGVTALNRLIPQLNLLADESL 891

Query: 257  KERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDIN 316
             +RV  I+++L  A+++   ++Q    H   ++ LE   + ++    ++E+         
Sbjct: 892  ADRVEEIRERLDEAQEAARFIQQ----HGNTLSKLEPIASVLQSDPEQFEQLK------- 940

Query: 317  LESAQMTEYTNLKAEATKRA---GKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKK 373
                   +   ++ +A ++A    +++Q+    +     +    +++L +++        
Sbjct: 941  ---EDYAQAQQMQRQARQQAFALTEVVQRRAHFSYSDSAEMLSENSDLNEKL-------- 989

Query: 374  KRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQVIEE 433
             R  +E+A+    +  + +RQ++A L    ++   L S   + K  + ELQ+EL    ++
Sbjct: 990  -RQRLEQAEAERTRAREQLRQHQAQLSQYNQVLASLKSSYDTKKELLNELQQEL----QD 1044

Query: 434  LGDAKTDKHEDTRRKKKQEL 453
            +G       E+  R ++ EL
Sbjct: 1045 IGVRADSGAEERARIRRDEL 1064


>gnl|CDD|232969 TIGR00422, valS, valyl-tRNA synthetase.  The valyl-tRNA synthetase
           (ValS) is a class I amino acyl-tRNA ligase and is
           particularly closely related to the isoleucyl tRNA
           synthetase [Protein synthesis, tRNA aminoacylation].
          Length = 861

 Score = 30.4 bits (69), Expect = 6.0
 Identities = 28/144 (19%), Positives = 52/144 (36%), Gaps = 25/144 (17%)

Query: 8   IEVDNFKSYKGKFSIGPLKKFTAVIGPNGSGALKDDYERLKAEMIQAEEETNMSYLKKKG 67
           + + N K+        PLK                              + N   +K   
Sbjct: 734 VSIRNLKAESNIPPNAPLKV----------------LLIYTEAETAERLKLNAVDIKG-- 775

Query: 68  VVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEK 127
                 E ++  ++ E  + + E +   E+   +  L +   ++  L+ +LDK+K EV +
Sbjct: 776 -AINFSEVEVVIEKPEVTEAVVELVPGFEIIIPVKGLINKAKELARLQKQLDKEKKEVIR 834

Query: 128 IERRKE------KAENILREKKKE 145
           IE + E      KA   + EK+KE
Sbjct: 835 IEGKLENEGFVKKAPKEVIEKEKE 858


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 30.2 bits (68), Expect = 6.2
 Identities = 42/184 (22%), Positives = 69/184 (37%), Gaps = 19/184 (10%)

Query: 129 ERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKL 188
           +++    +   R+KK EQ A   E  +  ++ R+ ++E         K  E+      K 
Sbjct: 58  QKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQ----AAKQ 113

Query: 189 ASAKKSLVE---VRQANEAHNKDIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVE 245
           A  K+   E    +QA EA  K  A+ E +  +  K++AE E +           A+   
Sbjct: 114 AEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAK-----------AKAAA 162

Query: 246 INKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANE-AHNKDIADLETQLADVRKRKAE 304
             KK+ +  K K       K  A AK    E +   E A  K  A+   +        A 
Sbjct: 163 EAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAA 222

Query: 305 YERQ 308
            E +
Sbjct: 223 AEAE 226


>gnl|CDD|227780 COG5493, COG5493, Uncharacterized conserved protein containing a
           coiled-coil domain [Function unknown].
          Length = 231

 Score = 29.8 bits (67), Expect = 6.2
 Identities = 24/95 (25%), Positives = 48/95 (50%), Gaps = 4/95 (4%)

Query: 362 RQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQ 421
            QQ+ T+ ++++ R E E+ QK +   +  +R+ +A+ +D K L+     +  ++K  + 
Sbjct: 39  WQQLATKQDVEELRKETEQRQKELADEKLEVRKQKATKEDLKLLQRFQEEEFRATKEDI- 97

Query: 422 ELQKELEQVIEELGDAKTDKHEDTRRKKKQELVEN 456
              K LE +I  LG       ED  R+  +E++  
Sbjct: 98  ---KRLETIITGLGARWGILSEDAFRQGVREILRE 129


>gnl|CDD|222290 pfam13654, AAA_32, AAA domain.  This family includes a wide variety
           of AAA domains including some that have lost essential
           nucleotide binding residues in the P-loop.
          Length = 509

 Score = 30.5 bits (70), Expect = 6.3
 Identities = 36/175 (20%), Positives = 72/175 (41%), Gaps = 36/175 (20%)

Query: 71  ERKEAKIEKDEAEKYQRIREEIVAKEVEHQLFKLYHNETDI--------KELEDE----L 118
           E ++ +IE++  EK +   EE+  +E + + F L               + L +E    L
Sbjct: 114 EARKEEIEEEFQEKREEAFEEL-EEEAKEKGFALVRTPGGFVFAPLKDGEPLTEEEFEAL 172

Query: 119 DKKKGEV--EKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIK 176
            +++ E   EKI+  +E+ + ILR+ ++ +     +L ++D+E+                
Sbjct: 173 PEEEREELEEKIDELEEELQEILRQLRELEREAREKLRELDREV---------------- 216

Query: 177 SKERVSHIQKKLASAKKSLVEVRQANEAHNKDIAD-----LETQLADVRKRKAEY 226
           +   V  +  +L      L +V    +A  +DI +     LE   A+     AE 
Sbjct: 217 ALFAVGPLLDELKEKYADLPKVLAYLDAVQEDILENLDLFLEEDEAEAAASGAED 271


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 30.7 bits (70), Expect = 6.3
 Identities = 18/87 (20%), Positives = 38/87 (43%), Gaps = 9/87 (10%)

Query: 366 QTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQK 425
             Q E+   + ++E   +   + +      +  L   + L  EL       + + QEL+ 
Sbjct: 146 ALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAEL-------EEKQQELEA 198

Query: 426 ELEQVIEELGDAKTDKHEDTRRKKKQE 452
           +LEQ+ E+   A+T +    +RK+  +
Sbjct: 199 QLEQLQEK--AAETSQERKQKRKEITD 223


>gnl|CDD|217686 pfam03713, DUF305, Domain of unknown function (DUF305).  Domain
           found in small family of bacterial secreted proteins
           with no known function. Also found in Paramecium
           bursaria chlorella virus 1. This domain is short and
           found in one or two copies. The domain has a conserved
           HH motif that may be functionally important. This domain
           belongs to the ferritin superfamily. It contains two
           sequence similar repeats each of which is composed of
           two alpha helices.
          Length = 150

 Score = 29.2 bits (66), Expect = 6.4
 Identities = 11/30 (36%), Positives = 21/30 (70%), Gaps = 1/30 (3%)

Query: 816 QAVDAMDEEIGKARR-EVGSIAKDIQAAQK 844
           QAV+  +  + +++  EV ++A+DI AAQ+
Sbjct: 14  QAVEMAELALERSQDPEVRALAQDIIAAQQ 43


>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
           protein turnover, chaperones].
          Length = 579

 Score = 30.4 bits (69), Expect = 6.6
 Identities = 17/87 (19%), Positives = 42/87 (48%), Gaps = 8/87 (9%)

Query: 356 KLDNELRQQVQTQNEIKKKRHEMEEAQKRIDK-LEDHIRQNEASLKDNKKLKEELNSDVG 414
            LD + R+ V+ +N       E E     ++K L++ ++ +E   +  ++   +L   + 
Sbjct: 500 ALDKKFRELVEARN-------EAESLIYSLEKALKEIVKVSEEEKEKIEEAITDLEEALE 552

Query: 415 SSKNRVQELQKELEQVIEELGDAKTDK 441
             K  ++   +EL++V ++L + K  +
Sbjct: 553 GEKEEIKAKIEELQEVTQKLAEKKYQQ 579



 Score = 30.0 bits (68), Expect = 9.1
 Identities = 17/85 (20%), Positives = 37/85 (43%), Gaps = 3/85 (3%)

Query: 111 IKELEDELDKKKGEVEKIERR--KEKAENILREKKKEQGALNRE-LAKVDQEIREMDVEI 167
           +++ E      K   E +E R   E     L +  KE   ++ E   K+++ I +++  +
Sbjct: 492 VEDAEANAALDKKFRELVEARNEAESLIYSLEKALKEIVKVSEEEKEKIEEAITDLEEAL 551

Query: 168 NKKRPSLIKSKERVSHIQKKLASAK 192
             ++  +    E +  + +KLA  K
Sbjct: 552 EGEKEEIKAKIEELQEVTQKLAEKK 576


>gnl|CDD|214017 cd12924, iSH2_PIK3R1, Inter-Src homology 2 (iSH2) helical domain of
           Class IA Phosphoinositide 3-kinase Regulatory subunit 1,
           PIK3R1, also called p85alpha.  PI3Ks catalyze the
           transfer of the gamma-phosphoryl group from ATP to the
           3-hydroxyl of the inositol ring of
           D-myo-phosphatidylinositol (PtdIns) or its derivatives.
           They play an important role in a variety of fundamental
           cellular processes, including cell motility, the Ras
           pathway, vesicle trafficking and secretion, immune cell
           activation and apoptosis. They are classified according
           to their substrate specificity, regulation, and domain
           structure. Class IA PI3Ks are heterodimers of a p110
           catalytic (C) subunit and a p85-related regulatory (R)
           subunit. The R subunit down-regulates PI3K basal
           activity, stabilizes the C subunit, and plays a role in
           the activation downstream of tyrosine kinases. All R
           subunits contain two SH2 domains that flank an
           intervening helical domain (iSH2), which binds to the
           N-terminal adaptor-binding domain (ABD) of the catalytic
           subunit. In addition, p85alpha, also called PIK3R1,
           contains N-terminal SH3 and GAP domains. p85alpha carry
           functions independent of its PI3K regulatory role. It
           can independently stimulate signaling pathways involved
           in cytoskeletal rearrangements. Insulin-sensitive
           tissues express splice variants of the PIK3R1 gene,
           p50alpha and p55alpha, which may play important roles in
           insulin signaling during lipid and glucose metabolism.
           Mice deficient with PIK3R1 die perinatally, indicating
           its importance in development.
          Length = 161

 Score = 29.3 bits (65), Expect = 6.7
 Identities = 36/146 (24%), Positives = 66/146 (45%), Gaps = 13/146 (8%)

Query: 43  DYERLKAEMIQAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAKEVEHQLF 102
           +Y+RL  E  +  +E  M   K+  + A  +  KI +++ +  +R  +E + K      F
Sbjct: 23  EYDRLYEEYTRTSQEIQM---KRTAIEAFNETIKIFEEQCQTQERYSKEYIEK------F 73

Query: 103 KLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIRE 162
           K   NE +I+ +    +K K  + +I   + +    L E  K+Q A  RE+ K    I+ 
Sbjct: 74  KREGNEKEIQRIMHNYEKLKSRISEIVDSRRR----LEEDLKKQAAEYREIDKRMNSIKP 129

Query: 163 MDVEINKKRPSLIKSKERVSHIQKKL 188
             +++ K R   +    +    QKKL
Sbjct: 130 DLIQLRKTRDQYLMWLTQKGVRQKKL 155


>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
           division and chromosome partitioning].
          Length = 570

 Score = 30.4 bits (69), Expect = 6.7
 Identities = 32/191 (16%), Positives = 81/191 (42%), Gaps = 17/191 (8%)

Query: 83  EKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAENILREK 142
            + +R++E++V         +L   E ++  ++++++     +  +  R+ +A+N++ E 
Sbjct: 255 SRLERLKEQLVENSELLTQLELDEAEEELGLIQEKIE----SLYDLLEREVEAKNVVEEN 310

Query: 143 KKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSH---IQKKLASAKKSLVEVR 199
                 L   L K  +    +  EI + + S   ++  +      +K+L   +  L E+ 
Sbjct: 311 LPI---LPDYLEKAKENNEHLKEEIERVKESYRLAETELGSVRKFEKELKELESVLDEIL 367

Query: 200 QANEAHNK-------DIADLETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPS 252
           +  EA          ++ ++E  L D+   + + +      R   LE+ +++E  K +  
Sbjct: 368 ENIEAQEVAYSELQDNLEEIEKALTDIEDEQEKVQEHLTSLRKDELEARENLERLKSKLH 427

Query: 253 LIKSKERVSHI 263
            IK     S++
Sbjct: 428 EIKRYMEKSNL 438


>gnl|CDD|217892 pfam04100, Vps53_N, Vps53-like, N-terminal.  Vps53 complexes with
           Vps52 and Vps54 to form a multi- subunit complex
           involved in regulating membrane trafficking events.
          Length = 375

 Score = 30.1 bits (68), Expect = 6.9
 Identities = 17/75 (22%), Positives = 45/75 (60%), Gaps = 1/75 (1%)

Query: 340 LQQLDTINREQKGDQDKLDNELRQQVQTQ-NEIKKKRHEMEEAQKRIDKLEDHIRQNEAS 398
           L  +D++ ++ + +  +LD  ++  V+ Q N  +K +  ++EAQK I++L   I++ ++ 
Sbjct: 20  LTNIDSLIQKIQEEIRRLDANIKTVVRGQANVGEKGKEALDEAQKAIEELFQKIKEIKSK 79

Query: 399 LKDNKKLKEELNSDV 413
            +  +++ +E+  D+
Sbjct: 80  AESTEEMVKEITRDI 94


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 30.3 bits (69), Expect = 7.1
 Identities = 19/65 (29%), Positives = 36/65 (55%), Gaps = 3/65 (4%)

Query: 370 EIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQ 429
           E +K   E+EE Q   ++L    ++   +LK  +   EEL ++V   K +++EL+  L++
Sbjct: 37  ERRKLLRELEELQAERNEL---SKEIGRALKRGEDDAEELIAEVKELKEKLKELEAALDE 93

Query: 430 VIEEL 434
           +  EL
Sbjct: 94  LEAEL 98


>gnl|CDD|213181 cd03214, ABC_Iron-Siderophores_B12_Hemin, ATP-binding component
          of iron-siderophores, vitamin B12 and hemin
          transporters and related proteins.  ABC transporters,
          involved in the uptake of siderophores, heme, and
          vitamin B12, are widely conserved in bacteria and
          archaea. Only very few species lack representatives of
          the siderophore family transporters. The E. coli BtuCD
          protein is an ABC transporter mediating vitamin B12
          uptake. The two ATP-binding cassettes (BtuD) are in
          close contact with each other, as are the two
          membrane-spanning subunits (BtuC); this arrangement is
          distinct from that observed for the E. coli lipid
          flippase MsbA. The BtuC subunits provide 20
          transmembrane helices grouped around a translocation
          pathway that is closed to the cytoplasm by a gate
          region, whereas the dimer arrangement of the BtuD
          subunits resembles the ATP-bound form of the Rad50 DNA
          repair enzyme. A prominent cytoplasmic loop of BtuC
          forms the contact region with the ATP-binding cassette
          and represent a conserved motif among the ABC
          transporters.
          Length = 180

 Score = 29.3 bits (67), Expect = 7.1
 Identities = 8/30 (26%), Positives = 14/30 (46%), Gaps = 6/30 (20%)

Query: 15 SYKGKFSIGPLK------KFTAVIGPNGSG 38
           Y G+  +  L       +   ++GPNG+G
Sbjct: 8  GYGGRTVLDDLSLSIEAGEIVGILGPNGAG 37


>gnl|CDD|213207 cd03240, ABC_Rad50, ATP-binding cassette domain of Rad50.  The
            catalytic domains of Rad50 are similar to the ATP-binding
            cassette of ABC transporters, but are not associated with
            membrane-spanning domains. The conserved ATP-binding
            motifs common to Rad50 and the ABC transporter family
            include the Walker A and Walker B motifs, the Q loop, a
            histidine residue in the switch region, a D-loop, and a
            conserved LSGG sequence. This conserved sequence, LSGG,
            is the most specific and characteristic motif of this
            family and is thus known as the ABC signature sequence.
          Length = 204

 Score = 29.5 bits (67), Expect = 7.1
 Identities = 18/51 (35%), Positives = 24/51 (47%), Gaps = 2/51 (3%)

Query: 1001 PFFVLDEIDAALDNTNIGKVASYIVT--KTQDSLQTIVISLKEEFFSHADS 1049
                LDE    LD  NI +  + I+   K+Q + Q IVI+  EE    AD 
Sbjct: 141  GILALDEPTTNLDEENIEESLAEIIEERKSQKNFQLIVITHDEELVDAADH 191


>gnl|CDD|213186 cd03219, ABC_Mj1267_LivG_branched, ATP-binding cassette component
          of branched chain amino acids transport system.  The
          Mj1267/LivG ABC transporter subfamily is involved in
          the transport of the hydrophobic amino acids leucine,
          isoleucine and valine. MJ1267 is a branched-chain amino
          acid transporter with 29% similarity to both the LivF
          and LivG components of the E. coli branched-chain amino
          acid transporter. MJ1267 contains an insertion from
          residues 114 to 123 characteristic of LivG
          (Leucine-Isoleucine-Valine) homologs. The
          branched-chain amino acid transporter from E. coli
          comprises a heterodimer of ABCs (LivF and LivG), a
          heterodimer of six-helix TM domains (LivM and LivH),
          and one of two alternative soluble periplasmic
          substrate binding proteins (LivK or LivJ).
          Length = 236

 Score = 29.7 bits (68), Expect = 7.6
 Identities = 9/19 (47%), Positives = 13/19 (68%), Gaps = 1/19 (5%)

Query: 20 FSIGPLKKFTAVIGPNGSG 38
          FS+ P  +   +IGPNG+G
Sbjct: 21 FSVRP-GEIHGLIGPNGAG 38


>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
          Length = 3164

 Score = 30.3 bits (68), Expect = 7.7
 Identities = 22/111 (19%), Positives = 44/111 (39%), Gaps = 3/111 (2%)

Query: 331  EATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLED 390
            E  +  G++ +QL          +   D+ L      + E + K   +E  +  +  LE+
Sbjct: 2106 EYGRGMGELKEQLSNTVVILGVKEKNADDALSGTPGERLEREVKSVFVEAPRDMLFLLEE 2165

Query: 391  HIRQNEASLKDNKKLKEE--LNSDVGSSKNRVQELQKELEQVIEELGDAKT 439
             +R+ + S+   K  K+   L + +   K +   L +E+   I   GD   
Sbjct: 2166 EVRKRKGSVMKFKSSKKPAVLEAVLFVYKIKKASL-REIRSFIRPPGDLCI 2215


>gnl|CDD|218703 pfam05700, BCAS2, Breast carcinoma amplified sequence 2 (BCAS2).
           This family consists of several eukaryotic sequences of
           unknown function. The mammalian members of this family
           are annotated as breast carcinoma amplified sequence 2
           (BCAS2) proteins. BCAS2 is a putative spliceosome
           associated protein.
          Length = 221

 Score = 29.4 bits (66), Expect = 8.2
 Identities = 16/76 (21%), Positives = 39/76 (51%)

Query: 326 TNLKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRI 385
             L     K   ++ ++++ +NR++K  Q++   +LR   Q   ++  K  E+E A   +
Sbjct: 142 EALLKRLEKELAELKEEIEEVNRQRKYSQEEAGEKLRSLEQRWVDLVSKNLEIEVACAEL 201

Query: 386 DKLEDHIRQNEASLKD 401
           ++  + +++ +A LK 
Sbjct: 202 EQEIEQLKRKKAELKW 217


>gnl|CDD|213202 cd03235, ABC_Metallic_Cations, ATP-binding cassette domain of the
          metal-type transporters.  This family includes
          transporters involved in the uptake of various metallic
          cations such as iron, manganese, and zinc. The ATPases
          of this group of transporters are very similar to
          members of iron-siderophore uptake family suggesting
          that they share a common ancestor. The best
          characterized metal-type ABC transporters are the
          YfeABCD system of Y. pestis, the SitABCD system of
          Salmonella enterica serovar Typhimurium, and the
          SitABCD transporter of Shigella flexneri. Moreover
          other uncharacterized homologs of these metal-type
          transporters are mainly found in pathogens like
          Haemophilus or enteroinvasive E. coli isolates.
          Length = 213

 Score = 29.4 bits (67), Expect = 8.2
 Identities = 11/36 (30%), Positives = 20/36 (55%), Gaps = 11/36 (30%)

Query: 3  PILQYIEVDNFKSYKGKFSIGPLKKFTAVIGPNGSG 38
          P+L+ +   +F+   G+F         A++GPNG+G
Sbjct: 13 PVLEDV---SFEVKPGEF--------LAIVGPNGAG 37


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 29.7 bits (67), Expect = 8.2
 Identities = 26/126 (20%), Positives = 60/126 (47%), Gaps = 6/126 (4%)

Query: 83  EKYQRIREEIVAKEVEHQ------LFKLYHNETDIKELEDELDKKKGEVEKIERRKEKAE 136
           E+ +R ++E+  +  + +        +L   E    ELE++L +++ E + +E++ ++ E
Sbjct: 1   EEAEREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELE 60

Query: 137 NILREKKKEQGALNRELAKVDQEIREMDVEINKKRPSLIKSKERVSHIQKKLASAKKSLV 196
              R  ++E  A   E  +++ E+ E   E+ K      K +     +Q++L  A+++  
Sbjct: 61  EENRRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQELREAQEAHE 120

Query: 197 EVRQAN 202
             RQ  
Sbjct: 121 RARQEL 126


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 29.1 bits (66), Expect = 8.3
 Identities = 25/81 (30%), Positives = 47/81 (58%), Gaps = 9/81 (11%)

Query: 351 KGDQDKLDNELRQQVQTQNEIKKKRHEMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELN 410
           + +Q K  +EL Q      EIKK   E EE +KR+ +LE  +   E   ++ ++++E+ +
Sbjct: 117 QAEQGK--SELEQ------EIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRH 168

Query: 411 SD-VGSSKNRVQELQKELEQV 430
           +D +   K + Q+L+ +LEQ+
Sbjct: 169 ADEIAFLKKQNQQLKSQLEQI 189


>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
           binding.  This is the C-terminal conserved coiled coil
           region of a family of TATA element modulatory factor 1
           proteins conserved in eukaryotes. The proteins bind to
           the TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMF1_TATA_bd is the most conserved
           part of the TMFs. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells. The Rab6-binding
           domain appears to be the same region as this C-terminal
           family.
          Length = 121

 Score = 28.3 bits (64), Expect = 8.4
 Identities = 22/75 (29%), Positives = 36/75 (48%), Gaps = 16/75 (21%)

Query: 633 EKLSEELREAMKKSRKESELNTVQSTIKGLEIRLNYSRQ----------DLQNTKSQIAK 682
           E+LS  LR      R E EL +++  +  LE   + +RQ          +L+  K +I +
Sbjct: 20  ERLSSTLR------RLEGELASLKDELARLEAERDEARQEIVKLTEENEELRELKKEIEE 73

Query: 683 LEAEIDALNARADAT 697
           LE E++ L  R + T
Sbjct: 74  LEKELEDLEQRYETT 88


>gnl|CDD|149105 pfam07851, TMPIT, TMPIT-like protein.  A number of members of this
           family are annotated as being transmembrane proteins
           induced by tumour necrosis factor alpha, but no
           literature was found to support this.
          Length = 330

 Score = 29.7 bits (67), Expect = 8.4
 Identities = 22/98 (22%), Positives = 50/98 (51%), Gaps = 5/98 (5%)

Query: 377 EMEEAQKRIDKLEDHIRQNEASLKDNKKLKEELNSDVGSSKNRVQELQKELEQ----VIE 432
           E +E +K   +L++  R  +  L++ +KL+E+  S +   + R+++L   L++    +  
Sbjct: 5   EWKELEKEFQQLQETHRLYKQKLEEVEKLQEQCTSAIARQRKRLKQLIVSLKKLKKSLTP 64

Query: 433 ELGDAKTDKHEDTRRKKKQEL-VENFKKAYSGVYDRMI 469
           E  +      E  + +K Q   +E+F    +G+Y R++
Sbjct: 65  EDSELVEQLEEQIKERKNQFFDMESFLPKKNGLYLRLV 102


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 30.1 bits (68), Expect = 8.5
 Identities = 20/78 (25%), Positives = 36/78 (46%), Gaps = 5/78 (6%)

Query: 64   KKKGVVAERKEAKIE-----KDEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKELEDEL 118
            KKKG +   ++  IE     ++E E   +   E   ++  +    L + E +I+E   E 
Sbjct: 1505 KKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAES 1564

Query: 119  DKKKGEVEKIERRKEKAE 136
            D KK + +K  +   +AE
Sbjct: 1565 DIKKRKNKKQYKSNTEAE 1582


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 29.8 bits (67), Expect = 8.6
 Identities = 40/192 (20%), Positives = 80/192 (41%), Gaps = 11/192 (5%)

Query: 119 DKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMDVEINKK-----RPS 173
            ++K      E+RK+K +    E +++Q A    L ++++E R    E  K+     + +
Sbjct: 69  QQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKE-RLAAQEQKKQAEEAAKQA 127

Query: 174 LIKSKERVSHIQKKLASAK-KSLVEVRQANEAHNKDIADLETQLADVRKRKAEYERQSIP 232
            +K K+      K  A+AK K+  E ++A  A  K  A+ + +      +KA  E +   
Sbjct: 128 ALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAK--- 184

Query: 233 GRDINLESAQDVEINKKRPSLIKSKERVSHIQKKLASAKKSLVEVRQANEAHNKDIADLE 292
            +    E+A       K+ +  ++K++ +   KK A+A+      + A EA         
Sbjct: 185 -KKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAA 243

Query: 293 TQLADVRKRKAE 304
            + A+       
Sbjct: 244 AKAAEKAAAAKA 255


>gnl|CDD|221270 pfam11855, DUF3375, Protein of unknown function (DUF3375).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria. Proteins in this
           family are typically between 479 to 499 amino acids in
           length.
          Length = 471

 Score = 30.0 bits (68), Expect = 8.6
 Identities = 16/50 (32%), Positives = 25/50 (50%), Gaps = 6/50 (12%)

Query: 649 ESELNTVQSTIKGLEIRLNYSRQDLQNTKSQIAKLEAEIDALNARADATE 698
           ES L TV   ++ L            + ++++A+LE EID L+A  D  E
Sbjct: 118 ESRLLTVFELLRQLAEGTE------PDPEARLAELEREIDELDAEIDRLE 161


>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
          Length = 874

 Score = 30.1 bits (69), Expect = 8.7
 Identities = 15/61 (24%), Positives = 31/61 (50%), Gaps = 9/61 (14%)

Query: 111 IKELEDELDKKKGEVEKIERR--------KEKAENILREKKKEQGALNRELAKVDQEIRE 162
           +  LE EL K + E+E++E++        K   E + +E++K       +LAK+ + +  
Sbjct: 813 LARLEKELAKLEKEIERVEKKLSNEGFVAKAPEEVVEKEREKLAEYEE-KLAKLKERLAR 871

Query: 163 M 163
           +
Sbjct: 872 L 872


>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
          Length = 865

 Score = 30.1 bits (69), Expect = 8.7
 Identities = 19/67 (28%), Positives = 31/67 (46%), Gaps = 9/67 (13%)

Query: 384 RI-----DKLEDHIRQNEASLKD----NKKLKEELNSDVGSSKNRVQELQKELEQVIEEL 434
           RI     +   +++ + EA LK+     K    EL   V +    ++EL+KELEQ+  +L
Sbjct: 683 RIEAVTGEAALEYLNEQEALLKELAALLKAKPSELPERVEALLEELKELEKELEQLKAKL 742

Query: 435 GDAKTDK 441
             A    
Sbjct: 743 AAAAAGD 749


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 28.8 bits (65), Expect = 8.7
 Identities = 23/89 (25%), Positives = 50/89 (56%), Gaps = 3/89 (3%)

Query: 108 ETDIKELEDELDKKKGEVEKIERRKEKAENILRE--KKKEQGALNRELAKVDQEIREMDV 165
           E + K+L+ EL KK+ E++K E++ +K    L E  +K +Q  L ++  ++ Q+ +    
Sbjct: 41  EKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAAQQ 100

Query: 166 EINKKRPSLIKS-KERVSHIQKKLASAKK 193
           E+ +K+  L++   +++    K++A  K 
Sbjct: 101 ELQQKQQELLQPIYDKIDKAIKEVAKEKG 129


>gnl|CDD|221489 pfam12252, SidE, Dot/Icm substrate protein.  This family of proteins
            is found in bacteria. Proteins in this family are
            typically between 397 and 1543 amino acids in length.
            This family is the SidE protein in the Dot/Icm pathway of
            Legionella pneumophila bacteria. There is little
            literature describing the family.
          Length = 1443

 Score = 29.8 bits (67), Expect = 9.0
 Identities = 64/290 (22%), Positives = 117/290 (40%), Gaps = 51/290 (17%)

Query: 213  ETQLADVRKRKAEYERQSIPGRDINLESAQDVEINKKRPSLIKSKERVS--HIQKKLASA 270
            +  + +  KR  E E+  +     NL+      + K   +L   ++ V+    +K     
Sbjct: 1106 DALIENTSKRITELEKAKLA----NLDV-----VKKAISNLNNLQQEVTLLRNEKIRMHT 1156

Query: 271  KKSLVEVRQANEAHNKDIADLETQLAD-----VRKRKAEYERQSIPGRDINLES--AQMT 323
                V+     E   K +  ++T+LAD     V K+ +  E++  P     L+S  A+  
Sbjct: 1157 GTDKVDFSDI-EKLEKQLQVIDTKLADAYLLEVTKQISALEKE-KPKNQSELKSKIAKFF 1214

Query: 324  EYTN----LKAEATKRAGKILQQLDTINREQKGDQDKLDNELRQQVQ------------T 367
            + T     L+ E  K+ G     LD        D DKL   L+   Q            +
Sbjct: 1215 DTTADIEVLRNERIKKHGSSKDPLDL------SDLDKLSGNLQGVNQSLVSILITTIRSS 1268

Query: 368  QNEIKKKRHEMEEAQ-----KRIDKLEDHIRQNEASLK---DNKKLKEELNSDVGSSKNR 419
             N++K K  EM+E +     + + KLE  + +++ + K   D  KLK+ L +   +    
Sbjct: 1269 LNQMKPKTFEMQEKEIQQNFELLAKLEKTLDKSDTAEKLREDIPKLKDLLIAKQKAYPEM 1328

Query: 420  VQELQKELEQVIEELGDAKTDKHEDTRRKKKQELVENFKKAYSGVYDRMI 469
            VQ LQ + E +I +L +     H+D  + +   L E  ++   G    ++
Sbjct: 1329 VQ-LQYKSEALITQLRELCQAHHDDLAKTRTARLQELDRQDREGGITGIV 1377


>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242). 
          Length = 401

 Score = 29.9 bits (67), Expect = 9.0
 Identities = 29/101 (28%), Positives = 45/101 (44%), Gaps = 20/101 (19%)

Query: 74  EAKIEK--DEAEKYQRIREEIVAKEVEHQLFKLYHNETDIKEL---EDELDKKKGEVEKI 128
           + KI K  ++ E  + IRE I  ++ E    KL  N+  + +L        K KG  +K+
Sbjct: 74  QTKIRKLTEQDEGLRNIRESIEERQKESDQLKL-QNQKLVNQLGHARGVFIKTKGRYQKL 132

Query: 129 E----RRKEKAENI----------LREKKKEQGALNRELAK 155
           E      KE+ + +            EK +E   LNRELA+
Sbjct: 133 EELSRHLKEENQCLQIQLDALVQECNEKIEENQELNRELAE 173


>gnl|CDD|221016 pfam11180, DUF2968, Protein of unknown function (DUF2968).  This
           family of proteins has no known function.
          Length = 192

 Score = 29.3 bits (66), Expect = 9.1
 Identities = 24/100 (24%), Positives = 49/100 (49%), Gaps = 10/100 (10%)

Query: 612 DLARKAKRWDDKEM--GNLKAQKEKLSEELREAMKK-SRKESELNTVQSTIKGLEIRLNY 668
           D AR+  +  D E+    L+AQK +L  ++  +  +  R +++L   Q+  + +  R   
Sbjct: 92  DFARQTAQLADVEIRRAQLEAQKAQLERQIAASEARAERLQADLQLAQAQEQQVAARQQQ 151

Query: 669 SRQDL-------QNTKSQIAKLEAEIDALNARADATEPKI 701
           +RQ+        Q  ++Q+ KL+ +I  L A+ +   P +
Sbjct: 152 ARQEAVALEAQRQAAQAQLRKLQRQIRQLQAQQNEPIPHL 191


>gnl|CDD|238140 cd00228, eu-GS, Eukaryotic Glutathione Synthetase (eu-GS);
           catalyses the production of glutathione from
           gamma-glutamylcysteine and glycine in an ATP-dependent
           manner. Belongs to the ATP-grasp superfamily.
          Length = 471

 Score = 29.6 bits (67), Expect = 9.2
 Identities = 26/112 (23%), Positives = 44/112 (39%), Gaps = 30/112 (26%)

Query: 574 PEDAMKVAYDIEPQHR--YDAVALDGTFYQKSGIMSGGSLDLARKAKRWDDKEMGNL--- 628
           PE   + A +++P      D ++ DG F Q+S         L+   K   D+    L   
Sbjct: 53  PEALFEQAVEVQPDFNELVDRISQDGKFLQQS---------LSSTKKV--DEFTSRLLDI 101

Query: 629 --KAQKEKLSEELR------------EAMKKSRKESELNTVQSTIKGLEIRL 666
             K  +E   + +R            +   K  K+ E+NT+ S+  GL  R+
Sbjct: 102 HKKVLEENKKQPVRLGIFRSDYMFDEKDGTKGLKQIEINTISSSFGGLSSRV 153


>gnl|CDD|235878 PRK06851, PRK06851, hypothetical protein; Provisional.
          Length = 367

 Score = 29.5 bits (67), Expect = 9.2
 Identities = 25/106 (23%), Positives = 45/106 (42%), Gaps = 20/106 (18%)

Query: 909 EKLEHAKENLMKTNEE----FENARKRAKKAKA--------NFDRIKKERYDKFTRCFEH 956
           +KL   KE ++K NEE    F+ A +   +A A          + +   + ++ T   + 
Sbjct: 115 DKLRKHKEEILKINEEISRCFQRAYEYLNEALAIHDEWEKIYIENMDFAKANELT---DE 171

Query: 957 VSNEI-DGAGSESVLPRP---FLGPENPEEPLTYRVS-TTIVSHRY 997
           +  E+  GA  +    +    FLG   P+  + +  S T  V +RY
Sbjct: 172 LIQELFKGAPGKISKGKVRHLFLGAITPKGAVDFVPSLTEGVKNRY 217


>gnl|CDD|219955 pfam08657, DASH_Spc34, DASH complex subunit Spc34.  The DASH
           complex is a ~10 subunit microtubule-binding complex
           that is transferred to the kinetochore prior to mitosis.
           In Saccharomyces cerevisiae DASH forms both rings and
           spiral structures on microtubules in vitro. Components
           of the DASH complex, including Dam1, Duo1, Spc34, Dad1
           and Ask1, are essential and connect the centromere to
           the plus end of spindle microtubules.
          Length = 255

 Score = 29.4 bits (66), Expect = 9.7
 Identities = 23/127 (18%), Positives = 54/127 (42%), Gaps = 8/127 (6%)

Query: 53  QAEEETNMSYLKKKGVVAERKEAKIEKDEAEKYQRIREEIVAK--------EVEHQLFKL 104
           +A+ E   S +         +E  +E+   E   R  EE+           ++E    + 
Sbjct: 129 KADSEIKQSAVALVLGSVLTEEITVEEFNVEVLLRGAEELCDVYPLPGAREKIETLRDRY 188

Query: 105 YHNETDIKELEDELDKKKGEVEKIERRKEKAENILREKKKEQGALNRELAKVDQEIREMD 164
                 I ELE E+ ++K ++E +        +   E ++++  ++  + K ++EIRE++
Sbjct: 189 NELSESIAELEAEIAEQKRQLEIMNISSSSPSSDTSESEEDEIDIDELIRKEEEEIRELE 248

Query: 165 VEINKKR 171
            ++ +  
Sbjct: 249 RQLRQLE 255


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.311    0.128    0.341 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 53,777,412
Number of extensions: 5548088
Number of successful extensions: 14130
Number of sequences better than 10.0: 1
Number of HSP's gapped: 10875
Number of HSP's successfully gapped: 1697
Length of query: 1070
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 963
Effective length of database: 6,191,724
Effective search space: 5962630212
Effective search space used: 5962630212
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 64 (28.5 bits)