RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy588
         (903 letters)



>gnl|CDD|215951 pfam00498, FHA, FHA domain.  The FHA (Forkhead-associated) domain
           is a phosphopeptide binding motif.
          Length = 67

 Score = 71.9 bits (177), Expect = 1e-15
 Identities = 34/79 (43%), Positives = 41/79 (51%), Gaps = 14/79 (17%)

Query: 33  VKIGRCIARAKVAPSNGI-FDCKVLSRNHALIWYYNG-KFYLQDTKSSNGTFVNNQRLSK 90
           V IGR       +P   I  D   +SR HA I Y  G +FYL+D  S+NGTFVN QRL  
Sbjct: 1   VTIGR-------SPDCDIVLDDPSVSRRHAEIRYDGGGRFYLEDLGSTNGTFVNGQRLGP 53

Query: 91  TNEESAPHELCSGDVVQFG 109
                 P  L  GDV++ G
Sbjct: 54  E-----PVRLRDGDVIRLG 67


>gnl|CDD|238017 cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic
           and prokaryotic proteins. Putative nuclear signalling
           domain. FHA domains may bind phosphothreonine,
           phosphoserine and sometimes phosphotyrosine. In
           eukaryotes, many FHA domain-containing proteins localize
           to the nucleus, where they participate in establishing
           or maintaining cell cycle checkpoints, DNA repair, or
           transcriptional regulation. Members of the FHA family
           include: Dun1, Rad53,  Cds1, Mek1,
           KAPP(kinase-associated protein phosphatase),and Ki-67 (a
           human nuclear protein related to cell proliferation).
          Length = 102

 Score = 66.6 bits (163), Expect = 2e-13
 Identities = 30/82 (36%), Positives = 38/82 (46%), Gaps = 10/82 (12%)

Query: 33  VKIGRCIARAKVAPSNGIFDCKVLSRNHALIWYYN-GKFYLQDTKSSNGTFVNNQRLSKT 91
             IGR          + + D   +SR HA+I Y   G   L D  S+NGTFVN QR+S  
Sbjct: 24  YTIGRDSDNC-----DIVLDDPSVSRRHAVIRYDGDGGVVLIDLGSTNGTFVNGQRVSPG 78

Query: 92  NEESAPHELCSGDVVQFGVDVI 113
                P  L  GDV++ G   I
Sbjct: 79  E----PVRLRDGDVIRLGNTSI 96


>gnl|CDD|214578 smart00240, FHA, Forkhead associated domain.  Found in eukaryotic
          and prokaryotic proteins. Putative nuclear signalling
          domain.
          Length = 52

 Score = 51.8 bits (125), Expect = 9e-09
 Identities = 24/57 (42%), Positives = 31/57 (54%), Gaps = 6/57 (10%)

Query: 33 VKIGRCIARAKVAPSNGIFDCKVLSRNHALIWY-YNGKFYLQDTKSSNGTFVNNQRL 88
          V IGR      +       D   +SR HA+I Y   G+FYL D  S+NGTFVN +R+
Sbjct: 1  VTIGRSSEDCDI-----QLDGPSISRRHAVIVYDGGGRFYLIDLGSTNGTFVNGKRI 52


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 58.6 bits (142), Expect = 1e-08
 Identities = 107/639 (16%), Positives = 256/639 (40%), Gaps = 66/639 (10%)

Query: 151 SMPLEELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRV- 209
           ++ L +L +L   ++E  +  + L+ +L +LQ  + + +K  +     L +    L  + 
Sbjct: 228 ALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQ 287

Query: 210 -EFLESQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTE 268
            E LE +            LR+ ++ L  +    E +L+E+ +K+ A K E+ ++    E
Sbjct: 288 EELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLE 347

Query: 269 RKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLE 328
                +       +EL ++    LEE+ + ++    +  E E+ + ++ N  +E    +E
Sbjct: 348 ELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIE 407

Query: 329 KLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNI-- 386
            L+    +  ++L+    +   +  +L+        + E++  ++ +L       + +  
Sbjct: 408 SLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELER 467

Query: 387 -LVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLRE---------QLHTA 436
            L +  +++ ++  +      R +  E+    S      +               +L   
Sbjct: 468 ELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALESGLPGVYGPVAELIKV 527

Query: 437 QEEYT---------SLQEIVNRNEEQ----LAFLEQSK-----YLHLDKIGGLELQLEKL 478
           +E+Y           LQ +V  NEE     + FL+++K     +L LD+I  L       
Sbjct: 528 KEKYETALEAALGNRLQAVVVENEEVAKKAIEFLKENKAGRATFLPLDRIKPLRSLKSDA 587

Query: 479 KQTYNSDCYENEKFGKAY-------NCDTLLIN----EEKTVAPNQKDVNIIDVSNQY-- 525
              +     +   F   Y         DTL+++      +     +    I+ +      
Sbjct: 588 APGFLGLASDLIDFDPKYEPAVRFVLGDTLVVDDLEQARRLARKLRIKYRIVTLDGDLVE 647

Query: 526 ---------NSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINL 576
                     +++  ++  +E++E   ++ ++ A++ K   EL   KN           L
Sbjct: 648 PSGSITGGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKN----------EL 697

Query: 577 QEQFSLLEKSKETWVSLKLNSAESQS-SASLQSQLNDALDNIKKYKEEHSKMTMEIESLK 635
           +    LLE+ +     L+    E +   A+L+ +L      +++ +EE  ++  E+E L+
Sbjct: 698 RSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQ 757

Query: 636 EKLKFSQSKENDLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEE-EITQQ 694
           E+L+  + +   L E   KLK  I++ +    ++ E +   +++ +++E+ ++  E   +
Sbjct: 758 ERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELE 817

Query: 695 SLNPLSSELDETIHKNVESVTSNMENDKSNEDDNSSVQS 733
           SL      L++ I +  E +    E     E++   ++ 
Sbjct: 818 SLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEK 856



 Score = 57.0 bits (138), Expect = 4e-08
 Identities = 68/401 (16%), Positives = 165/401 (41%), Gaps = 58/401 (14%)

Query: 269  RKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLE 328
            R  +  + E   L+E L     +LE++ ++ +    + +  E  +++L    +E    LE
Sbjct: 660  RSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLE 719

Query: 329  KLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILV 388
            +L+ E    +++L+    +   + ++L+        ++E++  ++ EL    +     L 
Sbjct: 720  ELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELE-SLEEALAKLK 778

Query: 389  KYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVN 448
            +  +++ +      E               +E++E+  ++   +L   + E  SL++   
Sbjct: 779  EEIEELEEKRQALQE--------------ELEELEEELEEAERRLDALERELESLEQRRE 824

Query: 449  RNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKT 508
            R E+++  LE+      +K+  LE +LE+L++                    L    E+ 
Sbjct: 825  RLEQEIEELEEEIEELEEKLDELEEELEELEKELEE----------------LKEELEEL 868

Query: 509  VAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVK 568
                               E +   +  E++E   + +++  E+ +  +EL + K    +
Sbjct: 869  -------------------EAEKEELEDELKELEEEKEELEEELRELESELAELKE---E 906

Query: 569  NEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMT 628
             EK    L+E  + LE+ +   +       E +   +L+++L   ++ +++  E    + 
Sbjct: 907  IEKLRERLEELEAKLERLEVE-LPELEEELEEEYEDTLETELEREIERLEEEIEALGPVN 965

Query: 629  M----EIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNV 665
            +    E E ++E+ +  +S+  DL E  EKL  VI++    
Sbjct: 966  LRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDKE 1006



 Score = 50.1 bits (120), Expect = 5e-06
 Identities = 52/326 (15%), Positives = 131/326 (40%), Gaps = 29/326 (8%)

Query: 417 NSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLE 476
            S    ++  K+L E+L   + +   L+E +   + +L  LE        ++  LE QLE
Sbjct: 660 RSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLE 719

Query: 477 KLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSEKDFVSINK 536
           +LK+   +   E E+                     Q  +  ++   +   E+    + +
Sbjct: 720 ELKRELAALEEELEQL--------------------QSRLEELEEELEELEEEL-EELQE 758

Query: 537 EIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKLN 596
            +EE   +++ +   + K   E+ + +      ++ +  L+E+    E+  +       +
Sbjct: 759 RLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELES 818

Query: 597 SAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLK 656
               Q    L+ ++ +  + I++ +E+  ++  E+E L+++L+  + +  +L  E E+L+
Sbjct: 819 --LEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELE 876

Query: 657 LVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHKNVESVTS 716
             +K+ +     + E +   + +  + ++ IE+   +        E  E     +E    
Sbjct: 877 DELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELE---- 932

Query: 717 NMENDKSNEDDNSSVQSTDEVKDTEE 742
             E  +   +D    +   E++  EE
Sbjct: 933 --EELEEEYEDTLETELEREIERLEE 956



 Score = 50.1 bits (120), Expect = 5e-06
 Identities = 67/409 (16%), Positives = 161/409 (39%), Gaps = 81/409 (19%)

Query: 275 ENECSNLQELLDRNRDELEEI---IKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQ 331
           E +    +E L+R  D LEE+   ++K + Q  K + +           QE    L +L+
Sbjct: 178 ERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERY-----------QELKAELRELE 226

Query: 332 YETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYF 391
              L A+ +                        +++++  ++ EL               
Sbjct: 227 LALLLAKLK-----------------------ELRKELEELEEELS-------------- 249

Query: 392 DDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNE 451
               ++  + +E      E+E      IE+++   ++LRE+L   QEE   L+E +   E
Sbjct: 250 ----RLEEELEELQEELEEAE----KEIEELKSELEELREELEELQEELLELKEEIEELE 301

Query: 452 EQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLL------INE 505
            +++ L +      +++  LE +LE+LK+   +     E+  +       L      + E
Sbjct: 302 GEISLLRERLEELENELEELEERLEELKEKIEAL---KEELEERETLLEELEQLLAELEE 358

Query: 506 EKTVAPNQKDVNIIDVSNQY-NSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKN 564
            K     +    + ++   +    ++   +  E+ E   +++++  EI      L     
Sbjct: 359 AKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSE 418

Query: 565 CLVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEH 624
            L   ++ +  L+ +   L+   E           ++    L+ QL +  D +K+ + E 
Sbjct: 419 RLEDLKEELKELEAELEELQTELEE---------LNEELEELEEQLEELRDRLKELEREL 469

Query: 625 SKMTMEIESLKEKLKFSQSKENDL---LEEYEKLKLVIKDYQNVLPSIV 670
           +++  E++ L+++L   +++ + L       + ++ V++  ++ LP + 
Sbjct: 470 AELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALESGLPGVY 518



 Score = 47.8 bits (114), Expect = 3e-05
 Identities = 46/281 (16%), Positives = 120/281 (42%), Gaps = 26/281 (9%)

Query: 420 EDIEKVTKDL-REQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKL 478
            ++ ++   L   +L   ++E   L+E ++R EE+L  L++       +I  L+ +LE+L
Sbjct: 220 AELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEEL 279

Query: 479 KQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQY-NSEKDFVSINKE 537
           ++       E  +  +                  + +  I  +  +    E +   + + 
Sbjct: 280 REELEELQEELLELKEEIE---------------ELEGEISLLRERLEELENELEELEER 324

Query: 538 IEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSL---- 593
           +EE   KI+ +  E+ +    L + +  L + E+    L+E+ S L +  E         
Sbjct: 325 LEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREE 384

Query: 594 --KLNSAESQSSA---SLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDL 648
             +L +  ++       L+ ++    + +++  E    +  E++ L+ +L+  Q++  +L
Sbjct: 385 LAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEEL 444

Query: 649 LEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEE 689
            EE E+L+  +++ ++ L  +   +   +++ Q  E+ +  
Sbjct: 445 NEELEELEEQLEELRDRLKELERELAELQEELQRLEKELSS 485



 Score = 47.0 bits (112), Expect = 4e-05
 Identities = 64/318 (20%), Positives = 148/318 (46%), Gaps = 26/318 (8%)

Query: 154  LEELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLE 213
             EEL  L   ++        L+ +L +L+  + +LK+          + ++L SR+E LE
Sbjct: 687  EEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELA---ALEEELEQLQSRLEELE 743

Query: 214  SQLAAYSKSFTD-----DKLRDEIKLLHRDKSTYENQLKEV--MKKMLAEKV-EVMQKCQ 265
             +L    +   +     ++L +E++ L    +  + +++E+   ++ L E++ E+ ++ +
Sbjct: 744  EELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELE 803

Query: 266  DTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSN 325
            + ER+L  +E E  +L++  +R   E+EE+ ++ ++   K  E E  +++L    +E   
Sbjct: 804  EAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKE 863

Query: 326  TLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNEL-GLETKVDQ 384
             LE+L+ E  + + +L     +   + ++L+        +KE++  ++  L  LE K+++
Sbjct: 864  ELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLER 923

Query: 385  ------NILVKYFDDVHQILTQFDEYFLRFNESESSGL-----NSIEDIEKVTK---DLR 430
                   +  +  ++    L    E  +   E E   L      +IE+ E+V +   +L+
Sbjct: 924  LEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEERYEELK 983

Query: 431  EQLHTAQEEYTSLQEIVN 448
             Q    +E    L E++ 
Sbjct: 984  SQREDLEEAKEKLLEVIE 1001



 Score = 47.0 bits (112), Expect = 5e-05
 Identities = 63/405 (15%), Positives = 164/405 (40%), Gaps = 60/405 (14%)

Query: 226 DKLRDEIKLLHRDKSTYENQLKEVMK-KMLAEKVEVMQKCQDTERKLVNIENECSNLQEL 284
           ++L D ++ L +     E Q ++  + + L  ++  ++       KL  +  E   L+E 
Sbjct: 189 ERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELEL-ALLLAKLKELRKELEELEEE 247

Query: 285 LDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAF 344
           L R  +ELEE+ ++ +       E E  I++L +  +E    LE+LQ E L+ +++++  
Sbjct: 248 LSRLEEELEELQEELE-------EAEKEIEELKSELEELREELEELQEELLELKEEIEEL 300

Query: 345 TLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEY 404
             + +L+ ++L+        ++E++  +K ++    +  +  L +    + ++     E 
Sbjct: 301 EGEISLLRERLEELENELEELEERLEELKEKI----EALKEELEERETLLEELEQLLAEL 356

Query: 405 FLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLH 464
                E E      +E++E++ + LRE+L   + E   ++  +   + ++  LE+     
Sbjct: 357 EEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERL 416

Query: 465 LDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQ 524
            +++  L+ +L++L+                         EE      + +  +      
Sbjct: 417 SERLEDLKEELKELEAEL----------------------EELQTELEELNEEL------ 448

Query: 525 YNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLE 584
                    + +++EE   ++ ++  E+ +   EL   +  L   E  +  L+ +    +
Sbjct: 449 -------EELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQ 501

Query: 585 KSKETWVSLK------------LNSAESQSSASLQSQLNDALDNI 617
             +    +L+            L   + +   +L++ L + L  +
Sbjct: 502 GVRAVLEALESGLPGVYGPVAELIKVKEKYETALEAALGNRLQAV 546



 Score = 40.5 bits (95), Expect = 0.005
 Identities = 46/307 (14%), Positives = 117/307 (38%), Gaps = 27/307 (8%)

Query: 419 IEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSK-----YLHLD------- 466
           +   ++  ++   +L   +E    L++++   E+QL  LE+       Y  L        
Sbjct: 167 VSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELE 226

Query: 467 ------KIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIID 520
                 K+  L  +LE+L++  +    E E+  +        I E K+     ++     
Sbjct: 227 LALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEEL 286

Query: 521 VSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQF 580
                  +++   +  EI     +++++  E+ +    L + K  +   ++ +   +   
Sbjct: 287 QEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLL 346

Query: 581 SLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDN-IKKYKEEHSKMTMEIESLKEKLK 639
             LE+               +      S L + L+   +  +EE +++  E+  ++ +L+
Sbjct: 347 EELEQLL------AELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELE 400

Query: 640 FSQSKENDLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPL 699
             + +   L E  E+L   ++D +  L  +   +   + + ++  + +EE   ++ L  L
Sbjct: 401 ELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEE--LEEQLEEL 458

Query: 700 SSELDET 706
              L E 
Sbjct: 459 RDRLKEL 465


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 57.8 bits (140), Expect = 2e-08
 Identities = 86/519 (16%), Positives = 205/519 (39%), Gaps = 48/519 (9%)

Query: 153 PLEELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDR-LLSRVEF 211
            L EL +L   ++E ++R   L+ ++ +L+  +  L+   +   + L+++ + L  R+E 
Sbjct: 275 ELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLE-ELEELLEKLKSLEERLEK 333

Query: 212 LESQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKL 271
           LE +L          KL  E++ L  +K+     L+E +K+               E +L
Sbjct: 334 LEEKLE---------KLESELEELAEEKNELAKLLEERLKE--------------LEERL 370

Query: 272 VNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQ 331
             +E E     E L +  + ++E+ ++  +     +E +  +++L    +E    LE+L+
Sbjct: 371 EELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELE 430

Query: 332 YETLQAQKQLDAFTLKDNLMH----DKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNIL 387
            E  + ++Q++    K+ ++        K       + +E    +     LE +  +   
Sbjct: 431 EEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEE-E 489

Query: 388 VKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIV 447
           +    +  ++  + +E      E E   +  +E  E + ++L E+L   +     L+E+ 
Sbjct: 490 LSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELK 549

Query: 448 NRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEK 507
            + +       Q     L ++     +L++L +         E+  +       L  + K
Sbjct: 550 EKLQ------LQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLK 603

Query: 508 TVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKIDDINAEITK--CTNELVDAKNC 565
                     + +  +Q       + +++   E     +++ +E+ K     EL +    
Sbjct: 604 E---------LEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQA 654

Query: 566 LVKN-EKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEH 624
            ++  E+ V  L+ +     +  E    L+    E +       QL + L+ + K   E 
Sbjct: 655 ALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEI 714

Query: 625 SKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQ 663
            ++  E+ES K +L+  + +   L +  E L+ + +   
Sbjct: 715 EQLIEELESRKAELEELKKELEKLEKALELLEELREKLG 753



 Score = 55.5 bits (134), Expect = 1e-07
 Identities = 84/503 (16%), Positives = 198/503 (39%), Gaps = 19/503 (3%)

Query: 225 DDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQEL 284
           + +L + ++ +       E +LKE+ K    ++ +  ++ +     L     E    +E 
Sbjct: 191 EGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKER 250

Query: 285 LDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAF 344
           L+  +  L EI     + +   +E    +++LL   +EK   LE+L+ E  + +++L+  
Sbjct: 251 LEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGL 310

Query: 345 TLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEY 404
                 + + L+    +     EK+     +L  E +       +    + + L + +E 
Sbjct: 311 RALLEELEELLEKLK-SLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEER 369

Query: 405 FLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLH 464
                +     L  ++ +E+  ++L+E+L         +QE +   E++L  LE+     
Sbjct: 370 LEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEEL 429

Query: 465 LDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQ 524
            ++I  LE Q+ +L+              K   C   L  E +       ++ + ++  +
Sbjct: 430 EEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEE 489

Query: 525 YNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLE 584
            + EK+   + +EIEE  +++ ++  E+ +        K  L +  + + NL E+   L+
Sbjct: 490 LSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELK 549

Query: 585 KSKETWVSLKLNSAESQSSASLQSQLNDALDN---IKKYKEEHSKMTMEIESLKEKLKFS 641
           +  +          E +       +L + L+    ++  KEE  ++   ++ LK+KLK  
Sbjct: 550 EKLQLQQ----LKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKEL 605

Query: 642 QSKENDL---------LEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIE--EE 690
           + + + L          E   +L+   ++ ++ L  +       +      E+  E  EE
Sbjct: 606 EERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEE 665

Query: 691 ITQQSLNPLSSELDETIHKNVES 713
           +  +    L    +E   +    
Sbjct: 666 LEAEIRRELQRIENEEQLEEKLE 688



 Score = 55.2 bits (133), Expect = 1e-07
 Identities = 77/511 (15%), Positives = 200/511 (39%), Gaps = 38/511 (7%)

Query: 168 LQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKSFTD-D 226
           L+ E++    L   +  + +L++  +   + + + + L   +E LE +L        + +
Sbjct: 259 LEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELE 318

Query: 227 KLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLD 286
           +L +++K L       E +L+++  ++     E  +  +  E +L  +E     L++ L+
Sbjct: 319 ELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELE 378

Query: 287 RNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTL 346
           +  + L+++ +  Q+   +  E  + ++++    +E    LE+L+ E  + ++++     
Sbjct: 379 KALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEE 438

Query: 347 KDNLMHDKLKNYYM-----------NFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVH 395
           + N +  K                    + +E    +     LE +  +   +    +  
Sbjct: 439 QINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEE-ELSREKEEA 497

Query: 396 QILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLA 455
           ++  + +E      E E   +  +E  E + ++L E+L   +     L+E+  + + Q  
Sbjct: 498 ELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQ-- 555

Query: 456 FLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKD 515
                    L ++     +L++L +         E+  +       L  + K        
Sbjct: 556 ----QLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKE------- 604

Query: 516 VNIIDVSNQYNSEKDFVSINKEIEESSRKIDDINAEITK--CTNELVDAKNCLVKN-EKW 572
             + +  +Q       + +++   E     +++ +E+ K     EL +     ++  E+ 
Sbjct: 605 --LEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEK 662

Query: 573 VINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIE 632
           V  L+ +     +  E    L+    E +       QL + L+ + K   E       IE
Sbjct: 663 VEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGE-------IE 715

Query: 633 SLKEKLKFSQSKENDLLEEYEKLKLVIKDYQ 663
            L E+L+  +++  +L +E EKL+  ++  +
Sbjct: 716 QLIEELESRKAELEELKKELEKLEKALELLE 746



 Score = 34.4 bits (79), Expect = 0.31
 Identities = 45/271 (16%), Positives = 110/271 (40%), Gaps = 33/271 (12%)

Query: 419 IEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKL 478
           +E  EK+++ L+E +  A+ +   L+  ++   E +  L ++    L ++  LE   E+ 
Sbjct: 166 LEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQ 225

Query: 479 KQTYNSDCYE--NEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSE-KDFVSIN 535
           ++       E   E+  +       L   +  +   +     ++       E ++   + 
Sbjct: 226 EEEELEQEIEALEERLAELEEEKERLEELKARLLEIES--LELEALKIREEELRELERLL 283

Query: 536 KEIEESSRKIDDINAEITKCTNELVDAKNCLVKNE--------------------KWVIN 575
           +E+EE   +++++  EI +   EL   +  L + E                    + + +
Sbjct: 284 EELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLES 343

Query: 576 LQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLK 635
             E+ +  +      +  +L   E +    L+ +L  AL+ +K+ +E        I+ LK
Sbjct: 344 ELEELAEEKNELAKLLEERLKELEERLEE-LEKELEKALERLKQLEE-------AIQELK 395

Query: 636 EKLKFSQSKENDLLEEYEKLKLVIKDYQNVL 666
           E+L    +   ++ EE E+L+  +++ +  L
Sbjct: 396 EELAELSAALEEIQEELEELEKELEELEREL 426



 Score = 33.6 bits (77), Expect = 0.67
 Identities = 41/195 (21%), Positives = 79/195 (40%), Gaps = 23/195 (11%)

Query: 138 KEAKASPTSMLISSMPLEELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWK 197
           +E K     + +     EEL +L   ++E  ++   L+ +LS+L+               
Sbjct: 569 QELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLE--------------- 613

Query: 198 ALIQEDRLLSRVEFLESQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEK 257
            L+Q   L      LE                +++ L    +   +  L+E+ +K+   +
Sbjct: 614 ELLQSLELSEAENELEEAEEELE------SELEKLNLQAELEELLQAALEELEEKVEELE 667

Query: 258 VEVMQKCQDTER--KLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQ 315
            E+ ++ Q  E   +L     E   L+E L++ R+ELEE++KK  +     +E ES   +
Sbjct: 668 AEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAE 727

Query: 316 LLNSNQEKSNTLEKL 330
           L    +E     + L
Sbjct: 728 LEELKKELEKLEKAL 742


>gnl|CDD|224630 COG1716, COG1716, FOG: FHA domain [Signal transduction mechanisms].
          Length = 191

 Score = 54.2 bits (130), Expect = 3e-08
 Identities = 24/91 (26%), Positives = 40/91 (43%), Gaps = 12/91 (13%)

Query: 28  NKNAAVKIGRCIARAKVAPSNGIFDCKVLSRNHALIWYYNGKFYLQDTKSSNGTFVNNQR 87
                  IGR         ++ + D  V+SR HA +     + +L+D  S+NGT+VN ++
Sbjct: 86  LGEPVTTIGR------DPDNDIVLDDDVVSRRHAELRREGNEVFLEDLGSTNGTYVNGEK 139

Query: 88  LSKTNEESAPHELCSGDVVQFGVDVIETNRN 118
           +           L  GDV++ G  + E  R 
Sbjct: 140 V------RQRVLLQDGDVIRLGGTLAERLRI 164


>gnl|CDD|234178 TIGR03354, VI_FHA, type VI secretion system FHA domain protein.
           Members of this protein family are FHA
           (forkhead-associated) domain-containing proteins that
           are part of type VI secretion loci in a considerable
           number of bacteria, most of which are known pathogens.
           Species include Pseudomonas aeruginosa PAO1, Aeromonas
           hydrophila, Yersinia pestis, Burkholderia mallei, etc
           [Protein fate, Protein and peptide secretion and
           trafficking, Cellular processes, Pathogenesis].
          Length = 396

 Score = 50.8 bits (122), Expect = 2e-06
 Identities = 20/58 (34%), Positives = 29/58 (50%), Gaps = 7/58 (12%)

Query: 54  KVLSRNHALIWYYNGKFYLQDTKSSNGTFVNN--QRLSKTNEESAPHELCSGDVVQFG 109
           + +S  HA I Y +G + L D  S+NG F+N     L + N    P  L  GD ++ G
Sbjct: 43  RHVSGRHARIRYRDGAYLLTDL-STNGVFLNGSGSPLGRGN----PVRLEQGDRLRLG 95


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 50.1 bits (120), Expect = 6e-06
 Identities = 45/217 (20%), Positives = 102/217 (47%), Gaps = 6/217 (2%)

Query: 528 EKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSK 587
           E+   +I +++     +++ +  EI++    L + +  L +  K + +L E+  L  K K
Sbjct: 236 ERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEK 295

Query: 588 ETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKEND 647
              +  ++ S E   +   + +L DA + + K + E  K+  EIE L+ +++  + + + 
Sbjct: 296 IGELEAEIASLERSIAEKER-ELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDK 354

Query: 648 LLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETI 707
           L EEY +LK  ++D +  L  + +    T+D+ +D  + +E+   ++ +N L  ELD   
Sbjct: 355 LTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEK--LKREINELKRELDRLQ 412

Query: 708 H--KNVESVTSNMEND-KSNEDDNSSVQSTDEVKDTE 741
              + +    +++       E   + ++   E K  E
Sbjct: 413 EELQRLSEELADLNAAIAGIEAKINELEEEKEDKALE 449



 Score = 43.1 bits (102), Expect = 8e-04
 Identities = 66/394 (16%), Positives = 154/394 (39%), Gaps = 84/394 (21%)

Query: 281 LQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQ 340
               L R R+ LE + ++     ++ +  E+ +D+L     + S  + +++ E  Q +++
Sbjct: 672 EPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQE 731

Query: 341 LDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQ 400
            +          ++L+    +   +++++ N+K+EL    +++  I              
Sbjct: 732 EEKLK-------ERLEELEEDLSSLEQEIENVKSEL---KELEARI-------------- 767

Query: 401 FDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQS 460
            +E     ++ E + LN +E      +    ++   Q E + L+E V+R E +L  +EQ 
Sbjct: 768 -EELEEDLHKLEEA-LNDLEA-----RLSHSRIPEIQAELSKLEEEVSRIEARLREIEQK 820

Query: 461 KYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIID 520
               L+++   +  LEK  Q                                      ID
Sbjct: 821 ----LNRLTLEKEYLEKEIQE--------------------------------LQEQRID 844

Query: 521 VSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQF 580
           +  Q  S +      KEIE  + K +++  E+ +    L D ++ L   +K    L+ Q 
Sbjct: 845 LKEQIKSIE------KEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQL 898

Query: 581 SLLEKSKETWVS----LKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKE 636
             LE+  E   +     +   +E ++      +    +++ K   EE  +  + +E ++ 
Sbjct: 899 RELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQA 958

Query: 637 KLKFSQSKENDL-------LEEYEKLKLVIKDYQ 663
           +L+  + +   L       ++EYE++   + + +
Sbjct: 959 ELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELK 992



 Score = 42.7 bits (101), Expect = 9e-04
 Identities = 56/327 (17%), Positives = 140/327 (42%), Gaps = 23/327 (7%)

Query: 226 DKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELL 285
           D+ R +++ L R++   E     + +K   E  E++++ +  ER+   IE + ++L+E L
Sbjct: 194 DEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEEL 253

Query: 286 DRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQ-EKSNTLEKLQYETLQAQKQLDAF 344
           ++  +E+ E+ K+ ++     +E    I  L    Q      + +L+ E    ++ +   
Sbjct: 254 EKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEK 313

Query: 345 TLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEY 404
             +     ++L         +  ++  ++ E+  E +  ++ L + + ++       +E 
Sbjct: 314 ERELEDAEERLAKLEAEIDKLLAEIEELEREIE-EERKRRDKLTEEYAELK------EEL 366

Query: 405 FLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLH 464
                E E       E  ++  KD RE+L   + E   L+  ++R +E+L  L +     
Sbjct: 367 EDLRAELEEVDKEFAETRDE-LKDYREKLEKLKREINELKRELDRLQEELQRLSEELADL 425

Query: 465 LDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQ 524
              I G+E ++ +L++       E +K             +E  +     D++  +    
Sbjct: 426 NAAIAGIEAKINELEEEKEDKALEIKK-------------QEWKLEQLAADLSKYE-QEL 471

Query: 525 YNSEKDFVSINKEIEESSRKIDDINAE 551
           Y+ ++++  + KE+ +  R++ +  A+
Sbjct: 472 YDLKEEYDRVEKELSKLQRELAEAEAQ 498



 Score = 41.2 bits (97), Expect = 0.003
 Identities = 46/281 (16%), Positives = 107/281 (38%), Gaps = 30/281 (10%)

Query: 466 DKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIID----- 520
           +++ GL+ +L  L+        EN     +      L +  + +   +K++  ++     
Sbjct: 681 ERLEGLKRELSSLQSELRR--IENRLDELS----QELSDASRKIGEIEKEIEQLEQEEEK 734

Query: 521 -VSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVK-----NEKWVI 574
                   E+D  S+ +EIE    ++ ++ A I +   +L   +  L       +   + 
Sbjct: 735 LKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIP 794

Query: 575 NLQEQFSLLEKSKETWVSL------KLNSAE------SQSSASLQSQLNDALDNIKKYKE 622
            +Q + S LE+      +       KLN          +    LQ Q  D  + IK  ++
Sbjct: 795 EIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEK 854

Query: 623 EHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQD 682
           E   +  + E L+E+L+  ++   DL      LK    + +  L  +   +   + + + 
Sbjct: 855 EIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEK 914

Query: 683 SEQSIEE-EITQQSLNPLSSELDETIHKNVESVTSNMENDK 722
             + + E +   ++L    SE+++   ++ E     +  + 
Sbjct: 915 KRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLED 955



 Score = 38.1 bits (89), Expect = 0.023
 Identities = 56/279 (20%), Positives = 119/279 (42%), Gaps = 18/279 (6%)

Query: 418 SIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSK-YLHLDKIGGLELQLE 476
           +IE ++ +  + R+QL   + E    +E   R +  L    + + Y  L +   LE Q E
Sbjct: 185 NIERLDLIIDEKRQQLERLRRE----REKAERYQALLKEKREYEGYELLKEKEALERQKE 240

Query: 477 KLKQTYNSDCYENEKFGKAYNCDTLLINE-EKTVAPNQKDVNIIDVSNQYNSEKDFVSIN 535
            +++   S   E EK        T  I+E EK +   ++ +  ++   +   E++ + + 
Sbjct: 241 AIERQLASLEEELEKL-------TEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVK 293

Query: 536 KEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKL 595
           ++I E   +I  +   I +   EL DA+  L K E  +  L  +   LE+  E     + 
Sbjct: 294 EKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRD 353

Query: 596 N-SAESQSSAS----LQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLE 650
             + E          L+++L +      + ++E      ++E LK ++   + + + L E
Sbjct: 354 KLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQE 413

Query: 651 EYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEE 689
           E ++L   + D    +  I   +N  +++ +D    I++
Sbjct: 414 ELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKK 452



 Score = 37.4 bits (87), Expect = 0.037
 Identities = 51/258 (19%), Positives = 113/258 (43%), Gaps = 32/258 (12%)

Query: 228 LRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDR 287
           LR+ ++ L R+ S+ +++L+ +  ++     E+ Q+  D  RK+  IE E   L++  ++
Sbjct: 679 LRERLEGLKRELSSLQSELRRIENRLD----ELSQELSDASRKIGEIEKEIEQLEQEEEK 734

Query: 288 NRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLK 347
            ++ LEE+ +       + +  +S + +L    +E    L KL       ++ L+   L+
Sbjct: 735 LKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKL-------EEALND--LE 785

Query: 348 DNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLR 407
             L H ++         I+ +++ ++ E+        + +     ++ Q L +       
Sbjct: 786 ARLSHSRIPE-------IQAELSKLEEEV--------SRIEARLREIEQKLNRLT-LEKE 829

Query: 408 FNESESSGL-NSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLD 466
           + E E   L     D+++  K + +++     +   L+E +   E  L  LE S+   L 
Sbjct: 830 YLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLE-SRLGDLK 888

Query: 467 K-IGGLELQLEKLKQTYN 483
           K    LE QL +L++   
Sbjct: 889 KERDELEAQLRELERKIE 906



 Score = 35.8 bits (83), Expect = 0.11
 Identities = 45/261 (17%), Positives = 114/261 (43%), Gaps = 27/261 (10%)

Query: 200 IQEDRLLSRVEFLESQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVE 259
            +   L S +  +E++L         D+L  E+    R     E +++++ ++    K  
Sbjct: 688 RELSSLQSELRRIENRL---------DELSQELSDASRKIGEIEKEIEQLEQEEEKLKER 738

Query: 260 VMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVT--KNKEFESTIDQLL 317
           + +  +D       IEN  S L+EL  R  +ELEE + K ++ +   + +   S I ++ 
Sbjct: 739 LEELEEDLSSLEQEIENVKSELKELEAR-IEELEEDLHKLEEALNDLEARLSHSRIPEIQ 797

Query: 318 NSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELG 377
               +    + +++    + +++L+  TL+   +  +++        ++E+  ++K ++ 
Sbjct: 798 AELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQE-------LQEQRIDLKEQIK 850

Query: 378 LETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQ 437
              K  +N+  K      ++  + +E      + ES     + D++K   +L  QL   +
Sbjct: 851 SIEKEIENLNGK----KEELEEELEELEAALRDLES----RLGDLKKERDELEAQLRELE 902

Query: 438 EEYTSLQEIVNRNEEQLAFLE 458
            +   L+  + +  ++L+ L+
Sbjct: 903 RKIEELEAQIEKKRKRLSELK 923



 Score = 31.2 bits (71), Expect = 3.4
 Identities = 60/315 (19%), Positives = 116/315 (36%), Gaps = 35/315 (11%)

Query: 166 EALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKSFTD 225
           E L+RE     +   L     + +      ++ L +++ L  + E +E QLA+  +    
Sbjct: 201 ERLRREREKAERYQALLKEKREYE-----GYELLKEKEALERQKEAIERQLASLEEELEK 255

Query: 226 -DKLRDEI-KLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTE---RKLVNIENECSN 280
             +   E+ K L   +   E   K++      E++ V +K  + E     L     E   
Sbjct: 256 LTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKER 315

Query: 281 LQELLDRNRDELEEIIKKYQDQVTKNK----EFESTIDQLLNSNQEKSNTLEKLQYETLQ 336
             E  +    +LE  I K   ++ + +    E     D+L     E    LE L+ E  +
Sbjct: 316 ELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEE 375

Query: 337 AQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQ 396
             K+      +D L   + K          EK+    NEL  E    Q  L +  +++  
Sbjct: 376 VDKEFA--ETRDELKDYREK---------LEKLKREINELKRELDRLQEELQRLSEELAD 424

Query: 397 ILTQFDEYFLRFNESESSGLNSIEDIEKVT----------KDLREQLHTAQEEYTSLQEI 446
           +         + NE E    +   +I+K                ++L+  +EEY  +++ 
Sbjct: 425 LNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKE 484

Query: 447 VNRNEEQLAFLEQSK 461
           +++ + +LA  E   
Sbjct: 485 LSKLQRELAEAEAQA 499



 Score = 30.8 bits (70), Expect = 3.9
 Identities = 62/375 (16%), Positives = 148/375 (39%), Gaps = 43/375 (11%)

Query: 269 RKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKN-KEFESTIDQLLNSNQEKSNTL 327
           RK+++   E + + E   +    LEE+     ++V +N +  +  ID       EK   L
Sbjct: 156 RKIID---EIAGVAEFDRKKEKALEEL-----EEVEENIERLDLIID-------EKRQQL 200

Query: 328 EKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNIL 387
           E+L+ E  +A++      L       +          ++ +   I+ +L    +  + + 
Sbjct: 201 ERLRREREKAERYQA---LLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLT 257

Query: 388 VKYFD---DVHQILTQFDEYFLRFNESESSGLNSI-EDIEKVTKDLREQLHTAQEEYTSL 443
            +  +    + +I    +E   +  +        + E I ++  +    L  +  E    
Sbjct: 258 EEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAE-IASLERSIAEKERE 316

Query: 444 QEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLI 503
            E     EE+LA LE        +I  L  ++E+L++    +    +K  + Y       
Sbjct: 317 LE---DAEERLAKLEA-------EIDKLLAEIEELEREIEEERKRRDKLTEEY------A 360

Query: 504 NEEKTVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAK 563
             ++ +   + ++  +D       ++      +++E+  R+I+++  E+ +   EL    
Sbjct: 361 ELKEELEDLRAELEEVDKEFAETRDE-LKDYREKLEKLKREINELKRELDRLQEELQRLS 419

Query: 564 NCLVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEE 623
             L      +  ++ + + LE+ KE   +L++   E +    L + L+     +   KEE
Sbjct: 420 EELADLNAAIAGIEAKINELEEEKED-KALEIKKQEWKLE-QLAADLSKYEQELYDLKEE 477

Query: 624 HSKMTMEIESLKEKL 638
           + ++  E+  L+ +L
Sbjct: 478 YDRVEKELSKLQREL 492


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
            bacterial type.  SMC (structural maintenance of
            chromosomes) proteins bind DNA and act in organizing and
            segregating chromosomes for partition. SMC proteins are
            found in bacteria, archaea, and eukaryotes. This family
            represents the SMC protein of most bacteria. The smc gene
            is often associated with scpB (TIGR00281) and scpA genes,
            where scp stands for segregation and condensation
            protein. SMC was shown (in Caulobacter crescentus) to be
            induced early in S phase but present and bound to DNA
            throughout the cell cycle [Cellular processes, Cell
            division, DNA metabolism, Chromosome-associated
            proteins].
          Length = 1179

 Score = 46.2 bits (110), Expect = 7e-05
 Identities = 63/350 (18%), Positives = 142/350 (40%), Gaps = 29/350 (8%)

Query: 307  KEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIK 366
            +E E  I++L     E    L +L+ E  + +++L+    +   +  ++     +   ++
Sbjct: 680  EELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLE 739

Query: 367  EKMTNIK---NELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLN-SIEDI 422
             ++  ++    +L  E    +  + +  + + +   +  E      E+E   L   IE +
Sbjct: 740  AEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAE-----AEAEIEELEAQIEQL 794

Query: 423  EKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTY 482
            ++  K LRE L   + E T L E      E+L  LE+       ++  LE Q+E+L +  
Sbjct: 795  KEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDI 854

Query: 483  NSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESS 542
             S     E        + L    E     N++      ++       +   +++E+ E  
Sbjct: 855  ES--LAAEIEELEELIEEL--ESELEALLNERASLEEALALL---RSELEELSEELRELE 907

Query: 543  RKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFS-----LLEKSKETWVSLKLNS 597
             K  ++  E+ +   +L   +  L   E  + NLQE+ S      LE+++     ++ + 
Sbjct: 908  SKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDE 967

Query: 598  AESQSS-ASLQSQLN-------DALDNIKKYKEEHSKMTMEIESLKEKLK 639
             E++     L++++         A++  ++ KE +  +T + E L E  +
Sbjct: 968  EEARRRLKRLENKIKELGPVNLAAIEEYEELKERYDFLTAQKEDLTEAKE 1017



 Score = 45.4 bits (108), Expect = 1e-04
 Identities = 47/264 (17%), Positives = 104/264 (39%), Gaps = 19/264 (7%)

Query: 399 TQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLE 458
            + +E      E E      +E++     +L E++   Q+E  +L   ++R E+Q   L 
Sbjct: 253 EELEELTAELQELEEK----LEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILR 308

Query: 459 QSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNI 518
           +       ++  LE QLE+L+     D    E        + L    E   A  ++    
Sbjct: 309 ERLANLERQLEELEAQLEELESK--LDELAEELAELEEKLEELKEELESLEAELEELEAE 366

Query: 519 IDVSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCL----VKNEKWVI 574
           ++       E +     +++E    K+  +  +I    NE+   +  L     + E+   
Sbjct: 367 LEELESRLEELE-----EQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQ 421

Query: 575 NLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESL 634
            ++E    LE+++   +  +L   E +    LQ +L    + +++ +EE  +    +++ 
Sbjct: 422 EIEELLKKLEEAELKELQAELEELE-EELEELQEELERLEEALEELREELEEAEQALDAA 480

Query: 635 KEKLKFSQSKEN---DLLEEYEKL 655
           + +L   Q++ +    L E  E  
Sbjct: 481 ERELAQLQARLDSLERLQENLEGF 504



 Score = 45.0 bits (107), Expect = 2e-04
 Identities = 47/285 (16%), Positives = 117/285 (41%), Gaps = 33/285 (11%)

Query: 203 DRLLSRVEFLESQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQ 262
           + L  ++E LE ++A          L   +  L ++    E +L+++      E  E+ +
Sbjct: 680 EELEEKIEELEEKIAE---------LEKALAELRKELEELEEELEQL----RKELEELSR 726

Query: 263 KCQDTERKLVNIENECSNLQELLDRNRDELEEI---IKKYQDQVTKNKEFESTIDQLLNS 319
           +     + L  +E E   L+E + +   EL E+   I++ ++++ + +E  +  +  +  
Sbjct: 727 QISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEE 786

Query: 320 NQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLE 379
            + +   +E+L+ E    ++ LD           +L         ++E++ +++  +   
Sbjct: 787 LEAQ---IEQLKEELKALREALDEL-------RAELTLLNEEAANLRERLESLERRIAAT 836

Query: 380 TKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKD---LREQLHTA 436
            +  ++ L +  +++ + +           E E        ++E +  +   L E L   
Sbjct: 837 ERRLED-LEEQIEELSEDIE---SLAAEIEELEELIEELESELEALLNERASLEEALALL 892

Query: 437 QEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQT 481
           + E   L E +   E + + L +      +K+  LEL+LE L+  
Sbjct: 893 RSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVR 937



 Score = 38.1 bits (89), Expect = 0.024
 Identities = 45/259 (17%), Positives = 102/259 (39%), Gaps = 20/259 (7%)

Query: 227 KLRDEIKLLHRDKSTYE-NQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELL 285
           +L+ E++ L          +L+E ++++  E  E  ++ ++   +L         L+E L
Sbjct: 217 ELKAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAEL-------QELEEKL 269

Query: 286 DRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFT 345
           +  R E+ E+ ++ ++   +     + I +L    Q     L  L+ +  + + QL+   
Sbjct: 270 EELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEEL- 328

Query: 346 LKDNLMHDKLKNYYMNFHIIKEKMTNIKNEL-GLETKVDQNILVKYFDDVHQILTQFDEY 404
                   KL         ++EK+  +K EL  LE ++++  L    +++   L + +E 
Sbjct: 329 ------ESKLDELAEELAELEEKLEELKEELESLEAELEE--LEAELEELESRLEELEEQ 380

Query: 405 FLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLH 464
                   +     I  +    + L  +L   ++    LQ+     E      E      
Sbjct: 381 LETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQ--EIEELLKKLEEAELKEL 438

Query: 465 LDKIGGLELQLEKLKQTYN 483
             ++  LE +LE+L++   
Sbjct: 439 QAELEELEEELEELQEELE 457



 Score = 35.8 bits (83), Expect = 0.12
 Identities = 106/616 (17%), Positives = 248/616 (40%), Gaps = 90/616 (14%)

Query: 155 EELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLES 214
           +ELY L   I    Q++ +L+ +L+ L+     L++      +   + D L   +  LE 
Sbjct: 288 KELYALANEISRLEQQKQILRERLANLE---RQLEELEAQLEELESKLDELAEELAELEE 344

Query: 215 QLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTER---KL 271
           +L    +      L  E++ L  +    E++L+E+ +++   + +V Q          ++
Sbjct: 345 KLEELKEELES--LEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEI 402

Query: 272 VNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQ 331
             +E     L++  +R + E+EE++KK ++   + KE ++ +++L    +E    LE+L+
Sbjct: 403 ERLEARLERLEDRRERLQQEIEELLKKLEEA--ELKELQAELEELEEELEELQEELERLE 460

Query: 332 YETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYF 391
               + +++L+      +    +L         ++    N++                + 
Sbjct: 461 EALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEG---------------FS 505

Query: 392 DDVHQILTQFDEYFLRFNESESSG-LNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRN 450
           + V  +L          N+S  SG L  + ++  V +     +  A      LQ +V  N
Sbjct: 506 EGVKALLK---------NQSGLSGILGVLSELISVDEGYEAAIEAALGG--RLQAVVVEN 554

Query: 451 EEQ----LAFLEQSK-----YLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTL 501
                  +AFL+Q++     +L LD I G E+Q    +   N + +     G A +    
Sbjct: 555 LNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGF----LGVAKDLVKF 610

Query: 502 LINEEKTVAPNQKDVNIID-------VSNQYNSEKDFVSINKE----------------- 537
                K ++     V ++D       ++ +       V+++ +                 
Sbjct: 611 DPKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYRIVTLDGDLVRPGGVITGGSAKTNS 670

Query: 538 -IEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEK---SKETWVSL 593
            I E  R+I+++  +I +   ++ + +  L +  K +  L+E+   L K        +S 
Sbjct: 671 SILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISA 730

Query: 594 KLNSAE---------SQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSK 644
                           +  A L  +L +    I++ +E   +   E+   + +++  +++
Sbjct: 731 LRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQ 790

Query: 645 ENDLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELD 704
              L EE + L+  + + +  L  + E   N +++ +  E+ I    T++ L  L  ++ 
Sbjct: 791 IEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAA--TERRLEDLEEQI- 847

Query: 705 ETIHKNVESVTSNMEN 720
           E + +++ES+ + +E 
Sbjct: 848 EELSEDIESLAAEIEE 863



 Score = 33.1 bits (76), Expect = 0.84
 Identities = 48/287 (16%), Positives = 117/287 (40%), Gaps = 44/287 (15%)

Query: 427 KDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDC 486
           ++L E++   +E+   L++ +    ++L  LE+              +LE+L++      
Sbjct: 680 EELEEKIEELEEKIAELEKALAELRKELEELEE--------------ELEQLRKE----- 720

Query: 487 YENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKID 546
              E   +       L   E  V                  E+    ++KE+ E   +I+
Sbjct: 721 -LEELSRQISALRKDLARLEAEVE---------------QLEERIAQLSKELTELEAEIE 764

Query: 547 DINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKLN---SAESQSS 603
           ++   + +   EL +A+  + + E  +  L+E+   L ++ +     +L    +  ++ +
Sbjct: 765 ELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALD-----ELRAELTLLNEEA 819

Query: 604 ASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQ 663
           A+L+ +L      I   +     +  +IE L E ++   ++  +L E  E+L+  ++   
Sbjct: 820 ANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALL 879

Query: 664 NVLPSIVESMNNTKDKPQDSEQSIEE-EITQQSLNPLSSELDETIHK 709
           N   S+ E++   + + ++  + + E E  +  L     EL E + +
Sbjct: 880 NERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQ 926



 Score = 31.2 bits (71), Expect = 3.0
 Identities = 26/154 (16%), Positives = 65/154 (42%), Gaps = 24/154 (15%)

Query: 575 NLQEQFSLLEKSKETWVSLKLNSAESQSS--------ASLQSQLNDALDNIKKYKEEHSK 626
            L+E    L++++E    L     E +          + L+ ++ +    +     E S+
Sbjct: 240 ELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISR 299

Query: 627 MTMEIESLKEKLKFSQ--------------SKENDLLEEYEKLKLVIKDYQNVLPSIVES 672
           +  + + L+E+L   +              SK ++L EE  +L+  +++ +  L S+   
Sbjct: 300 LEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAE 359

Query: 673 MNNTKDKPQDSEQSIEEEITQQSLNPLSSELDET 706
           +   + + ++ E  +EE   ++ L  L S++ + 
Sbjct: 360 LEELEAELEELESRLEE--LEEQLETLRSKVAQL 391


>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN.  All proteins in this
           family for which functions are known are ATP binding
           proteins involved in the initiation of recombination and
           recombinational repair [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 563

 Score = 45.9 bits (109), Expect = 8e-05
 Identities = 44/200 (22%), Positives = 89/200 (44%), Gaps = 23/200 (11%)

Query: 156 ELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFL-ES 214
           +L       QE  QR + L+ +L +L+   +   +      +AL  E + LS +E L E 
Sbjct: 176 QLKDRQQKEQELAQRLDFLQFQLEELEEADLQPGE-----DEALEAEQQRLSNLEKLREL 230

Query: 215 QLAAYSKSFTDDKLRDEIKLLHRDK-------STYENQLKEVMKKMLAEKVEVMQKCQDT 267
              A +    D  +  E  LL           S  +  L+E+ +++     EV +  ++ 
Sbjct: 231 SQNALAALRGDVDV-QEGSLLEGLGEAQLALASVIDGSLRELAEQVGNALTEVEEATREL 289

Query: 268 ERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTK----NKEFESTIDQLLNSNQEK 323
           +  L  +E +   L E+ +R   +++ + +KY   V +     ++ +  +DQL +S++  
Sbjct: 290 QNYLDELEFDPERLNEIEER-LAQIKRLKRKYGASVEEVLEYAEKIKEELDQLDDSDES- 347

Query: 324 SNTLEKLQYETLQAQKQLDA 343
              LE L+ E  + +++LD 
Sbjct: 348 ---LEALEEEVDKLEEELDK 364


>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 557

 Score = 44.1 bits (105), Expect = 3e-04
 Identities = 44/196 (22%), Positives = 87/196 (44%), Gaps = 15/196 (7%)

Query: 156 ELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQ 215
           EL  L    +E  QR ++L+ +L +L+ +  +L+       + L +E + LS  E L   
Sbjct: 172 ELEDLQEKERERAQRADLLQFQLEELEEL--NLQ---PGEDEELEEERKRLSNSEKLAEA 226

Query: 216 LA-AYSKSFTDDKLRDEIKLLHR------DKSTYENQLKEVMKKMLAEKVEVMQKCQDT- 267
           +  A      +D     + LL R      D S Y+ +L E+  ++L E +  +++  +  
Sbjct: 227 IQNALELLSGEDDTVSALSLLGRALEALEDLSEYDGKLSEL-AELLEEALYELEEASEEL 285

Query: 268 ERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTL 327
              L  +E + + L+E+ +R    L+ + +KY   +    E+   I + L        +L
Sbjct: 286 RAYLDELEFDPNRLEEVEER-LFALKSLARKYGVTIEDLLEYLDKIKEELAQLDNSEESL 344

Query: 328 EKLQYETLQAQKQLDA 343
           E L+ E  + + +L  
Sbjct: 345 EALEKEVKKLKAELLE 360


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 42.7 bits (101), Expect = 3e-04
 Identities = 43/204 (21%), Positives = 81/204 (39%), Gaps = 16/204 (7%)

Query: 536 KEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKL 595
             I++   + D +   I +    L  AK  L    K +  L+ +   LE           
Sbjct: 10  LAIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQV------SQ 63

Query: 596 NSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKL 655
             +E Q       +  + L  +K  +E    + +EI+  KE++   + +  +L+EE EKL
Sbjct: 64  LESEIQEIRERIKRAEEKLSAVKDERE-LRALNIEIQIAKERINSLEDELAELMEEIEKL 122

Query: 656 KLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEI--TQQSLNPLSSELDETIHKNVES 713
           +  I+D +       E +   +    ++E  +EEE+   ++    LSS+ +E   K    
Sbjct: 123 EKEIEDLK-------ERLERLEKNLAEAEARLEEEVAEIREEGQELSSKREELKEKLDPE 175

Query: 714 VTSNMENDKSNEDDNSSVQSTDEV 737
           + S  E  + N+     V     V
Sbjct: 176 LLSEYERIRKNKKGVGVVPLEGRV 199



 Score = 30.0 bits (68), Expect = 5.1
 Identities = 18/145 (12%), Positives = 60/145 (41%), Gaps = 9/145 (6%)

Query: 268 ERKLVNIENECSNLQELLDRNRDELEEI---IKKYQDQVTKNKEFESTIDQLLNSNQEK- 323
           E ++  I       +  L+     LE +   ++  ++QV++ +     I + +   +EK 
Sbjct: 23  EPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKL 82

Query: 324 -----SNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGL 378
                   L  L  E   A++++++   +   + ++++        +KE++  ++  L  
Sbjct: 83  SAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAE 142

Query: 379 ETKVDQNILVKYFDDVHQILTQFDE 403
                +  + +  ++  ++ ++ +E
Sbjct: 143 AEARLEEEVAEIREEGQELSSKREE 167


>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
           protein, PEP-CTERM locus subfamily.  Members of this
           protein family belong to the family of polysaccharide
           chain length determinant proteins (pfam02706). All are
           found in species that encode the PEP-CTERM/exosortase
           system predicted to act in protein sorting in a number
           of Gram-negative bacteria, and are found near the epsH
           homolog that is the putative exosortase gene [Cell
           envelope, Biosynthesis and degradation of surface
           polysaccharides and lipopolysaccharides].
          Length = 498

 Score = 40.4 bits (95), Expect = 0.004
 Identities = 42/194 (21%), Positives = 73/194 (37%), Gaps = 35/194 (18%)

Query: 155 EELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLES 214
           EEL      + EA+ + + LK +L   + +++     A+           L  R+E LE 
Sbjct: 211 EELEAARLELNEAIAQRDALKRQLGGEEPVLLAGSSVAN---------SELDGRIEALEK 261

Query: 215 QLAAYSKSFTDD-----KLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQ---- 265
           QL A    +TD        + EI  L  ++   E   K    +       V Q+ Q    
Sbjct: 262 QLDALRLRYTDKHPDVIATKREIAQL-EEQKEEEGSAKNGGPERGEIANPVYQQLQIELA 320

Query: 266 DTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSN 325
           + E ++ ++E   + L   ++R    L  I            E E+ + QL   N++   
Sbjct: 321 EAEAEIASLEARVAELTARIERLESLLRTI-----------PEVEAELTQL---NRDYEV 366

Query: 326 TLEKLQYETLQAQK 339
              K  YE L  ++
Sbjct: 367 --NKSNYEQLLTRR 378


>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein.  This
            model represents a group of paralogous families in
            plasmodium species alternately annotated as reticulocyte
            binding protein, 235-kDa family protein and rhoptry
            protein. Rhoptry protein is localized on the cell surface
            and is extremely large (although apparently lacking in
            repeat structure) and is important for the process of
            invasion of the RBCs by the parasite. These proteins are
            found in P. falciparum, P. vivax and P. yoelii.
          Length = 2757

 Score = 40.8 bits (95), Expect = 0.004
 Identities = 100/487 (20%), Positives = 210/487 (43%), Gaps = 50/487 (10%)

Query: 226  DKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTER---------KLVNIEN 276
            +K++D+IK L +    +   L+E+ KK      E+  +  D E              IE 
Sbjct: 1114 NKIKDDIKNLDQKIDHHIKALEEIKKKSENYIDEIKAQINDLEDVADKAISNDDPEEIEK 1173

Query: 277  ECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQ 336
            +  N+   +D+ ++  +EI KK  +++ + ++ +++++++   N      L KL  E + 
Sbjct: 1174 KIENIVTKIDKKKNIYDEI-KKLLNEIAEIEKDKTSLEEVKGINLSYGKNLGKLFLEKID 1232

Query: 337  AQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQ-----NILVKYF 391
             +K+     +K       ++ Y  +   IKEK   I+NE+G+E  +       NI     
Sbjct: 1233 EEKKKSEHMIK------AMEAYIEDLDEIKEKSPEIENEMGIEMDIKAEMETFNISHDDD 1286

Query: 392  DDVHQILTQFDEYFLRFNESE---SSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVN 448
             D H I  + DE      E         +   DI  + K+L++ L  AQ+  + +   +N
Sbjct: 1287 KDHHIISKKHDENISDIREKSLKIIEDFSEESDINDIKKELQKNLLDAQKHNSDINLYLN 1346

Query: 449  RNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKT 508
                     E +   ++ K+  ++  ++++K+ Y  +  EN K     N    L   EK 
Sbjct: 1347 ---------EIANIYNILKLNKIKKIIDEVKE-YTKEIEENNK-----NIKDELDKSEKL 1391

Query: 509  VAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVK 568
            +   + D+N+ +  ++  S  D   I++ I++     + I +E +       +A     +
Sbjct: 1392 IKKIKDDINLEECKSKIESTLDDKDIDECIKKIKELKNHILSEESNIDTYFKNAD----E 1447

Query: 569  NEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMT 628
            N + V+ L +   + +   +  + +K ++A +    ++ ++L + +D  K  K+E  K  
Sbjct: 1448 NNENVLLLFKNIEMADNKSQHILKIKKDNATNDHDFNI-NELKEHIDKSKGCKDEADKNA 1506

Query: 629  MEIESLKEKLKFSQSKENDLLEEYEKLKL------VIKDYQNVLPSIVESMNNTKDKPQD 682
              IE  KE  +  +    +LL +Y  L +        KD + ++  I ++      + + 
Sbjct: 1507 KAIEKNKELFEQYKKDVTELLNKYSALAIKNKFAKTKKDSEIIIKEIKDAHKKFILEAEK 1566

Query: 683  SEQSIEE 689
            SEQ I+E
Sbjct: 1567 SEQKIKE 1573



 Score = 30.8 bits (69), Expect = 4.2
 Identities = 36/200 (18%), Positives = 92/200 (46%), Gaps = 12/200 (6%)

Query: 259 EVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLN 318
           ++++  ++ E++L N  N+ +  ++ L++ + ++ EI   Y DQ+  +   +    Q  +
Sbjct: 755 KILEDFKNKEKELSNKINDYAKEKDELNKYKSKISEIKNHYNDQINIDNIKDEDAKQNYD 814

Query: 319 SNQEKSNTLEKLQYETLQAQKQL----DAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKN 374
            ++E   T+   + E  +   ++    D F  K +   +   N         E+   + N
Sbjct: 815 KSKEYIKTISIKEDEIFKIINEMKFMKDDFLNKVDKFINFENNCKEKIDSEHEQFAELTN 874

Query: 375 ELGLETKVDQ-NILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIE---KVTKDLR 430
           ++  E   D+ N   K F+D   ++ + ++      E E   +N+++ ++   K+ ++ +
Sbjct: 875 KIKAEISDDKLNDYEKKFNDSKSLINEINKSI----EEEYQNINTLKKVDEYIKICENTK 930

Query: 431 EQLHTAQEEYTSLQEIVNRN 450
           E +     +   L+EI+N+N
Sbjct: 931 ESIEKFHNKQNILKEILNKN 950


>gnl|CDD|178515 PLN02927, PLN02927, antheraxanthin epoxidase/zeaxanthin epoxidase.
          Length = 668

 Score = 40.1 bits (93), Expect = 0.005
 Identities = 22/71 (30%), Positives = 38/71 (53%), Gaps = 6/71 (8%)

Query: 56  LSRNHALIWYYNGKFYLQDTKSSNGTFV-NNQRLSKTNEESAPHELCSGDVVQFGVDV-- 112
           +S+ HA + Y +G F+L D +S +GT+V +N+        + P    S D+++FG D   
Sbjct: 581 VSKMHARVIYKDGAFFLMDLRSEHGTYVTDNEGRRYRATPNFPARFRSSDIIEFGSDKKA 640

Query: 113 ---IETNRNTP 120
              ++  R TP
Sbjct: 641 AFRVKVIRKTP 651


>gnl|CDD|225987 COG3456, COG3456, Predicted component of the type VI protein
           secretion system, contains a FHA domain [Intracellular
           trafficking, secretion, and vesicular    transport;
           Signal transduction mechanisms].
          Length = 430

 Score = 39.8 bits (93), Expect = 0.005
 Identities = 21/59 (35%), Positives = 27/59 (45%), Gaps = 4/59 (6%)

Query: 56  LSRNHALIWYYNGKFYLQDTKSSNGTFVNNQRLSKTNEESAPHELCSGDVVQFGVDVIE 114
           +S+ H  I Y +G F L DT S+ G  V N       E SA   L  GD +  G  +I 
Sbjct: 47  VSKQHCTISYRDGGFCLTDT-SNGGLLV-NGSDLPLGEGSAR--LQQGDEILIGRYIIR 101


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 39.7 bits (93), Expect = 0.007
 Identities = 111/560 (19%), Positives = 228/560 (40%), Gaps = 71/560 (12%)

Query: 155 EELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLES 214
           +EL ++   I E       L+ +L KL+  V +L++  +   +   + + L      LE 
Sbjct: 200 KELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEE 259

Query: 215 QLAAYSKSFTDDKLR--------DEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQD 266
           ++    +   + K           E+K L      Y  +L E  ++ L E  E+ ++   
Sbjct: 260 KIRELEERIEELKKEIEELEEKVKELKELKEKAEEYI-KLSEFYEEYLDELREIEKRLSR 318

Query: 267 TERKLVNIE---NECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLN----- 318
            E ++  IE    E    +E L+  + +L+E+ K+ ++   +++ +E             
Sbjct: 319 LEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYE-EAKAKKEELERL 377

Query: 319 SNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELG- 377
             +    T EKL+ E  + +K  +    + + +  ++         IKE    I  EL  
Sbjct: 378 KKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELK---KEIKELKKAI-EELKK 433

Query: 378 -----------LETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKV- 425
                      L  +  + +L +Y  ++ +I  +  E      E E      + ++EKV 
Sbjct: 434 AKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELKE----IEEKERKLRKELRELEKVL 489

Query: 426 --------TKDLREQLHTAQEEYTS--LQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQL 475
                    K+L EQL   +E+     L+E+  + EE     E+   L   +I  L+ +L
Sbjct: 490 KKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLK-GEIKSLKKEL 548

Query: 476 EKLKQTYNSDCYENEKFGKAYNCDTLLINE-EKTVAPNQKDVN--IIDVSNQYNSEKDFV 532
           EKL++         +K  +       L+ E E+    + +++   + ++   YN   +  
Sbjct: 549 EKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELK 608

Query: 533 SINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFS----------L 582
              KE+E   +++  +  E+ K   EL + +  L +  K +  L++++S           
Sbjct: 609 DAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEY 668

Query: 583 LEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQ 642
           LE S+E    L    AE +     + ++   L+ +K+  EE  K   E+E L++ L+  +
Sbjct: 669 LELSRE----LAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVE 724

Query: 643 SKENDLLEEYEKLKLVIKDY 662
               +L E+ +K K ++K+ 
Sbjct: 725 ----ELREKVKKYKALLKER 740



 Score = 33.1 bits (76), Expect = 0.85
 Identities = 75/430 (17%), Positives = 172/430 (40%), Gaps = 25/430 (5%)

Query: 244 NQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQV 303
            +  E ++K +     + +  ++ E++L  +  E + +   L   R+ELE++ K+ ++  
Sbjct: 175 KRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELE 234

Query: 304 TKNKEFESTIDQLLNSNQEKSNTLEKL-QYETLQAQKQLDAFTLKDNLMH-DKLKNYYMN 361
              +E E    +L +    K    EK+ + E    + + +   L++ +    +LK     
Sbjct: 235 ELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEE 294

Query: 362 FHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIED 421
           +  + E      +EL    K     L +  ++++ I  +  E      E +   L  ++ 
Sbjct: 295 YIKLSEFYEEYLDELREIEKR----LSRLEEEINGIEERIKEL-----EEKEERLEELKK 345

Query: 422 IEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQT 481
             K  +   E+L    E Y   +      E      ++   L  +K+     +LEK K+ 
Sbjct: 346 KLKELEKRLEELEERHELYEEAKAKKEELER---LKKRLTGLTPEKLEKELEELEKAKEE 402

Query: 482 YNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEES 541
              +   ++   +       +   +K +   +K      V  +  +E+    +   +EE 
Sbjct: 403 IEEE--ISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKEL---LEEY 457

Query: 542 SRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQ 601
           + ++  I  E+ +   +    +  L + EK V+  + +   L++  E     +L   E +
Sbjct: 458 TAELKRIEKELKEIEEKERKLRKELRELEK-VLKKESELIKLKELAE-----QLKELEEK 511

Query: 602 SSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKD 661
                  +L    +  +K KE+  K+  EI+SLK++L+  +  +  L E  +KL  + ++
Sbjct: 512 LKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEE 571

Query: 662 YQNVLPSIVE 671
              +L  + E
Sbjct: 572 LAELLKELEE 581


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 39.2 bits (91), Expect = 0.010
 Identities = 78/448 (17%), Positives = 168/448 (37%), Gaps = 36/448 (8%)

Query: 227 KLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVM---QKCQDTERKLVNIENECSNLQ- 282
           K  +EI  L     T E     + K++ AE+  +     KC    ++L   + E   LQ 
Sbjct: 249 KAENEIYRLQSRCDTSEADRNRLDKEVEAERSALAAMKAKCDRAAQELSRKKTELLGLQT 308

Query: 283 --ELLDRNRDELEEIIKKYQDQVT----KNKEFESTIDQLLNSNQEKSNTLEKLQYETLQ 336
             E L     ++ + + K ++ +T    +    ++ +D L    + K NTL K       
Sbjct: 309 ELETLANQDSDMRQHLDKLKEDLTRAEQEKAILQTEVDALRYELERKHNTLTKKTASLQA 368

Query: 337 AQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETK--VDQNILVKYFDDV 394
           AQ++   +  +   M D+ +       ++++K+ N++     + +   ++   ++     
Sbjct: 369 AQEEKATYAGEIEDMRDRYEKTERKLRVLQKKIENLQETFRRKERRLKEEKERLRSLQTD 428

Query: 395 HQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQL 454
               T  ++      E E       E  ++  +  +E+  T ++E+  L+E V   + +L
Sbjct: 429 TNTDTALEKLEKALAEKERIIERLKEQRDRDERYEQEEFETYKKEFEDLKEEVQNLQLKL 488

Query: 455 AFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQK 514
           +  E+   L L K    +L   +LKQ  + +    E        + L    ++  A  + 
Sbjct: 489 S--ERELQLELLKEEVSKLASNQLKQRSDLERAHIELEKIREKHEKLEKELKRLRANPES 546

Query: 515 DVNIIDVSNQYNSEKDFVSINKEIEESSRKI--------DDINAEITKCTNELVDAK--- 563
                 V    +  +   +  +   +             DD   E  +   EL  A+   
Sbjct: 547 ADRGSAVDAGTSRSRADSAGARNEVDRLLDRLEKAEQERDDTEMEAGRLAKELEKAQRHL 606

Query: 564 --------NCLVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALD 615
                      ++ E+    L E+   LEKS+    +L+ ++   Q   +  +  N    
Sbjct: 607 TKQQEKTEATRIEFERKSAELLEEAERLEKSEAEEETLRQST---QIGHAQAAAHNHIEH 663

Query: 616 NIKKYKEEHSKMTMEIESLKEKLKFSQS 643
           +++K + +  ++  E E L  +L+ SQ 
Sbjct: 664 HVQKLESDLKQLRAEREQLVAQLEKSQQ 691


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
            Provisional.
          Length = 2722

 Score = 39.0 bits (91), Expect = 0.014
 Identities = 88/523 (16%), Positives = 194/523 (37%), Gaps = 81/523 (15%)

Query: 200  IQEDRLLSRVEFLESQLAAYSKSFTDDKLRDEI------KLLHRDKSTYENQLKEVMK-- 251
             +E  + ++++ + ++L+  +K   +D+  DEI      + L   ++++  +  E +   
Sbjct: 1951 NKESAISNKIDNVSNKLSELNKITCNDESYDEILEKEEYEELKDLRNSFNQEKAETLNNL 2010

Query: 252  --KMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEF 309
                + E     +   D   K V       N++++++  +  ++ I    +D   + +  
Sbjct: 2011 KLNKIKEDFNSYKNLLDELEKSVKTLKASENIKKIVENKKTSIDAINTNIEDIEKEIESI 2070

Query: 310  ESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYM-NFHIIKEK 368
              ++D+LL    +   +      + +Q +   D+  + D     ++   Y+ N +   +K
Sbjct: 2071 NPSLDELLKKGHKIEISRYTSIIDNVQTKISNDSKNINDIEKKAQIYLAYIKNNYNSIKK 2130

Query: 369  MTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKD 428
              +  NE   E +V            + ILT  D+     +E   +  NS E IE + K 
Sbjct: 2131 DISTLNEYFDEKQVS-----------NYILTNIDKANKLSSELSEAVTNSEEIIENIKK- 2178

Query: 429  LREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYE 488
                            EI+  NE               ++  LE   +KLK+ Y +    
Sbjct: 2179 ----------------EIIEINEN-------------TEMNTLENTADKLKELYEN---- 2205

Query: 489  NEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKIDDI 548
                         L  ++  +    K +N I +    NS + +  I+K        ++  
Sbjct: 2206 -------------LKKKKNIINNIYKKINFIKLQEIENSSEKYNDISKLFNNV---VETQ 2249

Query: 549  NAEITKCTNELVDAKNCLVKNEKWVINLQEQFSL--LEKSKETWVSLKLNSAESQSSASL 606
              ++    N++ + K+ +   EK +IN+   F+L  ++   E +  +K N  +      L
Sbjct: 2250 KKKLLDNKNKINNIKDKINDKEKELINVDSSFTLESIKTFNEIYDDIKSNIGDLY---KL 2306

Query: 607  QSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVL 666
            +   ND L  +K Y E  + +   I +L   L   Q +     +  E       +  + +
Sbjct: 2307 EDTNNDELKKVKLYIENITHLLNRINTLINDLDNYQDENYGKDKNIELN----NENNSYI 2362

Query: 667  PSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHK 709
                E +NN K++     ++I+   T  + N +   +      
Sbjct: 2363 IKTKEKINNLKEEFSKLLKNIKRNNTLCNNNNIKDFISNIGKS 2405



 Score = 34.4 bits (79), Expect = 0.39
 Identities = 89/519 (17%), Positives = 200/519 (38%), Gaps = 58/519 (11%)

Query: 250 MKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEF 309
           +KK   EK+  ++K  +  + L++I     +L      + D  E+      +   K  E 
Sbjct: 444 IKKKYDEKINELKKSINQLKTLISIMKSFYDLIISEKDSMDSKEKKESSDSNYQEKVDEL 503

Query: 310 ESTIDQLLNSNQEKSN----------TLEKLQYETLQAQKQLDAF--TLKDNLMHDKLKN 357
              I+ +   N   +N          T+E L+ E     + +  +  +++  +  +KLK 
Sbjct: 504 LQIINSIKEKNNIVNNNFKNIEDYYITIEGLKNEIEGLIELIKYYLQSIETLIKDEKLKR 563

Query: 358 YYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLN 417
              N   IK K+  I+  +     +   I +   D++  I+ Q +E         +  L 
Sbjct: 564 SMKN--DIKNKIKYIEENV---DHIKDIISLN--DEIDNIIQQIEELI-------NEALF 609

Query: 418 SIEDIEKVTKDLREQLHTAQEEY--TSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQL 475
           + E       DL+E++     ++    LQE+++   E   FL+  KYL+ +     +LQ 
Sbjct: 610 NKEKFINEKNDLQEKVKYILNKFYKGDLQELLD---ELSHFLDDHKYLYHEAKSKEDLQ- 665

Query: 476 EKLKQTYNSDCYENEKFGKAYNCDTL-LINEEKTVAPNQKDVNIIDVSNQYNSEKDFVSI 534
             L  T  ++  + E        + +  + +E     + K+ NII         K   +I
Sbjct: 666 -TLLNTSKNEYEKLEFMKSDNIDNIIKNLKKELQNLLSLKE-NIIK--------KQLNNI 715

Query: 535 NKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLK 594
            ++I  S  +      ++     E  + +  L   +  +IN + +F L     +  +   
Sbjct: 716 EQDISNSLNQYTIKYNDLKSSIEEYKEEEEKLEVYKHQIINRKNEFILHLYENDKDLPDG 775

Query: 595 LNSAES---------QSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKE 645
            N+ E               + + +N   +N K  ++  +   + I+ L+   + +  + 
Sbjct: 776 KNTYEEFLQYKDTILNKENKISNDINILKENKKNNQDLLNSYNILIQKLEAHTEKNDEEL 835

Query: 646 NDLLE--EYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSEL 703
             LL+    E   L +K+ +       + ++N     ++  ++I      ++LN ++   
Sbjct: 836 KQLLQKFPTEDENLNLKELEKEFNENNQIVDNIIKDIENMNKNIN---IIKTLN-IAINR 891

Query: 704 DETIHKNVESVTSNMENDKSNEDDNSSVQSTDEVKDTEE 742
             +  + VE + +N  + K+  + +  + +TD +    E
Sbjct: 892 SNSNKQLVEHLLNNKIDLKNKLEQHMKIINTDNIIQKNE 930


>gnl|CDD|224428 COG1511, COG1511, Predicted membrane protein [Function unknown].
          Length = 780

 Score = 38.6 bits (90), Expect = 0.015
 Identities = 51/413 (12%), Positives = 124/413 (30%), Gaps = 23/413 (5%)

Query: 308 EFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKE 367
           + +   D+  N N++ S+ L  L   +      L+A T     + D L         +  
Sbjct: 189 KLKDGTDEASNGNKKLSDLLNTLNNSSATFSDGLNALTSGLTTLTDGLNQLDSGLGTLAA 248

Query: 368 KMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTK 427
            +  +K       ++++ I    F      L    +         + G++++     +  
Sbjct: 249 GIGELKQGAE---QLNEGIGE--FSSGLSELNSGVQDLAAGVPQLNQGISALAAGLSLPD 303

Query: 428 DLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQ-SKYLHLDKIGGLELQLEKLKQTYNSDC 486
            L +Q  + QE  T + + + +         Q S       +   +      +       
Sbjct: 304 SLGDQFSSLQEALTQIAQGLKQKTSSSLEAAQGSLSSLQSMLALSKSLDLTAEGATVDAL 363

Query: 487 YENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNS--EKDFVSINKEIEESSRK 544
              +          L  +++     ++     ID  ++     E+    + K + +    
Sbjct: 364 GAPDGVQ------WLDESQKTLATLSELLSTGIDGVSEGLDALEQASAQLAKSLAKLKTA 417

Query: 545 IDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSA 604
           +  I A I +      +    L    K +  L  Q +         +   L+ A     +
Sbjct: 418 VAQIAASIAQLLPGASEVLKTL--KSKGLDKLLNQLNGALAKGSNALVQGLSDANDSFRS 475

Query: 605 SLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQN 664
              +QL   L+ +     + S +   +  L +  K      ++L              ++
Sbjct: 476 ITSAQLKAGLNTLADGSNDLSSLGPGLGQLADGSKLLADGLSEL-------NTGSAQLRD 528

Query: 665 VLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHKNVESVTSN 717
            L  + + +    D  QD+   +             +   E   K+++ V + 
Sbjct: 529 GLGELSDGLTELADSLQDAADQLSLANDSDKQASFIANPVELKEKDIDPVPNY 581


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 37.6 bits (87), Expect = 0.030
 Identities = 97/519 (18%), Positives = 185/519 (35%), Gaps = 53/519 (10%)

Query: 168 LQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKSF---- 223
           +Q EN L  K  + +   I+L++ A    +   +E      +E     L    K      
Sbjct: 71  IQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELEVRLKALEELEKKAENEA 130

Query: 224 ---------TDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNI 274
                      DKL  E   L  +K     + KE + ++  +  E+  + Q+ + +L  +
Sbjct: 131 AEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNADTELKLL 190

Query: 275 ENECSNLQELLDRNRDELEEIIKKYQ----------DQVTKNKEFESTIDQLLNSNQ-EK 323
           E+E   L+E L+  + EL E  KK Q          D   K K  E  + +     +  K
Sbjct: 191 ESELEELREQLEECQKELAEAEKKLQSLTSEQASSADNSVKIKHLEEELKRYEQDAEVVK 250

Query: 324 SNTLEKLQYETLQAQKQ-------------LDAFTLKDNLMHDKLKNYYMN-----FHII 365
           S   + LQ   L+ +                D   LK+ L   + +             +
Sbjct: 251 SMKEQLLQIPELERELAALREENRKLRSMKEDNELLKEELEDLQSRLERFEKMREKLADL 310

Query: 366 KEKMTNIKNELGLETKVDQNIL--VKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIE 423
           + +   ++NEL     + Q+I   ++  DD+ + +       L+  E   S  +S + +E
Sbjct: 311 ELEKEKLENELKSWKSLLQDIGLNLRTPDDLSRRIVVLQNEELQLKEKNGSISSSAKQLE 370

Query: 424 KVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYN 483
              + L+ +   A  E   L++ +   +  +  L++   L   +  GL   L    +   
Sbjct: 371 TTLQQLQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKERDGLRAILNSYDKELT 430

Query: 484 SDCYENEKFGKAYNCDTLL--INEEKTVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEES 541
                 +   +    + L+  +        NQ      DV  Q +      ++  EI+  
Sbjct: 431 ETSVSGQLMKRLEEAEDLVQKVQSHLAKMENQLSELEEDVGQQKDRNN---TLETEIKLL 487

Query: 542 SRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQ 601
             ++      ++         +  +   E+    L+++ SLLE   E    L L    + 
Sbjct: 488 KEQLVSNERLLSFVKEATNALRLKIETLERERDRLRQEKSLLEMKLE---HLCLQGDYNA 544

Query: 602 SSAS-LQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLK 639
           S    L   LN A +  +  K+    +  E E LKE+L+
Sbjct: 545 SRTKVLHMSLNPASEAEQIAKQTIEALQAECEKLKERLQ 583



 Score = 34.9 bits (80), Expect = 0.21
 Identities = 85/468 (18%), Positives = 177/468 (37%), Gaps = 71/468 (15%)

Query: 258 VEVMQKCQDTER-----KLVNIENECSNLQELLDRNRDELEEII----KKYQDQVTKNKE 308
            E M   Q  E      KL+ +ENE    +    R + ELE       + Y+ ++ +N E
Sbjct: 52  EESMTLLQRAELIRSKSKLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLE 111

Query: 309 FESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNL-MHDKLKNYYMNFHIIKE 367
            E  +  L    ++  N   + + E    + +LDA +LK      D+LK    +   IK 
Sbjct: 112 LEVRLKALEELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKN 171

Query: 368 KMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESES--SGLNSIEDIEKV 425
            ++ ++         +  +L    +++ +   Q +E      E+E     L S +     
Sbjct: 172 DLSEMQCR-AQNADTELKLLESELEELRE---QLEECQKELAEAEKKLQSLTSEQASSAD 227

Query: 426 TKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYL---------HLDKIGGLELQLE 476
                + L    + Y    E+V   +EQL  + + +            L  +      L+
Sbjct: 228 NSVKIKHLEEELKRYEQDAEVVKSMKEQLLQIPELERELAALREENRKLRSMKEDNELLK 287

Query: 477 KLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNII-----------DVSNQY 525
           +  +   S     EK  +      L + +EK     +   +++           D+S + 
Sbjct: 288 EELEDLQSRLERFEKMREKLA--DLELEKEKLENELKSWKSLLQDIGLNLRTPDDLSRRI 345

Query: 526 ----NSEKDFVSINKEIEESSRK----IDDINAEITKCTNELVDAKNCLVKNEKWVINLQ 577
               N E      N  I  S+++    +  +  E  K  +E+++ K  L   +  V  LQ
Sbjct: 346 VVLQNEELQLKEKNGSISSSAKQLETTLQQLQLERQKAVSEILELKKKLEALKALVRRLQ 405

Query: 578 EQFSLLEKSKETWVSL--KLNSAESQSSASLQ-----------------------SQLND 612
            + +L+ K ++   ++    +   +++S S Q                       +QL++
Sbjct: 406 RRLTLVTKERDGLRAILNSYDKELTETSVSGQLMKRLEEAEDLVQKVQSHLAKMENQLSE 465

Query: 613 ALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIK 660
             +++ + K+ ++ +  EI+ LKE+L  ++   + + E    L+L I+
Sbjct: 466 LEEDVGQQKDRNNTLETEIKLLKEQLVSNERLLSFVKEATNALRLKIE 513


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 37.8 bits (88), Expect = 0.031
 Identities = 102/606 (16%), Positives = 218/606 (35%), Gaps = 70/606 (11%)

Query: 165  QEALQRENMLKNKLSKLQHIVIDLKKAAD--------VSWKALIQEDRLLSRVEFLESQL 216
            +  ++  + LK    +L+  +I+LKK+             + + +  +LL+ ++  E   
Sbjct: 888  KIDVKSISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPS 947

Query: 217  AAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIEN 276
              Y K    +KL +    L      YE+ LK+    ++ E  +   + ++ +++L  +  
Sbjct: 948  IEYVKLPELNKLHEVESKLKETSEEYEDLLKK-STILVREGNKANSELKNFKKELAELSK 1006

Query: 277  ECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQ 336
            +   LQE   + ++   E+ +      +K    EST   +L   Q+    L  L+   LQ
Sbjct: 1007 QYGALQESTKQLKELPVEVAELQSA--SKIISSESTELSILKPLQKLKGLLL-LENNQLQ 1063

Query: 337  AQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELG-----------LETKVDQN 385
            A+        ++N + D  + Y +       K  N+K+              L+  V Q 
Sbjct: 1064 AR-YKALKLRRENSLLDDKQLYQLESTENLLKTINVKDLEVTNRNLVKPANVLQFIVAQM 1122

Query: 386  ILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQE 445
            I +    ++ + L+Q     L     + S L    D      +L            S + 
Sbjct: 1123 IKLNLLQEISKFLSQ-LVNTLEPVFQKLSVLQLELDGLFWEANLEALPSPPPFAALSEKR 1181

Query: 446  IVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINE 505
            +             S  ++      L+ +L  L     S  +      K    +  +  E
Sbjct: 1182 LYQSALYDEKSKLSSSEVND-----LKNELIALFSKIFSG-WPRGDKLKKLISEGWVPTE 1235

Query: 506  EKTVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNC 565
              T      ++N    +    S +  +S+   I+ +      +  E+             
Sbjct: 1236 YSTSLKGFNNLNKKFDTPASMSNEKLLSLLNSID-NLLSSYKLEEEVL----------PA 1284

Query: 566  LVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHS 625
             + +    IN    F+ L ++K + +  K  +  +          N    +    + E S
Sbjct: 1285 TINSLLQYIN-VGLFNAL-RTKASSLRWKSATEVNY---------NSEELDDWCREFEIS 1333

Query: 626  KMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQ 685
             +  E+E L + +K  Q  ++DL     KL  ++    ++ P+ ++++ + +  P D E 
Sbjct: 1334 DVDEELEELIQAVKVLQLLKDDL----NKLDELLDACYSLNPAEIQNLKS-RYDPADKEN 1388

Query: 686  SIEEEI--------TQQSLNPLSSELDETIHKNVESVTSNMENDKSNEDDNSSVQSTDEV 737
            ++ +EI         +Q L       DET   ++  + S  ++  S   D +S+   +EV
Sbjct: 1389 NLPKEILKKIEALLIKQELQLSLEGKDET-EVHLSEIFSEEKSLIS--LDRNSIY-KEEV 1444

Query: 738  KDTEEG 743
              +   
Sbjct: 1445 LSSLSA 1450



 Score = 30.4 bits (69), Expect = 5.8
 Identities = 54/317 (17%), Positives = 108/317 (34%), Gaps = 48/317 (15%)

Query: 155  EELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLES 214
            +E      CI + LQ+    + KL + + +   LK         LIQ+     R    + 
Sbjct: 810  KEYRSYLACIIK-LQKTIKREKKLRETEEVEFSLKAEV------LIQK---FGRSLKAKK 859

Query: 215  QLAAYSKSFTDDKLRDEIKLLHRDKSTYE------NQLKEVMKKMLAEKVEVMQKCQDTE 268
            + +   K     +    ++L  R     +      + LK V  ++ +E +E+ +      
Sbjct: 860  RFSLLKKETIYLQSAQRVELAERQLQELKIDVKSISSLKLVNLELESEIIELKKSLSSDL 919

Query: 269  RKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNK--EFESTIDQLLNSNQEKSNT 326
             + +  + E     + L  N D  E    +Y      NK  E ES + +   +++E  + 
Sbjct: 920  IENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKE---TSEEYEDL 976

Query: 327  LEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNI 386
            L+K      +  K           + +  K  Y       +++           K     
Sbjct: 977  LKKSTILVREGNKANSELKNFKKELAELSKQ-YGALQESTKQL-----------KELPVE 1024

Query: 387  LVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEI 446
            + +       I             SES+ L+ ++ ++K+   L  + +  Q  Y +L+  
Sbjct: 1025 VAELQSASKII------------SSESTELSILKPLQKLKGLLLLENNQLQARYKALK-- 1070

Query: 447  VNRNEEQLAFLEQSKYL 463
              R E  L   +Q   L
Sbjct: 1071 -LRRENSLLDDKQLYQL 1086


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 36.6 bits (85), Expect = 0.065
 Identities = 38/177 (21%), Positives = 77/177 (43%), Gaps = 21/177 (11%)

Query: 167 ALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKSFTDD 226
           A +     K KL K++  + +L    DV        +R+ + V        A SK   ++
Sbjct: 350 AYKAYLAYKPKLEKVERKLPELGIWKDV--------ERIKALVIRGYPLAEALSKVKEEE 401

Query: 227 KLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQ-KCQDTERKLVNIENECSNLQELL 285
           + R++      ++       K + K  L E VE ++ +  + +R+L  ++ E   L    
Sbjct: 402 RPREKEGTEEEERREITVYEKRIKK--LEETVERLEEENSELKRELEELKREIEKL---- 455

Query: 286 DRNRDELEEIIKKYQDQVTKNKEFE---STIDQLLNSNQEKSNTLEKLQYETLQAQK 339
                ELE   ++ +D+V K++E       I++L    +EK   +E+L+ +  + +K
Sbjct: 456 ---ESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAELRK 509


>gnl|CDD|225437 COG2882, FliJ, Flagellar biosynthesis chaperone [Cell motility and
           secretion / Intracellular trafficking and secretion /
           Posttranslational modification, protein turnover,
           chaperones].
          Length = 148

 Score = 34.6 bits (80), Expect = 0.074
 Identities = 29/134 (21%), Positives = 53/134 (39%), Gaps = 15/134 (11%)

Query: 221 KSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSN 280
               +++   E+  +  +K   E QLK     +   + E  Q   +   K      +  N
Sbjct: 14  AKKEEEEAAIELSKIRSEKENAEEQLKM----LSGYRNEYEQNLNEK-LKSGVSAAQWQN 68

Query: 281 LQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQ 340
            Q+ + +    LE  I + Q Q++K       ++Q     QEK   L+    E L+ ++Q
Sbjct: 69  YQQFISQ----LEVAIDQQQSQLSK---LRKQVEQKREIWQEKQIELKA--LEKLK-ERQ 118

Query: 341 LDAFTLKDNLMHDK 354
              F L++N    K
Sbjct: 119 KTEFLLEENRREQK 132


>gnl|CDD|219890 pfam08537, NBP1, Fungal Nap binding protein NBP1.  NBP1 is a
           nuclear protein which has been shown in Saccharomyces
           cerevisiae to be essential for the G2/M transition of
           the cell cycle.
          Length = 313

 Score = 35.2 bits (81), Expect = 0.12
 Identities = 27/123 (21%), Positives = 52/123 (42%), Gaps = 22/123 (17%)

Query: 617 IKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVLPSIVESMNNT 676
           I K +E+ +++  E++S ++KL+F + K N LLE        + D  N+    V+S  + 
Sbjct: 175 IDKLEEKLAELEQELQSTQKKLQFVKEK-NHLLES-------LLDDANIDDEYVKSRRDI 226

Query: 677 KDKPQDSEQSIEEEIT---QQSLNPLSSELDETIHKNVESVTSNMENDKSNEDDNSSVQS 733
           K+   D+ +    +++    + +NPL             S      N +  E     +Q 
Sbjct: 227 KNLQNDNLKPELRDLSPSPVRPVNPL-----------FTSSPIRKSNRQGRERPLQPLQK 275

Query: 734 TDE 736
            D 
Sbjct: 276 GDN 278


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
           double-strand DNA break proteins. This domain contains a
           P-loop motif.
          Length = 1118

 Score = 35.5 bits (82), Expect = 0.17
 Identities = 83/483 (17%), Positives = 175/483 (36%), Gaps = 56/483 (11%)

Query: 166 EALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKSFTD 225
           EA++  + L ++L +   +V +L+        A  + ++L   +E LE  LAA  +++  
Sbjct: 534 EAVESADQLADRLLREAQLVGELQSLRQQEEAARRRLEQLEKELEVLELALAALREAWQA 593

Query: 226 -------DKLRDEIKLLHRDKSTYENQLKEVMKK--MLAEKVEVMQKCQDTERKL---VN 273
                       E++    +++T   Q++   K    L   ++   + +   R     V 
Sbjct: 594 QWAAAGLPLTPAEMEDWLAERATAREQVRAYFKARAELDALLDRRARLRAALRAALKAVA 653

Query: 274 IENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYE 333
           I      L ELL+  R  LEE  K    Q  +    +  +         +   LE+ +  
Sbjct: 654 IVLPGEELAELLELARQLLEEAEK----QAARKASLDERL------RDAERA-LEEAEER 702

Query: 334 TLQAQKQLDAF--TLKDNLMHDKL--KNYYMNFHIIKEKMTNIKNELGLETKVDQNI--L 387
             +AQ  L+A+     D L+   L  +          E + NIK +L     + Q I  +
Sbjct: 703 HDEAQSALEAWQEQWYDALLEAGLGGRASPAGALDALELLQNIKEKLQAADDLRQRIAAM 762

Query: 388 VKYFDDVHQILTQF-DEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEI 446
            +      + +    +        + +         ++  + L+++L  A++        
Sbjct: 763 ERDLARFEEEVEALAEAVAPEMLGTPA---------DETARALKQRLKRARDT----AAA 809

Query: 447 VNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEE 506
             +  E++   E+        +   E +L  L +       E E        DT     +
Sbjct: 810 AEKLAEEIEEAEKEVSEAAAALDEAEARLTALLRAARCTTIE-ELLAAVERSDTYRELRK 868

Query: 507 KTVAPNQKDVNIIDV-----SNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVD 561
           +  A  +  V               +  D   +   +EE +R I+++  E+ +   E+  
Sbjct: 869 RIAALERTLVRAGGGLSLEALVAEAAALDPDELPARLEELARDIEELEEELNELAQEVGA 928

Query: 562 AKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYK 621
           AK  L +     ++     + LE  +E+ ++   + AE     +L S+L      I++Y+
Sbjct: 929 AKQELAR-----MDGGSTAAELEAERESLLAQLRDLAERYLELALASRL--LRKAIERYR 981

Query: 622 EEH 624
           E+ 
Sbjct: 982 EQR 984


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 35.4 bits (82), Expect = 0.17
 Identities = 43/294 (14%), Positives = 102/294 (34%), Gaps = 8/294 (2%)

Query: 538  IEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKE--TWVSLKL 595
            ++E  +       +I K  +E + A+      E    + ++    L  +      +S+ +
Sbjct: 1037 VKELKKLGYVRFKDIIKKKSEKITAEEEEGAEEDDEADDEDDEEELGAAVSYDYLLSMPI 1096

Query: 596  NSAESQSSASLQSQLNDALDNIKKYKE-EHSKMTME-IESLKEKLKFSQSKENDLLEEYE 653
             S   +    L ++L      ++K K      M +E ++  +E L+  +  E   + + +
Sbjct: 1097 WSLTKEKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQ 1156

Query: 654  KLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHKNVES 713
            +LK   K   + L          K K   +++S +  +   S    S E  +   K    
Sbjct: 1157 RLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNK 1216

Query: 714  VTSNMENDKSNEDDNSSVQSTDEVKDTEEGPHTPSLVDKSEDTISVTTLVNVNTESSEGD 773
             +++  +D+ ++++  +      VK  +   +  S    SED    ++         +  
Sbjct: 1217 KSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSK--SSEDNDEFSSDDLSKEGKPKNA 1274

Query: 774  LSPIDPPESVIALINDDDTSDSSSEMTDSSTLIRIRKPKPVSVPTDASLVKPQS 827
               +   +      +     D  S      +    +K K     + A+L K + 
Sbjct: 1275 PKRVSAVQYSPPPPS--KRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKK 1326


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 35.0 bits (81), Expect = 0.18
 Identities = 47/209 (22%), Positives = 88/209 (42%), Gaps = 29/209 (13%)

Query: 175 KNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKSFTDDKLRDEIKL 234
              +++ Q+   +L + A  + KA  + + L   +  L   +   S +    KL      
Sbjct: 212 GENIARKQNKYDELVEEAK-TIKA--EIEELTDELLNLVMDIEDPSAALN--KLNTAAAK 266

Query: 235 LHRDKSTYENQLKEVMKKMLAEKVEV---MQKCQDTERKLVNIENECSNLQ---ELLDRN 288
           +   KS  E   K +  KM  +        Q+  +   ++  I+++   LQ   E LD  
Sbjct: 267 I---KSKIEQFQKVI--KMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTA 321

Query: 289 RDELEEIIKKYQDQVTKNKEFESTIDQ-------LLNSNQEKSNTLEKLQYETLQAQKQL 341
            DELEEI+ ++ +Q  K  E ++ I         L++  ++    +E+LQ E +   ++L
Sbjct: 322 IDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNAEEL 381

Query: 342 DAFTLKDNLMHDKLKNYYMNFHIIKEKMT 370
               L+D L  DK+        ++KEK  
Sbjct: 382 A--KLQDEL--DKIVKTKSE--LVKEKYH 404



 Score = 32.3 bits (74), Expect = 1.3
 Identities = 35/245 (14%), Positives = 95/245 (38%), Gaps = 29/245 (11%)

Query: 250 MKKMLAEKV-EVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKE 308
           M K+  +K+ E+ Q+ Q  + K+ +I+ +     + ++  R +  E I + Q++  +  E
Sbjct: 168 MDKLNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVE 227

Query: 309 FESTIDQLLNSNQEK-----------SNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKN 357
              TI   +    ++           S  L KL     + + +++ F         K+  
Sbjct: 228 EAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQ--------KVIK 279

Query: 358 YYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLN 417
            Y    +       I            + + K  D + ++    ++     +E     ++
Sbjct: 280 MYEKGGVCPTCTQQISEG--------PDRITKIKDKLKELQHSLEKLDTAIDE-LEEIMD 330

Query: 418 SIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEK 477
              +  K   +L+ ++ T ++   +L +   + +  +  L+     + +++  L+ +L+K
Sbjct: 331 EFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDELDK 390

Query: 478 LKQTY 482
           + +T 
Sbjct: 391 IVKTK 395


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 35.3 bits (82), Expect = 0.19
 Identities = 35/199 (17%), Positives = 74/199 (37%), Gaps = 18/199 (9%)

Query: 405 FLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNE------EQLAFLE 458
                E        +E IEK  K+L E++   + E   L++ + R E        L+ L 
Sbjct: 81  VKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLEPWGNFDLDLSLLL 140

Query: 459 QSKYLHLDKIGGLEL-QLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVN 517
             KY+ +  +G +   +LE+LK   + +  E     K Y    +++     +     +V 
Sbjct: 141 GFKYVSV-FVGTVPEDKLEELKLESDVENVEYISTDKGY--VYVVVV---VLKELSDEVE 194

Query: 518 II--DVSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVIN 575
                +  +    ++  + ++ I E   ++++I  E      EL +      K  + ++ 
Sbjct: 195 EELKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELAK---KYLEELLA 251

Query: 576 LQEQFSLLEKSKETWVSLK 594
           L E   +  +  E      
Sbjct: 252 LYEYLEIELERAEALSKFL 270



 Score = 33.7 bits (78), Expect = 0.52
 Identities = 14/56 (25%), Positives = 26/56 (46%)

Query: 617 IKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVLPSIVES 672
           I++ KEE  ++  E ESL E+LK    K  + L    +   +  +    L   +++
Sbjct: 217 IREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIELERAEALSKFLKT 272



 Score = 30.7 bits (70), Expect = 4.0
 Identities = 29/181 (16%), Positives = 75/181 (41%), Gaps = 27/181 (14%)

Query: 164 IQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKSF 223
           I+E  +  + L+N++ +L+  +  L+      W     +  LL   +++   +    +  
Sbjct: 102 IKELEEEISELENEIKELEQEIERLEP-----WGNFDLDLSLLLGFKYVSVFVGTVPEDK 156

Query: 224 TDDKLRDEIK----LLHRDKST----------YENQLKEVMKKMLAEKVEVMQKCQDTER 269
            ++   +        +  DK              ++++E +KK+  E++E+ ++   +E 
Sbjct: 157 LEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLELEEEGTPSEL 216

Query: 270 KLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEK 329
            +  I+ E   +++  +   +EL+E+ KKY +           + + L    E++  L K
Sbjct: 217 -IREIKEELEEIEKERESLLEELKELAKKYLE-------ELLALYEYLEIELERAEALSK 268

Query: 330 L 330
            
Sbjct: 269 F 269


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 35.1 bits (81), Expect = 0.24
 Identities = 47/253 (18%), Positives = 97/253 (38%), Gaps = 30/253 (11%)

Query: 245 QLKEVMKKMLAEKVEVMQKCQDTER-----KLVNIENECSNLQELLDRNRDELEEIIKKY 299
            L+ + K  +A ++E +Q+    +      +L ++  E    +E L   ++E +E   + 
Sbjct: 237 ALRAIQK--VAPEIEKLQE-DFEQLLSLELRLQHLHGELVADEERLAEEQEERQEAKNRL 293

Query: 300 QDQV-TKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNY 358
           + Q+ T   + +   D+L       +  L   + E    + Q  AF    ++  ++L+  
Sbjct: 294 RQQLRTLEDQLKEARDELNQELSAANAKLAADRSELELLEDQKGAF-EDADI--EQLQAD 350

Query: 359 YMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNS 418
                 I+ ++  ++  L   T   Q++  KY     +I  Q         E +    N 
Sbjct: 351 LDQLPSIRSELEEVEARLDALTGKHQDVQRKYERLKQKIKEQ--------LERDLEKNNE 402

Query: 419 IEDIEKVTKDLREQLHTAQEEYTSL-QEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEK 477
                +  KD   Q    +E+  +L  ++  + E       + +Y        LEL+L +
Sbjct: 403 RLAAIREEKD--RQKAAIEEDLQALESQLRQQLEAGKLEFNEEEY-------ELELRLGR 453

Query: 478 LKQTYNSDCYENE 490
           LKQ  +S     E
Sbjct: 454 LKQRLDSATATPE 466


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 34.6 bits (80), Expect = 0.28
 Identities = 44/260 (16%), Positives = 91/260 (35%), Gaps = 51/260 (19%)

Query: 414 SGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLEL 473
             L S++ +E     L E++   + E  SL+E++   E+  + +E+      +++  L+ 
Sbjct: 68  DYLRSVKGLEGRLFILPEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDE 127

Query: 474 QLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSEKDFVS 533
           +LE L                          +          ++  D+S     +   V 
Sbjct: 128 ELEDL-------------------------EDLLEELEPLAYLD-FDLSLLRGLKFLLVR 161

Query: 534 INKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSL 593
           +     E    +  +  +      E V+A   +V     V +  E    L+K  +    L
Sbjct: 162 LGLVRREKLEALVGVIEDEVALYGENVEASVVIV-----VAHGAE---DLDKVSKILNEL 213

Query: 594 KLNSAESQSS----ASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKL-KFSQSKENDL 648
                E        + L S+L + +  I+           E+ESL+ +L   ++    +L
Sbjct: 214 GFELYEVPEFDGGPSELISELEEVIAEIQD----------ELESLRSELEALAEKIAEEL 263

Query: 649 LEEYEKLKL--VIKDYQNVL 666
           L   E L++   + D  + L
Sbjct: 264 LAVREILEIEKALGDVLSKL 283



 Score = 33.9 bits (78), Expect = 0.46
 Identities = 29/173 (16%), Positives = 65/173 (37%), Gaps = 16/173 (9%)

Query: 164 IQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKS- 222
           + E L+    L  +L  L ++  DL   + +     +     L R E LE+ +       
Sbjct: 125 LDEELEDLEDLLEELEPLAYLDFDL---SLLRGLKFLLVRLGLVRREKLEALVGVIEDEV 181

Query: 223 ---FTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECS 279
                + +    I + H  +   +  + +++ ++  E  EV +        +  +E   +
Sbjct: 182 ALYGENVEASVVIVVAHGAEDLDK--VSKILNELGFELYEVPEFDGGPSELISELEEVIA 239

Query: 280 NLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQY 332
            +Q+ L+  R ELE + +K  +           + ++L   +   + L KL  
Sbjct: 240 EIQDELESLRSELEALAEKIAE-------ELLAVREILEIEKALGDVLSKLAR 285


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 34.4 bits (79), Expect = 0.29
 Identities = 19/88 (21%), Positives = 44/88 (50%), Gaps = 7/88 (7%)

Query: 240 STYENQLK---EVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEII 296
              + QLK   E +K+   EK E+ Q+C + +++++ ++ E ++LQ+  ++ ++ L ++ 
Sbjct: 168 GNLDQQLKKALEDLKEAQEEKDELAQRCHELDKQVLLLQEEKNSLQQENEKLQERLAQLE 227

Query: 297 KKYQDQVT----KNKEFESTIDQLLNSN 320
                       K    +S ++QL   N
Sbjct: 228 GSSLGPNQLGSKKYNLLQSQLEQLQEEN 255


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 34.3 bits (79), Expect = 0.38
 Identities = 54/289 (18%), Positives = 105/289 (36%), Gaps = 38/289 (13%)

Query: 206 LSRVEFLESQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKK--MLAEKVEVMQK 263
           L  ++ L   L    +     +L   I     +    E +LKE  +K   LA+++  ++ 
Sbjct: 181 LDLIDRLAGDLTNVLRRRKKSELPSSI---LSEIEALEAELKEQSEKYEDLAQEIAHLRN 237

Query: 264 -CQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQ-----LL 317
             ++ +R L ++E +  +    L   R++LE  +K+ +     N+     +       LL
Sbjct: 238 ELEEAQRSLESLEKKFRSEGGDLFEEREQLERQLKEIEAARKANRAQLRELAADPLPLLL 297

Query: 318 NSNQEKSNTLEKLQYE--TLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNE 375
             N   S T  +LQ E  + Q Q   +    +D  + + L    +    +KE    I  E
Sbjct: 298 IPNLLDS-TKAQLQKEEQSQQNQLTQEELEERDKELLESLPKLALPAEHVKE----IAAE 352

Query: 376 LGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHT 435
           L         I      D             +           I+ +++  +D + QL  
Sbjct: 353 L-------AEIDKPATTDSEIPHRLSGSELTQLE-------VLIQQVKRELQDAKSQLLK 398

Query: 436 AQEEYTSLQEIVNRN------EEQLAFLEQSKYLHLDKIGGLELQLEKL 478
              E       V++       EEQ+A L +      +++   E ++E+L
Sbjct: 399 ELRELEEELAEVDKKISTIPSEEQIAQLLEELGEAQNELFRSEAEIEEL 447


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 33.8 bits (77), Expect = 0.39
 Identities = 17/87 (19%), Positives = 32/87 (36%), Gaps = 8/87 (9%)

Query: 405 FLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSL--------QEIVNRNEEQLAF 456
               N +  SG+  ++DI    + LR +L TAQ E  +          E+    +E+ A 
Sbjct: 62  LFLLNRNLRSGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAV 121

Query: 457 LEQSKYLHLDKIGGLELQLEKLKQTYN 483
            ++      +     +      KQ  +
Sbjct: 122 RQELAAARQNLAKAQQELARLTKQAQD 148


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 33.1 bits (76), Expect = 0.41
 Identities = 17/84 (20%), Positives = 41/84 (48%), Gaps = 3/84 (3%)

Query: 584 EKSKETWVSLK-LNSAESQSS--ASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKF 640
            K +E WV  + L+   S       LQ +L +  + + + +E+ +++  E + LK++L  
Sbjct: 46  PKGREGWVLSRYLSDTPSARERLPELQQELAELQEELAELQEQLAELQQENQELKQELST 105

Query: 641 SQSKENDLLEEYEKLKLVIKDYQN 664
            +++   L +E  ++K +  +   
Sbjct: 106 LEAELERLQKELARIKQLSANAIE 129


>gnl|CDD|176245 cd08285, NADP_ADH, NADP(H)-dependent alcohol dehydrogenases.  This
           group is predominated by atypical alcohol
           dehydrogenases; they exist as tetramers and exhibit
           specificity for NADP(H) as a cofactor in the
           interconversion of alcohols and aldehydes, or ketones.
           Like other zinc-dependent alcohol dehydrogenases (ADH)
           of the medium chain alcohol dehydrogenase/reductase
           family (MDR), tetrameric ADHs have a catalytic zinc that
           resides between the catalytic and NAD(H)binding domains;
           however, they do not have and a structural zinc in a
           lobe of the catalytic domain.  The medium chain alcohol
           dehydrogenase family (MDR) has a NAD(P)(H)-binding
           domain in a Rossmann fold of a beta-alpha form. The
           N-terminal region typically has an all-beta catalytic
           domain. These proteins typically form dimers (typically
           higher plants, mammals) or tetramers (yeast, bacteria),
           and have 2 tightly bound zinc atoms per subunit.
          Length = 351

 Score = 33.4 bits (77), Expect = 0.47
 Identities = 14/28 (50%), Positives = 17/28 (60%), Gaps = 2/28 (7%)

Query: 808 IRKPKPVSVPTDASLVKPQSTTGCQTSD 835
           I KP PV  P DA +V+P +   C TSD
Sbjct: 15  IEKPIPVCGPNDA-IVRPTAVAPC-TSD 40


>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
           subunit.  CENP-Q is one of the components that assembles
           onto the CENP-A-nucleosome distal (CAD) centromere. The
           centromere, which is the basic element of chromosome
           inheritance, is epigenetically determined in mammals.
           CENP-A, the centromere-specific histone H3 variant,
           assembles an array of nucleosomes and it is this that
           seems to be the prime candidate for specifying
           centromere identity. CENP-A nucleosomes directly recruit
           a proximal CENP-A nucleosome associated complex (NAC)
           comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
           CENP-C and CENP-H. Assembly of the CENP-A NAC at
           centromeres is dependent on CENP-M, CENP-N and CENP-T.
           Additionally, there are seven other subunits which make
           up the CENP-A-nucleosome distal (CAD) centromere,
           CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
           CENP-S, also assembling on the CENP-A NAC. Fta7 is the
           equivalent component of the fission yeast Sim4 complex.
          Length = 159

 Score = 32.3 bits (74), Expect = 0.53
 Identities = 28/120 (23%), Positives = 54/120 (45%), Gaps = 3/120 (2%)

Query: 600 SQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVI 659
            +   +L+ QL   L  ++  +EE  +   E+E   E L+  +     L  E E+     
Sbjct: 26  LRRKRALERQLAAELKQLELLEEEIRREEAELEKDLEYLQELEKNAKALEREREEE---S 82

Query: 660 KDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHKNVESVTSNME 719
           K+   VL  +   +    +  QDS   + E     +L+P   +L + ++K++ES+  N+E
Sbjct: 83  KNLHPVLRLLESEVLEENELLQDSLLELSERNFSPNLDPELLDLLKQLNKHLESLQGNLE 142


>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
           This family consists of several plant proteins of
           unknown function. Several sequences in this family are
           described as being "myosin heavy chain-like".
          Length = 484

 Score = 33.4 bits (76), Expect = 0.57
 Identities = 37/147 (25%), Positives = 64/147 (43%), Gaps = 9/147 (6%)

Query: 592 SLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEE 651
           SL+   +   S AS+  +L +A  N++K  EE   +   +ESLK++L+  + KE + L E
Sbjct: 243 SLQKEISIMASVASVLKELEEAKANLEKAAEEEKSLRNLVESLKQELEEEK-KELEELRE 301

Query: 652 YEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHKNV 711
            E              S+   +N TK + +   +    EI  + L   SSE +E   K  
Sbjct: 302 KEGEAEEA------ASSLEAELNRTKSEKESKAREKMVEIPLK-LQQASSEAEEA-RKEA 353

Query: 712 ESVTSNMENDKSNEDDNSSVQSTDEVK 738
           E+    +   K   +   +   T E++
Sbjct: 354 EAAREELRKLKEEAEQTKAALETAELR 380


>gnl|CDD|214710 smart00533, MUTSd, DNA-binding domain of DNA mismatch repair MUTS
           family. 
          Length = 308

 Score = 33.0 bits (76), Expect = 0.58
 Identities = 41/162 (25%), Positives = 70/162 (43%), Gaps = 30/162 (18%)

Query: 153 PLEELYQLNTCIQ--EALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVE 210
           PL +L ++N  +   E L     L+ KL +L      LK+  D+        +RLLSR+E
Sbjct: 27  PLLDLKEINERLDAVEELVENPELRQKLRQL------LKRIPDL--------ERLLSRIE 72

Query: 211 FLE---SQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQ-D 266
                   L     S   + L++  +LL          L +V+ + L E +E++ +   D
Sbjct: 73  RGRASPRDLLRLYDSL--EGLKEIRQLLESLDGPLLGLLLKVILEPLLELLELLLELLND 130

Query: 267 TERKLVNIENECSN--------LQELLDRNRDELEEIIKKYQ 300
            +   VN      +        L+E L+   +ELEE++KK +
Sbjct: 131 DDPLEVNDGGLIKDGFDPELDELREKLEELEEELEELLKKER 172


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 33.4 bits (76), Expect = 0.68
 Identities = 51/363 (14%), Positives = 127/363 (34%), Gaps = 32/363 (8%)

Query: 251 KKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFE 310
           +K      +++++ ++    ++++E       +L ++ +  LE        +  + +E  
Sbjct: 170 RKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALE---YYQLKEKLELEEEN 226

Query: 311 STIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMT 370
                 L  N+E+ + L++L  +  +  +       K                  +E + 
Sbjct: 227 LLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEK-----------------EEEILA 269

Query: 371 NIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLR 430
            +  E   E K  +    +       +     E     +E        ++D EK+ +   
Sbjct: 270 QVLKENKEEEKEKKLQEEE-------LKLLAKEEEELKSELLKLERRKVDDEEKLKESE- 321

Query: 431 EQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENE 490
           ++L   ++E    +E +   E++L  LE  +    ++   LE   EKL+Q       + +
Sbjct: 322 KELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKK 381

Query: 491 KFGKAYNCDTLLINEEKTVAPN--QKDVNIIDVSNQYNSEKDFV--SINKEIEESSRKID 546
              +  +    L  EE  +     ++   ++++S Q             K +EE    ++
Sbjct: 382 LESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQEEDLLKEEKKEELKIVEELEESLE 441

Query: 547 DINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASL 606
               ++T+   EL      L+K++  +   ++     +  K       L   +    AS 
Sbjct: 442 TKQGKLTEEKEELEKQALKLLKDKLELKKSEDLLKETKLVKLLEQLELLLLRQKLEEASQ 501

Query: 607 QSQ 609
           +  
Sbjct: 502 KES 504



 Score = 31.9 bits (72), Expect = 1.8
 Identities = 35/209 (16%), Positives = 68/209 (32%), Gaps = 14/209 (6%)

Query: 535 NKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLK 594
            +E+    ++  + + +  +   E++       K E+    LQE+   L   +E  +  +
Sbjct: 243 LQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSE 302

Query: 595 LNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEK 654
           L   E +     +       +  K  KE   +     E  KE  +    +E +  EE + 
Sbjct: 303 LLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQL 362

Query: 655 LKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHKNVESV 714
            KL  K  Q       E     K K +    S   ++ ++ L   + E  E         
Sbjct: 363 EKLQEKLEQL------EEELLAKKKLESERLSSAAKLKEEELELKNEEEKE--------A 408

Query: 715 TSNMENDKSNEDDNSSVQSTDEVKDTEEG 743
              +E  +  ED     +  +     E  
Sbjct: 409 KLLLELSEQEEDLLKEEKKEELKIVEELE 437



 Score = 31.9 bits (72), Expect = 2.1
 Identities = 54/415 (13%), Positives = 136/415 (32%), Gaps = 35/415 (8%)

Query: 250  MKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEF 309
                  E  ++++  +  E  L    +    L E  +      E   +   +Q  + K  
Sbjct: 628  GILKDTELTKLLESAKAKESGLRKGVSLEEGLAEKSELKASLSELTKELLAEQELQEKA- 686

Query: 310  ESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKM 369
            ES + +     +++    ++ + +    + +L+   L  + + +          ++++K+
Sbjct: 687  ESELAKNEILRRQEEIKKKEQRIKEELKKLKLEKEELLADKVQEAQDKINEELKLLEQKI 746

Query: 370  TNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDL 429
               + E   ++++ +    +   ++     +  E      E         E ++   ++L
Sbjct: 747  KEKEEEE-EKSRLKKEEEEEEKSELSLKEKELAEE-EEKTEKLKVEEEKEEKLKAQEEEL 804

Query: 430  REQLHTAQ-EEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYE 488
            R      + E     +E +   +E+    E+ + L L+     +L+    ++    +   
Sbjct: 805  RALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEI 864

Query: 489  NEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKIDDI 548
             ++             EE+      KD        +           KE+EE S+K + +
Sbjct: 865  TKEELLQELLLKEEELEEQ----KLKDELESKEEKEKE-------EKKELEEESQKDNLL 913

Query: 549  NAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQS 608
              +  +                      +E   LL+   E    L   + E +     + 
Sbjct: 914  EEKENEIEE----------------RIAEEAIILLKYESEPEELLLEEADEKEKEEDNKE 957

Query: 609  QLNDALDNIKKYKEEHSKMT----MEIESLKEKLKFSQSKENDLLEEYEKLKLVI 659
            +  +    +   KEE   +      E E  +E+    + K+  L EE ++L   I
Sbjct: 958  EEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREI 1012


>gnl|CDD|221776 pfam12795, MscS_porin, Mechanosensitive ion channel porin domain.
           The small mechanosensitive channel, MscS, is a part of
           the turgor-driven solute efflux system that protects
           bacteria from lysis in the event of osmotic shock. The
           MscS protein alone is sufficient to form a functional
           mechanosensitive channel gated directly by tension in
           the lipid bilayer. The MscS proteins are heptamers of
           three transmembrane subunits with seven converging M3
           domains, and this MscS_porin is towards the N-terminal
           of the molecules. The high concentration of negative
           charges at the extracellular entrance of the pore helps
           select the cations for efflux.
          Length = 239

 Score = 32.2 bits (74), Expect = 0.80
 Identities = 26/144 (18%), Positives = 54/144 (37%), Gaps = 14/144 (9%)

Query: 209 VEFLESQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQ------ 262
           ++ LE  L    +    DK + +     +       +L+E+ +++ A K           
Sbjct: 20  IQDLEQALNFLDEI---DKSKQKADQYQKQIDDAPKELRELRQELEALKKTDAPVFPELA 76

Query: 263 --KCQDTERKLVNIENECSNLQELLDRNRDELEEI---IKKYQDQVTKNKEFESTIDQLL 317
                  E++L    ++   LQE L +   +L E+    ++ Q Q+++ +     I   L
Sbjct: 77  NLSLSQLEQRLAQTLSQLQELQEQLQQENSQLIELQTRPERAQQQLSEARRRLQEIRNRL 136

Query: 318 NSNQEKSNTLEKLQYETLQAQKQL 341
            +       L + Q   LQA+   
Sbjct: 137 QALSPGGTPLAQAQRTLLQAELAA 160


>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
          Length = 865

 Score = 33.1 bits (77), Expect = 0.81
 Identities = 18/75 (24%), Positives = 40/75 (53%), Gaps = 7/75 (9%)

Query: 600 SQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVI 659
            + +A L+++ ++  + ++   EE  ++  E+E LK KL  + +   DLL + +++  V 
Sbjct: 704 KELAALLKAKPSELPERVEALLEELKELEKELEQLKAKL--AAAAAGDLLAQAKEVNGV- 760

Query: 660 KDYQNVLPSIVESMN 674
                VL + VE ++
Sbjct: 761 ----KVLAAQVEGVD 771


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 32.7 bits (75), Expect = 0.83
 Identities = 23/119 (19%), Positives = 59/119 (49%), Gaps = 6/119 (5%)

Query: 226 DKLRDEIKLLHRDKS---TYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQ 282
           + L + ++ L  D          L  +  K L ++ + +++     ++L +   +C    
Sbjct: 147 EGLDENLEGLKEDYKLLMKELELLNSIKPK-LRDRKDALEEELRQLKQLEDELEDCD--P 203

Query: 283 ELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQL 341
             LDR +++L++++++   +V K +E E  + +L +  ++ +N   +L  E  +A+K+L
Sbjct: 204 TELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKL 262



 Score = 31.1 bits (71), Expect = 2.3
 Identities = 32/138 (23%), Positives = 58/138 (42%), Gaps = 9/138 (6%)

Query: 512 NQKDVNIIDVSNQYNSEKDFVS-INKEIEESSRKIDDINAEITKC-TNELVDAKNCLVKN 569
            +    ++      NS K  +      +EE  R++  +  E+  C   EL  AK  L K 
Sbjct: 157 KEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKLKKL 216

Query: 570 EKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTM 629
            + ++   ++   LE+  +         ++ +   + +S+LN  +   +K  E+    T 
Sbjct: 217 LQEIMIKVKKLEELEEELQE------LESKIEDLTNKKSELNTEIAEAEKKLEQCRGFTF 270

Query: 630 -EIESLKEKLKFSQSKEN 646
            EIE LKE+LK  QS   
Sbjct: 271 KEIEKLKEQLKLLQSLTG 288


>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
          Length = 428

 Score = 32.7 bits (75), Expect = 0.84
 Identities = 17/77 (22%), Positives = 38/77 (49%), Gaps = 3/77 (3%)

Query: 404 YFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYL 463
           YF   N++      +I ++++  ++L  Q    +E+ +  + ++   + Q   LEQ++  
Sbjct: 160 YFGYLNQARQ---ETIAELKQTREELAAQKAELEEKQSQQKTLLYEQQAQQQKLEQARNE 216

Query: 464 HLDKIGGLELQLEKLKQ 480
               + GLE  L+K +Q
Sbjct: 217 RKKTLTGLESSLQKDQQ 233


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 32.8 bits (76), Expect = 0.84
 Identities = 23/101 (22%), Positives = 47/101 (46%), Gaps = 10/101 (9%)

Query: 243 ENQLKEVMKKMLAE-KVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQD 301
           + + + + K+ L E K E+ +   + E++L    NE   LQ+L  R   + EE + +  +
Sbjct: 48  KKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNE---LQKLEKRLLQK-EENLDRKLE 103

Query: 302 QVTKNKEFESTIDQLLNSNQEKSNTLEKLQ--YETLQAQKQ 340
            + K    E  +++     ++K   LEK +   E L  ++ 
Sbjct: 104 LLEKR---EEELEKKEKELEQKQQELEKKEEELEELIEEQL 141



 Score = 32.1 bits (74), Expect = 1.5
 Identities = 17/67 (25%), Positives = 35/67 (52%)

Query: 229 RDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRN 288
           R+E++ L +     E  L   ++ +   + E+ +K ++ E+K   +E +   L+EL++  
Sbjct: 81  RNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQ 140

Query: 289 RDELEEI 295
             ELE I
Sbjct: 141 LQELERI 147



 Score = 31.3 bits (72), Expect = 2.6
 Identities = 21/112 (18%), Positives = 51/112 (45%), Gaps = 27/112 (24%)

Query: 201 QEDRLLSRVEFLESQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEV 260
            E RLL + E L+ +L    K   +++L  + K L + +   E + +E+ ++++ E+++ 
Sbjct: 87  LEKRLLQKEENLDRKLELLEKR--EEELEKKEKELEQKQQELEKKEEEL-EELIEEQLQE 143

Query: 261 MQKCQDTERKLVNIENECSNL------QELLDRNRDELEE----IIKKYQDQ 302
           ++                S L      + LL++  +E       +IK+ +++
Sbjct: 144 LE--------------RISGLTAEEAKEILLEKVEEEARHEAAVLIKEIEEE 181


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 32.8 bits (75), Expect = 0.91
 Identities = 34/221 (15%), Positives = 74/221 (33%), Gaps = 26/221 (11%)

Query: 242 YENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDR---------NRDEL 292
            E+ LK  ++K    K   +    +  R++ + E++  +L ++            N D +
Sbjct: 759 LEDLLKARLEKDYQSKYLDLIDNLNDAREIKDKESKAIDLDDIDFELELIGKQEINIDYI 818

Query: 293 EEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMH 352
            E+++ + D+    +  +  I++++    E     +KL    ++A  Q  A         
Sbjct: 819 LELLQTFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKK------L 872

Query: 353 DKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESE 412
           D  +     +  I  K      +   E  +++  L  Y D             L  N +E
Sbjct: 873 DTAEKLEELY--ILAKKEEEFKQFAEEEGLNEEELAFYDDLALNGGK------LPENGTE 924

Query: 413 SSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQ 453
                + E   +       +     EE  +  +   R   Q
Sbjct: 925 LVEKLAKEKSLREKNKDDWKAK---EEVEAKLKAFFRRFIQ 962


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 32.7 bits (74), Expect = 0.92
 Identities = 32/118 (27%), Positives = 55/118 (46%), Gaps = 13/118 (11%)

Query: 664 NVLPSIVESMNNTKDKPQD-SE----QSIEEEITQQSLNPLSSELDETIHKNVESVTSNM 718
            V  S V S +++K  P   SE    +S +E ++Q+S    ++E ++    +     S  
Sbjct: 61  KVWKSAVSSSDDSKTVPTPVSEPNITRSFQEPVSQESEVQDNTEQNQDTKGSKTD--SEE 118

Query: 719 ENDKSNEDDNSSVQSTDEVKDTEEGPHTPSLVDKSEDTISVTTLVNVNTESSEGDLSP 776
           ++D S E+DN S  S    KD +    T   V  S  + S T+  ++NT+ S+  L  
Sbjct: 119 DDDDSEEEDNKSTSS----KDGKGSKKTQPGVSTS--SGSTTSGTDLNTKQSQTGLGA 170


>gnl|CDD|226655 COG4192, COG4192, Signal transduction histidine kinase regulating
           phosphoglycerate transport system [Signal transduction
           mechanisms].
          Length = 673

 Score = 32.6 bits (74), Expect = 1.0
 Identities = 36/200 (18%), Positives = 84/200 (42%), Gaps = 11/200 (5%)

Query: 166 EALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKSFTD 225
              Q+   L+  L+ +  ++  L +    + +  +QE R L +    +      ++   D
Sbjct: 82  TLNQQYQRLQQDLTAIHALLASLPQNEQAALQQGLQELRQLLQ----QINDLVSARIDVD 137

Query: 226 DKLR---DEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQ-DTERKLVNIENECSNL 281
            +L    ++I  LH+D     + L++ ++      ++ + + Q D  ++++        L
Sbjct: 138 RRLALFAEQIDWLHQDFGMELSPLRQELQWQQETLLDGITQAQVDNGQRVLQYRLYQQEL 197

Query: 282 QELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQL 341
           +E+ +  R E  +I +  +D+V + ++  STI QL   +    + L       +Q     
Sbjct: 198 EEVYNVLRLE-GQIQQSLRDRVVETQKLNSTI-QLDQQHTAYLDLLANADALFIQV-LGQ 254

Query: 342 DAFTLKDNLMHDKLKNYYMN 361
            A T+  + + D+L     N
Sbjct: 255 TASTVTLHQLSDELDAIGHN 274


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 32.4 bits (74), Expect = 1.1
 Identities = 34/188 (18%), Positives = 65/188 (34%), Gaps = 21/188 (11%)

Query: 155 EELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQ-EDRLLSRVEFLE 213
           ++  +L   ++        L+ +L +    +  L+K        L   E +   +   L 
Sbjct: 59  DQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARLNALEVQEREQRRRLA 118

Query: 214 SQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVN 273
            QLAA  +S  +        L+  + +    +L      +   + E +   + T ++L  
Sbjct: 119 EQLAALQRSGRNPPP---ALLVSPEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAA 175

Query: 274 IENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYE 333
           +  E +  Q  L     E      K              + QLL   +E+  TL +L  E
Sbjct: 176 VRAEIAAEQAELTTLLSEQRAQQAK--------------LAQLL---EERKKTLAQLNSE 218

Query: 334 TLQAQKQL 341
               QK+L
Sbjct: 219 LSADQKKL 226



 Score = 32.0 bits (73), Expect = 1.6
 Identities = 27/190 (14%), Positives = 58/190 (30%), Gaps = 31/190 (16%)

Query: 523 NQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSL 582
                +K+  ++ K+I E   +   +  ++     E+   +  L++    +  L++Q + 
Sbjct: 39  QLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIAD 98

Query: 583 LEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQ 642
           L           L   E +    L  QL               + +         +    
Sbjct: 99  LNARLN-----ALEVQEREQRRRLAEQLAAL------------QRSGRNPPPALLVSPED 141

Query: 643 SKENDLLEEYEKLKLVIKDYQNVLPSI---VESMNNTKDKPQDSEQSIEEEITQQSLNPL 699
           ++ +  L  Y         Y  + P+    ++++  T  +       I  E  Q  L  L
Sbjct: 142 AQRSVRLAIY---------YGALNPARAERIDALKATLKQLAAVRAEIAAE--QAELTTL 190

Query: 700 SSELDETIHK 709
            SE      K
Sbjct: 191 LSEQRAQQAK 200


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 32.7 bits (74), Expect = 1.1
 Identities = 43/193 (22%), Positives = 75/193 (38%), Gaps = 20/193 (10%)

Query: 622  EEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQ 681
            E+ +    E + L+E      +     L+E E  +L   D   V    V S N   D  +
Sbjct: 3825 EDITNTLNEDDDLEELANEEDTANQSDLDESEARELE-SDMNGVTKDSVVSENENSDS-E 3882

Query: 682  DSEQSIEEEITQ------QSLN------PLSSELDETIHKNVESVTSNMENDKSN-EDDN 728
            +  Q ++EE+         SLN      P   +L ET  K+ E   +N E+D  + EDDN
Sbjct: 3883 EENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDN 3942

Query: 729  SSVQSTDEVKDTEEGPHTPSLVDKSEDTISVTTLVNVNTESSEGDLSPIDPPESVIALIN 788
             +++  D  +  +E   +  +    E    +   +  N      +   +D PE  + L  
Sbjct: 3943 KALEDKDRQEKEDEEEMSDDVGIDDE----IQPDIQENNSQPPPENEDLDLPED-LKLDE 3997

Query: 789  DDDTSDSSSEMTD 801
             +      S++ D
Sbjct: 3998 KEGDVSKDSDLED 4010


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 32.7 bits (75), Expect = 1.1
 Identities = 42/261 (16%), Positives = 95/261 (36%), Gaps = 35/261 (13%)

Query: 254 LAEKVEVMQ--------KCQDTERKLVNIENECSNLQELL-----------DRNRDELEE 294
           L E V   Q        +C + ERKL  +E++   L   L            +   +LEE
Sbjct: 15  LNEDVSAFQRKFVNEVRRCDEVERKLRKLESKIKKLGIPLKDTGGKPDVPPSKEFLDLEE 74

Query: 295 IIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQL-DAFTLKDNLMHD 353
            I   + ++ + +E   ++++ +N  +E  N L++ +    +  ++L +   L  +  + 
Sbjct: 75  EILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLDENLEELSELSNLDIDFKYL 134

Query: 354 KLKNYYMNFHII----KEKMTNIKNEL-----GLETKVDQNILVKY-FDDVHQILTQFDE 403
           +         +     +EK+   + EL     G   + +    ++     V  I     E
Sbjct: 135 RGAEGLKLGFVAGVINREKLEAFERELWRACRGYIRQAEIEEPLEDPKKTVFIIFFVGKE 194

Query: 404 YFLRFNESESSGLN-SIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKY 462
              +  +         + D+ +   +  E +         ++E+    E+  + LE+   
Sbjct: 195 DLDKV-KKILDSFGFELYDVPETEGERSELISKVN---KRIEELQRVLEQTESHLEKVLV 250

Query: 463 LHLDKIGGLELQLEKLKQTYN 483
              D++   + Q+ K K  Y 
Sbjct: 251 KIADELLAWDEQVSKEKAVYE 271



 Score = 30.4 bits (69), Expect = 4.8
 Identities = 34/209 (16%), Positives = 80/209 (38%), Gaps = 38/209 (18%)

Query: 155 EELYQLNTCIQEALQRE--------NMLKNKLSKLQHIVIDLKKAA--DVSWKALIQEDR 204
            E+ ++     E+L++E        N+L  + S L   + +L + +  D+ +K L   + 
Sbjct: 81  AEIKEVEE-NLESLEKEINELEEWLNVLDEEKSFLDENLEELSELSNLDIDFKYLRGAEG 139

Query: 205 L----------LSRVEFLESQLAAYSKSFTDDKLRDEIKLLHRDKSTY---------ENQ 245
           L            ++E  E +L    + +       E  L    K+ +          ++
Sbjct: 140 LKLGFVAGVINREKLEAFERELWRACRGYIRQ-AEIEEPLEDPKKTVFIIFFVGKEDLDK 198

Query: 246 LKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTK 305
           +K+++     E  +V +   +    +  +      LQ +L++    LE+++ K  D++  
Sbjct: 199 VKKILDSFGFELYDVPETEGERSELISKVNKRIEELQRVLEQTESHLEKVLVKIADEL-- 256

Query: 306 NKEFESTIDQLLNSNQEKSNTLEKLQYET 334
                   D+ ++  +    TL    Y+T
Sbjct: 257 -----LAWDEQVSKEKAVYETLNLFNYDT 280


>gnl|CDD|178768 PLN03229, PLN03229, acetyl-coenzyme A carboxylase carboxyl
           transferase subunit alpha; Provisional.
          Length = 762

 Score = 32.5 bits (74), Expect = 1.4
 Identities = 50/226 (22%), Positives = 94/226 (41%), Gaps = 50/226 (22%)

Query: 174 LKNKLSKLQHIVI------------DLKKAADVSWKALIQEDRLLSRVEFLESQLA---A 218
           LK KL  L                  LK   +  +K ++    +  ++E L++++A   A
Sbjct: 533 LKYKLDMLNEFSRAKALSEKKSKAEKLKAEINKKFKEVMDRPEIKEKMEALKAEVASSGA 592

Query: 219 YSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENEC 278
            S    DD L+++++   + K   E +L  V+K M  E + V +K +DT  +        
Sbjct: 593 SSGDELDDDLKEKVE---KMKKEIELELAGVLKSMGLEVIGVTKKNKDTAEQTPP----- 644

Query: 279 SNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQ 338
                        L+E I+        N+E    I++++ S+  K + +E L+ E  +A 
Sbjct: 645 -----------PNLQEKIESL------NEEINKKIERVIRSSDLK-SKIELLKLEVAKAS 686

Query: 339 KQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQ 384
           K  D  T K+ +  + L+        IK+K+    N   L+ K ++
Sbjct: 687 KTPDV-TEKEKI--EALEQQ------IKQKIAEALNSSELKEKFEE 723


>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family.  In a
           subset of endospore-forming members of the Firmcutes,
           members of this protein family are found, several to a
           genome. Two very strongly conserved sequences regions
           are separated by a highly variable linker region. Much
           of the linker region was excised from the seed alignment
           for this model. A characterized member is the
           prespore-specific transcription RsfA from Bacillus
           subtilis, previously called YwfN, which is controlled by
           sigma factor F and seems to fine-tune expression of some
           genes in the sigma-F regulon. A paralog in Bacillus
           subtilis is designated YlbO [Regulatory functions, DNA
           interactions, Cellular processes, Sporulation and
           germination].
          Length = 161

 Score = 31.3 bits (71), Expect = 1.4
 Identities = 15/41 (36%), Positives = 28/41 (68%), Gaps = 1/41 (2%)

Query: 419 IEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQ 459
            E++EK  + LR++L T +E+Y +L +I++R   +LA +E 
Sbjct: 120 NEELEKELEKLRQRLSTIEEDYQTLIDIMDR-ARKLAVVED 159


>gnl|CDD|203371 pfam06005, DUF904, Protein of unknown function (DUF904).  This
           family consists of several bacterial and archaeal
           hypothetical proteins of unknown function.
          Length = 72

 Score = 29.2 bits (66), Expect = 1.6
 Identities = 19/71 (26%), Positives = 37/71 (52%), Gaps = 7/71 (9%)

Query: 606 LQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNV 665
           L+S++  A++ I   +       MEIE LKE+ +    +  +L EE +KLK     +Q+ 
Sbjct: 9   LESKIQQAVETIALLQ-------MEIEELKEENEQLSEENEELEEENKKLKEERNAWQSR 61

Query: 666 LPSIVESMNNT 676
           L +++  ++  
Sbjct: 62  LRALLGKLDEV 72


>gnl|CDD|185283 PRK15385, PRK15385, magnesium transport protein MgtC; Provisional.
          Length = 225

 Score = 31.4 bits (71), Expect = 1.7
 Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 1/47 (2%)

Query: 297 KKYQDQVTKNKEFESTIDQ-LLNSNQEKSNTLEKLQYETLQAQKQLD 342
           K+Y  +VT NKE ES + Q LLN  +E +  L+ L     Q Q   +
Sbjct: 141 KRYILKVTCNKEDESAVRQWLLNIVKEAAICLQGLGSVPAQEQGYKE 187


>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase.  The model describes
           alanine--tRNA ligase. This enzyme catalyzes the reaction
           (tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
           pyrophosphate + AMP) [Protein synthesis, tRNA
           aminoacylation].
          Length = 851

 Score = 32.0 bits (73), Expect = 1.8
 Identities = 16/85 (18%), Positives = 37/85 (43%), Gaps = 5/85 (5%)

Query: 600 SQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKE-NDLLEEYEKLKLV 658
            + +  L+   N+    I+++ EE   +  E+ESLK K+   +  E  D  E    +K++
Sbjct: 711 KELADILKVTPNELPKKIERFFEEWKALQKELESLKLKIADQKIDELKDKAETINGVKVL 770

Query: 659 IKD----YQNVLPSIVESMNNTKDK 679
           ++         L ++ + + +    
Sbjct: 771 VEVVDAKDMKSLKTMADRLKSKLGS 795


>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA.  During
           the bacterial cell cycle, the tubulin-like cell-division
           protein FtsZ polymerises into a ring structure that
           establishes the location of the nascent division site.
           EzrA modulates the frequency and position of FtsZ ring
           formation.
          Length = 559

 Score = 31.8 bits (73), Expect = 2.0
 Identities = 44/187 (23%), Positives = 78/187 (41%), Gaps = 50/187 (26%)

Query: 272 VNIENECSNLQELLDRNRDELEEI-IKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKL 330
           ++IE E  +L+E +D+N   LEE+ + + ++   +N+E E  ID L              
Sbjct: 248 LDIEKELQDLKEQIDQNLALLEELDLDEAEE---ENEEIEERIDTL-------------- 290

Query: 331 QYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETK-VDQNILVK 389
            Y+ L+ + +   F  K+    DKL ++        E       +L LE   + Q+    
Sbjct: 291 -YDILEKEVKAKKFVEKN---IDKLTDFL-------EHAREQNKQLLLELDRLQQS---- 335

Query: 390 YFDDVHQILTQFDEYFLRFNESESS-GLNS-IEDIEKVTKDLREQLHTAQEEYTSLQEIV 447
                         Y L  +E E+   L   +E++E     L E++   +  Y+ LQE +
Sbjct: 336 --------------YTLNEDELETVRELEKQLEELEAQYDQLVERIAEKKVAYSELQEEL 381

Query: 448 NRNEEQL 454
              E+QL
Sbjct: 382 EEIEKQL 388


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 31.5 bits (72), Expect = 2.2
 Identities = 36/200 (18%), Positives = 72/200 (36%), Gaps = 27/200 (13%)

Query: 387 LVKYFDDVHQ----ILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTS 442
           L  +FD+ +Q     L +  + +    E     L SI D EK   + +  +   +    +
Sbjct: 272 LEAHFDEEYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEA 331

Query: 443 LQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLL 502
           L+EI+ +N ++L           +K+      +E       S     E      +    L
Sbjct: 332 LEEILEKNLQKLE----------EKLKDPSTSIE-----LESITDLIESINDIIDAINEL 376

Query: 503 INEEKTVAPNQKDVNIIDVSNQYNS--EKDFV-SINKEIEESSRKIDDINAEITKCTNEL 559
           I E      N+K  N+    N+         V  + ++I+   ++   +   I     E+
Sbjct: 377 IREH-----NEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEI 431

Query: 560 VDAKNCLVKNEKWVINLQEQ 579
              +  +   EK +  L++Q
Sbjct: 432 KQLEAEIKALEKEIKELEKQ 451



 Score = 30.7 bits (70), Expect = 4.4
 Identities = 29/201 (14%), Positives = 84/201 (41%), Gaps = 28/201 (13%)

Query: 474 QLEKLKQTYNSDC-YENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSEKDFV 532
           QLE+L   Y S      E+     + +      +  V   +  +  ++   + N +K   
Sbjct: 286 QLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNLQK--- 342

Query: 533 SINKEIEE--SSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETW 590
            + +++++  +S +++ I   I    + +      + ++ + + NL+++ +  +K     
Sbjct: 343 -LEEKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLKKEKNKAKKKLWLH 401

Query: 591 VSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLE 650
           +                  + +  ++I  Y++E   +   I SL++++K  +++   L +
Sbjct: 402 L------------------VAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEK 443

Query: 651 EYEKLKLVIKDYQNVLPSIVE 671
           E ++L+  +    N+ P+  E
Sbjct: 444 EIKELEKQLT---NIEPTADE 461


>gnl|CDD|234953 PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed.
          Length = 476

 Score = 31.6 bits (73), Expect = 2.3
 Identities = 30/155 (19%), Positives = 55/155 (35%), Gaps = 49/155 (31%)

Query: 149 ISSMPLEELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSR 208
           I  +  EEL +L   +   L++E +       L+ +V             L++E     R
Sbjct: 325 IRELDPEELAEL--LLPWHLEQEGLDTEDGPYLEKVV------------PLLKE-----R 365

Query: 209 VEFLESQLAAYSKS-FTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDT 267
            + L  +LA  ++  F D    DE           +  LKEV +++L   +E        
Sbjct: 366 AKTL-KELAELARFFFEDFPEYDEDAA--------KKHLKEVAREVLEALLE-------- 408

Query: 268 ERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQ 302
                        L+ L +   + +E  IK   ++
Sbjct: 409 ------------KLEALEEWTAEAIEAAIKAVAEE 431


>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 406

 Score = 31.5 bits (72), Expect = 2.4
 Identities = 16/68 (23%), Positives = 32/68 (47%), Gaps = 3/68 (4%)

Query: 229 RDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLV-NIENECSNLQELLDR 287
            D +  +  D  +YE Q      K+   K+++++K +         +E E   L+  +DR
Sbjct: 3   LDALDEILGDLESYEPQEYL--NKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDR 60

Query: 288 NRDELEEI 295
            R+E+E +
Sbjct: 61  LREEIERL 68


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 31.3 bits (71), Expect = 2.5
 Identities = 28/175 (16%), Positives = 67/175 (38%), Gaps = 22/175 (12%)

Query: 671 ESMNNTKDKPQDSEQSIEEEITQ-QSLNPLSSELDETIHKNVESVTSNMENDKSNEDDNS 729
           +  +   +    ++ + +++ +   S    S+ + + I+KN+     N    K+  DDN 
Sbjct: 58  KDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNY 117

Query: 730 SVQST--------DEVKDTEEGPHTPSLVDKSEDTISVTTLVNVNTESSEGDLSPIDPPE 781
           S+ +          ++ D E+  ++    + S      +   + +T+SS+ D +      
Sbjct: 118 SLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKA------ 171

Query: 782 SVIALINDDDTSDSSSEMTDSSTLIRIRKPKPVSVPTDASLVKPQSTTGCQTSDE 836
                  D+  + SS+    S++  +   PKP       S      T   ++S +
Sbjct: 172 -------DNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSK 219



 Score = 29.4 bits (66), Expect = 9.9
 Identities = 32/245 (13%), Positives = 78/245 (31%), Gaps = 16/245 (6%)

Query: 512 NQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNE----LVDAKNCLV 567
                N     +  +S     +   +    +    +IN  +TK   +    L      L 
Sbjct: 68  TSNQDNNDKKFSTIDSSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLF 127

Query: 568 KNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKM 627
                + + ++  +  + + ++  +   +      +   QS   D  DN K     ++K 
Sbjct: 128 NLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDT---QSSKQDKADNQKAPSSNNTKP 184

Query: 628 TMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQN------VLPSIV-ESMNNTKDKP 680
           +   +          ++ N      +          N       L SI+ +   + K   
Sbjct: 185 STSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKDNQSMSDSALDSILDQYSEDAKKTQ 244

Query: 681 QDSEQSIE--EEITQQSLNPLSSELDETIHKNVESVTSNMENDKSNEDDNSSVQSTDEVK 738
           +D     +  +  T  + NP     DE  HK+  + +   + ++SN    S  ++   + 
Sbjct: 245 KDYASQSKKDKTETSNTKNPQLPTQDELKHKSKPAQSFENDVNQSNTRSTSLFETGPSLS 304

Query: 739 DTEEG 743
           + ++ 
Sbjct: 305 NNDDS 309


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 30.7 bits (70), Expect = 2.6
 Identities = 24/141 (17%), Positives = 54/141 (38%), Gaps = 21/141 (14%)

Query: 228 LRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQ----------DTERKLVNIENE 277
           L++  KL  + K      +KEV++ ++ + +   +K                KL   +  
Sbjct: 14  LKELEKLAPKVKGISSMSVKEVLQSLVDDGLVRTEKIGTSNYYWSFPSQALNKL---KTR 70

Query: 278 CSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQA 337
              L++ L+  +  + E+  + +      +E      +LL   ++    L+KL+      
Sbjct: 71  LEKLKKELEELKQRIAELQAQIEKLKKGREE-TEERTELLEELKQLEKELKKLK------ 123

Query: 338 QKQLDAFTLKDNLMHDKLKNY 358
             +L+ +   D    +KLK  
Sbjct: 124 -AELEKYEKNDPERIEKLKEE 143


>gnl|CDD|144876 pfam01442, Apolipoprotein, Apolipoprotein A1/A4/E domain.  These
           proteins contain several 22 residue repeats which form a
           pair of alpha helices. This family includes:
           Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein
           E.
          Length = 191

 Score = 30.7 bits (70), Expect = 2.7
 Identities = 28/106 (26%), Positives = 52/106 (49%), Gaps = 17/106 (16%)

Query: 609 QLNDALDNIKKYKEEHSKMTMEIESLKEKL-KFSQSKENDLLEEYEKLKLVI-KDYQNVL 666
            L+D+LD +  Y EE          L+E+L  ++Q     L +E E L+  + KD ++V 
Sbjct: 2   LLSDSLDELNSYAEE----------LQEQLGPYAQEFWARLSKETEALREELQKDLEDVR 51

Query: 667 PSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHKNVE 712
             +   ++  K       Q++EE   +Q L P + EL + ++++ E
Sbjct: 52  ARLQPYLDELK---AKVGQNLEE--LRQRLAPYAEELRKRLNRDAE 92


>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
          Length = 977

 Score = 31.4 bits (71), Expect = 3.1
 Identities = 59/278 (21%), Positives = 113/278 (40%), Gaps = 73/278 (26%)

Query: 147 MLISSMPLEELYQLNTCI--QEALQRE-NMLKNKLSKLQHIVIDLKKAADVSWKALIQED 203
           +L++   L+ L  L   +  +EALQ + N+L+ +LS+       +K AA       I E+
Sbjct: 145 LLLNQARLQALEDLEKILTEKEALQGKINILEMRLSETDA---RIKLAAQEKIHVEILEE 201

Query: 204 RL-------LSRVEFLESQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAE 256
           +L       L R       + + SK      L++E  LL  D    + +L EV       
Sbjct: 202 QLEKLRNELLIRGATEGLCVHSLSKELDV--LKEENMLLKDDIQFLKAELIEV------- 252

Query: 257 KVEVMQKCQDTERKLVNIENECS------------------------------------N 280
                    +TE ++  +E E S                                    N
Sbjct: 253 --------AETEERVFKLEKERSLLDASLRELESKFIVAQEDVSKLSPLQYDCWWEKVEN 304

Query: 281 LQELLDRNRDELEE---IIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQA 337
           LQ+LLDR  +++E+   ++ + QD   K  + E+++ +  N ++  S  +E LQ +    
Sbjct: 305 LQDLLDRATNQVEKAALVLDQNQDLRDKVDKLEASLKE-ANVSKFSSYKVELLQQKLKLL 363

Query: 338 QKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNE 375
           +++L A    D+ +H  ++ Y  +    ++ ++ +K E
Sbjct: 364 EERLQA---SDHEIHSYIQLYQESIKEFQDTLSKLKEE 398



 Score = 30.6 bits (69), Expect = 4.8
 Identities = 59/328 (17%), Positives = 139/328 (42%), Gaps = 26/328 (7%)

Query: 363 HIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDI 422
               E +  I NE    +K  + +     +D+  ++   ++  L  N++    L ++ED+
Sbjct: 102 MQRDEAIAAIDNEQQTNSKDGEQLSDFQLEDLVGMIQNAEKNILLLNQAR---LQALEDL 158

Query: 423 EKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTY 482
           EK+  + +E L   Q +   L+  ++  + ++    Q K +H++    LE QLEKL+   
Sbjct: 159 EKILTE-KEAL---QGKINILEMRLSETDARIKLAAQEK-IHVEI---LEEQLEKLRNEL 210

Query: 483 NSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESS 542
                       + + +  ++ EE  +  +       ++     +E+    + KE     
Sbjct: 211 LIRGATEGLCVHSLSKELDVLKEENMLLKDDIQFLKAELIEVAETEERVFKLEKERSLLD 270

Query: 543 RKIDDINAEITKCTNELVDAKNCLVKNEKW---VINLQEQFSLLEKSKETWVSLKLNSAE 599
             + ++ ++      ++  +K   ++ + W   V NLQ+   LL+++        L   +
Sbjct: 271 ASLRELESKFIVAQEDV--SKLSPLQYDCWWEKVENLQD---LLDRATNQVEKAALVLDQ 325

Query: 600 SQSSASLQSQLNDALDNIKKYKEEHSKMTM---EIESLKEKLKFSQSKENDLLEEYEKLK 656
           +Q       +L  +L      K    K+ +   +++ L+E+L+ S  + +  ++ Y++  
Sbjct: 326 NQDLRDKVDKLEASLKEANVSKFSSYKVELLQQKLKLLEERLQASDHEIHSYIQLYQES- 384

Query: 657 LVIKDYQNVLPSIV-ESMNNTKDKPQDS 683
             IK++Q+ L  +  ES   + + P D 
Sbjct: 385 --IKEFQDTLSKLKEESKKRSLEHPADD 410


>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090).  This
           family consists of several bacterial proteins of unknown
           function and is known as YqjC in E. coli.
          Length = 115

 Score = 29.2 bits (66), Expect = 3.3
 Identities = 20/69 (28%), Positives = 32/69 (46%), Gaps = 5/69 (7%)

Query: 208 RVEFLESQLAAYSKSFTDDKLRDE----IKLLHRDKSTYENQLKEVMKKMLAEKVEVMQ- 262
           R+  LE  L       TD  LR E    ++   ++ +  E +LKE   K  A+K+E  Q 
Sbjct: 44  RIAGLEKALKEVQAHCTDAGLRAERQAKVEEKEQEVAEREQELKEAQAKGDADKIEKKQR 103

Query: 263 KCQDTERKL 271
           K  + + +L
Sbjct: 104 KLAEAQAEL 112


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 31.0 bits (70), Expect = 3.4
 Identities = 79/472 (16%), Positives = 170/472 (36%), Gaps = 49/472 (10%)

Query: 283 ELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLD 342
             L+RN D+L+++I   + +++     E  +       +     +   +       K+++
Sbjct: 162 NSLERNYDKLKDVIDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKEIE 221

Query: 343 AFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFD 402
             +++ N   D   N     + +   + ++KN    E K  ++ L    +  +      +
Sbjct: 222 RLSIEYNNAMDDYNNLKSALNELSS-LEDMKNRYESEIKTAESDLSMELEKNNYYKELEE 280

Query: 403 EYFLRFNESESSGLNSIEDIEKVTKDL---REQLHTAQEEYTSLQEIVNRNEEQLAFLEQ 459
            +    N+      N I D  K   D+   ++ L     E      I+ +          
Sbjct: 281 RHMKIINDPVYKNRNYINDYFKYKNDIENKKQILSNIDAEINKYHAIIKKLSVLQKDYND 340

Query: 460 --SKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVN 517
              K    D +    L+LE  +  YNS     E   K            K    N + ++
Sbjct: 341 YIKKKSRYDDLNNQILELEGYEMDYNSYLKSIESLKKKIE------EYSK----NIERMS 390

Query: 518 IIDVSNQYNSEKDFVSINKEIEESSRKIDDINA-----------------EITKCTNELV 560
                     E D  +I KE+ E + K+ DI++                 E+++    L 
Sbjct: 391 AFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRNMEMLN 450

Query: 561 DAKNCLV--------KNEKWVINLQEQFSLLE-KSKETWVSLKLNSAESQSSASLQSQLN 611
               C V        K+   + +  E+ S LE K +E  + +K    +       +  L 
Sbjct: 451 GQSVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLE 510

Query: 612 DALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVLPSIVE 671
              + I K   E++K+    ES +  L+  + K N+L ++++K + +   Y+++    ++
Sbjct: 511 S--EEINKSINEYNKI----ESARADLEDIKIKINELKDKHDKYEEIKNRYKSLKLEDLD 564

Query: 672 SMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHKNVESVTSNMENDKS 723
           S   +          I+ E  +   N +  +L++ +   ++ +     +DKS
Sbjct: 565 SKRTSWLNALAVISLIDIETNRSRSNEIKKQLND-LESRLQEIEIGFPDDKS 615


>gnl|CDD|227612 COG5293, COG5293, Predicted ATPase [General function prediction
           only].
          Length = 591

 Score = 31.0 bits (70), Expect = 3.4
 Identities = 51/335 (15%), Positives = 104/335 (31%), Gaps = 62/335 (18%)

Query: 180 KLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKSFTDDKLRDEIKLLHRDK 239
           +L H +I  +   D +++ L+           +      +SK F           L    
Sbjct: 131 QLLHDLIGWRMKKDWTFRKLLGY--------LIRDSQENFSKVF----------ELSHSV 172

Query: 240 STYENQLKEV-MKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIK- 297
             +        +  +L E  +   +  D  ++L + +     L++    + D LEEI   
Sbjct: 173 LKHAKWKNYPFLLHLLGEDGKCAAEYYDKIQELESKKKLAELLRKTWIGSLDSLEEIETT 232

Query: 298 --KYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKL 355
             + QD+V K +   +T D       E    +  +        +++     +   M    
Sbjct: 233 ELRKQDEVNKKQATLNTFDFHAQDYAETEELVNTVD-------ERIAELNNRRISMQSHW 285

Query: 356 KNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSG 415
           K    +                    + + + V +   V +       +     E     
Sbjct: 286 KRVKTSLKEQILF------CPDEIQVLYEEVGVLFPGQVKKDFEHVIAFNRAITEERHDY 339

Query: 416 LNSIEDIEKVTKDLRE-------------------QLHTAQEEYTSLQEIVNRNEEQLAF 456
           L   E+I ++  DL+E                   +     E+Y +L E +     +LA 
Sbjct: 340 LQ--EEIAEIEGDLKEVNAELDDLGKRRAEGLAFLKNRGVFEKYQTLCEEIIALRGELAE 397

Query: 457 LEQ-----SKYLHL-DKIGGLELQLEKLKQTYNSD 485
           LE       K   L   IG L+ +   L++   ++
Sbjct: 398 LEYRIEPLRKLHALDQYIGTLKHECLDLEERIYTE 432


>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2).  All proteins
           in this family for which functions are known are flap
           endonucleases that generate the 3' incision next to DNA
           damage as part of nucleotide excision repair. This
           family is related to many other flap endonuclease
           families including the fen1 family. This family is based
           on the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1034

 Score = 31.0 bits (70), Expect = 3.5
 Identities = 18/79 (22%), Positives = 38/79 (48%), Gaps = 7/79 (8%)

Query: 265 QDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKS 324
           ++ E K+ +IE E     E L  +  E + I+   +++    K+ +   ++  + + E+ 
Sbjct: 686 ENAENKVASIEGEHRKEIEDLLFDESEEDNIVGMIEEE----KDADDFKNEWQDISLEEL 741

Query: 325 NTLE---KLQYETLQAQKQ 340
             LE     +  +L+AQKQ
Sbjct: 742 EALEANLLAEQNSLKAQKQ 760


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 29.1 bits (66), Expect = 3.8
 Identities = 15/63 (23%), Positives = 31/63 (49%), Gaps = 4/63 (6%)

Query: 609 QLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVLPS 668
           +   A    ++ K+   +   EI+ LK +L+  +++   L E+ E+ +     Y+  L S
Sbjct: 61  KRRRAEKKAEEEKKLRKEKEEEIKELKAELEELKAEIEKLEEKLEEYQP----YEEFLES 116

Query: 669 IVE 671
           +VE
Sbjct: 117 VVE 119


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
           This family consists of a number of sequences found in
           Arabidopsis thaliana, Oryza sativa and Lycopersicon
           esculentum (Tomato). The function of this family is
           unknown.
          Length = 767

 Score = 31.1 bits (70), Expect = 3.8
 Identities = 32/150 (21%), Positives = 70/150 (46%), Gaps = 12/150 (8%)

Query: 536 KEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKE-TWVSLK 594
           +E+E    + ++++ E++KC ++L ++KN L + EK +  L+ +    ++S       LK
Sbjct: 594 EELETLKSEKENLDGELSKCKDDLEESKNKLQETEKKLEELKSELDASQESNSLAETQLK 653

Query: 595 LNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEK 654
                 +S  SL+ +  D        + EH  +  +I SL+++L+  +    +L+ +  +
Sbjct: 654 ---CMVESYESLELRAKDL-------EAEHKSLQEKISSLEDELEKERQNHQELIAKCRE 703

Query: 655 LKLVIKDYQNVLPSIVESMNNTKDKPQDSE 684
           L+  I+  +             K K Q+ E
Sbjct: 704 LEEKIERAEQEENMQKLDEEEQKIK-QEKE 732



 Score = 30.7 bits (69), Expect = 3.9
 Identities = 57/302 (18%), Positives = 106/302 (35%), Gaps = 34/302 (11%)

Query: 473 LQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIID-VSNQYNSEKDF 531
           L++EKL         +  K       D+ +         +  +  + D +     S    
Sbjct: 386 LEMEKLAMLPPESSADGSKVSSPKCSDSSVALTGPVEHEDNLESKLSDRLPEVLQSVLKL 445

Query: 532 VSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWV 591
           V     I E S  ++DI AE+                     + ++E  +++E+S+E   
Sbjct: 446 VMEKHIISEISEILEDIEAELDSLERSSNGDDENEEVAMVGSL-VEESSAVIERSQELEG 504

Query: 592 SLKLNSAESQSSASLQSQLNDALDNIKKYKE--------------------EHSKMTMEI 631
           ++   S  +       +   D LD     ++                    + S    EI
Sbjct: 505 AVSRISEFTSVLEHEVTVCQDLLDGKADLEKFIQEFSLTLEWVVNQEKSLQDVSVEASEI 564

Query: 632 ESLKEKLKFSQSKENDL-----------LEEYEKLKLVIKDYQNVLPSIVESMNNTKDKP 680
           +     LK S+ + N             LEE E LK   ++    L    + +  +K+K 
Sbjct: 565 KKNFLGLKSSEKEINSPDEVKGAVCISTLEELETLKSEKENLDGELSKCKDDLEESKNKL 624

Query: 681 QDSEQSIEEEITQQSLNPLSSELDET-IHKNVESVTSNMENDKSNEDDNSSVQSTDEVKD 739
           Q++E+ +EE  ++   +  S+ L ET +   VES  S     K  E ++ S+Q      +
Sbjct: 625 QETEKKLEELKSELDASQESNSLAETQLKCMVESYESLELRAKDLEAEHKSLQEKISSLE 684

Query: 740 TE 741
            E
Sbjct: 685 DE 686


>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 725

 Score = 30.9 bits (70), Expect = 3.8
 Identities = 48/271 (17%), Positives = 87/271 (32%), Gaps = 28/271 (10%)

Query: 409 NESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKI 468
               SS +N  E+I   +    E L    E       I N  E +             +I
Sbjct: 411 KNKNSSFINKTENILTNSPLKDELLEKTTEIIN----IENPQEFEFG-----------QI 455

Query: 469 GGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSE 528
           G   +  E  +   N +  +  +F    N      N       N+ D NI +  +     
Sbjct: 456 GNDIISTEIAQLDENQNLIDTGEFDLENN----FSNSFNPENGNKIDENINETFDTSTIS 511

Query: 529 KDFVSINKEIEESSRKIDD--INAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKS 586
            +         +S    D   IN+EI     +     N   KN K     ++  S+ E  
Sbjct: 512 ANLSENKTNFAQSFNNKDTNLINSEIPIDLIKDTITINNSQKNVK-KNGNKDYLSVEEVI 570

Query: 587 KETWVSLKLNSAESQSSASLQSQLNDALDNIKK----YKEEHSKMTMEIESLKEKLKFSQ 642
               +++K +S        L    N  L   +      +  H    ++I +  +      
Sbjct: 571 NLIMLAIKFHSQNQVEYKKLVQNWNKNLPLFEYDVEFMEIAHFLKDLKILASSDNFILFS 630

Query: 643 SKENDLLEEYEKLKLVIKDYQNVLPSIVESM 673
           SK +++ E    +KL   +Y+    + +E +
Sbjct: 631 SKRDEIDELI--IKLNKNNYKVNFQNFLEKI 659



 Score = 29.8 bits (67), Expect = 8.0
 Identities = 40/268 (14%), Positives = 91/268 (33%), Gaps = 29/268 (10%)

Query: 511 PNQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNE 570
              KD+ I D  N        + I K +  S  + +D N EI     E     N  +K +
Sbjct: 330 DLLKDLKISDNPNDTLE----ILIIKLLALSELEEEDEN-EIKFKKIEENSIDNLDIKEK 384

Query: 571 KWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTME 630
           K       +     K+ E              +   +++ +  ++      E     +  
Sbjct: 385 KIENENDIEGKSDTKNLEE----------GFETKDNKNKNSSFINKT----ENILTNSPL 430

Query: 631 IESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEE 690
            + L EK     + EN    E+ ++        +++ + +  ++  ++     E  +E  
Sbjct: 431 KDELLEKTTEIINIENPQEFEFGQIG------NDIISTEIAQLDENQNLIDTGEFDLENN 484

Query: 691 ITQQSLNPLSSELDETIHKNVESVTSNMENDKSNEDDNSSVQSTDEVKDTEEGPHTPSLV 750
            +        +++DE I++  +  TS +  + S   + ++   +   KDT        + 
Sbjct: 485 FSNSFNPENGNKIDENINETFD--TSTISANLSE--NKTNFAQSFNNKDTNLINSEIPID 540

Query: 751 DKSEDTISVTTLVNVNTESSEGDLSPID 778
              +      +  NV    ++  LS  +
Sbjct: 541 LIKDTITINNSQKNVKKNGNKDYLSVEE 568


>gnl|CDD|235591 PRK05755, PRK05755, DNA polymerase I; Provisional.
          Length = 880

 Score = 30.8 bits (71), Expect = 4.1
 Identities = 28/111 (25%), Positives = 52/111 (46%), Gaps = 24/111 (21%)

Query: 242 YEN--QLKEVMKKMLAEKVEVMQKCQDTERKL------VNIENECSNLQELLDRNRDELE 293
           YEN  ++K   K+ L E  E         RKL      V +E +  +L EL   +R++L 
Sbjct: 214 YENLDEIKGKKKEKLRENKEQAFLS----RKLATIKTDVPLEVDLEDL-ELQPPDREKLI 268

Query: 294 EIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAF 344
            + K+         EF+S + +       ++  L++  YET+  +++L+A+
Sbjct: 269 ALFKEL--------EFKSLLRRA---AAAEAAPLDEEDYETILDEEELEAW 308


>gnl|CDD|203570 pfam07058, Myosin_HC-like, Myosin II heavy chain-like.  This family
           represents a conserved region within a number of myosin
           II heavy chain-like proteins that seem to be specific to
           Arabidopsis thaliana.
          Length = 351

 Score = 30.4 bits (68), Expect = 4.3
 Identities = 34/162 (20%), Positives = 65/162 (40%), Gaps = 16/162 (9%)

Query: 611 NDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVL--PS 668
           ND +  +K++ EE   +  E++ L++KL  S        +  EK KL +K  +  L  P+
Sbjct: 104 NDKVMPVKQWLEERRFLQGEMQQLRDKLAISDRAAKSEAQLKEKFKLRLKVLEEGLKGPN 163

Query: 669 IVESMNNTKDKPQDSEQSIEEEITQQSLNPL------------SSELDETIHKNVESVTS 716
                  T  + + +  +  + +     +P             SS+L  ++   + +V  
Sbjct: 164 SSFVRPTTVGRSESNGPTRRQSLGGAETSPKFTSNGGLSKKRPSSQLRGSLTGRISTVLK 223

Query: 717 NMENDKSNEDDNSSVQSTDEVKDTEEGPHTPSLVDKSEDTIS 758
           + +    + D  +  +S D  K    GP    L DK E+  S
Sbjct: 224 HAKGTSISFDGGT--RSMDRSKILANGPSNFPLNDKHEEGTS 263


>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter.  The Monovalent
           Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
           CPA1 family is a large family of proteins derived from
           Gram-positive and Gram-negative bacteria, blue green
           bacteria, yeast, plants and animals. Transporters from
           eukaryotes have been functionally characterized, and all
           of these catalyze Na+:H+ exchange. Their primary
           physiological functions may be in (1) cytoplasmic pH
           regulation, extruding the H+ generated during
           metabolism, and (2) salt tolerance (in plants), due to
           Na+ uptake into vacuoles. This model is specific for the
           fungal members of this family [Transport and binding
           proteins, Cations and iron carrying compounds].
          Length = 810

 Score = 30.7 bits (69), Expect = 4.4
 Identities = 25/143 (17%), Positives = 45/143 (31%), Gaps = 21/143 (14%)

Query: 680 PQDSEQSIEEEITQQSLNPLSSELDETIHKNVESVTSNME-NDKSNEDDNSSV------- 731
            Q   Q +      +    ++    E + K  E+  +  + +  +     +         
Sbjct: 509 RQRWRQKLRNNKEMEDDIEMNDLGRERLQKEKEAHAATFDLSTTTTTQLGTENGRGGGLE 568

Query: 732 --QSTDEVKDTEEGPHTPSLVDKSEDTISVTTLVNVNTESSEGDLSPIDPPESVIALIND 789
               T+  + +E       L DK         +  V T   +G  S ID          D
Sbjct: 569 ERSKTNIKERSENVNTIYGL-DKLARDTENRDVTYVPTSRYDGIESEID----------D 617

Query: 790 DDTSDSSSEMTDSSTLIRIRKPK 812
             T ++ SE   SS   RI+K +
Sbjct: 618 VYTYENDSESIASSERRRIKKLR 640


>gnl|CDD|234204 TIGR03423, pbp2_mrdA, penicillin-binding protein 2.  Members of
           this protein family are penicillin-binding protein 2
           (PBP-2), a protein whose gene (designated pbpA or mrdA)
           generally is found next to the gene for RodA, a protein
           required for rod (bacillus) shape in many bacteria.
           PBP-2 acts as a transpeptidase for cell elongation
           (hence, rod-shape) [Cell envelope, Biosynthesis and
           degradation of murein sacculus and peptidoglycan].
          Length = 592

 Score = 30.6 bits (70), Expect = 5.1
 Identities = 14/35 (40%), Positives = 16/35 (45%), Gaps = 3/35 (8%)

Query: 447 VNR-NEEQLAFLEQSKYLHLDKIG--GLELQLEKL 478
           V   NEE L  LE + Y   D IG  G+E   E  
Sbjct: 161 VGEINEEDLERLEPANYAGGDYIGKSGIEKYYEDE 195


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 30.7 bits (70), Expect = 5.3
 Identities = 43/218 (19%), Positives = 81/218 (37%), Gaps = 54/218 (24%)

Query: 270  KLVNIENECSNLQELLDR----NRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSN 325
            +L   +   S L  LL R      + L + +++ ++Q+ + +E +  + Q         N
Sbjct: 866  QLEQAKEGLSALNRLLPRLNLLADETLADRVEEIREQLDEAEEAKRFVQQ-------HGN 918

Query: 326  TLEKL--QYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVD 383
             L +L      LQ+          D    ++LK  Y      ++   + K +    T+V 
Sbjct: 919  ALAQLEPIVSVLQS----------DPEQFEQLKQDYQ---QAQQTQRDAKQQAFALTEVV 965

Query: 384  QNILVKYFDDVHQILTQFDEYFLRFNESESSGLNS-----IEDIEKVTKDLREQLHTAQE 438
            Q      ++D  ++L            +++S LN      +E  E+     REQL  AQ 
Sbjct: 966  QRRAHFSYEDAAEML------------AKNSDLNEKLRQRLEQAEQERTRAREQLRQAQA 1013

Query: 439  EYT-------SLQEIVNRNEEQLAFLEQSKYLHLDKIG 469
            +         SL+   +   + L  L+Q     L  +G
Sbjct: 1014 QLAQYNQVLASLKSSYDAKRQMLQELKQE----LQDLG 1047


>gnl|CDD|220625 pfam10191, COG7, Golgi complex component 7 (COG7).  COG7 is a
           component of the conserved oligomeric Golgi complex
           which is required for normal Golgi morphology and
           localisation. Mutation in COG7 causes a congenital
           disorder of glycosylation.
          Length = 768

 Score = 30.5 bits (69), Expect = 5.4
 Identities = 29/130 (22%), Positives = 54/130 (41%), Gaps = 16/130 (12%)

Query: 583 LEKSKETWVSL--KLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKF 640
           LEK      SL  KL     Q + +L+     AL N+ +   +   +  E   LKE++  
Sbjct: 29  LEKGDAHASSLVMKLQLYVQQVNYALEESSGQALQNMPRVLRDVEALRQEASFLKEQM-- 86

Query: 641 SQSKENDLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLS 700
                  + ++ +K +        VL    E ++  K + Q +++S++E         LS
Sbjct: 87  -----ILVKQDIKKFEQDTGQSMQVL----ERLDTVKQRMQAAKESLQE---ADGWGTLS 134

Query: 701 SELDETIHKN 710
           +EL++T    
Sbjct: 135 AELEDTFETQ 144


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 29.9 bits (68), Expect = 6.0
 Identities = 23/91 (25%), Positives = 39/91 (42%), Gaps = 7/91 (7%)

Query: 170 REN--MLKNKLSKLQHIVIDLKK--AADVSWKALIQE-DRLLSRVEFLESQLAAYSKSFT 224
           REN   ++ KL K     +D+ K    D   + L++E + L +    L  ++    K   
Sbjct: 8   RENPDAVREKLKKRGGDALDVDKLLELDEERRKLLRELEELQAERNELSKEIGRALKRGE 67

Query: 225 DD--KLRDEIKLLHRDKSTYENQLKEVMKKM 253
           DD  +L  E+K L       E  L E+  ++
Sbjct: 68  DDAEELIAEVKELKEKLKELEAALDELEAEL 98


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 29.6 bits (67), Expect = 6.4
 Identities = 40/215 (18%), Positives = 88/215 (40%), Gaps = 13/215 (6%)

Query: 227 KLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLD 286
           +LR++ + L  ++     +++E+ K+   E    +Q+ +   R+L    NE +     + 
Sbjct: 52  ELREKAQELREERDEINEEVQEL-KEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIK 110

Query: 287 RNRDELEEIIKKYQDQVT---KNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDA 343
               E+E + KK Q  V    + +E    I + L    E +    +   +  + + ++D 
Sbjct: 111 SLEREIERLEKKQQTSVLTPEEERELVQKIKE-LRKELEDAKKALEENEKLKELKAEIDE 169

Query: 344 FTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDE 403
              K   +H+K++          E+M  +  E         + L K  D++H+   +  +
Sbjct: 170 LKKKAREIHEKIQELANEAQEYHEEMIKLFEEA--------DELRKEADELHEEFVELSK 221

Query: 404 YFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQE 438
                +E   +  N + ++EK  K LR +   A+ 
Sbjct: 222 KIDELHEEFRNLQNELRELEKKIKALRAKEKAAKR 256


>gnl|CDD|237853 PRK14897, PRK14897, unknown domain/DNA-directed RNA polymerase
           subunit A'' fusion protein; Provisional.
          Length = 509

 Score = 29.8 bits (67), Expect = 6.9
 Identities = 27/176 (15%), Positives = 59/176 (33%), Gaps = 10/176 (5%)

Query: 526 NSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEK 585
           N    ++++ K+       I  I        N ++  K       K V+ ++ +  +   
Sbjct: 32  NITDGYITLKKKEFTYHVIISKIAPYSNS--NGIIKKKK---PVLKTVLEIESEEKIEAI 86

Query: 586 SKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKE 645
               +  L     +   S S    L        +       + +  +  K   +   S  
Sbjct: 87  DLMEFKRLFGRILDENMSFSTGELLTAEEKEYYEENSNEDVLKVIDDVKKLGFRLPPSVI 146

Query: 646 NDLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQD-----SEQSIEEEITQQSL 696
            ++ +  +K +L   +Y+ +L  I E     +  P +     + QSI E  TQ ++
Sbjct: 147 EEIAKAMKKKELSDDEYEEILRRIREEYERARVDPYEAVGIVAAQSIGEPGTQMTM 202


>gnl|CDD|204415 pfam10212, TTKRSYEDQ, Predicted coiled-coil domain-containing
           protein.  This is the C-terminal 500 amino acids of a
           family of proteins with a predicted coiled-coil domain
           conserved from nematodes to humans. It carries a
           characteristic TTKRSYEDQ sequence-motif. The function is
           not known.
          Length = 518

 Score = 29.9 bits (67), Expect = 6.9
 Identities = 27/114 (23%), Positives = 49/114 (42%), Gaps = 5/114 (4%)

Query: 600 SQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVI 659
           ++S   L  Q+  +L+ I K ++E     +E +  K KL+    +  DL +   K +L  
Sbjct: 301 TESREGLAQQVQQSLEKISKLEQEKEHWMLEAQLAKIKLEKENQRIADLEKGTLKGQLAE 360

Query: 660 KDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHKNVES 713
           K ++N   +        +   + S +S  E     SL  + +  DET   + ES
Sbjct: 361 KLHENATLT-----QAHEQPEEFSSKSEREPTKSTSLIGMLTVTDETEAPDEES 409


>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal.  This domain
           is specific to the N-terminal part of the prp1 splicing
           factor, which is involved in mRNA splicing (and possibly
           also poly(A)+ RNA nuclear export and cell cycle
           progression). This domain is specific to the N terminus
           of the RNA splicing factor encoded by prp1. It is
           involved in mRNA splicing and possibly also poly(A)and
           RNA nuclear export and cell cycle progression.
          Length = 131

 Score = 28.4 bits (64), Expect = 7.4
 Identities = 21/81 (25%), Positives = 38/81 (46%), Gaps = 7/81 (8%)

Query: 213 ESQLAAYSKSFTDDKLRDEI------KLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQD 266
              L +  K   +D+  D I      ++  R K   E + KE ++K   E  ++ Q+  D
Sbjct: 47  NEGLFSDGKYDDEDEEADRIYESIDERMDERRKKRREQKEKEEIEKYREENPKIQQQFAD 106

Query: 267 TERKLVNI-ENECSNLQELLD 286
            +R L  + E+E +N+ E+ D
Sbjct: 107 LKRNLATVTEDEWANIPEVGD 127


>gnl|CDD|191851 pfam07798, DUF1640, Protein of unknown function (DUF1640).  This
           family consists of sequences derived from hypothetical
           eukaryotic proteins. A region approximately 100 residues
           in length is featured.
          Length = 177

 Score = 29.1 bits (66), Expect = 7.6
 Identities = 26/113 (23%), Positives = 51/113 (45%), Gaps = 16/113 (14%)

Query: 226 DKLRDEIKLLHRDKSTYENQLKEVMKKM--LAEKVEVMQKCQDTERKLVNIENECSNLQE 283
           D + D I+ + +D  T E Q K   ++    A+    +Q  + +E   +  ENE      
Sbjct: 30  DVINDSIENVSKDMVTKEQQEKTEYQQKVDFAKLKSELQSLEKSEFANLQAENE------ 83

Query: 284 LLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQ 336
              + +++LE++  + +D++TK      T    L+ N EK    E+   + L+
Sbjct: 84  ---KLKNDLEKLKNRLRDEITK-----VTAGVRLDLNLEKGRIREESAEQELK 128


>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
           HlyD family.  Type I secretion is an ABC transport
           process that exports proteins, without cleavage of any
           signal sequence, from the cytosol to extracellular
           medium across both inner and outer membranes. The
           secretion signal is found in the C-terminus of the
           transported protein. This model represents the adaptor
           protein between the ATP-binding cassette (ABC) protein
           of the inner membrane and the outer membrane protein,
           and is called the membrane fusion protein. This model
           selects a subfamily closely related to HlyD; it is
           defined narrowly and excludes, for example, colicin V
           secretion protein CvaA and multidrug efflux proteins
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 423

 Score = 29.6 bits (67), Expect = 7.6
 Identities = 20/119 (16%), Positives = 51/119 (42%), Gaps = 18/119 (15%)

Query: 604 ASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKEN----------------- 646
           ++L++QL   L  IK+ + E + +  ++++L+++L+    +                   
Sbjct: 140 STLRAQLELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRKLKEKGLVSRLELL 199

Query: 647 DLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDE 705
           +L  E  + +  +   +  L  +   ++  + + Q  EQ+  EE+  + L    + L E
Sbjct: 200 ELERERAEAQGELGRLEAELEVLKRQIDELQLERQQIEQTFREEV-LEELTEAQARLAE 257


>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
           (DUF3340).  This presumed domain is found at the
           C-terminus of tail specific proteases. Its function is
           unknown. This family is found in bacteria and
           eukaryotes. This presumed domain is typically between 88
           to 187 amino acids in length.
          Length = 139

 Score = 28.7 bits (65), Expect = 7.8
 Identities = 11/64 (17%), Positives = 29/64 (45%), Gaps = 1/64 (1%)

Query: 285 LDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQA-QKQLDA 343
                  L E+ +K++ ++ K+ EF+   + +    +E+ +    L     +A +++ + 
Sbjct: 30  YGDISPFLPELRQKHKARIAKDPEFQYLNEDIAWLKEERDDKTVSLNEAERKAEREEDEK 89

Query: 344 FTLK 347
             LK
Sbjct: 90  KRLK 93


>gnl|CDD|206307 pfam14138, COX16, Cytochrome c oxidase assembly protein COX16.
           This family represents homologues of COX16 which has
           been shown to be involved in assembly of cytochrome
           oxidase. Protein in this family are typically between
           106 and 134 amino acids in length.
          Length = 79

 Score = 27.5 bits (62), Expect = 8.1
 Identities = 14/38 (36%), Positives = 22/38 (57%), Gaps = 2/38 (5%)

Query: 630 EIESLKEKLKFSQSKEN-DLLEEYEKLK-LVIKDYQNV 665
           +    +E LK  + +   DL EEY +L+ L I D++NV
Sbjct: 34  QQLEEEEALKLLKKRRKVDLEEEYYRLQGLDIDDWENV 71


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 29.5 bits (67), Expect = 8.2
 Identities = 18/67 (26%), Positives = 35/67 (52%)

Query: 229 RDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRN 288
           R+E++ L R     E  L   M+ +  ++  + +K ++   K  N++ +   L+EL+   
Sbjct: 75  RNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQ 134

Query: 289 RDELEEI 295
           R+ELE I
Sbjct: 135 REELERI 141


>gnl|CDD|234038 TIGR02865, spore_II_E, stage II sporulation protein E.  Stage II
           sporulation protein E (SpoIIE) is a multiple membrane
           spanning protein with two separable functions. It plays
           a role in the switch to polar cell division during
           sporulation. By means of it protein phosphatase
           activity, located in the C-terminal region, it activates
           sigma-F. All proteins that score above the trusted
           cutoff to this model are found in endospore-forming
           Gram-positive bacteria. Surprisingly, a sequence from
           the Cyanobacterium-like (and presumably
           non-spore-forming) photosynthesizer Heliobacillus
           mobilis is homologous, and scores between the trusted
           and noise cutoffs [Cellular processes, Sporulation and
           germination].
          Length = 764

 Score = 29.7 bits (67), Expect = 8.4
 Identities = 23/124 (18%), Positives = 46/124 (37%), Gaps = 23/124 (18%)

Query: 216 LAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKE------VMKKM------------LAEK 257
               S +F +    +E   + R  S     L E       MK              + E 
Sbjct: 323 FRELSNTFVEALASNEKLTMKRKSSYLLENLAERVCQSCNMKHRCWKREFDYTYSAMEEL 382

Query: 258 VEVMQKCQDTERKLVN-IENECSNLQELLDRNRDEL--EEIIKKYQDQVTKNKEFESTID 314
           +E +++ +D   KL +  E +C   +EL++   D L    I + ++ ++ + +      +
Sbjct: 383 IENLEEKKDPNSKLPDEFERKCIKRKELINTTEDILNNYIINEMWRKRLEEGRRL--VAE 440

Query: 315 QLLN 318
           QL  
Sbjct: 441 QLKG 444


>gnl|CDD|222986 PHA03102, PHA03102, Small T antigen; Reviewed.
          Length = 153

 Score = 28.5 bits (64), Expect = 8.5
 Identities = 14/80 (17%), Positives = 31/80 (38%), Gaps = 7/80 (8%)

Query: 226 DKLRDEIKLLHRD------KSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECS 279
            K R+ +K L          S  E+     +     ++V  +  C+D +  L N+ + C 
Sbjct: 54  KKFRESVKSLRDLDGEEDSSSEEEDVPSGYVGATFGDRVNALY-CKDWDTCLKNLSSLCK 112

Query: 280 NLQELLDRNRDELEEIIKKY 299
            +  LL +     +++  + 
Sbjct: 113 CIFCLLRKQHRLKQKLRPEP 132


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 29.9 bits (68), Expect = 8.7
 Identities = 24/115 (20%), Positives = 45/115 (39%), Gaps = 15/115 (13%)

Query: 594 KLNSAESQSS--ASLQSQLNDAL---DNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDL 648
            LN  +   +    +Q  L   L   D I + KE       E E LK++L  + +K    
Sbjct: 47  ALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQKE-------ETEQLKQQLAQAPAKLRQA 99

Query: 649 LEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSEL 703
             E E LK    D        + +++  + + + ++   + +  Q  L   +S+L
Sbjct: 100 QAELEALK---DDNDEETRETLSTLSLRQLESRLAQTLDQLQNAQNDLAEYNSQL 151


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 29.7 bits (67), Expect = 9.1
 Identities = 31/155 (20%), Positives = 60/155 (38%), Gaps = 28/155 (18%)

Query: 528 EKDFVSINKEIEESSRKIDDINAEITKC------TNELVDAKNCLVKNEKWVINLQEQFS 581
           E +  ++ ++I+E    I+++ +EI          NE +       +  +    ++++  
Sbjct: 269 EAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIKKRTV 328

Query: 582 LLEKSKETWVSLKLNS---AESQSSASLQSQLNDA----LDNIKKYKE-------EHSKM 627
            L    E  V  KL +   A S+    L  Q        +D  +  KE       E  + 
Sbjct: 329 ELLPDAENNV-AKLQALVVASSERLLELAQQWEAHRTPLIDEYRSLKEKNRNKEDETQRQ 387

Query: 628 TMEIESLKEKLKFSQSK-------ENDLLEEYEKL 655
             EI+ L+ K++  +S+          LL+EYE  
Sbjct: 388 LDEIKKLRNKIEELESELQTKEQLYKQLLDEYENA 422


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 29.5 bits (66), Expect = 9.9
 Identities = 25/122 (20%), Positives = 48/122 (39%), Gaps = 2/122 (1%)

Query: 234 LLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELE 293
           LL R  S   +  KE  +    + +  ++           +++E   L++  +R  DELE
Sbjct: 30  LLERLDSELRDAEKE--RDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELE 87

Query: 294 EIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHD 353
           E+ K+  D   +  E +   +QL N   +            LQ +  L +  L+     +
Sbjct: 88  ELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQLEDNLQSLELQYEYSLN 147

Query: 354 KL 355
           +L
Sbjct: 148 QL 149


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.310    0.127    0.344 

Gapped
Lambda     K      H
   0.267   0.0724    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 43,643,482
Number of extensions: 4293140
Number of successful extensions: 7107
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6165
Number of HSP's successfully gapped: 712
Length of query: 903
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 797
Effective length of database: 6,236,078
Effective search space: 4970154166
Effective search space used: 4970154166
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 64 (28.4 bits)