RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy2772
         (746 letters)



>gnl|CDD|99996 cd04300, GT1_Glycogen_Phosphorylase, This is a family of
           oligosaccharide phosphorylases. It includes yeast and
           mammalian glycogen phosphorylases, plant starch/glucan
           phosphorylase, as well as the maltodextrin
           phosphorylases of bacteria. The members of this family
           catalyze the breakdown of oligosaccharides into
           glucose-1-phosphate units. They are important allosteric
           enzymes in carbohydrate metabolism. The allosteric
           control mechanisms of yeast and mammalian members of
           this family are different from that of bacterial
           members. The members of this family belong to the GT-B
           structural superfamily of glycoslytransferases, which
           have characteristic N- and C-terminal domains each
           containing a typical Rossmann fold. The two domains have
           high structural homology despite minimal sequence
           homology.  The large cleft that separates the two
           domains includes the catalytic center and permits a high
           degree of flexibility.
          Length = 797

 Score =  108 bits (272), Expect = 3e-24
 Identities = 37/65 (56%), Positives = 45/65 (69%)

Query: 253 LKFVNDGDYIQAVLDRNLAENISRVLYPNDNNFGGKELRLKQEYFMCAATLQVISLPFKD 312
           L   N GDYI+AV ++N AENIS+VLYPND+   GKELRLKQ+YF  +A+LQ I   FK 
Sbjct: 231 LDAFNRGDYIRAVEEKNRAENISKVLYPNDSTEEGKELRLKQQYFFVSASLQDIIRRFKK 290

Query: 313 QGRAL 317
               L
Sbjct: 291 THGPL 295


>gnl|CDD|215868 pfam00343, Phosphorylase, Carbohydrate phosphorylase.  The members
           of this family catalyze the formation of glucose
           1-phosphate from one of the following polyglucoses;
           glycogen, starch, glucan or maltodextrin.
          Length = 712

 Score =  101 bits (254), Expect = 3e-22
 Identities = 43/69 (62%), Positives = 51/69 (73%)

Query: 249 EKIPLKFVNDGDYIQAVLDRNLAENISRVLYPNDNNFGGKELRLKQEYFMCAATLQVISL 308
           E+  L   NDGDY+ AV D+N AENISRVLYPNDN F GKELRLKQ+YF+ +ATLQ I  
Sbjct: 141 EEFNLADFNDGDYLAAVEDKNRAENISRVLYPNDNTFEGKELRLKQQYFLVSATLQDIIR 200

Query: 309 PFKDQGRAL 317
            FK   ++L
Sbjct: 201 RFKKSHKSL 209


>gnl|CDD|233722 TIGR02093, P_ylase, glycogen/starch/alpha-glucan phosphorylases.
           This family consists of phosphorylases. Members use
           phosphate to break alpha 1,4 linkages between pairs of
           glucose residues at the end of long glucose polymers,
           releasing alpha-D-glucose 1-phosphate. The nomenclature
           convention is to preface the name according to the
           natural substrate, as in glycogen phosphorylase, starch
           phosphorylase, maltodextrin phosphorylase, etc. Name
           differences among these substrates reflect differences
           in patterns of branching with alpha 1,6 linkages.
           Members include allosterically regulated and unregulated
           forms. A related family, TIGR02094, contains examples
           known to act well on particularly small alpha 1,4
           glucans, as may be found after import from exogenous
           sources [Energy metabolism, Biosynthesis and degradation
           of polysaccharides].
          Length = 794

 Score = 96.6 bits (241), Expect = 1e-20
 Identities = 36/69 (52%), Positives = 46/69 (66%)

Query: 249 EKIPLKFVNDGDYIQAVLDRNLAENISRVLYPNDNNFGGKELRLKQEYFMCAATLQVISL 308
           E+  L   N GDY +AV ++N AENISRVLYPND+ + GKELRLKQ+YF  +A+LQ I  
Sbjct: 224 EEFDLDAFNAGDYYEAVEEKNRAENISRVLYPNDSTYEGKELRLKQQYFFVSASLQDIIR 283

Query: 309 PFKDQGRAL 317
              +    L
Sbjct: 284 RHLETHPDL 292


>gnl|CDD|184948 PRK14986, PRK14986, glycogen phosphorylase; Provisional.
          Length = 815

 Score = 78.3 bits (193), Expect = 6e-15
 Identities = 32/53 (60%), Positives = 43/53 (81%), Gaps = 1/53 (1%)

Query: 254 KFVNDGDYIQAVLDRNLAENISRVLYPNDNNFGGKELRLKQEYFMCAATLQVI 306
           KF N GDY  AV D+N +EN+SRVLYP+D+ + G+ELRL+QEYF+ +AT+Q I
Sbjct: 246 KF-NQGDYFAAVEDKNHSENVSRVLYPDDSTYSGRELRLRQEYFLVSATVQDI 297


>gnl|CDD|223136 COG0058, GlgP, Glucan phosphorylase [Carbohydrate transport and
           metabolism].
          Length = 750

 Score = 62.3 bits (152), Expect = 7e-10
 Identities = 27/54 (50%), Positives = 35/54 (64%), Gaps = 11/54 (20%)

Query: 253 LKFVNDGDYIQAVLDRNLAENISRVLYPNDNNFGGKELRLKQEYFMCAATLQVI 306
           L F N G+      ++N A NI+RVLYP D+    KELRLKQEYF+ +A +Q I
Sbjct: 210 LDF-NVGE------NKNDARNITRVLYPGDS----KELRLKQEYFLGSAGVQDI 252


>gnl|CDD|237881 PRK14985, PRK14985, maltodextrin phosphorylase; Provisional.
          Length = 798

 Score = 62.0 bits (151), Expect = 7e-10
 Identities = 29/59 (49%), Positives = 39/59 (66%), Gaps = 9/59 (15%)

Query: 242 FNLTERNEKIPLKFVNDGDYIQAVLDRNLAENISRVLYPNDNNFGGKELRLKQEYFMCA 300
           F+LT        KF NDGD+++A      AE +++VLYPNDN+  GK+LRL Q+YF CA
Sbjct: 231 FDLT--------KF-NDGDFLRAEQQGIDAEKLTKVLYPNDNHTAGKKLRLMQQYFQCA 280


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 46.2 bits (110), Expect = 7e-05
 Identities = 25/115 (21%), Positives = 51/115 (44%), Gaps = 6/115 (5%)

Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVC 528
           AEVE+   +   L +EL+ + + +    ++LEE     A  E+E+  L  +++  + E+ 
Sbjct: 740 AEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELK 799

Query: 529 NGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
               +L        L AEL  +     +L+ + + L  ++     +   L +QI 
Sbjct: 800 ALREALD------ELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIE 848



 Score = 42.4 bits (100), Expect = 0.001
 Identities = 28/115 (24%), Positives = 54/115 (46%), Gaps = 1/115 (0%)

Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVC 528
            + +  R +   LER+L  + + L     KL+E     A LE +L  L+++L+    E+ 
Sbjct: 302 QQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELE 361

Query: 529 NGTASLSNGPS-VAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
              A L    S +  LE +L  +++ V  L+ Q   L+ ++++L  +   L  + 
Sbjct: 362 ELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRR 416



 Score = 40.8 bits (96), Expect = 0.003
 Identities = 26/122 (21%), Positives = 54/122 (44%), Gaps = 8/122 (6%)

Query: 470 EVERYRRQQRLLERELSRVRSI-------LAHNSKKLEETVAANARLESELVVLRQKLQW 522
           E+E    + + LE +L  +R         +    K+L       +RLE +  +LR++L  
Sbjct: 254 ELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLAN 313

Query: 523 SRREVCNGTASLSNGPSVA-ALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQ 581
             R++    A L    S    L  EL  ++  + +L+ + + L A++++L  +   L  +
Sbjct: 314 LERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESR 373

Query: 582 IR 583
           + 
Sbjct: 374 LE 375



 Score = 40.4 bits (95), Expect = 0.004
 Identities = 27/109 (24%), Positives = 54/109 (49%), Gaps = 8/109 (7%)

Query: 476 RQQRL--LERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTAS 533
           R++ +  LE ++  +   +A   K L E       LE EL  LR++L+   R++      
Sbjct: 675 RRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKD 734

Query: 534 LSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
           L      A LEAE+ +++  +  L ++  EL A++++L E+     +++
Sbjct: 735 L------ARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEEL 777



 Score = 40.0 bits (94), Expect = 0.006
 Identities = 31/138 (22%), Positives = 60/138 (43%), Gaps = 9/138 (6%)

Query: 444 GQAKEMLENALGSLRHKMHGVHASPAEVERYR----RQQRLLERELSRVRSILAHNSKKL 499
                 LE     LR ++  +     E+E        +   L  EL+ +   L    ++L
Sbjct: 294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEEL 353

Query: 500 EETVAANARLESELVVLRQKLQWSRREV---CNGTASLSNGPSVAALEAELRRVQALVGD 556
           E   A    LE+EL  L  +L+    ++    +  A L     +A+L  E+ R++A +  
Sbjct: 354 ESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLEL--QIASLNNEIERLEARLER 411

Query: 557 LQRQRQELSAQVKQLTEK 574
           L+ +R+ L  ++++L +K
Sbjct: 412 LEDRRERLQQEIEELLKK 429



 Score = 38.9 bits (91), Expect = 0.012
 Identities = 32/164 (19%), Positives = 70/164 (42%), Gaps = 27/164 (16%)

Query: 424 SRDQIASKLHSRGRNICSVQGQAKEMLENALGSLRHKMH----GVHASPAEVERYRRQQR 479
             D++A +L      +  ++ +  E LE  L  L  ++      +     ++E  R +  
Sbjct: 331 KLDELAEELAELEEKLEELKEE-LESLEAELEELEAELEELESRLEELEEQLETLRSKVA 389

Query: 480 LLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPS 539
            LE +++ + + +     +LE       RL+ E+  L +KL+                  
Sbjct: 390 QLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLE------------------ 431

Query: 540 VAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
               EAEL+ +QA + +L+ + +EL  ++++L E    L +++ 
Sbjct: 432 ----EAELKELQAELEELEEELEELQEELERLEEALEELREELE 471



 Score = 38.1 bits (89), Expect = 0.022
 Identities = 28/147 (19%), Positives = 56/147 (38%), Gaps = 10/147 (6%)

Query: 440 CSVQGQAKEMLENALGSLRHKMH----GVHASPAEVERYRRQQRLLERELSRVRSILAHN 495
                +A + L   L  L  +       + +    +    R+   LE ++  +   +   
Sbjct: 798 LKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESL 857

Query: 496 SKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVG 555
           + ++EE       LESEL  L  +       +    + L        L  ELR +++   
Sbjct: 858 AAEIEELEELIEELESELEALLNERASLEEALALLRSELEE------LSEELRELESKRS 911

Query: 556 DLQRQRQELSAQVKQLTEKSNSLSQQI 582
           +L+R+ +EL  ++ QL  +   L  +I
Sbjct: 912 ELRRELEELREKLAQLELRLEGLEVRI 938



 Score = 36.6 bits (85), Expect = 0.052
 Identities = 20/110 (18%), Positives = 41/110 (37%), Gaps = 13/110 (11%)

Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVC 528
            ++E    +   LE  +  + S L     +      A A L SEL  L ++L+    +  
Sbjct: 852 EDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKR- 910

Query: 529 NGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSL 578
                       + L  EL  ++  +  L+ + + L  ++  L E+ +  
Sbjct: 911 ------------SELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEE 948


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 45.8 bits (109), Expect = 9e-05
 Identities = 41/148 (27%), Positives = 65/148 (43%), Gaps = 14/148 (9%)

Query: 447 KEMLENALGSLRHKMHGVHASPAE--VERYRRQQRLLERELSRVRSILAHNSKKLEETVA 504
            E LE  L  L   ++ + A  +   +   + +   LE E+SR+ + L    +KL     
Sbjct: 767 IEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTL 826

Query: 505 ANARLESE---LVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALV------- 554
               LE E   L   R  L+   + +     +L NG     LE EL  ++A +       
Sbjct: 827 EKEYLEKEIQELQEQRIDLKEQIKSIEKEIENL-NG-KKEELEEELEELEAALRDLESRL 884

Query: 555 GDLQRQRQELSAQVKQLTEKSNSLSQQI 582
           GDL+++R EL AQ+++L  K   L  QI
Sbjct: 885 GDLKKERDELEAQLRELERKIEELEAQI 912



 Score = 43.1 bits (102), Expect = 6e-04
 Identities = 29/123 (23%), Positives = 48/123 (39%), Gaps = 6/123 (4%)

Query: 466 ASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRR 525
                +E        LE+E+  V+S L     ++EE      +LE  L  L  +L  SR 
Sbjct: 734 KLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRI 793

Query: 526 EVCNGTAS------LSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLS 579
                  S            +  +E +L R+      L+++ QEL  Q   L E+  S+ 
Sbjct: 794 PEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIE 853

Query: 580 QQI 582
           ++I
Sbjct: 854 KEI 856



 Score = 39.7 bits (93), Expect = 0.007
 Identities = 30/170 (17%), Positives = 71/170 (41%), Gaps = 17/170 (10%)

Query: 425 RDQIASKLHSRGRNICSVQGQAKEMLENALGSLRHKMHGVHASPA-----------EVER 473
             ++  +L    + +  +  + K++ E     ++ K+  + A  A           E+E 
Sbjct: 260 ISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELED 319

Query: 474 YRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTAS 533
              +   LE E+ ++ + +    +++EE      +L  E   L+++L+  R E+      
Sbjct: 320 AEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKE 379

Query: 534 LSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
                  A    EL+  +  +  L+R+  EL  ++ +L E+   LS+++ 
Sbjct: 380 F------AETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELA 423



 Score = 33.1 bits (76), Expect = 0.63
 Identities = 22/113 (19%), Positives = 52/113 (46%), Gaps = 13/113 (11%)

Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCN 529
           E++  + Q+  L+ ++  +   + + + K EE       LE+ L  L  +L   ++E   
Sbjct: 834 EIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDE 893

Query: 530 GTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
                        LEA+LR ++  + +L+ Q ++   ++ +L  K  +L +++
Sbjct: 894 -------------LEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEEL 933



 Score = 32.7 bits (75), Expect = 0.85
 Identities = 26/107 (24%), Positives = 45/107 (42%), Gaps = 1/107 (0%)

Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCN 529
           E+   +R+   L+ EL R+   LA  +  +    A    LE E      +++    ++  
Sbjct: 400 EINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQ 459

Query: 530 GTASLSN-GPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKS 575
             A LS     +  L+ E  RV+  +  LQR+  E  AQ +   E+ 
Sbjct: 460 LAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERV 506



 Score = 32.0 bits (73), Expect = 1.5
 Identities = 23/117 (19%), Positives = 49/117 (41%), Gaps = 13/117 (11%)

Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESE-LVVLR-QKLQWSRRE 526
            + E+   +   +E  + R+  I+    ++LE       RL  E     R Q L   +RE
Sbjct: 170 RKKEKALEELEEVEENIERLDLIIDEKRQQLE-------RLRREREKAERYQALLKEKRE 222

Query: 527 VCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
              G   L       ALE +   ++  +  L+ + ++L+ ++ +L ++   + Q + 
Sbjct: 223 Y-EGYELLKE---KEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLE 275



 Score = 31.6 bits (72), Expect = 2.1
 Identities = 26/112 (23%), Positives = 54/112 (48%), Gaps = 8/112 (7%)

Query: 468 PAEVERYRRQQRLLERELSRVRSILAH-------NSKKLEETVAANARLESELVVLRQKL 520
           PAE++R R +   L+RELS ++S L          S++L +       +E E+  L Q+ 
Sbjct: 673 PAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEE 732

Query: 521 QWSRREVCNGTASLSN-GPSVAALEAELRRVQALVGDLQRQRQELSAQVKQL 571
           +  +  +      LS+    +  +++EL+ ++A + +L+    +L   +  L
Sbjct: 733 EKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDL 784



 Score = 31.2 bits (71), Expect = 2.5
 Identities = 27/140 (19%), Positives = 60/140 (42%), Gaps = 6/140 (4%)

Query: 447 KEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAAN 506
           K  L +    LR   + +     E+    R+   +E+E+ ++        ++LEE     
Sbjct: 687 KRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDL 746

Query: 507 ARLESELVVLRQKLQWSRREVCNGTASLSN-GPSVAALEAELRR-----VQALVGDLQRQ 560
           + LE E+  ++ +L+     +      L     ++  LEA L       +QA +  L+ +
Sbjct: 747 SSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEE 806

Query: 561 RQELSAQVKQLTEKSNSLSQ 580
              + A+++++ +K N L+ 
Sbjct: 807 VSRIEARLREIEQKLNRLTL 826



 Score = 30.8 bits (70), Expect = 3.6
 Identities = 25/78 (32%), Positives = 38/78 (48%), Gaps = 9/78 (11%)

Query: 2   ELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALV-------GDLQRQRQEL 54
           EL   R  L+   + +     +L NG     LE EL  ++A +       GDL+++R EL
Sbjct: 837 ELQEQRIDLKEQIKSIEKEIENL-NG-KKEELEEELEELEAALRDLESRLGDLKKERDEL 894

Query: 55  SAQVKQLTEKSNSLSQQI 72
            AQ+++L  K   L  QI
Sbjct: 895 EAQLRELERKIEELEAQI 912



 Score = 30.4 bits (69), Expect = 4.2
 Identities = 27/145 (18%), Positives = 60/145 (41%), Gaps = 10/145 (6%)

Query: 426 DQIASKLHSRGRNICSVQGQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRL----L 481
           +++  +++   R +  +Q + +  L   L  L   + G+ A   E+E  +  + L     
Sbjct: 395 EKLKREINELKRELDRLQEELQR-LSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQ 453

Query: 482 ERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVA 541
           E +L ++ + L+   ++L +      R+E EL  L+++L  +  E     +        A
Sbjct: 454 EWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQREL--AEAEAQARASEERVRGGRA 511

Query: 542 A---LEAELRRVQALVGDLQRQRQE 563
               L+A ++ V   V  L    + 
Sbjct: 512 VEEVLKASIQGVHGTVAQLGSVGER 536



 Score = 30.0 bits (68), Expect = 5.2
 Identities = 35/144 (24%), Positives = 66/144 (45%), Gaps = 6/144 (4%)

Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCN 529
           ++E+ +R+   L+REL R++  L   S++L +  AA A +E+++  L ++ +    E+  
Sbjct: 393 KLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKK 452

Query: 530 GTASL-SNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGPTG 588
               L      ++  E EL  ++    +  R  +ELS   ++L E              G
Sbjct: 453 QEWKLEQLAADLSKYEQELYDLKE---EYDRVEKELSKLQRELAEAEAQARASEERVRGG 509

Query: 589 VAVAGKSVNRPGPTGVAGKVSKSG 612
            AV  + V +    GV G V++ G
Sbjct: 510 RAV--EEVLKASIQGVHGTVAQLG 531


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 43.1 bits (102), Expect = 1e-04
 Identities = 22/107 (20%), Positives = 50/107 (46%), Gaps = 6/107 (5%)

Query: 480 LLERELSR---VRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSN 536
           +L R LS     R  L    ++L E     A L+ +L  L+Q+ Q  ++E+   +   + 
Sbjct: 53  VLSRYLSDTPSARERLPELQQELAELQEELAELQEQLAELQQENQELKQEL---STLEAE 109

Query: 537 GPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
              +    A ++++ A   +L  + +EL  ++ +L +++ +L  +  
Sbjct: 110 LERLQKELARIKQLSANAIELDEENRELREELAELKQENEALEAENE 156



 Score = 39.6 bits (93), Expect = 0.003
 Identities = 24/102 (23%), Positives = 51/102 (50%), Gaps = 3/102 (2%)

Query: 475 RRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASL 534
           R +   L++EL+ ++  LA   ++L E    N  L+ EL  L  +L+  ++E+       
Sbjct: 65  RERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLS 124

Query: 535 SNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSN 576
           +N      L+ E R ++  + +L+++ + L A+ ++L E   
Sbjct: 125 AN---AIELDEENRELREELAELKQENEALEAENERLQENEQ 163



 Score = 30.7 bits (70), Expect = 1.8
 Identities = 15/54 (27%), Positives = 30/54 (55%), Gaps = 4/54 (7%)

Query: 24  LSNGPS----VAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 73
           LS+ PS    +  L+ EL  +Q  + +LQ Q  EL  + ++L ++ ++L  ++ 
Sbjct: 58  LSDTPSARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELE 111



 Score = 30.0 bits (68), Expect = 3.4
 Identities = 12/73 (16%), Positives = 36/73 (49%), Gaps = 3/73 (4%)

Query: 1   SELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQ 60
            +L  L+Q+ Q  ++E+   +   +    +    A ++++ A   +L  + +EL  ++ +
Sbjct: 87  EQLAELQQENQELKQEL---STLEAELERLQKELARIKQLSANAIELDEENRELREELAE 143

Query: 61  LTEKSNSLSQQIR 73
           L +++ +L  +  
Sbjct: 144 LKQENEALEAENE 156



 Score = 28.8 bits (65), Expect = 7.7
 Identities = 8/35 (22%), Positives = 20/35 (57%)

Query: 32  ALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSN 66
            L+ E R ++  + +L+++ + L A+ ++L E   
Sbjct: 129 ELDEENRELREELAELKQENEALEAENERLQENEQ 163


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 44.3 bits (105), Expect = 2e-04
 Identities = 30/114 (26%), Positives = 61/114 (53%), Gaps = 6/114 (5%)

Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVC 528
           A+++  R++   LE ELSR+   L    ++LEE       L+SEL  LR++L+  + E+ 
Sbjct: 232 AKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELL 291

Query: 529 NGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
                +        LE E+  ++  + +L+ + +EL  ++++L EK  +L +++
Sbjct: 292 ELKEEI------EELEGEISLLRERLEELENELEELEERLEELKEKIEALKEEL 339



 Score = 43.2 bits (102), Expect = 6e-04
 Identities = 28/114 (24%), Positives = 53/114 (46%), Gaps = 13/114 (11%)

Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCN 529
            +E  RRQ   LER+L  ++  LA   ++LE+  +    LE EL  L ++L+  +     
Sbjct: 703 LLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQER--- 759

Query: 530 GTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
                     +  LE EL  ++  +  L+ + +EL  + + L E+   L +++ 
Sbjct: 760 ----------LEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELE 803



 Score = 40.5 bits (95), Expect = 0.003
 Identities = 28/123 (22%), Positives = 61/123 (49%), Gaps = 6/123 (4%)

Query: 461 MHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKL 520
           + G   +       +R+ + LE EL+ + + L    ++L+        LE  L  LR++L
Sbjct: 652 ITGGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQL 711

Query: 521 QWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQ 580
           +   R++            +AALE EL ++Q+ + +L+ + +EL  ++++L E+   L +
Sbjct: 712 EELERQL------EELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEE 765

Query: 581 QIR 583
           ++ 
Sbjct: 766 ELE 768



 Score = 40.5 bits (95), Expect = 0.004
 Identities = 25/116 (21%), Positives = 50/116 (43%), Gaps = 6/116 (5%)

Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVC 528
            E+E+ + +   LE EL  +   L    ++LEE       LE  L  L+++++    +  
Sbjct: 730 EELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQ 789

Query: 529 NGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRP 584
                L        LE EL   +  +  L+R+ + L  + ++L ++   L ++I  
Sbjct: 790 ALQEEL------EELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEE 839



 Score = 40.1 bits (94), Expect = 0.004
 Identities = 29/168 (17%), Positives = 65/168 (38%), Gaps = 12/168 (7%)

Query: 426 DQIASKLHSRGRNICSVQGQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLEREL 485
           +++  +L    + I  ++ + +E L   L  L+ ++  +     E+E      R    EL
Sbjct: 256 EELQEELEEAEKEIEELKSELEE-LREELEELQEELLELKEEIEELEGEISLLRERLEEL 314

Query: 486 SRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCN----------GTASLS 535
                 L    ++L+E + A      E   L ++L+                        
Sbjct: 315 ENELEELEERLEELKEKIEALKEELEERETLLEELE-QLLAELEEAKEELEEKLSALLEE 373

Query: 536 NGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
                 AL  EL  ++A + +++ + +EL  +++ L E+   LS+++ 
Sbjct: 374 LEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLE 421



 Score = 37.4 bits (87), Expect = 0.032
 Identities = 32/130 (24%), Positives = 58/130 (44%), Gaps = 4/130 (3%)

Query: 443 QGQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEET 502
                E LE    +LR ++  + A   E+   R +   L+RE+  +   L   S++LE+ 
Sbjct: 367 LSALLEELEELFEALREELAELEA---ELAEIRNELEELKREIESLEERLERLSERLEDL 423

Query: 503 VAANARLESELVVLRQKLQWSRREVCNGTASLSNGPS-VAALEAELRRVQALVGDLQRQR 561
                 LE+EL  L+ +L+    E+      L      +  LE EL  +Q  +  L+++ 
Sbjct: 424 KEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEKEL 483

Query: 562 QELSAQVKQL 571
             L A++ +L
Sbjct: 484 SSLEARLDRL 493



 Score = 36.6 bits (85), Expect = 0.054
 Identities = 30/139 (21%), Positives = 55/139 (39%), Gaps = 30/139 (21%)

Query: 445 QAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVA 504
           Q +E LE  +  L  ++          E    +   LE EL  +   L    ++LEE  A
Sbjct: 821 QRRERLEQEIEELEEEI----------EELEEKLDELEEELEELEKELEELKEELEELEA 870

Query: 505 ANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQEL 564
               LE EL  L ++                       LE ELR +++ + +L+ + ++L
Sbjct: 871 EKEELEDELKELEEEK--------------------EELEEELRELESELAELKEEIEKL 910

Query: 565 SAQVKQLTEKSNSLSQQIR 583
             ++++L  K   L  ++ 
Sbjct: 911 RERLEELEAKLERLEVELP 929



 Score = 33.5 bits (77), Expect = 0.54
 Identities = 35/153 (22%), Positives = 65/153 (42%), Gaps = 35/153 (22%)

Query: 447 KEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLER---ELSRVRSILAHNSKKLE--E 501
           ++++E A G  ++K           ER    +R LER    L R+  +L    K+LE  E
Sbjct: 158 RKLIEEAAGVSKYK-----------ERKEEAERKLERTEENLERLEDLLEELEKQLEKLE 206

Query: 502 TVAANA----RLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDL 557
             A  A     L++EL  L   L  ++ +             +  LE EL R++  + +L
Sbjct: 207 RQAEKAERYQELKAELRELELALLLAKLKELRK--------ELEELEEELSRLEEELEEL 258

Query: 558 QRQRQELSAQV-------KQLTEKSNSLSQQIR 583
           Q + +E   ++       ++L E+   L +++ 
Sbjct: 259 QEELEEAEKEIEELKSELEELREELEELQEELL 291



 Score = 30.1 bits (68), Expect = 6.0
 Identities = 22/105 (20%), Positives = 42/105 (40%), Gaps = 3/105 (2%)

Query: 426 DQIASKLHSRGRNICSVQGQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLEREL 485
           ++   +L  R  ++     + +  LE     L      +     ++E  R + + LEREL
Sbjct: 410 EERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELEREL 469

Query: 486 SRVRSILAHNSKKLEETVAANARLESELV---VLRQKLQWSRREV 527
           + ++  L    K+L    A   RLE+E      +R  L+     +
Sbjct: 470 AELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALESGL 514


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
           multi-subunit complex made up of two heavy chains and
           four light chains it is a fundamental contractile
           protein found in all eukaryote cell types. This family
           consists of the coiled-coil myosin heavy chain tail
           region. The coiled-coil is composed of the tail from two
           molecules of myosin. These can then assemble into the
           macromolecular thick filament. The coiled-coil region
           provides the structural backbone the thick filament.
          Length = 859

 Score = 43.5 bits (103), Expect = 4e-04
 Identities = 34/133 (25%), Positives = 66/133 (49%), Gaps = 7/133 (5%)

Query: 451 ENALGSLRHKMHGVHASPAE-VERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARL 509
           E AL +LR K        +E +E+ ++Q+   E+E S++++ +     +L++   A A+L
Sbjct: 119 EEALATLRKKHQDAINELSEQIEQLQKQKAKAEKEKSQLQAEVDDLLAQLDQ--IAKAKL 176

Query: 510 ESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVK 569
            +E     ++L+    E+      L     +  L ++  R+Q+   DL RQ +E  AQV 
Sbjct: 177 NAEKK--AKQLESQLSELQVKLDELQR--QLNDLTSQKSRLQSENSDLTRQLEEAEAQVS 232

Query: 570 QLTEKSNSLSQQI 582
            L++  + L  Q+
Sbjct: 233 NLSKLKSQLESQL 245



 Score = 39.3 bits (92), Expect = 0.008
 Identities = 32/137 (23%), Positives = 64/137 (46%), Gaps = 9/137 (6%)

Query: 446 AKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAA 505
            K  L+  +  L  ++  +  +    E+  +Q   LE +LS ++  L    ++L +  + 
Sbjct: 153 EKSQLQAEVDDLLAQLDQIAKAKLNAEKKAKQ---LESQLSELQVKLDELQRQLNDLTSQ 209

Query: 506 NARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELS 565
            +RL+SE   L ++L+    E      S       + LE++L   +  + +  R+R  L 
Sbjct: 210 KSRLQSENSDLTRQLE--EAEAQVSNLSKLK----SQLESQLEEAKRSLEEESRERANLQ 263

Query: 566 AQVKQLTEKSNSLSQQI 582
           AQ++QL    +SL +Q+
Sbjct: 264 AQLRQLEHDLDSLREQL 280



 Score = 38.1 bits (89), Expect = 0.021
 Identities = 32/145 (22%), Positives = 64/145 (44%), Gaps = 18/145 (12%)

Query: 450 LENALGSLRHKMHGVHASPAEVERYRRQ--QRLLER--ELSRVRSILAHNSKKLEETVAA 505
           LE A  +L  +   V  +  E+ + R +  +RL E+  E    R       + L+ T+ A
Sbjct: 477 LEEAEAALELEESKVLRAQVELSQIRSEIERRLAEKEEEFENTRKNHQRAIESLQATLEA 536

Query: 506 NARLESELVVLRQKL--QWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQ----- 558
            A+ ++E   L++KL    +  E+    A+ +N    A  +  +++ Q  V +LQ     
Sbjct: 537 EAKGKAEASRLKKKLEGDINELEIALDHANKAN----AEAQKNVKKYQQQVKELQTQVEE 592

Query: 559 --RQRQELSAQVKQLTEKSNSLSQQ 581
             R R++   Q+  + E+  +  + 
Sbjct: 593 EQRAREDAREQL-AVAERRATALEA 616



 Score = 35.8 bits (83), Expect = 0.11
 Identities = 14/44 (31%), Positives = 24/44 (54%)

Query: 29  SVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 72
            +  L ++  R+Q+   DL RQ +E  AQV  L++  + L  Q+
Sbjct: 202 QLNDLTSQKSRLQSENSDLTRQLEEAEAQVSNLSKLKSQLESQL 245



 Score = 33.1 bits (76), Expect = 0.68
 Identities = 22/92 (23%), Positives = 44/92 (47%), Gaps = 11/92 (11%)

Query: 484 ELSRVRSILAHNSKKLEETVAA-NARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAA 542
           EL R +  L +   + E  ++  +++LE E   L  +LQ   +E+            +  
Sbjct: 1   ELERQKRELENQLYRKESELSQLSSKLEDEQA-LVAQLQKKIKELEA---------RIRE 50

Query: 543 LEAELRRVQALVGDLQRQRQELSAQVKQLTEK 574
           LE EL   +A     ++ R +LS ++++L+E+
Sbjct: 51  LEEELEAERAARAKAEKARADLSRELEELSER 82



 Score = 33.1 bits (76), Expect = 0.73
 Identities = 13/41 (31%), Positives = 28/41 (68%)

Query: 32  ALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 72
            LE++L  +Q  + +LQRQ  +L++Q  +L  +++ L++Q+
Sbjct: 184 QLESQLSELQVKLDELQRQLNDLTSQKSRLQSENSDLTRQL 224



 Score = 31.6 bits (72), Expect = 2.2
 Identities = 32/150 (21%), Positives = 59/150 (39%), Gaps = 21/150 (14%)

Query: 426 DQIASKLHSRGRNICSVQGQAKEMLENALGSLRHKMHGVHASPAE-VERYRRQQRLLERE 484
           D+I ++   +   + +    A+    N    L  ++        + VE  RR+ + L+ E
Sbjct: 383 DKILAEWKRKVDELQAELDTAQREARNLSTEL-FRLKNELEELKDQVEALRRENKNLQDE 441

Query: 485 LSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALE 544
           +  +   L    + + E   A  RLE+E   L+  L+ +                 AALE
Sbjct: 442 IHDLTDQLGEGGRNVHELEKARRRLEAEKDELQAALEEAE----------------AALE 485

Query: 545 AELRRVQALVGDLQRQRQELSAQVKQLTEK 574
            E  +V     +L + R E+    ++L EK
Sbjct: 486 LEESKVLRAQVELSQIRSEIE---RRLAEK 512



 Score = 30.8 bits (70), Expect = 3.6
 Identities = 30/139 (21%), Positives = 58/139 (41%), Gaps = 8/139 (5%)

Query: 444 GQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETV 503
            + +  L+  L  L H +  +      +E     +  LER+LS+  + +     K E   
Sbjct: 256 SRERANLQAQLRQLEHDLDSLREQ---LEEESEAKAELERQLSKANAEIQQWRSKFESEG 312

Query: 504 AANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQE 563
           A  A    EL  L++KL     E+     + +      +LE    R+Q+ + DLQ + + 
Sbjct: 313 ALRAE---ELEELKKKLNQKISELEEAAEAANA--KCDSLEKTKSRLQSELEDLQIELER 367

Query: 564 LSAQVKQLTEKSNSLSQQI 582
            +A   +L +K  +  + +
Sbjct: 368 ANAAASELEKKQKNFDKIL 386


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 41.3 bits (97), Expect = 5e-04
 Identities = 22/127 (17%), Positives = 46/127 (36%), Gaps = 6/127 (4%)

Query: 456 SLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVV 515
               +   +     E+   ++Q   L+ EL  +   +A    +LE+       L   L  
Sbjct: 66  VQSVRPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKS 125

Query: 516 LRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKS 575
           L ++L+     +      L         +     V+ L  +L+R ++ L    + + E  
Sbjct: 126 LEERLESLEESIKELAKELREL-----RQDLREEVEELREELERLQENLQRLQEAIQE-L 179

Query: 576 NSLSQQI 582
            SL +Q+
Sbjct: 180 QSLLEQL 186


>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
           Members of this protein family are homologs of ClpB, an
           ATPase associated with chaperone-related functions.
           These ClpB homologs, designated ClpV1, are a key
           component of the bacterial pathogenicity-associated type
           VI secretion system [Protein fate, Protein and peptide
           secretion and trafficking, Cellular processes,
           Pathogenesis].
          Length = 852

 Score = 39.9 bits (94), Expect = 0.005
 Identities = 30/119 (25%), Positives = 45/119 (37%), Gaps = 24/119 (20%)

Query: 465 HASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSR 524
           +A+PA +E  RR+   LE EL      L   +    +     A L +EL  L  +L    
Sbjct: 415 NATPAALEDLRRRIAALELEL----DALEREAALGADHDERLAELRAELAALEAEL---- 466

Query: 525 REVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
                           AALEA  ++ + LV  +   R EL A      +   +L  Q+ 
Sbjct: 467 ----------------AALEARWQQEKELVEAILALRAELEADADAPADDDAALRAQLA 509


>gnl|CDD|217843 pfam04012, PspA_IM30, PspA/IM30 family.  This family includes PspA
           a protein that suppresses sigma54-dependent
           transcription. The PspA protein, a negative regulator of
           the Escherichia coli phage shock psp operon, is produced
           when virulence factors are exported through secretins in
           many Gram-negative pathogenic bacteria and its homologue
           in plants, VIPP1, plays a critical role in thylakoid
           biogenesis, essential for photosynthesis. Activation of
           transcription by the enhancer-dependent bacterial
           sigma(54) containing RNA polymerase occurs through ATP
           hydrolysis-driven protein conformational changes enabled
           by activator proteins that belong to the large AAA(+)
           mechanochemical protein family. It has been shown that
           PspA directly and specifically acts upon and binds to
           the AAA(+) domain of the PspF transcription activator.
          Length = 220

 Score = 38.1 bits (89), Expect = 0.009
 Identities = 27/108 (25%), Positives = 54/108 (50%), Gaps = 5/108 (4%)

Query: 478 QRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNG 537
           +++LE+ +  ++S L    + L + +A   +LE +L    QK Q  + E     A+L+ G
Sbjct: 25  EKMLEQAIRDMQSELGKARQALAQVIARQKQLERKLE--EQKEQAKKLEN-KARAALTKG 81

Query: 538 PSVAALEA--ELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
               A EA  E+  ++     L+ Q  +  + V+QL ++  +L  +I+
Sbjct: 82  NEELAREALAEIATLEKQAEALETQLTQQRSAVEQLRKQLAALETKIQ 129


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 36.2 bits (84), Expect = 0.063
 Identities = 25/117 (21%), Positives = 50/117 (42%), Gaps = 18/117 (15%)

Query: 467 SPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQW--SR 524
           S +E+ +     + ++REL   +S L    ++LEE          EL  + +K+    S 
Sbjct: 371 SGSELTQLEVLIQQVKRELQDAKSQLLKELRELEE----------ELAEVDKKISTIPSE 420

Query: 525 REVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQ 581
            ++      L         + EL R +A + +L RQ + L   ++ L +  +  ++Q
Sbjct: 421 EQIAQLLEEL------GEAQNELFRSEAEIEELLRQLETLKEAIEALRKTLDEKTKQ 471



 Score = 32.0 bits (73), Expect = 1.6
 Identities = 19/107 (17%), Positives = 40/107 (37%), Gaps = 16/107 (14%)

Query: 477 QQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSN 536
           ++R      S           ++E   A       +   L Q++   R E+     SL +
Sbjct: 196 RRRKKSELPS-------SILSEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLES 248

Query: 537 GPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
                      ++ ++  GDL  +R++L  Q+K++     +   Q+R
Sbjct: 249 L---------EKKFRSEGGDLFEEREQLERQLKEIEAARKANRAQLR 286


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 36.2 bits (84), Expect = 0.068
 Identities = 27/109 (24%), Positives = 50/109 (45%), Gaps = 10/109 (9%)

Query: 467 SPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSR-R 525
           +  E+ER  ++ + LE EL +    LA   K+LEE       L  EL  L +K       
Sbjct: 610 AEKELEREEKELKKLEEELDKAFEELAETEKRLEE-------LRKELEELEKKYSEEEYE 662

Query: 526 EVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEK 574
           E+      LS    +A L AEL  ++    ++++  ++L  ++++  + 
Sbjct: 663 ELREEYLELSR--ELAGLRAELEELEKRREEIKKTLEKLKEELEEREKA 709



 Score = 34.7 bits (80), Expect = 0.21
 Identities = 28/114 (24%), Positives = 56/114 (49%), Gaps = 8/114 (7%)

Query: 469 AEVERY-RRQQRLLERELSRVRSI---LAHNSKKLEETVAANARLESELVVLRQKLQWSR 524
            EV +  +R+   LE+ + R  +I   +    K+LEE +     + SEL  LR++L+   
Sbjct: 168 GEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLE 227

Query: 525 REV--CNGTASLSNGPS--VAALEAELRRVQALVGDLQRQRQELSAQVKQLTEK 574
           +EV                + +LE   R+++  + +L+ + +EL  ++++L EK
Sbjct: 228 KEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEK 281



 Score = 31.6 bits (72), Expect = 1.7
 Identities = 30/165 (18%), Positives = 68/165 (41%), Gaps = 31/165 (18%)

Query: 430 SKLHSRGRNICSVQGQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVR 489
            +L  + + +  ++ +A+E ++  L                 E Y  + R +E+ LSR+ 
Sbjct: 276 EELEEKVKELKELKEKAEEYIK--LSEFY-------------EEYLDELREIEKRLSRLE 320

Query: 490 SILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRR 549
             +    ++++E      RLE EL    ++L+  R E       L       A + EL R
Sbjct: 321 EEINGIEERIKELEEKEERLE-ELKKKLKELE-KRLEELEERHELYE--EAKAKKEELER 376

Query: 550 VQALVGD------------LQRQRQELSAQVKQLTEKSNSLSQQI 582
           ++  +              L++ ++E+  ++ ++T +   L ++I
Sbjct: 377 LKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEI 421



 Score = 30.8 bits (70), Expect = 3.0
 Identities = 22/80 (27%), Positives = 33/80 (41%), Gaps = 9/80 (11%)

Query: 448 EMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANA 507
           E L   L  L  K      S  E E  R +   L REL+ +R+ L    K+ EE      
Sbjct: 643 EELRKELEELEKKY-----SEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLE 697

Query: 508 RLESELVVLRQKLQWSRREV 527
           +L+ EL    +     ++E+
Sbjct: 698 KLKEELEEREKA----KKEL 713



 Score = 30.8 bits (70), Expect = 3.6
 Identities = 26/135 (19%), Positives = 52/135 (38%), Gaps = 12/135 (8%)

Query: 450 LENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARL 509
           + + L  LR ++  +     E+E  + +   LE+EL  +        +K+ E       L
Sbjct: 212 ISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEEL 271

Query: 510 ESELVVLRQK------LQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQE 563
           + E+  L +K      L+    E    +              ELR ++  +  L+ +   
Sbjct: 272 KKEIEELEEKVKELKELKEKAEEYIKLSEFYEE------YLDELREIEKRLSRLEEEING 325

Query: 564 LSAQVKQLTEKSNSL 578
           +  ++K+L EK   L
Sbjct: 326 IEERIKELEEKEERL 340



 Score = 29.6 bits (67), Expect = 6.7
 Identities = 24/134 (17%), Positives = 48/134 (35%), Gaps = 26/134 (19%)

Query: 448 EMLENALGSLRHKMHGVHASPAEVERYRRQQRLLEREL-SRVRSILAHNSKKLEETVAAN 506
           E LE  L SL      +     E+E    + +    EL  +V+ +     K+L+E     
Sbjct: 241 EELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKEL-----KELKEKAEEY 295

Query: 507 ARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSA 566
            +L         +L+                     +E  L R++  +  ++ + +EL  
Sbjct: 296 IKLSEFYEEYLDELR--------------------EIEKRLSRLEEEINGIEERIKELEE 335

Query: 567 QVKQLTEKSNSLSQ 580
           + ++L E    L +
Sbjct: 336 KEERLEELKKKLKE 349


>gnl|CDD|218852 pfam06008, Laminin_I, Laminin Domain I.  coiled-coil structure. It
           has been suggested that the domains I and II from
           laminin A, B1 and B2 may come together to form a triple
           helical coiled-coil structure.
          Length = 263

 Score = 35.4 bits (82), Expect = 0.082
 Identities = 37/153 (24%), Positives = 56/153 (36%), Gaps = 22/153 (14%)

Query: 427 QIASKLHSRGRNICSVQGQA-KEMLENA---LGSLRHKMHGVHASPAEVERYRRQQRLLE 482
           +I  KL   G N  ++ G A   ML  A   LG +R +        AE E  +  + LL 
Sbjct: 104 EILEKLAGLGENAFALPGSALSRMLAEAQRMLGEIRERDFQKQLQNAEAE-LKAAEELLN 162

Query: 483 RELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAA 542
           R  +  ++    N    E         E++L  LR+ L     E    T           
Sbjct: 163 RVQTWFQTPQEENEGLFEALRDDLNEYEAKLSDLRELLD----EAQAKTR---------- 208

Query: 543 LEAE--LRRVQALVGDLQRQRQELSAQVKQLTE 573
            +AE      Q  + + Q ++QE+S    Q  E
Sbjct: 209 -DAERLNLANQKRLREFQEKKQEVSELKNQAEE 240


>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
           exopolysaccharide biosynthesis [Cell envelope
           biogenesis, outer membrane].
          Length = 458

 Score = 35.5 bits (82), Expect = 0.092
 Identities = 32/146 (21%), Positives = 56/146 (38%), Gaps = 15/146 (10%)

Query: 444 GQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRL--LERELSRVRSILAHNSKKL-- 499
            +A + L+  L  LR +   +  + A+VE +R Q  L    R        L+  + +L  
Sbjct: 194 RRASDSLDERLEELRAR---LQEAEAQVEDFRAQHGLTDAARGQLLSEQQLSALNTQLQS 250

Query: 500 --EETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDL 557
                  A ARL S L +L    + +          +   P++  L  +  +V+  + DL
Sbjct: 251 ARARLAQAEARLASLLQLLPLGREAAA------LREVLESPTIQDLRQQYAQVRQQIADL 304

Query: 558 QRQRQELSAQVKQLTEKSNSLSQQIR 583
             +      Q+  L  +   L QQI 
Sbjct: 305 STELGAKHPQLVALEAQLAELRQQIA 330


>gnl|CDD|147975 pfam06103, DUF948, Bacterial protein of unknown function (DUF948). 
           This family consists of bacterial sequences several of
           which are thought to be general stress proteins.
          Length = 90

 Score = 32.5 bits (75), Expect = 0.11
 Identities = 15/58 (25%), Positives = 26/58 (44%), Gaps = 7/58 (12%)

Query: 547 LRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI-----RPGPTGVAVA--GKSVN 597
           L  V   +  L +Q   ++ + + L  K+N L + +        P   AVA  G+SV+
Sbjct: 28  LDEVNKTLAGLTKQVDGITTETEDLLAKTNVLLEDVNGKVETLDPVFQAVADLGESVS 85



 Score = 32.1 bits (74), Expect = 0.16
 Identities = 8/36 (22%), Positives = 17/36 (47%)

Query: 37 LRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 72
          L  V   +  L +Q   ++ + + L  K+N L + +
Sbjct: 28 LDEVNKTLAGLTKQVDGITTETEDLLAKTNVLLEDV 63


>gnl|CDD|132062 TIGR03017, EpsF, chain length determinant protein EpsF.  Sequences
           in this family of proteins are members of the chain
           length determinant family (pfam02706) which includes the
           wzc protein from E.coli. This family of proteins are
           homologous to the EpsF protein of the methanolan
           biosynthesis operon of Methylobacillus species strain
           12S. The distribution of this protein appears to be
           restricted to a subset of exopolysaccharide operons
           containing a syntenic grouping of genes including a
           variant of the EpsH exosortase protein. Exosortase has
           been proposed to be involved in the targetting and
           processing of proteins containing the PEP-CTERM domain
           to the exopolysaccharide layer.
          Length = 444

 Score = 35.2 bits (81), Expect = 0.11
 Identities = 27/90 (30%), Positives = 45/90 (50%), Gaps = 8/90 (8%)

Query: 481 LERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSV 540
           L+ +++R  S LA  S++L        R ++E+  L+ +L     E+   T+S+     +
Sbjct: 259 LKTDIARAESKLAELSQRLGPNHPQYKRAQAEINSLKSQL---NAEIKKVTSSVGTNSRI 315

Query: 541 A-ALEAELR----RVQALVGDLQRQRQELS 565
               EAELR      +A V +L RQR E+S
Sbjct: 316 LKQREAELREALENQKAKVLELNRQRDEMS 345


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
          [Function unknown].
          Length = 265

 Score = 34.7 bits (80), Expect = 0.13
 Identities = 7/44 (15%), Positives = 20/44 (45%)

Query: 29 SVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 72
              ++ E+  +   + ++Q +  EL  ++ Q   +   L ++I
Sbjct: 46 EKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEI 89



 Score = 34.7 bits (80), Expect = 0.13
 Identities = 7/44 (15%), Positives = 20/44 (45%)

Query: 539 SVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
               ++ E+  +   + ++Q +  EL  ++ Q   +   L ++I
Sbjct: 46  EKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEI 89


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 35.0 bits (81), Expect = 0.14
 Identities = 24/111 (21%), Positives = 45/111 (40%), Gaps = 20/111 (18%)

Query: 481 LERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPS- 539
           LE  L  ++      +K L         + S    LR+KLQ  + E+    ASL      
Sbjct: 152 LEENLEGMKRDEELLNKDLNL-------INSIKPKLRKKLQALKEEI----ASLRQLADE 200

Query: 540 --------VAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
                   +     ELR +   + + ++Q +EL  ++++LT    +L+ + 
Sbjct: 201 LNLCDPLELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKK 251


>gnl|CDD|205229 pfam13048, DUF3908, Protein of unknown function (DUF3908).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are approximately 140 amino
           acids in length. There is a single completely conserved
           residue Y that may be functionally important.
          Length = 125

 Score = 33.1 bits (76), Expect = 0.16
 Identities = 28/107 (26%), Positives = 41/107 (38%), Gaps = 13/107 (12%)

Query: 197 VLSAYIRYV-SARLADLETPKRMEHLQKLEELKRHLVELEKQVSITFNLTERNEKIPLKF 255
                 ++V  +   D E  K  E L+K++EL   L E   +V    NL   N+K+ L  
Sbjct: 1   NYEEVDQWVEGSNARDEEDRKLRELLKKVKEL---LDEKNIKVIYPKNLFNDNKKLELII 57

Query: 256 V-NDGDYI--------QAVLDRNLAENISRVLYPNDNNFGGKELRLK 293
              D   I           ++    E+I +      NN G K L LK
Sbjct: 58  FLEDNKLIVIIENEEETITIEILKLEDIKKSELETKNNNGNKNLTLK 104


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 33.4 bits (77), Expect = 0.16
 Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 6/91 (6%)

Query: 497 KKLEETVAANARLESELVVLRQKLQWSRREVCNGTASL---SNGPSVAALEAELRRVQAL 553
            +     AA  +LE E   L+ +LQ   +E+      L   +   S  A +A+ + +Q  
Sbjct: 28  SESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQK 87

Query: 554 VGDLQRQRQELSAQVKQLTEKSNSLSQQIRP 584
             +LQ+++Q      ++L +K   L Q I  
Sbjct: 88  QQELQQKQQAAQ---QELQQKQQELLQPIYD 115



 Score = 30.7 bits (70), Expect = 1.6
 Identities = 14/51 (27%), Positives = 31/51 (60%), Gaps = 1/51 (1%)

Query: 24 LSNGPSVAALEAELR-RVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 73
          LS  P+  A + +L    + L  +LQ++ +EL  + ++L +++ +LS++ R
Sbjct: 27 LSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEAR 77



 Score = 30.7 bits (70), Expect = 1.6
 Identities = 14/51 (27%), Positives = 31/51 (60%), Gaps = 1/51 (1%)

Query: 534 LSNGPSVAALEAELR-RVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
           LS  P+  A + +L    + L  +LQ++ +EL  + ++L +++ +LS++ R
Sbjct: 27  LSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEAR 77



 Score = 28.4 bits (64), Expect = 9.3
 Identities = 12/46 (26%), Positives = 23/46 (50%), Gaps = 3/46 (6%)

Query: 29  SVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRP 74
           S  A +A+ + +Q    +LQ+++Q      ++L +K   L Q I  
Sbjct: 73  SEEARKAKQQELQQKQQELQQKQQAAQ---QELQQKQQELLQPIYD 115


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 34.6 bits (79), Expect = 0.21
 Identities = 25/137 (18%), Positives = 60/137 (43%), Gaps = 13/137 (9%)

Query: 445 QAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVA 504
             +  L  A G  R       A+ +E+++ R+++  + +EL+  R  LA   ++L     
Sbjct: 85  ALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLTK 144

Query: 505 ANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQEL 564
               L++ L  L ++ +    +               +L+A  +++QA    L+ Q  +L
Sbjct: 145 QAQDLQTRLKTLAEQRRQLEAQA-------------QSLQASQKQLQASATQLKSQVLDL 191

Query: 565 SAQVKQLTEKSNSLSQQ 581
             +  Q+ +++ +L+ +
Sbjct: 192 KLRSAQIEQEAQNLATR 208


>gnl|CDD|115928 pfam07303, Occludin_ELL, Occludin homology domain.  This domain
          represents a conserved region approximately 100
          residues long within eukaryotic occludin proteins and
          the RNA polymerase II elongation factor ELL. Occludin
          is an integral membrane protein that localises to tight
          junctions, while ELL is an elongation factor that can
          increase the catalytic rate of RNA polymerase II
          transcription by suppressing transient pausing by
          polymerase at multiple sites along the DNA. This shared
          domain is thought to mediate protein interactions.
          Length = 101

 Score = 31.9 bits (73), Expect = 0.24
 Identities = 10/31 (32%), Positives = 18/31 (58%)

Query: 35 AELRRVQALVGDLQRQRQELSAQVKQLTEKS 65
           E + +QA V  + ++ Q+L A++  L E S
Sbjct: 22 QEYKELQAEVDAINKRLQQLDAELDSLPEGS 52



 Score = 31.9 bits (73), Expect = 0.24
 Identities = 10/31 (32%), Positives = 18/31 (58%)

Query: 545 AELRRVQALVGDLQRQRQELSAQVKQLTEKS 575
            E + +QA V  + ++ Q+L A++  L E S
Sbjct: 22  QEYKELQAEVDAINKRLQQLDAELDSLPEGS 52


>gnl|CDD|191973 pfam08232, Striatin, Striatin family.  Striatin is an
          intracellular protein which has a caveolin-binding
          motif, a coiled-coil structure, a calmodulin-binding
          site, and a WD (pfam00400) repeat domain. It acts as a
          scaffold protein and is involved in signalling
          pathways.
          Length = 133

 Score = 32.7 bits (75), Expect = 0.24
 Identities = 14/59 (23%), Positives = 23/59 (38%)

Query: 30 VAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGPTGVAGKSAQGST 88
          +A LE E R ++ L  DL+R+ + L   +KQ   K   L             +  +   
Sbjct: 34 IAQLEGERRGLENLKEDLKRRIKMLEYALKQERAKLKKLKYGTDSPQKPSKEEEEEELE 92



 Score = 31.6 bits (72), Expect = 0.52
 Identities = 14/47 (29%), Positives = 21/47 (44%)

Query: 540 VAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGP 586
           +A LE E R ++ L  DL+R+ + L   +KQ   K   L        
Sbjct: 34  IAQLEGERRGLENLKEDLKRRIKMLEYALKQERAKLKKLKYGTDSPQ 80


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 34.4 bits (79), Expect = 0.25
 Identities = 24/117 (20%), Positives = 44/117 (37%), Gaps = 12/117 (10%)

Query: 470 EVERYRRQQRLLE---RELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRRE 526
           E+E  R     LE    +L  +   L    +KLE+  +    L  E   L + L+    E
Sbjct: 306 ELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLE----E 361

Query: 527 VCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
                        +  LE EL +    +  L+   QEL  ++ +L+     + +++ 
Sbjct: 362 RLK-----ELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELE 413



 Score = 34.4 bits (79), Expect = 0.27
 Identities = 27/143 (18%), Positives = 53/143 (37%), Gaps = 16/143 (11%)

Query: 447 KEMLENALGSLRHKMHGVHASPAEVE-RYRRQQRLL--------ERELSRVRSILAHNSK 497
               +  L  LR ++  +     E+E R  + + LL        E EL      L    +
Sbjct: 581 LRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELE 640

Query: 498 KLEETVAANARLESELVVLRQKLQWSRREV-------CNGTASLSNGPSVAALEAELRRV 550
           KL         L++ L  L +K++    E+        N          +  LE EL ++
Sbjct: 641 KLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQL 700

Query: 551 QALVGDLQRQRQELSAQVKQLTE 573
           +  + +L ++  E+   +++L  
Sbjct: 701 REELEELLKKLGEIEQLIEELES 723



 Score = 31.3 bits (71), Expect = 2.3
 Identities = 29/113 (25%), Positives = 44/113 (38%), Gaps = 8/113 (7%)

Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCN 529
           E+    R  R  + EL  +R  L    KKL+E         S+L  L Q L+ S  E   
Sbjct: 573 ELLEELRLLRTRKEELEELRERLKELKKKLKEL----EERLSQLEELLQSLELSEAENEL 628

Query: 530 GTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
             A       + +   +L     L   LQ   +EL  +V++L  +     Q+I
Sbjct: 629 EEAEE----ELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRI 677


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 34.3 bits (78), Expect = 0.30
 Identities = 57/281 (20%), Positives = 89/281 (31%), Gaps = 23/281 (8%)

Query: 61  LTEKSNSLSQQIRPGPTGVAGK-SAQGSTSPSGRSTAAETGTERTGQTADGEIQLFES-- 117
           L  K++     I    T   G  S +G      RS +A    E      D E +   +  
Sbjct: 70  LENKASKDDGNINLTDTSENGDASKKGHGKSRVRSASAAAILEEDDSKDDMEFKANPNEA 129

Query: 118 -------ADQARTHWSSGPGSTGVAGKSAQGSTSPSGRSTAAETGTERTGQTADGEIQLF 170
                   +Q     SS  G +  + KS   S S  G S +++     +G+ A   +   
Sbjct: 130 GKPGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSSDESATDSGK-ASASVAGI 188

Query: 171 ESADQARTHWSNPGLMCILSSYVYYCVLSAYIRYVSARLADLETPKRMEHLQKLEELKRH 230
             AD+         L  +   Y     L A    +     +++  K        EE    
Sbjct: 189 VGADEEAPPAPKNTLTPLEELYETNVNLFALKHPLEKLEEEIDILKNDGDKVAEEEEFEL 248

Query: 231 LVELEKQVSITFNLTERNEKI-------PLKFVNDGDYIQ-AVLDRNLAENISRVLYPND 282
             E E+      +  E  EKI         KF  +  +I+  V DRN+A   S       
Sbjct: 249 DEEHEE---AEEDKKEALEKIGAEGDEEKFKFDEEIKFIEHDVKDRNIAGGFSEFF-SKL 304

Query: 283 NNFGGKELRLKQEYFMCAATLQVISLPFKDQGRALLGYSHS 323
           N F   E   K+E      +     +  K++    LG    
Sbjct: 305 NPFKKDEKIEKKEISKITFSGIGDKITDKEKIAKDLGIGAK 345


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 32.2 bits (74), Expect = 0.35
 Identities = 25/107 (23%), Positives = 45/107 (42%), Gaps = 11/107 (10%)

Query: 479 RLLE--RELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSN 536
           RLLE  RE+  V+  L    ++ E       + E EL    ++LQ S  +          
Sbjct: 1   RLLEKRREMEEVQLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDK------- 53

Query: 537 GPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
              +   EA+ RR +    + ++ R+E   ++K+L  +   L  +I 
Sbjct: 54  --FLKENEAKRRRAEKKAEEEKKLRKEKEEEIKELKAELEELKAEIE 98



 Score = 31.8 bits (73), Expect = 0.48
 Identities = 14/103 (13%), Positives = 37/103 (35%), Gaps = 27/103 (26%)

Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVC 528
              E   +++  L+  L +    L  N  K      A  + E E    + + +       
Sbjct: 32  QREEELEKKEEELQESLIKFDKFLKENEAKRRR---AEKKAEEEK---KLRKE------- 78

Query: 529 NGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQL 571
                          E E++ ++A + +L+ + ++L  ++++ 
Sbjct: 79  --------------KEEEIKELKAELEELKAEIEKLEEKLEEY 107


>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
          Length = 343

 Score = 33.4 bits (77), Expect = 0.45
 Identities = 21/76 (27%), Positives = 32/76 (42%), Gaps = 6/76 (7%)

Query: 507 ARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSA 566
            RL S++  L   L   R+    G   L +  SVA L A L   +A    LQ    EL+ 
Sbjct: 56  DRLNSQIAELADLLSLERQ----GNQDLQD--SVANLRASLSAAEAERSRLQALLAELAG 109

Query: 567 QVKQLTEKSNSLSQQI 582
                  ++  L+Q++
Sbjct: 110 AGAAAEGRAGELAQEL 125



 Score = 31.1 bits (71), Expect = 2.0
 Identities = 13/44 (29%), Positives = 20/44 (45%)

Query: 29  SVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 72
           SVA L A L   +A    LQ    EL+        ++  L+Q++
Sbjct: 82  SVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQEL 125


>gnl|CDD|221084 pfam11336, DUF3138, Protein of unknown function (DUF3138).  This
           family of proteins with unknown function appear to be
           restricted to Proteobacteria.
          Length = 514

 Score = 33.4 bits (76), Expect = 0.46
 Identities = 36/141 (25%), Positives = 60/141 (42%), Gaps = 28/141 (19%)

Query: 14  RREVCNGTASLSNGPSVAALEA-ELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 72
           ++ +C   A    G + AA +A +++ +QA +  LQ+Q  EL A +              
Sbjct: 3   KKLICALIAGALPGAAAAASDASQIKALQAQLTALQQQVNELRAALAA------------ 50

Query: 73  RPGPTGVAGK----SAQGSTSPSGRSTAAETG---TERTGQTADGEIQLFESADQARTHW 125
           +P   G   K    +A  + +PS  + AA T    T+   Q A+  +++    D A T  
Sbjct: 51  KPAAAGGGAKIQSAAAAAAAAPSSDAAAALTNDDVTQMREQIANQSLKVDSLTDAATT-- 108

Query: 126 SSGPGSTGVAGKSAQGSTSPS 146
             GP    +AG S  G   P+
Sbjct: 109 --GP----IAGLSVTGYLDPT 123


>gnl|CDD|113664 pfam04899, MbeD_MobD, MbeD/MobD like.  The MbeD and MobD proteins
          are plasmid encoded, and are involved in the plasmids
          mobilisation and transfer in the presence of
          conjugative plasmids.
          Length = 70

 Score = 30.2 bits (68), Expect = 0.50
 Identities = 14/33 (42%), Positives = 21/33 (63%)

Query: 41 QALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 73
          Q + G  QR+   LS QV  L+++  SLS+Q+R
Sbjct: 34 QRMFGLTQRENAALSEQVTGLSQQVQSLSEQVR 66



 Score = 30.2 bits (68), Expect = 0.50
 Identities = 14/33 (42%), Positives = 21/33 (63%)

Query: 551 QALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
           Q + G  QR+   LS QV  L+++  SLS+Q+R
Sbjct: 34  QRMFGLTQRENAALSEQVTGLSQQVQSLSEQVR 66


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 33.5 bits (77), Expect = 0.50
 Identities = 32/139 (23%), Positives = 55/139 (39%), Gaps = 34/139 (24%)

Query: 445 QAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVA 504
           + +  LE  LG L+ ++    A+P E+E    Q  + +  L + +        + E+  A
Sbjct: 442 EEEYELELRLGRLKQRLDSATATPEELE----QLEINDEALEKAQE-------EQEQAEA 490

Query: 505 ANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQEL 564
              +L+SEL  LR++                +    A   AE R        L + RQ L
Sbjct: 491 NVEQLQSELRQLRKR---------------RDEALEALQRAERR--------LLQLRQAL 527

Query: 565 SAQVKQLTEKSNSLSQQIR 583
                QL+ ++ SL   +R
Sbjct: 528 DELELQLSPQAGSLLHFLR 546



 Score = 32.4 bits (74), Expect = 1.1
 Identities = 19/104 (18%), Positives = 43/104 (41%), Gaps = 9/104 (8%)

Query: 489 RSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSN-----GPSVAAL 543
           R  L    + L+  VA   + E +LV    +L+  +R       +L              
Sbjct: 606 RERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQNEQ 665

Query: 544 EAELRRVQALVGD----LQRQRQELSAQVKQLTEKSNSLSQQIR 583
           ++   +++  + +     + Q ++L AQ+KQL E+  +  + ++
Sbjct: 666 QSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALK 709



 Score = 30.4 bits (69), Expect = 5.0
 Identities = 26/161 (16%), Positives = 54/161 (33%), Gaps = 24/161 (14%)

Query: 430 SKLHSRGRNICSVQGQAKEMLENAL----GSLRHKMHGVHASPAEVERYRRQQRLLEREL 485
            K + R   I   + + K  +E  L      LR ++            +  ++  LE  L
Sbjct: 398 EKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGK------LEFNEEEYELELRL 451

Query: 486 SRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEA 545
            R             +    +A    E +  + ++     E        +   +V  L++
Sbjct: 452 GR------------LKQRLDSATATPEEL-EQLEINDEALEKAQEEQEQAEA-NVEQLQS 497

Query: 546 ELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGP 586
           ELR+++    +     Q    ++ QL +  + L  Q+ P  
Sbjct: 498 ELRQLRKRRDEALEALQRAERRLLQLRQALDELELQLSPQA 538



 Score = 29.3 bits (66), Expect = 9.0
 Identities = 16/47 (34%), Positives = 22/47 (46%), Gaps = 1/47 (2%)

Query: 28  PSVAALEAELR-RVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 73
           P  AA E ELR R+Q     LQ    +     +QL + +  L +Q R
Sbjct: 596 PDYAANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKR 642


>gnl|CDD|221757 pfam12761, End3, Actin cytoskeleton-regulatory complex protein
           END3.  Endocytosis is accomplished through the
           sequential recruitment at endocytic sites of proteins
           that drive cargo sorting, membrane invagination and
           vesicle release. End3p is part of the coat module
           protein complex Pan1, along with Pan1p, Sla1p, and
           Sla2p. The proteins in this complex are regulated by
           phosphorylation events. End3p also regulates the
           cortical actin cytoskeleton. The subunits of the Pan1
           complex are homologous to mammalian intersectin.
          Length = 193

 Score = 32.3 bits (74), Expect = 0.59
 Identities = 23/104 (22%), Positives = 44/104 (42%), Gaps = 11/104 (10%)

Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVV--LRQKLQWSRREV 527
           E  R RR+   LE +LS+ +          E     +A+  S LV   L Q L++   ++
Sbjct: 97  EEVRLRRELAELESKLSKAQQ---------ESESRNDAKPNSALVKRELEQLLKYKENQL 147

Query: 528 CNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQL 571
                  SNG ++ ++  ++  ++  V  L+        ++ QL
Sbjct: 148 RELGGRSSNGVNLGSVREDIDSIEEQVTGLEEHLASRKEELAQL 191


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 32.8 bits (75), Expect = 0.63
 Identities = 33/163 (20%), Positives = 54/163 (33%), Gaps = 29/163 (17%)

Query: 445 QAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVA 504
           +  + L+  L  L        A  AE+   + +   L  E    ++ LA   + LEE   
Sbjct: 161 ERIDALKATLKQLA-------AVRAEIAAEQAELTTLLSEQRAQQAKLA---QLLEERKK 210

Query: 505 ANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQEL 564
             A+L SEL   ++KL+  R       + L N   +A+ EA   + +           E 
Sbjct: 211 TLAQLNSELSADQKKLEELRAN----ESRLKN--EIASAEAAAAKARE-----AAAAAEA 259

Query: 565 SAQVKQLTEKSNSLSQQIRPGPTGVAVAGKSVNRPGPTGVAGK 607
           +A   +  E   +        P               TG  G 
Sbjct: 260 AAARARAAEAKRTGETYKPTAP--------EKMLISSTGGFGA 294



 Score = 31.2 bits (71), Expect = 2.1
 Identities = 11/40 (27%), Positives = 21/40 (52%)

Query: 34 EAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 73
          + E+  ++  + + Q QR +L  Q+K L  +  SL  Q+ 
Sbjct: 44 QKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLI 83



 Score = 31.2 bits (71), Expect = 2.1
 Identities = 11/40 (27%), Positives = 21/40 (52%)

Query: 544 EAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
           + E+  ++  + + Q QR +L  Q+K L  +  SL  Q+ 
Sbjct: 44  QKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLI 83


>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
           This domain is found associated with the Pfam tRNA
          synthetase class II domain (pfam00587) and represents
          the N-terminal domain of seryl-tRNA synthetase.
          Length = 108

 Score = 31.0 bits (71), Expect = 0.64
 Identities = 12/53 (22%), Positives = 24/53 (45%), Gaps = 10/53 (18%)

Query: 30 VAALEAELRRVQALVGDLQRQRQELSAQVKQ----------LTEKSNSLSQQI 72
          +  L+ E R +Q  + +LQ +R ELS ++ +          L  +   L  ++
Sbjct: 31 LLELDEERRELQVELEELQAERNELSKEIGKAKKKKEDAEALIAEVKELKDEL 83



 Score = 31.0 bits (71), Expect = 0.64
 Identities = 12/53 (22%), Positives = 24/53 (45%), Gaps = 10/53 (18%)

Query: 540 VAALEAELRRVQALVGDLQRQRQELSAQVKQ----------LTEKSNSLSQQI 582
           +  L+ E R +Q  + +LQ +R ELS ++ +          L  +   L  ++
Sbjct: 31  LLELDEERRELQVELEELQAERNELSKEIGKAKKKKEDAEALIAEVKELKDEL 83


>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
           in cytoskeletal structure; family members include
           spectrin, alpha-actinin and dystrophin; the spectrin
           repeat forms a three helix bundle with the second helix
           interrupted by proline in some sequences; the repeats
           are independent folding units; tandem repeats are found
           in differing numbers and arrange in an antiparallel
           manner to form dimers; the repeats are defined by a
           characteristic tryptophan (W) residue in helix A and a
           leucine (L) at the carboxyl end of helix C and separated
           by a linker of 5 residues; two copies of the repeat are
           present here.
          Length = 213

 Score = 32.4 bits (74), Expect = 0.64
 Identities = 27/159 (16%), Positives = 61/159 (38%), Gaps = 23/159 (14%)

Query: 426 DQIASKLHSRGRNICSVQGQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLEREL 485
           + + ++L +    + ++    ++++E             H    E++   R + L +R  
Sbjct: 43  EALEAELAAHEERVEALNELGEQLIEEG-----------HPDAEEIQE--RLEELNQRW- 88

Query: 486 SRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEA 545
             +R +     ++LEE +     L+          QW   +     AS   G  + ++E 
Sbjct: 89  EELRELAEERRQRLEEAL----DLQQFFRDADDLEQWLEEKEAAL-ASEDLGKDLESVEE 143

Query: 546 ELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRP 584
            L++ + L  +L+     L    K L E +  L ++  P
Sbjct: 144 LLKKHKELEEELEAHEPRL----KSLNELAEELLEEGHP 178


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 32.3 bits (74), Expect = 0.66
 Identities = 27/127 (21%), Positives = 48/127 (37%), Gaps = 34/127 (26%)

Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCN 529
           E++  + +   LE EL+ +   +    K++E+      RLE  L                
Sbjct: 97  EIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEA----------- 145

Query: 530 GTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSN----SLSQQIRPG 585
                        LE E+         ++ + QELS++ ++L EK +    S  ++IR  
Sbjct: 146 ------------RLEEEVAE-------IREEGQELSSKREELKEKLDPELLSEYERIRKN 186

Query: 586 PTGVAVA 592
             GV V 
Sbjct: 187 KKGVGVV 193



 Score = 32.0 bits (73), Expect = 0.89
 Identities = 22/115 (19%), Positives = 46/115 (40%), Gaps = 20/115 (17%)

Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQ-WSRREVC 528
           E+E    Q   LE E+  +R  +    +KL     +  + E EL  L  ++Q    R   
Sbjct: 53  ELEDLENQVSQLESEIQEIRERIKRAEEKL-----SAVKDERELRALNIEIQIAKERIN- 106

Query: 529 NGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
                        +LE EL  +   +  L+++ ++L  ++++L +       ++ 
Sbjct: 107 -------------SLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLE 148


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 33.0 bits (75), Expect = 0.72
 Identities = 21/103 (20%), Positives = 39/103 (37%)

Query: 478 QRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNG 537
            + L+ E  +  S +    KKLE   A   RL+  L ++ ++    R  + +    L+  
Sbjct: 373 LQQLQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKERDGLRAILNSYDKELTET 432

Query: 538 PSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQ 580
                L   L   + LV  +Q    ++  Q+ +L E       
Sbjct: 433 SVSGQLMKRLEEAEDLVQKVQSHLAKMENQLSELEEDVGQQKD 475



 Score = 30.3 bits (68), Expect = 4.6
 Identities = 23/115 (20%), Positives = 47/115 (40%), Gaps = 6/115 (5%)

Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVC 528
            E++  + Q    ER LS V+        K+E       RL  E  +L  KL+    +  
Sbjct: 482 TEIKLLKEQLVSNERLLSFVKEATNALRLKIETLERERDRLRQEKSLLEMKLEHLCLQGD 541

Query: 529 NGTAS-----LSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSL 578
              +      +S  P+  A +   + ++AL  + ++ ++ L A +++   +   L
Sbjct: 542 YNASRTKVLHMSLNPASEAEQIAKQTIEALQAECEKLKERLQA-LEEGKSQPGDL 595


>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin. 
          Length = 237

 Score = 32.2 bits (74), Expect = 0.77
 Identities = 25/68 (36%), Positives = 32/68 (47%), Gaps = 2/68 (2%)

Query: 445 QAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVA 504
           +A+E LE A   L         + AEV    R+ +LLE +L R    LA   +KLEE   
Sbjct: 12  KAEERLEEAQEKLEEAEKRAEKAEAEVASLNRRIQLLEEDLERSEERLATALEKLEEAEK 71

Query: 505 ANARLESE 512
           A    ESE
Sbjct: 72  AAD--ESE 77


>gnl|CDD|191851 pfam07798, DUF1640, Protein of unknown function (DUF1640).  This
           family consists of sequences derived from hypothetical
           eukaryotic proteins. A region approximately 100 residues
           in length is featured.
          Length = 177

 Score = 31.8 bits (73), Expect = 0.79
 Identities = 14/53 (26%), Positives = 28/53 (52%), Gaps = 4/53 (7%)

Query: 470 EVERYRRQQRL-LERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQ 521
           E+ +     RL L  E  R+R   A    K++ET   + ++++E+  L+ +L+
Sbjct: 99  EITKVTAGVRLDLNLEKGRIREESAEQELKIKET---DTKIDTEIAGLKTQLE 148



 Score = 28.7 bits (65), Expect = 6.9
 Identities = 22/124 (17%), Positives = 50/124 (40%), Gaps = 22/124 (17%)

Query: 460 KMHGVHASPAEVERYRRQQRLLERELSRVRS-ILAHNSKKLEETVAANARLESELVVLRQ 518
            +     +  + E+   QQ   + + ++++S + +    +     A N +L+++L  L+ 
Sbjct: 38  NVSKDMVTKEQQEKTEYQQ---KVDFAKLKSELQSLEKSEFANLQAENEKLKNDLEKLKN 94

Query: 519 KLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSL 578
           +L   R E+   TA          L+  L +         R R+E + Q  ++ E    +
Sbjct: 95  RL---RDEITKVTAG-------VRLDLNLEK--------GRIREESAEQELKIKETDTKI 136

Query: 579 SQQI 582
             +I
Sbjct: 137 DTEI 140


>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein.  The sequences
           featured in this family are similar to a region of human
           TPR protein and to yeast myosin-like proteins 1 (MLP1)
           and 2 (MLP2). These proteins share a number of features;
           for example, they all have coiled-coil regions and all
           three are associated with nuclear pores. TPR is thought
           to be a component of nuclear pore complex- attached
           intra-nuclear filaments, and is implicated in nuclear
           protein import. Moreover, its N-terminal region is
           involved in the activation of oncogenic kinases,
           possibly by mediating the dimerisation of kinase domains
           or by targeting these kinases to the nuclear pore
           complex. MLP1 and MLP2 are involved in the process of
           telomere length regulation, where they are thought to
           interact with proteins such as Tel1p and modulate their
           activity.
          Length = 132

 Score = 31.1 bits (71), Expect = 0.85
 Identities = 23/114 (20%), Positives = 46/114 (40%), Gaps = 17/114 (14%)

Query: 472 ERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGT 531
           E    + + L+ +L     I     +K E  +  +A    EL  LR++L           
Sbjct: 20  EDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKHAEDIEELQALRKQLN---------- 69

Query: 532 ASLSNGPSVAALEAELRRVQALVGDLQR----QRQELSAQVKQLTEKSNSLSQQ 581
                   +A L+AE    QA + + +     Q++ L  ++ +L ++   L++Q
Sbjct: 70  ---ELKKEIAQLKAEAESAQAELSEAEESWEEQKKMLEDELSELEKRIEELNEQ 120


>gnl|CDD|117396 pfam08826, DMPK_coil, DMPK coiled coil domain like.  This domain is
           found in the myotonic dystrophy protein kinase (DMPK)
           and adopts a coiled coil structure. It plays a role in
           dimerisation.
          Length = 61

 Score = 29.4 bits (66), Expect = 0.88
 Identities = 15/51 (29%), Positives = 30/51 (58%)

Query: 476 RQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRRE 526
           R ++ L+ EL +V++   +   KL+E  A N  LE+E+  L+++++  R  
Sbjct: 11  RAKQSLQEELEKVKAANINFESKLQEAEAKNRELEAEVRQLKKRMEELRAR 61


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 32.6 bits (75), Expect = 1.1
 Identities = 21/107 (19%), Positives = 44/107 (41%), Gaps = 27/107 (25%)

Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVC 528
           +E+E+  RQQ+  ER       +LA   K+L + +     LE     L  +L+       
Sbjct: 523 SELEQRLRQQQRAER-------LLAEFCKRLGKNLDDEDELEQLQEELEARLE------- 568

Query: 529 NGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKS 575
                        +L   +   +     L++Q ++L A++++L  ++
Sbjct: 569 -------------SLSESVSEARERRMALRQQLEQLQARIQRLAARA 602



 Score = 30.7 bits (70), Expect = 3.9
 Identities = 38/177 (21%), Positives = 65/177 (36%), Gaps = 37/177 (20%)

Query: 426  DQIASKLHSRGRNICSVQG---QAKEMLEN------ALGSLRHKMHGVHASPAE------ 470
            + I S L S       ++    QA++   +      AL  +  +    H S  +      
Sbjct: 924  EPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAKQQAFALTEVVQRRA--HFSYEDAAEMLA 981

Query: 471  -----VERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRR 525
                  E+ R++    E+E +R R  L     +L +     A L+S     RQ LQ  ++
Sbjct: 982  KNSDLNEKLRQRLEQAEQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQ 1041

Query: 526  EVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
            E       L +    A   AE R         + +R EL A++     + N L +Q+
Sbjct: 1042 E-------LQDLGVPADSGAEER--------ARARRDELHARLSANRSRRNQLEKQL 1083



 Score = 29.5 bits (67), Expect = 8.4
 Identities = 12/36 (33%), Positives = 17/36 (47%)

Query: 466 ASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEE 501
               +V  Y +Q    EREL+  R  LA   + L+E
Sbjct: 624 EDSQDVTEYMQQLLERERELTVERDELAARKQALDE 659


>gnl|CDD|203138 pfam04977, DivIC, Septum formation initiator.  DivIC from B.
          subtilis is necessary for both vegetative and
          sporulation septum formation. These proteins are mainly
          composed of an amino terminal coiled-coil.
          Length = 80

 Score = 29.5 bits (67), Expect = 1.2
 Identities = 12/42 (28%), Positives = 23/42 (54%)

Query: 32 ALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 73
           L  E+  +QA +  L+ + +EL A+VK L    + + ++ R
Sbjct: 21 QLNQEIAALQAELAKLKAENEELEAEVKDLKSDPDYIEERAR 62



 Score = 29.5 bits (67), Expect = 1.2
 Identities = 12/42 (28%), Positives = 23/42 (54%)

Query: 542 ALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
            L  E+  +QA +  L+ + +EL A+VK L    + + ++ R
Sbjct: 21  QLNQEIAALQAELAKLKAENEELEAEVKDLKSDPDYIEERAR 62


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 31.6 bits (72), Expect = 1.3
 Identities = 27/111 (24%), Positives = 48/111 (43%), Gaps = 8/111 (7%)

Query: 472 ERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVL---RQKLQWSRREVC 528
           E   R+Q+ LE  + ++   +    K+LEE       LE +L       Q L+    E+ 
Sbjct: 1   EEAEREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELE 60

Query: 529 NGTASLSNGPSVAA-----LEAELRRVQALVGDLQRQRQELSAQVKQLTEK 574
                L    + +      LEAE+    A V  L+ +R++  A+ +QL ++
Sbjct: 61  EENRRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQE 111


>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16).  Autophagy is a
           ubiquitous intracellular degradation system for
           eukaryotic cells. During autophagy, cytoplasmic
           components are enclosed in autophagosomes and delivered
           to lysosomes/vacuoles. ATG16 (also known as Apg16) has
           been shown to be bind to Apg5 and is required for the
           function of the Apg12p-Apg5p conjugate in the yeast
           autophagy pathway.
          Length = 194

 Score = 31.3 bits (71), Expect = 1.3
 Identities = 17/89 (19%), Positives = 38/89 (42%), Gaps = 9/89 (10%)

Query: 495 NSKKLEETVAANARLE---SELVVLRQKLQWSRREVCNGTASLS------NGPSVAALEA 545
             K   E      +L      L V    LQ  + E  +  +S        +  ++A +E 
Sbjct: 15  REKAFSELFEQYRQLADRTRLLKVENDALQAEKYEQQSSHSSSPSADGPGSDAAIAEMEQ 74

Query: 546 ELRRVQALVGDLQRQRQELSAQVKQLTEK 574
           +L +++  + +L ++R EL+ ++  L ++
Sbjct: 75  KLAKLREELTELHKKRGELAQRLLLLNDE 103


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
           functions are known are involvedin recombination,
           recombinational repair, and/or non-homologous end
           joining.They are components of an exonuclease complex
           with MRE11 homologs. This family is distantly related to
           the SbcC family of bacterial proteins.This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 32.3 bits (73), Expect = 1.3
 Identities = 23/118 (19%), Positives = 48/118 (40%), Gaps = 14/118 (11%)

Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCN 529
           E++  + + + +E  LS++         KL+  + A    + ++     +L+    +V  
Sbjct: 246 ELDPLKNRLKEIEHNLSKIM--------KLDNEIKALKSRKKQMEKDNSELELKMEKVFQ 297

Query: 530 GTASLSN------GPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQ 581
           GT    N        +V   E EL   Q  +  L ++R+ L+ +  +L  +   L  Q
Sbjct: 298 GTDEQLNDLYHNHQRTVREKERELVDCQRELEKLNKERRLLNQEKTELLVEQGRLQLQ 355


>gnl|CDD|236989 PRK11809, putA, trifunctional transcriptional regulator/proline
            dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
            Reviewed.
          Length = 1318

 Score = 32.3 bits (74), Expect = 1.4
 Identities = 16/62 (25%), Positives = 29/62 (46%), Gaps = 5/62 (8%)

Query: 22   ASLSNGPSVAALEAELRRV-----QALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGP 76
             +L+   +   ++A+LR        AL      +  EL+A   Q  E + + + ++ PGP
Sbjct: 1117 VTLARQDAEYPVDAQLRAALLAPLTALREWAAEREPELAALCDQYAELAQAGTTRLLPGP 1176

Query: 77   TG 78
            TG
Sbjct: 1177 TG 1178



 Score = 32.3 bits (74), Expect = 1.4
 Identities = 16/62 (25%), Positives = 29/62 (46%), Gaps = 5/62 (8%)

Query: 532  ASLSNGPSVAALEAELRRV-----QALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGP 586
             +L+   +   ++A+LR        AL      +  EL+A   Q  E + + + ++ PGP
Sbjct: 1117 VTLARQDAEYPVDAQLRAALLAPLTALREWAAEREPELAALCDQYAELAQAGTTRLLPGP 1176

Query: 587  TG 588
            TG
Sbjct: 1177 TG 1178


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 31.0 bits (71), Expect = 1.4
 Identities = 32/125 (25%), Positives = 55/125 (44%), Gaps = 26/125 (20%)

Query: 479 RLLERELSRVRSILAHNSKKLEETVAANARL-------ESELVVLRQKLQWSRREVCNGT 531
           + L+ E++ ++    HN K + E    N RL       E E+  LR+KL+   ++     
Sbjct: 30  KSLKEEIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDK---- 85

Query: 532 ASLSNGPS-VAALEAELRRVQ-------ALVGDLQRQRQELSAQ-------VKQLTEKSN 576
            SL N  + +  LE EL+ ++            ++R+R EL  +       V+Q T   N
Sbjct: 86  QSLKNLKARLKELEKELKNLKWESEVLEQRFEKVERERDELYDKFEAAIQDVQQKTGLKN 145

Query: 577 SLSQQ 581
            L +Q
Sbjct: 146 LLLEQ 150


>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
           HlyD family.  Type I secretion is an ABC transport
           process that exports proteins, without cleavage of any
           signal sequence, from the cytosol to extracellular
           medium across both inner and outer membranes. The
           secretion signal is found in the C-terminus of the
           transported protein. This model represents the adaptor
           protein between the ATP-binding cassette (ABC) protein
           of the inner membrane and the outer membrane protein,
           and is called the membrane fusion protein. This model
           selects a subfamily closely related to HlyD; it is
           defined narrowly and excludes, for example, colicin V
           secretion protein CvaA and multidrug efflux proteins
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 423

 Score = 31.9 bits (73), Expect = 1.4
 Identities = 30/124 (24%), Positives = 52/124 (41%), Gaps = 18/124 (14%)

Query: 466 ASPAEVERYRRQQRLLERELSRVRSILA---HNSKKLEETVAANARLESELVVLRQKLQW 522
           A P  ++    QQ L E   S +R+ L       K+LE  +A    L+++L  LRQ+L+ 
Sbjct: 123 AVPELIKG---QQSLFESRKSTLRAQLELILAQIKQLEAELA---GLQAQLQALRQQLEV 176

Query: 523 SRREVCNGTASLSN---GPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLS 579
              E+      L        +  LE E  R      + Q +   L A+++ L  + + L 
Sbjct: 177 ISEEL-EARRKLKEKGLVSRLELLELERERA-----EAQGELGRLEAELEVLKRQIDELQ 230

Query: 580 QQIR 583
            + +
Sbjct: 231 LERQ 234


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 32.2 bits (74), Expect = 1.4
 Identities = 32/125 (25%), Positives = 56/125 (44%), Gaps = 17/125 (13%)

Query: 467 SPAEVERYRRQQRLLERELSRVRSILAHNS-------KKLEETVAANARLESELVVLRQK 519
            P++    + +Q LL  +    R  L  N+       K+ +   A   RLE +L +L++ 
Sbjct: 190 RPSQRVLLQAEQALLNAQNDLQRKSLEGNTQLQDLLQKQRDYLTARIQRLEHQLQLLQEA 249

Query: 520 LQWSRREVCNGTASLSNGPSVAALEAELRRVQA---LVGDLQRQRQELSAQVKQLTEKSN 576
           +   R  +   T   +     AA      R+QA   +  +L+   Q LS ++ + TEK N
Sbjct: 250 INSKRLTLSEKTVQEAQSQDEAA------RIQANPLVAQELEINLQ-LSQRLLKATEKLN 302

Query: 577 SLSQQ 581
           +L+QQ
Sbjct: 303 TLTQQ 307


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 31.8 bits (73), Expect = 1.5
 Identities = 25/82 (30%), Positives = 43/82 (52%), Gaps = 6/82 (7%)

Query: 500 EETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQR 559
           E+       L+ E++ L+Q+L+   RE     A         A + EL  ++ L  +L+ 
Sbjct: 138 EDPENLLHALQQEVLTLKQQLELQAREKAQSQALA------EAQQQELVALEGLAAELEE 191

Query: 560 QRQELSAQVKQLTEKSNSLSQQ 581
           ++QEL AQ++QL EK+   SQ+
Sbjct: 192 KQQELEAQLEQLQEKAAETSQE 213



 Score = 29.5 bits (67), Expect = 9.1
 Identities = 23/71 (32%), Positives = 39/71 (54%), Gaps = 6/71 (8%)

Query: 1   SELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQ 60
            E++ L+Q+L+   RE     A         A + EL  ++ L  +L+ ++QEL AQ++Q
Sbjct: 149 QEVLTLKQQLELQAREKAQSQALA------EAQQQELVALEGLAAELEEKQQELEAQLEQ 202

Query: 61  LTEKSNSLSQQ 71
           L EK+   SQ+
Sbjct: 203 LQEKAAETSQE 213


>gnl|CDD|165553 PHA03296, PHA03296, envelope glycoprotein H; Provisional.
          Length = 814

 Score = 31.9 bits (72), Expect = 1.6
 Identities = 17/65 (26%), Positives = 27/65 (41%), Gaps = 9/65 (13%)

Query: 664 DLDMNSTLDVAIESPCISYPSSPQ-HMTSSPLPSSPTSTSIP-----LYVNTDSGTDVKL 717
           D+ +N   +VA  S CI+ P+SP   M ++P  +    +        L    D    V +
Sbjct: 203 DIKINIGKNVA--SLCIASPASPPIEMIATPRSAKLKESRFKQWPRMLKA-EDQELSVLI 259

Query: 718 NGKTG 722
           N   G
Sbjct: 260 NVAAG 264


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 31.7 bits (72), Expect = 1.6
 Identities = 39/156 (25%), Positives = 64/156 (41%), Gaps = 47/156 (30%)

Query: 448 EMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANA 507
           E LE  L  L+H+   + +  AE        + L+ E+     +L  +S K         
Sbjct: 270 EELEKELAELQHRNDELTSLAAES-------QALKDEID----VLRESSDKA-------K 311

Query: 508 RLESELVVLRQKLQWS---RREV-----CNGTASLSNGPSVAALEAELRRVQALVGDLQ- 558
           +LE+++   ++KL+     RR+V      N     +       LE EL++  A  G L+ 
Sbjct: 312 KLEAQVETYKKKLEDLNDLRRQVKLLEERNAMYMQNT----VQLEEELKKANAARGQLET 367

Query: 559 --RQRQELSAQV--------------KQLTEKSNSL 578
             RQ QEL A++              K+L EK  +L
Sbjct: 368 YKRQVQELHAKLSEESKKADKLEFEYKRLEEKLEAL 403


>gnl|CDD|219737 pfam08172, CASP_C, CASP C terminal.  This domain is the C-terminal
           region of the CASP family of proteins. It is a Golgi
           membrane protein which is thought to have a role in
           vesicle transport.
          Length = 245

 Score = 31.2 bits (71), Expect = 1.7
 Identities = 17/72 (23%), Positives = 28/72 (38%), Gaps = 2/72 (2%)

Query: 33  LEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGPTGVAGKSAQGSTSPSG 92
           LE ELR+    +  L+R+ + L A   +L EK   L  Q   G +  +  +   S   S 
Sbjct: 98  LEEELRKQNQTISSLRRELESLKADNIKLYEKIRYL--QSYQGNSPPSDSAVSNSPGRSS 155

Query: 93  RSTAAETGTERT 104
                 +    +
Sbjct: 156 LQPGPGSIDVES 167



 Score = 30.1 bits (68), Expect = 3.7
 Identities = 16/59 (27%), Positives = 23/59 (38%)

Query: 543 LEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGPTGVAVAGKSVNRPGP 601
           LE ELR+    +  L+R+ + L A   +L EK   L       P   +    S  R   
Sbjct: 98  LEEELRKQNQTISSLRRELESLKADNIKLYEKIRYLQSYQGNSPPSDSAVSNSPGRSSL 156


>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450).  This
          family of proteins are functionally uncharacterized.
          This protein is found in bacteria and eukaryotes.
          Proteins in this family are about 260 amino acids in
          length.
          Length = 250

 Score = 31.0 bits (71), Expect = 1.9
 Identities = 12/42 (28%), Positives = 21/42 (50%)

Query: 31 AALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 72
            L AE+R++   + +L+    +L   V    ++  SL QQI
Sbjct: 53 QELLAEIRQLLKEIENLRVYNDQLQRLVANQQQEIASLQQQI 94



 Score = 31.0 bits (71), Expect = 1.9
 Identities = 12/42 (28%), Positives = 21/42 (50%)

Query: 541 AALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
             L AE+R++   + +L+    +L   V    ++  SL QQI
Sbjct: 53  QELLAEIRQLLKEIENLRVYNDQLQRLVANQQQEIASLQQQI 94


>gnl|CDD|197544 smart00150, SPEC, Spectrin repeats. 
          Length = 101

 Score = 29.2 bits (66), Expect = 2.0
 Identities = 13/55 (23%), Positives = 29/55 (52%), Gaps = 3/55 (5%)

Query: 22 ASLSNGPSVAALEAELRRVQALVGDLQRQR---QELSAQVKQLTEKSNSLSQQIR 73
          AS   G  + ++EA L++ +A   +L+      + L+   +QL E+ +  +++I 
Sbjct: 22 ASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLIEEGHPDAEEIE 76



 Score = 29.2 bits (66), Expect = 2.0
 Identities = 13/55 (23%), Positives = 29/55 (52%), Gaps = 3/55 (5%)

Query: 532 ASLSNGPSVAALEAELRRVQALVGDLQRQR---QELSAQVKQLTEKSNSLSQQIR 583
           AS   G  + ++EA L++ +A   +L+      + L+   +QL E+ +  +++I 
Sbjct: 22  ASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLIEEGHPDAEEIE 76


>gnl|CDD|182745 PRK10803, PRK10803, tol-pal system protein YbgF; Provisional.
          Length = 263

 Score = 30.9 bits (70), Expect = 2.1
 Identities = 20/78 (25%), Positives = 35/78 (44%), Gaps = 2/78 (2%)

Query: 33  LEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGPTGVAGKSAQGSTSPSG 92
           L+ +L   Q+ +  L+ Q QE   Q+ Q+ E+   +  QI    +G  G +AQ ++    
Sbjct: 59  LQQQLSDNQSDIDSLRGQIQENQYQLNQVVERQKQIYLQIDSLSSG--GAAAQSTSGDQS 116

Query: 93  RSTAAETGTERTGQTADG 110
            + A+ T     G    G
Sbjct: 117 GAAASATPAADAGTANAG 134


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
           double-strand DNA break proteins. This domain contains a
           P-loop motif.
          Length = 1118

 Score = 31.3 bits (71), Expect = 2.2
 Identities = 30/157 (19%), Positives = 58/157 (36%), Gaps = 16/157 (10%)

Query: 442 VQGQAKEMLENALGSLRHKMHGVHASP----AEVERYRRQQRLLERELSRVRSILAHNSK 497
            +  A EML          +           A  E+   +    E+E+S   + L     
Sbjct: 777 AEAVAPEMLGTPADETARALKQRLKRARDTAAAAEKLAEEIEEAEKEVSEAAAALDEAEA 836

Query: 498 KLEETV-AANARLESELVVLRQKLQWS---RREVCNGTASLSN---GPSVAALEAE---- 546
           +L   + AA      EL+   ++       R+ +     +L     G S+ AL AE    
Sbjct: 837 RLTALLRAARCTTIEELLAAVERSDTYRELRKRIAALERTLVRAGGGLSLEALVAEAAAL 896

Query: 547 -LRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
               + A + +L R  +EL  ++ +L ++  +  Q++
Sbjct: 897 DPDELPARLEELARDIEELEEELNELAQEVGAAKQEL 933



 Score = 30.5 bits (69), Expect = 4.5
 Identities = 26/137 (18%), Positives = 47/137 (34%), Gaps = 14/137 (10%)

Query: 457 LRHKMHGVHASPAEVERYRRQ------QRLLEREL-SRVRSILAHNSKKLEETVAANARL 509
           +R     +  + +  E           + L E +L   ++S+      + E       +L
Sbjct: 518 IRRGYALLEPTASAYEEAVESADQLADRLLREAQLVGELQSLRQ----QEEAARRRLEQL 573

Query: 510 ESELVVLRQKLQWSRREVCNGTASLSNGPSVAALE---AELRRVQALVGDLQRQRQELSA 566
           E EL VL   L   R       A+     + A +E   AE    +  V    + R EL A
Sbjct: 574 EKELEVLELALAALREAWQAQWAAAGLPLTPAEMEDWLAERATAREQVRAYFKARAELDA 633

Query: 567 QVKQLTEKSNSLSQQIR 583
            + +      +L   ++
Sbjct: 634 LLDRRARLRAALRAALK 650



 Score = 30.5 bits (69), Expect = 4.8
 Identities = 34/146 (23%), Positives = 57/146 (39%), Gaps = 18/146 (12%)

Query: 445 QAKEMLENALGSLRH--------KMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNS 496
            A +  E  L +L          ++             R++   LER L R     A   
Sbjct: 829 AALDEAEARLTALLRAARCTTIEELLAAVERSDTYRELRKRIAALERTLVR-----AGGG 883

Query: 497 KKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSN-GPSVAALEAELRRV--QAL 553
             LE  VA  A L+     L  +L+   R++      L+     V A + EL R+   + 
Sbjct: 884 LSLEALVAEAAALD--PDELPARLEELARDIEELEEELNELAQEVGAAKQELARMDGGST 941

Query: 554 VGDLQRQRQELSAQVKQLTEKSNSLS 579
             +L+ +R+ L AQ++ L E+   L+
Sbjct: 942 AAELEAERESLLAQLRDLAERYLELA 967


>gnl|CDD|220555 pfam10079, DUF2317, Uncharacterized protein conserved in bacteria
           (DUF2317).  Members of this family of hypothetical
           bacterial proteins have no known function.
          Length = 541

 Score = 31.0 bits (71), Expect = 2.3
 Identities = 17/78 (21%), Positives = 34/78 (43%), Gaps = 1/78 (1%)

Query: 445 QAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVA 504
             +++  + L  L+ K+    A+      +   +  LE +  R+R   A     LE  + 
Sbjct: 398 SIEDVFLHGLELLKEKVLEEKANSEIDIDFEEAKAELEAQFKRLREEAAKIDPSLEGALE 457

Query: 505 AN-ARLESELVVLRQKLQ 521
           AN A++  +L  L ++L 
Sbjct: 458 ANEAKILKQLEFLEKRLL 475


>gnl|CDD|217914 pfam04129, Vps52, Vps52 / Sac2 family.  Vps52 complexes with
          Vps53 and Vps54 to form a multi- subunit complex
          involved in regulating membrane trafficking events.
          Length = 511

 Score = 31.2 bits (71), Expect = 2.4
 Identities = 17/67 (25%), Positives = 31/67 (46%), Gaps = 7/67 (10%)

Query: 7  RQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSN 66
          +  +  S        ASL N   +AA ++ L R++ ++   Q     +S  +K L EKSN
Sbjct: 7  QDVIDES-----ENLASLHN--QIAACDSVLERMEDMLTSFQSDLSSISQDIKFLQEKSN 59

Query: 67 SLSQQIR 73
           +  ++ 
Sbjct: 60 EMQLRLE 66



 Score = 31.2 bits (71), Expect = 2.4
 Identities = 17/67 (25%), Positives = 31/67 (46%), Gaps = 7/67 (10%)

Query: 517 RQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSN 576
           +  +  S        ASL N   +AA ++ L R++ ++   Q     +S  +K L EKSN
Sbjct: 7   QDVIDES-----ENLASLHN--QIAACDSVLERMEDMLTSFQSDLSSISQDIKFLQEKSN 59

Query: 577 SLSQQIR 583
            +  ++ 
Sbjct: 60  EMQLRLE 66


>gnl|CDD|215918 pfam00435, Spectrin, Spectrin repeat.  Spectrin repeat-domains
          are found in several proteins involved in cytoskeletal
          structure. These include spectrin, alpha-actinin and
          dystrophin. The sequence repeat used in this family is
          taken from the structural repeat in reference. The
          spectrin domain- repeat forms a three helix bundle. The
          second helix is interrupted by proline in some
          sequences. The repeats are defined by a characteristic
          tryptophan (W) residue at position 17 in helix A and a
          leucine (L) at 2 residues from the carboxyl end of
          helix C. Although the domain occurs in ultiple repeats
          along sequences, the domains are actually stable on
          their own - ie they act, biophysically, like domains
          rather than repeats that along function when
          aggregated.
          Length = 105

 Score = 29.2 bits (66), Expect = 2.5
 Identities = 11/50 (22%), Positives = 29/50 (58%), Gaps = 3/50 (6%)

Query: 27 GPSVAALEAELRRVQALVGDL---QRQRQELSAQVKQLTEKSNSLSQQIR 73
          G  + +++A L++ +AL  +L   Q + + L+   ++L  + +  S++I+
Sbjct: 30 GKDLESVQALLKKHKALEAELAAHQDRVEALNELAEKLIAEGHYASEEIQ 79



 Score = 29.2 bits (66), Expect = 2.5
 Identities = 11/50 (22%), Positives = 29/50 (58%), Gaps = 3/50 (6%)

Query: 537 GPSVAALEAELRRVQALVGDL---QRQRQELSAQVKQLTEKSNSLSQQIR 583
           G  + +++A L++ +AL  +L   Q + + L+   ++L  + +  S++I+
Sbjct: 30  GKDLESVQALLKKHKALEAELAAHQDRVEALNELAEKLIAEGHYASEEIQ 79


>gnl|CDD|240354 PTZ00310, PTZ00310, AMP deaminase; Provisional.
          Length = 1453

 Score = 31.3 bits (71), Expect = 2.5
 Identities = 36/149 (24%), Positives = 59/149 (39%), Gaps = 12/149 (8%)

Query: 439  ICSVQGQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKK 498
            + S   QA E++   + +   +   VH   A+V+  RRQQR     + R+        ++
Sbjct: 1316 LLSTHDQAMEVMLREMETTNER--IVH-LRAQVDSLRRQQR---SLVERLTEEGIR-RQE 1368

Query: 499  LEETVAANARLESELV-VLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDL 557
            + E VA      +  V  LR++LQ     V  G   L    +    EA   +    V  L
Sbjct: 1369 VSEQVAGENNSIALRVNRLRERLQSEGLTVGGG---LYGRSNEEESEASSTQQGETVKRL 1425

Query: 558  QRQRQELSAQVKQLTEKSNSLSQQIRPGP 586
             R +      ++ +T  S   S+  RP P
Sbjct: 1426 LRWKPMPPDVMRAVTVASVPGSRG-RPLP 1453


>gnl|CDD|216560 pfam01544, CorA, CorA-like Mg2+ transporter protein.  The CorA
           transport system is the primary Mg2+ influx system of
           Salmonella typhimurium and Escherichia coli. CorA is
           virtually ubiquitous in the Bacteria and Archaea. There
           are also eukaryotic relatives of this protein. The
           family includes the MRS2 protein from yeast that is
           thought to be an RNA splicing protein. However its
           membership of this family suggests that its effect on
           splicing is due to altered magnesium levels in the cell.
          Length = 291

 Score = 30.7 bits (70), Expect = 2.7
 Identities = 18/87 (20%), Positives = 36/87 (41%), Gaps = 8/87 (9%)

Query: 188 ILSSYVYYCVLSAYIRYVSARLADLE----TPKRMEHLQKLEELKRHLVELEKQVSITFN 243
           +L S V        +  +   L +LE         E L++L  L+R LV L + ++    
Sbjct: 115 LLDSIVDR--YFEILEKLEEELDELEDELEDSTTNELLRELLRLRRSLVRLRRSLAPLRE 172

Query: 244 LTERNEKIPLKFVNDGD--YIQAVLDR 268
           +  R        +++ +  Y++ +LD 
Sbjct: 173 VLNRLLSDDGPLIDEEEKEYLRDLLDE 199


>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 366 and 597 amino
           acids in length. There is a single completely conserved
           residue R that may be functionally important.
          Length = 297

 Score = 30.7 bits (70), Expect = 2.8
 Identities = 21/111 (18%), Positives = 33/111 (29%), Gaps = 26/111 (23%)

Query: 478 QRLLERELSRVRSILAHNSKKLEETVAAN-----ARLESELVVLRQ----------KLQW 522
             LLE +     +  A         VAA      A L +E+  L+               
Sbjct: 112 TVLLEIQQEEQAAAQA--------QVAAGFRPKIAALTAEIAALQAEIDEAQAEVNAAYQ 163

Query: 523 SRREVCNGTAS---LSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQ 570
             +    GT        GP       +L   QA +  L+ +     AQ++ 
Sbjct: 164 EAQCEAEGTGGTGVAGKGPVYKEKREKLDAAQARLETLKARLDAAIAQLEA 214


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 30.8 bits (71), Expect = 2.9
 Identities = 13/51 (25%), Positives = 23/51 (45%), Gaps = 10/51 (19%)

Query: 32 ALEAELRRVQALVGDLQRQRQELSAQVKQ----------LTEKSNSLSQQI 72
           L+ E R +Q  + +LQ +R  LS ++ Q          L  +   L ++I
Sbjct: 32 ELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEEI 82



 Score = 30.8 bits (71), Expect = 2.9
 Identities = 13/51 (25%), Positives = 23/51 (45%), Gaps = 10/51 (19%)

Query: 542 ALEAELRRVQALVGDLQRQRQELSAQVKQ----------LTEKSNSLSQQI 582
            L+ E R +Q  + +LQ +R  LS ++ Q          L  +   L ++I
Sbjct: 32  ELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEEI 82


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 30.4 bits (69), Expect = 3.0
 Identities = 23/112 (20%), Positives = 47/112 (41%), Gaps = 25/112 (22%)

Query: 480 LLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPS 539
           LL +EL  + SI      + +        LE EL  L+Q       +           P+
Sbjct: 162 LLMKELELLNSIKPKLRDRKDA-------LEEELRQLKQL-----EDELE-----DCDPT 204

Query: 540 -VAALEAELRRVQA-------LVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
            +   + +L+++          + +L+ + QEL ++++ LT K + L+ +I 
Sbjct: 205 ELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIA 256


>gnl|CDD|221776 pfam12795, MscS_porin, Mechanosensitive ion channel porin domain.
           The small mechanosensitive channel, MscS, is a part of
           the turgor-driven solute efflux system that protects
           bacteria from lysis in the event of osmotic shock. The
           MscS protein alone is sufficient to form a functional
           mechanosensitive channel gated directly by tension in
           the lipid bilayer. The MscS proteins are heptamers of
           three transmembrane subunits with seven converging M3
           domains, and this MscS_porin is towards the N-terminal
           of the molecules. The high concentration of negative
           charges at the extracellular entrance of the pore helps
           select the cations for efflux.
          Length = 239

 Score = 30.3 bits (69), Expect = 3.1
 Identities = 12/44 (27%), Positives = 25/44 (56%)

Query: 29  SVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 72
           S++ LE  L +  + + +LQ Q Q+ ++Q+ +L  +     QQ+
Sbjct: 79  SLSQLEQRLAQTLSQLQELQEQLQQENSQLIELQTRPERAQQQL 122



 Score = 30.3 bits (69), Expect = 3.1
 Identities = 12/44 (27%), Positives = 25/44 (56%)

Query: 539 SVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
           S++ LE  L +  + + +LQ Q Q+ ++Q+ +L  +     QQ+
Sbjct: 79  SLSQLEQRLAQTLSQLQELQEQLQQENSQLIELQTRPERAQQQL 122


>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
           sigma54-dependent transcription [Transcription / Signal
           transduction mechanisms].
          Length = 225

 Score = 30.0 bits (68), Expect = 3.7
 Identities = 27/108 (25%), Positives = 52/108 (48%), Gaps = 7/108 (6%)

Query: 478 QRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNG 537
           +++LE+ +  + S LA   + L + +A   +LE +L     + +  + E      +L  G
Sbjct: 26  EKMLEQAIRDMESELAKARQALAQAIARQKQLERKLE--EAQARAEKLEE-KAELALQAG 82

Query: 538 PSVAALEAELRRVQALVGD---LQRQRQELSAQVKQLTEKSNSLSQQI 582
               A EA L   Q+L      L+ + Q+   QV++L ++  +L Q+I
Sbjct: 83  NEDLAREA-LEEKQSLEDLAKALEAELQQAEEQVEKLKKQLAALEQKI 129


>gnl|CDD|234592 PRK00045, hemA, glutamyl-tRNA reductase; Reviewed.
          Length = 423

 Score = 30.5 bits (70), Expect = 3.7
 Identities = 20/96 (20%), Positives = 34/96 (35%), Gaps = 20/96 (20%)

Query: 488 VRSILAHNSKK-LEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAAL--- 543
           ++ I+  N  +  E    A A +E E+    +       EV          P++ AL   
Sbjct: 303 LQEIVEENLAQRQEAAEKAEAIVEEEVAEFME--WLRSLEV---------VPTIRALREQ 351

Query: 544 -----EAELRRVQALVGDLQRQRQELSAQVKQLTEK 574
                E EL R    +G  + + + L    + L  K
Sbjct: 352 AEEIREEELERALKKLGPGEDEEEVLEKLARSLVNK 387


>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein. 
          Length = 312

 Score = 30.2 bits (69), Expect = 3.9
 Identities = 24/95 (25%), Positives = 39/95 (41%), Gaps = 28/95 (29%)

Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVA-ANARLESELVVLRQKLQWSRREVC 528
           E+   RRQ + LE EL  ++   A     LE  +A    R E EL   +  +        
Sbjct: 217 EITELRRQIQSLEIELQSLKKQKA----SLERQLAELEERYELELADYQDTI-------- 264

Query: 529 NGTASLSNGPSVAALEAELRRVQALVGDLQRQRQE 563
                       + LE EL++++A   ++ RQ +E
Sbjct: 265 ------------SELEEELQQLKA---EMARQLRE 284


>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR).  This
           family consists of several mammalian alpha helical
           coiled-coil rod HCR proteins. The function of HCR is
           unknown but it has been implicated in psoriasis in
           humans and is thought to affect keratinocyte
           proliferation.
          Length = 739

 Score = 30.4 bits (68), Expect = 4.0
 Identities = 33/132 (25%), Positives = 47/132 (35%), Gaps = 13/132 (9%)

Query: 448 EMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANA 507
           +    AL SLR K  G+  S   +E  R  +             L     K +E + A  
Sbjct: 158 QAHLEALSSLRSKAEGLEKSLQSLETRRAGEAKALAAAQAEADTLREQLSKTQEELEAQV 217

Query: 508 RLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQ 567
            L  +L          R+ V  G   ++   S  A E E   +   V  LQ  R  L A 
Sbjct: 218 TLVEQL----------RKYV--GEQVVAESRS-QAWELERGELLQTVKHLQEDRAALQAT 264

Query: 568 VKQLTEKSNSLS 579
           V+ L  +  SL+
Sbjct: 265 VELLQVRVQSLT 276


>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
           envelope biogenesis, outer membrane].
          Length = 835

 Score = 30.4 bits (69), Expect = 4.2
 Identities = 17/73 (23%), Positives = 26/73 (35%), Gaps = 5/73 (6%)

Query: 509 LESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQV 568
           L +E   L   LQ   +E  N        P+    E        L+        +LS  +
Sbjct: 20  LTAESAQLEAALQ-LLQEAVNSKRQEEAEPAAEEAEL----QAELIQQELAINDQLSQAL 74

Query: 569 KQLTEKSNSLSQQ 581
            Q TE+ N+L+  
Sbjct: 75  NQQTERLNALASD 87



 Score = 30.0 bits (68), Expect = 5.8
 Identities = 16/70 (22%), Positives = 24/70 (34%), Gaps = 5/70 (7%)

Query: 2  ELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQL 61
          E   L   LQ   +E  N        P+    E        L+        +LS  + Q 
Sbjct: 23 ESAQLEAALQ-LLQEAVNSKRQEEAEPAAEEAEL----QAELIQQELAINDQLSQALNQQ 77

Query: 62 TEKSNSLSQQ 71
          TE+ N+L+  
Sbjct: 78 TERLNALASD 87


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 30.4 bits (69), Expect = 4.4
 Identities = 18/94 (19%), Positives = 31/94 (32%), Gaps = 3/94 (3%)

Query: 447 KEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLEREL---SRVRSILAHNSKKLEETV 503
            E LE+ L   R ++        E+    R+   LE+EL    +    L     +L +  
Sbjct: 452 IEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAELRKMR 511

Query: 504 AANARLESELVVLRQKLQWSRREVCNGTASLSNG 537
                 +   V + +KL     E       +  G
Sbjct: 512 KLELSGKGTPVKVVEKLTLEAIEEAEEEYGIKEG 545



 Score = 29.3 bits (66), Expect = 8.7
 Identities = 19/103 (18%), Positives = 39/103 (37%), Gaps = 5/103 (4%)

Query: 471 VERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNG 530
           VER   +   L+REL  ++  +     +LE        +  + V   ++++   R +   
Sbjct: 431 VERLEEENSELKRELEELKREIEKLESELER---FRREVRDK-VRKDREIRARDRRIERL 486

Query: 531 TASLSNGPS-VAALEAELRRVQALVGDLQRQRQELSAQVKQLT 572
              L      V  LE +L  ++ +       +      V++LT
Sbjct: 487 EKELEEKKKRVEELERKLAELRKMRKLELSGKGTPVKVVEKLT 529


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 30.1 bits (68), Expect = 4.4
 Identities = 23/133 (17%), Positives = 51/133 (38%), Gaps = 17/133 (12%)

Query: 464 VHASPAEVERYRRQQRLLER-----ELSRVRSILAHNSKKLEETVAANARLESELVVLRQ 518
            ++   E  R R++QRL ++       +        +S      + +  RL+S     R 
Sbjct: 178 WNSQGLEDYRKRKRQRLDQKAAEYLRQAAQAGTTGSSSADDLADLLSKFRLDSWDKGSRF 237

Query: 519 KLQWSRREVCNGTASLSNGPSVA---------ALEAELRRVQALVGDLQRQRQELSAQVK 569
                  +V     + +  P              EAEL  +Q  + +L+   +E+ +++K
Sbjct: 238 T---RTEKVEQAATAAAEVPPAEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIK 294

Query: 570 QLTEKSNSLSQQI 582
            L  K   +++++
Sbjct: 295 ALASKIKQVNEEL 307


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
          structure and biogenesis].
          Length = 429

 Score = 30.3 bits (69), Expect = 4.7
 Identities = 11/52 (21%), Positives = 24/52 (46%), Gaps = 11/52 (21%)

Query: 32 ALEAELRRVQALVGDLQRQRQELSAQVKQ-----------LTEKSNSLSQQI 72
           L+ E R++   + +LQ +R ELS ++ +           L  +   L +++
Sbjct: 33 ELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEKL 84



 Score = 30.3 bits (69), Expect = 4.7
 Identities = 11/52 (21%), Positives = 24/52 (46%), Gaps = 11/52 (21%)

Query: 542 ALEAELRRVQALVGDLQRQRQELSAQVKQ-----------LTEKSNSLSQQI 582
            L+ E R++   + +LQ +R ELS ++ +           L  +   L +++
Sbjct: 33  ELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEKL 84


>gnl|CDD|220180 pfam09325, Vps5, Vps5 C terminal like.  Vps5 is a sorting nexin
          that functions in membrane trafficking. This is the C
          terminal dimerisation domain.
          Length = 236

 Score = 29.5 bits (67), Expect = 5.2
 Identities = 9/40 (22%), Positives = 23/40 (57%)

Query: 30 VAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLS 69
          + +LE++L+++   +  L  QR+EL++   +  +    L+
Sbjct: 33 IDSLESQLKKLYKALELLVNQRKELASATGEFAKSLAMLA 72



 Score = 29.5 bits (67), Expect = 5.2
 Identities = 9/40 (22%), Positives = 23/40 (57%)

Query: 540 VAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLS 579
           + +LE++L+++   +  L  QR+EL++   +  +    L+
Sbjct: 33  IDSLESQLKKLYKALELLVNQRKELASATGEFAKSLAMLA 72


>gnl|CDD|227606 COG5281, COG5281, Phage-related minor tail protein [Function
           unknown].
          Length = 833

 Score = 30.0 bits (67), Expect = 5.8
 Identities = 22/164 (13%), Positives = 44/164 (26%)

Query: 19  NGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGPTG 78
            G         +A L A        +GD+ +     ++   +  +K+     ++   P  
Sbjct: 210 AGITQHQAAEVLAQLVAAGVATAEQLGDVAQAAARFASASGESIDKTIKAFSKLTDDPVN 269

Query: 79  VAGKSAQGSTSPSGRSTAAETGTERTGQTADGEIQLFESADQARTHWSSGPGSTGVAGKS 138
            A    +     +          +R G TA       E+A       +    + G    +
Sbjct: 270 GAKALNRQFHYLTAAQLEQIAALQRAGDTAAAAAAAAEAAAAMDDRTARVKENMGTLETA 329

Query: 139 AQGSTSPSGRSTAAETGTERTGQTADGEIQLFESADQARTHWSN 182
                  + +   A  G  R  + A        +   AR     
Sbjct: 330 WDALADAAKKMWDAVLGIGREDKQAALLAAKLAAEKLARVTAQG 373


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
           This family consists of a number of sequences found in
           Arabidopsis thaliana, Oryza sativa and Lycopersicon
           esculentum (Tomato). The function of this family is
           unknown.
          Length = 767

 Score = 29.9 bits (67), Expect = 6.2
 Identities = 31/144 (21%), Positives = 60/144 (41%), Gaps = 12/144 (8%)

Query: 423 ISRDQIASKLHSRGRNICSVQGQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLE 482
             R++   K+            + K  LE  L  L+ ++    A    + R  +++  + 
Sbjct: 63  QVREEQEQKIQDAATEKTKEWEKVKAELEKKLLELQKELARAAAENDALSRSLQEKSKMI 122

Query: 483 RELSRVRSILAHNSKKLEETVAA----NARLESELVVLRQKLQW--SRREVCNGTAS--- 533
            ELS  +S      + L+  + +    N+ L+ E+ VL ++L+     R +   +A    
Sbjct: 123 MELSEEKSRAESEIEDLKARLESAEKENSSLKYEVHVLSKELEIRNEERNMSLQSADAAS 182

Query: 534 ---LSNGPSVAALEAELRRVQALV 554
              L     +A LEAE +R++ LV
Sbjct: 183 KQHLEGVKKIAKLEAECQRLRGLV 206


>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding.  This family
           is found in mammals where it is localised at cell-cell
           adherens junctions, and in Sch. pombe and other fungi
           where it anchors spindle-pole bodies to spindle
           microtubules. It is a coiled-coil structure, and in
           pombe, it is required for anchoring the minus end of
           spindle microtubules to the centrosome equivalent, the
           spindle-pole body. The name ADIP derives from the family
           being composed of Afadin- and alpha -Actinin-Binding
           Proteins Localised at Cell-Cell Adherens Junctions.
          Length = 149

 Score = 28.8 bits (65), Expect = 6.5
 Identities = 20/85 (23%), Positives = 36/85 (42%), Gaps = 1/85 (1%)

Query: 445 QAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVA 504
           +  E  E+   +LR     +      +ER + Q   LEREL+ +++      KKL+    
Sbjct: 47  RDLEFRESLEETLRKLEAEIERLQNTIERLKTQLEDLERELALLQAKERQLEKKLKTLEQ 106

Query: 505 ANARLESELVVLRQKLQWSRREVCN 529
                + E+  L+  +Q  R+   N
Sbjct: 107 KLKNEKEEVQRLKNIIQ-QRKTQYN 130



 Score = 28.4 bits (64), Expect = 6.9
 Identities = 19/99 (19%), Positives = 43/99 (43%), Gaps = 9/99 (9%)

Query: 472 ERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGT 531
           +R    +  LE  L ++ + +      +E        LE EL +L+ K +   +++    
Sbjct: 46  DRDLEFRESLEETLRKLEAEIERLQNTIERLKTQLEDLERELALLQAKERQLEKKL---- 101

Query: 532 ASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQ 570
            +L        L+ E   VQ L   +Q+++ + + ++K+
Sbjct: 102 KTLEQ-----KLKNEKEEVQRLKNIIQQRKTQYNHELKK 135


>gnl|CDD|184543 PRK14157, PRK14157, heat shock protein GrpE; Provisional.
          Length = 227

 Score = 29.2 bits (65), Expect = 6.5
 Identities = 14/82 (17%), Positives = 27/82 (32%), Gaps = 2/82 (2%)

Query: 74  PGPTGVAGKSAQGSTSPSGRSTAAETGTERTGQTADGEIQLFESADQARTHWSSGPGSTG 133
           P    +  +      S    ++A     + +   AD       +   A     +    TG
Sbjct: 14  PDADDLEAQGQAAQASSGADASAESGEQQDSAAQADANAGADAAPAAAEGEAKAAAEKTG 73

Query: 134 VAGKSAQGSTSPSGRST--AAE 153
            A   +  + +P G++   AAE
Sbjct: 74  EAQSDSDDTLTPLGQAKKEAAE 95


>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
          subunit.  CENP-Q is one of the components that
          assembles onto the CENP-A-nucleosome distal (CAD)
          centromere. The centromere, which is the basic element
          of chromosome inheritance, is epigenetically determined
          in mammals. CENP-A, the centromere-specific histone H3
          variant, assembles an array of nucleosomes and it is
          this that seems to be the prime candidate for
          specifying centromere identity. CENP-A nucleosomes
          directly recruit a proximal CENP-A nucleosome
          associated complex (NAC) comprised of CENP-M, CENP-N
          and CENP-T, CENP-U(50), CENP-C and CENP-H. Assembly of
          the CENP-A NAC at centromeres is dependent on CENP-M,
          CENP-N and CENP-T. Additionally, there are seven other
          subunits which make up the CENP-A-nucleosome distal
          (CAD) centromere, CENP-K, CENP-L, CENP-O, CENP-P,
          CENP-Q, CENP-R and CENP-S, also assembling on the
          CENP-A NAC. Fta7 is the equivalent component of the
          fission yeast Sim4 complex.
          Length = 159

 Score = 28.9 bits (65), Expect = 6.7
 Identities = 13/46 (28%), Positives = 20/46 (43%)

Query: 29 SVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRP 74
           +  LE E+RR +A +       QEL    K L  +    S+ + P
Sbjct: 42 QLELLEEEIRREEAELEKDLEYLQELEKNAKALEREREEESKNLHP 87



 Score = 28.9 bits (65), Expect = 6.7
 Identities = 13/46 (28%), Positives = 20/46 (43%)

Query: 539 SVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRP 584
            +  LE E+RR +A +       QEL    K L  +    S+ + P
Sbjct: 42  QLELLEEEIRREEAELEKDLEYLQELEKNAKALEREREEESKNLHP 87


>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
           binding.  This is the middle region of a family of TATA
           element modulatory factor 1 proteins conserved in
           eukaryotes that contains at its N-terminal section a
           number of leucine zippers that could potentially form
           coiled coil structures. The whole proteins bind to the
           TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells.
          Length = 74

 Score = 27.3 bits (61), Expect = 6.7
 Identities = 11/52 (21%), Positives = 24/52 (46%)

Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKL 520
            E E+  +++      + ++R+      K++ E      +LE EL  L ++L
Sbjct: 19  EEGEKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKLEKELENLEERL 70


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This
          family includes outer membrane proteins such as OmpH
          among others. Skp (OmpH) has been characterized as a
          molecular chaperone that interacts with unfolded
          proteins as they emerge in the periplasm from the Sec
          translocation machinery.
          Length = 140

 Score = 28.3 bits (64), Expect = 6.8
 Identities = 12/43 (27%), Positives = 24/43 (55%), Gaps = 3/43 (6%)

Query: 31 AALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 73
            LE E ++ QA   +L++  +EL    ++L + + +LS+  R
Sbjct: 21 KQLEKEFKKRQA---ELEKLEKELQKLKEKLQKDAATLSEAAR 60



 Score = 28.3 bits (64), Expect = 6.8
 Identities = 12/43 (27%), Positives = 24/43 (55%), Gaps = 3/43 (6%)

Query: 541 AALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
             LE E ++ QA   +L++  +EL    ++L + + +LS+  R
Sbjct: 21  KQLEKEFKKRQA---ELEKLEKELQKLKEKLQKDAATLSEAAR 60


>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
          binding.  This is the C-terminal conserved coiled coil
          region of a family of TATA element modulatory factor 1
          proteins conserved in eukaryotes. The proteins bind to
          the TATA element of some RNA polymerase II promoters
          and repress their activity. by competing with the
          binding of TATA binding protein. TMF1_TATA_bd is the
          most conserved part of the TMFs. TMFs are
          evolutionarily conserved golgins that bind Rab6, a
          ubiquitous ras-like GTP-binding Golgi protein, and
          contribute to Golgi organisation in animal and plant
          cells. The Rab6-binding domain appears to be the same
          region as this C-terminal family.
          Length = 121

 Score = 28.3 bits (64), Expect = 7.1
 Identities = 15/68 (22%), Positives = 31/68 (45%), Gaps = 13/68 (19%)

Query: 19 NGTASLSNGPSVAALE---AELRR-------VQALVGDLQRQRQELSAQVKQLT---EKS 65
             ++   GP+V  +E   + LRR       ++  +  L+ +R E   ++ +LT   E+ 
Sbjct: 5  VSVSTSGAGPNVQLVERLSSTLRRLEGELASLKDELARLEAERDEARQEIVKLTEENEEL 64

Query: 66 NSLSQQIR 73
            L ++I 
Sbjct: 65 RELKKEIE 72



 Score = 28.3 bits (64), Expect = 7.1
 Identities = 15/68 (22%), Positives = 31/68 (45%), Gaps = 13/68 (19%)

Query: 529 NGTASLSNGPSVAALE---AELRR-------VQALVGDLQRQRQELSAQVKQLT---EKS 575
              ++   GP+V  +E   + LRR       ++  +  L+ +R E   ++ +LT   E+ 
Sbjct: 5   VSVSTSGAGPNVQLVERLSSTLRRLEGELASLKDELARLEAERDEARQEIVKLTEENEEL 64

Query: 576 NSLSQQIR 583
             L ++I 
Sbjct: 65  RELKKEIE 72


>gnl|CDD|223914 COG0845, AcrA, Membrane-fusion protein [Cell envelope biogenesis,
           outer membrane].
          Length = 372

 Score = 29.3 bits (65), Expect = 7.7
 Identities = 19/132 (14%), Positives = 41/132 (31%), Gaps = 14/132 (10%)

Query: 466 ASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRR 525
            +PAE+   +R+ +L   + +  ++ L      L    A     ++ L   +  L+++R 
Sbjct: 121 LAPAELGDLQREAKLAAEKAAVSQAELDAAQALLRAAEALVEAAQAALASAKLNLEYTRI 180

Query: 526 EVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPG 585
                          A +   +      VG L    Q L+         +  +   +   
Sbjct: 181 --------------TAPISGVIGARLVRVGQLVSAGQALATIADLDPAAALWVLAAVLER 226

Query: 586 PTGVAVAGKSVN 597
                  G+ V 
Sbjct: 227 DLLAVKVGQKVT 238


>gnl|CDD|222849 PHA02109, PHA02109, hypothetical protein.
          Length = 233

 Score = 28.9 bits (64), Expect = 8.2
 Identities = 17/66 (25%), Positives = 29/66 (43%), Gaps = 3/66 (4%)

Query: 465 HASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLES---ELVVLRQKLQ 521
            A P  VE +   +R+ + E S     L   + KL++      +LE+   E   ++ K+ 
Sbjct: 158 PAKPKAVEIHASTERIDQVERSHTGENLEGLTDKLKQISELTIKLEALSDEACQVKHKIL 217

Query: 522 WSRREV 527
             R EV
Sbjct: 218 NLRAEV 223


>gnl|CDD|182972 PRK11113, PRK11113, D-alanyl-D-alanine
           carboxypeptidase/endopeptidase; Provisional.
          Length = 477

 Score = 29.2 bits (66), Expect = 8.3
 Identities = 12/31 (38%), Positives = 18/31 (58%), Gaps = 2/31 (6%)

Query: 244 LTERNEKIPLKF-VNDGD-YIQAVLDRNLAE 272
           L +R++ +PL F V DG  Y  A+L   L +
Sbjct: 235 LPQRSKPLPLAFAVQDGASYAGAILKDELKQ 265


>gnl|CDD|218237 pfam04738, Lant_dehyd_C, Lantibiotic dehydratase, C terminus.
           Lantibiotics are ribosomally synthesised antimicrobial
           agents derived from ribosomally synthesised peptides.
           They are produced by bacteria of the Firmicutes phylum,
           and include mutacin, subtilin, and nisin. Lantibiotic
           peptides contain thioether bridges termed lanthionines
           that are thought to be generated by dehydration of
           serine and threonine residues followed by addition of
           cysteine residues. This family constitutes the
           C-terminus of the enzyme proposed to catalyze the
           dehydration step.
          Length = 500

 Score = 29.3 bits (66), Expect = 8.6
 Identities = 25/115 (21%), Positives = 47/115 (40%), Gaps = 19/115 (16%)

Query: 201 YIRYVSARLADLETPKRMEHLQKLEELKRHL-----VELEKQVSITFNLTERNEKI---- 251
            + Y+  +L  L+ P+  E L  L E+++ +     + + +   +   L E+  ++    
Sbjct: 48  PLDYLIEKLEALDVPEANELLAALREIQKLIAEYAELPIGEGRKLLQELEEKMSQLTPSR 107

Query: 252 --PLK--FVNDGDYIQAVLDRNLAENISRVLYP----NDNNFGGKELRLKQEYFM 298
             PL      D + +   L  N+AE I+R        + N  G   LR   E F+
Sbjct: 108 RGPLYVDLRLDAEEVT--LGENVAEEIARAAELLLRLSPNPAGRPYLRDYHERFL 160


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 29.4 bits (65), Expect = 8.7
 Identities = 23/113 (20%), Positives = 37/113 (32%), Gaps = 6/113 (5%)

Query: 61  LTEKSNSLSQQIRPGPTGVAGKSAQGSTSPSGRSTAAE-TGTERTGQTADGEIQLFESAD 119
            T  + S +      PTG    +    TSP+ R+T+A    T  T            +  
Sbjct: 459 STGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQK 518

Query: 120 QARTHWSSGP-----GSTGVAGKSAQGSTSPSGRSTAAETGTERTGQTADGEI 167
            + T  ++ P     G T  A     G+TS    ++   T       T    +
Sbjct: 519 TSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVV 571


>gnl|CDD|226396 COG3879, COG3879, Uncharacterized protein conserved in bacteria
          [Function unknown].
          Length = 247

 Score = 28.9 bits (65), Expect = 9.3
 Identities = 9/38 (23%), Positives = 18/38 (47%)

Query: 32 ALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLS 69
             A    +   +  LQ++   L+A+V+ L  K +S+ 
Sbjct: 47 VRRARDLDLVKELRSLQKKVNTLAAEVEDLENKLDSVR 84



 Score = 28.9 bits (65), Expect = 9.3
 Identities = 9/38 (23%), Positives = 18/38 (47%)

Query: 542 ALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLS 579
              A    +   +  LQ++   L+A+V+ L  K +S+ 
Sbjct: 47  VRRARDLDLVKELRSLQKKVNTLAAEVEDLENKLDSVR 84


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 29.1 bits (65), Expect = 9.4
 Identities = 20/93 (21%), Positives = 35/93 (37%), Gaps = 6/93 (6%)

Query: 483 RELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAA 542
            E   ++       +KLEE       L  EL  L  + +  +  +       S       
Sbjct: 128 PEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSR------ 181

Query: 543 LEAELRRVQALVGDLQRQRQELSAQVKQLTEKS 575
           LE  L+++   V DL+++  EL   V+   E+ 
Sbjct: 182 LEEMLKKLPGEVYDLKKRWDELEPGVELPEEEL 214


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 29.2 bits (66), Expect = 9.4
 Identities = 27/139 (19%), Positives = 57/139 (41%), Gaps = 23/139 (16%)

Query: 447 KEMLENALGSLRHKMHGVHASPAEVERYRRQQRL--LERELSRVRSILAHNSKKLEETVA 504
           + +L +  GSL      + A   E E     +RL  LE EL+ +   +    ++ E+   
Sbjct: 179 ERVLSDQRGSLD----QLKAQIEEKEEKDLHERLNGLESELAELDEEIERYEEQREQARE 234

Query: 505 ANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQEL 564
                +  L    ++ +                  +  LEAE+  ++  + + +R+R+EL
Sbjct: 235 TRDEADEVLEEHEERRE-----------------ELETLEAEIEDLRETIAETEREREEL 277

Query: 565 SAQVKQLTEKSNSLSQQIR 583
           + +V+ L E+   L ++  
Sbjct: 278 AEEVRDLRERLEELEEERD 296


>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
           Provisional.
          Length = 475

 Score = 29.0 bits (65), Expect = 10.0
 Identities = 17/97 (17%), Positives = 41/97 (42%), Gaps = 2/97 (2%)

Query: 541 AALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR-PGPTGVAVAGKSVNRP 599
           A ++ +   ++  +  L +QR +   ++++L + + +L++Q++  G   V   G+ V + 
Sbjct: 79  AQMQKQYEEIRRELDVLNKQRGDDQRRIEKLGQDNAALAEQVKALGANPVTATGEPVPQM 138

Query: 600 GPTGVAGKVSKSGLDPTGVAVDTRIPNPCSLSLGLYP 636
            P    G   +     T V+   +           YP
Sbjct: 139 -PASPPGPEGEPQPGNTPVSFPPQGSVAVPPPTAFYP 174


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.312    0.128    0.366 

Gapped
Lambda     K      H
   0.267   0.0788    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 36,887,066
Number of extensions: 3617179
Number of successful extensions: 4704
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4535
Number of HSP's successfully gapped: 329
Length of query: 746
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 642
Effective length of database: 6,324,786
Effective search space: 4060512612
Effective search space used: 4060512612
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 63 (27.9 bits)