RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8682
         (1956 letters)



>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains 34
            amino acids
            [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
            X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in
            a variety of organisms including bacteria, cyanobacteria,
            yeast, fungi, plants, and humans in various subcellular
            locations; involved in a variety of functions including
            protein-protein interactions, but common features in the
            interaction partners have not been defined; involved in
            chaperone, cell-cycle, transciption, and protein
            transport complexes; the number of TPR motifs varies
            among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats
            generate a right-handed helical structure with an
            amphipathic channel that is thought to accomodate an
            alpha-helix of a target protein; it has been proposed
            that TPR proteins preferably interact with WD-40 repeat
            proteins, but in many instances several TPR-proteins seem
            to aggregate to multi-protein complexes; examples of
            TPR-proteins include, Cdc16p, Cdc23p and Cdc27p
            components of the cyclosome/APC, the Pex5p/Pas10p
            receptor for peroxisomal targeting signals, the Tom70p
            co-receptor for mitochondrial targeting signals, Ser/Thr
            phosphatase 5C and the p110 subunit of O-GlcNAc
            transferase; three copies of the repeat are present here.
          Length = 100

 Score = 83.2 bits (206), Expect = 8e-19
 Identities = 32/93 (34%), Positives = 52/93 (55%)

Query: 1767 NGLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQL 1826
              LG L+    +YD+A++ +  AL++ PD++  +  L A+     + EEA+E Y  AL+L
Sbjct: 4    LNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALEL 63

Query: 1827 SPGFVRARYNLGITCVHLGANTQAVEHFLTALN 1859
             P   +A YNLG+    LG   +A+E +  AL 
Sbjct: 64   DPDNAKAYYNLGLAYYKLGKYEEALEAYEKALE 96



 Score = 66.6 bits (163), Expect = 5e-13
 Identities = 24/93 (25%), Positives = 44/93 (47%), Gaps = 3/93 (3%)

Query: 1736 LSSREIHQQVLSLYLNAARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPD 1795
                  + + L  Y  A    P    + D    L   +    +Y++A++ +  AL++ PD
Sbjct: 10   YYKLGDYDEALEYYEKALELDPD---NADAYYNLAAAYYKLGKYEEALEDYEKALELDPD 66

Query: 1796 DSRLWNRLGASLANGNRPEEAVEAYHTALQLSP 1828
            +++ +  LG +     + EEA+EAY  AL+L P
Sbjct: 67   NAKAYYNLGLAYYKLGKYEEALEAYEKALELDP 99



 Score = 64.7 bits (158), Expect = 2e-12
 Identities = 21/89 (23%), Positives = 42/89 (47%)

Query: 1609 FALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSI 1668
              LG    + GD   A+ Y E A + +PDNA+ + +L  +  +  +  +A+    K L +
Sbjct: 4    LNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALEL 63

Query: 1669 EPKNLEALMAISICFTNEACLHDALDTLK 1697
            +P N +A   + + +       +AL+  +
Sbjct: 64   DPDNAKAYYNLGLAYYKLGKYEEALEAYE 92



 Score = 48.5 bits (116), Expect = 1e-06
 Identities = 20/62 (32%), Positives = 26/62 (41%)

Query: 1799 LWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNLGITCVHLGANTQAVEHFLTAL 1858
                LG         +EA+E Y  AL+L P    A YNL      LG   +A+E +  AL
Sbjct: 2    ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKAL 61

Query: 1859 NQ 1860
              
Sbjct: 62   EL 63



 Score = 44.7 bits (106), Expect = 2e-05
 Identities = 16/63 (25%), Positives = 32/63 (50%)

Query: 1609 FALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSI 1668
            + L     + G    A+   E A + +PDNA+ + +LG++  +  +  +A+ A  K L +
Sbjct: 38   YNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALEL 97

Query: 1669 EPK 1671
            +P 
Sbjct: 98   DPN 100



 Score = 41.6 bits (98), Expect = 2e-04
 Identities = 24/109 (22%), Positives = 45/109 (41%), Gaps = 15/109 (13%)

Query: 736 NGLGVLFNLSDEYDKAVDCFRSALQVRPDFTELLVYLFSSLDFQPPKQKISDPDE-LRDY 794
             LG L+    +YD+A++ +  AL++ PD  +    L ++        K+   +E L DY
Sbjct: 4   LNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAY------YKLGKYEEALEDY 57

Query: 795 QHRKRKAFEDNIRKNRMVISNWIKYAQWEESQKQVDRARSIYERALDVD 843
                KA    +  +      +           + + A   YE+AL++D
Sbjct: 58  ----EKA----LELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELD 98



 Score = 40.8 bits (96), Expect = 5e-04
 Identities = 13/72 (18%), Positives = 27/72 (37%), Gaps = 3/72 (4%)

Query: 705 LSSREIHQQVLSLYLNAARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPD 764
                 + + L  Y  A    P    + D    L   +    +Y++A++ +  AL++ PD
Sbjct: 10  YYKLGDYDEALEYYEKALELDPD---NADAYYNLAAAYYKLGKYEEALEDYEKALELDPD 66

Query: 765 FTELLVYLFSSL 776
             +    L  + 
Sbjct: 67  NAKAYYNLGLAY 78



 Score = 34.3 bits (79), Expect = 0.11
 Identities = 22/156 (14%), Positives = 46/156 (29%), Gaps = 56/156 (35%)

Query: 1640 EVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLEALMAISICFTNEACLHDALDTLKDK 1699
            E  L+LG    +     +A+    K L ++P N +A   ++  +       +AL+     
Sbjct: 1    EALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALE----- 55

Query: 1700 IRPGQESNPRPSAYKADALPSKLTRLANHTLTFRSPLSSREIHQQVLSLYLNAARQCPSQ 1759
                                                             Y  A    P  
Sbjct: 56   ------------------------------------------------DYEKALELDPD- 66

Query: 1760 SIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPD 1795
              +      LG+ +    +Y++A++ +  AL++ P+
Sbjct: 67   --NAKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100



 Score = 32.7 bits (75), Expect = 0.32
 Identities = 13/83 (15%), Positives = 27/83 (32%)

Query: 815 NWIKYAQWEESQKQVDRARSIYERALDVDHRNITLWLKYTELEMRNRQVNHARNLWDRAV 874
             +            D A   YE+AL++D  N   +        +  +   A   +++A+
Sbjct: 2   ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKAL 61

Query: 875 TILPRANQFWYKYTYMEEMLENV 897
            + P   + +Y        L   
Sbjct: 62  ELDPDNAKAYYNLGLAYYKLGKY 84



 Score = 31.2 bits (71), Expect = 1.2
 Identities = 11/60 (18%), Positives = 26/60 (43%), Gaps = 3/60 (5%)

Query: 705 LSSREIHQQVLSLYLNAARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPD 764
                 +++ L  Y  A    P    +      LG+ +    +Y++A++ +  AL++ P+
Sbjct: 44  YYKLGKYEEALEDYEKALELDPD---NAKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100


>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat. 
          Length = 69

 Score = 62.7 bits (153), Expect = 5e-12
 Identities = 19/68 (27%), Positives = 34/68 (50%), Gaps = 1/68 (1%)

Query: 1762 DPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLAN-GNRPEEAVEAY 1820
            + +    LG       +YD+A++ +  AL++ PD++  +  L  +    G   EEA+E  
Sbjct: 2    NAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDL 61

Query: 1821 HTALQLSP 1828
              AL+L P
Sbjct: 62   EKALELDP 69



 Score = 55.4 bits (134), Expect = 2e-09
 Identities = 20/69 (28%), Positives = 33/69 (47%), Gaps = 1/69 (1%)

Query: 1603 QNETNAFALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAE-NEQDPQAIAA 1661
             N      LG    + GD   AI   E A + +PDNAE + +L ++  +  +   +A+  
Sbjct: 1    DNAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALED 60

Query: 1662 LSKCLSIEP 1670
            L K L ++P
Sbjct: 61   LEKALELDP 69



 Score = 50.0 bits (120), Expect = 1e-07
 Identities = 23/66 (34%), Positives = 34/66 (51%), Gaps = 1/66 (1%)

Query: 1795 DDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNLGITCVHLGAN-TQAVEH 1853
            D++     LG +L      +EA+EAY  AL+L P    A YNL +  + LG +  +A+E 
Sbjct: 1    DNAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALED 60

Query: 1854 FLTALN 1859
               AL 
Sbjct: 61   LEKALE 66



 Score = 40.0 bits (94), Expect = 5e-04
 Identities = 16/47 (34%), Positives = 26/47 (55%)

Query: 1637 DNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLEALMAISICF 1683
            DNAE   +LG +L +     +AI A  K L ++P N EA   +++ +
Sbjct: 1    DNAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAY 47



 Score = 33.8 bits (78), Expect = 0.074
 Identities = 11/42 (26%), Positives = 20/42 (47%)

Query: 731 DPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDFTELLVYL 772
           + +    LG       +YD+A++ +  AL++ PD  E    L
Sbjct: 2   NAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNL 43



 Score = 33.4 bits (77), Expect = 0.097
 Identities = 10/54 (18%), Positives = 22/54 (40%), Gaps = 4/54 (7%)

Query: 711 HQQVLSLYLNAARQCPSQSIDPDVQNGLG-VLFNLSDEYDKAVDCFRSALQVRP 763
           + + +  Y  A    P    + +    L      L  +Y++A++    AL++ P
Sbjct: 19  YDEAIEAYEKALELDPD---NAEAYYNLALAYLKLGKDYEEALEDLEKALELDP 69



 Score = 32.7 bits (75), Expect = 0.18
 Identities = 11/64 (17%), Positives = 20/64 (31%), Gaps = 1/64 (1%)

Query: 816 WIKYAQWEESQKQVDRARSIYERALDVDHRNITLWLKYTE-LEMRNRQVNHARNLWDRAV 874
                         D A   YE+AL++D  N   +           +    A    ++A+
Sbjct: 6   LKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKAL 65

Query: 875 TILP 878
            + P
Sbjct: 66  ELDP 69



 Score = 30.0 bits (68), Expect = 1.8
 Identities = 14/50 (28%), Positives = 29/50 (58%), Gaps = 4/50 (8%)

Query: 1111 LRHLEAAAKQEPDNAEVWLSLGISLAENEQLLGEFWEHNLGNMQQAFRMD 1160
            +   E A + +PDNAE + +L ++  +    LG+ +E  L ++++A  +D
Sbjct: 23   IEAYEKALELDPDNAEAYYNLALAYLK----LGKDYEEALEDLEKALELD 68


>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
          Length = 291

 Score = 68.3 bits (165), Expect = 7e-12
 Identities = 59/254 (23%), Positives = 103/254 (40%), Gaps = 21/254 (8%)

Query: 1578 FSDFSRNQRSSVSMHEYTFAEDNPMQNETNAFALGQEKLRQGDLPSAILYLEAAAKQE-- 1635
              +       ++ + E                 L    L+ G L  A+  LE A + E  
Sbjct: 32   LLELLGELAEALELLEEALELLPNSDLAGLLLLLALALLKLGRLEEALELLEKALELELL 91

Query: 1636 PDNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLEALMAISI-CFTNEACLHDALD 1694
            P+ AE  L+LG+ L    +  +A+  L K L+++P    A   +++          +AL+
Sbjct: 92   PNLAEALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALYELGDYEEALE 151

Query: 1695 TLKDKIRPGQESNPRPSAYKADALPSKLTRLANHTLTFRSPLSSREIHQQVLSLYLNAAR 1754
              +                KA  L  +L  LA   L   + L +   +++ L L   A +
Sbjct: 152  LYE----------------KALELDPELNELAEALLALGALLEALGRYEEALELLEKALK 195

Query: 1755 QCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPE 1814
              P    D +    LG+L+    +Y++A++ +  AL++ PD++     L   L    R E
Sbjct: 196  LNPD--DDAEALLNLGLLYLKLGKYEEALEYYEKALELDPDNAEALYNLALLLLELGRYE 253

Query: 1815 EAVEAYHTALQLSP 1828
            EA+EA   AL+L P
Sbjct: 254  EALEALEKALELDP 267



 Score = 56.8 bits (135), Expect = 3e-08
 Identities = 30/178 (16%), Positives = 61/178 (34%), Gaps = 12/178 (6%)

Query: 718 YLNAARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDFTELLVYLFSSLD 777
            L  A          +    LG L+ L  +Y++A++ +  AL++ P+  EL   L +   
Sbjct: 117 LLEKALALDPDPDLAEALLALGALYEL-GDYEEALELYEKALELDPELNELAEALLALGA 175

Query: 778 FQPPKQKISDPDELRDYQHRKRKAFEDNIRKNRMVISNWIKYAQWEESQKQVDRARSIYE 837
                + +   +E  +      KA + N   +       +          + + A   YE
Sbjct: 176 LL---EALGRYEEALEL---LEKALKLNPDDDAEA---LLNLGLLYLKLGKYEEALEYYE 226

Query: 838 RALDVDHRNITLWLKYTELEMRNRQVNHARNLWDRAVTILPRANQF--WYKYTYMEEM 893
           +AL++D  N         L +   +   A    ++A+ + P              E +
Sbjct: 227 KALELDPDNAEALYNLALLLLELGRYEEALEALEKALELDPDLYNLGLALLLLLAEAL 284



 Score = 45.2 bits (105), Expect = 2e-04
 Identities = 25/86 (29%), Positives = 41/86 (47%)

Query: 1598 EDNPMQNETNAFALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQ 1657
            + NP  +      LG   L+ G    A+ Y E A + +PDNAE   +L + L E  +  +
Sbjct: 195  KLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELDPDNAEALYNLALLLLELGRYEE 254

Query: 1658 AIAALSKCLSIEPKNLEALMAISICF 1683
            A+ AL K L ++P      +A+ +  
Sbjct: 255  ALEALEKALELDPDLYNLGLALLLLL 280


>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
            lipoprotein.  This protein family occurs in strictly
            within a subset of Gram-negative bacterial species with
            the proposed PEP-CTERM/exosortase system, analogous to
            the LPXTG/sortase system common in Gram-positive
            bacteria. This protein occurs in a species if and only if
            a transmembrane histidine kinase (TIGR02916) and a
            DNA-binding response regulator (TIGR02915) also occur.
            The present of tetratricopeptide repeats (TPR) suggests
            protein-protein interaction, possibly for the regulation
            of PEP-CTERM protein expression, since many PEP-CTERM
            proteins in these genomes are preceded by a proposed DNA
            binding site for the response regulator.
          Length = 899

 Score = 65.1 bits (159), Expect = 3e-10
 Identities = 69/299 (23%), Positives = 109/299 (36%), Gaps = 50/299 (16%)

Query: 1584 NQRSSVSMHEYTFAEDNPMQNETNAFALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWL 1643
            N+  +V+  E     +   Q    A AL Q  L +G L  A+  L  AA   PD+ E WL
Sbjct: 548  NEEEAVAWLEKAAELNP--QEIEPALALAQYYLGKGQLKKALAILNEAADAAPDSPEAWL 605

Query: 1644 SLGISLAENEQDPQAIAALSKCLSIEPKNLEALMAISICFTNEACLHDALDTLKDKIRPG 1703
             LG +        +A+++  K L+++P +  AL            L DA   +K+     
Sbjct: 606  MLGRAQLAAGDLNKAVSSFKKLLALQPDSALAL----------LLLADAYAVMKN----- 650

Query: 1704 QESNPRPSAYKADALPSKLTRLANHTLTFRSPLSSREIHQQVL-SLYLNAARQCPSQSID 1762
                    A    +L   L    ++T             Q  L  L L A R   ++ I 
Sbjct: 651  -------YAKAITSLKRALELKPDNTEA-----------QIGLAQLLLAAKRTESAKKIA 692

Query: 1763 PDVQNG----------LGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNR 1812
              +Q             G L+    +Y  A+  +R AL+  P          A LA+GN 
Sbjct: 693  KSLQKQHPKAALGFELEGDLYLRQKDYPAAIQAYRKALKRAPSSQNAIKLHRALLASGNT 752

Query: 1813 PEEAVEAYHTALQLSPGFVRARYNLGITCVHLGANTQAVEHFLTALNQQ---AATHDGL 1868
              EAV+     L+  P     R  L    +      +A++H+ T + +    A   + L
Sbjct: 753  A-EAVKTLEAWLKTHPNDAVLRTALAELYLAQKDYDKAIKHYQTVVKKAPDNAVVLNNL 810



 Score = 62.8 bits (153), Expect = 2e-09
 Identities = 59/273 (21%), Positives = 92/273 (33%), Gaps = 43/273 (15%)

Query: 1603 QNETNAFAL---GQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGIS-LAENEQDPQA 1658
             +  +  AL   G+  L  GD   A  YL  A + +P+NA     LGIS L++ +   +A
Sbjct: 358  LDPDDPAALSLLGEAYLALGDFEKAAEYLAKATELDPENAAARTQLGISKLSQGDPS-EA 416

Query: 1659 IAALSKCLSIEPKNLEALMAISICFTNEACLHDALDTLKDKIRPGQESNPRPSAYKADAL 1718
            IA L     ++P+   A + + + +        AL   K K+   Q  N           
Sbjct: 417  IADLETAAQLDPELGRADLLLILSYLRSGQFDKALAAAK-KLEKKQPDNASLHNLLGAIY 475

Query: 1719 PSKLTRLANHTLTFRSPLSSREIHQQVLSLYLNAARQCPSQSIDPDVQNGLGVLFNLS-- 1776
                             L   ++ +   +     A      SI+PD         NL+  
Sbjct: 476  -----------------LGKGDLAKAREA--FEKAL-----SIEPD---FFPAAANLARI 508

Query: 1777 ----DEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVR 1832
                   D A+  F   L + P + R    L          EEAV     A +L+P  + 
Sbjct: 509  DIQEGNPDDAIQRFEKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAAELNPQEIE 568

Query: 1833 ARYNLGITCVHLGANTQAVEHFLTALNQQAATH 1865
                L    +  G   +A    L  LN+ A   
Sbjct: 569  PALALAQYYLGKGQLKKA----LAILNEAADAA 597



 Score = 57.0 bits (138), Expect = 8e-08
 Identities = 56/250 (22%), Positives = 91/250 (36%), Gaps = 28/250 (11%)

Query: 1608 AFALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLS 1667
             +       ++ +   A   L+ A K  P+     L  G S  +     QA   L++ L 
Sbjct: 264  HYLKALVDFQKKNYEDARETLQDALKSAPEYLPALLLAGASEYQLGNLEQAYQYLNQILK 323

Query: 1668 IEPKNLEALMAISICFTNEACLHDALDTLKDKIRPGQESNPRPSAY-------------- 1713
              P + +A   ++        + +A+ TL   +      +P   A               
Sbjct: 324  YAPNSHQARRLLASIQLRLGRVDEAIATLSPAL----GLDPDDPAALSLLGEAYLALGDF 379

Query: 1714 -KADALPSKLTRL-----ANHTLTFRSPLSSREIHQQVLSLYLNAARQCPSQSIDPDVQN 1767
             KA    +K T L     A  T    S LS  +  + +  L    A Q   +    D+  
Sbjct: 380  EKAAEYLAKATELDPENAAARTQLGISKLSQGDPSEAIADL--ETAAQLDPELGRADLLL 437

Query: 1768 GLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLS 1827
             L  L   S ++DKA+   +   + +PD++ L N LGA         +A EA+  AL + 
Sbjct: 438  ILSYL--RSGQFDKALAAAKKLEKKQPDNASLHNLLGAIYLGKGDLAKAREAFEKALSIE 495

Query: 1828 PGFVRARYNL 1837
            P F  A  NL
Sbjct: 496  PDFFPAAANL 505



 Score = 56.2 bits (136), Expect = 2e-07
 Identities = 53/259 (20%), Positives = 88/259 (33%), Gaps = 32/259 (12%)

Query: 1612 GQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSI--E 1669
             +  L++    +AI+ L+ A +++P++AE    LG           A   L K LS+   
Sbjct: 29   AKSYLQKNKYKAAIIQLKNALQKDPNDAEARFLLGKIYLALGDYAAAEKELRKALSLGYP 88

Query: 1670 PKNLEALMAISICFTNEACLHDALDTLKDKIRPGQESNPRPSAYKADALPSKLTRLANHT 1729
               +  L+A +     +      LD L     PG+       A +       L       
Sbjct: 89   KNQVLPLLARAYLLQGKF--QQVLDEL-----PGKTLLDDEGAAEL------LALRGLAY 135

Query: 1730 LTFRSPLSSREIHQQVLSLYLNAARQCPSQSIDP---DVQNGLGVLFNLSDEYDKAVDCF 1786
            L       ++         Y  A       +IDP     + GL  L    + +D+A    
Sbjct: 136  LGLGQLELAQ-------KSYEQA------LAIDPRSLYAKLGLAQLALAENRFDEARALI 182

Query: 1787 RSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNLGITCVHLGA 1846
               L   P +       G  L +    E A+ AY  A+ L P  +     L    +  G 
Sbjct: 183  DEVLTADPGNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVLLALATILIEAGE 242

Query: 1847 NTQAVEHFLTALNQQAATH 1865
              +A +H   AL ++A   
Sbjct: 243  FEEAEKH-ADALLKKAPNS 260



 Score = 53.9 bits (130), Expect = 8e-07
 Identities = 61/274 (22%), Positives = 101/274 (36%), Gaps = 36/274 (13%)

Query: 1610 ALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIE 1669
             LG+ +L  GDL  A+   +     +PD+A   L L  + A  +   +AI +L + L ++
Sbjct: 606  MLGRAQLAAGDLNKAVSSFKKLLALQPDSALALLLLADAYAVMKNYAKAITSLKRALELK 665

Query: 1670 PKNLEALMAISICFTNEACLHDALDTLKDKIRPGQESNPRPSAYKADALPSKLTRL---- 1725
            P N EA + ++           A    K      Q+ +P     KA         L    
Sbjct: 666  PDNTEAQIGLAQLLLAAKRTESAKKIAKSL----QKQHP-----KAALGFELEGDLYLRQ 716

Query: 1726 --------ANHTLTFRSPLSSREIHQQVLSLYLNAARQCP--------SQSIDPD---VQ 1766
                    A      R+P S   I    L   L A+                 P+   ++
Sbjct: 717  KDYPAAIQAYRKALKRAPSSQNAIK---LHRALLASGNTAEAVKTLEAWLKTHPNDAVLR 773

Query: 1767 NGLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQL 1826
              L  L+    +YDKA+  +++ ++  PD++ + N L         P  A+E    AL+L
Sbjct: 774  TALAELYLAQKDYDKAIKHYQTVVKKAPDNAVVLNNLAWLYLELKDP-RALEYAERALKL 832

Query: 1827 SPGFVRARYNLGITCVHLGANTQAVEHFLTALNQ 1860
            +P        LG   V  G   +A+     A+N 
Sbjct: 833  APNIPAILDTLGWLLVEKGEADRALPLLRKAVNI 866



 Score = 52.4 bits (126), Expect = 2e-06
 Identities = 65/277 (23%), Positives = 95/277 (34%), Gaps = 40/277 (14%)

Query: 1609 FALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSI 1668
              L Q  L +     A   ++     +P N +  L  G  L        A+AA  K +++
Sbjct: 163  LGLAQLALAENRFDEARALIDEVLTADPGNVDALLLKGDLLLSLGNIELALAAYRKAIAL 222

Query: 1669 EPKN-----LEALMAISICFTNEACLHDALDTLKDKIRPGQESNPRPSAYKADALPS--- 1720
             P N       A + I      EA  H   D L  K  P       P A+   AL     
Sbjct: 223  RPNNIAVLLALATILIEAGEFEEAEKH--ADALL-KKAPNS-----PLAHYLKALVDFQK 274

Query: 1721 -----------KLTRLANHTLTFRSPLSSREIHQQVLSL---YLNAARQCPSQSIDPDVQ 1766
                          + A   L       + E     L     YLN   +       P+  
Sbjct: 275  KNYEDARETLQDALKSAPEYLPALLLAGASEYQLGNLEQAYQYLNQILK-----YAPNSH 329

Query: 1767 NGLGVLFNL---SDEYDKAVDCFRSALQVRPDDSRLWNRLG-ASLANGNRPEEAVEAYHT 1822
                +L ++       D+A+     AL + PDD    + LG A LA G+  E+A E    
Sbjct: 330  QARRLLASIQLRLGRVDEAIATLSPALGLDPDDPAALSLLGEAYLALGD-FEKAAEYLAK 388

Query: 1823 ALQLSPGFVRARYNLGITCVHLGANTQAVEHFLTALN 1859
            A +L P    AR  LGI+ +  G  ++A+    TA  
Sbjct: 389  ATELDPENAAARTQLGISKLSQGDPSEAIADLETAAQ 425



 Score = 50.5 bits (121), Expect = 1e-05
 Identities = 56/218 (25%), Positives = 86/218 (39%), Gaps = 32/218 (14%)

Query: 1616 LRQGDLPSAILYLEAAAKQEPDNAEVWLSLG-ISLAENEQDPQAIAALSKCLSIEPKNLE 1674
            LR G    A+   +   K++PDNA +   LG I L + +   +A  A  K LSIEP    
Sbjct: 442  LRSGQFDKALAAAKKLEKKQPDNASLHNLLGAIYLGKGDLA-KAREAFEKALSIEPDFFP 500

Query: 1675 ALMAISICFTNEACLHDALDTLKDKIRPGQESNPRPSAYKADALPSKLTRLANH--TLTF 1732
            A         N A +              QE NP  +  + + +   LT    +   +  
Sbjct: 501  AA-------ANLARIDI------------QEGNPDDAIQRFEKV---LTIDPKNLRAILA 538

Query: 1733 RSPLSSREIH-QQVLSLYLNAARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQ 1791
             + L  R  + ++ ++    AA   P Q I+P +   L   +    +  KA+     A  
Sbjct: 539  LAGLYLRTGNEEEAVAWLEKAAELNP-QEIEPALA--LAQYYLGKGQLKKALAILNEAAD 595

Query: 1792 VRPDDSRLWNRLG-ASLANGNRPEEAVEAYHTALQLSP 1828
              PD    W  LG A LA G+   +AV ++   L L P
Sbjct: 596  AAPDSPEAWLMLGRAQLAAGDL-NKAVSSFKKLLALQP 632



 Score = 47.8 bits (114), Expect = 7e-05
 Identities = 51/225 (22%), Positives = 84/225 (37%), Gaps = 34/225 (15%)

Query: 1615 KLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLE 1674
            +LR G +  AI  L  A   +PD+      LG +        +A   L+K   ++P+N  
Sbjct: 339  QLRLGRVDEAIATLSPALGLDPDDPAALSLLGEAYLALGDFEKAAEYLAKATELDPENAA 398

Query: 1675 ALMAISICFTNEACLHDALDTLKDKIRPGQESNPRPSAYKADALPSKLTRLANHTLTFRS 1734
            A   + I                 K+  G  S        A  L  +L R     L   S
Sbjct: 399  ARTQLGIS----------------KLSQGDPSEAIADLETAAQLDPELGRA--DLLLILS 440

Query: 1735 PLSSREIHQQVLSLYLNAARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRP 1794
             L S +   + L+    A +    Q  +  + N LG ++    +  KA + F  AL + P
Sbjct: 441  YLRSGQF-DKALAA---AKKLEKKQPDNASLHNLLGAIYLGKGDLAKAREAFEKALSIEP 496

Query: 1795 DD--SRLWNRLGASLAN----GNRPEEAVEAYHTALQLSPGFVRA 1833
            D   +       A+LA        P++A++ +   L + P  +RA
Sbjct: 497  DFFPAA------ANLARIDIQEGNPDDAIQRFEKVLTIDPKNLRA 535



 Score = 41.6 bits (98), Expect = 0.005
 Identities = 17/68 (25%), Positives = 28/68 (41%)

Query: 1610 ALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIE 1669
              G   L  G L  A    E A   +P +    L L        +  +A A + + L+ +
Sbjct: 130  LRGLAYLGLGQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARALIDEVLTAD 189

Query: 1670 PKNLEALM 1677
            P N++AL+
Sbjct: 190  PGNVDALL 197



 Score = 39.7 bits (93), Expect = 0.018
 Identities = 16/59 (27%), Positives = 27/59 (45%)

Query: 1616 LRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLE 1674
              +   P A+ Y E A K  P+   +  +LG  L E  +  +A+  L K ++I P+   
Sbjct: 814  YLELKDPRALEYAERALKLAPNIPAILDTLGWLLVEKGEADRALPLLRKAVNIAPEAAA 872



 Score = 32.4 bits (74), Expect = 3.2
 Identities = 31/164 (18%), Positives = 61/164 (37%), Gaps = 13/164 (7%)

Query: 721 AARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDFTELLVYLFSSLDFQP 780
           A +    Q  +  + N LG ++    +  KA + F  AL + PDF      L + +D Q 
Sbjct: 454 AKKLEKKQPDNASLHNLLGAIYLGKGDLAKAREAFEKALSIEPDFFPAAANL-ARIDIQE 512

Query: 781 PKQKISDPDELRDYQHRKRKAFEDNIRKNRMVISNWIKYAQWEESQKQVDRARSIYERAL 840
                 +PD+           FE  +  +   +   +  A         + A +  E+A 
Sbjct: 513 -----GNPDDAIQR-------FEKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAA 560

Query: 841 DVDHRNITLWLKYTELEMRNRQVNHARNLWDRAVTILPRANQFW 884
           +++ + I   L   +  +   Q+  A  + + A    P + + W
Sbjct: 561 ELNPQEIEPALALAQYYLGKGQLKKALAILNEAADAAPDSPEAW 604


>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat. 
          Length = 65

 Score = 54.2 bits (131), Expect = 4e-09
 Identities = 23/64 (35%), Positives = 32/64 (50%)

Query: 1609 FALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSI 1668
             AL +  LR GD   A+  LEAA  + P  AE  L LG +L    +  +A A L   L+ 
Sbjct: 1    LALARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAA 60

Query: 1669 EPKN 1672
            +P +
Sbjct: 61   DPDD 64



 Score = 47.3 bits (113), Expect = 1e-06
 Identities = 15/52 (28%), Positives = 21/52 (40%)

Query: 1778 EYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPG 1829
            +YD+A+    +AL   P  +     LG +L    R  EA      AL   P 
Sbjct: 12   DYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADPD 63



 Score = 43.8 bits (104), Expect = 2e-05
 Identities = 14/57 (24%), Positives = 19/57 (33%)

Query: 1802 RLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNLGITCVHLGANTQAVEHFLTAL 1858
             L  +       +EA+ A   AL   P    A   LG   +  G   +A      AL
Sbjct: 2    ALARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAAL 58



 Score = 33.4 bits (77), Expect = 0.10
 Identities = 12/55 (21%), Positives = 17/55 (30%)

Query: 826 QKQVDRARSIYERALDVDHRNITLWLKYTELEMRNRQVNHARNLWDRAVTILPRA 880
               D A +  E AL          L   E  +R  ++  A  L   A+   P  
Sbjct: 10  AGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADPDD 64


>gnl|CDD|227343 COG5010, TadD, Flp pilus assembly protein TadD, contains TPR repeats
            [Intracellular trafficking and secretion].
          Length = 257

 Score = 55.1 bits (133), Expect = 8e-08
 Identities = 26/79 (32%), Positives = 38/79 (48%), Gaps = 2/79 (2%)

Query: 1762 DPDVQNGLGV-LFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAY 1820
            D ++    G       + + +AV   R A ++ P D   WN LGA+L    R +EA  AY
Sbjct: 99   DRELLAAQGKNQIRNGN-FGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAY 157

Query: 1821 HTALQLSPGFVRARYNLGI 1839
              AL+L+P       NLG+
Sbjct: 158  RQALELAPNEPSIANNLGM 176



 Score = 42.8 bits (101), Expect = 0.001
 Identities = 22/90 (24%), Positives = 37/90 (41%), Gaps = 3/90 (3%)

Query: 1593 EYTFAEDNPMQNETNAFALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAEN 1652
                A   P   E    A G+ ++R G+   A+  L  AA+  P + E W  LG +L + 
Sbjct: 89   LQKSAIAYPKDREL-LAAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQL 147

Query: 1653 EQDPQAIAALSKCLSIEPKNLEAL--MAIS 1680
             +  +A  A  + L + P        + +S
Sbjct: 148  GRFDEARRAYRQALELAPNEPSIANNLGMS 177



 Score = 37.8 bits (88), Expect = 0.034
 Identities = 22/97 (22%), Positives = 34/97 (35%), Gaps = 16/97 (16%)

Query: 1602 MQNETNAFALGQEKLRQGDLPSAI-----LYLEAAAKQ-----------EPDNAEVWLSL 1645
             Q +  A ALG   LR  +  S       LYL   A              P + E+  + 
Sbjct: 47   RQTQGAAAALGAAVLRNPEDLSIAKLATALYLRGDADSSLAVLQKSAIAYPKDRELLAAQ 106

Query: 1646 GISLAENEQDPQAIAALSKCLSIEPKNLEALMAISIC 1682
            G +   N    +A++ L K   + P + EA   +   
Sbjct: 107  GKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAA 143



 Score = 34.3 bits (79), Expect = 0.56
 Identities = 23/105 (21%), Positives = 32/105 (30%), Gaps = 12/105 (11%)

Query: 1787 RSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNLGITCVHLGA 1846
            + +    P D  L    G +        EAV     A +L+P    A   LG     LG 
Sbjct: 90   QKSAIAYPKDRELLAAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGR 149

Query: 1847 NTQAVEHFLTALNQQAATHDGLTPHGLEPRAVKEMSDSIWYSLRG 1891
              +A   +  AL               EP     +   +   LRG
Sbjct: 150  FDEARRAYRQALELAPN----------EPSIANNL--GMSLLLRG 182


>gnl|CDD|214642 smart00386, HAT, HAT (Half-A-TPR) repeats.  Present in several
           RNA-binding proteins. Structurally and sequentially
           thought to be similar to TPRs.
          Length = 33

 Score = 46.0 bits (110), Expect = 2e-06
 Identities = 17/33 (51%), Positives = 23/33 (69%)

Query: 827 KQVDRARSIYERALDVDHRNITLWLKYTELEMR 859
             ++RAR IYERAL+   +++ LWLKY E E R
Sbjct: 1   GDIERARKIYERALEKFPKSVELWLKYAEFEER 33



 Score = 36.8 bits (86), Expect = 0.002
 Identities = 11/32 (34%), Positives = 17/32 (53%)

Query: 794 YQHRKRKAFEDNIRKNRMVISNWIKYAQWEES 825
              R RK +E  + K    +  W+KYA++EE 
Sbjct: 2   DIERARKIYERALEKFPKSVELWLKYAEFEER 33



 Score = 34.4 bits (80), Expect = 0.021
 Identities = 10/29 (34%), Positives = 17/29 (58%)

Query: 864 NHARNLWDRAVTILPRANQFWYKYTYMEE 892
             AR +++RA+   P++ + W KY   EE
Sbjct: 4   ERARKIYERALEKFPKSVELWLKYAEFEE 32


>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat. 
          Length = 78

 Score = 43.9 bits (104), Expect = 2e-05
 Identities = 22/73 (30%), Positives = 30/73 (41%), Gaps = 7/73 (9%)

Query: 1793 RPDDSRLWNRLGASLANGNRPEEAVEAYHTALQL-------SPGFVRARYNLGITCVHLG 1845
             PD +   N L   L      +EA+E    AL+L        P   RA  NL    + LG
Sbjct: 1    HPDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALG 60

Query: 1846 ANTQAVEHFLTAL 1858
               +A+E+   AL
Sbjct: 61   DYDEALEYLEKAL 73



 Score = 38.9 bits (91), Expect = 0.002
 Identities = 17/72 (23%), Positives = 29/72 (40%), Gaps = 7/72 (9%)

Query: 1764 DVQNGLGVLFNLSDEYDKAVDCFRSALQVR-------PDDSRLWNRLGASLANGNRPEEA 1816
               N L ++     +YD+A++    AL++        P+ +R  N L          +EA
Sbjct: 6    AALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEA 65

Query: 1817 VEAYHTALQLSP 1828
            +E    AL L  
Sbjct: 66   LEYLEKALALRE 77



 Score = 33.1 bits (76), Expect = 0.16
 Identities = 15/70 (21%), Positives = 23/70 (32%), Gaps = 7/70 (10%)

Query: 1609 FALGQEKLRQGDLPSAILYLEAAAKQE-------PDNAEVWLSLGISLAENEQDPQAIAA 1661
              L     R GD   A+  LE A +         P+ A    +L           +A+  
Sbjct: 9    NNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEALEY 68

Query: 1662 LSKCLSIEPK 1671
            L K L++   
Sbjct: 69   LEKALALREA 78


>gnl|CDD|226687 COG4235, COG4235, Cytochrome c biogenesis factor [Posttranslational
            modification, protein turnover, chaperones].
          Length = 287

 Score = 44.3 bits (105), Expect = 4e-04
 Identities = 21/70 (30%), Positives = 34/70 (48%), Gaps = 3/70 (4%)

Query: 1611 LGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISL---AENEQDPQAIAALSKCLS 1667
            LG+  +  G    A+L    A +   DN E+ L L  +L   A  +   +A A L + L+
Sbjct: 162  LGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAEALYYQAGQQMTAKARALLRQALA 221

Query: 1668 IEPKNLEALM 1677
            ++P N+ AL 
Sbjct: 222  LDPANIRALS 231



 Score = 40.1 bits (94), Expect = 0.008
 Identities = 20/95 (21%), Positives = 34/95 (35%), Gaps = 11/95 (11%)

Query: 1734 SPLSSREIHQQVLSLYLNAARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVR 1793
                   ++Q V +    A R     +  P              E +  +    + LQ  
Sbjct: 104  LSYLLLGLYQAVGAPEQPADRLADPLAQPPA-----------EQEMEALIARLETHLQQN 152

Query: 1794 PDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSP 1828
            P D+  W+ LG +     R  +A+ AY  AL+L+ 
Sbjct: 153  PGDAEGWDLLGRAYMALGRASDALLAYRNALRLAG 187



 Score = 36.6 bits (85), Expect = 0.095
 Identities = 18/80 (22%), Positives = 31/80 (38%), Gaps = 8/80 (10%)

Query: 1601 PMQNETNAFALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLG-ISLAENEQDPQAI 1659
             + +   A    ++++        I  LE   +Q P +AE W  LG   +A       A+
Sbjct: 124  RLADP-LAQPPAEQEMEA-----LIARLETHLQQNPGDAEGWDLLGRAYMALGRAS-DAL 176

Query: 1660 AALSKCLSIEPKNLEALMAI 1679
             A    L +   N E L+ +
Sbjct: 177  LAYRNALRLAGDNPEILLGL 196



 Score = 34.7 bits (80), Expect = 0.41
 Identities = 16/65 (24%), Positives = 26/65 (40%), Gaps = 3/65 (4%)

Query: 1778 EYDKAVDCFRSALQVRPDDSRLWNRLGASL---ANGNRPEEAVEAYHTALQLSPGFVRAR 1834
                A+  +R+AL++  D+  +   L  +L   A      +A      AL L P  +RA 
Sbjct: 171  RASDALLAYRNALRLAGDNPEILLGLAEALYYQAGQQMTAKARALLRQALALDPANIRAL 230

Query: 1835 YNLGI 1839
              L  
Sbjct: 231  SLLAF 235



 Score = 30.8 bits (70), Expect = 6.9
 Identities = 14/62 (22%), Positives = 19/62 (30%)

Query: 1618 QGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLEALM 1677
            Q     A   L  A   +P N      L  +  E     +A AA    L + P +     
Sbjct: 206  QQMTAKARALLRQALALDPANIRALSLLAFAAFEQGDYAEAAAAWQMLLDLLPADDPRRS 265

Query: 1678 AI 1679
             I
Sbjct: 266  LI 267


>gnl|CDD|131573 TIGR02521, type_IV_pilW, type IV pilus biogenesis/stability protein
            PilW.  Members of this family are designated PilF in ref
            (PMID:8973346) and PilW in ref (PMID:15612916). This
            outer membrane protein is required both for pilus
            stability and for pilus function such as adherence to
            human cells. Members of this family contain copies of the
            TPR (tetratricopeptide repeat) domain.
          Length = 234

 Score = 43.5 bits (103), Expect = 5e-04
 Identities = 20/87 (22%), Positives = 32/87 (36%)

Query: 1779 YDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNLG 1838
             + A +    AL+  PDD   +  L          E+A +++  AL L+P       N G
Sbjct: 47   LEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYG 106

Query: 1839 ITCVHLGANTQAVEHFLTALNQQAATH 1865
                  G   QA++ F  A+       
Sbjct: 107  TFLCQQGKYEQAMQQFEQAIEDPLYPQ 133



 Score = 41.9 bits (99), Expect = 0.001
 Identities = 24/104 (23%), Positives = 40/104 (38%), Gaps = 4/104 (3%)

Query: 1762 DPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYH 1821
            D      L + +    E +KA D FR AL + P++  + N  G  L    + E+A     
Sbjct: 64   DYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGKYEQA-MQQF 122

Query: 1822 TALQLSPGFV---RARYNLGITCVHLGANTQAVEHFLTALNQQA 1862
                  P +    R+  N G+  +  G   +A ++   AL    
Sbjct: 123  EQAIEDPLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQIDP 166



 Score = 40.0 bits (94), Expect = 0.006
 Identities = 18/67 (26%), Positives = 33/67 (49%)

Query: 1610 ALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIE 1669
             L    L QGDL  A   L+ A + +PD+   +L+L +   +  +  +A  +  + L++ 
Sbjct: 36   QLALGYLEQGDLEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLN 95

Query: 1670 PKNLEAL 1676
            P N + L
Sbjct: 96   PNNGDVL 102



 Score = 38.9 bits (91), Expect = 0.016
 Identities = 33/150 (22%), Positives = 51/150 (34%), Gaps = 27/150 (18%)

Query: 731 DPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDFTELL----VYLFSSLDFQPPKQK-- 784
           D      L + +    E +KA D FR AL + P+  ++L     +L     ++   Q+  
Sbjct: 64  DYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGKYEQAMQQFE 123

Query: 785 --ISDPDELRDYQHRKRKAFEDNIRKNRMVISNWIKYAQWEESQKQVDRARSIYERALDV 842
             I DP     Y    R + E+                         D+A     RAL +
Sbjct: 124 QAIEDP----LYPQPAR-SLENAGLCAL--------------KAGDFDKAEKYLTRALQI 164

Query: 843 DHRNITLWLKYTELEMRNRQVNHARNLWDR 872
           D +     L+  EL     Q   AR   +R
Sbjct: 165 DPQRPESLLELAELYYLRGQYKDARAYLER 194



 Score = 30.8 bits (70), Expect = 6.8
 Identities = 17/61 (27%), Positives = 23/61 (37%)

Query: 1616 LRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLEA 1675
            L+ GD   A  YL  A + +P   E  L L        Q   A A L +      +  E+
Sbjct: 146  LKAGDFDKAEKYLTRALQIDPQRPESLLELAELYYLRGQYKDARAYLERYQQTYNQTAES 205

Query: 1676 L 1676
            L
Sbjct: 206  L 206



 Score = 30.0 bits (68), Expect = 9.5
 Identities = 15/62 (24%), Positives = 27/62 (43%), Gaps = 2/62 (3%)

Query: 1617 RQGDLPSAILYLEAAAKQE--PDNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLE 1674
            +QG    A+   E A +    P  A    + G+   +     +A   L++ L I+P+  E
Sbjct: 111  QQGKYEQAMQQFEQAIEDPLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQIDPQRPE 170

Query: 1675 AL 1676
            +L
Sbjct: 171  SL 172


>gnl|CDD|111114 pfam02184, HAT, HAT (Half-A-TPR) repeat.  The HAT (Half A TPR)
           repeat is found in several RNA processing proteins.
          Length = 32

 Score = 38.7 bits (90), Expect = 5e-04
 Identities = 14/33 (42%), Positives = 20/33 (60%), Gaps = 1/33 (3%)

Query: 827 KQVDRARSIYERALDVDHRNITLWLKYTELEMR 859
           K++DRAR IYER + V H  +  W+K+   E  
Sbjct: 1   KEIDRARGIYERFVHV-HPEVQNWIKWARFEEE 32


>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat. 
          Length = 34

 Score = 38.2 bits (90), Expect = 9e-04
 Identities = 8/34 (23%), Positives = 17/34 (50%)

Query: 732 PDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDF 765
                 LG  +    +YD+A++ +  AL++ P+ 
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34



 Score = 37.8 bits (89), Expect = 0.001
 Identities = 8/34 (23%), Positives = 18/34 (52%)

Query: 1763 PDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDD 1796
                  LG  +    +YD+A++ +  AL++ P++
Sbjct: 1    AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34



 Score = 37.4 bits (88), Expect = 0.001
 Identities = 10/34 (29%), Positives = 18/34 (52%)

Query: 1797 SRLWNRLGASLANGNRPEEAVEAYHTALQLSPGF 1830
            ++    LG +     + +EA+E Y  AL+L+P  
Sbjct: 1    AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34



 Score = 33.2 bits (77), Expect = 0.045
 Identities = 8/34 (23%), Positives = 16/34 (47%)

Query: 1639 AEVWLSLGISLAENEQDPQAIAALSKCLSIEPKN 1672
            A+   +LG +  +  +  +A+    K L + P N
Sbjct: 1    AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34


>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat.  This Pfam entry includes
            outlying Tetratricopeptide-like repeats (TPR) that are
            not matched by pfam00515.
          Length = 34

 Score = 36.3 bits (85), Expect = 0.004
 Identities = 12/33 (36%), Positives = 15/33 (45%)

Query: 1798 RLWNRLGASLANGNRPEEAVEAYHTALQLSPGF 1830
                 LG +       EEA+EAY  AL+L P  
Sbjct: 2    EALYNLGLAYYKLGDYEEALEAYEKALELDPNN 34



 Score = 32.1 bits (74), Expect = 0.14
 Identities = 10/34 (29%), Positives = 18/34 (52%)

Query: 1639 AEVWLSLGISLAENEQDPQAIAALSKCLSIEPKN 1672
            AE   +LG++  +     +A+ A  K L ++P N
Sbjct: 1    AEALYNLGLAYYKLGDYEEALEAYEKALELDPNN 34



 Score = 29.4 bits (67), Expect = 1.0
 Identities = 9/30 (30%), Positives = 15/30 (50%)

Query: 1609 FALGQEKLRQGDLPSAILYLEAAAKQEPDN 1638
            + LG    + GD   A+   E A + +P+N
Sbjct: 5    YNLGLAYYKLGDYEEALEAYEKALELDPNN 34


>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats.  Repeats present in 4 or
            more copies in proteins. Contain a minimum of 34 amino
            acids each and self-associate via a "knobs and holes"
            mechanism.
          Length = 34

 Score = 36.3 bits (85), Expect = 0.004
 Identities = 8/34 (23%), Positives = 19/34 (55%)

Query: 1763 PDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDD 1796
             +    LG  +    +YD+A++ +  AL++ P++
Sbjct: 1    AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34



 Score = 36.3 bits (85), Expect = 0.004
 Identities = 8/34 (23%), Positives = 18/34 (52%)

Query: 732 PDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDF 765
            +    LG  +    +YD+A++ +  AL++ P+ 
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34



 Score = 34.3 bits (80), Expect = 0.018
 Identities = 10/34 (29%), Positives = 15/34 (44%)

Query: 1797 SRLWNRLGASLANGNRPEEAVEAYHTALQLSPGF 1830
            +     LG +       +EA+E Y  AL+L P  
Sbjct: 1    AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34



 Score = 32.8 bits (76), Expect = 0.067
 Identities = 9/34 (26%), Positives = 16/34 (47%)

Query: 1639 AEVWLSLGISLAENEQDPQAIAALSKCLSIEPKN 1672
            AE   +LG +  +     +A+    K L ++P N
Sbjct: 1    AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34


>gnl|CDD|222121 pfam13428, TPR_14, Tetratricopeptide repeat. 
          Length = 44

 Score = 36.0 bits (83), Expect = 0.007
 Identities = 14/37 (37%), Positives = 18/37 (48%)

Query: 1610 ALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLG 1646
            AL +  L  GDL  A+  L  A   +PD+ E  L L 
Sbjct: 6    ALARALLALGDLDEALALLRRALALDPDDPEALLLLA 42



 Score = 34.8 bits (80), Expect = 0.016
 Identities = 9/41 (21%), Positives = 17/41 (41%)

Query: 816 WIKYAQWEESQKQVDRARSIYERALDVDHRNITLWLKYTEL 856
            +  A+   +   +D A ++  RAL +D  +    L    L
Sbjct: 4   LLALARALLALGDLDEALALLRRALALDPDDPEALLLLARL 44



 Score = 34.1 bits (78), Expect = 0.029
 Identities = 10/41 (24%), Positives = 13/41 (31%)

Query: 1798 RLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNLG 1838
                 L  +L      +EA+     AL L P    A   L 
Sbjct: 2    AALLALARALLALGDLDEALALLRRALALDPDDPEALLLLA 42



 Score = 33.3 bits (76), Expect = 0.070
 Identities = 11/42 (26%), Positives = 15/42 (35%)

Query: 1763 PDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLG 1804
            P     L        + D+A+   R AL + PDD      L 
Sbjct: 1    PAALLALARALLALGDLDEALALLRRALALDPDDPEALLLLA 42



 Score = 31.0 bits (70), Expect = 0.44
 Identities = 11/40 (27%), Positives = 21/40 (52%)

Query: 1639 AEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLEALMA 1678
                L+L  +L       +A+A L + L+++P + EAL+ 
Sbjct: 1    PAALLALARALLALGDLDEALALLRRALALDPDDPEALLL 40


>gnl|CDD|227518 COG5191, COG5191, Uncharacterized conserved protein, contains HAT
           (Half-A-TPR) repeat [General function prediction only].
          Length = 435

 Score = 40.7 bits (95), Expect = 0.008
 Identities = 25/117 (21%), Positives = 49/117 (41%), Gaps = 7/117 (5%)

Query: 782 KQKISDPDELRDYQHRKRKAFE----DNIRKNRMVISNWIKYAQWEESQKQVDRARSIYE 837
           ++K+       D    ++K FE     N   N   I  W +YA +   +K     ++I+ 
Sbjct: 74  RKKVGKKASFSDMSIPQKKIFELYRSTNKFFNDPKI--WSQYAAYVIKKKMYGEMKNIFA 131

Query: 838 RALDVDHRNITLWLKYTELEMR-NRQVNHARNLWDRAVTILPRANQFWYKYTYMEEM 893
             L     N+ LW+     E+     +  +R ++ + + +  R+ + W +Y  ME M
Sbjct: 132 ECLTKHPLNVDLWIYCCAFELFEIANIESSRAMFLKGLRMNSRSPRIWIEYFRMELM 188


>gnl|CDD|236875 PRK11189, PRK11189, lipoprotein NlpI; Provisional.
          Length = 296

 Score = 39.9 bits (94), Expect = 0.011
 Identities = 18/57 (31%), Positives = 27/57 (47%), Gaps = 2/57 (3%)

Query: 1763 PDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPD-DSRLWNRLGASLANGNRPEEAVE 1818
             D  N LG+    +  +D A + F S L++ P  +    NR G +L  G R E A +
Sbjct: 98   ADAYNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAYLNR-GIALYYGGRYELAQD 153



 Score = 34.9 bits (81), Expect = 0.35
 Identities = 11/39 (28%), Positives = 19/39 (48%), Gaps = 2/39 (5%)

Query: 732 PDVQNGLGVLFNLSDEYDKAVDCFRSALQVRP--DFTEL 768
            D  N LG+    +  +D A + F S L++ P  ++  L
Sbjct: 98  ADAYNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAYL 136



 Score = 32.9 bits (76), Expect = 1.4
 Identities = 23/70 (32%), Positives = 33/70 (47%)

Query: 1786 FRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNLGITCVHLG 1845
            F  AL +RPD +  +N LG  L      + A EA+ + L+L P +  A  N GI   + G
Sbjct: 87   FSQALALRPDMADAYNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAYLNRGIALYYGG 146

Query: 1846 ANTQAVEHFL 1855
                A +  L
Sbjct: 147  RYELAQDDLL 156


>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat. 
          Length = 34

 Score = 34.5 bits (80), Expect = 0.016
 Identities = 8/34 (23%), Positives = 19/34 (55%)

Query: 1785 CFRSALQVRPDDSRLWNRLGASLANGNRPEEAVE 1818
             +  AL++ P+++  +  L   L N  + +EA++
Sbjct: 1    LYEKALELDPNNAEAYYNLALLLLNLGQYDEALQ 34



 Score = 32.9 bits (76), Expect = 0.065
 Identities = 10/34 (29%), Positives = 18/34 (52%)

Query: 1627 YLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIA 1660
              E A + +P+NAE + +L + L    Q  +A+ 
Sbjct: 1    LYEKALELDPNNAEAYYNLALLLLNLGQYDEALQ 34



 Score = 29.9 bits (68), Expect = 0.79
 Identities = 9/29 (31%), Positives = 15/29 (51%)

Query: 1113 HLEAAAKQEPDNAEVWLSLGISLAENEQL 1141
              E A + +P+NAE + +L + L    Q 
Sbjct: 1    LYEKALELDPNNAEAYYNLALLLLNLGQY 29


>gnl|CDD|198111 smart01043, BTAD, Bacterial transcriptional activator domain.  Found
            in the DNRI/REDD/AFSR family of regulators. This region
            of AFSR along with the C terminal region is capable of
            independently directing actinorhodin production. This
            family contains TPR repeats.
          Length = 145

 Score = 36.5 bits (85), Expect = 0.042
 Identities = 13/49 (26%), Positives = 21/49 (42%)

Query: 1778 EYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQL 1826
             +++A+      L + P   RL   L  +L    R  EA+ AY    +L
Sbjct: 76   RHEEALALLERLLALDPLRERLHRLLMRALYRAGRRAEALRAYRRLRRL 124


>gnl|CDD|233924 TIGR02552, LcrH_SycD, type III secretion low calcium response
            chaperone LcrH/SycD.  Genes in this family are found in
            type III secretion operons. LcrH, from Yersinia is
            believed to have a regulatory function in the low-calcium
            response of the secretion system. The same protein is
            also known as SycD (SYC = Specific Yop Chaperone) for its
            chaperone role. In Pseudomonas, where the homolog is
            known as PcrH, the chaperone role has been demonstrated
            and the regulatory role appears to be absent. ScyD/LcrH
            contains three central tetratricopeptide-like repeats
            that are predicted to fold into an all-alpha-helical
            array.
          Length = 135

 Score = 36.1 bits (84), Expect = 0.044
 Identities = 23/81 (28%), Positives = 35/81 (43%)

Query: 1778 EYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNL 1837
             YD+A+  F+      P +SR W  L A        EEA++AY  A  L P   R  ++ 
Sbjct: 32   RYDEALKLFQLLAAYDPYNSRYWLGLAACCQMLKEYEEAIDAYALAAALDPDDPRPYFHA 91

Query: 1838 GITCVHLGANTQAVEHFLTAL 1858
                + LG    A++    A+
Sbjct: 92   AECLLALGEPESALKALDLAI 112



 Score = 36.1 bits (84), Expect = 0.049
 Identities = 20/84 (23%), Positives = 31/84 (36%), Gaps = 1/84 (1%)

Query: 1635 EPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLEALMAISICFTNEACLHDALD 1694
            +P N+  WL L       ++  +AI A +   +++P +       + C         AL 
Sbjct: 47   DPYNSRYWLGLAACCQMLKEYEEAIDAYALAAALDPDDPRPYFHAAECLLALGEPESALK 106

Query: 1695 TLKDKIRPGQESNPRPSAYKADAL 1718
             L   I    E NP  S  K  A 
Sbjct: 107  ALDLAIEICGE-NPEYSELKERAE 129



 Score = 34.2 bits (79), Expect = 0.18
 Identities = 17/62 (27%), Positives = 29/62 (46%)

Query: 1768 GLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLS 1827
            GL     +  EY++A+D +  A  + PDD R +      L     PE A++A   A+++ 
Sbjct: 56   GLAACCQMLKEYEEAIDAYALAAALDPDDPRPYFHAAECLLALGEPESALKALDLAIEIC 115

Query: 1828 PG 1829
              
Sbjct: 116  GE 117


>gnl|CDD|225605 COG3063, PilF, Tfp pilus assembly protein PilF [Cell motility and
            secretion / Intracellular trafficking and secretion].
          Length = 250

 Score = 37.4 bits (87), Expect = 0.049
 Identities = 30/110 (27%), Positives = 44/110 (40%), Gaps = 5/110 (4%)

Query: 1751 NAARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANG 1810
             A    PS              +    E D A + +R AL + P++  + N  GA L   
Sbjct: 60   KALEHDPS---YYLAHLVRAHYYQKLGENDLADESYRKALSLAPNNGDVLNNYGAFLCAQ 116

Query: 1811 NRPEEAVEAYHTALQLSPGFVRARY--NLGITCVHLGANTQAVEHFLTAL 1858
             RPEEA++ +  AL        +    NLG+  +  G   QA E+   AL
Sbjct: 117  GRPEEAMQQFERALADPAYGEPSDTLENLGLCALKAGQFDQAEEYLKRAL 166



 Score = 33.2 bits (76), Expect = 1.1
 Identities = 27/157 (17%), Positives = 47/157 (29%), Gaps = 22/157 (14%)

Query: 720 NAARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDFTELL----VYLFSS 775
            A    PS              +    E D A + +R AL + P+  ++L     +L + 
Sbjct: 60  KALEHDPS---YYLAHLVRAHYYQKLGENDLADESYRKALSLAPNNGDVLNNYGAFLCAQ 116

Query: 776 LDFQPPKQKISDPDELRDYQHRKRKAFEDNIRKNRMVISNWIKYAQWEESQKQVDRARSI 835
              +   Q+         Y        E+                       Q D+A   
Sbjct: 117 GRPEEAMQQFERALADPAYGEPSD-TLEN---------LGLCAL-----KAGQFDQAEEY 161

Query: 836 YERALDVDHRNITLWLKYTELEMRNRQVNHARNLWDR 872
            +RAL++D +     L+   L  +      AR   +R
Sbjct: 162 LKRALELDPQFPPALLELARLHYKAGDYAPARLYLER 198



 Score = 30.9 bits (70), Expect = 6.7
 Identities = 21/70 (30%), Positives = 28/70 (40%), Gaps = 6/70 (8%)

Query: 1610 ALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGI---SLAENEQDPQAIAALSKCL 1666
             L    L+QGD   A   LE A + +P      L        L EN+    A  +  K L
Sbjct: 40   QLALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGEND---LADESYRKAL 96

Query: 1667 SIEPKNLEAL 1676
            S+ P N + L
Sbjct: 97   SLAPNNGDVL 106


>gnl|CDD|188247 TIGR02795, tol_pal_ybgF, tol-pal system protein YbgF.  Members of
            this protein family are the product of one of seven genes
            regularly clustered in operons to encode the proteins of
            the tol-pal system, which is critical for maintaining the
            integrity of the bacterial outer membrane. The gene for
            this periplasmic protein has been designated orf2 and
            ybgF. All members of the seed alignment were from unique
            tol-pal gene regions from completed bacterial genomes.
            The architecture of this protein is a signal sequence, a
            low-complexity region usually rich in Asn and Gln, a
            well-conserved region with tandem repeats that resemble
            the tetratricopeptide (TPR) repeat, involved in
            protein-protein interaction.
          Length = 117

 Score = 35.3 bits (82), Expect = 0.057
 Identities = 23/76 (30%), Positives = 34/76 (44%), Gaps = 4/76 (5%)

Query: 1607 NA-FALGQEKLRQGDLPSAILYLEAAAKQEPDNA---EVWLSLGISLAENEQDPQAIAAL 1662
            NA + LG+    QGD   A     A  K+ P +    +  L LG+SL E     +A A L
Sbjct: 38   NAHYWLGEAYYAQGDYADAAKAFLAVVKKYPKSPKAPDALLKLGMSLQELGDKEKAKATL 97

Query: 1663 SKCLSIEPKNLEALMA 1678
             + +   P +  A +A
Sbjct: 98   QQVIKRYPGSSAAKLA 113


>gnl|CDD|236983 PRK11788, PRK11788, tetratricopeptide repeat protein; Provisional.
          Length = 389

 Score = 37.1 bits (87), Expect = 0.091
 Identities = 16/52 (30%), Positives = 22/52 (42%)

Query: 1611 LGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAAL 1662
            L Q+ L +GDL +A   L+ A   +P      + LG           AI AL
Sbjct: 186  LAQQALARGDLDAARALLKKALAADPQCVRASILLGDLALAQGDYAAAIEAL 237



 Score = 35.6 bits (83), Expect = 0.25
 Identities = 13/30 (43%), Positives = 20/30 (66%)

Query: 739 GVLFNLSDEYDKAVDCFRSALQVRPDFTEL 768
           G+ F L+++ DKA+D F   L+V P+  EL
Sbjct: 42  GLNFLLNEQPDKAIDLFIEMLKVDPETVEL 71



 Score = 33.2 bits (77), Expect = 1.2
 Identities = 11/26 (42%), Positives = 18/26 (69%)

Query: 1770 GVLFNLSDEYDKAVDCFRSALQVRPD 1795
            G+ F L+++ DKA+D F   L+V P+
Sbjct: 42   GLNFLLNEQPDKAIDLFIEMLKVDPE 67


>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
            repeats [General function prediction only].
          Length = 484

 Score = 37.0 bits (86), Expect = 0.10
 Identities = 19/74 (25%), Positives = 33/74 (44%)

Query: 1778 EYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNL 1837
            +YD+A+   +  +  +PD+       G  L   N+ +EA+E    AL L P     + NL
Sbjct: 321  QYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKEAIERLKKALALDPNSPLLQLNL 380

Query: 1838 GITCVHLGANTQAV 1851
                +  G   +A+
Sbjct: 381  AQALLKGGKPQEAI 394



 Score = 34.7 bits (80), Expect = 0.50
 Identities = 19/74 (25%), Positives = 34/74 (45%), Gaps = 4/74 (5%)

Query: 1599 DNPMQNETNAFALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQA 1658
            DNP   E      G   L       AI  L+ A   +P++  + L+L  +L +  +  +A
Sbjct: 338  DNPYYLE----LAGDILLEANKAKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQEA 393

Query: 1659 IAALSKCLSIEPKN 1672
            I  L++ L  +P++
Sbjct: 394  IRILNRYLFNDPED 407



 Score = 32.4 bits (74), Expect = 3.2
 Identities = 14/51 (27%), Positives = 21/51 (41%)

Query: 1611 LGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAA 1661
            L Q  L+ G    AI  L      +P++   W  L  + AE     +A+ A
Sbjct: 380  LAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAELGNRAEALLA 430



 Score = 30.8 bits (70), Expect = 8.8
 Identities = 15/62 (24%), Positives = 25/62 (40%)

Query: 1609 FALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSI 1668
            +    +    G    A+  L+     +PDN       G  L E  +  +AI  L K L++
Sbjct: 310  YGRALQTYLAGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKEAIERLKKALAL 369

Query: 1669 EP 1670
            +P
Sbjct: 370  DP 371


>gnl|CDD|227124 COG4785, NlpI, Lipoprotein NlpI, contains TPR repeats [General
            function prediction only].
          Length = 297

 Score = 36.7 bits (85), Expect = 0.11
 Identities = 27/86 (31%), Positives = 40/86 (46%)

Query: 1770 GVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPG 1829
            GVL++       A + F  AL +RPD   ++N LG  L      + A EA+ + L+L P 
Sbjct: 72   GVLYDSLGLRALARNDFSQALAIRPDMPEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPT 131

Query: 1830 FVRARYNLGITCVHLGANTQAVEHFL 1855
            +  A  N GI   + G    A +  L
Sbjct: 132  YNYAHLNRGIALYYGGRYKLAQDDLL 157



 Score = 32.1 bits (73), Expect = 2.4
 Identities = 19/58 (32%), Positives = 31/58 (53%), Gaps = 4/58 (6%)

Query: 1763 PDVQNGLGVLFNLSDEYDKAVDCFRSALQVRP--DDSRLWNRLGASLANGNRPEEAVE 1818
            P+V N LG+    +  +D A + F S L++ P  + + L NR G +L  G R + A +
Sbjct: 99   PEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAHL-NR-GIALYYGGRYKLAQD 154


>gnl|CDD|234406 TIGR03939, PGA_TPR_OMP, poly-beta-1,6 N-acetyl-D-glucosamine export
            porin PgaA.  Members of this protein family are the
            poly-beta-1,6 N-acetyl-D-glucosamine (PGA) export porin
            PgaA of Gram-negative bacteria. There is no counterpart
            in the poly-beta-1,6 N-acetyl-D-glucosamine biosynthesis
            systems of Gram-positive bacteria such as Staphylococcus
            epidermidis. The PGA polysaccharide adhesin is a critical
            determinant of biofilm formation. The conserved
            C-terminal domain of this outer membrane protein is
            preceded by a variable number of TPR repeats.
          Length = 800

 Score = 36.6 bits (85), Expect = 0.16
 Identities = 14/69 (20%), Positives = 32/69 (46%)

Query: 1610 ALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIE 1669
            AL +    +     A+       +++P+N +  L L ++LA+  +D +A+  L + ++  
Sbjct: 91   ALAKAYRNEKQWDKALELYRKLLQRDPNNPDGLLGLALTLADAGKDAEALKYLKEYVARF 150

Query: 1670 PKNLEALMA 1678
            P +     A
Sbjct: 151  PTDAARYEA 159



 Score = 30.8 bits (70), Expect = 9.9
 Identities = 16/84 (19%), Positives = 27/84 (32%), Gaps = 4/84 (4%)

Query: 825 SQKQVDRARSIYERALDVDHRNITLWLKY--TELEMRNRQVNHARNLWDRAVTILPRANQ 882
           ++KQ D+A  +Y + L  D  N    L    T  +        A       V   P    
Sbjct: 98  NEKQWDKALELYRKLLQRDPNNPDGLLGLALTLADAGKDA--EALKYLKEYVARFPTDAA 155

Query: 883 FWYKYTYMEEMLENVAGKLFVFHR 906
            +    Y+    E+    L  + +
Sbjct: 156 RYEALAYVLRAAEDHLDALQAWQQ 179


>gnl|CDD|235419 PRK05339, PRK05339, PEP synthetase regulatory protein; Provisional.
          Length = 269

 Score = 35.8 bits (84), Expect = 0.17
 Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 14/72 (19%)

Query: 788 PDELRDYQHRKRKAF-----ED---NIRKNRMVISNWIKYAQWEESQKQVDRARSIYERA 839
           P+EL  +    +K F      +    IRK R+      +YA  E+ ++++  A  ++ R 
Sbjct: 182 PEEL--FPIDPKKIFGLTIDPERLIEIRKERLPNLGLSRYASLEQCREELAEAERLFRRE 239

Query: 840 ----LDVDHRNI 847
               +DV +++I
Sbjct: 240 GIPVIDVTNKSI 251


>gnl|CDD|205550 pfam13371, TPR_9, Tetratricopeptide repeat. 
          Length = 73

 Score = 31.8 bits (73), Expect = 0.38
 Identities = 15/63 (23%), Positives = 24/63 (38%)

Query: 1616 LRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLEA 1675
            LR+ D   A+  +E      PD+       G+  A+      A+A L   L + P   +A
Sbjct: 6    LREEDFERALAVVERLLLLAPDDPYERRDRGLLYAQLGCFQAALADLEYFLELCPDAPDA 65

Query: 1676 LMA 1678
               
Sbjct: 66   ERI 68



 Score = 31.0 bits (71), Expect = 0.72
 Identities = 11/53 (20%), Positives = 21/53 (39%)

Query: 1776 SDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSP 1828
             +++++A+      L + PDD       G   A     + A+      L+L P
Sbjct: 8    EEDFERALAVVERLLLLAPDDPYERRDRGLLYAQLGCFQAALADLEYFLELCP 60


>gnl|CDD|224643 COG1729, COG1729, Uncharacterized protein conserved in bacteria
            [Function unknown].
          Length = 262

 Score = 34.3 bits (79), Expect = 0.49
 Identities = 21/76 (27%), Positives = 30/76 (39%), Gaps = 4/76 (5%)

Query: 1607 NA-FALGQEKLRQGDLPSAILYLEAAAKQEPDNA---EVWLSLGISLAENEQDPQAIAAL 1662
            NA + LG+    QGD   A        K  P +    +  L LG+SL       +A A L
Sbjct: 179  NAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALLKLGVSLGRLGNTDEACATL 238

Query: 1663 SKCLSIEPKNLEALMA 1678
             + +   P    A +A
Sbjct: 239  QQVIKRYPGTDAAKLA 254


>gnl|CDD|203781 pfam07849, DUF1641, Protein of unknown function (DUF1641).
           Archaeal and bacterial hypothetical proteins are found
           in this family, with the region in question being
           approximately 40 residues long.
          Length = 42

 Score = 30.5 bits (70), Expect = 0.55
 Identities = 10/14 (71%), Positives = 11/14 (78%)

Query: 731 DPDVQNGLGVLFNL 744
           DPDVQ GLG L +L
Sbjct: 22  DPDVQRGLGFLLSL 35



 Score = 30.5 bits (70), Expect = 0.55
 Identities = 10/14 (71%), Positives = 11/14 (78%)

Query: 1762 DPDVQNGLGVLFNL 1775
            DPDVQ GLG L +L
Sbjct: 22   DPDVQRGLGFLLSL 35


>gnl|CDD|205362 pfam13181, TPR_8, Tetratricopeptide repeat. 
          Length = 34

 Score = 30.1 bits (69), Expect = 0.55
 Identities = 8/34 (23%), Positives = 13/34 (38%)

Query: 1639 AEVWLSLGISLAENEQDPQAIAALSKCLSIEPKN 1672
            A  +  LG    +     +A     K L ++P N
Sbjct: 1    ARAYYLLGQIYLQLGDYEEAKEYYEKALELDPNN 34


>gnl|CDD|185096 PRK15174, PRK15174, Vi polysaccharide export protein VexE;
            Provisional.
          Length = 656

 Score = 34.3 bits (78), Expect = 0.73
 Identities = 27/242 (11%), Positives = 61/242 (25%), Gaps = 34/242 (14%)

Query: 1618 QGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLEALM 1677
                 + +  +           E  L +   L +++Q         +       N   + 
Sbjct: 89   SSQPDAVLQVVNKLLAVNVCQPEDVLLVASVLLKSKQYATVADLAEQAWLAFSGNS-QIF 147

Query: 1678 AISICFTNEAC-LHDALDTLKD----KIRPGQE----------------SNPRPSAYKAD 1716
            A+ +           A+   +         G                   +   +     
Sbjct: 148  ALHLRTLVLMDKELQAISLARTQAQEVPPRGDMIATCLSFLNKSRLPEDHDLARALLPFF 207

Query: 1717 ALPSKLTRLANHTLTFRSPLSSREIHQQVLSLYLNAARQCPSQSIDPDVQNGLGVL-FNL 1775
            AL  +    A   +     +   +   Q     L       +      ++  LG+  +  
Sbjct: 208  ALERQ--ESAGLAVDTLCAVGKYQEAIQTGESALARGLDGAA------LRRSLGLAYYQS 259

Query: 1776 SDEYDKAVDC---FRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVR 1832
                +  +     +R ALQ   D+ R+      +L    + E+A+     +L   P    
Sbjct: 260  GRSREAKLQAAEHWRHALQFNSDNVRIVTLYADALIRTGQNEKAIPLLQQSLATHPDLPY 319

Query: 1833 AR 1834
             R
Sbjct: 320  VR 321


>gnl|CDD|220793 pfam10516, SHNi-TPR, SHNi-TPR.  SHNi-TPR family members contain a
            reiterated sequence motif that is an interrupted form of
            TPR repeat.
          Length = 38

 Score = 30.1 bits (69), Expect = 0.77
 Identities = 17/38 (44%), Positives = 22/38 (57%), Gaps = 2/38 (5%)

Query: 1639 AEVWLSLG-ISLAENEQDPQAIAALSKCLSIEPKNLEA 1675
            A+V+  LG ISL ENE  PQA+  L K L +  + L  
Sbjct: 1    ADVYDLLGEISL-ENENFPQAVEDLRKALELREQLLPP 37


>gnl|CDD|225613 COG3071, HemY, Uncharacterized enzyme of heme biosynthesis [Coenzyme
            metabolism].
          Length = 400

 Score = 33.5 bits (77), Expect = 1.2
 Identities = 21/65 (32%), Positives = 32/65 (49%), Gaps = 1/65 (1%)

Query: 1762 DPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYH 1821
            DP + + LG L   +  + KA +   +AL++RP  S  +  L  +L     PEEA +   
Sbjct: 327  DPLLLSTLGRLALKNKLWGKASEALEAALKLRPSASD-YAELADALDQLGEPEEAEQVRR 385

Query: 1822 TALQL 1826
             AL L
Sbjct: 386  EALLL 390


>gnl|CDD|227438 COG5107, RNA14, Pre-mRNA 3'-end processing (cleavage and
           polyadenylation) factor [RNA processing and
           modification].
          Length = 660

 Score = 33.8 bits (77), Expect = 1.2
 Identities = 15/43 (34%), Positives = 22/43 (51%), Gaps = 2/43 (4%)

Query: 816 WIKYAQWEESQKQVDRARSIYERALDVDHRNIT-LWLKYTELE 857
             +  +WEE Q+ +D+ R+ Y RAL     N+  LW  Y   E
Sbjct: 165 IEELGKWEEQQR-IDKIRNGYMRALQTPMGNLEKLWKDYENFE 206


>gnl|CDD|183140 PRK11447, PRK11447, cellulose synthase subunit BcsC; Provisional.
          Length = 1157

 Score = 33.9 bits (78), Expect = 1.3
 Identities = 15/38 (39%), Positives = 19/38 (50%), Gaps = 2/38 (5%)

Query: 1610 ALGQEKLRQGDLPSAILYLEAAAKQEPD--NAEVWLSL 1645
            ALGQ   +QGD   A+   E A   +P   N + W SL
Sbjct: 308  ALGQAYSQQGDRARAVAQFEKALALDPHSSNRDKWESL 345



 Score = 32.7 bits (75), Expect = 2.6
 Identities = 17/70 (24%), Positives = 25/70 (35%)

Query: 1804 GASLANGNRPEEAVEAYHTALQLSPGFVRARYNLGITCVHLGANTQAVEHFLTALNQQAA 1863
              +L N  +  +A E     L L PG V   Y L       G  +QA         Q+  
Sbjct: 468  AEALENQGKWAQAAELQRQRLALDPGSVWLTYRLAQDLRQAGQRSQADALMRRLAQQKPN 527

Query: 1864 THDGLTPHGL 1873
              + +  +GL
Sbjct: 528  DPEQVYAYGL 537



 Score = 32.0 bits (73), Expect = 4.3
 Identities = 17/66 (25%), Positives = 31/66 (46%), Gaps = 3/66 (4%)

Query: 1608 AFALGQEKLRQ---GDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSK 1664
            AF    + L     G    AI  L+ A +  P ++E   +LG + ++     +A+A   K
Sbjct: 269  AFRARAQGLAAVDSGQGGKAIPELQQAVRANPKDSEALGALGQAYSQQGDRARAVAQFEK 328

Query: 1665 CLSIEP 1670
             L+++P
Sbjct: 329  ALALDP 334


>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
            receptor (72 kDa mitochondrial outermembrane protein)
            (mitochondrial import receptor for the ADP/ATP carrier)
            (translocase of outermembrane tom70).  [Transport and
            binding proteins, Amino acids, peptides and amines].
          Length = 615

 Score = 33.4 bits (76), Expect = 1.3
 Identities = 32/162 (19%), Positives = 58/162 (35%), Gaps = 24/162 (14%)

Query: 1714 KADALPSKLTRLANHTLTFRSPLS------SREIHQQVLSLYLNAARQCPSQSIDPDVQN 1767
            K + LPS +T + N+  +FR          S E+ ++  +  L    + P    D     
Sbjct: 254  KPENLPS-VTFVGNYLQSFRPKPRPAGLEDSNELDEETGNGQLQLGLKSPESKADE---- 308

Query: 1768 GLGVLFNLSDEYDKAVDCFRSAL---QVRPDDSRLWNRLGASLANGNRPEEAVEAYHTAL 1824
                       Y++A   F  AL   ++   ++   N  G       +  EA+     ++
Sbjct: 309  ----------SYEEAARAFEKALDLGKLGEKEAIALNLRGTFKCLKGKHLEALADLSKSI 358

Query: 1825 QLSPGFVRARYNLGITCVHLGANTQAVEHFLTALNQQAATHD 1866
            +L P   ++        + LG   +A E F  AL   +   D
Sbjct: 359  ELDPRVTQSYIKRASMNLELGDPDKAEEDFDKALKLNSEDPD 400


>gnl|CDD|217642 pfam03618, Kinase-PPPase, Kinase/pyrophosphorylase.  This family of
           regulatory proteins has ADP-dependent kinase and
           inorganic phosphate-dependent pyrophosphorylase
           activity.
          Length = 255

 Score = 32.8 bits (76), Expect = 1.3
 Identities = 17/74 (22%), Positives = 36/74 (48%), Gaps = 16/74 (21%)

Query: 788 PDELRDYQHRKRKAF------E--DNIRKNRMVISNWI--KYAQWEESQKQVDRARSIYE 837
           P+EL  ++  ++K F      E    IR+ R+        +YA  E+ ++++  A  ++ 
Sbjct: 176 PEEL--FEVDRKKVFGLTIDPERLHEIRRERLPSLGLDESRYASLEQCREELAYAERLFR 233

Query: 838 RA----LDVDHRNI 847
           R     +DV +++I
Sbjct: 234 RLGIPVIDVTNKSI 247


>gnl|CDD|214817 smart00777, Mad3_BUB1_I, Mad3/BUB1 hoMad3/BUB1 homology region 1.
           Proteins containing this domain are checkpoint proteins
           involved in cell division. This region has been shown to
           be essential for the binding of the binding of BUB1 and
           MAD3 to CDC20p.
          Length = 124

 Score = 31.0 bits (71), Expect = 1.8
 Identities = 21/89 (23%), Positives = 39/89 (43%), Gaps = 22/89 (24%)

Query: 798 KRKAFEDNIR---KNRMVISNWIKYAQW-EESQKQVDRA---RSIYERALDV-------- 842
           +R+AFE  ++   +    +  W++Y +W EE+  Q  +     ++ ER +          
Sbjct: 3   QRQAFEAELQDLYEGDDPLDLWLRYIKWTEENYPQGGKESGLLTLLERCIRYFEDDERYK 62

Query: 843 -DHRNITLWLKYTELEMRNRQVNHARNLW 870
            D R + +WLKY E        +  R L+
Sbjct: 63  NDPRYLKIWLKYAE------YCDEPRELF 85


>gnl|CDD|179448 PRK02603, PRK02603, photosystem I assembly protein Ycf3; Provisional.
          Length = 172

 Score = 32.0 bits (73), Expect = 1.9
 Identities = 21/76 (27%), Positives = 38/76 (50%), Gaps = 5/76 (6%)

Query: 1778 EYDKAVDCFRSALQVRPDDSR----LWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRA 1833
            EY +A++ +  AL++  D +     L+N      +NG   ++A+E YH AL+L+P    A
Sbjct: 50   EYAEALENYEEALKLEEDPNDRSYILYNMGIIYASNGEH-DKALEYYHQALELNPKQPSA 108

Query: 1834 RYNLGITCVHLGANTQ 1849
              N+ +     G   +
Sbjct: 109  LNNIAVIYHKRGEKAE 124


>gnl|CDD|182792 PRK10866, PRK10866, outer membrane biogenesis protein BamD;
            Provisional.
          Length = 243

 Score = 32.1 bits (73), Expect = 2.7
 Identities = 24/75 (32%), Positives = 38/75 (50%), Gaps = 6/75 (8%)

Query: 1599 DNPMQNETNAFALGQEKLRQGDLPSAILYLEAAAKQEP---DNAEVWLSLGISLAENEQD 1655
            DNP  +E   +A  Q+KL+ G+   AI  LEA   + P    + +V L L  +  +N   
Sbjct: 29   DNP-PSEI--YATAQQKLQDGNWKQAITQLEALDNRYPFGPYSQQVQLDLIYAYYKNADL 85

Query: 1656 PQAIAALSKCLSIEP 1670
            P A AA+ + + + P
Sbjct: 86   PLAQAAIDRFIRLNP 100


>gnl|CDD|176974 CHL00033, ycf3, photosystem I assembly protein Ycf3.
          Length = 168

 Score = 31.1 bits (71), Expect = 2.9
 Identities = 24/95 (25%), Positives = 43/95 (45%), Gaps = 9/95 (9%)

Query: 1778 EYDKAVDCFRSALQVRPDD---SRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRAR 1834
            EY +A+  +  A+++  D    S +   +G    +     +A+E Y  AL+ +P   +A 
Sbjct: 50   EYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALERNPFLPQAL 109

Query: 1835 YNLGITCVHLGANTQAVE--HFLTA--LNQQAATH 1865
             N+ + C + G   QA+E      A     QAA +
Sbjct: 110  NNMAVICHYRG--EQAIEQGDSEIAEAWFDQAAEY 142


>gnl|CDD|225463 COG2911, COG2911, Uncharacterized protein conserved in bacteria
            [Function unknown].
          Length = 1278

 Score = 32.3 bits (74), Expect = 3.1
 Identities = 18/80 (22%), Positives = 28/80 (35%), Gaps = 21/80 (26%)

Query: 1490 GPLFSNGVASINQQPVPNTNGNLHSLDRTLAVSK---LFKGPII-----------GTPST 1535
            G L  NG  ++         G   +  + L + K   LF GP+               +T
Sbjct: 1082 GNLGLNGQINL-------RRGRYRAYGQDLTIRKGTLLFSGPLDQPYLNIEAIRNPDSTT 1134

Query: 1536 TSGVSTSGTVDKPEPALWNE 1555
               V  +GT D P    ++E
Sbjct: 1135 AGAVRVTGTADSPRITFFSE 1154


>gnl|CDD|184854 PRK14852, PRK14852, hypothetical protein; Provisional.
          Length = 989

 Score = 32.4 bits (73), Expect = 3.1
 Identities = 20/70 (28%), Positives = 28/70 (40%)

Query: 1544 TVDKPEPALWNELNSHWKEMTESLGSGESLPHQWFSDFSRNQRSSVSMHEYTFAEDNPMQ 1603
            TVD P  AL  +L+    E+ E  G  E      F+ F+R         E     D P++
Sbjct: 187  TVDAPAVALRIDLHEAMDELQEKYGDEEDFETNLFAFFARMNSDETETSENPVRRDRPLE 246

Query: 1604 NETNAFALGQ 1613
              T  + L Q
Sbjct: 247  PYTAYYLLQQ 256


>gnl|CDD|215568 PLN03088, PLN03088, SGT1,  suppressor of G2 allele of SKP1;
            Provisional.
          Length = 356

 Score = 32.1 bits (73), Expect = 3.4
 Identities = 20/72 (27%), Positives = 39/72 (54%), Gaps = 2/72 (2%)

Query: 1775 LSDEYDKAVDCFRSALQVRPDDSRLW-NRLGASLANGNRPEEAVEAYHTALQLSPGFVRA 1833
            + D++  AVD +  A+ + P+++ L+ +R  A++  GN  E   +A + A++L P   +A
Sbjct: 14   VDDDFALAVDLYTQAIDLDPNNAELYADRAQANIKLGNFTEAVADA-NKAIELDPSLAKA 72

Query: 1834 RYNLGITCVHLG 1845
                G  C+ L 
Sbjct: 73   YLRKGTACMKLE 84


>gnl|CDD|218234 pfam04733, Coatomer_E, Coatomer epsilon subunit.  This family
            represents the epsilon subunit of the coatomer complex,
            which is involved in the regulation of intracellular
            protein trafficking between the endoplasmic reticulum and
            the Golgi complex.
          Length = 288

 Score = 31.7 bits (72), Expect = 3.7
 Identities = 15/55 (27%), Positives = 19/55 (34%)

Query: 1812 RPEEAVEAYHTALQLSPGFVRARYNLGITCVHLGANTQAVEHFLTALNQQAATHD 1866
            R EEA      AL           NL +  +HLG   +     L+ L     TH 
Sbjct: 214  RYEEAESLLKEALDKDAKDPETLINLVVCALHLGKPAEVSNRNLSQLKLSHPTHP 268


>gnl|CDD|234074 TIGR02966, phoR_proteo, phosphate regulon sensor kinase PhoR.
            Members of this protein family are the regulatory
            histidine kinase PhoR associated with the phosphate ABC
            transporter in most Proteobacteria. Related proteins from
            Gram-positive organisms are not included in this model.
            The phoR gene usually is adjacent to the response
            regulator phoB gene (TIGR02154) [Signal transduction,
            Two-component systems].
          Length = 333

 Score = 31.4 bits (72), Expect = 4.6
 Identities = 22/97 (22%), Positives = 40/97 (41%), Gaps = 13/97 (13%)

Query: 1135 LAENEQLLGEFWEHNLGNM-QQAFRMDNLLAEMRD---IEAASQRPKPQQSAPISLLAR- 1189
            LA+      E W   L  M +Q+ RM +L+ ++     +E+A+   + +     +LL   
Sbjct: 139  LADGPDEDPEEWNRALEIMLEQSQRMQSLVEDLLTLSRLESAASPLEDEPVDMPALLDHL 198

Query: 1190 QDDAREWANQFLEAGSHFEHTPPETIWDNAPIMRGSE 1226
            +D+A        EA S  ++       D    + G E
Sbjct: 199  RDEA--------EALSQGKNHQITFEIDGGVDVLGDE 227


>gnl|CDD|234531 TIGR04282, glyco_like_cofC, transferase 1,
            rSAM/selenodomain-associated.  Members of this protein
            family show strongly correlated phylogenetic
            distribution, and in most cases co-clustering, with an
            unusual radical SAM enzyme (TIGR04167) whose C-terminal
            pfam12345 domain often contains a selenocysteine residue.
            Other members of the conserved gene neighborhood include
            another putative glycosyltransferase, an
            alkylhydroperoxidase family protein (TIGR04169), and a
            phosphoesterase family protein (TIGR04168). The cassette
            is likely to be biosynthetic but its exact function is
            unknown [Unknown function, Enzymes of unknown
            specificity].
          Length = 189

 Score = 30.9 bits (71), Expect = 4.9
 Identities = 17/60 (28%), Positives = 23/60 (38%), Gaps = 8/60 (13%)

Query: 1107 QSFLLRHLEAAAKQ-EPDNAEVWLSLGISLAENEQLLGEFWEH------NLGN-MQQAFR 1158
               LL H  A AK     + E+  S     A  +  LG   E       +LG  M +AF+
Sbjct: 30   YRRLLEHTLAVAKSLAGVDVELAYSPDGPHAAFDDWLGPGVELRLQGGGDLGERMARAFQ 89


>gnl|CDD|236624 PRK09782, PRK09782, bacteriophage N4 receptor, outer membrane
            subunit; Provisional.
          Length = 987

 Score = 31.8 bits (72), Expect = 5.2
 Identities = 13/48 (27%), Positives = 28/48 (58%)

Query: 1617 RQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSK 1664
            ++ ++P+A+  L AA + EP+N+    +LG +L ++    Q+   L +
Sbjct: 621  QRHNVPAAVSDLRAALELEPNNSNYQAALGYALWDSGDIAQSREMLER 668


>gnl|CDD|213348 cd12788, RasGAP_plexin_D1, Ras-GTPase Activating Domain of
           plexin-D1.  Plexins form a conserved family of
           transmembrane receptors for semaphorins and may be the
           ancestors of semaphorins. Plexins are divided into four
           types (A-D) according to sequence similarity. Plexin-D1
           has been identified as the receptor of Sema3E. It binds
           to Sema3E directly with high affinity. Sema3E is
           implicated in axonal path finding and inhibition of
           developmental and postischemic angiogenesis. Plexin-D1
           is broadly expressed on tumor vessels and tumor cells in
           a number of different types of human tumors. The
           Plexin-D1 and Sema3E interaction inhibits tumor growth
           but promotes invasiveness and metastasis. Plexins
           contain a C-terminal RasGAP domain, which functions as
           an enhancer of the hydrolysis of GTP that is bound to
           Ras-GTPases. Plexins display GAP activity towards the
           Ras homolog Rap. Although the Rho (Ras homolog) GTPases
           are most closely related to members of the Ras family,
           RhoGAP and RasGAP show no sequence homology at their
           amino acid level. RasGTPases function as molecular
           switches in a large number of of signaling pathways.
           When bound to GTP they are in the on state and when
           bound to GDP they are in the off state. The RasGAP
           domain speeds up the hydrolysis of GTP in Ras-like
           proteins acting as a negative regulator.
          Length = 419

 Score = 31.5 bits (71), Expect = 5.4
 Identities = 16/40 (40%), Positives = 22/40 (55%), Gaps = 1/40 (2%)

Query: 753 DCFRSALQVRPDFTELLV-YLFSSLDFQPPKQKISDPDEL 791
           D F++ L +  D   L V Y F  L+ Q  K+ I+DPD L
Sbjct: 215 DLFQAILSIPEDRPPLAVKYFFDFLEEQAEKRGITDPDTL 254


>gnl|CDD|234243 TIGR03523, GldN, gliding motility associated protien GldN.  Members
           of this protein family are exclusive to the
           Bacteroidetes phylum (previously
           Cytophaga-Flavobacteria-Bacteroides). GldN is a protein
           linked to a type of rapid surface gliding motility found
           in certain Bacteroidetes, such as Flavobacterium
           johnsoniae and Cytophaga hutchinsonii. Knockouts of GldN
           abolish the gliding phenotype. Gliding motility appears
           closely linked to chitin utilization in the model
           species Flavobacterium johnsoniae. Bacteroidetes with
           members of this protein also include those which are not
           believed to express the gliding phenotype, such as
           Prevotella intermedia and Porphyromonas gingivales.
          Length = 280

 Score = 31.3 bits (71), Expect = 5.5
 Identities = 16/76 (21%), Positives = 26/76 (34%), Gaps = 2/76 (2%)

Query: 579 NLPLFVNFPKHVPNARRSVNKQFYITANAENRYKVYDDEYLCRVLEATVFTPAKFIQEET 638
           NLPL+  +P      RRS+    +      +  +VY D+      +    T  +F     
Sbjct: 73  NLPLY--YPTDPVGDRRSLFDYIFKAIVEGSIIEVYADDGFEIFTDTEPITLQEFQDRFY 130

Query: 639 LESPHFEYFDITGIDP 654
           L     +  D   I  
Sbjct: 131 LYYEPSDRLDGDSIFS 146


>gnl|CDD|233834 TIGR02358, thia_cytX, putative hydroxymethylpyrimidine transporter
            CytX.  On the basis of a phylogenomic study of thiamine
            biosythetic, salvage, and transporter genes and a highly
            conserved RNA element THI, this protein family has been
            identified as a probable transporter of
            hydroxymethylpyrimidine (HMP), the phosphorylated (by
            ThiD) form of which gets joined (by ThiE) to
            hydroxyethylthiazole phosphate to make thiamine phosphate
            [Transport and binding proteins, Nucleosides, purines and
            pyrimidines, Biosynthesis of cofactors, prosthetic
            groups, and carriers, Thiamine].
          Length = 386

 Score = 31.3 bits (71), Expect = 5.5
 Identities = 24/111 (21%), Positives = 41/111 (36%), Gaps = 19/111 (17%)

Query: 910  SQVTLWLGALTLSLEGLGSNLQSWVRFLARDVGECCCGQVVSTVDSEVGGLSSKGLIEGT 969
                +WL   ++ L  + +     V F   D+ E              G +S    +E  
Sbjct: 131  PLAFVWLNNWSVWLLLIATLWLLVVVFSKADLSEL----WNRA-----GDMSFAVAVELV 181

Query: 970  VTTP-SYLPEIQEKSRQDSSRKPVVPRT-------NLIC--LGLSSSRATG 1010
            +  P S+LP I + +R   + + V   T       +     LGL+ + ATG
Sbjct: 182  IAMPLSWLPLIADYTRFARNPRHVFLGTVLGYFIGSCWMYFLGLAVTLATG 232


>gnl|CDD|225504 COG2956, COG2956, Predicted N-acetylglucosaminyl transferase
            [Carbohydrate transport and metabolism].
          Length = 389

 Score = 31.2 bits (71), Expect = 5.5
 Identities = 23/100 (23%), Positives = 38/100 (38%), Gaps = 5/100 (5%)

Query: 1769 LGVLFNLSDEYDKAVDCFRSALQVRPDDSRL-----WNRLGASLANGNRPEEAVEAYHTA 1823
            L  ++  + E++KA+D     +++     R+     +  L       +  + A E    A
Sbjct: 147  LLNIYQATREWEKAIDVAERLVKLGGQTYRVEIAQFYCELAQQALASSDVDRARELLKKA 206

Query: 1824 LQLSPGFVRARYNLGITCVHLGANTQAVEHFLTALNQQAA 1863
            LQ     VRA   LG   +  G   +AVE     L Q   
Sbjct: 207  LQADKKCVRASIILGRVELAKGDYQKAVEALERVLEQNPE 246


>gnl|CDD|227310 COG4976, COG4976, Predicted methyltransferase (contains TPR repeat)
            [General function prediction only].
          Length = 287

 Score = 30.9 bits (70), Expect = 6.2
 Identities = 14/51 (27%), Positives = 24/51 (47%)

Query: 1778 EYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSP 1828
            + + A + +  AL++ P+ +  W RLG         + A  AY   L+L P
Sbjct: 10   DAEAAAELYNQALELAPEWAAGWFRLGEYTEKAGEFDAAAAAYEEVLELDP 60


>gnl|CDD|221954 pfam13170, DUF4003, Protein of unknown function (DUF4003).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 327 and 345 amino
           acids in length.
          Length = 297

 Score = 30.7 bits (70), Expect = 6.9
 Identities = 15/83 (18%), Positives = 37/83 (44%), Gaps = 8/83 (9%)

Query: 768 LLVYLFSSLDFQPPKQKISDPDELRDYQHRKRKAFEDNIRKNRMVISNWI--KYAQWEES 825
             +     + F+ P++ I +  ++  Y   K + F+   R   + ++  I  +Y + E+ 
Sbjct: 61  YSLASLLDIHFEDPEEAIQEILDI--YDKLKEEGFK---RSEYLYLAALIILEYREKEDY 115

Query: 826 QKQVDRARSIYERALDVDHRNIT 848
            + ++R + IY+      H  +T
Sbjct: 116 DEIINRTKEIYKLMKK-KHPFLT 137


>gnl|CDD|190425 pfam02786, CPSase_L_D2, Carbamoyl-phosphate synthase L chain, ATP
            binding domain.  Carbamoyl-phosphate synthase catalyzes
            the ATP-dependent synthesis of carbamyl-phosphate from
            glutamine or ammonia and bicarbonate. This important
            enzyme initiates both the urea cycle and the biosynthesis
            of arginine and/or pyrimidines. The carbamoyl-phosphate
            synthase (CPS) enzyme in prokaryotes is a heterodimer of
            a small and large chain. The small chain promotes the
            hydrolysis of glutamine to ammonia, which is used by the
            large chain to synthesise carbamoyl phosphate. See
            pfam00988. The small chain has a GATase domain in the
            carboxyl terminus. See pfam00117. The ATP binding domain
            (this one) has an ATP-grasp fold.
          Length = 211

 Score = 30.3 bits (69), Expect = 7.6
 Identities = 14/58 (24%), Positives = 21/58 (36%)

Query: 1492 LFSNGVASINQQPVPNTNGNLHSLDRTLAVSKLFKGPIIGTPSTTSGVSTSGTVDKPE 1549
            LF   +       VP T G + + +  LA +K    P+I   +   G    G     E
Sbjct: 4    LFKAAMKEAGVPTVPGTAGPVETEEEALAAAKEIGYPVIIKAAFGGGGLGMGIARNEE 61


>gnl|CDD|177845 PLN02194, PLN02194, cytochrome-c oxidase.
          Length = 265

 Score = 30.4 bits (68), Expect = 8.8
 Identities = 16/35 (45%), Positives = 20/35 (57%), Gaps = 2/35 (5%)

Query: 1408 WFSDFSRNQRSSVSMHVSLVFDLRPRYGPILFCVS 1442
            W+ D  R   S++  H + V  L PRYG ILF VS
Sbjct: 60   WWRDVLR--ESTLEGHHTKVVQLGPRYGSILFIVS 92



 Score = 30.4 bits (68), Expect = 9.8
 Identities = 14/26 (53%), Positives = 15/26 (57%)

Query: 269 ESAFLGQNASVFDLPPRYGPILFCVS 294
           ES   G +  V  L PRYG ILF VS
Sbjct: 67  ESTLEGHHTKVVQLGPRYGSILFIVS 92


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.132    0.399 

Gapped
Lambda     K      H
   0.267   0.0733    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 97,779,808
Number of extensions: 9517912
Number of successful extensions: 7776
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7716
Number of HSP's successfully gapped: 166
Length of query: 1956
Length of database: 10,937,602
Length adjustment: 111
Effective length of query: 1845
Effective length of database: 6,014,308
Effective search space: 11096398260
Effective search space used: 11096398260
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 67 (29.6 bits)