RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17306
         (904 letters)



>gnl|CDD|193551 cd05676, M20_dipept_like_CNDP, M20 cytosolic nonspecific
           dipeptidases including anserinase and serum carnosinase.
            Peptidase M20 family, CNDP (cytosolic nonspecific
           dipeptidase) subfamily including anserinase
           (Xaa-methyl-His dipeptidase, EC 3.4.13.5), 'serum'
           carnosinase (beta-alanyl-L-histidine dipeptidase; EC
           3.4.13.20), and some uncharacterized proteins. Two
           genes, CN1 and CN2, coding for proteins that degrade
           carnosine (beta-alanyl-L-histidine) and homocarnosine
           (gamma-aminobutyric acid-L-histidine), two naturally
           occurring dipeptides with potential neuroprotective and
           neurotransmitter functions, have been identified. CN1
           encodes for serum carnosinase and has narrow substrate
           specificity for Xaa-His dipeptides, where Xaa can be
           beta-alanine (carnosine), N-methyl beta-alanine,
           alanine, glycine and gamma-aminobutyric acid
           (homocarnosine). CN2 corresponds to the cytosolic
           nonspecific dipeptidase (CNDP; EC 3.4.13.18) and is not
           limited to Xaa-His dipeptides. CNDP requires Mn(2+) for
           full activity and does not hydrolyze homocarnosine.
           Anserinase is a dipeptidase that mainly catalyzes the
           hydrolysis of N-alpha-acetylhistidine.
          Length = 466

 Score =  375 bits (964), Expect = e-121
 Identities = 155/322 (48%), Positives = 189/322 (58%), Gaps = 62/322 (19%)

Query: 302 KKEDGWNTEPFVLDFERRKALWRRGALVMMKRPVL------------------------- 336
           K EDGW+T+PF L  E+   L+ RG+    K PVL                         
Sbjct: 99  KLEDGWDTDPFTLT-EKDGKLYGRGS-TDDKGPVLGWLNAIEAYQKLGQDLPVNLKFCFE 156

Query: 337 --------GLASAIES-------------ISAN----QEK----------CAVNIKIECA 361
                   GL   IE+             IS N     +K          C   I++E A
Sbjct: 157 GMEESGSEGLDELIEARKDTFFSDVDYVCISDNYWLGTKKPCLTYGLRGICYFFIEVEGA 216

Query: 362 SKDLHSGLFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTEEQFYEKIDFD 421
            KDLHSG+FGGSVHE MTDLI +M  LV+ +GKILIP IY  V PL + E + YEKIDFD
Sbjct: 217 DKDLHSGVFGGSVHEPMTDLIALMSSLVDSDGKILIPGIYDLVAPLTEEEWELYEKIDFD 276

Query: 422 TEDFRTAIDHPKLTKADKTQVLMSRWRYPSLSLHGIEGAFSGPGGKTVIPGKVVGKFSIR 481
            E++R  I   +L   +K ++LM RWRYPSLS+HGIEGAFS PG KTVIP KV+GKFSIR
Sbjct: 277 MEEYREDIGVRRLLYDNKEELLMHRWRYPSLSIHGIEGAFSEPGAKTVIPAKVIGKFSIR 336

Query: 482 IVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAYLLDSGKSWRTNPEHPNYVAAARATKY 541
           +VPN  P+ VEK V DYL +++    SPNK K Y+   GK W  +P+HPNY A  +ATK 
Sbjct: 337 LVPNMDPEQVEKLVTDYLEKVFAELGSPNKLKVYMGHGGKPWVADPDHPNYKAGRKATKR 396

Query: 542 VYNVEPDLTREGGSIPITLTFE 563
           V+ VEPDLTREGGSIPITLTF+
Sbjct: 397 VFGVEPDLTREGGSIPITLTFQ 418


>gnl|CDD|193555 cd05680, M20_dipept_like_3, Uncharacterized M20 Dipeptidases.
           Peptidase M20 family, unknown dipeptidase-like subfamily
           3 (inferred by homology to be dipeptidases). M20
           dipeptidases include a large variety of bacterial
           enzymes including cytosolic nonspecific dipeptidase
           (CNDP), Xaa-methyl-His dipeptidase (anserinase),and
           canosinase. These dipeptidases have been shown to act on
           a wide range of dipeptides, but not larger peptides. For
           example, anserinase mainly catalyzes the hydrolysis of
           N-alpha-acetylhistidine while carnosinase degrades
           beta-alanyl-L-histidine.
          Length = 437

 Score =  176 bits (449), Expect = 7e-48
 Identities = 72/217 (33%), Positives = 105/217 (48%), Gaps = 13/217 (5%)

Query: 352 CAVNIKIECASKDLHSGLFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTE 411
             + I +  A +DLHSGL+GG+V   +  L  ++  L + +G+I IP  Y DV PL   E
Sbjct: 184 AYLEITVRGADRDLHSGLYGGAVPNPIHALARLLASLHDEDGRITIPGFYDDVRPLSAEE 243

Query: 412 EQFYEKIDFDTEDFRTAIDHPKLTKADKTQVLMSRWRYPSLSLHGIEGAFSGPGGKTVIP 471
              +  + FD E F   +    L        L   W  P+L ++GI G ++G G KTVIP
Sbjct: 244 RAAWAALPFDEEAFLAELGVSALAGEAGYTTLERLWARPTLEVNGIWGGYTGEGSKTVIP 303

Query: 472 GKVVGKFSIRIVPNQTPQ----CVEKYVLDYLNELWKARNSPN-KFKAYLLDSGKSWRTN 526
            +   K S+R+VP+Q P      +E +V        +A   P    +   L  G+ +   
Sbjct: 304 AEAHAKISMRLVPDQDPDKILELLEAHV--------RAHTPPGVTVEVTRLHGGRPYLVP 355

Query: 527 PEHPNYVAAARATKYVYNVEPDLTREGGSIPITLTFE 563
           P+HP   AA RA +  +   P   REGGSIPI   F+
Sbjct: 356 PDHPALQAARRALEEAFGKPPVFIREGGSIPIVGLFK 392


>gnl|CDD|193552 cd05677, M20_dipept_like_DUG2_type, M20 Defective in Utilization of
           Glutathione-type peptidases containing WD repeats.
           Peptidase M20 family, Defective in Utilization of
           Glutathione (DUG2) subfamily. DUG2-type proteins are
           metallopeptidases containing WD repeats at the
           N-terminus. DUG2 proteins are involved in the
           alternative pathway of glutathione (GSH) degradation.
           GSH, the major low-molecular-weight thiol compound in
           most eukaryotic cells, is normally degraded through the
           gamma-glutamyl cycle initiated by gamma-glutamyl
           transpeptidase. However, a novel pathway for the
           degradation of GSH has been characterized; it requires
           the participation of three genes identified in
           Saccharomyces cerevisiae as "defective in utilization of
           glutathione" genes including DUG1, DUG2, and DUG3. DUG1
           encodes a probable di- or tri-peptidase identified as
           M20 metallopeptidase, DUG2 gene encodes a protein with a
           metallopeptidase domain and a large N-terminal WD40
           repeat region, while DUG3 encodes a protein with a
           glutamine amidotransferase domain. Although dipeptides
           and tripeptides with a normal peptide bond, such as
           cys-gly or glu-cys-gly, can be hydrolyzed by the DUG1
           protein, the presence of an unusual peptide bond, like
           in GSH, requires the participation of the DUG2 and DUG3
           proteins as well. These three proteins form a GSH
           degradosomal complex.
          Length = 434

 Score =  161 bits (410), Expect = 7e-43
 Identities = 71/203 (34%), Positives = 109/203 (53%), Gaps = 12/203 (5%)

Query: 355 NIKIECASKDLHSGLFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTEEQF 414
            I +     DLHSG+ GG++ E   DLI ++G+L +P+G+ILIP  Y  V+P+ + EE  
Sbjct: 193 TISVSSDEPDLHSGVDGGALREPTADLILLLGKLQDPDGRILIPGFYDPVKPITEEEEAR 252

Query: 415 YEKIDFDTEDFRTAIDHPKLTKADKTQVLMSRWRYPSLSLHGIEGAFSGPGGKTVIPGKV 474
           +  I         A+     T       L+++WR PSL++H +    SGPG  TVIP + 
Sbjct: 253 FRDI------ASIALIPLPTTVDS----LIAKWRKPSLTIHSVN--VSGPGNSTVIPRRA 300

Query: 475 VGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAYLLDSGKSWRTNPEHPNYVA 534
               SIR+VP+Q  + +++ + DYL E +    SPN     +L+  + W  +P++P +  
Sbjct: 301 SASVSIRLVPDQDLETIKQDLKDYLRECFAELKSPNHLDIEVLNEAEPWLGDPDNPAFQT 360

Query: 535 AARATKYVYNVEPDLTREGGSIP 557
              A    + VEP   REGGSIP
Sbjct: 361 LREAVTDAWGVEPLYIREGGSIP 383


>gnl|CDD|220112 pfam09110, HAND, HAND.  The HAND domain adopts a secondary
           structure consisting of four alpha helices, three of
           which (H2, H3, H4) form an L-like configuration. Helix
           H2 runs antiparallel to helices H3 and H4, packing
           closely against helix H4, whilst helix H1 reposes in the
           concave surface formed by these three helices and runs
           perpendicular to them. The domain confers DNA and
           nucleosome binding properties to the protein.
          Length = 109

 Score =  145 bits (368), Expect = 6e-41
 Identities = 53/108 (49%), Positives = 69/108 (63%), Gaps = 9/108 (8%)

Query: 805 AVDAYFKEALRTSEPKA---PKAPRPPKQPIVQDFQFFPPRLFEILDQEIYYFRKTVGYK 861
           +VD Y+K+ L T   KA   PKAPR PKQ  VQD QFFPPRL E+ ++E  Y++K +GYK
Sbjct: 1   SVDNYYKDVLNTGGRKASKKPKAPRAPKQVNVQDHQFFPPRLKELQEKEQLYYKKKIGYK 60

Query: 862 VPKN------PELGSDATKAQKEEQKKIDESEPLTEEELAEKEELLTQ 903
           VP +       E   +    +K EQ+KID +EPLTEEE  EK+ LL +
Sbjct: 61  VPLDDVGDGDEEDEEEREAKRKLEQEKIDNAEPLTEEEEEEKQRLLEE 108


>gnl|CDD|193513 cd03893, M20_Dipept_like, M20 Dipeptidases.  Peptidase M20 family,
           dipeptidase-like subfamily. This group contains a large
           variety of enzymes, including cytosolic nonspecific
           dipeptidase (CNDP), Xaa-methyl-His dipeptidase
           (anserinase), canosinase, DUG2 type proteins, as well as
           many proteins inferred by homology to be dipeptidases.
           These enzymes have been shown to act on a wide range of
           dipeptides, but not larger peptides. For example,
           anserinase mainly catalyzes the hydrolysis of
           N-alpha-acetylhistidine while carnosinase degrades
           beta-alanyl-L-histidine. Substrates of CNDP are varied
           and not limited to Xaa-His dipeptides. DUG2 proteins
           contain a metallopeptidase domain and a large N-terminal
           WD40 repeat region, and are involved in the alternative
           pathway of glutathione degradation.
          Length = 428

 Score =  155 bits (394), Expect = 9e-41
 Identities = 56/215 (26%), Positives = 97/215 (45%), Gaps = 18/215 (8%)

Query: 352 CAVNIKIECASKDLHSGLFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTE 411
             V + +      +HSG+FGG+  +A+T L+ ++  L + +G++ +P +Y     L    
Sbjct: 184 ANVTVTVRTLDHAVHSGMFGGAAPDALTALVRLLATLHDEDGRVAVPGLY---SDLPWEG 240

Query: 412 EQFYEKIDFDTEDFRT---AIDHPKLTKADKTQVLMSRWRYPSLSLHGIEGAFSGPGGKT 468
             + E      E+FR     +D  +L  A    +    W  P+L++ GI+ A    G   
Sbjct: 241 VDYPE------EEFRAEAGVLDGVEL--AGTGSIADRLWAKPALTVIGID-APPVAGASN 291

Query: 469 VIPGKVVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAYLLDSGKSWRTNPE 528
            +P     K S+R+VP Q P+  ++ +  +L           K      D+G+ +R + +
Sbjct: 292 AVPPSARAKLSLRLVPGQDPEEAQEALEAHLE---AHAPWGAKVTVTREDAGEPFRADTD 348

Query: 529 HPNYVAAARATKYVYNVEPDLTREGGSIPITLTFE 563
            P Y AA  A    + VEP L   GGSIP+    +
Sbjct: 349 GPAYQAARAALATAWGVEPVLIGSGGSIPLISDLQ 383


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score =  140 bits (355), Expect = 5e-34
 Identities = 67/180 (37%), Positives = 101/180 (56%), Gaps = 17/180 (9%)

Query: 729 ITDEDIDVILERCEAKTEELNKKFEQLGESSLRDFTLDAPTQSVYKFEGEDYREKQKI-- 786
           ITDEDID I+ + E  T EL+ K ++  E +++ F +D     +Y F+ ED +++ K+  
Sbjct: 655 ITDEDIDRIIAKGEEATAELDAKMKKFTEDAIK-FKMDDTA-ELYDFDDEDDKDENKLDF 712

Query: 787 --VPIGHWIEPPKRERKANYAVDAYFKEALRTSEPKAPKAPRPPKQPIVQDFQFF-PPRL 843
             +   +WI+PPKRERK NY+   YFK+A+R   P  PK PR P+ P + DFQFF   RL
Sbjct: 713 KKIVSDNWIDPPKRERKRNYSESEYFKQAMRQGAPAKPKEPRIPRMPQLHDFQFFNVQRL 772

Query: 844 FEILDQEIYYFRKTVGYKVPKNPELGSDATKAQKEEQKKIDESEPLTEEELAEKEELLTQ 903
            E+ ++E       V Y +  + +     T    E +   +  +PLT EE  EKE+LL +
Sbjct: 773 TELYEKE-------VRYLMQAHQKGQLKDTIDVAEPE---EPGDPLTAEEQEEKEQLLEE 822


>gnl|CDD|169276 PRK08201, PRK08201, hypothetical protein; Provisional.
          Length = 456

 Score =  131 bits (330), Expect = 2e-32
 Identities = 71/212 (33%), Positives = 106/212 (50%), Gaps = 5/212 (2%)

Query: 352 CAVNIKIECASKDLHSGLFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTE 411
            A+ I +  A  DLHSGL+GG+V  A+  L+ ++  L + +G + +   Y  V PL   E
Sbjct: 200 AALEIDVRGAKGDLHSGLYGGAVPNALHALVQLLASLHDEHGTVAVEGFYDGVRPLTPEE 259

Query: 412 EQFYEKIDFDTEDFRTAIDHPKLTKADKTQVLMSRWRYPSLSLHGIEGAFSGPGGKTVIP 471
            + +  + FD E  +  +   +L   +    L   W  P+L L+G+ G F G G KTVIP
Sbjct: 260 REEFAALGFDEEKLKRELGVDELFGEEGYTALERTWARPTLELNGVYGGFQGEGTKTVIP 319

Query: 472 GKVVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPN-KFKAYLLDSGKSWRTNPEHP 530
            +   K + R+VP+Q PQ     +LD +    +A      +      D G ++    +HP
Sbjct: 320 AEAHAKITCRLVPDQDPQ----EILDLIEAHLQAHTPAGVRVTIRRFDKGPAFVAPIDHP 375

Query: 531 NYVAAARATKYVYNVEPDLTREGGSIPITLTF 562
              AAARA + VY  E   TR GGSIP+  TF
Sbjct: 376 AIQAAARAYEAVYGTEAAFTRMGGSIPVVETF 407


>gnl|CDD|235802 PRK06446, PRK06446, hypothetical protein; Provisional.
          Length = 436

 Score =  116 bits (293), Expect = 1e-27
 Identities = 64/248 (25%), Positives = 106/248 (42%), Gaps = 27/248 (10%)

Query: 308 NTEPFVLDF-ERRKA---LWRRGALVMMKRP--VLGLASAIESISANQEKCAVNIKIECA 361
           N E F+     + KA   +     L    RP  VLG+   +           V + +   
Sbjct: 142 NLEDFIEKNKNKLKADSVIMEGAGLDPKGRPQIVLGVKGLL----------YVELVLRTG 191

Query: 362 SKDLHSGLFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTEEQFYEKIDFD 421
           +KDLHS      V     DL+ ++  LV+  G++LIP  Y DV  L + E +  +K D D
Sbjct: 192 TKDLHS-SNAPIVRNPAWDLVKLLSTLVDGEGRVLIPGFYDDVRELTEEERELLKKYDID 250

Query: 422 TEDFRTAIDHPKLTKADKTQVLMSRWRYPSLSLHGIEGAFSGPGGKTVIPGKVVGKFSIR 481
            E+ R A+   +L  +D+ ++  +    P+ ++ G    ++G G KT++P +   K   R
Sbjct: 251 VEELRKALGFKELKYSDREKIAEALLTEPTCNIDGFYSGYTGKGSKTIVPSRAFAKLDFR 310

Query: 482 IVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAYLLDSG--KSWRTNPEHPNYVAAARAT 539
           +VPNQ P  + + +  +L            F   ++  G     RT+       A   + 
Sbjct: 311 LVPNQDPYKIFELLKKHL--------QKVGFNGEIIVHGFEYPVRTSVNSKVVKAMIESA 362

Query: 540 KYVYNVEP 547
           K VY  EP
Sbjct: 363 KRVYGTEP 370


>gnl|CDD|193556 cd05681, M20_dipept_like_4, Uncharacterized M20 Dipeptidases.
           Peptidase M20 family, unknown dipeptidase-like subfamily
           3 (inferred by homology to be dipeptidases). M20
           dipeptidases include a large variety of bacterial
           enzymes including cytosolic nonspecific dipeptidase
           (CNDP), Xaa-methyl-His dipeptidase (anserinase),and
           canosinase. These dipeptidases have been shown to act on
           a wide range of dipeptides, but not larger peptides. For
           example, anserinase mainly catalyzes the hydrolysis of
           N-alpha-acetylhistidine while carnosinase degrades
           beta-alanyl-L-histidine.
          Length = 430

 Score =  115 bits (291), Expect = 2e-27
 Identities = 54/200 (27%), Positives = 92/200 (46%), Gaps = 9/200 (4%)

Query: 352 CAVNIKIECASKDLHSGLFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTE 411
             V + ++ AS DLHS      V      L+  +  L +  G+ILIP  Y DV P  + E
Sbjct: 180 LYVELSVKGASADLHSSY-AAVVDNPAWRLVQALASLRDEGGRILIPGFYDDVRPPTEEE 238

Query: 412 EQFYEKIDFDTEDFRTA--IDHPKLTKADKTQVLMSRWRYPSLSLHGIEGAFSGPGGKTV 469
           ++  +K+ FD E  + A  +  P L        L +    P+ +++G+   ++G G KT+
Sbjct: 239 KELIDKLPFDEEALKKAFGLKRP-LDGLTGRDALEALVFEPTCNINGLYSGYTGEGSKTI 297

Query: 470 IPGKVVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAYLLDSGKSWRTNPEH 529
           +P +   K   R+VP+Q P+ V   +  +L+     +N  +  +   L      RT+ + 
Sbjct: 298 LPAEATAKLDFRLVPDQDPEEVLDLLRAHLD-----KNGFSDIEVTYLLGEPPARTDLDA 352

Query: 530 PNYVAAARATKYVYNVEPDL 549
           P   AA    K VY  +P +
Sbjct: 353 PLVQAALDTAKKVYGTDPVV 372


>gnl|CDD|236379 PRK09104, PRK09104, hypothetical protein; Validated.
          Length = 464

 Score = 97.3 bits (243), Expect = 4e-21
 Identities = 60/208 (28%), Positives = 92/208 (44%), Gaps = 13/208 (6%)

Query: 356 IKIECASKDLHSGLFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTEEQFY 415
           + I  A +DLHSGLFGG+    +  L  I+  L +  G++ +P  Y  VE L       +
Sbjct: 212 VTITAADRDLHSGLFGGAAANPIRVLTRILAGLHDETGRVTLPGFYDGVEELPPEILAQW 271

Query: 416 EKIDFDTEDFRTAID--HPKLTKADKTQVLMSRWRYPSLSLHGIEGAFSGPGGKTVIPGK 473
           + + F  E F   +    P   K  ++ VL   W  P+  ++GI G ++G G KTVIP +
Sbjct: 272 KALGFTAEAFLGPVGLSIPAGEK-GRS-VLEQIWSRPTCEINGIWGGYTGEGFKTVIPAE 329

Query: 474 VVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPN---KFKAYLLDSGKSWRTNPEHP 530
              K S R+V  Q P  + +    Y+    +AR   +   +F  +      +     + P
Sbjct: 330 ASAKVSFRLVGGQDPAKIREAFRAYV----RARLPADCSVEFHDH--GGSPAIALPYDSP 383

Query: 531 NYVAAARATKYVYNVEPDLTREGGSIPI 558
              AA  A    +     L   GGSIPI
Sbjct: 384 ALAAAKAALSDEWGKPAVLIGSGGSIPI 411


>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
           34 amino acids
           [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
           X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
           in a variety of organisms including bacteria,
           cyanobacteria, yeast, fungi, plants, and humans in
           various subcellular locations; involved in a variety of
           functions including protein-protein interactions, but
           common features in the interaction partners have not
           been defined; involved in chaperone, cell-cycle,
           transciption, and protein transport complexes; the
           number of TPR motifs varies among proteins (1,3-11,13
           15,16,19); 5-6 tandem repeats generate a right-handed
           helical structure with an amphipathic channel that is
           thought to accomodate an alpha-helix of a target
           protein; it has been proposed that TPR proteins
           preferably interact with WD-40 repeat proteins, but in
           many instances several TPR-proteins seem to aggregate to
           multi-protein complexes; examples of TPR-proteins
           include, Cdc16p, Cdc23p and Cdc27p components of the
           cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
           targeting signals, the Tom70p co-receptor for
           mitochondrial targeting signals, Ser/Thr phosphatase 5C
           and the p110 subunit of O-GlcNAc transferase; three
           copies of the repeat are present here.
          Length = 100

 Score = 83.6 bits (207), Expect = 2e-19
 Identities = 47/111 (42%), Positives = 60/111 (54%), Gaps = 11/111 (9%)

Query: 586 EAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTA 645
           EA  NLGN+Y + G   EALE Y  A+ L PD  D Y NLAAA    G  E+A++     
Sbjct: 1   EALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALE----- 55

Query: 646 LQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPN 696
                  Y KA+E  PD A A+ NLG  +   G+   A+  +EKA+ LDPN
Sbjct: 56  ------DYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100



 Score = 82.0 bits (203), Expect = 7e-19
 Identities = 39/91 (42%), Positives = 51/91 (56%)

Query: 77  EAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTA 136
           EA  NLGN+Y + G   EALE Y  A+ L PD  D Y NLAAA    G  E+A++ Y  A
Sbjct: 1   EALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKA 60

Query: 137 LQYNPDLYCVRSDLGNLLKALGRLDEAKDLY 167
           L+ +PD      +LG     LG+ +EA + Y
Sbjct: 61  LELDPDNAKAYYNLGLAYYKLGKYEEALEAY 91



 Score = 70.1 bits (172), Expect = 1e-14
 Identities = 34/99 (34%), Positives = 57/99 (57%)

Query: 44  VLLLLSSIHFQCRKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAV 103
            LL L +++++    D++  +   A++ +P  A+AY NL   Y + G+ +EALE+Y  A+
Sbjct: 2   ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKAL 61

Query: 104 RLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPD 142
            L PD    Y NL  A    G  E+A++AY  AL+ +P+
Sbjct: 62  ELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100



 Score = 67.4 bits (165), Expect = 1e-13
 Identities = 31/87 (35%), Positives = 49/87 (56%)

Query: 565 RKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN 624
              D++  +   A++ +P  A+AY NL   Y + G+ +EALE+Y  A+ L PD    Y N
Sbjct: 14  GDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYN 73

Query: 625 LAAALVAAGDMEQAVQAYVTALQYNPS 651
           L  A    G  E+A++AY  AL+ +P+
Sbjct: 74  LGLAYYKLGKYEEALEAYEKALELDPN 100



 Score = 65.1 bits (159), Expect = 7e-13
 Identities = 30/74 (40%), Positives = 44/74 (59%)

Query: 226 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 285
           A++ +P  A+AY NL   Y + G+ +EALE+Y  A+ L PD    Y NL  A    G  E
Sbjct: 26  ALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYE 85

Query: 286 QAVQAYVTALQYNP 299
           +A++AY  AL+ +P
Sbjct: 86  EALEAYEKALELDP 99



 Score = 62.4 bits (152), Expect = 6e-12
 Identities = 29/96 (30%), Positives = 45/96 (46%), Gaps = 11/96 (11%)

Query: 622 YINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIW 681
            +NL       GD ++A++            Y KA+E  PD A A+ NL   +   G+  
Sbjct: 3   LLNLGNLYYKLGDYDEALE-----------YYEKALELDPDNADAYYNLAAAYYKLGKYE 51

Query: 682 LAIHHFEKAVSLDPNFLDAYINLGNVLKEARIFDRA 717
            A+  +EKA+ LDP+   AY NLG    +   ++ A
Sbjct: 52  EALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEA 87



 Score = 62.4 bits (152), Expect = 6e-12
 Identities = 31/65 (47%), Positives = 39/65 (60%)

Query: 235 EAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTA 294
           EA  NLGN+Y + G   EALE Y  A+ L PD  D Y NLAAA    G  E+A++ Y  A
Sbjct: 1   EALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKA 60

Query: 295 LQYNP 299
           L+ +P
Sbjct: 61  LELDP 65



 Score = 48.9 bits (117), Expect = 4e-07
 Identities = 18/52 (34%), Positives = 29/52 (55%)

Query: 666 AWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLDAYINLGNVLKEARIFDRA 717
           A  NLG ++   G+   A+ ++EKA+ LDP+  DAY NL     +   ++ A
Sbjct: 2   ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEA 53



 Score = 45.1 bits (107), Expect = 8e-06
 Identities = 35/154 (22%), Positives = 56/154 (36%), Gaps = 56/154 (36%)

Query: 113 YINLAAALVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSD 172
            +NL       GD ++A++ Y  AL+ +PD                      D Y    +
Sbjct: 3   LLNLGNLYYKLGDYDEALEYYEKALELDPD--------------------NADAYY---N 39

Query: 173 LGNLLKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPL 232
           L      LG+ +EA   + +                                 A++ +P 
Sbjct: 40  LAAAYYKLGKYEEALEDYEK---------------------------------ALELDPD 66

Query: 233 LAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD 266
            A+AY NLG  Y + G+ +EALE Y  A+ L P+
Sbjct: 67  NAKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100


>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
           lipoprotein.  This protein family occurs in strictly
           within a subset of Gram-negative bacterial species with
           the proposed PEP-CTERM/exosortase system, analogous to
           the LPXTG/sortase system common in Gram-positive
           bacteria. This protein occurs in a species if and only
           if a transmembrane histidine kinase (TIGR02916) and a
           DNA-binding response regulator (TIGR02915) also occur.
           The present of tetratricopeptide repeats (TPR) suggests
           protein-protein interaction, possibly for the regulation
           of PEP-CTERM protein expression, since many PEP-CTERM
           proteins in these genomes are preceded by a proposed DNA
           binding site for the response regulator.
          Length = 899

 Score = 76.7 bits (189), Expect = 3e-14
 Identities = 69/287 (24%), Positives = 115/287 (40%), Gaps = 40/287 (13%)

Query: 11  LLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAIK 70
           LL LA       +Y  A     +    + +NT   + L+ +    ++ + +   +    K
Sbjct: 638 LLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTESAKKIAKSLQK 697

Query: 71  QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAV 130
           Q+P  A  +   G++Y  +     A++ YR A++  P      I L  AL+A+G+  +AV
Sbjct: 698 QHPKAALGFELEGDLYLRQKDYPAAIQAYRKALKRAPS-SQNAIKLHRALLASGNTAEAV 756

Query: 131 QAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLKALGRLDEAKNLH 190
           +     L+ +P+   +R+ L  L  A    D+A                       K+  
Sbjct: 757 KTLEAWLKTHPNDAVLRTALAELYLAQKDYDKA----------------------IKHYQ 794

Query: 191 TENIKPVTMKVQ--NAIVCN-----YGGRK-PTTLESAHFSTLAIKQNPLLAEAYSNLGN 242
           T       +K    NA+V N     Y   K P  LE A     A+K  P +      LG 
Sbjct: 795 T------VVKKAPDNAVVLNNLAWLYLELKDPRALEYA---ERALKLAPNIPAILDTLGW 845

Query: 243 VYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQ 289
           +  E+G+   AL   R AV + P+      +LA AL+A G   +A +
Sbjct: 846 LLVEKGEADRALPLLRKAVNIAPEAAAIRYHLALALLATGRKAEARK 892



 Score = 72.8 bits (179), Expect = 5e-13
 Identities = 64/330 (19%), Positives = 114/330 (34%), Gaps = 58/330 (17%)

Query: 22  GDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAIKQNPLLAEAYSN 81
           G  + A          + ++   L LL   +      +K+A +   A + +P  A A + 
Sbjct: 343 GRVDEAIATLSPALGLDPDDPAALSLLGEAYLALGDFEKAAEYLAKATELDPENAAARTQ 402

Query: 82  LGNVYKERGQLQEALENYRHAVRLKPDFID------------------------------ 111
           LG     +G   EA+ +   A +L P+                                 
Sbjct: 403 LGISKLSQGDPSEAIADLETAAQLDPELGRADLLLILSYLRSGQFDKALAAAKKLEKKQP 462

Query: 112 ----GYINLAAALVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLY 167
                +  L A  +  GD+ +A +A+  AL   PD +   ++L  +    G  D+A   +
Sbjct: 463 DNASLHNLLGAIYLGKGDLAKAREAFEKALSIEPDFFPAAANLARIDIQEGNPDDAIQRF 522

Query: 168 --CVRSDLGNL--LKALGRLDEAKN--------------LHTENIKPVTMKVQNAIVCNY 209
              +  D  NL  + AL  L                   L+ + I+P     Q  +    
Sbjct: 523 EKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAAELNPQEIEPALALAQYYLGKGQ 582

Query: 210 GGRKPTTLESAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFID 269
             +    L  A     A   +P   EA+  LG      G L +A+ +++  + L+PD   
Sbjct: 583 LKKALAILNEA---ADAAPDSP---EAWLMLGRAQLAAGDLNKAVSSFKKLLALQPDSAL 636

Query: 270 GYINLAAALVAAGDMEQAVQAYVTALQYNP 299
             + LA A     +  +A+ +   AL+  P
Sbjct: 637 ALLLLADAYAVMKNYAKAITSLKRALELKP 666



 Score = 71.7 bits (176), Expect = 1e-12
 Identities = 68/300 (22%), Positives = 120/300 (40%), Gaps = 45/300 (15%)

Query: 11  LLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAIK 70
              LA  + Q G+ + A +   ++   +  N   +L L+ ++ +    +++  +   A +
Sbjct: 502 AANLARIDIQEGNPDDAIQRFEKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAAE 561

Query: 71  QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAV 130
            NP   E    L   Y  +GQL++AL     A    PD  + ++ L  A +AAGD+ +AV
Sbjct: 562 LNPQEIEPALALAQYYLGKGQLKKALAILNEAADAAPDSPEAWLMLGRAQLAAGDLNKAV 621

Query: 131 QAYVTALQYNPD----------LYCVRSDLGNLLKALGR-LDEAKDLYCVRSDLGNLLKA 179
            ++   L   PD           Y V  +    + +L R L+   D    +  L  LL A
Sbjct: 622 SSFKKLLALQPDSALALLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQIGLAQLLLA 681

Query: 180 LGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPLLAEAYSN 239
             R + AK +                           L+         KQ+P  A  +  
Sbjct: 682 AKRTESAKKIAKS------------------------LQ---------KQHPKAALGFEL 708

Query: 240 LGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
            G++Y  +     A++ YR A++  P      I L  AL+A+G+  +AV+     L+ +P
Sbjct: 709 EGDLYLRQKDYPAAIQAYRKALKRAPS-SQNAIKLHRALLASGNTAEAVKTLEAWLKTHP 767



 Score = 65.9 bits (161), Expect = 8e-11
 Identities = 57/244 (23%), Positives = 90/244 (36%), Gaps = 54/244 (22%)

Query: 68  AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 127
           A++++P  AEA   LG +Y   G    A +  R A+ L          LA A +  G  +
Sbjct: 48  ALQKDPNDAEARFLLGKIYLALGDYAAAEKELRKALSLGYPKNQVLPLLARAYLLQGKFQ 107

Query: 128 QAVQAYVTALQYNPD-----LYCVRSDLGNLLKALGRLDEAKDLYC-----------VRS 171
           Q +   +       D     L  +R   G     LG+L+ A+  Y             + 
Sbjct: 108 QVL-DELPGKTLLDDEGAAELLALR---GLAYLGLGQLELAQKSYEQALAIDPRSLYAKL 163

Query: 172 DLGNLLKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNP 231
            L  L  A  R DEA+ L  E                                  +  +P
Sbjct: 164 GLAQLALAENRFDEARALIDE---------------------------------VLTADP 190

Query: 232 LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAY 291
              +A    G++    G ++ AL  YR A+ L+P+ I   + LA  L+ AG+ E+A + +
Sbjct: 191 GNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVLLALATILIEAGEFEEA-EKH 249

Query: 292 VTAL 295
             AL
Sbjct: 250 ADAL 253



 Score = 64.3 bits (157), Expect = 2e-10
 Identities = 74/325 (22%), Positives = 127/325 (39%), Gaps = 46/325 (14%)

Query: 4   IAQLGIGLLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAH 63
           +A LG+G LELA +      YE A          +  +    L L+ +     + D++  
Sbjct: 133 LAYLGLGQLELAQK-----SYEQALA-------IDPRSLYAKLGLAQLALAENRFDEARA 180

Query: 64  FSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAA 123
                +  +P   +A    G++    G ++ AL  YR A+ L+P+ I   + LA  L+ A
Sbjct: 181 LIDEVLTADPGNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVLLALATILIEA 240

Query: 124 GDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKAL-----GRLDEAKDLY--CVRSD---- 172
           G+ E+A +     L+  P+     S L + LKAL        ++A++     ++S     
Sbjct: 241 GEFEEAEKHADALLKKAPN-----SPLAHYLKALVDFQKKNYEDARETLQDALKSAPEYL 295

Query: 173 ------------LGNLLKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESA 220
                       LGNL +A   L++       + +   +     +          TL   
Sbjct: 296 PALLLAGASEYQLGNLEQAYQYLNQILKYAPNSHQARRLLASIQLRLGRVDEAIATLSP- 354

Query: 221 HFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVA 280
                A+  +P    A S LG  Y   G  ++A E    A  L P+       L  + ++
Sbjct: 355 -----ALGLDPDDPAALSLLGEAYLALGDFEKAAEYLAKATELDPENAAARTQLGISKLS 409

Query: 281 AGDMEQAVQAYVTALQYNPLLKKED 305
            GD  +A+    TA Q +P L + D
Sbjct: 410 QGDPSEAIADLETAAQLDPELGRAD 434



 Score = 61.3 bits (149), Expect = 2e-09
 Identities = 48/210 (22%), Positives = 79/210 (37%), Gaps = 22/210 (10%)

Query: 503 WKARNSPNKFKAYLLDSGKSWRTNPEHPNYVAAARATKYVYNVEPDLTREGGSIPITLTF 562
             A   P+   A LL +           NY  A  + K    ++PD T    +       
Sbjct: 627 LLALQ-PDSALALLLLADAYAVMK----NYAKAITSLKRALELKPDNTE---AQIGLAQL 678

Query: 563 ECRKLDKSAHFSTLAI--KQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFID 620
                   +         KQ+P  A  +   G++Y  +     A++ YR A++  P    
Sbjct: 679 LLAAKRTESAKKIAKSLQKQHPKAALGFELEGDLYLRQKDYPAAIQAYRKALKRAPS-SQ 737

Query: 621 GYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEI 680
             I L  AL+A+G+  +AV+            +LK   T P+ AV  + L  ++ AQ + 
Sbjct: 738 NAIKLHRALLASGNTAEAVKTLEA--------WLK---THPNDAVLRTALAELYLAQKDY 786

Query: 681 WLAIHHFEKAVSLDPNFLDAYINLGNVLKE 710
             AI H++  V   P+      NL  +  E
Sbjct: 787 DKAIKHYQTVVKKAPDNAVVLNNLAWLYLE 816



 Score = 61.3 bits (149), Expect = 2e-09
 Identities = 69/317 (21%), Positives = 125/317 (39%), Gaps = 29/317 (9%)

Query: 2   NDIAQLGIGLLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKS 61
           N  A+  +G+ +L+      GD   A        + +       LLL   + +  + DK+
Sbjct: 396 NAAARTQLGISKLS-----QGDPSEAIADLETAAQLDPELGRADLLLILSYLRSGQFDKA 450

Query: 62  AHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALV 121
              +    K+ P  A  ++ LG +Y  +G L +A E +  A+ ++PDF     NLA   +
Sbjct: 451 LAAAKKLEKKQPDNASLHNLLGAIYLGKGDLAKAREAFEKALSIEPDFFPAAANLARIDI 510

Query: 122 AAGDMEQAVQAYVTALQYNP----------DLYCVR---SDLGNLLKALGRLD---EAKD 165
             G+ + A+Q +   L  +P           LY       +    L+    L+       
Sbjct: 511 QEGNPDDAIQRFEKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAAELNPQEIEPA 570

Query: 166 LYCVRSDL--GNLLKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFS 223
           L   +  L  G L KAL  L+EA +   ++ +   M  +  +      +  ++ +     
Sbjct: 571 LALAQYYLGKGQLKKALAILNEAADAAPDSPEAWLMLGRAQLAAGDLNKAVSSFKKL--- 627

Query: 224 TLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGD 283
              +   P  A A   L + Y       +A+ + + A+ LKPD  +  I LA  L+AA  
Sbjct: 628 ---LALQPDSALALLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQIGLAQLLLAAKR 684

Query: 284 MEQAVQAYVTALQYNPL 300
            E A +   +  + +P 
Sbjct: 685 TESAKKIAKSLQKQHPK 701



 Score = 60.9 bits (148), Expect = 3e-09
 Identities = 34/137 (24%), Positives = 57/137 (41%), Gaps = 11/137 (8%)

Query: 581 NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQ 640
           +   AE  +  G  Y   GQL+ A ++Y  A+ + P  +   + LA   +A    ++A  
Sbjct: 121 DEGAAELLALRGLAYLGLGQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARA 180

Query: 641 AYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLDA 700
                L  +P               A    G +  + G I LA+  + KA++L PN +  
Sbjct: 181 LIDEVLTADP-----------GNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAV 229

Query: 701 YINLGNVLKEARIFDRA 717
            + L  +L EA  F+ A
Sbjct: 230 LLALATILIEAGEFEEA 246



 Score = 59.7 bits (145), Expect = 5e-09
 Identities = 39/194 (20%), Positives = 70/194 (36%), Gaps = 30/194 (15%)

Query: 565 RKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN 624
            + D++       +  +P   +A    G++    G ++ AL  YR A+ L+P+ I   + 
Sbjct: 173 NRFDEARALIDEVLTADPGNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVLLA 232

Query: 625 LAAALVAAGDMEQAVQAYVTALQYNPS----CYLKAIE--TRPDFAVAWSNL-------- 670
           LA  L+ AG+ E+A +     L+  P+     YLKA+    + ++  A   L        
Sbjct: 233 LATILIEAGEFEEAEKHADALLKKAPNSPLAHYLKALVDFQKKNYEDARETLQDALKSAP 292

Query: 671 ---------GCVFNAQGEIWLAIHHFEKAVSLDPNFLDA-------YINLGNVLKEARIF 714
                    G      G +  A  +  + +   PN   A        + LG V +     
Sbjct: 293 EYLPALLLAGASEYQLGNLEQAYQYLNQILKYAPNSHQARRLLASIQLRLGRVDEAIATL 352

Query: 715 DRANTLFYVSSYFM 728
             A  L       +
Sbjct: 353 SPALGLDPDDPAAL 366



 Score = 58.9 bits (143), Expect = 9e-09
 Identities = 58/281 (20%), Positives = 104/281 (37%), Gaps = 21/281 (7%)

Query: 11  LLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAIK 70
           +L LA    + G+ E A     +            L L+  +    +L K+      A  
Sbjct: 536 ILALAGLYLRTGNEEEAVAWLEKAAELNPQEIEPALALAQYYLGKGQLKKALAILNEAAD 595

Query: 71  QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAV 130
             P   EA+  LG      G L +A+ +++  + L+PD     + LA A     +  +A+
Sbjct: 596 AAPDSPEAWLMLGRAQLAAGDLNKAVSSFKKLLALQPDSALALLLLADAYAVMKNYAKAI 655

Query: 131 QAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLY-----------CVRSDLGNLLKA 179
            +   AL+  PD    +  L  LL A  R + AK +                  G+L   
Sbjct: 656 TSLKRALELKPDNTEAQIGLAQLLLAAKRTESAKKIAKSLQKQHPKAALGFELEGDLYLR 715

Query: 180 LGRLDEAKNLHTENIK-----PVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPLLA 234
                 A   + + +K        +K+  A++ +  G      E+       +K +P  A
Sbjct: 716 QKDYPAAIQAYRKALKRAPSSQNAIKLHRALLAS--GNTA---EAVKTLEAWLKTHPNDA 770

Query: 235 EAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLA 275
              + L  +Y  +    +A+++Y+  V+  PD      NLA
Sbjct: 771 VLRTALAELYLAQKDYDKAIKHYQTVVKKAPDNAVVLNNLA 811



 Score = 55.5 bits (134), Expect = 1e-07
 Identities = 58/280 (20%), Positives = 101/280 (36%), Gaps = 23/280 (8%)

Query: 21  AGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKS-AHFSTLAIKQNPLLAEAY 79
            GDY +AE+   +          VL LL+  +    K  +         +  +   AE  
Sbjct: 69  LGDYAAAEKELRKALSLGYPKNQVLPLLARAYLLQGKFQQVLDELPGKTLLDDEGAAELL 128

Query: 80  SNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQY 139
           +  G  Y   GQL+ A ++Y  A+ + P  +   + LA   +A    ++A       L  
Sbjct: 129 ALRGLAYLGLGQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARALIDEVLTA 188

Query: 140 NPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLKALGRLDEAKNLHTENIKPVTM 199
           +P         G+LL +LG ++ A   Y                 +A  L   NI  +  
Sbjct: 189 DPGNVDALLLKGDLLLSLGNIELALAAY----------------RKAIALRPNNIAVLLA 232

Query: 200 KVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRH 259
                I            ++     L    N  LA     L  V  ++   ++A E  + 
Sbjct: 233 LATILIEAGEFEEAEKHADAL----LKKAPNSPLAHYLKAL--VDFQKKNYEDARETLQD 286

Query: 260 AVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
           A++  P+++   +   A+    G++EQA Q     L+Y P
Sbjct: 287 ALKSAPEYLPALLLAGASEYQLGNLEQAYQYLNQILKYAP 326



 Score = 51.2 bits (123), Expect = 2e-06
 Identities = 47/203 (23%), Positives = 79/203 (38%), Gaps = 30/203 (14%)

Query: 562 FECRKLDKSAHFSTL--AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFI 619
              R  ++    + L  A + NP   E    L   Y  +GQL++AL     A    PD  
Sbjct: 542 LYLRTGNEEEAVAWLEKAAELNPQEIEPALALAQYYLGKGQLKKALAILNEAADAAPDSP 601

Query: 620 DGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGE 679
           + ++ L  A +AAGD+ +AV           S + K +  +PD A+A   L   +     
Sbjct: 602 EAWLMLGRAQLAAGDLNKAV-----------SSFKKLLALQPDSALALLLLADAYAVMKN 650

Query: 680 IWLAIHHFEKAVSLDPNFLDAYINLGNVLKEARIFDRANTLFYVSSYFMITDEDIDVILE 739
              AI   ++A+ L P+  +A I L  +L  A+  + A  +                  +
Sbjct: 651 YAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTESAKKIA-----------------K 693

Query: 740 RCEAKTEELNKKFEQLGESSLRD 762
             + +  +    FE  G+  LR 
Sbjct: 694 SLQKQHPKAALGFELEGDLYLRQ 716



 Score = 48.9 bits (117), Expect = 1e-05
 Identities = 39/153 (25%), Positives = 72/153 (47%), Gaps = 14/153 (9%)

Query: 565 RKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN 624
              +K+A +   A + +P  A A + LG     +G   EA+ +   A +L P+     + 
Sbjct: 377 GDFEKAAEYLAKATELDPENAAARTQLGISKLSQGDPSEAIADLETAAQLDPELGRADLL 436

Query: 625 LAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETR-PDFAVAWSNLGCVFNAQGEIWLA 683
           L  + + +G  ++A+ A             K +E + PD A   + LG ++  +G++  A
Sbjct: 437 LILSYLRSGQFDKALAA------------AKKLEKKQPDNASLHNLLGAIYLGKGDLAKA 484

Query: 684 IHHFEKAVSLDPNFLDAYINLGNV-LKEARIFD 715
              FEKA+S++P+F  A  NL  + ++E    D
Sbjct: 485 REAFEKALSIEPDFFPAAANLARIDIQEGNPDD 517



 Score = 47.0 bits (112), Expect = 5e-05
 Identities = 50/250 (20%), Positives = 90/250 (36%), Gaps = 34/250 (13%)

Query: 44  VLLLLSSIHFQCRKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAV 103
           +L L    +    +L+ +      A+  +P    A   L  +     +  EA       +
Sbjct: 127 LLALRGLAYLGLGQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARALIDEVL 186

Query: 104 RLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEA 163
              P  +D  +     L++ G++E A+ AY  A+   P+   V   L  +L   G  +EA
Sbjct: 187 TADPGNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVLLALATILIEAGEFEEA 246

Query: 164 -KDLYCVR-----SDLGNLLKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTL 217
            K    +      S L + LKAL  +D  K              ++A           TL
Sbjct: 247 EKHADALLKKAPNSPLAHYLKAL--VDFQKK-----------NYEDAR---------ETL 284

Query: 218 ESAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAA 277
           +       A+K  P    A    G    + G L++A +     ++  P+       LA+ 
Sbjct: 285 QD------ALKSAPEYLPALLLAGASEYQLGNLEQAYQYLNQILKYAPNSHQARRLLASI 338

Query: 278 LVAAGDMEQA 287
            +  G +++A
Sbjct: 339 QLRLGRVDEA 348



 Score = 41.2 bits (97), Expect = 0.003
 Identities = 31/142 (21%), Positives = 53/142 (37%), Gaps = 12/142 (8%)

Query: 579 KQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQA 638
           K +P  A   + L  +Y  +    +A+++Y+  V+  PD      NLA   +   D    
Sbjct: 764 KTHPNDAVLRTALAELYLAQKDYDKAIKHYQTVVKKAPDNAVVLNNLAWLYLELKDPR-- 821

Query: 639 VQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFL 698
                 AL+Y      +A++  P+       LG +   +GE   A+    KAV++ P   
Sbjct: 822 ------ALEYA----ERALKLAPNIPAILDTLGWLLVEKGEADRALPLLRKAVNIAPEAA 871

Query: 699 DAYINLGNVLKEARIFDRANTL 720
               +L   L        A   
Sbjct: 872 AIRYHLALALLATGRKAEARKE 893



 Score = 41.2 bits (97), Expect = 0.003
 Identities = 35/153 (22%), Positives = 61/153 (39%), Gaps = 1/153 (0%)

Query: 14  LAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAIKQNP 73
           L      +G+   A +      +   N+  +   L+ ++   +  DK+       +K+ P
Sbjct: 742 LHRALLASGNTAEAVKTLEAWLKTHPNDAVLRTALAELYLAQKDYDKAIKHYQTVVKKAP 801

Query: 74  LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAY 133
             A   +NL  +Y E      ALE    A++L P+       L   LV  G+ ++A+   
Sbjct: 802 DNAVVLNNLAWLYLELKD-PRALEYAERALKLAPNIPAILDTLGWLLVEKGEADRALPLL 860

Query: 134 VTALQYNPDLYCVRSDLGNLLKALGRLDEAKDL 166
             A+   P+   +R  L   L A GR  EA+  
Sbjct: 861 RKAVNIAPEAAAIRYHLALALLATGRKAEARKE 893



 Score = 39.7 bits (93), Expect = 0.007
 Identities = 39/162 (24%), Positives = 71/162 (43%), Gaps = 12/162 (7%)

Query: 544 NVEPDLTREGGSIPITLTFECRKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQE 603
            ++P+L R    + ++      + DK+   +    K+ P  A  ++ LG +Y  +G L +
Sbjct: 425 QLDPELGRADLLLILSY-LRSGQFDKALAAAKKLEKKQPDNASLHNLLGAIYLGKGDLAK 483

Query: 604 ALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDF 663
           A E +  A+ ++PDF     NLA   +  G+ + A+Q +   L  +P   L+AI      
Sbjct: 484 AREAFEKALSIEPDFFPAAANLARIDIQEGNPDDAIQRFEKVLTIDPK-NLRAILALAGL 542

Query: 664 AVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLDAYINLG 705
            +   N       +   WL     EKA  L+P  ++  + L 
Sbjct: 543 YLRTGN-----EEEAVAWL-----EKAAELNPQEIEPALALA 574



 Score = 38.1 bits (89), Expect = 0.026
 Identities = 32/150 (21%), Positives = 56/150 (37%), Gaps = 24/150 (16%)

Query: 582 PLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQA 641
           P   EA+  LG      G L +A+ +++  + L+PD     + LA A     +  +A+ +
Sbjct: 598 PDSPEAWLMLGRAQLAAGDLNKAVSSFKKLLALQPDSALALLLLADAYAVMKNYAKAITS 657

Query: 642 YVTALQYNPSCYLKAI-----------------------ETRPDFAVAWSNLGCVFNAQG 678
              AL+  P      I                       +  P  A+ +   G ++  Q 
Sbjct: 658 LKRALELKPDNTEAQIGLAQLLLAAKRTESAKKIAKSLQKQHPKAALGFELEGDLYLRQK 717

Query: 679 EIWLAIHHFEKAVSLDPNFLDAYINLGNVL 708
           +   AI  + KA+   P+  +A I L   L
Sbjct: 718 DYPAAIQAYRKALKRAPSSQNA-IKLHRAL 746



 Score = 37.4 bits (87), Expect = 0.039
 Identities = 20/64 (31%), Positives = 31/64 (48%)

Query: 577 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 636
           A+K  P +      LG +  E+G+   AL   R AV + P+      +LA AL+A G   
Sbjct: 829 ALKLAPNIPAILDTLGWLLVEKGEADRALPLLRKAVNIAPEAAAIRYHLALALLATGRKA 888

Query: 637 QAVQ 640
           +A +
Sbjct: 889 EARK 892



 Score = 37.0 bits (86), Expect = 0.055
 Identities = 18/79 (22%), Positives = 31/79 (39%), Gaps = 2/79 (2%)

Query: 230 NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQ 289
           +   AE  +  G  Y   GQL+ A ++Y  A+ + P  +   + LA   +A    ++A  
Sbjct: 121 DEGAAELLALRGLAYLGLGQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARA 180

Query: 290 AYVTALQYNPLLKKEDGWN 308
                L  +P     D   
Sbjct: 181 LIDEVLTADP--GNVDALL 197



 Score = 36.6 bits (85), Expect = 0.072
 Identities = 33/131 (25%), Positives = 52/131 (39%), Gaps = 23/131 (17%)

Query: 598 RGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNP------- 650
           +   ++A E  + A++  P+++   +   A+    G++EQA Q     L+Y P       
Sbjct: 274 KKNYEDARETLQDALKSAPEYLPALLLAGASEYQLGNLEQAYQYLNQILKYAPNSHQARR 333

Query: 651 ---SCYLK-------------AIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLD 694
              S  L+             A+   PD   A S LG  + A G+   A  +  KA  LD
Sbjct: 334 LLASIQLRLGRVDEAIATLSPALGLDPDDPAALSLLGEAYLALGDFEKAAEYLAKATELD 393

Query: 695 PNFLDAYINLG 705
           P    A   LG
Sbjct: 394 PENAAARTQLG 404



 Score = 35.1 bits (81), Expect = 0.22
 Identities = 27/132 (20%), Positives = 46/132 (34%), Gaps = 11/132 (8%)

Query: 565 RKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN 624
            +L+ +      A+  +P    A   L  +     +  EA       +   P  +D  + 
Sbjct: 139 GQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARALIDEVLTADPGNVDALLL 198

Query: 625 LAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAI 684
               L++ G++E A+ AY            KAI  RP+       L  +    GE   A 
Sbjct: 199 KGDLLLSLGNIELALAAYR-----------KAIALRPNNIAVLLALATILIEAGEFEEAE 247

Query: 685 HHFEKAVSLDPN 696
            H +  +   PN
Sbjct: 248 KHADALLKKAPN 259



 Score = 34.7 bits (80), Expect = 0.25
 Identities = 30/160 (18%), Positives = 57/160 (35%), Gaps = 24/160 (15%)

Query: 585 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT 644
            E        Y ++ + + A+   ++A++  P+  +    L    +A GD   A +    
Sbjct: 22  PEELIEAAKSYLQKNKYKAAIIQLKNALQKDPNDAEARFLLGKIYLALGDYAAAEKELRK 81

Query: 645 ALQYNPS----------CYL------KAIET--------RPDFAVAWSNLGCVFNAQGEI 680
           AL                YL      + ++             A   +  G  +   G++
Sbjct: 82  ALSLGYPKNQVLPLLARAYLLQGKFQQVLDELPGKTLLDDEGAAELLALRGLAYLGLGQL 141

Query: 681 WLAIHHFEKAVSLDPNFLDAYINLGNVLKEARIFDRANTL 720
            LA   +E+A+++DP  L A + L  +      FD A  L
Sbjct: 142 ELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARAL 181


>gnl|CDD|193553 cd05678, M20_dipept_like_1, Uncharacterized M20 Dipeptidases.
           Peptidase M20 family, unknown dipeptidase-like subfamily
           1 (inferred by homology to be dipeptidases). M20
           dipeptidases include a large variety of bacterial
           enzymes including cytosolic nonspecific dipeptidase
           (CNDP), Xaa-methyl-His dipeptidase (anserinase),and
           canosinase. These dipeptidases have been shown to act on
           a wide range of dipeptides, but not larger peptides. For
           example, anserinase mainly catalyzes the hydrolysis of
           N-alpha-acetylhistidine while carnosinase degrades
           beta-alanyl-L-histidine.
          Length = 466

 Score = 75.0 bits (185), Expect = 5e-14
 Identities = 52/226 (23%), Positives = 93/226 (41%), Gaps = 31/226 (13%)

Query: 354 VNIKIECASKDLHSGLFGGSV-HEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTEE 412
           + + +  A   LHSG +G    + A   L  ++  + + +G++LIP  Y D  PL + E+
Sbjct: 205 ITLTVYGAKVPLHSGHYGNYAPNPAFR-LARLLASMKDDDGRVLIPGYY-DGIPLSEEEK 262

Query: 413 QFYEKIDFDTEDFRTAIDHPKLTKADKT----QVLMSRWRYPSLSLHGIEGAFSGPGGKT 468
           +    +  D E  R       L + +K     Q  +   +YPSL++ G++  ++G   +T
Sbjct: 263 KALAAVPDDEESLR---KRLGLAQTEKVGRNYQEAL---QYPSLNIRGLQSGWTGDKART 316

Query: 469 VIPGKVVGKFSIRIVPNQTPQCVEKYVLDYL--------------NELWKARNSPNKFKA 514
           +IP     +  IR+VP   P  +   +  ++               E       P     
Sbjct: 317 IIPSTATAEIDIRLVPETDPDRLLDLLKKHIEAQGYHVVDGEPTAEER---LAHPKIATF 373

Query: 515 YLLDSGKSWRTNPEHPNYVAAARATKYVYNVEPDLTRE-GGSIPIT 559
               S  ++RT+ + P       A    +  EP   R  GGS+PI+
Sbjct: 374 TRTGSYNAFRTDLDSPIGKWLRAALTKAFGEEPIQIRTMGGSVPIS 419


>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat. 
          Length = 69

 Score = 61.9 bits (151), Expect = 4e-12
 Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 1/68 (1%)

Query: 75  LAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDM-EQAVQAY 133
            AEA  NLGN   + G   EA+E Y  A+ L PD  + Y NLA A +  G   E+A++  
Sbjct: 2   NAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDL 61

Query: 134 VTALQYNP 141
             AL+ +P
Sbjct: 62  EKALELDP 69



 Score = 61.9 bits (151), Expect = 4e-12
 Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 1/68 (1%)

Query: 233 LAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDM-EQAVQAY 291
            AEA  NLGN   + G   EA+E Y  A+ L PD  + Y NLA A +  G   E+A++  
Sbjct: 2   NAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDL 61

Query: 292 VTALQYNP 299
             AL+ +P
Sbjct: 62  EKALELDP 69



 Score = 61.9 bits (151), Expect = 4e-12
 Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 1/68 (1%)

Query: 584 LAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDM-EQAVQAY 642
            AEA  NLGN   + G   EA+E Y  A+ L PD  + Y NLA A +  G   E+A++  
Sbjct: 2   NAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDL 61

Query: 643 VTALQYNP 650
             AL+ +P
Sbjct: 62  EKALELDP 69



 Score = 50.4 bits (121), Expect = 4e-08
 Identities = 25/75 (33%), Positives = 36/75 (48%), Gaps = 12/75 (16%)

Query: 622 YINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIW 681
             NL  AL   GD ++A++            Y KA+E  PD A A+ NL   +   G+ +
Sbjct: 6   LKNLGNALFKLGDYDEAIE-----------AYEKALELDPDNAEAYYNLALAYLKLGKDY 54

Query: 682 -LAIHHFEKAVSLDP 695
             A+   EKA+ LDP
Sbjct: 55  EEALEDLEKALELDP 69



 Score = 45.0 bits (107), Expect = 4e-06
 Identities = 21/65 (32%), Positives = 32/65 (49%), Gaps = 1/65 (1%)

Query: 44  VLLLLSSIHFQCRKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERG-QLQEALENYRHA 102
            L  L +  F+    D++      A++ +P  AEAY NL   Y + G   +EALE+   A
Sbjct: 5   ALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKA 64

Query: 103 VRLKP 107
           + L P
Sbjct: 65  LELDP 69



 Score = 42.3 bits (100), Expect = 3e-05
 Identities = 19/56 (33%), Positives = 29/56 (51%), Gaps = 1/56 (1%)

Query: 562 FECRKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERG-QLQEALENYRHAVRLKP 616
           F+    D++      A++ +P  AEAY NL   Y + G   +EALE+   A+ L P
Sbjct: 14  FKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKALELDP 69



 Score = 41.1 bits (97), Expect = 9e-05
 Identities = 18/47 (38%), Positives = 24/47 (51%)

Query: 664 AVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLDAYINLGNVLKE 710
           A A  NLG      G+   AI  +EKA+ LDP+  +AY NL     +
Sbjct: 3   AEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLK 49



 Score = 38.4 bits (90), Expect = 8e-04
 Identities = 17/43 (39%), Positives = 24/43 (55%), Gaps = 1/43 (2%)

Query: 224 TLAIKQNPLLAEAYSNLGNVYKERG-QLQEALENYRHAVRLKP 265
             A++ +P  AEAY NL   Y + G   +EALE+   A+ L P
Sbjct: 27  EKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKALELDP 69



 Score = 31.9 bits (73), Expect = 0.14
 Identities = 18/56 (32%), Positives = 27/56 (48%), Gaps = 1/56 (1%)

Query: 113 YINLAAALVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALG-RLDEAKDLY 167
             NL  AL   GD ++A++AY  AL+ +PD      +L      LG   +EA +  
Sbjct: 6   LKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDL 61


>gnl|CDD|236127 PRK07907, PRK07907, hypothetical protein; Provisional.
          Length = 449

 Score = 66.9 bits (164), Expect = 2e-11
 Identities = 46/208 (22%), Positives = 85/208 (40%), Gaps = 32/208 (15%)

Query: 365 LHSGLFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTEEQFYEKIDFDTED 424
           +HSG FGG+  +A+T L+ ++  L + +G + +          G    + +  +D+D E 
Sbjct: 214 VHSGQFGGAAPDALTALVRLLATLHDEDGNVAVD---------GLDATEPWLGVDYDEER 264

Query: 425 FRTA---IDHPKLTKADKTQVLMSRWRYPSLSLHGI-----EGAFSGPGGKTVIPGKVVG 476
           FR     +D  +L         +  W  P++++ GI      GA +       +P     
Sbjct: 265 FRADAGVLDGVELIGTGSVADRL--WAKPAITVIGIDAPPVAGASN------ALPPSARA 316

Query: 477 KFSIRIVPNQTPQCVEKYVLDYL--NELWKARNSPNKFKAYLLDSGKSWRTNPEHPNYVA 534
           + S+R+ P Q     +  ++ +L  +  W A            D+G+ +  +   P Y A
Sbjct: 317 RLSLRVAPGQDAAEAQDALVAHLEAHAPWGAH-----VTVERGDAGQPFAADASGPAYDA 371

Query: 535 AARATKYVYNVEPDLTREGGSIPITLTF 562
           A  A +  +  +P     GGSIP     
Sbjct: 372 ARAAMREAWGKDPVDMGMGGSIPFIAEL 399


>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
          Length = 291

 Score = 57.9 bits (138), Expect = 7e-09
 Identities = 68/243 (27%), Positives = 104/243 (42%), Gaps = 35/243 (14%)

Query: 67  LAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVR--LKPDFIDGYINLAAALVAAG 124
           L +  N  LA     L     + G+L+EALE    A+   L P+  +  +NL   L A G
Sbjct: 50  LELLPNSDLAGLLLLLALALLKLGRLEEALELLEKALELELLPNLAEALLNLGLLLEALG 109

Query: 125 DMEQAVQAYVTALQYNPDLYCVRS--------DLGNLLKALGRLDEAKDLYCVRSDLGNL 176
             E+A++    AL  +PD     +        +LG+  +AL   ++A +L    ++L   
Sbjct: 110 KYEEALELLEKALALDPDPDLAEALLALGALYELGDYEEALELYEKALELDPELNELAEA 169

Query: 177 LKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPLLAEA 236
           L ALG L EA   + E ++                                      AEA
Sbjct: 170 LLALGALLEALGRYEEALE-------------------------LLEKALKLNPDDDAEA 204

Query: 237 YSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQ 296
             NLG +Y + G+ +EALE Y  A+ L PD  +   NLA  L+  G  E+A++A   AL+
Sbjct: 205 LLNLGLLYLKLGKYEEALEYYEKALELDPDNAEALYNLALLLLELGRYEEALEALEKALE 264

Query: 297 YNP 299
            +P
Sbjct: 265 LDP 267



 Score = 56.0 bits (133), Expect = 3e-08
 Identities = 50/184 (27%), Positives = 78/184 (42%), Gaps = 5/184 (2%)

Query: 5   AQLGIGLLELAHREYQAGDYESAERHCMQ-LWRQETNNTGVLLLLSSIHFQCRKLDKSAH 63
             L   LL L       G YE A     + L      +    LL     ++    +++  
Sbjct: 92  PNLAEALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALYELGDYEEALE 151

Query: 64  FSTLAIKQNP---LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD-FIDGYINLAAA 119
               A++ +P    LAEA   LG + +  G+ +EALE    A++L PD   +  +NL   
Sbjct: 152 LYEKALELDPELNELAEALLALGALLEALGRYEEALELLEKALKLNPDDDAEALLNLGLL 211

Query: 120 LVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLKA 179
            +  G  E+A++ Y  AL+ +PD      +L  LL  LGR +EA +      +L   L  
Sbjct: 212 YLKLGKYEEALEYYEKALELDPDNAEALYNLALLLLELGRYEEALEALEKALELDPDLYN 271

Query: 180 LGRL 183
           LG  
Sbjct: 272 LGLA 275



 Score = 51.0 bits (120), Expect = 1e-06
 Identities = 47/144 (32%), Positives = 71/144 (49%), Gaps = 10/144 (6%)

Query: 576 LAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLA-AALVAAGD 634
           L ++  P LAEA  NLG + +  G+ +EALE    A+ L PD       LA  AL   GD
Sbjct: 86  LELELLPNLAEALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALYELGD 145

Query: 635 MEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLD 694
            E+A++ Y  AL+ +P           + A A   LG +  A G    A+   EKA+ L+
Sbjct: 146 YEEALELYEKALELDPELN--------ELAEALLALGALLEALGRYEEALELLEKALKLN 197

Query: 695 PN-FLDAYINLGNVLKEARIFDRA 717
           P+   +A +NLG +  +   ++ A
Sbjct: 198 PDDDAEALLNLGLLYLKLGKYEEA 221



 Score = 49.1 bits (115), Expect = 4e-06
 Identities = 50/145 (34%), Positives = 73/145 (50%), Gaps = 14/145 (9%)

Query: 576 LAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKP---DFIDGYINLAAALVAA 632
           LA+  +P LAEA   LG    E G  +EALE Y  A+ L P   +  +  + L A L A 
Sbjct: 122 LALDPDPDLAEALLALG-ALYELGDYEEALELYEKALELDPELNELAEALLALGALLEAL 180

Query: 633 GDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVS 692
           G  E+A++    AL+ NP           D A A  NLG ++   G+   A+ ++EKA+ 
Sbjct: 181 GRYEEALELLEKALKLNPD----------DDAEALLNLGLLYLKLGKYEEALEYYEKALE 230

Query: 693 LDPNFLDAYINLGNVLKEARIFDRA 717
           LDP+  +A  NL  +L E   ++ A
Sbjct: 231 LDPDNAEALYNLALLLLELGRYEEA 255



 Score = 48.3 bits (113), Expect = 9e-06
 Identities = 49/187 (26%), Positives = 81/187 (43%), Gaps = 15/187 (8%)

Query: 531 NYVAAARATKYVYNVEPDLTREGGSIPITLTFECRKLDKSAHFSTLAIKQNP---LLAEA 587
            Y  A    +    ++PD       + +   +E    +++      A++ +P    LAEA
Sbjct: 110 KYEEALELLEKALALDPDPDLAEALLALGALYELGDYEEALELYEKALELDPELNELAEA 169

Query: 588 YSNLGNVYKERGQLQEALENYRHAVRLKPD-FIDGYINLAAALVAAGDMEQAVQAYVTAL 646
              LG + +  G+ +EALE    A++L PD   +  +NL    +  G  E+A++      
Sbjct: 170 LLALGALLEALGRYEEALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALE------ 223

Query: 647 QYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLDAYINLGN 706
                 Y KA+E  PD A A  NL  +    G    A+   EKA+ LDP+  +  + L  
Sbjct: 224 -----YYEKALELDPDNAEALYNLALLLLELGRYEEALEALEKALELDPDLYNLGLALLL 278

Query: 707 VLKEARI 713
           +L EA  
Sbjct: 279 LLAEALE 285



 Score = 36.4 bits (82), Expect = 0.051
 Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 1/82 (1%)

Query: 225 LAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLA-AALVAAGD 283
           L ++  P LAEA  NLG + +  G+ +EALE    A+ L PD       LA  AL   GD
Sbjct: 86  LELELLPNLAEALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALYELGD 145

Query: 284 MEQAVQAYVTALQYNPLLKKED 305
            E+A++ Y  AL+ +P L +  
Sbjct: 146 YEEALELYEKALELDPELNELA 167



 Score = 30.2 bits (66), Expect = 5.3
 Identities = 46/160 (28%), Positives = 70/160 (43%), Gaps = 13/160 (8%)

Query: 566 KLDKSAHFSTLAIKQNPLLAEAYSNLGN-VYKERGQLQEALENYRHAVRLKP--DFIDGY 622
            L  +      A+ +    A A    G  + +  G+L EALE    A+ L P  D     
Sbjct: 3   DLLLALAILLEALAKLLAEALALLEAGLALLELLGELAEALELLEEALELLPNSDLAGLL 62

Query: 623 INLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWL 682
           + LA AL+  G +E+A++    AL+          E  P+ A A  NLG +  A G+   
Sbjct: 63  LLLALALLKLGRLEEALELLEKALEL---------ELLPNLAEALLNLGLLLEALGKYEE 113

Query: 683 AIHHFEKAVSLDPNFLDAYINLG-NVLKEARIFDRANTLF 721
           A+   EKA++LDP+   A   L    L E   ++ A  L+
Sbjct: 114 ALELLEKALALDPDPDLAEALLALGALYELGDYEEALELY 153


>gnl|CDD|131573 TIGR02521, type_IV_pilW, type IV pilus biogenesis/stability protein
           PilW.  Members of this family are designated PilF in ref
           (PMID:8973346) and PilW in ref (PMID:15612916). This
           outer membrane protein is required both for pilus
           stability and for pilus function such as adherence to
           human cells. Members of this family contain copies of
           the TPR (tetratricopeptide repeat) domain.
          Length = 234

 Score = 55.8 bits (135), Expect = 2e-08
 Identities = 49/221 (22%), Positives = 88/221 (39%), Gaps = 21/221 (9%)

Query: 76  AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT 135
           A+    L   Y E+G L+ A EN   A+   PD    Y+ LA      G++E+A  ++  
Sbjct: 31  AKIRVQLALGYLEQGDLEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRR 90

Query: 136 ALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLKALGRLDEAKNLHTENIK 195
           AL  NP+   V ++ G  L   G+ ++A         +    +A+        L+ +  +
Sbjct: 91  ALTLNPNNGDVLNNYGTFLCQQGKYEQA---------MQQFEQAI-----EDPLYPQPAR 136

Query: 196 PVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALE 255
            +     NA +C    +     ++  + T A++ +P   E+   L  +Y  RGQ ++A  
Sbjct: 137 SLE----NAGLCAL--KAGDFDKAEKYLTRALQIDPQRPESLLELAELYYLRGQYKDARA 190

Query: 256 NYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQ 296
                 +      +  + L   +  A     A Q Y   LQ
Sbjct: 191 YLERYQQTYNQTAES-LWLGIRIARALGDVAAAQRYGAQLQ 230



 Score = 55.4 bits (134), Expect = 2e-08
 Identities = 38/140 (27%), Positives = 58/140 (41%), Gaps = 15/140 (10%)

Query: 585 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT 644
           A+    L   Y E+G L+ A EN   A+   PD    Y+ LA      G++E+A  ++  
Sbjct: 31  AKIRVQLALGYLEQGDLEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRR 90

Query: 645 ALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNF---LDAY 701
           AL  N           P+     +N G     QG+   A+  FE+A+  DP +     + 
Sbjct: 91  ALTLN-----------PNNGDVLNNYGTFLCQQGKYEQAMQQFEQAIE-DPLYPQPARSL 138

Query: 702 INLGNVLKEARIFDRANTLF 721
            N G    +A  FD+A    
Sbjct: 139 ENAGLCALKAGDFDKAEKYL 158



 Score = 52.3 bits (126), Expect = 2e-07
 Identities = 34/131 (25%), Positives = 60/131 (45%), Gaps = 15/131 (11%)

Query: 577 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 636
           A++ +P    AY  L   Y++ G+L++A +++R A+ L P+  D   N    L   G  E
Sbjct: 57  ALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGKYE 116

Query: 637 QAVQAYVTALQYNPSCYLKAIETRPDF---AVAWSNLGCVFNAQGEIWLAIHHFEKAVSL 693
           QA+Q            + +AIE  P +   A +  N G      G+   A  +  +A+ +
Sbjct: 117 QAMQQ-----------FEQAIED-PLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQI 164

Query: 694 DPNFLDAYINL 704
           DP   ++ + L
Sbjct: 165 DPQRPESLLEL 175



 Score = 44.6 bits (106), Expect = 1e-04
 Identities = 22/66 (33%), Positives = 32/66 (48%)

Query: 234 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT 293
           A+    L   Y E+G L+ A EN   A+   PD    Y+ LA      G++E+A  ++  
Sbjct: 31  AKIRVQLALGYLEQGDLEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRR 90

Query: 294 ALQYNP 299
           AL  NP
Sbjct: 91  ALTLNP 96



 Score = 42.3 bits (100), Expect = 5e-04
 Identities = 27/98 (27%), Positives = 45/98 (45%), Gaps = 2/98 (2%)

Query: 68  AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 127
           A++ +P    AY  L   Y++ G+L++A +++R A+ L P+  D   N    L   G  E
Sbjct: 57  ALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGKYE 116

Query: 128 QAVQAYVTAL--QYNPDLYCVRSDLGNLLKALGRLDEA 163
           QA+Q +  A+     P       + G      G  D+A
Sbjct: 117 QAMQQFEQAIEDPLYPQPARSLENAGLCALKAGDFDKA 154



 Score = 37.7 bits (88), Expect = 0.014
 Identities = 23/110 (20%), Positives = 42/110 (38%), Gaps = 36/110 (32%)

Query: 226 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD----------FI------- 268
           A++ +P    AY  L   Y++ G+L++A +++R A+ L P+          F+       
Sbjct: 57  ALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGKYE 116

Query: 269 -------------------DGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
                                  N     + AGD ++A +    ALQ +P
Sbjct: 117 QAMQQFEQAIEDPLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQIDP 166


>gnl|CDD|225605 COG3063, PilF, Tfp pilus assembly protein PilF [Cell motility and
           secretion / Intracellular trafficking and secretion].
          Length = 250

 Score = 55.1 bits (133), Expect = 3e-08
 Identities = 38/145 (26%), Positives = 64/145 (44%), Gaps = 9/145 (6%)

Query: 577 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 636
           A++ +P    A+    + Y++ G+   A E+YR A+ L P+  D   N  A L A G  E
Sbjct: 61  ALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAPNNGDVLNNYGAFLCAQGRPE 120

Query: 637 QAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPN 696
           +A+Q +  AL  +P+    +            NLG      G+   A  + ++A+ LDP 
Sbjct: 121 EAMQQFERALA-DPAYGEPSD--------TLENLGLCALKAGQFDQAEEYLKRALELDPQ 171

Query: 697 FLDAYINLGNVLKEARIFDRANTLF 721
           F  A + L  +  +A  +  A    
Sbjct: 172 FPPALLELARLHYKAGDYAPARLYL 196



 Score = 45.5 bits (108), Expect = 5e-05
 Identities = 48/224 (21%), Positives = 84/224 (37%), Gaps = 27/224 (12%)

Query: 76  AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT 135
           A+A   L   Y ++G   +A +N   A+   P +   ++  A      G+ + A ++Y  
Sbjct: 35  AKARLQLALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADESYRK 94

Query: 136 ALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLKALGRLDEAKNLHTENIK 195
           AL   P+   V ++ G  L A GR +E                A+ + + A         
Sbjct: 95  ALSLAPNNGDVLNNYGAFLCAQGRPEE----------------AMQQFERALADPAYGEP 138

Query: 196 PVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEA-- 253
             T+  +N  +C     +    E   +   A++ +P    A   L  ++ + G    A  
Sbjct: 139 SDTL--ENLGLCALKAGQFDQAE--EYLKRALELDPQFPPALLELARLHYKAGDYAPARL 194

Query: 254 -LENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQ 296
            LE Y+     + + +   I +A  L   GD   A Q Y   LQ
Sbjct: 195 YLERYQQRGGAQAESLLLGIRIAKRL---GD-RAAAQRYQAQLQ 234



 Score = 39.3 bits (92), Expect = 0.006
 Identities = 16/66 (24%), Positives = 29/66 (43%)

Query: 234 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT 293
           A+A   L   Y ++G   +A +N   A+   P +   ++  A      G+ + A ++Y  
Sbjct: 35  AKARLQLALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADESYRK 94

Query: 294 ALQYNP 299
           AL   P
Sbjct: 95  ALSLAP 100



 Score = 38.2 bits (89), Expect = 0.012
 Identities = 33/104 (31%), Positives = 52/104 (50%), Gaps = 6/104 (5%)

Query: 68  AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 127
           A++ +P    A+    + Y++ G+   A E+YR A+ L P+  D   N  A L A G  E
Sbjct: 61  ALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAPNNGDVLNNYGAFLCAQGRPE 120

Query: 128 QAVQAYVTALQYNPDLYCVRSDLGNL----LKALGRLDEAKDLY 167
           +A+Q +  AL  +P        L NL    LKA G+ D+A++  
Sbjct: 121 EAMQQFERALA-DPAYGEPSDTLENLGLCALKA-GQFDQAEEYL 162



 Score = 35.5 bits (82), Expect = 0.080
 Identities = 35/177 (19%), Positives = 67/177 (37%), Gaps = 4/177 (2%)

Query: 3   DIAQLGIGLLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSA 62
           D  +     L+LA    Q GDY  A+++  +    + +     L+ +  + +  + D + 
Sbjct: 30  DRNEAAKARLQLALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLAD 89

Query: 63  HFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF---IDGYINLAAA 119
                A+   P   +  +N G     +G+ +EA++ +  A+   P +    D   NL   
Sbjct: 90  ESYRKALSLAPNNGDVLNNYGAFLCAQGRPEEAMQQFERAL-ADPAYGEPSDTLENLGLC 148

Query: 120 LVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNL 176
            + AG  +QA +    AL+ +P       +L  L    G    A+         G  
Sbjct: 149 ALKAGQFDQAEEYLKRALELDPQFPPALLELARLHYKAGDYAPARLYLERYQQRGGA 205


>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat. 
          Length = 78

 Score = 50.1 bits (120), Expect = 6e-08
 Identities = 24/74 (32%), Positives = 34/74 (45%), Gaps = 7/74 (9%)

Query: 72  NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLK-------PDFIDGYINLAAALVAAG 124
           +P LA A +NL  V +  G   EALE    A+ L        P+      NLA   +A G
Sbjct: 1   HPDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALG 60

Query: 125 DMEQAVQAYVTALQ 138
           D ++A++    AL 
Sbjct: 61  DYDEALEYLEKALA 74



 Score = 50.1 bits (120), Expect = 6e-08
 Identities = 24/74 (32%), Positives = 34/74 (45%), Gaps = 7/74 (9%)

Query: 230 NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLK-------PDFIDGYINLAAALVAAG 282
           +P LA A +NL  V +  G   EALE    A+ L        P+      NLA   +A G
Sbjct: 1   HPDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALG 60

Query: 283 DMEQAVQAYVTALQ 296
           D ++A++    AL 
Sbjct: 61  DYDEALEYLEKALA 74



 Score = 50.1 bits (120), Expect = 6e-08
 Identities = 24/74 (32%), Positives = 34/74 (45%), Gaps = 7/74 (9%)

Query: 581 NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLK-------PDFIDGYINLAAALVAAG 633
           +P LA A +NL  V +  G   EALE    A+ L        P+      NLA   +A G
Sbjct: 1   HPDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALG 60

Query: 634 DMEQAVQAYVTALQ 647
           D ++A++    AL 
Sbjct: 61  DYDEALEYLEKALA 74



 Score = 44.7 bits (106), Expect = 6e-06
 Identities = 24/81 (29%), Positives = 39/81 (48%), Gaps = 4/81 (4%)

Query: 616 PDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFN 675
           PD      NLA  L   GD ++A++    AL+       +  E  P+ A A +NL  ++ 
Sbjct: 2   PDLAAALNNLALVLRRLGDYDEALELLEKALEL----ARELGEDHPETARALNNLARLYL 57

Query: 676 AQGEIWLAIHHFEKAVSLDPN 696
           A G+   A+ + EKA++L   
Sbjct: 58  ALGDYDEALEYLEKALALREA 78



 Score = 38.1 bits (89), Expect = 0.001
 Identities = 17/72 (23%), Positives = 29/72 (40%), Gaps = 7/72 (9%)

Query: 44  VLLLLSSIHFQCRKLDKSAHFSTLAIKQ-------NPLLAEAYSNLGNVYKERGQLQEAL 96
            L  L+ +  +    D++      A++        +P  A A +NL  +Y   G   EAL
Sbjct: 7   ALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEAL 66

Query: 97  ENYRHAVRLKPD 108
           E    A+ L+  
Sbjct: 67  EYLEKALALREA 78



 Score = 37.3 bits (87), Expect = 0.002
 Identities = 24/98 (24%), Positives = 35/98 (35%), Gaps = 26/98 (26%)

Query: 169 VRSDLGNLLKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIK 228
             ++L  +L+ LG  DEA  L  +                        LE A       +
Sbjct: 7   ALNNLALVLRRLGDYDEALELLEK-----------------------ALELAR---ELGE 40

Query: 229 QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD 266
            +P  A A +NL  +Y   G   EALE    A+ L+  
Sbjct: 41  DHPETARALNNLARLYLALGDYDEALEYLEKALALREA 78



 Score = 37.0 bits (86), Expect = 0.003
 Identities = 13/36 (36%), Positives = 17/36 (47%)

Query: 582 PLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD 617
           P  A A +NL  +Y   G   EALE    A+ L+  
Sbjct: 43  PETARALNNLARLYLALGDYDEALEYLEKALALREA 78



 Score = 32.7 bits (75), Expect = 0.096
 Identities = 19/65 (29%), Positives = 25/65 (38%), Gaps = 7/65 (10%)

Query: 660 RPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSL-------DPNFLDAYINLGNVLKEAR 712
            PD A A +NL  V    G+   A+   EKA+ L        P    A  NL  +     
Sbjct: 1   HPDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALG 60

Query: 713 IFDRA 717
            +D A
Sbjct: 61  DYDEA 65


>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
           repeats [General function prediction only].
          Length = 484

 Score = 55.9 bits (135), Expect = 7e-08
 Identities = 33/153 (21%), Positives = 55/153 (35%), Gaps = 10/153 (6%)

Query: 15  AHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAIKQNPL 74
           A + Y AG Y+ A +    L   + +N   L L   I  +  K  ++      A+  +P 
Sbjct: 313 ALQTYLAGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKEAIERLKKALALDPN 372

Query: 75  LAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYV 134
                 NL     + G+ QEA+      +   P+  +G+  LA A    G+  +A+ A  
Sbjct: 373 SPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAELGNRAEALLARA 432

Query: 135 TALQYNPDLYCVRSDLGNLLKALGRLDEAKDLY 167
                           G L +A+  L  A    
Sbjct: 433 EGYALA----------GRLEQAIIFLMRASQQV 455



 Score = 42.4 bits (100), Expect = 0.001
 Identities = 26/104 (25%), Positives = 42/104 (40%), Gaps = 14/104 (13%)

Query: 577 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 636
           A   NP   E     G++  E  + +EA+E  + A+ L P+     +NLA AL+  G  +
Sbjct: 335 AQPDNPYYLEL---AGDILLEANKAKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQ 391

Query: 637 QAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEI 680
           +A++        N   YL      P+    W  L   +   G  
Sbjct: 392 EAIR------ILNR--YLFND---PEDPNGWDLLAQAYAELGNR 424



 Score = 40.1 bits (94), Expect = 0.004
 Identities = 23/88 (26%), Positives = 40/88 (45%), Gaps = 13/88 (14%)

Query: 226 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 285
           A   NP   E     G++  E  + +EA+E  + A+ L P+     +NLA AL+  G  +
Sbjct: 335 AQPDNPYYLEL---AGDILLEANKAKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQ 391

Query: 286 QAVQAYVTALQYNPLLKKE----DGWNT 309
           +A++        N  L  +    +GW+ 
Sbjct: 392 EAIR------ILNRYLFNDPEDPNGWDL 413



 Score = 37.8 bits (88), Expect = 0.023
 Identities = 16/73 (21%), Positives = 28/73 (38%)

Query: 226 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 285
           A+  +P       NL     + G+ QEA+      +   P+  +G+  LA A    G+  
Sbjct: 366 ALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAELGNRA 425

Query: 286 QAVQAYVTALQYN 298
           +A+ A        
Sbjct: 426 EALLARAEGYALA 438



 Score = 37.8 bits (88), Expect = 0.023
 Identities = 16/73 (21%), Positives = 28/73 (38%)

Query: 577 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 636
           A+  +P       NL     + G+ QEA+      +   P+  +G+  LA A    G+  
Sbjct: 366 ALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAELGNRA 425

Query: 637 QAVQAYVTALQYN 649
           +A+ A        
Sbjct: 426 EALLARAEGYALA 438


>gnl|CDD|227343 COG5010, TadD, Flp pilus assembly protein TadD, contains TPR
           repeats [Intracellular trafficking and secretion].
          Length = 257

 Score = 49.0 bits (117), Expect = 3e-06
 Identities = 41/191 (21%), Positives = 70/191 (36%), Gaps = 13/191 (6%)

Query: 12  LELAHREYQAGDYESAERHCMQLWRQETNNTGVLLL--LSSIHFQCRKLDKSAHFSTLAI 69
            +       +      +     L      N   L +  L++  +     D S      + 
Sbjct: 34  AKATGAPESSLAMRQTQGAAAALGAAVLRNPEDLSIAKLATALYLRGDADSSLAVLQKSA 93

Query: 70  KQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQA 129
              P   E  +  G      G   EA+   R A RL P   + +  L AAL   G  ++A
Sbjct: 94  IAYPKDRELLAAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEA 153

Query: 130 VQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLY-----------CVRSDLGNLLK 178
            +AY  AL+  P+   + ++LG  L   G L++A+ L             VR +L  ++ 
Sbjct: 154 RRAYRQALELAPNEPSIANNLGMSLLLRGDLEDAETLLLPAYLSPAADSRVRQNLALVVG 213

Query: 179 ALGRLDEAKNL 189
             G   EA+++
Sbjct: 214 LQGDFREAEDI 224



 Score = 40.5 bits (95), Expect = 0.003
 Identities = 42/211 (19%), Positives = 63/211 (29%), Gaps = 45/211 (21%)

Query: 88  ERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPDLYCVR 147
              Q Q A      AV   P+ +     LA AL   GD + ++     +    P    + 
Sbjct: 45  AMRQTQGAAAALGAAVLRNPEDLSIA-KLATALYLRGDADSSLAVLQKSAIAYPKDRELL 103

Query: 148 SDLGNLLKALGRLDEAKDLYC-----------VRSDLGNLLKALGRLDEAKNLHTENIKP 196
           +  G      G   EA  +               + LG  L  LGR DEA+  + + ++ 
Sbjct: 104 AAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAYRQALE- 162

Query: 197 VTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALEN 256
                                       LA  +        +NLG     RG L++A   
Sbjct: 163 ----------------------------LAPNE----PSIANNLGMSLLLRGDLEDAETL 190

Query: 257 YRHAVRLKPDFIDGYINLAAALVAAGDMEQA 287
              A            NLA  +   GD  +A
Sbjct: 191 LLPAYLSPAADSRVRQNLALVVGLQGDFREA 221



 Score = 38.6 bits (90), Expect = 0.009
 Identities = 31/123 (25%), Positives = 44/123 (35%), Gaps = 11/123 (8%)

Query: 598 RGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAI 657
            G   EA+   R A RL P   + +  L AAL   G  ++A +AY  AL+      L   
Sbjct: 113 NGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAYRQALE------LAPN 166

Query: 658 ETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLDAYINLGNVLKEARIFDRA 717
           E         +NLG     +G++  A      A            NL  V+     F  A
Sbjct: 167 E-----PSIANNLGMSLLLRGDLEDAETLLLPAYLSPAADSRVRQNLALVVGLQGDFREA 221

Query: 718 NTL 720
             +
Sbjct: 222 EDI 224



 Score = 38.6 bits (90), Expect = 0.009
 Identities = 19/53 (35%), Positives = 26/53 (49%)

Query: 247 RGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
            G   EA+   R A RL P   + +  L AAL   G  ++A +AY  AL+  P
Sbjct: 113 NGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAYRQALELAP 165



 Score = 37.8 bits (88), Expect = 0.016
 Identities = 28/125 (22%), Positives = 41/125 (32%), Gaps = 12/125 (9%)

Query: 597 ERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKA 656
              Q Q A      AV   P+ +     LA AL   GD + ++     +    P      
Sbjct: 45  AMRQTQGAAAALGAAVLRNPEDLSIA-KLATALYLRGDADSSLAVLQKSAIAYPK----- 98

Query: 657 IETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLDAYINLGNVLKEARIFDR 716
                D  +  +  G      G    A+    KA  L P   +A+  LG  L +   FD 
Sbjct: 99  -----DRELL-AAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDE 152

Query: 717 ANTLF 721
           A   +
Sbjct: 153 ARRAY 157



 Score = 37.4 bits (87), Expect = 0.021
 Identities = 31/156 (19%), Positives = 52/156 (33%), Gaps = 34/156 (21%)

Query: 11  LLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAIK 70
           L      + + G++  A     +  R    +                             
Sbjct: 103 LAAQGKNQIRNGNFGEAVSVLRKAARLAPTD----------------------------- 133

Query: 71  QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAV 130
                 EA++ LG    + G+  EA   YR A+ L P+      NL  +L+  GD+E A 
Sbjct: 134 -----WEAWNLLGAALDQLGRFDEARRAYRQALELAPNEPSIANNLGMSLLLRGDLEDAE 188

Query: 131 QAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDL 166
              + A         VR +L  ++   G   EA+D+
Sbjct: 189 TLLLPAYLSPAADSRVRQNLALVVGLQGDFREAEDI 224



 Score = 32.4 bits (74), Expect = 0.93
 Identities = 19/55 (34%), Positives = 28/55 (50%)

Query: 585 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAV 639
            EA++ LG    + G+  EA   YR A+ L P+      NL  +L+  GD+E A 
Sbjct: 134 WEAWNLLGAALDQLGRFDEARRAYRQALELAPNEPSIANNLGMSLLLRGDLEDAE 188


>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat. 
          Length = 34

 Score = 43.9 bits (105), Expect = 4e-06
 Identities = 16/34 (47%), Positives = 21/34 (61%)

Query: 76  AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 109
           A+A  NLGN Y + G+  EALE Y  A+ L P+ 
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34



 Score = 43.9 bits (105), Expect = 4e-06
 Identities = 16/34 (47%), Positives = 21/34 (61%)

Query: 234 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 267
           A+A  NLGN Y + G+  EALE Y  A+ L P+ 
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34



 Score = 43.9 bits (105), Expect = 4e-06
 Identities = 16/34 (47%), Positives = 21/34 (61%)

Query: 585 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 618
           A+A  NLGN Y + G+  EALE Y  A+ L P+ 
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34



 Score = 40.5 bits (96), Expect = 7e-05
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 664 AVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNF 697
           A A  NLG  +   G+   A+ ++EKA+ L+PN 
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34



 Score = 32.8 bits (76), Expect = 0.031
 Identities = 10/33 (30%), Positives = 17/33 (51%)

Query: 110 IDGYINLAAALVAAGDMEQAVQAYVTALQYNPD 142
                NL  A +  G  ++A++ Y  AL+ NP+
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALELNPN 33



 Score = 32.4 bits (75), Expect = 0.045
 Identities = 10/34 (29%), Positives = 17/34 (50%)

Query: 619 IDGYINLAAALVAAGDMEQAVQAYVTALQYNPSC 652
                NL  A +  G  ++A++ Y  AL+ NP+ 
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34



 Score = 31.6 bits (73), Expect = 0.082
 Identities = 10/32 (31%), Positives = 16/32 (50%)

Query: 268 IDGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
                NL  A +  G  ++A++ Y  AL+ NP
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALELNP 32


>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats.  Repeats present in 4
           or more copies in proteins. Contain a minimum of 34
           amino acids each and self-associate via a "knobs and
           holes" mechanism.
          Length = 34

 Score = 42.8 bits (102), Expect = 1e-05
 Identities = 17/34 (50%), Positives = 20/34 (58%)

Query: 76  AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 109
           AEA  NLGN Y + G   EALE Y  A+ L P+ 
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34



 Score = 42.8 bits (102), Expect = 1e-05
 Identities = 17/34 (50%), Positives = 20/34 (58%)

Query: 234 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 267
           AEA  NLGN Y + G   EALE Y  A+ L P+ 
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34



 Score = 42.8 bits (102), Expect = 1e-05
 Identities = 17/34 (50%), Positives = 20/34 (58%)

Query: 585 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 618
           AEA  NLGN Y + G   EALE Y  A+ L P+ 
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34



 Score = 39.0 bits (92), Expect = 2e-04
 Identities = 14/34 (41%), Positives = 20/34 (58%)

Query: 664 AVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNF 697
           A A  NLG  +   G+   A+ ++EKA+ LDPN 
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34



 Score = 31.3 bits (72), Expect = 0.096
 Identities = 10/33 (30%), Positives = 19/33 (57%)

Query: 110 IDGYINLAAALVAAGDMEQAVQAYVTALQYNPD 142
            +   NL  A +  GD ++A++ Y  AL+ +P+
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELDPN 33



 Score = 31.3 bits (72), Expect = 0.097
 Identities = 10/34 (29%), Positives = 19/34 (55%)

Query: 619 IDGYINLAAALVAAGDMEQAVQAYVTALQYNPSC 652
            +   NL  A +  GD ++A++ Y  AL+ +P+ 
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34



 Score = 30.5 bits (70), Expect = 0.19
 Identities = 10/32 (31%), Positives = 18/32 (56%)

Query: 268 IDGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
            +   NL  A +  GD ++A++ Y  AL+ +P
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELDP 32


>gnl|CDD|193559 cd08011, M20_ArgE_DapE_like6, M20 Peptidases with similarity to
           acetylornithine deacetylases and
           succinyl-diaminopimelate desuccinylases.  Peptidase M20
           family, Uncharacterized protein subfamily with
           similarity to acetylornithine
           deacetylase/succinyl-diaminopimelate desuccinylase
           (ArgE/DapE) subfamily. This group includes the
           hypothetical protein ygeY from Escherichia coli, a
           putative deacetylase, but many in this subfamily are
           classified as unassigned peptidases. ArgE/DapE enzymes
           catalyze analogous reactions and share a common
           activator, the metal ion (usually Co2+ or Zn2+). ArgE
           catalyzes a broad range of substrates, including
           N-acetylornithine, alpha-N-acetylmethionine and
           alpha-N-formylmethionine, while DapE catalyzes the
           hydrolysis of N-succinyl-L,L-diaminopimelate (L,L-SDAP)
           to L,L-diaminopimelate and succinate. Proteins in this
           subfamily are mostly archaeal, and have been inferred by
           homology as being related to both ArgE and DapE.
          Length = 390

 Score = 46.6 bits (111), Expect = 4e-05
 Identities = 61/266 (22%), Positives = 105/266 (39%), Gaps = 34/266 (12%)

Query: 305 DGWNTEPFVLDFERRKALWRRGALVMMKRPVLGLASAIESISA-NQEKCAVNIKIECA-S 362
           DGW+ +PF     +   ++ RGA   MK    G+A+AI ++ A  +     N  +E A  
Sbjct: 87  DGWSVDPFKP-VVKDGRIYGRGA-SDMKG---GIAAAIAALEALAEAGVPPNGPVEAAFV 141

Query: 363 KDLHSGLFGGSVHEAMTDLIYIMGQLV---EPNGK--ILIPH---IYKDVEPLGDTE--- 411
            D  SG   G+ +  + +   I    V   EP+G   I I H   ++ +V   G      
Sbjct: 142 PDEESGGETGTGY--LVEEGGIRPDYVIIAEPSGLDNIWIGHRGIVWGEVRVKGKQAHGS 199

Query: 412 -----EQFYEKIDFDTEDFRTAIDHPKLTKADKTQVLMSRWR--YPSLSLHGIEGAFSGP 464
                   +EK      + + A+       + K++      R   P+++L G      G 
Sbjct: 200 TPWLGINAFEKASVIALELQEAL--KPRLSSRKSKYEYEDERTANPTVTLGGEAE---GT 254

Query: 465 GGKTVIPGKVVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAYLLDSGKSWR 524
           G   ++PG        R++P +  + V K ++D L    +     ++F+   L    +  
Sbjct: 255 GKTNIVPGYFRFSIDRRLIPEENLEEVRKEIVDLLERAEEGI--KHRFEVKSLMIVSAEF 312

Query: 525 TNPEHPNYVAAARATKYVYNVEPDLT 550
           T P+     A   A + V  VEP  T
Sbjct: 313 TPPDSSVVEALREAIREVLGVEPRKT 338


>gnl|CDD|179448 PRK02603, PRK02603, photosystem I assembly protein Ycf3;
           Provisional.
          Length = 172

 Score = 43.9 bits (104), Expect = 8e-05
 Identities = 39/130 (30%), Positives = 55/130 (42%), Gaps = 35/130 (26%)

Query: 599 GQLQEALENYRHAVRLKPDFID-GYI--NLAAALVAAGDMEQAVQAYVTALQYN---PSC 652
           G+  EALENY  A++L+ D  D  YI  N+     + G+ ++A++ Y  AL+ N   PS 
Sbjct: 49  GEYAEALENYEEALKLEEDPNDRSYILYNMGIIYASNGEHDKALEYYHQALELNPKQPSA 108

Query: 653 YLKAI-------------ETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLD 699
            L  I                 D A A       F+   E W      ++A+ L PN   
Sbjct: 109 -LNNIAVIYHKRGEKAEEAGDQDEAEAL------FDKAAEYW------KQAIRLAPN--- 152

Query: 700 AYINLGNVLK 709
            YI   N LK
Sbjct: 153 NYIEAQNWLK 162



 Score = 41.6 bits (98), Expect = 4e-04
 Identities = 26/88 (29%), Positives = 44/88 (50%), Gaps = 10/88 (11%)

Query: 90  GQLQEALENYRHAVRLKPDFID-GYI--NLAAALVAAGDMEQAVQAYVTALQYNPDLYCV 146
           G+  EALENY  A++L+ D  D  YI  N+     + G+ ++A++ Y  AL+ NP     
Sbjct: 49  GEYAEALENYEEALKLEEDPNDRSYILYNMGIIYASNGEHDKALEYYHQALELNPKQPSA 108

Query: 147 RSDLGNLLKALGR-------LDEAKDLY 167
            +++  +    G         DEA+ L+
Sbjct: 109 LNNIAVIYHKRGEKAEEAGDQDEAEALF 136



 Score = 40.8 bits (96), Expect = 0.001
 Identities = 21/55 (38%), Positives = 33/55 (60%), Gaps = 3/55 (5%)

Query: 248 GQLQEALENYRHAVRLKPDFID-GYI--NLAAALVAAGDMEQAVQAYVTALQYNP 299
           G+  EALENY  A++L+ D  D  YI  N+     + G+ ++A++ Y  AL+ NP
Sbjct: 49  GEYAEALENYEEALKLEEDPNDRSYILYNMGIIYASNGEHDKALEYYHQALELNP 103


>gnl|CDD|183140 PRK11447, PRK11447, cellulose synthase subunit BcsC; Provisional.
          Length = 1157

 Score = 46.2 bits (110), Expect = 9e-05
 Identities = 55/253 (21%), Positives = 92/253 (36%), Gaps = 42/253 (16%)

Query: 68  AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDG-----------Y--- 113
           A++ NP  +EA   LG  Y ++G    A+  +  A+ L P   +            Y   
Sbjct: 295 AVRANPKDSEALGALGQAYSQQGDRARAVAQFEKALALDPHSSNRDKWESLLKVNRYWLL 354

Query: 114 INLAAALVAAGDMEQAVQAYVTALQYNP-DLYCVRSDLGNLLKALGRLDEAKDLY--CVR 170
           I    A + A ++ QA + Y  A Q +  D Y V   LG++  A      A+  Y   +R
Sbjct: 355 IQQGDAALKANNLAQAERLYQQARQVDNTDSYAVLG-LGDVAMARKDYAAAERYYQQALR 413

Query: 171 SDLGN--LLKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIK 228
            D GN   ++ L  L   ++           K    I          +L ++   ++   
Sbjct: 414 MDPGNTNAVRGLANLYRQQS---------PEKALAFI---------ASLSASQRRSIDDI 455

Query: 229 QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN--LAAALVAAGDMEQ 286
           +  L  +  +      + +G+  +A E  R  + L P     ++   LA  L  AG   Q
Sbjct: 456 ERSLQNDRLAQQAEALENQGKWAQAAELQRQRLALDPG--SVWLTYRLAQDLRQAGQRSQ 513

Query: 287 AVQAYVTALQYNP 299
           A        Q  P
Sbjct: 514 ADALMRRLAQQKP 526



 Score = 44.7 bits (106), Expect = 2e-04
 Identities = 64/297 (21%), Positives = 110/297 (37%), Gaps = 35/297 (11%)

Query: 5   AQLGIGLLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDK-SAH 63
           A LG+G + +A       DY +AER+  Q  R +  NT  +  L+++ ++ +  +K  A 
Sbjct: 387 AVLGLGDVAMAR-----KDYAAAERYYQQALRMDPGNTNAVRGLANL-YRQQSPEKALAF 440

Query: 64  FSTLAIKQ--------NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN 115
            ++L+  Q          L  +  +      + +G+  +A E  R  + L P     ++ 
Sbjct: 441 IASLSASQRRSIDDIERSLQNDRLAQQAEALENQGKWAQAAELQRQRLALDPG--SVWLT 498

Query: 116 --LAAALVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDL 173
             LA  L  AG   QA        Q  P+        G  L    R   A          
Sbjct: 499 YRLAQDLRQAGQRSQADALMRRLAQQKPNDPEQVYAYGLYLSGSDRDRAA---------- 548

Query: 174 GNLLKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPLL 233
              L  L  L  A+     NI+ +  ++Q+  V     R   + + A    L ++Q P  
Sbjct: 549 ---LAHLNTLPRAQ--WNSNIQELAQRLQSDQVLETANRLRDSGKEAEAEAL-LRQQPPS 602

Query: 234 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQA 290
                 L +  ++RG    A   Y+  +  +P   D  + L    +A GD+  A   
Sbjct: 603 TRIDLTLADWAQQRGDYAAARAAYQRVLTREPGNADARLGLIEVDIAQGDLAAARAQ 659



 Score = 34.3 bits (79), Expect = 0.34
 Identities = 30/126 (23%), Positives = 53/126 (42%), Gaps = 11/126 (8%)

Query: 577 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF--IDGYINLAA-----AL 629
           A++ NP  +EA   LG  Y ++G    A+  +  A+ L P     D + +L        L
Sbjct: 295 AVRANPKDSEALGALGQAYSQQGDRARAVAQFEKALALDPHSSNRDKWESLLKVNRYWLL 354

Query: 630 VAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEK 689
           +  GD          A +     Y +A +     + A   LG V  A+ +   A  ++++
Sbjct: 355 IQQGDAALKANNLAQAERL----YQQARQVDNTDSYAVLGLGDVAMARKDYAAAERYYQQ 410

Query: 690 AVSLDP 695
           A+ +DP
Sbjct: 411 ALRMDP 416



 Score = 33.9 bits (78), Expect = 0.51
 Identities = 18/81 (22%), Positives = 28/81 (34%), Gaps = 2/81 (2%)

Query: 88  ERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQY-NPDLYCV 146
           +RG    A   Y+  +  +P   D  + L    +A GD+  A           N  L   
Sbjct: 615 QRGDYAAARAAYQRVLTREPGNADARLGLIEVDIAQGDLAAARAQLAKLPATANDSLNTQ 674

Query: 147 RSDLGNLLKALGRLDEAKDLY 167
           R  +     ALG    A+  +
Sbjct: 675 RR-VALAWAALGDTAAAQRTF 694



 Score = 31.2 bits (71), Expect = 3.3
 Identities = 15/56 (26%), Positives = 21/56 (37%), Gaps = 4/56 (7%)

Query: 597 ERGQLQEALENYRHAVRLKPDFIDGYIN--LAAALVAAGDMEQAVQAYVTALQYNP 650
            +G+  +A E  R  + L P     ++   LA  L  AG   QA        Q  P
Sbjct: 473 NQGKWAQAAELQRQRLALDPG--SVWLTYRLAQDLRQAGQRSQADALMRRLAQQKP 526



 Score = 30.8 bits (70), Expect = 4.4
 Identities = 19/75 (25%), Positives = 32/75 (42%), Gaps = 4/75 (5%)

Query: 597 ERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKA 656
           +RG    A   Y+  +  +P   D  + L    +A GD+  A       L   P+    +
Sbjct: 615 QRGDYAAARAAYQRVLTREPGNADARLGLIEVDIAQGDLAAARA----QLAKLPATANDS 670

Query: 657 IETRPDFAVAWSNLG 671
           + T+   A+AW+ LG
Sbjct: 671 LNTQRRVALAWAALG 685


>gnl|CDD|223697 COG0624, ArgE, Acetylornithine deacetylase/Succinyl-diaminopimelate
           desuccinylase and related deacylases [Amino acid
           transport and metabolism].
          Length = 409

 Score = 44.8 bits (106), Expect = 1e-04
 Identities = 20/150 (13%), Positives = 39/150 (26%), Gaps = 43/150 (28%)

Query: 354 VNIKIECASKDLHSG-LFGGSVHEAMTDLIYIMGQLVEPNGKILIPHIYKDVEPLGDTEE 412
           + +++    K  H+           +   I  + +L+E  G +                 
Sbjct: 198 LWLEVTVKGKAGHASTTPPDLGRNPIHAAIEALAELIEELGDLAGEGFD----------- 246

Query: 413 QFYEKIDFDTEDFRTAIDHPKLTKADKTQVLMSRWRYPSLSLHGIEGAFSGPGGKTVIPG 472
                                L                 L L G   + +G     VIPG
Sbjct: 247 -----------------GPLGLN--------------VGLILAGPGASVNGGDKVNVIPG 275

Query: 473 KVVGKFSIRIVPNQTPQCVEKYVLDYLNEL 502
           +      IR++P +    V + +   L  +
Sbjct: 276 EAEATVDIRLLPGEDLDDVLEELEAELRAI 305


>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat.  This Pfam entry
           includes outlying Tetratricopeptide-like repeats (TPR)
           that are not matched by pfam00515.
          Length = 34

 Score = 39.0 bits (92), Expect = 2e-04
 Identities = 16/34 (47%), Positives = 20/34 (58%)

Query: 76  AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 109
           AEA  NLG  Y + G  +EALE Y  A+ L P+ 
Sbjct: 1   AEALYNLGLAYYKLGDYEEALEAYEKALELDPNN 34



 Score = 39.0 bits (92), Expect = 2e-04
 Identities = 16/34 (47%), Positives = 20/34 (58%)

Query: 234 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 267
           AEA  NLG  Y + G  +EALE Y  A+ L P+ 
Sbjct: 1   AEALYNLGLAYYKLGDYEEALEAYEKALELDPNN 34



 Score = 39.0 bits (92), Expect = 2e-04
 Identities = 16/34 (47%), Positives = 20/34 (58%)

Query: 585 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 618
           AEA  NLG  Y + G  +EALE Y  A+ L P+ 
Sbjct: 1   AEALYNLGLAYYKLGDYEEALEAYEKALELDPNN 34



 Score = 30.2 bits (69), Expect = 0.30
 Identities = 12/32 (37%), Positives = 19/32 (59%)

Query: 111 DGYINLAAALVAAGDMEQAVQAYVTALQYNPD 142
           +   NL  A    GD E+A++AY  AL+ +P+
Sbjct: 2   EALYNLGLAYYKLGDYEEALEAYEKALELDPN 33



 Score = 29.0 bits (66), Expect = 0.64
 Identities = 12/31 (38%), Positives = 18/31 (58%)

Query: 269 DGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
           +   NL  A    GD E+A++AY  AL+ +P
Sbjct: 2   EALYNLGLAYYKLGDYEEALEAYEKALELDP 32



 Score = 29.0 bits (66), Expect = 0.64
 Identities = 12/31 (38%), Positives = 18/31 (58%)

Query: 620 DGYINLAAALVAAGDMEQAVQAYVTALQYNP 650
           +   NL  A    GD E+A++AY  AL+ +P
Sbjct: 2   EALYNLGLAYYKLGDYEEALEAYEKALELDP 32


>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
           receptor (72 kDa mitochondrial outermembrane protein)
           (mitochondrial import receptor for the ADP/ATP carrier)
           (translocase of outermembrane tom70).  [Transport and
           binding proteins, Amino acids, peptides and amines].
          Length = 615

 Score = 44.6 bits (105), Expect = 2e-04
 Identities = 49/230 (21%), Positives = 93/230 (40%), Gaps = 28/230 (12%)

Query: 70  KQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQA 129
           K     A A +  G     +G+  EAL +   ++ L P     YI  A+  +  GD ++A
Sbjct: 325 KLGEKEAIALNLRGTFKCLKGKHLEALADLSKSIELDPRVTQSYIKRASMNLELGDPDKA 384

Query: 130 VQAYVTALQYN---PDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLKALGRLDEA 186
            + +  AL+ N   PD+Y  R+ L + +K  G   +A   Y                 ++
Sbjct: 385 EEDFDKALKLNSEDPDIYYHRAQL-HFIK--GEFAQAGKDY----------------QKS 425

Query: 187 KNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFSTLAIKQNPLLAEAYSNLGNVYKE 246
            +L  + I      +Q  +     G   +++  A F     K  P   + Y+  G +  +
Sbjct: 426 IDLDPDFIFS---HIQLGVTQYKEGSIASSM--ATFRR-CKKNFPEAPDVYNYYGELLLD 479

Query: 247 RGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQ 296
           + +  EA+E +  A+ L+ +    Y+N+   +  A  + Q  Q ++ A  
Sbjct: 480 QNKFDEAIEKFDTAIELEKETKPMYMNVLPLINKALALFQWKQDFIEAEN 529



 Score = 41.5 bits (97), Expect = 0.002
 Identities = 34/143 (23%), Positives = 60/143 (41%), Gaps = 11/143 (7%)

Query: 579 KQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQA 638
           K     A A +  G     +G+  EAL +   ++ L P     YI  A+  +  GD ++A
Sbjct: 325 KLGEKEAIALNLRGTFKCLKGKHLEALADLSKSIELDPRVTQSYIKRASMNLELGDPDKA 384

Query: 639 VQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFL 698
            + +  AL+ N           PD     + L  +   +GE   A   ++K++ LDP+F+
Sbjct: 385 EEDFDKALKLNSE--------DPDIYYHRAQLHFI---KGEFAQAGKDYQKSIDLDPDFI 433

Query: 699 DAYINLGNVLKEARIFDRANTLF 721
            ++I LG    +      +   F
Sbjct: 434 FSHIQLGVTQYKEGSIASSMATF 456



 Score = 40.7 bits (95), Expect = 0.003
 Identities = 26/88 (29%), Positives = 38/88 (43%), Gaps = 1/88 (1%)

Query: 76  AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT 135
           A      GN         +A++ Y  A+  KPD +  Y N AA   A GD E+ V+    
Sbjct: 127 AAKLKEKGNKAYRNKDFNKAIKLYSKAIECKPDPV-YYSNRAACHNALGDWEKVVEDTTA 185

Query: 136 ALQYNPDLYCVRSDLGNLLKALGRLDEA 163
           AL+ +PD     +   N    LG+  +A
Sbjct: 186 ALELDPDYSKALNRRANAYDGLGKYADA 213



 Score = 40.0 bits (93), Expect = 0.005
 Identities = 29/87 (33%), Positives = 40/87 (45%), Gaps = 5/87 (5%)

Query: 585 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT 644
           A      GN         +A++ Y  A+  KPD +  Y N AA   A GD E+ V+    
Sbjct: 127 AAKLKEKGNKAYRNKDFNKAIKLYSKAIECKPDPV-YYSNRAACHNALGDWEKVVEDTTA 185

Query: 645 ALQYNPSCYLKAIETRPDFAVAWSNLG 671
           AL+ +P  Y KA+  R   A A+  LG
Sbjct: 186 ALELDPD-YSKALNRR---ANAYDGLG 208



 Score = 39.6 bits (92), Expect = 0.008
 Identities = 45/190 (23%), Positives = 81/190 (42%), Gaps = 13/190 (6%)

Query: 530 PNYVAAARATKYVYNVEPDLTREGGSIPITLTFECRKLDKSAHFSTL--AIKQNPLLAEA 587
            +Y  AARA +   ++     +E  ++ +  TF+C K       + L  +I+ +P + ++
Sbjct: 308 ESYEEAARAFEKALDLGKLGEKEAIALNLRGTFKCLKGKHLEALADLSKSIELDPRVTQS 367

Query: 588 YSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQ 647
           Y    ++  E G   +A E++  A++L  +  D Y + A      G+  QA + Y     
Sbjct: 368 YIKRASMNLELGDPDKAEEDFDKALKLNSEDPDIYYHRAQLHFIKGEFAQAGKDYQ---- 423

Query: 648 YNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLDAYINLGNV 707
                  K+I+  PDF  +   LG     +G I  ++  F +     P   D Y   G +
Sbjct: 424 -------KSIDLDPDFIFSHIQLGVTQYKEGSIASSMATFRRCKKNFPEAPDVYNYYGEL 476

Query: 708 LKEARIFDRA 717
           L +   FD A
Sbjct: 477 LLDQNKFDEA 486



 Score = 36.9 bits (85), Expect = 0.056
 Identities = 21/66 (31%), Positives = 30/66 (45%), Gaps = 1/66 (1%)

Query: 234 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT 293
           A      GN         +A++ Y  A+  KPD +  Y N AA   A GD E+ V+    
Sbjct: 127 AAKLKEKGNKAYRNKDFNKAIKLYSKAIECKPDPV-YYSNRAACHNALGDWEKVVEDTTA 185

Query: 294 ALQYNP 299
           AL+ +P
Sbjct: 186 ALELDP 191



 Score = 35.0 bits (80), Expect = 0.18
 Identities = 27/122 (22%), Positives = 54/122 (44%)

Query: 59  DKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAA 118
           DK+      A+K N    + Y +   ++  +G+  +A ++Y+ ++ L PDFI  +I L  
Sbjct: 382 DKAEEDFDKALKLNSEDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDLDPDFIFSHIQLGV 441

Query: 119 ALVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLK 178
                G +  ++  +    +  P+   V +  G LL    + DEA + +    +L    K
Sbjct: 442 TQYKEGSIASSMATFRRCKKNFPEAPDVYNYYGELLLDQNKFDEAIEKFDTAIELEKETK 501

Query: 179 AL 180
            +
Sbjct: 502 PM 503



 Score = 35.0 bits (80), Expect = 0.20
 Identities = 27/134 (20%), Positives = 57/134 (42%), Gaps = 4/134 (2%)

Query: 560 LTFECRKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFI 619
             ++   +  S        K  P   + Y+  G +  ++ +  EA+E +  A+ L+ +  
Sbjct: 442 TQYKEGSIASSMATFRRCKKNFPEAPDVYNYYGELLLDQNKFDEAIEKFDTAIELEKETK 501

Query: 620 DGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGE 679
             Y+N+   +  A  + Q  Q ++ A         KA+   P+  +A + +  +   QG+
Sbjct: 502 PMYMNVLPLINKALALFQWKQDFIEAENLCE----KALIIDPECDIAVATMAQLLLQQGD 557

Query: 680 IWLAIHHFEKAVSL 693
           +  A+  FE+A  L
Sbjct: 558 VDEALKLFERAAEL 571



 Score = 33.0 bits (75), Expect = 0.75
 Identities = 30/151 (19%), Positives = 63/151 (41%), Gaps = 11/151 (7%)

Query: 48  LSSIHFQCRKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKP 107
           L    ++   +  S        K  P   + Y+  G +  ++ +  EA+E +  A+ L+ 
Sbjct: 439 LGVTQYKEGSIASSMATFRRCKKNFPEAPDVYNYYGELLLDQNKFDEAIEKFDTAIELEK 498

Query: 108 DFIDGYINLAAALVAAGDMEQAVQAYV-------TALQYNPDLYCVRSDLGNLLKALGRL 160
           +    Y+N+   +  A  + Q  Q ++        AL  +P+     + +  LL   G +
Sbjct: 499 ETKPMYMNVLPLINKALALFQWKQDFIEAENLCEKALIIDPECDIAVATMAQLLLQQGDV 558

Query: 161 DEAKDLYCVRSDL----GNLLKALGRLDEAK 187
           DEA  L+   ++L    G L++A+   +  +
Sbjct: 559 DEALKLFERAAELARTEGELVQAISYAEATR 589



 Score = 33.0 bits (75), Expect = 0.81
 Identities = 54/267 (20%), Positives = 101/267 (37%), Gaps = 28/267 (10%)

Query: 53  FQCRKLDKSAHFSTLAI--KQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFI 110
           ++ +  +K+    + AI  K +P+    YSN    +   G  ++ +E+   A+ L PD+ 
Sbjct: 138 YRNKDFNKAIKLYSKAIECKPDPVY---YSNRAACHNALGDWEKVVEDTTAALELDPDYS 194

Query: 111 DGYINLAAALVAAGDMEQAVQAYVTALQYNPDLYCVRSD--LGNLLKALGRLDEAKDLYC 168
                 A A    G    A+   +TA          +S   +  LLK      +AK++  
Sbjct: 195 KALNRRANAYDGLGKYADALLD-LTASCIIDGFRNEQSAQAVERLLKKFA-ESKAKEILE 252

Query: 169 VR-------SDLGNLLKAL------GRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPT 215
            +       + +GN L++         L+++  L  E           +          +
Sbjct: 253 TKPENLPSVTFVGNYLQSFRPKPRPAGLEDSNELDEETGNGQLQLGLKSPESKAD---ES 309

Query: 216 TLESAHFSTLAIKQNPL---LAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYI 272
             E+A     A+    L    A A +  G     +G+  EAL +   ++ L P     YI
Sbjct: 310 YEEAARAFEKALDLGKLGEKEAIALNLRGTFKCLKGKHLEALADLSKSIELDPRVTQSYI 369

Query: 273 NLAAALVAAGDMEQAVQAYVTALQYNP 299
             A+  +  GD ++A + +  AL+ N 
Sbjct: 370 KRASMNLELGDPDKAEEDFDKALKLNS 396



 Score = 32.7 bits (74), Expect = 1.2
 Identities = 27/124 (21%), Positives = 53/124 (42%), Gaps = 18/124 (14%)

Query: 568 DKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAA 627
           DK+      A+K N    + Y +   ++  +G+  +A ++Y+ ++ L PDFI  +I L  
Sbjct: 382 DKAEEDFDKALKLNSEDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDLDPDFIFSHIQLGV 441

Query: 628 ALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHF 687
                G +  ++  +    +  P          PD          V+N  GE+ L  + F
Sbjct: 442 TQYKEGSIASSMATFRRCKKNFPEA--------PD----------VYNYYGELLLDQNKF 483

Query: 688 EKAV 691
           ++A+
Sbjct: 484 DEAI 487



 Score = 31.9 bits (72), Expect = 1.8
 Identities = 23/75 (30%), Positives = 35/75 (46%), Gaps = 5/75 (6%)

Query: 626 AAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIH 685
           AA L   G+     + +  A++     Y KAIE +PD  V +SN     NA G+    + 
Sbjct: 127 AAKLKEKGNKAYRNKDFNKAIKL----YSKAIECKPD-PVYYSNRAACHNALGDWEKVVE 181

Query: 686 HFEKAVSLDPNFLDA 700
               A+ LDP++  A
Sbjct: 182 DTTAALELDPDYSKA 196


>gnl|CDD|205362 pfam13181, TPR_8, Tetratricopeptide repeat. 
          Length = 34

 Score = 37.8 bits (89), Expect = 5e-04
 Identities = 14/34 (41%), Positives = 19/34 (55%)

Query: 76  AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 109
           A AY  LG +Y + G  +EA E Y  A+ L P+ 
Sbjct: 1   ARAYYLLGQIYLQLGDYEEAKEYYEKALELDPNN 34



 Score = 37.8 bits (89), Expect = 5e-04
 Identities = 14/34 (41%), Positives = 19/34 (55%)

Query: 234 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 267
           A AY  LG +Y + G  +EA E Y  A+ L P+ 
Sbjct: 1   ARAYYLLGQIYLQLGDYEEAKEYYEKALELDPNN 34



 Score = 37.8 bits (89), Expect = 5e-04
 Identities = 14/34 (41%), Positives = 19/34 (55%)

Query: 585 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDF 618
           A AY  LG +Y + G  +EA E Y  A+ L P+ 
Sbjct: 1   ARAYYLLGQIYLQLGDYEEAKEYYEKALELDPNN 34



 Score = 27.1 bits (61), Expect = 3.4
 Identities = 12/32 (37%), Positives = 19/32 (59%)

Query: 666 AWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNF 697
           A+  LG ++   G+   A  ++EKA+ LDPN 
Sbjct: 3   AYYLLGQIYLQLGDYEEAKEYYEKALELDPNN 34



 Score = 26.7 bits (60), Expect = 4.8
 Identities = 10/30 (33%), Positives = 16/30 (53%)

Query: 113 YINLAAALVAAGDMEQAVQAYVTALQYNPD 142
           Y  L    +  GD E+A + Y  AL+ +P+
Sbjct: 4   YYLLGQIYLQLGDYEEAKEYYEKALELDPN 33


>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat. 
          Length = 65

 Score = 38.0 bits (89), Expect = 9e-04
 Identities = 16/41 (39%), Positives = 21/41 (51%)

Query: 68  AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD 108
           A+ + PL AEA   LG     +G+L EA    R A+   PD
Sbjct: 23  ALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADPD 63



 Score = 38.0 bits (89), Expect = 9e-04
 Identities = 16/41 (39%), Positives = 21/41 (51%)

Query: 226 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD 266
           A+ + PL AEA   LG     +G+L EA    R A+   PD
Sbjct: 23  ALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADPD 63



 Score = 38.0 bits (89), Expect = 9e-04
 Identities = 16/41 (39%), Positives = 21/41 (51%)

Query: 577 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD 617
           A+ + PL AEA   LG     +G+L EA    R A+   PD
Sbjct: 23  ALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADPD 63



 Score = 36.9 bits (86), Expect = 0.002
 Identities = 16/61 (26%), Positives = 23/61 (37%)

Query: 82  LGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNP 141
           L       G   EAL     A+   P   +  + L  AL+  G + +A      AL  +P
Sbjct: 3   LARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADP 62

Query: 142 D 142
           D
Sbjct: 63  D 63



 Score = 36.9 bits (86), Expect = 0.003
 Identities = 14/57 (24%), Positives = 20/57 (35%)

Query: 240 LGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQ 296
           L       G   EAL     A+   P   +  + L  AL+  G + +A      AL 
Sbjct: 3   LARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALA 59



 Score = 36.9 bits (86), Expect = 0.003
 Identities = 14/57 (24%), Positives = 20/57 (35%)

Query: 591 LGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQ 647
           L       G   EAL     A+   P   +  + L  AL+  G + +A      AL 
Sbjct: 3   LARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALA 59



 Score = 36.1 bits (84), Expect = 0.004
 Identities = 20/52 (38%), Positives = 24/52 (46%)

Query: 116 LAAALVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLY 167
           LA A + AGD ++A+ A   AL   P        LG  L   GRL EA  L 
Sbjct: 3   LARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALL 54



 Score = 36.1 bits (84), Expect = 0.004
 Identities = 21/72 (29%), Positives = 29/72 (40%), Gaps = 11/72 (15%)

Query: 625 LAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAI 684
           LA A + AGD ++A+ A   AL   P             A A   LG     QG +  A 
Sbjct: 3   LARAALRAGDYDEALAALEAALARYP-----------LAAEALLLLGEALLRQGRLAEAA 51

Query: 685 HHFEKAVSLDPN 696
                A++ DP+
Sbjct: 52  ALLRAALAADPD 63



 Score = 27.3 bits (61), Expect = 7.1
 Identities = 10/39 (25%), Positives = 15/39 (38%)

Query: 683 AIHHFEKAVSLDPNFLDAYINLGNVLKEARIFDRANTLF 721
           A+   E A++  P   +A + LG  L        A  L 
Sbjct: 16  ALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALL 54


>gnl|CDD|219515 pfam07687, M20_dimer, Peptidase dimerisation domain.  This domain
           consists of 4 beta strands and two alpha helices which
           make up the dimerisation surface of members of the M20
           family of peptidases. This family includes a range of
           zinc metallopeptidases belonging to several families in
           the peptidase classification. Family M20 are Glutamate
           carboxypeptidases. Peptidase family M25 contains X-His
           dipeptidases.
          Length = 107

 Score = 38.9 bits (91), Expect = 0.001
 Identities = 15/56 (26%), Positives = 30/56 (53%), Gaps = 4/56 (7%)

Query: 447 WRYPSLSLHGIEGAFSGPGGKTVIPGKVVGKFSIRIVPNQTPQCVEKYVLDYLNEL 502
           +   +L++ GIEG  +    + VIP +   KF IR++P +  + + K + + L + 
Sbjct: 51  FPRTTLNITGIEGGTA----RNVIPAEAEAKFDIRLLPGEDLEELLKEIEEILEKE 102


>gnl|CDD|193557 cd05682, M20_dipept_like_5, Uncharacterized M20 Dipeptidases.
           Peptidase M20 family, unknown dipeptidase-like subfamily
           3 (inferred by homology to be dipeptidases). M20
           dipeptidases include a large variety of bacterial
           enzymes including cytosolic nonspecific dipeptidase
           (CNDP), Xaa-methyl-His dipeptidase (anserinase),and
           canosinase. These dipeptidases have been shown to act on
           a wide range of dipeptides, but not larger peptides. For
           example, anserinase mainly catalyzes the hydrolysis of
           N-alpha-acetylhistidine while carnosinase degrades
           beta-alanyl-L-histidine.
          Length = 451

 Score = 41.5 bits (98), Expect = 0.002
 Identities = 53/228 (23%), Positives = 91/228 (39%), Gaps = 44/228 (19%)

Query: 352 CAVNIKIECASKDLHSGLFGGSVHEAMTDLIYIMGQLVE-----PNGKIL-------IP- 398
              ++ ++   + +HSG   G V  +      I+ QL++       G+IL       IP 
Sbjct: 194 LGGDLTVQVLDEGVHSGDASGIVPSSFR----ILRQLLDRIEDPETGEILPDELHCEIPA 249

Query: 399 HIYKD----VEPLGDTEEQFYEKIDFDTEDFRTAIDHPKLTKADKTQVLMSR-WRYPSLS 453
           H Y+      E LG+     +++  F        +   +    D  ++L++R WR P+LS
Sbjct: 250 HRYEQAKKAAEILGEP---VFDRFPF--------LSGVRPVTTDPVELLLNRTWR-PALS 297

Query: 454 LHGIEGAFSGPGGKTVIPGKVVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPN--K 511
           + G +G  S      V+  +   K S+R+ P       E      L +L    + P   K
Sbjct: 298 VTGADGLPSVATAGNVLRPETTLKLSLRLPPT---VDAEA-ASAALKKLLTT-DPPYGAK 352

Query: 512 FKAYLLDSGKSWRTNPEHPNYVAAA--RATKYVYNVEPDLTREGGSIP 557
                  +G  W      P ++A A  +A++  +   P    EGGSIP
Sbjct: 353 VTFEPDGAGSGWNAPALAP-WLAKALNKASQDFFGKPPAYMGEGGSIP 399


>gnl|CDD|176974 CHL00033, ycf3, photosystem I assembly protein Ycf3.
          Length = 168

 Score = 39.6 bits (93), Expect = 0.002
 Identities = 36/127 (28%), Positives = 57/127 (44%), Gaps = 27/127 (21%)

Query: 598 RGQLQEALENYRHAVRLKPDFID-GYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKA 656
            G+  EAL+NY  A+RL+ D  D  YI     L+   + E     +  AL+Y    Y +A
Sbjct: 48  EGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGE-----HTKALEY----YFQA 98

Query: 657 IETRPDFAVAWSNLGCVFNAQGE------------IWL--AIHHFEKAVSLDPNFLDAYI 702
           +E  P    A +N+  + + +GE             W   A  ++++A++L P     YI
Sbjct: 99  LERNPFLPQALNNMAVICHYRGEQAIEQGDSEIAEAWFDQAAEYWKQAIALAPG---NYI 155

Query: 703 NLGNVLK 709
              N LK
Sbjct: 156 EAQNWLK 162



 Score = 37.3 bits (87), Expect = 0.013
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 3/58 (5%)

Query: 247 RGQLQEALENYRHAVRLKPDFID-GYI--NLAAALVAAGDMEQAVQAYVTALQYNPLL 301
            G+  EAL+NY  A+RL+ D  D  YI  N+     + G+  +A++ Y  AL+ NP L
Sbjct: 48  EGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALERNPFL 105



 Score = 36.5 bits (85), Expect = 0.020
 Identities = 22/58 (37%), Positives = 33/58 (56%), Gaps = 3/58 (5%)

Query: 89  RGQLQEALENYRHAVRLKPDFID-GYI--NLAAALVAAGDMEQAVQAYVTALQYNPDL 143
            G+  EAL+NY  A+RL+ D  D  YI  N+     + G+  +A++ Y  AL+ NP L
Sbjct: 48  EGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALERNPFL 105


>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat. 
          Length = 34

 Score = 36.0 bits (84), Expect = 0.002
 Identities = 13/30 (43%), Positives = 18/30 (60%)

Query: 68 AIKQNPLLAEAYSNLGNVYKERGQLQEALE 97
          A++ +P  AEAY NL  +    GQ  EAL+
Sbjct: 5  ALELDPNNAEAYYNLALLLLNLGQYDEALQ 34



 Score = 36.0 bits (84), Expect = 0.002
 Identities = 13/30 (43%), Positives = 18/30 (60%)

Query: 226 AIKQNPLLAEAYSNLGNVYKERGQLQEALE 255
           A++ +P  AEAY NL  +    GQ  EAL+
Sbjct: 5   ALELDPNNAEAYYNLALLLLNLGQYDEALQ 34



 Score = 36.0 bits (84), Expect = 0.002
 Identities = 13/30 (43%), Positives = 18/30 (60%)

Query: 577 AIKQNPLLAEAYSNLGNVYKERGQLQEALE 606
           A++ +P  AEAY NL  +    GQ  EAL+
Sbjct: 5   ALELDPNNAEAYYNLALLLLNLGQYDEALQ 34



 Score = 32.6 bits (75), Expect = 0.046
 Identities = 11/33 (33%), Positives = 17/33 (51%)

Query: 652 CYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAI 684
            Y KA+E  P+ A A+ NL  +    G+   A+
Sbjct: 1   LYEKALELDPNNAEAYYNLALLLLNLGQYDEAL 33



 Score = 32.2 bits (74), Expect = 0.049
 Identities = 12/33 (36%), Positives = 19/33 (57%)

Query: 99  YRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQ 131
           Y  A+ L P+  + Y NLA  L+  G  ++A+Q
Sbjct: 2   YEKALELDPNNAEAYYNLALLLLNLGQYDEALQ 34



 Score = 32.2 bits (74), Expect = 0.049
 Identities = 12/33 (36%), Positives = 19/33 (57%)

Query: 257 YRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQ 289
           Y  A+ L P+  + Y NLA  L+  G  ++A+Q
Sbjct: 2   YEKALELDPNNAEAYYNLALLLLNLGQYDEALQ 34



 Score = 32.2 bits (74), Expect = 0.049
 Identities = 12/33 (36%), Positives = 19/33 (57%)

Query: 608 YRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQ 640
           Y  A+ L P+  + Y NLA  L+  G  ++A+Q
Sbjct: 2   YEKALELDPNNAEAYYNLALLLLNLGQYDEALQ 34



 Score = 31.8 bits (73), Expect = 0.079
 Identities = 14/32 (43%), Positives = 19/32 (59%)

Query: 686 HFEKAVSLDPNFLDAYINLGNVLKEARIFDRA 717
            +EKA+ LDPN  +AY NL  +L     +D A
Sbjct: 1   LYEKALELDPNNAEAYYNLALLLLNLGQYDEA 32



 Score = 29.9 bits (68), Expect = 0.32
 Identities = 12/32 (37%), Positives = 17/32 (53%)

Query: 132 AYVTALQYNPDLYCVRSDLGNLLKALGRLDEA 163
            Y  AL+ +P+      +L  LL  LG+ DEA
Sbjct: 1   LYEKALELDPNNAEAYYNLALLLLNLGQYDEA 32


>gnl|CDD|236983 PRK11788, PRK11788, tetratricopeptide repeat protein; Provisional.
          Length = 389

 Score = 40.2 bits (95), Expect = 0.005
 Identities = 25/89 (28%), Positives = 34/89 (38%), Gaps = 19/89 (21%)

Query: 625 LAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAI 684
           LA   +A GD++ A      AL  +P C             A   LG +  AQG+   AI
Sbjct: 186 LAQQALARGDLDAARALLKKALAADPQC-----------VRASILLGDLALAQGDYAAAI 234

Query: 685 HHFEKAVSLDPNFL--------DAYINLG 705
              E+    DP +L        + Y  LG
Sbjct: 235 EALERVEEQDPEYLSEVLPKLMECYQALG 263



 Score = 36.3 bits (85), Expect = 0.058
 Identities = 42/166 (25%), Positives = 61/166 (36%), Gaps = 33/166 (19%)

Query: 4   IAQLGIGLLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAH 63
             Q  + L EL     +AG  + AE   +QL  +     G L  L  I+ Q +   K   
Sbjct: 103 REQRLLALQELGQDYLKAGLLDRAEELFLQLVDEGDFAEGALQQLLEIYQQEKDWQK--- 159

Query: 64  FSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAA 123
               AI     +AE    LG        L+  + ++             Y  LA   +A 
Sbjct: 160 ----AID----VAERLEKLG-----GDSLRVEIAHF-------------YCELAQQALAR 193

Query: 124 GDMEQAVQAYVTALQYNPDLYCVRS--DLGNLLKALGRLDEAKDLY 167
           GD++ A      AL  +P   CVR+   LG+L  A G    A +  
Sbjct: 194 GDLDAARALLKKALAADPQ--CVRASILLGDLALAQGDYAAAIEAL 237


>gnl|CDD|205553 pfam13374, TPR_10, Tetratricopeptide repeat. 
          Length = 42

 Score = 35.2 bits (82), Expect = 0.005
 Identities = 8/31 (25%), Positives = 16/31 (51%)

Query: 75  LAEAYSNLGNVYKERGQLQEALENYRHAVRL 105
            A + +NL    + +G+ +EA E    A+ +
Sbjct: 1   TAASLNNLARALRAQGRYEEAEELLEEALAI 31



 Score = 35.2 bits (82), Expect = 0.005
 Identities = 8/31 (25%), Positives = 16/31 (51%)

Query: 233 LAEAYSNLGNVYKERGQLQEALENYRHAVRL 263
            A + +NL    + +G+ +EA E    A+ +
Sbjct: 1   TAASLNNLARALRAQGRYEEAEELLEEALAI 31



 Score = 35.2 bits (82), Expect = 0.005
 Identities = 8/31 (25%), Positives = 16/31 (51%)

Query: 584 LAEAYSNLGNVYKERGQLQEALENYRHAVRL 614
            A + +NL    + +G+ +EA E    A+ +
Sbjct: 1   TAASLNNLARALRAQGRYEEAEELLEEALAI 31



 Score = 27.2 bits (61), Expect = 4.3
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query: 170 RSDLGNLLKALGRLDEAKNLHTE 192
            ++L   L+A GR +EA+ L  E
Sbjct: 5   LNNLARALRAQGRYEEAEELLEE 27


>gnl|CDD|236624 PRK09782, PRK09782, bacteriophage N4 receptor, outer membrane
           subunit; Provisional.
          Length = 987

 Score = 39.5 bits (92), Expect = 0.009
 Identities = 31/165 (18%), Positives = 65/165 (39%), Gaps = 7/165 (4%)

Query: 9   IGLLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLA 68
             LL  A+    AG+  + +R   Q  ++   +  +   L +  +   + + + +  T +
Sbjct: 543 EDLLAAANTAQAAGNGAARDRWLQQAEQRGLGDNALYWWLHAQRYIPGQPELALNDLTRS 602

Query: 69  IKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQ 128
           +   P  A AY     +Y++R  +  A+ + R A+ L+P+  +    L  AL  +GD+ Q
Sbjct: 603 LNIAPS-ANAYVARATIYRQRHNVPAAVSDLRAALELEPNNSNYQAALGYALWDSGDIAQ 661

Query: 129 AVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDL 173
           + +    A +  PD   +   L              D+   +   
Sbjct: 662 SREMLERAHKGLPDDPALIRQLAY------VNQRLDDMAATQHYA 700



 Score = 34.5 bits (79), Expect = 0.32
 Identities = 20/79 (25%), Positives = 37/79 (46%), Gaps = 4/79 (5%)

Query: 211 GRKPTTLESAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDG 270
           G+    L      T ++   P  A AY     +Y++R  +  A+ + R A+ L+P+  + 
Sbjct: 590 GQPELALN---DLTRSLNIAPS-ANAYVARATIYRQRHNVPAAVSDLRAALELEPNNSNY 645

Query: 271 YINLAAALVAAGDMEQAVQ 289
              L  AL  +GD+ Q+ +
Sbjct: 646 QAALGYALWDSGDIAQSRE 664



 Score = 34.1 bits (78), Expect = 0.44
 Identities = 18/68 (26%), Positives = 34/68 (50%), Gaps = 1/68 (1%)

Query: 573 FSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAA 632
             T ++   P  A AY     +Y++R  +  A+ + R A+ L+P+  +    L  AL  +
Sbjct: 598 DLTRSLNIAPS-ANAYVARATIYRQRHNVPAAVSDLRAALELEPNNSNYQAALGYALWDS 656

Query: 633 GDMEQAVQ 640
           GD+ Q+ +
Sbjct: 657 GDIAQSRE 664


>gnl|CDD|234406 TIGR03939, PGA_TPR_OMP, poly-beta-1,6 N-acetyl-D-glucosamine export
           porin PgaA.  Members of this protein family are the
           poly-beta-1,6 N-acetyl-D-glucosamine (PGA) export porin
           PgaA of Gram-negative bacteria. There is no counterpart
           in the poly-beta-1,6 N-acetyl-D-glucosamine biosynthesis
           systems of Gram-positive bacteria such as Staphylococcus
           epidermidis. The PGA polysaccharide adhesin is a
           critical determinant of biofilm formation. The conserved
           C-terminal domain of this outer membrane protein is
           preceded by a variable number of TPR repeats.
          Length = 800

 Score = 39.3 bits (92), Expect = 0.010
 Identities = 33/125 (26%), Positives = 53/125 (42%), Gaps = 13/125 (10%)

Query: 581 NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQ 640
             L A     L   Y+   Q  +ALE YR  ++  P+  DG + LA  L  AG   +A++
Sbjct: 82  VGLPARVLEALAKAYRNEKQWDKALELYRKLLQRDPNNPDGLLGLALTLADAGKDAEALK 141

Query: 641 AYVTALQYNPSCYLKAIETRPDFAVAW-SNLGCVFNAQGEIWLAIHHFEKAVSLDPNFLD 699
                       YLK    R     A    L  V  A  +   A+  +++A++L+P+  +
Sbjct: 142 ------------YLKEYVARFPTDAARYEALAYVLRAAEDHLDALQAWQQALTLEPDNPE 189

Query: 700 AYINL 704
           A + L
Sbjct: 190 AALEL 194



 Score = 34.6 bits (80), Expect = 0.29
 Identities = 31/121 (25%), Positives = 45/121 (37%), Gaps = 8/121 (6%)

Query: 72  NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQ 131
             L A     L   Y+   Q  +ALE YR  ++  P+  DG + LA  L  AG   +A++
Sbjct: 82  VGLPARVLEALAKAYRNEKQWDKALELYRKLLQRDPNNPDGLLGLALTLADAGKDAEALK 141

Query: 132 A-------YVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLG-NLLKALGRL 183
                   + T       L  V     + L AL    +A  L     +    L + L RL
Sbjct: 142 YLKEYVARFPTDAARYEALAYVLRAAEDHLDALQAWQQALTLEPDNPEAALELYRILTRL 201

Query: 184 D 184
            
Sbjct: 202 G 202


>gnl|CDD|226687 COG4235, COG4235, Cytochrome c biogenesis factor [Posttranslational
           modification, protein turnover, chaperones].
          Length = 287

 Score = 38.5 bits (90), Expect = 0.011
 Identities = 30/98 (30%), Positives = 42/98 (42%), Gaps = 3/98 (3%)

Query: 69  IKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME- 127
           ++QNP  AE +  LG  Y   G+  +AL  YR+A+RL  D  +  + LA AL      + 
Sbjct: 149 LQQNPGDAEGWDLLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAEALYYQAGQQM 208

Query: 128 --QAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEA 163
             +A      AL  +P      S L       G   EA
Sbjct: 209 TAKARALLRQALALDPANIRALSLLAFAAFEQGDYAEA 246



 Score = 38.1 bits (89), Expect = 0.013
 Identities = 34/121 (28%), Positives = 51/121 (42%), Gaps = 12/121 (9%)

Query: 578 IKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGD--M 635
           ++QNP  AE +  LG  Y   G+  +AL  YR+A+RL  D  +  + LA AL       M
Sbjct: 149 LQQNPGDAEGWDLLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAEALYYQAGQQM 208

Query: 636 EQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDP 695
               +A    L+       +A+   P    A S L      QG+   A   ++  + L P
Sbjct: 209 TAKARA---LLR-------QALALDPANIRALSLLAFAAFEQGDYAEAAAAWQMLLDLLP 258

Query: 696 N 696
            
Sbjct: 259 A 259



 Score = 37.7 bits (88), Expect = 0.020
 Identities = 21/57 (36%), Positives = 30/57 (52%)

Query: 227 IKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGD 283
           ++QNP  AE +  LG  Y   G+  +AL  YR+A+RL  D  +  + LA AL     
Sbjct: 149 LQQNPGDAEGWDLLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAEALYYQAG 205



 Score = 33.1 bits (76), Expect = 0.49
 Identities = 23/121 (19%), Positives = 39/121 (32%), Gaps = 14/121 (11%)

Query: 600 QLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIET 659
           +++  +      ++  P   +G+  L  A +A G    A+ AY  AL+            
Sbjct: 137 EMEALIARLETHLQQNPGDAEGWDLLGRAYMALGRASDALLAYRNALR-----------L 185

Query: 660 RPDFAVAWSNLGCVFNAQGE---IWLAIHHFEKAVSLDPNFLDAYINLGNVLKEARIFDR 716
             D       L      Q        A     +A++LDP  + A   L     E   +  
Sbjct: 186 AGDNPEILLGLAEALYYQAGQQMTAKARALLRQALALDPANIRALSLLAFAAFEQGDYAE 245

Query: 717 A 717
           A
Sbjct: 246 A 246



 Score = 30.4 bits (69), Expect = 3.4
 Identities = 12/52 (23%), Positives = 23/52 (44%)

Query: 91  QLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPD 142
           +++  +      ++  P   +G+  L  A +A G    A+ AY  AL+   D
Sbjct: 137 EMEALIARLETHLQQNPGDAEGWDLLGRAYMALGRASDALLAYRNALRLAGD 188


>gnl|CDD|225504 COG2956, COG2956, Predicted N-acetylglucosaminyl transferase
           [Carbohydrate transport and metabolism].
          Length = 389

 Score = 38.5 bits (90), Expect = 0.013
 Identities = 39/163 (23%), Positives = 67/163 (41%), Gaps = 14/163 (8%)

Query: 11  LLELAHREYQ-AGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAI 69
           L +L  R+Y  AG  + AE    QL  +     G L  L +I+   R+ +K+   +   +
Sbjct: 110 LQQLG-RDYMAAGLLDRAEDIFNQLVDEGEFAEGALQQLLNIYQATREWEKAIDVAERLV 168

Query: 70  K-----QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAG 124
           K         +A+ Y  L         +  A E  + A++     +   I L    +A G
Sbjct: 169 KLGGQTYRVEIAQFYCELAQQALASSDVDRARELLKKALQADKKCVRASIILGRVELAKG 228

Query: 125 DMEQAVQAYVTALQYNPDLYCVRSDLGNLLK----ALGRLDEA 163
           D ++AV+A    L+ NP+     S++  +L      LG+  E 
Sbjct: 229 DYQKAVEALERVLEQNPEYL---SEVLEMLYECYAQLGKPAEG 268



 Score = 35.8 bits (83), Expect = 0.082
 Identities = 48/219 (21%), Positives = 81/219 (36%), Gaps = 44/219 (20%)

Query: 586 EAYSNLGNVYKERGQLQEALENYRHAVRLKPD--------FIDGYINLAAALVAAGDMEQ 637
            A   L N+Y+   + ++A++     V+L           F   Y  LA   +A+ D+++
Sbjct: 142 GALQQLLNIYQATREWEKAIDVAERLVKLGGQTYRVEIAQF---YCELAQQALASSDVDR 198

Query: 638 AVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSLDPNF 697
           A +    ALQ +  C   +I            LG V  A+G+   A+   E+ +  +P +
Sbjct: 199 ARELLKKALQADKKCVRASII-----------LGRVELAKGDYQKAVEALERVLEQNPEY 247

Query: 698 L--------DAYINLGNVLKEARIFDRANTLFY--VSSYFMITD--EDIDVILERCEAKT 745
           L        + Y  LG    E   F R          +  M+ D  E  + I       T
Sbjct: 248 LSEVLEMLYECYAQLGK-PAEGLNFLRRAMETNTGADAELMLADLIELQEGIDAAQAYLT 306

Query: 746 EELNKKFEQLGESSLRDFTLDAPTQSVYKFEGEDYREKQ 784
            +L +K    G   L D+ L          + E+ R K+
Sbjct: 307 RQLRRKPTMRGFHRLMDYHLA---------DAEEGRAKE 336



 Score = 32.4 bits (74), Expect = 1.0
 Identities = 62/284 (21%), Positives = 98/284 (34%), Gaps = 69/284 (24%)

Query: 27  AERHCMQLWRQETNN------TGVLLLLSSIHFQCRKLDKSA-HFSTLAIKQNPLLAEAY 79
             R   Q  + + N        G+  LLS       + DK+   F  +  +++P   EA+
Sbjct: 20  GRRSAQQDKQDQANRLSRDYVKGLNFLLS------NQPDKAVDLFLEML-QEDPETFEAH 72

Query: 80  SNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQY 139
             LGN+++ RG+                  +D  I +   L+ + D+        T  Q 
Sbjct: 73  LTLGNLFRSRGE------------------VDRAIRIHQTLLESPDL--------TFEQR 106

Query: 140 NPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLKALGRLDEAKNLHTENIKPVTM 199
              L      LG    A G LD A+D++       N L   G   E       NI   T 
Sbjct: 107 LLALQ----QLGRDYMAAGLLDRAEDIF-------NQLVDEGEFAEGALQQLLNIYQATR 155

Query: 200 KVQNAIVCNYG----GRKPTTLESAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALE 255
           + + AI         G +   +E A F              Y  L         +  A E
Sbjct: 156 EWEKAIDVAERLVKLGGQTYRVEIAQF--------------YCELAQQALASSDVDRARE 201

Query: 256 NYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
             + A++     +   I L    +A GD ++AV+A    L+ NP
Sbjct: 202 LLKKALQADKKCVRASIILGRVELAKGDYQKAVEALERVLEQNP 245



 Score = 31.6 bits (72), Expect = 1.9
 Identities = 17/72 (23%), Positives = 29/72 (40%)

Query: 579 KQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQA 638
                +A+ Y  L         +  A E  + A++     +   I L    +A GD ++A
Sbjct: 174 TYRVEIAQFYCELAQQALASSDVDRARELLKKALQADKKCVRASIILGRVELAKGDYQKA 233

Query: 639 VQAYVTALQYNP 650
           V+A    L+ NP
Sbjct: 234 VEALERVLEQNP 245


>gnl|CDD|227310 COG4976, COG4976, Predicted methyltransferase (contains TPR repeat)
           [General function prediction only].
          Length = 287

 Score = 35.9 bits (83), Expect = 0.081
 Identities = 15/48 (31%), Positives = 24/48 (50%)

Query: 95  ALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPD 142
           A E Y  A+ L P++  G+  L      AG+ + A  AY   L+ +P+
Sbjct: 14  AAELYNQALELAPEWAAGWFRLGEYTEKAGEFDAAAAAYEEVLELDPE 61



 Score = 34.4 bits (79), Expect = 0.23
 Identities = 15/47 (31%), Positives = 23/47 (48%)

Query: 253 ALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
           A E Y  A+ L P++  G+  L      AG+ + A  AY   L+ +P
Sbjct: 14  AAELYNQALELAPEWAAGWFRLGEYTEKAGEFDAAAAAYEEVLELDP 60



 Score = 34.4 bits (79), Expect = 0.23
 Identities = 15/47 (31%), Positives = 23/47 (48%)

Query: 604 ALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNP 650
           A E Y  A+ L P++  G+  L      AG+ + A  AY   L+ +P
Sbjct: 14  AAELYNQALELAPEWAAGWFRLGEYTEKAGEFDAAAAAYEEVLELDP 60



 Score = 29.0 bits (65), Expect = 9.8
 Identities = 21/70 (30%), Positives = 30/70 (42%), Gaps = 11/70 (15%)

Query: 626 AAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIH 685
           A  L  +GD E A + Y            +A+E  P++A  W  LG      GE   A  
Sbjct: 2   AYMLAESGDAEAAAELY-----------NQALELAPEWAAGWFRLGEYTEKAGEFDAAAA 50

Query: 686 HFEKAVSLDP 695
            +E+ + LDP
Sbjct: 51  AYEEVLELDP 60


>gnl|CDD|222121 pfam13428, TPR_14, Tetratricopeptide repeat. 
          Length = 44

 Score = 32.1 bits (73), Expect = 0.085
 Identities = 14/44 (31%), Positives = 17/44 (38%)

Query: 76  AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAA 119
             A   L       G L EAL   R A+ L PD  +  + LA  
Sbjct: 1   PAALLALARALLALGDLDEALALLRRALALDPDDPEALLLLARL 44



 Score = 32.1 bits (73), Expect = 0.085
 Identities = 14/44 (31%), Positives = 17/44 (38%)

Query: 234 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAA 277
             A   L       G L EAL   R A+ L PD  +  + LA  
Sbjct: 1   PAALLALARALLALGDLDEALALLRRALALDPDDPEALLLLARL 44



 Score = 32.1 bits (73), Expect = 0.085
 Identities = 14/44 (31%), Positives = 17/44 (38%)

Query: 585 AEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAA 628
             A   L       G L EAL   R A+ L PD  +  + LA  
Sbjct: 1   PAALLALARALLALGDLDEALALLRRALALDPDDPEALLLLARL 44


>gnl|CDD|221958 pfam13176, TPR_7, Tetratricopeptide repeat. 
          Length = 36

 Score = 30.7 bits (70), Expect = 0.19
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 78  AYSNLGNVYKERGQLQEALENYRHAVRLKPD 108
           A SNLG +Y++ G  ++A+  Y  A+ L  D
Sbjct: 1   ALSNLGRLYRKLGDYEKAISLYERALALAKD 31



 Score = 30.7 bits (70), Expect = 0.19
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 236 AYSNLGNVYKERGQLQEALENYRHAVRLKPD 266
           A SNLG +Y++ G  ++A+  Y  A+ L  D
Sbjct: 1   ALSNLGRLYRKLGDYEKAISLYERALALAKD 31



 Score = 30.7 bits (70), Expect = 0.19
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 587 AYSNLGNVYKERGQLQEALENYRHAVRLKPD 617
           A SNLG +Y++ G  ++A+  Y  A+ L  D
Sbjct: 1   ALSNLGRLYRKLGDYEKAISLYERALALAKD 31



 Score = 26.4 bits (59), Expect = 7.1
 Identities = 9/26 (34%), Positives = 11/26 (42%)

Query: 113 YINLAAALVAAGDMEQAVQAYVTALQ 138
             NL       GD E+A+  Y  AL 
Sbjct: 2   LSNLGRLYRKLGDYEKAISLYERALA 27



 Score = 26.4 bits (59), Expect = 7.1
 Identities = 9/26 (34%), Positives = 11/26 (42%)

Query: 271 YINLAAALVAAGDMEQAVQAYVTALQ 296
             NL       GD E+A+  Y  AL 
Sbjct: 2   LSNLGRLYRKLGDYEKAISLYERALA 27



 Score = 26.4 bits (59), Expect = 7.1
 Identities = 9/26 (34%), Positives = 11/26 (42%)

Query: 622 YINLAAALVAAGDMEQAVQAYVTALQ 647
             NL       GD E+A+  Y  AL 
Sbjct: 2   LSNLGRLYRKLGDYEKAISLYERALA 27


>gnl|CDD|225613 COG3071, HemY, Uncharacterized enzyme of heme biosynthesis
           [Coenzyme metabolism].
          Length = 400

 Score = 34.7 bits (80), Expect = 0.19
 Identities = 32/124 (25%), Positives = 52/124 (41%), Gaps = 4/124 (3%)

Query: 14  LAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAIKQNP 73
            A R  + GD++ A+       +++ +     L+          L K+A      +KQ+P
Sbjct: 269 YAERLIRLGDHDEAQEIIEDALKRQWDPRLCRLIPRLRPGDPEPLIKAAEK---WLKQHP 325

Query: 74  LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAY 133
                 S LG +  +     +A E    A++L+P   D Y  LA AL   G+ E+A Q  
Sbjct: 326 EDPLLLSTLGRLALKNKLWGKASEALEAALKLRPSASD-YAELADALDQLGEPEEAEQVR 384

Query: 134 VTAL 137
             AL
Sbjct: 385 REAL 388



 Score = 34.7 bits (80), Expect = 0.21
 Identities = 63/296 (21%), Positives = 98/296 (33%), Gaps = 35/296 (11%)

Query: 9   IGLLELAHREYQAGDYESAERHCMQLWRQETN-NTGVLLLLSSIHFQCRKLDKSAHFSTL 67
           +  L  A    Q GD + A R+  +      +    V L  + +    R    +      
Sbjct: 119 LAYLLAAEAAQQRGDEDRANRYLAEAAELAGDDTLAVELTRARLLLNRRDYPAARENVDQ 178

Query: 68  AIKQNPLLAEAYSNLGNVYKERGQLQEA---LENYRHAVRLKPDFIDGYINLAAALVAAG 124
            ++  P   E        Y   G  Q     L   R A  L                 A 
Sbjct: 179 LLEMTPRHPEVLRLALRAYIRLGAWQALLAILPKLRKAGLLSD-------------EEAA 225

Query: 125 DMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRL-DEAKDLYCVRSDLGNLLKALGRL 183
            +EQ  QA+   LQ   D     +    L         + ++   +       L  LG  
Sbjct: 226 RLEQ--QAWEGLLQQARD----DNGSEGLKTWWKNQPRKLRNDPELVVAYAERLIRLGDH 279

Query: 184 DEAKNLHTENIKPVTMKVQNAIVCNYGGR----KPTTLESAHFSTLAIKQNPLLAEAYSN 239
           DEA+ +  + +K    +  +  +C    R     P  L  A  +   +KQ+P      S 
Sbjct: 280 DEAQEIIEDALK----RQWDPRLCRLIPRLRPGDPEPLIKA--AEKWLKQHPEDPLLLST 333

Query: 240 LGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTAL 295
           LG +  +     +A E    A++L+P   D Y  LA AL   G+ E+A Q    AL
Sbjct: 334 LGRLALKNKLWGKASEALEAALKLRPSASD-YAELADALDQLGEPEEAEQVRREAL 388



 Score = 32.3 bits (74), Expect = 1.2
 Identities = 26/82 (31%), Positives = 37/82 (45%), Gaps = 4/82 (4%)

Query: 565 RKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN 624
             L K+A      +KQ+P      S LG +  +     +A E    A++L+P   D Y  
Sbjct: 311 EPLIKAAEK---WLKQHPEDPLLLSTLGRLALKNKLWGKASEALEAALKLRPSASD-YAE 366

Query: 625 LAAALVAAGDMEQAVQAYVTAL 646
           LA AL   G+ E+A Q    AL
Sbjct: 367 LADALDQLGEPEEAEQVRREAL 388



 Score = 30.0 bits (68), Expect = 5.8
 Identities = 60/277 (21%), Positives = 90/277 (32%), Gaps = 66/277 (23%)

Query: 11  LLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQ-------CRKLDKSAH 63
            L  A       DY +A  +  QL      +  VL L    + +          L K   
Sbjct: 156 ELTRARLLLNRRDYPAARENVDQLLEMTPRHPEVLRLALRAYIRLGAWQALLAILPKLR- 214

Query: 64  FSTLAIKQNPLLAEAYSNLGNVYKERGQLQEA-----LENYRHAVRLKPDFIDGYINLAA 118
                 K   L  E  + L       G LQ+A      E  +   + +P  +     L  
Sbjct: 215 ------KAGLLSDEEAARLEQ-QAWEGLLQQARDDNGSEGLKTWWKNQPRKLRNDPELVV 267

Query: 119 ALVA----AGDMEQAVQAYVTAL--QYNPDLY----CVR-SDLGNLLKAL-GRLDEAKDL 166
           A        GD ++A +    AL  Q++P L      +R  D   L+KA    L +  + 
Sbjct: 268 AYAERLIRLGDHDEAQEIIEDALKRQWDPRLCRLIPRLRPGDPEPLIKAAEKWLKQHPED 327

Query: 167 YCVRSDLGNLLKALGRLDEAKNLHTENIKPVTMKVQNAIVCNYGGRKPTTLESAHFSTLA 226
                    LL  LGRL     L                     G+    LE+      A
Sbjct: 328 PL-------LLSTLGRLALKNKLW--------------------GKASEALEA------A 354

Query: 227 IKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRL 263
           +K  P  A  Y+ L +   + G+ +EA +  R A+ L
Sbjct: 355 LKLRP-SASDYAELADALDQLGEPEEAEQVRREALLL 390


>gnl|CDD|236323 PRK08651, PRK08651, succinyl-diaminopimelate desuccinylase;
           Reviewed.
          Length = 394

 Score = 34.2 bits (79), Expect = 0.26
 Identities = 25/94 (26%), Positives = 43/94 (45%), Gaps = 7/94 (7%)

Query: 464 PGGKT-VIPGKVVGKFSI--RIVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAYLLDSG 520
            G KT ++PG     FSI  R++P +T + V   +   L+E+  A     + +  +    
Sbjct: 250 GGTKTNIVPGYC--AFSIDRRLIPEETAEEVRDELEALLDEV--APELGIEVEFEITPFS 305

Query: 521 KSWRTNPEHPNYVAAARATKYVYNVEPDLTREGG 554
           +++ T+P+     A   A + V  VEP  T   G
Sbjct: 306 EAFVTDPDSELVKALREAIREVLGVEPKKTISLG 339


>gnl|CDD|215573 PLN03098, LPA1, LOW PSII ACCUMULATION1; Provisional.
          Length = 453

 Score = 34.5 bits (79), Expect = 0.28
 Identities = 26/89 (29%), Positives = 38/89 (42%), Gaps = 10/89 (11%)

Query: 582 PLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFID---GYINLAAALVAAGDMEQA 638
              AE   NLG     +G++++AL  +  A+ L P+  +    Y N A       + ++A
Sbjct: 72  VKTAEDAVNLGLSLFSKGRVKDALAQFETALELNPNPDEAQAAYYNKACCHAYREEGKKA 131

Query: 639 VQAYVTAL-QYNPSCYLK--AIETRPDFA 664
                TAL  YN    LK   I   PD A
Sbjct: 132 ADCLRTALRDYN----LKFSTILNDPDLA 156



 Score = 34.1 bits (78), Expect = 0.35
 Identities = 24/89 (26%), Positives = 37/89 (41%), Gaps = 13/89 (14%)

Query: 73  PLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFID---GYINLAAALVAAGDMEQA 129
              AE   NLG     +G++++AL  +  A+ L P+  +    Y N A       + ++A
Sbjct: 72  VKTAEDAVNLGLSLFSKGRVKDALAQFETALELNPNPDEAQAAYYNKACCHAYREEGKKA 131

Query: 130 VQAYVTAL-QYN---------PDLYCVRS 148
                TAL  YN         PDL   R+
Sbjct: 132 ADCLRTALRDYNLKFSTILNDPDLAPFRA 160



 Score = 34.1 bits (78), Expect = 0.36
 Identities = 20/72 (27%), Positives = 32/72 (44%), Gaps = 4/72 (5%)

Query: 231 PLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFID---GYINLAAALVAAGDMEQA 287
              AE   NLG     +G++++AL  +  A+ L P+  +    Y N A       + ++A
Sbjct: 72  VKTAEDAVNLGLSLFSKGRVKDALAQFETALELNPNPDEAQAAYYNKACCHAYREEGKKA 131

Query: 288 VQAYVTAL-QYN 298
                TAL  YN
Sbjct: 132 ADCLRTALRDYN 143



 Score = 31.8 bits (72), Expect = 1.9
 Identities = 34/123 (27%), Positives = 48/123 (39%), Gaps = 16/123 (13%)

Query: 669 NLGCVFNAQGEIWLAIHHFEKAVSLDPNFLDAYINLGNVL----------KEARIFDRAN 718
           NLG    ++G +  A+  FE A+ L+PN  +A     N            K A     A 
Sbjct: 80  NLGLSLFSKGRVKDALAQFETALELNPNPDEAQAAYYNKACCHAYREEGKKAADCLRTAL 139

Query: 719 TLFYVSSYFMITDEDIDVILERCEAKT--EELNKKFEQLGESSLRDFTL----DAPTQSV 772
             + +    ++ D D+       E K   EE  K  E +G S  RD  L     AP + V
Sbjct: 140 RDYNLKFSTILNDPDLAPFRASPEFKELQEEARKGGEDIGSSFRRDLKLISEVQAPFRGV 199

Query: 773 YKF 775
            KF
Sbjct: 200 RKF 202


>gnl|CDD|226428 COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase,
           SPINDLY family [Posttranslational modification, protein
           turnover, chaperones].
          Length = 620

 Score = 34.4 bits (79), Expect = 0.32
 Identities = 32/160 (20%), Positives = 50/160 (31%), Gaps = 11/160 (6%)

Query: 2   NDIAQ-LGIGLLELAHREYQAGDYESAERHCMQLWRQ-ETNNTGVLL--LLSSIHFQCRK 57
           N +A  L +  L +      A   ++   + + L       N  +LL   LS +      
Sbjct: 23  NALAGHLSLLDLAIVLEWLNAEGLQALAIYALLLGIAINDVNPELLLAAFLSILLAPLAD 82

Query: 58  LDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRH-AVRLKPDFIDG---- 112
              +     + +  NP    A  NL    +  G    AL +    A  L PD  +     
Sbjct: 83  STLAFLAKRIPLSVNPENCPAVQNLAAALELDGLQFLALADISEIAEWLSPDNAEFLGHL 142

Query: 113 --YINLAAALVAAGDMEQAVQAYVTALQYNPDLYCVRSDL 150
             +  L   L   G   +A  A   A+   P    V   L
Sbjct: 143 IRFYQLGRYLKLLGRTAEAELALERAVDLLPKYPRVLGAL 182



 Score = 34.0 bits (78), Expect = 0.45
 Identities = 26/115 (22%), Positives = 33/115 (28%), Gaps = 18/115 (15%)

Query: 567 LDKSAHFSTLA-------IKQNPLLAEAYSNLGNVYKERGQLQEALENYRH-AVRLKPDF 618
           L      STLA       +  NP    A  NL    +  G    AL +    A  L PD 
Sbjct: 76  LLAPLADSTLAFLAKRIPLSVNPENCPAVQNLAAALELDGLQFLALADISEIAEWLSPDN 135

Query: 619 IDG------YINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAW 667
            +       +  L   L   G   +A  A   A+   P    K          A 
Sbjct: 136 AEFLGHLIRFYQLGRYLKLLGRTAEAELALERAVDLLP----KYPRVLGALMTAR 186



 Score = 31.3 bits (71), Expect = 2.7
 Identities = 19/82 (23%), Positives = 27/82 (32%), Gaps = 7/82 (8%)

Query: 225 LAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRH-AVRLKPDFIDG------YINLAAA 277
           + +  NP    A  NL    +  G    AL +    A  L PD  +       +  L   
Sbjct: 92  IPLSVNPENCPAVQNLAAALELDGLQFLALADISEIAEWLSPDNAEFLGHLIRFYQLGRY 151

Query: 278 LVAAGDMEQAVQAYVTALQYNP 299
           L   G   +A  A   A+   P
Sbjct: 152 LKLLGRTAEAELALERAVDLLP 173


>gnl|CDD|225660 COG3118, COG3118, Thioredoxin domain-containing protein
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 304

 Score = 33.9 bits (78), Expect = 0.33
 Identities = 14/51 (27%), Positives = 22/51 (43%)

Query: 117 AAALVAAGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLKALGRLDEAKDLY 167
           A  L+ A D  +A      ALQ  P+    +  L   L A G ++ A+ + 
Sbjct: 141 AKELIEAEDFGEAAPLLKQALQAAPENSEAKLLLAECLLAAGDVEAAQAIL 191



 Score = 32.7 bits (75), Expect = 0.71
 Identities = 25/110 (22%), Positives = 39/110 (35%), Gaps = 11/110 (10%)

Query: 88  ERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT-----------A 136
           E     EA    + A++  P+  +  + LA  L+AAGD+E A                  
Sbjct: 146 EAEDFGEAAPLLKQALQAAPENSEAKLLLAECLLAAGDVEAAQAILAALPLQAQDKAAHG 205

Query: 137 LQYNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLKALGRLDEA 186
           LQ   +L    +    +     RL    D       L + L  +GR + A
Sbjct: 206 LQAQIELLEQAAATPEIQDLQRRLAADPDDVEAALALADQLHLVGRNEAA 255


>gnl|CDD|188247 TIGR02795, tol_pal_ybgF, tol-pal system protein YbgF.  Members of
           this protein family are the product of one of seven
           genes regularly clustered in operons to encode the
           proteins of the tol-pal system, which is critical for
           maintaining the integrity of the bacterial outer
           membrane. The gene for this periplasmic protein has been
           designated orf2 and ybgF. All members of the seed
           alignment were from unique tol-pal gene regions from
           completed bacterial genomes. The architecture of this
           protein is a signal sequence, a low-complexity region
           usually rich in Asn and Gln, a well-conserved region
           with tandem repeats that resemble the tetratricopeptide
           (TPR) repeat, involved in protein-protein interaction.
          Length = 117

 Score = 32.3 bits (74), Expect = 0.34
 Identities = 17/71 (23%), Positives = 28/71 (39%), Gaps = 6/71 (8%)

Query: 69  IKQNP---LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFI---DGYINLAAALVA 122
           +K+ P       A+  LG  Y  +G   +A + +   V+  P      D  + L  +L  
Sbjct: 27  LKKYPKSTYAPNAHYWLGEAYYAQGDYADAAKAFLAVVKKYPKSPKAPDALLKLGMSLQE 86

Query: 123 AGDMEQAVQAY 133
            GD E+A    
Sbjct: 87  LGDKEKAKATL 97



 Score = 32.3 bits (74), Expect = 0.34
 Identities = 17/71 (23%), Positives = 28/71 (39%), Gaps = 6/71 (8%)

Query: 227 IKQNP---LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFI---DGYINLAAALVA 280
           +K+ P       A+  LG  Y  +G   +A + +   V+  P      D  + L  +L  
Sbjct: 27  LKKYPKSTYAPNAHYWLGEAYYAQGDYADAAKAFLAVVKKYPKSPKAPDALLKLGMSLQE 86

Query: 281 AGDMEQAVQAY 291
            GD E+A    
Sbjct: 87  LGDKEKAKATL 97



 Score = 32.3 bits (74), Expect = 0.34
 Identities = 17/71 (23%), Positives = 28/71 (39%), Gaps = 6/71 (8%)

Query: 578 IKQNP---LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFI---DGYINLAAALVA 631
           +K+ P       A+  LG  Y  +G   +A + +   V+  P      D  + L  +L  
Sbjct: 27  LKKYPKSTYAPNAHYWLGEAYYAQGDYADAAKAFLAVVKKYPKSPKAPDALLKLGMSLQE 86

Query: 632 AGDMEQAVQAY 642
            GD E+A    
Sbjct: 87  LGDKEKAKATL 97


>gnl|CDD|182415 PRK10370, PRK10370, formate-dependent nitrite reductase complex
           subunit NrfG; Provisional.
          Length = 198

 Score = 33.0 bits (75), Expect = 0.49
 Identities = 18/53 (33%), Positives = 27/53 (50%)

Query: 68  AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAAL 120
            I+ NP  +E ++ LG  Y  R     AL  YR A++L+ +  + Y  LA  L
Sbjct: 65  KIRANPQNSEQWALLGEYYLWRNDYDNALLAYRQALQLRGENAELYAALATVL 117



 Score = 33.0 bits (75), Expect = 0.49
 Identities = 18/53 (33%), Positives = 27/53 (50%)

Query: 226 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAAL 278
            I+ NP  +E ++ LG  Y  R     AL  YR A++L+ +  + Y  LA  L
Sbjct: 65  KIRANPQNSEQWALLGEYYLWRNDYDNALLAYRQALQLRGENAELYAALATVL 117



 Score = 33.0 bits (75), Expect = 0.49
 Identities = 18/53 (33%), Positives = 27/53 (50%)

Query: 577 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAAL 629
            I+ NP  +E ++ LG  Y  R     AL  YR A++L+ +  + Y  LA  L
Sbjct: 65  KIRANPQNSEQWALLGEYYLWRNDYDNALLAYRQALQLRGENAELYAALATVL 117


>gnl|CDD|233013 TIGR00540, TPR_hemY_coli, heme biosynthesis-associated TPR protein.
            Members of this protein family are uncharacterized
           tetratricopeptide repeat (TPR) proteins invariably found
           in heme biosynthesis gene clusters. The absence of any
           invariant residues other than Ala argues against this
           protein serving as an enzyme per se. The gene symbol
           hemY assigned in E. coli is unfortunate in that an
           unrelated protein, protoporphyrinogen oxidase (HemG in
           E. coli) is designated HemY in Bacillus subtilis
           [Unknown function, General].
          Length = 367

 Score = 33.4 bits (77), Expect = 0.57
 Identities = 44/180 (24%), Positives = 69/180 (38%), Gaps = 24/180 (13%)

Query: 20  QAGDYESAERHCMQLWRQETNNTGVLLLLSSIHFQ----------CRKLDKSAHFSTLAI 69
              DYE+A      L  Q   +T VL L    + +             L K+   S    
Sbjct: 146 DQRDYEAALAALDSLQAQAPRHTAVLRLALRAYQRSGNWDALLKLLPALRKAKALSPEEA 205

Query: 70  KQNPLLAEAYSNLGNVYKERGQ--LQEALENYRHAVRLKPDFIDGYINLAAALVAAGDME 127
            +  L  +AY  L +  +E     L+   +    A R +P+        A AL+  GD +
Sbjct: 206 AR--LEQQAYIGLLDEAREEDADALKTWWKQLPRAERQEPELAVAA---AEALIQLGDHD 260

Query: 128 QAVQAYVTAL--QYNPDLY--CVRSDLGNLLKALGRLDEAKDLYCVRSDLGNLLKALGRL 183
           +A +    AL  +++P+L     R   G+    + R ++   L     D   LL ALGRL
Sbjct: 261 EAEKLIEEALKKEWDPELLRLYGRLQPGDPSPLIKRAEKW--LKKHPDD-ALLLLALGRL 317


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 33.4 bits (77), Expect = 0.60
 Identities = 25/78 (32%), Positives = 35/78 (44%), Gaps = 4/78 (5%)

Query: 56  RKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN 115
              DK+  +   AI+  P L E Y     + K  G LQ+A E    A  L  D  D YIN
Sbjct: 207 GSRDKALEYIDKAIEHTPTLVELYMTKARILKHAGDLQKAAEAMEEARSL--DLADRYIN 264

Query: 116 LAAA--LVAAGDMEQAVQ 131
              A  ++ A + E+A +
Sbjct: 265 SKCAKYMLRANENEEAEK 282



 Score = 33.4 bits (77), Expect = 0.60
 Identities = 25/78 (32%), Positives = 35/78 (44%), Gaps = 4/78 (5%)

Query: 565 RKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN 624
              DK+  +   AI+  P L E Y     + K  G LQ+A E    A  L  D  D YIN
Sbjct: 207 GSRDKALEYIDKAIEHTPTLVELYMTKARILKHAGDLQKAAEAMEEARSL--DLADRYIN 264

Query: 625 LAAA--LVAAGDMEQAVQ 640
              A  ++ A + E+A +
Sbjct: 265 SKCAKYMLRANENEEAEK 282



 Score = 30.7 bits (70), Expect = 4.5
 Identities = 23/66 (34%), Positives = 31/66 (46%), Gaps = 4/66 (6%)

Query: 226 AIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAA--LVAAGD 283
           AI+  P L E Y     + K  G LQ+A E    A  L  D  D YIN   A  ++ A +
Sbjct: 219 AIEHTPTLVELYMTKARILKHAGDLQKAAEAMEEARSL--DLADRYINSKCAKYMLRANE 276

Query: 284 MEQAVQ 289
            E+A +
Sbjct: 277 NEEAEK 282


>gnl|CDD|185096 PRK15174, PRK15174, Vi polysaccharide export protein VexE;
           Provisional.
          Length = 656

 Score = 33.2 bits (75), Expect = 0.74
 Identities = 27/137 (19%), Positives = 47/137 (34%), Gaps = 9/137 (6%)

Query: 56  RKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYIN 115
            KL  + H+   A++ N       +   +     GQ ++A+   + ++   PD       
Sbjct: 265 AKLQAAEHWRH-ALQFNSDNVRIVTLYADALIRTGQNEKAIPLLQQSLATHPDLPYVRAM 323

Query: 116 LAAALVAAGDMEQAVQAYVTALQYNPD-LYCVRSDLGNLLKALGRLDEAKDL---YCVRS 171
            A AL   G    A   +V   +         R     LL+A G+  EA+ +   Y    
Sbjct: 324 YARALRQVGQYTAASDEFVQLAREKGVTSKWNRYAAAALLQA-GKTSEAESVFEHYIQAR 382

Query: 172 DL---GNLLKALGRLDE 185
                 +  + L  LD 
Sbjct: 383 ASHLPQSFEEGLLALDG 399


>gnl|CDD|198111 smart01043, BTAD, Bacterial transcriptional activator domain.
           Found in the DNRI/REDD/AFSR family of regulators. This
           region of AFSR along with the C terminal region is
           capable of independently directing actinorhodin
           production. This family contains TPR repeats.
          Length = 145

 Score = 31.5 bits (72), Expect = 0.90
 Identities = 22/75 (29%), Positives = 30/75 (40%), Gaps = 3/75 (4%)

Query: 232 LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAY 291
           L  EA   L       G+ +EAL      + L P     +  L  AL  AG   +A++AY
Sbjct: 59  LRLEALEALAEALLALGRHEEALALLERLLALDPLRERLHRLLMRALYRAGRRAEALRAY 118

Query: 292 VTALQYNPLLKKEDG 306
               +   LL  E G
Sbjct: 119 RRLRR---LLADELG 130



 Score = 30.3 bits (69), Expect = 2.3
 Identities = 18/60 (30%), Positives = 25/60 (41%)

Query: 74  LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAY 133
           L  EA   L       G+ +EAL      + L P     +  L  AL  AG   +A++AY
Sbjct: 59  LRLEALEALAEALLALGRHEEALALLERLLALDPLRERLHRLLMRALYRAGRRAEALRAY 118



 Score = 30.3 bits (69), Expect = 2.3
 Identities = 18/60 (30%), Positives = 25/60 (41%)

Query: 583 LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAY 642
           L  EA   L       G+ +EAL      + L P     +  L  AL  AG   +A++AY
Sbjct: 59  LRLEALEALAEALLALGRHEEALALLERLLALDPLRERLHRLLMRALYRAGRRAEALRAY 118



 Score = 29.2 bits (66), Expect = 5.4
 Identities = 25/94 (26%), Positives = 34/94 (36%), Gaps = 21/94 (22%)

Query: 92  LQEALENYR---------------HAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTA 136
           L+ AL  YR                  RL+   ++    LA AL+A G  E+A+      
Sbjct: 28  LEAALALYRGPLLADVPDEDWAEAERERLRELRLEALEALAEALLALGRHEEALALLERL 87

Query: 137 LQYNP---DLYCVRSDLGNLLKALGRLDEAKDLY 167
           L  +P    L+     L   L   GR  EA   Y
Sbjct: 88  LALDPLRERLHR---LLMRALYRAGRRAEALRAY 118


>gnl|CDD|182695 PRK10747, PRK10747, putative protoheme IX biogenesis protein;
           Provisional.
          Length = 398

 Score = 32.8 bits (75), Expect = 0.91
 Identities = 29/81 (35%), Positives = 39/81 (48%), Gaps = 11/81 (13%)

Query: 69  IKQ---NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGD 125
           IKQ    PLL   +S LG +  + G+ QEA   +R A++ +PD  D Y  LA AL     
Sbjct: 321 IKQHGDTPLL---WSTLGQLLMKHGEWQEASLAFRAALKQRPDAYD-YAWLADALDRLHK 376

Query: 126 MEQAVQ----AYVTALQYNPD 142
            E+A        +  LQ NP 
Sbjct: 377 PEEAAAMRRDGLMLTLQNNPP 397



 Score = 32.0 bits (73), Expect = 1.7
 Identities = 23/55 (41%), Positives = 31/55 (56%), Gaps = 7/55 (12%)

Query: 227 IKQ---NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAAL 278
           IKQ    PLL   +S LG +  + G+ QEA   +R A++ +PD  D Y  LA AL
Sbjct: 321 IKQHGDTPLL---WSTLGQLLMKHGEWQEASLAFRAALKQRPDAYD-YAWLADAL 371



 Score = 32.0 bits (73), Expect = 1.7
 Identities = 23/55 (41%), Positives = 31/55 (56%), Gaps = 7/55 (12%)

Query: 578 IKQ---NPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAAL 629
           IKQ    PLL   +S LG +  + G+ QEA   +R A++ +PD  D Y  LA AL
Sbjct: 321 IKQHGDTPLL---WSTLGQLLMKHGEWQEASLAFRAALKQRPDAYD-YAWLADAL 371


>gnl|CDD|236644 PRK10049, pgaA, outer membrane protein PgaA; Provisional.
          Length = 765

 Score = 32.8 bits (75), Expect = 0.93
 Identities = 26/142 (18%), Positives = 47/142 (33%), Gaps = 18/142 (12%)

Query: 582 PLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQA 641
            L A  Y+ +   Y+   Q Q +L  ++ A+ L+P   D    L   L  AG  ++A+  
Sbjct: 46  QLPARGYAAVAVAYRNLKQWQNSLTLWQKALSLEPQNDDYQRGLILTLADAGQYDEALVK 105

Query: 642 YVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFNAQGEIWLAIHHFEKAVSL---DPNFL 698
                        K + +          L  V+   G  W  +    +A+        + 
Sbjct: 106 ------------AKQLVSGAPDKANLLALAYVYKRAGRHWDELRAMTQALPRAPQTQQYP 153

Query: 699 DAYINLGNVLKEARIFDRANTL 720
             Y+     L+  R+   A   
Sbjct: 154 TEYV---QALRNNRLSAPALGA 172



 Score = 32.0 bits (73), Expect = 1.6
 Identities = 34/162 (20%), Positives = 53/162 (32%), Gaps = 42/162 (25%)

Query: 73  PLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFID---GYI-------------NL 116
            L A  Y+ +   Y+   Q Q +L  ++ A+ L+P   D   G I               
Sbjct: 46  QLPARGYAAVAVAYRNLKQWQNSLTLWQKALSLEPQNDDYQRGLILTLADAGQYDEALVK 105

Query: 117 AAALVA-----------------AGDMEQAVQAYVTALQYNPDLYCVRSDLGNLLK---- 155
           A  LV+                 AG     ++A   AL   P      ++    L+    
Sbjct: 106 AKQLVSGAPDKANLLALAYVYKRAGRHWDELRAMTQALPRAPQTQQYPTEYVQALRNNRL 165

Query: 156 ---ALGRLDEAKDLYCVRSDL-GNLLKALGRLDEAKNLHTEN 193
              ALG +D+A      + DL  +    L RL       +E 
Sbjct: 166 SAPALGAIDDANLTPAEKRDLEADAAAELVRLSFMPT-RSEK 206



 Score = 30.1 bits (68), Expect = 6.2
 Identities = 16/52 (30%), Positives = 21/52 (40%), Gaps = 1/52 (1%)

Query: 248 GQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNP 299
           GQ  EAL   +  V   PD  +  + LA     AG     ++A   AL   P
Sbjct: 97  GQYDEALVKAKQLVSGAPDKAN-LLALAYVYKRAGRHWDELRAMTQALPRAP 147


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 31.6 bits (72), Expect = 1.2
 Identities = 26/113 (23%), Positives = 39/113 (34%), Gaps = 27/113 (23%)

Query: 796 PKRERKANYAVDAYFKEALRTSE----PKAPKAPRPPKQPIVQ----DFQFFPPRLFEIL 847
           P  +                 SE    P        P+ P+ Q      +FFP       
Sbjct: 63  PSEKGGKYLLAALTIDRFYHISESVKIPAIYDKVVVPRPPVPQPKGLKMRFFP------- 115

Query: 848 DQEIYYFRKTVGYKVPKNP--ELGSDATKAQKEEQKKIDESEPLTEEELAEKE 898
                      GY  P  P  ELGS++  ++KE   K+++   + EEE  EK+
Sbjct: 116 ----------TGYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKK 158


>gnl|CDD|164812 PHA00450, PHA00450, host dGTPase inhibitor.
          Length = 85

 Score = 29.4 bits (66), Expect = 1.9
 Identities = 16/44 (36%), Positives = 22/44 (50%), Gaps = 6/44 (13%)

Query: 712 RIFDRANTLFYVSSYFMITDEDIDVILERCEAKTEELNKKFEQL 755
           R+ DR+  L   S  F   DED+         +TE LNK ++QL
Sbjct: 44  RVEDRSGHLI-ASRTFEHHDEDV-----LYNMQTEWLNKMYDQL 81


>gnl|CDD|216562 pfam01546, Peptidase_M20, Peptidase family M20/M25/M40.  This
           family includes a range of zinc metallopeptidases
           belonging to several families in the peptidase
           classification. Family M20 are Glutamate
           carboxypeptidases. Peptidase family M25 contains X-His
           dipeptidases.
          Length = 310

 Score = 31.4 bits (71), Expect = 2.0
 Identities = 15/108 (13%), Positives = 28/108 (25%), Gaps = 6/108 (5%)

Query: 456 GIEGAFSGPGGKTVIPGKVVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAY 515
           GI     G G    +  +       R   ++  + + +   + +     A          
Sbjct: 164 GIGTVGGGGGSNNNVIPEAAFLRGRRRTLDEELRALVEEEEEAIAAGAAAAG----VVEE 219

Query: 516 LLDSGKSWRTNPEHPNYVAAAR--ATKYVYNVEPDLTREGGSIPITLT 561
             D    +      P  VAA    A +     EP+ +  G        
Sbjct: 220 EEDYRPPYPVTVNDPALVAALEEAAKELGLGPEPEPSGGGEDAAFFAE 267


>gnl|CDD|233446 TIGR01513, NAPRTase_put, putative nicotinate
           phosphoribosyltransferase.  A deep split separates two
           related families of proteins, one of which includes
           experimentally characterized examples of nicotinate
           phosphoribosyltransferase, an the first enzyme of NAD
           salvage biosynthesis. This model represents the other
           family. Members have a different (longer) spacing of
           several key motifs and have an additional C-terminal
           domain of up to 100 residues. One argument suggesting
           that this family represents the same enzyme is that no
           species has a member of both families. Another is that
           the gene encoding this protein is located near other NAD
           salvage biosynthesis genes in Nostoc and in at least
           four different Gram-positive bacteria. NAD and NADP are
           ubiquitous in life. Most members of this family are
           Gram-positive bacteria. An additional set of mutually
           closely related archaeal sequences score between the
           trusted and noise cutoffs [Biosynthesis of cofactors,
           prosthetic groups, and carriers, Pyridine nucleotides].
          Length = 443

 Score = 31.6 bits (72), Expect = 2.2
 Identities = 23/81 (28%), Positives = 34/81 (41%), Gaps = 10/81 (12%)

Query: 116 LAAALVAA-GDMEQAVQAYVTALQYNP----DLY-CVRSDLGNLLKALGRLDEAKDLYCV 169
           +A + V +  +   A +AY            D Y  +RS L N +     L E   +  V
Sbjct: 193 MAHSFVMSFDNELAAFRAYAKLYPKATVLLVDTYDTLRSGLPNAIAVAKELGEQGKVVGV 252

Query: 170 RSDLGNLL----KALGRLDEA 186
           R D G+LL    +A  +LD A
Sbjct: 253 RIDSGDLLYLSKQARKQLDAA 273


>gnl|CDD|193570 cd08659, M20_ArgE_DapE_like, Peptidase M20 acetylornithine
           deacetylase/succinyl-diaminopimelate desuccinylase
           (ArgE/DapE)-like.  Peptidase M20 acetylornithine
           deacetylase/succinyl-diaminopimelate desuccinylase
           (ArgE/DapE) like family of enzymes catalyze analogous
           reactions and share a common activator, the metal ion
           (usually Co2+ or Zn2+). ArgE catalyzes a broad range of
           substrates, including N-acetylornithine,
           alpha-N-acetylmethionine and alpha-N-formylmethionine,
           while DapE catalyzes the hydrolysis of
           N-succinyl-L,L-diaminopimelate (L,L-SDAP) to
           L,L-diaminopimelate and succinate. Proteins in this
           family are mostly bacterial and have been inferred by
           homology as being related to both, ArgE and DapE. This
           family also includes N-acetyl-L-citrulline deacetylase
           (ACDase; acetylcitrulline deacetylase), a unique, novel
           enzyme found in Xanthomonas campestris, a plant
           pathogen, in which N-acetyl-L-ornithine is the substrate
           for transcarbamoylation reaction, and the product is
           N-acetyl-L-citrulline. Thus, in the arginine
           biosynthesis pathway, ACDase subsequently catalyzes the
           hydrolysis of N-acetyl-L-citrulline to acetate and
           L-citrulline.
          Length = 365

 Score = 31.0 bits (71), Expect = 2.6
 Identities = 28/108 (25%), Positives = 43/108 (39%), Gaps = 12/108 (11%)

Query: 450 PSLSLHGIEGAFSGPGGKT-VIPGKVVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNS 508
           P+L++  I+G     G +  V+P +   +  IR+VP + P      VL  L  L      
Sbjct: 221 PTLNVGVIKG-----GEQVNVVPDEATLELDIRLVPGEDPD----EVLAELEALLAQVPP 271

Query: 509 PNKFKAYLLDSGKSWRTNPEHPNYV-AAARATKYVYNVEPDLTREGGS 555
           P   +  L        T+P+ P  V A A A   V    P +    G+
Sbjct: 272 PADVEVDLSVPPPPVVTDPDSP-LVQALAAAIADVTGRPPKVRGVPGA 318


>gnl|CDD|225523 COG2976, COG2976, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 207

 Score = 30.5 bits (69), Expect = 3.0
 Identities = 26/103 (25%), Positives = 40/103 (38%), Gaps = 6/103 (5%)

Query: 10  GLLELAHREYQAGDYESAERHCMQLWRQETNNTGVLLL---LSSIHFQCRKLDKSAHFST 66
             LELA  E +A + + AE    Q   Q  +     L    L+ +  Q +K D +    T
Sbjct: 91  AALELAKAEVEANNLDKAEAQLKQALAQTKDENLKALAALRLARVQLQQKKADAA--LKT 148

Query: 67  LAIKQNPLLAEAYSNL-GNVYKERGQLQEALENYRHAVRLKPD 108
           L   +    A   + L G++   +G  QEA   Y  A+     
Sbjct: 149 LDTIKEESWAAIVAELRGDILLAKGDKQEARAAYEKALESDAS 191


>gnl|CDD|222667 pfam14308, DnaJ-X, X-domain of DnaJ-containing.  IN certain plant
           and yeast proteins, the DnaJ-1 proteins have a
           three-domain structure. The x-domain lies between the
           N-terminal DnaJ and the C-terminal Z domains. The exact
           function is not known.
          Length = 204

 Score = 30.6 bits (70), Expect = 3.0
 Identities = 15/34 (44%), Positives = 20/34 (58%)

Query: 869 GSDATKAQKEEQKKIDESEPLTEEELAEKEELLT 902
             DA    +E +K  ++   LTEEELAE EE +T
Sbjct: 98  ALDAQSTMEEMKKLEEKGGELTEEELAEMEEKVT 131


>gnl|CDD|226414 COG3898, COG3898, Uncharacterized membrane-bound protein [Function
           unknown].
          Length = 531

 Score = 31.0 bits (70), Expect = 3.1
 Identities = 20/80 (25%), Positives = 37/80 (46%), Gaps = 2/80 (2%)

Query: 95  ALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTA--LQYNPDLYCVRSDLGN 152
           A ++   A +L PD +   +  A AL   G++ +  +   TA   + +PD+  +     +
Sbjct: 248 ARDDALEANKLAPDLVPAAVVAARALFRDGNLRKGSKILETAWKAEPHPDIALLYVRARS 307

Query: 153 LLKALGRLDEAKDLYCVRSD 172
              AL RL  AK L  ++ +
Sbjct: 308 GDTALDRLKRAKKLESLKPN 327


>gnl|CDD|215341 PLN02635, PLN02635, disproportionating enzyme.
          Length = 538

 Score = 30.9 bits (70), Expect = 3.2
 Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 9/71 (12%)

Query: 213 KPTTLESAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYI 272
           +P  +    FS +A  ++PL+A+A   L       G+L+E LE++R      P+ I  ++
Sbjct: 117 EPVPVGKVDFSAVAELKDPLIAKAAERL---LLSDGELKEELEDFRK----DPE-ISSWL 168

Query: 273 NLAAALVAAGD 283
              AAL AA D
Sbjct: 169 E-DAALFAAID 178


>gnl|CDD|224643 COG1729, COG1729, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 262

 Score = 30.5 bits (69), Expect = 3.2
 Identities = 13/62 (20%), Positives = 24/62 (38%), Gaps = 3/62 (4%)

Query: 71  QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD---FIDGYINLAAALVAAGDME 127
            +     AY  LG     +G  ++A   +   V+  P      D  + L  +L   G+ +
Sbjct: 173 NSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALLKLGVSLGRLGNTD 232

Query: 128 QA 129
           +A
Sbjct: 233 EA 234



 Score = 30.5 bits (69), Expect = 3.2
 Identities = 13/62 (20%), Positives = 24/62 (38%), Gaps = 3/62 (4%)

Query: 229 QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD---FIDGYINLAAALVAAGDME 285
            +     AY  LG     +G  ++A   +   V+  P      D  + L  +L   G+ +
Sbjct: 173 NSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALLKLGVSLGRLGNTD 232

Query: 286 QA 287
           +A
Sbjct: 233 EA 234



 Score = 30.5 bits (69), Expect = 3.2
 Identities = 13/62 (20%), Positives = 24/62 (38%), Gaps = 3/62 (4%)

Query: 580 QNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD---FIDGYINLAAALVAAGDME 636
            +     AY  LG     +G  ++A   +   V+  P      D  + L  +L   G+ +
Sbjct: 173 NSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALLKLGVSLGRLGNTD 232

Query: 637 QA 638
           +A
Sbjct: 233 EA 234


>gnl|CDD|236875 PRK11189, PRK11189, lipoprotein NlpI; Provisional.
          Length = 296

 Score = 30.6 bits (70), Expect = 3.5
 Identities = 19/66 (28%), Positives = 29/66 (43%)

Query: 66  TLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGD 125
           + A+   P +A+AY+ LG    + G    A E +   + L P +   Y+N   AL   G 
Sbjct: 88  SQALALRPDMADAYNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAYLNRGIALYYGGR 147

Query: 126 MEQAVQ 131
            E A  
Sbjct: 148 YELAQD 153



 Score = 30.6 bits (70), Expect = 3.5
 Identities = 19/66 (28%), Positives = 29/66 (43%)

Query: 224 TLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGD 283
           + A+   P +A+AY+ LG    + G    A E +   + L P +   Y+N   AL   G 
Sbjct: 88  SQALALRPDMADAYNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAYLNRGIALYYGGR 147

Query: 284 MEQAVQ 289
            E A  
Sbjct: 148 YELAQD 153



 Score = 30.6 bits (70), Expect = 3.5
 Identities = 19/66 (28%), Positives = 29/66 (43%)

Query: 575 TLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGD 634
           + A+   P +A+AY+ LG    + G    A E +   + L P +   Y+N   AL   G 
Sbjct: 88  SQALALRPDMADAYNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAYLNRGIALYYGGR 147

Query: 635 MEQAVQ 640
            E A  
Sbjct: 148 YELAQD 153


>gnl|CDD|233632 TIGR01910, DapE-ArgE, acetylornithine deacetylase or
           succinyl-diaminopimelate desuccinylase.  This group of
           sequences contains annotations for both acetylornithine
           deacetylase and succinyl-diaminopimelate desuccinylase,
           but does not contain any members with experimental
           characterization. Bacillus, Staphylococcus and
           Sulfolobus species contain multiple hits to this
           subfamily and each may have a separate activity.
           Determining which is which must await further laboratory
           research [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 375

 Score = 30.8 bits (70), Expect = 3.7
 Identities = 16/79 (20%), Positives = 31/79 (39%), Gaps = 1/79 (1%)

Query: 469 VIPGKVVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAYLLDSGKSWRTNPE 528
            +P        +RI+P +    V++ + D +  L K+     + +  +  SG +    P+
Sbjct: 248 SVPDYCEFSIDVRIIPEENLDEVKQIIEDVVKALSKSDGWLYENEPVVKWSGPNETP-PD 306

Query: 529 HPNYVAAARATKYVYNVEP 547
                A     K V  +EP
Sbjct: 307 SRLVKALEAIIKKVRGIEP 325


>gnl|CDD|204480 pfam10414, CysG_dimeriser, Sirohaem synthase dimerisation region.
           Bacterial sulfur metabolism depends on the
           iron-containing porphinoid sirohaem. CysG,
           S-adenosyl-L-methionine (SAM)-dependent
           bis-methyltransferase, dehydrogenase and ferrochelatase,
           synthesises sirohaem from uroporphyrinogen III via
           reactions which encompass two branchpoint intermediates
           in tetrapyrrole biosynthesis, diverting flux first from
           protoporphyrin IX biosynthesis and then from cobalamin
           (vitamin B12) biosynthesis. CysG is a dimer of two
           structurally similar protomers held together
           asymmetrically through a number of salt-bridges across
           complementary residues in the CysG_dimeriser region to
           produce a series of active sites, accounting for CysG's
           multifunctionality, catalyzing four diverse reactions:
           two SAM-dependent methylations, NAD+-dependent
           tetrapyrrole dehydrogenation and metal chelation. The
           CysG_dimeriser region holding the two protomers together
           is of 74 residues.
          Length = 60

 Score = 27.5 bits (62), Expect = 4.0
 Identities = 20/60 (33%), Positives = 24/60 (40%), Gaps = 14/60 (23%)

Query: 90  GQLQEALENYRHAV--RLKP---------DFIDGYINLAAALVAAGDMEQAVQAYVTALQ 138
           G+L      +R  V  RL            F DG     A LV AGD E+A  A   AL+
Sbjct: 4   GRLAALAGRFRDRVKQRLPDVAERRRFWERFFDGPF---AELVLAGDEEEAEAALEQALE 60



 Score = 27.5 bits (62), Expect = 4.0
 Identities = 20/60 (33%), Positives = 24/60 (40%), Gaps = 14/60 (23%)

Query: 248 GQLQEALENYRHAV--RLKP---------DFIDGYINLAAALVAAGDMEQAVQAYVTALQ 296
           G+L      +R  V  RL            F DG     A LV AGD E+A  A   AL+
Sbjct: 4   GRLAALAGRFRDRVKQRLPDVAERRRFWERFFDGPF---AELVLAGDEEEAEAALEQALE 60



 Score = 27.5 bits (62), Expect = 4.0
 Identities = 20/60 (33%), Positives = 24/60 (40%), Gaps = 14/60 (23%)

Query: 599 GQLQEALENYRHAV--RLKP---------DFIDGYINLAAALVAAGDMEQAVQAYVTALQ 647
           G+L      +R  V  RL            F DG     A LV AGD E+A  A   AL+
Sbjct: 4   GRLAALAGRFRDRVKQRLPDVAERRRFWERFFDGPF---AELVLAGDEEEAEAALEQALE 60


>gnl|CDD|237265 PRK13009, PRK13009, succinyl-diaminopimelate desuccinylase;
           Reviewed.
          Length = 375

 Score = 30.4 bits (70), Expect = 4.5
 Identities = 23/90 (25%), Positives = 40/90 (44%), Gaps = 15/90 (16%)

Query: 469 VIPGKVVGKFSIRIVPNQTPQCVEKYVLDYLNELWKARNSPNKFKAYLLD---SGKSWRT 525
           VIPG++  +F+ R     T + ++  V   L      ++  +    Y L+   SG+ + T
Sbjct: 245 VIPGELEAQFNFRFSTEHTAESLKARVEAIL-----DKHGLD----YTLEWTLSGEPFLT 295

Query: 526 NPEHPNYVAA-ARATKYVYNVEPDLTREGG 554
            P     V A   A + V  + P+L+  GG
Sbjct: 296 PPG--KLVDAVVAAIEAVTGITPELSTSGG 323


>gnl|CDD|237851 PRK14894, PRK14894, glycyl-tRNA synthetase; Provisional.
          Length = 539

 Score = 30.4 bits (68), Expect = 4.6
 Identities = 28/98 (28%), Positives = 40/98 (40%), Gaps = 15/98 (15%)

Query: 87  KERGQLQEAL-ENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVT-------ALQ 138
           K RGQ Q  + E+YR  +RLKP        LA   VA   +++  +  V         LQ
Sbjct: 409 KLRGQAQPLIDEHYRTVLRLKP-------RLAPIKVAVFPLKRNHEGLVATAKAVRRQLQ 461

Query: 139 YNPDLYCVRSDLGNLLKALGRLDEAKDLYCVRSDLGNL 176
               +  V  D G + K   R DE    +C+  D   +
Sbjct: 462 VGGRMRTVYDDTGAIGKLYRRQDEIGTPFCITVDFDTI 499


>gnl|CDD|226036 COG3505, VirD4, Type IV secretory pathway, VirD4 components
           [Intracellular trafficking and secretion].
          Length = 596

 Score = 30.4 bits (69), Expect = 4.9
 Identities = 13/47 (27%), Positives = 17/47 (36%), Gaps = 1/47 (2%)

Query: 795 PPKRERKANYAVDAYFKEALRTSEPKAPKAPRPPKQPIVQDFQFFPP 841
           PP +  K  Y  D  FK+ L+  EP+    P     P        P 
Sbjct: 540 PPIKCGKLRYFKDPKFKDRLKPVEPELEAGP-KLGAPAKASTDAAPS 585


>gnl|CDD|233924 TIGR02552, LcrH_SycD, type III secretion low calcium response
           chaperone LcrH/SycD.  Genes in this family are found in
           type III secretion operons. LcrH, from Yersinia is
           believed to have a regulatory function in the
           low-calcium response of the secretion system. The same
           protein is also known as SycD (SYC = Specific Yop
           Chaperone) for its chaperone role. In Pseudomonas, where
           the homolog is known as PcrH, the chaperone role has
           been demonstrated and the regulatory role appears to be
           absent. ScyD/LcrH contains three central
           tetratricopeptide-like repeats that are predicted to
           fold into an all-alpha-helical array.
          Length = 135

 Score = 28.8 bits (65), Expect = 6.3
 Identities = 21/87 (24%), Positives = 40/87 (45%)

Query: 52  HFQCRKLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFID 111
            +Q  + D++     L    +P  +  +  L    +   + +EA++ Y  A  L PD   
Sbjct: 27  LYQQGRYDEALKLFQLLAAYDPYNSRYWLGLAACCQMLKEYEEAIDAYALAAALDPDDPR 86

Query: 112 GYINLAAALVAAGDMEQAVQAYVTALQ 138
            Y + A  L+A G+ E A++A   A++
Sbjct: 87  PYFHAAECLLALGEPESALKALDLAIE 113



 Score = 28.8 bits (65), Expect = 7.0
 Identities = 20/96 (20%), Positives = 39/96 (40%), Gaps = 2/96 (2%)

Query: 14  LAHREYQAGDYESAERHCMQLWRQ-ETNNTGVLLLLSSIHFQCRKLDKSAHFSTLAIKQN 72
           LA+  YQ G Y+ A     QL    +  N+   L L++     ++ +++     LA   +
Sbjct: 23  LAYNLYQQGRYDEA-LKLFQLLAAYDPYNSRYWLGLAACCQMLKEYEEAIDAYALAAALD 81

Query: 73  PLLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPD 108
           P     Y +        G+ + AL+    A+ +  +
Sbjct: 82  PDDPRPYFHAAECLLALGEPESALKALDLAIEICGE 117


>gnl|CDD|233451 TIGR01531, glyc_debranch, glycogen debranching enzymye.  glycogen
           debranching enzyme possesses two different catalytic
           activities; oligo-1,4-->1,4-glucantransferase (EC
           2.4.1.25) and amylo-1,6-glucosidase (EC 3.2.1.33). Site
           directed mutagenesis studies in S. cerevisiae indicate
           that the transferase and glucosidase activities are
           independent and located in different regions of the
           polypeptide chain. Proteins in this model belong to the
           larger alpha-amylase family. The model covers eukaryotic
           proteins with a seed composed of human, nematode and
           yeast sequences. Yeast seed sequence is well
           characterized. The model is quite rigorous; either query
           sequence yields large bit score or it fails to hit the
           model altogether. There doesn't appear to be any middle
           ground [Energy metabolism, Biosynthesis and degradation
           of polysaccharides].
          Length = 1464

 Score = 30.2 bits (68), Expect = 6.3
 Identities = 20/111 (18%), Positives = 34/111 (30%), Gaps = 8/111 (7%)

Query: 18  EYQAGDYESAERHCMQLWRQETNNTGVLLLLSSIHF----QCRKLDKSAHFSTLAIKQNP 73
           E+   D + A       W QE++     +   S       + R+   + +F T     N 
Sbjct: 286 EFYQVDVQKAVNDFKAHWTQESSYVTNNIKDQSSDIIQDPEYRRFGVTVNFETALRIFNR 345

Query: 74  LLAEAYSNLGNVYKERGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAG 124
              +         K    L  AL      +RL        I++A   +  G
Sbjct: 346 HNGDLKLEEDRGEKCSSSLATALNILNENLRL----YRYDIDVALEQLLGG 392


>gnl|CDD|236148 PRK08074, PRK08074, bifunctional ATP-dependent DNA helicase/DNA
           polymerase III subunit epsilon; Validated.
          Length = 928

 Score = 30.3 bits (69), Expect = 6.7
 Identities = 24/107 (22%), Positives = 39/107 (36%), Gaps = 20/107 (18%)

Query: 685 HHFEKAVS------LDPNFLDAYINLGNVLKEARIFDRANTLFYVSSYFM-ITDEDIDVI 737
           HHFE+A S               ++    L+E  +  +   LF  S      +  D+D  
Sbjct: 465 HHFEEAASRHLGEQFSYMSFQLLLSRLGTLEEDGLLSKLAKLFKKSDQASRSSFRDLDES 524

Query: 738 LERCEAKTEELNKKFEQLGESSLRDFTLDAPTQS-----VYKFEGED 779
           L+  + + +EL   F       LR F L    Q      +Y++  E 
Sbjct: 525 LKELKFEADEL---F-----QMLRSFVLKRKKQEQNGRLIYRYNTES 563


>gnl|CDD|226862 COG4455, ImpE, Protein of avirulence locus involved in
           temperature-dependent protein secretion [General
           function prediction only].
          Length = 273

 Score = 29.4 bits (66), Expect = 7.7
 Identities = 17/61 (27%), Positives = 23/61 (37%), Gaps = 1/61 (1%)

Query: 599 GQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIE 658
             LQ+A+   R  V+ KP    G   L   L  AGD E+A+     A   +P        
Sbjct: 15  NSLQDAIGLARDQVKAKPTDAGGRHFLFQLLCVAGDWEKALAQLNLAATLSPQ-DTVGAS 73

Query: 659 T 659
            
Sbjct: 74  L 74


>gnl|CDD|215883 pfam00374, NiFeSe_Hases, Nickel-dependent hydrogenase. 
          Length = 502

 Score = 29.8 bits (67), Expect = 8.0
 Identities = 16/63 (25%), Positives = 25/63 (39%), Gaps = 5/63 (7%)

Query: 616 PDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPSCYLKAIETRPDFAVAWSNLGCVFN 675
            D++D    L A    A ++ Q+V  +  +       Y KA++ R    V    LG   N
Sbjct: 76  LDWVDVTSALKADPKKASELAQSVSPWPKS-----PGYFKAVQNRLKKFVESGQLGIFTN 130

Query: 676 AQG 678
           A  
Sbjct: 131 AYW 133


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.135    0.395 

Gapped
Lambda     K      H
   0.267   0.0733    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 47,250,961
Number of extensions: 4783156
Number of successful extensions: 6385
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6130
Number of HSP's successfully gapped: 329
Length of query: 904
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 798
Effective length of database: 6,236,078
Effective search space: 4976390244
Effective search space used: 4976390244
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.4 bits)