RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy12642
         (201 letters)



>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
           lipoprotein.  This protein family occurs in strictly
           within a subset of Gram-negative bacterial species with
           the proposed PEP-CTERM/exosortase system, analogous to
           the LPXTG/sortase system common in Gram-positive
           bacteria. This protein occurs in a species if and only
           if a transmembrane histidine kinase (TIGR02916) and a
           DNA-binding response regulator (TIGR02915) also occur.
           The present of tetratricopeptide repeats (TPR) suggests
           protein-protein interaction, possibly for the regulation
           of PEP-CTERM protein expression, since many PEP-CTERM
           proteins in these genomes are preceded by a proposed DNA
           binding site for the response regulator.
          Length = 899

 Score = 47.4 bits (113), Expect = 2e-06
 Identities = 43/169 (25%), Positives = 66/169 (39%), Gaps = 5/169 (2%)

Query: 3   PIVYIIGCFHFFR--YEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEY 60
           P + + G   +     E+A ++L  I+K    +  AR+    I     ++ EAI  L+  
Sbjct: 296 PALLLAGASEYQLGNLEQAYQYLNQILKYAPNSHQARRLLASIQLRLGRVDEAIATLSPA 355

Query: 61  LKKFMTDQETWQELCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLEN 120
           L     D      L + YL+ GD+ KA   + +     P N     +   I     G  +
Sbjct: 356 LGLDPDDPAALSLLGEAYLALGDFEKAAEYLAKATELDPENAAARTQLG-ISKLSQG--D 412

Query: 121 IELAISHYLMAINLNEKNIRALYGLALSCHQVLTSAKCSAAKKKEISKQ 169
              AI+    A  L+ +  RA   L LS  +     K  AA KK   KQ
Sbjct: 413 PSEAIADLETAAQLDPELGRADLLLILSYLRSGQFDKALAAAKKLEKKQ 461



 Score = 37.4 bits (87), Expect = 0.004
 Identities = 33/135 (24%), Positives = 57/135 (42%), Gaps = 10/135 (7%)

Query: 16  YEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELC 75
           Y+ A+  L+  ++ D  +  AR     I  A      A KEL + L       +    L 
Sbjct: 38  YKAAIIQLKNALQKDPNDAEARFLLGKIYLALGDYAAAEKELRKALSLGYPKNQVLPLLA 97

Query: 76  DLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQG----GLENIELAISHYLMA 131
             YL +G + + +  +    L      L  +  A++L  +G    GL  +ELA   Y  A
Sbjct: 98  RAYLLQGKFQQVLDELPGKTL------LDDEGAAELLALRGLAYLGLGQLELAQKSYEQA 151

Query: 132 INLNEKNIRALYGLA 146
           + ++ +++ A  GLA
Sbjct: 152 LAIDPRSLYAKLGLA 166



 Score = 37.4 bits (87), Expect = 0.004
 Identities = 33/132 (25%), Positives = 49/132 (37%), Gaps = 3/132 (2%)

Query: 15  RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQEL 74
             EEA+  LE   +++                K ++ +A+  L E         E W  L
Sbjct: 548 NEEEAVAWLEKAAELNPQEIEPALALAQYYLGKGQLKKALAILNEAADAAPDSPEAWLML 607

Query: 75  CDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHYLMAINL 134
               L+ GD  KAV   ++L    P + L     AD       ++N   AI+    A+ L
Sbjct: 608 GRAQLAAGDLNKAVSSFKKLLALQPDSALALLLLADA---YAVMKNYAKAITSLKRALEL 664

Query: 135 NEKNIRALYGLA 146
              N  A  GLA
Sbjct: 665 KPDNTEAQIGLA 676



 Score = 35.8 bits (83), Expect = 0.012
 Identities = 33/130 (25%), Positives = 52/130 (40%), Gaps = 3/130 (2%)

Query: 17  EEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELCD 76
            EA+  LET  ++D     A    I       +  +A+    +  KK   +      L  
Sbjct: 414 SEAIADLETAAQLDPELGRADLLLILSYLRSGQFDKALAAAKKLEKKQPDNASLHNLLGA 473

Query: 77  LYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHYLMAINLNE 136
           +YL +GD AKA    E+     P         A I   +G   N + AI  +   + ++ 
Sbjct: 474 IYLGKGDLAKAREAFEKALSIEPDFFPAAANLARIDIQEG---NPDDAIQRFEKVLTIDP 530

Query: 137 KNIRALYGLA 146
           KN+RA+  LA
Sbjct: 531 KNLRAILALA 540



 Score = 33.9 bits (78), Expect = 0.043
 Identities = 27/110 (24%), Positives = 46/110 (41%), Gaps = 1/110 (0%)

Query: 15  RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQEL 74
            Y  A++     +K   ++  A K    +L + N   EA+K L  +LK    D      L
Sbjct: 718 DYPAAIQAYRKALKRAPSSQNAIKLHRALLASGNT-AEAVKTLEAWLKTHPNDAVLRTAL 776

Query: 75  CDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELA 124
            +LYL++ DY KA+   + +    P N ++    A +         +E A
Sbjct: 777 AELYLAQKDYDKAIKHYQTVVKKAPDNAVVLNNLAWLYLELKDPRALEYA 826



 Score = 32.0 bits (73), Expect = 0.19
 Identities = 27/134 (20%), Positives = 40/134 (29%), Gaps = 5/134 (3%)

Query: 16  YEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELC 75
            E AL      I +   N A       IL    +  EA K     LKK            
Sbjct: 209 IELALAAYRKAIALRPNNIAVLLALATILIEAGEFEEAEKHADALLKKAPNSPLAHYLKA 268

Query: 76  DLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADI-LYTQGGLENIELAISHYLMAINL 134
            +   + +Y  A   +++     P  +L     A    Y  G LE     ++  L     
Sbjct: 269 LVDFQKKNYEDARETLQDALKSAP-EYLPALLLAGASEYQLGNLEQAYQYLNQILKYA-- 325

Query: 135 NEKNIRALYGLALS 148
              + +A   LA  
Sbjct: 326 -PNSHQARRLLASI 338



 Score = 31.6 bits (72), Expect = 0.24
 Identities = 25/103 (24%), Positives = 40/103 (38%)

Query: 17  EEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELCD 76
           E A +  E  + ID  +  A+     +  A+N+  EA   + E L     + +      D
Sbjct: 142 ELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARALIDEVLTADPGNVDALLLKGD 201

Query: 77  LYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLE 119
           L LS G+   A+    +     P+N  +    A IL   G  E
Sbjct: 202 LLLSLGNIELALAAYRKAIALRPNNIAVLLALATILIEAGEFE 244



 Score = 31.2 bits (71), Expect = 0.42
 Identities = 18/43 (41%), Positives = 22/43 (51%)

Query: 46 AKNKIPEAIKELTEYLKKFMTDQETWQELCDLYLSEGDYAKAV 88
           KNK   AI +L   L+K   D E    L  +YL+ GDYA A 
Sbjct: 34 QKNKYKAAIIQLKNALQKDPNDAEARFLLGKIYLALGDYAAAE 76



 Score = 30.8 bits (70), Expect = 0.44
 Identities = 18/86 (20%), Positives = 31/86 (36%)

Query: 16  YEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELC 75
             +A E  E  + I+     A      I   +    +AI+   + L     +      L 
Sbjct: 481 LAKAREAFEKALSIEPDFFPAAANLARIDIQEGNPDDAIQRFEKVLTIDPKNLRAILALA 540

Query: 76  DLYLSEGDYAKAVFCMEELFLHHPHN 101
            LYL  G+  +AV  +E+    +P  
Sbjct: 541 GLYLRTGNEEEAVAWLEKAAELNPQE 566



 Score = 30.1 bits (68), Expect = 0.96
 Identities = 24/130 (18%), Positives = 50/130 (38%), Gaps = 3/130 (2%)

Query: 14  FRYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQE 73
            R++EA   ++ ++  D  N  A   K  +L +   I  A+    + +     +      
Sbjct: 173 NRFDEARALIDEVLTADPGNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVLLA 232

Query: 74  LCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHYLMAIN 133
           L  + +  G++ +A    + L    P++ L H   A + + +    N E A      A+ 
Sbjct: 233 LATILIEAGEFEEAEKHADALLKKAPNSPLAHYLKALVDFQKK---NYEDARETLQDALK 289

Query: 134 LNEKNIRALY 143
              + + AL 
Sbjct: 290 SAPEYLPALL 299



 Score = 29.7 bits (67), Expect = 1.1
 Identities = 33/164 (20%), Positives = 55/164 (33%), Gaps = 36/164 (21%)

Query: 16  YEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELC 75
           Y +A+  L+  +++   NT A+     +L A  +   A K      K+       ++   
Sbjct: 651 YAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTESAKKIAKSLQKQHPKAALGFELEG 710

Query: 76  DLYLSEGDYAKAVFCMEELF-----------LH----------------------HPHNH 102
           DLYL + DY  A+    +             LH                      HP++ 
Sbjct: 711 DLYLRQKDYPAAIQAYRKALKRAPSSQNAIKLHRALLASGNTAEAVKTLEAWLKTHPNDA 770

Query: 103 LLHQRYADILYTQGGLENIELAISHYLMAINLNEKNIRALYGLA 146
           +L    A++   Q   +    AI HY   +     N   L  LA
Sbjct: 771 VLRTALAELYLAQKDYDK---AIKHYQTVVKKAPDNAVVLNNLA 811



 Score = 28.5 bits (64), Expect = 2.8
 Identities = 34/191 (17%), Positives = 62/191 (32%), Gaps = 12/191 (6%)

Query: 15  RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQEL 74
             ++A++  E ++ ID  N  A      +        EA+  L +  +    + E    L
Sbjct: 514 NPDDAIQRFEKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAAELNPQEIEPALAL 573

Query: 75  CDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHYLMAINL 134
              YL +G   KA+  + E     P +              G L     A+S +   + L
Sbjct: 574 AQYYLGKGQLKKALAILNEAADAAPDSPEAWLMLGRAQLAAGDLNK---AVSSFKKLLAL 630

Query: 135 NEKNIRALYGLALSCHQVLTSAKCSAAKKKEISKQM-----MWVSKHLARQYEEQQGNTE 189
              +  AL  LA          K  A     + + +        ++    Q       TE
Sbjct: 631 QPDSALALLLLA----DAYAVMKNYAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTE 686

Query: 190 TLTELMSALQV 200
           +  ++  +LQ 
Sbjct: 687 SAKKIAKSLQK 697



 Score = 28.5 bits (64), Expect = 3.1
 Identities = 24/72 (33%), Positives = 34/72 (47%), Gaps = 3/72 (4%)

Query: 77  LYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHYLMAINLNE 136
           L L+E  + +A   ++E+    P N        D+L + G   NIELA++ Y  AI L  
Sbjct: 168 LALAENRFDEARALIDEVLTADPGNVDALLLKGDLLLSLG---NIELALAAYRKAIALRP 224

Query: 137 KNIRALYGLALS 148
            NI  L  LA  
Sbjct: 225 NNIAVLLALATI 236



 Score = 27.4 bits (61), Expect = 7.4
 Identities = 21/118 (17%), Positives = 38/118 (32%), Gaps = 3/118 (2%)

Query: 17  EEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELCD 76
            +A+   + ++ +   +  A               +AI  L   L+    + E    L  
Sbjct: 618 NKAVSSFKKLLALQPDSALALLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQIGLAQ 677

Query: 77  LYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHYLMAINL 134
           L L+      A    + L   HP   L  +   D+   Q   ++   AI  Y  A+  
Sbjct: 678 LLLAAKRTESAKKIAKSLQKQHPKAALGFELEGDLYLRQ---KDYPAAIQAYRKALKR 732


>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
           34 amino acids
           [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
           X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
           in a variety of organisms including bacteria,
           cyanobacteria, yeast, fungi, plants, and humans in
           various subcellular locations; involved in a variety of
           functions including protein-protein interactions, but
           common features in the interaction partners have not
           been defined; involved in chaperone, cell-cycle,
           transciption, and protein transport complexes; the
           number of TPR motifs varies among proteins (1,3-11,13
           15,16,19); 5-6 tandem repeats generate a right-handed
           helical structure with an amphipathic channel that is
           thought to accomodate an alpha-helix of a target
           protein; it has been proposed that TPR proteins
           preferably interact with WD-40 repeat proteins, but in
           many instances several TPR-proteins seem to aggregate to
           multi-protein complexes; examples of TPR-proteins
           include, Cdc16p, Cdc23p and Cdc27p components of the
           cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
           targeting signals, the Tom70p co-receptor for
           mitochondrial targeting signals, Ser/Thr phosphatase 5C
           and the p110 subunit of O-GlcNAc transferase; three
           copies of the repeat are present here.
          Length = 100

 Score = 42.4 bits (100), Expect = 1e-05
 Identities = 18/100 (18%), Positives = 33/100 (33%), Gaps = 3/100 (3%)

Query: 38  KRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELCDLYLSEGDYAKAVFCMEELFLH 97
                +        EA++   + L+    + + +  L   Y   G Y +A+   E+    
Sbjct: 4   LNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALEL 63

Query: 98  HPHNHLLHQRYADILYTQGGLENIELAISHYLMAINLNEK 137
            P N   +       Y  G     E A+  Y  A+ L+  
Sbjct: 64  DPDNAKAYYNLGLAYYKLG---KYEEALEAYEKALELDPN 100



 Score = 42.4 bits (100), Expect = 1e-05
 Identities = 24/83 (28%), Positives = 36/83 (43%), Gaps = 3/83 (3%)

Query: 69  ETWQELCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHY 128
           E    L +LY   GDY +A+   E+     P N   +   A   Y  G     E A+  Y
Sbjct: 1   EALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLG---KYEEALEDY 57

Query: 129 LMAINLNEKNIRALYGLALSCHQ 151
             A+ L+  N +A Y L L+ ++
Sbjct: 58  EKALELDPDNAKAYYNLGLAYYK 80



 Score = 34.7 bits (80), Expect = 0.005
 Identities = 19/79 (24%), Positives = 35/79 (44%)

Query: 15 RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQEL 74
           Y+EALE+ E  +++D  N  A            K  EA+++  + L+    + + +  L
Sbjct: 15 DYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNL 74

Query: 75 CDLYLSEGDYAKAVFCMEE 93
             Y   G Y +A+   E+
Sbjct: 75 GLAYYKLGKYEEALEAYEK 93



 Score = 33.9 bits (78), Expect = 0.012
 Identities = 14/48 (29%), Positives = 22/48 (45%)

Query: 15 RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLK 62
          +YEEALE  E  +++D  N  A            K  EA++   + L+
Sbjct: 49 KYEEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALE 96


>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat. 
          Length = 69

 Score = 38.1 bits (89), Expect = 2e-04
 Identities = 15/68 (22%), Positives = 26/68 (38%), Gaps = 2/68 (2%)

Query: 69  ETWQELCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHY 128
           E  + L +     GDY +A+   E+     P N   +   A      G  ++ E A+   
Sbjct: 4   EALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLG--KDYEEALEDL 61

Query: 129 LMAINLNE 136
             A+ L+ 
Sbjct: 62  EKALELDP 69



 Score = 27.7 bits (62), Expect = 0.97
 Identities = 13/45 (28%), Positives = 20/45 (44%), Gaps = 3/45 (6%)

Query: 103 LLHQRYADILYTQGGLENIELAISHYLMAINLNEKNIRALYGLAL 147
              +   + L+  G   + + AI  Y  A+ L+  N  A Y LAL
Sbjct: 4   EALKNLGNALFKLG---DYDEAIEAYEKALELDPDNAEAYYNLAL 45



 Score = 26.9 bits (60), Expect = 1.8
 Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 1/49 (2%)

Query: 15 RYEEALEHLETIIKIDETNTAAR-KRKICILKAKNKIPEAIKELTEYLK 62
           Y+EA+E  E  +++D  N  A     +  LK      EA+++L + L+
Sbjct: 18 DYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKALE 66



 Score = 26.1 bits (58), Expect = 3.7
 Identities = 17/61 (27%), Positives = 25/61 (40%), Gaps = 1/61 (1%)

Query: 33 NTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELCDLYLSEG-DYAKAVFCM 91
          N  A K     L       EAI+   + L+    + E +  L   YL  G DY +A+  +
Sbjct: 2  NAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDL 61

Query: 92 E 92
          E
Sbjct: 62 E 62


>gnl|CDD|234406 TIGR03939, PGA_TPR_OMP, poly-beta-1,6 N-acetyl-D-glucosamine export
           porin PgaA.  Members of this protein family are the
           poly-beta-1,6 N-acetyl-D-glucosamine (PGA) export porin
           PgaA of Gram-negative bacteria. There is no counterpart
           in the poly-beta-1,6 N-acetyl-D-glucosamine biosynthesis
           systems of Gram-positive bacteria such as Staphylococcus
           epidermidis. The PGA polysaccharide adhesin is a
           critical determinant of biofilm formation. The conserved
           C-terminal domain of this outer membrane protein is
           preceded by a variable number of TPR repeats.
          Length = 800

 Score = 37.3 bits (87), Expect = 0.003
 Identities = 23/98 (23%), Positives = 41/98 (41%)

Query: 15  RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQEL 74
           ++++ALE    +++ D  N          L    K  EA+K L EY+ +F TD   ++ L
Sbjct: 101 QWDKALELYRKLLQRDPNNPDGLLGLALTLADAGKDAEALKYLKEYVARFPTDAARYEAL 160

Query: 75  CDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADIL 112
             +  +  D+  A+   ++     P N         IL
Sbjct: 161 AYVLRAAEDHLDALQAWQQALTLEPDNPEAALELYRIL 198


>gnl|CDD|222121 pfam13428, TPR_14, Tetratricopeptide repeat. 
          Length = 44

 Score = 32.1 bits (73), Expect = 0.017
 Identities = 7/43 (16%), Positives = 13/43 (30%)

Query: 69  ETWQELCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADI 111
                L    L+ GD  +A+  +       P +       A +
Sbjct: 2   AALLALARALLALGDLDEALALLRRALALDPDDPEALLLLARL 44


>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat. 
          Length = 65

 Score = 32.3 bits (74), Expect = 0.025
 Identities = 13/66 (19%), Positives = 23/66 (34%), Gaps = 3/66 (4%)

Query: 73  ELCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHYLMAI 132
            L    L  GDY +A+  +E     +P          + L  QG    +  A +    A+
Sbjct: 2   ALARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQG---RLAEAAALLRAAL 58

Query: 133 NLNEKN 138
             +  +
Sbjct: 59  AADPDD 64



 Score = 28.4 bits (64), Expect = 0.52
 Identities = 13/53 (24%), Positives = 18/53 (33%)

Query: 15 RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTD 67
           Y+EAL  LE  +        A       L  + ++ EA   L   L     D
Sbjct: 12 DYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADPDD 64


>gnl|CDD|225094 COG2183, Tex, Transcriptional accessory protein [Transcription].
          Length = 780

 Score = 33.0 bits (76), Expect = 0.10
 Identities = 44/162 (27%), Positives = 65/162 (40%), Gaps = 24/162 (14%)

Query: 23  LETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELCDLYLSEG 82
           L  + +  E      +RK  ILK+   I E  K LT+ LK+ +   E   EL DLYL   
Sbjct: 57  LRDLEERLEYLRELEERKESILKS---IEEQGK-LTDELKEQIEAAEELTELEDLYL--- 109

Query: 83  DYA-----KAVFCME-------ELFLHHPHNHLLHQRYADILYTQGGLENIELAIS---H 127
            Y      +A    E       +L L  P    L +  AD +  + G+ + +LA+     
Sbjct: 110 PYKEKRRTRATIAREKGLEPLADLILSKPSLDPLLESAADFISIEEGVSSSKLALDGARD 169

Query: 128 YLM-AINLNEKNIRALYGLALSCHQVLTSAKCSAAKKKEISK 168
            L   I+ + +    L    L  H VLTS      +  E +K
Sbjct: 170 ILAERISEDAELREKLRDY-LRKHGVLTSKVVKGKEDDEGAK 210


>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein.  Tymoviruses
           are single stranded RNA viruses. This family includes a
           protein of unknown function that has been named based on
           its molecular weight. Tymoviruses such as the ononis
           yellow mosaic tymovirus encode only three proteins. Of
           these two are overlapping this protein overlaps a larger
           ORF that is thought to be the polymerase.
          Length = 458

 Score = 32.0 bits (73), Expect = 0.17
 Identities = 9/24 (37%), Positives = 13/24 (54%), Gaps = 2/24 (8%)

Query: 94  LFLHHP--HNHLLHQRYADILYTQ 115
             LHH   H + LH R +D+L + 
Sbjct: 130 HHLHHSSKHRNRLHARRSDVLPSL 153


>gnl|CDD|224661 COG1747, COG1747, Uncharacterized N-terminal domain of the
           transcription elongation factor GreA [Function unknown].
          Length = 711

 Score = 32.2 bits (73), Expect = 0.19
 Identities = 23/83 (27%), Positives = 38/83 (45%), Gaps = 7/83 (8%)

Query: 16  YEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELC 75
           + EA+  L+ I++ DE +  ARK  I  L+ K +      +L EYLK     Q + +   
Sbjct: 221 WTEAIRILKHILEHDEKDVWARKEIIENLRDKYR---GHSQLEEYLKISNISQ-SGRNFF 276

Query: 76  DLYLSEGDYAKAVFCMEELFLHH 98
           +      D+ K +   E  F+ H
Sbjct: 277 EAL---NDFEKLMHFDEGNFVFH 296


>gnl|CDD|226156 COG3629, DnrI, DNA-binding transcriptional activator of the SARP
           family [Signal transduction mechanisms].
          Length = 280

 Score = 31.6 bits (72), Expect = 0.20
 Identities = 12/53 (22%), Positives = 23/53 (43%)

Query: 15  RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTD 67
           R +  +EHLE +I++D  +  A  R +       +   AI+   +  K    +
Sbjct: 168 RADAVIEHLERLIELDPYDEPAYLRLMEAYLVNGRQSAAIRAYRQLKKTLAEE 220


>gnl|CDD|223966 COG1035, FrhB, Coenzyme F420-reducing hydrogenase, beta subunit
           [Energy production and conversion].
          Length = 332

 Score = 31.6 bits (72), Expect = 0.21
 Identities = 21/63 (33%), Positives = 28/63 (44%), Gaps = 6/63 (9%)

Query: 4   IVYIIG--CFHFFRYEEALEHLETIIKIDETNTAA---RKRKICI-LKAKNKIPEAIKEL 57
           IVY+IG  C   F YE   + LE  + +D  +      RK K  + LK        +KE 
Sbjct: 174 IVYVIGLFCMENFSYEGLKKFLEEDLGVDPEDVEKMDIRKGKFVVELKDGEVKEIPLKET 233

Query: 58  TEY 60
            EY
Sbjct: 234 EEY 236


>gnl|CDD|200938 pfam00025, Arf, ADP-ribosylation factor family.  Pfam combines a
           number of different Prosite families together.
          Length = 174

 Score = 31.0 bits (71), Expect = 0.24
 Identities = 16/60 (26%), Positives = 28/60 (46%), Gaps = 6/60 (10%)

Query: 4   IVYIIGCFHFFRYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIK--ELTEYL 61
           +++++      R EEA E L  ++  +E   A     + IL  K  +P A+   E+ E L
Sbjct: 85  VIFVVDSADRDRIEEAKEELHALLNEEELADAP----LLILANKQDLPGAMSEAEIRELL 140


>gnl|CDD|226559 COG4073, COG4073, Uncharacterized protein conserved in archaea
           [Function unknown].
          Length = 198

 Score = 31.3 bits (71), Expect = 0.25
 Identities = 10/47 (21%), Positives = 22/47 (46%)

Query: 21  EHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTD 67
             L+ + +I      +R R+  I + + + PE  ++  EY + F  +
Sbjct: 151 RSLDVVSEILGLIAESRTREEFIEELRRRFPEQTEKALEYFRSFYGE 197


>gnl|CDD|225660 COG3118, COG3118, Thioredoxin domain-containing protein
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 304

 Score = 31.2 bits (71), Expect = 0.30
 Identities = 12/34 (35%), Positives = 19/34 (55%), Gaps = 2/34 (5%)

Query: 15  RYEEALEHLETIIKIDET--NTAARKRKICILKA 46
           R E ALEHL  +++ D    +  ARK  + + +A
Sbjct: 251 RNEAALEHLLALLRRDRGFEDGEARKTLLELFEA 284


>gnl|CDD|131219 TIGR02164, torA, trimethylamine-N-oxide reductase TorA.  This very
           narrowly defined family represents TorA, part of a
           family of related molybdoenzymes that include biotin
           sulfoxide reductases, dimethyl sulfoxide reductases, and
           at least two different subfamilies of
           trimethylamine-N-oxide reductases. A single enzyme from
           the larger family may have more than one activity. TorA
           typically is located in the periplasm, has a Tat
           (twin-arginine translocation)-dependent signal sequence,
           and is encoded in a torCAD operon.
          Length = 822

 Score = 31.0 bits (70), Expect = 0.42
 Identities = 26/107 (24%), Positives = 47/107 (43%), Gaps = 21/107 (19%)

Query: 7   IIGCFHFFRYE------EALEHLETIIKIDETNTAARKRKICILKA-----KNKIPE--- 52
             GC  +  ++      +A + LET++ ID + TA  +    +L A     +N I     
Sbjct: 470 FSGCNPWHHHQDRNRMKQAFQKLETVVTIDVSWTATCRFSDIVLPACTQFERNDIDVYGS 529

Query: 53  -------AIKELTEYLKKFMTDQETWQELCDLYLSEGDYAKAVFCME 92
                  A+++L + L    +D E + ELC  +  E +Y + +  ME
Sbjct: 530 YSNRGIIAMQKLVDPLFDSRSDFEIFTELCRRFGKEKEYTRNMDEME 576


>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
           repeats [General function prediction only].
          Length = 484

 Score = 30.4 bits (69), Expect = 0.53
 Identities = 25/113 (22%), Positives = 38/113 (33%), Gaps = 18/113 (15%)

Query: 15  RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQEL 74
           + +EA+E L+  + +D  +   +      L    K  EAI+ L  YL     D   W  L
Sbjct: 355 KAKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLL 414

Query: 75  CDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLEN-IELAIS 126
              Y   G+ A+A                     A+     G LE  I   + 
Sbjct: 415 AQAYAELGNRAEA-----------------LLARAEGYALAGRLEQAIIFLMR 450


>gnl|CDD|233199 TIGR00939, 2a57, Equilibrative Nucleoside Transporter (ENT).
           [Transport and binding proteins, Nucleosides, purines
           and pyrimidines].
          Length = 437

 Score = 30.1 bits (68), Expect = 0.79
 Identities = 14/74 (18%), Positives = 21/74 (28%), Gaps = 1/74 (1%)

Query: 48  NKIPEAIKELTEYLKKFMTDQETWQELCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQR 107
           N    A +   EY K      E       ++     Y   +       L +  N  L  R
Sbjct: 13  NAFITAPQYFIEYYKYAQNIPEAIPSSSKMWKHFNTYY-TLASQLPSLLFNSLNLFLIFR 71

Query: 108 YADILYTQGGLENI 121
               +   GGL  +
Sbjct: 72  IPVTVRLLGGLVIL 85


>gnl|CDD|198111 smart01043, BTAD, Bacterial transcriptional activator domain.
           Found in the DNRI/REDD/AFSR family of regulators. This
           region of AFSR along with the C terminal region is
           capable of independently directing actinorhodin
           production. This family contains TPR repeats.
          Length = 145

 Score = 29.2 bits (66), Expect = 0.80
 Identities = 15/60 (25%), Positives = 24/60 (40%), Gaps = 3/60 (5%)

Query: 69  ETWQELCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHY 128
           E  + L +  L+ G + +A+  +E L    P    LH+     LY  G       A+  Y
Sbjct: 62  EALEALAEALLALGRHEEALALLERLLALDPLRERLHRLLMRALYRAG---RRAEALRAY 118


>gnl|CDD|199209 cd00463, Ribosomal_L31e, Eukaryotic/archaeal ribosomal protein
          L31.  Ribosomal protein L31e, which is present in
          archaea and eukaryotes, binds the 23S rRNA and is one
          of six protein components encircling the polypeptide
          exit tunnel. It is a component of the eukaryotic 60S
          (large) ribosomal subunit, and the archaeal 50S (large)
          ribosomal subunit.
          Length = 83

 Score = 28.3 bits (64), Expect = 0.82
 Identities = 10/33 (30%), Positives = 17/33 (51%)

Query: 39 RKICILKAKNKIPEAIKELTEYLKKFMTDQETW 71
          RK   +  K + P AIKE+ ++  K M  ++  
Sbjct: 14 RKRKHVPRKKRAPRAIKEIRKFAAKHMKTEDVR 46


>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat.  This Pfam entry
           includes outlying Tetratricopeptide-like repeats (TPR)
           that are not matched by pfam00515.
          Length = 34

 Score = 26.7 bits (60), Expect = 0.94
 Identities = 10/33 (30%), Positives = 14/33 (42%)

Query: 69  ETWQELCDLYLSEGDYAKAVFCMEELFLHHPHN 101
           E    L   Y   GDY +A+   E+     P+N
Sbjct: 2   EALYNLGLAYYKLGDYEEALEAYEKALELDPNN 34


>gnl|CDD|220459 pfam09892, DUF2119, Uncharacterized protein conserved in archaea
           (DUF2119).  This domain, found in various hypothetical
           archaeal proteins, has no known function.
          Length = 193

 Score = 29.3 bits (66), Expect = 1.1
 Identities = 15/44 (34%), Positives = 22/44 (50%)

Query: 21  EHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKF 64
           E LE + +I E    +R R+  I   K + PE  K+  E  +KF
Sbjct: 147 ESLEFVAEILEVGKESRSREEFIEYLKKRYPEQAKKAIEDARKF 190


>gnl|CDD|225605 COG3063, PilF, Tfp pilus assembly protein PilF [Cell motility and
           secretion / Intracellular trafficking and secretion].
          Length = 250

 Score = 29.3 bits (66), Expect = 1.2
 Identities = 26/76 (34%), Positives = 34/76 (44%), Gaps = 6/76 (7%)

Query: 78  YLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHYLMAINLNEK 137
           YL +GDYA+A   +E+   H P  +L H   A      G     +LA   Y  A++L   
Sbjct: 45  YLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLG---ENDLADESYRKALSLAPN 101

Query: 138 NIRAL--YGLALSCHQ 151
           N   L  YG  L C Q
Sbjct: 102 NGDVLNNYGAFL-CAQ 116


>gnl|CDD|234418 TIGR03964, mycofact_creat, mycofactocin system creatininase family
           protein.  Members of this protein family are
           uncharacterized Actinobacterial proteins, with homology
           to creatinine amidohydrolase from Pseudomonas. Members
           occur only in the context of the mycofactocin system
           [Unknown function, Enzymes of unknown specificity].
          Length = 228

 Score = 28.9 bits (65), Expect = 1.6
 Identities = 10/29 (34%), Positives = 14/29 (48%)

Query: 170 MMWVSKHLARQYEEQQGNTETLTELMSAL 198
           ++ +S    R    + GNT  L ELM  L
Sbjct: 149 LLHLSPDRVRMDRAEAGNTAPLAELMPRL 177


>gnl|CDD|115863 pfam07234, DUF1426, Protein of unknown function (DUF1426).  This
          family consists of several Banana bunchy top virus
          proteins of around 120 residues in length. One member
          is annotated a movement protein whereas most other
          family members are hypothetical. The function of this
          family is unknown.
          Length = 117

 Score = 28.0 bits (62), Expect = 1.7
 Identities = 11/35 (31%), Positives = 20/35 (57%)

Query: 49 KIPEAIKELTEYLKKFMTDQETWQELCDLYLSEGD 83
          ++P+ IK L  YL +++T +  W +   L  + GD
Sbjct: 37 EVPKYIKGLVRYLVEYLTRRRVWMQRTQLTEATGD 71


>gnl|CDD|236644 PRK10049, pgaA, outer membrane protein PgaA; Provisional.
          Length = 765

 Score = 28.9 bits (65), Expect = 1.9
 Identities = 9/57 (15%), Positives = 15/57 (26%)

Query: 56  ELTEYLKKFMTDQETWQELCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADIL 112
              + L     D+     L  +Y   G +   +  M +     P        Y   L
Sbjct: 104 VKAKQLVSGAPDKANLLALAYVYKRAGRHWDELRAMTQALPRAPQTQQYPTEYVQAL 160


>gnl|CDD|173359 PTZ00064, PTZ00064, histone acetyltransferase; Provisional.
          Length = 552

 Score = 28.4 bits (63), Expect = 2.5
 Identities = 25/99 (25%), Positives = 38/99 (38%), Gaps = 10/99 (10%)

Query: 10  CFHFFRYE-EALEHLE-TIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTD 67
           C  FF +E E + HL    ++    N   RK  I + +    +     E   YL K   D
Sbjct: 286 CLDFFCFEDELIRHLSRCQLRHPPGNEIYRKDNISVFEIDGALTRGYAENLCYLAKLFLD 345

Query: 68  QETWQELCDLYL--------SEGDYAKAVFCMEELFLHH 98
            +T Q   + +L         EG +    F  E++ L H
Sbjct: 346 HKTLQYDVEPFLFYIVTEVDEEGCHIVGYFSKEKVSLLH 384


>gnl|CDD|206645 cd00879, Sar1, Sar1 is an essential component of COPII vesicle
           coats.  Sar1 is an essential component of COPII vesicle
           coats involved in export of cargo from the ER. The
           GTPase activity of Sar1 functions as a molecular switch
           to control protein-protein and protein-lipid
           interactions that direct vesicle budding from the ER.
           Activation of the GDP to the GTP-bound form of Sar1
           involves the membrane-associated guanine nucleotide
           exchange factor (GEF) Sec12. Sar1 is unlike all Ras
           superfamily GTPases that use either myristoyl or prenyl
           groups to direct membrane association and function, in
           that Sar1 lacks such modification. Instead, Sar1
           contains a unique nine-amino-acid N-terminal extension.
           This extension contains an evolutionarily conserved
           cluster of bulky hydrophobic amino acids, referred to as
           the Sar1-N-terminal activation recruitment (STAR) motif.
           The STAR motif mediates the recruitment of Sar1 to ER
           membranes and facilitates its interaction with mammalian
           Sec12 GEF leading to activation.
          Length = 191

 Score = 28.0 bits (63), Expect = 2.6
 Identities = 14/53 (26%), Positives = 26/53 (49%), Gaps = 4/53 (7%)

Query: 4   IVYIIGCFHFFRYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKE 56
           IV+++      R++E+ E L++++  +E         I IL  K   P A+ E
Sbjct: 90  IVFLVDAADPERFQESKEELDSLLNDEELAN----VPILILGNKIDKPGAVSE 138


>gnl|CDD|221596 pfam12481, DUF3700, Aluminium induced protein.  This domain family
           is found in eukaryotes, and is approximately 120 amino
           acids in length. There are two conserved sequence
           motifs: YGL and LRDR. This family is related to GATase
           enzyme domains.
          Length = 228

 Score = 27.9 bits (63), Expect = 2.6
 Identities = 14/48 (29%), Positives = 19/48 (39%), Gaps = 7/48 (14%)

Query: 98  HPHNHLLHQR----YADI--LYTQGGLENIELAISHYLMAINLNEKNI 139
           H     L  R      DI  L+ QG LEN+      Y ++   NE  +
Sbjct: 60  HSKQSPLLPRLFAVVDDIFCLF-QGHLENLASLRQQYGLSKTANEAML 106


>gnl|CDD|184851 PRK14849, PRK14849, putative lipoprotein/autotransporter
            domain-containing protein; Provisional.
          Length = 1806

 Score = 28.5 bits (63), Expect = 2.9
 Identities = 12/35 (34%), Positives = 17/35 (48%)

Query: 82   GDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQG 116
            G Y   +     LF H  H+ L   +Y D L++QG
Sbjct: 1486 GSYISNIAAANSLFSHRLHDRLGEPQYIDSLHSQG 1520


>gnl|CDD|238649 cd01328, FSL_SPARC, Follistatin-like SPARC (secreted protein,
          acidic, and rich in cysteines) domain;
          SPARC/BM-40/osteonectin is a multifunctional
          glycoprotein which modulates cellular interaction with
          the extracellular matrix by its binding to structural
          matrix proteins such as collagen and vitronectin. The
          protein it composed of an N-terminal acidic region, a
          follistatin (FS) domain and an EF-hand calcium binding
          domain. The FS domain consists of an N-terminal beta
          hairpin (FOLN/EGF-like domain) and a small hydrophobic
          core of alpha/beta structure (Kazal domain) and has
          five disulfide bonds and a conserved N-glycosylation
          site. The FSL_SPARC domain is a member of the
          superfamily of kazal-like proteinase inhibitors and
          follistatin-like proteins.
          Length = 86

 Score = 26.7 bits (59), Expect = 2.9
 Identities = 12/44 (27%), Positives = 20/44 (45%), Gaps = 5/44 (11%)

Query: 40 KICILKAKNK-----IPEAIKELTEYLKKFMTDQETWQELCDLY 78
          K+C +  +N      I    +E+ +  K    D ET+   C+LY
Sbjct: 11 KVCEVDDENTPKCVCIDPCPEEVDDRRKVCTNDNETFDSDCELY 54


>gnl|CDD|182415 PRK10370, PRK10370, formate-dependent nitrite reductase complex
           subunit NrfG; Provisional.
          Length = 198

 Score = 27.9 bits (62), Expect = 3.2
 Identities = 19/68 (27%), Positives = 30/68 (44%), Gaps = 1/68 (1%)

Query: 51  PEA-IKELTEYLKKFMTDQETWQELCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYA 109
           PEA ++ L + ++    + E W  L + YL   DY  A+    +       N  L+   A
Sbjct: 55  PEAQLQALQDKIRANPQNSEQWALLGEYYLWRNDYDNALLAYRQALQLRGENAELYAALA 114

Query: 110 DILYTQGG 117
            +LY Q G
Sbjct: 115 TVLYYQAG 122


>gnl|CDD|165653 PLN00078, PLN00078, photosystem I reaction center subunit N
          (PsaN); Provisional.
          Length = 122

 Score = 27.0 bits (59), Expect = 4.2
 Identities = 18/59 (30%), Positives = 28/59 (47%), Gaps = 5/59 (8%)

Query: 16 YEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKE-----LTEYLKKFMTDQE 69
          +  A +HL     + + N  +R+  +  L +   IPEA  E     L EYLKK   ++E
Sbjct: 15 HRSAAKHLHPAAMLAQRNGISRRCLLTFLTSTAAIPEAGSESRKALLQEYLKKSEENKE 73


>gnl|CDD|184804 PRK14720, PRK14720, transcript cleavage factor/unknown domain
           fusion protein; Provisional.
          Length = 906

 Score = 27.8 bits (62), Expect = 4.4
 Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 3/48 (6%)

Query: 16  YEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKK 63
           ++E +  L+ I++ D  N  AR+  I   K K K       L +YLK 
Sbjct: 239 WDEVIYILKKILEHDNKNNKAREELIRFYKEKYK---DHSLLEDYLKM 283


>gnl|CDD|220821 pfam10602, RPN7, 26S proteasome subunit RPN7.  RPN7 (known as the
          non ATPase regulatory subunit 6 in higher eukaryotes)
          is one of the lid subunits of the 26S proteasome and
          has been shown in Saccharomyces cerevisiae to be
          required for structural integrity. The 26S proteasome
          is is involved in the ATP-dependent degradation of
          ubiquitinated proteins.
          Length = 174

 Score = 27.1 bits (61), Expect = 4.5
 Identities = 13/58 (22%), Positives = 22/58 (37%), Gaps = 8/58 (13%)

Query: 44 LKAKNKIPEAIKELTEYLKKFMTD------QETWQELCDLYLSEGDYAKAVFCMEELF 95
          L+ KN+  E ++EL   LK    +      +    +L + Y   GD   A+       
Sbjct: 8  LRKKNE--EELEELDAELKDAKENLGKEEIRRALLDLAEHYAKIGDLENALKAYSRAR 63


>gnl|CDD|177876 PLN02231, PLN02231, alanine transaminase.
          Length = 534

 Score = 28.0 bits (62), Expect = 4.5
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 43  ILKAKNKIPEAIKELTEYLKKFM 65
           IL  ++KIP  +  LTE+ K FM
Sbjct: 507 ILPQEDKIPAIVSRLTEFHKSFM 529


>gnl|CDD|226862 COG4455, ImpE, Protein of avirulence locus involved in
          temperature-dependent protein secretion [General
          function prediction only].
          Length = 273

 Score = 27.5 bits (61), Expect = 4.7
 Identities = 12/40 (30%), Positives = 17/40 (42%)

Query: 48 NKIPEAIKELTEYLKKFMTDQETWQELCDLYLSEGDYAKA 87
          N + +AI    + +K   TD      L  L    GD+ KA
Sbjct: 15 NSLQDAIGLARDQVKAKPTDAGGRHFLFQLLCVAGDWEKA 54


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 27.2 bits (61), Expect = 6.1
 Identities = 19/84 (22%), Positives = 36/84 (42%), Gaps = 2/84 (2%)

Query: 40  KICILKAKNKIPEAIKELTEYLKKFMTDQETWQEL-CDLYLSEGDYAKAVFCMEELFLHH 98
           K  +++    + EA++ L E  K+ + D+    E+  DL L  G   +A      L   +
Sbjct: 10  KNSLIEESGDLEEALEHLEEKEKQ-IVDRLAVMEMRADLLLKLGRKEEAEATYRALLDRN 68

Query: 99  PHNHLLHQRYADILYTQGGLENIE 122
           P N+  ++     L  +    +IE
Sbjct: 69  PENYDYYEGLQKALGLEISSGDIE 92


>gnl|CDD|234164 TIGR03302, OM_YfiO, outer membrane assembly lipoprotein YfiO.
           Members of this protein family include YfiO, a
           near-essential protein of the outer membrane, part of a
           complex involved in protein insertion into the bacterial
           outer membrane. Many proteins in this family are
           annotated as ComL, based on the involvement of this
           protein in natural transformation with exogenous DNA in
           Neisseria gonorrhoeae. This protein family shows
           sequence similarity to, but is distinct from, the
           tol-pal system protein YbgF (TIGR02795) [Protein fate,
           Protein and peptide secretion and trafficking].
          Length = 235

 Score = 26.7 bits (60), Expect = 6.8
 Identities = 20/74 (27%), Positives = 28/74 (37%), Gaps = 3/74 (4%)

Query: 78  YLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHYLMAINL--N 135
            L  GDY +A+   E L   +P +    Q   D+ Y      +   AI+     I L  N
Sbjct: 43  ALDSGDYTEAIKYFEALESRYPFSPYAEQAQLDLAYAYYKSGDYAEAIAAADRFIRLHPN 102

Query: 136 EKNI-RALYGLALS 148
             +   A Y   LS
Sbjct: 103 HPDADYAYYLRGLS 116


>gnl|CDD|192936 pfam12095, DUF3571, Protein of unknown function (DUF3571).  This
          family of proteins is functionally uncharacterized.
          This protein is found in bacteria and eukaryotes.
          Proteins in this family are typically between 85 to 97
          amino acids in length.
          Length = 83

 Score = 25.7 bits (57), Expect = 7.0
 Identities = 8/37 (21%), Positives = 12/37 (32%), Gaps = 4/37 (10%)

Query: 52 EAIKELTEYLKKFMTDQETWQEL----CDLYLSEGDY 84
            +  L   L K  +     Q L    C+L +  G  
Sbjct: 38 TRLDSLPADLAKLPSLDAQAQRLLDTACELEIGPGLT 74


>gnl|CDD|235288 PRK04346, PRK04346, tryptophan synthase subunit beta; Validated.
          Length = 397

 Score = 27.0 bits (61), Expect = 7.7
 Identities = 14/40 (35%), Positives = 19/40 (47%), Gaps = 7/40 (17%)

Query: 50 IPE----AIKELTEYLKKFMTDQETWQELCDLYLSEGDYA 85
          +PE    A++EL E  +K   D E   EL D  L   +Y 
Sbjct: 19 VPETLMPALEELEEAYEKAKNDPEFQAEL-DYLLK--NYV 55


>gnl|CDD|233999 TIGR02757, TIGR02757, TIGR02757 family protein.  Members of this
           uncharacterized protein family are found sporadically,
           so far only among spirochetes, epsilon and delta
           proteobacteria, and Bacteroides. The function is unknown
           and its gene neighborhoods show little conservation
           [Hypothetical proteins, Conserved].
          Length = 229

 Score = 26.6 bits (59), Expect = 8.5
 Identities = 12/41 (29%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 24  ETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKF 64
           + I+ +D T+     +K+ +LK K+   +A  E+TE L++ 
Sbjct: 171 DLILPLD-THVFRIAKKLKLLKRKSYDLKAAIEITEALREL 210


>gnl|CDD|225008 COG2097, RPL31A, Ribosomal protein L31E [Translation, ribosomal
          structure and biogenesis].
          Length = 89

 Score = 25.4 bits (56), Expect = 8.9
 Identities = 6/33 (18%), Positives = 16/33 (48%)

Query: 39 RKICILKAKNKIPEAIKELTEYLKKFMTDQETW 71
          R+   +    + P A+K + +++ + M  +E  
Sbjct: 16 RRAKKVPRTKRAPRAVKIIRKFVARHMKAEEVR 48


>gnl|CDD|237080 PRK12370, PRK12370, invasion protein regulator; Provisional.
          Length = 553

 Score = 26.7 bits (59), Expect = 9.1
 Identities = 31/162 (19%), Positives = 61/162 (37%), Gaps = 18/162 (11%)

Query: 2   APIVYIIGCFHFF---RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELT 58
           +  +     ++ F   + EEAL+ +   +K+D T  AA   K+ I      I +AI+   
Sbjct: 371 SADIKYYYGWNLFMAGQLEEALQTINECLKLDPTRAAAGITKLWITYYHTGIDDAIRLGD 430

Query: 59  EYLKKFMTDQETWQELCDLYLS-EGDYAKAVFCMEELFLHHPHNHLLHQRYADILYT--- 114
           E   + + D      +  ++LS +G +  A     +L        +      ++LY    
Sbjct: 431 ELRSQHLQDNPILLSMQVMFLSLKGKHELA----RKLTKEISTQEITGLIAVNLLYAEYC 486

Query: 115 QGGLENIELAISHYLMAINLNEKN------IRALYGLALSCH 150
           Q   E     I  +L +    + N      +   +G A++  
Sbjct: 487 QNS-ERALPTIREFLESEQRIDNNPGLLPLVLVAHGEAIAEK 527


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.133    0.391 

Gapped
Lambda     K      H
   0.267   0.0733    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,190,980
Number of extensions: 948496
Number of successful extensions: 1390
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1373
Number of HSP's successfully gapped: 131
Length of query: 201
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 109
Effective length of database: 6,857,034
Effective search space: 747416706
Effective search space used: 747416706
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 56 (25.3 bits)