RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy13481
         (231 letters)



>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
           34 amino acids
           [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
           X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
           in a variety of organisms including bacteria,
           cyanobacteria, yeast, fungi, plants, and humans in
           various subcellular locations; involved in a variety of
           functions including protein-protein interactions, but
           common features in the interaction partners have not
           been defined; involved in chaperone, cell-cycle,
           transciption, and protein transport complexes; the
           number of TPR motifs varies among proteins (1,3-11,13
           15,16,19); 5-6 tandem repeats generate a right-handed
           helical structure with an amphipathic channel that is
           thought to accomodate an alpha-helix of a target
           protein; it has been proposed that TPR proteins
           preferably interact with WD-40 repeat proteins, but in
           many instances several TPR-proteins seem to aggregate to
           multi-protein complexes; examples of TPR-proteins
           include, Cdc16p, Cdc23p and Cdc27p components of the
           cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
           targeting signals, the Tom70p co-receptor for
           mitochondrial targeting signals, Ser/Thr phosphatase 5C
           and the p110 subunit of O-GlcNAc transferase; three
           copies of the repeat are present here.
          Length = 100

 Score = 75.5 bits (186), Expect = 8e-18
 Identities = 31/95 (32%), Positives = 50/95 (52%)

Query: 117 KREAIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKI 176
                  Y  G +++A+  Y +A++L+P +A  Y     +Y +L K    + D   AL++
Sbjct: 4   LNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALEL 63

Query: 177 NPDSAAAYKFRGRAYRLLGKWEEAAVDLRNACKID 211
           +PD+A AY   G AY  LGK+EEA      A ++D
Sbjct: 64  DPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELD 98



 Score = 49.3 bits (118), Expect = 4e-08
 Identities = 19/60 (31%), Positives = 34/60 (56%)

Query: 120 AIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPD 179
           A   Y  GK+E+A+  Y +A++L+P +A  Y   G +Y +L K    +     AL+++P+
Sbjct: 41  AAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100



 Score = 42.4 bits (100), Expect = 1e-05
 Identities = 21/63 (33%), Positives = 31/63 (49%)

Query: 149 LYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKFRGRAYRLLGKWEEAAVDLRNAC 208
                G  Y +L   +  +     AL+++PD+A AY     AY  LGK+EEA  D   A 
Sbjct: 2   ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKAL 61

Query: 209 KID 211
           ++D
Sbjct: 62  ELD 64


>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat. 
          Length = 69

 Score = 61.6 bits (150), Expect = 5e-13
 Identities = 18/69 (26%), Positives = 35/69 (50%), Gaps = 1/69 (1%)

Query: 111 DKSNEKKREAIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQL-SKPNACIRD 169
           D +   K      +  G +++A+ AY +A++L+P +A  Y     +YL+L       + D
Sbjct: 1   DNAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALED 60

Query: 170 CSVALKINP 178
              AL+++P
Sbjct: 61  LEKALELDP 69



 Score = 48.1 bits (115), Expect = 6e-08
 Identities = 23/67 (34%), Positives = 34/67 (50%), Gaps = 1/67 (1%)

Query: 146 SALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKFRGRAYRLLGK-WEEAAVDL 204
           +A      G +  +L   +  I     AL+++PD+A AY     AY  LGK +EEA  DL
Sbjct: 2   NAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDL 61

Query: 205 RNACKID 211
             A ++D
Sbjct: 62  EKALELD 68



 Score = 31.1 bits (71), Expect = 0.062
 Identities = 13/35 (37%), Positives = 18/35 (51%)

Query: 179 DSAAAYKFRGRAYRLLGKWEEAAVDLRNACKIDFD 213
           D+A A K  G A   LG ++EA      A ++D D
Sbjct: 1   DNAEALKNLGNALFKLGDYDEAIEAYEKALELDPD 35


>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
           receptor (72 kDa mitochondrial outermembrane protein)
           (mitochondrial import receptor for the ADP/ATP carrier)
           (translocase of outermembrane tom70).  [Transport and
           binding proteins, Amino acids, peptides and amines].
          Length = 615

 Score = 62.3 bits (151), Expect = 2e-11
 Identities = 46/156 (29%), Positives = 74/156 (47%), Gaps = 3/156 (1%)

Query: 57  KPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSEEDIDKSNEK 116
           K E+          E ++EE          + EP +E P  D ++   +SEE+  K   K
Sbjct: 71  KKERRKRKQAEKETEGKTEEKKSTAPKNAPV-EPADELPEIDESSVANLSEEERKKYAAK 129

Query: 117 KREAIH-AYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALK 175
            +E  + AY +  F  A+  YS+AI+  P   + Y+ R   +  L      + D + AL+
Sbjct: 130 LKEKGNKAYRNKDFNKAIKLYSKAIECKPDP-VYYSNRAACHNALGDWEKVVEDTTAALE 188

Query: 176 INPDSAAAYKFRGRAYRLLGKWEEAAVDLRNACKID 211
           ++PD + A   R  AY  LGK+ +A +DL  +C ID
Sbjct: 189 LDPDYSKALNRRANAYDGLGKYADALLDLTASCIID 224



 Score = 42.7 bits (100), Expect = 8e-05
 Identities = 42/160 (26%), Positives = 70/160 (43%), Gaps = 12/160 (7%)

Query: 65  PEPS-APEEESEESDPELDN--TGVISEPDEEEPIPDVATDIEVSEEDIDKSNEKKREAI 121
           P+P  A  E+S E D E  N    +  +  E +            E+ +D     ++EAI
Sbjct: 273 PKPRPAGLEDSNELDEETGNGQLQLGLKSPESKADESYEEAARAFEKALDLGKLGEKEAI 332

Query: 122 HAYN--------DGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVA 173
            A N         GK  +A+   S++I+L+P     Y KR    L+L  P+    D   A
Sbjct: 333 -ALNLRGTFKCLKGKHLEALADLSKSIELDPRVTQSYIKRASMNLELGDPDKAEEDFDKA 391

Query: 174 LKINPDSAAAYKFRGRAYRLLGKWEEAAVDLRNACKIDFD 213
           LK+N +    Y  R + + + G++ +A  D + +  +D D
Sbjct: 392 LKLNSEDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDLDPD 431


>gnl|CDD|215568 PLN03088, PLN03088, SGT1,  suppressor of G2 allele of SKP1;
           Provisional.
          Length = 356

 Score = 51.3 bits (123), Expect = 9e-08
 Identities = 28/87 (32%), Positives = 51/87 (58%)

Query: 114 NEKKREAIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVA 173
            + + +A  A+ D  F  AV+ Y++AI L+P++A LYA R Q+ ++L      + D + A
Sbjct: 3   KDLEDKAKEAFVDDDFALAVDLYTQAIDLDPNNAELYADRAQANIKLGNFTEAVADANKA 62

Query: 174 LKINPDSAAAYKFRGRAYRLLGKWEEA 200
           ++++P  A AY  +G A   L +++ A
Sbjct: 63  IELDPSLAKAYLRKGTACMKLEEYQTA 89


>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat. 
          Length = 65

 Score = 43.8 bits (104), Expect = 2e-06
 Identities = 14/64 (21%), Positives = 25/64 (39%)

Query: 118 REAIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKIN 177
             A  A   G +++A+ A   A+   P +A      G++ L+  +          AL  +
Sbjct: 2   ALARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAAD 61

Query: 178 PDSA 181
           PD  
Sbjct: 62  PDDP 65



 Score = 37.3 bits (87), Expect = 5e-04
 Identities = 16/60 (26%), Positives = 23/60 (38%)

Query: 152 KRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKFRGRAYRLLGKWEEAAVDLRNACKID 211
              ++ L+    +  +     AL   P +A A    G A    G+  EAA  LR A   D
Sbjct: 2   ALARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAAD 61


>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
          Length = 291

 Score = 46.0 bits (107), Expect = 6e-06
 Identities = 31/119 (26%), Positives = 51/119 (42%), Gaps = 1/119 (0%)

Query: 103 IEVSEEDIDKSNEKKREAIHAYNDGKFEDAVNAYSEAIKLNPS-SALLYAKRGQSYLQLS 161
           +E+  E  + +             G++E+A+    +A+KLNP   A      G  YL+L 
Sbjct: 157 LELDPELNELAEALLALGALLEALGRYEEALELLEKALKLNPDDDAEALLNLGLLYLKLG 216

Query: 162 KPNACIRDCSVALKINPDSAAAYKFRGRAYRLLGKWEEAAVDLRNACKIDFDEQADEWL 220
           K    +     AL+++PD+A A          LG++EEA   L  A ++D D       
Sbjct: 217 KYEEALEYYEKALELDPDNAEALYNLALLLLELGRYEEALEALEKALELDPDLYNLGLA 275



 Score = 36.4 bits (82), Expect = 0.008
 Identities = 29/102 (28%), Positives = 40/102 (39%), Gaps = 4/102 (3%)

Query: 110 IDKSNEKKREAIHAYNDGKFEDAVNAYSEAIKLNPSS---ALLYAKRGQSYLQLSKPNAC 166
                E        Y  G +E+A+  Y +A++L+P     A      G     L +    
Sbjct: 127 DPDLAEALLALGALYELGDYEEALELYEKALELDPELNELAEALLALGALLEALGRYEEA 186

Query: 167 IRDCSVALKINPDSAA-AYKFRGRAYRLLGKWEEAAVDLRNA 207
           +     ALK+NPD  A A    G  Y  LGK+EEA      A
Sbjct: 187 LELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKA 228


>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
           repeats [General function prediction only].
          Length = 484

 Score = 45.1 bits (107), Expect = 1e-05
 Identities = 22/74 (29%), Positives = 36/74 (48%)

Query: 127 GKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKF 186
            K ++A+    +A+ L+P+S LL     Q+ L+  KP   IR  +  L  +P+    +  
Sbjct: 354 NKAKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDL 413

Query: 187 RGRAYRLLGKWEEA 200
             +AY  LG   EA
Sbjct: 414 LAQAYAELGNRAEA 427



 Score = 36.6 bits (85), Expect = 0.007
 Identities = 23/101 (22%), Positives = 38/101 (37%), Gaps = 1/101 (0%)

Query: 119 EAIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINP 178
            A+  Y  G++++A+      I   P +       G   L+ +K    I     AL ++P
Sbjct: 312 RALQTYLAGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKEAIERLKKALALDP 371

Query: 179 DSAAAYKFRGRAYRLLGKWEEAAVDLRNACKIDFDEQADEW 219
           +S        +A    GK +EA   L      D  E  + W
Sbjct: 372 NSPLLQLNLAQALLKGGKPQEAIRILNRYLFND-PEDPNGW 411


>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
           lipoprotein.  This protein family occurs in strictly
           within a subset of Gram-negative bacterial species with
           the proposed PEP-CTERM/exosortase system, analogous to
           the LPXTG/sortase system common in Gram-positive
           bacteria. This protein occurs in a species if and only
           if a transmembrane histidine kinase (TIGR02916) and a
           DNA-binding response regulator (TIGR02915) also occur.
           The present of tetratricopeptide repeats (TPR) suggests
           protein-protein interaction, possibly for the regulation
           of PEP-CTERM protein expression, since many PEP-CTERM
           proteins in these genomes are preceded by a proposed DNA
           binding site for the response regulator.
          Length = 899

 Score = 42.8 bits (101), Expect = 9e-05
 Identities = 27/100 (27%), Positives = 42/100 (42%), Gaps = 5/100 (5%)

Query: 122 HAY-NDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLS--KPNACIRDCSVALKINP 178
            AY   G+ E A  +Y +A+ ++P S  LYAK G + L L+  + +         L  +P
Sbjct: 133 LAYLGLGQLELAQKSYEQALAIDPRS--LYAKLGLAQLALAENRFDEARALIDEVLTADP 190

Query: 179 DSAAAYKFRGRAYRLLGKWEEAAVDLRNACKIDFDEQADE 218
            +  A   +G     LG  E A    R A  +  +  A  
Sbjct: 191 GNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVL 230



 Score = 38.9 bits (91), Expect = 0.002
 Identities = 23/89 (25%), Positives = 42/89 (47%)

Query: 125 NDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAY 184
             G+ ++A+   S A+ L+P      +  G++YL L          + A +++P++AAA 
Sbjct: 341 RLGRVDEAIATLSPALGLDPDDPAALSLLGEAYLALGDFEKAAEYLAKATELDPENAAAR 400

Query: 185 KFRGRAYRLLGKWEEAAVDLRNACKIDFD 213
              G +    G   EA  DL  A ++D +
Sbjct: 401 TQLGISKLSQGDPSEAIADLETAAQLDPE 429



 Score = 35.1 bits (81), Expect = 0.025
 Identities = 16/85 (18%), Positives = 31/85 (36%)

Query: 120 AIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPD 179
           A  A  + +F++A     E +  +P +      +G   L L      +     A+ + P+
Sbjct: 166 AQLALAENRFDEARALIDEVLTADPGNVDALLLKGDLLLSLGNIELALAAYRKAIALRPN 225

Query: 180 SAAAYKFRGRAYRLLGKWEEAAVDL 204
           + A            G++EEA    
Sbjct: 226 NIAVLLALATILIEAGEFEEAEKHA 250



 Score = 35.1 bits (81), Expect = 0.029
 Identities = 16/76 (21%), Positives = 29/76 (38%), Gaps = 1/76 (1%)

Query: 126 DGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYK 185
              +  A+ AY +A+K  PSS     K  ++ L        ++     LK +P+ A    
Sbjct: 716 QKDYPAAIQAYRKALKRAPSSQNA-IKLHRALLASGNTAEAVKTLEAWLKTHPNDAVLRT 774

Query: 186 FRGRAYRLLGKWEEAA 201
                Y     +++A 
Sbjct: 775 ALAELYLAQKDYDKAI 790



 Score = 33.9 bits (78), Expect = 0.059
 Identities = 19/76 (25%), Positives = 31/76 (40%)

Query: 138 EAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKFRGRAYRLLGKW 197
           EA    P S   +   G++ L     N  +      L + PDSA A      AY ++  +
Sbjct: 592 EAADAAPDSPEAWLMLGRAQLAAGDLNKAVSSFKKLLALQPDSALALLLLADAYAVMKNY 651

Query: 198 EEAAVDLRNACKIDFD 213
            +A   L+ A ++  D
Sbjct: 652 AKAITSLKRALELKPD 667



 Score = 32.7 bits (75), Expect = 0.14
 Identities = 20/55 (36%), Positives = 28/55 (50%)

Query: 153 RGQSYLQLSKPNACIRDCSVALKINPDSAAAYKFRGRAYRLLGKWEEAAVDLRNA 207
             +SYLQ +K  A I     AL+ +P+ A A    G+ Y  LG +  A  +LR A
Sbjct: 28  AAKSYLQKNKYKAAIIQLKNALQKDPNDAEARFLLGKIYLALGDYAAAEKELRKA 82



 Score = 32.0 bits (73), Expect = 0.27
 Identities = 17/74 (22%), Positives = 29/74 (39%)

Query: 127 GKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKF 186
           G    AV+++ + + L P SAL       +Y  +      I     AL++ PD+  A   
Sbjct: 615 GDLNKAVSSFKKLLALQPDSALALLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQIG 674

Query: 187 RGRAYRLLGKWEEA 200
             +      + E A
Sbjct: 675 LAQLLLAAKRTESA 688



 Score = 31.2 bits (71), Expect = 0.49
 Identities = 23/77 (29%), Positives = 29/77 (37%), Gaps = 1/77 (1%)

Query: 125 NDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAY 184
              + E A        K +P +AL +   G  YL+     A I+    ALK  P S  A 
Sbjct: 681 AAKRTESAKKIAKSLQKQHPKAALGFELEGDLYLRQKDYPAAIQAYRKALKRAPSSQNAI 740

Query: 185 KFRGRAYRLLGKWEEAA 201
           K   RA    G   EA 
Sbjct: 741 KL-HRALLASGNTAEAV 756



 Score = 31.2 bits (71), Expect = 0.51
 Identities = 21/74 (28%), Positives = 29/74 (39%)

Query: 127 GKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKF 186
           G  E+AV    +A +LNP          Q YL   +    +   + A    PDS  A+  
Sbjct: 547 GNEEEAVAWLEKAAELNPQEIEPALALAQYYLGKGQLKKALAILNEAADAAPDSPEAWLM 606

Query: 187 RGRAYRLLGKWEEA 200
            GRA    G   +A
Sbjct: 607 LGRAQLAAGDLNKA 620



 Score = 31.2 bits (71), Expect = 0.53
 Identities = 27/98 (27%), Positives = 39/98 (39%), Gaps = 8/98 (8%)

Query: 127 GKFEDAVNAYSEAIKL--NPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAY 184
           GKF+  +        L  +  +A L A RG +YL L +     +    AL I+P S  A 
Sbjct: 104 GKFQQVL-DELPGKTLLDDEGAAELLALRGLAYLGLGQLELAQKSYEQALAIDPRSLYA- 161

Query: 185 KFRGRAYRLL--GKWEEAAVDLRNACKIDFDEQADEWL 220
           K  G A   L   +++EA   +      D     D  L
Sbjct: 162 KL-GLAQLALAENRFDEARALIDEVLTAD-PGNVDALL 197



 Score = 30.8 bits (70), Expect = 0.57
 Identities = 17/57 (29%), Positives = 28/57 (49%)

Query: 127 GKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAA 183
           G FE A    ++A +L+P +A    + G S L    P+  I D   A +++P+   A
Sbjct: 377 GDFEKAAEYLAKATELDPENAAARTQLGISKLSQGDPSEAIADLETAAQLDPELGRA 433



 Score = 30.1 bits (68), Expect = 1.2
 Identities = 30/91 (32%), Positives = 44/91 (48%), Gaps = 4/91 (4%)

Query: 123 AYNDGKFEDAVNAYSEAIKLNPSSALLYAKR--GQSYLQLSKPNACIRDCSVALKINPDS 180
            Y  G  E A    ++ +K  P+S    A+R      L+L + +  I   S AL ++PD 
Sbjct: 305 EYQLGNLEQAYQYLNQILKYAPNS--HQARRLLASIQLRLGRVDEAIATLSPALGLDPDD 362

Query: 181 AAAYKFRGRAYRLLGKWEEAAVDLRNACKID 211
            AA    G AY  LG +E+AA  L  A ++D
Sbjct: 363 PAALSLLGEAYLALGDFEKAAEYLAKATELD 393



 Score = 28.9 bits (65), Expect = 3.0
 Identities = 19/87 (21%), Positives = 36/87 (41%)

Query: 127 GKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKF 186
           G  E A+ AY +AI L P++  +        ++  +     +     LK  P+S  A+  
Sbjct: 207 GNIELALAAYRKAIALRPNNIAVLLALATILIEAGEFEEAEKHADALLKKAPNSPLAHYL 266

Query: 187 RGRAYRLLGKWEEAAVDLRNACKIDFD 213
           +         +E+A   L++A K   +
Sbjct: 267 KALVDFQKKNYEDARETLQDALKSAPE 293


>gnl|CDD|233924 TIGR02552, LcrH_SycD, type III secretion low calcium response
           chaperone LcrH/SycD.  Genes in this family are found in
           type III secretion operons. LcrH, from Yersinia is
           believed to have a regulatory function in the
           low-calcium response of the secretion system. The same
           protein is also known as SycD (SYC = Specific Yop
           Chaperone) for its chaperone role. In Pseudomonas, where
           the homolog is known as PcrH, the chaperone role has
           been demonstrated and the regulatory role appears to be
           absent. ScyD/LcrH contains three central
           tetratricopeptide-like repeats that are predicted to
           fold into an all-alpha-helical array.
          Length = 135

 Score = 38.8 bits (91), Expect = 5e-04
 Identities = 21/105 (20%), Positives = 40/105 (38%)

Query: 118 REAIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKIN 177
             A + Y  G++++A+  +      +P ++  +         L +    I   ++A  ++
Sbjct: 22  ALAYNLYQQGRYDEALKLFQLLAAYDPYNSRYWLGLAACCQMLKEYEEAIDAYALAAALD 81

Query: 178 PDSAAAYKFRGRAYRLLGKWEEAAVDLRNACKIDFDEQADEWLKE 222
           PD    Y         LG+ E A   L  A +I  +      LKE
Sbjct: 82  PDDPRPYFHAAECLLALGEPESALKALDLAIEICGENPEYSELKE 126



 Score = 30.0 bits (68), Expect = 0.45
 Identities = 16/69 (23%), Positives = 36/69 (52%)

Query: 127 GKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKF 186
            ++E+A++AY+ A  L+P     Y    +  L L +P + ++   +A++I  ++    + 
Sbjct: 65  KEYEEAIDAYALAAALDPDDPRPYFHAAECLLALGEPESALKALDLAIEICGENPEYSEL 124

Query: 187 RGRAYRLLG 195
           + RA  +L 
Sbjct: 125 KERAEAMLE 133


>gnl|CDD|205550 pfam13371, TPR_9, Tetratricopeptide repeat. 
          Length = 73

 Score = 36.0 bits (84), Expect = 0.001
 Identities = 13/51 (25%), Positives = 19/51 (37%)

Query: 155 QSYLQLSKPNACIRDCSVALKINPDSAAAYKFRGRAYRLLGKWEEAAVDLR 205
             YL+       +      L + PD     + RG  Y  LG ++ A  DL 
Sbjct: 3   AIYLREEDFERALAVVERLLLLAPDDPYERRDRGLLYAQLGCFQAALADLE 53



 Score = 25.6 bits (57), Expect = 5.9
 Identities = 18/64 (28%), Positives = 26/64 (40%), Gaps = 1/64 (1%)

Query: 120 AIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPD 179
           AI+   +  FE A+      + L P        RG  Y QL    A + D    L++ PD
Sbjct: 3   AIYL-REEDFERALAVVERLLLLAPDDPYERRDRGLLYAQLGCFQAALADLEYFLELCPD 61

Query: 180 SAAA 183
           +  A
Sbjct: 62  APDA 65


>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats.  Repeats present in 4
           or more copies in proteins. Contain a minimum of 34
           amino acids each and self-associate via a "knobs and
           holes" mechanism.
          Length = 34

 Score = 34.0 bits (79), Expect = 0.004
 Identities = 11/31 (35%), Positives = 15/31 (48%)

Query: 181 AAAYKFRGRAYRLLGKWEEAAVDLRNACKID 211
           A A    G AY  LG ++EA      A ++D
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELD 31



 Score = 29.7 bits (68), Expect = 0.11
 Identities = 6/19 (31%), Positives = 15/19 (78%)

Query: 127 GKFEDAVNAYSEAIKLNPS 145
           G +++A+  Y +A++L+P+
Sbjct: 15  GDYDEALEYYEKALELDPN 33



 Score = 26.3 bits (59), Expect = 1.6
 Identities = 8/33 (24%), Positives = 16/33 (48%)

Query: 147 ALLYAKRGQSYLQLSKPNACIRDCSVALKINPD 179
           A      G +YL+L   +  +     AL+++P+
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELDPN 33


>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat. 
          Length = 34

 Score = 32.8 bits (76), Expect = 0.009
 Identities = 11/31 (35%), Positives = 16/31 (51%)

Query: 181 AAAYKFRGRAYRLLGKWEEAAVDLRNACKID 211
           A A    G AY  LGK++EA      A +++
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALELN 31



 Score = 29.7 bits (68), Expect = 0.092
 Identities = 10/33 (30%), Positives = 17/33 (51%)

Query: 147 ALLYAKRGQSYLQLSKPNACIRDCSVALKINPD 179
           A      G +YL+L K +  +     AL++NP+
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALELNPN 33



 Score = 29.7 bits (68), Expect = 0.11
 Identities = 8/19 (42%), Positives = 16/19 (84%)

Query: 127 GKFEDAVNAYSEAIKLNPS 145
           GK+++A+  Y +A++LNP+
Sbjct: 15  GKYDEALEYYEKALELNPN 33


>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat. 
          Length = 34

 Score = 32.6 bits (75), Expect = 0.010
 Identities = 10/28 (35%), Positives = 18/28 (64%)

Query: 173 ALKINPDSAAAYKFRGRAYRLLGKWEEA 200
           AL+++P++A AY         LG+++EA
Sbjct: 5   ALELDPNNAEAYYNLALLLLNLGQYDEA 32



 Score = 29.1 bits (66), Expect = 0.18
 Identities = 8/34 (23%), Positives = 18/34 (52%)

Query: 135 AYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIR 168
            Y +A++L+P++A  Y       L L + +  ++
Sbjct: 1   LYEKALELDPNNAEAYYNLALLLLNLGQYDEALQ 34


>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat.  This Pfam entry
           includes outlying Tetratricopeptide-like repeats (TPR)
           that are not matched by pfam00515.
          Length = 34

 Score = 31.7 bits (73), Expect = 0.019
 Identities = 12/31 (38%), Positives = 15/31 (48%)

Query: 181 AAAYKFRGRAYRLLGKWEEAAVDLRNACKID 211
           A A    G AY  LG +EEA      A ++D
Sbjct: 1   AEALYNLGLAYYKLGDYEEALEAYEKALELD 31



 Score = 30.2 bits (69), Expect = 0.069
 Identities = 9/27 (33%), Positives = 17/27 (62%)

Query: 118 REAIHAYNDGKFEDAVNAYSEAIKLNP 144
              +  Y  G +E+A+ AY +A++L+P
Sbjct: 6   NLGLAYYKLGDYEEALEAYEKALELDP 32



 Score = 25.6 bits (57), Expect = 3.4
 Identities = 7/33 (21%), Positives = 14/33 (42%)

Query: 147 ALLYAKRGQSYLQLSKPNACIRDCSVALKINPD 179
           A      G +Y +L      +     AL+++P+
Sbjct: 1   AEALYNLGLAYYKLGDYEEALEAYEKALELDPN 33


>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat. 
          Length = 78

 Score = 32.7 bits (75), Expect = 0.026
 Identities = 16/76 (21%), Positives = 28/76 (36%), Gaps = 7/76 (9%)

Query: 143 NPSSALLYAKRGQSYLQLSKPNACIRDCSVALKI-------NPDSAAAYKFRGRAYRLLG 195
           +P  A           +L   +  +     AL++       +P++A A     R Y  LG
Sbjct: 1   HPDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALG 60

Query: 196 KWEEAAVDLRNACKID 211
            ++EA   L  A  + 
Sbjct: 61  DYDEALEYLEKALALR 76


>gnl|CDD|222122 pfam13429, TPR_15, Tetratricopeptide repeat. 
          Length = 279

 Score = 33.8 bits (78), Expect = 0.055
 Identities = 23/97 (23%), Positives = 36/97 (37%), Gaps = 15/97 (15%)

Query: 119 EAIHAYND----GKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVAL 174
            A++ Y       +  D + + +       S+  L A R + Y Q  +P+A +RD   AL
Sbjct: 114 SALYLYTRAGDYKRIVDLLASLTPEEAAEQSAGFLSA-RAEYYRQTGQPDAALRDYRRAL 172

Query: 175 KINPDSAAA----------YKFRGRAYRLLGKWEEAA 201
            + P S+               R    RLL  W   A
Sbjct: 173 ALPPASSDLRAALLWLLIDLGRRAELKRLLAAWRGRA 209


>gnl|CDD|188247 TIGR02795, tol_pal_ybgF, tol-pal system protein YbgF.  Members of
           this protein family are the product of one of seven
           genes regularly clustered in operons to encode the
           proteins of the tol-pal system, which is critical for
           maintaining the integrity of the bacterial outer
           membrane. The gene for this periplasmic protein has been
           designated orf2 and ybgF. All members of the seed
           alignment were from unique tol-pal gene regions from
           completed bacterial genomes. The architecture of this
           protein is a signal sequence, a low-complexity region
           usually rich in Asn and Gln, a well-conserved region
           with tandem repeats that resemble the tetratricopeptide
           (TPR) repeat, involved in protein-protein interaction.
          Length = 117

 Score = 32.3 bits (74), Expect = 0.061
 Identities = 19/82 (23%), Positives = 31/82 (37%), Gaps = 8/82 (9%)

Query: 126 DGKFEDAVNAYSEAIKLNPSSALL---YAKRGQSYLQLSKPNACIRDCSVALKINPDSAA 182
            G + DA+ A+   +K  P S      +   G++Y          +     +K  P S  
Sbjct: 13  AGDYADAIQAFQAFLKKYPKSTYAPNAHYWLGEAYYAQGDYADAAKAFLAVVKKYPKSPK 72

Query: 183 A----YKFRGRAYRLLGKWEEA 200
           A     K  G + + LG  E+A
Sbjct: 73  APDALLKL-GMSLQELGDKEKA 93


>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
           component of pyruvate dehydrogenase complex.
          Length = 539

 Score = 33.3 bits (76), Expect = 0.084
 Identities = 14/49 (28%), Positives = 21/49 (42%), Gaps = 4/49 (8%)

Query: 30  ELKFFKDFVESYGGTIPPPSKAQSDAPKP----EKPFTTPEPSAPEEES 74
           ++  FKD+  S       P    S  P      EKP ++PEP A +  +
Sbjct: 193 DIGKFKDYKPSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSA 241


>gnl|CDD|233365 TIGR01347, sucB, 2-oxoglutarate dehydrogenase complex
           dihydrolipoamide succinyltransferase (E2 component).
           This model describes the TCA cycle 2-oxoglutarate system
           E2 component, dihydrolipoamide succinyltransferase. It
           is closely related to the pyruvate dehydrogenase E2
           component, dihydrolipoamide acetyltransferase. The seed
           for this model includes mitochondrial and Gram-negative
           bacterial forms. Mycobacterial candidates are highly
           derived, differ in having and extra copy of the
           lipoyl-binding domain at the N-terminus. They score
           below the trusted cutoff, but above the noise cutoff and
           above all examples of dihydrolipoamide acetyltransferase
           [Energy metabolism, TCA cycle].
          Length = 403

 Score = 32.8 bits (75), Expect = 0.12
 Identities = 17/80 (21%), Positives = 24/80 (30%), Gaps = 2/80 (2%)

Query: 42  GGTIPPPSKAQSDAPKPEKPFTTPEPSAPEEESEES-DPELDNTGVISEPDEEEPIPDVA 100
           G        A+S   K E P  +   +     +  S  P           D    +P   
Sbjct: 77  GNDATAAPPAKSGEEKEETPAASAAAAPTAAANRPSLSPAARRLAKEHGIDLSA-VPGTG 135

Query: 101 TDIEVSEEDIDKSNEKKREA 120
               V++EDI K  E    A
Sbjct: 136 VTGRVTKEDIIKKTEAPASA 155


>gnl|CDD|205362 pfam13181, TPR_8, Tetratricopeptide repeat. 
          Length = 34

 Score = 29.0 bits (66), Expect = 0.19
 Identities = 12/31 (38%), Positives = 16/31 (51%)

Query: 181 AAAYKFRGRAYRLLGKWEEAAVDLRNACKID 211
           A AY   G+ Y  LG +EEA      A ++D
Sbjct: 1   ARAYYLLGQIYLQLGDYEEAKEYYEKALELD 31


>gnl|CDD|226456 COG3947, COG3947, Response regulator containing CheY-like receiver
           and SARP domains [Signal transduction mechanisms].
          Length = 361

 Score = 32.1 bits (73), Expect = 0.22
 Identities = 30/129 (23%), Positives = 49/129 (37%), Gaps = 19/129 (14%)

Query: 108 EDIDKSNEKKREAIHAYNDGKFEDA-------VNAYSEAIKLNPSSALLYAKRGQSYLQL 160
           E I+ + ++ +E +  Y      +A            E + +      L  K  ++YL+ 
Sbjct: 238 EAINLTIDELKELVGQYKGDYLPEADYPWAEDERERLEQLYMK-----LLGKVARAYLEA 292

Query: 161 SKPNACIRDCSVALKINPDSAAAYKFRGRAYRLLG-------KWEEAAVDLRNACKIDFD 213
            KPN  I+    AL ++P S    K    +   LG        +E  A  L     ID D
Sbjct: 293 GKPNEAIQLHQRALTLDPLSEQDNKGLMASLATLGDEISAIKHYERYAEVLEAELGIDVD 352

Query: 214 EQADEWLKE 222
           +  +EW K 
Sbjct: 353 DSIEEWYKS 361


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 32.0 bits (73), Expect = 0.22
 Identities = 12/70 (17%), Positives = 21/70 (30%)

Query: 43  GTIPPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATD 102
              P   K ++     E+     E    EE+ +       +     E  EEE    + + 
Sbjct: 404 QKEPLEEKPENKDESVEEISDAEEDDEDEEDEDGDGDVEMSAVDNDEEKEEEDKEAIPST 463

Query: 103 IEVSEEDIDK 112
           I   E  +  
Sbjct: 464 ILEEEPTVGG 473


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 31.3 bits (71), Expect = 0.46
 Identities = 13/56 (23%), Positives = 19/56 (33%), Gaps = 4/56 (7%)

Query: 46  PPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISE---PDEEEPIPD 98
           P P       P P +P   PE    E +  + D   +   +  E      E  +PD
Sbjct: 267 PEPEPQPPPPPPPPEP-PEPEEEPDEPDQTDPDDGEETDQIPEELMFDAVEADLPD 321



 Score = 28.6 bits (64), Expect = 3.1
 Identities = 15/50 (30%), Positives = 20/50 (40%), Gaps = 6/50 (12%)

Query: 46  PPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEP 95
           P  ++     P+P  P   PEP  PEEE +E D          +  EE  
Sbjct: 261 PRATRLPEPEPQPPPPPPPPEPPEPEEEPDEPDQT------DPDDGEETD 304


>gnl|CDD|238138 cd00226, PRCH, Photosynthetic reaction center (RC) complex, subunit
           H;  RC is an integral membrane protein-pigment complex
           which catalyzes light-induced reduction of ubiquinone to
           ubiquinol, generating a transmembrane electrochemical
           gradient of protons used to produce ATP by ATP synthase.
           Subunit H is positioned mainly in the cytoplasm with one
           transmembrane alpha helix. Provides proton transfer
           pathway (water channels) connecting the terminal quinone
           electron acceptor of RC, to the aqueous phase. Found in
           photosynthetic bacteria: alpha, beta, and gamma
           proteobacteria.
          Length = 246

 Score = 30.9 bits (70), Expect = 0.48
 Identities = 29/109 (26%), Positives = 40/109 (36%), Gaps = 20/109 (18%)

Query: 16  LGAVKKNPAIFYLPELKFFKDFVESYGGTIPPPSKAQSDAPKPEKPFTTPEPSAPEEESE 75
            G   +N  +F LP  K FK     +GGT+  P              T P   AP   + 
Sbjct: 46  DGGPSENGGVFPLPYPKTFK---LPHGGTVTVPRVEAETRALALAR-TAPFEGAPLVPT- 100

Query: 76  ESDPELDNTGVISEPDEEEPIPD-------------VATDIEVSEEDID 111
             +P LD  G  S  +  + +PD             VAT   V+  D+D
Sbjct: 101 -GNPMLDGVGPASWAERRD-LPDLDVHGHPKIVPMRVATGFSVAAGDVD 147


>gnl|CDD|179448 PRK02603, PRK02603, photosystem I assembly protein Ycf3;
           Provisional.
          Length = 172

 Score = 30.0 bits (68), Expect = 0.58
 Identities = 30/116 (25%), Positives = 52/116 (44%), Gaps = 18/116 (15%)

Query: 118 REAIHAYNDGKFEDAVNAYSEAIKL--NPSS--------ALLYAKRGQSYLQLSKPNACI 167
           R+ + A  DG++ +A+  Y EA+KL  +P+          ++YA  G+    L   +   
Sbjct: 40  RDGMSAQADGEYAEALENYEEALKLEEDPNDRSYILYNMGIIYASNGEHDKALEYYHQ-- 97

Query: 168 RDCSVALKINPDSAAAYKFRGRAYRLLGKWEEAAVDLRNACKIDFDEQADEWLKEV 223
                AL++NP   +A       Y   G+  E A D   A +  FD+ A+ W + +
Sbjct: 98  -----ALELNPKQPSALNNIAVIYHKRGEKAEEAGDQDEA-EALFDKAAEYWKQAI 147



 Score = 27.3 bits (61), Expect = 5.3
 Identities = 16/56 (28%), Positives = 30/56 (53%), Gaps = 7/56 (12%)

Query: 107 EEDIDKSNEKKREAIHAYNDGKFEDAVNAYSEAIKLNP-------SSALLYAKRGQ 155
           E+  D+S       I   ++G+ + A+  Y +A++LNP       + A++Y KRG+
Sbjct: 66  EDPNDRSYILYNMGIIYASNGEHDKALEYYHQALELNPKQPSALNNIAVIYHKRGE 121


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 30.9 bits (70), Expect = 0.59
 Identities = 11/58 (18%), Positives = 21/58 (36%)

Query: 66  EPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSEEDIDKSNEKKREAIHA 123
           E    +   EE + EL+     +  D ++       + E  E+D +  +  K E    
Sbjct: 3   EQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60


>gnl|CDD|226687 COG4235, COG4235, Cytochrome c biogenesis factor [Posttranslational
           modification, protein turnover, chaperones].
          Length = 287

 Score = 30.0 bits (68), Expect = 0.80
 Identities = 24/93 (25%), Positives = 38/93 (40%), Gaps = 14/93 (15%)

Query: 122 HAY-NDGKFEDAVNAYSEAIKLNPSSA--------LLYAKRGQSYLQLSKPNACIRDCSV 172
            AY   G+  DA+ AY  A++L   +          LY + GQ     ++  A +R    
Sbjct: 164 RAYMALGRASDALLAYRNALRLAGDNPEILLGLAEALYYQAGQQMTAKAR--ALLRQ--- 218

Query: 173 ALKINPDSAAAYKFRGRAYRLLGKWEEAAVDLR 205
           AL ++P +  A      A    G + EAA   +
Sbjct: 219 ALALDPANIRALSLLAFAAFEQGDYAEAAAAWQ 251


>gnl|CDD|236875 PRK11189, PRK11189, lipoprotein NlpI; Provisional.
          Length = 296

 Score = 29.9 bits (68), Expect = 0.94
 Identities = 12/32 (37%), Positives = 16/32 (50%)

Query: 169 DCSVALKINPDSAAAYKFRGRAYRLLGKWEEA 200
           D S AL + PD A AY + G      G ++ A
Sbjct: 86  DFSQALALRPDMADAYNYLGIYLTQAGNFDAA 117


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 30.0 bits (68), Expect = 0.99
 Identities = 11/50 (22%), Positives = 19/50 (38%), Gaps = 7/50 (14%)

Query: 44  TIPPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEE 93
            +P P +   + P P      P P  PE++ +  + E +         EE
Sbjct: 257 RLPAPPQPPEEEPPPP-----PPP--PEDDDDPPEDEEEQDDAEDRALEE 299



 Score = 30.0 bits (68), Expect = 0.99
 Identities = 7/45 (15%), Positives = 14/45 (31%)

Query: 58  PEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATD 102
           P  P    E   P     E D +        +  E+  + ++  +
Sbjct: 259 PAPPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEIVLE 303


>gnl|CDD|236512 PRK09430, djlA, Dna-J like membrane chaperone protein; Provisional.
          Length = 267

 Score = 29.8 bits (68), Expect = 1.1
 Identities = 11/34 (32%), Positives = 16/34 (47%), Gaps = 10/34 (29%)

Query: 105 VSEEDI-------DK---SNEKKREAIHAYNDGK 128
           V+E DI       D+     E +R A  A+ +GK
Sbjct: 72  VTEADIRIASQLMDRMNLHGEARRAAQQAFREGK 105


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 29.9 bits (67), Expect = 1.1
 Identities = 13/61 (21%), Positives = 24/61 (39%), Gaps = 7/61 (11%)

Query: 47  PPSKAQSDAPKPEKPFTTP-------EPSAPEEESEESDPELDNTGVISEPDEEEPIPDV 99
               A++ +  PE+P           EP  PE+ +   DP   +   ++ P ++   PD 
Sbjct: 387 HQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVAGPGDDPAEPDG 446

Query: 100 A 100
            
Sbjct: 447 I 447



 Score = 27.3 bits (60), Expect = 7.9
 Identities = 15/61 (24%), Positives = 20/61 (32%), Gaps = 9/61 (14%)

Query: 48  PSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSE 107
           P+  Q DA         P   A E   E              P++  PIPD A   E++ 
Sbjct: 384 PAAHQVDAEAASAAPEEPAALASEAHDETEP---------EVPEKAAPIPDPAKPDELAV 434

Query: 108 E 108
            
Sbjct: 435 A 435


>gnl|CDD|143585 cd07316, terB_like_DjlA, N-terminal tellurium resistance protein
           terB-like domain of heat shock DnaJ-like proteins.
           Tellurium resistance terB-like domain of the DnaJ-like
           DjlA proteins. This family represents the terB-like
           domain of DjlA-like proteins, a subgroup of heat shock
           DnaJ-like proteins.  Escherichia coli DjlA is a type III
           membrane protein with a small N-terminal transmembrane
           region and DnaJ-like domain on the extreme C-terminus.
           Overproduction has been shown to activate the RcsC
           pathway, which regulates the production of the capsular
           exopolysaccharide colanic acid.  The specific function
           of this domain is unknown.
          Length = 106

 Score = 28.3 bits (64), Expect = 1.2
 Identities = 16/59 (27%), Positives = 22/59 (37%), Gaps = 15/59 (25%)

Query: 105 VSEEDID----------KSNEKKREAIHAYNDGK-----FEDAVNAYSEAIKLNPSSAL 148
           VSE +I              E +REAI  +N+GK      E+    +  A    P   L
Sbjct: 16  VSEAEIQAARALMDQMGLDAEARREAIRLFNEGKESDFGLEEYARQFRRACGGRPELLL 74


>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
          Length = 310

 Score = 29.6 bits (67), Expect = 1.3
 Identities = 13/53 (24%), Positives = 20/53 (37%), Gaps = 8/53 (15%)

Query: 26  FYLPELKFFKDFVESYGGTIPPPSKAQSDAPKPEKPFTTPEPSAPEEESEESD 78
           F  P  +  K+ + +       PS A + A            +AP EE EE +
Sbjct: 257 FTFPLAEKIKEALAN-------PSAAAA-AAAAAAAAAAAAAAAPAEEEEEEE 301


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 30.0 bits (68), Expect = 1.3
 Identities = 16/82 (19%), Positives = 23/82 (28%), Gaps = 8/82 (9%)

Query: 46  PPPSKAQSDAPKPEKP------FTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDV 99
            P   A    P P +P         P    P      +             +EEE   D 
Sbjct: 730 APSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDD 789

Query: 100 ATDIEVSEEDIDKSNEKKREAI 121
           A    + +ED   + E   E +
Sbjct: 790 AP--SMDDEDRRDAEEVAMELL 809


>gnl|CDD|153411 cd07957, Anticodon_Ia_Met, Anticodon-binding domain of methionyl
           tRNA synthetases.  This domain is found in methionyl
           tRNA synthetases (MetRS), which belong to the class Ia
           aminoacyl tRNA synthetases. It lies C-terminal to the
           catalytic core domain, and recognizes and specifically
           binds to the tRNA anticodon (CAU). MetRS catalyzes the
           transfer of methionine to the 3'-end of its tRNA.
          Length = 129

 Score = 28.6 bits (65), Expect = 1.4
 Identities = 14/74 (18%), Positives = 30/74 (40%), Gaps = 11/74 (14%)

Query: 96  IPDVATDIEVSEEDIDKSNEKKREAIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQ 155
           +P+     E  EE ++++ E   E   A  + +F  A+    E ++L        A+   
Sbjct: 28  VPEFGGLTEEDEELLEEAEELLEEVAEAMEELEFRKALE---EIMEL--------ARAAN 76

Query: 156 SYLQLSKPNACIRD 169
            Y+  + P    ++
Sbjct: 77  KYIDETAPWKLAKE 90


>gnl|CDD|234504 TIGR04216, halo_surf_glyco, major cell surface glycoprotein.
           Members of this family are the S-layer-forming
           halobacterial major cell surface glycoprotein. The
           highest scores below model cutoffs are fragmentary
           paralogs to actual members of the family. Modifications
           include at N-linked and O-linked glycosylation, a
           C-terminal diphytanylglyceryl modification, and probable
           cleavage of the PGF-CTERM tail.
          Length = 782

 Score = 29.5 bits (66), Expect = 1.5
 Identities = 10/43 (23%), Positives = 15/43 (34%)

Query: 55  APKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIP 97
             +     TTP  + PEE +E ++P         E       P
Sbjct: 718 TSEDPTTTTTPTTTGPEETTETAEPTTTTEEPTEETTTGSSTP 760


>gnl|CDD|129652 TIGR00561, pntA, NAD(P) transhydrogenase, alpha subunit.  This
           integral membrane protein is the alpha subunit of alpha
           2 beta 2 tetramer that couples the proton transport
           across the membrane to the reversible transfer of
           hydride ion equivalents between NAD and NADP. An
           alternate name is pyridine nucleotide transhydrogenase
           alpha subunit. The N-terminal region is homologous to
           alanine dehydrogenase. In some species, such as
           Rhodospirillum rubrum, the alpha chain is replaced by
           two shorter chains, both with some homology to the
           full-length alpha chain modeled here. These score below
           the trusted cutoff [Energy metabolism, Electron
           transport].
          Length = 511

 Score = 29.5 bits (66), Expect = 1.7
 Identities = 14/36 (38%), Positives = 16/36 (44%), Gaps = 6/36 (16%)

Query: 44  TIPPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDP 79
           T PPP    S  P+P K       +AP  ESEE   
Sbjct: 366 TWPPPKIQVSAQPQPAK------KAAPVVESEEKSA 395


>gnl|CDD|239121 cd02656, MIT, MIT: domain contained within Microtubule Interacting
           and Trafficking molecules. The MIT domain is found in
           sorting nexins, the nuclear thiol protease PalBH, the
           AAA protein spastin and archaebacterial proteins with
           similar domain architecture, vacuolar sorting proteins
           and others. The molecular function of the MIT domain is
           unclear.
          Length = 75

 Score = 27.3 bits (61), Expect = 1.7
 Identities = 10/33 (30%), Positives = 20/33 (60%)

Query: 108 EDIDKSNEKKREAIHAYNDGKFEDAVNAYSEAI 140
           E + ++ E  ++A+    DG +E+A+  Y EA+
Sbjct: 1   ELLQQAKELIKQAVKEDEDGNYEEALELYKEAL 33


>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 944

 Score = 29.3 bits (66), Expect = 1.7
 Identities = 14/64 (21%), Positives = 23/64 (35%), Gaps = 4/64 (6%)

Query: 49  SKAQSDAPKPE---KPFTTPEPSAPEEESEE-SDPELDNTGVISEPDEEEPIPDVATDIE 104
           S  + D  K     KP T P  + P   S+  S P+   T    + D +  +    +  E
Sbjct: 630 SPKEGDGKKSSADRKPKTPPSRAPPASLSKPASSPDASQTSASFDLDPDFELATHQSVPE 689

Query: 105 VSEE 108
            +  
Sbjct: 690 AALA 693


>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
           partitioning].
          Length = 324

 Score = 29.1 bits (65), Expect = 1.8
 Identities = 13/75 (17%), Positives = 20/75 (26%)

Query: 45  IPPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIE 104
           IP P   Q   P      T P     +   E   P L       +P            ++
Sbjct: 101 IPVPQPPQISDPPAHPQPTQPALDQEQPPEEARQPVLPQEAPAPQPVHSAAPQPAVQTVQ 160

Query: 105 VSEEDIDKSNEKKRE 119
            +  +     E+  E
Sbjct: 161 PAVPEQQVQPEEVVE 175


>gnl|CDD|233728 TIGR02102, pullulan_Gpos, pullulanase, extracellular, Gram-positive. 
            Pullulan is an unusual, industrially important
            polysaccharide in which short alpha-1,4 chains
            (maltotriose) are connected in alpha-1,6 linkages.
            Enzymes that cleave alpha-1,6 linkages in pullulan and
            release maltotriose are called pullulanases although
            pullulan itself may not be the natural substrate. In
            contrast, a glycogen debranching enzyme such GlgX,
            homologous to this family, can release glucose at
            alpha,1-6 linkages from glycogen first subjected to limit
            degradation by phosphorylase. Characterized members of
            this family include a surface-located pullulanase from
            Streptococcus pneumoniae (PMID:11083842) and an
            extracellular bifunctional amylase/pullulanase with
            C-terminal pullulanase activity (PMID:8798645).
          Length = 1111

 Score = 29.4 bits (66), Expect = 1.9
 Identities = 23/80 (28%), Positives = 33/80 (41%), Gaps = 5/80 (6%)

Query: 46   PPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEV 105
            PPP + +  APKP  P   P+ S P+++ +  D +   T   S+ DE    P    + E 
Sbjct: 1014 PPPPEHEPQAPKP--PTQDPDGSKPKDKVDPKDNKDPLTPPGSD-DENGETPK--GNEEK 1068

Query: 106  SEEDIDKSNEKKREAIHAYN 125
             EE  DK            N
Sbjct: 1069 KEEQPDKGANLPNTGTKNSN 1088


>gnl|CDD|223708 COG0635, HemN, Coproporphyrinogen III oxidase and related Fe-S
           oxidoreductases [Coenzyme metabolism].
          Length = 416

 Score = 29.2 bits (66), Expect = 1.9
 Identities = 19/76 (25%), Positives = 32/76 (42%), Gaps = 11/76 (14%)

Query: 114 NEKKREAIHAYNDGKFEDAVNAYSEAIKLNPSSA---LLYAKRGQSYLQLSKPNACIRDC 170
           N++  +A+   +D   E+A  A   A K   +S    L+Y   GQ+        +   D 
Sbjct: 159 NDEVLKALGRIHD--EEEAKEAVELARKAGFTSINIDLIYGLPGQT------LESLKEDL 210

Query: 171 SVALKINPDSAAAYKF 186
             AL++ PD  + Y  
Sbjct: 211 EQALELGPDHLSLYSL 226


>gnl|CDD|234164 TIGR03302, OM_YfiO, outer membrane assembly lipoprotein YfiO.
           Members of this protein family include YfiO, a
           near-essential protein of the outer membrane, part of a
           complex involved in protein insertion into the bacterial
           outer membrane. Many proteins in this family are
           annotated as ComL, based on the involvement of this
           protein in natural transformation with exogenous DNA in
           Neisseria gonorrhoeae. This protein family shows
           sequence similarity to, but is distinct from, the
           tol-pal system protein YbgF (TIGR02795) [Protein fate,
           Protein and peptide secretion and trafficking].
          Length = 235

 Score = 28.7 bits (65), Expect = 1.9
 Identities = 12/40 (30%), Positives = 18/40 (45%), Gaps = 3/40 (7%)

Query: 124 YNDGKFEDAVNAYSEAIKLNPSSALL-YA--KRGQSYLQL 160
           Y  G + +A+ A    I+L+P+     YA   RG S    
Sbjct: 81  YKSGDYAEAIAAADRFIRLHPNHPDADYAYYLRGLSNYNQ 120


>gnl|CDD|235285 PRK04335, PRK04335, cell division protein ZipA; Provisional.
          Length = 313

 Score = 29.0 bits (65), Expect = 1.9
 Identities = 17/62 (27%), Positives = 29/62 (46%)

Query: 37  FVESYGGTIPPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPI 96
            V++     P   + Q + P+ E+    PEP A +    E  PE +    + EP+E +P 
Sbjct: 113 PVQAQSQPQPEKVEPQVEEPRDEEVLEEPEPVAAKVPMAEVQPEEETEIEVDEPEEPKPE 172

Query: 97  PD 98
           P+
Sbjct: 173 PE 174


>gnl|CDD|236624 PRK09782, PRK09782, bacteriophage N4 receptor, outer membrane
           subunit; Provisional.
          Length = 987

 Score = 29.5 bits (66), Expect = 2.0
 Identities = 21/72 (29%), Positives = 29/72 (40%), Gaps = 4/72 (5%)

Query: 149 LYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKFRGRAYRLLGKWEEAAVDLRNAC 208
           LY           +P   + D + +L I P SA AY  R   YR       A  DLR A 
Sbjct: 578 LYWWLHAQRYIPGQPELALNDLTRSLNIAP-SANAYVARATIYRQRHNVPAAVSDLRAAL 636

Query: 209 KIDFDE---QAD 217
           +++ +    QA 
Sbjct: 637 ELEPNNSNYQAA 648


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 29.6 bits (66), Expect = 2.0
 Identities = 17/73 (23%), Positives = 31/73 (42%), Gaps = 8/73 (10%)

Query: 69   APEEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSEEDIDKSNEKKREAIHAYNDGK 128
            A +E  EE+D E D      +P EE    D   +    ++  D + + ++      N+  
Sbjct: 4015 AADENKEEADAEKDEPMQDEDPLEENNTLD---EDIQQDDFSDLAEDDEKM-----NEDG 4066

Query: 129  FEDAVNAYSEAIK 141
            FE+ V    E+ +
Sbjct: 4067 FEENVQENEESTE 4079



 Score = 28.0 bits (62), Expect = 6.0
 Identities = 10/70 (14%), Positives = 29/70 (41%), Gaps = 2/70 (2%)

Query: 49   SKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSEE 108
            + A+ D P  ++       +  E+  ++   +L       + +E+    +V  + E +E+
Sbjct: 4023 ADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDD--EKMNEDGFEENVQENEESTED 4080

Query: 109  DIDKSNEKKR 118
             +    E ++
Sbjct: 4081 GVKSDEELEQ 4090


>gnl|CDD|225504 COG2956, COG2956, Predicted N-acetylglucosaminyl transferase
           [Carbohydrate transport and metabolism].
          Length = 389

 Score = 28.9 bits (65), Expect = 2.0
 Identities = 23/101 (22%), Positives = 35/101 (34%), Gaps = 10/101 (9%)

Query: 130 EDAVNAYSEAIKLNPSS-----ALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAY 184
           E A++     +KL   +     A  Y +  Q  L  S  +        AL+ +     A 
Sbjct: 158 EKAIDVAERLVKLGGQTYRVEIAQFYCELAQQALASSDVDRARELLKKALQADKKCVRAS 217

Query: 185 KFRGRAYRLLGKWEEAAVDLRNACKIDFDEQADEWLKEVTP 225
              GR     G +++A   L         EQ  E+L EV  
Sbjct: 218 IILGRVELAKGDYQKAVEALERV-----LEQNPEYLSEVLE 253


>gnl|CDD|222121 pfam13428, TPR_14, Tetratricopeptide repeat. 
          Length = 44

 Score = 26.4 bits (58), Expect = 2.1
 Identities = 6/36 (16%), Positives = 12/36 (33%)

Query: 120 AIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQ 155
           A      G  ++A+     A+ L+P          +
Sbjct: 8   ARALLALGDLDEALALLRRALALDPDDPEALLLLAR 43


>gnl|CDD|218099 pfam04465, DUF499, Protein of unknown function (DUF499).  Family of
           uncharacterized hypothetical prokaryotic proteins.
          Length = 1025

 Score = 29.0 bits (65), Expect = 2.2
 Identities = 25/86 (29%), Positives = 32/86 (37%), Gaps = 1/86 (1%)

Query: 58  PEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSEEDID-KSNEK 116
           PE  +  P  S  +EE E  D E    G I E  EEE I     D+ V    I  K  + 
Sbjct: 773 PESNYEIPLKSLVDEECEVKDEEAVLWGYIVEKKEEEFIVKGEFDLSVEPSSIKGKPGDP 832

Query: 117 KREAIHAYNDGKFEDAVNAYSEAIKL 142
               + A   G +   V   S   +L
Sbjct: 833 VEVRVKAKPLGDYRFTVELSSNFGEL 858


>gnl|CDD|227343 COG5010, TadD, Flp pilus assembly protein TadD, contains TPR
           repeats [Intracellular trafficking and secretion].
          Length = 257

 Score = 28.6 bits (64), Expect = 2.4
 Identities = 21/89 (23%), Positives = 39/89 (43%)

Query: 119 EAIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINP 178
           +  +   +G F +AV+   +A +L P+    +   G +  QL + +   R    AL++ P
Sbjct: 106 QGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAYRQALELAP 165

Query: 179 DSAAAYKFRGRAYRLLGKWEEAAVDLRNA 207
           +  +     G +  L G  E+A   L  A
Sbjct: 166 NEPSIANNLGMSLLLRGDLEDAETLLLPA 194


>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
           unknown].
          Length = 769

 Score = 28.9 bits (64), Expect = 2.5
 Identities = 22/104 (21%), Positives = 41/104 (39%), Gaps = 7/104 (6%)

Query: 51  AQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVIS-----EPDEEEPIPDVATDIEV 105
            ++  PK    +        EEE  + + E      I      E D EE   D  + I+ 
Sbjct: 377 KKAAVPKGTSFYQAKWAEDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDD 436

Query: 106 SEEDIDKSNEKKREAIHAYNDGKFEDAVNAYSEAIKLNPSSALL 149
           +E   + S E++   +  + D + ED    + +  +L PS + +
Sbjct: 437 NEGFEELSPEEEERQLREFRDMEKEDR--EFPDEAELQPSESAI 478


>gnl|CDD|222484 pfam13974, YebO, YebO-like protein.  This short protein is
          uncharacterized. It seems likely to be of phage origin
          as it is found in Bacteriophage HK022 Gp20 and
          Bacteriophage HK97 Gp15. The protein is also found in a
          variety of enteric bacteria.
          Length = 80

 Score = 26.9 bits (60), Expect = 3.1
 Identities = 10/27 (37%), Positives = 12/27 (44%)

Query: 63 TTPEPSAPEEESEESDPELDNTGVISE 89
           T E S    +S   D ELD   +I E
Sbjct: 53 NTVEKSGAVNDSVVEDVELDFKRLIPE 79


>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
           Provisional.
          Length = 376

 Score = 28.3 bits (63), Expect = 3.1
 Identities = 18/104 (17%), Positives = 36/104 (34%), Gaps = 4/104 (3%)

Query: 46  PPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEV 105
           P P     D P  ++      P + + E E ++P  + T + +    ++     A   + 
Sbjct: 75  PAPIDVPPDPPAAQEAVQPTAPPSAQSEPERNEPRPEETPIFAYSSGDQGGSKRAGHGDT 134

Query: 106 SEEDIDKSNEKKREAIHAYNDGKFEDAVNAYSEAIKLNPSSALL 149
                D   +    A         E+ ++   +  +L PS A L
Sbjct: 135 DRRQDDNREDNSLPAGEV----SAENDLSIRMKPTELQPSRATL 174


>gnl|CDD|221958 pfam13176, TPR_7, Tetratricopeptide repeat. 
          Length = 36

 Score = 25.7 bits (57), Expect = 3.5
 Identities = 9/19 (47%), Positives = 11/19 (57%)

Query: 183 AYKFRGRAYRLLGKWEEAA 201
           A    GR YR LG +E+A 
Sbjct: 1   ALSNLGRLYRKLGDYEKAI 19


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 28.1 bits (63), Expect = 3.8
 Identities = 20/82 (24%), Positives = 32/82 (39%), Gaps = 1/82 (1%)

Query: 66  EPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSEEDIDKSNEKKREAIHAYN 125
           E     ++ ++ + E +        DE E I DV +D E+   D +   EK   A  A  
Sbjct: 89  EQGLESDDDDDEEEEWEVEEDEDSDDEGEWI-DVESDKEIESSDSEDEEEKDEAAKKAKE 147

Query: 126 DGKFEDAVNAYSEAIKLNPSSA 147
           D   E +     EA +   + A
Sbjct: 148 DSDEELSEEDEEEAAEEEEAEA 169


>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
           Tim54.  Mitochondrial function depends on the import of
           hundreds of different proteins synthesised in the
           cytosol. Protein import is a multi-step pathway which
           includes the binding of precursor proteins to surface
           receptors, translocation of the precursor across one or
           both mitochondrial membranes, and folding and assembly
           of the imported protein inside the mitochondrion. Most
           precursor proteins carry amino-terminal targeting
           signals, called pre-sequences, and are imported into
           mitochondria via import complexes located in both the
           outer and the inner membrane (IM). The IM complex, TIM,
           is made up of at least two proteins which mediate
           translocation of proteins into the matrix by removing
           their signal peptide and another pair of proteins, Tim54
           and Tim22, that insert the polytopic proteins, that
           carry internal targetting information, into the inner
           membrane.
          Length = 377

 Score = 28.2 bits (63), Expect = 4.3
 Identities = 16/64 (25%), Positives = 25/64 (39%), Gaps = 8/64 (12%)

Query: 57  KPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSEEDIDKSNEK 116
            PE P  T + +APE E E +          +E   E       T  E +E+  ++ N K
Sbjct: 197 PPEPPEPTVDEAAPETEVEATPA--------AESPAEPAEETAETTPEETEDAPEEENNK 248

Query: 117 KREA 120
             + 
Sbjct: 249 PVKP 252


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 28.2 bits (63), Expect = 4.7
 Identities = 12/47 (25%), Positives = 18/47 (38%)

Query: 71  EEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSEEDIDKSNEKK 117
           E E E  D E +      + +EE+         E  E++  K   KK
Sbjct: 30  EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKK 76


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 28.1 bits (63), Expect = 4.9
 Identities = 13/31 (41%), Positives = 16/31 (51%), Gaps = 5/31 (16%)

Query: 65  PEPSAPEEESEESDPELDNTGVISEPDEEEP 95
           P PS PE E EE + E      + E +EEE 
Sbjct: 298 PPPSPPEPEEEEEEEE-----EVPEEEEEEE 323



 Score = 26.9 bits (60), Expect = 9.3
 Identities = 12/35 (34%), Positives = 16/35 (45%), Gaps = 5/35 (14%)

Query: 46  PPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPE 80
           PPPS  + +  + E      E    EEE EE + E
Sbjct: 298 PPPSPPEPEEEEEE-----EEEVPEEEEEEEEEEE 327


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 27.5 bits (62), Expect = 5.4
 Identities = 21/92 (22%), Positives = 37/92 (40%), Gaps = 10/92 (10%)

Query: 63  TTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDI---EVSE--EDIDKSNEKK 117
                +A   ESE  + +L++     E ++E+   +        E  E   ++ +  EK 
Sbjct: 35  AAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEEARERFAELREQYEKT 94

Query: 118 REAI--HAYNDGKFEDAVNAYSE---AIKLNP 144
           R+AI  H     K + A  A +E     +L P
Sbjct: 95  RKAIEKHGRGSKKAQKAREALAEEFMQFRLVP 126


>gnl|CDD|193565 cd08018, M20_Acy1_amhX_like, M20 Peptidase Aminoacylase 1 amhX_like
           subfamily.  Peptidase M20 family, Uncharacterized
           subfamily of uncharacterized proteins predicted as
           putative amidohydrolases, including the amhX gene
           product from Bacillus subtilis. These are a class of
           zinc binding homodimeric enzymes involved in hydrolysis
           of N-acetylated proteins. N-terminal acetylation of
           proteins is a widespread and highly conserved process
           that is involved in protection and stability of
           proteins. Several types of aminoacylases can be
           distinguished on the basis of substrate specificity.
           Aminoacylase 1 (ACY1) breaks down cytosolic aliphatic
           N-acyl-alpha-amino acids (except L-aspartate),
           especially N-acetyl-methionine and acetyl-glutamate into
           L-amino acids and an acyl group. However, ACY1 can also
           catalyze the reverse reaction, the synthesis of
           acetylated amino acids. ACY1 may also play a role in
           xenobiotic bioactivation as well as the inter-organ
           processing of amino acid-conjugated xenobiotic
           derivatives (S-substituted-N-acetyl-L-cysteine).
          Length = 365

 Score = 27.6 bits (62), Expect = 6.2
 Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 6/44 (13%)

Query: 82  DNTGVISEPDEEEPIPDVA--TDIEVSEEDIDKSNEKKREAIHA 123
           D TGV++E    +P P VA   D++   +++D   E K  A H+
Sbjct: 46  DCTGVVAEIGSGKPGPVVALRADMDALWQEVD--GEWK--ANHS 85


>gnl|CDD|216082 pfam00723, Glyco_hydro_15, Glycosyl hydrolases family 15.  In
           higher organisms this family is represented by
           phosphorylase kinase subunits.
          Length = 416

 Score = 27.4 bits (61), Expect = 6.6
 Identities = 17/60 (28%), Positives = 21/60 (35%), Gaps = 8/60 (13%)

Query: 30  ELKFFKDFVESYGGTIPPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISE 89
              FF+  V S  GT P  S      P   K +      A     EE    L   G++SE
Sbjct: 338 ARGFFEKLVSSTVGT-PYSSSTFISIPLAVKEYADGFVKA----VEEYANSL---GLLSE 389


>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
           TopJ.  This model describes TopJ (MG_200, CbpA), a DnaJ
           homolog and probable assembly protein of the Mycoplasma
           terminal organelle. The terminal organelle is involved
           in both cytadherence and gliding motility [Cellular
           processes, Chemotaxis and motility].
          Length = 871

 Score = 27.5 bits (60), Expect = 6.7
 Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 3/69 (4%)

Query: 56  PKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEE--EPIPDVATD-IEVSEEDIDK 112
           P+P +   T   SAPE   EE+ PE++      +P  E     P   TD ++  E D  +
Sbjct: 294 PQPTQTVETKPTSAPESTVEENLPEINQPTQAVQPTSETISTTPVEPTDQLKPKEVDQIQ 353

Query: 113 SNEKKREAI 121
              KK + I
Sbjct: 354 EELKKTKEI 362


>gnl|CDD|182415 PRK10370, PRK10370, formate-dependent nitrite reductase complex
           subunit NrfG; Provisional.
          Length = 198

 Score = 27.2 bits (60), Expect = 6.8
 Identities = 17/73 (23%), Positives = 36/73 (49%), Gaps = 3/73 (4%)

Query: 115 EKKREA--IHAYNDGKFEDA-VNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCS 171
           E +R A  +H +   +  +A + A  + I+ NP ++  +A  G+ YL  +  +  +    
Sbjct: 38  EYQRLADPLHQFASQQTPEAQLQALQDKIRANPQNSEQWALLGEYYLWRNDYDNALLAYR 97

Query: 172 VALKINPDSAAAY 184
            AL++  ++A  Y
Sbjct: 98  QALQLRGENAELY 110


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 27.1 bits (60), Expect = 7.4
 Identities = 19/80 (23%), Positives = 27/80 (33%), Gaps = 8/80 (10%)

Query: 45  IPPPSKAQSDAPKPEKPFTTP--------EPSAPEEESEESDPELDNTGVISEPDEEEPI 96
           IP    ++     P K  TT         +   PE  +EE     +N     + D E P 
Sbjct: 45  IPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPK 104

Query: 97  PDVATDIEVSEEDIDKSNEK 116
                     E+ ID S+ K
Sbjct: 105 VQNTAAEVNHEDVIDLSDLK 124


>gnl|CDD|225613 COG3071, HemY, Uncharacterized enzyme of heme biosynthesis
           [Coenzyme metabolism].
          Length = 400

 Score = 27.3 bits (61), Expect = 8.4
 Identities = 31/162 (19%), Positives = 51/162 (31%), Gaps = 25/162 (15%)

Query: 79  PELDNTGVISEPD-------EEEPIPDVATDIEVSEEDIDKSNEKKRE-----------A 120
           P+L   G++S+ +         E +   A D   SE        + R+           A
Sbjct: 211 PKLRKAGLLSDEEAARLEQQAWEGLLQQARDDNGSEGLKTWWKNQPRKLRNDPELVVAYA 270

Query: 121 IHAYNDGKFEDAVNAYSEAIKLN--PSSALLYAKRGQSYLQLSKPNACIRDCSVALKINP 178
                 G  ++A     +A+K    P    L  +     L+   P   I+     LK +P
Sbjct: 271 ERLIRLGDHDEAQEIIEDALKRQWDPRLCRLIPR-----LRPGDPEPLIKAAEKWLKQHP 325

Query: 179 DSAAAYKFRGRAYRLLGKWEEAAVDLRNACKIDFDEQADEWL 220
           +        GR       W +A+  L  A K+         L
Sbjct: 326 EDPLLLSTLGRLALKNKLWGKASEALEAALKLRPSASDYAEL 367


>gnl|CDD|233927 TIGR02557, HpaP, type III secretion protein HpaP.  This family of
          genes is always found in type III secretion operons,
          althought its function in the processes of secretion
          and virulence is unclear. Hpa stands for Hrp-associated
          gene, where Hrp stands for hypersensitivity response
          and virulence.
          Length = 201

 Score = 26.8 bits (59), Expect = 8.5
 Identities = 9/34 (26%), Positives = 12/34 (35%)

Query: 46 PPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDP 79
           P      D  +PE    T     P E   ++DP
Sbjct: 41 RPEPPPPCDEDRPEPRADTRASDPPPEAPTDADP 74


>gnl|CDD|198111 smart01043, BTAD, Bacterial transcriptional activator domain.
           Found in the DNRI/REDD/AFSR family of regulators. This
           region of AFSR along with the C terminal region is
           capable of independently directing actinorhodin
           production. This family contains TPR repeats.
          Length = 145

 Score = 26.5 bits (59), Expect = 8.7
 Identities = 11/41 (26%), Positives = 15/41 (36%), Gaps = 1/41 (2%)

Query: 181 AAAYKFRGRAYRLLGKWEEAAVDLRNACKID-FDEQADEWL 220
             A +    A   LG+ EEA   L     +D   E+    L
Sbjct: 61  LEALEALAEALLALGRHEEALALLERLLALDPLRERLHRLL 101


>gnl|CDD|188409 TIGR03894, chp_P_marinus_1, conserved hypothetical protein,
           TIGR03894 family.  This protein family is restricted to
           the Prochlorococcus and Synechococcus lineages of the
           Cyanobacteria, and is sporadic in those lineages.
           Members average 100 amino acids in length, including a
           30-residue, highly polar, low complexity region
           sandwiched between an N-terminal region of about 60
           residues and a C-terminal [KR]VVR[KR]RS motif, both
           well-conserved. The function is unknown [Hypothetical
           proteins, Conserved].
          Length = 95

 Score = 25.9 bits (57), Expect = 9.2
 Identities = 11/30 (36%), Positives = 16/30 (53%)

Query: 89  EPDEEEPIPDVATDIEVSEEDIDKSNEKKR 118
           EPD+EE   + +   E SE + + S E  R
Sbjct: 60  EPDDEENEGEDSETKEDSESESESSREGPR 89


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 27.0 bits (60), Expect = 9.2
 Identities = 17/72 (23%), Positives = 28/72 (38%), Gaps = 10/72 (13%)

Query: 42  GGTIPPPSKAQSDAPKPEKPFTTPEPSAP--EEESEESDPELDNTGVISEPDEEEPIPDV 99
           GG      K      K  K  T  +      ++E + SD + D   + S+ D+       
Sbjct: 219 GGGAMKVLKQPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLDDS------ 272

Query: 100 ATDIEVSEEDID 111
             D +VS+ED +
Sbjct: 273 --DDDVSDEDGE 282


>gnl|CDD|233779 TIGR02208, lipid_A_msbB, lipid A biosynthesis
           (KDO)2-(lauroyl)-lipid IVA acyltransferase.  This family
           consists of MsbB in E. coli and closely related proteins
           in other species. MsbB is homologous to HtrB (TIGR02207)
           and acts immediately after it in the biosynthesis of
           KDO-2 lipid A (also called Re LPS and Re endotoxin).
           These two enzymes act after creation of KDO-2 lipid IV-A
           by addition of the KDO sugars [Cell envelope,
           Biosynthesis and degradation of surface polysaccharides
           and lipopolysaccharides].
          Length = 305

 Score = 26.7 bits (59), Expect = 9.8
 Identities = 10/43 (23%), Positives = 20/43 (46%), Gaps = 6/43 (13%)

Query: 10  NKLQLFLGAVKKNPAIFYLPELKF------FKDFVESYGGTIP 46
             ++  L ++K+  + +YLP+         F  F  +Y  T+P
Sbjct: 180 AGIKALLASLKRGESGYYLPDEDHGPEQSVFVPFFATYKATLP 222


>gnl|CDD|234730 PRK00339, minC, septum formation inhibitor; Reviewed.
          Length = 249

 Score = 26.7 bits (59), Expect = 9.8
 Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 2/44 (4%)

Query: 47  PPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEP 90
           PPS A+    +P      PEP+   E+  E  P +  T VI+ P
Sbjct: 108 PPSGARERPLEPSPAEAEPEPAKAPEKPPE--PAVRPTRVITTP 149


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
           inhibitor of apoptosis (IAP)-associated factor (VIAF)
           subfamily; VIAF is a Phd-like protein that functions in
           caspase activation during apoptosis. It was identified
           as an IAP binding protein through a screen of a human
           B-cell library using a prototype IAP. VIAF lacks a
           consensus IAP binding motif and while it does not
           function as an IAP antagonist, it still plays a
           regulatory role in the complete activation of caspases.
           VIAF itself is a substrate for IAP-mediated
           ubiquitination, suggesting that it may be a target of
           IAPs in the prevention of cell death. The similarity of
           VIAF to Phd points to a potential role distinct from
           apoptosis regulation. Phd functions as a cytosolic
           regulator of G protein by specifically binding to G
           protein betagamma (Gbg)-subunits. The C-terminal domain
           of Phd adopts a thioredoxin fold, but it does not
           contain a CXXC motif. Phd interacts with G protein beta
           mostly through the N-terminal helical domain.
          Length = 192

 Score = 26.5 bits (59), Expect = 9.9
 Identities = 14/53 (26%), Positives = 22/53 (41%), Gaps = 1/53 (1%)

Query: 67  PSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSEEDIDKSNEKKRE 119
           P  P  + EE +          E   E+ + D   D E+ EE+ D+  E+ R 
Sbjct: 15  PKPPSPKEEEEEALELAIQEAHENALEKKLLDE-LDEELDEEEDDRFLEEYRR 66


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.310    0.131    0.375 

Gapped
Lambda     K      H
   0.267   0.0733    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,257,609
Number of extensions: 1169811
Number of successful extensions: 2141
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1951
Number of HSP's successfully gapped: 252
Length of query: 231
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 137
Effective length of database: 6,768,326
Effective search space: 927260662
Effective search space used: 927260662
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 57 (25.7 bits)