RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9209
         (349 letters)



>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
           34 amino acids
           [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
           X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
           in a variety of organisms including bacteria,
           cyanobacteria, yeast, fungi, plants, and humans in
           various subcellular locations; involved in a variety of
           functions including protein-protein interactions, but
           common features in the interaction partners have not
           been defined; involved in chaperone, cell-cycle,
           transciption, and protein transport complexes; the
           number of TPR motifs varies among proteins (1,3-11,13
           15,16,19); 5-6 tandem repeats generate a right-handed
           helical structure with an amphipathic channel that is
           thought to accomodate an alpha-helix of a target
           protein; it has been proposed that TPR proteins
           preferably interact with WD-40 repeat proteins, but in
           many instances several TPR-proteins seem to aggregate to
           multi-protein complexes; examples of TPR-proteins
           include, Cdc16p, Cdc23p and Cdc27p components of the
           cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
           targeting signals, the Tom70p co-receptor for
           mitochondrial targeting signals, Ser/Thr phosphatase 5C
           and the p110 subunit of O-GlcNAc transferase; three
           copies of the repeat are present here.
          Length = 100

 Score = 57.4 bits (139), Expect = 1e-10
 Identities = 23/84 (27%), Positives = 46/84 (54%)

Query: 93  SQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQ 152
            +A+ Y+++AL+L+P+    +  L   Y ++     A++ Y +A+E++  + +A+Y LG 
Sbjct: 17  DEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGL 76

Query: 153 TYEILRLPYYGLYYYKQAHMVRPN 176
            Y  L      L  Y++A  + PN
Sbjct: 77  AYYKLGKYEEALEAYEKALELDPN 100



 Score = 55.8 bits (135), Expect = 5e-10
 Identities = 25/96 (26%), Positives = 49/96 (51%)

Query: 112 VWTLLGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQTYEILRLPYYGLYYYKQAH 171
               LG+ Y ++ + + A++ Y +A+E++  +  A+Y L   Y  L      L  Y++A 
Sbjct: 2   ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKAL 61

Query: 172 MVRPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAI 207
            + P++ +    LG A+ K  K  EA++ Y K+  +
Sbjct: 62  ELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALEL 97



 Score = 35.8 bits (83), Expect = 0.004
 Identities = 22/137 (16%), Positives = 47/137 (34%), Gaps = 46/137 (33%)

Query: 4   KLYDKLNETEAAADLFMEFVSKLDTFAAPPDKTCGFFAFKYLANHHLKANNLDTAYKCAQ 63
            LY KL + + A + + + +      A          A+  LA  + K    + A +   
Sbjct: 8   NLYYKLGDYDEALEYYEKALELDPDNAD---------AYYNLAAAYYKLGKYEEALED-- 56

Query: 64  KCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEM 123
                                              +++AL+L+P+    +  LG  Y ++
Sbjct: 57  -----------------------------------YEKALELDPDNAKAYYNLGLAYYKL 81

Query: 124 KNTNAAIQCYRQAIEIN 140
                A++ Y +A+E++
Sbjct: 82  GKYEEALEAYEKALELD 98



 Score = 31.6 bits (72), Expect = 0.13
 Identities = 18/83 (21%), Positives = 33/83 (39%), Gaps = 9/83 (10%)

Query: 213 KALFKLAKLYDKLNETEAAADLFMEFVSKLDTFAAPPDKTCGFFAFKYLANHHLKANNLD 272
           +AL  L  LY KL + + A + + + +      A          A+  LA  + K    +
Sbjct: 1   EALLNLGNLYYKLGDYDEALEYYEKALELDPDNAD---------AYYNLAAAYYKLGKYE 51

Query: 273 TAYKCAQKCLQHEETAEEAKSLL 295
            A +  +K L+ +    +A   L
Sbjct: 52  EALEDYEKALELDPDNAKAYYNL 74


>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
          Length = 291

 Score = 44.5 bits (103), Expect = 4e-05
 Identities = 40/204 (19%), Positives = 73/204 (35%), Gaps = 6/204 (2%)

Query: 4   KLYDKLNETEAAADLFMEFVSKLDTFAAPPDKTCGFFAFKYLANHHLKANNLDTAYKCAQ 63
            L   L           E +  L+             A   L          + A +  +
Sbjct: 60  GLLLLLALALLKLGRLEEALELLEKALELELLPNLAEALLNLGLLLEALGKYEEALELLE 119

Query: 64  KCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLYFQRALKLNP---NYLGVWTLLGHEY 120
           K L  +   + A++LL   A     D    +A+  +++AL+L+P           LG   
Sbjct: 120 KALALDPDPDLAEALLALGALYELGD--YEEALELYEKALELDPELNELAEALLALGALL 177

Query: 121 MEMKNTNAAIQCYRQAIEIN-NLDYRAWYGLGQTYEILRLPYYGLYYYKQAHMVRPNDPR 179
             +     A++   +A+++N + D  A   LG  Y  L      L YY++A  + P++  
Sbjct: 178 EALGRYEEALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELDPDNAE 237

Query: 180 MLTALGEAFEKQEKISEAMKCYNK 203
            L  L     +  +  EA++   K
Sbjct: 238 ALYNLALLLLELGRYEEALEALEK 261



 Score = 29.0 bits (63), Expect = 3.5
 Identities = 17/101 (16%), Positives = 40/101 (39%), Gaps = 2/101 (1%)

Query: 40  FAFKYLANHHLKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLYF 99
            A   L          + A +  +K L+     +    L   +   +       +A+ Y+
Sbjct: 168 EALLALGALLEALGRYEEALELLEKALKLNPDDDAEALLNLGLLYLKLGK--YEEALEYY 225

Query: 100 QRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEIN 140
           ++AL+L+P+       L    +E+     A++   +A+E++
Sbjct: 226 EKALELDPDNAEALYNLALLLLELGRYEEALEALEKALELD 266


>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat. 
          Length = 69

 Score = 39.2 bits (92), Expect = 2e-04
 Identities = 11/44 (25%), Positives = 23/44 (52%)

Query: 112 VWTLLGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQTYE 155
               LG+   ++ + + AI+ Y +A+E++  +  A+Y L   Y 
Sbjct: 5   ALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYL 48



 Score = 37.7 bits (88), Expect = 5e-04
 Identities = 11/48 (22%), Positives = 30/48 (62%), Gaps = 1/48 (2%)

Query: 94  QAVLYFQRALKLNPNYLGVWTLLGHEYMEM-KNTNAAIQCYRQAIEIN 140
           +A+  +++AL+L+P+    +  L   Y+++ K+   A++   +A+E++
Sbjct: 21  EAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKALELD 68



 Score = 33.4 bits (77), Expect = 0.016
 Identities = 16/57 (28%), Positives = 26/57 (45%)

Query: 176 NDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAKLYDKLNETEAAA 232
           ++   L  LG A  K     EA++ Y K+  +   + +A + LA  Y KL +    A
Sbjct: 1   DNAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEA 57



 Score = 29.2 bits (66), Expect = 0.61
 Identities = 19/67 (28%), Positives = 30/67 (44%), Gaps = 2/67 (2%)

Query: 41  AFKYLANHHLKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLYFQ 100
           A K L N   K  + D A +  +K L+ +    EA   L     K   D    +A+   +
Sbjct: 5   ALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYE--EALEDLE 62

Query: 101 RALKLNP 107
           +AL+L+P
Sbjct: 63  KALELDP 69


>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
           lipoprotein.  This protein family occurs in strictly
           within a subset of Gram-negative bacterial species with
           the proposed PEP-CTERM/exosortase system, analogous to
           the LPXTG/sortase system common in Gram-positive
           bacteria. This protein occurs in a species if and only
           if a transmembrane histidine kinase (TIGR02916) and a
           DNA-binding response regulator (TIGR02915) also occur.
           The present of tetratricopeptide repeats (TPR) suggests
           protein-protein interaction, possibly for the regulation
           of PEP-CTERM protein expression, since many PEP-CTERM
           proteins in these genomes are preceded by a proposed DNA
           binding site for the response regulator.
          Length = 899

 Score = 42.8 bits (101), Expect = 2e-04
 Identities = 46/197 (23%), Positives = 77/197 (39%), Gaps = 18/197 (9%)

Query: 41  AFKYLANHHLKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIA-QKRQPDESSSQAVLYF 99
           A   LA+ +    N   A    ++ L+ +    EA+  L  +    ++ +     A    
Sbjct: 637 ALLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTES----AKKIA 692

Query: 100 QRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINNLD------YRAWYGLGQT 153
           +   K +P     + L G  Y+  K+  AAIQ YR+A++           +RA    G T
Sbjct: 693 KSLQKQHPKAALGFELEGDLYLRQKDYPAAIQAYRKALKRAPSSQNAIKLHRALLASGNT 752

Query: 154 YEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGK 213
            E ++         +      PND  + TAL E +  Q+   +A+K Y         +  
Sbjct: 753 AEAVKT-------LEAWLKTHPNDAVLRTALAELYLAQKDYDKAIKHYQTVVKKAPDNAV 805

Query: 214 ALFKLAKLYDKLNETEA 230
            L  LA LY +L +  A
Sbjct: 806 VLNNLAWLYLELKDPRA 822



 Score = 41.2 bits (97), Expect = 8e-04
 Identities = 48/269 (17%), Positives = 102/269 (37%), Gaps = 12/269 (4%)

Query: 39  FFAFKYLANHHLKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLY 98
           F A   LA   ++  N D A +  +K L  +     A   L  +  +     +  +AV +
Sbjct: 499 FPAAANLARIDIQEGNPDDAIQRFEKVLTIDPKNLRAILALAGLYLRTG---NEEEAVAW 555

Query: 99  FQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQTYEILR 158
            ++A +LNP  +     L   Y+       A+    +A +       AW  LG+      
Sbjct: 556 LEKAAELNPQEIEPALALAQYYLGKGQLKKALAILNEAADAAPDSPEAWLMLGRAQLAAG 615

Query: 159 LPYYGLYYYKQAHMVRPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKL 218
                +  +K+   ++P+    L  L +A+   +  ++A+    ++  +   + +A   L
Sbjct: 616 DLNKAVSSFKKLLALQPDSALALLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQIGL 675

Query: 219 AKLYDKLNETEAAADLFMEFVSKLDTFAAPPDKTCGFFAFKYLANHHLKANNLDTAYKCA 278
           A+L      TE+A  +      +    A           F+   + +L+  +   A +  
Sbjct: 676 AQLLLAAKRTESAKKIAKSLQKQHPKAA---------LGFELEGDLYLRQKDYPAAIQAY 726

Query: 279 QKCLQHEETAEEAKSLLRSIAQKRQPDES 307
           +K L+   +++ A  L R++       E+
Sbjct: 727 RKALKRAPSSQNAIKLHRALLASGNTAEA 755



 Score = 38.5 bits (90), Expect = 0.004
 Identities = 39/196 (19%), Positives = 74/196 (37%), Gaps = 5/196 (2%)

Query: 41  AFKYLANHHLKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIA-QKRQPDESSSQAVLYF 99
           A + LA+  L+   +D A       L  +     A SLL          ++    A  Y 
Sbjct: 331 ARRLLASIQLRLGRVDEAIATLSPALGLDPDDPAALSLLGEAYLALGDFEK----AAEYL 386

Query: 100 QRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQTYEILRL 159
            +A +L+P      T LG   +   + + AI     A +++    RA   L  +Y     
Sbjct: 387 AKATELDPENAAARTQLGISKLSQGDPSEAIADLETAAQLDPELGRADLLLILSYLRSGQ 446

Query: 160 PYYGLYYYKQAHMVRPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLA 219
               L   K+    +P++  +   LG  +  +  +++A + + K+ +I      A   LA
Sbjct: 447 FDKALAAAKKLEKKQPDNASLHNLLGAIYLGKGDLAKAREAFEKALSIEPDFFPAAANLA 506

Query: 220 KLYDKLNETEAAADLF 235
           ++  +    + A   F
Sbjct: 507 RIDIQEGNPDDAIQRF 522



 Score = 37.8 bits (88), Expect = 0.009
 Identities = 33/183 (18%), Positives = 67/183 (36%), Gaps = 9/183 (4%)

Query: 92  SSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLG 151
            S+A+   + A +L+P       LL   Y+     + A+   ++  +    +      LG
Sbjct: 413 PSEAIADLETAAQLDPELGRADLLLILSYLRSGQFDKALAAAKKLEKKQPDNASLHNLLG 472

Query: 152 QTYEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDAD 211
             Y            +++A  + P+       L     ++    +A++ + K   I   +
Sbjct: 473 AIYLGKGDLAKAREAFEKALSIEPDFFPAAANLARIDIQEGNPDDAIQRFEKVLTIDPKN 532

Query: 212 GKALFKLAKLYDKLNETEAAADLFMEFVSKLDTFAAPPDKTCGFFAFKYLANHHLKANNL 271
            +A+  LA LY +    E A   ++E  ++L+     P           LA ++L    L
Sbjct: 533 LRAILALAGLYLRTGNEEEAVA-WLEKAAELNPQEIEPALA--------LAQYYLGKGQL 583

Query: 272 DTA 274
             A
Sbjct: 584 KKA 586



 Score = 35.4 bits (82), Expect = 0.047
 Identities = 61/302 (20%), Positives = 110/302 (36%), Gaps = 15/302 (4%)

Query: 21  EFVSKLDTFAAPPDKTCGFFAFKYLAN-HHLKANNLDTAYKCAQKCLQHEETAEEAKSLL 79
            +    +T          +     LA     +  NL+ AY+   + L++   + +A+ LL
Sbjct: 276 NYEDARETLQDALKSAPEYLPALLLAGASEYQLGNLEQAYQYLNQILKYAPNSHQARRLL 335

Query: 80  RSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEI 139
            SI Q R      + A L    AL L+P+     +LLG  Y+ + +   A +   +A E+
Sbjct: 336 ASI-QLRLGRVDEAIATL--SPALGLDPDDPAALSLLGEAYLALGDFEKAAEYLAKATEL 392

Query: 140 NNLDYRAWYGLGQTYEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAFEKQEKISEAMK 199
           +  +  A   LG +      P   +   + A  + P   R    L  ++ +  +  +A+ 
Sbjct: 393 DPENAAARTQLGISKLSQGDPSEAIADLETAAQLDPELGRADLLLILSYLRSGQFDKALA 452

Query: 200 CYNKSRAIGDADGKALFKLAKLYDKLNETEAA-ADLFMEFVSKLDTFAAPPDKTCGFFAF 258
              K          A          L + + A A    E    ++     PD    F A 
Sbjct: 453 AAKKLEK--KQPDNASLHNLLGAIYLGKGDLAKAREAFEKALSIE-----PDF---FPAA 502

Query: 259 KYLANHHLKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIAQKRQPDESSSQPMECAVVL 318
             LA   ++  N D A +  +K L  +     A   L  +  +   +E +   +E A  L
Sbjct: 503 ANLARIDIQEGNPDDAIQRFEKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAAEL 562

Query: 319 DP 320
           +P
Sbjct: 563 NP 564



 Score = 32.0 bits (73), Expect = 0.50
 Identities = 21/98 (21%), Positives = 37/98 (37%), Gaps = 5/98 (5%)

Query: 44  YLANHHLKANNLDTAYKCAQKCLQHEETAEEAK-SLLRSIAQKRQPDESSSQAVLYFQRA 102
                +L    L+ A K  ++ L  +  +  AK  L +    + + DE    A       
Sbjct: 130 LRGLAYLGLGQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDE----ARALIDEV 185

Query: 103 LKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEIN 140
           L  +P  +    L G   + + N   A+  YR+AI + 
Sbjct: 186 LTADPGNVDALLLKGDLLLSLGNIELALAAYRKAIALR 223



 Score = 31.6 bits (72), Expect = 0.75
 Identities = 26/111 (23%), Positives = 43/111 (38%), Gaps = 4/111 (3%)

Query: 44  YLANHHLKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLYFQRAL 103
            LA  +L   + D A K  Q  ++         + L  +  + +      +A+ Y +RAL
Sbjct: 775 ALAELYLAQKDYDKAIKHYQTVVKKAPDNAVVLNNLAWLYLELKD----PRALEYAERAL 830

Query: 104 KLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQTY 154
           KL PN   +   LG   +E    + A+   R+A+ I        Y L    
Sbjct: 831 KLAPNIPAILDTLGWLLVEKGEADRALPLLRKAVNIAPEAAAIRYHLALAL 881



 Score = 30.4 bits (69), Expect = 1.7
 Identities = 42/175 (24%), Positives = 71/175 (40%), Gaps = 24/175 (13%)

Query: 69  EETAEEAKSLLRSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNA 128
           EE  E AKS L    QK +       A++  + AL+ +PN      LLG  Y+ + +  A
Sbjct: 23  EELIEAAKSYL----QKNKYKA----AIIQLKNALQKDPNDAEARFLLGKIYLALGDYAA 74

Query: 129 AIQCYRQAIE-------INNLDYRAWYGLGQTYEILRLPYYGLYYYKQAHMVRPNDPRML 181
           A +  R+A+        +  L  RA+   G+  ++L           +  +       +L
Sbjct: 75  AEKELRKALSLGYPKNQVLPLLARAYLLQGKFQQVLD------ELPGKTLLDDEGAAELL 128

Query: 182 TALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAKL---YDKLNETEAAAD 233
              G A+    ++  A K Y ++ AI      A   LA+L    ++ +E  A  D
Sbjct: 129 ALRGLAYLGLGQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARALID 183



 Score = 29.3 bits (66), Expect = 4.1
 Identities = 27/133 (20%), Positives = 46/133 (34%), Gaps = 9/133 (6%)

Query: 175 PNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAKLYDKLNETEAAADL 234
           PND      LG+ +      + A K   K+ ++G    + L  LA+ Y    + +   D 
Sbjct: 53  PNDAEARFLLGKIYLALGDYAAAEKELRKALSLGYPKNQVLPLLARAYLLQGKFQQVLDE 112

Query: 235 FMEFVSKLDTFAAPPDKTCGFFAFKYLANHHLKANNLDTAYKCAQKCLQHEETAEEAK-S 293
                   D  AA                 +L    L+ A K  ++ L  +  +  AK  
Sbjct: 113 LPGKTLLDDEGAAE--------LLALRGLAYLGLGQLELAQKSYEQALAIDPRSLYAKLG 164

Query: 294 LLRSIAQKRQPDE 306
           L +    + + DE
Sbjct: 165 LAQLALAENRFDE 177


>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat. 
          Length = 34

 Score = 37.0 bits (87), Expect = 4e-04
 Identities = 8/29 (27%), Positives = 18/29 (62%)

Query: 112 VWTLLGHEYMEMKNTNAAIQCYRQAIEIN 140
               LG+ Y+++   + A++ Y +A+E+N
Sbjct: 3   ALYNLGNAYLKLGKYDEALEYYEKALELN 31



 Score = 25.8 bits (58), Expect = 3.5
 Identities = 8/16 (50%), Positives = 14/16 (87%)

Query: 94  QAVLYFQRALKLNPNY 109
           +A+ Y+++AL+LNPN 
Sbjct: 19  EALEYYEKALELNPNN 34



 Score = 25.5 bits (57), Expect = 4.5
 Identities = 10/26 (38%), Positives = 14/26 (53%)

Query: 178 PRMLTALGEAFEKQEKISEAMKCYNK 203
            + L  LG A+ K  K  EA++ Y K
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEK 26



 Score = 25.1 bits (56), Expect = 6.3
 Identities = 12/34 (35%), Positives = 17/34 (50%)

Query: 144 YRAWYGLGQTYEILRLPYYGLYYYKQAHMVRPND 177
            +A Y LG  Y  L      L YY++A  + PN+
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34


>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats.  Repeats present in 4
           or more copies in proteins. Contain a minimum of 34
           amino acids each and self-associate via a "knobs and
           holes" mechanism.
          Length = 34

 Score = 36.3 bits (85), Expect = 8e-04
 Identities = 7/29 (24%), Positives = 19/29 (65%)

Query: 112 VWTLLGHEYMEMKNTNAAIQCYRQAIEIN 140
               LG+ Y+++ + + A++ Y +A+E++
Sbjct: 3   ALYNLGNAYLKLGDYDEALEYYEKALELD 31



 Score = 26.6 bits (60), Expect = 2.0
 Identities = 9/30 (30%), Positives = 14/30 (46%)

Query: 178 PRMLTALGEAFEKQEKISEAMKCYNKSRAI 207
              L  LG A+ K     EA++ Y K+  +
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALEL 30


>gnl|CDD|131573 TIGR02521, type_IV_pilW, type IV pilus biogenesis/stability protein
           PilW.  Members of this family are designated PilF in ref
           (PMID:8973346) and PilW in ref (PMID:15612916). This
           outer membrane protein is required both for pilus
           stability and for pilus function such as adherence to
           human cells. Members of this family contain copies of
           the TPR (tetratricopeptide repeat) domain.
          Length = 234

 Score = 40.0 bits (94), Expect = 0.001
 Identities = 44/192 (22%), Positives = 75/192 (39%), Gaps = 9/192 (4%)

Query: 45  LANHHLKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLYFQRALK 104
           LA  +L+  +L+ A +   K L+H+     A   L    Q+    E   +A   F+RAL 
Sbjct: 37  LALGYLEQGDLEVAKENLDKALEHDPDDYLAYLALALYYQQLGELE---KAEDSFRRALT 93

Query: 105 LNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINNLD--YRAWYGLGQTYEILRLPYY 162
           LNPN   V    G    +      A+Q + QAIE        R+    G     L+   +
Sbjct: 94  LNPNNGDVLNNYGTFLCQQGKYEQAMQQFEQAIEDPLYPQPARSLENAGLCA--LKAGDF 151

Query: 163 GLY--YYKQAHMVRPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAK 220
                Y  +A  + P  P  L  L E +  + +  +A     + +   +   ++L+   +
Sbjct: 152 DKAEKYLTRALQIDPQRPESLLELAELYYLRGQYKDARAYLERYQQTYNQTAESLWLGIR 211

Query: 221 LYDKLNETEAAA 232
           +   L +  AA 
Sbjct: 212 IARALGDVAAAQ 223



 Score = 39.6 bits (93), Expect = 0.001
 Identities = 26/151 (17%), Positives = 50/151 (33%), Gaps = 18/151 (11%)

Query: 100 QRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLG-------- 151
            +AL+ +P+    +  L   Y ++     A   +R+A+ +N  +       G        
Sbjct: 55  DKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGK 114

Query: 152 --QTYEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGD 209
             Q  +          Y + A        R L   G    K     +A K   ++  I  
Sbjct: 115 YEQAMQQFEQAIEDPLYPQPA--------RSLENAGLCALKAGDFDKAEKYLTRALQIDP 166

Query: 210 ADGKALFKLAKLYDKLNETEAAADLFMEFVS 240
              ++L +LA+LY    + + A      +  
Sbjct: 167 QRPESLLELAELYYLRGQYKDARAYLERYQQ 197


>gnl|CDD|205362 pfam13181, TPR_8, Tetratricopeptide repeat. 
          Length = 34

 Score = 35.5 bits (83), Expect = 0.001
 Identities = 8/30 (26%), Positives = 18/30 (60%)

Query: 112 VWTLLGHEYMEMKNTNAAIQCYRQAIEINN 141
            + LLG  Y+++ +   A + Y +A+E++ 
Sbjct: 3   AYYLLGQIYLQLGDYEEAKEYYEKALELDP 32


>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat. 
          Length = 65

 Score = 36.5 bits (85), Expect = 0.001
 Identities = 11/63 (17%), Positives = 19/63 (30%)

Query: 116 LGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQTYEILRLPYYGLYYYKQAHMVRP 175
           L    +   + + A+     A+    L   A   LG+               + A    P
Sbjct: 3   LARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADP 62

Query: 176 NDP 178
           +DP
Sbjct: 63  DDP 65



 Score = 28.4 bits (64), Expect = 1.1
 Identities = 12/58 (20%), Positives = 20/58 (34%)

Query: 183 ALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAKLYDKLNETEAAADLFMEFVS 240
           AL  A  +     EA+     + A      +AL  L +   +      AA L    ++
Sbjct: 2   ALARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALA 59


>gnl|CDD|226687 COG4235, COG4235, Cytochrome c biogenesis factor [Posttranslational
           modification, protein turnover, chaperones].
          Length = 287

 Score = 39.3 bits (92), Expect = 0.002
 Identities = 32/139 (23%), Positives = 52/139 (37%), Gaps = 29/139 (20%)

Query: 76  KSLLRSIAQKRQP--DESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCY 133
           +   R      QP  ++     +   +  L+ NP     W LLG  YM +   + A+  Y
Sbjct: 120 QPADRLADPLAQPPAEQEMEALIARLETHLQQNPGDAEGWDLLGRAYMALGRASDALLAY 179

Query: 134 RQAIEI--NNLDYRAWYGLGQTYEILRLPYYGLYY-------------YKQAHMVRPNDP 178
           R A+ +  +N +     GL +           LYY              +QA  + P + 
Sbjct: 180 RNALRLAGDNPEI--LLGLAEA----------LYYQAGQQMTAKARALLRQALALDPANI 227

Query: 179 RMLTALGEAFEKQEKISEA 197
           R L+ L  A  +Q   +EA
Sbjct: 228 RALSLLAFAAFEQGDYAEA 246


>gnl|CDD|233013 TIGR00540, TPR_hemY_coli, heme biosynthesis-associated TPR protein.
            Members of this protein family are uncharacterized
           tetratricopeptide repeat (TPR) proteins invariably found
           in heme biosynthesis gene clusters. The absence of any
           invariant residues other than Ala argues against this
           protein serving as an enzyme per se. The gene symbol
           hemY assigned in E. coli is unfortunate in that an
           unrelated protein, protoporphyrinogen oxidase (HemG in
           E. coli) is designated HemY in Bacillus subtilis
           [Unknown function, General].
          Length = 367

 Score = 37.7 bits (88), Expect = 0.007
 Identities = 19/62 (30%), Positives = 34/62 (54%), Gaps = 1/62 (1%)

Query: 174 RPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAKLYDKLNETEAAAD 233
            P+D  +L ALG    +Q+   +A      S ++     +A  +LA+L+++L +TEAAA 
Sbjct: 304 HPDDALLLLALGRLCLRQQLWGKAQSYLEASLSL-APTEEAHLELAQLFEQLGDTEAAAQ 362

Query: 234 LF 235
            +
Sbjct: 363 HY 364


>gnl|CDD|227343 COG5010, TadD, Flp pilus assembly protein TadD, contains TPR
           repeats [Intracellular trafficking and secretion].
          Length = 257

 Score = 37.0 bits (86), Expect = 0.009
 Identities = 21/81 (25%), Positives = 35/81 (43%), Gaps = 16/81 (19%)

Query: 72  AEEAKSLLRSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQ 131
           A + K+ +R        + +  +AV   ++A +L P     W LLG    ++   + A +
Sbjct: 104 AAQGKNQIR--------NGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARR 155

Query: 132 CYRQAIE--------INNLDY 144
            YRQA+E         NNL  
Sbjct: 156 AYRQALELAPNEPSIANNLGM 176



 Score = 31.6 bits (72), Expect = 0.51
 Identities = 22/117 (18%), Positives = 41/117 (35%), Gaps = 1/117 (0%)

Query: 121 MEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQTYEILRLPYYGLYYYKQAHMVRPNDPRM 180
           + M+ T  A      A+  N  D      L     +       L   +++ +  P D  +
Sbjct: 44  LAMRQTQGAAAALGAAVLRNPEDLSI-AKLATALYLRGDADSSLAVLQKSAIAYPKDREL 102

Query: 181 LTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAKLYDKLNETEAAADLFME 237
           L A G+   +     EA+    K+  +   D +A   L    D+L   + A   + +
Sbjct: 103 LAAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAYRQ 159



 Score = 30.1 bits (68), Expect = 1.7
 Identities = 12/64 (18%), Positives = 22/64 (34%)

Query: 174 RPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAKLYDKLNETEAAAD 233
            P D      LG A ++  +  EA + Y ++  +   +      L        + E A  
Sbjct: 130 APTDWEAWNLLGAALDQLGRFDEARRAYRQALELAPNEPSIANNLGMSLLLRGDLEDAET 189

Query: 234 LFME 237
           L + 
Sbjct: 190 LLLP 193


>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat. 
          Length = 78

 Score = 33.5 bits (77), Expect = 0.019
 Identities = 14/66 (21%), Positives = 26/66 (39%), Gaps = 7/66 (10%)

Query: 179 RMLTALGEAFEKQEKISEAMKCYNKSRAI----GDADGK---ALFKLAKLYDKLNETEAA 231
             L  L     +     EA++   K+  +    G+   +   AL  LA+LY  L + + A
Sbjct: 6   AALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEA 65

Query: 232 ADLFME 237
            +   +
Sbjct: 66  LEYLEK 71



 Score = 30.0 bits (68), Expect = 0.38
 Identities = 14/78 (17%), Positives = 31/78 (39%), Gaps = 7/78 (8%)

Query: 106 NPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEI------NNLDY-RAWYGLGQTYEILR 158
           +P+       L      + + + A++   +A+E+      ++ +  RA   L + Y  L 
Sbjct: 1   HPDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALG 60

Query: 159 LPYYGLYYYKQAHMVRPN 176
                L Y ++A  +R  
Sbjct: 61  DYDEALEYLEKALALREA 78



 Score = 27.3 bits (61), Expect = 3.4
 Identities = 9/53 (16%), Positives = 24/53 (45%), Gaps = 7/53 (13%)

Query: 94  QAVLYFQRALKL-------NPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEI 139
           +A+   ++AL+L       +P        L   Y+ + + + A++   +A+ +
Sbjct: 23  EALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEALEYLEKALAL 75


>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
           repeats [General function prediction only].
          Length = 484

 Score = 35.8 bits (83), Expect = 0.033
 Identities = 15/76 (19%), Positives = 29/76 (38%), Gaps = 5/76 (6%)

Query: 82  IAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINN 141
           +   +  +     A+   ++AL L+PN   +   L    ++      AI+   + +  + 
Sbjct: 351 LEANKAKE-----AIERLKKALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDP 405

Query: 142 LDYRAWYGLGQTYEIL 157
            D   W  L Q Y  L
Sbjct: 406 EDPNGWDLLAQAYAEL 421



 Score = 31.6 bits (72), Expect = 0.66
 Identities = 20/129 (15%), Positives = 41/129 (31%)

Query: 69  EETAEEAKSLLRSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNA 128
           + +     +     A +        +A+   Q  +   P+      L G   +E      
Sbjct: 299 KRSKRGGLAAQYGRALQTYLAGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKE 358

Query: 129 AIQCYRQAIEINNLDYRAWYGLGQTYEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAF 188
           AI+  ++A+ ++         L Q       P   +    +     P DP     L +A+
Sbjct: 359 AIERLKKALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAY 418

Query: 189 EKQEKISEA 197
            +    +EA
Sbjct: 419 AELGNRAEA 427


>gnl|CDD|236983 PRK11788, PRK11788, tetratricopeptide repeat protein; Provisional.
          Length = 389

 Score = 35.6 bits (83), Expect = 0.038
 Identities = 14/63 (22%), Positives = 27/63 (42%), Gaps = 1/63 (1%)

Query: 82  IAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINN 141
           +AQ+         A    ++AL  +P  +    LLG   +   +  AAI+   + +E  +
Sbjct: 186 LAQQALARGDLDAARALLKKALAADPQCVRASILLGDLALAQGDYAAAIEALER-VEEQD 244

Query: 142 LDY 144
            +Y
Sbjct: 245 PEY 247


>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat. 
          Length = 34

 Score = 31.4 bits (72), Expect = 0.042
 Identities = 8/34 (23%), Positives = 18/34 (52%)

Query: 98  YFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQ 131
            +++AL+L+PN    +  L    + +   + A+Q
Sbjct: 1   LYEKALELDPNNAEAYYNLALLLLNLGQYDEALQ 34



 Score = 27.5 bits (62), Expect = 0.95
 Identities = 6/24 (25%), Positives = 12/24 (50%)

Query: 132 CYRQAIEINNLDYRAWYGLGQTYE 155
            Y +A+E++  +  A+Y L     
Sbjct: 1   LYEKALELDPNNAEAYYNLALLLL 24


>gnl|CDD|183140 PRK11447, PRK11447, cellulose synthase subunit BcsC; Provisional.
          Length = 1157

 Score = 35.4 bits (82), Expect = 0.052
 Identities = 30/133 (22%), Positives = 51/133 (38%), Gaps = 28/133 (21%)

Query: 94  QAVLYFQRALKLNPN--YLGVWTLLGHEYMEMKNTN--AAIQCYRQAIEINNLDYRAWYG 149
           +AV  F++AL L+P+      W  L      +K       IQ    A++ NNL       
Sbjct: 321 RAVAQFEKALALDPHSSNRDKWESL------LKVNRYWLLIQQGDAALKANNLA------ 368

Query: 150 LGQTYEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGD 209
             +              Y+QA  V   D   +  LG+    ++  + A + Y ++  +  
Sbjct: 369 QAERL------------YQQARQVDNTDSYAVLGLGDVAMARKDYAAAERYYQQALRMDP 416

Query: 210 ADGKALFKLAKLY 222
            +  A+  LA LY
Sbjct: 417 GNTNAVRGLANLY 429


>gnl|CDD|225605 COG3063, PilF, Tfp pilus assembly protein PilF [Cell motility and
           secretion / Intracellular trafficking and secretion].
          Length = 250

 Score = 33.6 bits (77), Expect = 0.10
 Identities = 29/149 (19%), Positives = 58/149 (38%), Gaps = 21/149 (14%)

Query: 94  QAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAI-------EINNLDYRA 146
           QA    ++AL+ +P+Y     +  H Y ++   + A + YR+A+       ++ N +Y A
Sbjct: 53  QAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAPNNGDVLN-NYGA 111

Query: 147 W-YGLG---QTYEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAFEKQEKISEAMKCYN 202
           +    G   +  +          Y + +          L  LG    K  +  +A +   
Sbjct: 112 FLCAQGRPEEAMQQFERALADPAYGEPS--------DTLENLGLCALKAGQFDQAEEYLK 163

Query: 203 KSRAIGDADGKALFKLAKL-YDKLNETEA 230
           ++  +      AL +LA+L Y   +   A
Sbjct: 164 RALELDPQFPPALLELARLHYKAGDYAPA 192



 Score = 33.2 bits (76), Expect = 0.13
 Identities = 19/84 (22%), Positives = 34/84 (40%)

Query: 116 LGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQTYEILRLPYYGLYYYKQAHMVRP 175
           L   Y++  +   A +   +A+E +   Y A       Y+ L         Y++A  + P
Sbjct: 41  LALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAP 100

Query: 176 NDPRMLTALGEAFEKQEKISEAMK 199
           N+  +L   G     Q +  EAM+
Sbjct: 101 NNGDVLNNYGAFLCAQGRPEEAMQ 124



 Score = 30.5 bits (69), Expect = 1.1
 Identities = 16/64 (25%), Positives = 31/64 (48%), Gaps = 3/64 (4%)

Query: 45  LANHHLKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLYFQRALK 104
           LA  +L+  +   A K  +K L+H+ +   A  +     QK   ++ + +    +++AL 
Sbjct: 41  LALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADE---SYRKALS 97

Query: 105 LNPN 108
           L PN
Sbjct: 98  LAPN 101


>gnl|CDD|221956 pfam13174, TPR_6, Tetratricopeptide repeat. 
          Length = 33

 Score = 29.8 bits (68), Expect = 0.16
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 213 KALFKLAKLYDKLNETEAAADLFMEFVSK 241
            AL+KLA  Y KL +T+ A +     + +
Sbjct: 1   DALYKLALAYLKLGDTDEAKEALERLLKR 29


>gnl|CDD|236624 PRK09782, PRK09782, bacteriophage N4 receptor, outer membrane
           subunit; Provisional.
          Length = 987

 Score = 33.0 bits (75), Expect = 0.25
 Identities = 49/235 (20%), Positives = 89/235 (37%), Gaps = 41/235 (17%)

Query: 34  DKTCGFFAFKYLANHHLKANNLDTAYKCAQKC----------LQHEETAEEAKSLLRSIA 83
           D      A++ ++ H +   +L  A   AQ             Q E+      +L   + 
Sbjct: 524 DYATALAAWQKISLHDMSNEDLLAAANTAQAAGNGAARDRWLQQAEQRGLGDNALYWWLH 583

Query: 84  QKRQPDESSSQAVLYFQRALKLNPN---YLGVWTLLGHEYMEMKNTNAAIQCYRQAIEI- 139
            +R        A+    R+L + P+   Y+   T+    Y +  N  AA+   R A+E+ 
Sbjct: 584 AQRYIPGQPELALNDLTRSLNIAPSANAYVARATI----YRQRHNVPAAVSDLRAALELE 639

Query: 140 -NNLDYRAWYGL--------GQTYEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAFEK 190
            NN +Y+A  G          Q+ E+L          ++AH   P+DP ++  L    ++
Sbjct: 640 PNNSNYQAALGYALWDSGDIAQSREML----------ERAHKGLPDDPALIRQLAYVNQR 689

Query: 191 QEKISEAMKCYNKSRAIGDADGKALFKLAKLYDKLNETEAAADLFMEFVSKLDTF 245
            + ++     +     I D D +AL  +  L  + N+         E V +  TF
Sbjct: 690 LDDMAATQ--HYARLVIDDIDNQAL--ITPLTPEQNQQRFNFRRLHEEVGRRWTF 740


>gnl|CDD|226428 COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase,
           SPINDLY family [Posttranslational modification, protein
           turnover, chaperones].
          Length = 620

 Score = 32.4 bits (74), Expect = 0.32
 Identities = 27/127 (21%), Positives = 46/127 (36%), Gaps = 9/127 (7%)

Query: 77  SLLRSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTN---AAIQCY 133
           +   SI     P   S+ A L  +  L +NP        L              A I   
Sbjct: 70  AAFLSILLA--PLADSTLAFLAKRIPLSVNPENCPAVQNLAAALELDGLQFLALADISEI 127

Query: 134 RQAIEINNLDYRA----WYGLGQTYEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAFE 189
            + +  +N ++      +Y LG+  ++L          ++A  + P  PR+L AL  A +
Sbjct: 128 AEWLSPDNAEFLGHLIRFYQLGRYLKLLGRTAEAELALERAVDLLPKYPRVLGALMTARQ 187

Query: 190 KQEKISE 196
           +Q    E
Sbjct: 188 EQCSWPE 194


>gnl|CDD|232953 TIGR00392, ileS, isoleucyl-tRNA synthetase.  The isoleucyl tRNA
           synthetase (IleS) is a class I amino acyl-tRNA ligase
           and is particularly closely related to the valyl tRNA
           synthetase. This model may recognize IleS from every
           species, including eukaryotic cytosolic and
           mitochondrial forms [Protein synthesis, tRNA
           aminoacylation].
          Length = 861

 Score = 32.7 bits (75), Expect = 0.36
 Identities = 38/153 (24%), Positives = 49/153 (32%), Gaps = 41/153 (26%)

Query: 73  EEAKSLLRSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQC 132
           EE +   R  A K Q +E   Q    FQR        LGVW    + Y  M  +    Q 
Sbjct: 111 EEFREKCREFALK-QIEEQREQ----FQR--------LGVWGDWENPYKTMDPSYEESQ- 156

Query: 133 YRQAIEINNLDYRAWYGLGQTYEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAFEKQE 192
                         W+   + +E       GL Y  +        PR  TAL EA E + 
Sbjct: 157 --------------WWLFKEAHE------KGLLY--RGLKPVYWSPRCRTALAEA-EVEY 193

Query: 193 KISEAMK----CYNKSRAIGDADGKALFKLAKL 221
           K +         Y K     D       KL+ L
Sbjct: 194 KENYKDVKDPSIYVKFPVKKDKKTYLKVKLSSL 226


>gnl|CDD|225504 COG2956, COG2956, Predicted N-acetylglucosaminyl transferase
           [Carbohydrate transport and metabolism].
          Length = 389

 Score = 32.4 bits (74), Expect = 0.37
 Identities = 10/59 (16%), Positives = 25/59 (42%)

Query: 82  IAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEIN 140
           +AQ+        +A    ++AL+ +   +    +LG   +   +   A++   + +E N
Sbjct: 186 LAQQALASSDVDRARELLKKALQADKKCVRASIILGRVELAKGDYQKAVEALERVLEQN 244


>gnl|CDD|239108 cd02430, PTH2, Peptidyl-tRNA hydrolase, type 2 (PTH2).
           Peptidyl-tRNA hydrolase (PTH) activity releases tRNA
           from the premature translation termination product
           peptidyl-tRNA, therefore allowing the tRNA and peptide
           to be reused in protein synthesis. PTH2 is present in
           archaea and eukaryotes.
          Length = 115

 Score = 30.6 bits (70), Expect = 0.38
 Identities = 10/25 (40%), Positives = 15/25 (60%)

Query: 128 AAIQCYRQAIEINNLDYRAWYGLGQ 152
           AA+  Y++A++ N    RAW   GQ
Sbjct: 25  AALGAYKKAMKSNPELLRAWEREGQ 49


>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat.  This Pfam entry
           includes outlying Tetratricopeptide-like repeats (TPR)
           that are not matched by pfam00515.
          Length = 34

 Score = 28.6 bits (65), Expect = 0.39
 Identities = 7/29 (24%), Positives = 16/29 (55%)

Query: 112 VWTLLGHEYMEMKNTNAAIQCYRQAIEIN 140
               LG  Y ++ +   A++ Y +A+E++
Sbjct: 3   ALYNLGLAYYKLGDYEEALEAYEKALELD 31


>gnl|CDD|225613 COG3071, HemY, Uncharacterized enzyme of heme biosynthesis
           [Coenzyme metabolism].
          Length = 400

 Score = 31.6 bits (72), Expect = 0.59
 Identities = 16/63 (25%), Positives = 27/63 (42%), Gaps = 1/63 (1%)

Query: 175 PNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAKLYDKLNETEAAADL 234
           P DP +L+ LG    K +   +A +    +  +  +      +LA   D+L E E A  +
Sbjct: 325 PEDPLLLSTLGRLALKNKLWGKASEALEAALKLRPSA-SDYAELADALDQLGEPEEAEQV 383

Query: 235 FME 237
             E
Sbjct: 384 RRE 386


>gnl|CDD|190182 pfam01981, PTH2, Peptidyl-tRNA hydrolase PTH2.  Peptidyl-tRNA
           hydrolases are enzymes that release tRNAs from
           peptidyl-tRNA during translation.
          Length = 116

 Score = 30.2 bits (69), Expect = 0.60
 Identities = 10/31 (32%), Positives = 18/31 (58%)

Query: 128 AAIQCYRQAIEINNLDYRAWYGLGQTYEILR 158
           AA+  Y++A++ N    +AW   GQ   +L+
Sbjct: 26  AAVGAYKKALKANPELLKAWLREGQKKIVLK 56


>gnl|CDD|222121 pfam13428, TPR_14, Tetratricopeptide repeat. 
          Length = 44

 Score = 28.3 bits (63), Expect = 0.73
 Identities = 7/41 (17%), Positives = 17/41 (41%)

Query: 112 VWTLLGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQ 152
               L    + + + + A+   R+A+ ++  D  A   L +
Sbjct: 3   ALLALARALLALGDLDEALALLRRALALDPDDPEALLLLAR 43



 Score = 26.0 bits (57), Expect = 4.8
 Identities = 14/44 (31%), Positives = 21/44 (47%)

Query: 178 PRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAKL 221
           P  L AL  A      + EA+    ++ A+   D +AL  LA+L
Sbjct: 1   PAALLALARALLALGDLDEALALLRRALALDPDDPEALLLLARL 44


>gnl|CDD|219851 pfam08456, Vmethyltransf_C, Viral methyltransferase C-terminal.
           This domain is found to the C-terminus of the viral
           methyltransferase domain (pfam01660) in
           single-stranded-RNA positive-strand viruses with no DNA
           stage in the Virgaviridae family.
          Length = 208

 Score = 30.5 bits (69), Expect = 0.94
 Identities = 11/58 (18%), Positives = 18/58 (31%)

Query: 273 TAYKCAQKCLQHEETAEEAKSLLRSIAQKRQPDESSSQPMECAVVLDPVPPSTRSSKF 330
                 +K   +EE   E K + R I   +          E +  L  +    R + F
Sbjct: 84  DFEDLDEKIRLYEEYERERKRISRKIVSDKLEVSEDGSASESSSDLKELRKFRRRASF 141


>gnl|CDD|226456 COG3947, COG3947, Response regulator containing CheY-like receiver
           and SARP domains [Signal transduction mechanisms].
          Length = 361

 Score = 30.2 bits (68), Expect = 1.6
 Identities = 21/88 (23%), Positives = 36/88 (40%), Gaps = 3/88 (3%)

Query: 71  TAEEAKSLLRSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAI 130
           T +E K L+        P+     A    +R  +L    LG    +   Y+E    N AI
Sbjct: 243 TIDELKELVGQYKGDYLPEADYPWAEDERERLEQLYMKLLG---KVARAYLEAGKPNEAI 299

Query: 131 QCYRQAIEINNLDYRAWYGLGQTYEILR 158
           Q +++A+ ++ L  +   GL  +   L 
Sbjct: 300 QLHQRALTLDPLSEQDNKGLMASLATLG 327


>gnl|CDD|182415 PRK10370, PRK10370, formate-dependent nitrite reductase complex
           subunit NrfG; Provisional.
          Length = 198

 Score = 29.5 bits (66), Expect = 2.2
 Identities = 23/97 (23%), Positives = 38/97 (39%), Gaps = 14/97 (14%)

Query: 79  LRSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIE 138
           L   A ++ P+      +   Q  ++ NP     W LLG  Y+   + + A+  YRQA++
Sbjct: 46  LHQFASQQTPEAQ----LQALQDKIRANPQNSEQWALLGEYYLWRNDYDNALLAYRQALQ 101

Query: 139 INNLDYRAWYGLGQTYEILRLPYYGLYYYKQAHMVRP 175
           +           G+  E+       LYY    HM   
Sbjct: 102 LR----------GENAELYAALATVLYYQAGQHMTPQ 128


>gnl|CDD|216950 pfam02259, FAT, FAT domain.  The FAT domain is named after FRAP,
           ATM and TRRAP.
          Length = 350

 Score = 29.3 bits (66), Expect = 2.9
 Identities = 7/29 (24%), Positives = 15/29 (51%)

Query: 127 NAAIQCYRQAIEINNLDYRAWYGLGQTYE 155
           +  +Q YR A++ ++  Y+AW+       
Sbjct: 274 DEILQAYRTAVQFDDQWYKAWHSWALANF 302


>gnl|CDD|235280 PRK04322, PRK04322, peptidyl-tRNA hydrolase; Provisional.
          Length = 113

 Score = 27.9 bits (63), Expect = 3.1
 Identities = 8/25 (32%), Positives = 11/25 (44%)

Query: 128 AAIQCYRQAIEINNLDYRAWYGLGQ 152
           AA+  Y +A + N      W   GQ
Sbjct: 23  AAVSAYEKADKSNREWLEEWLNEGQ 47


>gnl|CDD|214513 smart00093, SERPIN, SERine Proteinase INhibitors. 
          Length = 359

 Score = 29.1 bits (66), Expect = 3.2
 Identities = 13/61 (21%), Positives = 24/61 (39%), Gaps = 9/61 (14%)

Query: 50  LKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLYFQRALKLNPNY 109
           L  N  +T+     +  QH         LL  + +     E  +   L+  ++LKL  ++
Sbjct: 46  LGFNLTETSEADIHQGFQH---------LLHLLNRPDSQLELKTANALFVDKSLKLKDSF 96

Query: 110 L 110
           L
Sbjct: 97  L 97


>gnl|CDD|218970 pfam06278, DUF1032, Protein of unknown function (DUF1032).  This
           family consists of several conserved eukaryotic proteins
           of unknown function.
          Length = 565

 Score = 28.8 bits (64), Expect = 5.7
 Identities = 14/47 (29%), Positives = 22/47 (46%), Gaps = 3/47 (6%)

Query: 301 KRQPDESSSQPMECAVVLDPVPPSTRSSKFPANPAYPFATSDPDDDA 347
           ++ P  +  QPME +    PVP    S + P  PA   A    +++A
Sbjct: 165 QKDPSVAEEQPMEVSDCGSPVPVLQFSQEEPEGPA---ANGGYEEEA 208


>gnl|CDD|224971 COG2060, KdpA, K+-transporting ATPase, A chain [Inorganic ion
           transport and metabolism].
          Length = 560

 Score = 28.4 bits (64), Expect = 6.3
 Identities = 13/36 (36%), Positives = 17/36 (47%), Gaps = 3/36 (8%)

Query: 95  AVLYFQRALKLNPNYLGV--WTLLGHEYME-MKNTN 127
           A+L  Q  L LNP  L    W L  +  +  + NTN
Sbjct: 82  ALLMLQGWLPLNPQGLPGLSWDLAFNTAVSFVTNTN 117


>gnl|CDD|219284 pfam07064, RIC1, RIC1.  RIC1 has been identified in yeast as a
           Golgi protein involved in retrograde transport to the
           cis-Golgi network. It forms a heterodimer with Rgp1 and
           functions as a guanyl-nucleotide exchange factor.
          Length = 251

 Score = 28.1 bits (63), Expect = 6.6
 Identities = 13/56 (23%), Positives = 19/56 (33%), Gaps = 6/56 (10%)

Query: 63  QKCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGH 118
           ++CL+     + A S L  +        SS  AV     AL+        W L   
Sbjct: 185 EECLELGRL-KTAASYLLVLQNLEGSASSSDCAVRLLVLALE-----EKNWELCKE 234


>gnl|CDD|189027 cd09857, PIN_EXO1, PIN domain of Exonuclease-1, a
           structure-specific, divalent-metal-ion dependent, 5'
           nuclease and homologs.  Exonuclease-1 (EXO1) is involved
           in multiple, eukaryotic DNA metabolic pathways,
           including DNA replication processes (5' flap DNA
           endonuclease activity and double stranded DNA
           5'-exonuclease activity), DNA repair processes (DNA
           mismatch repair (MMR) and post-replication repair
           (PRR)), recombination, and telomere integrity. EXO1
           functions in the MMS2 error-free branch of the PRR
           pathway in the maintenance and repair of stalled
           replication forks. Studies also suggest that EXO1 plays
           both structural and catalytic roles during MMR-mediated
           mutation avoidance. EXO1 belongs to the FEN1-EXO1-like
           family of structure-specific, 5' nucleases. These
           nucleases contain a PIN (PilT N terminus) domain with a
           helical arch/clamp region (I domain) of variable length
           (approximately 43 residues in EXO1 PIN domains) and a
           H3TH (helix-3-turn-helix) domain, an atypical
           helix-hairpin-helix-2-like region. Both the H3TH domain
           (not included here) and the helical arch/clamp region
           are involved in DNA binding. Nucleases within this group
           also have a carboxylate-rich active site that is
           involved in binding essential divalent metal ion
           cofactors (Mg2+/Mn2+). EXO1 nucleases also have
           C-terminal Mlh1- and Msh2-binding domains which allow
           interaction with MMR and PRR proteins, respectively.
          Length = 210

 Score = 27.5 bits (62), Expect = 8.0
 Identities = 6/25 (24%), Positives = 14/25 (56%)

Query: 116 LGHEYMEMKNTNAAIQCYRQAIEIN 140
              E +   N + A +C+++A++I 
Sbjct: 102 KALELLREGNRSEARECFQRAVDIT 126


>gnl|CDD|152569 pfam12134, PRP8_domainIV, PRP8 domain IV core.  This domain is
          found in eukaryotes, and is about 20 amino acids in
          length. It is found associated with pfam10597,
          pfam10596, pfam10598, pfam08083, pfam08082, pfam01398,
          pfam08084. There is a conserved LILR sequence motif.
          The domain is a selenomethionine domain in a subunit of
          the spliceosome. The function of PRP8 domain IV is
          believed to be interaction with the splicosomal core.
          Length = 231

 Score = 27.7 bits (62), Expect = 9.2
 Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 3/29 (10%)

Query: 63 QKCLQH---EETAEEAKSLLRSIAQKRQP 88
          QK L      +TAEE  +L+RS+ ++ QP
Sbjct: 71 QKRLSQLAKWKTAEEVAALIRSLPKEEQP 99



 Score = 27.7 bits (62), Expect = 9.2
 Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 3/29 (10%)

Query: 279 QKCLQH---EETAEEAKSLLRSIAQKRQP 304
           QK L      +TAEE  +L+RS+ ++ QP
Sbjct: 71  QKRLSQLAKWKTAEEVAALIRSLPKEEQP 99


>gnl|CDD|206687 cd01900, YchF, YchF GTPase.  YchF is a member of the Obg family,
           which includes four other subfamilies of GTPases: Obg,
           DRG, Ygr210, and NOG1. Obg is an essential gene that is
           involved in DNA replication in C. crescentus and
           Streptomyces griseus and is associated with the
           ribosome. Several members of the family, including YchF,
           possess the TGS domain related to the RNA-binding
           proteins. Experimental data and genomic analysis suggest
           that YchF may be part of a nucleoprotein complex and may
           function as a GTP-dependent translational factor.
          Length = 274

 Score = 27.8 bits (63), Expect = 9.3
 Identities = 9/19 (47%), Positives = 12/19 (63%), Gaps = 2/19 (10%)

Query: 325 TRSSKFPANPAYPFATSDP 343
           T+S+   AN  YPF T +P
Sbjct: 19  TKSNAEAAN--YPFCTIEP 35


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.131    0.388 

Gapped
Lambda     K      H
   0.267   0.0733    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,349,684
Number of extensions: 1602918
Number of successful extensions: 1834
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1781
Number of HSP's successfully gapped: 120
Length of query: 349
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 251
Effective length of database: 6,590,910
Effective search space: 1654318410
Effective search space used: 1654318410
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.5 bits)