RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16588
         (341 letters)



>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
           34 amino acids
           [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
           X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
           in a variety of organisms including bacteria,
           cyanobacteria, yeast, fungi, plants, and humans in
           various subcellular locations; involved in a variety of
           functions including protein-protein interactions, but
           common features in the interaction partners have not
           been defined; involved in chaperone, cell-cycle,
           transciption, and protein transport complexes; the
           number of TPR motifs varies among proteins (1,3-11,13
           15,16,19); 5-6 tandem repeats generate a right-handed
           helical structure with an amphipathic channel that is
           thought to accomodate an alpha-helix of a target
           protein; it has been proposed that TPR proteins
           preferably interact with WD-40 repeat proteins, but in
           many instances several TPR-proteins seem to aggregate to
           multi-protein complexes; examples of TPR-proteins
           include, Cdc16p, Cdc23p and Cdc27p components of the
           cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
           targeting signals, the Tom70p co-receptor for
           mitochondrial targeting signals, Ser/Thr phosphatase 5C
           and the p110 subunit of O-GlcNAc transferase; three
           copies of the repeat are present here.
          Length = 100

 Score = 55.1 bits (133), Expect = 8e-10
 Identities = 24/101 (23%), Positives = 42/101 (41%), Gaps = 2/101 (1%)

Query: 188 ELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADVWYNISHVAIGISDTRLAIQCLH 247
           E   NL    +    YD  +  +E+AL L  + + AD +YN++     +     A++   
Sbjct: 1   EALLNLGNLYYKLGDYDEALEYYEKALEL--DPDNADAYYNLAAAYYKLGKYEEALEDYE 58

Query: 248 LALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAASSP 288
            AL +D  +  +  NL +   + G  E A    + A    P
Sbjct: 59  KALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELDP 99



 Score = 54.3 bits (131), Expect = 1e-09
 Identities = 24/94 (25%), Positives = 48/94 (51%)

Query: 123 LNLAKIFEGLNNMPMSVKYYKLILKRDATCMEAIACIGVNHFYNDQPEVALLFYRRLLQM 182
           LNL  ++  L +   +++YY+  L+ D    +A   +   ++   + E AL  Y + L++
Sbjct: 4   LNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALEL 63

Query: 183 GLYNAELFNNLALCCFYSQQYDMVVTCFERALSL 216
              NA+ + NL L  +   +Y+  +  +E+AL L
Sbjct: 64  DPDNAKAYYNLGLAYYKLGKYEEALEAYEKALEL 97



 Score = 52.4 bits (126), Expect = 7e-09
 Identities = 22/97 (22%), Positives = 47/97 (48%), Gaps = 2/97 (2%)

Query: 159 IGVNHFYNDQPEVALLFYRRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFERALSLAL 218
           +G  ++     + AL +Y + L++   NA+ + NLA   +   +Y+  +  +E+AL   L
Sbjct: 6   LGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALE--L 63

Query: 219 NENAADVWYNISHVAIGISDTRLAIQCLHLALSIDSS 255
           + + A  +YN+      +     A++    AL +D +
Sbjct: 64  DPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100



 Score = 48.1 bits (115), Expect = 2e-07
 Identities = 22/99 (22%), Positives = 42/99 (42%)

Query: 224 DVWYNISHVAIGISDTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAA 283
           +   N+ ++   + D   A++    AL +D  +  +  NLA    + G  E A    + A
Sbjct: 1   EALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKA 60

Query: 284 AASSPYLYETHYNQAVISNLAGDLQESYNIVKKSLDLHP 322
               P   + +YN  +     G  +E+    +K+L+L P
Sbjct: 61  LELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELDP 99



 Score = 31.6 bits (72), Expect = 0.13
 Identities = 18/77 (23%), Positives = 32/77 (41%)

Query: 261 NNLAVLEAREGHIERASTYLQAAAASSPYLYETHYNQAVISNLAGDLQESYNIVKKSLDL 320
            NL  L  + G  + A  Y + A    P   + +YN A      G  +E+    +K+L+L
Sbjct: 4   LNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALEL 63

Query: 321 HPGHSYSWDILRKLEQY 337
            P ++ ++  L      
Sbjct: 64  DPDNAKAYYNLGLAYYK 80


>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
          Length = 291

 Score = 50.2 bits (118), Expect = 5e-07
 Identities = 41/184 (22%), Positives = 72/184 (39%), Gaps = 5/184 (2%)

Query: 116 PFIQVSRLNLAKIFEGLNNMPMSVKYYKLILKRDATCMEAIACIGVN-HFYNDQPEVALL 174
           P +  + LNL  + E L     +++  +  L  D     A A + +   +     E AL 
Sbjct: 92  PNLAEALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALYELGDYEEALE 151

Query: 175 FYRRLLQMG---LYNAELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADVWYNISH 231
            Y + L++       AE    L        +Y+  +   E+AL L  + + A+   N+  
Sbjct: 152 LYEKALELDPELNELAEALLALGALLEALGRYEEALELLEKALKLNPD-DDAEALLNLGL 210

Query: 232 VAIGISDTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAASSPYLY 291
           + + +     A++    AL +D  +  +  NLA+L    G  E A   L+ A    P LY
Sbjct: 211 LYLKLGKYEEALEYYEKALELDPDNAEALYNLALLLLELGRYEEALEALEKALELDPDLY 270

Query: 292 ETHY 295
               
Sbjct: 271 NLGL 274



 Score = 44.8 bits (104), Expect = 3e-05
 Identities = 39/192 (20%), Positives = 69/192 (35%), Gaps = 6/192 (3%)

Query: 141 YYKLILKRDATCMEAIACIGVNHFYNDQPEVALLFYRRLLQM--GLYNAELFNNLALCCF 198
             + + K  A  +  +           +   AL      L++      A L   LAL   
Sbjct: 11  LLEALAKLLAEALALLEAGLALLELLGELAEALELLEEALELLPNSDLAGLLLLLALALL 70

Query: 199 YSQQYDMVVTCFERALSLALNENAADVWYNISHVAIGISDTRLAIQCLHLALSIDSSHGL 258
              + +  +   E+AL L L  N A+   N+  +   +     A++ L  AL++D    L
Sbjct: 71  KLGRLEEALELLEKALELELLPNLAEALLNLGLLLEALGKYEEALELLEKALALDPDPDL 130

Query: 259 SQNNLA-VLEAREGHIERASTYLQAAAASSPYLYE---THYNQAVISNLAGDLQESYNIV 314
           ++  LA       G  E A    + A    P L E          +    G  +E+  ++
Sbjct: 131 AEALLALGALYELGDYEEALELYEKALELDPELNELAEALLALGALLEALGRYEEALELL 190

Query: 315 KKSLDLHPGHSY 326
           +K+L L+P    
Sbjct: 191 EKALKLNPDDDA 202


>gnl|CDD|131573 TIGR02521, type_IV_pilW, type IV pilus biogenesis/stability protein
           PilW.  Members of this family are designated PilF in ref
           (PMID:8973346) and PilW in ref (PMID:15612916). This
           outer membrane protein is required both for pilus
           stability and for pilus function such as adherence to
           human cells. Members of this family contain copies of
           the TPR (tetratricopeptide repeat) domain.
          Length = 234

 Score = 46.2 bits (110), Expect = 7e-06
 Identities = 35/145 (24%), Positives = 55/145 (37%)

Query: 168 QPEVALLFYRRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADVWY 227
           + E A   +RR L +   N ++ NN         +Y+  +  FE+A+   L    A    
Sbjct: 80  ELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGKYEQAMQQFEQAIEDPLYPQPARSLE 139

Query: 228 NISHVAIGISDTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAASS 287
           N    A+   D   A + L  AL ID     S   LA L    G  + A  YL+    + 
Sbjct: 140 NAGLCALKAGDFDKAEKYLTRALQIDPQRPESLLELAELYYLRGQYKDARAYLERYQQTY 199

Query: 288 PYLYETHYNQAVISNLAGDLQESYN 312
               E+ +    I+   GD+  +  
Sbjct: 200 NQTAESLWLGIRIARALGDVAAAQR 224



 Score = 28.5 bits (64), Expect = 4.4
 Identities = 15/59 (25%), Positives = 25/59 (42%)

Query: 238 DTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAASSPYLYETHYN 296
           D  +A + L  AL  D    L+   LA+   + G +E+A    + A   +P   +   N
Sbjct: 46  DLEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNN 104



 Score = 27.7 bits (62), Expect = 7.7
 Identities = 19/114 (16%), Positives = 44/114 (38%), Gaps = 2/114 (1%)

Query: 216 LALNENAADVWYNISHVAIGISDTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIER 275
           L  + +    +  ++     + +   A      AL+++ ++G   NN      ++G  E+
Sbjct: 58  LEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGKYEQ 117

Query: 276 ASTYLQAAAASSPY--LYETHYNQAVISNLAGDLQESYNIVKKSLDLHPGHSYS 327
           A    + A     Y     +  N  + +  AGD  ++   + ++L + P    S
Sbjct: 118 AMQQFEQAIEDPLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQIDPQRPES 171


>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat. 
          Length = 69

 Score = 40.8 bits (96), Expect = 5e-05
 Identities = 21/69 (30%), Positives = 35/69 (50%), Gaps = 3/69 (4%)

Query: 186 NAELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADVWYNISHVAIGI-SDTRLAIQ 244
           NAE   NL    F    YD  +  +E+A  L L+ + A+ +YN++   + +  D   A++
Sbjct: 2   NAEALKNLGNALFKLGDYDEAIEAYEKA--LELDPDNAEAYYNLALAYLKLGKDYEEALE 59

Query: 245 CLHLALSID 253
            L  AL +D
Sbjct: 60  DLEKALELD 68



 Score = 35.8 bits (83), Expect = 0.003
 Identities = 17/59 (28%), Positives = 27/59 (45%), Gaps = 1/59 (1%)

Query: 160 GVNHFYNDQPEVALLFYRRLLQMGLYNAELFNNLALCCFYS-QQYDMVVTCFERALSLA 217
           G   F     + A+  Y + L++   NAE + NLAL      + Y+  +   E+AL L 
Sbjct: 10  GNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKALELD 68



 Score = 29.2 bits (66), Expect = 0.46
 Identities = 16/69 (23%), Positives = 26/69 (37%), Gaps = 1/69 (1%)

Query: 221 NAADVWYNISHVAIGISDTRLAIQCLHLALSIDSSHGLSQNNLA-VLEAREGHIERASTY 279
           + A+   N+ +    + D   AI+    AL +D  +  +  NLA          E A   
Sbjct: 1   DNAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALED 60

Query: 280 LQAAAASSP 288
           L+ A    P
Sbjct: 61  LEKALELDP 69



 Score = 27.7 bits (62), Expect = 1.9
 Identities = 17/64 (26%), Positives = 28/64 (43%), Gaps = 1/64 (1%)

Query: 260 QNNLAVLEAREGHIERASTYLQAAAASSPYLYETHYNQAVIS-NLAGDLQESYNIVKKSL 318
             NL     + G  + A    + A    P   E +YN A+    L  D +E+   ++K+L
Sbjct: 6   LKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKAL 65

Query: 319 DLHP 322
           +L P
Sbjct: 66  ELDP 69


>gnl|CDD|233924 TIGR02552, LcrH_SycD, type III secretion low calcium response
           chaperone LcrH/SycD.  Genes in this family are found in
           type III secretion operons. LcrH, from Yersinia is
           believed to have a regulatory function in the
           low-calcium response of the secretion system. The same
           protein is also known as SycD (SYC = Specific Yop
           Chaperone) for its chaperone role. In Pseudomonas, where
           the homolog is known as PcrH, the chaperone role has
           been demonstrated and the regulatory role appears to be
           absent. ScyD/LcrH contains three central
           tetratricopeptide-like repeats that are predicted to
           fold into an all-alpha-helical array.
          Length = 135

 Score = 38.8 bits (91), Expect = 8e-04
 Identities = 29/131 (22%), Positives = 59/131 (45%), Gaps = 6/131 (4%)

Query: 142 YKLILKRDATCMEAIACIGVNHFYNDQPEVALLFYRRLLQMGLYNAELFNNLALCCFYSQ 201
            K +L  D+  +E I  +  N +   + + AL  ++ L     YN+  +  LA CC   +
Sbjct: 6   LKDLLGLDSEQLEQIYALAYNLYQQGRYDEALKLFQLLAAYDPYNSRYWLGLAACCQMLK 65

Query: 202 QYDMVVTCFERALSLALNENAADVWYNISHVAIGISDTRLAIQCLHLALSI---DSSHG- 257
           +Y+  +  +  A +L  ++           +A+G  +   A++ L LA+ I   +  +  
Sbjct: 66  EYEEAIDAYALAAALDPDDPRPYFHAAECLLALG--EPESALKALDLAIEICGENPEYSE 123

Query: 258 LSQNNLAVLEA 268
           L +   A+LE+
Sbjct: 124 LKERAEAMLES 134


>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat. 
          Length = 34

 Score = 33.9 bits (79), Expect = 0.005
 Identities = 9/30 (30%), Positives = 14/30 (46%)

Query: 187 AELFNNLALCCFYSQQYDMVVTCFERALSL 216
           A+   NL        +YD  +  +E+AL L
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALEL 30


>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
           lipoprotein.  This protein family occurs in strictly
           within a subset of Gram-negative bacterial species with
           the proposed PEP-CTERM/exosortase system, analogous to
           the LPXTG/sortase system common in Gram-positive
           bacteria. This protein occurs in a species if and only
           if a transmembrane histidine kinase (TIGR02916) and a
           DNA-binding response regulator (TIGR02915) also occur.
           The present of tetratricopeptide repeats (TPR) suggests
           protein-protein interaction, possibly for the regulation
           of PEP-CTERM protein expression, since many PEP-CTERM
           proteins in these genomes are preceded by a proposed DNA
           binding site for the response regulator.
          Length = 899

 Score = 38.5 bits (90), Expect = 0.005
 Identities = 46/224 (20%), Positives = 78/224 (34%), Gaps = 25/224 (11%)

Query: 111 SQPDGPFIQVSRLNLAKIFEGLNNMPMSVKYYKLILKRDATCMEAIACIGVNHFYNDQ-- 168
            +PD      +  NLA+I     N   +++ ++ +L  D   + AI  + +   Y     
Sbjct: 494 IEPDFFP---AAANLARIDIQEGNPDDAIQRFEKVLTIDPKNLRAI--LALAGLYLRTGN 548

Query: 169 PEVALLFYRRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADVWYN 228
            E A+ +  +     L   E+   LAL  +Y  +  +         +LA+   AAD    
Sbjct: 549 EEEAVAWLEKA--AELNPQEIEPALALAQYYLGKGQL-------KKALAILNEAAD-AAP 598

Query: 229 ISHVAIGI--------SDTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYL 280
            S  A  +         D   A+      L++     L+   LA   A   +  +A T L
Sbjct: 599 DSPEAWLMLGRAQLAAGDLNKAVSSFKKLLALQPDSALALLLLADAYAVMKNYAKAITSL 658

Query: 281 QAAAASSPYLYETHYNQAVISNLAGDLQESYNIVKKSLDLHPGH 324
           + A    P   E     A +   A   + +  I K     HP  
Sbjct: 659 KRALELKPDNTEAQIGLAQLLLAAKRTESAKKIAKSLQKQHPKA 702



 Score = 34.3 bits (79), Expect = 0.11
 Identities = 27/150 (18%), Positives = 57/150 (38%), Gaps = 1/150 (0%)

Query: 176 YRRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADVWYNISHVAIG 235
            R+ L +G    ++   LA       ++  V+        L  +E AA++        +G
Sbjct: 79  LRKALSLGYPKNQVLPLLARAYLLQGKFQQVLDELPGKTLLD-DEGAAELLALRGLAYLG 137

Query: 236 ISDTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAASSPYLYETHY 295
           +    LA +    AL+ID     ++  LA L   E   + A   +     + P   +   
Sbjct: 138 LGQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARALIDEVLTADPGNVDALL 197

Query: 296 NQAVISNLAGDLQESYNIVKKSLDLHPGHS 325
            +  +    G+++ +    +K++ L P + 
Sbjct: 198 LKGDLLLSLGNIELALAAYRKAIALRPNNI 227



 Score = 33.5 bits (77), Expect = 0.15
 Identities = 18/82 (21%), Positives = 36/82 (43%)

Query: 242 AIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAASSPYLYETHYNQAVIS 301
           A +    ALSI+     +  NLA ++ +EG+ + A    +      P         A + 
Sbjct: 484 AREAFEKALSIEPDFFPAAANLARIDIQEGNPDDAIQRFEKVLTIDPKNLRAILALAGLY 543

Query: 302 NLAGDLQESYNIVKKSLDLHPG 323
              G+ +E+   ++K+ +L+P 
Sbjct: 544 LRTGNEEEAVAWLEKAAELNPQ 565



 Score = 30.4 bits (69), Expect = 1.4
 Identities = 28/171 (16%), Positives = 64/171 (37%), Gaps = 2/171 (1%)

Query: 154 EAIACIGVNHFYNDQPEVALLFYRRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFERA 213
           E +A  G+ +    Q E+A   Y + L +   +      LA       ++D      +  
Sbjct: 126 ELLALRGLAYLGLGQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARALIDEV 185

Query: 214 LSLALNENAADVWYNISHVAIGISDTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHI 273
           L+   +    D       + + + +  LA+     A+++  ++      LA +    G  
Sbjct: 186 LTA--DPGNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVLLALATILIEAGEF 243

Query: 274 ERASTYLQAAAASSPYLYETHYNQAVISNLAGDLQESYNIVKKSLDLHPGH 324
           E A  +  A    +P     HY +A++     + +++   ++ +L   P +
Sbjct: 244 EEAEKHADALLKKAPNSPLAHYLKALVDFQKKNYEDARETLQDALKSAPEY 294



 Score = 28.1 bits (63), Expect = 8.7
 Identities = 25/157 (15%), Positives = 56/157 (35%), Gaps = 2/157 (1%)

Query: 166 NDQPEVALLFYRRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADV 225
            ++ + A      +L     N +               ++ +  + +A+  AL  N   V
Sbjct: 172 ENRFDEARALIDEVLTADPGNVDALLLKGDLLLSLGNIELALAAYRKAI--ALRPNNIAV 229

Query: 226 WYNISHVAIGISDTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAA 285
              ++ + I   +   A +     L    +  L+    A+++ ++ + E A   LQ A  
Sbjct: 230 LLALATILIEAGEFEEAEKHADALLKKAPNSPLAHYLKALVDFQKKNYEDARETLQDALK 289

Query: 286 SSPYLYETHYNQAVISNLAGDLQESYNIVKKSLDLHP 322
           S+P                G+L+++Y  + + L   P
Sbjct: 290 SAPEYLPALLLAGASEYQLGNLEQAYQYLNQILKYAP 326


>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats.  Repeats present in 4
           or more copies in proteins. Contain a minimum of 34
           amino acids each and self-associate via a "knobs and
           holes" mechanism.
          Length = 34

 Score = 32.0 bits (74), Expect = 0.020
 Identities = 10/31 (32%), Positives = 13/31 (41%)

Query: 187 AELFNNLALCCFYSQQYDMVVTCFERALSLA 217
           AE   NL         YD  +  +E+AL L 
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELD 31



 Score = 25.1 bits (56), Expect = 7.9
 Identities = 8/34 (23%), Positives = 18/34 (52%)

Query: 223 ADVWYNISHVAIGISDTRLAIQCLHLALSIDSSH 256
           A+  YN+ +  + + D   A++    AL +D ++
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34


>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat. 
          Length = 78

 Score = 33.5 bits (77), Expect = 0.020
 Identities = 20/77 (25%), Positives = 33/77 (42%), Gaps = 5/77 (6%)

Query: 184 LYNAELFNNLALCCFYSQQYDMVVTCFERALSLA-----LNENAADVWYNISHVAIGISD 238
              A   NNLAL       YD  +   E+AL LA      +   A    N++ + + + D
Sbjct: 2   PDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGD 61

Query: 239 TRLAIQCLHLALSIDSS 255
              A++ L  AL++  +
Sbjct: 62  YDEALEYLEKALALREA 78



 Score = 30.4 bits (69), Expect = 0.25
 Identities = 20/74 (27%), Positives = 31/74 (41%), Gaps = 7/74 (9%)

Query: 219 NENAADVWYNISHVAIGISDTRLAIQCLHLALSIDSSHG-------LSQNNLAVLEAREG 271
           + + A    N++ V   + D   A++ L  AL +    G        + NNLA L    G
Sbjct: 1   HPDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALG 60

Query: 272 HIERASTYLQAAAA 285
             + A  YL+ A A
Sbjct: 61  DYDEALEYLEKALA 74



 Score = 28.5 bits (64), Expect = 1.2
 Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 7/71 (9%)

Query: 259 SQNNLAVLEAREGHIERASTYLQAAAASSPYLYETHYNQAVI-SNLA------GDLQESY 311
           + NNLA++  R G  + A   L+ A   +  L E H   A   +NLA      GD  E+ 
Sbjct: 7   ALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEAL 66

Query: 312 NIVKKSLDLHP 322
             ++K+L L  
Sbjct: 67  EYLEKALALRE 77


>gnl|CDD|237610 PRK14099, PRK14099, glycogen synthase; Provisional.
          Length = 485

 Score = 35.8 bits (83), Expect = 0.031
 Identities = 14/33 (42%), Positives = 21/33 (63%)

Query: 19  GIADSVLDTNTIATAARPGTSLKTAAVTAPALT 51
           G+AD+V+D N +A A    T ++ + VTA AL 
Sbjct: 403 GLADTVVDANEMAIATGVATGVQFSPVTADALA 435


>gnl|CDD|225605 COG3063, PilF, Tfp pilus assembly protein PilF [Cell motility and
           secretion / Intracellular trafficking and secretion].
          Length = 250

 Score = 34.7 bits (80), Expect = 0.042
 Identities = 34/147 (23%), Positives = 57/147 (38%), Gaps = 6/147 (4%)

Query: 168 QPEVALLFYRRLLQMGLYNAELFNNLA--LCCFYSQ-QYDMVVTCFERALSLALNENAAD 224
           + ++A   YR+ L +   N ++ NN    LC   +Q + +  +  FERAL+       +D
Sbjct: 84  ENDLADESYRKALSLAPNNGDVLNNYGAFLC---AQGRPEEAMQQFERALADPAYGEPSD 140

Query: 225 VWYNISHVAIGISDTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAA 284
              N+   A+       A + L  AL +D     +   LA L  + G    A  YL+   
Sbjct: 141 TLENLGLCALKAGQFDQAEEYLKRALELDPQFPPALLELARLHYKAGDYAPARLYLERYQ 200

Query: 285 ASSPYLYETHYNQAVISNLAGDLQESY 311
                  E+      I+   GD   + 
Sbjct: 201 QRGGAQAESLLLGIRIAKRLGDRAAAQ 227



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 13/51 (25%), Positives = 21/51 (41%)

Query: 238 DTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAASSP 288
           D   A + L  AL  D S+ L+    A    + G  + A    + A + +P
Sbjct: 50  DYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAP 100


>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat.  This Pfam entry
           includes outlying Tetratricopeptide-like repeats (TPR)
           that are not matched by pfam00515.
          Length = 34

 Score = 30.6 bits (70), Expect = 0.082
 Identities = 10/31 (32%), Positives = 15/31 (48%)

Query: 187 AELFNNLALCCFYSQQYDMVVTCFERALSLA 217
           AE   NL L  +    Y+  +  +E+AL L 
Sbjct: 1   AEALYNLGLAYYKLGDYEEALEAYEKALELD 31


>gnl|CDD|237080 PRK12370, PRK12370, invasion protein regulator; Provisional.
          Length = 553

 Score = 34.1 bits (78), Expect = 0.12
 Identities = 20/98 (20%), Positives = 38/98 (38%), Gaps = 4/98 (4%)

Query: 229 ISHVAIGISDTRLAI----QCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAA 284
           +S   +GI D + A+    +    A  +D ++  +   L ++          S   + A 
Sbjct: 306 LSMAQMGIFDKQNAMIKAKEHAIKATELDHNNPQALGLLGLINTIHSEYIVGSLLFKQAN 365

Query: 285 ASSPYLYETHYNQAVISNLAGDLQESYNIVKKSLDLHP 322
             SP   +  Y       +AG L+E+   + + L L P
Sbjct: 366 LLSPISADIKYYYGWNLFMAGQLEEALQTINECLKLDP 403


>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
          Length = 171

 Score = 32.1 bits (73), Expect = 0.21
 Identities = 16/56 (28%), Positives = 22/56 (39%), Gaps = 1/56 (1%)

Query: 40  LKTAAVTAPALT-SRPRTESGRPVSGVVRPGTLASRGGTLEQSLKTPRTAKSARPL 94
            K AA + P L     RT   + V+     G+  +   T EQ L  PR   +A   
Sbjct: 55  AKPAAASLPRLAEGNFRTMPVQTVTATTPSGSRPAVTATPEQKLLAPRPLFTAARE 110


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 33.4 bits (76), Expect = 0.23
 Identities = 27/94 (28%), Positives = 32/94 (34%), Gaps = 7/94 (7%)

Query: 27   TNTIATAARPGTSL---KTAAVTAPALTSRPRTESGRPVSGVVRPGTLASRGGTLEQSLK 83
               +  AA P   L    +A  TAP     P      P+ G V PG    R         
Sbjct: 2815 AAALPPAASPAGPLPPPTSAQPTAPPPPPGP-PPPSLPLGGSVAPGGDVRRRPPSRSPAA 2873

Query: 84   TPRTAKSARPLTSQAAR-TIRLGTASMLSQPDGP 116
             P  A  ARP   + AR  +   T S    PD P
Sbjct: 2874 KP--AAPARPPVRRLARPAVSRSTESFALPPDQP 2905


>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat. 
          Length = 65

 Score = 30.0 bits (68), Expect = 0.28
 Identities = 15/63 (23%), Positives = 26/63 (41%)

Query: 227 YNISHVAIGISDTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAAS 286
             ++  A+   D   A+  L  AL+       +   L     R+G +  A+  L+AA A+
Sbjct: 1   LALARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAA 60

Query: 287 SPY 289
            P 
Sbjct: 61  DPD 63



 Score = 29.2 bits (66), Expect = 0.42
 Identities = 16/63 (25%), Positives = 23/63 (36%)

Query: 263 LAVLEAREGHIERASTYLQAAAASSPYLYETHYNQAVISNLAGDLQESYNIVKKSLDLHP 322
           LA    R G  + A   L+AA A  P   E            G L E+  +++ +L   P
Sbjct: 3   LARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADP 62

Query: 323 GHS 325
              
Sbjct: 63  DDP 65


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 31.8 bits (73), Expect = 0.57
 Identities = 17/49 (34%), Positives = 28/49 (57%), Gaps = 2/49 (4%)

Query: 274 ERASTYLQAAAASSPYLYETHYNQAVISNLAGDLQESYNIVKK--SLDL 320
           ++A  Y+  A   +P L E +  +A I   AGDLQ++   +++  SLDL
Sbjct: 210 DKALEYIDKAIEHTPTLVELYMTKARILKHAGDLQKAAEAMEEARSLDL 258


>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
           family.  The proteins represented by this model contain
           three RNA recognition motifs (rrm: pfam00076) and have
           been characterized as poly-pyrimidine tract binding
           proteins associated with RNA splicing factors. In the
           case of PUF60 (GP|6176532), in complex with p54, and in
           the presence of U2AF, facilitates association of U2
           snRNP with pre-mRNA.
          Length = 612

 Score = 31.6 bits (71), Expect = 0.64
 Identities = 24/107 (22%), Positives = 35/107 (32%), Gaps = 3/107 (2%)

Query: 12  DLEAEEEGIADSVLDTNTIATAARPGTSLKTAAVTAPALTSRPRTESGRPVSGVVRPGTL 71
           DL  +   +   V   + +   A        AAV A A T+  +  +   V+G    G  
Sbjct: 270 DLGGQYLRVGKCVTPPDALLQPATVSAIPAAAAVAAAAATA--KIMAAEAVAGAAVLGPR 327

Query: 72  ASRGGTLEQSLKTPRTAKS-ARPLTSQAARTIRLGTASMLSQPDGPF 117
           A    T   SL T    K+       +A     L  A+      GP 
Sbjct: 328 AQSPATPSSSLPTDIGNKAVVSSAKKEAEEVPPLPQAAPAVVKPGPM 374


>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
          Length = 463

 Score = 30.1 bits (67), Expect = 1.7
 Identities = 22/86 (25%), Positives = 35/86 (40%), Gaps = 1/86 (1%)

Query: 6   SQLSVDDLEAEEEGIADSVLDTNTIATAARPGTSLKTAAVTAPALTSRPRTESGRPVSGV 65
           S  S D +EA    + +S+ D        +PG  ++     A   T +   +   P SGV
Sbjct: 85  SSESGDTVEAVVPHMGESITDGTLATFLKKPGERVQADEAIAQIETDKVTIDIASPASGV 144

Query: 66  VRPGTLASRGGTLEQSLKTPRTAKSA 91
           ++   L   G T+E   K    +KS 
Sbjct: 145 IQE-FLVKEGDTVEPGTKVAIISKSE 169


>gnl|CDD|205362 pfam13181, TPR_8, Tetratricopeptide repeat. 
          Length = 34

 Score = 26.7 bits (60), Expect = 2.0
 Identities = 7/34 (20%), Positives = 16/34 (47%)

Query: 223 ADVWYNISHVAIGISDTRLAIQCLHLALSIDSSH 256
           A  +Y +  + + + D   A +    AL +D ++
Sbjct: 1   ARAYYLLGQIYLQLGDYEEAKEYYEKALELDPNN 34


>gnl|CDD|233797 TIGR02243, TIGR02243, conserved hypothetical protein, phage
           tail-like region.  This family consists of a large,
           conserved hypothetical protein in phage tail-like
           regions of at least six bacterial genomes: Gloeobacter
           violaceus PCC 7421, Geobacter sulfurreducens PCA,
           Streptomyces coelicolor A3(2), Streptomyces avermitilis
           MA-4680, Mesorhizobium loti, and Myxococcus xanthus. The
           function is unknown [Mobile and extrachromosomal element
           functions, Prophage functions].
          Length = 656

 Score = 29.8 bits (67), Expect = 2.1
 Identities = 23/92 (25%), Positives = 30/92 (32%), Gaps = 5/92 (5%)

Query: 23  SVLDTNTIATAARPGTSLKTAAVTAPALTS-RPRTESGRPVSGVVRPGTLASRGGTLEQS 81
            VL T      ARPG         +P +   R  T  G    G     T++  G      
Sbjct: 304 HVLATACRTLPARPGA---PFYTRSPRIARLRANTIGGTVTVGATHSETVSEEGLGSGAG 360

Query: 82  LKTPRTAKSARPLTSQAARTIRLGTASMLSQP 113
           L   R    A P  + AAR + L   + L   
Sbjct: 361 LPGQRFTLPASPALAGAAR-VALLVQADLGVE 391


>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
           repeats [General function prediction only].
          Length = 484

 Score = 29.7 bits (67), Expect = 2.3
 Identities = 16/66 (24%), Positives = 23/66 (34%)

Query: 266 LEAREGHIERASTYLQAAAASSPYLYETHYNQAVISNLAGDLQESYNIVKKSLDLHPGHS 325
           +       + A   L+ A A  P       N A      G  QE+  I+ + L   P   
Sbjct: 349 ILLEANKAKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDP 408

Query: 326 YSWDIL 331
             WD+L
Sbjct: 409 NGWDLL 414


>gnl|CDD|221851 pfam12916, DUF3834, Protein of unknown function (DUF3834).  This
          family is likely to be related to solute-binding
          lipo-proteins.
          Length = 199

 Score = 28.9 bits (65), Expect = 3.1
 Identities = 24/62 (38%), Positives = 33/62 (53%), Gaps = 7/62 (11%)

Query: 12 DLEAEEEGIADSVLDTNTIATAARPGTSLKTAAVTAPALTS-RPRTESGRPVSGVVRPGT 70
          D+   +EG AD VLD+ T++   R    L+  AVT   L S  P+  SG+   GV R G+
Sbjct: 23 DIIFSKEGNADVVLDS-TVSLVKR---GLRIDAVTIKKLYSIYPKIGSGK--IGVWRKGS 76

Query: 71 LA 72
           A
Sbjct: 77 AA 78


>gnl|CDD|220721 pfam10373, EST1_DNA_bind, Est1 DNA/RNA binding domain.  Est1 is a
           protein which recruits or activates telomerase at the
           site of polymerisation. This is the DNA/RNA binding
           domain of EST1.
          Length = 276

 Score = 28.9 bits (65), Expect = 4.1
 Identities = 24/104 (23%), Positives = 43/104 (41%), Gaps = 11/104 (10%)

Query: 242 AIQCLHLALSIDSSHGLSQNNLAVLEAREG-HIERASTYLQAAAASSPYLYETHYNQAVI 300
           A++   LA  +  S G   N L V+    G H++    YL++ A  +P+           
Sbjct: 1   ALRYYLLASLLLPSSGEPYNQLGVIALYVGNHLDAVYYYLRSLAVRNPF-------PTAK 53

Query: 301 SNLA---GDLQESYNIVKKSLDLHPGHSYSWDILRKLEQYFSYL 341
           +NL      ++   ++V + L          D+L++L  YF  L
Sbjct: 54  NNLELLFEKIKFLADLVSQELLNKGKEESKSDLLKRLVIYFLRL 97


>gnl|CDD|182415 PRK10370, PRK10370, formate-dependent nitrite reductase complex
           subunit NrfG; Provisional.
          Length = 198

 Score = 27.9 bits (62), Expect = 7.0
 Identities = 24/70 (34%), Positives = 35/70 (50%), Gaps = 3/70 (4%)

Query: 157 ACIGVNHFYNDQPEVALLFYRRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFE---RA 213
           A +G  + + +  + ALL YR+ LQ+   NAEL+  LA   +Y     M     E   +A
Sbjct: 77  ALLGEYYLWRNDYDNALLAYRQALQLRGENAELYAALATVLYYQAGQHMTPQTREMIDKA 136

Query: 214 LSLALNENAA 223
           L+L  NE  A
Sbjct: 137 LALDANEVTA 146


>gnl|CDD|236983 PRK11788, PRK11788, tetratricopeptide repeat protein; Provisional.
          Length = 389

 Score = 27.9 bits (63), Expect = 8.6
 Identities = 17/79 (21%), Positives = 24/79 (30%), Gaps = 9/79 (11%)

Query: 233 AIGISDTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAASSPYLYE 292
           A+   D   A   L  AL+ D     +   L  L   +G    A   L+      P    
Sbjct: 190 ALARGDLDAARALLKKALAADPQCVRASILLGDLALAQGDYAAAIEALERVEEQDP---- 245

Query: 293 THYNQAVISNLAGDLQESY 311
                  +S +   L E Y
Sbjct: 246 -----EYLSEVLPKLMECY 259


>gnl|CDD|236368 PRK09058, PRK09058, coproporphyrinogen III oxidase; Provisional.
          Length = 449

 Score = 28.0 bits (63), Expect = 9.1
 Identities = 19/75 (25%), Positives = 28/75 (37%), Gaps = 15/75 (20%)

Query: 274 ERASTYLQAAAASSPYLYETHYNQAVISNLAGDLQES--YNIVKKS----LDLHPG---- 323
           ERA  Y         +L +  + Q   S+ A   +E   YN++ K     L    G    
Sbjct: 277 ERADMYAYGVE----FLAKAGWRQLSNSHWARTTRERNLYNLLIKQGAECLPFGAGAGGS 332

Query: 324 -HSYSWDILRKLEQY 337
              YS+   R L+ Y
Sbjct: 333 IGGYSYMNHRDLDTY 347


>gnl|CDD|223049 PHA03334, PHA03334, putative DNA polymerase catalytic subunit;
           Provisional.
          Length = 1545

 Score = 27.9 bits (62), Expect = 10.0
 Identities = 19/105 (18%), Positives = 35/105 (33%), Gaps = 10/105 (9%)

Query: 117 FIQVSRLNLAKIFEGLNNMPMSVKYYKLILKRDATCMEAIACIGVNHFYNDQPEVALLFY 176
            + +       +F G     +  + +    K +    +A          +   +V  L +
Sbjct: 95  TLPLPERGPITVFNGDTKEDVDCRLFLFGTKAER---KAFIKALRKVLPSLGADV--LTF 149

Query: 177 RRLLQMGLYNAE-LFNNLALCCFYSQQYDMVVTCFERALSLALNE 220
            R +     NA  + N L  CC+Y  +Y        R   +AL E
Sbjct: 150 GRHVSNMDGNAPAILNRLMCCCYYRTKYK----VASRPDVIALTE 190


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.131    0.372 

Gapped
Lambda     K      H
   0.267   0.0867    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 16,759,447
Number of extensions: 1583406
Number of successful extensions: 1434
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1414
Number of HSP's successfully gapped: 61
Length of query: 341
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 243
Effective length of database: 6,590,910
Effective search space: 1601591130
Effective search space used: 1601591130
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.3 bits)