RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16607
         (677 letters)



>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
           34 amino acids
           [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
           X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
           in a variety of organisms including bacteria,
           cyanobacteria, yeast, fungi, plants, and humans in
           various subcellular locations; involved in a variety of
           functions including protein-protein interactions, but
           common features in the interaction partners have not
           been defined; involved in chaperone, cell-cycle,
           transciption, and protein transport complexes; the
           number of TPR motifs varies among proteins (1,3-11,13
           15,16,19); 5-6 tandem repeats generate a right-handed
           helical structure with an amphipathic channel that is
           thought to accomodate an alpha-helix of a target
           protein; it has been proposed that TPR proteins
           preferably interact with WD-40 repeat proteins, but in
           many instances several TPR-proteins seem to aggregate to
           multi-protein complexes; examples of TPR-proteins
           include, Cdc16p, Cdc23p and Cdc27p components of the
           cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
           targeting signals, the Tom70p co-receptor for
           mitochondrial targeting signals, Ser/Thr phosphatase 5C
           and the p110 subunit of O-GlcNAc transferase; three
           copies of the repeat are present here.
          Length = 100

 Score = 56.6 bits (137), Expect = 4e-10
 Identities = 28/107 (26%), Positives = 49/107 (45%), Gaps = 10/107 (9%)

Query: 460 ELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADVWYNISHVAILNALSTSVYNDQP 519
           E   NL    +    YD  +  +E+AL L  + + AD +YN++               + 
Sbjct: 1   EALLNLGNLYYKLGDYDEALEYYEKALEL--DPDNADAYYNLAAAYY--------KLGKY 50

Query: 520 EVALLFYRRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFERALSL 566
           E AL  Y + L++   NA+ + NL L  +   +Y+  +  +E+AL L
Sbjct: 51  EEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALEL 97



 Score = 51.6 bits (124), Expect = 3e-08
 Identities = 21/96 (21%), Positives = 47/96 (48%)

Query: 393 IMTEMARIFEGLNNMPMSVKYYKLILKRDATCMEAIACIGVNHFYNDQPEVALLFYRRLL 452
            +  +  ++  L +   +++YY+  L+ D    +A   +   ++   + E AL  Y + L
Sbjct: 2   ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKAL 61

Query: 453 QMGLYNAELFNNLALCCFYSQQYDMVVTCFERALSL 488
           ++   NA+ + NL L  +   +Y+  +  +E+AL L
Sbjct: 62  ELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALEL 97



 Score = 47.8 bits (114), Expect = 6e-07
 Identities = 25/120 (20%), Positives = 43/120 (35%), Gaps = 21/120 (17%)

Query: 538 ELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADVWYNISHVAIITECSPFSFSTHT 597
           E   NL    +    YD  +  +E+AL L  + + AD +YN++                 
Sbjct: 1   EALLNLGNLYYKLGDYDEALEYYEKALEL--DPDNADAYYNLAAA--------------- 43

Query: 598 SYLFIQGISDTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAASSP 657
              +  G      A++    AL +D  +  +  NL +   + G  E A    + A    P
Sbjct: 44  --YYKLG--KYEEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELDP 99



 Score = 44.7 bits (106), Expect = 9e-06
 Identities = 18/71 (25%), Positives = 38/71 (53%), Gaps = 2/71 (2%)

Query: 431 IGVNHFYNDQPEVALLFYRRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFERALSLAL 490
           +G  ++     + AL +Y + L++   NA+ + NLA   +   +Y+  +  +E+AL   L
Sbjct: 6   LGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALE--L 63

Query: 491 NENAADVWYNI 501
           + + A  +YN+
Sbjct: 64  DPDNAKAYYNL 74



 Score = 42.4 bits (100), Expect = 5e-05
 Identities = 17/58 (29%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 522 ALLFYRRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADVWYNI 579
           AL +Y + L++   NA+ + NLA   +   +Y+  +  +E+AL   L+ + A  +YN+
Sbjct: 19  ALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALE--LDPDNAKAYYNL 74



 Score = 41.2 bits (97), Expect = 1e-04
 Identities = 20/94 (21%), Positives = 40/94 (42%), Gaps = 1/94 (1%)

Query: 329 QLGKCYFSLGLIREAQQQFNSALNQF-TDIEAFIRMIRVYIRLDQPIRAIDIGRNALDCY 387
            LG  Y+ LG   EA + +  AL     + +A+  +   Y +L +   A++    AL+  
Sbjct: 5   NLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELD 64

Query: 388 PNEVTIMTEMARIFEGLNNMPMSVKYYKLILKRD 421
           P+       +   +  L     +++ Y+  L+ D
Sbjct: 65  PDNAKAYYNLGLAYYKLGKYEEALEAYEKALELD 98



 Score = 36.2 bits (84), Expect = 0.006
 Identities = 20/96 (20%), Positives = 43/96 (44%)

Query: 358 EAFIRMIRVYIRLDQPIRAIDIGRNALDCYPNEVTIMTEMARIFEGLNNMPMSVKYYKLI 417
           EA + +  +Y +L     A++    AL+  P+       +A  +  L     +++ Y+  
Sbjct: 1   EALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKA 60

Query: 418 LKRDATCMEAIACIGVNHFYNDQPEVALLFYRRLLQ 453
           L+ D    +A   +G+ ++   + E AL  Y + L+
Sbjct: 61  LELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALE 96



 Score = 35.0 bits (81), Expect = 0.020
 Identities = 15/63 (23%), Positives = 25/63 (39%)

Query: 607 DTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAASSPYLYETHYNQ 666
           D   A++    AL +D  +  +  NLA    + G  E A    + A    P   + +YN 
Sbjct: 15  DYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNL 74

Query: 667 AVI 669
            + 
Sbjct: 75  GLA 77


>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
          Length = 291

 Score = 40.6 bits (93), Expect = 0.002
 Identities = 50/272 (18%), Positives = 86/272 (31%), Gaps = 28/272 (10%)

Query: 413 YYKLILKRDATCMEAIACIGVNHFYNDQPEVALLFYRRLLQM--GLYNAELFNNLALCCF 470
             + + K  A  +  +           +   AL      L++      A L   LAL   
Sbjct: 11  LLEALAKLLAEALALLEAGLALLELLGELAEALELLEEALELLPNSDLAGLLLLLALALL 70

Query: 471 YSQQYDMVVTCFERALSLALNENAADVWYNISHVAILNALSTSVYNDQPEVALLFYRRLL 530
              + +  +   E+AL L L  N A+   N+  +             + E AL    + L
Sbjct: 71  KLGRLEEALELLEKALELELLPNLAEALLNLGLLLE--------ALGKYEEALELLEKAL 122

Query: 531 QMGLYNAELFNNLALCCFYSQ-QYDMVVTCFERALSLA-LNENAADVWYNISHV------ 582
            +          LAL   Y    Y+  +  +E+AL L       A+    +  +      
Sbjct: 123 ALDPDPDLAEALLALGALYELGDYEEALELYEKALELDPELNELAEALLALGALLEALGR 182

Query: 583 -----AIITECSPFSFSTHTSYLFIQGIS-----DTRLAIQCLHLALSIDSSHGLSQNNL 632
                 ++ +    +       L   G+          A++    AL +D  +  +  NL
Sbjct: 183 YEEALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELDPDNAEALYNL 242

Query: 633 AVLEAREGHIERASTYLQAAAASSPYLYETHY 664
           A+L    G  E A   L+ A    P LY    
Sbjct: 243 ALLLLELGRYEEALEALEKALELDPDLYNLGL 274



 Score = 38.7 bits (88), Expect = 0.007
 Identities = 29/169 (17%), Positives = 63/169 (37%), Gaps = 5/169 (2%)

Query: 291 AKYLFEYLYHHENDVASAMDLAVESTKACEFRDWWWKVQLGKCYFSLGLIREAQQQFNSA 350
           A      L         A++L  ++       D    +      + LG   EA + +  A
Sbjct: 97  ALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALYELGDYEEALELYEKA 156

Query: 351 LNQFTD----IEAFIRMIRVYIRLDQPIRAIDIGRNALDCYPNEVT-IMTEMARIFEGLN 405
           L    +     EA + +  +   L +   A+++   AL   P++    +  +  ++  L 
Sbjct: 157 LELDPELNELAEALLALGALLEALGRYEEALELLEKALKLNPDDDAEALLNLGLLYLKLG 216

Query: 406 NMPMSVKYYKLILKRDATCMEAIACIGVNHFYNDQPEVALLFYRRLLQM 454
               +++YY+  L+ D    EA+  + +      + E AL    + L++
Sbjct: 217 KYEEALEYYEKALELDPDNAEALYNLALLLLELGRYEEALEALEKALEL 265



 Score = 37.1 bits (84), Expect = 0.020
 Identities = 24/112 (21%), Positives = 41/112 (36%), Gaps = 1/112 (0%)

Query: 279 LNLAKYARDKTVAKYLFEYLYHHENDVASAMDLAVESTKACEFRDWWWKVQLGKCYFSLG 338
           L L     +   A      L         A++L  ++ K     D    + LG  Y  LG
Sbjct: 157 LELDPELNELAEALLALGALLEALGRYEEALELLEKALKLNPDDDAEALLNLGLLYLKLG 216

Query: 339 LIREAQQQFNSALNQF-TDIEAFIRMIRVYIRLDQPIRAIDIGRNALDCYPN 389
              EA + +  AL     + EA   +  + + L +   A++    AL+  P+
Sbjct: 217 KYEEALEYYEKALELDPDNAEALYNLALLLLELGRYEEALEALEKALELDPD 268



 Score = 32.9 bits (73), Expect = 0.45
 Identities = 28/145 (19%), Positives = 55/145 (37%), Gaps = 4/145 (2%)

Query: 281 LAKYARDKTVAKYLFEYLYHHENDVASAMDLAVESTKA--CEFRDWWWKVQLGKCYFSLG 338
           LA           L     +   D   A++L  ++ +            + LG    +LG
Sbjct: 122 LALDPDPDLAEALLALGALYELGDYEEALELYEKALELDPELNELAEALLALGALLEALG 181

Query: 339 LIREAQQQFNSAL--NQFTDIEAFIRMIRVYIRLDQPIRAIDIGRNALDCYPNEVTIMTE 396
              EA +    AL  N   D EA + +  +Y++L +   A++    AL+  P+    +  
Sbjct: 182 RYEEALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELDPDNAEALYN 241

Query: 397 MARIFEGLNNMPMSVKYYKLILKRD 421
           +A +   L     +++  +  L+ D
Sbjct: 242 LALLLLELGRYEEALEALEKALELD 266


>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat. 
          Length = 69

 Score = 35.8 bits (83), Expect = 0.005
 Identities = 17/59 (28%), Positives = 27/59 (45%), Gaps = 1/59 (1%)

Query: 432 GVNHFYNDQPEVALLFYRRLLQMGLYNAELFNNLALCCFYS-QQYDMVVTCFERALSLA 489
           G   F     + A+  Y + L++   NAE + NLAL      + Y+  +   E+AL L 
Sbjct: 10  GNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKALELD 68



 Score = 35.8 bits (83), Expect = 0.005
 Identities = 19/74 (25%), Positives = 31/74 (41%), Gaps = 9/74 (12%)

Query: 458 NAELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADVWYNISHVAILNALSTSVYND 517
           NAE   NL    F    YD  +  +E+A  L L+ + A+ +YN++   +           
Sbjct: 2   NAEALKNLGNALFKLGDYDEAIEAYEKA--LELDPDNAEAYYNLALAYLKL-------GK 52

Query: 518 QPEVALLFYRRLLQ 531
             E AL    + L+
Sbjct: 53  DYEEALEDLEKALE 66



 Score = 35.0 bits (81), Expect = 0.008
 Identities = 21/87 (24%), Positives = 36/87 (41%), Gaps = 20/87 (22%)

Query: 536 NAELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADVWYNISHVAIITECSPFSFST 595
           NAE   NL    F    YD  +  +E+A  L L+ + A+ +YN++               
Sbjct: 2   NAEALKNLGNALFKLGDYDEAIEAYEKA--LELDPDNAEAYYNLALA------------- 46

Query: 596 HTSYLFIQGISDTRLAIQCLHLALSID 622
                +++   D   A++ L  AL +D
Sbjct: 47  -----YLKLGKDYEEALEDLEKALELD 68



 Score = 32.7 bits (75), Expect = 0.068
 Identities = 15/51 (29%), Positives = 25/51 (49%), Gaps = 1/51 (1%)

Query: 518 QPEVALLFYRRLLQMGLYNAELFNNLALCCFYS-QQYDMVVTCFERALSLA 567
             + A+  Y + L++   NAE + NLAL      + Y+  +   E+AL L 
Sbjct: 18  DYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKALELD 68



 Score = 28.4 bits (64), Expect = 2.3
 Identities = 12/67 (17%), Positives = 24/67 (35%), Gaps = 1/67 (1%)

Query: 356 DIEAFIRMIRVYIRLDQPIRAIDIGRNALDCYPNEVTIMTEMARIFEGLNNMPMS-VKYY 414
           + EA   +     +L     AI+    AL+  P+       +A  +  L       ++  
Sbjct: 2   NAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDL 61

Query: 415 KLILKRD 421
           +  L+ D
Sbjct: 62  EKALELD 68


>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat. 
          Length = 34

 Score = 33.9 bits (79), Expect = 0.010
 Identities = 9/30 (30%), Positives = 14/30 (46%)

Query: 459 AELFNNLALCCFYSQQYDMVVTCFERALSL 488
           A+   NL        +YD  +  +E+AL L
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALEL 30



 Score = 33.9 bits (79), Expect = 0.010
 Identities = 9/30 (30%), Positives = 14/30 (46%)

Query: 537 AELFNNLALCCFYSQQYDMVVTCFERALSL 566
           A+   NL        +YD  +  +E+AL L
Sbjct: 1   AKALYNLGNAYLKLGKYDEALEYYEKALEL 30


>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats.  Repeats present in 4
           or more copies in proteins. Contain a minimum of 34
           amino acids each and self-associate via a "knobs and
           holes" mechanism.
          Length = 34

 Score = 32.0 bits (74), Expect = 0.039
 Identities = 10/31 (32%), Positives = 13/31 (41%)

Query: 459 AELFNNLALCCFYSQQYDMVVTCFERALSLA 489
           AE   NL         YD  +  +E+AL L 
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELD 31



 Score = 32.0 bits (74), Expect = 0.039
 Identities = 10/31 (32%), Positives = 13/31 (41%)

Query: 537 AELFNNLALCCFYSQQYDMVVTCFERALSLA 567
           AE   NL         YD  +  +E+AL L 
Sbjct: 1   AEALYNLGNAYLKLGDYDEALEYYEKALELD 31


>gnl|CDD|237610 PRK14099, PRK14099, glycogen synthase; Provisional.
          Length = 485

 Score = 36.2 bits (84), Expect = 0.050
 Identities = 14/33 (42%), Positives = 21/33 (63%)

Query: 175 GIADSVLDTNTIATAARPGTSLKTAAVTAPALT 207
           G+AD+V+D N +A A    T ++ + VTA AL 
Sbjct: 403 GLADTVVDANEMAIATGVATGVQFSPVTADALA 435


>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat.  This Pfam entry
           includes outlying Tetratricopeptide-like repeats (TPR)
           that are not matched by pfam00515.
          Length = 34

 Score = 30.6 bits (70), Expect = 0.14
 Identities = 10/31 (32%), Positives = 15/31 (48%)

Query: 459 AELFNNLALCCFYSQQYDMVVTCFERALSLA 489
           AE   NL L  +    Y+  +  +E+AL L 
Sbjct: 1   AEALYNLGLAYYKLGDYEEALEAYEKALELD 31



 Score = 30.6 bits (70), Expect = 0.14
 Identities = 10/31 (32%), Positives = 15/31 (48%)

Query: 537 AELFNNLALCCFYSQQYDMVVTCFERALSLA 567
           AE   NL L  +    Y+  +  +E+AL L 
Sbjct: 1   AEALYNLGLAYYKLGDYEEALEAYEKALELD 31


>gnl|CDD|131573 TIGR02521, type_IV_pilW, type IV pilus biogenesis/stability protein
           PilW.  Members of this family are designated PilF in ref
           (PMID:8973346) and PilW in ref (PMID:15612916). This
           outer membrane protein is required both for pilus
           stability and for pilus function such as adherence to
           human cells. Members of this family contain copies of
           the TPR (tetratricopeptide repeat) domain.
          Length = 234

 Score = 33.5 bits (77), Expect = 0.26
 Identities = 31/133 (23%), Positives = 45/133 (33%), Gaps = 19/133 (14%)

Query: 518 QPEVALLFYRRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADVWY 577
           + E A   +RR L +   N ++ NN         +Y+  +  FE+A+   L    A    
Sbjct: 80  ELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGKYEQAMQQFEQAIEDPLYPQPARSLE 139

Query: 578 NISHVAIITECSPFSFSTHTSYLFIQGISDTRLAIQCLHLALSIDSSHGLSQNNLAVLEA 637
           N    A+                      D   A + L  AL ID     S   LA L  
Sbjct: 140 NAGLCAL-------------------KAGDFDKAEKYLTRALQIDPQRPESLLELAELYY 180

Query: 638 REGHIERASTYLQ 650
             G  + A  YL+
Sbjct: 181 LRGQYKDARAYLE 193



 Score = 28.8 bits (65), Expect = 7.6
 Identities = 15/59 (25%), Positives = 25/59 (42%)

Query: 607 DTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAASSPYLYETHYN 665
           D  +A + L  AL  D    L+   LA+   + G +E+A    + A   +P   +   N
Sbjct: 46  DLEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNN 104


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 34.1 bits (78), Expect = 0.33
 Identities = 27/94 (28%), Positives = 32/94 (34%), Gaps = 7/94 (7%)

Query: 183  TNTIATAARPGTSL---KTAAVTAPALTSRPRTESGRPVSGVVRPGTLASRGGTLEQSLK 239
               +  AA P   L    +A  TAP     P      P+ G V PG    R         
Sbjct: 2815 AAALPPAASPAGPLPPPTSAQPTAPPPPPGP-PPPSLPLGGSVAPGGDVRRRPPSRSPAA 2873

Query: 240  TPRTAKSARPLTSQAAR-TIRLGTASMLSQPDGP 272
             P  A  ARP   + AR  +   T S    PD P
Sbjct: 2874 KP--AAPARPPVRRLARPAVSRSTESFALPPDQP 2905


>gnl|CDD|145533 pfam02437, Ski_Sno, SKI/SNO/DAC family.  This family contains a
           presumed domain that is about 100 amino acids long. All
           members of this family contain a conserved CLPQ motif.
           The c-ski proto-oncogene has been shown to influence
           proliferation, morphological transformation and myogenic
           differentiation. Sno, a Ski proto-oncogene homologue, is
           expressed in two isoforms and plays a role in the
           response to proliferation stimuli. Dachshund also
           contains this domain. It is involved in various aspects
           of development.
          Length = 101

 Score = 31.2 bits (71), Expect = 0.47
 Identities = 21/78 (26%), Positives = 29/78 (37%), Gaps = 20/78 (25%)

Query: 345 QQFNSALNQFTDIEAFIRMIRVYIRLDQPIRAIDIGRNALDCYPNEVTIMTEMARIFEGL 404
           Q FN  L  F+       + ++  +L             LD Y    T+  E  RI  GL
Sbjct: 34  QIFNLFLKHFS-------LHQINTKLK-----------RLDIYCVVCTV--EQLRILRGL 73

Query: 405 NNMPMSVKYYKLILKRDA 422
             +P  V    LI + DA
Sbjct: 74  GAIPPGVNRCGLITRTDA 91


>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
          Length = 171

 Score = 32.1 bits (73), Expect = 0.59
 Identities = 16/56 (28%), Positives = 22/56 (39%), Gaps = 1/56 (1%)

Query: 196 LKTAAVTAPALT-SRPRTESGRPVSGVVRPGTLASRGGTLEQSLKTPRTAKSARPL 250
            K AA + P L     RT   + V+     G+  +   T EQ L  PR   +A   
Sbjct: 55  AKPAAASLPRLAEGNFRTMPVQTVTATTPSGSRPAVTATPEQKLLAPRPLFTAARE 110


>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat. 
          Length = 65

 Score = 29.6 bits (67), Expect = 0.63
 Identities = 14/52 (26%), Positives = 22/52 (42%)

Query: 607 DTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAASSPY 658
           D   A+  L  AL+       +   L     R+G +  A+  L+AA A+ P 
Sbjct: 12  DYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADPD 63


>gnl|CDD|233924 TIGR02552, LcrH_SycD, type III secretion low calcium response
           chaperone LcrH/SycD.  Genes in this family are found in
           type III secretion operons. LcrH, from Yersinia is
           believed to have a regulatory function in the
           low-calcium response of the secretion system. The same
           protein is also known as SycD (SYC = Specific Yop
           Chaperone) for its chaperone role. In Pseudomonas, where
           the homolog is known as PcrH, the chaperone role has
           been demonstrated and the regulatory role appears to be
           absent. ScyD/LcrH contains three central
           tetratricopeptide-like repeats that are predicted to
           fold into an all-alpha-helical array.
          Length = 135

 Score = 31.1 bits (71), Expect = 0.64
 Identities = 18/79 (22%), Positives = 37/79 (46%)

Query: 414 YKLILKRDATCMEAIACIGVNHFYNDQPEVALLFYRRLLQMGLYNAELFNNLALCCFYSQ 473
            K +L  D+  +E I  +  N +   + + AL  ++ L     YN+  +  LA CC   +
Sbjct: 6   LKDLLGLDSEQLEQIYALAYNLYQQGRYDEALKLFQLLAAYDPYNSRYWLGLAACCQMLK 65

Query: 474 QYDMVVTCFERALSLALNE 492
           +Y+  +  +  A +L  ++
Sbjct: 66  EYEEAIDAYALAAALDPDD 84


>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
           lipoprotein.  This protein family occurs in strictly
           within a subset of Gram-negative bacterial species with
           the proposed PEP-CTERM/exosortase system, analogous to
           the LPXTG/sortase system common in Gram-positive
           bacteria. This protein occurs in a species if and only
           if a transmembrane histidine kinase (TIGR02916) and a
           DNA-binding response regulator (TIGR02915) also occur.
           The present of tetratricopeptide repeats (TPR) suggests
           protein-protein interaction, possibly for the regulation
           of PEP-CTERM protein expression, since many PEP-CTERM
           proteins in these genomes are preceded by a proposed DNA
           binding site for the response regulator.
          Length = 899

 Score = 32.7 bits (75), Expect = 0.81
 Identities = 29/161 (18%), Positives = 60/161 (37%), Gaps = 1/161 (0%)

Query: 330 LGKCYFSLGLIREAQQQFNSALN-QFTDIEAFIRMIRVYIRLDQPIRAIDIGRNALDCYP 388
           LG  Y   G + +A++ F  AL+ +     A   + R+ I+   P  AI      L   P
Sbjct: 471 LGAIYLGKGDLAKAREAFEKALSIEPDFFPAAANLARIDIQEGNPDDAIQRFEKVLTIDP 530

Query: 389 NEVTIMTEMARIFEGLNNMPMSVKYYKLILKRDATCMEAIACIGVNHFYNDQPEVALLFY 448
             +  +  +A ++    N   +V + +   + +   +E    +   +    Q + AL   
Sbjct: 531 KNLRAILALAGLYLRTGNEEEAVAWLEKAAELNPQEIEPALALAQYYLGKGQLKKALAIL 590

Query: 449 RRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFERALSLA 489
                    + E +  L      +   +  V+ F++ L+L 
Sbjct: 591 NEAADAAPDSPEAWLMLGRAQLAAGDLNKAVSSFKKLLALQ 631


>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat. 
          Length = 78

 Score = 29.6 bits (67), Expect = 0.84
 Identities = 18/82 (21%), Positives = 29/82 (35%), Gaps = 13/82 (15%)

Query: 456 LYNAELFNNLALCCFYSQQYDMVVTCFERALSLA-----LNENAADVWYNISHVAILNAL 510
              A   NNLAL       YD  +   E+AL LA      +   A    N++ +      
Sbjct: 2   PDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLY----- 56

Query: 511 STSVYNDQPEVALLFYRRLLQM 532
              +     + AL +  + L +
Sbjct: 57  ---LALGDYDEALEYLEKALAL 75



 Score = 28.9 bits (65), Expect = 1.5
 Identities = 14/66 (21%), Positives = 23/66 (34%), Gaps = 7/66 (10%)

Query: 431 IGVNHFYNDQPEVALLFYRRLLQM-------GLYNAELFNNLALCCFYSQQYDMVVTCFE 483
           + +        + AL    + L++           A   NNLA        YD  +   E
Sbjct: 11  LALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEALEYLE 70

Query: 484 RALSLA 489
           +AL+L 
Sbjct: 71  KALALR 76



 Score = 27.3 bits (61), Expect = 6.1
 Identities = 13/34 (38%), Positives = 15/34 (44%)

Query: 534 LYNAELFNNLALCCFYSQQYDMVVTCFERALSLA 567
              A   NNLAL       YD  +   E+AL LA
Sbjct: 2   PDLAAALNNLALVLRRLGDYDEALELLEKALELA 35


>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
          Length = 463

 Score = 32.4 bits (73), Expect = 0.85
 Identities = 33/167 (19%), Positives = 61/167 (36%), Gaps = 4/167 (2%)

Query: 81  IATAARPGTSLKTAANDQQPISTSVMSTPGSAPTATVDSMFYAVDHFHKRNFEKCAAICT 140
           I  A+  G+S        Q  S    S+P     +   ++ +  +H H  +        +
Sbjct: 7   IRRASTRGSSPSLFGKSLQS-SRVAASSPSLLSGSETGALLHRGNHAHSFHNLALPG-NS 64

Query: 141 ELLAKNPYDKAAWTLKMRSLTSQLSVDDLEAEEEGIADSVLDTNTIATAARPGTSLKTAA 200
            +        +     +R  +S+ S D +EA    + +S+ D        +PG  ++   
Sbjct: 65  GISRSASLVSSTLQRWVRPFSSE-SGDTVEAVVPHMGESITDGTLATFLKKPGERVQADE 123

Query: 201 VTAPALTSRPRTESGRPVSGVVRPGTLASRGGTLEQSLKTPRTAKSA 247
             A   T +   +   P SGV++   L   G T+E   K    +KS 
Sbjct: 124 AIAQIETDKVTIDIASPASGVIQE-FLVKEGDTVEPGTKVAIISKSE 169


>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
           family.  The proteins represented by this model contain
           three RNA recognition motifs (rrm: pfam00076) and have
           been characterized as poly-pyrimidine tract binding
           proteins associated with RNA splicing factors. In the
           case of PUF60 (GP|6176532), in complex with p54, and in
           the presence of U2AF, facilitates association of U2
           snRNP with pre-mRNA.
          Length = 612

 Score = 32.0 bits (72), Expect = 1.2
 Identities = 24/107 (22%), Positives = 35/107 (32%), Gaps = 3/107 (2%)

Query: 168 DLEAEEEGIADSVLDTNTIATAARPGTSLKTAAVTAPALTSRPRTESGRPVSGVVRPGTL 227
           DL  +   +   V   + +   A        AAV A A T+  +  +   V+G    G  
Sbjct: 270 DLGGQYLRVGKCVTPPDALLQPATVSAIPAAAAVAAAAATA--KIMAAEAVAGAAVLGPR 327

Query: 228 ASRGGTLEQSLKTPRTAKS-ARPLTSQAARTIRLGTASMLSQPDGPF 273
           A    T   SL T    K+       +A     L  A+      GP 
Sbjct: 328 AQSPATPSSSLPTDIGNKAVVSSAKKEAEEVPPLPQAAPAVVKPGPM 374


>gnl|CDD|222121 pfam13428, TPR_14, Tetratricopeptide repeat. 
          Length = 44

 Score = 28.3 bits (63), Expect = 1.3
 Identities = 10/43 (23%), Positives = 20/43 (46%)

Query: 358 EAFIRMIRVYIRLDQPIRAIDIGRNALDCYPNEVTIMTEMARI 400
            A + + R  + L     A+ + R AL   P++   +  +AR+
Sbjct: 2   AALLALARALLALGDLDEALALLRRALALDPDDPEALLLLARL 44


>gnl|CDD|197279 cd09182, PLDc_FAM83B_N, N-terminal phospholipase D-like domain of
           the uncharacterized protein, Family with sequence
           similarity 83B.  N-terminal phospholipase D (PLD)-like
           domain of the uncharacterized protein, Family with
           sequence similarity 83B (FAM83B). Since the N-terminal
           PLD-like domain of FAM83 proteins shows only trace
           similarity to the PLD catalytic domain and lacks the
           functionally important histidine residue, FAM83 proteins
           may share a similar three-dimensional fold with PLD
           enzymes, but are most unlikely to carry PLD activity.
           The N-terminus of FAM83B shows high homology to other
           FAM83 family members, indicating that FAM83B might have
           arisen early in vertebrate evolution by duplication of a
           gene in the FAM83 family.
          Length = 266

 Score = 30.6 bits (69), Expect = 2.8
 Identities = 16/40 (40%), Positives = 22/40 (55%), Gaps = 7/40 (17%)

Query: 340 IREAQQQFNSALNQFTD-------IEAFIRMIRVYIRLDQ 372
           I+EA+Q    A++ FTD       +EA  R + VYI LD 
Sbjct: 126 IQEARQVIAIAMDVFTDVDIFKEVVEASTRGVAVYILLDH 165


>gnl|CDD|233797 TIGR02243, TIGR02243, conserved hypothetical protein, phage
           tail-like region.  This family consists of a large,
           conserved hypothetical protein in phage tail-like
           regions of at least six bacterial genomes: Gloeobacter
           violaceus PCC 7421, Geobacter sulfurreducens PCA,
           Streptomyces coelicolor A3(2), Streptomyces avermitilis
           MA-4680, Mesorhizobium loti, and Myxococcus xanthus. The
           function is unknown [Mobile and extrachromosomal element
           functions, Prophage functions].
          Length = 656

 Score = 30.6 bits (69), Expect = 3.0
 Identities = 23/92 (25%), Positives = 30/92 (32%), Gaps = 5/92 (5%)

Query: 179 SVLDTNTIATAARPGTSLKTAAVTAPALTS-RPRTESGRPVSGVVRPGTLASRGGTLEQS 237
            VL T      ARPG         +P +   R  T  G    G     T++  G      
Sbjct: 304 HVLATACRTLPARPGA---PFYTRSPRIARLRANTIGGTVTVGATHSETVSEEGLGSGAG 360

Query: 238 LKTPRTAKSARPLTSQAARTIRLGTASMLSQP 269
           L   R    A P  + AAR + L   + L   
Sbjct: 361 LPGQRFTLPASPALAGAAR-VALLVQADLGVE 391


>gnl|CDD|225605 COG3063, PilF, Tfp pilus assembly protein PilF [Cell motility and
           secretion / Intracellular trafficking and secretion].
          Length = 250

 Score = 30.1 bits (68), Expect = 3.5
 Identities = 13/51 (25%), Positives = 21/51 (41%)

Query: 607 DTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAASSP 657
           D   A + L  AL  D S+ L+    A    + G  + A    + A + +P
Sbjct: 50  DYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAP 100


>gnl|CDD|129992 TIGR00914, 2A0601, heavy metal efflux pump, CzcA family.  This
           model represents a family of H+/heavy metal cation
           antiporters. This family is one of several subfamilies
           within the scope of Pfam model pfam00873 [Cellular
           processes, Detoxification, Transport and binding
           proteins, Cations and iron carrying compounds].
          Length = 1051

 Score = 30.5 bits (69), Expect = 3.6
 Identities = 15/43 (34%), Positives = 19/43 (44%), Gaps = 10/43 (23%)

Query: 87  PGTSLKTAANDQQPISTSVMSTP---------GSAPTATVDSM 120
           PGTSL  +   QQ +   + S P         G+A  AT D M
Sbjct: 579 PGTSLAQSVAMQQTLEKLIKSFPEVARVFAKTGTAEIAT-DPM 620


>gnl|CDD|221779 pfam12803, G-7-MTase, mRNA (guanine-7-)methyltransferase
           (G-7-MTase).  The Sendai virus RNA-dependent RNA
           polymerase complex, which consists of L and P proteins,
           participates in the synthesis of viral mRNAs that
           possess a methylated cap structure. The N-terminal of
           the L protein acts as the RNA-dependent RNA polymerase
           part of the molecule, family Paramyx_RNA_pol, pfam00946.
           This domain is the C-terminal part of the L protein and
           it catalyzes cap methylation through its mRNA
           (guanine-7-)methyltransferase (G-7-MTase) activity.
          Length = 320

 Score = 30.3 bits (69), Expect = 3.7
 Identities = 26/130 (20%), Positives = 41/130 (31%), Gaps = 45/130 (34%)

Query: 499 YNISHVA---ILNALSTSVYNDQPEVALLFYRRLLQMGL---YNAELFNNL---ALCCFY 549
             IS      + N LS       P++    +RR   +G+    N           L C  
Sbjct: 26  RRISGSVLKVLANTLS------HPKI----FRRFWDLGIIEPINGPYLATQDFVKLSC-- 73

Query: 550 SQQYDMVVTCFERALSLALNENAADVWYNISHVAIITECSPFSFSTHTSYLFIQGISDTR 609
               D++V  + R L+  LN    ++        +I E                 +SD  
Sbjct: 74  ----DLLVWGYRRYLADLLNGIELEI--------LICESDEDV------------LSDRM 109

Query: 610 LAIQCLHLAL 619
             +Q  HL L
Sbjct: 110 ENLQARHLCL 119


>gnl|CDD|197284 cd09188, PLDc_FAM83H_N, N-terminal phospholipase D-like domain of
           the uncharacterized protein, Family with sequence
           similarity 83H.  N-terminal phospholipase D (PLD)-like
           domain of the protein, Family with sequence similarity
           83H (FAM83H) on chromosome 8q24.3, which localizes in
           the intracellular environment and is associated with
           vesicles, can be regulated by kinases, and plays
           important roles during ameloblast differentiation and
           enamel matrix calcification. The gene encoding protein
           FAM83H is the first gene involved in the etiology of
           amelogenesis imperfecta (AI), that encodes a
           non-secreted protein due to the absence of a signal
           peptide. Defects in gene FAM83H cause autosomal dominant
           hypocalcified amelogenesis imperfecta (ADHCAI). Since
           the N-terminal PLD-like domain of FAM83H shows only
           trace similarity to the PLD catalytic domain and lacks
           the functionally important histidine residue, FAM83H may
           share a similar three-dimensional fold with PLD enzymes,
           but is most unlikely to carry PLD activity.
          Length = 265

 Score = 29.8 bits (67), Expect = 3.7
 Identities = 16/41 (39%), Positives = 22/41 (53%), Gaps = 7/41 (17%)

Query: 339 LIREAQQQFNSALNQFTDI-------EAFIRMIRVYIRLDQ 372
           +IR AQQ     ++ FTD+       EA  R + VYI LD+
Sbjct: 127 MIRSAQQVIAVVMDIFTDVDILSELLEAAARRVPVYILLDE 167


>gnl|CDD|202284 pfam02561, FliS, Flagellar protein FliS.  FliS is coded for by the
           FliD operon and is transcribed in conjunction with FliD
           and FliT, however this protein has no known function.
          Length = 122

 Score = 28.6 bits (64), Expect = 4.5
 Identities = 18/71 (25%), Positives = 27/71 (38%), Gaps = 9/71 (12%)

Query: 475 YDMVVTCFERALSLALNENAADVWYNISHVA-ILNALSTSV-YNDQPEVA-------LLF 525
           YD  +    RA     NE+      NIS    I+  L +++      E+A          
Sbjct: 26  YDKAIKALVRAAEFIENEDIEKKNENISKAQNIITELRSTLDEEKGGEIAQNLLAIYTYM 85

Query: 526 YRRLLQMGLYN 536
            +RL+Q  L N
Sbjct: 86  NKRLVQANLKN 96


>gnl|CDD|107029 PHA01351, PHA01351, putative minor structural protein.
          Length = 1070

 Score = 29.9 bits (67), Expect = 5.1
 Identities = 28/110 (25%), Positives = 48/110 (43%), Gaps = 18/110 (16%)

Query: 332 KCYFSLGLIREAQQQFNSALNQFTDIEAFIRMIRVYIRLDQPIRAIDIGRNALDCYPNEV 391
           K + ++G+I+E +  F +  NQ   I AF+ +++  +R  Q    ID         P E 
Sbjct: 600 KQFKAIGIIKEYEDAFINFYNQELQISAFLTILKSQLRQFQ----ID---------PKEA 646

Query: 392 TIMTEMARIFEGLNNMPMSVKYYKL-ILKRDATCMEAIACIGVNHFYNDQ 440
               +   I E L N  +  + Y + I K   + +E IA      +Y+ Q
Sbjct: 647 ETELKKLNINEYLANQ-IIQEEYNINIAKLQLSVLETIAKTL---YYDQQ 692


>gnl|CDD|225613 COG3071, HemY, Uncharacterized enzyme of heme biosynthesis
           [Coenzyme metabolism].
          Length = 400

 Score = 29.6 bits (67), Expect = 5.2
 Identities = 15/74 (20%), Positives = 24/74 (32%), Gaps = 6/74 (8%)

Query: 317 KACEFRDWWWKVQ------LGKCYFSLGLIREAQQQFNSALNQFTDIEAFIRMIRVYIRL 370
           KA E               LG+      L  +A +   +AL        +  +     +L
Sbjct: 315 KAAEKWLKQHPEDPLLLSTLGRLALKNKLWGKASEALEAALKLRPSASDYAELADALDQL 374

Query: 371 DQPIRAIDIGRNAL 384
            +P  A  + R AL
Sbjct: 375 GEPEEAEQVRREAL 388


>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
           subunit 8 N-terminus.  The largest of the mammalian
           translation initiation factors, eIF3, consists of at
           least eight subunits ranging in mass from 35 to 170 kDa.
           eIF3 binds to the 40 S ribosome in an early step of
           translation initiation and promotes the binding of
           methionyl-tRNAi and mRNA.
          Length = 593

 Score = 29.8 bits (67), Expect = 5.3
 Identities = 12/18 (66%), Positives = 13/18 (72%)

Query: 328 VQLGKCYFSLGLIREAQQ 345
           VQLG C F  GLI+EA Q
Sbjct: 502 VQLGLCAFRQGLIKEAHQ 519


>gnl|CDD|221851 pfam12916, DUF3834, Protein of unknown function (DUF3834).  This
           family is likely to be related to solute-binding
           lipo-proteins.
          Length = 199

 Score = 28.9 bits (65), Expect = 6.1
 Identities = 23/62 (37%), Positives = 32/62 (51%), Gaps = 7/62 (11%)

Query: 168 DLEAEEEGIADSVLDTNTIATAARPGTSLKTAAVTAPALTS-RPRTESGRPVSGVVRPGT 226
           D+   +EG AD VLD  +  +  + G  L+  AVT   L S  P+  SG+   GV R G+
Sbjct: 23  DIIFSKEGNADVVLD--STVSLVKRG--LRIDAVTIKKLYSIYPKIGSGK--IGVWRKGS 76

Query: 227 LA 228
            A
Sbjct: 77  AA 78


>gnl|CDD|240440 PTZ00493, PTZ00493, phosphomethylpyrimidine kinase; Provisional.
          Length = 321

 Score = 28.8 bits (64), Expect = 10.0
 Identities = 33/162 (20%), Positives = 67/162 (41%), Gaps = 22/162 (13%)

Query: 410 SVKYYKLILKRDATCMEAIACIGVNHFYNDQPEVALLFYRRLLQMGLYNAELFNNLALCC 469
           +++Y K  L  D  C   I+CI   +FY    E  ++      QM L  A   N   LC 
Sbjct: 125 NLEYIKFAL--DLIC--PISCIITPNFY----ECKVILEALDCQMDLSKA---NMTELCK 173

Query: 470 FYSQQYDMVVTCFERALSLALNENAADVWYNISHVAILNALST----SVYNDQPEVALLF 525
             +++ + +  C  ++ ++  N    +  Y + H+ I N  S         D   V  L+
Sbjct: 174 LVTEKLN-INACLFKSCNVGENSAEENEVYAVDHLCIRNVGSYPTGEKQQIDAGGVTYLY 232

Query: 526 ------YRRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFE 561
                  +R     ++      + A+ C+ +++++++ +C E
Sbjct: 233 DVYKLRSKRKPGKDIHGTGCTLSTAIACYLAKKHNILQSCIE 274


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.131    0.384 

Gapped
Lambda     K      H
   0.267   0.0733    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 33,181,555
Number of extensions: 3202218
Number of successful extensions: 2898
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2866
Number of HSP's successfully gapped: 73
Length of query: 677
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 574
Effective length of database: 6,369,140
Effective search space: 3655886360
Effective search space used: 3655886360
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 62 (27.7 bits)