RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16607
(677 letters)
>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
34 amino acids
[WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
in a variety of organisms including bacteria,
cyanobacteria, yeast, fungi, plants, and humans in
various subcellular locations; involved in a variety of
functions including protein-protein interactions, but
common features in the interaction partners have not
been defined; involved in chaperone, cell-cycle,
transciption, and protein transport complexes; the
number of TPR motifs varies among proteins (1,3-11,13
15,16,19); 5-6 tandem repeats generate a right-handed
helical structure with an amphipathic channel that is
thought to accomodate an alpha-helix of a target
protein; it has been proposed that TPR proteins
preferably interact with WD-40 repeat proteins, but in
many instances several TPR-proteins seem to aggregate to
multi-protein complexes; examples of TPR-proteins
include, Cdc16p, Cdc23p and Cdc27p components of the
cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
targeting signals, the Tom70p co-receptor for
mitochondrial targeting signals, Ser/Thr phosphatase 5C
and the p110 subunit of O-GlcNAc transferase; three
copies of the repeat are present here.
Length = 100
Score = 56.6 bits (137), Expect = 4e-10
Identities = 28/107 (26%), Positives = 49/107 (45%), Gaps = 10/107 (9%)
Query: 460 ELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADVWYNISHVAILNALSTSVYNDQP 519
E NL + YD + +E+AL L + + AD +YN++ +
Sbjct: 1 EALLNLGNLYYKLGDYDEALEYYEKALEL--DPDNADAYYNLAAAYY--------KLGKY 50
Query: 520 EVALLFYRRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFERALSL 566
E AL Y + L++ NA+ + NL L + +Y+ + +E+AL L
Sbjct: 51 EEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALEL 97
Score = 51.6 bits (124), Expect = 3e-08
Identities = 21/96 (21%), Positives = 47/96 (48%)
Query: 393 IMTEMARIFEGLNNMPMSVKYYKLILKRDATCMEAIACIGVNHFYNDQPEVALLFYRRLL 452
+ + ++ L + +++YY+ L+ D +A + ++ + E AL Y + L
Sbjct: 2 ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKAL 61
Query: 453 QMGLYNAELFNNLALCCFYSQQYDMVVTCFERALSL 488
++ NA+ + NL L + +Y+ + +E+AL L
Sbjct: 62 ELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALEL 97
Score = 47.8 bits (114), Expect = 6e-07
Identities = 25/120 (20%), Positives = 43/120 (35%), Gaps = 21/120 (17%)
Query: 538 ELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADVWYNISHVAIITECSPFSFSTHT 597
E NL + YD + +E+AL L + + AD +YN++
Sbjct: 1 EALLNLGNLYYKLGDYDEALEYYEKALEL--DPDNADAYYNLAAA--------------- 43
Query: 598 SYLFIQGISDTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAASSP 657
+ G A++ AL +D + + NL + + G E A + A P
Sbjct: 44 --YYKLG--KYEEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELDP 99
Score = 44.7 bits (106), Expect = 9e-06
Identities = 18/71 (25%), Positives = 38/71 (53%), Gaps = 2/71 (2%)
Query: 431 IGVNHFYNDQPEVALLFYRRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFERALSLAL 490
+G ++ + AL +Y + L++ NA+ + NLA + +Y+ + +E+AL L
Sbjct: 6 LGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALE--L 63
Query: 491 NENAADVWYNI 501
+ + A +YN+
Sbjct: 64 DPDNAKAYYNL 74
Score = 42.4 bits (100), Expect = 5e-05
Identities = 17/58 (29%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 522 ALLFYRRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADVWYNI 579
AL +Y + L++ NA+ + NLA + +Y+ + +E+AL L+ + A +YN+
Sbjct: 19 ALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALE--LDPDNAKAYYNL 74
Score = 41.2 bits (97), Expect = 1e-04
Identities = 20/94 (21%), Positives = 40/94 (42%), Gaps = 1/94 (1%)
Query: 329 QLGKCYFSLGLIREAQQQFNSALNQF-TDIEAFIRMIRVYIRLDQPIRAIDIGRNALDCY 387
LG Y+ LG EA + + AL + +A+ + Y +L + A++ AL+
Sbjct: 5 NLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELD 64
Query: 388 PNEVTIMTEMARIFEGLNNMPMSVKYYKLILKRD 421
P+ + + L +++ Y+ L+ D
Sbjct: 65 PDNAKAYYNLGLAYYKLGKYEEALEAYEKALELD 98
Score = 36.2 bits (84), Expect = 0.006
Identities = 20/96 (20%), Positives = 43/96 (44%)
Query: 358 EAFIRMIRVYIRLDQPIRAIDIGRNALDCYPNEVTIMTEMARIFEGLNNMPMSVKYYKLI 417
EA + + +Y +L A++ AL+ P+ +A + L +++ Y+
Sbjct: 1 EALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKA 60
Query: 418 LKRDATCMEAIACIGVNHFYNDQPEVALLFYRRLLQ 453
L+ D +A +G+ ++ + E AL Y + L+
Sbjct: 61 LELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALE 96
Score = 35.0 bits (81), Expect = 0.020
Identities = 15/63 (23%), Positives = 25/63 (39%)
Query: 607 DTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAASSPYLYETHYNQ 666
D A++ AL +D + + NLA + G E A + A P + +YN
Sbjct: 15 DYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNL 74
Query: 667 AVI 669
+
Sbjct: 75 GLA 77
>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
Length = 291
Score = 40.6 bits (93), Expect = 0.002
Identities = 50/272 (18%), Positives = 86/272 (31%), Gaps = 28/272 (10%)
Query: 413 YYKLILKRDATCMEAIACIGVNHFYNDQPEVALLFYRRLLQM--GLYNAELFNNLALCCF 470
+ + K A + + + AL L++ A L LAL
Sbjct: 11 LLEALAKLLAEALALLEAGLALLELLGELAEALELLEEALELLPNSDLAGLLLLLALALL 70
Query: 471 YSQQYDMVVTCFERALSLALNENAADVWYNISHVAILNALSTSVYNDQPEVALLFYRRLL 530
+ + + E+AL L L N A+ N+ + + E AL + L
Sbjct: 71 KLGRLEEALELLEKALELELLPNLAEALLNLGLLLE--------ALGKYEEALELLEKAL 122
Query: 531 QMGLYNAELFNNLALCCFYSQ-QYDMVVTCFERALSLA-LNENAADVWYNISHV------ 582
+ LAL Y Y+ + +E+AL L A+ + +
Sbjct: 123 ALDPDPDLAEALLALGALYELGDYEEALELYEKALELDPELNELAEALLALGALLEALGR 182
Query: 583 -----AIITECSPFSFSTHTSYLFIQGIS-----DTRLAIQCLHLALSIDSSHGLSQNNL 632
++ + + L G+ A++ AL +D + + NL
Sbjct: 183 YEEALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELDPDNAEALYNL 242
Query: 633 AVLEAREGHIERASTYLQAAAASSPYLYETHY 664
A+L G E A L+ A P LY
Sbjct: 243 ALLLLELGRYEEALEALEKALELDPDLYNLGL 274
Score = 38.7 bits (88), Expect = 0.007
Identities = 29/169 (17%), Positives = 63/169 (37%), Gaps = 5/169 (2%)
Query: 291 AKYLFEYLYHHENDVASAMDLAVESTKACEFRDWWWKVQLGKCYFSLGLIREAQQQFNSA 350
A L A++L ++ D + + LG EA + + A
Sbjct: 97 ALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALYELGDYEEALELYEKA 156
Query: 351 LNQFTD----IEAFIRMIRVYIRLDQPIRAIDIGRNALDCYPNEVT-IMTEMARIFEGLN 405
L + EA + + + L + A+++ AL P++ + + ++ L
Sbjct: 157 LELDPELNELAEALLALGALLEALGRYEEALELLEKALKLNPDDDAEALLNLGLLYLKLG 216
Query: 406 NMPMSVKYYKLILKRDATCMEAIACIGVNHFYNDQPEVALLFYRRLLQM 454
+++YY+ L+ D EA+ + + + E AL + L++
Sbjct: 217 KYEEALEYYEKALELDPDNAEALYNLALLLLELGRYEEALEALEKALEL 265
Score = 37.1 bits (84), Expect = 0.020
Identities = 24/112 (21%), Positives = 41/112 (36%), Gaps = 1/112 (0%)
Query: 279 LNLAKYARDKTVAKYLFEYLYHHENDVASAMDLAVESTKACEFRDWWWKVQLGKCYFSLG 338
L L + A L A++L ++ K D + LG Y LG
Sbjct: 157 LELDPELNELAEALLALGALLEALGRYEEALELLEKALKLNPDDDAEALLNLGLLYLKLG 216
Query: 339 LIREAQQQFNSALNQF-TDIEAFIRMIRVYIRLDQPIRAIDIGRNALDCYPN 389
EA + + AL + EA + + + L + A++ AL+ P+
Sbjct: 217 KYEEALEYYEKALELDPDNAEALYNLALLLLELGRYEEALEALEKALELDPD 268
Score = 32.9 bits (73), Expect = 0.45
Identities = 28/145 (19%), Positives = 55/145 (37%), Gaps = 4/145 (2%)
Query: 281 LAKYARDKTVAKYLFEYLYHHENDVASAMDLAVESTKA--CEFRDWWWKVQLGKCYFSLG 338
LA L + D A++L ++ + + LG +LG
Sbjct: 122 LALDPDPDLAEALLALGALYELGDYEEALELYEKALELDPELNELAEALLALGALLEALG 181
Query: 339 LIREAQQQFNSAL--NQFTDIEAFIRMIRVYIRLDQPIRAIDIGRNALDCYPNEVTIMTE 396
EA + AL N D EA + + +Y++L + A++ AL+ P+ +
Sbjct: 182 RYEEALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELDPDNAEALYN 241
Query: 397 MARIFEGLNNMPMSVKYYKLILKRD 421
+A + L +++ + L+ D
Sbjct: 242 LALLLLELGRYEEALEALEKALELD 266
>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat.
Length = 69
Score = 35.8 bits (83), Expect = 0.005
Identities = 17/59 (28%), Positives = 27/59 (45%), Gaps = 1/59 (1%)
Query: 432 GVNHFYNDQPEVALLFYRRLLQMGLYNAELFNNLALCCFYS-QQYDMVVTCFERALSLA 489
G F + A+ Y + L++ NAE + NLAL + Y+ + E+AL L
Sbjct: 10 GNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKALELD 68
Score = 35.8 bits (83), Expect = 0.005
Identities = 19/74 (25%), Positives = 31/74 (41%), Gaps = 9/74 (12%)
Query: 458 NAELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADVWYNISHVAILNALSTSVYND 517
NAE NL F YD + +E+A L L+ + A+ +YN++ +
Sbjct: 2 NAEALKNLGNALFKLGDYDEAIEAYEKA--LELDPDNAEAYYNLALAYLKL-------GK 52
Query: 518 QPEVALLFYRRLLQ 531
E AL + L+
Sbjct: 53 DYEEALEDLEKALE 66
Score = 35.0 bits (81), Expect = 0.008
Identities = 21/87 (24%), Positives = 36/87 (41%), Gaps = 20/87 (22%)
Query: 536 NAELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADVWYNISHVAIITECSPFSFST 595
NAE NL F YD + +E+A L L+ + A+ +YN++
Sbjct: 2 NAEALKNLGNALFKLGDYDEAIEAYEKA--LELDPDNAEAYYNLALA------------- 46
Query: 596 HTSYLFIQGISDTRLAIQCLHLALSID 622
+++ D A++ L AL +D
Sbjct: 47 -----YLKLGKDYEEALEDLEKALELD 68
Score = 32.7 bits (75), Expect = 0.068
Identities = 15/51 (29%), Positives = 25/51 (49%), Gaps = 1/51 (1%)
Query: 518 QPEVALLFYRRLLQMGLYNAELFNNLALCCFYS-QQYDMVVTCFERALSLA 567
+ A+ Y + L++ NAE + NLAL + Y+ + E+AL L
Sbjct: 18 DYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKALELD 68
Score = 28.4 bits (64), Expect = 2.3
Identities = 12/67 (17%), Positives = 24/67 (35%), Gaps = 1/67 (1%)
Query: 356 DIEAFIRMIRVYIRLDQPIRAIDIGRNALDCYPNEVTIMTEMARIFEGLNNMPMS-VKYY 414
+ EA + +L AI+ AL+ P+ +A + L ++
Sbjct: 2 NAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDL 61
Query: 415 KLILKRD 421
+ L+ D
Sbjct: 62 EKALELD 68
>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat.
Length = 34
Score = 33.9 bits (79), Expect = 0.010
Identities = 9/30 (30%), Positives = 14/30 (46%)
Query: 459 AELFNNLALCCFYSQQYDMVVTCFERALSL 488
A+ NL +YD + +E+AL L
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALEL 30
Score = 33.9 bits (79), Expect = 0.010
Identities = 9/30 (30%), Positives = 14/30 (46%)
Query: 537 AELFNNLALCCFYSQQYDMVVTCFERALSL 566
A+ NL +YD + +E+AL L
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALEL 30
>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats. Repeats present in 4
or more copies in proteins. Contain a minimum of 34
amino acids each and self-associate via a "knobs and
holes" mechanism.
Length = 34
Score = 32.0 bits (74), Expect = 0.039
Identities = 10/31 (32%), Positives = 13/31 (41%)
Query: 459 AELFNNLALCCFYSQQYDMVVTCFERALSLA 489
AE NL YD + +E+AL L
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELD 31
Score = 32.0 bits (74), Expect = 0.039
Identities = 10/31 (32%), Positives = 13/31 (41%)
Query: 537 AELFNNLALCCFYSQQYDMVVTCFERALSLA 567
AE NL YD + +E+AL L
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELD 31
>gnl|CDD|237610 PRK14099, PRK14099, glycogen synthase; Provisional.
Length = 485
Score = 36.2 bits (84), Expect = 0.050
Identities = 14/33 (42%), Positives = 21/33 (63%)
Query: 175 GIADSVLDTNTIATAARPGTSLKTAAVTAPALT 207
G+AD+V+D N +A A T ++ + VTA AL
Sbjct: 403 GLADTVVDANEMAIATGVATGVQFSPVTADALA 435
>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat. This Pfam entry
includes outlying Tetratricopeptide-like repeats (TPR)
that are not matched by pfam00515.
Length = 34
Score = 30.6 bits (70), Expect = 0.14
Identities = 10/31 (32%), Positives = 15/31 (48%)
Query: 459 AELFNNLALCCFYSQQYDMVVTCFERALSLA 489
AE NL L + Y+ + +E+AL L
Sbjct: 1 AEALYNLGLAYYKLGDYEEALEAYEKALELD 31
Score = 30.6 bits (70), Expect = 0.14
Identities = 10/31 (32%), Positives = 15/31 (48%)
Query: 537 AELFNNLALCCFYSQQYDMVVTCFERALSLA 567
AE NL L + Y+ + +E+AL L
Sbjct: 1 AEALYNLGLAYYKLGDYEEALEAYEKALELD 31
>gnl|CDD|131573 TIGR02521, type_IV_pilW, type IV pilus biogenesis/stability protein
PilW. Members of this family are designated PilF in ref
(PMID:8973346) and PilW in ref (PMID:15612916). This
outer membrane protein is required both for pilus
stability and for pilus function such as adherence to
human cells. Members of this family contain copies of
the TPR (tetratricopeptide repeat) domain.
Length = 234
Score = 33.5 bits (77), Expect = 0.26
Identities = 31/133 (23%), Positives = 45/133 (33%), Gaps = 19/133 (14%)
Query: 518 QPEVALLFYRRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFERALSLALNENAADVWY 577
+ E A +RR L + N ++ NN +Y+ + FE+A+ L A
Sbjct: 80 ELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGKYEQAMQQFEQAIEDPLYPQPARSLE 139
Query: 578 NISHVAIITECSPFSFSTHTSYLFIQGISDTRLAIQCLHLALSIDSSHGLSQNNLAVLEA 637
N A+ D A + L AL ID S LA L
Sbjct: 140 NAGLCAL-------------------KAGDFDKAEKYLTRALQIDPQRPESLLELAELYY 180
Query: 638 REGHIERASTYLQ 650
G + A YL+
Sbjct: 181 LRGQYKDARAYLE 193
Score = 28.8 bits (65), Expect = 7.6
Identities = 15/59 (25%), Positives = 25/59 (42%)
Query: 607 DTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAASSPYLYETHYN 665
D +A + L AL D L+ LA+ + G +E+A + A +P + N
Sbjct: 46 DLEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNN 104
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 34.1 bits (78), Expect = 0.33
Identities = 27/94 (28%), Positives = 32/94 (34%), Gaps = 7/94 (7%)
Query: 183 TNTIATAARPGTSL---KTAAVTAPALTSRPRTESGRPVSGVVRPGTLASRGGTLEQSLK 239
+ AA P L +A TAP P P+ G V PG R
Sbjct: 2815 AAALPPAASPAGPLPPPTSAQPTAPPPPPGP-PPPSLPLGGSVAPGGDVRRRPPSRSPAA 2873
Query: 240 TPRTAKSARPLTSQAAR-TIRLGTASMLSQPDGP 272
P A ARP + AR + T S PD P
Sbjct: 2874 KP--AAPARPPVRRLARPAVSRSTESFALPPDQP 2905
>gnl|CDD|145533 pfam02437, Ski_Sno, SKI/SNO/DAC family. This family contains a
presumed domain that is about 100 amino acids long. All
members of this family contain a conserved CLPQ motif.
The c-ski proto-oncogene has been shown to influence
proliferation, morphological transformation and myogenic
differentiation. Sno, a Ski proto-oncogene homologue, is
expressed in two isoforms and plays a role in the
response to proliferation stimuli. Dachshund also
contains this domain. It is involved in various aspects
of development.
Length = 101
Score = 31.2 bits (71), Expect = 0.47
Identities = 21/78 (26%), Positives = 29/78 (37%), Gaps = 20/78 (25%)
Query: 345 QQFNSALNQFTDIEAFIRMIRVYIRLDQPIRAIDIGRNALDCYPNEVTIMTEMARIFEGL 404
Q FN L F+ + ++ +L LD Y T+ E RI GL
Sbjct: 34 QIFNLFLKHFS-------LHQINTKLK-----------RLDIYCVVCTV--EQLRILRGL 73
Query: 405 NNMPMSVKYYKLILKRDA 422
+P V LI + DA
Sbjct: 74 GAIPPGVNRCGLITRTDA 91
>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
Length = 171
Score = 32.1 bits (73), Expect = 0.59
Identities = 16/56 (28%), Positives = 22/56 (39%), Gaps = 1/56 (1%)
Query: 196 LKTAAVTAPALT-SRPRTESGRPVSGVVRPGTLASRGGTLEQSLKTPRTAKSARPL 250
K AA + P L RT + V+ G+ + T EQ L PR +A
Sbjct: 55 AKPAAASLPRLAEGNFRTMPVQTVTATTPSGSRPAVTATPEQKLLAPRPLFTAARE 110
>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat.
Length = 65
Score = 29.6 bits (67), Expect = 0.63
Identities = 14/52 (26%), Positives = 22/52 (42%)
Query: 607 DTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAASSPY 658
D A+ L AL+ + L R+G + A+ L+AA A+ P
Sbjct: 12 DYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADPD 63
>gnl|CDD|233924 TIGR02552, LcrH_SycD, type III secretion low calcium response
chaperone LcrH/SycD. Genes in this family are found in
type III secretion operons. LcrH, from Yersinia is
believed to have a regulatory function in the
low-calcium response of the secretion system. The same
protein is also known as SycD (SYC = Specific Yop
Chaperone) for its chaperone role. In Pseudomonas, where
the homolog is known as PcrH, the chaperone role has
been demonstrated and the regulatory role appears to be
absent. ScyD/LcrH contains three central
tetratricopeptide-like repeats that are predicted to
fold into an all-alpha-helical array.
Length = 135
Score = 31.1 bits (71), Expect = 0.64
Identities = 18/79 (22%), Positives = 37/79 (46%)
Query: 414 YKLILKRDATCMEAIACIGVNHFYNDQPEVALLFYRRLLQMGLYNAELFNNLALCCFYSQ 473
K +L D+ +E I + N + + + AL ++ L YN+ + LA CC +
Sbjct: 6 LKDLLGLDSEQLEQIYALAYNLYQQGRYDEALKLFQLLAAYDPYNSRYWLGLAACCQMLK 65
Query: 474 QYDMVVTCFERALSLALNE 492
+Y+ + + A +L ++
Sbjct: 66 EYEEAIDAYALAAALDPDD 84
>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
lipoprotein. This protein family occurs in strictly
within a subset of Gram-negative bacterial species with
the proposed PEP-CTERM/exosortase system, analogous to
the LPXTG/sortase system common in Gram-positive
bacteria. This protein occurs in a species if and only
if a transmembrane histidine kinase (TIGR02916) and a
DNA-binding response regulator (TIGR02915) also occur.
The present of tetratricopeptide repeats (TPR) suggests
protein-protein interaction, possibly for the regulation
of PEP-CTERM protein expression, since many PEP-CTERM
proteins in these genomes are preceded by a proposed DNA
binding site for the response regulator.
Length = 899
Score = 32.7 bits (75), Expect = 0.81
Identities = 29/161 (18%), Positives = 60/161 (37%), Gaps = 1/161 (0%)
Query: 330 LGKCYFSLGLIREAQQQFNSALN-QFTDIEAFIRMIRVYIRLDQPIRAIDIGRNALDCYP 388
LG Y G + +A++ F AL+ + A + R+ I+ P AI L P
Sbjct: 471 LGAIYLGKGDLAKAREAFEKALSIEPDFFPAAANLARIDIQEGNPDDAIQRFEKVLTIDP 530
Query: 389 NEVTIMTEMARIFEGLNNMPMSVKYYKLILKRDATCMEAIACIGVNHFYNDQPEVALLFY 448
+ + +A ++ N +V + + + + +E + + Q + AL
Sbjct: 531 KNLRAILALAGLYLRTGNEEEAVAWLEKAAELNPQEIEPALALAQYYLGKGQLKKALAIL 590
Query: 449 RRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFERALSLA 489
+ E + L + + V+ F++ L+L
Sbjct: 591 NEAADAAPDSPEAWLMLGRAQLAAGDLNKAVSSFKKLLALQ 631
>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat.
Length = 78
Score = 29.6 bits (67), Expect = 0.84
Identities = 18/82 (21%), Positives = 29/82 (35%), Gaps = 13/82 (15%)
Query: 456 LYNAELFNNLALCCFYSQQYDMVVTCFERALSLA-----LNENAADVWYNISHVAILNAL 510
A NNLAL YD + E+AL LA + A N++ +
Sbjct: 2 PDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLY----- 56
Query: 511 STSVYNDQPEVALLFYRRLLQM 532
+ + AL + + L +
Sbjct: 57 ---LALGDYDEALEYLEKALAL 75
Score = 28.9 bits (65), Expect = 1.5
Identities = 14/66 (21%), Positives = 23/66 (34%), Gaps = 7/66 (10%)
Query: 431 IGVNHFYNDQPEVALLFYRRLLQM-------GLYNAELFNNLALCCFYSQQYDMVVTCFE 483
+ + + AL + L++ A NNLA YD + E
Sbjct: 11 LALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEALEYLE 70
Query: 484 RALSLA 489
+AL+L
Sbjct: 71 KALALR 76
Score = 27.3 bits (61), Expect = 6.1
Identities = 13/34 (38%), Positives = 15/34 (44%)
Query: 534 LYNAELFNNLALCCFYSQQYDMVVTCFERALSLA 567
A NNLAL YD + E+AL LA
Sbjct: 2 PDLAAALNNLALVLRRLGDYDEALELLEKALELA 35
>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
Length = 463
Score = 32.4 bits (73), Expect = 0.85
Identities = 33/167 (19%), Positives = 61/167 (36%), Gaps = 4/167 (2%)
Query: 81 IATAARPGTSLKTAANDQQPISTSVMSTPGSAPTATVDSMFYAVDHFHKRNFEKCAAICT 140
I A+ G+S Q S S+P + ++ + +H H + +
Sbjct: 7 IRRASTRGSSPSLFGKSLQS-SRVAASSPSLLSGSETGALLHRGNHAHSFHNLALPG-NS 64
Query: 141 ELLAKNPYDKAAWTLKMRSLTSQLSVDDLEAEEEGIADSVLDTNTIATAARPGTSLKTAA 200
+ + +R +S+ S D +EA + +S+ D +PG ++
Sbjct: 65 GISRSASLVSSTLQRWVRPFSSE-SGDTVEAVVPHMGESITDGTLATFLKKPGERVQADE 123
Query: 201 VTAPALTSRPRTESGRPVSGVVRPGTLASRGGTLEQSLKTPRTAKSA 247
A T + + P SGV++ L G T+E K +KS
Sbjct: 124 AIAQIETDKVTIDIASPASGVIQE-FLVKEGDTVEPGTKVAIISKSE 169
>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
family. The proteins represented by this model contain
three RNA recognition motifs (rrm: pfam00076) and have
been characterized as poly-pyrimidine tract binding
proteins associated with RNA splicing factors. In the
case of PUF60 (GP|6176532), in complex with p54, and in
the presence of U2AF, facilitates association of U2
snRNP with pre-mRNA.
Length = 612
Score = 32.0 bits (72), Expect = 1.2
Identities = 24/107 (22%), Positives = 35/107 (32%), Gaps = 3/107 (2%)
Query: 168 DLEAEEEGIADSVLDTNTIATAARPGTSLKTAAVTAPALTSRPRTESGRPVSGVVRPGTL 227
DL + + V + + A AAV A A T+ + + V+G G
Sbjct: 270 DLGGQYLRVGKCVTPPDALLQPATVSAIPAAAAVAAAAATA--KIMAAEAVAGAAVLGPR 327
Query: 228 ASRGGTLEQSLKTPRTAKS-ARPLTSQAARTIRLGTASMLSQPDGPF 273
A T SL T K+ +A L A+ GP
Sbjct: 328 AQSPATPSSSLPTDIGNKAVVSSAKKEAEEVPPLPQAAPAVVKPGPM 374
>gnl|CDD|222121 pfam13428, TPR_14, Tetratricopeptide repeat.
Length = 44
Score = 28.3 bits (63), Expect = 1.3
Identities = 10/43 (23%), Positives = 20/43 (46%)
Query: 358 EAFIRMIRVYIRLDQPIRAIDIGRNALDCYPNEVTIMTEMARI 400
A + + R + L A+ + R AL P++ + +AR+
Sbjct: 2 AALLALARALLALGDLDEALALLRRALALDPDDPEALLLLARL 44
>gnl|CDD|197279 cd09182, PLDc_FAM83B_N, N-terminal phospholipase D-like domain of
the uncharacterized protein, Family with sequence
similarity 83B. N-terminal phospholipase D (PLD)-like
domain of the uncharacterized protein, Family with
sequence similarity 83B (FAM83B). Since the N-terminal
PLD-like domain of FAM83 proteins shows only trace
similarity to the PLD catalytic domain and lacks the
functionally important histidine residue, FAM83 proteins
may share a similar three-dimensional fold with PLD
enzymes, but are most unlikely to carry PLD activity.
The N-terminus of FAM83B shows high homology to other
FAM83 family members, indicating that FAM83B might have
arisen early in vertebrate evolution by duplication of a
gene in the FAM83 family.
Length = 266
Score = 30.6 bits (69), Expect = 2.8
Identities = 16/40 (40%), Positives = 22/40 (55%), Gaps = 7/40 (17%)
Query: 340 IREAQQQFNSALNQFTD-------IEAFIRMIRVYIRLDQ 372
I+EA+Q A++ FTD +EA R + VYI LD
Sbjct: 126 IQEARQVIAIAMDVFTDVDIFKEVVEASTRGVAVYILLDH 165
>gnl|CDD|233797 TIGR02243, TIGR02243, conserved hypothetical protein, phage
tail-like region. This family consists of a large,
conserved hypothetical protein in phage tail-like
regions of at least six bacterial genomes: Gloeobacter
violaceus PCC 7421, Geobacter sulfurreducens PCA,
Streptomyces coelicolor A3(2), Streptomyces avermitilis
MA-4680, Mesorhizobium loti, and Myxococcus xanthus. The
function is unknown [Mobile and extrachromosomal element
functions, Prophage functions].
Length = 656
Score = 30.6 bits (69), Expect = 3.0
Identities = 23/92 (25%), Positives = 30/92 (32%), Gaps = 5/92 (5%)
Query: 179 SVLDTNTIATAARPGTSLKTAAVTAPALTS-RPRTESGRPVSGVVRPGTLASRGGTLEQS 237
VL T ARPG +P + R T G G T++ G
Sbjct: 304 HVLATACRTLPARPGA---PFYTRSPRIARLRANTIGGTVTVGATHSETVSEEGLGSGAG 360
Query: 238 LKTPRTAKSARPLTSQAARTIRLGTASMLSQP 269
L R A P + AAR + L + L
Sbjct: 361 LPGQRFTLPASPALAGAAR-VALLVQADLGVE 391
>gnl|CDD|225605 COG3063, PilF, Tfp pilus assembly protein PilF [Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 250
Score = 30.1 bits (68), Expect = 3.5
Identities = 13/51 (25%), Positives = 21/51 (41%)
Query: 607 DTRLAIQCLHLALSIDSSHGLSQNNLAVLEAREGHIERASTYLQAAAASSP 657
D A + L AL D S+ L+ A + G + A + A + +P
Sbjct: 50 DYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAP 100
>gnl|CDD|129992 TIGR00914, 2A0601, heavy metal efflux pump, CzcA family. This
model represents a family of H+/heavy metal cation
antiporters. This family is one of several subfamilies
within the scope of Pfam model pfam00873 [Cellular
processes, Detoxification, Transport and binding
proteins, Cations and iron carrying compounds].
Length = 1051
Score = 30.5 bits (69), Expect = 3.6
Identities = 15/43 (34%), Positives = 19/43 (44%), Gaps = 10/43 (23%)
Query: 87 PGTSLKTAANDQQPISTSVMSTP---------GSAPTATVDSM 120
PGTSL + QQ + + S P G+A AT D M
Sbjct: 579 PGTSLAQSVAMQQTLEKLIKSFPEVARVFAKTGTAEIAT-DPM 620
>gnl|CDD|221779 pfam12803, G-7-MTase, mRNA (guanine-7-)methyltransferase
(G-7-MTase). The Sendai virus RNA-dependent RNA
polymerase complex, which consists of L and P proteins,
participates in the synthesis of viral mRNAs that
possess a methylated cap structure. The N-terminal of
the L protein acts as the RNA-dependent RNA polymerase
part of the molecule, family Paramyx_RNA_pol, pfam00946.
This domain is the C-terminal part of the L protein and
it catalyzes cap methylation through its mRNA
(guanine-7-)methyltransferase (G-7-MTase) activity.
Length = 320
Score = 30.3 bits (69), Expect = 3.7
Identities = 26/130 (20%), Positives = 41/130 (31%), Gaps = 45/130 (34%)
Query: 499 YNISHVA---ILNALSTSVYNDQPEVALLFYRRLLQMGL---YNAELFNNL---ALCCFY 549
IS + N LS P++ +RR +G+ N L C
Sbjct: 26 RRISGSVLKVLANTLS------HPKI----FRRFWDLGIIEPINGPYLATQDFVKLSC-- 73
Query: 550 SQQYDMVVTCFERALSLALNENAADVWYNISHVAIITECSPFSFSTHTSYLFIQGISDTR 609
D++V + R L+ LN ++ +I E +SD
Sbjct: 74 ----DLLVWGYRRYLADLLNGIELEI--------LICESDEDV------------LSDRM 109
Query: 610 LAIQCLHLAL 619
+Q HL L
Sbjct: 110 ENLQARHLCL 119
>gnl|CDD|197284 cd09188, PLDc_FAM83H_N, N-terminal phospholipase D-like domain of
the uncharacterized protein, Family with sequence
similarity 83H. N-terminal phospholipase D (PLD)-like
domain of the protein, Family with sequence similarity
83H (FAM83H) on chromosome 8q24.3, which localizes in
the intracellular environment and is associated with
vesicles, can be regulated by kinases, and plays
important roles during ameloblast differentiation and
enamel matrix calcification. The gene encoding protein
FAM83H is the first gene involved in the etiology of
amelogenesis imperfecta (AI), that encodes a
non-secreted protein due to the absence of a signal
peptide. Defects in gene FAM83H cause autosomal dominant
hypocalcified amelogenesis imperfecta (ADHCAI). Since
the N-terminal PLD-like domain of FAM83H shows only
trace similarity to the PLD catalytic domain and lacks
the functionally important histidine residue, FAM83H may
share a similar three-dimensional fold with PLD enzymes,
but is most unlikely to carry PLD activity.
Length = 265
Score = 29.8 bits (67), Expect = 3.7
Identities = 16/41 (39%), Positives = 22/41 (53%), Gaps = 7/41 (17%)
Query: 339 LIREAQQQFNSALNQFTDI-------EAFIRMIRVYIRLDQ 372
+IR AQQ ++ FTD+ EA R + VYI LD+
Sbjct: 127 MIRSAQQVIAVVMDIFTDVDILSELLEAAARRVPVYILLDE 167
>gnl|CDD|202284 pfam02561, FliS, Flagellar protein FliS. FliS is coded for by the
FliD operon and is transcribed in conjunction with FliD
and FliT, however this protein has no known function.
Length = 122
Score = 28.6 bits (64), Expect = 4.5
Identities = 18/71 (25%), Positives = 27/71 (38%), Gaps = 9/71 (12%)
Query: 475 YDMVVTCFERALSLALNENAADVWYNISHVA-ILNALSTSV-YNDQPEVA-------LLF 525
YD + RA NE+ NIS I+ L +++ E+A
Sbjct: 26 YDKAIKALVRAAEFIENEDIEKKNENISKAQNIITELRSTLDEEKGGEIAQNLLAIYTYM 85
Query: 526 YRRLLQMGLYN 536
+RL+Q L N
Sbjct: 86 NKRLVQANLKN 96
>gnl|CDD|107029 PHA01351, PHA01351, putative minor structural protein.
Length = 1070
Score = 29.9 bits (67), Expect = 5.1
Identities = 28/110 (25%), Positives = 48/110 (43%), Gaps = 18/110 (16%)
Query: 332 KCYFSLGLIREAQQQFNSALNQFTDIEAFIRMIRVYIRLDQPIRAIDIGRNALDCYPNEV 391
K + ++G+I+E + F + NQ I AF+ +++ +R Q ID P E
Sbjct: 600 KQFKAIGIIKEYEDAFINFYNQELQISAFLTILKSQLRQFQ----ID---------PKEA 646
Query: 392 TIMTEMARIFEGLNNMPMSVKYYKL-ILKRDATCMEAIACIGVNHFYNDQ 440
+ I E L N + + Y + I K + +E IA +Y+ Q
Sbjct: 647 ETELKKLNINEYLANQ-IIQEEYNINIAKLQLSVLETIAKTL---YYDQQ 692
>gnl|CDD|225613 COG3071, HemY, Uncharacterized enzyme of heme biosynthesis
[Coenzyme metabolism].
Length = 400
Score = 29.6 bits (67), Expect = 5.2
Identities = 15/74 (20%), Positives = 24/74 (32%), Gaps = 6/74 (8%)
Query: 317 KACEFRDWWWKVQ------LGKCYFSLGLIREAQQQFNSALNQFTDIEAFIRMIRVYIRL 370
KA E LG+ L +A + +AL + + +L
Sbjct: 315 KAAEKWLKQHPEDPLLLSTLGRLALKNKLWGKASEALEAALKLRPSASDYAELADALDQL 374
Query: 371 DQPIRAIDIGRNAL 384
+P A + R AL
Sbjct: 375 GEPEEAEQVRREAL 388
>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
subunit 8 N-terminus. The largest of the mammalian
translation initiation factors, eIF3, consists of at
least eight subunits ranging in mass from 35 to 170 kDa.
eIF3 binds to the 40 S ribosome in an early step of
translation initiation and promotes the binding of
methionyl-tRNAi and mRNA.
Length = 593
Score = 29.8 bits (67), Expect = 5.3
Identities = 12/18 (66%), Positives = 13/18 (72%)
Query: 328 VQLGKCYFSLGLIREAQQ 345
VQLG C F GLI+EA Q
Sbjct: 502 VQLGLCAFRQGLIKEAHQ 519
>gnl|CDD|221851 pfam12916, DUF3834, Protein of unknown function (DUF3834). This
family is likely to be related to solute-binding
lipo-proteins.
Length = 199
Score = 28.9 bits (65), Expect = 6.1
Identities = 23/62 (37%), Positives = 32/62 (51%), Gaps = 7/62 (11%)
Query: 168 DLEAEEEGIADSVLDTNTIATAARPGTSLKTAAVTAPALTS-RPRTESGRPVSGVVRPGT 226
D+ +EG AD VLD + + + G L+ AVT L S P+ SG+ GV R G+
Sbjct: 23 DIIFSKEGNADVVLD--STVSLVKRG--LRIDAVTIKKLYSIYPKIGSGK--IGVWRKGS 76
Query: 227 LA 228
A
Sbjct: 77 AA 78
>gnl|CDD|240440 PTZ00493, PTZ00493, phosphomethylpyrimidine kinase; Provisional.
Length = 321
Score = 28.8 bits (64), Expect = 10.0
Identities = 33/162 (20%), Positives = 67/162 (41%), Gaps = 22/162 (13%)
Query: 410 SVKYYKLILKRDATCMEAIACIGVNHFYNDQPEVALLFYRRLLQMGLYNAELFNNLALCC 469
+++Y K L D C I+CI +FY E ++ QM L A N LC
Sbjct: 125 NLEYIKFAL--DLIC--PISCIITPNFY----ECKVILEALDCQMDLSKA---NMTELCK 173
Query: 470 FYSQQYDMVVTCFERALSLALNENAADVWYNISHVAILNALST----SVYNDQPEVALLF 525
+++ + + C ++ ++ N + Y + H+ I N S D V L+
Sbjct: 174 LVTEKLN-INACLFKSCNVGENSAEENEVYAVDHLCIRNVGSYPTGEKQQIDAGGVTYLY 232
Query: 526 ------YRRLLQMGLYNAELFNNLALCCFYSQQYDMVVTCFE 561
+R ++ + A+ C+ +++++++ +C E
Sbjct: 233 DVYKLRSKRKPGKDIHGTGCTLSTAIACYLAKKHNILQSCIE 274
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.131 0.384
Gapped
Lambda K H
0.267 0.0733 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 33,181,555
Number of extensions: 3202218
Number of successful extensions: 2898
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2866
Number of HSP's successfully gapped: 73
Length of query: 677
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 574
Effective length of database: 6,369,140
Effective search space: 3655886360
Effective search space used: 3655886360
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 62 (27.7 bits)