RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy1039
(206 letters)
>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
34 amino acids
[WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
in a variety of organisms including bacteria,
cyanobacteria, yeast, fungi, plants, and humans in
various subcellular locations; involved in a variety of
functions including protein-protein interactions, but
common features in the interaction partners have not
been defined; involved in chaperone, cell-cycle,
transciption, and protein transport complexes; the
number of TPR motifs varies among proteins (1,3-11,13
15,16,19); 5-6 tandem repeats generate a right-handed
helical structure with an amphipathic channel that is
thought to accomodate an alpha-helix of a target
protein; it has been proposed that TPR proteins
preferably interact with WD-40 repeat proteins, but in
many instances several TPR-proteins seem to aggregate to
multi-protein complexes; examples of TPR-proteins
include, Cdc16p, Cdc23p and Cdc27p components of the
cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
targeting signals, the Tom70p co-receptor for
mitochondrial targeting signals, Ser/Thr phosphatase 5C
and the p110 subunit of O-GlcNAc transferase; three
copies of the repeat are present here.
Length = 100
Score = 70.5 bits (173), Expect = 3e-16
Identities = 35/94 (37%), Positives = 52/94 (55%), Gaps = 4/94 (4%)
Query: 10 NKLKESGNSAFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKVIED 69
L GN +K GDY+ AL++Y KAL++ + A N AA Y K + ++ +ED
Sbjct: 1 EALLNLGNLYYKLGDYDEALEYYEKALEL----DPDNADAYYNLAAAYYKLGKYEEALED 56
Query: 70 CSKSLEIVPDDPKALFRRCQAYEAIGKFEEAYTD 103
K+LE+ PD+ KA + AY +GK+EEA
Sbjct: 57 YEKALELDPDNAKAYYNLGLAYYKLGKYEEALEA 90
Score = 44.3 bits (105), Expect = 2e-06
Identities = 22/66 (33%), Positives = 34/66 (51%)
Query: 49 CLKNRAAVYLKQNQNDKVIEDCSKSLEIVPDDPKALFRRCQAYEAIGKFEEAYTDAKHIH 108
L N +Y K D+ +E K+LE+ PD+ A + AY +GK+EEA D +
Sbjct: 2 ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKAL 61
Query: 109 RVEPTN 114
++P N
Sbjct: 62 ELDPDN 67
Score = 37.4 bits (87), Expect = 6e-04
Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 4/60 (6%)
Query: 20 FKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKVIEDCSKSLEIVPD 79
+K G YE AL+ Y KAL++ + A N Y K + ++ +E K+LE+ P+
Sbjct: 45 YKLGKYEEALEDYEKALELDPDN----AKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 69.3 bits (169), Expect = 7e-14
Identities = 40/138 (28%), Positives = 65/138 (47%), Gaps = 15/138 (10%)
Query: 11 KLKESGNSAFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKVIEDC 70
KLKE GN A++ D+ A+ Y+KA+ E NRAA + +KV+ED
Sbjct: 129 KLKEKGNKAYRNKDFNKAIKLYSKAI-----ECKPDPVYYSNRAACHNALGDWEKVVEDT 183
Query: 71 SKSLEIVPDDPKALFRRCQAYEAIGKFEEAYTDAKHIHRVEP-TNKAIQPVLSRLFAIVT 129
+ +LE+ PD KAL RR AY+ +GK+ +A D ++ N+ + RL
Sbjct: 184 TAALELDPDYSKALNRRANAYDGLGKYADALLDLTASCIIDGFRNEQSAQAVERLL---- 239
Query: 130 KRMQENEQLQNKVHNMFK 147
+ ++K + +
Sbjct: 240 -----KKFAESKAKEILE 252
Score = 32.7 bits (74), Expect = 0.13
Identities = 27/115 (23%), Positives = 49/115 (42%), Gaps = 12/115 (10%)
Query: 8 DYNKLKE--------SGNSAFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLK 59
D KL E G +G + AL +K++++ + RA++ L+
Sbjct: 322 DLGKLGEKEAIALNLRGTFKCLKGKHLEALADLSKSIELDPRVTQSYI----KRASMNLE 377
Query: 60 QNQNDKVIEDCSKSLEIVPDDPKALFRRCQAYEAIGKFEEAYTDAKHIHRVEPTN 114
DK ED K+L++ +DP + R Q + G+F +A D + ++P
Sbjct: 378 LGDPDKAEEDFDKALKLNSEDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDLDPDF 432
>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat.
Length = 69
Score = 50.8 bits (122), Expect = 4e-09
Identities = 26/73 (35%), Positives = 42/73 (57%), Gaps = 5/73 (6%)
Query: 7 NDYNKLKESGNSAFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQN-DK 65
++ LK GN+ FK GDY+ A++ Y KAL++ + A N A YLK ++ ++
Sbjct: 1 DNAEALKNLGNALFKLGDYDEAIEAYEKALEL----DPDNAEAYYNLALAYLKLGKDYEE 56
Query: 66 VIEDCSKSLEIVP 78
+ED K+LE+ P
Sbjct: 57 ALEDLEKALELDP 69
Score = 50.4 bits (121), Expect = 7e-09
Identities = 23/69 (33%), Positives = 35/69 (50%), Gaps = 1/69 (1%)
Query: 45 ERATCLKNRAAVYLKQNQNDKVIEDCSKSLEIVPDDPKALFRRCQAYEAIGK-FEEAYTD 103
+ A LKN K D+ IE K+LE+ PD+ +A + AY +GK +EEA D
Sbjct: 1 DNAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALED 60
Query: 104 AKHIHRVEP 112
+ ++P
Sbjct: 61 LEKALELDP 69
>gnl|CDD|179448 PRK02603, PRK02603, photosystem I assembly protein Ycf3;
Provisional.
Length = 172
Score = 43.9 bits (104), Expect = 1e-05
Identities = 25/70 (35%), Positives = 40/70 (57%), Gaps = 1/70 (1%)
Query: 15 SGNSAFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKVIEDCSKSL 74
G SA G+Y AL+ Y +ALK+ E+ ++R+ L N +Y ++DK +E ++L
Sbjct: 41 DGMSAQADGEYAEALENYEEALKLE-EDPNDRSYILYNMGIIYASNGEHDKALEYYHQAL 99
Query: 75 EIVPDDPKAL 84
E+ P P AL
Sbjct: 100 ELNPKQPSAL 109
>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat.
Length = 78
Score = 41.6 bits (98), Expect = 1e-05
Identities = 20/64 (31%), Positives = 31/64 (48%), Gaps = 3/64 (4%)
Query: 16 GNSAFKQGDYETALDFYTKALKVTA---EESHERATCLKNRAAVYLKQNQNDKVIEDCSK 72
+ GDY+ AL+ KAL++ E+ E A L N A +YL D+ +E K
Sbjct: 12 ALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEALEYLEK 71
Query: 73 SLEI 76
+L +
Sbjct: 72 ALAL 75
Score = 30.0 bits (68), Expect = 0.19
Identities = 19/63 (30%), Positives = 30/63 (47%), Gaps = 7/63 (11%)
Query: 45 ERATCLKNRAAVYLKQNQNDKVIEDCSKSLEIV----PDDPK---ALFRRCQAYEAIGKF 97
+ A L N A V + D+ +E K+LE+ D P+ AL + Y A+G +
Sbjct: 3 DLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDY 62
Query: 98 EEA 100
+EA
Sbjct: 63 DEA 65
>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
lipoprotein. This protein family occurs in strictly
within a subset of Gram-negative bacterial species with
the proposed PEP-CTERM/exosortase system, analogous to
the LPXTG/sortase system common in Gram-positive
bacteria. This protein occurs in a species if and only
if a transmembrane histidine kinase (TIGR02916) and a
DNA-binding response regulator (TIGR02915) also occur.
The present of tetratricopeptide repeats (TPR) suggests
protein-protein interaction, possibly for the regulation
of PEP-CTERM protein expression, since many PEP-CTERM
proteins in these genomes are preceded by a proposed DNA
binding site for the response regulator.
Length = 899
Score = 39.3 bits (92), Expect = 0.001
Identities = 25/111 (22%), Positives = 40/111 (36%), Gaps = 4/111 (3%)
Query: 16 GNSAFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKVIEDCSKSLE 75
G + GD A+ + K L + + A L A Y K I ++LE
Sbjct: 608 GRAQLAAGDLNKAVSSFKKLLALQPDS----ALALLLLADAYAVMKNYAKAITSLKRALE 663
Query: 76 IVPDDPKALFRRCQAYEAIGKFEEAYTDAKHIHRVEPTNKAIQPVLSRLFA 126
+ PD+ +A Q A + E A AK + + P + L+
Sbjct: 664 LKPDNTEAQIGLAQLLLAAKRTESAKKIAKSLQKQHPKAALGFELEGDLYL 714
Score = 38.9 bits (91), Expect = 0.001
Identities = 31/96 (32%), Positives = 48/96 (50%), Gaps = 4/96 (4%)
Query: 5 NMNDYNKLKESGNSAFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQND 64
L +G S ++ G+ E A + + LK A SH+ L A++ L+ + D
Sbjct: 291 APEYLPALLLAGASEYQLGNLEQAYQYLNQILKY-APNSHQARRLL---ASIQLRLGRVD 346
Query: 65 KVIEDCSKSLEIVPDDPKALFRRCQAYEAIGKFEEA 100
+ I S +L + PDDP AL +AY A+G FE+A
Sbjct: 347 EAIATLSPALGLDPDDPAALSLLGEAYLALGDFEKA 382
Score = 32.7 bits (75), Expect = 0.13
Identities = 16/99 (16%), Positives = 35/99 (35%), Gaps = 4/99 (4%)
Query: 16 GNSAFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKVIEDCSKSLE 75
G+ G+ E AL Y KA+ + A + ++ + ++ + L+
Sbjct: 200 GDLLLSLGNIELALAAYRKAIALRPNNIAVLL----ALATILIEAGEFEEAEKHADALLK 255
Query: 76 IVPDDPKALFRRCQAYEAIGKFEEAYTDAKHIHRVEPTN 114
P+ P A + + +E+A + + P
Sbjct: 256 KAPNSPLAHYLKALVDFQKKNYEDARETLQDALKSAPEY 294
Score = 32.0 bits (73), Expect = 0.21
Identities = 21/105 (20%), Positives = 37/105 (35%), Gaps = 11/105 (10%)
Query: 16 GNSAFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYL---KQNQNDKVIEDCSK 72
G+ +Q DY A+ Y KALK + L + K +E
Sbjct: 710 GDLYLRQKDYPAAIQAYRKALKRAPS-----SQNAIKLHRALLASGNTAEAVKTLEA--- 761
Query: 73 SLEIVPDDPKALFRRCQAYEAIGKFEEAYTDAKHIHRVEPTNKAI 117
L+ P+D + Y A +++A + + + P N +
Sbjct: 762 WLKTHPNDAVLRTALAELYLAQKDYDKAIKHYQTVVKKAPDNAVV 806
Score = 30.8 bits (70), Expect = 0.57
Identities = 17/83 (20%), Positives = 32/83 (38%), Gaps = 4/83 (4%)
Query: 22 QGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKVIEDCSKSLEIVPDDP 81
+ ++ A + L TA+ + A LK + L + + K++ + P++
Sbjct: 172 ENRFDEARALIDEVL--TADPGNVDALLLK--GDLLLSLGNIELALAAYRKAIALRPNNI 227
Query: 82 KALFRRCQAYEAIGKFEEAYTDA 104
L G+FEEA A
Sbjct: 228 AVLLALATILIEAGEFEEAEKHA 250
Score = 29.3 bits (66), Expect = 1.5
Identities = 24/109 (22%), Positives = 43/109 (39%), Gaps = 4/109 (3%)
Query: 18 SAFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKVIEDCSKSLEIV 77
S + G ++ AL A K ++ + A+ A+YL + K E K+L I
Sbjct: 440 SYLRSGQFDKAL----AAAKKLEKKQPDNASLHNLLGAIYLGKGDLAKAREAFEKALSIE 495
Query: 78 PDDPKALFRRCQAYEAIGKFEEAYTDAKHIHRVEPTNKAIQPVLSRLFA 126
PD A + G ++A + + ++P N L+ L+
Sbjct: 496 PDFFPAAANLARIDIQEGNPDDAIQRFEKVLTIDPKNLRAILALAGLYL 544
Score = 28.5 bits (64), Expect = 2.7
Identities = 25/121 (20%), Positives = 43/121 (35%), Gaps = 10/121 (8%)
Query: 16 GNSAFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKVIEDCSKSLE 75
G + G E A Y +AL + A + L +N+ D+ + L
Sbjct: 132 GLAYLGLGQLELAQKSYEQALAIDPR----SLYAKLGLAQLALAENRFDEARALIDEVLT 187
Query: 76 IVPDDPKALFRRCQAYEAIGKFEEAYTDAKHIHRVEPTNKAIQPVLSRLFAIVTKRMQEN 135
P + AL + ++G E A + + P N A+ L A+ T ++
Sbjct: 188 ADPGNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAV------LLALATILIEAG 241
Query: 136 E 136
E
Sbjct: 242 E 242
>gnl|CDD|215568 PLN03088, PLN03088, SGT1, suppressor of G2 allele of SKP1;
Provisional.
Length = 356
Score = 38.6 bits (90), Expect = 0.001
Identities = 22/91 (24%), Positives = 42/91 (46%), Gaps = 4/91 (4%)
Query: 10 NKLKESGNSAFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKVIED 69
L++ AF D+ A+D YT+A+ + A +RA +K + + D
Sbjct: 3 KDLEDKAKEAFVDDDFALAVDLYTQAI----DLDPNNAELYADRAQANIKLGNFTEAVAD 58
Query: 70 CSKSLEIVPDDPKALFRRCQAYEAIGKFEEA 100
+K++E+ P KA R+ A + +++ A
Sbjct: 59 ANKAIELDPSLAKAYLRKGTACMKLEEYQTA 89
>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat.
Length = 34
Score = 34.7 bits (81), Expect = 0.001
Identities = 12/34 (35%), Positives = 19/34 (55%)
Query: 47 ATCLKNRAAVYLKQNQNDKVIEDCSKSLEIVPDD 80
A L N YLK + D+ +E K+LE+ P++
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34
Score = 32.8 bits (76), Expect = 0.007
Identities = 11/23 (47%), Positives = 17/23 (73%)
Query: 16 GNSAFKQGDYETALDFYTKALKV 38
GN+ K G Y+ AL++Y KAL++
Sbjct: 8 GNAYLKLGKYDEALEYYEKALEL 30
Score = 26.6 bits (60), Expect = 1.2
Identities = 11/34 (32%), Positives = 17/34 (50%)
Query: 81 PKALFRRCQAYEAIGKFEEAYTDAKHIHRVEPTN 114
KAL+ AY +GK++EA + + P N
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34
>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats. Repeats present in 4
or more copies in proteins. Contain a minimum of 34
amino acids each and self-associate via a "knobs and
holes" mechanism.
Length = 34
Score = 34.3 bits (80), Expect = 0.002
Identities = 12/34 (35%), Positives = 18/34 (52%)
Query: 47 ATCLKNRAAVYLKQNQNDKVIEDCSKSLEIVPDD 80
A L N YLK D+ +E K+LE+ P++
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34
Score = 32.4 bits (75), Expect = 0.009
Identities = 12/23 (52%), Positives = 18/23 (78%)
Query: 16 GNSAFKQGDYETALDFYTKALKV 38
GN+ K GDY+ AL++Y KAL++
Sbjct: 8 GNAYLKLGDYDEALEYYEKALEL 30
>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat.
Length = 65
Score = 35.0 bits (81), Expect = 0.003
Identities = 17/66 (25%), Positives = 24/66 (36%), Gaps = 4/66 (6%)
Query: 16 GNSAFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKVIEDCSKSLE 75
+A + GDY+ AL AL A L L+Q + + +L
Sbjct: 4 ARAALRAGDYDEALAALEAALA----RYPLAAEALLLLGEALLRQGRLAEAAALLRAALA 59
Query: 76 IVPDDP 81
PDDP
Sbjct: 60 ADPDDP 65
Score = 25.7 bits (57), Expect = 5.2
Identities = 11/48 (22%), Positives = 18/48 (37%)
Query: 53 RAAVYLKQNQNDKVIEDCSKSLEIVPDDPKALFRRCQAYEAIGKFEEA 100
A L+ D+ + +L P +AL +A G+ EA
Sbjct: 3 LARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEA 50
>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
Length = 291
Score = 36.7 bits (83), Expect = 0.004
Identities = 34/121 (28%), Positives = 57/121 (47%), Gaps = 2/121 (1%)
Query: 7 NDYNKLKESGNSAFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKV 66
D + + + ++ GDYE AL+ Y KAL+ E +E A L A+ + ++
Sbjct: 128 PDLAEALLALGALYELGDYEEALELYEKALE-LDPELNELAEALLALGALLEALGRYEEA 186
Query: 67 IEDCSKSLEIVPD-DPKALFRRCQAYEAIGKFEEAYTDAKHIHRVEPTNKAIQPVLSRLF 125
+E K+L++ PD D +AL Y +GK+EEA + ++P N L+ L
Sbjct: 187 LELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELDPDNAEALYNLALLL 246
Query: 126 A 126
Sbjct: 247 L 247
Score = 35.6 bits (80), Expect = 0.010
Identities = 30/112 (26%), Positives = 51/112 (45%), Gaps = 3/112 (2%)
Query: 1 MIDNNMNDYNKLKESGNSAFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQ 60
+ L G G YE AL+ KALK+ ++ E L N +YLK
Sbjct: 159 LDPELNELAEALLALGALLEALGRYEEALELLEKALKLNPDDDAE---ALLNLGLLYLKL 215
Query: 61 NQNDKVIEDCSKSLEIVPDDPKALFRRCQAYEAIGKFEEAYTDAKHIHRVEP 112
+ ++ +E K+LE+ PD+ +AL+ +G++EEA + ++P
Sbjct: 216 GKYEEALEYYEKALELDPDNAEALYNLALLLLELGRYEEALEALEKALELDP 267
>gnl|CDD|131573 TIGR02521, type_IV_pilW, type IV pilus biogenesis/stability protein
PilW. Members of this family are designated PilF in ref
(PMID:8973346) and PilW in ref (PMID:15612916). This
outer membrane protein is required both for pilus
stability and for pilus function such as adherence to
human cells. Members of this family contain copies of
the TPR (tetratricopeptide repeat) domain.
Length = 234
Score = 33.8 bits (78), Expect = 0.036
Identities = 25/108 (23%), Positives = 49/108 (45%), Gaps = 3/108 (2%)
Query: 7 NDYNKLKESGNSAFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKV 66
N+ + L G +QG YE A+ + +A++ R+ L+N LK DK
Sbjct: 97 NNGDVLNNYGTFLCQQGKYEQAMQQFEQAIEDPLYPQPARS--LENAGLCALKAGDFDKA 154
Query: 67 IEDCSKSLEIVPDDPKALFRRCQAYEAIGKFEEA-YTDAKHIHRVEPT 113
+ +++L+I P P++L + Y G++++A ++ T
Sbjct: 155 EKYLTRALQIDPQRPESLLELAELYYLRGQYKDARAYLERYQQTYNQT 202
Score = 32.7 bits (75), Expect = 0.087
Identities = 19/97 (19%), Positives = 41/97 (42%), Gaps = 11/97 (11%)
Query: 20 FKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKVIEDCSKSLEIVPD 79
+QGD E A + KAL+ ++ A Y + + +K + ++L + P+
Sbjct: 42 LEQGDLEVAKENLDKALEHDPDDYLAYLA----LALYYQQLGELEKAEDSFRRALTLNPN 97
Query: 80 DPKA-------LFRRCQAYEAIGKFEEAYTDAKHIHR 109
+ L ++ + +A+ +FE+A D +
Sbjct: 98 NGDVLNNYGTFLCQQGKYEQAMQQFEQAIEDPLYPQP 134
Score = 29.2 bits (66), Expect = 1.1
Identities = 16/47 (34%), Positives = 24/47 (51%)
Query: 54 AAVYLKQNQNDKVIEDCSKSLEIVPDDPKALFRRCQAYEAIGKFEEA 100
A YL+Q + E+ K+LE PDD A Y+ +G+ E+A
Sbjct: 38 ALGYLEQGDLEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKA 84
>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat. This Pfam entry
includes outlying Tetratricopeptide-like repeats (TPR)
that are not matched by pfam00515.
Length = 34
Score = 29.8 bits (68), Expect = 0.092
Identities = 12/22 (54%), Positives = 16/22 (72%)
Query: 16 GNSAFKQGDYETALDFYTKALK 37
G + +K GDYE AL+ Y KAL+
Sbjct: 8 GLAYYKLGDYEEALEAYEKALE 29
Score = 27.1 bits (61), Expect = 0.80
Identities = 10/34 (29%), Positives = 17/34 (50%)
Query: 47 ATCLKNRAAVYLKQNQNDKVIEDCSKSLEIVPDD 80
A L N Y K ++ +E K+LE+ P++
Sbjct: 1 AEALYNLGLAYYKLGDYEEALEAYEKALELDPNN 34
Score = 27.1 bits (61), Expect = 0.89
Identities = 10/34 (29%), Positives = 17/34 (50%)
Query: 81 PKALFRRCQAYEAIGKFEEAYTDAKHIHRVEPTN 114
+AL+ AY +G +EEA + ++P N
Sbjct: 1 AEALYNLGLAYYKLGDYEEALEAYEKALELDPNN 34
>gnl|CDD|205550 pfam13371, TPR_9, Tetratricopeptide repeat.
Length = 73
Score = 28.7 bits (65), Expect = 0.43
Identities = 17/75 (22%), Positives = 32/75 (42%), Gaps = 3/75 (4%)
Query: 50 LKNRAAVYLKQNQNDKVIEDCSKSLEIVPDDPKALFRRCQAYEAIGKFEEAYTDAKHIHR 109
LK YL++ ++ + + L + PDDP R Y +G F+ A D ++
Sbjct: 1 LKAI---YLREEDFERALAVVERLLLLAPDDPYERRDRGLLYAQLGCFQAALADLEYFLE 57
Query: 110 VEPTNKAIQPVLSRL 124
+ P + + +L
Sbjct: 58 LCPDAPDAERIREQL 72
>gnl|CDD|223423 COG0346, GloA, Lactoylglutathione lyase and related lyases [Amino
acid transport and metabolism].
Length = 138
Score = 30.1 bits (67), Expect = 0.44
Identities = 12/102 (11%), Positives = 26/102 (25%), Gaps = 9/102 (8%)
Query: 24 DYETALDFYTKALKVT-AEESHERATCLKNRAAVYLKQNQNDKVIEDCSKSLEIVPDDPK 82
D E ++DFYT L + +++ A ++L + + +
Sbjct: 12 DLEASIDFYTDVLGLRLVKDTVNEADDGGGYHLLFLDGDGGPGELLA--------FFGFE 63
Query: 83 ALFRRCQAYEAIGKFEEAYTDAKHIHRVEPTNKAIQPVLSRL 124
+ H+ L+ L
Sbjct: 64 GRAGTGFVGDVALGVPGGDLGLGHLAFEVDDEAFGDAALAFL 105
>gnl|CDD|187833 cd09702, Csx1_III-U, CRISPR/Cas system-associated protein Csx1.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Protein of this family often fused to HTH domain; Some
proteins could have an additional fusion with
RecB-family nuclease domain; Core domain appears to have
a Rossmann-like fold; loosely associated with CRISPR/Cas
systems; also known as TIGR02710 family.
Length = 378
Score = 30.6 bits (69), Expect = 0.55
Identities = 25/115 (21%), Positives = 42/115 (36%), Gaps = 5/115 (4%)
Query: 19 AFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKVIEDCSKSLEI-V 77
AF GDYETAL L+ E+H + YL ++ + E+ L +
Sbjct: 141 AFNAGDYETALAILENLLRRVLAENHTFYEAMIKLTRAYLHWDRFE--HEEALDYLNSDL 198
Query: 78 PDDP-KALFRRCQAYEAIGKFEEAYTDAKHIHRVEPTNKAIQPVLSRLFAIVTKR 131
+ +A Q + + D + R + +P+L L A +R
Sbjct: 199 VNSLEEAEDVLKQNKSIVHEIASISLDQQAS-RNGARKQKYRPLLPDLLANAERR 252
>gnl|CDD|225605 COG3063, PilF, Tfp pilus assembly protein PilF [Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 250
Score = 30.1 bits (68), Expect = 0.63
Identities = 21/64 (32%), Positives = 30/64 (46%), Gaps = 2/64 (3%)
Query: 21 KQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKVIEDCSKSLEIVPDD 80
QG E A+ + +AL A E + L+N LK Q D+ E ++LE+ P
Sbjct: 115 AQGRPEEAMQQFERALADPA--YGEPSDTLENLGLCALKAGQFDQAEEYLKRALELDPQF 172
Query: 81 PKAL 84
P AL
Sbjct: 173 PPAL 176
Score = 28.5 bits (64), Expect = 2.1
Identities = 26/82 (31%), Positives = 37/82 (45%), Gaps = 4/82 (4%)
Query: 20 FKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKVIEDCSKSLEIVPD 79
+QGDY A KAL+ + S+ A + RA Y K +ND E K+L + P+
Sbjct: 46 LQQGDYAQAKKNLEKALE--HDPSYYLAHLV--RAHYYQKLGENDLADESYRKALSLAPN 101
Query: 80 DPKALFRRCQAYEAIGKFEEAY 101
+ L A G+ EEA
Sbjct: 102 NGDVLNNYGAFLCAQGRPEEAM 123
>gnl|CDD|176974 CHL00033, ycf3, photosystem I assembly protein Ycf3.
Length = 168
Score = 29.6 bits (67), Expect = 0.67
Identities = 20/69 (28%), Positives = 38/69 (55%), Gaps = 1/69 (1%)
Query: 16 GNSAFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKVIEDCSKSLE 75
G SA +G+Y AL Y +A+++ + ++R+ L N ++ ++ K +E ++LE
Sbjct: 42 GMSAQSEGEYAEALQNYYEAMRLE-IDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALE 100
Query: 76 IVPDDPKAL 84
P P+AL
Sbjct: 101 RNPFLPQAL 109
>gnl|CDD|235289 PRK04350, PRK04350, thymidine phosphorylase; Provisional.
Length = 490
Score = 29.8 bits (68), Expect = 0.97
Identities = 14/40 (35%), Positives = 22/40 (55%), Gaps = 3/40 (7%)
Query: 147 KYVFDTSAPMDKRVTAVNN--LVVLAREMSGAEMLLKSGV 184
+ D +AP D VTA++N L +AR +GA +G+
Sbjct: 403 DHTHDVTAPRDGYVTAIDNRRLARIARL-AGAPKDKGAGI 441
>gnl|CDD|221958 pfam13176, TPR_7, Tetratricopeptide repeat.
Length = 36
Score = 26.4 bits (59), Expect = 1.5
Identities = 9/26 (34%), Positives = 13/26 (50%)
Query: 21 KQGDYETALDFYTKALKVTAEESHER 46
K GDYE A+ Y +AL + +
Sbjct: 11 KLGDYEKAISLYERALALAKDPEDRE 36
>gnl|CDD|226687 COG4235, COG4235, Cytochrome c biogenesis factor [Posttranslational
modification, protein turnover, chaperones].
Length = 287
Score = 28.9 bits (65), Expect = 1.7
Identities = 27/128 (21%), Positives = 49/128 (38%), Gaps = 7/128 (5%)
Query: 16 GNSAFKQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQND--KVIEDCSKS 73
G + G AL Y AL++ A ++ E L A+Y + Q K ++
Sbjct: 163 GRAYMALGRASDALLAYRNALRL-AGDNPEILLGLAE--ALYYQAGQQMTAKARALLRQA 219
Query: 74 LEIVPDDPKALFRRCQAYEAIGKFEEAYTDAKHIHRVEPTNKAIQPVLSRLFAIVTKRMQ 133
L + P + +AL A G + EA + + + P + + ++ R +I Q
Sbjct: 220 LALDPANIRALSLLAFAAFEQGDYAEAAAAWQMLLDLLPADDPRRSLIER--SIARALAQ 277
Query: 134 ENEQLQNK 141
+ Q
Sbjct: 278 RSAQGPAA 285
>gnl|CDD|233137 TIGR00811, sit, silicon transporter. Marine diatoms such as
Cylindrotheca fusiformis encode at least six silicon
transport protein homologues which exhibit similar size
and topology. One characterized member of the family
(Sit1) functions in the energy-dependent uptake of
either Silicic acid [Si(OH)4] or Silicate [Si(OH)3O-] by
a Na+ symport mechanism. The system is found in marine
diatoms which make their "glass houses" out of silicon
[Transport and binding proteins, Other].
Length = 545
Score = 28.8 bits (64), Expect = 2.2
Identities = 12/44 (27%), Positives = 20/44 (45%)
Query: 5 NMNDYNKLKESGNSAFKQGDYETALDFYTKALKVTAEESHERAT 48
N+ + ++ + S ++ +E AL KAL E E AT
Sbjct: 490 NIRELREMINNAISDEEKTTFEAALAIEVKALDKLNAEEEEEAT 533
>gnl|CDD|211352 cd07255, Glo_EDI_BRP_like_12, This conserved domain belongs to a
superfamily including the bleomycin resistance protein,
glyoxalase I, and type I ring-cleaving dioxygenases.
This protein family belongs to a conserved domain
superfamily that is found in a variety of structurally
related metalloproteins, including the bleomycin
resistance protein, glyoxalase I, and type I
ring-cleaving dioxygenases. A bound metal ion is
required for protein activities for the members of this
superfamily. A variety of metal ions have been found in
the catalytic centers of these proteins including
Fe(II), Mn(II), Zn(II), Ni(II) and Mg(II). The protein
superfamily contains members with or without domain
swapping.
Length = 125
Score = 27.5 bits (62), Expect = 2.5
Identities = 11/49 (22%), Positives = 19/49 (38%), Gaps = 3/49 (6%)
Query: 16 GNSAFKQGDYETALDFYTKALKVTAEESHERATCL---KNRAAVYLKQN 61
G + D E +L FY L + E + L R + L+++
Sbjct: 4 GAVTLRVADLERSLAFYQDVLGLEVLERTDSTAVLGTGGKRPLLVLEED 52
>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat.
Length = 34
Score = 25.6 bits (57), Expect = 3.1
Identities = 8/31 (25%), Positives = 18/31 (58%)
Query: 70 CSKSLEIVPDDPKALFRRCQAYEAIGKFEEA 100
K+LE+ P++ +A + +G+++EA
Sbjct: 2 YEKALELDPNNAEAYYNLALLLLNLGQYDEA 32
>gnl|CDD|222121 pfam13428, TPR_14, Tetratricopeptide repeat.
Length = 44
Score = 25.6 bits (56), Expect = 3.6
Identities = 12/44 (27%), Positives = 21/44 (47%)
Query: 81 PKALFRRCQAYEAIGKFEEAYTDAKHIHRVEPTNKAIQPVLSRL 124
P AL +A A+G +EA + ++P + +L+RL
Sbjct: 1 PAALLALARALLALGDLDEALALLRRALALDPDDPEALLLLARL 44
>gnl|CDD|227314 COG4981, COG4981, Enoyl reductase domain of yeast-type FAS1 [Lipid
metabolism].
Length = 717
Score = 28.3 bits (63), Expect = 3.9
Identities = 24/112 (21%), Positives = 42/112 (37%), Gaps = 16/112 (14%)
Query: 70 CSKSL-EIVPDDPKALFRRCQAYEAIGKFEEAY-------TDAKHIHR------VEPTNK 115
+ L E+ D P RR + EA+ K + Y T A+ + R E +
Sbjct: 314 AGRLLDEVAGDLPAVAARRDEIIEALNKTAKPYFGDVADMTYAEWLQRYVELMIGEVGSP 373
Query: 116 AIQPVLSRLFAIVTKRMQENEQLQNKVHNMFKYVFDTSAPMDKRVTAVNNLV 167
I F + +R + +L + H +F + +DK A+ L+
Sbjct: 374 WIDVTWRDRFGDMLRRAE--ARLTEQDHGPITTLFTDYSLLDKPDEAIAKLL 423
>gnl|CDD|211353 cd07263, Glo_EDI_BRP_like_16, This conserved domain belongs to a
superfamily including the bleomycin resistance protein,
glyoxalase I, and type I ring-cleaving dioxygenases.
This protein family belongs to a conserved domain
superfamily that is found in a variety of structurally
related metalloproteins, including the bleomycin
resistance protein, glyoxalase I, and type I
ring-cleaving dioxygenases. A bound metal ion is
required for protein activities for the members of this
superfamily. A variety of metal ions have been found in
the catalytic centers of these proteins including
Fe(II), Mn(II), Zn(II), Ni(II) and Mg(II). The protein
superfamily contains members with or without domain
swapping.
Length = 120
Score = 27.1 bits (61), Expect = 3.9
Identities = 7/13 (53%), Positives = 9/13 (69%)
Query: 24 DYETALDFYTKAL 36
D + AL FYT+ L
Sbjct: 8 DQDKALAFYTEKL 20
>gnl|CDD|211335 cd02568, PseudoU_synth_PUS1_PUS2, Pseudouridine synthase, PUS1/
PUS2 like. This group consists of eukaryotic
pseudouridine synthases similar to Saccharomyces
cerevisiae Pus1p, S. cerevisiae Pus2p, Caenorhabditis
elegans Pus1p and human PUS1. Pseudouridine synthases
catalyze the isomerization of specific uridines in an
RNA molecule to pseudouridines (5-ribosyluracil, psi).
No cofactors are required. S. cerevisiae Pus1p catalyzes
the formation of psi34 and psi36 in the
intron-containing tRNAIle, psi35 in the
intron-containing tRNATyr, psi27 and/or psi28 in several
yeast cytoplasmic tRNAs and, psi44 in U2 small nuclear
RNA (U2 snRNA). The presence of the intron is required
for the formation of psi 34, 35 and 36. In addition S.
cerevisiae PUS1 makes are psi 26, 65 and 67. C. elegans
Pus1p does not modify psi44 in U2 snRNA. Mouse Pus1p
makes psi27/28 in pre- tRNASer , tRNAVal and tRNAIle,
psi 34/36 in tRNAIle and, psi 32 and potentially 67 in
tRNAVal. Psi44 in U2 snRNA and psi32 in tRNAs are
highly phylogenetically conserved. Psi
26,27,28,34,35,36,65 and 67 in tRNAs are less highly
conserved. Mouse Pus1p regulates nuclear receptor
activity through pseudouridylation of Steroid Receptor
RNA Activator. Missense mutation in human PUS1 causes
mitochondrial myopathy and sideroblastic anemia (MLASA).
Length = 245
Score = 27.6 bits (62), Expect = 4.0
Identities = 14/55 (25%), Positives = 23/55 (41%), Gaps = 1/55 (1%)
Query: 139 QNKVHNMFKYVFDTSAPMDKRVTAVNNLVVLAREMSGAEMLLKSGVAKQ-INTLL 192
++ + K F +A DK V A N+V L + E L + +N+ L
Sbjct: 39 ESNAGDPKKIGFSRAARTDKGVHAARNVVSLKVIIDDPEGLGILEDLVEKLNSHL 93
>gnl|CDD|216182 pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance
protein/Dioxygenase superfamily.
Length = 120
Score = 27.0 bits (60), Expect = 4.2
Identities = 9/14 (64%), Positives = 10/14 (71%)
Query: 23 GDYETALDFYTKAL 36
GD E +LDFYT L
Sbjct: 10 GDLEKSLDFYTDVL 23
>gnl|CDD|221837 pfam12895, Apc3, Anaphase-promoting complex, cyclosome, subunit 3.
Apc3, otherwise known as Cdc27, is one of the subunits
of the anaphase-promoting complex or cyclosome. The
anaphase-promoting complex is a multiprotein subunit E3
ubiquitin ligase complex that controls segregation of
chromosomes and exit from mitosis in eukaryotes. The
protein members of this family contain TPR repeats just
as those of Apc7 do, and it appears that these TPR units
bind the C-termini of the APC co-activators CDH1 and
CDC20.
Length = 80
Score = 26.1 bits (58), Expect = 4.3
Identities = 19/80 (23%), Positives = 28/80 (35%), Gaps = 7/80 (8%)
Query: 21 KQGDYETALDFYTKALKVTAEESHERATCLKNRAAVYLKQNQNDKVIEDCSKSLEIVPDD 80
QG+YE A+ K L +T Y Q Q + E K
Sbjct: 1 DQGNYENAIFLAEKLLALTPSNEDAYLLAQ-----CYFLQGQYKRAYELLRKLKLDNSSG 55
Query: 81 PKALFRRCQAYEAIGKFEEA 100
+ L +C +GK++EA
Sbjct: 56 CRYLLAQC--LLKLGKYDEA 73
>gnl|CDD|236431 PRK09250, PRK09250, fructose-bisphosphate aldolase; Provisional.
Length = 348
Score = 27.6 bits (62), Expect = 4.5
Identities = 15/33 (45%), Positives = 18/33 (54%), Gaps = 8/33 (24%)
Query: 17 NSAF-KQGDYETALDFYTKALKVTAEESHERAT 48
NSAF K GDY TA D +T + +H AT
Sbjct: 203 NSAFKKDGDYHTAAD-------LTGQANHLAAT 228
>gnl|CDD|197854 smart00745, MIT, Microtubule Interacting and Trafficking molecule
domain.
Length = 77
Score = 26.1 bits (58), Expect = 4.6
Identities = 16/44 (36%), Positives = 24/44 (54%), Gaps = 5/44 (11%)
Query: 21 KQGDYETALDFYTKALK-----VTAEESHERATCLKNRAAVYLK 59
+ G+YE AL+ Y KA++ + E +R LK +AA YL
Sbjct: 20 EAGNYEEALELYKKAIEYLLEGIKVESDSKRREALKAKAAEYLD 63
>gnl|CDD|217619 pfam03571, Peptidase_M49, Peptidase family M49.
Length = 550
Score = 27.6 bits (62), Expect = 5.6
Identities = 7/31 (22%), Positives = 9/31 (29%)
Query: 23 GDYETALDFYTKALKVTAEESHERATCLKNR 53
D E F+ V E R L +
Sbjct: 480 ADVEAGTKFFDDYTTVDDEWLKFRDIVLAKK 510
>gnl|CDD|236330 PRK08665, PRK08665, ribonucleotide-diphosphate reductase subunit
alpha; Validated.
Length = 752
Score = 27.6 bits (62), Expect = 6.2
Identities = 16/56 (28%), Positives = 23/56 (41%), Gaps = 16/56 (28%)
Query: 48 TCLKNRAAVYLKQNQNDKVIEDCSKSLEIVPDDPKALFRRCQAYEAIGKFEEAYTD 103
T L+ R YLK++++ K IE P+ +FRR I E Y
Sbjct: 18 TVLEKR---YLKKDEDGKPIE-----------TPEDMFRRVA--RTIAAAELIYDS 57
>gnl|CDD|222122 pfam13429, TPR_15, Tetratricopeptide repeat.
Length = 279
Score = 27.3 bits (61), Expect = 6.5
Identities = 17/68 (25%), Positives = 27/68 (39%), Gaps = 4/68 (5%)
Query: 18 SAFKQGDYETALDFYTKAL-KVTAEESHERATCLKNRAAVYLKQNQNDKVIEDCSKSLEI 76
+ GDY+ +D + AE+S A L RA Y + Q D + D ++L +
Sbjct: 118 LYTRAGDYKRIVDLLASLTPEEAAEQS---AGFLSARAEYYRQTGQPDAALRDYRRALAL 174
Query: 77 VPDDPKAL 84
P
Sbjct: 175 PPASSDLR 182
>gnl|CDD|189027 cd09857, PIN_EXO1, PIN domain of Exonuclease-1, a
structure-specific, divalent-metal-ion dependent, 5'
nuclease and homologs. Exonuclease-1 (EXO1) is involved
in multiple, eukaryotic DNA metabolic pathways,
including DNA replication processes (5' flap DNA
endonuclease activity and double stranded DNA
5'-exonuclease activity), DNA repair processes (DNA
mismatch repair (MMR) and post-replication repair
(PRR)), recombination, and telomere integrity. EXO1
functions in the MMS2 error-free branch of the PRR
pathway in the maintenance and repair of stalled
replication forks. Studies also suggest that EXO1 plays
both structural and catalytic roles during MMR-mediated
mutation avoidance. EXO1 belongs to the FEN1-EXO1-like
family of structure-specific, 5' nucleases. These
nucleases contain a PIN (PilT N terminus) domain with a
helical arch/clamp region (I domain) of variable length
(approximately 43 residues in EXO1 PIN domains) and a
H3TH (helix-3-turn-helix) domain, an atypical
helix-hairpin-helix-2-like region. Both the H3TH domain
(not included here) and the helical arch/clamp region
are involved in DNA binding. Nucleases within this group
also have a carboxylate-rich active site that is
involved in binding essential divalent metal ion
cofactors (Mg2+/Mn2+). EXO1 nucleases also have
C-terminal Mlh1- and Msh2-binding domains which allow
interaction with MMR and PRR proteins, respectively.
Length = 210
Score = 26.7 bits (60), Expect = 8.4
Identities = 8/36 (22%), Positives = 18/36 (50%)
Query: 10 NKLKESGNSAFKQGDYETALDFYTKALKVTAEESHE 45
+ E ++G+ A + + +A+ +T E +HE
Sbjct: 97 EENLEKALELLREGNRSEARECFQRAVDITPEMAHE 132
>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
repeats [General function prediction only].
Length = 484
Score = 27.0 bits (60), Expect = 8.9
Identities = 18/84 (21%), Positives = 34/84 (40%)
Query: 53 RAAVYLKQNQNDKVIEDCSKSLEIVPDDPKALFRRCQAYEAIGKFEEAYTDAKHIHRVEP 112
+ L+ N+ + IE K+L + P+ P QA GK +EA +P
Sbjct: 346 AGDILLEANKAKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDP 405
Query: 113 TNKAIQPVLSRLFAIVTKRMQENE 136
+ +L++ +A + R +
Sbjct: 406 EDPNGWDLLAQAYAELGNRAEALL 429
>gnl|CDD|182504 PRK10507, PRK10507, bifunctional glutathionylspermidine
amidase/glutathionylspermidine synthetase; Provisional.
Length = 619
Score = 26.9 bits (60), Expect = 9.6
Identities = 10/14 (71%), Positives = 10/14 (71%)
Query: 112 PTNKAIQPVLSRLF 125
P NKAI PVL LF
Sbjct: 495 PGNKAILPVLWSLF 508
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.129 0.356
Gapped
Lambda K H
0.267 0.0730 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,019,143
Number of extensions: 909047
Number of successful extensions: 1005
Number of sequences better than 10.0: 1
Number of HSP's gapped: 975
Number of HSP's successfully gapped: 95
Length of query: 206
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 114
Effective length of database: 6,857,034
Effective search space: 781701876
Effective search space used: 781701876
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 57 (25.7 bits)