RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy13481
(231 letters)
>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
34 amino acids
[WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
in a variety of organisms including bacteria,
cyanobacteria, yeast, fungi, plants, and humans in
various subcellular locations; involved in a variety of
functions including protein-protein interactions, but
common features in the interaction partners have not
been defined; involved in chaperone, cell-cycle,
transciption, and protein transport complexes; the
number of TPR motifs varies among proteins (1,3-11,13
15,16,19); 5-6 tandem repeats generate a right-handed
helical structure with an amphipathic channel that is
thought to accomodate an alpha-helix of a target
protein; it has been proposed that TPR proteins
preferably interact with WD-40 repeat proteins, but in
many instances several TPR-proteins seem to aggregate to
multi-protein complexes; examples of TPR-proteins
include, Cdc16p, Cdc23p and Cdc27p components of the
cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
targeting signals, the Tom70p co-receptor for
mitochondrial targeting signals, Ser/Thr phosphatase 5C
and the p110 subunit of O-GlcNAc transferase; three
copies of the repeat are present here.
Length = 100
Score = 75.5 bits (186), Expect = 8e-18
Identities = 31/95 (32%), Positives = 50/95 (52%)
Query: 117 KREAIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKI 176
Y G +++A+ Y +A++L+P +A Y +Y +L K + D AL++
Sbjct: 4 LNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALEL 63
Query: 177 NPDSAAAYKFRGRAYRLLGKWEEAAVDLRNACKID 211
+PD+A AY G AY LGK+EEA A ++D
Sbjct: 64 DPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELD 98
Score = 49.3 bits (118), Expect = 4e-08
Identities = 19/60 (31%), Positives = 34/60 (56%)
Query: 120 AIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPD 179
A Y GK+E+A+ Y +A++L+P +A Y G +Y +L K + AL+++P+
Sbjct: 41 AAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100
Score = 42.4 bits (100), Expect = 1e-05
Identities = 21/63 (33%), Positives = 31/63 (49%)
Query: 149 LYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKFRGRAYRLLGKWEEAAVDLRNAC 208
G Y +L + + AL+++PD+A AY AY LGK+EEA D A
Sbjct: 2 ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKAL 61
Query: 209 KID 211
++D
Sbjct: 62 ELD 64
>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat.
Length = 69
Score = 61.6 bits (150), Expect = 5e-13
Identities = 18/69 (26%), Positives = 35/69 (50%), Gaps = 1/69 (1%)
Query: 111 DKSNEKKREAIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQL-SKPNACIRD 169
D + K + G +++A+ AY +A++L+P +A Y +YL+L + D
Sbjct: 1 DNAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALED 60
Query: 170 CSVALKINP 178
AL+++P
Sbjct: 61 LEKALELDP 69
Score = 48.1 bits (115), Expect = 6e-08
Identities = 23/67 (34%), Positives = 34/67 (50%), Gaps = 1/67 (1%)
Query: 146 SALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKFRGRAYRLLGK-WEEAAVDL 204
+A G + +L + I AL+++PD+A AY AY LGK +EEA DL
Sbjct: 2 NAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDL 61
Query: 205 RNACKID 211
A ++D
Sbjct: 62 EKALELD 68
Score = 31.1 bits (71), Expect = 0.062
Identities = 13/35 (37%), Positives = 18/35 (51%)
Query: 179 DSAAAYKFRGRAYRLLGKWEEAAVDLRNACKIDFD 213
D+A A K G A LG ++EA A ++D D
Sbjct: 1 DNAEALKNLGNALFKLGDYDEAIEAYEKALELDPD 35
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 62.3 bits (151), Expect = 2e-11
Identities = 46/156 (29%), Positives = 74/156 (47%), Gaps = 3/156 (1%)
Query: 57 KPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSEEDIDKSNEK 116
K E+ E ++EE + EP +E P D ++ +SEE+ K K
Sbjct: 71 KKERRKRKQAEKETEGKTEEKKSTAPKNAPV-EPADELPEIDESSVANLSEEERKKYAAK 129
Query: 117 KREAIH-AYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALK 175
+E + AY + F A+ YS+AI+ P + Y+ R + L + D + AL+
Sbjct: 130 LKEKGNKAYRNKDFNKAIKLYSKAIECKPDP-VYYSNRAACHNALGDWEKVVEDTTAALE 188
Query: 176 INPDSAAAYKFRGRAYRLLGKWEEAAVDLRNACKID 211
++PD + A R AY LGK+ +A +DL +C ID
Sbjct: 189 LDPDYSKALNRRANAYDGLGKYADALLDLTASCIID 224
Score = 42.7 bits (100), Expect = 8e-05
Identities = 42/160 (26%), Positives = 70/160 (43%), Gaps = 12/160 (7%)
Query: 65 PEPS-APEEESEESDPELDN--TGVISEPDEEEPIPDVATDIEVSEEDIDKSNEKKREAI 121
P+P A E+S E D E N + + E + E+ +D ++EAI
Sbjct: 273 PKPRPAGLEDSNELDEETGNGQLQLGLKSPESKADESYEEAARAFEKALDLGKLGEKEAI 332
Query: 122 HAYN--------DGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVA 173
A N GK +A+ S++I+L+P Y KR L+L P+ D A
Sbjct: 333 -ALNLRGTFKCLKGKHLEALADLSKSIELDPRVTQSYIKRASMNLELGDPDKAEEDFDKA 391
Query: 174 LKINPDSAAAYKFRGRAYRLLGKWEEAAVDLRNACKIDFD 213
LK+N + Y R + + + G++ +A D + + +D D
Sbjct: 392 LKLNSEDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDLDPD 431
>gnl|CDD|215568 PLN03088, PLN03088, SGT1, suppressor of G2 allele of SKP1;
Provisional.
Length = 356
Score = 51.3 bits (123), Expect = 9e-08
Identities = 28/87 (32%), Positives = 51/87 (58%)
Query: 114 NEKKREAIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVA 173
+ + +A A+ D F AV+ Y++AI L+P++A LYA R Q+ ++L + D + A
Sbjct: 3 KDLEDKAKEAFVDDDFALAVDLYTQAIDLDPNNAELYADRAQANIKLGNFTEAVADANKA 62
Query: 174 LKINPDSAAAYKFRGRAYRLLGKWEEA 200
++++P A AY +G A L +++ A
Sbjct: 63 IELDPSLAKAYLRKGTACMKLEEYQTA 89
>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat.
Length = 65
Score = 43.8 bits (104), Expect = 2e-06
Identities = 14/64 (21%), Positives = 25/64 (39%)
Query: 118 REAIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKIN 177
A A G +++A+ A A+ P +A G++ L+ + AL +
Sbjct: 2 ALARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAAD 61
Query: 178 PDSA 181
PD
Sbjct: 62 PDDP 65
Score = 37.3 bits (87), Expect = 5e-04
Identities = 16/60 (26%), Positives = 23/60 (38%)
Query: 152 KRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKFRGRAYRLLGKWEEAAVDLRNACKID 211
++ L+ + + AL P +A A G A G+ EAA LR A D
Sbjct: 2 ALARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAAD 61
>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
Length = 291
Score = 46.0 bits (107), Expect = 6e-06
Identities = 31/119 (26%), Positives = 51/119 (42%), Gaps = 1/119 (0%)
Query: 103 IEVSEEDIDKSNEKKREAIHAYNDGKFEDAVNAYSEAIKLNPS-SALLYAKRGQSYLQLS 161
+E+ E + + G++E+A+ +A+KLNP A G YL+L
Sbjct: 157 LELDPELNELAEALLALGALLEALGRYEEALELLEKALKLNPDDDAEALLNLGLLYLKLG 216
Query: 162 KPNACIRDCSVALKINPDSAAAYKFRGRAYRLLGKWEEAAVDLRNACKIDFDEQADEWL 220
K + AL+++PD+A A LG++EEA L A ++D D
Sbjct: 217 KYEEALEYYEKALELDPDNAEALYNLALLLLELGRYEEALEALEKALELDPDLYNLGLA 275
Score = 36.4 bits (82), Expect = 0.008
Identities = 29/102 (28%), Positives = 40/102 (39%), Gaps = 4/102 (3%)
Query: 110 IDKSNEKKREAIHAYNDGKFEDAVNAYSEAIKLNPSS---ALLYAKRGQSYLQLSKPNAC 166
E Y G +E+A+ Y +A++L+P A G L +
Sbjct: 127 DPDLAEALLALGALYELGDYEEALELYEKALELDPELNELAEALLALGALLEALGRYEEA 186
Query: 167 IRDCSVALKINPDSAA-AYKFRGRAYRLLGKWEEAAVDLRNA 207
+ ALK+NPD A A G Y LGK+EEA A
Sbjct: 187 LELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKA 228
>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
repeats [General function prediction only].
Length = 484
Score = 45.1 bits (107), Expect = 1e-05
Identities = 22/74 (29%), Positives = 36/74 (48%)
Query: 127 GKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKF 186
K ++A+ +A+ L+P+S LL Q+ L+ KP IR + L +P+ +
Sbjct: 354 NKAKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDL 413
Query: 187 RGRAYRLLGKWEEA 200
+AY LG EA
Sbjct: 414 LAQAYAELGNRAEA 427
Score = 36.6 bits (85), Expect = 0.007
Identities = 23/101 (22%), Positives = 38/101 (37%), Gaps = 1/101 (0%)
Query: 119 EAIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINP 178
A+ Y G++++A+ I P + G L+ +K I AL ++P
Sbjct: 312 RALQTYLAGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKEAIERLKKALALDP 371
Query: 179 DSAAAYKFRGRAYRLLGKWEEAAVDLRNACKIDFDEQADEW 219
+S +A GK +EA L D E + W
Sbjct: 372 NSPLLQLNLAQALLKGGKPQEAIRILNRYLFND-PEDPNGW 411
>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
lipoprotein. This protein family occurs in strictly
within a subset of Gram-negative bacterial species with
the proposed PEP-CTERM/exosortase system, analogous to
the LPXTG/sortase system common in Gram-positive
bacteria. This protein occurs in a species if and only
if a transmembrane histidine kinase (TIGR02916) and a
DNA-binding response regulator (TIGR02915) also occur.
The present of tetratricopeptide repeats (TPR) suggests
protein-protein interaction, possibly for the regulation
of PEP-CTERM protein expression, since many PEP-CTERM
proteins in these genomes are preceded by a proposed DNA
binding site for the response regulator.
Length = 899
Score = 42.8 bits (101), Expect = 9e-05
Identities = 27/100 (27%), Positives = 42/100 (42%), Gaps = 5/100 (5%)
Query: 122 HAY-NDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLS--KPNACIRDCSVALKINP 178
AY G+ E A +Y +A+ ++P S LYAK G + L L+ + + L +P
Sbjct: 133 LAYLGLGQLELAQKSYEQALAIDPRS--LYAKLGLAQLALAENRFDEARALIDEVLTADP 190
Query: 179 DSAAAYKFRGRAYRLLGKWEEAAVDLRNACKIDFDEQADE 218
+ A +G LG E A R A + + A
Sbjct: 191 GNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVL 230
Score = 38.9 bits (91), Expect = 0.002
Identities = 23/89 (25%), Positives = 42/89 (47%)
Query: 125 NDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAY 184
G+ ++A+ S A+ L+P + G++YL L + A +++P++AAA
Sbjct: 341 RLGRVDEAIATLSPALGLDPDDPAALSLLGEAYLALGDFEKAAEYLAKATELDPENAAAR 400
Query: 185 KFRGRAYRLLGKWEEAAVDLRNACKIDFD 213
G + G EA DL A ++D +
Sbjct: 401 TQLGISKLSQGDPSEAIADLETAAQLDPE 429
Score = 35.1 bits (81), Expect = 0.025
Identities = 16/85 (18%), Positives = 31/85 (36%)
Query: 120 AIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPD 179
A A + +F++A E + +P + +G L L + A+ + P+
Sbjct: 166 AQLALAENRFDEARALIDEVLTADPGNVDALLLKGDLLLSLGNIELALAAYRKAIALRPN 225
Query: 180 SAAAYKFRGRAYRLLGKWEEAAVDL 204
+ A G++EEA
Sbjct: 226 NIAVLLALATILIEAGEFEEAEKHA 250
Score = 35.1 bits (81), Expect = 0.029
Identities = 16/76 (21%), Positives = 29/76 (38%), Gaps = 1/76 (1%)
Query: 126 DGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYK 185
+ A+ AY +A+K PSS K ++ L ++ LK +P+ A
Sbjct: 716 QKDYPAAIQAYRKALKRAPSSQNA-IKLHRALLASGNTAEAVKTLEAWLKTHPNDAVLRT 774
Query: 186 FRGRAYRLLGKWEEAA 201
Y +++A
Sbjct: 775 ALAELYLAQKDYDKAI 790
Score = 33.9 bits (78), Expect = 0.059
Identities = 19/76 (25%), Positives = 31/76 (40%)
Query: 138 EAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKFRGRAYRLLGKW 197
EA P S + G++ L N + L + PDSA A AY ++ +
Sbjct: 592 EAADAAPDSPEAWLMLGRAQLAAGDLNKAVSSFKKLLALQPDSALALLLLADAYAVMKNY 651
Query: 198 EEAAVDLRNACKIDFD 213
+A L+ A ++ D
Sbjct: 652 AKAITSLKRALELKPD 667
Score = 32.7 bits (75), Expect = 0.14
Identities = 20/55 (36%), Positives = 28/55 (50%)
Query: 153 RGQSYLQLSKPNACIRDCSVALKINPDSAAAYKFRGRAYRLLGKWEEAAVDLRNA 207
+SYLQ +K A I AL+ +P+ A A G+ Y LG + A +LR A
Sbjct: 28 AAKSYLQKNKYKAAIIQLKNALQKDPNDAEARFLLGKIYLALGDYAAAEKELRKA 82
Score = 32.0 bits (73), Expect = 0.27
Identities = 17/74 (22%), Positives = 29/74 (39%)
Query: 127 GKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKF 186
G AV+++ + + L P SAL +Y + I AL++ PD+ A
Sbjct: 615 GDLNKAVSSFKKLLALQPDSALALLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQIG 674
Query: 187 RGRAYRLLGKWEEA 200
+ + E A
Sbjct: 675 LAQLLLAAKRTESA 688
Score = 31.2 bits (71), Expect = 0.49
Identities = 23/77 (29%), Positives = 29/77 (37%), Gaps = 1/77 (1%)
Query: 125 NDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAY 184
+ E A K +P +AL + G YL+ A I+ ALK P S A
Sbjct: 681 AAKRTESAKKIAKSLQKQHPKAALGFELEGDLYLRQKDYPAAIQAYRKALKRAPSSQNAI 740
Query: 185 KFRGRAYRLLGKWEEAA 201
K RA G EA
Sbjct: 741 KL-HRALLASGNTAEAV 756
Score = 31.2 bits (71), Expect = 0.51
Identities = 21/74 (28%), Positives = 29/74 (39%)
Query: 127 GKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKF 186
G E+AV +A +LNP Q YL + + + A PDS A+
Sbjct: 547 GNEEEAVAWLEKAAELNPQEIEPALALAQYYLGKGQLKKALAILNEAADAAPDSPEAWLM 606
Query: 187 RGRAYRLLGKWEEA 200
GRA G +A
Sbjct: 607 LGRAQLAAGDLNKA 620
Score = 31.2 bits (71), Expect = 0.53
Identities = 27/98 (27%), Positives = 39/98 (39%), Gaps = 8/98 (8%)
Query: 127 GKFEDAVNAYSEAIKL--NPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAY 184
GKF+ + L + +A L A RG +YL L + + AL I+P S A
Sbjct: 104 GKFQQVL-DELPGKTLLDDEGAAELLALRGLAYLGLGQLELAQKSYEQALAIDPRSLYA- 161
Query: 185 KFRGRAYRLL--GKWEEAAVDLRNACKIDFDEQADEWL 220
K G A L +++EA + D D L
Sbjct: 162 KL-GLAQLALAENRFDEARALIDEVLTAD-PGNVDALL 197
Score = 30.8 bits (70), Expect = 0.57
Identities = 17/57 (29%), Positives = 28/57 (49%)
Query: 127 GKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAA 183
G FE A ++A +L+P +A + G S L P+ I D A +++P+ A
Sbjct: 377 GDFEKAAEYLAKATELDPENAAARTQLGISKLSQGDPSEAIADLETAAQLDPELGRA 433
Score = 30.1 bits (68), Expect = 1.2
Identities = 30/91 (32%), Positives = 44/91 (48%), Gaps = 4/91 (4%)
Query: 123 AYNDGKFEDAVNAYSEAIKLNPSSALLYAKR--GQSYLQLSKPNACIRDCSVALKINPDS 180
Y G E A ++ +K P+S A+R L+L + + I S AL ++PD
Sbjct: 305 EYQLGNLEQAYQYLNQILKYAPNS--HQARRLLASIQLRLGRVDEAIATLSPALGLDPDD 362
Query: 181 AAAYKFRGRAYRLLGKWEEAAVDLRNACKID 211
AA G AY LG +E+AA L A ++D
Sbjct: 363 PAALSLLGEAYLALGDFEKAAEYLAKATELD 393
Score = 28.9 bits (65), Expect = 3.0
Identities = 19/87 (21%), Positives = 36/87 (41%)
Query: 127 GKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKF 186
G E A+ AY +AI L P++ + ++ + + LK P+S A+
Sbjct: 207 GNIELALAAYRKAIALRPNNIAVLLALATILIEAGEFEEAEKHADALLKKAPNSPLAHYL 266
Query: 187 RGRAYRLLGKWEEAAVDLRNACKIDFD 213
+ +E+A L++A K +
Sbjct: 267 KALVDFQKKNYEDARETLQDALKSAPE 293
>gnl|CDD|233924 TIGR02552, LcrH_SycD, type III secretion low calcium response
chaperone LcrH/SycD. Genes in this family are found in
type III secretion operons. LcrH, from Yersinia is
believed to have a regulatory function in the
low-calcium response of the secretion system. The same
protein is also known as SycD (SYC = Specific Yop
Chaperone) for its chaperone role. In Pseudomonas, where
the homolog is known as PcrH, the chaperone role has
been demonstrated and the regulatory role appears to be
absent. ScyD/LcrH contains three central
tetratricopeptide-like repeats that are predicted to
fold into an all-alpha-helical array.
Length = 135
Score = 38.8 bits (91), Expect = 5e-04
Identities = 21/105 (20%), Positives = 40/105 (38%)
Query: 118 REAIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKIN 177
A + Y G++++A+ + +P ++ + L + I ++A ++
Sbjct: 22 ALAYNLYQQGRYDEALKLFQLLAAYDPYNSRYWLGLAACCQMLKEYEEAIDAYALAAALD 81
Query: 178 PDSAAAYKFRGRAYRLLGKWEEAAVDLRNACKIDFDEQADEWLKE 222
PD Y LG+ E A L A +I + LKE
Sbjct: 82 PDDPRPYFHAAECLLALGEPESALKALDLAIEICGENPEYSELKE 126
Score = 30.0 bits (68), Expect = 0.45
Identities = 16/69 (23%), Positives = 36/69 (52%)
Query: 127 GKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKF 186
++E+A++AY+ A L+P Y + L L +P + ++ +A++I ++ +
Sbjct: 65 KEYEEAIDAYALAAALDPDDPRPYFHAAECLLALGEPESALKALDLAIEICGENPEYSEL 124
Query: 187 RGRAYRLLG 195
+ RA +L
Sbjct: 125 KERAEAMLE 133
>gnl|CDD|205550 pfam13371, TPR_9, Tetratricopeptide repeat.
Length = 73
Score = 36.0 bits (84), Expect = 0.001
Identities = 13/51 (25%), Positives = 19/51 (37%)
Query: 155 QSYLQLSKPNACIRDCSVALKINPDSAAAYKFRGRAYRLLGKWEEAAVDLR 205
YL+ + L + PD + RG Y LG ++ A DL
Sbjct: 3 AIYLREEDFERALAVVERLLLLAPDDPYERRDRGLLYAQLGCFQAALADLE 53
Score = 25.6 bits (57), Expect = 5.9
Identities = 18/64 (28%), Positives = 26/64 (40%), Gaps = 1/64 (1%)
Query: 120 AIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINPD 179
AI+ + FE A+ + L P RG Y QL A + D L++ PD
Sbjct: 3 AIYL-REEDFERALAVVERLLLLAPDDPYERRDRGLLYAQLGCFQAALADLEYFLELCPD 61
Query: 180 SAAA 183
+ A
Sbjct: 62 APDA 65
>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats. Repeats present in 4
or more copies in proteins. Contain a minimum of 34
amino acids each and self-associate via a "knobs and
holes" mechanism.
Length = 34
Score = 34.0 bits (79), Expect = 0.004
Identities = 11/31 (35%), Positives = 15/31 (48%)
Query: 181 AAAYKFRGRAYRLLGKWEEAAVDLRNACKID 211
A A G AY LG ++EA A ++D
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELD 31
Score = 29.7 bits (68), Expect = 0.11
Identities = 6/19 (31%), Positives = 15/19 (78%)
Query: 127 GKFEDAVNAYSEAIKLNPS 145
G +++A+ Y +A++L+P+
Sbjct: 15 GDYDEALEYYEKALELDPN 33
Score = 26.3 bits (59), Expect = 1.6
Identities = 8/33 (24%), Positives = 16/33 (48%)
Query: 147 ALLYAKRGQSYLQLSKPNACIRDCSVALKINPD 179
A G +YL+L + + AL+++P+
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPN 33
>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat.
Length = 34
Score = 32.8 bits (76), Expect = 0.009
Identities = 11/31 (35%), Positives = 16/31 (51%)
Query: 181 AAAYKFRGRAYRLLGKWEEAAVDLRNACKID 211
A A G AY LGK++EA A +++
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELN 31
Score = 29.7 bits (68), Expect = 0.092
Identities = 10/33 (30%), Positives = 17/33 (51%)
Query: 147 ALLYAKRGQSYLQLSKPNACIRDCSVALKINPD 179
A G +YL+L K + + AL++NP+
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPN 33
Score = 29.7 bits (68), Expect = 0.11
Identities = 8/19 (42%), Positives = 16/19 (84%)
Query: 127 GKFEDAVNAYSEAIKLNPS 145
GK+++A+ Y +A++LNP+
Sbjct: 15 GKYDEALEYYEKALELNPN 33
>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat.
Length = 34
Score = 32.6 bits (75), Expect = 0.010
Identities = 10/28 (35%), Positives = 18/28 (64%)
Query: 173 ALKINPDSAAAYKFRGRAYRLLGKWEEA 200
AL+++P++A AY LG+++EA
Sbjct: 5 ALELDPNNAEAYYNLALLLLNLGQYDEA 32
Score = 29.1 bits (66), Expect = 0.18
Identities = 8/34 (23%), Positives = 18/34 (52%)
Query: 135 AYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIR 168
Y +A++L+P++A Y L L + + ++
Sbjct: 1 LYEKALELDPNNAEAYYNLALLLLNLGQYDEALQ 34
>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat. This Pfam entry
includes outlying Tetratricopeptide-like repeats (TPR)
that are not matched by pfam00515.
Length = 34
Score = 31.7 bits (73), Expect = 0.019
Identities = 12/31 (38%), Positives = 15/31 (48%)
Query: 181 AAAYKFRGRAYRLLGKWEEAAVDLRNACKID 211
A A G AY LG +EEA A ++D
Sbjct: 1 AEALYNLGLAYYKLGDYEEALEAYEKALELD 31
Score = 30.2 bits (69), Expect = 0.069
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 118 REAIHAYNDGKFEDAVNAYSEAIKLNP 144
+ Y G +E+A+ AY +A++L+P
Sbjct: 6 NLGLAYYKLGDYEEALEAYEKALELDP 32
Score = 25.6 bits (57), Expect = 3.4
Identities = 7/33 (21%), Positives = 14/33 (42%)
Query: 147 ALLYAKRGQSYLQLSKPNACIRDCSVALKINPD 179
A G +Y +L + AL+++P+
Sbjct: 1 AEALYNLGLAYYKLGDYEEALEAYEKALELDPN 33
>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat.
Length = 78
Score = 32.7 bits (75), Expect = 0.026
Identities = 16/76 (21%), Positives = 28/76 (36%), Gaps = 7/76 (9%)
Query: 143 NPSSALLYAKRGQSYLQLSKPNACIRDCSVALKI-------NPDSAAAYKFRGRAYRLLG 195
+P A +L + + AL++ +P++A A R Y LG
Sbjct: 1 HPDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALG 60
Query: 196 KWEEAAVDLRNACKID 211
++EA L A +
Sbjct: 61 DYDEALEYLEKALALR 76
>gnl|CDD|222122 pfam13429, TPR_15, Tetratricopeptide repeat.
Length = 279
Score = 33.8 bits (78), Expect = 0.055
Identities = 23/97 (23%), Positives = 36/97 (37%), Gaps = 15/97 (15%)
Query: 119 EAIHAYND----GKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVAL 174
A++ Y + D + + + S+ L A R + Y Q +P+A +RD AL
Sbjct: 114 SALYLYTRAGDYKRIVDLLASLTPEEAAEQSAGFLSA-RAEYYRQTGQPDAALRDYRRAL 172
Query: 175 KINPDSAAA----------YKFRGRAYRLLGKWEEAA 201
+ P S+ R RLL W A
Sbjct: 173 ALPPASSDLRAALLWLLIDLGRRAELKRLLAAWRGRA 209
>gnl|CDD|188247 TIGR02795, tol_pal_ybgF, tol-pal system protein YbgF. Members of
this protein family are the product of one of seven
genes regularly clustered in operons to encode the
proteins of the tol-pal system, which is critical for
maintaining the integrity of the bacterial outer
membrane. The gene for this periplasmic protein has been
designated orf2 and ybgF. All members of the seed
alignment were from unique tol-pal gene regions from
completed bacterial genomes. The architecture of this
protein is a signal sequence, a low-complexity region
usually rich in Asn and Gln, a well-conserved region
with tandem repeats that resemble the tetratricopeptide
(TPR) repeat, involved in protein-protein interaction.
Length = 117
Score = 32.3 bits (74), Expect = 0.061
Identities = 19/82 (23%), Positives = 31/82 (37%), Gaps = 8/82 (9%)
Query: 126 DGKFEDAVNAYSEAIKLNPSSALL---YAKRGQSYLQLSKPNACIRDCSVALKINPDSAA 182
G + DA+ A+ +K P S + G++Y + +K P S
Sbjct: 13 AGDYADAIQAFQAFLKKYPKSTYAPNAHYWLGEAYYAQGDYADAAKAFLAVVKKYPKSPK 72
Query: 183 A----YKFRGRAYRLLGKWEEA 200
A K G + + LG E+A
Sbjct: 73 APDALLKL-GMSLQELGDKEKA 93
>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
component of pyruvate dehydrogenase complex.
Length = 539
Score = 33.3 bits (76), Expect = 0.084
Identities = 14/49 (28%), Positives = 21/49 (42%), Gaps = 4/49 (8%)
Query: 30 ELKFFKDFVESYGGTIPPPSKAQSDAPKP----EKPFTTPEPSAPEEES 74
++ FKD+ S P S P EKP ++PEP A + +
Sbjct: 193 DIGKFKDYKPSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSA 241
>gnl|CDD|233365 TIGR01347, sucB, 2-oxoglutarate dehydrogenase complex
dihydrolipoamide succinyltransferase (E2 component).
This model describes the TCA cycle 2-oxoglutarate system
E2 component, dihydrolipoamide succinyltransferase. It
is closely related to the pyruvate dehydrogenase E2
component, dihydrolipoamide acetyltransferase. The seed
for this model includes mitochondrial and Gram-negative
bacterial forms. Mycobacterial candidates are highly
derived, differ in having and extra copy of the
lipoyl-binding domain at the N-terminus. They score
below the trusted cutoff, but above the noise cutoff and
above all examples of dihydrolipoamide acetyltransferase
[Energy metabolism, TCA cycle].
Length = 403
Score = 32.8 bits (75), Expect = 0.12
Identities = 17/80 (21%), Positives = 24/80 (30%), Gaps = 2/80 (2%)
Query: 42 GGTIPPPSKAQSDAPKPEKPFTTPEPSAPEEESEES-DPELDNTGVISEPDEEEPIPDVA 100
G A+S K E P + + + S P D +P
Sbjct: 77 GNDATAAPPAKSGEEKEETPAASAAAAPTAAANRPSLSPAARRLAKEHGIDLSA-VPGTG 135
Query: 101 TDIEVSEEDIDKSNEKKREA 120
V++EDI K E A
Sbjct: 136 VTGRVTKEDIIKKTEAPASA 155
>gnl|CDD|205362 pfam13181, TPR_8, Tetratricopeptide repeat.
Length = 34
Score = 29.0 bits (66), Expect = 0.19
Identities = 12/31 (38%), Positives = 16/31 (51%)
Query: 181 AAAYKFRGRAYRLLGKWEEAAVDLRNACKID 211
A AY G+ Y LG +EEA A ++D
Sbjct: 1 ARAYYLLGQIYLQLGDYEEAKEYYEKALELD 31
>gnl|CDD|226456 COG3947, COG3947, Response regulator containing CheY-like receiver
and SARP domains [Signal transduction mechanisms].
Length = 361
Score = 32.1 bits (73), Expect = 0.22
Identities = 30/129 (23%), Positives = 49/129 (37%), Gaps = 19/129 (14%)
Query: 108 EDIDKSNEKKREAIHAYNDGKFEDA-------VNAYSEAIKLNPSSALLYAKRGQSYLQL 160
E I+ + ++ +E + Y +A E + + L K ++YL+
Sbjct: 238 EAINLTIDELKELVGQYKGDYLPEADYPWAEDERERLEQLYMK-----LLGKVARAYLEA 292
Query: 161 SKPNACIRDCSVALKINPDSAAAYKFRGRAYRLLG-------KWEEAAVDLRNACKIDFD 213
KPN I+ AL ++P S K + LG +E A L ID D
Sbjct: 293 GKPNEAIQLHQRALTLDPLSEQDNKGLMASLATLGDEISAIKHYERYAEVLEAELGIDVD 352
Query: 214 EQADEWLKE 222
+ +EW K
Sbjct: 353 DSIEEWYKS 361
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 32.0 bits (73), Expect = 0.22
Identities = 12/70 (17%), Positives = 21/70 (30%)
Query: 43 GTIPPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATD 102
P K ++ E+ E EE+ + + E EEE + +
Sbjct: 404 QKEPLEEKPENKDESVEEISDAEEDDEDEEDEDGDGDVEMSAVDNDEEKEEEDKEAIPST 463
Query: 103 IEVSEEDIDK 112
I E +
Sbjct: 464 ILEEEPTVGG 473
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 31.3 bits (71), Expect = 0.46
Identities = 13/56 (23%), Positives = 19/56 (33%), Gaps = 4/56 (7%)
Query: 46 PPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISE---PDEEEPIPD 98
P P P P +P PE E + + D + + E E +PD
Sbjct: 267 PEPEPQPPPPPPPPEP-PEPEEEPDEPDQTDPDDGEETDQIPEELMFDAVEADLPD 321
Score = 28.6 bits (64), Expect = 3.1
Identities = 15/50 (30%), Positives = 20/50 (40%), Gaps = 6/50 (12%)
Query: 46 PPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEP 95
P ++ P+P P PEP PEEE +E D + EE
Sbjct: 261 PRATRLPEPEPQPPPPPPPPEPPEPEEEPDEPDQT------DPDDGEETD 304
>gnl|CDD|238138 cd00226, PRCH, Photosynthetic reaction center (RC) complex, subunit
H; RC is an integral membrane protein-pigment complex
which catalyzes light-induced reduction of ubiquinone to
ubiquinol, generating a transmembrane electrochemical
gradient of protons used to produce ATP by ATP synthase.
Subunit H is positioned mainly in the cytoplasm with one
transmembrane alpha helix. Provides proton transfer
pathway (water channels) connecting the terminal quinone
electron acceptor of RC, to the aqueous phase. Found in
photosynthetic bacteria: alpha, beta, and gamma
proteobacteria.
Length = 246
Score = 30.9 bits (70), Expect = 0.48
Identities = 29/109 (26%), Positives = 40/109 (36%), Gaps = 20/109 (18%)
Query: 16 LGAVKKNPAIFYLPELKFFKDFVESYGGTIPPPSKAQSDAPKPEKPFTTPEPSAPEEESE 75
G +N +F LP K FK +GGT+ P T P AP +
Sbjct: 46 DGGPSENGGVFPLPYPKTFK---LPHGGTVTVPRVEAETRALALAR-TAPFEGAPLVPT- 100
Query: 76 ESDPELDNTGVISEPDEEEPIPD-------------VATDIEVSEEDID 111
+P LD G S + + +PD VAT V+ D+D
Sbjct: 101 -GNPMLDGVGPASWAERRD-LPDLDVHGHPKIVPMRVATGFSVAAGDVD 147
>gnl|CDD|179448 PRK02603, PRK02603, photosystem I assembly protein Ycf3;
Provisional.
Length = 172
Score = 30.0 bits (68), Expect = 0.58
Identities = 30/116 (25%), Positives = 52/116 (44%), Gaps = 18/116 (15%)
Query: 118 REAIHAYNDGKFEDAVNAYSEAIKL--NPSS--------ALLYAKRGQSYLQLSKPNACI 167
R+ + A DG++ +A+ Y EA+KL +P+ ++YA G+ L +
Sbjct: 40 RDGMSAQADGEYAEALENYEEALKLEEDPNDRSYILYNMGIIYASNGEHDKALEYYHQ-- 97
Query: 168 RDCSVALKINPDSAAAYKFRGRAYRLLGKWEEAAVDLRNACKIDFDEQADEWLKEV 223
AL++NP +A Y G+ E A D A + FD+ A+ W + +
Sbjct: 98 -----ALELNPKQPSALNNIAVIYHKRGEKAEEAGDQDEA-EALFDKAAEYWKQAI 147
Score = 27.3 bits (61), Expect = 5.3
Identities = 16/56 (28%), Positives = 30/56 (53%), Gaps = 7/56 (12%)
Query: 107 EEDIDKSNEKKREAIHAYNDGKFEDAVNAYSEAIKLNP-------SSALLYAKRGQ 155
E+ D+S I ++G+ + A+ Y +A++LNP + A++Y KRG+
Sbjct: 66 EDPNDRSYILYNMGIIYASNGEHDKALEYYHQALELNPKQPSALNNIAVIYHKRGE 121
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 30.9 bits (70), Expect = 0.59
Identities = 11/58 (18%), Positives = 21/58 (36%)
Query: 66 EPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSEEDIDKSNEKKREAIHA 123
E + EE + EL+ + D ++ + E E+D + + K E
Sbjct: 3 EQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60
>gnl|CDD|226687 COG4235, COG4235, Cytochrome c biogenesis factor [Posttranslational
modification, protein turnover, chaperones].
Length = 287
Score = 30.0 bits (68), Expect = 0.80
Identities = 24/93 (25%), Positives = 38/93 (40%), Gaps = 14/93 (15%)
Query: 122 HAY-NDGKFEDAVNAYSEAIKLNPSSA--------LLYAKRGQSYLQLSKPNACIRDCSV 172
AY G+ DA+ AY A++L + LY + GQ ++ A +R
Sbjct: 164 RAYMALGRASDALLAYRNALRLAGDNPEILLGLAEALYYQAGQQMTAKAR--ALLRQ--- 218
Query: 173 ALKINPDSAAAYKFRGRAYRLLGKWEEAAVDLR 205
AL ++P + A A G + EAA +
Sbjct: 219 ALALDPANIRALSLLAFAAFEQGDYAEAAAAWQ 251
>gnl|CDD|236875 PRK11189, PRK11189, lipoprotein NlpI; Provisional.
Length = 296
Score = 29.9 bits (68), Expect = 0.94
Identities = 12/32 (37%), Positives = 16/32 (50%)
Query: 169 DCSVALKINPDSAAAYKFRGRAYRLLGKWEEA 200
D S AL + PD A AY + G G ++ A
Sbjct: 86 DFSQALALRPDMADAYNYLGIYLTQAGNFDAA 117
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 30.0 bits (68), Expect = 0.99
Identities = 11/50 (22%), Positives = 19/50 (38%), Gaps = 7/50 (14%)
Query: 44 TIPPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEE 93
+P P + + P P P P PE++ + + E + EE
Sbjct: 257 RLPAPPQPPEEEPPPP-----PPP--PEDDDDPPEDEEEQDDAEDRALEE 299
Score = 30.0 bits (68), Expect = 0.99
Identities = 7/45 (15%), Positives = 14/45 (31%)
Query: 58 PEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATD 102
P P E P E D + + E+ + ++ +
Sbjct: 259 PAPPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEIVLE 303
>gnl|CDD|236512 PRK09430, djlA, Dna-J like membrane chaperone protein; Provisional.
Length = 267
Score = 29.8 bits (68), Expect = 1.1
Identities = 11/34 (32%), Positives = 16/34 (47%), Gaps = 10/34 (29%)
Query: 105 VSEEDI-------DK---SNEKKREAIHAYNDGK 128
V+E DI D+ E +R A A+ +GK
Sbjct: 72 VTEADIRIASQLMDRMNLHGEARRAAQQAFREGK 105
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 29.9 bits (67), Expect = 1.1
Identities = 13/61 (21%), Positives = 24/61 (39%), Gaps = 7/61 (11%)
Query: 47 PPSKAQSDAPKPEKPFTTP-------EPSAPEEESEESDPELDNTGVISEPDEEEPIPDV 99
A++ + PE+P EP PE+ + DP + ++ P ++ PD
Sbjct: 387 HQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVAGPGDDPAEPDG 446
Query: 100 A 100
Sbjct: 447 I 447
Score = 27.3 bits (60), Expect = 7.9
Identities = 15/61 (24%), Positives = 20/61 (32%), Gaps = 9/61 (14%)
Query: 48 PSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSE 107
P+ Q DA P A E E P++ PIPD A E++
Sbjct: 384 PAAHQVDAEAASAAPEEPAALASEAHDETEP---------EVPEKAAPIPDPAKPDELAV 434
Query: 108 E 108
Sbjct: 435 A 435
>gnl|CDD|143585 cd07316, terB_like_DjlA, N-terminal tellurium resistance protein
terB-like domain of heat shock DnaJ-like proteins.
Tellurium resistance terB-like domain of the DnaJ-like
DjlA proteins. This family represents the terB-like
domain of DjlA-like proteins, a subgroup of heat shock
DnaJ-like proteins. Escherichia coli DjlA is a type III
membrane protein with a small N-terminal transmembrane
region and DnaJ-like domain on the extreme C-terminus.
Overproduction has been shown to activate the RcsC
pathway, which regulates the production of the capsular
exopolysaccharide colanic acid. The specific function
of this domain is unknown.
Length = 106
Score = 28.3 bits (64), Expect = 1.2
Identities = 16/59 (27%), Positives = 22/59 (37%), Gaps = 15/59 (25%)
Query: 105 VSEEDID----------KSNEKKREAIHAYNDGK-----FEDAVNAYSEAIKLNPSSAL 148
VSE +I E +REAI +N+GK E+ + A P L
Sbjct: 16 VSEAEIQAARALMDQMGLDAEARREAIRLFNEGKESDFGLEEYARQFRRACGGRPELLL 74
>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
Length = 310
Score = 29.6 bits (67), Expect = 1.3
Identities = 13/53 (24%), Positives = 20/53 (37%), Gaps = 8/53 (15%)
Query: 26 FYLPELKFFKDFVESYGGTIPPPSKAQSDAPKPEKPFTTPEPSAPEEESEESD 78
F P + K+ + + PS A + A +AP EE EE +
Sbjct: 257 FTFPLAEKIKEALAN-------PSAAAA-AAAAAAAAAAAAAAAPAEEEEEEE 301
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 30.0 bits (68), Expect = 1.3
Identities = 16/82 (19%), Positives = 23/82 (28%), Gaps = 8/82 (9%)
Query: 46 PPPSKAQSDAPKPEKP------FTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDV 99
P A P P +P P P + +EEE D
Sbjct: 730 APSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDD 789
Query: 100 ATDIEVSEEDIDKSNEKKREAI 121
A + +ED + E E +
Sbjct: 790 AP--SMDDEDRRDAEEVAMELL 809
>gnl|CDD|153411 cd07957, Anticodon_Ia_Met, Anticodon-binding domain of methionyl
tRNA synthetases. This domain is found in methionyl
tRNA synthetases (MetRS), which belong to the class Ia
aminoacyl tRNA synthetases. It lies C-terminal to the
catalytic core domain, and recognizes and specifically
binds to the tRNA anticodon (CAU). MetRS catalyzes the
transfer of methionine to the 3'-end of its tRNA.
Length = 129
Score = 28.6 bits (65), Expect = 1.4
Identities = 14/74 (18%), Positives = 30/74 (40%), Gaps = 11/74 (14%)
Query: 96 IPDVATDIEVSEEDIDKSNEKKREAIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQ 155
+P+ E EE ++++ E E A + +F A+ E ++L A+
Sbjct: 28 VPEFGGLTEEDEELLEEAEELLEEVAEAMEELEFRKALE---EIMEL--------ARAAN 76
Query: 156 SYLQLSKPNACIRD 169
Y+ + P ++
Sbjct: 77 KYIDETAPWKLAKE 90
>gnl|CDD|234504 TIGR04216, halo_surf_glyco, major cell surface glycoprotein.
Members of this family are the S-layer-forming
halobacterial major cell surface glycoprotein. The
highest scores below model cutoffs are fragmentary
paralogs to actual members of the family. Modifications
include at N-linked and O-linked glycosylation, a
C-terminal diphytanylglyceryl modification, and probable
cleavage of the PGF-CTERM tail.
Length = 782
Score = 29.5 bits (66), Expect = 1.5
Identities = 10/43 (23%), Positives = 15/43 (34%)
Query: 55 APKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIP 97
+ TTP + PEE +E ++P E P
Sbjct: 718 TSEDPTTTTTPTTTGPEETTETAEPTTTTEEPTEETTTGSSTP 760
>gnl|CDD|129652 TIGR00561, pntA, NAD(P) transhydrogenase, alpha subunit. This
integral membrane protein is the alpha subunit of alpha
2 beta 2 tetramer that couples the proton transport
across the membrane to the reversible transfer of
hydride ion equivalents between NAD and NADP. An
alternate name is pyridine nucleotide transhydrogenase
alpha subunit. The N-terminal region is homologous to
alanine dehydrogenase. In some species, such as
Rhodospirillum rubrum, the alpha chain is replaced by
two shorter chains, both with some homology to the
full-length alpha chain modeled here. These score below
the trusted cutoff [Energy metabolism, Electron
transport].
Length = 511
Score = 29.5 bits (66), Expect = 1.7
Identities = 14/36 (38%), Positives = 16/36 (44%), Gaps = 6/36 (16%)
Query: 44 TIPPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDP 79
T PPP S P+P K +AP ESEE
Sbjct: 366 TWPPPKIQVSAQPQPAK------KAAPVVESEEKSA 395
>gnl|CDD|239121 cd02656, MIT, MIT: domain contained within Microtubule Interacting
and Trafficking molecules. The MIT domain is found in
sorting nexins, the nuclear thiol protease PalBH, the
AAA protein spastin and archaebacterial proteins with
similar domain architecture, vacuolar sorting proteins
and others. The molecular function of the MIT domain is
unclear.
Length = 75
Score = 27.3 bits (61), Expect = 1.7
Identities = 10/33 (30%), Positives = 20/33 (60%)
Query: 108 EDIDKSNEKKREAIHAYNDGKFEDAVNAYSEAI 140
E + ++ E ++A+ DG +E+A+ Y EA+
Sbjct: 1 ELLQQAKELIKQAVKEDEDGNYEEALELYKEAL 33
>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 944
Score = 29.3 bits (66), Expect = 1.7
Identities = 14/64 (21%), Positives = 23/64 (35%), Gaps = 4/64 (6%)
Query: 49 SKAQSDAPKPE---KPFTTPEPSAPEEESEE-SDPELDNTGVISEPDEEEPIPDVATDIE 104
S + D K KP T P + P S+ S P+ T + D + + + E
Sbjct: 630 SPKEGDGKKSSADRKPKTPPSRAPPASLSKPASSPDASQTSASFDLDPDFELATHQSVPE 689
Query: 105 VSEE 108
+
Sbjct: 690 AALA 693
>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
partitioning].
Length = 324
Score = 29.1 bits (65), Expect = 1.8
Identities = 13/75 (17%), Positives = 20/75 (26%)
Query: 45 IPPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIE 104
IP P Q P T P + E P L +P ++
Sbjct: 101 IPVPQPPQISDPPAHPQPTQPALDQEQPPEEARQPVLPQEAPAPQPVHSAAPQPAVQTVQ 160
Query: 105 VSEEDIDKSNEKKRE 119
+ + E+ E
Sbjct: 161 PAVPEQQVQPEEVVE 175
>gnl|CDD|233728 TIGR02102, pullulan_Gpos, pullulanase, extracellular, Gram-positive.
Pullulan is an unusual, industrially important
polysaccharide in which short alpha-1,4 chains
(maltotriose) are connected in alpha-1,6 linkages.
Enzymes that cleave alpha-1,6 linkages in pullulan and
release maltotriose are called pullulanases although
pullulan itself may not be the natural substrate. In
contrast, a glycogen debranching enzyme such GlgX,
homologous to this family, can release glucose at
alpha,1-6 linkages from glycogen first subjected to limit
degradation by phosphorylase. Characterized members of
this family include a surface-located pullulanase from
Streptococcus pneumoniae (PMID:11083842) and an
extracellular bifunctional amylase/pullulanase with
C-terminal pullulanase activity (PMID:8798645).
Length = 1111
Score = 29.4 bits (66), Expect = 1.9
Identities = 23/80 (28%), Positives = 33/80 (41%), Gaps = 5/80 (6%)
Query: 46 PPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEV 105
PPP + + APKP P P+ S P+++ + D + T S+ DE P + E
Sbjct: 1014 PPPPEHEPQAPKP--PTQDPDGSKPKDKVDPKDNKDPLTPPGSD-DENGETPK--GNEEK 1068
Query: 106 SEEDIDKSNEKKREAIHAYN 125
EE DK N
Sbjct: 1069 KEEQPDKGANLPNTGTKNSN 1088
>gnl|CDD|223708 COG0635, HemN, Coproporphyrinogen III oxidase and related Fe-S
oxidoreductases [Coenzyme metabolism].
Length = 416
Score = 29.2 bits (66), Expect = 1.9
Identities = 19/76 (25%), Positives = 32/76 (42%), Gaps = 11/76 (14%)
Query: 114 NEKKREAIHAYNDGKFEDAVNAYSEAIKLNPSSA---LLYAKRGQSYLQLSKPNACIRDC 170
N++ +A+ +D E+A A A K +S L+Y GQ+ + D
Sbjct: 159 NDEVLKALGRIHD--EEEAKEAVELARKAGFTSINIDLIYGLPGQT------LESLKEDL 210
Query: 171 SVALKINPDSAAAYKF 186
AL++ PD + Y
Sbjct: 211 EQALELGPDHLSLYSL 226
>gnl|CDD|234164 TIGR03302, OM_YfiO, outer membrane assembly lipoprotein YfiO.
Members of this protein family include YfiO, a
near-essential protein of the outer membrane, part of a
complex involved in protein insertion into the bacterial
outer membrane. Many proteins in this family are
annotated as ComL, based on the involvement of this
protein in natural transformation with exogenous DNA in
Neisseria gonorrhoeae. This protein family shows
sequence similarity to, but is distinct from, the
tol-pal system protein YbgF (TIGR02795) [Protein fate,
Protein and peptide secretion and trafficking].
Length = 235
Score = 28.7 bits (65), Expect = 1.9
Identities = 12/40 (30%), Positives = 18/40 (45%), Gaps = 3/40 (7%)
Query: 124 YNDGKFEDAVNAYSEAIKLNPSSALL-YA--KRGQSYLQL 160
Y G + +A+ A I+L+P+ YA RG S
Sbjct: 81 YKSGDYAEAIAAADRFIRLHPNHPDADYAYYLRGLSNYNQ 120
>gnl|CDD|235285 PRK04335, PRK04335, cell division protein ZipA; Provisional.
Length = 313
Score = 29.0 bits (65), Expect = 1.9
Identities = 17/62 (27%), Positives = 29/62 (46%)
Query: 37 FVESYGGTIPPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPI 96
V++ P + Q + P+ E+ PEP A + E PE + + EP+E +P
Sbjct: 113 PVQAQSQPQPEKVEPQVEEPRDEEVLEEPEPVAAKVPMAEVQPEEETEIEVDEPEEPKPE 172
Query: 97 PD 98
P+
Sbjct: 173 PE 174
>gnl|CDD|236624 PRK09782, PRK09782, bacteriophage N4 receptor, outer membrane
subunit; Provisional.
Length = 987
Score = 29.5 bits (66), Expect = 2.0
Identities = 21/72 (29%), Positives = 29/72 (40%), Gaps = 4/72 (5%)
Query: 149 LYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAYKFRGRAYRLLGKWEEAAVDLRNAC 208
LY +P + D + +L I P SA AY R YR A DLR A
Sbjct: 578 LYWWLHAQRYIPGQPELALNDLTRSLNIAP-SANAYVARATIYRQRHNVPAAVSDLRAAL 636
Query: 209 KIDFDE---QAD 217
+++ + QA
Sbjct: 637 ELEPNNSNYQAA 648
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 29.6 bits (66), Expect = 2.0
Identities = 17/73 (23%), Positives = 31/73 (42%), Gaps = 8/73 (10%)
Query: 69 APEEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSEEDIDKSNEKKREAIHAYNDGK 128
A +E EE+D E D +P EE D + ++ D + + ++ N+
Sbjct: 4015 AADENKEEADAEKDEPMQDEDPLEENNTLD---EDIQQDDFSDLAEDDEKM-----NEDG 4066
Query: 129 FEDAVNAYSEAIK 141
FE+ V E+ +
Sbjct: 4067 FEENVQENEESTE 4079
Score = 28.0 bits (62), Expect = 6.0
Identities = 10/70 (14%), Positives = 29/70 (41%), Gaps = 2/70 (2%)
Query: 49 SKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSEE 108
+ A+ D P ++ + E+ ++ +L + +E+ +V + E +E+
Sbjct: 4023 ADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDD--EKMNEDGFEENVQENEESTED 4080
Query: 109 DIDKSNEKKR 118
+ E ++
Sbjct: 4081 GVKSDEELEQ 4090
>gnl|CDD|225504 COG2956, COG2956, Predicted N-acetylglucosaminyl transferase
[Carbohydrate transport and metabolism].
Length = 389
Score = 28.9 bits (65), Expect = 2.0
Identities = 23/101 (22%), Positives = 35/101 (34%), Gaps = 10/101 (9%)
Query: 130 EDAVNAYSEAIKLNPSS-----ALLYAKRGQSYLQLSKPNACIRDCSVALKINPDSAAAY 184
E A++ +KL + A Y + Q L S + AL+ + A
Sbjct: 158 EKAIDVAERLVKLGGQTYRVEIAQFYCELAQQALASSDVDRARELLKKALQADKKCVRAS 217
Query: 185 KFRGRAYRLLGKWEEAAVDLRNACKIDFDEQADEWLKEVTP 225
GR G +++A L EQ E+L EV
Sbjct: 218 IILGRVELAKGDYQKAVEALERV-----LEQNPEYLSEVLE 253
>gnl|CDD|222121 pfam13428, TPR_14, Tetratricopeptide repeat.
Length = 44
Score = 26.4 bits (58), Expect = 2.1
Identities = 6/36 (16%), Positives = 12/36 (33%)
Query: 120 AIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQ 155
A G ++A+ A+ L+P +
Sbjct: 8 ARALLALGDLDEALALLRRALALDPDDPEALLLLAR 43
>gnl|CDD|218099 pfam04465, DUF499, Protein of unknown function (DUF499). Family of
uncharacterized hypothetical prokaryotic proteins.
Length = 1025
Score = 29.0 bits (65), Expect = 2.2
Identities = 25/86 (29%), Positives = 32/86 (37%), Gaps = 1/86 (1%)
Query: 58 PEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSEEDID-KSNEK 116
PE + P S +EE E D E G I E EEE I D+ V I K +
Sbjct: 773 PESNYEIPLKSLVDEECEVKDEEAVLWGYIVEKKEEEFIVKGEFDLSVEPSSIKGKPGDP 832
Query: 117 KREAIHAYNDGKFEDAVNAYSEAIKL 142
+ A G + V S +L
Sbjct: 833 VEVRVKAKPLGDYRFTVELSSNFGEL 858
>gnl|CDD|227343 COG5010, TadD, Flp pilus assembly protein TadD, contains TPR
repeats [Intracellular trafficking and secretion].
Length = 257
Score = 28.6 bits (64), Expect = 2.4
Identities = 21/89 (23%), Positives = 39/89 (43%)
Query: 119 EAIHAYNDGKFEDAVNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCSVALKINP 178
+ + +G F +AV+ +A +L P+ + G + QL + + R AL++ P
Sbjct: 106 QGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAYRQALELAP 165
Query: 179 DSAAAYKFRGRAYRLLGKWEEAAVDLRNA 207
+ + G + L G E+A L A
Sbjct: 166 NEPSIANNLGMSLLLRGDLEDAETLLLPA 194
>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
unknown].
Length = 769
Score = 28.9 bits (64), Expect = 2.5
Identities = 22/104 (21%), Positives = 41/104 (39%), Gaps = 7/104 (6%)
Query: 51 AQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVIS-----EPDEEEPIPDVATDIEV 105
++ PK + EEE + + E I E D EE D + I+
Sbjct: 377 KKAAVPKGTSFYQAKWAEDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDD 436
Query: 106 SEEDIDKSNEKKREAIHAYNDGKFEDAVNAYSEAIKLNPSSALL 149
+E + S E++ + + D + ED + + +L PS + +
Sbjct: 437 NEGFEELSPEEEERQLREFRDMEKEDR--EFPDEAELQPSESAI 478
>gnl|CDD|222484 pfam13974, YebO, YebO-like protein. This short protein is
uncharacterized. It seems likely to be of phage origin
as it is found in Bacteriophage HK022 Gp20 and
Bacteriophage HK97 Gp15. The protein is also found in a
variety of enteric bacteria.
Length = 80
Score = 26.9 bits (60), Expect = 3.1
Identities = 10/27 (37%), Positives = 12/27 (44%)
Query: 63 TTPEPSAPEEESEESDPELDNTGVISE 89
T E S +S D ELD +I E
Sbjct: 53 NTVEKSGAVNDSVVEDVELDFKRLIPE 79
>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
Provisional.
Length = 376
Score = 28.3 bits (63), Expect = 3.1
Identities = 18/104 (17%), Positives = 36/104 (34%), Gaps = 4/104 (3%)
Query: 46 PPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEV 105
P P D P ++ P + + E E ++P + T + + ++ A +
Sbjct: 75 PAPIDVPPDPPAAQEAVQPTAPPSAQSEPERNEPRPEETPIFAYSSGDQGGSKRAGHGDT 134
Query: 106 SEEDIDKSNEKKREAIHAYNDGKFEDAVNAYSEAIKLNPSSALL 149
D + A E+ ++ + +L PS A L
Sbjct: 135 DRRQDDNREDNSLPAGEV----SAENDLSIRMKPTELQPSRATL 174
>gnl|CDD|221958 pfam13176, TPR_7, Tetratricopeptide repeat.
Length = 36
Score = 25.7 bits (57), Expect = 3.5
Identities = 9/19 (47%), Positives = 11/19 (57%)
Query: 183 AYKFRGRAYRLLGKWEEAA 201
A GR YR LG +E+A
Sbjct: 1 ALSNLGRLYRKLGDYEKAI 19
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 28.1 bits (63), Expect = 3.8
Identities = 20/82 (24%), Positives = 32/82 (39%), Gaps = 1/82 (1%)
Query: 66 EPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSEEDIDKSNEKKREAIHAYN 125
E ++ ++ + E + DE E I DV +D E+ D + EK A A
Sbjct: 89 EQGLESDDDDDEEEEWEVEEDEDSDDEGEWI-DVESDKEIESSDSEDEEEKDEAAKKAKE 147
Query: 126 DGKFEDAVNAYSEAIKLNPSSA 147
D E + EA + + A
Sbjct: 148 DSDEELSEEDEEEAAEEEEAEA 169
>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
Tim54. Mitochondrial function depends on the import of
hundreds of different proteins synthesised in the
cytosol. Protein import is a multi-step pathway which
includes the binding of precursor proteins to surface
receptors, translocation of the precursor across one or
both mitochondrial membranes, and folding and assembly
of the imported protein inside the mitochondrion. Most
precursor proteins carry amino-terminal targeting
signals, called pre-sequences, and are imported into
mitochondria via import complexes located in both the
outer and the inner membrane (IM). The IM complex, TIM,
is made up of at least two proteins which mediate
translocation of proteins into the matrix by removing
their signal peptide and another pair of proteins, Tim54
and Tim22, that insert the polytopic proteins, that
carry internal targetting information, into the inner
membrane.
Length = 377
Score = 28.2 bits (63), Expect = 4.3
Identities = 16/64 (25%), Positives = 25/64 (39%), Gaps = 8/64 (12%)
Query: 57 KPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSEEDIDKSNEK 116
PE P T + +APE E E + +E E T E +E+ ++ N K
Sbjct: 197 PPEPPEPTVDEAAPETEVEATPA--------AESPAEPAEETAETTPEETEDAPEEENNK 248
Query: 117 KREA 120
+
Sbjct: 249 PVKP 252
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 28.2 bits (63), Expect = 4.7
Identities = 12/47 (25%), Positives = 18/47 (38%)
Query: 71 EEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSEEDIDKSNEKK 117
E E E D E + + +EE+ E E++ K KK
Sbjct: 30 EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKK 76
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 28.1 bits (63), Expect = 4.9
Identities = 13/31 (41%), Positives = 16/31 (51%), Gaps = 5/31 (16%)
Query: 65 PEPSAPEEESEESDPELDNTGVISEPDEEEP 95
P PS PE E EE + E + E +EEE
Sbjct: 298 PPPSPPEPEEEEEEEE-----EVPEEEEEEE 323
Score = 26.9 bits (60), Expect = 9.3
Identities = 12/35 (34%), Positives = 16/35 (45%), Gaps = 5/35 (14%)
Query: 46 PPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPE 80
PPPS + + + E E EEE EE + E
Sbjct: 298 PPPSPPEPEEEEEE-----EEEVPEEEEEEEEEEE 327
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 27.5 bits (62), Expect = 5.4
Identities = 21/92 (22%), Positives = 37/92 (40%), Gaps = 10/92 (10%)
Query: 63 TTPEPSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDI---EVSE--EDIDKSNEKK 117
+A ESE + +L++ E ++E+ + E E ++ + EK
Sbjct: 35 AAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEEARERFAELREQYEKT 94
Query: 118 REAI--HAYNDGKFEDAVNAYSE---AIKLNP 144
R+AI H K + A A +E +L P
Sbjct: 95 RKAIEKHGRGSKKAQKAREALAEEFMQFRLVP 126
>gnl|CDD|193565 cd08018, M20_Acy1_amhX_like, M20 Peptidase Aminoacylase 1 amhX_like
subfamily. Peptidase M20 family, Uncharacterized
subfamily of uncharacterized proteins predicted as
putative amidohydrolases, including the amhX gene
product from Bacillus subtilis. These are a class of
zinc binding homodimeric enzymes involved in hydrolysis
of N-acetylated proteins. N-terminal acetylation of
proteins is a widespread and highly conserved process
that is involved in protection and stability of
proteins. Several types of aminoacylases can be
distinguished on the basis of substrate specificity.
Aminoacylase 1 (ACY1) breaks down cytosolic aliphatic
N-acyl-alpha-amino acids (except L-aspartate),
especially N-acetyl-methionine and acetyl-glutamate into
L-amino acids and an acyl group. However, ACY1 can also
catalyze the reverse reaction, the synthesis of
acetylated amino acids. ACY1 may also play a role in
xenobiotic bioactivation as well as the inter-organ
processing of amino acid-conjugated xenobiotic
derivatives (S-substituted-N-acetyl-L-cysteine).
Length = 365
Score = 27.6 bits (62), Expect = 6.2
Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 6/44 (13%)
Query: 82 DNTGVISEPDEEEPIPDVA--TDIEVSEEDIDKSNEKKREAIHA 123
D TGV++E +P P VA D++ +++D E K A H+
Sbjct: 46 DCTGVVAEIGSGKPGPVVALRADMDALWQEVD--GEWK--ANHS 85
>gnl|CDD|216082 pfam00723, Glyco_hydro_15, Glycosyl hydrolases family 15. In
higher organisms this family is represented by
phosphorylase kinase subunits.
Length = 416
Score = 27.4 bits (61), Expect = 6.6
Identities = 17/60 (28%), Positives = 21/60 (35%), Gaps = 8/60 (13%)
Query: 30 ELKFFKDFVESYGGTIPPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISE 89
FF+ V S GT P S P K + A EE L G++SE
Sbjct: 338 ARGFFEKLVSSTVGT-PYSSSTFISIPLAVKEYADGFVKA----VEEYANSL---GLLSE 389
>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
TopJ. This model describes TopJ (MG_200, CbpA), a DnaJ
homolog and probable assembly protein of the Mycoplasma
terminal organelle. The terminal organelle is involved
in both cytadherence and gliding motility [Cellular
processes, Chemotaxis and motility].
Length = 871
Score = 27.5 bits (60), Expect = 6.7
Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 3/69 (4%)
Query: 56 PKPEKPFTTPEPSAPEEESEESDPELDNTGVISEPDEE--EPIPDVATD-IEVSEEDIDK 112
P+P + T SAPE EE+ PE++ +P E P TD ++ E D +
Sbjct: 294 PQPTQTVETKPTSAPESTVEENLPEINQPTQAVQPTSETISTTPVEPTDQLKPKEVDQIQ 353
Query: 113 SNEKKREAI 121
KK + I
Sbjct: 354 EELKKTKEI 362
>gnl|CDD|182415 PRK10370, PRK10370, formate-dependent nitrite reductase complex
subunit NrfG; Provisional.
Length = 198
Score = 27.2 bits (60), Expect = 6.8
Identities = 17/73 (23%), Positives = 36/73 (49%), Gaps = 3/73 (4%)
Query: 115 EKKREA--IHAYNDGKFEDA-VNAYSEAIKLNPSSALLYAKRGQSYLQLSKPNACIRDCS 171
E +R A +H + + +A + A + I+ NP ++ +A G+ YL + + +
Sbjct: 38 EYQRLADPLHQFASQQTPEAQLQALQDKIRANPQNSEQWALLGEYYLWRNDYDNALLAYR 97
Query: 172 VALKINPDSAAAY 184
AL++ ++A Y
Sbjct: 98 QALQLRGENAELY 110
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 27.1 bits (60), Expect = 7.4
Identities = 19/80 (23%), Positives = 27/80 (33%), Gaps = 8/80 (10%)
Query: 45 IPPPSKAQSDAPKPEKPFTTP--------EPSAPEEESEESDPELDNTGVISEPDEEEPI 96
IP ++ P K TT + PE +EE +N + D E P
Sbjct: 45 IPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPK 104
Query: 97 PDVATDIEVSEEDIDKSNEK 116
E+ ID S+ K
Sbjct: 105 VQNTAAEVNHEDVIDLSDLK 124
>gnl|CDD|225613 COG3071, HemY, Uncharacterized enzyme of heme biosynthesis
[Coenzyme metabolism].
Length = 400
Score = 27.3 bits (61), Expect = 8.4
Identities = 31/162 (19%), Positives = 51/162 (31%), Gaps = 25/162 (15%)
Query: 79 PELDNTGVISEPD-------EEEPIPDVATDIEVSEEDIDKSNEKKRE-----------A 120
P+L G++S+ + E + A D SE + R+ A
Sbjct: 211 PKLRKAGLLSDEEAARLEQQAWEGLLQQARDDNGSEGLKTWWKNQPRKLRNDPELVVAYA 270
Query: 121 IHAYNDGKFEDAVNAYSEAIKLN--PSSALLYAKRGQSYLQLSKPNACIRDCSVALKINP 178
G ++A +A+K P L + L+ P I+ LK +P
Sbjct: 271 ERLIRLGDHDEAQEIIEDALKRQWDPRLCRLIPR-----LRPGDPEPLIKAAEKWLKQHP 325
Query: 179 DSAAAYKFRGRAYRLLGKWEEAAVDLRNACKIDFDEQADEWL 220
+ GR W +A+ L A K+ L
Sbjct: 326 EDPLLLSTLGRLALKNKLWGKASEALEAALKLRPSASDYAEL 367
>gnl|CDD|233927 TIGR02557, HpaP, type III secretion protein HpaP. This family of
genes is always found in type III secretion operons,
althought its function in the processes of secretion
and virulence is unclear. Hpa stands for Hrp-associated
gene, where Hrp stands for hypersensitivity response
and virulence.
Length = 201
Score = 26.8 bits (59), Expect = 8.5
Identities = 9/34 (26%), Positives = 12/34 (35%)
Query: 46 PPPSKAQSDAPKPEKPFTTPEPSAPEEESEESDP 79
P D +PE T P E ++DP
Sbjct: 41 RPEPPPPCDEDRPEPRADTRASDPPPEAPTDADP 74
>gnl|CDD|198111 smart01043, BTAD, Bacterial transcriptional activator domain.
Found in the DNRI/REDD/AFSR family of regulators. This
region of AFSR along with the C terminal region is
capable of independently directing actinorhodin
production. This family contains TPR repeats.
Length = 145
Score = 26.5 bits (59), Expect = 8.7
Identities = 11/41 (26%), Positives = 15/41 (36%), Gaps = 1/41 (2%)
Query: 181 AAAYKFRGRAYRLLGKWEEAAVDLRNACKID-FDEQADEWL 220
A + A LG+ EEA L +D E+ L
Sbjct: 61 LEALEALAEALLALGRHEEALALLERLLALDPLRERLHRLL 101
>gnl|CDD|188409 TIGR03894, chp_P_marinus_1, conserved hypothetical protein,
TIGR03894 family. This protein family is restricted to
the Prochlorococcus and Synechococcus lineages of the
Cyanobacteria, and is sporadic in those lineages.
Members average 100 amino acids in length, including a
30-residue, highly polar, low complexity region
sandwiched between an N-terminal region of about 60
residues and a C-terminal [KR]VVR[KR]RS motif, both
well-conserved. The function is unknown [Hypothetical
proteins, Conserved].
Length = 95
Score = 25.9 bits (57), Expect = 9.2
Identities = 11/30 (36%), Positives = 16/30 (53%)
Query: 89 EPDEEEPIPDVATDIEVSEEDIDKSNEKKR 118
EPD+EE + + E SE + + S E R
Sbjct: 60 EPDDEENEGEDSETKEDSESESESSREGPR 89
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 27.0 bits (60), Expect = 9.2
Identities = 17/72 (23%), Positives = 28/72 (38%), Gaps = 10/72 (13%)
Query: 42 GGTIPPPSKAQSDAPKPEKPFTTPEPSAP--EEESEESDPELDNTGVISEPDEEEPIPDV 99
GG K K K T + ++E + SD + D + S+ D+
Sbjct: 219 GGGAMKVLKQPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLDDS------ 272
Query: 100 ATDIEVSEEDID 111
D +VS+ED +
Sbjct: 273 --DDDVSDEDGE 282
>gnl|CDD|233779 TIGR02208, lipid_A_msbB, lipid A biosynthesis
(KDO)2-(lauroyl)-lipid IVA acyltransferase. This family
consists of MsbB in E. coli and closely related proteins
in other species. MsbB is homologous to HtrB (TIGR02207)
and acts immediately after it in the biosynthesis of
KDO-2 lipid A (also called Re LPS and Re endotoxin).
These two enzymes act after creation of KDO-2 lipid IV-A
by addition of the KDO sugars [Cell envelope,
Biosynthesis and degradation of surface polysaccharides
and lipopolysaccharides].
Length = 305
Score = 26.7 bits (59), Expect = 9.8
Identities = 10/43 (23%), Positives = 20/43 (46%), Gaps = 6/43 (13%)
Query: 10 NKLQLFLGAVKKNPAIFYLPELKF------FKDFVESYGGTIP 46
++ L ++K+ + +YLP+ F F +Y T+P
Sbjct: 180 AGIKALLASLKRGESGYYLPDEDHGPEQSVFVPFFATYKATLP 222
>gnl|CDD|234730 PRK00339, minC, septum formation inhibitor; Reviewed.
Length = 249
Score = 26.7 bits (59), Expect = 9.8
Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 2/44 (4%)
Query: 47 PPSKAQSDAPKPEKPFTTPEPSAPEEESEESDPELDNTGVISEP 90
PPS A+ +P PEP+ E+ E P + T VI+ P
Sbjct: 108 PPSGARERPLEPSPAEAEPEPAKAPEKPPE--PAVRPTRVITTP 149
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 26.5 bits (59), Expect = 9.9
Identities = 14/53 (26%), Positives = 22/53 (41%), Gaps = 1/53 (1%)
Query: 67 PSAPEEESEESDPELDNTGVISEPDEEEPIPDVATDIEVSEEDIDKSNEKKRE 119
P P + EE + E E+ + D D E+ EE+ D+ E+ R
Sbjct: 15 PKPPSPKEEEEEALELAIQEAHENALEKKLLDE-LDEELDEEEDDRFLEEYRR 66
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.310 0.131 0.375
Gapped
Lambda K H
0.267 0.0733 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,257,609
Number of extensions: 1169811
Number of successful extensions: 2141
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1951
Number of HSP's successfully gapped: 252
Length of query: 231
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 137
Effective length of database: 6,768,326
Effective search space: 927260662
Effective search space used: 927260662
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 57 (25.7 bits)