RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8160
(324 letters)
>gnl|CDD|225504 COG2956, COG2956, Predicted N-acetylglucosaminyl transferase
[Carbohydrate transport and metabolism].
Length = 389
Score = 43.5 bits (103), Expect = 8e-05
Identities = 49/208 (23%), Positives = 89/208 (42%), Gaps = 28/208 (13%)
Query: 32 KAIKTFQQLLYVDPSYLRANEVHLRLGLMFKVNNEYDSALK-HLTLCLIDASPCTFTKLE 90
KA+ F ++L DP E HL LG +F+ E D A++ H TL +++ TF +
Sbjct: 53 KAVDLFLEMLQEDPETF---EAHLTLGNLFRSRGEVDRAIRIHQTL--LESPDLTFEQRL 107
Query: 91 GLMFKVNNEYDSALKHLTLCLIDASPCTFTKL--------EVRFHIAHLHEVQRKYKTAK 142
+ ++ +Y +A L+D + F +L + ++++ R+++ A
Sbjct: 108 LALQQLGRDYMAA------GLLDRAEDIFNQLVDEGEFAEGALQQLLNIYQATREWEKAI 161
Query: 143 DSYEQLLKEDDLPVHLK-ADICRQLGWMYHCIDTLGEKSHRETLAIHCLQKSIEADPKSG 201
D E+L+K ++ A +L L A L+K+++AD K
Sbjct: 162 DVAERLVKLGGQTYRVEIAQFYCELAQQA-----LASSDVDR--ARELLKKALQADKKCV 214
Query: 202 QSLYLLGRCFAAVGKVHDAFLAYRNSVE 229
++ +LGR A G A A +E
Sbjct: 215 RASIILGRVELAKGDYQKAVEALERVLE 242
>gnl|CDD|226687 COG4235, COG4235, Cytochrome c biogenesis factor [Posttranslational
modification, protein turnover, chaperones].
Length = 287
Score = 40.4 bits (95), Expect = 6e-04
Identities = 16/53 (30%), Positives = 29/53 (54%)
Query: 190 LQKSIEADPKSGQSLYLLGRCFAAVGKVHDAFLAYRNSVEKSEGNADTWCSIG 242
L+ ++ +P + LLGR + A+G+ DA LAYRN++ + N + +
Sbjct: 145 LETHLQQNPGDAEGWDLLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLA 197
>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
34 amino acids
[WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
in a variety of organisms including bacteria,
cyanobacteria, yeast, fungi, plants, and humans in
various subcellular locations; involved in a variety of
functions including protein-protein interactions, but
common features in the interaction partners have not
been defined; involved in chaperone, cell-cycle,
transciption, and protein transport complexes; the
number of TPR motifs varies among proteins (1,3-11,13
15,16,19); 5-6 tandem repeats generate a right-handed
helical structure with an amphipathic channel that is
thought to accomodate an alpha-helix of a target
protein; it has been proposed that TPR proteins
preferably interact with WD-40 repeat proteins, but in
many instances several TPR-proteins seem to aggregate to
multi-protein complexes; examples of TPR-proteins
include, Cdc16p, Cdc23p and Cdc27p components of the
cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
targeting signals, the Tom70p co-receptor for
mitochondrial targeting signals, Ser/Thr phosphatase 5C
and the p110 subunit of O-GlcNAc transferase; three
copies of the repeat are present here.
Length = 100
Score = 37.0 bits (86), Expect = 0.002
Identities = 15/58 (25%), Positives = 29/58 (50%)
Query: 186 AIHCLQKSIEADPKSGQSLYLLGRCFAAVGKVHDAFLAYRNSVEKSEGNADTWCSIGN 243
A+ +K++E DP + + Y L + +GK +A Y ++E NA + ++G
Sbjct: 19 ALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGL 76
Score = 34.7 bits (80), Expect = 0.011
Identities = 26/104 (25%), Positives = 47/104 (45%), Gaps = 11/104 (10%)
Query: 126 FHIAHLHEVQRKYKTAKDSYEQLLKEDDLPVHLKADICRQLGWMYHCIDTLGEKSHRETL 185
++ +L+ Y A + YE+ L+ D AD L Y+ K +
Sbjct: 4 LNLGNLYYKLGDYDEALEYYEKALELDPDN----ADAYYNLAAAYY-------KLGKYEE 52
Query: 186 AIHCLQKSIEADPKSGQSLYLLGRCFAAVGKVHDAFLAYRNSVE 229
A+ +K++E DP + ++ Y LG + +GK +A AY ++E
Sbjct: 53 ALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALE 96
Score = 33.9 bits (78), Expect = 0.019
Identities = 16/81 (19%), Positives = 34/81 (41%), Gaps = 4/81 (4%)
Query: 32 KAIKTFQQLLYVDPSYLRANEVHLRLGLMFKVNNEYDSALKHLTLCL-IDASPCTFTKLE 90
+A++ +++ L +DP + + L + +Y+ AL+ L +D
Sbjct: 18 EALEYYEKALELDPDN---ADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNL 74
Query: 91 GLMFKVNNEYDSALKHLTLCL 111
GL + +Y+ AL+ L
Sbjct: 75 GLAYYKLGKYEEALEAYEKAL 95
Score = 32.4 bits (74), Expect = 0.073
Identities = 20/102 (19%), Positives = 42/102 (41%), Gaps = 9/102 (8%)
Query: 52 EVHLRLGLMFKVNNEYDSALKHLTLCLIDASPC---TFTKLEGLMFKVNNEYDSALKHLT 108
E L LG ++ +YD AL++ ++ P + L + +Y+ AL+
Sbjct: 1 EALLNLGNLYYKLGDYDEALEYYE-KALELDPDNADAYYNL-AAAYYKLGKYEEALEDYE 58
Query: 109 LCLIDASPCTFTKLEVRFHIAHLHEVQRKYKTAKDSYEQLLK 150
L + +++ + KY+ A ++YE+ L+
Sbjct: 59 KALELDP----DNAKAYYNLGLAYYKLGKYEEALEAYEKALE 96
Score = 29.3 bits (66), Expect = 0.80
Identities = 26/112 (23%), Positives = 48/112 (42%), Gaps = 15/112 (13%)
Query: 88 KLEGLMFKVNNEYDSALKHLTLCLIDASPCTFTKLEVRFHIAHLHEVQRKYKTAKDSYEQ 147
G ++ +YD AL++ ++ P + +++A + KY+ A + YE+
Sbjct: 4 LNLGNLYYKLGDYDEALEYYE-KALELDPDNA---DAYYNLAAAYYKLGKYEEALEDYEK 59
Query: 148 LLKEDDLPVHLKADICRQLGWMYHCIDTLGEKSHRETLAIHCLQKSIEADPK 199
L+ D P + KA LG Y K + A+ +K++E DP
Sbjct: 60 ALELD--PDNAKA--YYNLGLAY-------YKLGKYEEALEAYEKALELDPN 100
>gnl|CDD|236983 PRK11788, PRK11788, tetratricopeptide repeat protein; Provisional.
Length = 389
Score = 35.9 bits (84), Expect = 0.019
Identities = 46/211 (21%), Positives = 80/211 (37%), Gaps = 40/211 (18%)
Query: 32 KAIKTFQQLLYVDPSYLRANEVHLRLGLMFKVNNEYDSALK-HLTLCLIDASPC-TFTKL 89
KAI F ++L VDP + E+HL LG +F+ E D A++ H L + P T +
Sbjct: 53 KAIDLFIEMLKVDPETV---ELHLALGNLFRRRGEVDRAIRIHQNL---LSRPDLTREQR 106
Query: 90 E-------------GLMFKVNNEYDSALKHLTLCLIDASPCTFTKLEVRFHIAHLHEVQR 136
GL D A + L+D + +++ ++
Sbjct: 107 LLALQELGQDYLKAGL-------LDRAEELFLQ-LVDEGD---FAEGALQQLLEIYQQEK 155
Query: 137 KYKTAKDSYEQLLKEDDLPVHLKADICRQLGWMYHCIDTLGEKSHRET-LAIHCLQKSIE 195
++ A D E+L K + ++ ++C + + A L+K++
Sbjct: 156 DWQKAIDVAERLEKLGGDSLR------VEIA-HFYCELAQQALARGDLDAARALLKKALA 208
Query: 196 ADPKSGQSLYLLGRCFAAVGKVHDAFLAYRN 226
ADP+ ++ LLG A G A A
Sbjct: 209 ADPQCVRASILLGDLALAQGDYAAAIEALER 239
>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat.
Length = 34
Score = 32.2 bits (74), Expect = 0.022
Identities = 8/32 (25%), Positives = 17/32 (53%)
Query: 189 CLQKSIEADPKSGQSLYLLGRCFAAVGKVHDA 220
+K++E DP + ++ Y L +G+ +A
Sbjct: 1 LYEKALELDPNNAEAYYNLALLLLNLGQYDEA 32
>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat.
Length = 69
Score = 32.3 bits (74), Expect = 0.039
Identities = 11/39 (28%), Positives = 21/39 (53%)
Query: 186 AIHCLQKSIEADPKSGQSLYLLGRCFAAVGKVHDAFLAY 224
AI +K++E DP + ++ Y L + +GK ++ L
Sbjct: 22 AIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALED 60
Score = 26.5 bits (59), Expect = 4.6
Identities = 14/62 (22%), Positives = 27/62 (43%), Gaps = 3/62 (4%)
Query: 202 QSLYLLGRCFAAVGKVHDAFLAYRNSVEKSEGNADTWCSIGNKDFSNNSNREDYHQALNK 261
++L LG +G +A AY ++E NA+ + ++ + +DY +AL
Sbjct: 4 EALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLAL---AYLKLGKDYEEALED 60
Query: 262 YR 263
Sbjct: 61 LE 62
>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat.
Length = 65
Score = 30.7 bits (70), Expect = 0.12
Identities = 10/50 (20%), Positives = 23/50 (46%)
Query: 186 AIHCLQKSIEADPKSGQSLYLLGRCFAAVGKVHDAFLAYRNSVEKSEGNA 235
A+ L+ ++ P + ++L LLG G++ +A R ++ +
Sbjct: 16 ALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADPDDP 65
Score = 26.5 bits (59), Expect = 3.7
Identities = 8/45 (17%), Positives = 12/45 (26%), Gaps = 3/45 (6%)
Query: 126 FHIAHLHEVQRKYKTAKDSYEQLLKEDDLPV---HLKADICRQLG 167
+A Y A + E L L L + + G
Sbjct: 1 LALARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQG 45
>gnl|CDD|118184 pfam09652, Cas_VVA1548, Putative CRISPR-associated protein
(Cas_VVA1548). This entry represents a conserved region
of about 95 amino acids found exclusively in species
with CRISPRs (Clustered Regularly Interspaced Short
Palindromic Repeats). In all bacterial species that
contain this entry, the genes encoding the proteins are
in the midst of a cluster of cas (CRISPR-associated)
genes.
Length = 93
Score = 29.3 bits (66), Expect = 0.70
Identities = 15/54 (27%), Positives = 21/54 (38%), Gaps = 16/54 (29%)
Query: 122 LEVRFHIAHLHEVQRKYKTAKDSYEQLLKED----DLPVHLKADICRQLGWMYH 171
L+V + HL Q+ D LPVHL AD+C + +H
Sbjct: 18 LDVDRFVDHLD------------PAQVSAGDKVIGTLPVHLAADVCEKGARYFH 59
>gnl|CDD|187874 cd09743, Csx16_III-U, CRISPR/Cas system-associated protein Csx16.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Small protein often seen in proximity to Csx1 (COG1517)
family; also known as VVA1548 family.
Length = 90
Score = 29.3 bits (66), Expect = 0.71
Identities = 16/54 (29%), Positives = 22/54 (40%), Gaps = 16/54 (29%)
Query: 122 LEVRFHIAHLHEVQRKYKTAKDSYEQLLKED----DLPVHLKADICRQLGWMYH 171
L+V + HL EQ+ D LPVHL AD+C + +H
Sbjct: 18 LDVDRFVIHL------------DPEQISAGDKVIGTLPVHLAADVCEKGARYFH 59
>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
Length = 291
Score = 31.0 bits (68), Expect = 0.79
Identities = 28/123 (22%), Positives = 54/123 (43%), Gaps = 6/123 (4%)
Query: 32 KAIKTFQQLLYVDPSYLRANEVHLRLGLMFKVNNEYDSALKHLT--LCLIDASPCTFTKL 89
+A++ +++ L +DP E L LG + + Y+ AL+ L L L
Sbjct: 148 EALELYEKALELDPELNELAEALLALGALLEALGRYEEALELLEKALKLNPDDDAEALLN 207
Query: 90 EGLMFKVNNEYDSALKHLTLCLIDASPCTFTKLEVRFHIAHLHEVQRKYKTAKDSYEQLL 149
GL++ +Y+ AL++ L E +++A L +Y+ A ++ E+ L
Sbjct: 208 LGLLYLKLGKYEEALEYYEKALELDPD----NAEALYNLALLLLELGRYEEALEALEKAL 263
Query: 150 KED 152
+ D
Sbjct: 264 ELD 266
>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
lipoprotein. This protein family occurs in strictly
within a subset of Gram-negative bacterial species with
the proposed PEP-CTERM/exosortase system, analogous to
the LPXTG/sortase system common in Gram-positive
bacteria. This protein occurs in a species if and only
if a transmembrane histidine kinase (TIGR02916) and a
DNA-binding response regulator (TIGR02915) also occur.
The present of tetratricopeptide repeats (TPR) suggests
protein-protein interaction, possibly for the regulation
of PEP-CTERM protein expression, since many PEP-CTERM
proteins in these genomes are preceded by a proposed DNA
binding site for the response regulator.
Length = 899
Score = 30.8 bits (70), Expect = 1.1
Identities = 24/116 (20%), Positives = 41/116 (35%), Gaps = 16/116 (13%)
Query: 124 VRFHIAHLHEVQRKYKTAKDSYEQLLK--EDDLPVHLKADICRQLGWMYHCIDTLGEKSH 181
+R +A L+ Q+ Y A Y+ ++K D+ V L W+Y E
Sbjct: 772 LRTALAELYLAQKDYDKAIKHYQTVVKKAPDNAVVL------NNLAWLYL------ELKD 819
Query: 182 RETLAIHCLQKSIEADPKSGQSLYLLGRCFAAVGKVHDAFLAYRNSVEKSEGNADT 237
L +++++ P L LG G+ A R +V + A
Sbjct: 820 PRALEY--AERALKLAPNIPAILDTLGWLLVEKGEADRALPLLRKAVNIAPEAAAI 873
Score = 28.5 bits (64), Expect = 5.6
Identities = 16/80 (20%), Positives = 30/80 (37%), Gaps = 4/80 (5%)
Query: 186 AIHCLQKSIEADPKSGQSLYLLGRCFAAVGKVHDAFLAYRNSVEKSEGNADTWCSIGNKD 245
A+ +K + P S +L LL +A + A + + ++E N + IG
Sbjct: 620 AVSSFKKLLALQPDSALALLLLADAYAVMKNYAKAITSLKRALELKPDNTEA--QIGLAQ 677
Query: 246 FSNNSNREDYHQALNKYRDL 265
+ R + A + L
Sbjct: 678 LLLAAKRTE--SAKKIAKSL 695
>gnl|CDD|240089 cd04738, DHOD_2_like, Dihydroorotate dehydrogenase (DHOD) class 2.
DHOD catalyzes the oxidation of (S)-dihydroorotate to
orotate. This is the fourth step and the only redox
reaction in the de novo biosynthesis of UMP, the
precursor of all pyrimidine nucleotides. DHOD requires
FMN as co-factor. DHOD divides into class 1 and class 2
based on their amino acid sequences, their cellular
location and their natural electron acceptor used to
reoxidize the flavin group. Members of class 1 are
cytosolic enzymes and multimers, while class 2 enzymes
are membrane associated, monomeric and use respiratory
quinones as their physiological electron acceptors.
Length = 327
Score = 29.8 bits (68), Expect = 1.8
Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 3/35 (8%)
Query: 240 SIG-NKDFSNNSNR-EDYHQALNKYRDLGDFLVIN 272
+IG NKD + + EDY + K D+LV+N
Sbjct: 134 NIGKNKD-TPLEDAVEDYVIGVRKLGPYADYLVVN 167
>gnl|CDD|198111 smart01043, BTAD, Bacterial transcriptional activator domain.
Found in the DNRI/REDD/AFSR family of regulators. This
region of AFSR along with the C terminal region is
capable of independently directing actinorhodin
production. This family contains TPR repeats.
Length = 145
Score = 28.8 bits (65), Expect = 2.2
Identities = 13/44 (29%), Positives = 20/44 (45%)
Query: 182 RETLAIHCLQKSIEADPKSGQSLYLLGRCFAAVGKVHDAFLAYR 225
R A+ L++ + DP + LL R G+ +A AYR
Sbjct: 76 RHEEALALLERLLALDPLRERLHRLLMRALYRAGRRAEALRAYR 119
>gnl|CDD|215560 PLN03076, PLN03076, ARF guanine nucleotide exchange factor
(ARF-GEF); Provisional.
Length = 1780
Score = 28.6 bits (64), Expect = 5.8
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 7/41 (17%)
Query: 186 AIHCLQKSI-------EADPKSGQSLYLLGRCFAAVGKVHD 219
A+ C+QK I EADP G LL + +V K HD
Sbjct: 106 ALDCIQKLIAHGYLRGEADPSGGPEALLLAKLIESVCKCHD 146
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.134 0.402
Gapped
Lambda K H
0.267 0.0733 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,691,002
Number of extensions: 1441575
Number of successful extensions: 1333
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1325
Number of HSP's successfully gapped: 43
Length of query: 324
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 227
Effective length of database: 6,635,264
Effective search space: 1506204928
Effective search space used: 1506204928
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (26.5 bits)