RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy13998
(319 letters)
>gnl|CDD|225926 COG3391, COG3391, Uncharacterized conserved protein [Function
unknown].
Length = 381
Score = 64.9 bits (158), Expect = 8e-12
Identities = 63/243 (25%), Positives = 100/243 (41%), Gaps = 40/243 (16%)
Query: 55 FTWPRGIAVGPDNSIV-VADSSNHRVQVCFPHFDLKTNCV--FLAFTW-PRGIAVGPDN- 109
+P G+AV P + V V ++ V V D TN V + P G+AV PD
Sbjct: 73 GVYPAGVAVNPAGNKVYVTTGDSNTVSV----IDTATNTVLGSIPVGLGPVGLAVDPDGK 128
Query: 110 SIVVADS--SNHRVQVFQSDGTFVGKFGSMGNKAGQLEHPHYIAVS-NTNRVIVSDSNNH 166
+ VA++ N+ V V + V +GN P +AV + N+V V++S+++
Sbjct: 129 YVYVANAGNGNNTVSVIDAATNKVTATIPVGN------TPTGVAVDPDGNKVYVTNSDDN 182
Query: 167 RVQIFDVNGRVFQSDGTFVGKFGSMGNKAGQLEHPHYIAVS-NTNRVIVSDSNNHRVQIF 225
V + D +G GS+G+ G P IAV + NRV V++ + +
Sbjct: 183 TVSVIDTSGNSV--------VRGSVGSLVGVGTGPAGIAVDPDGNRVYVANDGSGSNNVL 234
Query: 226 DVNGRVITSFGSEGSEEGQLKFPRGVAVDDQG-----------YISVGDSGNNRIQIFTP 274
++ ++ PRGVAVD G +SV D +R+ P
Sbjct: 235 KIDTATGNVTATDLPVGSG--APRGVAVDPAGKAAYVANSQGGTVSVIDGATDRVVKTGP 292
Query: 275 DGQ 277
G
Sbjct: 293 TGN 295
Score = 46.4 bits (110), Expect = 1e-05
Identities = 29/125 (23%), Positives = 54/125 (43%), Gaps = 17/125 (13%)
Query: 200 HPHYIAVSN-TNRVIVSDSNNHRVQIFDVNGRVITSFGSEGSEEGQLKFPRGVAVDDQG- 257
+P +AV+ N+V V+ +++ V + D + G P G+AVD G
Sbjct: 75 YPAGVAVNPAGNKVYVTTGDSNTVSVIDTATNTVLGSIPVGL------GPVGLAVDPDGK 128
Query: 258 --YISVGDSGNNRIQIFTPDGQFLRAFGCWGSGDGEFKGLEGVAVMSNGN-ILVCDRENH 314
Y++ +GNN + + + A G+ GVAV +GN + V + +++
Sbjct: 129 YVYVANAGNGNNTVSVIDAATNKVTATIPVGNTPT------GVAVDPDGNKVYVTNSDDN 182
Query: 315 RIQVF 319
+ V
Sbjct: 183 TVSVI 187
Score = 39.1 bits (91), Expect = 0.002
Identities = 37/142 (26%), Positives = 47/142 (33%), Gaps = 23/142 (16%)
Query: 58 PRGIAVGPD-NSIVVADSSNHRVQVCFPHFDLKTNCVFLA--------FTWPRGIAVGPD 108
P G+AV PD N + V +S ++ V V D N V T P GIAV PD
Sbjct: 162 PTGVAVDPDGNKVYVTNSDDNTVSV----IDTSGNSVVRGSVGSLVGVGTGPAGIAVDPD 217
Query: 109 NSIVVA---DSSNHRVQVFQSDGTFVGKFGSMGNKA---GQLEHPH----YIAVSNTNRV 158
+ V S ++ V + V G P Y+A S V
Sbjct: 218 GNRVYVANDGSGSNNVLKIDTATGNVTATDLPVGSGAPRGVAVDPAGKAAYVANSQGGTV 277
Query: 159 IVSDSNNHRVQIFDVNGRVFQS 180
V D RV G
Sbjct: 278 SVIDGATDRVVKTGPTGNEALG 299
>gnl|CDD|215497 PLN02919, PLN02919, haloacid dehalogenase-like hydrolase family
protein.
Length = 1057
Score = 60.6 bits (147), Expect = 3e-10
Identities = 51/211 (24%), Positives = 85/211 (40%), Gaps = 30/211 (14%)
Query: 96 AFTWPRGIAVGPD-NSIVVADSSNHRVQVFQ-----------SDGTFVGKFGSMGNKAG- 142
+F P GI++ PD + +ADS + ++ D TF G+ G
Sbjct: 738 SFAQPSGISLSPDLKELYIADSESSSIRALDLKTGGSRLLAGGDPTFSDNLFKFGDHDGV 797
Query: 143 ----QLEHPHYIAVSNTNRVIVSDSNNHRVQIFD-VNGRVFQSDGTFVGKFGSMGNKAGQ 197
L+HP + + ++ V+DS NH+++ D RV GT F Q
Sbjct: 798 GSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPATKRVTTLAGTGKAGFKDGKALKAQ 857
Query: 198 LEHPHYIAVSNTNRVIVSDSNNHRVQIFDVN---GRVITSFGSEGSE-----EGQLKFPR 249
L P +A+ R+ V+D+NN ++ D+N I + +G + LK R
Sbjct: 858 LSEPAGLALGENGRLFVADTNNSLIRYLDLNKGEAAEILTLELKGVQPPRPKSKSLKRLR 917
Query: 250 GVAVDDQGYISV----GDSGNNRIQIFTPDG 276
+ D I V G+ +++I P G
Sbjct: 918 RRSSADTQVIKVDGVTSLEGDLQLKISLPPG 948
Score = 48.3 bits (115), Expect = 3e-06
Identities = 50/196 (25%), Positives = 88/196 (44%), Gaps = 25/196 (12%)
Query: 103 IAVGPDNSIVVADSSNHRVQVFQSDGTFVGKFGSMGNKAGQLEHPHYIAVS-NTNRVIVS 161
IA+ + I + S+ +VF DG GS G + P I++S + + ++
Sbjct: 699 IAMAGQHQIWEYNISDGVTRVFSGDGYERNLNGSSG-TSTSFAQPSGISLSPDLKELYIA 757
Query: 162 DSNNHRVQIFDVN---------GRVFQSDGTFVGKFGS---MGNKAGQLEHPHYIAVSNT 209
DS + ++ D+ G SD F KFG +G++ L+HP + +
Sbjct: 758 DSESSSIRALDLKTGGSRLLAGGDPTFSDNLF--KFGDHDGVGSEV-LLQHPLGVLCAKD 814
Query: 210 NRVIVSDSNNHRVQIFD-VNGRVIT--SFGSEGSEEG-----QLKFPRGVAVDDQGYISV 261
++ V+DS NH+++ D RV T G G ++G QL P G+A+ + G + V
Sbjct: 815 GQIYVADSYNHKIKKLDPATKRVTTLAGTGKAGFKDGKALKAQLSEPAGLALGENGRLFV 874
Query: 262 GDSGNNRIQIFTPDGQ 277
D+ N+ I+ +
Sbjct: 875 ADTNNSLIRYLDLNKG 890
Score = 42.5 bits (100), Expect = 3e-04
Identities = 60/252 (23%), Positives = 104/252 (41%), Gaps = 42/252 (16%)
Query: 108 DNSIVVADSSNHRVQVFQSDGTFVGKFGSMGN---KAGQLE-----HPHYIAV-SNTNRV 158
+N + ++DS+++R+ V DG F+ + GS G + G E P +A + N +
Sbjct: 579 NNRLFISDSNHNRIVVTDLDGNFIVQIGSTGEEGLRDGSFEDATFNRPQGLAYNAKKNLL 638
Query: 159 IVSDSNNHRVQIFD-VNGRVFQSDGTFVGKFGSMGNKAGQLE----------HPH----Y 203
V+D+ NH ++ D VN V G G K G + P Y
Sbjct: 639 YVADTENHALREIDFVNETVRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVY 698
Query: 204 IAVSNTNRVIVSDSNNHRVQIFDVNGRVITSFGSEGSEEGQLKFPRGVAVD---DQGYIS 260
IA++ +++ + ++ ++F +G GS G+ + P G+++ + YI+
Sbjct: 699 IAMAGQHQIWEYNISDGVTRVFSGDGYERNLNGSSGTSTSFAQ-PSGISLSPDLKELYIA 757
Query: 261 VGDSGNNRIQIFTPDGQFLRAFG-------CWGSGDGEFKGLE-------GVAVMSNGNI 306
+S + R G L A G + GD + G E GV +G I
Sbjct: 758 DSESSSIRALDLKTGGSRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQI 817
Query: 307 LVCDRENHRIQV 318
V D NH+I+
Sbjct: 818 YVADSYNHKIKK 829
Score = 37.9 bits (88), Expect = 0.006
Identities = 29/96 (30%), Positives = 40/96 (41%), Gaps = 25/96 (26%)
Query: 43 FKIGSR---GSEPGCFTWPRGIAVGPDNSIVVADSSNHRVQVCFPHFDLKTNCVF-LA-- 96
FK G GSE P G+ D I VADS NH+++ D T V LA
Sbjct: 789 FKFGDHDGVGSE-VLLQHPLGVLCAKDGQIYVADSYNHKIKK----LDPATKRVTTLAGT 843
Query: 97 --------------FTWPRGIAVGPDNSIVVADSSN 118
+ P G+A+G + + VAD++N
Sbjct: 844 GKAGFKDGKALKAQLSEPAGLALGENGRLFVADTNN 879
>gnl|CDD|110440 pfam01436, NHL, NHL repeat. The NHL (NCL-1, HT2A and LIN-41)
repeat is found in multiple tandem copies. It is about
40 residues long and resembles the WD repeat pfam00400.
The repeats have a catalytic activity in bovine PAM
(peptidyl-glycine alpha-amidating monooxygenase),
proteolysis has shown that the
Peptidyl-alpha-hydroxyglycine alpha-amidating lyase
(PAL) activity is localised to the repeats. Human E3
ubiquitin-protein ligase TRIM32 interacts with the
activation domain of Tat. This interaction is me diated
by the NHL repeats.
Length = 28
Score = 45.5 bits (109), Expect = 4e-07
Identities = 17/28 (60%), Positives = 19/28 (67%)
Query: 97 FTWPRGIAVGPDNSIVVADSSNHRVQVF 124
F P G+AV D + VADS NHRVQVF
Sbjct: 1 FNRPHGVAVDSDGDVYVADSENHRVQVF 28
Score = 42.8 bits (102), Expect = 3e-06
Identities = 16/27 (59%), Positives = 18/27 (66%)
Query: 55 FTWPRGIAVGPDNSIVVADSSNHRVQV 81
F P G+AV D + VADS NHRVQV
Sbjct: 1 FNRPHGVAVDSDGDVYVADSENHRVQV 27
Score = 42.4 bits (101), Expect = 4e-06
Identities = 16/28 (57%), Positives = 20/28 (71%)
Query: 292 FKGLEGVAVMSNGNILVCDRENHRIQVF 319
F GVAV S+G++ V D ENHR+QVF
Sbjct: 1 FNRPHGVAVDSDGDVYVADSENHRVQVF 28
Score = 40.9 bits (97), Expect = 2e-05
Identities = 14/28 (50%), Positives = 18/28 (64%)
Query: 144 LEHPHYIAVSNTNRVIVSDSNNHRVQIF 171
PH +AV + V V+DS NHRVQ+F
Sbjct: 1 FNRPHGVAVDSDGDVYVADSENHRVQVF 28
Score = 40.9 bits (97), Expect = 2e-05
Identities = 14/28 (50%), Positives = 18/28 (64%)
Query: 198 LEHPHYIAVSNTNRVIVSDSNNHRVQIF 225
PH +AV + V V+DS NHRVQ+F
Sbjct: 1 FNRPHGVAVDSDGDVYVADSENHRVQVF 28
Score = 37.4 bits (88), Expect = 2e-04
Identities = 14/28 (50%), Positives = 18/28 (64%)
Query: 245 LKFPRGVAVDDQGYISVGDSGNNRIQIF 272
P GVAVD G + V DS N+R+Q+F
Sbjct: 1 FNRPHGVAVDSDGDVYVADSENHRVQVF 28
>gnl|CDD|219847 pfam08450, SGL, SMP-30/Gluconolaconase/LRE-like region. This
family describes a region that is found in proteins
expressed by a variety of eukaryotic and prokaryotic
species. These proteins include various enzymes, such as
senescence marker protein 30 (SMP-30), gluconolactonase
and luciferin-regenerating enzyme (LRE). SMP-30 is known
to hydrolyse diisopropyl phosphorofluoridate in the
liver, and has been noted as having sequence similarity,
in the region described in this family, with PON1 and
LRE.
Length = 245
Score = 41.1 bits (97), Expect = 3e-04
Identities = 24/86 (27%), Positives = 36/86 (41%), Gaps = 15/86 (17%)
Query: 204 IAVSNTNR-VIVSDSNNHRVQIFDV--------NGRVITSFGSEGSEEGQLKFPRGVAVD 254
+A S + + +DS R+ FD N RV F E P G+AVD
Sbjct: 138 LAWSPDGKTLYFADSPTRRIWAFDYDADGGLISNRRVFADFKDGDGE------PDGMAVD 191
Query: 255 DQGYISVGDSGNNRIQIFTPDGQFLR 280
+G + V G ++ + PDG+ L
Sbjct: 192 AEGNVWVARWGGGKVVRYDPDGKLLG 217
Score = 34.9 bits (81), Expect = 0.032
Identities = 22/86 (25%), Positives = 34/86 (39%), Gaps = 12/86 (13%)
Query: 58 PRGIAVGPDNSIV-VADSSNHRVQVCFPHFDLKTNC-----VFLAFT----WPRGIAVGP 107
G+A PD + ADS R+ +D VF F P G+AV
Sbjct: 135 SNGLAWSPDGKTLYFADSPTRRIWAF--DYDADGGLISNRRVFADFKDGDGEPDGMAVDA 192
Query: 108 DNSIVVADSSNHRVQVFQSDGTFVGK 133
+ ++ VA +V + DG +G+
Sbjct: 193 EGNVWVARWGGGKVVRYDPDGKLLGE 218
Score = 29.9 bits (68), Expect = 1.6
Identities = 27/132 (20%), Positives = 39/132 (29%), Gaps = 19/132 (14%)
Query: 62 AVGPDNSIVVADSSNHRVQVCFP------HFDLKTNCVFLAFTWPRGIAVGPDNSIV-VA 114
V PD P D K V T G+A PD + A
Sbjct: 91 KVDPDGRFWFGTMGFDIAPGGEPGALYRLDPDGKVERVLDGITISNGLAWSPDGKTLYFA 150
Query: 115 DSSNHRVQVFQ-SDGTFVGKFGSMGNK------AGQLEHPHYIAVSNTNRVIVSDSNNHR 167
DS R+ F G + N+ P +AV V V+ +
Sbjct: 151 DSPTRRIWAFDYDADG-----GLISNRRVFADFKDGDGEPDGMAVDAEGNVWVARWGGGK 205
Query: 168 VQIFDVNGRVFQ 179
V +D +G++
Sbjct: 206 VVRYDPDGKLLG 217
>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
proteins that cover a wide variety of functions
including adaptor/regulatory modules in signal
transduction, pre-mRNA processing and cytoskeleton
assembly; typically contains a GH dipeptide 11-24
residues from its N-terminus and the WD dipeptide at its
C-terminus and is 40 residues long, hence the name WD40;
between GH and WD lies a conserved core; serves as a
stable propeller-like platform to which proteins can
bind either stably or reversibly; forms a propeller-like
structure with several blades where each blade is
composed of a four-stranded anti-parallel b-sheet;
instances with few detectable copies are hypothesized to
form larger structures by dimerization; each WD40
sequence repeat forms the first three strands of one
blade and the last strand in the next blade; the last
C-terminal WD40 repeat completes the blade structure of
the first WD40 repeat to create the closed ring
propeller-structure; residues on the top and bottom
surface of the propeller are proposed to coordinate
interactions with other proteins and/or small ligands; 7
copies of the repeat are present in this alignment.
Length = 289
Score = 40.4 bits (95), Expect = 7e-04
Identities = 50/273 (18%), Positives = 101/273 (36%), Gaps = 51/273 (18%)
Query: 59 RGIAVGPDNSIVVADSSNHRVQVCFPHFDLKTNCVFLAFT----WPRGIAVGPDNSIVVA 114
+A PD ++ S + ++V +DL+T + R +A D + + +
Sbjct: 13 TCVAFSPDGKLLATGSGDGTIKV----WDLETGELLRTLKGHTGPVRDVAASADGTYLAS 68
Query: 115 DSSNHRVQVFQ-SDGTFVGKFGSMGNKAGQLEHPHYI---AVSNTNRVIVSDSNNHRVQI 170
SS+ ++++ G V H Y+ A S R++ S S + +++
Sbjct: 69 GSSDKTIRLWDLETGECVRTLTG---------HTSYVSSVAFSPDGRILSSSSRDKTIKV 119
Query: 171 FDVNGRVFQSDGTFVGKFGSMGNKAGQLEHPHYI---AVSNTNRVIVSDSNNHRVQIFDV 227
+DV + T G H ++ A S + S S + ++++D+
Sbjct: 120 WDVETGKCLT--TLRG-------------HTDWVNSVAFSPDGTFVASSSQDGTIKLWDL 164
Query: 228 -NGRVITSFGSEGSEEGQLKFPRGVAVDDQGYISVGDSGNNRIQIFTPDGQFLRAFGCWG 286
G+ + + G VA G + S + I+++ C G
Sbjct: 165 RTGKCVATL------TGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGK-----CLG 213
Query: 287 SGDGEFKGLEGVAVMSNGNILVCDRENHRIQVF 319
+ G G+ VA +G +L E+ I+V+
Sbjct: 214 TLRGHENGVNSVAFSPDGYLLASGSEDGTIRVW 246
Score = 38.9 bits (91), Expect = 0.002
Identities = 23/144 (15%), Positives = 53/144 (36%), Gaps = 23/144 (15%)
Query: 146 HPHYI---AVSNTNRVIVSDSNNHRVQIFDVNGRVFQSDGTFVGKFGSMGNKAGQLEHPH 202
H + A S +++ + S + ++++D+ T G G + +
Sbjct: 8 HTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLR--TLKGHTGPVRD--------- 56
Query: 203 YIAVSNTNRVIVSDSNNHRVQIFDVN-GRVITSFGSEGSEEGQLKFPRGVAVDDQGYISV 261
+A S + S S++ ++++D+ G + + G + VA G I
Sbjct: 57 -VAASADGTYLASGSSDKTIRLWDLETGECVRTL------TGHTSYVSSVAFSPDGRILS 109
Query: 262 GDSGNNRIQIF-TPDGQFLRAFGC 284
S + I+++ G+ L
Sbjct: 110 SSSRDKTIKVWDVETGKCLTTLRG 133
Score = 33.1 bits (76), Expect = 0.14
Identities = 36/178 (20%), Positives = 74/178 (41%), Gaps = 33/178 (18%)
Query: 57 WPRGIAVGPDNSIVVADSSNHRVQVCFPHFDLKTN-CV--FLAFT-WPRGIAVGPDNSIV 112
W +A PD + V + S + +++ +DL+T CV T +A PD +
Sbjct: 137 WVNSVAFSPDGTFVASSSQDGTIKL----WDLRTGKCVATLTGHTGEVNSVAFSPDGEKL 192
Query: 113 VADSSNHRVQVF-QSDGTFVGKFGSMGNKAGQLEHPHYIAVSNTNRVIVSDSNNHRVQIF 171
++ SS+ ++++ S G +G N + +A S ++ S S + ++++
Sbjct: 193 LSSSSDGTIKLWDLSTGKCLGTLRGHENGV------NSVAFSPDGYLLASGSEDGTIRVW 246
Query: 172 DVNGRVFQSDGTFVGKFGSMGNKAGQLEHPHYI---AVSNTNRVIVSDSNNHRVQIFD 226
D+ R + T G H + + A S + + S S + ++I+D
Sbjct: 247 DL--RTGECVQTLSG-------------HTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
>gnl|CDD|220674 pfam10282, Lactonase, Lactonase, 7-bladed beta-propeller. This
entry contains bacterial 6-phosphogluconolactonases
(6PGL)YbhE-type (EC:3.1.1.31) which hydrolyse
6-phosphogluconolactone to 6-phosphogluconate. The entry
also contains the fungal muconate lactonising enzyme
carboxy-cis,cis-muconate cyclase (EC:5.5.1.5) and
muconate cycloisomerase (EC:5.5.1.1), which convert
cis,cis-muconates to muconolactones and vice versa as
part of the microbial beta-ketoadipate pathway.
Structures of proteins in this family have revealed a
7-bladed beta-propeller fold.
Length = 344
Score = 34.5 bits (80), Expect = 0.056
Identities = 27/141 (19%), Positives = 54/141 (38%), Gaps = 22/141 (15%)
Query: 103 IAVGPDNSI---VVADSSNHRVQVFQSDG-----TFVGKFGSMGNKAGQLEHPHYIAVSN 154
+AV PD V + V F+ D T + + + G P +++V
Sbjct: 42 LAVSPDGKTLYAVNETGEDGGVSAFRIDPDTGKLTLLNQVPTGGA------SPCHLSVDP 95
Query: 155 TNRVIVSDSNNH--RVQIFDVN--GRVFQSDGTFVGKFGSMGNKAGQ-LEHPHYIAVSNT 209
R + +N V ++ ++ G + V GS N Q H H + ++
Sbjct: 96 DGRFLFV-ANYGGGSVSVYPLDADGSL-GEASQVVQHEGSGPNPERQEGPHAHSVVLTPD 153
Query: 210 NR-VIVSDSNNHRVQIFDVNG 229
+ ++V D RV+++ ++
Sbjct: 154 GKFLVVPDLGTDRVRVYRLDD 174
>gnl|CDD|234377 TIGR03866, PQQ_ABC_repeats, PQQ-dependent catabolism-associated
beta-propeller protein. Members of this protein family
consist of seven repeats each of the YVTN family
beta-propeller repeat (see TIGR02276). Members occur
invariably as part of a transport operon that is
associated with PQQ-dependent catabolism of alcohols
such as phenylethanol.
Length = 300
Score = 33.1 bits (76), Expect = 0.17
Identities = 25/114 (21%), Positives = 43/114 (37%), Gaps = 21/114 (18%)
Query: 66 DNSIVVADSSNHRVQVCFPHFDLKTNCVFLAFTWPRGIAVGPDNSIV-VADSSNHRVQVF 124
DN+I V D++ V FP PRGI D ++ V S + VQV
Sbjct: 10 DNTISVIDTATLEVTRTFP-----------VGQRPRGITFSKDGKLLYVCASDSDTVQVI 58
Query: 125 QSD-GTFVGKFGSMGNKAGQLEHPHYIAVSNTNRVI-VSDSNNHRVQIFDVNGR 176
G + S P A+ +++ +++ ++ V + D+ R
Sbjct: 59 DPATGEVLHTLPSG-------PDPELFALHPNGKILYIANEDDALVTVIDIETR 105
Score = 28.9 bits (65), Expect = 3.7
Identities = 15/49 (30%), Positives = 19/49 (38%), Gaps = 14/49 (28%)
Query: 43 FKIGSRGSEPGCFTWPRGIAVGPDNSIV-VADSSNHRVQVCFPHFDLKT 90
F +G R PRGI D ++ V S + VQV D T
Sbjct: 27 FPVGQR---------PRGITFSKDGKLLYVCASDSDTVQV----IDPAT 62
>gnl|CDD|236846 PRK11100, PRK11100, sensory histidine kinase CreC; Provisional.
Length = 475
Score = 32.1 bits (74), Expect = 0.35
Identities = 12/32 (37%), Positives = 16/32 (50%), Gaps = 2/32 (6%)
Query: 213 IVSDSNNHRVQIFDVNGRVITSFGSEGSEEGQ 244
I ++RV + D G V+ F SEG GQ
Sbjct: 86 IRKTRVDYRVYVTDAQGIVL--FDSEGRAVGQ 115
Score = 28.3 bits (64), Expect = 6.9
Identities = 11/30 (36%), Positives = 16/30 (53%), Gaps = 1/30 (3%)
Query: 159 IVSDSNNHRVQIFDVNGRV-FQSDGTFVGK 187
I ++RV + D G V F S+G VG+
Sbjct: 86 IRKTRVDYRVYVTDAQGIVLFDSEGRAVGQ 115
>gnl|CDD|225921 COG3386, COG3386, Gluconolactonase [Carbohydrate transport and
metabolism].
Length = 307
Score = 32.0 bits (73), Expect = 0.36
Identities = 45/216 (20%), Positives = 72/216 (33%), Gaps = 54/216 (25%)
Query: 107 PDNSIVVADSSNHRVQVFQSDGTFVGKFGSMGNKAGQLEHPHYIAVSNTNRVIVSDSNNH 166
I D + +VF S G F G++ + G+L IA + R++ D+
Sbjct: 45 LGGRIHRLDPETGKKRVFPSPGGFSS--GALIDAGGRL-----IACEHGVRLLDPDTGGK 97
Query: 167 RVQIFDV---------NGRVFQSDGTFVGKFGSMGNK------------------AGQLE 199
+ + N V DG FG MG G +
Sbjct: 98 ITLLAEPEDGLPLNRPNDGVVDPDGRIW--FGDMGYFDLGKSEERPTGSLYRVDPDGGVV 155
Query: 200 HP--HYIAVSNT-------NRVIVSDSNNHRVQIFDVNGRVITSFGSEGS-----EEGQL 245
+ + N + V+D+ +R+ +D++ G G E G
Sbjct: 156 RLLDDDLTIPNGLAFSPDGKTLYVADTPANRIHRYDLDPATGPIGGRRGFVDFDEEPG-- 213
Query: 246 KFPRGVAVDDQGYI-SVGDSGNNRIQIFTPDGQFLR 280
P G+AVD G + G R+ F PDG+ L
Sbjct: 214 -LPDGMAVDADGNLWVAAVWGGGRVVRFNPDGKLLG 248
Score = 28.5 bits (64), Expect = 4.4
Identities = 34/161 (21%), Positives = 52/161 (32%), Gaps = 36/161 (22%)
Query: 52 PGCFTWPRGIAVGPDNSIV-VADSSNHRV-QVCFPHFDLKTN--CVFLAFTWPRGIAVGP 107
T P G+A PD + VAD+ +R+ + F+ F G+ P
Sbjct: 159 DDDLTIPNGLAFSPDGKTLYVADTPANRIHRYDLDPATGPIGGRRGFVDFDEEPGL---P 215
Query: 108 DNSIVVAD--------SSNHRVQVFQSDGTFVGK------------FGSMGNKAGQLEHP 147
D V AD RV F DG +G+ FG G L
Sbjct: 216 DGMAVDADGNLWVAAVWGGGRVVRFNPDGKLLGEIKLPVKRPTNPAFG--GPDLNTL--- 270
Query: 148 HYIAVSNTNRVIVSDSNNHRVQIFDVNGRVFQSDGTFVGKF 188
YI + + + ++ +F + V G +F
Sbjct: 271 -YITSARSGMSRMLTADPLGGGLFSLRLEV---KGLPEPRF 307
>gnl|CDD|197380 cd09971, SdiA-regulated, SdiA-regulated. This model represents a
bacterial family of proteins that may be regulated by
SdiA, a member of the LuxR family of transcriptional
regulators. The C-terminal domain included in the
alignment forms a five-bladed beta-propeller structure.
The X-ray structure of Escherichia coli yjiK (C-terminal
domain) exhibits binding of calcium ions (Ca++) in what
appears to be an evolutionarily conserved site. Sequence
analysis suggests a distant relationship to proteins
that are characterized as containing NHL-repeats. The
latter also form beta-propeller structures, with several
examples known to form six-bladed beta-propellers.
Several of the six-bladed beta-propellers containing NHL
repeats have been characterized functionally, including
members with enzymatic functions that are dependent on
metal ions. No functional characterization is available
for this family of five-bladed propellers, though.
Length = 242
Score = 31.4 bits (72), Expect = 0.48
Identities = 12/60 (20%), Positives = 24/60 (40%), Gaps = 6/60 (10%)
Query: 260 SVGDSGNNRIQIFTPDGQFLRAFGCWGSGDGEFKGLEGVAVMSNGNILVCDRENHRIQVF 319
+V + +++ + DG LR DG F EG+ + +G + D + +
Sbjct: 31 AVTNKPPALVEL-SKDGDVLRRI----PLDG-FSDPEGITYLGDGQFAIVDERLQALLIV 84
>gnl|CDD|222139 pfam13449, Phytase-like, Esterase-like activity of phytase. This
is a repeated domain that carries several highly
conserved Glu and Asp residues indicating the likelihood
that the domain incorporates the enzymic activity of the
PLC-like phospho-diesterase part of the proteins.
Length = 324
Score = 31.1 bits (71), Expect = 0.69
Identities = 20/84 (23%), Positives = 32/84 (38%), Gaps = 14/84 (16%)
Query: 248 PRGVAVDDQGYISVGDSGN------NRIQIFTPDGQFLRAF--------GCWGSGDGEFK 293
P G+A+D G + + G+ ++ F DG+ LR G G
Sbjct: 87 PEGIALDPGGTLWISSEGDRSRGIPPFLRRFDLDGRLLRELPLPAAFLPGPGKQGVRNNL 146
Query: 294 GLEGVAVMSNGNILVCDRENHRIQ 317
G EG+ + +G+ L E Q
Sbjct: 147 GFEGLTLSPDGDTLWVATEAALKQ 170
>gnl|CDD|223663 COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport
and metabolism / Translation, ribosomal structure and
biogenesis].
Length = 152
Score = 30.0 bits (68), Expect = 0.99
Identities = 11/35 (31%), Positives = 18/35 (51%), Gaps = 1/35 (2%)
Query: 91 NCVFLAFTWPRGIAVGPDNSIVVADSSNHRVQVFQ 125
V+ A + P+ A+G I+ NHRV+V+
Sbjct: 102 RVVYGA-SDPKTGAIGSLLDILKDPRLNHRVEVYG 135
Score = 28.4 bits (64), Expect = 3.3
Identities = 9/28 (32%), Positives = 14/28 (50%)
Query: 54 CFTWPRGIAVGPDNSIVVADSSNHRVQV 81
+ P+ A+G I+ NHRV+V
Sbjct: 106 GASDPKTGAIGSLLDILKDPRLNHRVEV 133
>gnl|CDD|218908 pfam06136, DUF966, Domain of unknown function (DUF966). Family of
plant proteins with unknown function.
Length = 308
Score = 29.7 bits (67), Expect = 2.1
Identities = 12/23 (52%), Positives = 13/23 (56%), Gaps = 6/23 (26%)
Query: 142 GQLEHPHYIAVSNTNRVIVSDSN 164
GQLEHPH+I V VS N
Sbjct: 11 GQLEHPHFI------EVHVSSRN 27
Score = 29.7 bits (67), Expect = 2.1
Identities = 12/23 (52%), Positives = 13/23 (56%), Gaps = 6/23 (26%)
Query: 196 GQLEHPHYIAVSNTNRVIVSDSN 218
GQLEHPH+I V VS N
Sbjct: 11 GQLEHPHFI------EVHVSSRN 27
>gnl|CDD|234491 TIGR04183, Por_Secre_tail, Por secretion system C-terminal sorting
domain. Species that include Porphyromonas gingivalis,
Fibrobacter succinogenes, Flavobacterium johnsoniae,
Cytophaga hutchinsonii, Gramella forsetii, Prevotella
intermedia, and Salinibacter ruber average twenty or
more copies of a C-terminal domain, modeled by This
model, associated with sorting to the outer membrane and
covalent modification.
Length = 71
Score = 27.0 bits (60), Expect = 3.0
Identities = 11/47 (23%), Positives = 21/47 (44%), Gaps = 1/47 (2%)
Query: 206 VSNTNRVIVSDSNNHRVQIFDVNGRVITSFGSEGSEEGQL-KFPRGV 251
+++S S +V+I+D++G+V+ S L GV
Sbjct: 6 AGTLIIILLSSSEIAKVEIYDLSGKVVKKTTLNNSNSIDLSNLSSGV 52
>gnl|CDD|226708 COG4257, Vgb, Streptogramin lyase [Defense mechanisms].
Length = 353
Score = 29.1 bits (65), Expect = 3.2
Identities = 20/86 (23%), Positives = 28/86 (32%), Gaps = 16/86 (18%)
Query: 45 IGSRGSEPGCFTW--------PRGIAVGPDNSIVVADSSNHRVQVCFPHFDLKTNCVFL- 95
GS + G P +A PD ++ + H D T V
Sbjct: 43 AGSSPAPDGSSAEFPVPNGSAPFDVAPAPDGAVWFTAQGTGAIG----HLDPATGEVETY 98
Query: 96 ---AFTWPRGIAVGPDNSIVVADSSN 118
+ P GI VGPD S + D+
Sbjct: 99 PLGSGASPHGIVVGPDGSAWITDTGL 124
>gnl|CDD|215254 PLN02460, PLN02460, indole-3-glycerol-phosphate synthase.
Length = 338
Score = 29.0 bits (65), Expect = 3.8
Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 6/55 (10%)
Query: 222 VQIFDVNGRVITSFGSEGSEEGQL-KFPRGVAVDDQGYISVGDSGNNRIQIFTPD 275
V++ +N R + +F + S +L + RG + ++G I VG+SG +FTPD
Sbjct: 252 VELIGINNRSLETFEVDISNTKKLLEGERGEQIREKGIIVVGESG-----LFTPD 301
>gnl|CDD|219247 pfam06977, SdiA-regulated, SdiA-regulated. This family represents
a conserved region approximately within a number of
hypothetical bacterial proteins that may be regulated by
SdiA, a member of the LuxR family of transcriptional
regulators. Some family members contain the pfam01436
repeat.
Length = 249
Score = 28.7 bits (65), Expect = 4.0
Identities = 11/56 (19%), Positives = 21/56 (37%), Gaps = 7/56 (12%)
Query: 266 NNRIQI--FTPDGQFLRAFGCWGSGDGEFKGLEGVAVMSNGNILVCDRENHRIQVF 319
N ++ + +G LR G F EG+ + +G + D + + V
Sbjct: 42 NKPPELVELSLEGDVLRRIPLDG-----FSDPEGIEYIGDGRFAITDERDQALFVV 92
>gnl|CDD|223567 COG0493, GltD, NADPH-dependent glutamate synthase beta chain and
related oxidoreductases [Amino acid transport and
metabolism / General function prediction only].
Length = 457
Score = 28.4 bits (64), Expect = 5.6
Identities = 30/130 (23%), Positives = 41/130 (31%), Gaps = 32/130 (24%)
Query: 170 IFDVNGRVFQSDGTFVGKFGSMGNKAGQLEHPHYIAVSNTNRVIVSDSNNHRVQIFDVNG 229
I + GRV G K G++E Y+ R V + D
Sbjct: 333 IGNEGGRV-------------TGVKFGRVEPGEYVDGWGR-RGPVGVIGTEKTDAADT-- 376
Query: 230 RVITSFGSEGSEEGQLKFPRGVAVDDQGYISVGDSGNNRIQIFTP---------DGQFLR 280
VI + G EG L G+ +D +G I V D + I P G L
Sbjct: 377 -VILAIGFEGDATDGLLLEFGLKLDKRGRIKV-DENLQQTSI--PGVFAGGDAVRGAALV 432
Query: 281 AFGCWGSGDG 290
W +G
Sbjct: 433 V---WAIAEG 439
>gnl|CDD|222750 pfam14433, SUKH-3, SUKH-3 immunity protein. This family belongs to
the SUKH superfamily and functions as immunity proteins
in bacterial toxin systems.
Length = 142
Score = 27.3 bits (61), Expect = 6.7
Identities = 8/33 (24%), Positives = 12/33 (36%)
Query: 158 VIVSDSNNHRVQIFDVNGRVFQSDGTFVGKFGS 190
I N H + D +GRV+ + G
Sbjct: 94 PIGELDNGHSTLLVDESGRVYGLGDGGLWLLGE 126
>gnl|CDD|185138 PRK15216, PRK15216, putative fimbrial biosynthesis regulatory
protein; Provisional.
Length = 340
Score = 27.7 bits (61), Expect = 9.3
Identities = 20/75 (26%), Positives = 37/75 (49%), Gaps = 15/75 (20%)
Query: 103 IAVGPDNSIVVADSSNHRVQV-FQSDGTFVGKFGSMGNKAGQLEHPHYIAVSNTNR---V 158
I +GP++S+ + +++V +QS +M N+ G +E P+ I +S N V
Sbjct: 73 IGMGPNDSVKL------KLKVEWQSSSAI-----NMRNQNGVIEAPYKITISEINSLEAV 121
Query: 159 IVSDSNNHRVQIFDV 173
VS + V I ++
Sbjct: 122 TVSARGGYSVTIDNI 136
>gnl|CDD|215057 PLN00105, PLN00105, malate/L-lactate dehydrogenase; Provisional.
Length = 330
Score = 27.5 bits (61), Expect = 9.7
Identities = 12/34 (35%), Positives = 18/34 (52%)
Query: 224 IFDVNGRVITSFGSEGSEEGQLKFPRGVAVDDQG 257
+ D+ + FG ++ K PRGVA+D QG
Sbjct: 162 VLDMATSAYSFFGLLEAKTAGKKLPRGVAIDKQG 195
>gnl|CDD|234536 TIGR04312, choice_anch_B, choice-of-anchor B domain. This domain,
about 385 amino acids long, can have either of at least
two types of C-terminal sorting signal. Members from
Shewanella and allies have the rhombosortase target
domain GlyGly-CTERM (TIGR03501), while members of the
Bacteroidetes have the Por secretion system C-terminal
domain (TIGR04183). Most other members lack any
C-terminal extension, but in most of those, the normal
signal sequence is replaced by a lipoprotein signal
sequence. Member sequences show a region of local
similarity to the LVIVD repeat sequence (pfam08309).
Length = 364
Score = 27.7 bits (62), Expect = 9.9
Identities = 20/70 (28%), Positives = 32/70 (45%), Gaps = 9/70 (12%)
Query: 150 IAVSNTNRVIVSDSNNHRVQIFDVNG-------RVFQSDGTFVGKFGSMGNKAGQLEHPH 202
I V + IVS+++ H +Q+FD+ + F +D + FGS N E +
Sbjct: 84 IKVYQNHAYIVSEASGHGMQVFDLTRLRNVNPPQTFTADAHYTD-FGSAHNIYINEESGY 142
Query: 203 -YIAVSNTNR 211
YI +NT
Sbjct: 143 AYIVGTNTFG 152
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.138 0.423
Gapped
Lambda K H
0.267 0.0717 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,813,734
Number of extensions: 1490499
Number of successful extensions: 1324
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1294
Number of HSP's successfully gapped: 59
Length of query: 319
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 222
Effective length of database: 6,635,264
Effective search space: 1473028608
Effective search space used: 1473028608
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (26.5 bits)