RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14245
(181 letters)
>gnl|CDD|205965 pfam13792, Sulfate_tra_GLY, Sulfate transporter N-terminal domain
with GLY motif. This domain is found usually at the
N-terminus of sulfate-transporter proteins. It carries a
highly conserved GLY sequence motif, but the function of
the domain is not known.
Length = 83
Score = 117 bits (295), Expect = 7e-35
Identities = 41/78 (52%), Positives = 57/78 (73%)
Query: 54 VPILAWLPKYNVNAAVSDVLAGITVGLTLIPQAIAYASLAGLNPKYGLYSSIFGGVMYIF 113
+PIL WLPKY+ + D++AG+TV L LIPQ +AYA LAGL P YGLY+S ++Y
Sbjct: 1 LPILQWLPKYSRSWLKGDLIAGLTVALVLIPQGMAYALLAGLPPVYGLYASFVPPLIYAL 60
Query: 114 LGTTKQLSVGPTSIMALL 131
G+++ LSVGPT+ ++LL
Sbjct: 61 FGSSRHLSVGPTAAISLL 78
>gnl|CDD|162054 TIGR00815, sulP, high affinity sulphate transporter 1. The SulP
family is a large and ubiquitous family with over 30
sequenced members derived from bacteria, fungi, plants
and animals. Many organisms including Bacillus subtilis,
Synechocystis sp, Saccharomyces cerevisiae, Arabidopsis
thaliana and Caenorhabditis elegans possess multiple
SulP family paralogues. Many of these proteins are
functionally characterized, and all are sulfate uptake
transporters. Some transport their substrate with high
affinities, while others transport it with relatively
low affinities. Most function by SO42- :H+symport, but
SO42- :HCO3- antiport has been reported for the rat
protein (spP45380). The bacterial proteins vary in size
from 434 residues to 566 residues with one exception, a
Mycobacterium tuberculosis protein with 784 residues.
The eukaryotic proteins vary in size from 611 residues
to 893 residues with one exception, a protein designated
"early nodulin 70 protein" from Glycine max which is
reported to be of 485 residues. Thus, the eukaryotic
proteins are almost without exception larger than the
prokaryotic proteins. These proteins exhibit 10-13
putative transmembrane a-helical spanners (TMSs)
depending on the protein. The phylogenetic tree for the
SulP family reveals five principal branches. Three of
these are bacterial specific as follows: one bears a
single protein from M. tuberculosis; a second bears two
proteins, one from M. tuberculosis, the other from
Synechocystis sp, and the third bears all remaining
prokaryotic proteins. The remaining two clusters bear
only eukaryotic proteins with the animal proteins all
localized to one branch and the plant and fungal
proteins localized to the other. The generalized
transport reactions catalyzed by SulP family proteins
are: (1) SO42- (out) + nH+ (out) --> SO42- (in) + nH+
(in). (2) SO42- (out) + nHCO3- (in) SO42- (in) + nHCO3-
(out) [Transport and binding proteins, Anions].
Length = 563
Score = 117 bits (295), Expect = 4e-31
Identities = 53/119 (44%), Positives = 74/119 (62%), Gaps = 9/119 (7%)
Query: 55 PILAWLPKYNVNAAVSDVLAGITVGLTLIPQAIAYASLAGLNPKYGLYSSIFGGVMYIFL 114
P+L WLP Y + D++AG+TVG+ LIPQA+AYA LAGL+P YGLY+S +Y
Sbjct: 1 PVLRWLPHYRLKKFKGDLMAGLTVGILLIPQAMAYAILAGLSPIYGLYTSFVPPFIYALF 60
Query: 115 GTTKQLSVGPTSIMALLC--------LTYTHDTSLEMVAF-LTFLTGLVQLTCGLLSLG 164
GT++ +++GP ++M+LL L Y D +AF LT L G+ Q+ GLL LG
Sbjct: 61 GTSRDIAIGPVAVMSLLLGSVIARVGLQYLFDCDAIRLAFTLTLLAGIFQVILGLLRLG 119
>gnl|CDD|223732 COG0659, SUL1, Sulfate permease and related transporters (MFS
superfamily) [Inorganic ion transport and metabolism].
Length = 554
Score = 107 bits (269), Expect = 2e-27
Identities = 43/117 (36%), Positives = 66/117 (56%), Gaps = 3/117 (2%)
Query: 52 RKVPILAWLPKYNVNAAVSDVLAGITVGLTLIPQAIAYASLAGLNPKYGLYSSIFGGVMY 111
++P L WLP Y + D+LAG+TV +P A+A+A AG+ P+ GLY+SI G++Y
Sbjct: 5 SEIPTLKWLPYYFRSWLRGDLLAGLTVAAVALPLAMAFAIAAGVPPEAGLYASIVAGIIY 64
Query: 112 IFLGTTKQLSVGPTSIMALLC---LTYTHDTSLEMVAFLTFLTGLVQLTCGLLSLGK 165
G ++ L GPT A++ + +T L + T L G+ Q+ GLL LG+
Sbjct: 65 ALFGGSRGLISGPTGAFAVVLAAVIASLVETGLALAFLATLLAGVFQILLGLLRLGR 121
>gnl|CDD|183265 PRK11660, PRK11660, putative transporter; Provisional.
Length = 568
Score = 58.8 bits (143), Expect = 1e-10
Identities = 35/109 (32%), Positives = 59/109 (54%), Gaps = 2/109 (1%)
Query: 58 AWLPKYNVNAAVSDVLAGITVGLTLIPQAIAYASLAGLNPKYGLYSSIFGGVMYIFLGTT 117
W KY D++AGITVG+ IP A+A A +G+ P+YGLY++ G++ G +
Sbjct: 19 CWKEKYTAARFTRDLIAGITVGIIAIPLAMALAIASGVPPQYGLYTAAVAGIVIALTGGS 78
Query: 118 KQLSV-GPTSIMALLCLTYTHDTSLEMVAFLTFLTGLVQLTCGLLSLGK 165
+ SV GPT+ ++ + L + T ++G++ + GL LG+
Sbjct: 79 R-FSVSGPTAAFVVILYPVSQQFGLAGLLVATLMSGIILILMGLARLGR 126
>gnl|CDD|213023 cd10918, CE4_NodB_like_5s_6s, Putative catalytic NodB homology
domain of PgaB, IcaB, and similar proteins which consist
of a deformed (beta/alpha)8 barrel fold with 5- or
6-strands. This family belongs to the large and
functionally diverse carbohydrate esterase 4 (CE4)
superfamily, whose members show strong sequence
similarity with some variability due to their distinct
carbohydrate substrates. It includes bacterial
poly-beta-1,6-N-acetyl-D-glucosamine N-deacetylase PgaB,
hemin storage system HmsF protein in gram-negative
species, intercellular adhesion proteins IcaB, and many
uncharacterized prokaryotic polysaccharide deacetylases.
It also includes a putative polysaccharide deacetylase
YxkH encoded by the Bacillus subtilis yxkH gene, which
is one of six polysaccharide deacetylase gene homologs
present in the Bacillus subtilis genome. Sequence
comparison shows all family members contain a conserved
domain similar to the catalytic NodB homology domain of
rhizobial NodB-like proteins, which consists of a
deformed (beta/alpha)8 barrel fold with 6 or 7 strands.
However, in this family, most proteins have 5 strands
and some have 6 strands. Moreover, long insertions are
found in many family members, whose function remains
unknown.
Length = 157
Score = 39.1 bits (92), Expect = 3e-04
Identities = 21/77 (27%), Positives = 27/77 (35%), Gaps = 13/77 (16%)
Query: 20 LRDSGFEFQ--------VPQLSRDKLEE--LTMKTKLTELLHRKVPILAWLPKYNVNAAV 69
L SG E + LS ++L K +L E L + V A+ P N V
Sbjct: 62 LAASGVEIGSHTHTHPDLTTLSDEELRRELAESKERLEEELGKPVRSFAY-PYGRYNPRV 120
Query: 70 SDVL--AGITVGLTLIP 84
L AG T P
Sbjct: 121 IAALKEAGYKAAFTTDP 137
>gnl|CDD|178256 PLN02650, PLN02650, dihydroflavonol-4-reductase.
Length = 351
Score = 28.3 bits (63), Expect = 2.2
Identities = 9/20 (45%), Positives = 12/20 (60%)
Query: 9 NLKPVHLDNAKLRDSGFEFQ 28
+LK V + KL D GF F+
Sbjct: 286 DLKSVEFSSKKLTDLGFTFK 305
>gnl|CDD|132195 TIGR03151, enACPred_II, putative enoyl-[acyl-carrier-protein]
reductase II. This oxidoreductase of the
2-nitropropane dioxygenase family (pfam03060) is
commonly found in apparent operons with genes involved
in fatty acid biosynthesis. Furthermore, this genomic
context generally includes the fabG 3-oxoacyl-[ACP]
reductase and lacks the fabI enoyl-[ACP] reductase.
Length = 307
Score = 28.2 bits (63), Expect = 2.4
Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 9/45 (20%)
Query: 43 KTKLTELLHRKVPIL----AWLPKYNVNAAVSDVLAGITVGLTLI 83
KT+L +LL + PI AW+ ++ AAVS+ AG GL +I
Sbjct: 1 KTRLCDLLGIEYPIFQGGMAWVATGSLAAAVSN--AG---GLGII 40
>gnl|CDD|163482 TIGR03770, anch_rpt_perm, anchored repeat-type ABC transporter,
permease subunit. This protein family is the permease
subunit of binding protein-dependent ABC transporter
complex that strictly co-occurs with TIGR03769. TIGRFAMs
model TIGR03769 describes a protein domain that occurs
singly or as one of up to three repeats in proteins of a
number of Actinobacteria, including Propionibacterium
acnes KPA171202. The TIGR03769 domain occurs both in the
adjacent gene for the substrate-binding protein and in
additional (often nearby) proteins, often with
LPXTG-like sortase recognition signals. Homologous
permease subunits outside the scope of this family
include manganese transporter MntB in Synechocystis sp.
PCC 6803 and chelated iron transporter subunits. The
function of this transporter complex is unknown
[Transport and binding proteins, Unknown substrate].
Length = 270
Score = 27.8 bits (62), Expect = 2.9
Identities = 22/69 (31%), Positives = 35/69 (50%), Gaps = 9/69 (13%)
Query: 46 LTELLHRKVPILAWLPKYNVNAAVSDVLAGIT------VGLTLIPQAIAYASLAGLNPKY 99
LT+L + P LA+LPK + A +S ++ G+ G+ I A+A+A GL +
Sbjct: 6 LTDLTN---PALAFLPKALLVAVLSSIVCGVVGTHVVLRGMAFIGDAVAHAVFPGLAVAF 62
Query: 100 GLYSSIFGG 108
L S+ G
Sbjct: 63 ALQGSLLLG 71
>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201). This is
a family of coiled-coil proteins from eukaryotes. The
function is not known.
Length = 177
Score = 26.8 bits (60), Expect = 5.1
Identities = 18/48 (37%), Positives = 22/48 (45%), Gaps = 3/48 (6%)
Query: 13 VHLDNAKLRDSGFEFQVPQLSRD---KLEELTMKTKLTELLHRKVPIL 57
+ N KLR G VP L D +E++ K K E L RK IL
Sbjct: 124 LRKQNKKLRQQGGLLHVPALLYDYDKTVEKVEEKRKSVEKLRRKHKIL 171
>gnl|CDD|233659 TIGR01967, DEAH_box_HrpA, ATP-dependent helicase HrpA. This model
represents HrpA, one of two related but uncharacterized
DEAH-box ATP-dependent helicases in many Proteobacteria
and a few high-GC Gram-positive bacteria. HrpA is about
1300 amino acids long, while its paralog HrpB, also
uncharacterized, is about 800 amino acids long. Related
characterized eukarotic proteins are RNA helicases
associated with pre-mRNA processing [Unknown function,
Enzymes of unknown specificity].
Length = 1283
Score = 27.0 bits (60), Expect = 7.3
Identities = 10/29 (34%), Positives = 16/29 (55%), Gaps = 1/29 (3%)
Query: 12 PVHLDNAKLRDSGFEFQVPQLSRDKLEEL 40
P+ L N +L F++ VP L +K+ L
Sbjct: 865 PLPLLN-QLDTEDFDWLVPGLREEKVVAL 892
>gnl|CDD|221300 pfam11898, DUF3418, Domain of unknown function (DUF3418). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria. This domain is typically
between 582 to 594 amino acids in length. This domain is
found associated with pfam07717, pfam00271, pfam04408.
Length = 586
Score = 26.7 bits (60), Expect = 7.5
Identities = 12/29 (41%), Positives = 16/29 (55%), Gaps = 1/29 (3%)
Query: 12 PVHLDNAKLRDSGFEFQVPQLSRDKLEEL 40
P+ L N L FE+ VP L R+K+ L
Sbjct: 169 PLALLNQ-LPPERFEWLVPGLLREKVTAL 196
>gnl|CDD|233451 TIGR01531, glyc_debranch, glycogen debranching enzymye. glycogen
debranching enzyme possesses two different catalytic
activities; oligo-1,4-->1,4-glucantransferase (EC
2.4.1.25) and amylo-1,6-glucosidase (EC 3.2.1.33). Site
directed mutagenesis studies in S. cerevisiae indicate
that the transferase and glucosidase activities are
independent and located in different regions of the
polypeptide chain. Proteins in this model belong to the
larger alpha-amylase family. The model covers eukaryotic
proteins with a seed composed of human, nematode and
yeast sequences. Yeast seed sequence is well
characterized. The model is quite rigorous; either query
sequence yields large bit score or it fails to hit the
model altogether. There doesn't appear to be any middle
ground [Energy metabolism, Biosynthesis and degradation
of polysaccharides].
Length = 1464
Score = 26.7 bits (59), Expect = 7.9
Identities = 20/80 (25%), Positives = 29/80 (36%), Gaps = 14/80 (17%)
Query: 45 KLTELLHRKVPIL---AWLPKYNVNAAVSDVLAGITVGLTLIPQAIAYASLAGLNPKYGL 101
K + L + IL L +Y+++ A+ +L GI Y LA PK G
Sbjct: 359 KCSSSLATALNILNENLRLYRYDIDVALEQLLGGIK-----------YERLADGGPKQGP 407
Query: 102 YSSIFGGVMYIFLGTTKQLS 121
+ Y F K S
Sbjct: 408 VTVKHPLTTYYFTFKGKDGS 427
>gnl|CDD|240539 cd13134, MATE_like_8, Uncharacterized subfamily of the multidrug
and toxic compound extrusion (MATE) proteins. The
integral membrane proteins from the MATE family are
involved in exporting metabolites across the cell
membrane and are responsible for multidrug resistance
(MDR) in many bacteria and animals. A number of family
members are involved in the synthesis of peptidoglycan
components in bacteria.
Length = 438
Score = 26.3 bits (59), Expect = 9.7
Identities = 11/41 (26%), Positives = 18/41 (43%), Gaps = 4/41 (9%)
Query: 78 VGLTLIPQAIAYASLAGL----NPKYGLYSSIFGGVMYIFL 114
VG +L QA++ A L K +Y S+ ++ I
Sbjct: 128 VGGSLFLQALSLTLSAILRSHGRTKIPMYVSLIMNILNIVG 168
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.323 0.138 0.408
Gapped
Lambda K H
0.267 0.0783 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,056,973
Number of extensions: 826233
Number of successful extensions: 820
Number of sequences better than 10.0: 1
Number of HSP's gapped: 816
Number of HSP's successfully gapped: 66
Length of query: 181
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 90
Effective length of database: 6,901,388
Effective search space: 621124920
Effective search space used: 621124920
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 56 (25.2 bits)