RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4843
(277 letters)
>gnl|CDD|216595 pfam01595, DUF21, Domain of unknown function DUF21. This
transmembrane region has no known function. Many of the
sequences in this family are annotated as hemolysins,
however this is due to a similarity to Treponema
hyodysenteriae hemolysin C that does not contain this
domain. This domain is found in the N-terminus of the
proteins adjacent to two intracellular CBS domains
pfam00571.
Length = 182
Score = 105 bits (265), Expect = 6e-28
Identities = 55/195 (28%), Positives = 91/195 (46%), Gaps = 28/195 (14%)
Query: 87 LAIVIIVTCLGFSSLFSGLNLGLMSLNRTDLKIICNTGTEHERKYAKTIIPVREHGNYLL 146
L ++IIV L S+ FS L+SL+R+ L+ + G K A+ ++ + + + LL
Sbjct: 1 LLLLIIVLLLLLSAFFSASETALVSLSRSRLEELAEEG----NKGARRLLKLLANPDRLL 56
Query: 147 CSILLGTEHERKYAKTIIPVREHGNYLLCSILLGNVMVNSTFTILLDDITSGLVAVIGST 206
++LLG L +ILLG + LL L+A + ST
Sbjct: 57 ATLLLGNT-------------------LANILLG-ALATLAAAELLLGSLGVLLATLVST 96
Query: 207 LAIVIFGEISPQAVCSRHGLMIGAKTINVTKVVMILTFPLAYPISKILDWILG----EEI 262
L I++FGEI P+ + R+ I + +V+M L +PL + +SK+ + +L +
Sbjct: 97 LLILVFGEILPKTLARRNPERIALRLAPPLRVLMKLLYPLVWLLSKLSNLLLRLFGVKPE 156
Query: 263 GNVYTRERLKELVKI 277
G T E L+ LV
Sbjct: 157 GPAVTEEELRALVDE 171
>gnl|CDD|224173 COG1253, TlyC, Hemolysins and related proteins containing CBS
domains [General function prediction only].
Length = 429
Score = 70.8 bits (174), Expect = 7e-14
Identities = 41/192 (21%), Positives = 74/192 (38%), Gaps = 27/192 (14%)
Query: 86 WLAIVIIVTCLGFSSLFSGLNLGLMSLNRTDLKIICNTGTEHERKYAKTIIPVREHGNYL 145
L +++I+ + S+ FS L+SL R+ L+ + G K AK + + E N
Sbjct: 4 ILTLLLILLLIALSAFFSAAEFALVSLRRSRLEQLAEEGN----KRAKAALKLIERLNRY 59
Query: 146 LCSILLGTEHERKYAKTIIPVREHGNYLLCSILLG----NVMVNSTFTILLDDITSGLVA 201
L ++ LG L S+LLG + +L S ++
Sbjct: 60 LSTVQLG-------------------ITLVSLLLGAVGEPALAALLEPLLEALGLSAALS 100
Query: 202 VIGSTLAIVIFGEISPQAVCSRHGLMIGAKTINVTKVVMILTFPLAYPISKILDWILGEE 261
T V+FGE+ P+++ R+ + + L +PL + +++I + IL
Sbjct: 101 FAIITFLHVVFGELVPKSIAIRNPEKVALLIAPPLRFFYRLLYPLIWLLNRIANAILRLF 160
Query: 262 IGNVYTRERLKE 273
E L
Sbjct: 161 GVEPVEEEALTS 172
>gnl|CDD|226911 COG4536, CorB, Putative Mg2+ and Co2+ transporter CorB [Inorganic
ion transport and metabolism].
Length = 423
Score = 45.7 bits (109), Expect = 1e-05
Identities = 41/186 (22%), Positives = 76/186 (40%), Gaps = 47/186 (25%)
Query: 84 PFWLAIVIIVTCLGFSSLFSGLNLGLMSLNRTDLKIICNTGTEHERKYAKTIIPVREHGN 143
P W+ I+ I+ + S+ FSG L +LNR L+ H K GN
Sbjct: 1 PTWILIIAIIILIIISAFFSGSETALTALNRYRLR--------HLAK----------QGN 42
Query: 144 YLLCSILLGTEHERKYAKTIIPVREHGNYLLCSILLGNVMVNSTFTILLDDITSGL---- 199
+ AK + + E + L+ +IL+GN +VN IL + + L
Sbjct: 43 --------------RGAKRVEKLLEKPDRLIGTILIGNNLVN----ILASALATILGIRL 84
Query: 200 -------VAVIGSTLAIVIFGEISPQAVCSRHGLMIGAKTINVTKVVMILTFPLAYPISK 252
+A T I++F E+ P+ + + + + + + +++ L PL + ++
Sbjct: 85 YGDAGVAIATGVLTFVILVFAEVLPKTIAALYPERVALPSSFILAILVRLFGPLVWLLNA 144
Query: 253 ILDWIL 258
I +L
Sbjct: 145 ITRRLL 150
>gnl|CDD|132559 TIGR03520, GldE, gliding motility-associated protein GldE. Members
of this protein family are exclusive to the
Bacteroidetes phylum (previously
Cytophaga-Flavobacteria-Bacteroides). GldC is a protein
linked to a type of rapid surface gliding motility found
in certain Bacteroidetes, such as Flavobacterium
johnsoniae and Cytophaga hutchinsonii. GldE was
discovered because of its adjacency to GldD in F.
johnsonii. Overexpression of GldE partially supresses
the effects of a GldB point mutant suggesting that GldB
and GldE interact. Gliding motility appears closely
linked to chitin utilization in the model species
Flavobacterium johnsoniae. Not all Bacteroidetes with
members of this protein family appear to have all of the
genes associated with gliding motility and in fact some
do not appear to express the gliding phenotype.
Length = 408
Score = 43.1 bits (102), Expect = 8e-05
Identities = 42/164 (25%), Positives = 70/164 (42%), Gaps = 30/164 (18%)
Query: 99 SSLFSGLNLGLMSLNRTDLKIICNTGTEHERKYAKTIIPVREHGNYLLCSILLGTEHERK 158
S+L SG + SL+ TDL N E K + +I + + LL +IL+
Sbjct: 8 SALVSGSEVAFFSLSPTDL----NDEEEDNSKKEQIVINLLDRPKKLLATILI------- 56
Query: 159 YAKTIIPVREHGNYLLCSILLGNV--MVNSTFTILLDDITSGLVAVIGSTLAIVIFGEIS 216
A I + LL + L N+ N+ L+ V+ T I++FGEI
Sbjct: 57 -ANNFINI---AIVLLFTSLSDNLFGSFNTELLRF-------LIEVVIVTFLILLFGEIL 105
Query: 217 PQAVCSRHGL---MIGAKTINVTKVVMILTFPLAYPISKILDWI 257
P+ +R+ L A IN ++ + P++ P+ I ++I
Sbjct: 106 PKVYANRNNLKFAKFMAYPIN---ILDKVFSPISLPLRAITNFI 146
>gnl|CDD|178040 PLN02420, PLN02420, endoglucanase.
Length = 525
Score = 28.8 bits (64), Expect = 3.8
Identities = 15/45 (33%), Positives = 26/45 (57%)
Query: 138 VREHGNYLLCSILLGTEHERKYAKTIIPVREHGNYLLCSILLGNV 182
V+ G LL S+LL E ++++K + + ++ LCSIL N+
Sbjct: 294 VKYAGVQLLASMLLKEEKHKQHSKVLQQYKSKADHYLCSILNKNI 338
>gnl|CDD|215860 pfam00328, His_Phos_2, Histidine phosphatase superfamily (branch
2). The histidine phosphatase superfamily is so named
because catalysis centres on a conserved His residue
that is transiently phosphorylated during the catalytic
cycle. Other conserved residues contribute to a
'phosphate pocket' and interact with the phospho group
of substrate before, during and after its transfer to
the His residue. Structure and sequence analyses show
that different families contribute different additional
residues to the 'phosphate pocket' and, more
surprisingly, differ in the position, in sequence and
in three dimensions, of a catalytically essential
acidic residue. The superfamily may be divided into two
main branches.The smaller branch 2 contains
predominantly eukaryotic proteins. The catalytic
functions in members include phytase,
glucose-1-phosphatase and multiple inositol
polyphosphate phosphatase. The in vivo roles of the
mammalian acid phosphatases in branch 2 are not fully
understood, although activity against lysophosphatidic
acid and tyrosine-phosphorylated proteins has been
demonstrated.
Length = 327
Score = 28.2 bits (63), Expect = 5.5
Identities = 16/52 (30%), Positives = 21/52 (40%), Gaps = 14/52 (26%)
Query: 5 YILGRYLRTAYPIDPETVTENTGLIKLSLPILGQSDVIYYMCTKENRTLGTA 56
+ LGRYLR Y GL+ P +Y + NRTL +A
Sbjct: 61 FELGRYLRQRYV----------GLLPDGYP----PSEVYIRSSDSNRTLASA 98
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.323 0.140 0.415
Gapped
Lambda K H
0.267 0.0795 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 14,308,097
Number of extensions: 1377778
Number of successful extensions: 1393
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1381
Number of HSP's successfully gapped: 26
Length of query: 277
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 181
Effective length of database: 6,679,618
Effective search space: 1209010858
Effective search space used: 1209010858
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 58 (26.0 bits)