RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy2046
(138 letters)
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 144 bits (366), Expect = 3e-41
Identities = 65/128 (50%), Positives = 85/128 (66%), Gaps = 1/128 (0%)
Query: 12 YKVPDDLVVESGKLKKLDEILPDLKKNGHRVLIFSQFIFVLDILGHYMDIRGWRHLRLDG 71
Y + LV SGK+ LD++LP LK+ RVLIFSQ +LDIL Y+ RG+++ R+DG
Sbjct: 460 YTTGEHLVENSGKMVLLDKLLPKLKERDSRVLIFSQMTRLLDILEDYLMYRGYQYCRIDG 519
Query: 72 ATQVSSRQELIDEYNRD-QDLFAFLLSTKAGGLGINLTAADTVIIHDVDFNPYNDKQAED 130
T R ID +N+ + F FLLST+AGGLGINL AD VI++D D+NP D QA+D
Sbjct: 520 NTGGEDRDASIDAFNKPGSEKFVFLLSTRAGGLGINLATADIVILYDSDWNPQVDLQAQD 579
Query: 131 RCHRVGQT 138
R HR+GQ
Sbjct: 580 RAHRIGQK 587
>gnl|CDD|223627 COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family
[Transcription / DNA replication, recombination, and
repair].
Length = 866
Score = 117 bits (293), Expect = 2e-31
Identities = 66/119 (55%), Positives = 84/119 (70%), Gaps = 3/119 (2%)
Query: 23 GKLKKLDEILPD-LKKNGH--RVLIFSQFIFVLDILGHYMDIRGWRHLRLDGATQVSSRQ 79
GKL+ LDE+L D L + GH +VLIFSQF VLD+L Y+ G +++RLDG+T RQ
Sbjct: 692 GKLQALDELLLDKLLEEGHYHKVLIFSQFTPVLDLLEDYLKALGIKYVRLDGSTPAKRRQ 751
Query: 80 ELIDEYNRDQDLFAFLLSTKAGGLGINLTAADTVIIHDVDFNPYNDKQAEDRCHRVGQT 138
ELID +N D++ FLLS KAGGLG+NLT ADTVI+ D +NP + QA DR HR+GQ
Sbjct: 752 ELIDRFNADEEEKVFLLSLKAGGLGLNLTGADTVILFDPWWNPAVELQAIDRAHRIGQK 810
>gnl|CDD|238034 cd00079, HELICc, Helicase superfamily c-terminal domain; associated
with DEXDc-, DEAD-, and DEAH-box proteins, yeast
initiation factor 4A, Ski2p, and Hepatitis C virus NS3
helicases; this domain is found in a wide variety of
helicases and helicase related proteins; may not be an
autonomously folding unit, but an integral part of the
helicase; 4 helicase superfamilies at present according
to the organization of their signature motifs; all
helicases share the ability to unwind nucleic acid
duplexes with a distinct directional polarity; they
utilize the free energy from nucleoside triphosphate
hydrolysis to fuel their translocation along DNA,
unwinding the duplex in the process.
Length = 131
Score = 85.0 bits (211), Expect = 3e-22
Identities = 36/121 (29%), Positives = 57/121 (47%), Gaps = 2/121 (1%)
Query: 18 LVVESGKLKKLDEILPDLKKNGHRVLIFSQFIFVLDILGHYMDIRGWRHLRLDGATQVSS 77
L VE KL+ L E+L + K G +VLIF +LD L + G + L G
Sbjct: 7 LPVEDEKLEALLELLKEHLKKGGKVLIFCPSKKMLDELAELLRKPGIKVAALHGDGSQEE 66
Query: 78 RQELIDEYNRDQDLFAFLLSTKAGGLGINLTAADTVIIHDVDFNPYNDKQAEDRCHRVGQ 137
R+E++ ++ + L++T GI+L VI +D+ ++P + Q R R GQ
Sbjct: 67 REEVLKDFREGE--IVVLVATDVIARGIDLPNVSVVINYDLPWSPSSYLQRIGRAGRAGQ 124
Query: 138 T 138
Sbjct: 125 K 125
>gnl|CDD|201125 pfam00271, Helicase_C, Helicase conserved C-terminal domain. The
Prosite family is restricted to DEAD/H helicases,
whereas this domain family is found in a wide variety of
helicases and helicase related proteins. It may be that
this is not an autonomously folding unit, but an
integral part of the helicase.
Length = 78
Score = 76.0 bits (188), Expect = 2e-19
Identities = 21/79 (26%), Positives = 38/79 (48%), Gaps = 2/79 (2%)
Query: 58 YMDIRGWRHLRLDGATQVSSRQELIDEYNRDQDLFAFLLSTKAGGLGINLTAADTVIIHD 117
+ G + RL G R+E+++++ + L++T G GI+L + VI +D
Sbjct: 2 LLRKPGIKVARLHGGLSQEEREEILEDFRNGK--SKVLVATDVAGRGIDLPDVNLVINYD 59
Query: 118 VDFNPYNDKQAEDRCHRVG 136
+ +NP + Q R R G
Sbjct: 60 LPWNPASYIQRIGRAGRAG 78
>gnl|CDD|197757 smart00490, HELICc, helicase superfamily c-terminal domain.
Length = 82
Score = 73.4 bits (181), Expect = 3e-18
Identities = 23/84 (27%), Positives = 40/84 (47%), Gaps = 2/84 (2%)
Query: 53 DILGHYMDIRGWRHLRLDGATQVSSRQELIDEYNRDQDLFAFLLSTKAGGLGINLTAADT 112
+ L + G + RL G R+E++D++N L++T G++L D
Sbjct: 1 EELAELLKELGIKVARLHGGLSQEEREEILDKFNNG--KIKVLVATDVAERGLDLPGVDL 58
Query: 113 VIIHDVDFNPYNDKQAEDRCHRVG 136
VII+D+ ++P + Q R R G
Sbjct: 59 VIIYDLPWSPASYIQRIGRAGRAG 82
>gnl|CDD|240273 PTZ00110, PTZ00110, helicase; Provisional.
Length = 545
Score = 39.8 bits (93), Expect = 2e-04
Identities = 31/122 (25%), Positives = 57/122 (46%), Gaps = 8/122 (6%)
Query: 17 DLVVESGKLKKLDEILPDLKKNGHRVLIFSQFIFVLDILGHYMDIRGWRHLRLDGATQVS 76
+V E K KL +L + ++G ++LIF + D L + + GW L + G +
Sbjct: 355 FVVEEHEKRGKLKMLLQRIMRDGDKILIFVETKKGADFLTKELRLDGWPALCIHGDKKQE 414
Query: 77 SRQELIDEYNRDQDLFAFLLSTKAGGLGINLTAADTVIIHDVDFNPYNDKQAEDRCHRVG 136
R +++E+ + +++T G+++ VI + DF Q ED HR+G
Sbjct: 415 ERTWVLNEFKTGKS--PIMIATDVASRGLDVKDVKYVI--NFDF----PNQIEDYVHRIG 466
Query: 137 QT 138
+T
Sbjct: 467 RT 468
>gnl|CDD|235307 PRK04537, PRK04537, ATP-dependent RNA helicase RhlB; Provisional.
Length = 572
Score = 33.0 bits (75), Expect = 0.033
Identities = 30/140 (21%), Positives = 58/140 (41%), Gaps = 24/140 (17%)
Query: 15 PDDLVVESGKLKKL-----------DEILPDL-----KKNGHRVLIFSQFIFVLDILGHY 58
P+ LVVE+ + +E L + G R ++F ++ +
Sbjct: 217 PEKLVVETETITAARVRQRIYFPADEEKQTLLLGLLSRSEGARTMVFVNTKAFVERVART 276
Query: 59 MDIRGWRHLRLDGATQVSSRQELIDEYNRDQDLFAFLLSTKAGGLGINLTAADTVIIHDV 118
++ G+R L G R+ L++ + + Q L++T G+++ V +D+
Sbjct: 277 LERHGYRVGVLSGDVPQKKRESLLNRFQKGQ--LEILVATDVAARGLHIDGVKYVYNYDL 334
Query: 119 DFNPYNDKQAEDRCHRVGQT 138
F+ AED HR+G+T
Sbjct: 335 PFD------AEDYVHRIGRT 348
>gnl|CDD|234096 TIGR03031, cas_csx12, CRISPR-associated protein Cas9/Csx12, subtype
II-B/NMENI. Members of this family of CRISPR-associated
(cas) protein are found, so far, in CRISPR/cas loci in
Wolinella succinogenes DSM 1740, Legionella pneumophila
str. Paris, and Francisella tularensis, where the last
probably is an example of a degenerate CRISPR locus,
having neither repeats nor a functional Cas1. The
characteristic repeat length is 37 base pairs and period
is about 72. One region of this large protein shows
sequence similarity to pfam01844, HNH endonuclease
[Mobile and extrachromosomal element functions, Other].
Length = 802
Score = 31.5 bits (71), Expect = 0.12
Identities = 11/38 (28%), Positives = 20/38 (52%), Gaps = 2/38 (5%)
Query: 61 IRGWRHLRLDGATQVSSRQELIDEYNRDQDLFAFLLST 98
++ WRHL+ D + + + LI + + QD + L T
Sbjct: 285 LKNWRHLKGDQESL--AVRNLIQQLKQKQDYISILEKT 320
>gnl|CDD|218255 pfam04769, MAT_Alpha1, Mating-type protein MAT alpha 1. This
family includes Saccharomyces cerevisiae mating type
protein alpha 1. Mat alpha 1 is a transcription
activator which activates mating-type alpha-specific
genes. MAT alpha 1 and MCM 1 bind cooperatively to PQ
elements upstream of alpha-specific genes. Alpha 1
interacts in vivo with STE12, linking expression of
alpha-specific genes to the alpha-pheromone (pfam04648)
response pathway.
Length = 200
Score = 28.9 bits (65), Expect = 0.70
Identities = 11/51 (21%), Positives = 20/51 (39%), Gaps = 10/51 (19%)
Query: 57 HYMDIRGWRHLRLDGATQVSSR----------QELIDEYNRDQDLFAFLLS 97
Y+++ GW + + T R Q + Y ++DL F+ S
Sbjct: 125 AYLELLGWSLTQNELGTLTLERWTAPKQQRLGQPMNQLYLTEKDLITFVQS 175
>gnl|CDD|223194 COG0116, COG0116, Predicted N6-adenine-specific DNA methylase [DNA
replication, recombination, and repair].
Length = 381
Score = 28.4 bits (64), Expect = 1.0
Identities = 16/69 (23%), Positives = 28/69 (40%), Gaps = 9/69 (13%)
Query: 2 ALKQTLDCAKYKVPDDLVVESGKLK-----KLDEILPDLKKNGHRVLIFSQFIFVLDILG 56
L + D A+ + D +V+ + K +D PD++ N L +D G
Sbjct: 103 HLFTSPDIAR--IVKDAIVDRFRRKYGRRPSVDLDGPDVRINVE--LDKDTATLGIDTTG 158
Query: 57 HYMDIRGWR 65
+ RG+R
Sbjct: 159 DSLHKRGYR 167
>gnl|CDD|187835 cd09704, Csx12, CRISPR/Cas system-associated protein Cas9. CRISPR
(Clustered Regularly Interspaced Short Palindromic
Repeats) and associated Cas proteins comprise a system
for heritable host defense by prokaryotic cells against
phage and other foreign DNA; Very large protein
containing McrA/HNH-nuclease related domain and a
RuvC-like nuclease domain; signature gene for type II.
Length = 804
Score = 28.4 bits (63), Expect = 1.2
Identities = 14/38 (36%), Positives = 19/38 (50%), Gaps = 2/38 (5%)
Query: 61 IRGWRHLRLDGATQVSSRQELIDEYNRDQDLFAFLLST 98
I G H + DGA S ++L +E + L FLL T
Sbjct: 285 ILGEDHDKKDGAKY--SYKDLCNELKQKVGLIDFLLET 320
>gnl|CDD|212061 cd11492, SLC5sbd_NIS-SMVT, Na(+)/iodide (NIS) and
Na(+)/multivitamin (SMVT) cotransporters, and related
proteins; solute binding domain. NIS (encoded by the
SLC5A5 gene) transports I-, and other anions including
ClO4-, SCN-, and Br-. SMVT (encoded by the SLC5A6 gene)
transports biotin, pantothenic acid and lipoate. This
subfamily also includes SMCT1 and -2. SMCT1(encoded by
the SLC5A8 gene) is a high-affinity transporter of
various monocarboxylates including lactate and pyruvate,
short-chain fatty acids, ketone bodies, nicotinate and
its structural analogs, pyroglutamate, benzoate and its
derivatives, and iodide. SMCT2 (encoded by the SLC5A12
gene) is a low-affinity transporter for short-chain
fatty acids, lactate, pyruvate, and nicotinate. This
subgroup belongs to the solute carrier 5 (SLC5)
transporter family.
Length = 522
Score = 27.8 bits (63), Expect = 1.7
Identities = 17/49 (34%), Positives = 22/49 (44%), Gaps = 17/49 (34%)
Query: 10 AKYKVPDDLVVESGKLKKLDEILPDLKKNGHRVLIFSQFIFVLDILGHY 58
AKY D L +G + K D++LP FV+DILGH
Sbjct: 291 AKYHDCDPL--TAGLISKPDQLLP---------------YFVMDILGHL 322
>gnl|CDD|173933 cd08174, G1PDH-like, Glycerol-1-phosphate dehydrogenase-like.
Glycerol-1-phosphate dehydrogenase-like. The proteins
of this family have not been characterized. The protein
sequences have high similarity with that of
glycerol-1-phosphate dehydrogenase (G1PDH). G1PDH plays
a role in the synthesis of phosphoglycerolipids in
Gram-positive bacterial species. It catalyzes the
reversibly reduction of dihydroxyacetone phosphate
(DHAP) to glycerol-1-phosphate (G1P) in a
NADH-dependent manner. Its activity requires Ni++ ion.
This family is bacteria specific.
Length = 331
Score = 28.0 bits (63), Expect = 1.8
Identities = 11/32 (34%), Positives = 16/32 (50%)
Query: 15 PDDLVVESGKLKKLDEILPDLKKNGHRVLIFS 46
P L + G + +L E+L D N RV + S
Sbjct: 1 PLVLDIGEGAIARLGELLSDRNPNFGRVAVVS 32
>gnl|CDD|212048 cd11478, SLC5sbd_u2, Uncharacterized bacterial solute carrier 5
subfamily; putative solute-binding domain. SLC5 (also
called the sodium/glucose cotransporter family or solute
sodium symporter family) is a family of proteins that
co-transports Na+ with sugars, amino acids, inorganic
ions or vitamins. Prokaryotic members of this family
include Vibrio parahaemolyticus glucose/galactose
(vSGLT), and Escherichia coli proline (PutP) and
pantothenate (PutF) cotransporters. One member of the
SLC5 family, human SGLT3, has been characterized as a
glucose sensor and not a transporter. This subfamily
belongs to the solute carrier 5 (SLC5) transporter
family.
Length = 495
Score = 27.2 bits (61), Expect = 2.9
Identities = 17/49 (34%), Positives = 27/49 (55%), Gaps = 8/49 (16%)
Query: 91 LFAFLLSTKAGGLGINLTAADTVIIHDVDFNPYNDKQAEDRCH--RVGQ 137
L A +S AG N++A +TV +D+ + Y K A D+ H +VG+
Sbjct: 332 LLAAFMSGMAG----NVSAFNTVFTYDI-YQTYIRKDASDK-HYLKVGR 374
>gnl|CDD|221566 pfam12416, DUF3668, Cep120 protein. This family includes the
Cep120 protein which is associated with centriole
structure and function.
Length = 334
Score = 26.9 bits (60), Expect = 3.3
Identities = 11/44 (25%), Positives = 18/44 (40%), Gaps = 7/44 (15%)
Query: 18 LVVESGKLKKLDEILPDLKKNGHRVLIFSQFIFVLDILGHYMDI 61
L V + L++++P K S F F +LG D+
Sbjct: 190 LSVTLKSAENLEQLVPSSHKQNE-----SGFFFYYSLLG--NDV 226
>gnl|CDD|225650 COG3108, COG3108, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 185
Score = 26.6 bits (59), Expect = 3.7
Identities = 11/47 (23%), Positives = 20/47 (42%), Gaps = 6/47 (12%)
Query: 25 LKKLDEILPDLKKNG-HRV--LIFSQFIFVLDILGHY---MDIRGWR 65
L L+ +L D ++N R+ +F + + GH G+R
Sbjct: 66 LSGLNRLLRDWRQNEVVRMDPRLFDLVYQLKTLEGHRRPVQVTSGYR 112
>gnl|CDD|213304 cd05938, hsFATP2a_ACSVL_like, Fatty acid transport proteins (FATP)
including hsFATP2, hsFATP5, and hsFATP6, and similar
proteins. Fatty acid transport proteins (FATP) of this
family transport long-chain or very-long-chain fatty
acids across the plasma membrane. At least five copies
of FATPs are identified in mammalian cells. This family
includes hsFATP2, hsFATP5, and hsFATP6, and similar
proteins. Each FATP has unique patterns of tissue
distribution. These FATPs also have fatty acid CoA
synthetase activity, thus playing dual roles as fatty
acid transporters and its activation enzymes. The hsFATP
proteins exist in two splice variants; the b variant,
lacking exon 3, has no acyl-CoA synthetase activity.
FATPs are key players in the trafficking of exogenous
fatty acids into the cell and in intracellular fatty
acid homeostasis.
Length = 535
Score = 27.0 bits (60), Expect = 4.1
Identities = 15/46 (32%), Positives = 22/46 (47%), Gaps = 3/46 (6%)
Query: 1 MALKQTLDCAKYKVPDDLVVESGKLKKLDEILPDLKKNGHRVLIFS 46
+L C +V LV + L+ ++EILP L+ G RV S
Sbjct: 66 GSLLHCFRCCGARV---LVADPELLEAVEEILPALRAMGVRVFYLS 108
>gnl|CDD|219361 pfam07277, SapC, SapC. This family contains a number of bacterial
SapC proteins approximately 250 residues long. In
Campylobacter fetus, SapC forms part of a
paracrystalline surface layer (S-layer) that confers
serum resistance.
Length = 220
Score = 26.8 bits (60), Expect = 4.3
Identities = 15/43 (34%), Positives = 23/43 (53%), Gaps = 1/43 (2%)
Query: 6 TLDCAKYKVPDDLVVESGKLKKL-DEILPDLKKNGHRVLIFSQ 47
TLD K + ++ KL L DE L +L K+G+ LI++
Sbjct: 168 TLDGEKRNLNGFYTIDEEKLNALSDEALLELHKSGYLPLIYAH 210
>gnl|CDD|224626 COG1712, COG1712, Predicted dinucleotide-utilizing enzyme
[General function prediction only].
Length = 255
Score = 26.6 bits (59), Expect = 5.1
Identities = 11/50 (22%), Positives = 20/50 (40%), Gaps = 5/50 (10%)
Query: 2 ALKQTLDCAKYKVPD-----DLVVESGKLKKLDEILPDLKKNGHRVLIFS 46
+ + DLVVE+ + + E +P + K G V++ S
Sbjct: 42 LEASVGRRCVSDIDELIAEVDLVVEAASPEAVREYVPKILKAGIDVIVMS 91
>gnl|CDD|237343 PRK13304, PRK13304, L-aspartate dehydrogenase; Reviewed.
Length = 265
Score = 26.1 bits (58), Expect = 6.7
Identities = 14/34 (41%), Positives = 21/34 (61%), Gaps = 1/34 (2%)
Query: 14 VPD-DLVVESGKLKKLDEILPDLKKNGHRVLIFS 46
V D DLVVE + ++E++P +NG V+I S
Sbjct: 59 VEDVDLVVECASVNAVEEVVPKSLENGKDVIIMS 92
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.323 0.141 0.418
Gapped
Lambda K H
0.267 0.0781 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,256,687
Number of extensions: 675795
Number of successful extensions: 771
Number of sequences better than 10.0: 1
Number of HSP's gapped: 765
Number of HSP's successfully gapped: 36
Length of query: 138
Length of database: 10,937,602
Length adjustment: 87
Effective length of query: 51
Effective length of database: 7,078,804
Effective search space: 361019004
Effective search space used: 361019004
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 54 (24.5 bits)