RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16019
(121 letters)
>gnl|CDD|217409 pfam03178, CPSF_A, CPSF A subunit region. This family includes a
region that lies towards the C-terminus of the cleavage
and polyadenylation specificity factor (CPSF) A (160
kDa) subunit. CPSF is involved in mRNA polyadenylation
and binds the AAUAAA conserved sequence in pre-mRNA.
CPSF has also been found to be necessary for splicing of
single-intron pre-mRNAs. The function of the aligned
region is unknown but may be involved in RNA/DNA
binding.
Length = 318
Score = 30.6 bits (70), Expect = 0.13
Identities = 20/131 (15%), Positives = 46/131 (35%), Gaps = 23/131 (17%)
Query: 2 TSVLPTMWLGSQNGSVFV---HSAVSQWR-RCLHSIQLKDSVLNIVHVQGRVVCALADGS 57
T+ +++G ++V + + + +H ++K +V + QGR++
Sbjct: 48 TAFDLGEDPAARSGRIYVFEIIEPETNRKLKLVHKTEVKGAVTALCEFQGRLLAGQG-QK 106
Query: 58 VAIFRRGPDG-----QWDLSKYHTVTLGLPHHSVRSLAAV---YNKVW-CGYK---NKIH 105
+ ++ G D D + V+L + V V GY ++
Sbjct: 107 LRVYDLGKDKLLPKAFLDTPITYVVSL----KVFGNRIIVGDLMKSVTFLGYDEEPYRLI 162
Query: 106 VV--DPKSLDV 114
+ D + V
Sbjct: 163 LFARDTQPRWV 173
>gnl|CDD|221303 pfam11902, DUF3422, Protein of unknown function (DUF3422). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria, archaea and
eukaryotes. Proteins in this family are typically
between 426 to 444 amino acids in length.
Length = 419
Score = 30.6 bits (70), Expect = 0.15
Identities = 18/72 (25%), Positives = 26/72 (36%), Gaps = 8/72 (11%)
Query: 1 MTSVLPTMWLGSQNGSVF--VHSAVSQWRRCLHSIQLKDSVLNIVHVQGRVVCALADGSV 58
+LP WL G + VH V S L + + + +ADG+
Sbjct: 102 ALELLPQDWLAELPGQLIVAVHLEVEPAPDPEESPLLIRAAFDGESL---AGSGVADGAA 158
Query: 59 AI---FRRGPDG 67
+ FR PDG
Sbjct: 159 VVATDFRIDPDG 170
>gnl|CDD|130398 TIGR01331, bisphos_cysQ, 3'(2'),5'-bisphosphate nucleotidase,
bacterial. Sulfate is incorporated into
3-phosphoadenylylsulfate, PAPS, for utilization in
pathways such as methionine biosynthesis. Transfer of
sulfate from PAPS to an acceptor leaves adenosine
3'-5'-bisphosphate, APS. This model describes a form
found in bacteria of the enzyme 3'(2'),5'-bisphosphate
nucleotidase, which removes the 3'-phosphate from APS to
regenerate AMP and help drive the cycle [Central
intermediary metabolism, Sulfur metabolism].
Length = 249
Score = 28.6 bits (64), Expect = 0.63
Identities = 13/28 (46%), Positives = 18/28 (64%), Gaps = 1/28 (3%)
Query: 51 CALADGSVAIF-RRGPDGQWDLSKYHTV 77
C +A+GS I+ R GP G+WD + H V
Sbjct: 189 CLVAEGSADIYPRLGPTGEWDTAAGHAV 216
>gnl|CDD|183200 PRK11562, PRK11562, nitrite transporter NirC; Provisional.
Length = 268
Score = 28.3 bits (63), Expect = 0.98
Identities = 11/19 (57%), Positives = 14/19 (73%)
Query: 1 MTSVLPTMWLGSQNGSVFV 19
M ++LP WLG+ GSVFV
Sbjct: 101 MWAILPQTWLGNLVGSVFV 119
>gnl|CDD|187887 cd09757, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Zn-finger domain containing protein, distant homologs of
Cas8 proteins; signature gene for I-C subtype; also
known as Csd1 family.
Length = 569
Score = 28.0 bits (63), Expect = 1.1
Identities = 7/41 (17%), Positives = 14/41 (34%)
Query: 65 PDGQWDLSKYHTVTLGLPHHSVRSLAAVYNKVWCGYKNKIH 105
D L T + +++A + K GY+ +
Sbjct: 275 EDPFAALFDPPDDTKPDDAATTQAVALLLKKPITGYRADLD 315
>gnl|CDD|240114 cd04801, CBS_pair_M50_like, This cd contains two tandem repeats of
the cystathionine beta-synthase (CBS pair) domains in
association with the metalloprotease peptidase M50. CBS
is a small domain originally identified in cystathionine
beta-synthase and subsequently found in a wide range of
different proteins. CBS domains usually come in tandem
repeats, which associate to form a so-called Bateman
domain or a CBS pair which is reflected in this model.
The interface between the two CBS domains forms a cleft
that is a potential ligand binding site. The CBS pair
coexists with a variety of other functional domains. It
has been proposed that the CBS domain may play a
regulatory role, although its exact function is unknown.
Length = 114
Score = 27.2 bits (61), Expect = 1.4
Identities = 20/95 (21%), Positives = 35/95 (36%), Gaps = 10/95 (10%)
Query: 20 HSAVSQWRRCLHSIQLKDSVLNIVHVQGRVVCALADGSVAIFRRGPDGQWDLSKYHTVTL 79
H + ++ R + + +V +GR V + S+A R P QW + V
Sbjct: 9 HLTLREFVR-EYVLGSNQRRFVVVDNEGRYVGII---SLADLRAIPTSQWAQTTVIQVM- 63
Query: 80 GLPHHSVR-----SLAAVYNKVWCGYKNKIHVVDP 109
V SLA V + +++ VV+
Sbjct: 64 TPAAKLVTVLSEESLAEVLKLLEEQGLDELAVVED 98
>gnl|CDD|178120 PLN02504, PLN02504, nitrilase.
Length = 346
Score = 27.0 bits (60), Expect = 2.9
Identities = 17/50 (34%), Positives = 25/50 (50%), Gaps = 6/50 (12%)
Query: 58 VAIFRRGPDGQWDLSKYHTVTLGLPHHSVRSLAAVYNKVWCGYKNKIHVV 107
+AI R P G+ D KYH + +P V LAA+ K K+++V
Sbjct: 79 LAIGDRSPKGREDFRKYHASAIDVPGPEVDRLAAMAG------KYKVYLV 122
>gnl|CDD|192248 pfam09322, DUF1979, Domain of unknown function (DUF1979). Members
of this family of functionally uncharacterized domains
are found in various Oryza sativa mutator-like
transposases.
Length = 58
Score = 25.2 bits (55), Expect = 3.1
Identities = 14/48 (29%), Positives = 23/48 (47%), Gaps = 6/48 (12%)
Query: 62 RRGPDGQWDLSKYHTVTLGLPHHSVRSLAAVYNKVWCGYKNKIHVVDP 109
R GP G DLS + G+ + RS ++ N + G++ +DP
Sbjct: 17 RYGPTGV-DLSDFIVSERGIDRPAERSFPSIKNWLMRGFR-----IDP 58
>gnl|CDD|227204 COG4867, COG4867, Uncharacterized protein with a von Willebrand
factor type A (vWA) domain [General function prediction
only].
Length = 652
Score = 26.5 bits (58), Expect = 4.0
Identities = 9/19 (47%), Positives = 10/19 (52%)
Query: 54 ADGSVAIFRRGPDGQWDLS 72
A + RG DGQW LS
Sbjct: 355 ALVNQGFLDRGSDGQWRLS 373
>gnl|CDD|106970 PHA00661, PHA00661, hypothetical protein.
Length = 734
Score = 26.4 bits (58), Expect = 4.5
Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 1/40 (2%)
Query: 44 HVQGRVVCALADGSVAIFRRGPDGQWDL-SKYHTVTLGLP 82
H++G+ V LADG+V R DG L + TV +GLP
Sbjct: 587 HLEGKTVSILADGAVHPQRVVTDGAITLDVEAGTVHIGLP 626
>gnl|CDD|227379 COG5046, MAF1, Protein involved in Mod5 protein sorting
[Posttranslational modification, protein turnover,
chaperones].
Length = 282
Score = 26.1 bits (57), Expect = 5.6
Identities = 12/46 (26%), Positives = 19/46 (41%), Gaps = 2/46 (4%)
Query: 29 CLHSIQLKDSVLNIVHVQ--GRVVCALADGSVAIFRRGPDGQWDLS 72
L+ LK+ ++ IV Q R + AI R + + LS
Sbjct: 60 KLNDQNLKELMITIVVYQMEYRSTNSDKKLFKAIENRCQEDLFALS 105
>gnl|CDD|224139 COG1218, CysQ, 3'-Phosphoadenosine 5'-phosphosulfate (PAPS)
3'-phosphatase [Inorganic ion transport and metabolism].
Length = 276
Score = 25.7 bits (57), Expect = 6.2
Identities = 9/21 (42%), Positives = 14/21 (66%), Gaps = 1/21 (4%)
Query: 51 CALADGSVAIFRR-GPDGQWD 70
C +A+G+ I+ R GP +WD
Sbjct: 200 CLVAEGAADIYPRFGPTMEWD 220
>gnl|CDD|225013 COG2102, COG2102, Predicted ATPases of PP-loop superfamily [General
function prediction only].
Length = 223
Score = 25.7 bits (57), Expect = 6.8
Identities = 8/33 (24%), Positives = 11/33 (33%)
Query: 74 YHTVTLGLPHHSVRSLAAVYNKVWCGYKNKIHV 106
+ T+ L P R Y K W G +
Sbjct: 183 FETLVLDAPLFKKRIELVEYEKEWDGEWGYFEI 215
>gnl|CDD|167709 PRK04128, PRK04128,
1-(5-phosphoribosyl)-5-[(5-
phosphoribosylamino)methylideneamino]
imidazole-4-carboxamide isomerase; Provisional.
Length = 228
Score = 25.5 bits (56), Expect = 7.0
Identities = 12/25 (48%), Positives = 16/25 (64%), Gaps = 8/25 (32%)
Query: 100 YKNKIHVVD--------PKSLDVIR 116
Y +KIHVVD PK+LDV++
Sbjct: 42 YVDKIHVVDLDGAFEGKPKNLDVVK 66
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.324 0.136 0.439
Gapped
Lambda K H
0.267 0.0850 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,048,090
Number of extensions: 502125
Number of successful extensions: 529
Number of sequences better than 10.0: 1
Number of HSP's gapped: 529
Number of HSP's successfully gapped: 18
Length of query: 121
Length of database: 10,937,602
Length adjustment: 84
Effective length of query: 37
Effective length of database: 7,211,866
Effective search space: 266839042
Effective search space used: 266839042
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 53 (24.0 bits)