RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16019
         (121 letters)



>gnl|CDD|217409 pfam03178, CPSF_A, CPSF A subunit region.  This family includes a
           region that lies towards the C-terminus of the cleavage
           and polyadenylation specificity factor (CPSF) A (160
           kDa) subunit. CPSF is involved in mRNA polyadenylation
           and binds the AAUAAA conserved sequence in pre-mRNA.
           CPSF has also been found to be necessary for splicing of
           single-intron pre-mRNAs. The function of the aligned
           region is unknown but may be involved in RNA/DNA
           binding.
          Length = 318

 Score = 30.6 bits (70), Expect = 0.13
 Identities = 20/131 (15%), Positives = 46/131 (35%), Gaps = 23/131 (17%)

Query: 2   TSVLPTMWLGSQNGSVFV---HSAVSQWR-RCLHSIQLKDSVLNIVHVQGRVVCALADGS 57
           T+        +++G ++V       +  + + +H  ++K +V  +   QGR++       
Sbjct: 48  TAFDLGEDPAARSGRIYVFEIIEPETNRKLKLVHKTEVKGAVTALCEFQGRLLAGQG-QK 106

Query: 58  VAIFRRGPDG-----QWDLSKYHTVTLGLPHHSVRSLAAV---YNKVW-CGYK---NKIH 105
           + ++  G D        D    + V+L        +   V      V   GY     ++ 
Sbjct: 107 LRVYDLGKDKLLPKAFLDTPITYVVSL----KVFGNRIIVGDLMKSVTFLGYDEEPYRLI 162

Query: 106 VV--DPKSLDV 114
           +   D +   V
Sbjct: 163 LFARDTQPRWV 173


>gnl|CDD|221303 pfam11902, DUF3422, Protein of unknown function (DUF3422).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria, archaea and
           eukaryotes. Proteins in this family are typically
           between 426 to 444 amino acids in length.
          Length = 419

 Score = 30.6 bits (70), Expect = 0.15
 Identities = 18/72 (25%), Positives = 26/72 (36%), Gaps = 8/72 (11%)

Query: 1   MTSVLPTMWLGSQNGSVF--VHSAVSQWRRCLHSIQLKDSVLNIVHVQGRVVCALADGSV 58
              +LP  WL    G +   VH  V        S  L  +  +   +       +ADG+ 
Sbjct: 102 ALELLPQDWLAELPGQLIVAVHLEVEPAPDPEESPLLIRAAFDGESL---AGSGVADGAA 158

Query: 59  AI---FRRGPDG 67
            +   FR  PDG
Sbjct: 159 VVATDFRIDPDG 170


>gnl|CDD|130398 TIGR01331, bisphos_cysQ, 3'(2'),5'-bisphosphate nucleotidase,
           bacterial.  Sulfate is incorporated into
           3-phosphoadenylylsulfate, PAPS, for utilization in
           pathways such as methionine biosynthesis. Transfer of
           sulfate from PAPS to an acceptor leaves adenosine
           3'-5'-bisphosphate, APS. This model describes a form
           found in bacteria of the enzyme 3'(2'),5'-bisphosphate
           nucleotidase, which removes the 3'-phosphate from APS to
           regenerate AMP and help drive the cycle [Central
           intermediary metabolism, Sulfur metabolism].
          Length = 249

 Score = 28.6 bits (64), Expect = 0.63
 Identities = 13/28 (46%), Positives = 18/28 (64%), Gaps = 1/28 (3%)

Query: 51  CALADGSVAIF-RRGPDGQWDLSKYHTV 77
           C +A+GS  I+ R GP G+WD +  H V
Sbjct: 189 CLVAEGSADIYPRLGPTGEWDTAAGHAV 216


>gnl|CDD|183200 PRK11562, PRK11562, nitrite transporter NirC; Provisional.
          Length = 268

 Score = 28.3 bits (63), Expect = 0.98
 Identities = 11/19 (57%), Positives = 14/19 (73%)

Query: 1   MTSVLPTMWLGSQNGSVFV 19
           M ++LP  WLG+  GSVFV
Sbjct: 101 MWAILPQTWLGNLVGSVFV 119


>gnl|CDD|187887 cd09757, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Zn-finger domain containing protein, distant homologs of
           Cas8 proteins; signature gene for I-C subtype; also
           known as Csd1 family.
          Length = 569

 Score = 28.0 bits (63), Expect = 1.1
 Identities = 7/41 (17%), Positives = 14/41 (34%)

Query: 65  PDGQWDLSKYHTVTLGLPHHSVRSLAAVYNKVWCGYKNKIH 105
            D    L      T      + +++A +  K   GY+  + 
Sbjct: 275 EDPFAALFDPPDDTKPDDAATTQAVALLLKKPITGYRADLD 315


>gnl|CDD|240114 cd04801, CBS_pair_M50_like, This cd contains two tandem repeats of
           the cystathionine beta-synthase (CBS pair) domains in
           association with the metalloprotease peptidase M50.  CBS
           is a small domain originally identified in cystathionine
           beta-synthase and subsequently found in a wide range of
           different proteins. CBS domains usually come in tandem
           repeats, which associate to form a so-called Bateman
           domain or a CBS pair which is reflected in this model.
           The interface between the two CBS domains forms a cleft
           that is a potential ligand binding site. The CBS pair
           coexists with a variety of other functional domains.  It
           has been proposed that the CBS domain may play a
           regulatory role, although its exact function is unknown.
          Length = 114

 Score = 27.2 bits (61), Expect = 1.4
 Identities = 20/95 (21%), Positives = 35/95 (36%), Gaps = 10/95 (10%)

Query: 20  HSAVSQWRRCLHSIQLKDSVLNIVHVQGRVVCALADGSVAIFRRGPDGQWDLSKYHTVTL 79
           H  + ++ R  + +        +V  +GR V  +   S+A  R  P  QW  +    V  
Sbjct: 9   HLTLREFVR-EYVLGSNQRRFVVVDNEGRYVGII---SLADLRAIPTSQWAQTTVIQVM- 63

Query: 80  GLPHHSVR-----SLAAVYNKVWCGYKNKIHVVDP 109
                 V      SLA V   +     +++ VV+ 
Sbjct: 64  TPAAKLVTVLSEESLAEVLKLLEEQGLDELAVVED 98


>gnl|CDD|178120 PLN02504, PLN02504, nitrilase.
          Length = 346

 Score = 27.0 bits (60), Expect = 2.9
 Identities = 17/50 (34%), Positives = 25/50 (50%), Gaps = 6/50 (12%)

Query: 58  VAIFRRGPDGQWDLSKYHTVTLGLPHHSVRSLAAVYNKVWCGYKNKIHVV 107
           +AI  R P G+ D  KYH   + +P   V  LAA+        K K+++V
Sbjct: 79  LAIGDRSPKGREDFRKYHASAIDVPGPEVDRLAAMAG------KYKVYLV 122


>gnl|CDD|192248 pfam09322, DUF1979, Domain of unknown function (DUF1979).  Members
           of this family of functionally uncharacterized domains
           are found in various Oryza sativa mutator-like
           transposases.
          Length = 58

 Score = 25.2 bits (55), Expect = 3.1
 Identities = 14/48 (29%), Positives = 23/48 (47%), Gaps = 6/48 (12%)

Query: 62  RRGPDGQWDLSKYHTVTLGLPHHSVRSLAAVYNKVWCGYKNKIHVVDP 109
           R GP G  DLS +     G+   + RS  ++ N +  G++     +DP
Sbjct: 17  RYGPTGV-DLSDFIVSERGIDRPAERSFPSIKNWLMRGFR-----IDP 58


>gnl|CDD|227204 COG4867, COG4867, Uncharacterized protein with a von Willebrand
           factor type A (vWA) domain [General function prediction
           only].
          Length = 652

 Score = 26.5 bits (58), Expect = 4.0
 Identities = 9/19 (47%), Positives = 10/19 (52%)

Query: 54  ADGSVAIFRRGPDGQWDLS 72
           A  +     RG DGQW LS
Sbjct: 355 ALVNQGFLDRGSDGQWRLS 373


>gnl|CDD|106970 PHA00661, PHA00661, hypothetical protein.
          Length = 734

 Score = 26.4 bits (58), Expect = 4.5
 Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 1/40 (2%)

Query: 44  HVQGRVVCALADGSVAIFRRGPDGQWDL-SKYHTVTLGLP 82
           H++G+ V  LADG+V   R   DG   L  +  TV +GLP
Sbjct: 587 HLEGKTVSILADGAVHPQRVVTDGAITLDVEAGTVHIGLP 626


>gnl|CDD|227379 COG5046, MAF1, Protein involved in Mod5 protein sorting
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 282

 Score = 26.1 bits (57), Expect = 5.6
 Identities = 12/46 (26%), Positives = 19/46 (41%), Gaps = 2/46 (4%)

Query: 29  CLHSIQLKDSVLNIVHVQ--GRVVCALADGSVAIFRRGPDGQWDLS 72
            L+   LK+ ++ IV  Q   R   +      AI  R  +  + LS
Sbjct: 60  KLNDQNLKELMITIVVYQMEYRSTNSDKKLFKAIENRCQEDLFALS 105


>gnl|CDD|224139 COG1218, CysQ, 3'-Phosphoadenosine 5'-phosphosulfate (PAPS)
           3'-phosphatase [Inorganic ion transport and metabolism].
          Length = 276

 Score = 25.7 bits (57), Expect = 6.2
 Identities = 9/21 (42%), Positives = 14/21 (66%), Gaps = 1/21 (4%)

Query: 51  CALADGSVAIFRR-GPDGQWD 70
           C +A+G+  I+ R GP  +WD
Sbjct: 200 CLVAEGAADIYPRFGPTMEWD 220


>gnl|CDD|225013 COG2102, COG2102, Predicted ATPases of PP-loop superfamily [General
           function prediction only].
          Length = 223

 Score = 25.7 bits (57), Expect = 6.8
 Identities = 8/33 (24%), Positives = 11/33 (33%)

Query: 74  YHTVTLGLPHHSVRSLAAVYNKVWCGYKNKIHV 106
           + T+ L  P    R     Y K W G      +
Sbjct: 183 FETLVLDAPLFKKRIELVEYEKEWDGEWGYFEI 215


>gnl|CDD|167709 PRK04128, PRK04128,
           1-(5-phosphoribosyl)-5-[(5-
           phosphoribosylamino)methylideneamino]
           imidazole-4-carboxamide isomerase; Provisional.
          Length = 228

 Score = 25.5 bits (56), Expect = 7.0
 Identities = 12/25 (48%), Positives = 16/25 (64%), Gaps = 8/25 (32%)

Query: 100 YKNKIHVVD--------PKSLDVIR 116
           Y +KIHVVD        PK+LDV++
Sbjct: 42  YVDKIHVVDLDGAFEGKPKNLDVVK 66


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.324    0.136    0.439 

Gapped
Lambda     K      H
   0.267   0.0850    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,048,090
Number of extensions: 502125
Number of successful extensions: 529
Number of sequences better than 10.0: 1
Number of HSP's gapped: 529
Number of HSP's successfully gapped: 18
Length of query: 121
Length of database: 10,937,602
Length adjustment: 84
Effective length of query: 37
Effective length of database: 7,211,866
Effective search space: 266839042
Effective search space used: 266839042
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 53 (24.0 bits)