RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10121
         (548 letters)



>gnl|CDD|216059 pfam00685, Sulfotransfer_1, Sulfotransferase domain. 
          Length = 254

 Score =  148 bits (375), Expect = 5e-41
 Identities = 94/317 (29%), Positives = 118/317 (37%), Gaps = 99/317 (31%)

Query: 64  DDDVWVCSFPKTGTTWTQEMVWCIANDLDFEAAKEIL----PARFPFLELTPLFDYRNNP 119
           DDDV + ++PK+GTTW QE++  I N  DFE  +E        R PFLE   LF      
Sbjct: 1   DDDVLIVTYPKSGTTWLQEILSLIPNRGDFEKFEEPHLFNPHNRSPFLEWYDLF------ 54

Query: 120 NLDAPDFEENSVVHIQNLKGRRFIKAHLPLTPLFDYRNNPNLDAPDFEENSVVHIQNLKG 179
                                                               V +  L  
Sbjct: 55  -------------------------------------------VIFDVAEGPVRLNALPS 71

Query: 180 RRFIKAHLPFKLLPKKLQSGTTNAKIIYVTRNPKDTCVSYYHH--CHLMEGYRGDFDDFL 237
            R IK HLP  LLPK L     NAKIIY+ RNPKD  VSYYH      +    G   +F+
Sbjct: 72  PRIIKTHLPLHLLPKSLWD--PNAKIIYLVRNPKDVAVSYYHFFRMAKILKAPGTPFEFV 129

Query: 238 KLFLNDAGNFSARLALARLFPQPDSFFTPIL-------IKKFLF------PQDLGSIITQ 284
           + FLN   N               S+F  +            LF       +D    I +
Sbjct: 130 EDFLNGKVNCG-------------SYFDHVKGWWKLRPPGNILFLRYEDLKKDPRGEIKK 176

Query: 285 VATHLDKSLTDDQV-----HLSFESMKSNPATNYEFAIDFNKENKLIDDKFCAGKFMRSG 339
           +A  L   LT++++     HLSFE+MK NP  NY          KL   +     F R G
Sbjct: 177 IAEFLGIPLTEEELDKIVKHLSFENMKGNPCLNYS---------KLPKHEV--SPFFRKG 225

Query: 340 QVGGWKAVMTPEIAEHV 356
            VG WK   T E AE  
Sbjct: 226 LVGDWKNYFTVEQAEKF 242



 Score =  115 bits (291), Expect = 3e-29
 Identities = 49/130 (37%), Positives = 69/130 (53%), Gaps = 12/130 (9%)

Query: 416 VCFAPYWDHVLEFWAVAKKRDNVLFIKYEDMKKDLGSIITQVATHLDKSLTDDQVDILKQ 475
           V    Y+DHV  +W   +   N+LF++YED+KKD    I ++A  L   LT++++D + +
Sbjct: 137 VNCGSYFDHVKGWWK-LRPPGNILFLRYEDLKKDPRGEIKKIAEFLGIPLTEEELDKIVK 195

Query: 476 HLSFESMKSNPATNYEFAIDFNKENKLIDDKFCAGKFMRSGQVGGWKAVMTPEIVEQFDP 535
           HLSFE+MK NP  NY          KL   +     F R G VG WK   T E  E+FD 
Sbjct: 196 HLSFENMKGNPCLNYS---------KLPKHEV--SPFFRKGLVGDWKNYFTVEQAEKFDE 244

Query: 536 WTRTKTKGSD 545
             + K KG+ 
Sbjct: 245 IYQEKMKGTG 254


>gnl|CDD|177822 PLN02164, PLN02164, sulfotransferase.
          Length = 346

 Score = 61.5 bits (149), Expect = 3e-10
 Identities = 39/138 (28%), Positives = 69/138 (50%), Gaps = 18/138 (13%)

Query: 418 FAPYWDHVLEFW-AVAKKRDNVLFIKYEDMKKDLGSIITQVATHLDKSLTDDQ-----VD 471
           + PY DHVL +W A  +  D +LF+KYE M+ D    + ++A  +    T ++     V+
Sbjct: 213 YGPYLDHVLGYWKAYQENPDRILFLKYETMRADPLPYVKRLAEFMGYGFTAEEEEKGVVE 272

Query: 472 ILKQHLSFESMKSNPATNYEFAIDFNKENKLIDDK---FCAGKFMRSGQVGGWKAVMTPE 528
            + +  SFE++K+         ++ NK  K  +D+   +    + R G+VG W+  +TPE
Sbjct: 273 KVVKLCSFETLKN---------LEANKGEKDREDRPAVYANSAYFRKGKVGDWQNYLTPE 323

Query: 529 IVEQFDPWTRTKTKGSDF 546
           +  + D     K KG+  
Sbjct: 324 MAARIDGLMEEKFKGTGL 341



 Score = 58.0 bits (140), Expect = 5e-09
 Identities = 79/331 (23%), Positives = 125/331 (37%), Gaps = 95/331 (28%)

Query: 63  RDDDVWVCSFPKTGTTWTQEMVWCIANDLDFEAAKEILPARFPFLELTPLFDYRNNPNLD 122
           R +D  VCS+PKTGTTW + + + IAN   F+ +   L  R P  E  P           
Sbjct: 78  RPNDFLVCSYPKTGTTWLKALTFAIANRSRFDDSSNPLLKRNPH-EFVP----------- 125

Query: 123 APDFEENSVVHIQNLKGRRFIKAHLPLTPLFDYRNNPNLDAPDFEENSVVHIQNLKGRRF 182
                              +I+   P  P  D   +                   KG   
Sbjct: 126 -------------------YIEIDFPFFPSVDVLKD-------------------KGNTL 147

Query: 183 IKAHLPFKLLPKKL-QSGTTNAKIIYVTRNPKDTCVSYYHHCHLMEGYRG-------DFD 234
              H+P+ LLP  + +SG    K++Y+ R+PKDT +S +   H     +G        FD
Sbjct: 148 FSTHIPYGLLPDSVVKSG---CKMVYIWRDPKDTFISMWTFLHKERSQQGPLNSLEESFD 204

Query: 235 DFLKLFLNDAGNFSARLALARLFPQ-PDSFFTPILIKKF-LFPQDLGSIITQVATHLDKS 292
            F +            L   + + + PD     IL  K+     D    + ++A  +   
Sbjct: 205 MFCRGLSVYGPYLDHVLGYWKAYQENPDR----ILFLKYETMRADPLPYVKRLAEFMGYG 260

Query: 293 LTDDQ---------VHL-SFESMKSNPATNYEFAIDFNKENKLIDDK---FCAGKFMRSG 339
            T ++         V L SFE++K+         ++ NK  K  +D+   +    + R G
Sbjct: 261 FTAEEEEKGVVEKVVKLCSFETLKN---------LEANKGEKDREDRPAVYANSAYFRKG 311

Query: 340 QVGGWKAVMTPEIAEHVSDETEIGKLLRSKF 370
           +VG W+  +TPE+A        I  L+  KF
Sbjct: 312 KVGDWQNYLTPEMA------ARIDGLMEEKF 336


>gnl|CDD|234328 TIGR03721, exospore_TM, BclB C-terminal domain.  This domain occurs
           as the C-terminal region in a number of proteins that
           have extensive collagen-like triple helix repeat
           regions. Member domains are predicted by TmHMM to have
           four or five transmembrane helices. Members are found
           mostly in the Firmicutes, but also in Acanthamoeba
           polyphaga mimivirus. Members include spore surface
           glycoprotein BclB from Bacillus anthracis, a protein of
           the exosporium. The exosporium is an additional
           outermost spore layer, lacking in B. subtilis and most
           other spore formers, consisting of a basal layer and,
           above it, a nap of fine filaments.
          Length = 165

 Score = 32.2 bits (73), Expect = 0.37
 Identities = 19/64 (29%), Positives = 25/64 (39%), Gaps = 17/64 (26%)

Query: 244 AGNFSARLALARL-----------FPQPDSFFTPI------LIKKFLFPQDLGSIITQVA 286
           A  FSA  ALA L            P P + FTP+      L      P  +GS  + + 
Sbjct: 65  AAYFSATAALALLGPVTITAQLYIAPAPSNVFTPVPGAPVTLTPPITGPIAVGSTASGII 124

Query: 287 THLD 290
           T L+
Sbjct: 125 TGLN 128


>gnl|CDD|150335 pfam09634, DUF2025, Protein of unknown function (DUF2025).  This
          protein is produced from gene PA1123 in Pseudomonas. It
          contains three alpha helices and six beta strands and
          is thought to be monomeric. It appears to be present in
          the biofilm layer and may be a lipoprotein.
          Length = 106

 Score = 29.4 bits (66), Expect = 1.6
 Identities = 15/43 (34%), Positives = 17/43 (39%), Gaps = 10/43 (23%)

Query: 48 EYYVNFAEDIINMDVRDDDVWVCSFPKTGTTWTQEMVWCIAND 90
           Y V F+ED   MDV DD +   S          E VW    D
Sbjct: 27 RYIVRFSEDSFGMDVADDSITPTS----------EFVWASVRD 59


>gnl|CDD|232980 TIGR00450, mnmE_trmE_thdF, tRNA modification GTPase TrmE.  TrmE,
           also called MnmE and previously designated ThdF
           (thiophene and furan oxidation protein), is a GTPase
           involved in tRNA modification to create
           5-methylaminomethyl-2-thiouridine in the wobble position
           of some tRNAs. This protein and GidA form an
           alpha2/beta2 heterotetramer [Protein synthesis, tRNA and
           rRNA base modification].
          Length = 442

 Score = 29.8 bits (67), Expect = 4.7
 Identities = 14/39 (35%), Positives = 21/39 (53%)

Query: 439 LFIKYEDMKKDLGSIITQVATHLDKSLTDDQVDILKQHL 477
           L  K E ++K L  ++ QV  ++D    DD+ D L Q L
Sbjct: 143 LDQKIEAIRKSLLQLLAQVEVNIDYEEDDDEQDSLNQLL 181


>gnl|CDD|237890 PRK15055, PRK15055, anaerobic sulfite reductase subunit A;
           Provisional.
          Length = 344

 Score = 29.2 bits (66), Expect = 5.2
 Identities = 22/60 (36%), Positives = 32/60 (53%), Gaps = 7/60 (11%)

Query: 478 SFE-----SMKSNPATNYEFAIDFNKENKLIDDKFCAGKFMRSGQVGGWKAVMTPEIVEQ 532
           SFE     SM +N   NY  A+ F+++  L+D K    +F +  Q GG +   TPE VE+
Sbjct: 138 SFENCFCVSMGTNKTDNYSAAVRFDEDGVLVDIK--DEEFEKYFQGGGEEEEFTPEFVEE 195


>gnl|CDD|240743 cd12297, RRM2_Prp24, RNA recognition motif 2 in fungal
           pre-messenger RNA splicing protein 24 (Prp24) and
           similar proteins.  This subfamily corresponds to the
           RRM2 of Prp24, also termed U4/U6
           snRNA-associated-splicing factor PRP24 (U4/U6 snRNP), an
           RNA-binding protein with four well conserved RNA
           recognition motifs (RRMs), also termed RBDs (RNA binding
           domains) or RNPs (ribonucleoprotein domains). It
           facilitates U6 RNA base-pairing with U4 RNA during
           spliceosome assembly. Prp24 specifically binds free U6
           RNA primarily with RRMs 1 and 2 and facilitates pairing
           of U6 RNA bases with U4 RNA bases. Additionally, it may
           also be involved in dissociation of the U4/U6 complex
           during spliceosome activation. .
          Length = 78

 Score = 27.1 bits (61), Expect = 5.9
 Identities = 16/41 (39%), Positives = 21/41 (51%), Gaps = 8/41 (19%)

Query: 67  VWVCSFPKTGTTWTQEMVWCIANDLDFEAAKEILPARFPFL 107
           +WV +FP    ++ Q  +     DL FE   EIL  RFP L
Sbjct: 3   LWVTNFP---PSFDQSDI----RDL-FEQYGEILSIRFPSL 35


>gnl|CDD|130549 TIGR01485, SPP_plant-cyano, sucrose-6F-phosphate phosphohydrolase. 
           This model describes the sucrose phosphate
           phosphohydrolase from plants and cyanobacteria (SPP).
           SPP is a member of the Class IIB subfamily (TIGR01484)
           of the Haloacid Dehalogenase (HAD) superfamily of
           aspartate-nucleophile hydrolases. SPP catalyzes the
           final step in the biosynthesis of sucrose, a critically
           important molecule for plants. This model is limited to
           plants and cyanobacteria. However, a closely related
           group of sequences from bacteria and archaea (TIGR*****)
           may prove to catalyze the same reaction. If so, the
           SPP-subfamily model (TIGR01482) containing both of these
           groups should be promoted to equivalog and the two
           smaller models retired. Sucrose phosphate synthase
           (SPS), the prior step in the biosynthesis of sucrose
           contains a domain which exhibits considerable similarity
           to SPP albeit without conservation of the catalytic
           residues. The catalytic machinery of the synthase
           resides in another domain. It seems likely that the
           phosphatase-like domain is involved in substrate
           binding, possibly binding both substrates in a
           "product-like" orientation prior to ligation by the
           synthase catalytic domain.
          Length = 249

 Score = 28.6 bits (64), Expect = 7.6
 Identities = 14/46 (30%), Positives = 22/46 (47%)

Query: 256 LFPQPDSFFTPILIKKFLFPQDLGSIITQVATHLDKSLTDDQVHLS 301
           L PQPD    P  +  FL P+    +I Q+   L ++  D ++  S
Sbjct: 109 LKPQPDLEQRPHKVSFFLDPEAAPEVIKQLTEMLKETGLDVKLIYS 154


>gnl|CDD|218442 pfam05116, S6PP, Sucrose-6F-phosphate phosphohydrolase.  This
           family consists of Sucrose-6F-phosphate phosphohydrolase
           proteins found in plants and cyanobacteria.
           Sucrose-6(F)-phosphate phosphohydrolase catalyzes the
           final step in the pathway of sucrose biosynthesis.
          Length = 247

 Score = 28.4 bits (64), Expect = 8.6
 Identities = 12/50 (24%), Positives = 22/50 (44%)

Query: 253 LARLFPQPDSFFTPILIKKFLFPQDLGSIITQVATHLDKSLTDDQVHLSF 302
              L  QP+    P  +  FL P+   +++ ++   L+K   D +V  S 
Sbjct: 104 FPGLTLQPEEEQRPHKVSYFLDPEAAPAVLKELEQLLEKRGLDVKVIYSS 153


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.138    0.432 

Gapped
Lambda     K      H
   0.267   0.0624    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 28,394,989
Number of extensions: 2785585
Number of successful extensions: 2138
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2129
Number of HSP's successfully gapped: 28
Length of query: 548
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 446
Effective length of database: 6,413,494
Effective search space: 2860418324
Effective search space used: 2860418324
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 61 (27.5 bits)