RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy306
         (190 letters)



>gnl|CDD|112130 pfam03301, Trp_dioxygenase, Tryptophan 2,3-dioxygenase. 
          Length = 346

 Score =  169 bits (429), Expect = 6e-52
 Identities = 78/132 (59%), Positives = 97/132 (73%), Gaps = 1/132 (0%)

Query: 15  LDAESESLQSCQIADYKKRKEMFQTIFDEDIHASLLSRGDRSLSHKALQGAILITMYRDE 74
                E LQ    A+Y+K  E+  +  DE  H  LLS+G+R LSH+ALQGA++I  YRDE
Sbjct: 216 YPIPDEVLQRDLTAEYEKAPEVLASWLDEKRHDYLLSKGERRLSHRALQGALMIYFYRDE 275

Query: 75  PRFHLPSQILSALMDIDCCTAKWRHNHVLMVQRMIGSQHLGTGGTSGYQYLRSTLSDRYK 134
           PRF  P Q+L ALMDID    KWR+NHV+MVQRMIGS+  GTGG+SGY YLRST+SDRYK
Sbjct: 276 PRFSQPYQLLEALMDIDSLFTKWRYNHVVMVQRMIGSK-TGTGGSSGYAYLRSTVSDRYK 334

Query: 135 IFLDLFNVSSFL 146
           +F DLFN+S++L
Sbjct: 335 VFPDLFNLSTYL 346


>gnl|CDD|226014 COG3483, TDO2, Tryptophan 2,3-dioxygenase (vermilion) [Amino acid
           transport and metabolism].
          Length = 262

 Score = 65.6 bits (160), Expect = 3e-13
 Identities = 31/82 (37%), Positives = 47/82 (57%), Gaps = 3/82 (3%)

Query: 65  AILITMYRDEPRFHLPSQILSALMDIDCCTAKWRHNHVLMVQRMIGSQHLGTGGTSGYQY 124
           A  + +YRD  R+    ++   L+D++    +WR  HV  V+R+IG +  GTGGTSG  Y
Sbjct: 184 AAWLEVYRDPQRYWELYELAEKLVDLEDSFRRWRFRHVTTVERIIGFKR-GTGGTSGVGY 242

Query: 125 LRSTLSDRYKIFLDLFNVSSFL 146
           LRS L     +F +L+ V + L
Sbjct: 243 LRSML--DVVLFPELWTVRTLL 262


>gnl|CDD|188272 TIGR03036, trp_2_3_diox, tryptophan 2,3-dioxygenase.  Members of
           this family are tryptophan 2,3-dioxygenase, as confirmed
           by several experimental characterizations, and by
           conserved operon structure for many of the other
           members. This enzyme represents the first of a two-step
           degradation to L-kynurenine, and a three-step pathway
           (via kynurenine) to anthranilate plus alanine [Energy
           metabolism, Amino acids and amines].
          Length = 264

 Score = 55.0 bits (133), Expect = 2e-09
 Identities = 31/105 (29%), Positives = 54/105 (51%), Gaps = 6/105 (5%)

Query: 44  DIHASLLSRGDRSLSHKALQG--AILITMYRDEPRFHLPSQILSALMDIDCCTAKWRHNH 101
            I A +L R D +  ++      A  + +YRD   +    ++   L+D++    +WR  H
Sbjct: 164 PIPAEVLER-DWTQPYEPNPSVEAAWLEVYRDPDTYWELYELAEKLVDLEDLFRQWRFRH 222

Query: 102 VLMVQRMIGSQHLGTGGTSGYQYLRSTLSDRYKIFLDLFNVSSFL 146
           +  V+R+IG +  GTGG+SG  YL+  L     +F +L+ V + L
Sbjct: 223 LTTVERIIGFK-RGTGGSSGVAYLKKMLDVV--LFPELWTVRTEL 264


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 30.0 bits (68), Expect = 0.93
 Identities = 12/44 (27%), Positives = 23/44 (52%), Gaps = 1/44 (2%)

Query: 1   MQGRDKQPKLQIDYLDAESESLQSCQIADYKKRKEMFQTIFDED 44
            + R  + + +ID L AE E L+  +I + +KR++     + E 
Sbjct: 320 AEERLAKLEAEIDKLLAEIEELER-EIEEERKRRDKLTEEYAEL 362


>gnl|CDD|226481 COG3972, COG3972, Superfamily I DNA and RNA helicases [General
           function prediction only].
          Length = 660

 Score = 28.3 bits (63), Expect = 2.6
 Identities = 10/58 (17%), Positives = 23/58 (39%), Gaps = 5/58 (8%)

Query: 6   KQPKLQIDYLDAESESLQS-CQIADYKKRKEMFQTIFDEDIHASLLSRGDRSLSHKAL 62
               + + + DA    ++   Q    ++R  +  T+  E + A  L     S++H  +
Sbjct: 605 DISVIDLIHRDASRREIKFLKQDKVRRER--LLSTLGSELLIAEDLDA--ESMAHVRM 658


>gnl|CDD|240750 cd12304, RRM_Set1, RNA recognition motif in the Set1-like family of
           histone-lysine N-methyltransferases.  This subfamily
           corresponds to the RRM of the Set1-like family of
           histone-lysine N-methyltransferases which includes Set1A
           and Set1B that are ubiquitously expressed vertebrates
           histone methyltransferases exhibiting high homology to
           yeast Set1. Set1A and Set1B proteins exhibit a largely
           non-overlapping subnuclear distribution in euchromatic
           nuclear speckles, strongly suggesting that they bind to
           a unique set of target genes and thus make non-redundant
           contributions to the epigenetic control of chromatin
           structure and gene expression. With the exception of the
           catalytic component, the subunit composition of the
           Set1A and Set1B histone methyltransferase complexes are
           identical. Each complex contains six human homologs of
           the yeast Set1/COMPASS complex, including Set1A or
           Set1B, Ash2 (homologous to yeast Bre2), CXXC finger
           protein 1 (CFP1; homologous to yeast Spp1), Rbbp5
           (homologous to yeast Swd1), Wdr5 (homologous to yeast
           Swd3), and Wdr82 (homologous to yeast Swd2). The genomic
           targeting of these complexes is determined by the
           identity of the catalytic subunit present in each
           histone methyltransferase complex. Thus, the Set1A and
           Set1B complexes may exhibit both overlapping and
           non-redundant properties. Both Set1A and Set1B contain
           an N-terminal RNA recognition motif (RRM), also termed
           RBD (RNA binding domain) or RNP (ribonucleoprotein
           domain), an N- SET domain, and a C-terminal catalytic
           SET domain followed by a post-SET domain. In contrast to
           Set1B, Set1A additionally contains an HCF-1 binding
           motif that interacts with HCF-1 in vivo. .
          Length = 93

 Score = 26.5 bits (59), Expect = 3.6
 Identities = 9/31 (29%), Positives = 12/31 (38%), Gaps = 10/31 (32%)

Query: 168 HHPRNREH--------ADSK--KNCVKHLNG 188
            HP+  +H           K  K CV+ LN 
Sbjct: 36  FHPKTNKHLGLARVVFDSVKSAKRCVEKLNQ 66


>gnl|CDD|225544 COG2998, TupB, ABC-type tungstate transport system, permease
           component [Coenzyme metabolism].
          Length = 280

 Score = 27.1 bits (60), Expect = 5.4
 Identities = 14/34 (41%), Positives = 18/34 (52%), Gaps = 2/34 (5%)

Query: 27  IADYKKRKEMFQTIFDEDIHASLLSRGDRSLSHK 60
           I D K  KE F+ I +E   A  +SRGD S +  
Sbjct: 123 IKDAKNGKEAFEKIAEEK--AKFVSRGDNSGTDS 154


>gnl|CDD|182669 PRK10714, PRK10714, undecaprenyl phosphate
           4-deoxy-4-formamido-L-arabinose transferase;
           Provisional.
          Length = 325

 Score = 27.4 bits (61), Expect = 5.5
 Identities = 12/37 (32%), Positives = 20/37 (54%)

Query: 151 YIPVLSGTLKSQLSATPHHPRNREHADSKKNCVKHLN 187
           +IP+L+ T   +    P H   RE  DSK + ++ +N
Sbjct: 181 FIPILANTFARRAIEIPVHHAEREFGDSKYSFMRLIN 217


>gnl|CDD|237551 PRK13902, alaS, alanyl-tRNA synthetase; Provisional.
          Length = 900

 Score = 27.5 bits (62), Expect = 5.6
 Identities = 20/76 (26%), Positives = 26/76 (34%), Gaps = 27/76 (35%)

Query: 9   KLQIDYLDAESESLQSCQIADYKKRKEMFQTIFD-----EDIHASLLSRG----DRSLSH 59
            +QID L             DY + KE    I D     E+ +   L RG    +R    
Sbjct: 378 DMQIDELKE-----------DYPELKENRDYILDIVDLEEEKYRETLERGKRIVERLAKK 426

Query: 60  KALQGAI----LITMY 71
           K     I    LI +Y
Sbjct: 427 K---EEIPLDDLIELY 439


>gnl|CDD|185194 PRK15294, PRK15294, putative fimbrial outer membrane usher protein
           SthC; Provisional.
          Length = 845

 Score = 27.0 bits (59), Expect = 7.1
 Identities = 13/46 (28%), Positives = 25/46 (54%)

Query: 29  DYKKRKEMFQTIFDEDIHASLLSRGDRSLSHKALQGAILITMYRDE 74
           DY   +++  T  D   +  + +R D SLS   + G+I +T+Y ++
Sbjct: 473 DYYALEDVLDTYSDNSHYDHVRNRTDLSLSQDIIYGSISLTLYNED 518


>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase.  The model describes
           alanine--tRNA ligase. This enzyme catalyzes the reaction
           (tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
           pyrophosphate + AMP) [Protein synthesis, tRNA
           aminoacylation].
          Length = 851

 Score = 27.0 bits (60), Expect = 8.4
 Identities = 11/53 (20%), Positives = 21/53 (39%), Gaps = 9/53 (16%)

Query: 28  ADYKKRKEMFQTIFDEDIHASL--LSRG----DRSLSH---KALQGAILITMY 71
            + K+++E  + I + +       L RG    ++ L     K L G     +Y
Sbjct: 340 PELKEKQEFVEQILELEEERFAKTLERGLRLFEKLLKKLGKKELDGEDAFKLY 392


>gnl|CDD|216277 pfam01065, Adeno_hexon, Hexon, adenovirus major coat protein,
           N-terminal domain.  Hexon is the major coat protein from
           adenovirus type 2. Hexon forms a homo-trimer. The 240
           copies of the hexon trimer are organised so that 12 lie
           on each of the 20 facets. The central 9 hexons in a
           facet are cemented together by 12 copies of polypeptide
           IX. The penton complex, formed by the peripentonal
           hexons and base hexon (holding in place a fibre), lie at
           each of the 12 vertices. The N and C-terminal domains
           adopt the same PNGase F-like fold although they are
           significantly different in length.
          Length = 488

 Score = 26.4 bits (59), Expect = 9.7
 Identities = 9/21 (42%), Positives = 13/21 (61%), Gaps = 1/21 (4%)

Query: 121 GYQYLRSTLSDRYKIFLDLFN 141
            YQY+ + L DR + F  L+N
Sbjct: 280 SYQYMLADLMDRSRYF-SLWN 299


>gnl|CDD|178207 PLN02597, PLN02597, phosphoenolpyruvate carboxykinase [ATP].
          Length = 555

 Score = 26.7 bits (59), Expect = 10.0
 Identities = 13/50 (26%), Positives = 21/50 (42%), Gaps = 18/50 (36%)

Query: 139 LFNVSSFLLPKQYI-PV-----------------LSGTLKSQLSATPHHP 170
           LF++  +L+P + I  +                 LSGT K+ LS  P+  
Sbjct: 220 LFSLMHYLMPMRGILSLHSGCNMGKDGDVALFFGLSGTGKTTLSTDPNRY 269


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.134    0.399 

Gapped
Lambda     K      H
   0.267   0.0596    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,376,154
Number of extensions: 838068
Number of successful extensions: 670
Number of sequences better than 10.0: 1
Number of HSP's gapped: 666
Number of HSP's successfully gapped: 20
Length of query: 190
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 99
Effective length of database: 6,901,388
Effective search space: 683237412
Effective search space used: 683237412
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 56 (25.6 bits)