RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5736
         (346 letters)



>gnl|CDD|235198 PRK04011, PRK04011, peptide chain release factor 1; Provisional.
          Length = 411

 Score = 36.0 bits (84), Expect = 0.024
 Identities = 20/72 (27%), Positives = 32/72 (44%), Gaps = 8/72 (11%)

Query: 218 LMESLEKYPIRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLTEIKTKLSELN 277
           + E L K  +  +C N  CG      K ++   +  EK C  C S  ++ E +  + EL+
Sbjct: 318 ISEDLRKDRVTYKCPN--CGYE--EEKTVKRREELPEKTCPKCGSELEIVEEEDIIEELS 373

Query: 278 EMFYRGIEQMNT 289
           E+     EQ  T
Sbjct: 374 EL----AEQSGT 381


>gnl|CDD|216155 pfam00856, SET, SET domain.  SET domains are protein lysine
           methyltransferase enzymes. SET domains appear to be
           protein-protein interaction domains. It has been
           demonstrated that SET domains mediate interactions with
           a family of proteins that display similarity with
           dual-specificity phosphatases (dsPTPases). A subset of
           SET domains have been called PR domains. These domains
           are divergent in sequence from other SET domains, but
           also appear to mediate protein-protein interaction. The
           SET domain consists of two regions known as SET-N and
           SET-C. SET-C forms an unusual and conserved knot-like
           structure of probably functional importance.
           Additionally to SET-N and SET-C, an insert region
           (SET-I) and flanking regions of high structural
           variability form part of the overall structure.
          Length = 113

 Score = 32.5 bits (74), Expect = 0.098
 Identities = 17/51 (33%), Positives = 24/51 (47%), Gaps = 4/51 (7%)

Query: 138 GVGIYPTVSLFNHDCYP-AVTRYFN---GKNIIVKALRPLKPKEVVAENYG 184
             G+       NH C P    R+     G  I+V+ALR +KP E +  +YG
Sbjct: 63  ATGLGNVARFINHSCEPNCEVRFVFVNGGDRIVVRALRDIKPGEELTIDYG 113


>gnl|CDD|214614 smart00317, SET, SET (Su(var)3-9, Enhancer-of-zeste, Trithorax)
           domain.  Putative methyl transferase, based on outlier
           plant homologues.
          Length = 124

 Score = 29.6 bits (67), Expect = 1.1
 Identities = 14/49 (28%), Positives = 22/49 (44%), Gaps = 4/49 (8%)

Query: 146 SLFNHDCYPAVTRYF----NGKNIIVKALRPLKPKEVVAENYGLVFSRK 190
              NH C P     F        I++ ALR +KP E +  +YG  ++ +
Sbjct: 76  RFINHSCEPNCELLFVEVNGDDRIVIFALRDIKPGEELTIDYGSDYANE 124


>gnl|CDD|146106 pfam03302, VSP, Giardia variant-specific surface protein. 
          Length = 397

 Score = 30.7 bits (69), Expect = 1.2
 Identities = 16/62 (25%), Positives = 21/62 (33%), Gaps = 10/62 (16%)

Query: 209 CRACVENWPLMESLEKYPIRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLTE 268
           C  C   + L     K      C  +NC       K      ++V   CE CNS + LT 
Sbjct: 1   CDECKPGYELSADKTKCTSSAPCKTENC-------KACSNDKREV---CEECNSNNYLTP 50

Query: 269 IK 270
             
Sbjct: 51  TS 52


>gnl|CDD|225633 COG3091, SprT, Zn-dependent metalloprotease, SprT family [General
           function prediction only].
          Length = 156

 Score = 28.9 bits (65), Expect = 2.2
 Identities = 14/55 (25%), Positives = 24/55 (43%), Gaps = 13/55 (23%)

Query: 176 KEVVAENYGLVFSRKHLIDRQKVLSARYWFECKCRACVENWPLMESLEKYPIRIR 230
           K ++ +  GL F R H  + Q V    Y + C+C+             ++ +RIR
Sbjct: 89  KLLMQQVLGLRFCRTHQFEVQSVRRTTYPYRCQCQ-------------QHYLRIR 130


>gnl|CDD|193497 cd03875, M28_Fxna_like, M28 Zn-peptidase Endoplasmic reticulum
           metallopeptidase 1.  Peptidase family M28; Endoplasmic
           reticulum metallopeptidase 1 (ERMP1; Felix-ina, FXNA or
           Fxna peptidase; KIAA1815) subfamily. ERMP1 is a
           multi-pass membrane protein located in the endoplasmic
           reticulum membrane. In humans, Fxna may play a crucial
           role in processing proteins required for the
           organization of somatic cells and oocytes into discrete
           follicular structures, although which proteins are
           hydrolyzed has not yet been determined. Another member
           of this cluster is the 24-kDa vacuolar protein (VP24)
           which is probably involved in the formation of
           intravacuolar pigmented globules (cyanoplasts) in highly
           anthocyanin-containing vacuoles; however, the biological
           function of the C-terminal region which includes the
           putative transmembrane metallopeptidase domain is
           unknown.
          Length = 307

 Score = 29.5 bits (67), Expect = 2.7
 Identities = 9/39 (23%), Positives = 16/39 (41%), Gaps = 4/39 (10%)

Query: 148 FNHDCYPAVTRYFNGKNIIVKALRPLKPKEVVAENYGLV 186
            +       + YF G NI+V+    +  K   +E   L+
Sbjct: 64  SDFLGGGMTSVYFEGTNIVVR----ISGKNNGSEGALLL 98


>gnl|CDD|227506 COG5179, TAF1, Transcription initiation factor TFIID, subunit TAF1
           [Transcription].
          Length = 968

 Score = 28.9 bits (64), Expect = 4.6
 Identities = 20/81 (24%), Positives = 37/81 (45%)

Query: 246 LEPSAKKVEKKCESCNSTSDLTEIKTKLSELNEMFYRGIEQMNTSCFREAVESLTKFSDQ 305
           L+  AK  +K+ E  +    L E+K K +  N+ +Y  +++ + S  R  + +L      
Sbjct: 333 LKRQAKPKKKQQEVFSDDKPLNELKGKFNISNDKYYGILKKTHQSKVRNTLSTLGIQHSL 392

Query: 306 IHELVVPPYKLASLAHEALRN 326
               + PP+   +   E LRN
Sbjct: 393 PALRLDPPFYKNNHTKEELRN 413


>gnl|CDD|173900 cd00674, LysRS_core_class_I, catalytic core domain of  class I
           lysyl tRNA synthetase.  Class I lysyl tRNA synthetase
           (LysRS) catalytic core domain. This class I enzyme is a
           monomer which aminoacylates the 2'-OH of the nucleotide
           at the 3' of the appropriate tRNA. The core domain is
           based on the Rossman fold and is responsible for the
           ATP-dependent formation of the enzyme bound
           aminoacyl-adenylate. It contains the characteristic
           class I HIGH and KMSKS motifs, which are involved in ATP
           binding. The class I LysRS is found only in archaea and
           some bacteria and has evolved separately from class II
           LysRS, as the two do not share structural or sequence
           similarity.
          Length = 353

 Score = 28.8 bits (65), Expect = 4.9
 Identities = 18/66 (27%), Positives = 26/66 (39%), Gaps = 11/66 (16%)

Query: 210 RACVENWPLMESLEKYPIRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTS-DLTE 268
           R   E W        YP    C    CG+   TV+  +  A  V  KCE  +  + D+  
Sbjct: 159 RELQETW--------YPFMPYCEK--CGKDTTTVEAYDAKAGTVTYKCECGHEETVDIRT 208

Query: 269 IKTKLS 274
            + KL+
Sbjct: 209 GRGKLT 214


>gnl|CDD|216284 pfam01074, Glyco_hydro_38, Glycosyl hydrolases family 38 N-terminal
           domain.  Glycosyl hydrolases are key enzymes of
           carbohydrate metabolism.
          Length = 269

 Score = 28.3 bits (64), Expect = 5.0
 Identities = 12/57 (21%), Positives = 20/57 (35%), Gaps = 9/57 (15%)

Query: 10  IWDHITGSRLISWRPDRGSWARNSARCTALTLLNE---------HDDSLLLTGYDDG 57
           IW    GS +++       +    A+  A  LL++          +  L   G  DG
Sbjct: 161 IWRGPDGSEILTHMLPFDYYPTYGAQFRADDLLDQAKKYADKTRTNHVLWPFGDGDG 217


>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
          Length = 3164

 Score = 28.8 bits (64), Expect = 6.3
 Identities = 21/107 (19%), Positives = 30/107 (28%), Gaps = 17/107 (15%)

Query: 207  CKCRACVENWPLMESLEKYPIRIRC--------SNDNCGQIIATVKKLEPSAKKVEKKCE 258
            C+  +  E W  M+S     IR             D  G I           K+      
Sbjct: 2726 CELSSEFEKWRRMKSKYLCAIRYMLMSSEWILDHEDRSGFIHRLDVSFLLRTKR------ 2779

Query: 259  SCNSTSDLTEIKTKLSELNEMFYRGIEQMNTSCFREAVESLTKFSDQ 305
                 S L E K     L+     G + ++ SC R   +       Q
Sbjct: 2780 ---FVSTLLEDKNYRQVLSSCSLYGNDVISHSCDRFDRDVYRALKHQ 2823


>gnl|CDD|237234 PRK12854, PRK12854, glucose-6-phosphate 1-dehydrogenase;
           Provisional.
          Length = 484

 Score = 28.5 bits (64), Expect = 6.9
 Identities = 9/17 (52%), Positives = 13/17 (76%)

Query: 163 KNIIVKALRPLKPKEVV 179
           KN + +++RPL P EVV
Sbjct: 270 KNKVFRSMRPLDPAEVV 286


>gnl|CDD|235164 PRK03824, hypA, hydrogenase nickel incorporation protein;
           Provisional.
          Length = 135

 Score = 27.0 bits (60), Expect = 8.6
 Identities = 10/23 (43%), Positives = 10/23 (43%)

Query: 208 KCRACVENWPLMESLEKYPIRIR 230
           KCR C   W L E  E     IR
Sbjct: 72  KCRNCGNEWSLKEVKESLDEEIR 94


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 27.7 bits (62), Expect = 9.3
 Identities = 26/100 (26%), Positives = 42/100 (42%), Gaps = 13/100 (13%)

Query: 238 QIIATVKKLEPSAKKVEKKCESCNSTSDLT----EIKTKLSELNEMFYRGI-------EQ 286
           +++  +K+L    +  +K  E      +L     E+K K  E++E             E+
Sbjct: 135 ELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKAREIHEKIQELANEAQEYHEE 194

Query: 287 MNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRN 326
           M    F EA + L K +D++HE  V   K     HE  RN
Sbjct: 195 MIK-LFEEA-DELRKEADELHEEFVELSKKIDELHEEFRN 232


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.133    0.407 

Gapped
Lambda     K      H
   0.267   0.0717    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,021,828
Number of extensions: 1582749
Number of successful extensions: 1403
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1399
Number of HSP's successfully gapped: 24
Length of query: 346
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 248
Effective length of database: 6,590,910
Effective search space: 1634545680
Effective search space used: 1634545680
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (26.5 bits)