RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14464
         (175 letters)



>gnl|CDD|221172 pfam11701, UNC45-central, Myosin-binding striated muscle assembly
           central.  The UNC-45 or small muscle protein 1 of
           C.elegans is expressed in two forms from different
           genomic positions in mammals, as a general tissue
           protein UNC-45a and a specific form Unc-45b expressed
           only in striated and skeletal muscle. All members carry
           up to three amino-terminal tetratricopeptide repeat
           (TPR) domains towards their N-terminal, a UCS domain at
           the C-terminal that contains a number of Arm repeats
           pfam00514 and this central region of approximately 400
           residues. Both the general form and the muscle form of
           UNC-45 function in myotube formation through cell
           fusion. Myofibril formation requires both GC and SM
           UNC-45, consistent with the fact that the cytoskeleton
           is necessary for the development and maintenance of
           organised myofibrils. The S. pombe Rng3p, is crucial for
           cell shape, normal actin cytoskeleton, and contractile
           ring assembly, and is essential for assembly of the
           myosin II-containing progenitors of the contractile
           ring. Widespread defects in the cytoskeleton are found
           in null mutants of all three fungal proteins. Mammalian
           Unc45 is found to act as a specific chaperone during the
           folding of myosin and the assembly of striated muscle by
           forming a stable complex with the general chaperone
           Hsp90. The exact function of this central region is not
           known.
          Length = 155

 Score = 41.1 bits (97), Expect = 5e-05
 Identities = 17/32 (53%), Positives = 22/32 (68%), Gaps = 1/32 (3%)

Query: 11  IIKQGVDILKRLYM-SKNENIRVRALVGLCKL 41
           I K G++ LK LY  SK++ I+V A V LCKL
Sbjct: 124 ISKNGLEWLKDLYKNSKDDEIKVLAAVVLCKL 155


>gnl|CDD|222285 pfam13646, HEAT_2, HEAT repeats.  This family includes multiple
           HEAT repeats.
          Length = 88

 Score = 28.1 bits (63), Expect = 0.77
 Identities = 24/84 (28%), Positives = 33/84 (39%), Gaps = 19/84 (22%)

Query: 19  LKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPD 78
           L+ L    +  +R  A   L +LG                  +   A    L +P  DP+
Sbjct: 5   LEALLSDPDPEVRAAAARALGELGD----------------PEALPALLELLKDP--DPE 46

Query: 79  IRRWAAEGLAYLTLDAEVKEALIE 102
           +RR AAE L  L  D E   AL+E
Sbjct: 47  VRRAAAEALGKLG-DPEALPALLE 69


>gnl|CDD|173083 PRK14620, PRK14620, NAD(P)H-dependent glycerol-3-phosphate
           dehydrogenase; Provisional.
          Length = 326

 Score = 29.4 bits (66), Expect = 0.96
 Identities = 12/39 (30%), Positives = 25/39 (64%)

Query: 6   HSITSIIKQGVDILKRLYMSKNENIRVRALVGLCKLGSM 44
           ++  ++I +G++ +K LY +KN +I +  L+G   LG +
Sbjct: 208 NAHAAVITKGMNEIKTLYSAKNGSIDLNTLIGPSCLGDL 246


>gnl|CDD|237987 cd00020, ARM, Armadillo/beta-catenin-like repeats. An approximately
           40 amino acid long tandemly repeated sequence motif
           first identified in the Drosophila segment polarity gene
           armadillo; these repeats were also found in the
           mammalian armadillo homolog beta-catenin, the junctional
           plaque protein plakoglobin, the adenomatous polyposis
           coli (APC) tumor suppressor protein, and a number of
           other proteins. ARM has been implicated in mediating
           protein-protein interactions, but no common features
           among the target proteins recognized by the ARM repeats
           have been identified; related to the HEAT domain; three
           consecutive copies of the repeat are represented by this
           alignment model.
          Length = 120

 Score = 28.4 bits (64), Expect = 0.97
 Identities = 24/121 (19%), Positives = 44/121 (36%), Gaps = 11/121 (9%)

Query: 15  GVDILKRLYMSKNENIRVRALVGLCKLGSMGGSDASIRPFADGSTRKLAEACRRFLVN-- 72
           G+  L  L  S +EN++  A   L  L    G++ +I+   +             LV   
Sbjct: 8   GLPALVSLLSSSDENVQREAAWALSNL--SAGNNDNIQAVVEAG-------GLPALVQLL 58

Query: 73  PARDPDIRRWAAEGLAYLTLDAEVKEALIEDKPALAALVDVASSGKPACTYGVVTTLVNL 132
            + D ++ + A   L  L    E  + ++ +   +  LV++  S            L NL
Sbjct: 59  KSEDEEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGALSNL 118

Query: 133 C 133
            
Sbjct: 119 A 119


>gnl|CDD|219630 pfam07901, DUF1672, Protein of unknown function (DUF1672).  This
           family is composed of hypothetical bacterial proteins of
           unknown function.
          Length = 277

 Score = 28.3 bits (63), Expect = 2.0
 Identities = 16/53 (30%), Positives = 27/53 (50%), Gaps = 6/53 (11%)

Query: 125 VVTTLVNLCNAYDKQEIIPEMIELAK--FAKHHIPEDHE----LDDPDFVTKR 171
           VVTTL +  + + K   + ++I+L++    K +IPE+      L      TKR
Sbjct: 208 VVTTLFSTKDNFTKDNTVDDVIDLSEELKKKKNIPENTTITLQLSKNKINTKR 260


>gnl|CDD|234952 PRK01402, hslO, Hsp33-like chaperonin; Reviewed.
          Length = 328

 Score = 28.0 bits (63), Expect = 2.5
 Identities = 9/16 (56%), Positives = 11/16 (68%)

Query: 158 EDHELDDPDFVTKRLL 173
           ED EL DP   ++RLL
Sbjct: 243 EDDELIDPTVSSERLL 258


>gnl|CDD|225819 COG3280, TreY, Maltooligosyl trehalose synthase [Carbohydrate
           transport and metabolism].
          Length = 889

 Score = 28.2 bits (63), Expect = 2.8
 Identities = 16/67 (23%), Positives = 22/67 (32%)

Query: 39  CKLGSMGGSDASIRPFADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLTLDAEVKE 98
             LG++      I        R+     R  L      P+I     E LA    D E  +
Sbjct: 180 FLLGNLNAILERIAAVPSTRERETQAQFRAALAEILATPNIAACLDECLARFNADPEQLD 239

Query: 99  ALIEDKP 105
           AL E + 
Sbjct: 240 ALHERQH 246


>gnl|CDD|237248 PRK12890, PRK12890, allantoate amidohydrolase; Reviewed.
          Length = 414

 Score = 27.9 bits (63), Expect = 3.2
 Identities = 26/76 (34%), Positives = 30/76 (39%), Gaps = 18/76 (23%)

Query: 55  ADGSTRKLAEACRRFLVNPARDPDIRRWAAEGLAYLTLDAEVKEALIEDKPALAALVDVA 114
            DG+T  LAEA RR   +P   P   R      A+L       E  IE  P L A     
Sbjct: 157 DDGTT--LAEALRRIGGDPDALPGALRPPGAVAAFL-------ELHIEQGPVLEA----- 202

Query: 115 SSGKPACTYGVVTTLV 130
             G P    GVVT + 
Sbjct: 203 -EGLP---IGVVTAIQ 214


>gnl|CDD|234447 TIGR04046, MSMEG_0569_nitr, flavin-dependent oxidoreductase,
           MSMEG_0569 family.  Members of this protein family
           belong to a conserved seven-gene biosynthetic cluster
           found sparsely in Cyanobacteria, Proteobacteria, and
           Actinobacteria. Distant homologies to characterized
           proteins suggest that members are enzymes dependent on a
           flavinoid cofactor.
          Length = 400

 Score = 27.7 bits (62), Expect = 3.5
 Identities = 16/43 (37%), Positives = 26/43 (60%), Gaps = 2/43 (4%)

Query: 75  RDPDIRRWAAEGLA-YLTLDAEVKEALIEDKPALAALVDVASS 116
           RD D+RR+A EG+  Y  L+ +++E  +   P L A +D A +
Sbjct: 244 RDIDLRRFALEGMQLYGRLE-DIREGQLRFAPDLKANLDGADA 285


>gnl|CDD|199916 cd11307, M35_Asp_f2_like, Peptidase M35 domain of Asp f2, a major
           allergen from Aspergillus fumigatus, and related
           proteins; non catalytic.  In this domain subgroup the
           unique zinc-binding motif (the aspzincin motif,
           characteristic of the M35 deuterolysin family, and
           defined as the "HEXXH + D" motif: two His ligands and
           Asp as third ligand), is poorly conserved and may not
           bind Zinc. Members of this subgroup also lack a key
           conserved Tyr residue which acts as a proton donor
           during metallopeptidase catalysis. These include Asp f2,
           a major allergen from Aspergillus fumigatus, which
           reacts with serum from patients with ABPA (allergic
           bronchopulmonary aspergillosis), and pH-regulated
           antigen 1 (PRA1) from Candida albicans, which has a role
           in fungal morphogenesis and perhaps in the host-parasite
           interaction during candidal infection. No protease
           activity has been detected for Asp f2 to date. This
           subgroup also includes Saccharomyces cerevisiae Zps1p.
           The expression of the Zsp1 gene is increased in response
           to zinc deficiency; it is a target of the Zap1p
           transcription factor.
          Length = 179

 Score = 26.9 bits (60), Expect = 4.9
 Identities = 10/27 (37%), Positives = 17/27 (62%), Gaps = 3/27 (11%)

Query: 133 CNAYDKQEI---IPEMIELAKFAKHHI 156
           CNA  ++++   + E +ELA  AK H+
Sbjct: 2   CNATQRRQLSSALDETVELAAHAKDHL 28


>gnl|CDD|225532 COG2985, COG2985, Predicted permease [General function prediction
           only].
          Length = 544

 Score = 27.4 bits (61), Expect = 5.4
 Identities = 11/31 (35%), Positives = 12/31 (38%)

Query: 103 DKPALAALVDVASSGKPACTYGVVTTLVNLC 133
           D PALA   D A +  PA  Y     L    
Sbjct: 500 DPPALAFANDAARTDAPALGYATTYPLAMFL 530


>gnl|CDD|146808 pfam04361, DUF494, Protein of unknown function (DUF494).  Members
          of this family of uncharacterized proteins are often
          named Smg.
          Length = 155

 Score = 26.4 bits (59), Expect = 6.4
 Identities = 16/46 (34%), Positives = 20/46 (43%), Gaps = 8/46 (17%)

Query: 33 RALVGLCKLGSMGG--------SDASIRPFADGSTRKLAEACRRFL 70
          +AL  L  L  +          SD+SIR + D    KL   CR FL
Sbjct: 42 KALDWLENLAELQDESPYLAAASDSSIRIYTDEEQEKLDTECRGFL 87


>gnl|CDD|233007 TIGR00522, dph5, diphthine synthase.  Alternate name: diphthamide
           biosynthesis S-adenosylmethionine-dependent
           methyltransferase. This protein participates in the
           modification of a specific His of elongation factor 2 of
           eukarotes and Archaea to diphthamide. The protein was
           characterized in Saccharomyces cerevisiae and designated
           DPH5 [Protein fate, Protein modification and repair].
          Length = 257

 Score = 26.7 bits (59), Expect = 6.5
 Identities = 22/79 (27%), Positives = 32/79 (40%), Gaps = 9/79 (11%)

Query: 83  AAEGLAYL-TLDAEVKEALI-EDKPALAALVDVASSGKPACTYGVVTTLVNLCNAYDKQE 140
             EGL  L   + + K   I  D  A+   +  A SGKP      +  L N    YD  E
Sbjct: 177 IGEGLENLLEEEEKRKTGAITPDTYAVV--IARAGSGKPVVKCDKIENLKN----YDFGE 230

Query: 141 IIPEMIELAKFAKHHIPED 159
            +  ++ LAK   H +  +
Sbjct: 231 PLHCLVVLAKTL-HFMEFE 248


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 26.9 bits (60), Expect = 6.7
 Identities = 7/18 (38%), Positives = 12/18 (66%)

Query: 137 DKQEIIPEMIELAKFAKH 154
            K+E++ E+I  +KF K 
Sbjct: 174 SKKEVMKEVIAKSKFYKA 191


>gnl|CDD|236373 PRK09076, PRK09076, enoyl-CoA hydratase; Provisional.
          Length = 258

 Score = 26.4 bits (59), Expect = 8.2
 Identities = 20/60 (33%), Positives = 26/60 (43%), Gaps = 8/60 (13%)

Query: 14 QGVDILKRLYMSKNENIRVRALVGLC----KLGSMGGSDASIRPFADGSTRKLAEACRRF 69
            +  LK+L +  N +  V ALV +     K  S G   A +  FADG      E  RRF
Sbjct: 29 DSLQALKQLVLELNADKDVYALV-ITGDGEKFFSAG---ADLNLFADGDKAVAREMARRF 84


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.137    0.396 

Gapped
Lambda     K      H
   0.267   0.0738    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,275,233
Number of extensions: 872852
Number of successful extensions: 938
Number of sequences better than 10.0: 1
Number of HSP's gapped: 935
Number of HSP's successfully gapped: 44
Length of query: 175
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 85
Effective length of database: 6,945,742
Effective search space: 590388070
Effective search space used: 590388070
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 56 (25.3 bits)