RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4702
         (138 letters)



>gnl|CDD|237987 cd00020, ARM, Armadillo/beta-catenin-like repeats. An approximately
           40 amino acid long tandemly repeated sequence motif
           first identified in the Drosophila segment polarity gene
           armadillo; these repeats were also found in the
           mammalian armadillo homolog beta-catenin, the junctional
           plaque protein plakoglobin, the adenomatous polyposis
           coli (APC) tumor suppressor protein, and a number of
           other proteins. ARM has been implicated in mediating
           protein-protein interactions, but no common features
           among the target proteins recognized by the ARM repeats
           have been identified; related to the HEAT domain; three
           consecutive copies of the repeat are represented by this
           alignment model.
          Length = 120

 Score = 41.9 bits (99), Expect = 8e-06
 Identities = 23/97 (23%), Positives = 42/97 (43%)

Query: 25  DTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSA 84
           D      AA AL+ L++ +      ++E   +    Q+L   D  V   AL  + N+ + 
Sbjct: 20  DENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKAALWALRNLAAG 79

Query: 85  GKELAERVLSTELMEILMALTLLEGEEVRKLALQALA 121
            ++    VL    +  L+ L     E+++K A  AL+
Sbjct: 80  PEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGALS 116



 Score = 29.2 bits (66), Expect = 0.40
 Identities = 18/65 (27%), Positives = 29/65 (44%)

Query: 56  VDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKL 115
           +     +L+  D  VQ  A   + N+ +   +  + V+    +  L+ L   E EEV K 
Sbjct: 9   LPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKA 68

Query: 116 ALQAL 120
           AL AL
Sbjct: 69  ALWAL 73


>gnl|CDD|227396 COG5064, SRP1, Karyopherin (importin) alpha [Intracellular
           trafficking and secretion].
          Length = 526

 Score = 32.6 bits (74), Expect = 0.049
 Identities = 22/100 (22%), Positives = 42/100 (42%)

Query: 17  LFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALV 76
           L  L    D E    A  A++ L+         +L+        ++L+H    +Q  AL 
Sbjct: 248 LAKLIYSRDPEVLVDACWAISYLSDGPNEKIQAVLDVGIPGRLVELLSHESAKIQTPALR 307

Query: 77  IVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLA 116
            VGNI++   +  + +++   ++   +L     E +RK A
Sbjct: 308 SVGNIVTGSDDQTQVIINCGALKAFRSLLSSPKENIRKEA 347



 Score = 31.8 bits (72), Expect = 0.11
 Identities = 22/90 (24%), Positives = 43/90 (47%)

Query: 15  RFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRA 74
           RF+  +   +       AA AL  + S +T    ++++   V +F Q+L+  +  V+ +A
Sbjct: 118 RFVEFMDEIQRDMLQFEAAWALTNIASGTTQQTKVVVDAGAVPLFIQLLSSTEDDVREQA 177

Query: 75  LVIVGNILSAGKELAERVLSTELMEILMAL 104
           +  +GNI    +   + VL    +E L+ L
Sbjct: 178 VWALGNIAGDSEGCRDYVLQCGALEPLLGL 207


>gnl|CDD|205691 pfam13513, HEAT_EZ, HEAT-like repeat.  The HEAT repeat family is
           related to armadillo/beta-catenin-like repeats (see
           pfam00514). These EZ repeats are found in subunits of
           cyanobacterial phycocyanin lyase and other proteins and
           probably carry out a scaffolding role.
          Length = 55

 Score = 29.0 bits (65), Expect = 0.16
 Identities = 16/53 (30%), Positives = 27/53 (50%), Gaps = 1/53 (1%)

Query: 70  VQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLALQALAK 122
           V+  A + +G +   G EL    +  EL+  L+ L   + +EVR+ A  AL +
Sbjct: 3   VREAAALALGALAGGGPELLRPAV-PELLPALLPLLKDDDDEVREAAAWALGR 54


>gnl|CDD|233841 TIGR02388, rpoC2_cyan, DNA-directed RNA polymerase, beta'' subunit.
            The family consists of the product of the rpoC2 gene, a
           subunit of DNA-directed RNA polymerase of cyanobacteria
           and chloroplasts. RpoC2 corresponds largely to the
           C-terminal region of the RpoC (the beta' subunit) of
           other bacteria. Members of this family are designated
           beta'' in chloroplasts/plastids, and beta' (confusingly)
           in Cyanobacteria, where RpoC1 is called beta' in
           chloroplasts/plastids and gamma in Cyanobacteria. We
           prefer to name this family beta'', after its organellar
           members, to emphasize that this RpoC1 and RpoC2 together
           replace RpoC in other bacteria [Transcription,
           DNA-dependent RNA polymerase].
          Length = 1227

 Score = 29.4 bits (66), Expect = 0.55
 Identities = 22/76 (28%), Positives = 35/76 (46%), Gaps = 8/76 (10%)

Query: 26  TETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAG 85
           T+  C  AG +  + S    +  +L+E+N   +   I A P V          G+++ AG
Sbjct: 860 TQILCKEAGVVQGIDSGGESIRRLLVERNSDRLKVNIKAKPVVKT--------GDLVVAG 911

Query: 86  KELAERVLSTELMEIL 101
            ELA+ V + E  EI 
Sbjct: 912 DELAKGVKAEESGEIE 927


>gnl|CDD|183129 PRK11425, PRK11425, PTS system N-acetylgalactosamine-specific
           transporter subunit IIB; Provisional.
          Length = 157

 Score = 29.1 bits (65), Expect = 0.56
 Identities = 21/56 (37%), Positives = 27/56 (48%), Gaps = 4/56 (7%)

Query: 63  LAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLALQ 118
           L H  VGVQ         +L A  E+AE  +   LME+++A    EG  VR   LQ
Sbjct: 13  LIHGQVGVQWVGFAGANLVLVANDEVAEDPVQQNLMEMVLA----EGIAVRFWTLQ 64


>gnl|CDD|220906 pfam10926, DUF2800, Protein of unknown function (DUF2800).  This is
           a family of uncharacterized proteins found in bacteria
           and viruses. Some members of this family are annotated
           as being Phi APSE P51-like proteins.
          Length = 363

 Score = 28.8 bits (65), Expect = 0.93
 Identities = 15/50 (30%), Positives = 21/50 (42%), Gaps = 3/50 (6%)

Query: 80  NILSAGKELAERVLS-TELMEILMALTLLEG--EEVRKLALQALAKGEEY 126
           N+     E     L+  EL E+L    LLE   ++V   AL     G+E 
Sbjct: 233 NLAKYDFEDPTLDLTDEELAELLEKADLLEKWAKDVEAYALDEARNGKEV 282


>gnl|CDD|215356 PLN02659, PLN02659, Probable galacturonosyltransferase.
          Length = 534

 Score = 28.8 bits (64), Expect = 1.1
 Identities = 13/34 (38%), Positives = 21/34 (61%), Gaps = 2/34 (5%)

Query: 5  YLILFYCSQVRFLFLLTCEE--DTETACAAAGAL 36
          Y +LF+   +RF+F+L+  +  D ET C+  G L
Sbjct: 40 YSLLFFTFLLRFVFVLSTVDTIDGETKCSTLGCL 73


>gnl|CDD|130153 TIGR01081, mpl,
          UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl-meso-
          diaminopimelate ligase.  Alternate name: murein
          tripeptide ligase [Cell envelope, Biosynthesis and
          degradation of murein sacculus and peptidoglycan].
          Length = 448

 Score = 27.1 bits (60), Expect = 3.7
 Identities = 17/58 (29%), Positives = 25/58 (43%), Gaps = 12/58 (20%)

Query: 45 PVCSMLLEKNWVDIFK-----QILAHPDVGVQHRALVIVGNILSAGKELAERVLSTEL 97
          P  S  LE   ++I +     Q+   PD       LV++GN +  G    E VL+  L
Sbjct: 35 PPMSTQLEAQGIEIIEGFDAAQLEPKPD-------LVVIGNAMKRGNPCVEAVLNLNL 85


>gnl|CDD|215132 PLN02238, PLN02238, hypoxanthine phosphoribosyltransferase.
          Length = 189

 Score = 26.5 bits (59), Expect = 4.6
 Identities = 15/61 (24%), Positives = 28/61 (45%), Gaps = 6/61 (9%)

Query: 70  VQHRALVIVGNILSAGKELAERVLSTELMEILMA----LTLLEGEEVRKLALQALAKGEE 125
           V+ + +++V +I+  G  L+   L   L     A      LL+    RK+  + +  G+E
Sbjct: 95  VKGKHVLLVEDIVDTGNTLSA--LVAHLEAKGAASVSVCALLDKRARRKVKYELVGDGKE 152

Query: 126 Y 126
           Y
Sbjct: 153 Y 153


>gnl|CDD|235110 PRK03188, PRK03188, 4-diphosphocytidyl-2-C-methyl-D-erythritol
           kinase; Provisional.
          Length = 300

 Score = 26.4 bits (59), Expect = 5.0
 Identities = 12/26 (46%), Positives = 14/26 (53%)

Query: 25  DTETACAAAGALAMLTSVSTPVCSML 50
            T  A   AGALA + S S P C+ L
Sbjct: 234 RTLRAGEEAGALAGIVSGSGPTCAFL 259


>gnl|CDD|226991 COG4644, COG4644, Transposase and inactivated derivatives, TnpA
           family [DNA replication, recombination, and repair].
          Length = 323

 Score = 26.3 bits (58), Expect = 6.4
 Identities = 24/123 (19%), Positives = 43/123 (34%), Gaps = 11/123 (8%)

Query: 8   LFYCSQVRFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPD 67
           L +    RF   L    D +        LA+L S+      ++  K + ++ K  ++   
Sbjct: 140 LRHLLGFRFAPRLKDIADQKLYIPEKNELALLPSIIGGSNIIIEIKQYDNMLKDAISIKV 199

Query: 68  VGVQHRALVIVGNILSAGKELAE------RVLSTELMEILMALTLLEGEEVRKLALQALA 121
                 A++   N  S     A+      R+  T     +     L  E+ R+  L+ L 
Sbjct: 200 GNADPSAILRRLNRASRQHPTAKALLELGRIEKT-----IFLCNYLSDEDFRRRILEGLN 254

Query: 122 KGE 124
            GE
Sbjct: 255 VGE 257


>gnl|CDD|183563 PRK12508, PRK12508, putative monovalent cation/H+ antiporter
           subunit B; Reviewed.
          Length = 139

 Score = 25.3 bits (56), Expect = 7.9
 Identities = 14/46 (30%), Positives = 21/46 (45%), Gaps = 2/46 (4%)

Query: 33  AGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIV 78
             AL +L    T + SMLL  N++     +L    V  QH  + +V
Sbjct: 71  MAALGVLIYGGTGIASMLLGGNFLSYD--VLIFDSVTGQHLGIFLV 114


>gnl|CDD|219797 pfam08326, ACC_central, Acetyl-CoA carboxylase, central region.
           The region featured in this family is found in various
           eukaryotic acetyl-CoA carboxylases, N-terminal to the
           catalytic domain (pfam01039). This enzyme (EC:6.4.1.2)
           is involved in the synthesis of long-chain fatty acids,
           as it catalyzes the rate-limiting step in this process.
          Length = 707

 Score = 26.1 bits (58), Expect = 8.2
 Identities = 12/62 (19%), Positives = 26/62 (41%), Gaps = 2/62 (3%)

Query: 58  IFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLAL 117
           +   +L+H    V+ +  +++  +      L    +   L + L  L+ L      K+AL
Sbjct: 192 VVDIVLSHS--RVKAKNKLVLALLDQLVYPLLPSTVPASLRDALSRLSSLNSRAYAKVAL 249

Query: 118 QA 119
           +A
Sbjct: 250 KA 251


>gnl|CDD|218602 pfam05478, Prominin, Prominin.  The prominins are an emerging
           family of proteins that among the multispan membrane
           proteins display a novel topology. Mouse prominin and
           human prominin (mouse)-like 1 (PROML1) are predicted to
           contain five membrane spanning domains, with an
           N-terminal domain exposed to the extracellular space
           followed by four, alternating small cytoplasmic and
           large extracellular, loops and a cytoplasmic C-terminal
           domain. The exact function of prominin is unknown
           although in humans defects in PROM1, the gene coding for
           prominin, cause retinal degeneration.
          Length = 807

 Score = 26.1 bits (58), Expect = 8.2
 Identities = 12/37 (32%), Positives = 17/37 (45%), Gaps = 3/37 (8%)

Query: 100 ILMALTL---LEGEEVRKLALQALAKGEEYGIIRRPG 133
            LM + L   L G  V  L  + LA  E + ++  PG
Sbjct: 476 FLMLVVLALFLVGGNVYTLVCEPLANNELFQVLDTPG 512


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.136    0.385 

Gapped
Lambda     K      H
   0.267   0.0786    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,170,413
Number of extensions: 659734
Number of successful extensions: 685
Number of sequences better than 10.0: 1
Number of HSP's gapped: 683
Number of HSP's successfully gapped: 34
Length of query: 138
Length of database: 10,937,602
Length adjustment: 87
Effective length of query: 51
Effective length of database: 7,078,804
Effective search space: 361019004
Effective search space used: 361019004
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 54 (24.5 bits)