RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6422
         (182 letters)



>gnl|CDD|221573 pfam12430, ABA_GPCR, Abscisic acid G-protein coupled receptor.
           This domain family is found in eukaryotes, and is
           typically between 177 and 216 amino acids in length.
           This family is part of the abscisic acid (ABA) G-protein
           coupled receptor. ABA is a stress hormone in plants.
          Length = 99

 Score = 94.6 bits (236), Expect = 9e-26
 Identities = 50/91 (54%), Positives = 67/91 (73%), Gaps = 3/91 (3%)

Query: 22  LTILQTFFYAISSSKSS--NIIVLVLAQIMGMYFVSSVLLMRMNMPPEYRSIISKVLG-D 78
           LT L +FF A+    SS  N++VL LA+IMG+YF+S+VLL+R N+P EYRSII+ +LG D
Sbjct: 8   LTTLSSFFRALPGLLSSSSNLLVLFLAEIMGIYFISTVLLLRSNLPAEYRSIITDILGSD 67

Query: 79  LQFNFYHRWFDVIFLLSALSSIVFLPFSYVH 109
           L+F FY RWFD IFLLSAL + + L  ++  
Sbjct: 68  LEFRFYDRWFDKIFLLSALLTAIGLYLAHKS 98



 Score = 38.3 bits (90), Expect = 3e-04
 Identities = 20/43 (46%), Positives = 31/43 (72%), Gaps = 2/43 (4%)

Query: 135 LLSGFGAFFYAISSSKSS--NIIVLVLAQIMGMYFVSSVDILR 175
           +L+   +FF A+    SS  N++VL LA+IMG+YF+S+V +LR
Sbjct: 7   VLTTLSSFFRALPGLLSSSSNLLVLFLAEIMGIYFISTVLLLR 49


>gnl|CDD|221626 pfam12537, DUF3735, Protein of unknown function (DUF3735).  This
           domain family is found in eukaryotes, and is
           approximately 70 amino acids in length. There is a
           conserved LSG sequence motif. There is a single
           completely conserved residue G that may be functionally
           important.
          Length = 71

 Score = 44.9 bits (107), Expect = 5e-07
 Identities = 16/25 (64%), Positives = 21/25 (84%)

Query: 117 TIEQGISRIGVIGVTVMALLSGFGA 141
            +E  +SR+ VIGVT+MA+LSGFGA
Sbjct: 5   YLENLLSRVSVIGVTIMAILSGFGA 29


>gnl|CDD|224819 COG1907, COG1907, Predicted archaeal sugar kinases [General
           function prediction only].
          Length = 312

 Score = 30.8 bits (70), Expect = 0.34
 Identities = 9/36 (25%), Positives = 14/36 (38%)

Query: 131 TVMALLSGFGAFFYAISSSKSSNIIVLVLAQIMGMY 166
              A  S +G   Y I  S+ +  +V  L  I+   
Sbjct: 257 AYGAGQSSWGPTVYGIVDSREAGSVVRKLIDILLEE 292


>gnl|CDD|200431 TIGR04180, EDH_00030, NAD dependent epimerase/dehydratase,
           LLPSF_EDH_00030 family.  This clade within the NAD
           dependent epimerase/dehydratase superfamily (pfam01370)
           is characterized by inclusion of its members within a
           cassette of seven distinctive enzymes. These include
           four genes homologous to the elements of the neuraminic
           (sialic) acid biosynthesis cluster (NeuABCD), an
           aminotransferase and a nucleotidyltransferase in
           addition to the epimerase/dehydratase. Together it is
           very likely that these enzymes direct the biosynthesis
           of a nine-carbon sugar analagous to CMP-neuraminic acid.
           These seven genes form the core of the cassette,
           although they are often accompanied by additional genes
           that may further modify the product sugar. Although this
           cassette is widely distributed in bacteria, the family
           nomenclature arises from the instance in Leptospira
           interrogans serovar Lai, str. 56601, where it appears as
           the 30th gene in the 91-gene lipopolysaccharide
           biosynthesis cluster.
          Length = 297

 Score = 30.0 bits (68), Expect = 0.63
 Identities = 23/76 (30%), Positives = 36/76 (47%), Gaps = 18/76 (23%)

Query: 65  PPEYRSIISKVLGDLQFNFYHR----WFDVIFLLSALSSIVFLPFSYVHP------NI-- 112
           PPE +  I  V GD++     R      DV+F L+AL +I   P+SY+ P      N+  
Sbjct: 44  PPEVKDKIEVVTGDIRDPDSVRKAMKGCDVVFHLAALIAI---PYSYIAPDSYVDTNVTG 100

Query: 113 ---FLQWTIEQGISRI 125
               LQ   + G+ ++
Sbjct: 101 TLNVLQAARDLGVEKV 116


>gnl|CDD|188999 cd06461, M2_ACE, Peptidase family M2 Angiotensin converting enzyme
           (ACE).  Peptidase family M2 Angiotensin converting
           enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc
           dependent dipeptidase that catalyzes the conversion of
           the decapeptide angiotensin I to the potent vasopressor
           octapeptide angiotensin II, by removing two C-terminal
           amino acids. There are two forms of the enzyme in
           humans, the ubiquitous somatic ACE and the
           sperm-specific germinal ACE, both encoded by the same
           gene through transcription from alternative promoters.
           Somatic ACE has two tandem active sites with distinct
           catalytic properties, whereas germinal ACE, the function
           of which is largely unknown, has just a single active
           site. Recently, an ACE homolog, ACE2, has been
           identified in humans that differs from ACE; it
           preferentially removes carboxy-terminal hydrophobic or
           basic amino acids and appears to be important in cardiac
           function. ACE homologs (also known as members of the M2
           gluzincin family) have been found in a wide variety of
           species, including those that neither have a
           cardiovascular system nor synthesize angiotensin. ACE is
           well-known as a key part of the renin-angiotensin system
           that regulates blood pressure and ACE inhibitors are
           important for the treatment of hypertension.
          Length = 562

 Score = 29.8 bits (68), Expect = 0.70
 Identities = 10/17 (58%), Positives = 10/17 (58%), Gaps = 1/17 (5%)

Query: 92  FLLS-ALSSIVFLPFSY 107
           FLL  AL  I FLPF  
Sbjct: 401 FLLKMALDKIAFLPFGL 417


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 28.5 bits (63), Expect = 2.5
 Identities = 18/52 (34%), Positives = 23/52 (44%), Gaps = 1/52 (1%)

Query: 31  AISSSKSSNIIVLVLAQIMGMYFVSSVLLMRMNMPPEYRSI-ISKVLGDLQF 81
           A  S   S I + VL +    YF +  LL         R+I ISK LG + F
Sbjct: 319 AEDSLSKSEINIDVLKEAKMFYFNTHSLLDPTMRSTTLRAIKISKKLGGVIF 370


>gnl|CDD|201774 pfam01401, Peptidase_M2, Angiotensin-converting enzyme.  Members of
           this family are dipeptidyl carboxydipeptidases (cleave
           carboxyl dipeptides) and most notably convert
           angiotensin I to angiotensin II. Many members of this
           family contain a tandem duplication of the 600 amino
           acid peptidase domain, both of these are catalytically
           active. Most members are secreted membrane bound
           ectoenzymes.
          Length = 595

 Score = 27.6 bits (61), Expect = 5.2
 Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 10/58 (17%)

Query: 54  VSSVLLMRMNMPPEYRSI--ISKVLGDLQFNFYHRWFDVIFLLS-ALSSIVFLPFSYV 108
           +  VL + ++ P    SI  +S    D +        D+ FL+  AL  I F+PFSY+
Sbjct: 385 IGDVLALSVSTPKHLHSINLLSSEGNDYES-------DINFLMKMALDKIAFIPFSYL 435


>gnl|CDD|204929 pfam12460, MMS19_N, NER and RNAPII transcription protein n
           terminal.  This domain family is found in eukaryotes,
           and is approximately 60 amino acids in length. MMS19 is
           required for both nucleotide excision repair (NER) and
           RNA polymerase II (RNAP II) transcription.
          Length = 64

 Score = 25.2 bits (56), Expect = 5.5
 Identities = 11/35 (31%), Positives = 17/35 (48%), Gaps = 2/35 (5%)

Query: 148 SSKSSNIIVLVLAQIMGMYFVSSVDILREAAVQRL 182
            +    ++ LV  + +G Y  S   I+R  AVQ L
Sbjct: 25  ENGKLTLLDLV--ESLGEYLTSEDPIIRAKAVQLL 57


>gnl|CDD|220381 pfam09753, Use1, Membrane fusion protein Use1.  This entry is of a
           family of proteins all approximately 300 residues in
           length. The proteins have a single C-terminal
           trans-membrane domain and a SNARE [soluble NSF
           (N-ethylmaleimide-sensitive fusion protein) attachment
           protein receptor] domain of approximately 60 residues.
           The SNARE domains are essential for membrane fusion and
           are conserved from yeasts to humans. Use1 is one of the
           three protein subunits that make up the SNARE complex
           and it is specifically required for Golgi-endoplasmic
           reticulum retrograde transport.
          Length = 251

 Score = 26.7 bits (59), Expect = 7.6
 Identities = 7/30 (23%), Positives = 18/30 (60%)

Query: 33  SSSKSSNIIVLVLAQIMGMYFVSSVLLMRM 62
            +SK+S   + ++  ++   F+S VL +++
Sbjct: 221 HASKASKCFLWLMIFLVMCIFISMVLFIKI 250


>gnl|CDD|187543 cd05232, UDP_G4E_4_SDR_e, UDP-glucose 4 epimerase, subgroup 4,
           extended (e) SDRs.  UDP-glucose 4 epimerase (aka
           UDP-galactose-4-epimerase), is a homodimeric extended
           SDR. It catalyzes the NAD-dependent conversion of
           UDP-galactose to UDP-glucose, the final step in Leloir
           galactose synthesis. This subgroup is comprised of
           bacterial proteins, and includes the Staphylococcus
           aureus capsular polysaccharide Cap5N, which may have a
           role in the synthesis of UDP-N-acetyl-d-fucosamine. This
           subgroup has the characteristic active site tetrad and
           NAD-binding motif of the extended SDRs. Extended SDRs
           are distinct from classical SDRs. In addition to the
           Rossmann fold (alpha/beta folding pattern with a central
           beta-sheet) core region typical of all SDRs, extended
           SDRs have a less conserved C-terminal extension of
           approximately 100 amino acids. Extended SDRs are a
           diverse collection of proteins, and include isomerases,
           epimerases, oxidoreductases, and lyases; they typically
           have a TGXXGXXG cofactor binding motif. SDRs are a
           functionally diverse family of oxidoreductases that have
           a single domain with a structurally conserved Rossmann
           fold, an NAD(P)(H)-binding region, and a structurally
           diverse C-terminal region. Sequence identity between
           different SDR enzymes is typically in the 15-30% range;
           they catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 303

 Score = 26.5 bits (59), Expect = 7.7
 Identities = 16/77 (20%), Positives = 32/77 (41%), Gaps = 6/77 (7%)

Query: 10  DLIALFLLELEPLTILQTFFYAISSSKSSNIIVLVLAQIMG----MYFVSSVLLMRMNMP 65
           D I L +   +      TF  +     S+  +V  + + +G    +  V + LL      
Sbjct: 206 DAIYLCISLPKAAN--GTFLVSDGPPVSTAELVDEIRRALGKPTRLLPVPAGLLRFAAKL 263

Query: 66  PEYRSIISKVLGDLQFN 82
              R++I ++ G LQ++
Sbjct: 264 LGKRAVIQRLFGSLQYD 280


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.331    0.142    0.409 

Gapped
Lambda     K      H
   0.267   0.0756    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,499,808
Number of extensions: 918453
Number of successful extensions: 1854
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1819
Number of HSP's successfully gapped: 208
Length of query: 182
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 91
Effective length of database: 6,901,388
Effective search space: 628026308
Effective search space used: 628026308
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 56 (25.3 bits)