RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy1563
         (152 letters)



>gnl|CDD|205079 pfam12799, LRR_4, Leucine Rich repeats (2 copies).  Leucine rich
           repeats are short sequence motifs present in a number of
           proteins with diverse functions and cellular locations.
           These repeats are usually involved in protein-protein
           interactions. Each Leucine Rich Repeat is composed of a
           beta-alpha unit. These units form elongated non-globular
           structures. Leucine Rich Repeats are often flanked by
           cysteine rich domains.
          Length = 43

 Score = 34.4 bits (80), Expect = 0.001
 Identities = 11/41 (26%), Positives = 19/41 (46%)

Query: 65  LEELWISYNFIEKTKGIGSLKKLKVLYMCHNSVKEWGELNK 105
           LE L +S N I     + +L  L+ L +  N + +   L+ 
Sbjct: 3   LETLDLSNNQITDLPPLSNLPNLETLDLSGNKITDLSPLSN 43



 Score = 34.0 bits (79), Expect = 0.002
 Identities = 13/39 (33%), Positives = 16/39 (41%)

Query: 21 TKLSLSTNLIEKISGLMSLKKLRILALGRNMIKTFTGLE 59
            L LS N I  +  L +L  L  L L  N I   + L 
Sbjct: 4  ETLDLSNNQITDLPPLSNLPNLETLDLSGNKITDLSPLS 42



 Score = 27.5 bits (62), Expect = 0.49
 Identities = 13/42 (30%), Positives = 15/42 (35%), Gaps = 1/42 (2%)

Query: 40 KKLRILALGRNMIKTFTGLEPLGDCLEELWISYNFIEKTKGI 81
            L  L L  N I     L  L   LE L +S N I     +
Sbjct: 1  TNLETLDLSNNQITDLPPLSNL-PNLETLDLSGNKITDLSPL 41



 Score = 24.8 bits (55), Expect = 4.1
 Identities = 11/38 (28%), Positives = 19/38 (50%), Gaps = 3/38 (7%)

Query: 85  KKLKVLYMCHNSVKEWGELNKINDCPVLEDLVFCGNPI 122
             L+ L + +N +    +L  +++ P LE L   GN I
Sbjct: 1   TNLETLDLSNNQIT---DLPPLSNLPNLETLDLSGNKI 35


>gnl|CDD|227223 COG4886, COG4886, Leucine-rich repeat (LRR) protein [Function
           unknown].
          Length = 394

 Score = 35.3 bits (81), Expect = 0.008
 Identities = 25/81 (30%), Positives = 37/81 (45%), Gaps = 2/81 (2%)

Query: 21  TKLSLSTNLIEKISGLMS-LKKLRILALGRNMIKTFTGLEPLGDCLEELWISYNFIEKTK 79
             L LS N +  +  L+S L  L  L L  N I        L   LEEL +S N I +  
Sbjct: 166 KNLDLSFNDLSDLPKLLSNLSNLNNLDLSGNKISDLPPEIELLSALEELDLSNNSIIELL 225

Query: 80  -GIGSLKKLKVLYMCHNSVKE 99
             + +LK L  L + +N +++
Sbjct: 226 SSLSNLKNLSGLELSNNKLED 246



 Score = 32.6 bits (74), Expect = 0.052
 Identities = 25/78 (32%), Positives = 35/78 (44%), Gaps = 1/78 (1%)

Query: 23  LSLSTNLIEKISGLMS-LKKLRILALGRNMIKTFTGLEPLGDCLEELWISYNFIEKTKGI 81
           L LS N I ++   +S LK L  L L  N ++           LE L +S N I     +
Sbjct: 214 LDLSNNSIIELLSSLSNLKNLSGLELSNNKLEDLPESIGNLSNLETLDLSNNQISSISSL 273

Query: 82  GSLKKLKVLYMCHNSVKE 99
           GSL  L+ L +  NS+  
Sbjct: 274 GSLTNLRELDLSGNSLSN 291


>gnl|CDD|188086 TIGR00819, ydaH, p-Aminobenzoyl-glutamate transporter family.  The
           p-Aminobenzoyl-glutamate transporter family includes two
           transporters, the AbgT (YdaH) protein of E. coli and
           MtrF of Neisseria gonorrhoea. AbgT is apparently cryptic
           in wild type cells, but when expressed on a high copy
           number plasmid, or when expressed at higher levels due
           to mutation, it allows utilization of
           p-aminobenzoyl-glutamate as a source of p-aminobenzoate
           for p-aminobenzoate auxotrophs. p-Aminobenzoate is a
           constituent of and a precursor for the biosynthesis of
           folic acid [Hypothetical proteins, Conserved].
          Length = 525

 Score = 29.9 bits (67), Expect = 0.57
 Identities = 21/84 (25%), Positives = 31/84 (36%), Gaps = 6/84 (7%)

Query: 15  KGPNPNTKLSLSTNLIEKISGLMSLKKLRILALGRNMIKTFTGLEPLGDCLEELWISYNF 74
             P    K      LI  +  L + + + I     + IK F+G  PLG  L  L +    
Sbjct: 55  DNPPVGAKGRADDGLIHVVKLLDADEGIHIFL--PHTIKNFSGFAPLGAILALL-LGAGI 111

Query: 75  IEKTKGIGSLKKLKVLYMCHNSVK 98
            EK    G +  L      H++ K
Sbjct: 112 AEK---SGLIPALMRKLASHSNAK 132


>gnl|CDD|233049 TIGR00608, radc, DNA repair protein radc.  The genes in this
          family for which the functions are known have an as yet
          porrly defined role in determining sensitivity to DNA
          damaging agents such as UV irradiation. This family is
          based on the phylogenomic analysis of JA Eisen (1999,
          Ph.D. Thesis, Stanford University) [DNA metabolism, DNA
          replication, recombination, and repair].
          Length = 218

 Score = 27.8 bits (62), Expect = 2.3
 Identities = 14/62 (22%), Positives = 26/62 (41%), Gaps = 3/62 (4%)

Query: 29 LIEKISGLMSLKKLRILALGRNMIKTFTGLEPLGDCLEELWISYNFIEKTKGIGSLKKLK 88
          L+  I    + K L +L+L + ++  F   + LG  L         +    GIG  K ++
Sbjct: 19 LLAIILRTGTPKGLDVLSLSKRLLDVFGRQDSLGHLLS---APPEELSSVPGIGEAKAIQ 75

Query: 89 VL 90
          + 
Sbjct: 76 LK 77


>gnl|CDD|219962 pfam08668, HDOD, HDOD domain. 
          Length = 195

 Score = 27.2 bits (61), Expect = 3.0
 Identities = 10/31 (32%), Positives = 16/31 (51%)

Query: 39 LKKLRILALGRNMIKTFTGLEPLGDCLEELW 69
          L+ +R L L  ++ + F G   LG  L+  W
Sbjct: 66 LRTVRNLVLSLSVKRIFRGTPSLGFDLKGFW 96


>gnl|CDD|217741 pfam03806, ABG_transport, AbgT putative transporter family. 
          Length = 502

 Score = 27.3 bits (61), Expect = 3.5
 Identities = 8/13 (61%), Positives = 9/13 (69%)

Query: 50 NMIKTFTGLEPLG 62
          NM+K FT   PLG
Sbjct: 70 NMVKNFTSFAPLG 82


>gnl|CDD|206026 pfam13855, LRR_8, Leucine rich repeat. 
          Length = 60

 Score = 25.2 bits (56), Expect = 4.1
 Identities = 15/60 (25%), Positives = 23/60 (38%), Gaps = 3/60 (5%)

Query: 65  LEELWISYNFIEKTKG--IGSLKKLKVLYMCHNSVKEWGELNKINDCPVLEDLVFCGNPI 122
           L+ L +S N +          L  LKVL +  N++         +  P L  L   GN +
Sbjct: 2   LKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTS-ISPEAFSGLPSLRSLDLSGNNL 60



 Score = 24.4 bits (54), Expect = 8.3
 Identities = 17/60 (28%), Positives = 24/60 (40%), Gaps = 11/60 (18%)

Query: 21 TKLSLSTNLIEKISG--LMSLKKLRILALGRNMIKT-----FTGLEPLGDCLEELWISYN 73
            L LS N +  I       L  L++L L  N + +     F+GL      L  L +S N
Sbjct: 3  KSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLP----SLRSLDLSGN 58


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
           (vWA) domain [General function prediction only].
          Length = 4600

 Score = 26.9 bits (59), Expect = 6.2
 Identities = 23/92 (25%), Positives = 38/92 (41%), Gaps = 13/92 (14%)

Query: 27  TNLIEKISGLMSLKKLRILALGR--NMIKTFTGLEPLGDCLEELWISYNFIEKTKGIGSL 84
           T +I+ ++      KL ++   +   M     G +P+      L +   FI+  +G  S+
Sbjct: 478 TTMIQYLAL-KLHFKLTVINKSQQTEMSDDLGGYKPINGSTLGLPLHERFIDIFEGTFSM 536

Query: 85  KK-LKVLYMCH---------NSVKEWGELNKI 106
           KK +K + M           N  K W E NKI
Sbjct: 537 KKNIKFICMEQRVSVPVCWKNKEKLWWEANKI 568


>gnl|CDD|217421 pfam03200, Glyco_hydro_63, Mannosyl oligosaccharide glucosidase.
           This is a family of eukaryotic enzymes belonging to
           glycosyl hydrolase family 63. They catalyze the specific
           cleavage of the non-reducing terminal glucose residue
           from Glc(3)Man(9)GlcNAc(2). Mannosyl oligosaccharide
           glucosidase EC:3.2.1.106 is the first enzyme in the
           N-linked oligosaccharide processing pathway.
          Length = 775

 Score = 26.6 bits (59), Expect = 7.0
 Identities = 17/42 (40%), Positives = 20/42 (47%), Gaps = 10/42 (23%)

Query: 58  LEPLGDCL---EELWISYNFIEKTKGIGSLKKLKVLYMCHNS 96
           LE L D +   EELW  Y       G+ SL K   LYM  N+
Sbjct: 647 LEKLLDLIRDPEELWSDY-------GLRSLSKSSPLYMTRNT 681


>gnl|CDD|238064 cd00116, LRR_RI, Leucine-rich repeats (LRRs), ribonuclease
           inhibitor (RI)-like subfamily. LRRs are 20-29 residue
           sequence motifs present in many proteins that
           participate in protein-protein interactions and have
           different functions and cellular locations. LRRs
           correspond to structural units consisting of a beta
           strand (LxxLxLxxN/CxL conserved pattern) and an alpha
           helix. This alignment contains 12 strands corresponding
           to 11 full repeats, consistent with the extent observed
           in the subfamily acting as Ran GTPase Activating
           Proteins (RanGAP1).
          Length = 319

 Score = 26.2 bits (58), Expect = 7.1
 Identities = 15/48 (31%), Positives = 19/48 (39%), Gaps = 6/48 (12%)

Query: 35  GLMSLKKLRILALGRNMI------KTFTGLEPLGDCLEELWISYNFIE 76
            L SLK L +L LG N +         + L      L  L +S N I 
Sbjct: 216 TLASLKSLEVLNLGDNNLTDAGAAALASALLSPNISLLTLSLSCNDIT 263


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.138    0.403 

Gapped
Lambda     K      H
   0.267   0.0818    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,800,010
Number of extensions: 709224
Number of successful extensions: 409
Number of sequences better than 10.0: 1
Number of HSP's gapped: 405
Number of HSP's successfully gapped: 28
Length of query: 152
Length of database: 10,937,602
Length adjustment: 88
Effective length of query: 64
Effective length of database: 7,034,450
Effective search space: 450204800
Effective search space used: 450204800
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 55 (24.8 bits)