RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy11336
         (194 letters)



>gnl|CDD|233514 TIGR01658, EYA-cons_domain, eyes absent protein conserved domain.
           This domain is common to all eyes absent (EYA) homologs.
           Metazoan EYA's also contain a variable N-terminal domain
           consisting largely of low-complexity sequences.
          Length = 274

 Score =  168 bits (428), Expect = 2e-52
 Identities = 48/135 (35%), Positives = 76/135 (56%), Gaps = 20/135 (14%)

Query: 54  LDRVFIWDLDETIIIFHSLLTGSYATKHG--KCPQTVIQLGYKMEEMIFGVADNHFFFNE 111
            + V++WD+DET+I+ HSLL GSYA      K  +  +++G + EEMI  + D HFF+ E
Sbjct: 1   PENVYVWDMDETLILLHSLLNGSYAESFNGSKDHKRGVEIGRRWEEMILEICDTHFFYEE 60

Query: 112 VEDCDQVHIDDISTDDNGQDLNGYNFSADGFRASVLNVPSGPPGLCVASGVRGGVDWMRK 171
           +E+C++  +DD+ + D+G+DL+ Y F  DGF        +    L             RK
Sbjct: 61  IEECNEPFLDDVRSYDDGKDLSRYEFKTDGFS-------TPTDDLN-----------KRK 102

Query: 172 LAFRYRKIKDIYNNY 186
           LA+R+R + +IY   
Sbjct: 103 LAYRHRAVAEIYEKG 117


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 33.3 bits (76), Expect = 0.072
 Identities = 9/48 (18%), Positives = 11/48 (22%)

Query: 1   MSPYLEVGGSTSSRGRGRKSSASASNASSAGTPTPIIPNPCPSPGSPA 48
            SP  E   +           A A   + A  P          P   A
Sbjct: 426 RSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVA 473


>gnl|CDD|218803 pfam05904, DUF863, Plant protein of unknown function (DUF863).
           This family consists of a number of hypothetical
           proteins from Arabidopsis thaliana and Oryza sativa. The
           function of this family is unknown.
          Length = 766

 Score = 30.4 bits (68), Expect = 0.60
 Identities = 15/46 (32%), Positives = 19/46 (41%)

Query: 9   GSTSSRGRGRKSSASASNASSAGTPTPIIPNPCPSPGSPAGDSRIL 54
           G      RGR+ S +  N     +P P    PCPS   P  + R L
Sbjct: 705 GRRKGGPRGRRRSVTIINEEVEVSPPPAPAPPCPSLAQPMANDRSL 750


>gnl|CDD|221717 pfam12694, MoCo_carrier, Putative molybdenum carrier.  The
          structure of proteins in this family contain central
          beta strands with flanking alpha helices. The structure
          is similar to that of a molybdenum cofactor carrier
          protein.
          Length = 145

 Score = 29.5 bits (67), Expect = 0.66
 Identities = 14/40 (35%), Positives = 20/40 (50%), Gaps = 7/40 (17%)

Query: 59 IWDLDETIIIFH------SLLTGSYATKHGKCPQTVIQLG 92
          + D D T+I+        + LT  +A KHGK P  V+ L 
Sbjct: 61 VRDSDGTLILTRGELSGGTALTVEFARKHGK-PLLVVDLD 99


>gnl|CDD|226439 COG3926, zliS, Lysozyme family protein [General function prediction
           only].
          Length = 252

 Score = 29.0 bits (65), Expect = 1.2
 Identities = 12/51 (23%), Positives = 19/51 (37%)

Query: 37  IPNPCPSPGSPAGDSRILDRVFIWDLDETIIIFHSLLTGSYATKHGKCPQT 87
           +P P P P    G + + D       D        LL+G Y  ++   P +
Sbjct: 176 VPQPAPFPNGGQGKAVVDDAPSTAPADLATGATGGLLSGLYDLQNQLSPLS 226


>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA.  Members
           of this protein share two defining regions. One is a
           histidine/serine-rich cluster, typically
           H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
           the context of a pair of radical SAM proteins, HxsB and
           HxsC, and a fourth protein HxsD. The system is predicted
           to perform peptide modifications, likely in the
           His-Xaa-Ser region, to produce some uncharacterized
           natural product.
          Length = 186

 Score = 28.7 bits (64), Expect = 1.4
 Identities = 12/42 (28%), Positives = 15/42 (35%)

Query: 10  STSSRGRGRKSSASASNASSAGTPTPIIPNPCPSPGSPAGDS 51
           S SS       +  + +  S  T T   P P PS     G S
Sbjct: 61  SHSSHSSHYSGAGGSYSVPSGDTSTYSYPVPSPSYSPSPGSS 102


>gnl|CDD|226918 COG4544, COG4544, Uncharacterized conserved protein [Function
           unknown].
          Length = 260

 Score = 28.6 bits (64), Expect = 1.8
 Identities = 13/39 (33%), Positives = 15/39 (38%), Gaps = 6/39 (15%)

Query: 16  RGRKSSASASNASSAGT------PTPIIPNPCPSPGSPA 48
           R R + A A    +A        P P  P P P  G PA
Sbjct: 168 RRRAAEAEAFGQPTAAVSRWRISPRPSEPLPVPGVGRPA 206


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of AF4
            (Proto-oncogene AF4) and FMR2 (Fragile X E mental
            retardation syndrome) nuclear proteins. These proteins
            have been linked to human diseases such as acute
            lymphoblastic leukaemia and mental retardation. The
            family also contains a Drosophila AF4 protein homologue
            Lilliputian which contains an AT-hook domain. Lilliputian
            represents a novel pair-rule gene that acts in
            cytoskeleton regulation, segmentation and morphogenesis
            in Drosophila.
          Length = 1154

 Score = 28.7 bits (64), Expect = 2.5
 Identities = 15/40 (37%), Positives = 20/40 (50%), Gaps = 9/40 (22%)

Query: 19   KSSASASNASS-------AGTPTPIIPNPCPSPGSPAGDS 51
            K+S+  + A S        GTP+P+ PN  PSP S  G  
Sbjct: 1028 KNSSKVAQAPSPWVGSKSTGTPSPVSPN--PSPASSVGSQ 1065


>gnl|CDD|236394 PRK09169, PRK09169, hypothetical protein; Validated.
          Length = 2316

 Score = 28.1 bits (63), Expect = 3.2
 Identities = 12/40 (30%), Positives = 16/40 (40%)

Query: 8    GGSTSSRGRGRKSSASASNASSAGTPTPIIPNPCPSPGSP 47
            GGS  + GR R      + A+      P I     +PGS 
Sbjct: 1149 GGSRLTPGRTRVFELLLAGAAKRYGRLPAIRYADTAPGSA 1188


>gnl|CDD|179460 PRK02693, PRK02693, apocytochrome f; Reviewed.
          Length = 312

 Score = 28.0 bits (63), Expect = 3.3
 Identities = 16/39 (41%), Positives = 21/39 (53%), Gaps = 9/39 (23%)

Query: 7   VGGSTSSRGRGR------KSSASASNASSAGTPTPIIPN 39
           VGG+   RGRG+      KS+ +   AS+AGT T I   
Sbjct: 178 VGGN---RGRGQVYPTGEKSNNNVFTASAAGTITSIETG 213


>gnl|CDD|118685 pfam10157, DUF2365, Uncharacterized conserved protein (DUF2365). 
          This is a family of conserved proteins found from
          nematodes to humans. The function is unknown.
          Length = 149

 Score = 27.5 bits (61), Expect = 3.3
 Identities = 12/37 (32%), Positives = 18/37 (48%)

Query: 18 RKSSASASNASSAGTPTPIIPNPCPSPGSPAGDSRIL 54
          R SS+ +S  S++  P  +     PSP  P  D  +L
Sbjct: 18 RLSSSRSSTPSASMGPITLTRGFLPSPNIPPIDPSVL 54


>gnl|CDD|234030 TIGR02834, spo_ytxC, putative sporulation protein YtxC.  This
           uncharacterized protein is part of a panel of proteins
           conserved in all known endospore-forming Firmicutes
           (low-GC Gram-positive bacteria), including
           Carboxydothermus hydrogenoformans, and nowhere else
           [Cellular processes, Sporulation and germination].
          Length = 276

 Score = 27.6 bits (62), Expect = 4.3
 Identities = 7/49 (14%), Positives = 22/49 (44%), Gaps = 3/49 (6%)

Query: 93  YKMEEMIFGVADNHFFFNEVEDCDQVH---IDDISTDDNGQDLNGYNFS 138
           +K +E+I  + +  ++F + E+ +Q+       ++ +        Y  +
Sbjct: 74  HKEDELILKIIEESYYFTDQEEIEQILAIANSILTGERKDDPSEIYKMN 122


>gnl|CDD|117486 pfam08919, F_actin_bind, F-actin binding.  The F-actin binding
          domain forms a compact bundle of four antiparallel
          alpha-helices, which are arranged in a left-handed
          topology. Binding of F-actin to the F-actin binding
          domain may result in cytoplasmic retention and
          subcellular distribution of the protein, as well as
          possible inhibition of protein function.
          Length = 179

 Score = 27.4 bits (60), Expect = 4.7
 Identities = 13/39 (33%), Positives = 16/39 (41%), Gaps = 2/39 (5%)

Query: 23 SASNASSAGTPTPIIPNP--CPSPGSPAGDSRILDRVFI 59
            S A   GTP   +P P   PSP   A  ++     FI
Sbjct: 14 PQSTAKPVGTPPSPVPLPSTSPSPSKMANGTQPSSAAFI 52


>gnl|CDD|140329 PTZ00308, PTZ00308, ethanolamine-phosphate cytidylyltransferase;
           Provisional.
          Length = 353

 Score = 27.4 bits (61), Expect = 5.3
 Identities = 12/36 (33%), Positives = 20/36 (55%), Gaps = 6/36 (16%)

Query: 102 VADNHFFFNEVEDCDQ------VHIDDISTDDNGQD 131
           V + + +   +ED ++      VH DDIS D NG++
Sbjct: 78  VVEGYPYTTRLEDLERLECDFVVHGDDISVDLNGRN 113


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 27.4 bits (61), Expect = 5.8
 Identities = 14/51 (27%), Positives = 21/51 (41%)

Query: 2   SPYLEVGGSTSSRGRGRKSSASASNASSAGTPTPIIPNPCPSPGSPAGDSR 52
           S         SS     + S+S+S +SS+ +      +P PSP      SR
Sbjct: 305 SGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSR 355



 Score = 27.1 bits (60), Expect = 7.4
 Identities = 14/48 (29%), Positives = 24/48 (50%), Gaps = 5/48 (10%)

Query: 10  STSSRGRGRKSSASASNASS-----AGTPTPIIPNPCPSPGSPAGDSR 52
           ++SS    R+SS+S++++SS     A       P+  PSP  P   + 
Sbjct: 314 ASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPAD 361


>gnl|CDD|216205 pfam00937, Corona_nucleoca, Coronavirus nucleocapsid protein. 
          Length = 346

 Score = 27.4 bits (61), Expect = 5.8
 Identities = 16/40 (40%), Positives = 20/40 (50%), Gaps = 10/40 (25%)

Query: 13  SRGRGRKSSASASNASSAGTPTPIIPNPCPSPGSPAGDSR 52
            RGR R SS S+S ++S G          PS GS   +SR
Sbjct: 152 FRGRSRSSSRSSSRSNSRG----------PSRGSSRNNSR 181


>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566).  Family of
           related proteins that is plant specific.
          Length = 313

 Score = 27.2 bits (60), Expect = 6.0
 Identities = 15/51 (29%), Positives = 21/51 (41%), Gaps = 4/51 (7%)

Query: 7   VGGSTSSR----GRGRKSSASASNASSAGTPTPIIPNPCPSPGSPAGDSRI 53
             G  SSR     RG  S+A++ N S A   T    +P  S  + + D   
Sbjct: 73  SRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGSSSPSRSRRTTSSDLSS 123


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 27.0 bits (59), Expect = 7.6
 Identities = 10/41 (24%), Positives = 14/41 (34%)

Query: 8   GGSTSSRGRGRKSSASASNASSAGTPTPIIPNPCPSPGSPA 48
               S    G   S  A +  ++G    I P P     +PA
Sbjct: 437 THPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPA 477


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
          domain family is found in eukaryotes, and is typically
          between 125 and 136 amino acids in length.
          Length = 133

 Score = 25.9 bits (57), Expect = 8.8
 Identities = 17/42 (40%), Positives = 21/42 (50%), Gaps = 2/42 (4%)

Query: 10 STSSRGRGRKSSASASNASSAGTPTPIIPNPCPSPGSPAGDS 51
          ST   G G  SS S++ A  AG      PN  P+P +P  DS
Sbjct: 1  STGGDGTGGVSSGSSAPAPPAGPGPG--PNAPPAPAAPGVDS 40


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 26.7 bits (59), Expect = 9.4
 Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 4/43 (9%)

Query: 13  SRGRGRKSSASA-SNASSAGTPTPIIPNP---CPSPGSPAGDS 51
           S  RG +++A+A   A+ A  P P  P      P+P + A   
Sbjct: 564 SSDRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATG 606


>gnl|CDD|216368 pfam01213, CAP_N, Adenylate cyclase associated (CAP) N terminal. 
          Length = 313

 Score = 26.7 bits (59), Expect = 9.5
 Identities = 11/44 (25%), Positives = 18/44 (40%)

Query: 4   YLEVGGSTSSRGRGRKSSASASNASSAGTPTPIIPNPCPSPGSP 47
           +   G + S +G    + ++    SS+    P  P P P P  P
Sbjct: 202 HHTTGLTWSKKGPVAAAKSALPAVSSSAPSAPPPPPPPPPPSVP 245


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.137    0.425 

Gapped
Lambda     K      H
   0.267   0.0757    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,992,094
Number of extensions: 919461
Number of successful extensions: 955
Number of sequences better than 10.0: 1
Number of HSP's gapped: 946
Number of HSP's successfully gapped: 26
Length of query: 194
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 102
Effective length of database: 6,857,034
Effective search space: 699417468
Effective search space used: 699417468
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 56 (25.3 bits)