RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8702
         (194 letters)



>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain. 
          Length = 26

 Score = 38.5 bits (90), Expect = 6e-05
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query: 43 NLKTHLLTHTDHKPYECNSCGKVFRR 68
          NL+ H+ THT  KPY+C  CGK F  
Sbjct: 1  NLRRHMRTHTGEKPYKCPVCGKSFSS 26


>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type.  The C2H2 zinc finger
          is the classical zinc finger domain. The two conserved
          cysteines and histidines co-ordinate a zinc ion. The
          following pattern describes the zinc finger.
          #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
          be any amino acid, and numbers in brackets indicate the
          number of residues. The positions marked # are those
          that are important for the stable fold of the zinc
          finger. The final position can be either his or cys.
          The C2H2 zinc finger is composed of two short beta
          strands followed by an alpha helix. The amino terminal
          part of the helix binds the major groove in DNA binding
          zinc fingers. The accepted consensus binding sequence
          for Sp1 is usually defined by the asymmetric
          hexanucleotide core GGGCGG but this sequence does not
          include, among others, the GAG (=CTC) repeat that
          constitutes a high-affinity site for Sp1 binding to the
          wt1 promoter.
          Length = 22

 Score = 33.1 bits (76), Expect = 0.005
 Identities = 11/22 (50%), Positives = 14/22 (63%)

Query: 58 ECNSCGKVFRRNCDLRRHALTH 79
          +C  CGK F R  +L+RH  TH
Sbjct: 1  KCPDCGKSFSRKSNLKRHLRTH 22


>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger. 
          Length = 23

 Score = 31.7 bits (72), Expect = 0.015
 Identities = 12/23 (52%), Positives = 13/23 (56%)

Query: 57 YECNSCGKVFRRNCDLRRHALTH 79
          Y C  CGKVF+    LR H  TH
Sbjct: 1  YRCPECGKVFKSKSALREHMRTH 23


>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger.  This family
          contains a number of divergent C2H2 type zinc fingers.
          Length = 24

 Score = 31.1 bits (70), Expect = 0.029
 Identities = 9/23 (39%), Positives = 12/23 (52%)

Query: 57 YECNSCGKVFRRNCDLRRHALTH 79
          ++C  CGK F     L+RH   H
Sbjct: 1  FKCPLCGKSFSSKDALKRHLRKH 23


>gnl|CDD|109943 pfam00906, Hepatitis_core, Hepatitis core antigen.  The core
           antigen of hepatitis viruses possesses a carboxyl
           terminus rich in arginine. On this basis it was
           predicted that the core antigen would bind DNA. There is
           some experimental evidence to support this.
          Length = 182

 Score = 32.1 bits (73), Expect = 0.11
 Identities = 19/43 (44%), Positives = 20/43 (46%), Gaps = 14/43 (32%)

Query: 136 GRRSRSRTPSPPLRSHSPISIFRRHQSPSPIGIYRRHSQSPQP 178
           GR  R RTPSP           RR +S SP    RR SQSP  
Sbjct: 152 GRSPRRRTPSP-----------RRRRSQSP---RRRRSQSPSS 180


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 31.4 bits (71), Expect = 0.35
 Identities = 18/82 (21%), Positives = 30/82 (36%), Gaps = 5/82 (6%)

Query: 103  PPAPSPTGSQSSEDIEIERTPSEERSLEPS-DMLGRRSRSRTPSPPLRSHSPISIFRRHQ 161
            PP+PSP  ++          P E    +P+   + R  R+R      ++ SP    RR  
Sbjct: 2628 PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRA 2687

Query: 162  SPSPIG----IYRRHSQSPQPA 179
            +   +G    +       P P 
Sbjct: 2688 ARPTVGSLTSLADPPPPPPTPE 2709


>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
          Length = 580

 Score = 30.1 bits (68), Expect = 0.73
 Identities = 20/101 (19%), Positives = 28/101 (27%), Gaps = 5/101 (4%)

Query: 92  ESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRRSRSRTPSPPLRSH 151
               S   D+L P              I   P   R  E +        +    PP  + 
Sbjct: 44  SEATSEDDDDLYPPRETGSGGGVATSTIYTVPRPPRGPEQTLDKPDSLPASRELPPGPTP 103

Query: 152 SPISIFRRHQSPSPIGIYRRHSQSPQPAHSSSIPFRHHSPI 192
            P   FR   SP         S SP+  +  +      +PI
Sbjct: 104 VPPGGFRGASSPRL----GADSTSPRFLYQVN-FPVILAPI 139


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 30.1 bits (68), Expect = 0.77
 Identities = 20/99 (20%), Positives = 29/99 (29%), Gaps = 6/99 (6%)

Query: 88  SSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRRSRSRTPSPP 147
           S+ + +    +        S + S SSE       P  E  L       R +    P+  
Sbjct: 216 SASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPL------PRPAPITLPTRI 269

Query: 148 LRSHSPISIFRRHQSPSPIGIYRRHSQSPQPAHSSSIPF 186
             +        R    S     R  S SP P+   S P 
Sbjct: 270 WEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPA 308


>gnl|CDD|177288 PHA00616, PHA00616, hypothetical protein.
          Length = 44

 Score = 27.4 bits (61), Expect = 0.85
 Identities = 9/29 (31%), Positives = 15/29 (51%), Gaps = 1/29 (3%)

Query: 57 YECNSCGKVFRRNCDLRRHAL-THAVGDV 84
          Y+C  CG +FR+  ++  H L  H    +
Sbjct: 2  YQCLRCGGIFRKKKEVIEHLLSVHKQNKL 30


>gnl|CDD|221734 pfam12722, Hid1, High-temperature-induced dauer-formation protein. 
           Hid1 (high-temperature-induced dauer-formation protein
           1) represents proteins of approximately 800 residues
           long and is conserved from fungi to humans. It contains
           up to seven potential transmembrane domains separated by
           regions of low complexity. Functionally it might be
           involved in vesicle secretion or be an inter-cellular
           signalling protein or be a novel insulin receptor.
          Length = 813

 Score = 29.3 bits (66), Expect = 1.4
 Identities = 17/89 (19%), Positives = 33/89 (37%), Gaps = 4/89 (4%)

Query: 72  LRRHALTHAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEP 131
           LR   +   + ++  +SS  E   S +  +   + +P+    S    +   P E +SL  
Sbjct: 562 LRHRKVFDQLRNLILDSSQEEDERSNQSASGSLSDNPSNDNDSRSPSLSEVPEENKSLAI 621

Query: 132 ----SDMLGRRSRSRTPSPPLRSHSPISI 156
                      S S   +PP  +  P+ +
Sbjct: 622 TDDFDPASRENSTSEAAAPPSVNSVPLQL 650


>gnl|CDD|218276 pfam04814, HNF-1_N, Hepatocyte nuclear factor 1 (HNF-1), N
           terminus.  This family consists of the N terminus of
           homeobox-containing transcription factor HNF-1. This
           region contains a dimerisation sequence and an acidic
           region that may be involved in transcription activation.
           Mutations and the common Ala/Val 98 polymorphism in
           HNF-1 cause the type 3 form of maturity-onset diabetes
           of the young (MODY3).
          Length = 177

 Score = 28.3 bits (63), Expect = 1.8
 Identities = 22/83 (26%), Positives = 37/83 (44%), Gaps = 15/83 (18%)

Query: 72  LRRHALTHAVGDVPPESSLAESIGSLKRDNL------------PPAPSPTGSQSSEDIEI 119
           L + AL  A+G++ PE  + ES+ +L+  NL                  +  ++SED E 
Sbjct: 21  LSKEALIQALGELEPEPEVGESVITLQHLNLDKGESCHALPNGLGEGRLSEDETSEDGED 80

Query: 120 ERTP---SEERSLEPSDMLGRRS 139
           +R P    E   L P +   +R+
Sbjct: 81  DRPPPILKELERLSPEEAAHQRA 103


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 28.6 bits (64), Expect = 1.8
 Identities = 14/36 (38%), Positives = 17/36 (47%), Gaps = 3/36 (8%)

Query: 114 SEDIEIERTPSEERSLEPSDMLGRRSRSRTPSPPLR 149
             D E E    EE  L+ SDM    S++RT    LR
Sbjct: 46  DTDYEDE---DEELYLDESDMGDSDSKTRTTVRNLR 78


>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509).  This
           family consists of several uncharacterized viral
           proteins from the Marek's disease-like viruses. Members
           of this family are typically around 400 residues in
           length. The function of this family is unknown.
          Length = 377

 Score = 28.5 bits (63), Expect = 2.7
 Identities = 23/87 (26%), Positives = 32/87 (36%), Gaps = 13/87 (14%)

Query: 99  RDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRRSRSRTPSPPLRSHSPISIFR 158
               PP P P+G   S      RTP+   ++   D     +R   PSPP           
Sbjct: 196 AAQRPPQPGPSGQNRSP-----RTPTLS-NVRVLDAPVATNRGEAPSPPRTDTL------ 243

Query: 159 RHQSPSPIGIYRRHSQSPQPAHSSSIP 185
               P+  G  R  +++P P  SS  P
Sbjct: 244 -DPDPAIAGPSRAVNRTPSPRPSSPPP 269


>gnl|CDD|215194 PLN02340, PLN02340, endoglucanase.
          Length = 614

 Score = 28.3 bits (63), Expect = 2.8
 Identities = 15/58 (25%), Positives = 27/58 (46%), Gaps = 4/58 (6%)

Query: 138 RSRSRTPSPPLRSHSPIS-IFRRHQSPSPIGIYRRHSQSPQPAHSSSIPFRHHSPIKF 194
           RS      P L  ++P+  +F + QS S          S +P  ++S P +  +P++F
Sbjct: 470 RSNYEQTEPTLSGNAPLVGLFAKLQSASETADA---YGSYKPTPNTSSPKQSGAPVEF 524


>gnl|CDD|199820 cd10495, MH2_R-SMAD, C-terminal Mad Homology 2 (MH2) domain in
           receptor regulated SMADs.  The MH2 domain is located at
           the C-terminus of the SMAD (small mothers against
           decapentaplegic) family of proteins, which are signal
           transducers and transcriptional modulators that mediate
           multiple signaling pathways. The MH2 domain is
           responsible for type I receptor interaction,
           phosphorylation-triggered homo- and
           hetero-oligomerization, and transactivation. It is
           negatively regulated by the N-terminal MH1 domain.
           Receptor regulated SMADs (R-SMADs) include SMAD1, SMAD2,
           SMAD3, SMAD5 and SMAD9. SMAD1 plays an essential role in
           bone development and postnatal bone formation through
           activation by bone morphogenetic protein (BMP) type 1
           receptor kinase. SMAD2 regulates multiple cellular
           processes, such as cell proliferation, apoptosis and
           differentiation, while SMAD3 modulates signals of
           activin and TGF-beta. SMAD5 is involved in BMP signal
           modulation, possibly playing a role in the pathway
           involving inhibition of hematopoietic progenitor cells
           by TGF-beta. SMAD9 (also known as SMAD8) can mediate the
           differentiation of mesenchymal stem cells into
           tendon-like cells by inhibiting the osteogenic pathway.
          Length = 182

 Score = 27.7 bits (62), Expect = 3.4
 Identities = 22/69 (31%), Positives = 30/69 (43%), Gaps = 12/69 (17%)

Query: 35  LLTNV----LTSNLKTHL-----LTHTDHKPY-ECNSCGKVF--RRNCDLRRHALTHAVG 82
           LL+NV       N + H+     L +   + Y EC S   +F   RNC+LR       V 
Sbjct: 43  LLSNVNRNATIENTRRHIGRGVHLFYVGGEVYAECLSDSAIFVQSRNCNLRHGFHPATVC 102

Query: 83  DVPPESSLA 91
            +PP  SL 
Sbjct: 103 KIPPGCSLK 111


>gnl|CDD|234570 PRK00009, PRK00009, phosphoenolpyruvate carboxylase; Reviewed.
          Length = 911

 Score = 27.8 bits (63), Expect = 4.6
 Identities = 13/33 (39%), Positives = 16/33 (48%), Gaps = 1/33 (3%)

Query: 73  RRHALTHAVGDVPPESSLAESIGSLKRDNLPPA 105
           RR    HA G  P   SLAE++  LK   + P 
Sbjct: 93  RRRRE-HASGSQPQPGSLAETLRRLKAAGVSPE 124


>gnl|CDD|144655 pfam01141, Gag_p12, Gag polyprotein, inner coat protein p12.  The
           retroviral p12 is a virion structural protein. p12 is
           proline rich. The function carried out by p12 in
           assembly and replication is unknown. p12 is associated
           with pathogenicity of the virus.
          Length = 85

 Score = 26.0 bits (57), Expect = 5.4
 Identities = 19/81 (23%), Positives = 32/81 (39%), Gaps = 4/81 (4%)

Query: 76  ALTHAVGDVPPESSLAESIGS----LKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEP 131
           ALT ++   PP+  +    G     L  ++ PP   P  S  + + + E   +   + +P
Sbjct: 2   ALTPSIKPKPPKPQVLPDSGGPLIDLLTEDPPPYRGPGPSPPAGEGDEEEAATTSEAPDP 61

Query: 132 SDMLGRRSRSRTPSPPLRSHS 152
           S M  R    R P     + S
Sbjct: 62  SPMASRLRGRREPPVADSTTS 82


>gnl|CDD|235357 PRK05177, minC, septum formation inhibitor; Reviewed.
          Length = 239

 Score = 26.8 bits (60), Expect = 6.0
 Identities = 16/54 (29%), Positives = 24/54 (44%), Gaps = 9/54 (16%)

Query: 101 NLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRRSRSRTPS----PPLRS 150
            +PPA    G + + D+EI   P EE +  P       + +  PS     P+RS
Sbjct: 94  GMPPAL--VGGRPAGDVEI---PEEEPAAPPPAPAAPEAPAAVPSLVIEEPVRS 142


>gnl|CDD|201791 pfam01428, zf-AN1, AN1-like Zinc finger.  Zinc finger at the
          C-terminus of An1, a ubiquitin-like protein in Xenopus
          laevis. The following pattern describes the zinc
          finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C
          Where X can be any amino acid, and numbers in brackets
          indicate the number of residues.
          Length = 43

 Score = 24.7 bits (54), Expect = 6.6
 Identities = 8/24 (33%), Positives = 10/24 (41%), Gaps = 3/24 (12%)

Query: 56 PYECNSCGKVFRRNCDLRRHALTH 79
          P++C  CG  F   C   R    H
Sbjct: 13 PFKCRFCGGTF---CLKHRLPEDH 33


>gnl|CDD|235290 PRK04351, PRK04351, hypothetical protein; Provisional.
          Length = 149

 Score = 26.4 bits (59), Expect = 6.7
 Identities = 9/18 (50%), Positives = 11/18 (61%), Gaps = 3/18 (16%)

Query: 57  YECNSCGKVFRRNCDLRR 74
           YEC SCG+ + R    RR
Sbjct: 113 YECQSCGQQYLRK---RR 127


>gnl|CDD|227785 COG5498, ACF2, Predicted glycosyl hydrolase [Cell envelope
           biogenesis, outer membrane].
          Length = 760

 Score = 27.1 bits (60), Expect = 6.9
 Identities = 15/48 (31%), Positives = 19/48 (39%), Gaps = 3/48 (6%)

Query: 131 PSDMLGRRSRSRTPSPPLRSH--SPISIFRRHQSPSPIGIYRRHSQSP 176
            S  + +   S    PP+ S   +P S FR  Q P P    R  S  P
Sbjct: 43  HSVSVPKGD-SNVEFPPILSGTINPPSTFRSVQHPVPPPGVRSDSTGP 89


>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777).  This is
           a family of eukaryotic proteins of unknown function.
           Some of the proteins in this family are putative nucleic
           acid binding proteins.
          Length = 158

 Score = 26.4 bits (58), Expect = 7.1
 Identities = 19/42 (45%), Positives = 21/42 (50%), Gaps = 4/42 (9%)

Query: 124 SEERSLEPSDMLGRRSRSRTPSPPLRSHSPISIFRRHQSPSP 165
             ERS        RRSRSR+P    RS SP    RRH+S S 
Sbjct: 27  RRERSRSRERDRRRRSRSRSPHRSRRSRSP----RRHRSRSR 64


>gnl|CDD|215346 PLN02643, PLN02643, ADP-glucose phosphorylase.
          Length = 336

 Score = 26.6 bits (59), Expect = 8.8
 Identities = 13/46 (28%), Positives = 18/46 (39%), Gaps = 5/46 (10%)

Query: 145 SPPLRSHSPIS----IFRRHQSPSPIGIYRRHSQSPQPAHSSSIPF 186
              LR   P++    IF   +   P     +  Q+P   HSS  PF
Sbjct: 1   MAELR-KDPVTNRWVIFSPARGKRPTDFKSKSPQNPNGNHSSGCPF 45


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 26.9 bits (59), Expect = 9.1
 Identities = 26/94 (27%), Positives = 33/94 (35%), Gaps = 19/94 (20%)

Query: 104 PAPSPTGSQSSEDIEIERTPSEERSLEPSD-MLGRRSRSRTP--------SPPLRSH--- 151
           P   P   +   D   E   S++      D   GRR R R+P        SPP R     
Sbjct: 230 PTSDPEDEEDELDDVEEVIESDDHFFLDLDGERGRRRRKRSPRTSPKRFRSPPPRKARGR 289

Query: 152 SPISIFRRHQSPSPIGIYRRHSQSPQPAHSSSIP 185
           SP  + R    P          +SP P H+S  P
Sbjct: 290 SPRRLIRSPPPP-------GRLRSPPPLHASDSP 316


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.313    0.129    0.387 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,585,664
Number of extensions: 849921
Number of successful extensions: 735
Number of sequences better than 10.0: 1
Number of HSP's gapped: 716
Number of HSP's successfully gapped: 86
Length of query: 194
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 102
Effective length of database: 6,857,034
Effective search space: 699417468
Effective search space used: 699417468
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 56 (25.5 bits)