RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy11539
         (530 letters)



>gnl|CDD|217322 pfam03022, MRJP, Major royal jelly protein.  Royal jelly is the
           food of queen bee larvae, and is responsible for the
           high reproductive ability of the queen. Major royal
           jelly proteins make up around 90% of larval jelly
           proteins. This family also the sequence-related yellow
           protein of drosophila which controls pigmentation of the
           adult cuticle and larval mouth parts.
          Length = 285

 Score =  220 bits (562), Expect = 6e-68
 Identities = 98/269 (36%), Positives = 150/269 (55%), Gaps = 35/269 (13%)

Query: 240 ISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEG 299
           +SV+R++VD+CDRLW++D+G+ N L   +Q+CPPK++VFDL T+ L++++ LP       
Sbjct: 1   VSVYRIAVDECDRLWVLDSGIVNTLQPPKQICPPKLLVFDLATDKLLKRFELPADVAKGN 60

Query: 300 SLFSNIVTEVVED-CDHVFAYVNDVFRYGLIVYDFFKNTSYRLTHPYMYPEPTQSTYILD 358
           S   N+V +V +D CD  F Y+ D    GLIVYD   + S+R+ H   YP+P    + + 
Sbjct: 61  SRLVNLVVDVGDDTCDDTFVYIADAKGRGLIVYDLADDKSWRVEHNTFYPDPDFGKFTIA 120

Query: 359 NLKFRWVDGIFGMAISPELSGYKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSM 418
              F+  DGIFG+A+SP                                 D R +YFH +
Sbjct: 121 GESFQLDDGIFGLALSP------------------------------RTPDGRTLYFHPL 150

Query: 419 SSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFKNTQASASAINSNGVMFYNLVTKH 478
           +S R Y V T  LRN + + N++  ++ F  LG R  NTQ++A A++ NGV+F+ LV ++
Sbjct: 151 ASTRLYSVPTEVLRNETNWGNNAQYED-FKDLGDR--NTQSTALAVDPNGVLFFGLVGQN 207

Query: 479 SVGCWNTKTKVYLPQTQDIVQTSRDILNF 507
           +VGCWNT +  Y      +V  + D L F
Sbjct: 208 AVGCWNT-STPYSRANLGVVARNSDTLQF 235


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
           family represents a conserved region approximately 350
           residues long within a number of plant proteins of
           unknown function.
          Length = 357

 Score = 35.3 bits (81), Expect = 0.067
 Identities = 16/90 (17%), Positives = 22/90 (24%), Gaps = 2/90 (2%)

Query: 58  RPQS--QNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPA 115
            PQ      +E+     +  P       MA   P     P+       QP          
Sbjct: 173 APQVSGLYPEESPYQPQSYPPNEPLPSSMAMQPPYSGAPPSQQFYGPPQPSPYMYGGPGG 232

Query: 116 RPSTDQTKSFYPSVPSNGHPSVPSNAYPSS 145
           RP++       P           S   PS 
Sbjct: 233 RPNSGFPSGQQPPPSQGQEGYGYSGPPPSK 262



 Score = 29.9 bits (67), Expect = 3.1
 Identities = 20/94 (21%), Positives = 29/94 (30%), Gaps = 4/94 (4%)

Query: 59  PQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKN---NNQPPNTKSNAGPA 115
            QSQ    ++ P S   P      P  P+   +P  P   Q       QP   +    P 
Sbjct: 88  QQSQAPPSHQYP-SQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQ 146

Query: 116 RPSTDQTKSFYPSVPSNGHPSVPSNAYPSSSHFQ 149
            P   Q +S         +P   + + P  S   
Sbjct: 147 VPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLY 180


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 34.6 bits (79), Expect = 0.16
 Identities = 21/110 (19%), Positives = 30/110 (27%), Gaps = 22/110 (20%)

Query: 58  RPQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPA----------MGQKNNNQPPN 107
           + Q Q +  N      Q         +   G    +                    QP  
Sbjct: 355 QQQQQQQGGNHPAAHQQQMNQ----QVGQGGQMVALGYLNIQGNQGGLGANPMQQGQPGM 410

Query: 108 TKSNAGPARPSTDQT--KSFYPSVPSNGHP------SVPSNAYPSSSHFQ 149
             S +   +  T+Q+  +   PSVPS G P      SV     PS     
Sbjct: 411 MSSPSPVPQVQTNQSMPQPPQPSVPSPGGPGSQPPQSVSGGMIPSPPALM 460



 Score = 32.7 bits (74), Expect = 0.55
 Identities = 28/122 (22%), Positives = 34/122 (27%), Gaps = 22/122 (18%)

Query: 45  YTNNFLNVDLKKLRPQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQ 104
           Y N   N       P  Q +    M   +  P V T + M      QP  P+ G   +  
Sbjct: 388 YLNIQGNQGGLGANPMQQGQPG-MMSSPSPVPQVQTNQSMPQPP--QPSVPSPGGPGSQP 444

Query: 105 PPNTKSN-----------------AGPARPSTDQTKSFYPSVPSN--GHPSVPSNAYPSS 145
           P +                       PA   T Q     P  P N  G  SV S A P  
Sbjct: 445 PQSVSGGMIPSPPALMPSPSPQMSQSPASQRTIQQDMVSPGGPLNTPGQSSVNSPANPQE 504

Query: 146 SH 147
             
Sbjct: 505 EQ 506


>gnl|CDD|145501 pfam02389, Cornifin, Cornifin (SPRR) family.  SPRR genes (formerly
           SPR) encode a novel class of polypeptides (small proline
           rich proteins) that are strongly induced during
           differentiation of human epidermal keratinocytes in
           vitro and in vivo. The most characteristic feature of
           the SPRR gene family resides in the structure of the
           central segments of the encoded polypeptides that are
           built up from tandemly repeated units of either eight
           (SPRR1 and SPRR3) or nine (SPRR2) amino acids with the
           general consensus XKXPEPXX where X is any amino acid.
          Length = 153

 Score = 32.7 bits (74), Expect = 0.22
 Identities = 15/61 (24%), Positives = 19/61 (31%), Gaps = 2/61 (3%)

Query: 83  PMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYPSVPSNGHPSVPSNAY 142
              P  P +   P        +P   K    P      +    +P VP  GH  VP   Y
Sbjct: 67  TKVP-EPGRTKVPEPCCTKVPEPGYPKV-PEPGSTKVPEPGPCHPKVPEPGHTKVPEPGY 124

Query: 143 P 143
           P
Sbjct: 125 P 125



 Score = 29.2 bits (65), Expect = 3.5
 Identities = 10/54 (18%), Positives = 15/54 (27%)

Query: 90  SQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYPSVPSNGHPSVPSNAYP 143
            +P    + Q  N + P   +   P    T   +     VP      VP     
Sbjct: 22  KEPCHTKVPQPGNTKIPEPGNTKVPEPGCTKVPEPGCTKVPEPCSTKVPEPGRT 75


>gnl|CDD|135173 PRK04654, PRK04654, sec-independent translocase; Provisional.
          Length = 214

 Score = 33.2 bits (75), Expect = 0.25
 Identities = 18/70 (25%), Positives = 28/70 (40%), Gaps = 3/70 (4%)

Query: 70  PVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYPSV 129
           PV   AP +   +P+AP+ P Q + PA        P      + PA P +          
Sbjct: 141 PVPAPAPVIAQAQPIAPA-PHQTLVPAPHDTIVPAPHAAHLPSAPATPVS--VAPVDAGT 197

Query: 130 PSNGHPSVPS 139
            ++  PS P+
Sbjct: 198 SASPTPSEPT 207


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 33.5 bits (76), Expect = 0.32
 Identities = 25/101 (24%), Positives = 33/101 (32%), Gaps = 13/101 (12%)

Query: 62  QNEKENRMPVSTQAPAVTTYRPMAPS-GPSQPMRPAMGQKNNNQPPN----TKSNAGPAR 116
           Q   + + P S Q P      P AP   PS    P  G     QP          +  + 
Sbjct: 170 QQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISA 229

Query: 117 PSTDQTKSFYPSVP--------SNGHPSVPSNAYPSSSHFQ 149
           PS    +   P  P         +  P  PS+ +P SSH  
Sbjct: 230 PSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHG 270



 Score = 31.2 bits (70), Expect = 1.7
 Identities = 19/91 (20%), Positives = 31/91 (34%), Gaps = 7/91 (7%)

Query: 56  KLRPQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPA 115
           +L PQSQ  +     V  Q P +T  + + P   + P   +       Q P  +      
Sbjct: 407 QLMPQSQPLQ----SVPAQPPVLTQSQSLPPKASTHP--HSGLHSGPPQSPFAQHPFTSG 460

Query: 116 -RPSTDQTKSFYPSVPSNGHPSVPSNAYPSS 145
             P+     S   S P+    +   +  P S
Sbjct: 461 GLPAIGPPPSLPTSTPAAPPRASSGSQPPGS 491


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 33.1 bits (76), Expect = 0.35
 Identities = 14/84 (16%), Positives = 19/84 (22%), Gaps = 2/84 (2%)

Query: 64  EKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTK 123
              +  P S+         P  PS PS    P       +  P     +           
Sbjct: 182 ASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGP- 240

Query: 124 SFYPSVPSNGHPSVPSNAYPSSSH 147
              P  P        S A P+   
Sbjct: 241 -AAPPPPPVQQVPPLSTAKPTPPS 263


>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
           (LMP2) protein.  This family consists of several
           Gammaherpesvirus latent membrane protein (LMP2)
           proteins. Epstein-Barr virus is a human Gammaherpesvirus
           that infects and establishes latency in B lymphocytes in
           vivo. The latent membrane protein 2 (LMP2) gene is
           expressed in latently infected B cells and encodes two
           protein isoforms, LMP2A and LMP2B, that are identical
           except for an additional N-terminal 119 aa cytoplasmic
           domain which is present in the LMP2A isoform. LMP2A is
           thought to play a key role in either the establishment
           or the maintenance of latency and/or the reactivation of
           productive infection from the latent state. The
           significance of LMP2B and its role in pathogenesis
           remain unclear.
          Length = 489

 Score = 33.3 bits (76), Expect = 0.38
 Identities = 9/56 (16%), Positives = 16/56 (28%), Gaps = 4/56 (7%)

Query: 83  PMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYPSVPSNGHPSVP 138
           P+   GP     P   + ++N    +   +   RP             +  H   P
Sbjct: 8   PLGAGGPRSHGGPDGDEGDSNPYYPSSFGSSWDRP----GPPVPEDYDAPSHRPPP 59


>gnl|CDD|146273 pfam03546, Treacle, Treacher Collins syndrome protein Treacle. 
          Length = 519

 Score = 32.9 bits (74), Expect = 0.41
 Identities = 20/81 (24%), Positives = 30/81 (37%), Gaps = 4/81 (4%)

Query: 74  QAPAVTTYRPMAPSGPSQPMR----PAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYPSV 129
           + PA        PSG S  ++    PA        PP T   AGPA     + ++     
Sbjct: 36  EMPAAKNPPQAKPSGKSPQVKAASAPAKESPQKGAPPVTPGKAGPAAAQAGEEEAKSSEE 95

Query: 130 PSNGHPSVPSNAYPSSSHFQV 150
            S+     P+ A  ++S  Q 
Sbjct: 96  ESDSEGETPTAATLTTSPAQA 116


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 33.0 bits (75), Expect = 0.52
 Identities = 29/99 (29%), Positives = 36/99 (36%), Gaps = 20/99 (20%)

Query: 59  PQSQNEKENRMPVSTQAPAVTTYRPMAPSG------PSQPMRPAMGQKNNNQPPNTKSNA 112
           P+S +E    M     AP  T  R  A         PS P R A        P   K+N 
Sbjct: 80  PKSSSE----MEGEMLAPQATVGRDEATPSIAMENTPSPPRRTA-----KITPTTPKNNY 130

Query: 113 GPARPSTDQTKSFYPSVPSNGHPSVPSNAYPSSSHFQVV 151
            P    T++ K   P+ PS        N Y S+S  Q V
Sbjct: 131 SPTAAGTERVKEDTPATPSRA-----LNHYISTSGRQRV 164


>gnl|CDD|226868 COG4461, LprI, Uncharacterized protein conserved in bacteria,
           putative lipoprotein [Function unknown].
          Length = 185

 Score = 31.4 bits (71), Expect = 0.71
 Identities = 8/39 (20%), Positives = 8/39 (20%), Gaps = 6/39 (15%)

Query: 70  PVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNT 108
             S   P V  Y                 Q  N   P T
Sbjct: 102 DPSIDKPLVLEYGCSTQD------GALTAQFYNQFDPKT 134


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 32.1 bits (73), Expect = 0.73
 Identities = 16/66 (24%), Positives = 18/66 (27%), Gaps = 9/66 (13%)

Query: 83  PMAPSGPSQPMRPA-MGQKNNNQPPNTKSNAGPARPSTDQTKSFYPSVP-SNGHPSVPSN 140
           PM P GP +P   A M       P     NA    P            P     P     
Sbjct: 425 PMGPGGPLRPNGLAPMNA--VRAPSRNAQNAAQKPPMQP-----VMYPPNYQSLPLSQDL 477

Query: 141 AYPSSS 146
             P S+
Sbjct: 478 PQPQST 483



 Score = 30.2 bits (68), Expect = 3.4
 Identities = 14/94 (14%), Positives = 23/94 (24%), Gaps = 6/94 (6%)

Query: 57  LRPQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPAR 116
              Q        M  +   P      P+ P+G +     A+   + N     +       
Sbjct: 409 FNGQPLGWPRMSMMPTPMGPG----GPLRPNGLAPM--NAVRAPSRNAQNAAQKPPMQPV 462

Query: 117 PSTDQTKSFYPSVPSNGHPSVPSNAYPSSSHFQV 150
                 +S   S       S  S    +    QV
Sbjct: 463 MYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQV 496


>gnl|CDD|147322 pfam05078, DUF679, Protein of unknown function (DUF679).  This
           family contains several uncharacterized plant proteins.
          Length = 169

 Score = 31.1 bits (71), Expect = 0.82
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 180 NNLPLGIGIWRSTIFLSFPKWKAGIPF 206
            NLPLG+G+  S +F+ FP  + GI +
Sbjct: 143 TNLPLGVGVVSSFVFMVFPTTRHGIGY 169


>gnl|CDD|234974 PRK01722, PRK01722, formimidoylglutamase; Provisional.
          Length = 320

 Score = 31.5 bits (72), Expect = 1.2
 Identities = 18/58 (31%), Positives = 19/58 (32%), Gaps = 15/58 (25%)

Query: 430 DLRNSSRYVNSS------------DIDEYFHY--LG-SRFKNTQASASAINSNGVMFY 472
           DLRN      SS                 FHY  +G SR  NTQA        GV   
Sbjct: 156 DLRNLEDGGPSSGTPFRQLLEYCDAQIRGFHYACIGVSRASNTQALWEEAKELGVTVV 213


>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
           Provisional.
          Length = 481

 Score = 31.7 bits (71), Expect = 1.2
 Identities = 19/86 (22%), Positives = 35/86 (40%), Gaps = 3/86 (3%)

Query: 64  EKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTK 123
           E  N    +T+    TT +  AP  P++  +PA     N    N          +T+ T 
Sbjct: 273 ENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNT---NANKTNTNTNTNTNNTN 329

Query: 124 SFYPSVPSNGHPSVPSNAYPSSSHFQ 149
           +  PS  +N + +  +N   +++  Q
Sbjct: 330 TSTPSKNTNTNTNSNTNTNSNTNANQ 355


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 31.8 bits (72), Expect = 1.2
 Identities = 21/84 (25%), Positives = 31/84 (36%), Gaps = 2/84 (2%)

Query: 59   PQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAM-GQKNNNQPPNTKSNAGPARP 117
            P +  +    +P +   PAV    P  P GP++P RP          PP   +   P R 
Sbjct: 2725 PAAARQASPALPAAPAPPAVPA-GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRL 2783

Query: 118  STDQTKSFYPSVPSNGHPSVPSNA 141
            +     S   S  S   P  P++ 
Sbjct: 2784 TRPAVASLSESRESLPSPWDPADP 2807



 Score = 30.3 bits (68), Expect = 3.7
 Identities = 17/75 (22%), Positives = 26/75 (34%), Gaps = 2/75 (2%)

Query: 70   PVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPAR-PSTDQTKSFYPS 128
            P     P  T+ +P AP  P  P  P++    +  P        P+R P+        P 
Sbjct: 2824 PAGPLPPP-TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPP 2882

Query: 129  VPSNGHPSVPSNAYP 143
            V     P+V  +   
Sbjct: 2883 VRRLARPAVSRSTES 2897


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 31.4 bits (71), Expect = 1.3
 Identities = 18/79 (22%), Positives = 30/79 (37%), Gaps = 5/79 (6%)

Query: 70  PVSTQAPAVTTYRPMAPSGPSQPMRPAMGQK----NNNQPPNTKSNAGPARPSTDQTKSF 125
           P +   PA    RP+A +  + P R A        +++ PP  +     A P+  Q  + 
Sbjct: 458 PAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAA 517

Query: 126 YPSVPSNGHPSVPSNAYPS 144
                +   P  P+ A P 
Sbjct: 518 PAGWVAESIPD-PATADPD 535


>gnl|CDD|191179 pfam05053, Menin, Menin.  MEN1, the gene responsible for multiple
           endocrine neoplasia type 1, is a tumour suppressor gene
           that encodes a protein called Menin which may be an
           atypical GTPase stimulated by nm23.
          Length = 618

 Score = 31.1 bits (70), Expect = 1.8
 Identities = 22/83 (26%), Positives = 26/83 (31%), Gaps = 11/83 (13%)

Query: 64  EKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTK 123
           E   R P           +   PSG   P  P +   NNN   N  +N G  R     T 
Sbjct: 477 EGRRRGPRRES-------KSQEPSGGESP-NPELPANNNNSNSNNNNNNGADRKEAAATT 528

Query: 124 SFYPSVPSNGHPSVPSNAYPSSS 146
               +  SNG         P SS
Sbjct: 529 G-NATTTSNG-SGTSVPL-PVSS 548


>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin.  Amelogenins play a role in
           biomineralisation. They seem to regulate the formation
           of crystallites during the secretory stage of tooth
           enamel development. thought to play a major role in the
           structural organisation and mineralisation of developing
           enamel. They are found in the extracellular matrix.
           Mutations in X-chromosomal amelogenin can cause
           Amelogenesis imperfecta.
          Length = 174

 Score = 29.9 bits (67), Expect = 2.0
 Identities = 22/79 (27%), Positives = 32/79 (40%), Gaps = 6/79 (7%)

Query: 59  PQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPS 118
           PQ Q     ++    Q   +   +PM P     PM P  GQ+ + QPP       P +P+
Sbjct: 58  PQQQQSAHPKLTPHHQLLILPPQQPMMPVPGHHPMVPMTGQQPHLQPPAQH----PLQPT 113

Query: 119 TDQTKSFYPSVPSNGHPSV 137
             Q     P  P++  P V
Sbjct: 114 YGQNPQ--PQQPTHTQPPV 130


>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
          Length = 280

 Score = 30.6 bits (68), Expect = 2.1
 Identities = 21/76 (27%), Positives = 31/76 (40%), Gaps = 2/76 (2%)

Query: 71  VSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYPSVP 130
           V+  APA       AP+ P   + PA  +     PP+T+    P  P     K    + P
Sbjct: 107 VTCPAPAPACPPATAPTCPPPAVCPAPARPAPACPPSTRQ--CPPAPPLPTPKPAPAAKP 164

Query: 131 SNGHPSVPSNAYPSSS 146
              H  +P   YP++S
Sbjct: 165 IFLHNQLPPPDYPAAS 180


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 30.3 bits (69), Expect = 2.7
 Identities = 17/74 (22%), Positives = 23/74 (31%), Gaps = 10/74 (13%)

Query: 66  ENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSF 125
           E     ++  P     RP AP+ P+ P  PA             + A PA  S       
Sbjct: 598 EGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAG----------AAAAPAEASAAPAPGV 647

Query: 126 YPSVPSNGHPSVPS 139
                   H +VP 
Sbjct: 648 AAPEHHPKHVAVPD 661


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 30.4 bits (68), Expect = 3.3
 Identities = 18/111 (16%), Positives = 30/111 (27%), Gaps = 6/111 (5%)

Query: 35  DWEKELFNTNYTNNFLNVDLKKLRPQSQNEKENRM----PVSTQAPAVTTYRPMAPSGPS 90
               ++  T Y   +  +     +P              P + Q P         P+ P 
Sbjct: 652 HQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPP 711

Query: 91  QPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYPSVPSNGHPSVPSNA 141
              +         +PP   +  G ARP         P   + G    P+ A
Sbjct: 712 GRAQRPAAATGRARPP--AAAPGRARPPAAAPGRARPPAAAPGRARPPAAA 760



 Score = 29.3 bits (65), Expect = 7.1
 Identities = 18/74 (24%), Positives = 25/74 (33%), Gaps = 2/74 (2%)

Query: 68  RMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYP 127
           R P   + PA    R   P+  +   RP        +PP   +  G ARP         P
Sbjct: 699 RAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPP--AAAPGRARPPAAAPGRARP 756

Query: 128 SVPSNGHPSVPSNA 141
              + G    P+ A
Sbjct: 757 PAAAPGRARPPAAA 770


>gnl|CDD|238803 cd01569, PBEF_like, pre-B-cell colony-enhancing factor (PBEF)-like.
           The mammalian members of this group of nicotinate
           phosphoribosyltransferases (NAPRTases) were originally
           identified as genes whose expression is upregulated upon
           activation in lymphoid cells. In general, nicotinate
           phosphoribosyltransferase catalyses the formation of
           NAMN and PPi from 5-phosphoribosy -1-pyrophosphate
           (PRPP) and nicotinic acid, this is the first, and also
           rate limiting, reaction in the NAD salvage synthesis.
          Length = 407

 Score = 29.9 bits (68), Expect = 3.4
 Identities = 7/18 (38%), Positives = 11/18 (61%)

Query: 338 SYRLTHPYMYPEPTQSTY 355
           SY+++H   YP  T + Y
Sbjct: 6   SYKVSHRLQYPPGTTAVY 23


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 30.1 bits (68), Expect = 3.5
 Identities = 12/63 (19%), Positives = 20/63 (31%), Gaps = 2/63 (3%)

Query: 86  PSGPSQPMRPAMGQKNNNQPPNTKSNAGPARP--STDQTKSFYPSVPSNGHPSVPSNAYP 143
             GP Q ++P   Q      P+  + A P+    S     S   S           +  P
Sbjct: 372 GRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDP 431

Query: 144 SSS 146
            ++
Sbjct: 432 PAA 434


>gnl|CDD|217461 pfam03261, CDK5_activator, Cyclin-dependent kinase 5 activator
           protein. 
          Length = 314

 Score = 29.9 bits (67), Expect = 3.6
 Identities = 15/64 (23%), Positives = 21/64 (32%), Gaps = 4/64 (6%)

Query: 62  QNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQ---KNNNQPPNTKSNAGPARPS 118
             +K    P  T   +      +    P QP  P       KN N  P+ K+    A  S
Sbjct: 94  NLKKSVPRPSDTTL-SCANLSSLREPSPGQPPAPPPSVLSGKNANCIPSQKNAPSIAITS 152

Query: 119 TDQT 122
           T  +
Sbjct: 153 TGGS 156


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 30.0 bits (67), Expect = 3.7
 Identities = 28/112 (25%), Positives = 39/112 (34%), Gaps = 9/112 (8%)

Query: 68  RMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYP 127
           R+  ST +P+ T      PSG +    PA    +    P + S     R     T S   
Sbjct: 269 RVGDSTSSPSTTR-----PSGSTTTTTPASSGPSAPGGPGSSSRNAVTR----STDSISG 319

Query: 128 SVPSNGHPSVPSNAYPSSSHFQVVFRWKIMDFVFRDNKQKVNLIRSNKFIPE 179
            +PS G P   +        F V F         R    + N+ +  KFI E
Sbjct: 320 PIPSPGAPRAITGQMGEREMFAVQFLGDFKPKPRRYEGDRTNVAKLKKFIFE 371


>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
          Length = 327

 Score = 29.7 bits (67), Expect = 3.8
 Identities = 14/77 (18%), Positives = 19/77 (24%), Gaps = 4/77 (5%)

Query: 70  PVSTQAPAVTTYRPMAPSGPSQ----PMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSF 125
             ST   A  T  P   +           P           N    A  A P     K  
Sbjct: 170 NTSTLPTAPATVAPSKGAKVPATAETHPTPPQKPATKKPAVNHHKTATVAVPPATSGKPK 229

Query: 126 YPSVPSNGHPSVPSNAY 142
             +  +    S P++ Y
Sbjct: 230 SGAASARALSSAPASHY 246


>gnl|CDD|217106 pfam02559, CarD_CdnL_TRCF, CarD-like/TRCF domain.  CarD is a
           Myxococcus xanthus protein required for the activation
           of light- and starvation-inducible genes. This family
           includes the presumed N-terminal domain, CdnL. CarD
           interacts with the zinc-binding protein CarG to form a
           complex that regulates multiple processes in Myxococcus
           xanthus. This family also includes a domain to the
           N-terminal side of the DEAD helicase of TRCF
           (transcription-repair-coupling factor) proteins. TRCF
           displaces RNA polymerase stalled at a lesion, binds to
           the damage recognition protein UvrA, and increases the
           template strand repair rate during transcription. This
           domain is involved in binding to the stalled RNA
           polymerase. The family includes members otherwise
           referred to as CdnL, for CarD N-terminal like,
           whichdiffer functionally from CarD. The TRCF domain
           mentioned above is the RNA polymerase-interacting domain
           or RID.
          Length = 98

 Score = 27.9 bits (63), Expect = 4.4
 Identities = 11/49 (22%), Positives = 18/49 (36%)

Query: 404 KTIRDDQRYMYFHSMSSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGS 452
             +  ++R  Y    +     YV   +L    RY+ S D +     LG 
Sbjct: 24  IEVGGEKREYYVLEYADGDKLYVPVDNLDLIGRYIGSKDEEPVLDKLGG 72


>gnl|CDD|152703 pfam12268, DUF3612, Protein of unknown function (DUF3612).  This
           domain family is found in bacteria, and is approximately
           180 amino acids in length. The family is found in
           association with pfam01381.
          Length = 178

 Score = 29.0 bits (65), Expect = 4.6
 Identities = 10/21 (47%), Positives = 14/21 (66%)

Query: 224 PYPNWSYFDDSNCNSLISVFR 244
           PYP+W YFD  +   L +V+R
Sbjct: 5   PYPHWHYFDAYSPGKLKAVYR 25


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 29.8 bits (66), Expect = 5.0
 Identities = 14/85 (16%), Positives = 27/85 (31%), Gaps = 9/85 (10%)

Query: 71  VSTQAPAVTTYRPMAPSGPS---QPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYP 127
            ++  P V      A S P+           Q     P N  +        +  T +   
Sbjct: 525 ATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTT 584

Query: 128 ------SVPSNGHPSVPSNAYPSSS 146
                 S P++  P +PS+++ +  
Sbjct: 585 GQHGTGSSPTSQQPGIPSSSHSTPR 609


>gnl|CDD|218181 pfam04621, ETS_PEA3_N, PEA3 subfamily ETS-domain transcription
           factor N terminal domain.  The N terminus of the PEA3
           transcription factors is implicated in transactivation
           and in inhibition of DNA binding. Transactivation is
           potentiated by activation of the Ras/MAP kinase and
           protein kinase A signalling cascades. The N terminal
           region contains conserved MAP kinase phosphorylation
           sites.
          Length = 336

 Score = 29.5 bits (66), Expect = 5.1
 Identities = 22/76 (28%), Positives = 29/76 (38%), Gaps = 5/76 (6%)

Query: 74  QAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYPSVPSNG 133
           + PA     P  PS P  P+ P    +   QP    SN+ P  P T          P + 
Sbjct: 124 RKPASGFKPPTPPSTPCSPVNPQETVR-QLQPSGPLSNSSPPSPHTPLPNQSPLPPPMS- 181

Query: 134 HPSVPSNAYPSSSHFQ 149
               P ++YPS   FQ
Sbjct: 182 ---SPDSSYPSEHRFQ 194


>gnl|CDD|236407 PRK09198, PRK09198, putative nicotinate phosphoribosyltransferase;
           Provisional.
          Length = 463

 Score = 29.5 bits (67), Expect = 5.5
 Identities = 7/18 (38%), Positives = 9/18 (50%)

Query: 338 SYRLTHPYMYPEPTQSTY 355
           SY+ +H   YP  T   Y
Sbjct: 13  SYKASHRLQYPPGTTYLY 30


>gnl|CDD|217990 pfam04257, Exonuc_V_gamma, Exodeoxyribonuclease V, gamma subunit.
           The Exodeoxyribonuclease V enzyme is a multi-subunit
           enzyme comprised of the proteins RecB, RecC (this
           family) and RecD. This enzyme plays an important role in
           homologous genetic recombination, repair of double
           strand DNA breaks resistance to UV irradiation and
           chemical DNA-damage. The enzyme (EC:3.1.11.5) catalyzes
           ssDNA or dsDNA-dependent ATP hydrolysis, hydrolysis of
           ssDNA or dsDNA and unwinding of dsDNA. This family
           consists of two AAA domains.
          Length = 756

 Score = 29.1 bits (66), Expect = 6.4
 Identities = 19/65 (29%), Positives = 32/65 (49%), Gaps = 3/65 (4%)

Query: 403 DKTIRDDQRYMYFHSMSSNR-HYYVSTT--DLRNSSRYVNSSDIDEYFHYLGSRFKNTQA 459
           D++ RDD RY++  ++ S R H Y+S     +R++S    S  + E   YL   F     
Sbjct: 631 DRSRRDDDRYLFLEALLSAREHLYISYVGRSIRDNSERPPSVLVSELLDYLAQNFYLGDE 690

Query: 460 SASAI 464
           +  A+
Sbjct: 691 ALKAL 695


>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
          Length = 222

 Score = 28.5 bits (63), Expect = 6.7
 Identities = 19/79 (24%), Positives = 29/79 (36%), Gaps = 5/79 (6%)

Query: 53  DLKKLRP-----QSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPN 107
           DL +L+P     +S+N+K+       +                QP      Q+   QP N
Sbjct: 137 DLNRLKPHMMKTKSKNQKQENQEKQEKLEESEIQSEKQEQPSQQPQSQPQPQQQPQQPNN 196

Query: 108 TKSNAGPARPSTDQTKSFY 126
            K  A     S D+T+  Y
Sbjct: 197 NKRKAMEEGLSPDRTRPRY 215


>gnl|CDD|218116 pfam04503, SSDP, Single-stranded DNA binding protein, SSDP.  This
           is a family of eukaryotic single-stranded DNA binding
           proteins with specificity to a pyrimidine-rich element
           found in the promoter region of the alpha2(I) collagen
           gene.
          Length = 293

 Score = 28.9 bits (64), Expect = 7.0
 Identities = 21/78 (26%), Positives = 31/78 (39%), Gaps = 3/78 (3%)

Query: 58  RPQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARP 117
           RP       N +P S+ +P   T  P     P  P+ P+     N+   N  +   P  P
Sbjct: 166 RPWPNPISANSIPYSSSSPGEYTGPPGGGGPPGTPIMPSPADSTNSS-DNMYTVMNPVPP 224

Query: 118 STDQTKSFYPSVPSNGHP 135
              QT+S +P  P +  P
Sbjct: 225 G--QTRSNFPMGPGSDGP 240


>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
          Length = 356

 Score = 28.9 bits (64), Expect = 7.4
 Identities = 12/55 (21%), Positives = 21/55 (38%)

Query: 55  KKLRPQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTK 109
           K+ + Q+  E        T  P+  T +PM  +    P  P + Q   +  P  +
Sbjct: 115 KQEQSQTTTESTTPKTKVTTPPSTNTPQPMQSTKSDTPQSPTIKQAQTDMTPKYE 169


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 29.0 bits (65), Expect = 7.9
 Identities = 10/91 (10%), Positives = 13/91 (14%), Gaps = 1/91 (1%)

Query: 59  PQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPS 118
           PQ     +   P   Q P     +P  P     P           Q  N           
Sbjct: 246 PQFPGLSQQMPPPPPQPPQQQQ-QPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQL 304

Query: 119 TDQTKSFYPSVPSNGHPSVPSNAYPSSSHFQ 149
               +                          
Sbjct: 305 LPLVQQPQGQQRGPQFREQLVQLSQQQREAL 335


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 28.9 bits (64), Expect = 8.7
 Identities = 26/153 (16%), Positives = 52/153 (33%), Gaps = 15/153 (9%)

Query: 3   TSNINMAAPNLETDTMVNA--NPDN----TRMVNPNFVD-WEKELFNTNYTNNFLNVDLK 55
              IN+   N     + N   N D+      + +  F++ WE E  + ++  N  + D  
Sbjct: 20  ADGININGDNKIDKHIENEDFNFDDINSWMPIDDAEFLNSWEDEFSDESFLENKASKDDG 79

Query: 56  KLRPQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAG-P 114
            +     +E       +  A      +    S  +  +      K++ +     + AG P
Sbjct: 80  NINLTDTSE-------NGDASKKGHGKSRVRSASAAAILEEDDSKDDMEFKANPNEAGKP 132

Query: 115 ARPSTDQTKSFYPSVPSNGHPSVPSNAYPSSSH 147
            +P  +Q +    S       S  S +  +S H
Sbjct: 133 GKPKGNQGEGLASSSDGKSKASAKSGSKSASKH 165


>gnl|CDD|221509 pfam12287, Caprin-1_C, Cytoplasmic
           activation/proliferation-associated protein-1 C term.
           This family of proteins is found in eukaryotes. Proteins
           in this family are typically between 343 and 708 amino
           acids in length. This family is the C terminal region of
           caprin-1. Caprin-1 is a protein involved in regulating
           cellular proliferation. In mutated phenotypes, the G1
           phase of the cell cycle is greatly lengthened, impairing
           normal proliferation. The C terminal region of caprin-1
           contains RGG motifs which are characteristic of RNA
           binding domains. It is possible that caprin-1 functions
           through an RNA binding mechanism.
          Length = 319

 Score = 28.4 bits (63), Expect = 9.2
 Identities = 23/86 (26%), Positives = 36/86 (41%), Gaps = 8/86 (9%)

Query: 66  ENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTK-SNAGPARPSTDQTKS 124
           E+R+    Q  AV    P+ P     PM     +   + PP  + S+    RP TD    
Sbjct: 48  ESRLS---QPSAV----PVQPEPTQVPMVSPTSEGYTSSPPLYQPSHTAEPRPQTDPIDP 100

Query: 125 FYPSVPSNGHPSVPSNAYPSSSHFQV 150
              S+  N   +  S++ P++S  QV
Sbjct: 101 IQASMSLNSEQTPTSSSLPAASQPQV 126


>gnl|CDD|227364 COG5031, COQ4, Uncharacterized protein involved in ubiquinone
           biosynthesis [Coenzyme metabolism].
          Length = 235

 Score = 28.2 bits (63), Expect = 9.5
 Identities = 14/76 (18%), Positives = 27/76 (35%), Gaps = 8/76 (10%)

Query: 198 PKWKAGIPFTLASFNMNDPSESPILLPYPNWSYFDDSNCNSLISVFRMSVDKCDRLWIMD 257
           P+  A +   L    +  P  S  L+ Y    +        LI+       K + LW  +
Sbjct: 165 PRSVALLAKNLLPLRLYQPELSRQLMDYIPRGWRMGLQAKPLIAQ------KWEELW--E 216

Query: 258 TGVTNILSSIQQLCPP 273
             +  + + +  + PP
Sbjct: 217 KPLEEVRTELNIILPP 232


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.133    0.408 

Gapped
Lambda     K      H
   0.267   0.0569    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,508,874
Number of extensions: 2502132
Number of successful extensions: 2318
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2286
Number of HSP's successfully gapped: 66
Length of query: 530
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 429
Effective length of database: 6,457,848
Effective search space: 2770416792
Effective search space used: 2770416792
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.6 bits)