RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy11539
(530 letters)
>gnl|CDD|217322 pfam03022, MRJP, Major royal jelly protein. Royal jelly is the
food of queen bee larvae, and is responsible for the
high reproductive ability of the queen. Major royal
jelly proteins make up around 90% of larval jelly
proteins. This family also the sequence-related yellow
protein of drosophila which controls pigmentation of the
adult cuticle and larval mouth parts.
Length = 285
Score = 220 bits (562), Expect = 6e-68
Identities = 98/269 (36%), Positives = 150/269 (55%), Gaps = 35/269 (13%)
Query: 240 ISVFRMSVDKCDRLWIMDTGVTNILSSIQQLCPPKIMVFDLKTNTLIRKYILPTAQVFEG 299
+SV+R++VD+CDRLW++D+G+ N L +Q+CPPK++VFDL T+ L++++ LP
Sbjct: 1 VSVYRIAVDECDRLWVLDSGIVNTLQPPKQICPPKLLVFDLATDKLLKRFELPADVAKGN 60
Query: 300 SLFSNIVTEVVED-CDHVFAYVNDVFRYGLIVYDFFKNTSYRLTHPYMYPEPTQSTYILD 358
S N+V +V +D CD F Y+ D GLIVYD + S+R+ H YP+P + +
Sbjct: 61 SRLVNLVVDVGDDTCDDTFVYIADAKGRGLIVYDLADDKSWRVEHNTFYPDPDFGKFTIA 120
Query: 359 NLKFRWVDGIFGMAISPELSGYKYKRHPYEYYHYNVHHYNGTNVDKTIRDDQRYMYFHSM 418
F+ DGIFG+A+SP D R +YFH +
Sbjct: 121 GESFQLDDGIFGLALSP------------------------------RTPDGRTLYFHPL 150
Query: 419 SSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGSRFKNTQASASAINSNGVMFYNLVTKH 478
+S R Y V T LRN + + N++ ++ F LG R NTQ++A A++ NGV+F+ LV ++
Sbjct: 151 ASTRLYSVPTEVLRNETNWGNNAQYED-FKDLGDR--NTQSTALAVDPNGVLFFGLVGQN 207
Query: 479 SVGCWNTKTKVYLPQTQDIVQTSRDILNF 507
+VGCWNT + Y +V + D L F
Sbjct: 208 AVGCWNT-STPYSRANLGVVARNSDTLQF 235
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 35.3 bits (81), Expect = 0.067
Identities = 16/90 (17%), Positives = 22/90 (24%), Gaps = 2/90 (2%)
Query: 58 RPQS--QNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPA 115
PQ +E+ + P MA P P+ QP
Sbjct: 173 APQVSGLYPEESPYQPQSYPPNEPLPSSMAMQPPYSGAPPSQQFYGPPQPSPYMYGGPGG 232
Query: 116 RPSTDQTKSFYPSVPSNGHPSVPSNAYPSS 145
RP++ P S PS
Sbjct: 233 RPNSGFPSGQQPPPSQGQEGYGYSGPPPSK 262
Score = 29.9 bits (67), Expect = 3.1
Identities = 20/94 (21%), Positives = 29/94 (30%), Gaps = 4/94 (4%)
Query: 59 PQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKN---NNQPPNTKSNAGPA 115
QSQ ++ P S P P P+ +P P Q QP + P
Sbjct: 88 QQSQAPPSHQYP-SQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQ 146
Query: 116 RPSTDQTKSFYPSVPSNGHPSVPSNAYPSSSHFQ 149
P Q +S +P + + P S
Sbjct: 147 VPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLY 180
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 34.6 bits (79), Expect = 0.16
Identities = 21/110 (19%), Positives = 30/110 (27%), Gaps = 22/110 (20%)
Query: 58 RPQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPA----------MGQKNNNQPPN 107
+ Q Q + N Q + G + QP
Sbjct: 355 QQQQQQQGGNHPAAHQQQMNQ----QVGQGGQMVALGYLNIQGNQGGLGANPMQQGQPGM 410
Query: 108 TKSNAGPARPSTDQT--KSFYPSVPSNGHP------SVPSNAYPSSSHFQ 149
S + + T+Q+ + PSVPS G P SV PS
Sbjct: 411 MSSPSPVPQVQTNQSMPQPPQPSVPSPGGPGSQPPQSVSGGMIPSPPALM 460
Score = 32.7 bits (74), Expect = 0.55
Identities = 28/122 (22%), Positives = 34/122 (27%), Gaps = 22/122 (18%)
Query: 45 YTNNFLNVDLKKLRPQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQ 104
Y N N P Q + M + P V T + M QP P+ G +
Sbjct: 388 YLNIQGNQGGLGANPMQQGQPG-MMSSPSPVPQVQTNQSMPQPP--QPSVPSPGGPGSQP 444
Query: 105 PPNTKSN-----------------AGPARPSTDQTKSFYPSVPSN--GHPSVPSNAYPSS 145
P + PA T Q P P N G SV S A P
Sbjct: 445 PQSVSGGMIPSPPALMPSPSPQMSQSPASQRTIQQDMVSPGGPLNTPGQSSVNSPANPQE 504
Query: 146 SH 147
Sbjct: 505 EQ 506
>gnl|CDD|145501 pfam02389, Cornifin, Cornifin (SPRR) family. SPRR genes (formerly
SPR) encode a novel class of polypeptides (small proline
rich proteins) that are strongly induced during
differentiation of human epidermal keratinocytes in
vitro and in vivo. The most characteristic feature of
the SPRR gene family resides in the structure of the
central segments of the encoded polypeptides that are
built up from tandemly repeated units of either eight
(SPRR1 and SPRR3) or nine (SPRR2) amino acids with the
general consensus XKXPEPXX where X is any amino acid.
Length = 153
Score = 32.7 bits (74), Expect = 0.22
Identities = 15/61 (24%), Positives = 19/61 (31%), Gaps = 2/61 (3%)
Query: 83 PMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYPSVPSNGHPSVPSNAY 142
P P + P +P K P + +P VP GH VP Y
Sbjct: 67 TKVP-EPGRTKVPEPCCTKVPEPGYPKV-PEPGSTKVPEPGPCHPKVPEPGHTKVPEPGY 124
Query: 143 P 143
P
Sbjct: 125 P 125
Score = 29.2 bits (65), Expect = 3.5
Identities = 10/54 (18%), Positives = 15/54 (27%)
Query: 90 SQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYPSVPSNGHPSVPSNAYP 143
+P + Q N + P + P T + VP VP
Sbjct: 22 KEPCHTKVPQPGNTKIPEPGNTKVPEPGCTKVPEPGCTKVPEPCSTKVPEPGRT 75
>gnl|CDD|135173 PRK04654, PRK04654, sec-independent translocase; Provisional.
Length = 214
Score = 33.2 bits (75), Expect = 0.25
Identities = 18/70 (25%), Positives = 28/70 (40%), Gaps = 3/70 (4%)
Query: 70 PVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYPSV 129
PV AP + +P+AP+ P Q + PA P + PA P +
Sbjct: 141 PVPAPAPVIAQAQPIAPA-PHQTLVPAPHDTIVPAPHAAHLPSAPATPVS--VAPVDAGT 197
Query: 130 PSNGHPSVPS 139
++ PS P+
Sbjct: 198 SASPTPSEPT 207
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 33.5 bits (76), Expect = 0.32
Identities = 25/101 (24%), Positives = 33/101 (32%), Gaps = 13/101 (12%)
Query: 62 QNEKENRMPVSTQAPAVTTYRPMAPS-GPSQPMRPAMGQKNNNQPPN----TKSNAGPAR 116
Q + + P S Q P P AP PS P G QP + +
Sbjct: 170 QQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISA 229
Query: 117 PSTDQTKSFYPSVP--------SNGHPSVPSNAYPSSSHFQ 149
PS + P P + P PS+ +P SSH
Sbjct: 230 PSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHG 270
Score = 31.2 bits (70), Expect = 1.7
Identities = 19/91 (20%), Positives = 31/91 (34%), Gaps = 7/91 (7%)
Query: 56 KLRPQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPA 115
+L PQSQ + V Q P +T + + P + P + Q P +
Sbjct: 407 QLMPQSQPLQ----SVPAQPPVLTQSQSLPPKASTHP--HSGLHSGPPQSPFAQHPFTSG 460
Query: 116 -RPSTDQTKSFYPSVPSNGHPSVPSNAYPSS 145
P+ S S P+ + + P S
Sbjct: 461 GLPAIGPPPSLPTSTPAAPPRASSGSQPPGS 491
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 33.1 bits (76), Expect = 0.35
Identities = 14/84 (16%), Positives = 19/84 (22%), Gaps = 2/84 (2%)
Query: 64 EKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTK 123
+ P S+ P PS PS P + P +
Sbjct: 182 ASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGP- 240
Query: 124 SFYPSVPSNGHPSVPSNAYPSSSH 147
P P S A P+
Sbjct: 241 -AAPPPPPVQQVPPLSTAKPTPPS 263
>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
(LMP2) protein. This family consists of several
Gammaherpesvirus latent membrane protein (LMP2)
proteins. Epstein-Barr virus is a human Gammaherpesvirus
that infects and establishes latency in B lymphocytes in
vivo. The latent membrane protein 2 (LMP2) gene is
expressed in latently infected B cells and encodes two
protein isoforms, LMP2A and LMP2B, that are identical
except for an additional N-terminal 119 aa cytoplasmic
domain which is present in the LMP2A isoform. LMP2A is
thought to play a key role in either the establishment
or the maintenance of latency and/or the reactivation of
productive infection from the latent state. The
significance of LMP2B and its role in pathogenesis
remain unclear.
Length = 489
Score = 33.3 bits (76), Expect = 0.38
Identities = 9/56 (16%), Positives = 16/56 (28%), Gaps = 4/56 (7%)
Query: 83 PMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYPSVPSNGHPSVP 138
P+ GP P + ++N + + RP + H P
Sbjct: 8 PLGAGGPRSHGGPDGDEGDSNPYYPSSFGSSWDRP----GPPVPEDYDAPSHRPPP 59
>gnl|CDD|146273 pfam03546, Treacle, Treacher Collins syndrome protein Treacle.
Length = 519
Score = 32.9 bits (74), Expect = 0.41
Identities = 20/81 (24%), Positives = 30/81 (37%), Gaps = 4/81 (4%)
Query: 74 QAPAVTTYRPMAPSGPSQPMR----PAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYPSV 129
+ PA PSG S ++ PA PP T AGPA + ++
Sbjct: 36 EMPAAKNPPQAKPSGKSPQVKAASAPAKESPQKGAPPVTPGKAGPAAAQAGEEEAKSSEE 95
Query: 130 PSNGHPSVPSNAYPSSSHFQV 150
S+ P+ A ++S Q
Sbjct: 96 ESDSEGETPTAATLTTSPAQA 116
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 33.0 bits (75), Expect = 0.52
Identities = 29/99 (29%), Positives = 36/99 (36%), Gaps = 20/99 (20%)
Query: 59 PQSQNEKENRMPVSTQAPAVTTYRPMAPSG------PSQPMRPAMGQKNNNQPPNTKSNA 112
P+S +E M AP T R A PS P R A P K+N
Sbjct: 80 PKSSSE----MEGEMLAPQATVGRDEATPSIAMENTPSPPRRTA-----KITPTTPKNNY 130
Query: 113 GPARPSTDQTKSFYPSVPSNGHPSVPSNAYPSSSHFQVV 151
P T++ K P+ PS N Y S+S Q V
Sbjct: 131 SPTAAGTERVKEDTPATPSRA-----LNHYISTSGRQRV 164
>gnl|CDD|226868 COG4461, LprI, Uncharacterized protein conserved in bacteria,
putative lipoprotein [Function unknown].
Length = 185
Score = 31.4 bits (71), Expect = 0.71
Identities = 8/39 (20%), Positives = 8/39 (20%), Gaps = 6/39 (15%)
Query: 70 PVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNT 108
S P V Y Q N P T
Sbjct: 102 DPSIDKPLVLEYGCSTQD------GALTAQFYNQFDPKT 134
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 32.1 bits (73), Expect = 0.73
Identities = 16/66 (24%), Positives = 18/66 (27%), Gaps = 9/66 (13%)
Query: 83 PMAPSGPSQPMRPA-MGQKNNNQPPNTKSNAGPARPSTDQTKSFYPSVP-SNGHPSVPSN 140
PM P GP +P A M P NA P P P
Sbjct: 425 PMGPGGPLRPNGLAPMNA--VRAPSRNAQNAAQKPPMQP-----VMYPPNYQSLPLSQDL 477
Query: 141 AYPSSS 146
P S+
Sbjct: 478 PQPQST 483
Score = 30.2 bits (68), Expect = 3.4
Identities = 14/94 (14%), Positives = 23/94 (24%), Gaps = 6/94 (6%)
Query: 57 LRPQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPAR 116
Q M + P P+ P+G + A+ + N +
Sbjct: 409 FNGQPLGWPRMSMMPTPMGPG----GPLRPNGLAPM--NAVRAPSRNAQNAAQKPPMQPV 462
Query: 117 PSTDQTKSFYPSVPSNGHPSVPSNAYPSSSHFQV 150
+S S S S + QV
Sbjct: 463 MYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQV 496
>gnl|CDD|147322 pfam05078, DUF679, Protein of unknown function (DUF679). This
family contains several uncharacterized plant proteins.
Length = 169
Score = 31.1 bits (71), Expect = 0.82
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 180 NNLPLGIGIWRSTIFLSFPKWKAGIPF 206
NLPLG+G+ S +F+ FP + GI +
Sbjct: 143 TNLPLGVGVVSSFVFMVFPTTRHGIGY 169
>gnl|CDD|234974 PRK01722, PRK01722, formimidoylglutamase; Provisional.
Length = 320
Score = 31.5 bits (72), Expect = 1.2
Identities = 18/58 (31%), Positives = 19/58 (32%), Gaps = 15/58 (25%)
Query: 430 DLRNSSRYVNSS------------DIDEYFHY--LG-SRFKNTQASASAINSNGVMFY 472
DLRN SS FHY +G SR NTQA GV
Sbjct: 156 DLRNLEDGGPSSGTPFRQLLEYCDAQIRGFHYACIGVSRASNTQALWEEAKELGVTVV 213
>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
Provisional.
Length = 481
Score = 31.7 bits (71), Expect = 1.2
Identities = 19/86 (22%), Positives = 35/86 (40%), Gaps = 3/86 (3%)
Query: 64 EKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTK 123
E N +T+ TT + AP P++ +PA N N +T+ T
Sbjct: 273 ENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNT---NANKTNTNTNTNTNNTN 329
Query: 124 SFYPSVPSNGHPSVPSNAYPSSSHFQ 149
+ PS +N + + +N +++ Q
Sbjct: 330 TSTPSKNTNTNTNSNTNTNSNTNANQ 355
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 31.8 bits (72), Expect = 1.2
Identities = 21/84 (25%), Positives = 31/84 (36%), Gaps = 2/84 (2%)
Query: 59 PQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAM-GQKNNNQPPNTKSNAGPARP 117
P + + +P + PAV P P GP++P RP PP + P R
Sbjct: 2725 PAAARQASPALPAAPAPPAVPA-GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRL 2783
Query: 118 STDQTKSFYPSVPSNGHPSVPSNA 141
+ S S S P P++
Sbjct: 2784 TRPAVASLSESRESLPSPWDPADP 2807
Score = 30.3 bits (68), Expect = 3.7
Identities = 17/75 (22%), Positives = 26/75 (34%), Gaps = 2/75 (2%)
Query: 70 PVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPAR-PSTDQTKSFYPS 128
P P T+ +P AP P P P++ + P P+R P+ P
Sbjct: 2824 PAGPLPPP-TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPP 2882
Query: 129 VPSNGHPSVPSNAYP 143
V P+V +
Sbjct: 2883 VRRLARPAVSRSTES 2897
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 31.4 bits (71), Expect = 1.3
Identities = 18/79 (22%), Positives = 30/79 (37%), Gaps = 5/79 (6%)
Query: 70 PVSTQAPAVTTYRPMAPSGPSQPMRPAMGQK----NNNQPPNTKSNAGPARPSTDQTKSF 125
P + PA RP+A + + P R A +++ PP + A P+ Q +
Sbjct: 458 PAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAA 517
Query: 126 YPSVPSNGHPSVPSNAYPS 144
+ P P+ A P
Sbjct: 518 PAGWVAESIPD-PATADPD 535
>gnl|CDD|191179 pfam05053, Menin, Menin. MEN1, the gene responsible for multiple
endocrine neoplasia type 1, is a tumour suppressor gene
that encodes a protein called Menin which may be an
atypical GTPase stimulated by nm23.
Length = 618
Score = 31.1 bits (70), Expect = 1.8
Identities = 22/83 (26%), Positives = 26/83 (31%), Gaps = 11/83 (13%)
Query: 64 EKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTK 123
E R P + PSG P P + NNN N +N G R T
Sbjct: 477 EGRRRGPRRES-------KSQEPSGGESP-NPELPANNNNSNSNNNNNNGADRKEAAATT 528
Query: 124 SFYPSVPSNGHPSVPSNAYPSSS 146
+ SNG P SS
Sbjct: 529 G-NATTTSNG-SGTSVPL-PVSS 548
>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in
biomineralisation. They seem to regulate the formation
of crystallites during the secretory stage of tooth
enamel development. thought to play a major role in the
structural organisation and mineralisation of developing
enamel. They are found in the extracellular matrix.
Mutations in X-chromosomal amelogenin can cause
Amelogenesis imperfecta.
Length = 174
Score = 29.9 bits (67), Expect = 2.0
Identities = 22/79 (27%), Positives = 32/79 (40%), Gaps = 6/79 (7%)
Query: 59 PQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPS 118
PQ Q ++ Q + +PM P PM P GQ+ + QPP P +P+
Sbjct: 58 PQQQQSAHPKLTPHHQLLILPPQQPMMPVPGHHPMVPMTGQQPHLQPPAQH----PLQPT 113
Query: 119 TDQTKSFYPSVPSNGHPSV 137
Q P P++ P V
Sbjct: 114 YGQNPQ--PQQPTHTQPPV 130
>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
Length = 280
Score = 30.6 bits (68), Expect = 2.1
Identities = 21/76 (27%), Positives = 31/76 (40%), Gaps = 2/76 (2%)
Query: 71 VSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYPSVP 130
V+ APA AP+ P + PA + PP+T+ P P K + P
Sbjct: 107 VTCPAPAPACPPATAPTCPPPAVCPAPARPAPACPPSTRQ--CPPAPPLPTPKPAPAAKP 164
Query: 131 SNGHPSVPSNAYPSSS 146
H +P YP++S
Sbjct: 165 IFLHNQLPPPDYPAAS 180
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 30.3 bits (69), Expect = 2.7
Identities = 17/74 (22%), Positives = 23/74 (31%), Gaps = 10/74 (13%)
Query: 66 ENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSF 125
E ++ P RP AP+ P+ P PA + A PA S
Sbjct: 598 EGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAG----------AAAAPAEASAAPAPGV 647
Query: 126 YPSVPSNGHPSVPS 139
H +VP
Sbjct: 648 AAPEHHPKHVAVPD 661
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 30.4 bits (68), Expect = 3.3
Identities = 18/111 (16%), Positives = 30/111 (27%), Gaps = 6/111 (5%)
Query: 35 DWEKELFNTNYTNNFLNVDLKKLRPQSQNEKENRM----PVSTQAPAVTTYRPMAPSGPS 90
++ T Y + + +P P + Q P P+ P
Sbjct: 652 HQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPP 711
Query: 91 QPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYPSVPSNGHPSVPSNA 141
+ +PP + G ARP P + G P+ A
Sbjct: 712 GRAQRPAAATGRARPP--AAAPGRARPPAAAPGRARPPAAAPGRARPPAAA 760
Score = 29.3 bits (65), Expect = 7.1
Identities = 18/74 (24%), Positives = 25/74 (33%), Gaps = 2/74 (2%)
Query: 68 RMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYP 127
R P + PA R P+ + RP +PP + G ARP P
Sbjct: 699 RAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPP--AAAPGRARPPAAAPGRARP 756
Query: 128 SVPSNGHPSVPSNA 141
+ G P+ A
Sbjct: 757 PAAAPGRARPPAAA 770
>gnl|CDD|238803 cd01569, PBEF_like, pre-B-cell colony-enhancing factor (PBEF)-like.
The mammalian members of this group of nicotinate
phosphoribosyltransferases (NAPRTases) were originally
identified as genes whose expression is upregulated upon
activation in lymphoid cells. In general, nicotinate
phosphoribosyltransferase catalyses the formation of
NAMN and PPi from 5-phosphoribosy -1-pyrophosphate
(PRPP) and nicotinic acid, this is the first, and also
rate limiting, reaction in the NAD salvage synthesis.
Length = 407
Score = 29.9 bits (68), Expect = 3.4
Identities = 7/18 (38%), Positives = 11/18 (61%)
Query: 338 SYRLTHPYMYPEPTQSTY 355
SY+++H YP T + Y
Sbjct: 6 SYKVSHRLQYPPGTTAVY 23
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 30.1 bits (68), Expect = 3.5
Identities = 12/63 (19%), Positives = 20/63 (31%), Gaps = 2/63 (3%)
Query: 86 PSGPSQPMRPAMGQKNNNQPPNTKSNAGPARP--STDQTKSFYPSVPSNGHPSVPSNAYP 143
GP Q ++P Q P+ + A P+ S S S + P
Sbjct: 372 GRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDP 431
Query: 144 SSS 146
++
Sbjct: 432 PAA 434
>gnl|CDD|217461 pfam03261, CDK5_activator, Cyclin-dependent kinase 5 activator
protein.
Length = 314
Score = 29.9 bits (67), Expect = 3.6
Identities = 15/64 (23%), Positives = 21/64 (32%), Gaps = 4/64 (6%)
Query: 62 QNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQ---KNNNQPPNTKSNAGPARPS 118
+K P T + + P QP P KN N P+ K+ A S
Sbjct: 94 NLKKSVPRPSDTTL-SCANLSSLREPSPGQPPAPPPSVLSGKNANCIPSQKNAPSIAITS 152
Query: 119 TDQT 122
T +
Sbjct: 153 TGGS 156
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 30.0 bits (67), Expect = 3.7
Identities = 28/112 (25%), Positives = 39/112 (34%), Gaps = 9/112 (8%)
Query: 68 RMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYP 127
R+ ST +P+ T PSG + PA + P + S R T S
Sbjct: 269 RVGDSTSSPSTTR-----PSGSTTTTTPASSGPSAPGGPGSSSRNAVTR----STDSISG 319
Query: 128 SVPSNGHPSVPSNAYPSSSHFQVVFRWKIMDFVFRDNKQKVNLIRSNKFIPE 179
+PS G P + F V F R + N+ + KFI E
Sbjct: 320 PIPSPGAPRAITGQMGEREMFAVQFLGDFKPKPRRYEGDRTNVAKLKKFIFE 371
>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
Length = 327
Score = 29.7 bits (67), Expect = 3.8
Identities = 14/77 (18%), Positives = 19/77 (24%), Gaps = 4/77 (5%)
Query: 70 PVSTQAPAVTTYRPMAPSGPSQ----PMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSF 125
ST A T P + P N A A P K
Sbjct: 170 NTSTLPTAPATVAPSKGAKVPATAETHPTPPQKPATKKPAVNHHKTATVAVPPATSGKPK 229
Query: 126 YPSVPSNGHPSVPSNAY 142
+ + S P++ Y
Sbjct: 230 SGAASARALSSAPASHY 246
>gnl|CDD|217106 pfam02559, CarD_CdnL_TRCF, CarD-like/TRCF domain. CarD is a
Myxococcus xanthus protein required for the activation
of light- and starvation-inducible genes. This family
includes the presumed N-terminal domain, CdnL. CarD
interacts with the zinc-binding protein CarG to form a
complex that regulates multiple processes in Myxococcus
xanthus. This family also includes a domain to the
N-terminal side of the DEAD helicase of TRCF
(transcription-repair-coupling factor) proteins. TRCF
displaces RNA polymerase stalled at a lesion, binds to
the damage recognition protein UvrA, and increases the
template strand repair rate during transcription. This
domain is involved in binding to the stalled RNA
polymerase. The family includes members otherwise
referred to as CdnL, for CarD N-terminal like,
whichdiffer functionally from CarD. The TRCF domain
mentioned above is the RNA polymerase-interacting domain
or RID.
Length = 98
Score = 27.9 bits (63), Expect = 4.4
Identities = 11/49 (22%), Positives = 18/49 (36%)
Query: 404 KTIRDDQRYMYFHSMSSNRHYYVSTTDLRNSSRYVNSSDIDEYFHYLGS 452
+ ++R Y + YV +L RY+ S D + LG
Sbjct: 24 IEVGGEKREYYVLEYADGDKLYVPVDNLDLIGRYIGSKDEEPVLDKLGG 72
>gnl|CDD|152703 pfam12268, DUF3612, Protein of unknown function (DUF3612). This
domain family is found in bacteria, and is approximately
180 amino acids in length. The family is found in
association with pfam01381.
Length = 178
Score = 29.0 bits (65), Expect = 4.6
Identities = 10/21 (47%), Positives = 14/21 (66%)
Query: 224 PYPNWSYFDDSNCNSLISVFR 244
PYP+W YFD + L +V+R
Sbjct: 5 PYPHWHYFDAYSPGKLKAVYR 25
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 29.8 bits (66), Expect = 5.0
Identities = 14/85 (16%), Positives = 27/85 (31%), Gaps = 9/85 (10%)
Query: 71 VSTQAPAVTTYRPMAPSGPS---QPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYP 127
++ P V A S P+ Q P N + + T +
Sbjct: 525 ATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTT 584
Query: 128 ------SVPSNGHPSVPSNAYPSSS 146
S P++ P +PS+++ +
Sbjct: 585 GQHGTGSSPTSQQPGIPSSSHSTPR 609
>gnl|CDD|218181 pfam04621, ETS_PEA3_N, PEA3 subfamily ETS-domain transcription
factor N terminal domain. The N terminus of the PEA3
transcription factors is implicated in transactivation
and in inhibition of DNA binding. Transactivation is
potentiated by activation of the Ras/MAP kinase and
protein kinase A signalling cascades. The N terminal
region contains conserved MAP kinase phosphorylation
sites.
Length = 336
Score = 29.5 bits (66), Expect = 5.1
Identities = 22/76 (28%), Positives = 29/76 (38%), Gaps = 5/76 (6%)
Query: 74 QAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPSTDQTKSFYPSVPSNG 133
+ PA P PS P P+ P + QP SN+ P P T P +
Sbjct: 124 RKPASGFKPPTPPSTPCSPVNPQETVR-QLQPSGPLSNSSPPSPHTPLPNQSPLPPPMS- 181
Query: 134 HPSVPSNAYPSSSHFQ 149
P ++YPS FQ
Sbjct: 182 ---SPDSSYPSEHRFQ 194
>gnl|CDD|236407 PRK09198, PRK09198, putative nicotinate phosphoribosyltransferase;
Provisional.
Length = 463
Score = 29.5 bits (67), Expect = 5.5
Identities = 7/18 (38%), Positives = 9/18 (50%)
Query: 338 SYRLTHPYMYPEPTQSTY 355
SY+ +H YP T Y
Sbjct: 13 SYKASHRLQYPPGTTYLY 30
>gnl|CDD|217990 pfam04257, Exonuc_V_gamma, Exodeoxyribonuclease V, gamma subunit.
The Exodeoxyribonuclease V enzyme is a multi-subunit
enzyme comprised of the proteins RecB, RecC (this
family) and RecD. This enzyme plays an important role in
homologous genetic recombination, repair of double
strand DNA breaks resistance to UV irradiation and
chemical DNA-damage. The enzyme (EC:3.1.11.5) catalyzes
ssDNA or dsDNA-dependent ATP hydrolysis, hydrolysis of
ssDNA or dsDNA and unwinding of dsDNA. This family
consists of two AAA domains.
Length = 756
Score = 29.1 bits (66), Expect = 6.4
Identities = 19/65 (29%), Positives = 32/65 (49%), Gaps = 3/65 (4%)
Query: 403 DKTIRDDQRYMYFHSMSSNR-HYYVSTT--DLRNSSRYVNSSDIDEYFHYLGSRFKNTQA 459
D++ RDD RY++ ++ S R H Y+S +R++S S + E YL F
Sbjct: 631 DRSRRDDDRYLFLEALLSAREHLYISYVGRSIRDNSERPPSVLVSELLDYLAQNFYLGDE 690
Query: 460 SASAI 464
+ A+
Sbjct: 691 ALKAL 695
>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
Length = 222
Score = 28.5 bits (63), Expect = 6.7
Identities = 19/79 (24%), Positives = 29/79 (36%), Gaps = 5/79 (6%)
Query: 53 DLKKLRP-----QSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPN 107
DL +L+P +S+N+K+ + QP Q+ QP N
Sbjct: 137 DLNRLKPHMMKTKSKNQKQENQEKQEKLEESEIQSEKQEQPSQQPQSQPQPQQQPQQPNN 196
Query: 108 TKSNAGPARPSTDQTKSFY 126
K A S D+T+ Y
Sbjct: 197 NKRKAMEEGLSPDRTRPRY 215
>gnl|CDD|218116 pfam04503, SSDP, Single-stranded DNA binding protein, SSDP. This
is a family of eukaryotic single-stranded DNA binding
proteins with specificity to a pyrimidine-rich element
found in the promoter region of the alpha2(I) collagen
gene.
Length = 293
Score = 28.9 bits (64), Expect = 7.0
Identities = 21/78 (26%), Positives = 31/78 (39%), Gaps = 3/78 (3%)
Query: 58 RPQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARP 117
RP N +P S+ +P T P P P+ P+ N+ N + P P
Sbjct: 166 RPWPNPISANSIPYSSSSPGEYTGPPGGGGPPGTPIMPSPADSTNSS-DNMYTVMNPVPP 224
Query: 118 STDQTKSFYPSVPSNGHP 135
QT+S +P P + P
Sbjct: 225 G--QTRSNFPMGPGSDGP 240
>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
Length = 356
Score = 28.9 bits (64), Expect = 7.4
Identities = 12/55 (21%), Positives = 21/55 (38%)
Query: 55 KKLRPQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTK 109
K+ + Q+ E T P+ T +PM + P P + Q + P +
Sbjct: 115 KQEQSQTTTESTTPKTKVTTPPSTNTPQPMQSTKSDTPQSPTIKQAQTDMTPKYE 169
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 29.0 bits (65), Expect = 7.9
Identities = 10/91 (10%), Positives = 13/91 (14%), Gaps = 1/91 (1%)
Query: 59 PQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAGPARPS 118
PQ + P Q P +P P P Q N
Sbjct: 246 PQFPGLSQQMPPPPPQPPQQQQ-QPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQL 304
Query: 119 TDQTKSFYPSVPSNGHPSVPSNAYPSSSHFQ 149
+
Sbjct: 305 LPLVQQPQGQQRGPQFREQLVQLSQQQREAL 335
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 28.9 bits (64), Expect = 8.7
Identities = 26/153 (16%), Positives = 52/153 (33%), Gaps = 15/153 (9%)
Query: 3 TSNINMAAPNLETDTMVNA--NPDN----TRMVNPNFVD-WEKELFNTNYTNNFLNVDLK 55
IN+ N + N N D+ + + F++ WE E + ++ N + D
Sbjct: 20 ADGININGDNKIDKHIENEDFNFDDINSWMPIDDAEFLNSWEDEFSDESFLENKASKDDG 79
Query: 56 KLRPQSQNEKENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTKSNAG-P 114
+ +E + A + S + + K++ + + AG P
Sbjct: 80 NINLTDTSE-------NGDASKKGHGKSRVRSASAAAILEEDDSKDDMEFKANPNEAGKP 132
Query: 115 ARPSTDQTKSFYPSVPSNGHPSVPSNAYPSSSH 147
+P +Q + S S S + +S H
Sbjct: 133 GKPKGNQGEGLASSSDGKSKASAKSGSKSASKH 165
>gnl|CDD|221509 pfam12287, Caprin-1_C, Cytoplasmic
activation/proliferation-associated protein-1 C term.
This family of proteins is found in eukaryotes. Proteins
in this family are typically between 343 and 708 amino
acids in length. This family is the C terminal region of
caprin-1. Caprin-1 is a protein involved in regulating
cellular proliferation. In mutated phenotypes, the G1
phase of the cell cycle is greatly lengthened, impairing
normal proliferation. The C terminal region of caprin-1
contains RGG motifs which are characteristic of RNA
binding domains. It is possible that caprin-1 functions
through an RNA binding mechanism.
Length = 319
Score = 28.4 bits (63), Expect = 9.2
Identities = 23/86 (26%), Positives = 36/86 (41%), Gaps = 8/86 (9%)
Query: 66 ENRMPVSTQAPAVTTYRPMAPSGPSQPMRPAMGQKNNNQPPNTK-SNAGPARPSTDQTKS 124
E+R+ Q AV P+ P PM + + PP + S+ RP TD
Sbjct: 48 ESRLS---QPSAV----PVQPEPTQVPMVSPTSEGYTSSPPLYQPSHTAEPRPQTDPIDP 100
Query: 125 FYPSVPSNGHPSVPSNAYPSSSHFQV 150
S+ N + S++ P++S QV
Sbjct: 101 IQASMSLNSEQTPTSSSLPAASQPQV 126
>gnl|CDD|227364 COG5031, COQ4, Uncharacterized protein involved in ubiquinone
biosynthesis [Coenzyme metabolism].
Length = 235
Score = 28.2 bits (63), Expect = 9.5
Identities = 14/76 (18%), Positives = 27/76 (35%), Gaps = 8/76 (10%)
Query: 198 PKWKAGIPFTLASFNMNDPSESPILLPYPNWSYFDDSNCNSLISVFRMSVDKCDRLWIMD 257
P+ A + L + P S L+ Y + LI+ K + LW +
Sbjct: 165 PRSVALLAKNLLPLRLYQPELSRQLMDYIPRGWRMGLQAKPLIAQ------KWEELW--E 216
Query: 258 TGVTNILSSIQQLCPP 273
+ + + + + PP
Sbjct: 217 KPLEEVRTELNIILPP 232
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.133 0.408
Gapped
Lambda K H
0.267 0.0569 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,508,874
Number of extensions: 2502132
Number of successful extensions: 2318
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2286
Number of HSP's successfully gapped: 66
Length of query: 530
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 429
Effective length of database: 6,457,848
Effective search space: 2770416792
Effective search space used: 2770416792
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.6 bits)