RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9595
(232 letters)
>gnl|CDD|128818 smart00545, JmjN, Small domain found in the jumonji family of
transcription factors. To date, this domain always
co-occurs with the JmjC domain (although the reverse is
not true).
Length = 42
Score = 59.2 bits (144), Expect = 3e-12
Identities = 22/43 (51%), Positives = 27/43 (62%), Gaps = 1/43 (2%)
Query: 72 EIMVFRPTYEEFKDFSKFITYMESQGAHKAGLAKVIPPPEWVP 114
EI VF PT EEFKD +I+ + Q A K G+ KV+PP W P
Sbjct: 1 EIPVFYPTMEEFKDPLAYISKIRPQ-AEKYGICKVVPPKSWKP 42
>gnl|CDD|190295 pfam02375, JmjN, jmjN domain.
Length = 34
Score = 57.0 bits (139), Expect = 1e-11
Identities = 18/35 (51%), Positives = 22/35 (62%), Gaps = 1/35 (2%)
Query: 74 MVFRPTYEEFKDFSKFITYMESQGAHKAGLAKVIP 108
VFRPT EEFKD K+I + G K G+ KV+P
Sbjct: 1 PVFRPTMEEFKDPIKYIEKIRPLGE-KYGICKVVP 34
>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
structure and dynamics].
Length = 395
Score = 33.2 bits (76), Expect = 0.10
Identities = 16/80 (20%), Positives = 20/80 (25%), Gaps = 11/80 (13%)
Query: 12 PPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVKSTIKTLLPVKQERKH 71
P KK+T+ K K R V E S E ++Q R
Sbjct: 66 PKRKKQTEKGKKEKKPKVSDRMDLDNENVQLEMLYSISNER-----------EIRQLRFG 114
Query: 72 EIMVFRPTYEEFKDFSKFIT 91
V P I
Sbjct: 115 GSKVQNPHEGARVKNINEIK 134
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 32.2 bits (74), Expect = 0.14
Identities = 15/57 (26%), Positives = 22/57 (38%)
Query: 14 AKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVKSTIKTLLPVKQERK 70
KK +AA + A ++ R P K K K + E + + L V Q K
Sbjct: 118 QAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQAVK 174
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 32.0 bits (73), Expect = 0.15
Identities = 15/50 (30%), Positives = 19/50 (38%), Gaps = 4/50 (8%)
Query: 5 SEDFEPSPPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGE 54
ED E P + K K T +KP T K K KEK ++
Sbjct: 40 PEDIEIPSPKQPKKKRPTT----PRKPATTKKSKKKDKEKLTEEEKKPES 85
>gnl|CDD|236698 PRK10475, PRK10475, 23S rRNA pseudouridine synthase F; Provisional.
Length = 290
Score = 31.2 bits (71), Expect = 0.34
Identities = 13/37 (35%), Positives = 16/37 (43%), Gaps = 3/37 (8%)
Query: 3 SSSEDFEPSPPAKKKTKAAATGKGAKKKPRTPSKKTK 39
SSSE P AK K K A + K +T K +
Sbjct: 239 SSSEA---KPKAKAKPKTAGIKRPVVKMEKTAEKGGR 272
>gnl|CDD|226382 COG3864, COG3864, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 396
Score = 31.0 bits (70), Expect = 0.54
Identities = 14/73 (19%), Positives = 28/73 (38%), Gaps = 1/73 (1%)
Query: 154 QMTVREYGAMANKPRYATPKHFDYEDLERKYWKNITYVSPIYGADVSGSITDEDVNVWNI 213
QM ++ + K T D + + + + D SGS+TD +++ +
Sbjct: 225 QMYLKRFIGTLPKDYKKTITRKDRRQPYLRGKLSEHLIKIVVAVDTSGSMTDAEIDAA-M 283
Query: 214 NHLGTILDYVNED 226
+ IL N +
Sbjct: 284 TEIFDILKNKNYE 296
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 30.6 bits (69), Expect = 0.59
Identities = 18/55 (32%), Positives = 26/55 (47%), Gaps = 4/55 (7%)
Query: 9 EPSPPAKK--KTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVKSTIKT 61
+ AKK K KAA T AKK +T +K K K+ A+ ++ K+ K
Sbjct: 218 KAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALK--KAAKAVKKA 270
Score = 29.9 bits (67), Expect = 1.1
Identities = 14/60 (23%), Positives = 19/60 (31%)
Query: 11 SPPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVKSTIKTLLPVKQERK 70
K K A K A K + +KK KA G + + K+ K K
Sbjct: 247 KAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAG 306
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 30.6 bits (69), Expect = 0.65
Identities = 16/36 (44%), Positives = 18/36 (50%), Gaps = 1/36 (2%)
Query: 3 SSSEDFEPSPPAKKKTK-AAATGKGAKKKPRTPSKK 37
S SED PP K K K + A KKP+ P KK
Sbjct: 380 SFSEDEAEPPPTKPKPKVSTPAVPAAAKKPKAPKKK 415
Score = 27.9 bits (62), Expect = 5.8
Identities = 15/64 (23%), Positives = 21/64 (32%), Gaps = 7/64 (10%)
Query: 10 PSPPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVKSTIKTLLPVKQER 69
+P A K A GK KP +V + A+ + T K K E
Sbjct: 146 VAPAASPALKPTANGKRPSSKPPKSIMSPEVKVKSAKK-------TQDTSKETTTEKTEG 198
Query: 70 KHEI 73
K +
Sbjct: 199 KTSV 202
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation
factor 3 (eIF3). In yeast it is called Hcr1. The
Saccharomyces cerevisiae protein eIF3j (HCR1) has been
shown to be required for processing of 20S pre-rRNA and
binds to 18S rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 30.0 bits (68), Expect = 0.75
Identities = 16/53 (30%), Positives = 24/53 (45%)
Query: 4 SSEDFEPSPPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVK 56
S ++ E ++K K AA K K +K K +EK E G++E E
Sbjct: 36 SWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDT 88
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 30.5 bits (69), Expect = 0.93
Identities = 11/40 (27%), Positives = 16/40 (40%), Gaps = 1/40 (2%)
Query: 1 GASSSEDFEP-SPPAKKKTKAAATGKGAKKKPRTPSKKTK 39
+FEP S +K K +K + PS KT+
Sbjct: 756 QVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQ 795
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 30.1 bits (68), Expect = 1.0
Identities = 19/89 (21%), Positives = 30/89 (33%), Gaps = 9/89 (10%)
Query: 15 KKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVKSTIKTLLPVKQERKHEIM 74
K KK KK K K++ +S + + K+ K+ K+ K +
Sbjct: 53 STKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPK-- 110
Query: 75 VFRPTYEEFKDFSKFITYMESQ----GAH 99
P + F+K E GAH
Sbjct: 111 ---PNEDVDNAFNKIAELAEKSNVYIGAH 136
>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 758
Score = 29.5 bits (66), Expect = 1.8
Identities = 26/107 (24%), Positives = 34/107 (31%), Gaps = 7/107 (6%)
Query: 17 KTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVKSTIKTLLPVKQE-RKHEIMV 75
K K KK K K +K +A V+ + E K + + K
Sbjct: 111 KKTNDDLEKIESKKESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKNEER 170
Query: 76 FRPTYEEF--KDFSKFITYMESQGAHKA----GLAKVIPPPEWVPRK 116
F+ E F K F + I A GL KV E V R
Sbjct: 171 FKEYLENFKRKKFKRKILKEFENAKINASEIVGLEKVKEKIEIVFRH 217
>gnl|CDD|219549 pfam07744, SPOC, SPOC domain. The SPOC (Spen paralogue and
orthologue C-terminal) domain is involved in
developmental signalling.
Length = 109
Score = 28.0 bits (63), Expect = 1.8
Identities = 12/45 (26%), Positives = 21/45 (46%), Gaps = 5/45 (11%)
Query: 72 EIMVFRPTY---EEFKDFSKFITYMESQGAHKAGLAKVIPPPEWV 113
++V + + F + I Y++S+ AG+AKV P V
Sbjct: 60 AVVVLALSPDSESDRAAFDELIDYLQSKQR--AGVAKVGDPGSQV 102
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 29.4 bits (67), Expect = 1.9
Identities = 9/37 (24%), Positives = 12/37 (32%)
Query: 11 SPPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAES 47
+ A A KK S+KTK +E
Sbjct: 818 PEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEK 854
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 28.8 bits (64), Expect = 2.3
Identities = 19/73 (26%), Positives = 27/73 (36%), Gaps = 7/73 (9%)
Query: 11 SPPAKKKTKAAAT--GKGAKKKPRTPS-----KKTKVVKEKAESGVQEEGEVKSTIKTLL 63
+ KK K +T K A K +T S KK V EK ES +E K+ K +
Sbjct: 29 TAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAV 88
Query: 64 PVKQERKHEIMVF 76
+ +
Sbjct: 89 SAEVFEASNKLFK 101
>gnl|CDD|235860 PRK06778, PRK06778, hypothetical protein; Validated.
Length = 289
Score = 28.7 bits (64), Expect = 2.6
Identities = 16/52 (30%), Positives = 29/52 (55%), Gaps = 3/52 (5%)
Query: 11 SPPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVKSTIKTL 62
SP K A + +KK R ++KT ++K+K+ + E GE+ ++I T+
Sbjct: 76 SPSHPPKPATVAVPEETEKKARDVNEKTALLKKKSAT---ELGELATSINTI 124
>gnl|CDD|220222 pfam09403, FadA, Adhesion protein FadA. FadA (Fusobacterium
adhesin A) is an adhesin which forms two alpha helices.
Length = 126
Score = 27.6 bits (61), Expect = 2.9
Identities = 15/68 (22%), Positives = 30/68 (44%), Gaps = 2/68 (2%)
Query: 36 KKTKVVKEKAESGVQEEGEVKSTIKTLLPVKQERKHEIMVFRPTYEEF-KDFSKFITYME 94
++ K KE AE VQE E + + L +E + + Y++ K + +++ +E
Sbjct: 48 EEEKQEKETAEKEVQELKERQLGREELYAKLKEES-KTRFHKDEYKKLLKRYDEYLNKLE 106
Query: 95 SQGAHKAG 102
+ K
Sbjct: 107 QKITDKEQ 114
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 28.8 bits (64), Expect = 3.0
Identities = 28/97 (28%), Positives = 44/97 (45%), Gaps = 16/97 (16%)
Query: 14 AKKKTKAAATGKGAKKKPRTPS-------KKTKVVKEKAESGVQEEGEVKSTIKTL--LP 64
A+ KTK A K + K PS K +V+KEK V E+ + K + L
Sbjct: 251 ARAKTKGAEGKKEEEPKQEEPSVNQPYLNKYVEVIKEKLGEDVLEDSYINKLSKDVPTLV 310
Query: 65 VKQERKHEIMVFRPTYEE--FKDFSK-----FITYME 94
V+ E+ +E+ +E+ F S+ F+T+ME
Sbjct: 311 VEPEKYYEVAELLRFHEQLAFDYLSELHGTDFVTHME 347
>gnl|CDD|234336 TIGR03734, PRTRC_parB, PRTRC system ParB family protein. A novel
genetic system characterized by six major proteins,
included a ParB homolog and a ThiF homolog, is
designated PRTRC, or ParB-Related,ThiF-Related Cassette.
It is often found on plasmids. This protein family the
member related to ParB, and is designated PRTRC system
ParB family protein.
Length = 554
Score = 28.5 bits (64), Expect = 3.2
Identities = 10/46 (21%), Positives = 11/46 (23%)
Query: 12 PPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESGVQEEGEVKS 57
PA K S K +KA Q VK
Sbjct: 334 KPAAPAAGPGTPAKEKSPAETATSGAAKPAAKKAVPSSQPSNRVKD 379
>gnl|CDD|239002 cd02047, HCII, Heparin cofactor II (HCII) inhibits thrombin, the
final protease of the coagulation cascade. HCII is
allosterically activated by binding to cell surface
glycosaminoglycans (GAGs). The specificity of HCII for
thrombin is conferred by a highly acidic hirudin-like
N-terminal tail, which becomes available after GAG
binding for interaction with the anion-binding exosite I
of thrombin. This subgroup corresponds to clade D of the
serpin superfamily.
Length = 436
Score = 28.6 bits (64), Expect = 3.2
Identities = 22/108 (20%), Positives = 46/108 (42%), Gaps = 15/108 (13%)
Query: 125 IGDMSIPAPICQVVTGKQGLYQQINIQKRQMTVREYGAMANK------PRYATPKHFDYE 178
+G++S+ + ++G + L +QI Q+ R +M N+ P++ K+++
Sbjct: 289 VGNISMLIVVPHKLSGMKTLEKQIT---PQVVERWQKSMTNRTREVVLPKFKLEKNYNLI 345
Query: 179 DLERKYWKNITYVSPIYGADVSGSITDEDVNVWNINHLGTILDYVNED 226
+ K + ++DE + + H GTI VNE+
Sbjct: 346 ----ESLKLMGITDLFTEKGNMAGVSDEKIAIDLFKHQGTIT--VNEE 387
>gnl|CDD|220305 pfam09598, Stm1_N, Stm1. This region is found at the N terminal
of the Stm1 protein. Stm1 is a G4 quadraplex and purine
motif triplex nucleic acid-binding protein. It has been
implicated in many biological processes including
apoptosis and telomere biosynthesis. Stm1 is known to
interact with CDC13, and is known to associate with
ribosomes and nuclear telomere cap complexes.
Length = 66
Score = 26.3 bits (58), Expect = 3.7
Identities = 7/34 (20%), Positives = 11/34 (32%), Gaps = 1/34 (2%)
Query: 7 DFEPSPPAKKKTKAAATGKGAKKKPRTPSKKTKV 40
P+PP K+ K + + P P
Sbjct: 14 SRVPAPPTKEVVKPSTS-SKKADVPGAPPSPAPP 46
>gnl|CDD|211420 cd11579, Glyco_tran_WbsX, Glycosyl hydrolase family 99-like domain
of WbsX-like glycosyltransferases. Members of this
domain family are found in proteins within O-antigen
biosynthesis clusters in Gram negative bacteria, where
they may function as glycosyl hydrolases and typically
co-occur with glycosyltransferase domains. They bear
resemblance to GH71 and the GH99 family of
alpha-1,2-mannosidases and may share a similar cataltyic
site and mechanism. The O-antigens are essential
lipopolysaccharides in gram-negative bacteria's outer
membrane and have been linked to pathogenicity.
Length = 347
Score = 28.3 bits (64), Expect = 3.8
Identities = 26/149 (17%), Positives = 47/149 (31%), Gaps = 37/149 (24%)
Query: 59 IKTLLP-VKQER--KHE----IMVFRPTYEEFKDFSKFITYMESQGAHKAG-----LAKV 106
+ LLP K R K + +++RP E D + + + A +AG L V
Sbjct: 142 FRYLLPYFKDPRYIKIDGKPVFLIYRP--ELIPDPKETLELWR-ELAREAGLPGLYLVAV 198
Query: 107 IPPPEWVPRKSGYSLDSSIGDMSIPAPICQVVTGKQGLYQQINIQKRQMTVREYGAMANK 166
PR G+ D+++ + + +++ K
Sbjct: 199 ASFGIEDPRDLGF--DAAV------------------EFPPHYVVASELSKSRKLLKLKK 238
Query: 167 PRYATPKHFDYEDLERKYWKNITYVSPIY 195
+DY+DL + P Y
Sbjct: 239 GFG--GNVYDYDDLVKYILNRPDPDYPYY 265
>gnl|CDD|222666 pfam14307, Glyco_tran_WbsX, Glycosyltransferase WbsX. Members of
this family are found in within O-antigen biosynthesis
clusters in Gram negative bacteria, where they are
predicted to function as glycosyltransferases.
Length = 345
Score = 27.9 bits (63), Expect = 4.8
Identities = 27/152 (17%), Positives = 49/152 (32%), Gaps = 39/152 (25%)
Query: 59 IKTLLPV-KQER--KHE----IMVFRPTYEEFKDFSKFITYMESQGAHKAGLAKV----- 106
+ LLP K R K + +++RP E+ D +FI + A +AGL +
Sbjct: 141 FRYLLPYFKDPRYIKVDGKPVFLIYRP--EDIPDPKEFIDLWR-ELAKEAGLPGLYLVAV 197
Query: 107 ---IPPPEWVPRKSGYSLDSSIGDMSIPAPICQVVTGKQGLYQQINIQKRQMTVREYGAM 163
PR G+ D ++ P GK+ + + +
Sbjct: 198 INSFEEEYKDPRDLGF-------DAAVEFPPHYANKGKKLPNKYVRLLN----------- 239
Query: 164 ANKPRYATPKHFDYEDLERKYWKNITYVSPIY 195
++ +DY+ + P Y
Sbjct: 240 ---KKFGKGDVYDYDKVVEYILNRPAPDYPYY 268
>gnl|CDD|136022 PRK06728, PRK06728, aspartate-semialdehyde dehydrogenase;
Provisional.
Length = 347
Score = 27.7 bits (61), Expect = 5.5
Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 6/47 (12%)
Query: 41 VKEKAESGVQEEGEVKSTIKTLLPVKQERKHEIMVFR--PTYEEFKD 85
+KE+A+S + E EV+STI LP K+++KH + F P + F D
Sbjct: 172 LKEQAKSILAGE-EVESTI---LPAKKDKKHYPIAFNVLPQVDIFTD 214
>gnl|CDD|218735 pfam05760, IER, Immediate early response protein (IER). This
family consists of several eukaryotic immediate early
response (IER) 2 and 5 proteins. The role of IER5 is
unclear although it play an important role in mediating
the cellular response to mitogenic signals. Again,
little is known about the function of IER2 although it
is thought to play a role in mediating the cellular
responses to a variety of extracellular signals.
Length = 272
Score = 27.6 bits (61), Expect = 6.5
Identities = 16/66 (24%), Positives = 28/66 (42%), Gaps = 10/66 (15%)
Query: 4 SSEDFEPSPPAKKKTKAAATGKG----------AKKKPRTPSKKTKVVKEKAESGVQEEG 53
+ED +P+PPA K K ++G G K+ R + + E ++ E G
Sbjct: 160 QAEDGQPAPPAVCKRKRRSSGYGSDSGEASGFVPCKRARREDFEPESGGESEDAEEMETG 219
Query: 54 EVKSTI 59
+ + I
Sbjct: 220 NISNLI 225
>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
Length = 463
Score = 27.4 bits (60), Expect = 6.9
Identities = 12/34 (35%), Positives = 17/34 (50%)
Query: 2 ASSSEDFEPSPPAKKKTKAAATGKGAKKKPRTPS 35
+ D +PSPPA+ K K +KP+ PS
Sbjct: 181 IPETTDPKPSPPAEDKQKPKVESAPVAEKPKAPS 214
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 27.8 bits (61), Expect = 7.1
Identities = 17/45 (37%), Positives = 23/45 (51%)
Query: 2 ASSSEDFEPSPPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAE 46
A +E+ + + AKKK + A AKKK KK K+KAE
Sbjct: 1295 AKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAE 1339
>gnl|CDD|234007 TIGR02776, NHEJ_ligase_prk, DNA ligase D. Members of this protein
family are DNA ligases involved in the repair of DNA
double-stranded breaks by non-homologous end joining
(NHEJ). The system of the bacterial Ku protein
(TIGR02772) plus this DNA ligase is seen in about 20 %
of bacterial genomes to date and at least one archaeon
(Archeoglobus fulgidus). This model describes a central
and a C-terminal domain. These two domains may be
permuted, as in genus Mycobacterium, or divided into
tandem ORFs, and therefore not be identified by This
model. An additional N-terminal 3'-phosphoesterase (PE)
domain present in some but not all examples of this
ligase is not included in the seed alignment for this
model; This model models only the central ATP-dependent
ligase domain and the C-terminal polymerase domain. Most
examples of genes for this ligase are adjacent to the
gene for Ku [DNA metabolism, DNA replication,
recombination, and repair].
Length = 552
Score = 27.3 bits (61), Expect = 8.2
Identities = 8/21 (38%), Positives = 12/21 (57%), Gaps = 1/21 (4%)
Query: 77 RPTYEEFKDFSKFI-TYMESQ 96
T++E K F+K I Y+ Q
Sbjct: 447 TFTWDETKLFAKAIAEYLARQ 467
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 26.9 bits (60), Expect = 8.5
Identities = 12/47 (25%), Positives = 19/47 (40%), Gaps = 3/47 (6%)
Query: 2 ASSSEDFEPSPPAKKKTKAAATGKGAKKKPRTPSKKTKVVKEKAESG 48
+ + A+KK K + + KKK SK K +K+ G
Sbjct: 236 ETEEDVEAAESRAEKKRK---SKEEIKKKKPKESKGVKALKKVVAKG 279
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 27.5 bits (61), Expect = 8.6
Identities = 14/37 (37%), Positives = 18/37 (48%), Gaps = 1/37 (2%)
Query: 10 PSPPAKKKTKAAATGKGAK-KKPRTPSKKTKVVKEKA 45
P P +KKTKA K K +T KK+K +K
Sbjct: 743 PKTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKKT 779
>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
topoisomerase IA [General function prediction only].
Length = 298
Score = 27.0 bits (60), Expect = 9.2
Identities = 7/24 (29%), Positives = 9/24 (37%)
Query: 16 KKTKAAATGKGAKKKPRTPSKKTK 39
K A A K +T +KK
Sbjct: 260 GPGKKPAKKATAAKAKKTTAKKAA 283
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 26.0 bits (57), Expect = 9.9
Identities = 14/37 (37%), Positives = 17/37 (45%), Gaps = 2/37 (5%)
Query: 1 GASSSEDFEPSPPAKKKTKAAATGKGAKKKPRTPSKK 37
S +D P P K K K + K KKP+ SKK
Sbjct: 85 EPSHPDDHPPEPTEKPKKKKKKSKK--TKKPKKSSKK 119
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.311 0.130 0.378
Gapped
Lambda K H
0.267 0.0754 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,806,106
Number of extensions: 1090992
Number of successful extensions: 1263
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1169
Number of HSP's successfully gapped: 98
Length of query: 232
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 138
Effective length of database: 6,768,326
Effective search space: 934028988
Effective search space used: 934028988
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 57 (25.7 bits)