RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14677
(483 letters)
>gnl|CDD|238017 cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic
and prokaryotic proteins. Putative nuclear signalling
domain. FHA domains may bind phosphothreonine,
phosphoserine and sometimes phosphotyrosine. In
eukaryotes, many FHA domain-containing proteins localize
to the nucleus, where they participate in establishing
or maintaining cell cycle checkpoints, DNA repair, or
transcriptional regulation. Members of the FHA family
include: Dun1, Rad53, Cds1, Mek1,
KAPP(kinase-associated protein phosphatase),and Ki-67 (a
human nuclear protein related to cell proliferation).
Length = 102
Score = 92.8 bits (231), Expect = 5e-23
Identities = 33/104 (31%), Positives = 49/104 (47%), Gaps = 12/104 (11%)
Query: 355 RWRLYPFKGDTQLPVLHIHRQSAFLMGRDRKVADIPVDHPSCSKQHAALQYRLTDYTRED 414
RL GD ++ + +GRD DI +D PS S++HA ++Y
Sbjct: 1 VPRLVVLSGDASGRRYYLDPGGTYTIGRDSDNCDIVLDDPSVSRRHAVIRYD-------- 52
Query: 415 GSKGKRIRLYIIDLESANGTFVNNKKIDPKRYVELLEKDVIKFG 458
+ +IDL S NGTFVN +++ P V L + DVI+ G
Sbjct: 53 ----GDGGVVLIDLGSTNGTFVNGQRVSPGEPVRLRDGDVIRLG 92
>gnl|CDD|215951 pfam00498, FHA, FHA domain. The FHA (Forkhead-associated) domain
is a phosphopeptide binding motif.
Length = 67
Score = 65.7 bits (161), Expect = 8e-14
Identities = 29/81 (35%), Positives = 42/81 (51%), Gaps = 14/81 (17%)
Query: 378 FLMGRDRKVADIPVDHPSCSKQHAALQYRLTDYTREDGSKGKRIRLYIIDLESANGTFVN 437
+GR DI +D PS S++HA ++Y R Y+ DL S NGTFVN
Sbjct: 1 VTIGRSPD-CDIVLDDPSVSRRHAEIRYDGGG------------RFYLEDLGSTNGTFVN 47
Query: 438 NKKIDPKRYVELLEKDVIKFG 458
+++ P+ V L + DVI+ G
Sbjct: 48 GQRLGPE-PVRLRDGDVIRLG 67
>gnl|CDD|214578 smart00240, FHA, Forkhead associated domain. Found in eukaryotic
and prokaryotic proteins. Putative nuclear signalling
domain.
Length = 52
Score = 54.1 bits (131), Expect = 7e-10
Identities = 25/61 (40%), Positives = 32/61 (52%), Gaps = 12/61 (19%)
Query: 381 GRDRKVADIPVDHPSCSKQHAALQYRLTDYTREDGSKGKRIRLYIIDLESANGTFVNNKK 440
GR + DI +D PS S++HA + Y R Y+IDL S NGTFVN K+
Sbjct: 4 GRSSEDCDIQLDGPSISRRHAVIVYDGGG------------RFYLIDLGSTNGTFVNGKR 51
Query: 441 I 441
I
Sbjct: 52 I 52
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
Provisional.
Length = 672
Score = 49.1 bits (118), Expect = 4e-06
Identities = 21/109 (19%), Positives = 36/109 (33%)
Query: 178 QERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEETNERRHH 237
Q TE + + R E+ D+ ++ E ER + R D+ R
Sbjct: 151 QPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRRE 210
Query: 238 DRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHDRPQSSSNSHHRDRG 286
+ R R D + R + R +D R + + + RG
Sbjct: 211 QGDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRG 259
Score = 47.6 bits (114), Expect = 1e-05
Identities = 22/138 (15%), Positives = 45/138 (32%), Gaps = 1/138 (0%)
Query: 177 KQERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEETNERRH 236
+ R + + Q + ++++ + E R + +RQ+ + RR
Sbjct: 135 EAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRRE 194
Query: 237 HDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHDRPQSSSNSHHRD-RGPKKPFCDAS 295
+ +R D R + ERG +D + R + ++ D R +
Sbjct: 195 ERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDD 254
Query: 296 DEYEWGRKDANNEDNKNN 313
E GR+ D
Sbjct: 255 GEGRGGRRGRRFRDRDRR 272
Score = 43.7 bits (104), Expect = 2e-04
Identities = 21/149 (14%), Positives = 49/149 (32%), Gaps = 1/149 (0%)
Query: 127 RKSPKKKHQVVQALRVNDSEAVENKIKEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDR 186
R + +K + + + +EE +E + R ++ R E R
Sbjct: 139 RGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGR 198
Query: 187 HNLQENQRPRHEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEETNERRHHDRNNRTHDN 246
++R R E+ ++ + R +R+R+ ++ E R +
Sbjct: 199 DGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGR 258
Query: 247 RHDERNSRQQNNERGSQDRRSNQSHDRPQ 275
R R ++ +R + + P+
Sbjct: 259 G-GRRGRRFRDRDRRGRRGGDGGNEREPE 286
>gnl|CDD|236641 PRK10019, PRK10019, nickel/cobalt efflux protein RcnA; Provisional.
Length = 279
Score = 45.2 bits (107), Expect = 3e-05
Identities = 14/44 (31%), Positives = 19/44 (43%)
Query: 164 HHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDKHR 207
H H H H H HHH + E + HE+ H++D R
Sbjct: 125 HDHDHDHDHEHHHDHGHHHHHEHGATAEEYQDAHERAHANDIKR 168
Score = 40.9 bits (96), Expect = 7e-04
Identities = 17/82 (20%), Positives = 27/82 (32%), Gaps = 19/82 (23%)
Query: 159 EEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSERE 218
E+ + HHH H H H + HE H H H +E
Sbjct: 112 GERNWLENMHHHDHDHDHDHD-----------------HEHHHDHGHHHHHEHGATAEEY 154
Query: 219 TKRNRQSHDDEETNERRHHDRN 240
+ ++H ++ +RR R
Sbjct: 155 QDAHERAHANDI--KRRFDGRE 174
>gnl|CDD|177648 PHA03420, PHA03420, E4 protein; Provisional.
Length = 137
Score = 43.1 bits (101), Expect = 4e-05
Identities = 18/64 (28%), Positives = 25/64 (39%), Gaps = 2/64 (3%)
Query: 141 RVNDSEAVENKIKEEPTEEEK--PRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHE 198
R+ D A I+ E P + H H H H+Q+ L DR +L + R
Sbjct: 22 RLLDGRAENQHIQREGGNHRTWDPADYLDRPHHHPHRHQQDDHHLQDRQHLPQQHLQRPH 81
Query: 199 KPHS 202
P S
Sbjct: 82 HPLS 85
Score = 28.0 bits (62), Expect = 6.6
Identities = 10/40 (25%), Positives = 15/40 (37%), Gaps = 3/40 (7%)
Query: 164 HHH---HHHHRHRHHHKQERTELSDRHNLQENQRPRHEKP 200
H H HH + R H Q+ + + Q+ KP
Sbjct: 57 HRHQQDDHHLQDRQHLPQQHLQRPHHPLSPQCQQNVTGKP 96
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
splicing factor. These splicing factors consist of an
N-terminal arginine-rich low complexity domain followed
by three tandem RNA recognition motifs (pfam00076). The
well-characterized members of this family are auxilliary
components of the U2 small nuclear ribonuclearprotein
splicing factor (U2AF). These proteins are closely
related to the CC1-like subfamily of splicing factors
(TIGR01622). Members of this subfamily are found in
plants, metazoa and fungi.
Length = 509
Score = 43.7 bits (103), Expect = 1e-04
Identities = 24/137 (17%), Positives = 39/137 (28%), Gaps = 19/137 (13%)
Query: 153 KEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKT 212
+E E EK R R +R ++ D+HR S +
Sbjct: 3 EEPDREREKSRGRDRDRSSERP------------------RRRSRDRSRFRDRHRRSRER 44
Query: 213 EPSERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHD 272
E R+R+ +D R+ R R+ R R + R + S+
Sbjct: 45 SYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRRRSRSVRSIEQHR-RRLRDRSPSNQ 103
Query: 273 RPQSSSNSHHRDRGPKK 289
+ D P
Sbjct: 104 WRKDDKKRSLWDIKPPG 120
Score = 43.7 bits (103), Expect = 2e-04
Identities = 25/109 (22%), Positives = 43/109 (39%), Gaps = 5/109 (4%)
Query: 185 DRHNLQENQRPR-HEKPHSSDKHRHSSKTEPSERETKR---NRQSHDDEETNERRHHDRN 240
D +E ++ R ++ SS++ R S+ R+ R R +D +RR +D
Sbjct: 2 DEEPDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSR 61
Query: 241 NRTHDNRHDERNSRQQNNERGSQDRRSNQSHDRPQSSSNSHHRDRGPKK 289
+ R SR R S+ RS + H R + ++ R K
Sbjct: 62 SPRSLRYSSVRRSR-DRPRRRSRSVRSIEQHRRRLRDRSPSNQWRKDDK 109
Score = 34.1 bits (78), Expect = 0.19
Identities = 18/103 (17%), Positives = 35/103 (33%), Gaps = 6/103 (5%)
Query: 144 DSEAVENKIKEEPTEEEKPRHHHHHHHRHRHHHKQERTELSD-RHNLQENQRPRHEKPHS 202
+ E + ++ +E + R R RH +ER+ D R + R +
Sbjct: 8 EREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLR 67
Query: 203 SDKHRHS-----SKTEPSERETKRNRQSHDDEETNERRHHDRN 240
R S ++ + R+ D +N+ R D+
Sbjct: 68 YSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWRKDDKK 110
>gnl|CDD|224630 COG1716, COG1716, FOG: FHA domain [Signal transduction mechanisms].
Length = 191
Score = 41.9 bits (98), Expect = 2e-04
Identities = 24/95 (25%), Positives = 43/95 (45%), Gaps = 16/95 (16%)
Query: 369 VLHIHRQSAFLMGRDRKVADIPVDHPSCSKQHAALQYRLTDYTREDGSKGKRIRLYIIDL 428
+ + + +GRD DI +D S++HA L+ + ++ DL
Sbjct: 82 SVIVLGEPVTTIGRD-PDNDIVLDDDVVSRRHAELRREGNEV-------------FLEDL 127
Query: 429 ESANGTFVNNKKIDPKRYVELLEKDVIKFGFSSRE 463
S NGT+VN +K+ + ++ DVI+ G + E
Sbjct: 128 GSTNGTYVNGEKVRQRVLLQD--GDVIRLGGTLAE 160
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 40.7 bits (95), Expect = 0.002
Identities = 28/150 (18%), Positives = 47/150 (31%), Gaps = 21/150 (14%)
Query: 158 EEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQ-RPRHEKPHSSDKH---RHSSKTE 213
E E HHH H H H H++ N +H HS+K+
Sbjct: 252 ELEYSEKGKDHHHSHNHQH----------HSIGINNHHSKHADSKLQTIEVIENHSNKSR 301
Query: 214 PS-----ERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDE--RNSRQQNNERGSQDRR 266
PS + + S + + ++ N+ + ++ N S
Sbjct: 302 PSSSSTNGSKETTSNSSSAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSVS 361
Query: 267 SNQSHDRPQSSSNSHHRDRGPKKPFCDASD 296
N+S + S S+S RD A+
Sbjct: 362 DNESKQKRASKSSSGARDSKKDASGMSANG 391
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 37.7 bits (88), Expect = 0.014
Identities = 19/85 (22%), Positives = 45/85 (52%), Gaps = 5/85 (5%)
Query: 198 EKPHSSDKHRHSSKTEPSERETKRNRQSHDDEETNERRHHDRNNRTH----DNRHDERNS 253
E+ ++ ++ +P ++ +R + ++ + NERR R+NRT +NR + R +
Sbjct: 584 EETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRD-TRDNRTRREGRENREENRRN 642
Query: 254 RQQNNERGSQDRRSNQSHDRPQSSS 278
R+Q ++ ++ R S Q+ ++ +
Sbjct: 643 RRQAQQQTAETRESQQAEVTEKART 667
Score = 35.0 bits (81), Expect = 0.088
Identities = 29/130 (22%), Positives = 52/130 (40%), Gaps = 13/130 (10%)
Query: 146 EAVENKIKEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDK 205
E + KPR ++ R + RT R N +EN+R R + + +
Sbjct: 593 APKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREENRRNRRQAQQQTAE 652
Query: 206 HRHSSKTEPSERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDR 265
R S + E +E+ ++ D+++ RR +R R R+DE+ RQ E + +
Sbjct: 653 TRESQQAEVTEKA-----RTQDEQQQAPRR--ERQRR----RNDEK--RQAQQEAKALNV 699
Query: 266 RSNQSHDRPQ 275
+ Q
Sbjct: 700 EEQSVQETEQ 709
Score = 33.5 bits (77), Expect = 0.31
Identities = 17/54 (31%), Positives = 21/54 (38%), Gaps = 3/54 (5%)
Query: 215 SERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSN 268
ETK Q E R DR +NR D RN R + R ++ RR
Sbjct: 582 GGEETKPQEQPAPKAEAKPERQQDRRKPRQNNRRD-RNER--RDTRDNRTRREG 632
Score = 30.4 bits (69), Expect = 2.5
Identities = 19/93 (20%), Positives = 36/93 (38%), Gaps = 14/93 (15%)
Query: 229 EETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHDRPQSSSNSHHRDRGPK 288
E ER+ R R +N+R+ NER +D R N++ + + + R+R
Sbjct: 597 EAKPERQQDRRKPR--------QNNRRDRNER--RDTRDNRTRREGRENREENRRNRRQA 646
Query: 289 KPFCDASDEYEWGRKDANNEDNKNNAQQPKQKP 321
+ + E + A + + +Q P
Sbjct: 647 QQQTAETRE----SQQAEVTEKARTQDEQQQAP 675
Score = 28.9 bits (65), Expect = 7.8
Identities = 15/68 (22%), Positives = 25/68 (36%), Gaps = 9/68 (13%)
Query: 249 DERNSRQQNNERGSQDRRSNQSHDRPQSSSNSHHRDRGPKKPFCDASDEYEWGRKDANNE 308
++ + + QDRR + ++R RDR ++ D E N
Sbjct: 590 EQPAPKAEAKPERQQDRRKPRQNNR---------RDRNERRDTRDNRTRREGRENREENR 640
Query: 309 DNKNNAQQ 316
N+ AQQ
Sbjct: 641 RNRRQAQQ 648
>gnl|CDD|114045 pfam05297, Herpes_LMP1, Herpesvirus latent membrane protein 1
(LMP1). This family consists of several latent membrane
protein 1 or LMP1s mostly from Epstein-Barr virus. LMP1
of EBV is a 62-65 kDa plasma membrane protein possessing
six membrane spanning regions, a short cytoplasmic
N-terminus and a long cytoplasmic carboxy tail of 200
amino acids. EBV latent membrane protein 1 (LMP1) is
essential for EBV-mediated transformation and has been
associated with several cases of malignancies. EBV-like
viruses in Cynomolgus monkeys (Macaca fascicularis) have
been associated with high lymphoma rates in
immunosuppressed monkeys.
Length = 382
Score = 36.9 bits (85), Expect = 0.018
Identities = 32/141 (22%), Positives = 54/141 (38%), Gaps = 14/141 (9%)
Query: 162 PRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRH--------EKPHSSDKHRHS---- 209
PRH HHH H Q+ T+ S + + RH + P ++ +
Sbjct: 189 PRHSDEHHHDDSLPHPQQATDDSGHESDSNSNEGRHHLLVSGAGDGPPLCSQNLGAPGGG 248
Query: 210 SKTEPSERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQ 269
P + + + D + T++ D +N T DN + ++ N + + N
Sbjct: 249 PDNGPQDPDNTDDNGPQDPDNTDDNGPQDPDN-TDDNGPQDPDNTADNGPQDPDNTDDNG 307
Query: 270 SHDR-PQSSSNSHHRDRGPKK 289
HD P + S+S D GP K
Sbjct: 308 PHDPLPHNPSDSAGNDGGPPK 328
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 36.4 bits (84), Expect = 0.033
Identities = 26/165 (15%), Positives = 49/165 (29%), Gaps = 18/165 (10%)
Query: 126 KRKSPKKKHQVVQALRVNDSEAVEN---KIKEEPTEEEKPRHHHHHHHRHRHHHKQERTE 182
++K K+K + R EA E K KE+ E++ ++R E
Sbjct: 116 EKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEE-------------PRDREE 162
Query: 183 LSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEETNERRHHDRNNR 242
R ++ RP+ + E +R+ R EE + +R
Sbjct: 163 EKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNE--EREKE 220
Query: 243 THDNRHDERNSRQQNNERGSQDRRSNQSHDRPQSSSNSHHRDRGP 287
D + E + + Q ++ + P
Sbjct: 221 EDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASP 265
Score = 33.3 bits (76), Expect = 0.32
Identities = 31/206 (15%), Positives = 66/206 (32%), Gaps = 6/206 (2%)
Query: 153 KEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKT 212
E EEEK + + + +E + + +RP EK +K +
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRD 159
Query: 213 EPSERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHD 272
E++ +R R + N+ + +E+ + + + + +
Sbjct: 160 REEEKKRERVRAK---SRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEE 216
Query: 273 RPQSSSNSHHRDRGPKKPFCDASDEYEWGRKDANNEDNKNNAQQPKQKPDFGLSGKLTED 332
R + + R+ D S + + +++ K + P+ S K TE
Sbjct: 217 REKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSKRTET 276
Query: 333 ---TNTYNGVVIKYSEPPDARKPKRR 355
T+ S P + KR+
Sbjct: 277 RPRTSLRPPSARPASARPAPPRVKRK 302
>gnl|CDD|222914 PHA02666, PHA02666, hypothetical protein; Provisional.
Length = 287
Score = 36.1 bits (82), Expect = 0.033
Identities = 15/125 (12%), Positives = 36/125 (28%), Gaps = 1/125 (0%)
Query: 164 HHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSE-RETKRN 222
H HHR R + + R + + +R + ++ + + +
Sbjct: 25 HESCFHHRRRANSMESRRKSRPSRQHRSAERTPTTASSLTHENNTAPSRHGKQHSCKASS 84
Query: 223 RQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHDRPQSSSNSHH 282
R SH+ T+ +H + H + H + + + + +
Sbjct: 85 RSSHNRGSTSSSHNHHAHRGPHQSAHRRSKHDAVRDTYQPCPQSPETDLYKGRLPGETER 144
Query: 283 RDRGP 287
P
Sbjct: 145 HYETP 149
Score = 32.2 bits (72), Expect = 0.49
Identities = 45/202 (22%), Positives = 66/202 (32%), Gaps = 20/202 (9%)
Query: 197 HEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQ 256
S HR + S +++R S + TH+N +Q
Sbjct: 22 DGHHESCFHHRRRAN---SMESRRKSRPSRQHRSAERTPTTAS-SLTHENNTAPSRHGKQ 77
Query: 257 NNERGSQDRRSNQSHDRPQSSSNSHHRDRGPKKPFCDASDEYEWGRKDANNEDNKNNAQQ 316
++ + S RS+ + SS N HH RGP + + + DA + + Q
Sbjct: 78 HSCKASS--RSSHNRGSTSSSHN-HHAHRGPHQ------SAHRRSKHDAVRDTYQPCPQS 128
Query: 317 PKQKPDFGLSGKLTEDTNTYNGVVIKYSEPPDARKPKRRWRLYPFKGDTQLPVLHIHRQS 376
P+ G TE Y Y P D R + D LP LHI
Sbjct: 129 PETDLYKGRLPGETE--RHYETPDHIYDVPEDVRCAA-----VEPRRDLALPPLHIPSSK 181
Query: 377 AFLMGRDRKVADIPVDHPSCSK 398
R + D P+ H S K
Sbjct: 182 PARRMRPGSMGDFPMKHTSAGK 203
>gnl|CDD|217089 pfam02535, Zip, ZIP Zinc transporter. The ZIP family consists of
zinc transport proteins and many putative metal
transporters. The main contribution to this family is
from the Arabidopsis thaliana ZIP protein family these
proteins are responsible for zinc uptake in the plant.
Also found within this family are C. elegans proteins of
unknown function which are annotated as being similar to
human growth arrest inducible gene product, although
this protein in not found within this family.
Length = 314
Score = 35.8 bits (83), Expect = 0.040
Identities = 12/63 (19%), Positives = 17/63 (26%), Gaps = 7/63 (11%)
Query: 124 NLKRKSPKKKHQVVQALRVNDSEAVENKIKEEPTEEEKPRHHHHHHHRHRHHHKQERTEL 183
+ + H V V + E HHHHH H + + L
Sbjct: 110 HDHGHDGEHSHTVPDEESGAVESNVVH-------HEHGEGPHHHHHEGHEKGESDKESGL 162
Query: 184 SDR 186
R
Sbjct: 163 RVR 165
Score = 34.6 bits (80), Expect = 0.084
Identities = 11/64 (17%), Positives = 13/64 (20%)
Query: 160 EKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSERET 219
+ H H H H E S E PH H E
Sbjct: 103 GRGHSHGHDHGHDGEHSHTVPDEESGAVESNVVHHEHGEGPHHHHHEGHEKGESDKESGL 162
Query: 220 KRNR 223
+
Sbjct: 163 RVRD 166
Score = 33.9 bits (78), Expect = 0.17
Identities = 12/62 (19%), Positives = 17/62 (27%)
Query: 164 HHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSERETKRNR 223
H H H H H + T + E+ HE H H + + R
Sbjct: 104 RGHSHGHDHGHDGEHSHTVPDEESGAVESNVVHHEHGEGPHHHHHEGHEKGESDKESGLR 163
Query: 224 QS 225
Sbjct: 164 VR 165
Score = 33.5 bits (77), Expect = 0.23
Identities = 16/82 (19%), Positives = 21/82 (25%), Gaps = 17/82 (20%)
Query: 131 KKKHQVVQALRVNDSEAVENKIKEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQ 190
K + +D E EE E HH H HHH +
Sbjct: 102 KGRGHSHGHDHGHDGEHSHTVPDEESGAVESNVVHHEHGEGPHHHHHEG----------- 150
Query: 191 ENQRPRHEKPHSSDKHRHSSKT 212
HEK S + +
Sbjct: 151 ------HEKGESDKESGLRVRD 166
Score = 31.9 bits (73), Expect = 0.71
Identities = 11/61 (18%), Positives = 18/61 (29%), Gaps = 4/61 (6%)
Query: 159 EEKPRHHHHHHHRHRHHHKQERTEL----SDRHNLQENQRPRHEKPHSSDKHRHSSKTEP 214
H H H H H H E S+ + + + P H +K ++
Sbjct: 103 GRGHSHGHDHGHDGEHSHTVPDEESGAVESNVVHHEHGEGPHHHHHEGHEKGESDKESGL 162
Query: 215 S 215
Sbjct: 163 R 163
>gnl|CDD|218163 pfam04592, SelP_N, Selenoprotein P, N terminal region. SelP is the
only known eukaryotic selenoprotein that contains
multiple selenocysteine (Sec) residues, and accounts for
more than 50% of the selenium content of rat and human
plasma. It is thought to be glycosylated. SelP may have
antioxidant properties. It can attach to epithelial
cells, and may protect vascular endothelial cells
against peroxynitrite toxicity. The high selenium
content of SelP suggests that it may be involved in
selenium intercellular transport or storage. The
promoter structure of bovine SelP suggest that it may be
involved in countering heavy metal intoxication, and may
also have a developmental function. The N-terminal
region of SelP can exist independently of the C terminal
region. Zebrafish selenoprotein Pb lacks the C terminal
Sec-rich region, and a protein encoded by the rat SelP
gene and lacking this region has also been reported.
N-terminal region contains a conserved SecxxCys motif,
which is similar to the CysxxCys found in thioredoxins.
It is speculated that the N terminal region may adopt a
thioredoxin fold and catalyze redox reactions. The
N-terminal region also contains a His-rich region, which
is thought to mediate heparin binding. Binding to
heparan proteoglycans could account for the membrane
binding properties of SelP. The function of the
bacterial members of this family is uncharcterised.
Length = 238
Score = 34.9 bits (80), Expect = 0.056
Identities = 14/48 (29%), Positives = 18/48 (37%), Gaps = 1/48 (2%)
Query: 153 KEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKP 200
+P E E PR H HHH H H H+ N ++P
Sbjct: 173 VNKPVEAE-PRQDHPHHHSHHEHQGHAHHHPHGHHHPGSNSHSESQQP 219
Score = 34.5 bits (79), Expect = 0.097
Identities = 18/70 (25%), Positives = 28/70 (40%), Gaps = 4/70 (5%)
Query: 145 SEAVENKIKEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRP---RHEKPH 201
V ++ EP ++ H HH H H HHH E+Q+P + +P
Sbjct: 170 LATVNKPVEAEPRQDHPHHHSHHEHQGHAHHHPHGHHHPGSNS-HSESQQPDPDKPTEPP 228
Query: 202 SSDKHRHSSK 211
S H H+ +
Sbjct: 229 SGLHHHHNHR 238
>gnl|CDD|215173 PLN02304, PLN02304, probable pectinesterase.
Length = 379
Score = 34.1 bits (78), Expect = 0.15
Identities = 9/22 (40%), Positives = 13/22 (59%)
Query: 164 HHHHHHHRHRHHHKQERTELSD 185
HH HHHH H+H ++ + D
Sbjct: 39 HHSHHHHHHKHPDDGKKVSICD 60
Score = 32.5 bits (74), Expect = 0.44
Identities = 8/15 (53%), Positives = 9/15 (60%)
Query: 164 HHHHHHHRHRHHHKQ 178
H HHH H HHH +
Sbjct: 34 PHSIHHHSHHHHHHK 48
Score = 32.1 bits (73), Expect = 0.70
Identities = 9/18 (50%), Positives = 9/18 (50%)
Query: 164 HHHHHHHRHRHHHKQERT 181
H HHH H HHHK
Sbjct: 35 HSIHHHSHHHHHHKHPDD 52
Score = 31.4 bits (71), Expect = 1.2
Identities = 9/21 (42%), Positives = 12/21 (57%)
Query: 164 HHHHHHHRHRHHHKQERTELS 184
HHH HH H H H + ++S
Sbjct: 37 IHHHSHHHHHHKHPDDGKKVS 57
Score = 30.6 bits (69), Expect = 2.1
Identities = 6/16 (37%), Positives = 8/16 (50%)
Query: 164 HHHHHHHRHRHHHKQE 179
+ H HH HHH +
Sbjct: 33 YPHSIHHHSHHHHHHK 48
>gnl|CDD|193258 pfam12782, Innate_immun, Invertebrate innate immunity transcript
family. The immune response of the purple sea urchin
appears to be more complex than previously believed in
that it uses immune-related gene families homologous to
vertebrate Toll-like and NOD/NALP-like receptor families
as well as C-type lectins and a rudimentary complement
system. In addition, the species also produces this
unusual family of mRNAs, also known as 185/333, which is
strongly upregulated in response to pathogen challenge.
Length = 312
Score = 33.5 bits (76), Expect = 0.20
Identities = 14/73 (19%), Positives = 22/73 (30%), Gaps = 1/73 (1%)
Query: 143 NDSEAVENKIKEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHE-KPH 201
N+S + + H R ++ TE HN + RH K
Sbjct: 167 NESSEEDGRPHPHHHRHHGHNHFGRKPFGGRPFGRRNHTEGHQGHNETGDHPHRHHNKTG 226
Query: 202 SSDKHRHSSKTEP 214
D+ R + P
Sbjct: 227 DGDQDRPMFEMRP 239
Score = 30.8 bits (69), Expect = 1.4
Identities = 30/133 (22%), Positives = 50/133 (37%), Gaps = 9/133 (6%)
Query: 153 KEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSS--DKHRHSS 210
+ E +EE+ H HHH H +H R R + N H+ + + HRH +
Sbjct: 166 RNESSEEDGRPHPHHHRHHGHNHF--GRKPFGGRPFGRRNHTEGHQGHNETGDHPHRHHN 223
Query: 211 KTEPSER-----ETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDR 265
KT ++ E + R + + R DR N T + R+ RG
Sbjct: 224 KTGDGDQDRPMFEMRPFRFNPFGRKPFGDRPFDRRNGTEEGSPRRDGHRRPYGNRGRWGE 283
Query: 266 RSNQSHDRPQSSS 278
++ + P + S
Sbjct: 284 NESEEKEHPTTES 296
>gnl|CDD|238501 cd01019, ZnuA, Zinc binding protein ZnuA. These proteins have been
shown to function as initial receptors in the ABC uptake
of Zn2+. They belong to the TroA superfamily of
periplasmic metal binding proteins that share a distinct
fold and ligand binding mechanism. They are comprised
of two globular subdomains connected by a single helix
and bind their specific ligands in the cleft between
these domains. A typical TroA protein is comprised of
two globular subdomains connected by a single helix and
can bind the metal ion in the cleft between these
domains. In addition, these proteins sometimes have a
low complexity region containing a metal-binding
histidine-rich motif (repetitive HDH sequence).
Length = 286
Score = 32.3 bits (74), Expect = 0.44
Identities = 5/25 (20%), Positives = 5/25 (20%)
Query: 153 KEEPTEEEKPRHHHHHHHRHRHHHK 177
E H H H H
Sbjct: 95 ASHGDHEHDHEHAHGEHDGHEEGGL 119
Score = 28.9 bits (65), Expect = 5.7
Identities = 10/39 (25%), Positives = 14/39 (35%), Gaps = 2/39 (5%)
Query: 151 KIKE-EPTEEEKPRHHHHHHHRHRHHHKQERTELS-DRH 187
K+ + + E+ H H H H H E D H
Sbjct: 84 KLIDLKTLEDGASHGDHEHDHEHAHGEHDGHEEGGLDPH 122
Score = 28.9 bits (65), Expect = 6.7
Identities = 6/32 (18%), Positives = 8/32 (25%)
Query: 158 EEEKPRHHHHHHHRHRHHHKQERTELSDRHNL 189
+ K H H H H+ E
Sbjct: 86 IDLKTLEDGASHGDHEHDHEHAHGEHDGHEEG 117
>gnl|CDD|236558 PRK09545, znuA, high-affinity zinc transporter periplasmic
component; Reviewed.
Length = 311
Score = 32.3 bits (74), Expect = 0.50
Identities = 6/26 (23%), Positives = 9/26 (34%)
Query: 151 KIKEEPTEEEKPRHHHHHHHRHRHHH 176
K + ++ H H HHH
Sbjct: 115 KGAHDDHHDDDHDHAGHEKSDEDHHH 140
Score = 31.5 bits (72), Expect = 0.92
Identities = 5/26 (19%), Positives = 6/26 (23%)
Query: 154 EEPTEEEKPRHHHHHHHRHRHHHKQE 179
+ H H HH E
Sbjct: 117 AHDDHHDDDHDHAGHEKSDEDHHHGE 142
Score = 30.7 bits (70), Expect = 1.5
Identities = 5/25 (20%), Positives = 8/25 (32%)
Query: 152 IKEEPTEEEKPRHHHHHHHRHRHHH 176
+K + H H H + H
Sbjct: 114 MKGAHDDHHDDDHDHAGHEKSDEDH 138
Score = 28.4 bits (64), Expect = 8.7
Identities = 3/25 (12%), Positives = 9/25 (36%)
Query: 153 KEEPTEEEKPRHHHHHHHRHRHHHK 177
++ +++ H HH +
Sbjct: 118 HDDHHDDDHDHAGHEKSDEDHHHGE 142
>gnl|CDD|183534 PRK12451, PRK12451, arginyl-tRNA synthetase; Reviewed.
Length = 562
Score = 31.3 bits (71), Expect = 1.2
Identities = 17/59 (28%), Positives = 34/59 (57%), Gaps = 4/59 (6%)
Query: 88 KHIPS--MNLDIRKKSRRKRVIALIQNLTQAQIPNMKRNLKRKSP--KKKHQVVQALRV 142
+H+P + D +K S RK + L++ + + I K+N++ K+P K+K +V + + V
Sbjct: 358 EHVPFGLILKDGKKMSTRKGRVVLLEEVLEEAIELAKQNIEEKNPNLKQKEEVAKQVGV 416
>gnl|CDD|132860 cd07221, Pat_PNPLA3, Patatin-like phospholipase domain containing
protein 3. PNPLA3 is a triacylglycerol lipase that
mediates triacylglycerol hydrolysis in adipocytes and is
an indicator of the nutritional state. PNPLA3 is also
known as adiponutrin (ADPN) or iPLA2-epsilon. Human
adiponutrins are bound to the cell membrane of
adipocytes and show transacylase, TG hydrolase, and PLA2
activity. This family includes patatin-like proteins:
ADPN (adiponutrin) from mammals, PNPLA3 (Patatin-like
phospholipase domain-containing protein 3), and
iPLA2-epsilon (Calcium-independent phospholipase A2)
from Homo sapiens.
Length = 252
Score = 30.5 bits (69), Expect = 1.7
Identities = 20/81 (24%), Positives = 37/81 (45%), Gaps = 14/81 (17%)
Query: 77 YCAFLFSGNTRKHIPSMNLDIRKKSRRKRVIALIQ---NLTQAQIPNMKRNLKRKSPKKK 133
+C SG I + +D+ + + R R I ++ NL++ +++ L+R P
Sbjct: 45 HCVTFLSGLPLDQILQILMDLVRSA-RSRNIGILHPSFNLSK----HLRDGLQRHLPDNV 99
Query: 134 HQVVQA------LRVNDSEAV 148
HQ++ RV+D E V
Sbjct: 100 HQLISGKMCISLTRVSDGENV 120
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 30.6 bits (69), Expect = 2.4
Identities = 29/142 (20%), Positives = 45/142 (31%), Gaps = 21/142 (14%)
Query: 193 QRPRHEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERN 252
Q+ R P + ++S + E +R+R+ +D +
Sbjct: 168 QQQRLGFPPRAP---YASPASYA-PEQERDREPYDAGRPEYDQR---------------- 207
Query: 253 SRQQNNERGSQDRRSNQSHDRPQSSSNSHHRDRGPKKPFCDASDEYEWGRKDANNEDNKN 312
R ++ R DR DRP+ + H RG P R A
Sbjct: 208 RRDYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRGGPGPPERDDAPVVPIRPSAPGPLAAQ 267
Query: 313 NAQQPKQ-KPDFGLSGKLTEDT 333
A P +P L+ K T DT
Sbjct: 268 PAPAPGPGEPTARLNPKYTFDT 289
>gnl|CDD|182628 PRK10663, PRK10663, cytochrome o ubiquinol oxidase subunit III;
Provisional.
Length = 204
Score = 29.7 bits (67), Expect = 2.7
Identities = 6/12 (50%), Positives = 6/12 (50%)
Query: 164 HHHHHHHRHRHH 175
H H H H HH
Sbjct: 8 HATAHAHEHGHH 19
>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
Length = 977
Score = 30.3 bits (68), Expect = 2.7
Identities = 22/108 (20%), Positives = 34/108 (31%), Gaps = 18/108 (16%)
Query: 183 LSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEE------------ 230
L R L + R R S K + P +R + QS+ DE
Sbjct: 25 LPSRRRLAVSCRARRRGFSSQQKKKRGKNIAPKQRSSNSKLQSNTDENGQLENTSLRTVM 84
Query: 231 ------TNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHD 272
T+ H+R + D ++ QQ N + + Q D
Sbjct: 85 ELPQKSTSSDDDHNRASMQRDEAIAAIDNEQQTNSKDGEQLSDFQLED 132
>gnl|CDD|236005 PRK07376, PRK07376, NAD(P)H-quinone oxidoreductase subunit F;
Validated.
Length = 673
Score = 30.4 bits (69), Expect = 2.8
Identities = 9/36 (25%), Positives = 11/36 (30%)
Query: 141 RVNDSEAVENKIKEEPTEEEKPRHHHHHHHRHRHHH 176
R ND E + + HH H H H
Sbjct: 459 RGNDKELQKELLAAADAPGHHEGHHEEHGEHHHHSG 494
>gnl|CDD|177093 CHL00194, ycf39, Ycf39; Provisional.
Length = 317
Score = 30.0 bits (68), Expect = 2.8
Identities = 14/30 (46%), Positives = 16/30 (53%), Gaps = 4/30 (13%)
Query: 10 ALSTVLHQTVLKCSFSALIRYYQYRCLVRN 39
A T+ Q V + AL YQ RCLVRN
Sbjct: 8 ATGTLGRQIVRQ----ALDEGYQVRCLVRN 33
>gnl|CDD|220096 pfam09052, SipA, Salmonella invasion protein A. Salmonella
invasion protein A is an actin-binding protein that
contributes to host cytoskeletal rearrangements by
stimulating actin polymerisation and counteracting
F-actin destabilising proteins. Members of this family
possess an all-helical fold consisting of eight
alpha-helices arranged so that six long, amphipathic
helices form a compact fold that surrounds a final,
predominantly hydrophobic helix in the middle of the
molecule.
Length = 674
Score = 30.0 bits (67), Expect = 3.1
Identities = 14/70 (20%), Positives = 23/70 (32%), Gaps = 10/70 (14%)
Query: 226 HDDEETNERRHHDRNNRTHDNRHDE--RNSRQQNNERGSQDRRS--------NQSHDRPQ 275
H+D + + R D T+ NS +N+E +Q S S P
Sbjct: 307 HNDNRSYDNRVFDNRGDTYLGGARRHYDNSYHENSENDAQSPTSQTNDLSRNGNSLLSPP 366
Query: 276 SSSNSHHRDR 285
+S +
Sbjct: 367 ASPAAGQHAL 376
>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family. This model
represents a subfamily of RNA splicing factors including
the Pad-1 protein (N. crassa), CAPER (M. musculus) and
CC1.3 (H.sapiens). These proteins are characterized by
an N-terminal arginine-rich, low complexity domain
followed by three (or in the case of 4 H. sapiens
paralogs, two) RNA recognition domains (rrm: pfam00706).
These splicing factors are closely related to the U2AF
splicing factor family (TIGR01642). A homologous gene
from Plasmodium falciparum was identified in the course
of the analysis of that genome at TIGR and was included
in the seed.
Length = 457
Score = 29.9 bits (67), Expect = 3.5
Identities = 11/85 (12%), Positives = 24/85 (28%)
Query: 151 KIKEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDKHRHSS 210
+ +E R R R + R R + S +++
Sbjct: 3 RDRERGRLRNDTRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPNRYYRPR 62
Query: 211 KTEPSERETKRNRQSHDDEETNERR 235
R+ +R+ ++ + T R
Sbjct: 63 GDRSYRRDDRRSGRNTKEPLTEAER 87
>gnl|CDD|224151 COG1230, CzcD, Co/Zn/Cd efflux system component [Inorganic ion
transport and metabolism].
Length = 296
Score = 29.6 bits (67), Expect = 3.7
Identities = 7/22 (31%), Positives = 9/22 (40%)
Query: 162 PRHHHHHHHRHRHHHKQERTEL 183
HH H H H H + + L
Sbjct: 3 GAAHHSHGHDHAHDNPRNERRL 24
>gnl|CDD|204706 pfam11666, DUF2933, Protein of unknown function (DUF2933). This
bacterial family of proteins has no known function.
Length = 56
Score = 26.9 bits (60), Expect = 4.1
Identities = 4/17 (23%), Positives = 4/17 (23%)
Query: 163 RHHHHHHHRHRHHHKQE 179
H H H H
Sbjct: 40 HGGHGGHGGHDSHDDDP 56
>gnl|CDD|220888 pfam10846, DUF2722, Protein of unknown function (DUF2722). This
eukaryotic family of proteins has no known function.
Length = 373
Score = 29.5 bits (66), Expect = 4.4
Identities = 20/104 (19%), Positives = 43/104 (41%), Gaps = 6/104 (5%)
Query: 187 HNLQENQRPRHEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEETNERRHHDRNNRTHDN 246
H ++ ++ +S +H K E +++ +R + S + + + D+
Sbjct: 254 HKQSKSAPQPSQESMTSFQHIIQWKPESQQKKHRRTKSSSSFGVIDLNSISEASQVNEDD 313
Query: 247 RHDERNSRQQNNERGSQDRRSNQSHDR-----PQSSSNSHHRDR 285
+ +S+++ NE + D S S D +SSS S +R
Sbjct: 314 DPPDSDSKERKNEE-NSDPESTPSDDNDDKTCSESSSRSESPNR 356
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 29.5 bits (66), Expect = 4.9
Identities = 27/147 (18%), Positives = 49/147 (33%), Gaps = 7/147 (4%)
Query: 177 KQERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPS------ERETKRNRQSHDDEE 230
KQ+ TE +R QE +R + SSD + SS + S E E + + + ++EE
Sbjct: 394 KQDDTEEEERRKRQERER-QGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEE 452
Query: 231 TNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHDRPQSSSNSHHRDRGPKKP 290
E + +E + +N + S++ + R
Sbjct: 453 EEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAG 512
Query: 291 FCDASDEYEWGRKDANNEDNKNNAQQP 317
S+ + E + QP
Sbjct: 513 ISRMSEGQQPRGSSVQPESPQEEPLQP 539
>gnl|CDD|220924 pfam10986, DUF2796, Protein of unknown function (DUF2796). This
bacterial family of proteins has no known function.
Length = 161
Score = 28.4 bits (64), Expect = 5.1
Identities = 11/31 (35%), Positives = 12/31 (38%)
Query: 157 TEEEKPRHHHHHHHRHRHHHKQERTELSDRH 187
+ H HH H H HH E SD H
Sbjct: 79 SLFGDHDHDHHDHDHHDHHDDHEHAGHSDFH 109
Score = 28.4 bits (64), Expect = 5.3
Identities = 6/26 (23%), Positives = 8/26 (30%)
Query: 154 EEPTEEEKPRHHHHHHHRHRHHHKQE 179
E + HH H H H +
Sbjct: 77 ESSLFGDHDHDHHDHDHHDHHDDHEH 102
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 28.6 bits (64), Expect = 5.1
Identities = 18/73 (24%), Positives = 31/73 (42%), Gaps = 3/73 (4%)
Query: 157 TEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSE 216
+ + H H H +HR SD L+ +++ +K H DK R K + E
Sbjct: 105 HIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKER---KKKKKE 161
Query: 217 RETKRNRQSHDDE 229
++ K+ R S +
Sbjct: 162 KKKKKKRHSPEHP 174
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 28.8 bits (64), Expect = 5.4
Identities = 19/111 (17%), Positives = 39/111 (35%), Gaps = 5/111 (4%)
Query: 214 PSERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHDR 273
P + N + EE++ R + + + +R+ N E + S+ D
Sbjct: 168 PPAQPDVDNEEEERLEESDGRE--EEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDV 225
Query: 274 PQSSSNSHHRDRGPKK---PFCDASDEYEWGRKDANNEDNKNNAQQPKQKP 321
+ +G ++ E EW + + E +NN + P +K
Sbjct: 226 VDYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEEVPRNNEESPAKKQ 276
>gnl|CDD|200419 TIGR04168, TIGR04168, TIGR04168 family protein. Members of this
uncharacterized protein family are restricted, in 49 of
50 genomes, to organisms with a family TIGR04167 radical
SAM protein, which occasionally is a selenoprotein.
Length = 269
Score = 28.8 bits (65), Expect = 5.5
Identities = 17/48 (35%), Positives = 19/48 (39%), Gaps = 4/48 (8%)
Query: 167 HHHHRHRHHHKQERTEL-SDRHN---LQENQRPRHEKPHSSDKHRHSS 210
H HHR RH + R + DR L PR K K RH S
Sbjct: 192 HMHHRLRHTKGRLRRTVVRDRQGTVYLNAACVPRIGKDEEGRKLRHFS 239
>gnl|CDD|220184 pfam09332, Mcm10, Mcm10 replication factor. Mcm10 is a eukaryotic
DNA replication factor that regulates the stability and
chromatin association of DNA polymerase alpha.
Length = 346
Score = 28.7 bits (64), Expect = 7.1
Identities = 17/79 (21%), Positives = 32/79 (40%), Gaps = 2/79 (2%)
Query: 87 RKHIPSMNLDIRKKSRRKRVIALIQNLTQAQIPNMKRNLKRKSPKKKHQVVQALRVNDSE 146
+ P L + + + I ++ L +A ++KR KR K+ + + + N S
Sbjct: 123 KSPPPLPKLSAKASAAKLAAILKLKALEKANPNSIKR--KRSESGKRRAIDERVEKNASS 180
Query: 147 AVENKIKEEPTEEEKPRHH 165
A K E+ K R +
Sbjct: 181 AKRQKSPEDEEPAMKKRRY 199
>gnl|CDD|238500 cd01018, ZntC, Metal binding protein ZntC. These proteins are
predicted to function as initial receptors in ABC
transport of metal ions. They belong to the TroA
superfamily of helical backbone metal receptor proteins
that share a distinct fold and ligand binding mechanism.
They are comprised of two globular subdomains connected
by a long alpha helix and bind their specific ligands in
the cleft between these domains. In addition, many of
these proteins possess a metal-binding histidine-rich
motif (repetitive HDH sequence).
Length = 266
Score = 28.5 bits (64), Expect = 7.8
Identities = 8/13 (61%), Positives = 8/13 (61%)
Query: 165 HHHHHHRHRHHHK 177
HHHHHH HH
Sbjct: 94 HHHHHHGEHEHHH 106
>gnl|CDD|237322 PRK13261, ureE, urease accessory protein UreE; Provisional.
Length = 159
Score = 28.0 bits (63), Expect = 8.1
Identities = 6/13 (46%), Positives = 6/13 (46%)
Query: 164 HHHHHHHRHRHHH 176
HH H H H H
Sbjct: 147 AFRHHGHSHDHSH 159
Score = 27.6 bits (62), Expect = 8.9
Identities = 5/13 (38%), Positives = 5/13 (38%)
Query: 164 HHHHHHHRHRHHH 176
HH H H H
Sbjct: 145 GGAFRHHGHSHDH 157
>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
Provisional.
Length = 576
Score = 28.8 bits (64), Expect = 8.2
Identities = 20/107 (18%), Positives = 41/107 (38%), Gaps = 8/107 (7%)
Query: 190 QENQRPRHEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEET---NERRHHDRNNRTHDN 246
N+ + P + R ++ EP + E K + Q+ NER ++N + +
Sbjct: 400 DSNKEVPEDVPMEPEDDRDNNFNEPKKPENKGDGQNEPVIPKPLDNERDQSNKNKQVNPG 459
Query: 247 RHDERNSRQ-----QNNERGSQDRRSNQSHDRPQSSSNSHHRDRGPK 288
R +NNE + + +++ P+ S + D+ K
Sbjct: 460 NRHNSEDRYTRPHGRNNENRNYNNKNSDIPKHPERSEHEQPEDKKKK 506
>gnl|CDD|144541 pfam00985, MSA_2, Merozoite Surface Antigen 2 (MSA-2) family.
Length = 171
Score = 27.9 bits (61), Expect = 8.2
Identities = 15/80 (18%), Positives = 30/80 (37%)
Query: 178 QERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEETNERRHH 237
Q E + N+Q++ + + P + D S +P + E + +
Sbjct: 41 QANKETQNNSNVQQDSQTKSNVPETQDADTKSPTAQPEQAENSAPTAEQTESPELQSAPE 100
Query: 238 DRNNRTHDNRHDERNSRQQN 257
++ H + H RN+ QN
Sbjct: 101 NKGTGQHGHMHGSRNNHPQN 120
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 28.1 bits (62), Expect = 8.6
Identities = 11/56 (19%), Positives = 25/56 (44%)
Query: 217 RETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHD 272
R+ + + + EE ER +R D R D+ ++ E+ +++ + H+
Sbjct: 124 RQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHE 179
>gnl|CDD|227489 COG5160, ULP1, Protease, Ulp1 family [Posttranslational
modification, protein turnover, chaperones].
Length = 578
Score = 28.3 bits (63), Expect = 9.5
Identities = 24/99 (24%), Positives = 36/99 (36%), Gaps = 4/99 (4%)
Query: 200 PHSSDKHRHS--SKTEPSERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQN 257
PHSS H+HS +K + TKR + +++ D N D
Sbjct: 269 PHSSQIHKHSLSNKPTGLQSLTKRVFIASLGKKSLLISIKDSNISPSDFYQSN-GEETLQ 327
Query: 258 NERGSQDRRSNQSHD-RPQSSSNSHHRDRGPKKPFCDAS 295
+ S+ RRSN+ + N + R K F S
Sbjct: 328 SHAVSELRRSNRFKKGSDPLNFNDNVPFRDAKTDFQTPS 366
>gnl|CDD|216817 pfam01969, DUF111, Protein of unknown function DUF111. This
prokaryotic family has no known function.
Length = 381
Score = 28.2 bits (64), Expect = 10.0
Identities = 9/13 (69%), Positives = 9/13 (69%)
Query: 160 EKPRHHHHHHHRH 172
E H HHHHHRH
Sbjct: 60 ELEEHEHHHHHRH 72
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.131 0.389
Gapped
Lambda K H
0.267 0.0799 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,034,902
Number of extensions: 2287970
Number of successful extensions: 3447
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2940
Number of HSP's successfully gapped: 205
Length of query: 483
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 382
Effective length of database: 6,457,848
Effective search space: 2466897936
Effective search space used: 2466897936
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 61 (27.1 bits)