RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy11719
(657 letters)
>gnl|CDD|211393 cd11381, NSA2, pre-ribosomal protein NSA2 (Nop seven-associated 2).
NSA2 appears to be a protein required for the
maturation of 27S pre-rRNA in yeast; it has been
characterized in mammalian cells as a nucleolar protein
that might play a role in the regulation of the cell
cycle and in cell proliferation.
Length = 257
Score = 559 bits (1444), Expect = 0.0
Identities = 200/256 (78%), Positives = 227/256 (88%), Gaps = 1/256 (0%)
Query: 335 NEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKK 394
NEYIE HRK +GRRLD+EERKRK+EAREPHKR++KA+ +R +KAKLYNK+R EKIQMKK
Sbjct: 1 NEYIELHRKRHGRRLDHEERKRKKEAREPHKRSKKAQKLRGLKAKLYNKKRYKEKIQMKK 60
Query: 395 KIKAHEEKLTKQKHEKKE-EGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPK 453
IK HEE+ KQK + K EGA+P YLLDR+ QSRAKVLSNMIKQKRKEKAGKW VP+PK
Sbjct: 61 TIKMHEERNVKQKVDDKVPEGAVPAYLLDREGQSRAKVLSNMIKQKRKEKAGKWSVPLPK 120
Query: 454 VRTQADAEVFKIVRSGKTKRKAWKRMITKVCFVGENFTRKPPKFERFIRPMALRFNKAHV 513
VR ++ E+FK+V++GK K+K+WKRM+TKV FVGE FTRKPPK+ERFIRPM LRF KAHV
Sbjct: 121 VRGISEDEMFKVVKTGKRKKKSWKRMVTKVTFVGEGFTRKPPKYERFIRPMGLRFKKAHV 180
Query: 514 THPELKATFYLPIIGVKKNPNSPMYTNLGVITKGTVIEVNISELGLVTQAGKVVWGKYAQ 573
THPELKATF LPIIGVKKNP SP+YT LGVITKGT+IEVN+SELGLVTQ GKVVWGKYAQ
Sbjct: 181 THPELKATFCLPIIGVKKNPQSPLYTQLGVITKGTIIEVNVSELGLVTQGGKVVWGKYAQ 240
Query: 574 VTNNPENDGCINAVLL 589
VTNNPENDGCINAVLL
Sbjct: 241 VTNNPENDGCINAVLL 256
>gnl|CDD|240396 PTZ00388, PTZ00388, 40S ribosomal protein S8-like; Provisional.
Length = 223
Score = 429 bits (1104), Expect = e-148
Identities = 165/222 (74%), Positives = 192/222 (86%), Gaps = 1/222 (0%)
Query: 369 KARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQK-HEKKEEGALPVYLLDRDVQS 427
KA+ +R IKAKL+NK+R EK +MKK IKAHEEK K+K EK +GA+P YLLDR+ +
Sbjct: 1 KAQKLRGIKAKLFNKKRYAEKAEMKKTIKAHEEKDVKEKVPEKVPDGAVPSYLLDREQVN 60
Query: 428 RAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGKTKRKAWKRMITKVCFVG 487
RAKVLSNM+KQKRKEKAGKW+VPIPKV+ + E+FK+V+SGK K+K+WKR++ KV FVG
Sbjct: 61 RAKVLSNMLKQKRKEKAGKWDVPIPKVKAMNEDEMFKVVKSGKRKKKSWKRLVNKVTFVG 120
Query: 488 ENFTRKPPKFERFIRPMALRFNKAHVTHPELKATFYLPIIGVKKNPNSPMYTNLGVITKG 547
E+FTRKPPK+ERFIRP LRF KAHVTHPELK TFYL IIGVKKNP S +YT+LGVITKG
Sbjct: 121 EDFTRKPPKYERFIRPTGLRFKKAHVTHPELKTTFYLDIIGVKKNPQSNLYTSLGVITKG 180
Query: 548 TVIEVNISELGLVTQAGKVVWGKYAQVTNNPENDGCINAVLL 589
T+IEVN+SELGLVTQ GKVVWGKYAQVTNNPENDGCINAVLL
Sbjct: 181 TIIEVNVSELGLVTQTGKVVWGKYAQVTNNPENDGCINAVLL 222
>gnl|CDD|211392 cd11380, Ribosomal_S8e_like, Eukaryotic/archaeal ribosomal protein
S8e and similar proteins. This family contains the
eukaryotic/archaeal ribosomal protein S8, a component of
the small ribosomal subunits, as well as the NSA2 gene
product.
Length = 138
Score = 230 bits (589), Expect = 8e-73
Identities = 73/140 (52%), Positives = 91/140 (65%), Gaps = 2/140 (1%)
Query: 450 PIPKVRTQADAEVFKIVRSGKTKRKAWKRMITKVCFVGENFTRKPPKFERFIRPMALRFN 509
P K + +A FK+VR + K TK+ GE FTRK I+P ALR +
Sbjct: 1 PQGKHKRKATGGKFKVVRKKRKYELGRKPANTKLG--GERFTRKVRVRGGNIKPRALRLD 58
Query: 510 KAHVTHPELKATFYLPIIGVKKNPNSPMYTNLGVITKGTVIEVNISELGLVTQAGKVVWG 569
KA+V+HPEL T L I+GVKKNP++ Y GVITKGT+IE ++S LGLVT+ GKVVWG
Sbjct: 59 KANVSHPELGKTKKLKILGVKKNPSNNEYVRRGVITKGTIIETDVSPLGLVTEEGKVVWG 118
Query: 570 KYAQVTNNPENDGCINAVLL 589
YAQVT+ P DGC+NAVLL
Sbjct: 119 GYAQVTSRPGQDGCVNAVLL 138
>gnl|CDD|224918 COG2007, RPS8A, Ribosomal protein S8E [Translation, ribosomal
structure and biogenesis].
Length = 127
Score = 125 bits (317), Expect = 5e-34
Identities = 42/113 (37%), Positives = 51/113 (45%), Gaps = 26/113 (23%)
Query: 477 KRMITKVCFVGENFTRKPPKFERFIRPMALRFNKAHVTHPELKATFYLPIIGVKKNPNSP 536
KR KV G NF + ALR A+V+ PE T IIGV +NP +P
Sbjct: 40 KRKRKKVRTRGGNFKVR-----------ALRAKTANVSDPETGKTKKAKIIGVVENPANP 88
Query: 537 MYTNLGVITKGTVIEVNISELGLVTQAGKVVWGKYAQVTNNPENDGCINAVLL 589
Y +ITKG +IE I YA VT+ P DG +NAVLL
Sbjct: 89 EYVRRNIITKGAIIETEIG---------------YAVVTSRPGQDGVVNAVLL 126
Score = 38.8 bits (91), Expect = 0.001
Identities = 17/55 (30%), Positives = 24/55 (43%), Gaps = 4/55 (7%)
Query: 335 NEYIERHRKLYGRRLDYEERKRKREA-REP--HKRAEKARSVRKIKAKLYNKQRR 386
RK G + +KRK E REP K EK R +K++ + N + R
Sbjct: 3 IWQGRSKRKPTGGKRHQARKKRKFELGREPTETKIGEK-RKRKKVRTRGGNFKVR 56
>gnl|CDD|201659 pfam01201, Ribosomal_S8e, Ribosomal protein S8e.
Length = 129
Score = 121 bits (306), Expect = 1e-32
Identities = 57/255 (22%), Positives = 76/255 (29%), Gaps = 129/255 (50%)
Query: 335 NEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKK 394
N+ I RK G RLD+ +KR
Sbjct: 4 NQGIRSKRKRTGGRLDHHRKKR-------------------------------------- 25
Query: 395 KIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKV 454
K E G P K+ N +KQ R
Sbjct: 26 ---------------KFELGRPPAP---------TKLGKNRVKQVR-------------- 47
Query: 455 RTQADAEVFKIVRSGKTKRKAWKRMITKVCFVGENFTRKPPKFERFIRPMALRFNKAHVT 514
VR G K +A LR +KA+V+
Sbjct: 48 -----------VRGGNRKVRA------------------------------LRLDKANVS 66
Query: 515 HPELKATFYLPIIGVKKNPNSPMYTNLGVITKGTVIEVNISELGLVTQAGKVVWGKYAQV 574
P T + I+GV NP + Y +ITKG +IE ++SELG A+V
Sbjct: 67 WPSEGTTRKVRILGVVYNPANNEYVRRNIITKGAIIETDVSELGK------------ARV 114
Query: 575 TNNPENDGCINAVLL 589
T+ P DG +N VLL
Sbjct: 115 TSRPGQDGRVNGVLL 129
>gnl|CDD|235214 PRK04049, PRK04049, 30S ribosomal protein S8e; Validated.
Length = 127
Score = 61.4 bits (150), Expect = 2e-11
Identities = 32/85 (37%), Positives = 45/85 (52%), Gaps = 15/85 (17%)
Query: 505 ALRFNKAHVTHPELKATFYLPIIGVKKNPNSPMYTNLGVITKGTVIEVNISELGLVTQAG 564
LR N A+V P+ T + I+ V +NP +P Y +ITKG +IE T+ G
Sbjct: 56 LLRANYANVADPKTGKTKKVKILTVVENPANPHYARRNIITKGAIIE---------TELG 106
Query: 565 KVVWGKYAQVTNNPENDGCINAVLL 589
K A+VT+ P DG +NAVL+
Sbjct: 107 K------AKVTSRPGQDGVVNAVLI 125
>gnl|CDD|211394 cd11382, Ribosomal_S8e, Eukaryotic/archaeal ribosomal protein S8e
(RPS8). The eukaryotic/archaeal ribosomal protein S8 is
a component of the small (40S in eukaryotes, 30S in
archaea) ribosomal subunits and interacts tightly with
18S rRNA (16S rRNA in archaea, presumably).
Length = 122
Score = 53.3 bits (129), Expect = 1e-08
Identities = 27/85 (31%), Positives = 39/85 (45%), Gaps = 15/85 (17%)
Query: 505 ALRFNKAHVTHPELKATFYLPIIGVKKNPNSPMYTNLGVITKGTVIEVNISELGLVTQAG 564
ALR + +V+ P T + I+ V NP + +TKG +IE + GL
Sbjct: 53 ALRLDTGNVSWPSEGKTKKVKILDVVYNPANNELVRRNTLTKGAIIET---DRGL----- 104
Query: 565 KVVWGKYAQVTNNPENDGCINAVLL 589
A+VT+ P DG +N VLL
Sbjct: 105 -------ARVTSRPGQDGVVNGVLL 122
>gnl|CDD|129407 TIGR00307, S8e, ribosomal protein S8.e. Archaeal and eukaryotic
ribosomal protein S8. This model could easily have been
split into two models, one for eukaryotic S8 and one for
archaeal S8; eukaryotic forms invariably have in insert
of about 80 residues that archaeal forms of S8 do not
[Protein synthesis, Ribosomal proteins: synthesis and
modification].
Length = 127
Score = 50.2 bits (120), Expect = 1e-07
Identities = 29/85 (34%), Positives = 41/85 (48%), Gaps = 15/85 (17%)
Query: 505 ALRFNKAHVTHPELKATFYLPIIGVKKNPNSPMYTNLGVITKGTVIEVNISELGLVTQAG 564
LR A+V+ PE + I GV +NP + Y VITKG ++E +I
Sbjct: 56 LLRDETANVSDPETGKVKKVEIAGVVENPANNHYVRRNVITKGAIVETDIG--------- 106
Query: 565 KVVWGKYAQVTNNPENDGCINAVLL 589
YA+VT+ P DG +N VL+
Sbjct: 107 ------YARVTSRPGQDGVVNGVLI 125
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 43.9 bits (104), Expect = 3e-05
Identities = 23/72 (31%), Positives = 35/72 (48%), Gaps = 4/72 (5%)
Query: 348 RLDYEERK-RKREAREP--HKRAEKARSVRKIKAKLYNK-QRRNEKIQMKKKIKAHEEKL 403
RL+ + K +K E KR EK R + AK K Q++ +K + KKK K +K
Sbjct: 51 RLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKE 110
Query: 404 TKQKHEKKEEGA 415
K+ + EE +
Sbjct: 111 EKEGSKSSEESS 122
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 43.9 bits (103), Expect = 5e-05
Identities = 32/126 (25%), Positives = 59/126 (46%), Gaps = 24/126 (19%)
Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKK---K 395
E R+ RL EE R++E ++ EKA+ + + K +Q E+IQ +K +
Sbjct: 59 ELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEK--QEQEEQERIQKQKEEAE 116
Query: 396 IKAHEE----KLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPI 451
+A EE +L ++KH ++ E ++ R K L ++K+ RK +
Sbjct: 117 ARAREEAERMRLEREKHFQQIE---------QERLERKKRLEEIMKRTRKSEVS------ 161
Query: 452 PKVRTQ 457
P+V+ +
Sbjct: 162 PQVKKE 167
>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947). Family of
eukaryotic proteins with unknown function.
Length = 168
Score = 42.3 bits (100), Expect = 2e-04
Identities = 30/124 (24%), Positives = 55/124 (44%), Gaps = 22/124 (17%)
Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKA 398
++ RK Y DY R++E E K +K + K +L + + ++K
Sbjct: 48 DKFRKNYKFLDDY----REKEIEELEKALKKTKDSE-EKEELKRTLQSMKS-RLKTLKNK 101
Query: 399 HEEKLTKQKHEKKE-----EGALPVYLLDRDVQ-----------SRAKVLSNMIKQKRKE 442
E+ ++H+K+E EG P YL +++ ++K L +++KRK+
Sbjct: 102 DREREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKSKQLDKALEKKRKK 161
Query: 443 KAGK 446
AGK
Sbjct: 162 NAGK 165
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 43.4 bits (103), Expect = 3e-04
Identities = 23/118 (19%), Positives = 46/118 (38%), Gaps = 28/118 (23%)
Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYN----------------------KQRRNEK 389
ER+ +REA ++ EK R V +++A+ K+R+ EK
Sbjct: 163 AEREEEREAERRERKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKEK 222
Query: 390 IQMKKKIKAHEE---KLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKA 444
+ +K+ + +E +Q EK+E + R ++L + + E+
Sbjct: 223 EEAEKRRRQKQELQRAREEQIEEKEERLQE---ERAEEEAERERMLEKQAEDEELEQE 277
Score = 38.0 bits (89), Expect = 0.013
Identities = 22/102 (21%), Positives = 45/102 (44%), Gaps = 14/102 (13%)
Query: 347 RRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKK---------IK 397
R EE++ + + + AE+ R + K +Q EK +MK+ I+
Sbjct: 239 REEQIEEKEERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQIE 298
Query: 398 AHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQK 439
EE+ ++ E+ EEG L + + + +A++ +Q+
Sbjct: 299 EKEERRAAEREEELEEGER---LREEEAERQARI--EEERQR 335
Score = 37.6 bits (88), Expect = 0.015
Identities = 25/121 (20%), Positives = 44/121 (36%), Gaps = 24/121 (19%)
Query: 336 EYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARS---VRKIKAKLYNKQRRNEKIQM 392
E E+ K ++ EE E R K EK R KI K R E+ +
Sbjct: 115 EAQEKREK---QKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEEREA 171
Query: 393 KKKIKAHE---------EKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEK 443
+++ + E + + + E++E L D + + K+++KEK
Sbjct: 172 ERRERKEEKEREVARLRAQQEEAEDEREELDEL---RADLYQEEYER------KERQKEK 222
Query: 444 A 444
Sbjct: 223 E 223
Score = 36.4 bits (85), Expect = 0.041
Identities = 24/129 (18%), Positives = 51/129 (39%), Gaps = 28/129 (21%)
Query: 334 HNEYIERHRKLYGRRLD-YEERKRKREAREPHKRAEKARSVRKIKAKLY-------NKQR 385
+ I+ K RR+D E +R + E + E+ R + + + ++R
Sbjct: 28 EKKRIKAEEKEEERRIDEMMEEERLKALAE-EEERERKRKEERREGRAVLQEQIEEREKR 86
Query: 386 R--------NEKIQMK---KKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSN 434
R E+ QM ++I+ +E ++K EK+++ L ++
Sbjct: 87 RQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKK-------LREEID-EFNEERI 138
Query: 435 MIKQKRKEK 443
K++ KE+
Sbjct: 139 ERKEEEKER 147
Score = 35.7 bits (83), Expect = 0.070
Identities = 17/129 (13%), Positives = 49/129 (37%), Gaps = 14/129 (10%)
Query: 352 EERKRKREAREPHKR-AEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEK 410
E+++ K E +E +R E R ++ E+ + +++ + L +Q E+
Sbjct: 28 EKKRIKAEEKEEERRIDEMMEEERLKALAEEEER---ERKRKEERREGRAV-LQEQIEER 83
Query: 411 KEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGK 470
++ ++ + +++ + ++ E K E + + + + + F
Sbjct: 84 EKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREK---QKKLREEIDEF------N 134
Query: 471 TKRKAWKRM 479
+R K
Sbjct: 135 EERIERKEE 143
Score = 35.3 bits (82), Expect = 0.082
Identities = 22/112 (19%), Positives = 45/112 (40%), Gaps = 26/112 (23%)
Query: 348 RLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEK----- 402
R D + + +R+ R+ EK + ++ + K ++ R E+I+ K++ E
Sbjct: 205 RADLYQEEYERKERQ----KEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEEAE 260
Query: 403 ----LTKQKHEKKEEGALPVYLLDRDVQSRAK------VLSNMIKQKRKEKA 444
L KQ +++ E + R K L I++K + +A
Sbjct: 261 RERMLEKQAEDEELEQE-------NAEKRRMKRLEHRRELEQQIEEKEERRA 305
Score = 35.3 bits (82), Expect = 0.093
Identities = 26/129 (20%), Positives = 51/129 (39%), Gaps = 31/129 (24%)
Query: 338 IERHRKLYGRRLDYEERKRKREAR------------EPHKRAEKARSVRKIKA--KLYNK 383
IE K R+ +YEER +E + + EK +K++ +N+
Sbjct: 80 IEEREKR--RQEEYEERL--QEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNE 135
Query: 384 QRRNEKIQMKKKIKAHEEKL---TKQKHEKKEEGALPVYLLDRDVQSRAK-----VLSNM 435
+R K + K++ + E K+ ++K E++EE +R + K L
Sbjct: 136 ERIERKEEEKEREREEELKILEYQREKAEREEE-----REAERRERKEEKEREVARLRAQ 190
Query: 436 IKQKRKEKA 444
++ E+
Sbjct: 191 QEEAEDERE 199
Score = 30.3 bits (69), Expect = 3.7
Identities = 13/52 (25%), Positives = 27/52 (51%), Gaps = 5/52 (9%)
Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKL 403
EE++ +R A +R E+ +++ + +Q R E+ + + +K H E L
Sbjct: 298 EEKEERRAA----EREEELEEGERLREEEAERQARIEEERQ-RLLKEHAEAL 344
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 41.0 bits (97), Expect = 0.002
Identities = 22/87 (25%), Positives = 44/87 (50%), Gaps = 6/87 (6%)
Query: 332 SLHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKA-----RSVRKIKAKLYNKQRR 386
L E E+ KL D + ++EA++ K A+K + +R+++ Y +
Sbjct: 548 KLKEELEEKKEKLQEEE-DKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKA 606
Query: 387 NEKIQMKKKIKAHEEKLTKQKHEKKEE 413
+E I+ +K++ EK K+K ++KE+
Sbjct: 607 HELIEARKRLNKANEKKEKKKKKQKEK 633
>gnl|CDD|225058 COG2147, RPL19A, Ribosomal protein L19E [Translation, ribosomal
structure and biogenesis].
Length = 150
Score = 38.4 bits (90), Expect = 0.003
Identities = 21/71 (29%), Positives = 32/71 (45%), Gaps = 9/71 (12%)
Query: 339 ERH-RKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKI------Q 391
+RH +K GRR RK + AR P K R +R ++ +L K R + KI +
Sbjct: 65 KRHAQKKKGRRRGPGSRKGTKGARMPSKERWIKR-IRALRREL-RKLRDDGKIDKHTYRK 122
Query: 392 MKKKIKAHEEK 402
+ + K K
Sbjct: 123 LYRMAKGGAFK 133
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 40.3 bits (95), Expect = 0.003
Identities = 27/101 (26%), Positives = 51/101 (50%), Gaps = 20/101 (19%)
Query: 357 KREAREPHKRA--EKARSVRKIKAKLYN--KQRRNEKIQMKKKIKAHEEKLTKQ------ 406
K+EA K A E V K++A+L K+RRNE ++++++ EE L ++
Sbjct: 42 KKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDK 101
Query: 407 ---KHEKKEEGALPVYLLDR--DVQSRAKVLSNMIKQKRKE 442
EKKE+ L ++ ++ + + L +I ++R+E
Sbjct: 102 KEENLEKKEKE-----LSNKEKNLDEKEEELEELIAEQREE 137
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 40.0 bits (94), Expect = 0.004
Identities = 22/60 (36%), Positives = 37/60 (61%), Gaps = 7/60 (11%)
Query: 347 RRLDYEE-RKRKRE---AREPHKRAE-KARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEE 401
++L EE + R+ E RE R E KA+ ++KIK+K Y++ + EK +K+++K EE
Sbjct: 192 KKLSPEEAKARRAELRKMRELLFREEAKAKRIKKIKSKKYHRVHKKEK--LKEELKEFEE 249
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 39.6 bits (92), Expect = 0.005
Identities = 23/113 (20%), Positives = 44/113 (38%), Gaps = 12/113 (10%)
Query: 355 KRKREAREPHKRAEKAR-----SVRKIKAKL-YNKQRRNEKIQMKKKIKAHEEKLTKQKH 408
K++ E R+ + A+ S K ++ N++R EK Q++ K E K K
Sbjct: 245 KQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEAL--KAKD 302
Query: 409 EKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAE 461
K + + + A+ ++KR+ A + P+V Q +
Sbjct: 303 HKAFDLKQES----KASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPTSL 351
Score = 30.0 bits (67), Expect = 4.9
Identities = 17/91 (18%), Positives = 34/91 (37%), Gaps = 2/91 (2%)
Query: 356 RKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK--EE 413
++RE++E KRA++ + K +K ++ K +E KQ+ K +
Sbjct: 204 KERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKP 263
Query: 414 GALPVYLLDRDVQSRAKVLSNMIKQKRKEKA 444
D+ V K + + K+
Sbjct: 264 ADTSSPKEDKQVAENQKREIEKAQIEIKKND 294
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 39.0 bits (91), Expect = 0.007
Identities = 35/149 (23%), Positives = 59/149 (39%), Gaps = 25/149 (16%)
Query: 339 ERHRKLYGRRLDYEERKRK-----REAREPHKRAEKARSVRKIKAKLY--NKQRRNE--- 388
ER ++L RL +E+K++ ++A K+AE+A + AK + +R
Sbjct: 101 ERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAA 160
Query: 389 -KIQMKKKIKAHEEKLTKQKHE-KKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGK 446
K + K KA E K E KK+ A + + +A+ + K+K +A K
Sbjct: 161 KKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEA---KKKAAAEAKK 217
Query: 447 WEVPIPKVRTQADAEVFKIVRSGKTKRKA 475
+A AE + KA
Sbjct: 218 ----------KAAAEAKAAAAKAAAEAKA 236
Score = 34.0 bits (78), Expect = 0.24
Identities = 24/110 (21%), Positives = 42/110 (38%), Gaps = 12/110 (10%)
Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK 411
E+RK+K + + + ++A ++K K+R Q +KK K K ++
Sbjct: 79 EQRKKKEQQQAEELQQKQAAEQERLKQL--EKER--LAAQEQKKQAEEAAKQAALKQKQA 134
Query: 412 EEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAE 461
EE A + A + KR A K K + +A+A
Sbjct: 135 EEAA--------AKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAA 176
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 39.2 bits (91), Expect = 0.007
Identities = 25/134 (18%), Positives = 51/134 (38%), Gaps = 16/134 (11%)
Query: 347 RRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQ---RRNEKIQMKKKIKAHEEKL 403
+ L+Y + K K E E + + + + L + + E K++++ EE L
Sbjct: 209 KALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEIL 268
Query: 404 TKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVF 463
+ E KEE + + K+L+ ++ + E K+ + +
Sbjct: 269 AQVLKENKEEEK-----EKKLQEEELKLLAKEEEELKSELL--------KLERRKVDDEE 315
Query: 464 KIVRSGKTKRKAWK 477
K+ S K +K K
Sbjct: 316 KLKESEKELKKLEK 329
Score = 34.9 bits (80), Expect = 0.16
Identities = 17/78 (21%), Positives = 35/78 (44%), Gaps = 1/78 (1%)
Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKA 398
+ + + L+ EE + +E +K EK + +++ + KL K+ K ++ K +
Sbjct: 251 QEEIESSKQELEKEEEILAQVLKE-NKEEEKEKKLQEEELKLLAKEEEELKSELLKLERR 309
Query: 399 HEEKLTKQKHEKKEEGAL 416
+ K K +KE L
Sbjct: 310 KVDDEEKLKESEKELKKL 327
Score = 33.4 bits (76), Expect = 0.47
Identities = 16/94 (17%), Positives = 36/94 (38%), Gaps = 4/94 (4%)
Query: 351 YEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEK 410
E+++K+E + + + I + Q K Q KK E ++ +
Sbjct: 166 SREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKA----LEYYQLKEKLE 221
Query: 411 KEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKA 444
EE L + + R +L +++ +++E
Sbjct: 222 LEEENLLYLDYLKLNEERIDLLQELLRDEQEEIE 255
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 39.3 bits (91), Expect = 0.007
Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 6/79 (7%)
Query: 336 EYIERHRKLYGRRLDYEE-RKRKRE---AREPHKRAE-KARSVRKIKAKLYNKQRRNEKI 390
+++ +L +L EE +R R+ RE R E KA+ V KIK+K Y K R+N K
Sbjct: 353 SALKKQEELALNKLSVEEVAERTRQLRFMRELMFREERKAKRVAKIKSKTYRKIRKNRKE 412
Query: 391 QMKKKIKAHEEKLTKQKHE 409
+ + + E L +K E
Sbjct: 413 K-EMALIPKSEDLENEKSE 430
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 37.8 bits (88), Expect = 0.008
Identities = 21/82 (25%), Positives = 33/82 (40%), Gaps = 14/82 (17%)
Query: 334 HNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMK 393
E E R EERK+ E RE ++ E+ + K K +++ E+
Sbjct: 20 QREAEEEER---------EERKKLEEKREGERKEEEELEEEREKKKEEEERKEREE---- 66
Query: 394 KKIKAHEE-KLTKQKHEKKEEG 414
+ K EE + K +EEG
Sbjct: 67 QARKEQEEYEKLKSSFVVEEEG 88
Score = 31.6 bits (72), Expect = 0.79
Identities = 13/67 (19%), Positives = 31/67 (46%), Gaps = 5/67 (7%)
Query: 352 EERKRKREAREPHKRAEKA-----RSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQ 406
+KR + + +R ++ R RK + +R+ E+ +++ K EE+ K+
Sbjct: 4 GAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKE 63
Query: 407 KHEKKEE 413
+ E+ +
Sbjct: 64 REEQARK 70
Score = 31.6 bits (72), Expect = 0.79
Identities = 15/62 (24%), Positives = 33/62 (53%), Gaps = 1/62 (1%)
Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK 411
EE++ +R+ RE + + R + K + K+ E + ++K K EE+ +++ +K
Sbjct: 12 EEKQARRQQREAEEEEREERKKLEEKREGERKEEE-ELEEEREKKKEEEERKEREEQARK 70
Query: 412 EE 413
E+
Sbjct: 71 EQ 72
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 38.3 bits (89), Expect = 0.010
Identities = 25/119 (21%), Positives = 51/119 (42%), Gaps = 19/119 (15%)
Query: 347 RRLDYEERKRKREAR----EPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEK 402
+R + R+++ E R + K+AE+A + K K + + + + K K +A EK
Sbjct: 83 QRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEK 142
Query: 403 LTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAE 461
K++ +K+ E + + K+K E K E + + +A+A+
Sbjct: 143 KAKEEAKKQAE------------EEAKAKAAAEAKKKAAEAKKKAE---AEAKAKAEAK 186
Score = 37.5 bits (87), Expect = 0.019
Identities = 27/127 (21%), Positives = 49/127 (38%), Gaps = 9/127 (7%)
Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKI-- 396
R ++L R + K+ +A + + +K K K K + + + K K
Sbjct: 89 ARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEA 148
Query: 397 --KAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKV 454
+A EE K E K++ A + + ++AK K K K + K + K
Sbjct: 149 KKQAEEEAKAKAAAEAKKKAAEAKKKAEAE--AKAKAE---AKAKAKAEEAKAKAEAAKA 203
Query: 455 RTQADAE 461
+ A+A
Sbjct: 204 KAAAEAA 210
Score = 31.3 bits (71), Expect = 1.7
Identities = 20/112 (17%), Positives = 46/112 (41%), Gaps = 14/112 (12%)
Query: 352 EERKRKREAREPHKRAEKARS--VRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHE 409
++ R ++ ++P + E+ R + + + KQR E+ + K+ E++ +K
Sbjct: 50 QQANRIQQQKKPAAKKEQERQKKLEQQAEEA-EKQRAAEQARQKEL----EQRAAAEKAA 104
Query: 410 KKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAE 461
K+ E A + AK ++ E K E + + + +A+
Sbjct: 105 KQAEQAAKQAEEKQKQAEEAK------AKQAAEAKAKAE-AEAEKKAKEEAK 149
Score = 30.6 bits (69), Expect = 2.9
Identities = 22/109 (20%), Positives = 46/109 (42%), Gaps = 9/109 (8%)
Query: 353 ERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKKE 412
E++R+++ + + AEK R A+ ++ ++ +K K E+ KQ EK++
Sbjct: 66 EQERQKKLEQQAEEAEKQR-----AAEQARQKELEQRAAAEKAAKQ-AEQAAKQAEEKQK 119
Query: 413 EGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAE 461
+ + ++K KE+A K K + A+A+
Sbjct: 120 QAEEAKA---KQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAK 165
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 38.5 bits (90), Expect = 0.014
Identities = 27/113 (23%), Positives = 49/113 (43%), Gaps = 20/113 (17%)
Query: 336 EYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKK 395
E E + KL +Y + R+ E R + E+ + + +L K+ R E++ KKK
Sbjct: 290 EKAEEYIKLSEFYEEYLDELREIEKRL-SRLEEEINGIEERIKELEEKEERLEEL--KKK 346
Query: 396 IKAHEEKLT--KQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGK 446
+K E++L +++HE EE AK +++ +K G
Sbjct: 347 LKELEKRLEELEERHELYEE---------------AKAKKEELERLKKRLTGL 384
Score = 37.7 bits (88), Expect = 0.019
Identities = 23/78 (29%), Positives = 39/78 (50%), Gaps = 1/78 (1%)
Query: 336 EYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMK-K 394
E +R +L R YEE K K+E E K+ + K++ +L ++ E+I+ +
Sbjct: 349 ELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEIS 408
Query: 395 KIKAHEEKLTKQKHEKKE 412
KI A +L K+ E K+
Sbjct: 409 KITARIGELKKEIKELKK 426
Score = 35.4 bits (82), Expect = 0.099
Identities = 15/83 (18%), Positives = 36/83 (43%), Gaps = 7/83 (8%)
Query: 339 ERHRKLYGRRLDYE---ERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQ---- 391
E + +L L+ R +R E +++ K+K +L +++ ++++
Sbjct: 659 EEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEK 718
Query: 392 MKKKIKAHEEKLTKQKHEKKEEG 414
++++ EK+ K K KE
Sbjct: 719 ALERVEELREKVKKYKALLKERA 741
Score = 34.7 bits (80), Expect = 0.16
Identities = 19/82 (23%), Positives = 43/82 (52%), Gaps = 14/82 (17%)
Query: 332 SLHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQ 391
+NEY+E K + L+ EE++ K+ E + K +L ++R E++
Sbjct: 599 PFYNEYLEL--KDAEKELEREEKELKKLEEE----------LDKAFEELAETEKRLEEL- 645
Query: 392 MKKKIKAHEEKLTKQKHEKKEE 413
+K+++ E+K +++++E+ E
Sbjct: 646 -RKELEELEKKYSEEEYEELRE 666
Score = 32.7 bits (75), Expect = 0.73
Identities = 24/83 (28%), Positives = 39/83 (46%), Gaps = 13/83 (15%)
Query: 346 GRRLDYEERKR---------KREAREPHKRAEKARSVRKIKAKLYNKQRRNEKI----QM 392
GR L E RK KR +E + EK R +RK +L ++ ++ ++
Sbjct: 442 GRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKEL 501
Query: 393 KKKIKAHEEKLTKQKHEKKEEGA 415
+++K EEKL K E+ E+ A
Sbjct: 502 AEQLKELEEKLKKYNLEELEKKA 524
Score = 31.6 bits (72), Expect = 1.5
Identities = 33/125 (26%), Positives = 55/125 (44%), Gaps = 15/125 (12%)
Query: 331 MSLHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKAR-SVRKIKAKLYNKQRRNEK 389
S E E KL + EE K + E E K E S RK++ K+ + R E+
Sbjct: 213 SSELPELREELEKLEKEVKELEELKEEIEELE--KELESLEGSKRKLEEKIRELEERIEE 270
Query: 390 IQMKKKIKAHEEK---LTKQKHEKKEEGALPVYLLD-----RDVQSRAKVLSNMIK--QK 439
+KK+I+ EEK L + K + +E L + + R+++ R L I ++
Sbjct: 271 --LKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEE 328
Query: 440 RKEKA 444
R ++
Sbjct: 329 RIKEL 333
Score = 31.2 bits (71), Expect = 2.3
Identities = 17/82 (20%), Positives = 41/82 (50%), Gaps = 3/82 (3%)
Query: 349 LDYEERKRK-REAREPHKRAEKARS-VRKIKAKLYNKQR-RNEKIQMKKKIKAHEEKLTK 405
+ EE ++K E + ++ K + ++ +K +L + + + +++KK+ EE+L +
Sbjct: 515 YNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAE 574
Query: 406 QKHEKKEEGALPVYLLDRDVQS 427
E +E G V L+ ++
Sbjct: 575 LLKELEELGFESVEELEERLKE 596
Score = 30.4 bits (69), Expect = 3.6
Identities = 22/103 (21%), Positives = 40/103 (38%), Gaps = 25/103 (24%)
Query: 335 NEYIERHRKLYGRRLDYEERKRKREAR--------------EPHKRAE--KARS----VR 374
IER K R + EE +++E P R E K +
Sbjct: 175 KRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELE 234
Query: 375 KIKAKLYNKQRRNEKIQMKK-----KIKAHEEKLTKQKHEKKE 412
++K ++ ++ E ++ K KI+ EE++ + K E +E
Sbjct: 235 ELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEE 277
Score = 29.3 bits (66), Expect = 7.6
Identities = 26/128 (20%), Positives = 51/128 (39%), Gaps = 21/128 (16%)
Query: 333 LHNEYIERHRKLYGRRLDYEER---------------KRKREAREPHKRAEKARSVRKIK 377
+ EY++ R++ R EE +R E ++ K EK + +
Sbjct: 301 FYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEER 360
Query: 378 AKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK----EEGALPVYLLDRDVQSRAKVLS 433
+LY + + K + +++K LT +K EK+ E+ + + +R L
Sbjct: 361 HELYEEAKA--KKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELK 418
Query: 434 NMIKQKRK 441
IK+ +K
Sbjct: 419 KEIKELKK 426
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 37.5 bits (87), Expect = 0.020
Identities = 19/73 (26%), Positives = 33/73 (45%), Gaps = 4/73 (5%)
Query: 346 GRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKA---HEEK 402
GR+ ++E R+ KRE + K A + ++ +R EK + KK H
Sbjct: 343 GRQSEWEARQAKREGGD-AKAGRAAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKPLHPSW 401
Query: 403 LTKQKHEKKEEGA 415
K+K ++K+ A
Sbjct: 402 EAKKKAKEKKANA 414
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 36.3 bits (84), Expect = 0.025
Identities = 19/67 (28%), Positives = 35/67 (52%), Gaps = 3/67 (4%)
Query: 350 DYEERKRKREAREPHKRAEKARSVRKIKAKL-YNKQRRNEKIQ--MKKKIKAHEEKLTKQ 406
D+ +KRKRE + + K R V + + Y R+ + Q +++K K ++K+ +
Sbjct: 139 DWRAQKRKREQKARAAKERKERLVAEAREHFGYWVDPRDPRFQEMLQQKEKEEKKKVKEA 198
Query: 407 KHEKKEE 413
K +KEE
Sbjct: 199 KRREKEE 205
Score = 30.5 bits (69), Expect = 2.0
Identities = 28/125 (22%), Positives = 55/125 (44%), Gaps = 21/125 (16%)
Query: 333 LHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKL--YNKQRRNEKI 390
L E E + L + E+++++EAR + AE A+++ K+ + + Q+R
Sbjct: 90 LEEEEREWYPSLNQMLEENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKR---- 145
Query: 391 QMKKKIKAHEEKLTKQK--HEKKEEGALPV---------YLLDRDVQSRAKVLSNMIKQK 439
K++ KA K K++ E +E V L ++ + + KV K++
Sbjct: 146 --KREQKARAAKERKERLVAEAREHFGYWVDPRDPRFQEMLQQKEKEEKKKV--KEAKRR 201
Query: 440 RKEKA 444
KE+
Sbjct: 202 EKEEK 206
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 37.4 bits (86), Expect = 0.030
Identities = 24/117 (20%), Positives = 48/117 (41%), Gaps = 9/117 (7%)
Query: 339 ERHRKLYGRRLDYEERK--RKREAREPHKRAEKARSVRKIKAKLYNKQRRNE----KIQM 392
+ +K ++ E +K K++A E K+AE+A+ + K K +++ + K +
Sbjct: 1281 DELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEE 1340
Query: 393 KKK---IKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGK 446
KK E + + E EE A + + +A ++K+K K
Sbjct: 1341 AKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAK 1397
Score = 37.0 bits (85), Expect = 0.042
Identities = 24/97 (24%), Positives = 47/97 (48%)
Query: 347 RRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQ 406
++ + +E+K ++ + A+KA ++K +A+ K +K + + KIKA E K +
Sbjct: 1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAE 1740
Query: 407 KHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEK 443
+ +KK E A + + K ++ RKEK
Sbjct: 1741 EDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEK 1777
Score = 36.7 bits (84), Expect = 0.052
Identities = 31/121 (25%), Positives = 54/121 (44%), Gaps = 15/121 (12%)
Query: 339 ERHRKLYGRRLDYEERK----RKREAREPHKRAE--KARSVRKIKAKLYNKQRRNEKIQM 392
E+ ++ E +K +K+EA E K E KA KIKA+ K+ +K +
Sbjct: 1687 EKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDK-KK 1745
Query: 393 KKKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIP 452
++ K EE+ K H KKEE ++ + K +I+++ E+ K + +
Sbjct: 1746 AEEAKKDEEEKKKIAHLKKEE--------EKKAEEIRKEKEAVIEEELDEEDEKRRMEVD 1797
Query: 453 K 453
K
Sbjct: 1798 K 1798
Score = 36.3 bits (83), Expect = 0.070
Identities = 39/149 (26%), Positives = 67/149 (44%), Gaps = 14/149 (9%)
Query: 338 IERHRKLYG--RRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKI--QMK 393
IE KLY +++ EE K+ EA+ + +KA +K +L K+ +K ++K
Sbjct: 1594 IEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELK 1653
Query: 394 K-----KIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRK--EKAGK 446
K KIKA EE ++ +KK E A + D + A+ L ++ +K E K
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKA---EEDEKKAAEALKKEAEEAKKAEELKKK 1710
Query: 447 WEVPIPKVRTQADAEVFKIVRSGKTKRKA 475
K AE +++ + K++A
Sbjct: 1711 EAEEKKKAEELKKAEEENKIKAEEAKKEA 1739
Score = 36.3 bits (83), Expect = 0.073
Identities = 24/128 (18%), Positives = 56/128 (43%), Gaps = 12/128 (9%)
Query: 352 EERKRK----REAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQK 407
+E K+K ++A E K+AE+A+ + K K ++ +E + ++ K ++ K
Sbjct: 1447 DEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAA 1506
Query: 408 HEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVR 467
KK+ + +A + K+ ++A K E + + + + + ++
Sbjct: 1507 EAKKKADEAK----KAEEAKKADEAKKAEEAKKADEAKKAE----EKKKADELKKAEELK 1558
Query: 468 SGKTKRKA 475
+ K+KA
Sbjct: 1559 KAEEKKKA 1566
Score = 35.9 bits (82), Expect = 0.086
Identities = 27/128 (21%), Positives = 50/128 (39%), Gaps = 12/128 (9%)
Query: 348 RLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQK 407
EE K+K A K+AE+ + + K K +++ ++++ K ++ K+
Sbjct: 1370 EKKKEEAKKK--ADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKA 1427
Query: 408 HEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVR 467
EKK+ + AK K+ E+A K E K A+ K +
Sbjct: 1428 EEKKKADEAKKKAEEAKKADEAK--------KKAEEAKKAEEAKKKAEEAKKADEAK--K 1477
Query: 468 SGKTKRKA 475
+ +KA
Sbjct: 1478 KAEEAKKA 1485
Score = 34.7 bits (79), Expect = 0.18
Identities = 28/125 (22%), Positives = 53/125 (42%), Gaps = 22/125 (17%)
Query: 352 EERKRKREARE-PHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEK 410
EE K+ E + ++AE+A+ + + + K EK ++ K EE K + K
Sbjct: 1567 EEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELK 1626
Query: 411 KEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGK 470
K E + + KV +K+K E+ K E +++ AE +++ +
Sbjct: 1627 KAE------------EEKKKV--EQLKKKEAEEKKKAE----ELKK---AEEENKIKAAE 1665
Query: 471 TKRKA 475
+KA
Sbjct: 1666 EAKKA 1670
Score = 34.7 bits (79), Expect = 0.18
Identities = 25/128 (19%), Positives = 52/128 (40%), Gaps = 16/128 (12%)
Query: 352 EERKRK----REAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQK 407
+E K+K ++A E K+AE+A+ + K K ++ E+ + K + ++ K+
Sbjct: 1421 DEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEA-KKAEEAKKKAEEAKKADEAKKKA 1479
Query: 408 HEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVR 467
E K+ + AK ++ K+ + K E + +AD +
Sbjct: 1480 EEAKKADEA------KKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADE-----AK 1528
Query: 468 SGKTKRKA 475
+ +KA
Sbjct: 1529 KAEEAKKA 1536
Score = 34.7 bits (79), Expect = 0.18
Identities = 25/118 (21%), Positives = 48/118 (40%), Gaps = 8/118 (6%)
Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKA 398
E +K + E++ E K+AE+AR +K K+ + E+ + ++ K
Sbjct: 1561 EEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKI 1620
Query: 399 HEEKLTKQKHEKKEEGALPVYLLD--------RDVQSRAKVLSNMIKQKRKEKAGKWE 448
E+L K + EKK+ L + + + K+ + +K +E K E
Sbjct: 1621 KAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAE 1678
Score = 34.3 bits (78), Expect = 0.26
Identities = 31/144 (21%), Positives = 56/144 (38%), Gaps = 19/144 (13%)
Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAE---KARSVRKIKAKLYNKQRRNEKIQMKKK 395
K+ L E ++K+ + K AE KA ++K + + K K + K
Sbjct: 1615 AEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDK 1674
Query: 396 IKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVR 455
KA E K ++ +K E L ++ + K +K+K E+ K E +++
Sbjct: 1675 KKAEEAKKAEEDEKKAAEA------LKKEAEEAKKA--EELKKKEAEEKKKAE----ELK 1722
Query: 456 TQADAEVFKIVRSGKT----KRKA 475
+ K + K K+KA
Sbjct: 1723 KAEEENKIKAEEAKKEAEEDKKKA 1746
Score = 34.0 bits (77), Expect = 0.35
Identities = 30/147 (20%), Positives = 55/147 (37%), Gaps = 14/147 (9%)
Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIK----AKLYNKQRRNEKIQMKK 394
E +K + E +K EA+ +AE+ R+ +I+ A++ + RR I+ ++
Sbjct: 1221 EDAKKAEAVKKAEEAKKDAEEAK----KAEEERNNEEIRKFEEARMAHFARRQAAIKAEE 1276
Query: 395 KIKAHE----EKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVP 450
KA E E+ K KK E + + K ++ K+K +E K +
Sbjct: 1277 ARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKK--ADEAKKKAEEAKKKADAA 1334
Query: 451 IPKVRTQADAEVFKIVRSGKTKRKAWK 477
K A + +A
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEA 1361
Score = 33.6 bits (76), Expect = 0.46
Identities = 22/81 (27%), Positives = 42/81 (51%), Gaps = 4/81 (4%)
Query: 339 ERHRKLYGRRLDYEERKRKREAR---EPHKRAEKARSVRKIKAKLYNKQRRNE-KIQMKK 394
E +K ++ E +K+ EA+ E K+AE+A+ + K K ++ +E K + ++
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481
Query: 395 KIKAHEEKLTKQKHEKKEEGA 415
KA E K ++ +KK + A
Sbjct: 1482 AKKADEAKKKAEEAKKKADEA 1502
Score = 33.2 bits (75), Expect = 0.58
Identities = 24/119 (20%), Positives = 56/119 (47%), Gaps = 21/119 (17%)
Query: 352 EERKRKREAR---EPHKRAEKARSVRKIK----------AKLYNKQRRNEKIQMKKKIKA 398
+E K+ EA+ + K+AE+A+ + K AK ++++ ++++ +++K
Sbjct: 1500 DEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKK 1559
Query: 399 HEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQ 457
EEK ++ +K EE D+++ R + ++ R E+ K K++ +
Sbjct: 1560 AEEKKKAEEAKKAEE--------DKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAE 1610
Score = 32.8 bits (74), Expect = 0.74
Identities = 40/173 (23%), Positives = 68/173 (39%), Gaps = 37/173 (21%)
Query: 339 ERHRKLYGRRLDYEERKRKREAR--EPHKRAEKARSVRKIK-----AKLYNKQRRNEKI- 390
E RK R EE ++ EAR E K+AE + + K AK ++R NE+I
Sbjct: 1197 EDARKAEAAR-KAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIR 1255
Query: 391 -----------QMKKKIKAHE----EKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNM 435
+ + IKA E ++L K + +KK + A + + + K
Sbjct: 1256 KFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEA-------KKAEEKKKADEAK 1308
Query: 436 IKQKRKEKAGKWEVPIPKVRTQADA------EVFKIVRSGKTKRKAWKRMITK 482
K + +KA + + + + +ADA E K + K + +A
Sbjct: 1309 KKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEA 1361
Score = 32.0 bits (72), Expect = 1.3
Identities = 21/98 (21%), Positives = 41/98 (41%), Gaps = 2/98 (2%)
Query: 352 EERKRKREAREPHKRAEKARSVRKIK-AKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEK 410
E ++ E R+ + KA RK + A+ ++R+ E+ + + K E ++ +K
Sbjct: 1179 EAARKAEEVRKAEE-LRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKK 1237
Query: 411 KEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWE 448
E A + + R + M R++ A K E
Sbjct: 1238 DAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAE 1275
Score = 31.6 bits (71), Expect = 1.6
Identities = 32/127 (25%), Positives = 56/127 (44%), Gaps = 9/127 (7%)
Query: 352 EERKRKREA--REPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKK-KIKAHEEKLTKQKH 408
EE K+ EA E A++A + + KA+ K++ K + K KA E+K +
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEE-KAEAAEKKKEEAKKKADAAKKKAEEKKKADEAK 1397
Query: 409 EKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRS 468
+K EE L + ++ K ++ K+K +EK K + K A+ K +
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKK--ADEAKKKAEEKK-KADEAKKKAEEAKKADEAK--KK 1452
Query: 469 GKTKRKA 475
+ +KA
Sbjct: 1453 AEEAKKA 1459
Score = 31.6 bits (71), Expect = 1.9
Identities = 28/127 (22%), Positives = 47/127 (37%), Gaps = 8/127 (6%)
Query: 352 EERKRKREAREPHKRAE---KARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKH 408
E EA ++AE K + K KA K+ +K + K KA E+K +
Sbjct: 1351 EAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADEL 1410
Query: 409 EKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRS 468
+K + + + K +K+ E+A K + K AE K +
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADE---AKKKAEEAKKADEAKKKAEEAKKAEEAK--KK 1465
Query: 469 GKTKRKA 475
+ +KA
Sbjct: 1466 AEEAKKA 1472
Score = 31.3 bits (70), Expect = 2.2
Identities = 32/143 (22%), Positives = 53/143 (37%), Gaps = 26/143 (18%)
Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAE---KARSVRKIKAKLYNKQRRNEKIQMKKK 395
E RK R+ + R++ E ++A+ KA + KA K +K KK
Sbjct: 1252 EEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKK--KADEAKKAEEKKKADEAKK 1309
Query: 396 IKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVR 455
KA E K + +K EE AK ++ K+K +E E +
Sbjct: 1310 -KAEEAKKADEAKKKAEE---------------AKKKADAAKKKAEEAKKAAEAAKAEAE 1353
Query: 456 TQADAEVFKIVRSGKTKRKAWKR 478
AD + + K +A ++
Sbjct: 1354 AAADE-----AEAAEEKAEAAEK 1371
Score = 30.5 bits (68), Expect = 3.9
Identities = 27/129 (20%), Positives = 57/129 (44%), Gaps = 9/129 (6%)
Query: 352 EERKR----KREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAH-EEKLTKQ 406
EE+K+ K++A E K+A++ + K K +++ E+ + + K EE
Sbjct: 1388 EEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKAD 1447
Query: 407 KHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIV 466
+ +KK E A + + K K + +KA + + + + +AD +
Sbjct: 1448 EAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKAD----EAK 1503
Query: 467 RSGKTKRKA 475
++ + K+KA
Sbjct: 1504 KAAEAKKKA 1512
Score = 29.7 bits (66), Expect = 6.2
Identities = 31/140 (22%), Positives = 59/140 (42%), Gaps = 7/140 (5%)
Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIK-AKLYNKQRRNEKIQMKKKIK 397
E +K + EE ++ EA++ + A KA RK + A+ + R+ E + + +
Sbjct: 1101 EEAKKTETGKA--EEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIAR 1158
Query: 398 AHEEKLTKQKHEKKEEGALPVYLLDRDVQSR-AKVLSNMIKQKRKEKAGKWE--VPIPKV 454
E+ K + +K E A + + R A+ L ++ E A K E +
Sbjct: 1159 KAEDA-RKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEA 1217
Query: 455 RTQADAEVFKIVRSGKTKRK 474
R DA+ + V+ + +K
Sbjct: 1218 RKAEDAKKAEAVKKAEEAKK 1237
Score = 29.7 bits (66), Expect = 6.4
Identities = 20/79 (25%), Positives = 38/79 (48%), Gaps = 5/79 (6%)
Query: 339 ERHRKLYGRRLDYEERKRKREAR----EPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKK 394
E +K + EE K+ EA+ E K+A++A+ + K K ++ ++ E+ +
Sbjct: 1467 EEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKA-DEAKKAEEAKKAD 1525
Query: 395 KIKAHEEKLTKQKHEKKEE 413
+ K EE + +K EE
Sbjct: 1526 EAKKAEEAKKADEAKKAEE 1544
>gnl|CDD|236295 PRK08570, rpl19e, 50S ribosomal protein L19e; Reviewed.
Length = 150
Score = 35.2 bits (82), Expect = 0.032
Identities = 20/53 (37%), Positives = 25/53 (47%), Gaps = 3/53 (5%)
Query: 339 ERHRKL-YGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKI 390
ERH K GRR RK K+ AR P K R +R ++ L + R KI
Sbjct: 65 ERHEKRKKGRRRGPGSRKGKKGARTPKKERWINR-IRALRRYL-RELRDEGKI 115
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 36.3 bits (84), Expect = 0.051
Identities = 22/89 (24%), Positives = 37/89 (41%), Gaps = 14/89 (15%)
Query: 343 KLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEK-----------IQ 391
+ + +R + E KRKRE ++ KAR R+ +KL + E +
Sbjct: 40 ENWKKRQEEAEAKRKREELR--EKIAKAREKRERNSKLGGIKTLGEDDDDDDDTKAWLKK 97
Query: 392 MKKKIKAHE-EKLTKQKHEKKEEGALPVY 419
KK+ K E E+ ++KE+ Y
Sbjct: 98 SKKRQKKKEAERKKALLLDEKEKERAAEY 126
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This
family includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 36.2 bits (82), Expect = 0.056
Identities = 32/147 (21%), Positives = 70/147 (47%), Gaps = 7/147 (4%)
Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKA 398
++ +K + + E++ ++A E R +A + ++ + +KQ + E+ +++K+ +
Sbjct: 76 KKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEHKQWKAEQERIQKEQEK 135
Query: 399 HEEKLTKQKHEKKEEGALPVYLLDRDVQSR-----AKVLSNMIKQKRKEKAGKWEVPIPK 453
E L K + EKK+E A+ ++ +++ A V ++ +K P PK
Sbjct: 136 KEADLKKLQAEKKKEKAVKAEKAEKAEKTKKASTPAPVEEEIVVKKVANDRSAAPAPEPK 195
Query: 454 VRTQADAEVFKIVR--SGKTKRKAWKR 478
T AE + V+ +GK +K K+
Sbjct: 196 TPTNTPAEPAEQVQEITGKKNKKNKKK 222
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 35.9 bits (84), Expect = 0.072
Identities = 23/101 (22%), Positives = 51/101 (50%), Gaps = 20/101 (19%)
Query: 357 KREAREPHKRAE-KARS-VRKIKAKLYN--KQRRNEKIQMKKKIKAHEEKLTKQKH--EK 410
K+EA K A +A+ + K++ + ++RRNE +++K++ EE L ++ EK
Sbjct: 48 KKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEK 107
Query: 411 KEEGALPVYLLDR---------DVQSRAKVLSNMIKQKRKE 442
+EE L + +++ + + L +I+++ +E
Sbjct: 108 REE-----ELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143
Score = 32.4 bits (75), Expect = 0.91
Identities = 32/150 (21%), Positives = 70/150 (46%), Gaps = 18/150 (12%)
Query: 341 HRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRN----EKIQMKKKI 396
+K+ ++ E + KR E K AE + ++AK + RN E + + ++
Sbjct: 25 RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNEL 84
Query: 397 KAHEEKLTKQKHE---KKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPK 453
+ E++L QK E +K E + + +++ + K L +++++ K E + +
Sbjct: 85 QKLEKRL-LQKEENLDRKLEL---LEKREEELEKKEKELE-----QKQQELEKKEEELEE 135
Query: 454 VRTQADAEVFKIVRSGKTKRKAWKRMITKV 483
+ + E+ +I SG T +A + ++ KV
Sbjct: 136 LIEEQLQELERI--SGLTAEEAKEILLEKV 163
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 35.4 bits (82), Expect = 0.11
Identities = 28/124 (22%), Positives = 49/124 (39%), Gaps = 12/124 (9%)
Query: 332 SLHNEYIERHRKLYGRRLDYEER-KRKREAREPHKRAEKARS---------VRKIKAKLY 381
L EY E +L R + EE K E R+ K + R++
Sbjct: 354 KLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQE 413
Query: 382 NKQRRNEKI-QMKKKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKR 440
QR +E++ + I E K+ + + E+KE+ AL + + ++ A LS ++
Sbjct: 414 ELQRLSEELADLNAAIAGIEAKINELE-EEKEDKALEIKKQEWKLEQLAADLSKYEQELY 472
Query: 441 KEKA 444
K
Sbjct: 473 DLKE 476
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 35.1 bits (81), Expect = 0.12
Identities = 31/124 (25%), Positives = 58/124 (46%), Gaps = 20/124 (16%)
Query: 337 YIERHRKLYGR--RLDYEERKRKREAREPHKRAEKARS-VRKIKAKLYNKQRRNEKIQMK 393
Y +R +KL RL+ E + KRE E + EK S + + + ++ +K R++ +I+ +
Sbjct: 420 YEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRAR 479
Query: 394 -KKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIP 452
++I E+L K+ EKK+ V L+R +++ RK + P+
Sbjct: 480 DRRI----ERLEKELEEKKKR----VEELERK--------LAELRKMRKLELSGKGTPVK 523
Query: 453 KVRT 456
V
Sbjct: 524 VVEK 527
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 34.5 bits (79), Expect = 0.18
Identities = 17/63 (26%), Positives = 25/63 (39%), Gaps = 1/63 (1%)
Query: 353 ERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKKE 412
K+K EA E AEKA++ + KAK K + + + K K K
Sbjct: 191 AAKKKAEA-EAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAA 249
Query: 413 EGA 415
+ A
Sbjct: 250 KAA 252
Score = 33.4 bits (76), Expect = 0.36
Identities = 24/124 (19%), Positives = 56/124 (45%), Gaps = 12/124 (9%)
Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK 411
++ R + + K+ E+ R ++ + K ++ + + K+++ +E+L Q+ +K+
Sbjct: 62 QQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLE--KERLKAQEQQKQ 119
Query: 412 EEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGKT 471
E A L++ Q + ++K+ E A K + A+A K + +
Sbjct: 120 AEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAA--------KAKAAAEAAKLKA--AAEA 169
Query: 472 KRKA 475
K+KA
Sbjct: 170 KKKA 173
Score = 33.0 bits (75), Expect = 0.45
Identities = 21/67 (31%), Positives = 29/67 (43%), Gaps = 3/67 (4%)
Query: 352 EERKRKREAREPHKRAEKARSVR---KIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKH 408
E + + A E K+AE+A K KA+ +++ E KA E K K
Sbjct: 158 AEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKA 217
Query: 409 EKKEEGA 415
EKK E A
Sbjct: 218 EKKAEAA 224
Score = 31.8 bits (72), Expect = 1.2
Identities = 30/132 (22%), Positives = 52/132 (39%), Gaps = 22/132 (16%)
Query: 353 ERKRKRE-------AREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTK 405
E++R ++ A+E K+AE+A +++ K +Q R + KKK E K
Sbjct: 99 EQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKK---AEAAKAK 155
Query: 406 QKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAE--VF 463
E + A + AK + + +E K E K + +A+A+
Sbjct: 156 AAAEAAKLKA----------AAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAE 205
Query: 464 KIVRSGKTKRKA 475
K + K KA
Sbjct: 206 KAKAEAEAKAKA 217
>gnl|CDD|238268 cd00481, Ribosomal_L19e, Ribosomal protein L19e. L19e is found in
the large ribosomal subunit of eukaryotes and archaea.
L19e is distinct from the ribosomal subunit L19, which
is found in prokaryotes. It consists of two small
globular domains connected by an extended segment. It is
located toward the surface of the large subunit, with
one exposed end involved in forming the intersubunit
bridge with the small subunit. The other exposed end is
involved in forming the translocon binding site, along
with L22, L23, L24, L29, and L31e subunits.
Length = 145
Score = 33.0 bits (76), Expect = 0.18
Identities = 20/71 (28%), Positives = 29/71 (40%), Gaps = 9/71 (12%)
Query: 339 ERHRKL-YGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKI------Q 391
+RH GR RK + AR P K R +R ++ L K R + KI +
Sbjct: 62 KRHEARRKGRHRGPGSRKGTKGARMPSKELWIRR-IRALRRLL-KKLRDSGKIDKHTYRE 119
Query: 392 MKKKIKAHEEK 402
+ K K + K
Sbjct: 120 LYLKAKGNVFK 130
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 34.2 bits (79), Expect = 0.18
Identities = 14/87 (16%), Positives = 32/87 (36%), Gaps = 12/87 (13%)
Query: 334 HNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKA----RSVRKIKAKLYN-KQRRNE 388
N E +R +L K E + R ++K + K++R E
Sbjct: 280 ENLLYENYRT---EKL--SGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLE 334
Query: 389 KI--QMKKKIKAHEEKLTKQKHEKKEE 413
++ + ++ K KL + + + ++
Sbjct: 335 ELEQNLIEERKELNSKLEEIQKKLEDL 361
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 32.7 bits (75), Expect = 0.18
Identities = 18/70 (25%), Positives = 35/70 (50%), Gaps = 13/70 (18%)
Query: 354 RKRKREARE----PHKRAE------KARSVRKIKAKLYNKQR-RNEKIQMKKKIKAHEEK 402
+ +K+ E R E K + R+ KA+ K++ R E+I+ +K+I+ EE+
Sbjct: 4 KAKKKGVEEVVFDKEARKEYLTGFHKRKQQRRKKAQEEAKEKEREERIEERKRIR--EER 61
Query: 403 LTKQKHEKKE 412
+ + + KE
Sbjct: 62 KQELEKQLKE 71
Score = 30.0 bits (68), Expect = 1.9
Identities = 20/73 (27%), Positives = 39/73 (53%), Gaps = 13/73 (17%)
Query: 348 RLDY----EERK--RKREAREPHKRAEKARSVRKIKAKLYNKQRRNE-KIQMKKKIKAHE 400
R +Y +RK R+++A+E K E+ +I+ + K+ R E K +++K++K +
Sbjct: 20 RKEYLTGFHKRKQQRRKKAQEEAKEKEREE---RIEER---KRIREERKQELEKQLKERK 73
Query: 401 EKLTKQKHEKKEE 413
E L + E +E
Sbjct: 74 EALKLLEEENDDE 86
>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family. Members of this family are
coiled-coil proteins that are involved in pre-rRNA
processing.
Length = 105
Score = 32.0 bits (73), Expect = 0.23
Identities = 18/63 (28%), Positives = 28/63 (44%), Gaps = 1/63 (1%)
Query: 351 YEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEK 410
+E+R KR ++ K EK K + Q E+ K++ K EK+ + H K
Sbjct: 30 WEKRMEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAKEE-KERYEKMAAKMHAK 88
Query: 411 KEE 413
K E
Sbjct: 89 KVE 91
>gnl|CDD|238706 cd01418, Ribosomal_L19e_A, Ribosomal protein L19e, archaeal. L19e
is found in the large ribosomal subunit of eukaryotes
and archaea. L19e is distinct from the ribosomal subunit
L19, which is found in prokaryotes. It consists of two
small globular domains connected by an extended segment.
It is located toward the surface of the large subunit,
with one exposed end involved in forming the
intersubunit bridge with the small subunit. The other
exposed end is involved in forming the translocon
binding site, along with L22, L23, L24, L29, and L31e
subunits.
Length = 145
Score = 32.7 bits (75), Expect = 0.24
Identities = 19/53 (35%), Positives = 24/53 (45%), Gaps = 3/53 (5%)
Query: 339 ERHRKL-YGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKI 390
ERH K GRR RK K+ AR P K +R ++ L + R KI
Sbjct: 62 ERHEKRKKGRRRGPGSRKGKKGARTPKKERWIKT-IRALRRYL-KELRDKGKI 112
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 32.3 bits (74), Expect = 0.29
Identities = 11/62 (17%), Positives = 26/62 (41%), Gaps = 8/62 (12%)
Query: 346 GRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTK 405
GR +D EE++ ++E + K ++ + + K + + ++K EK
Sbjct: 9 GRIIDIEEKREEKEREKEEKERKEEKEKE------WGKG--LVQKEEREKRLEELEKAKN 60
Query: 406 QK 407
+
Sbjct: 61 KP 62
>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2. This
family consists of several Eukaryotic rRNA processing
protein EBP2 sequences. Ebp2p is required for the
maturation of 25S rRNA and 60S subunit assembly. Ebp2p
may be one of the target proteins of Rrs1p for executing
the signal to regulate ribosome biogenesis. This family
also plays a role in chromosome segregation.
Length = 270
Score = 33.5 bits (77), Expect = 0.32
Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 16/77 (20%)
Query: 352 EERKRKREAREPHKR-----AEKARS---VRKIKAKLYNKQRRNEKIQMKKKIKA----- 398
E RK ++ P KR AE +S ++K++ KL + + + +K +
Sbjct: 88 EARKLLKKLGVPFKRPDDYFAEMVKSDEHMQKVRKKLLAEAAAKKASEEARKQRELKKFG 147
Query: 399 ---HEEKLTKQKHEKKE 412
EKL ++ EKK+
Sbjct: 148 KQVQVEKLQERAKEKKD 164
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The function
is unclear.
Length = 151
Score = 32.3 bits (73), Expect = 0.35
Identities = 16/61 (26%), Positives = 31/61 (50%), Gaps = 5/61 (8%)
Query: 374 RKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLS 433
RK K + ++++K KKK K E+K +K E+K G +P + +++V ++
Sbjct: 14 RKESGK--TQSQKSDK---KKKEKVSEKKGKSKKKEEKPNGKIPEHEPNQEVTEVEVIIE 68
Query: 434 N 434
Sbjct: 69 K 69
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 33.1 bits (76), Expect = 0.36
Identities = 17/73 (23%), Positives = 33/73 (45%), Gaps = 16/73 (21%)
Query: 333 LHNEYIERHRKLYGRRLDYEERK-------RKREAREPHKRAEKARSVRKIKAKLYNKQR 385
L E + R+ ++ +E+R+ ++ E E +R EK R K +K+R
Sbjct: 149 LEEEELAELRQ---QQRQFEQRRNAELAETQRLEEAERRRREEKERR------KKQDKER 199
Query: 386 RNEKIQMKKKIKA 398
+ + + +KI A
Sbjct: 200 KQREKETAEKIAA 212
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 33.4 bits (76), Expect = 0.41
Identities = 37/161 (22%), Positives = 53/161 (32%), Gaps = 25/161 (15%)
Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEK------------ARSVRKIKAKLYNKQRR 386
E RK + E K + E RE K K A + K KA KQ+R
Sbjct: 20 EEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAAALAKQKR 79
Query: 387 --------NEKIQMKKK----IKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSN 434
EK + K K KA L KQK + E ++ A +
Sbjct: 80 EGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQK-REGTEEVTEEEKAAAKAKAAAAAKAK 138
Query: 435 MIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGKTKRKA 475
++++ G EV + T + K + K K A
Sbjct: 139 AAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAA 179
Score = 31.5 bits (71), Expect = 1.6
Identities = 31/124 (25%), Positives = 49/124 (39%), Gaps = 17/124 (13%)
Query: 354 RKRKREAREPHKRAEKARSVRKIK--AKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK 411
K+KRE E EKA + K AK +K + +++ EE+ K+K + K
Sbjct: 109 AKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAK 168
Query: 412 EEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGKT 471
A +++A L+ KQK E E + + +A A K + K
Sbjct: 169 AAAA---------AKAKAAALA---KQKAAEAGEGTEEVTEEEKAKAKA---KAAAAAKA 213
Query: 472 KRKA 475
K A
Sbjct: 214 KAAA 217
>gnl|CDD|233750 TIGR02151, IPP_isom_2, isopentenyl-diphosphate delta-isomerase,
type 2. Isopentenyl-diphosphate delta-isomerase (IPP
isomerase) interconverts isopentenyl diphosphate and
dimethylallyl diphosphate. This model represents the
type 2 enzyme. FMN, NADPH, and Mg2+ are required by this
form, which lacks homology to the type 1 enzyme
(TIGR02150). IPP is precursor to many compounds,
including enzyme cofactors, sterols, and isoprenoids
[Biosynthesis of cofactors, prosthetic groups, and
carriers, Other].
Length = 333
Score = 33.0 bits (76), Expect = 0.43
Identities = 13/41 (31%), Positives = 21/41 (51%), Gaps = 5/41 (12%)
Query: 503 PMALRFNKAHVTHPELKATFYLPIIGVKK-NPNSPMYTNLG 542
PM + +A + PE TF + V++ PN P+ N+G
Sbjct: 85 PMGVGSQRAALKDPETADTFEV----VREEAPNGPLIANIG 121
>gnl|CDD|223596 COG0522, RpsD, Ribosomal protein S4 and related proteins
[Translation, ribosomal structure and biogenesis].
Length = 205
Score = 32.7 bits (75), Expect = 0.43
Identities = 12/68 (17%), Positives = 25/68 (36%), Gaps = 7/68 (10%)
Query: 346 GRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQM-------KKKIKA 398
R +++ +R P + + + + Q R +K+ K+K++A
Sbjct: 2 ARYPGPKKKLSRRLGFNPWLKERLCKERKLPYKPGQHGQERWKKLSDYGLQLREKQKLRA 61
Query: 399 HEEKLTKQ 406
L KQ
Sbjct: 62 FYGVLEKQ 69
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 33.1 bits (76), Expect = 0.49
Identities = 26/111 (23%), Positives = 39/111 (35%), Gaps = 14/111 (12%)
Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHE-- 409
+ + E E K K K QR EK + + + +A EEK K+K
Sbjct: 253 SDDESAWEGFESEYEPINKPVRPKRKTK---AQRNKEKRRKELEREAKEEKQLKKKLAQL 309
Query: 410 ------KKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKV 454
KE + Q + + K KR+ K GK + P P +
Sbjct: 310 ARLKEIAKEVAQKEKARARKKEQRKERGEK--KKLKRR-KLGKHKYPEPPL 357
Score = 32.0 bits (73), Expect = 0.90
Identities = 12/75 (16%), Positives = 34/75 (45%)
Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK 411
E+R+++ E ++ K + + + K K+ ++ +K + +E+ K+K +++
Sbjct: 286 EKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKKLKRR 345
Query: 412 EEGALPVYLLDRDVQ 426
+ G +V
Sbjct: 346 KLGKHKYPEPPLEVV 360
>gnl|CDD|216581 pfam01577, Peptidase_S30, Potyvirus P1 protease. The potyviridae
family positive stand RNA viruses with genome encoding a
polyprotein. members include zucchini yellow mosaic
virus, and turnip mosaic viruses which cause
considerable losses of crops worldwide. This family
consists of a C terminus region from various plant
potyvirus P1 proteins (found at the N terminus of the
polyprotein). The C terminus of P1 is a serine-type
protease responsible for autocatalytic cleavage between
P1 and the helper component protease pfam00851. The
entire P1 protein may be involved in virus-host
interactions.
Length = 245
Score = 32.7 bits (75), Expect = 0.49
Identities = 22/114 (19%), Positives = 37/114 (32%), Gaps = 13/114 (11%)
Query: 374 RKIKAKLYNKQRRNEKIQMKKKIKAH-EEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVL 432
K KQ + +I ++K A +K K K E++EE
Sbjct: 11 LLRKEMSKIKQEKKGRIILRKLSPAQVAKKREKLKREEREERQFLQGAYA---------- 60
Query: 433 SNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGKTKRKAWKRMITKVCFV 486
S + K + RT K ++ K K+K M K+ ++
Sbjct: 61 SIVSKITPIGTDKVSKTESVSFRTPYYKRTTKKMKKKKKKKKV--VMSDKINYL 112
>gnl|CDD|217843 pfam04012, PspA_IM30, PspA/IM30 family. This family includes PspA
a protein that suppresses sigma54-dependent
transcription. The PspA protein, a negative regulator of
the Escherichia coli phage shock psp operon, is produced
when virulence factors are exported through secretins in
many Gram-negative pathogenic bacteria and its homologue
in plants, VIPP1, plays a critical role in thylakoid
biogenesis, essential for photosynthesis. Activation of
transcription by the enhancer-dependent bacterial
sigma(54) containing RNA polymerase occurs through ATP
hydrolysis-driven protein conformational changes enabled
by activator proteins that belong to the large AAA(+)
mechanochemical protein family. It has been shown that
PspA directly and specifically acts upon and binds to
the AAA(+) domain of the PspF transcription activator.
Length = 220
Score = 32.4 bits (74), Expect = 0.55
Identities = 24/120 (20%), Positives = 44/120 (36%), Gaps = 14/120 (11%)
Query: 358 REAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKKEEGALP 417
R+ + +A +A + + K ++ +K Q KK LTK E
Sbjct: 33 RDMQSELGKARQALAQVIARQKQLERKLEEQKEQAKKLENKARAALTKGNEE-------- 84
Query: 418 VYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKI--VRSGKTKRKA 475
L R+ + L + + + + ++R Q A KI +++ KT KA
Sbjct: 85 ---LAREALAEIATLEKQAEALETQ-LTQQRSAVEQLRKQLAALETKIQQLKAKKTALKA 140
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 33.2 bits (75), Expect = 0.59
Identities = 24/102 (23%), Positives = 54/102 (52%), Gaps = 6/102 (5%)
Query: 345 YGRRLDYEERKRKR-EAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKL 403
YG R+D + +R R E H++A + + + K + + ++ R E+++ ++ + E+L
Sbjct: 426 YGGRVDKDHAERARIEKENAHRKALEMKILEKKRIERLEREER-ERLERERMERIERERL 484
Query: 404 TKQKHEKKEEGALPVYLLDRDVQSR-AKVLSNMIKQKRKEKA 444
+++ E++ L L+RD R + + +++ R EKA
Sbjct: 485 ERERLERER---LERDRLERDRLDRLERERVDRLERDRLEKA 523
>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564. This
family of proteins has no known function. However, one
of the members is annotated as an EF-hand family
protein.
Length = 349
Score = 32.4 bits (74), Expect = 0.63
Identities = 12/78 (15%), Positives = 38/78 (48%), Gaps = 2/78 (2%)
Query: 335 NEYIERHRK-LYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMK 393
++ R + ++ K++ E R+ K+ + ++ +A ++ + E+ +
Sbjct: 272 ESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEANDPAQKLQEERKEQL 331
Query: 394 KKIKAHEEKLTKQKHEKK 411
K++ EEK ++++E++
Sbjct: 332 AKLRK-EEKEREKEYEQE 348
Score = 32.4 bits (74), Expect = 0.74
Identities = 26/117 (22%), Positives = 52/117 (44%), Gaps = 22/117 (18%)
Query: 359 EAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHE----------EKLTKQKH 408
RE KR +K+ K++ +L +++ E+ + KKK KA+ E++ +Q
Sbjct: 12 TKREKKKREKKSIRQSKLEEELNKLEKKEEEAECKKKFKANPVPASVKLPLYEEIMEQNE 71
Query: 409 EKKEEGAL-----------PVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEV-PIPK 453
E++EE P +R+ Q +A + + + + K++ P+PK
Sbjct: 72 ERREEVREKSKAILLSSQKPFSFYEREEQKKAILPRKLRSSTSEREPKKFKAKPVPK 128
Score = 30.5 bits (69), Expect = 2.8
Identities = 17/63 (26%), Positives = 28/63 (44%), Gaps = 3/63 (4%)
Query: 354 RKRKREAREPHKRAEKARSVRKIKAKLY---NKQRRNEKIQMKKKIKAHEEKLTKQKHEK 410
R ++E ++ +K RK K K KQR +K + +E+L K + E+
Sbjct: 279 RTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEANDPAQKLQEERKEQLAKLRKEE 338
Query: 411 KEE 413
KE
Sbjct: 339 KER 341
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 32.8 bits (75), Expect = 0.63
Identities = 15/82 (18%), Positives = 36/82 (43%), Gaps = 7/82 (8%)
Query: 332 SLHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQ 391
L E IE + + +ER + +AR + + +++ + +L +R E+++
Sbjct: 229 ELEQE-IEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELE 287
Query: 392 MKKKIKAHEEKLTKQKHEKKEE 413
K + +L + + E +E
Sbjct: 288 EKIE------RLEELEREIEEL 303
Score = 30.9 bits (70), Expect = 2.9
Identities = 20/128 (15%), Positives = 49/128 (38%), Gaps = 16/128 (12%)
Query: 329 TAMSLHNEYIERHRKLYGRRLDYEERKRKREAREPH-KRAEKARSVRKIKAKLYNKQRRN 387
+L E E+ KL + EE K K + ++ + + ++++K L +
Sbjct: 523 LEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELL---EELR 579
Query: 388 EKIQMKKKIKAHEEKLTK--------QKHEKKEEGALPVYLLD---RDVQSRAKVLSNMI 436
K++++ E+L + ++ + E L L +++ + L +
Sbjct: 580 LLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELES-E 638
Query: 437 KQKRKEKA 444
+K +A
Sbjct: 639 LEKLNLQA 646
Score = 29.3 bits (66), Expect = 7.5
Identities = 19/113 (16%), Positives = 44/113 (38%), Gaps = 18/113 (15%)
Query: 333 LHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQM 392
H + + +L L+ E + K EA + E + +R+++ +L E +
Sbjct: 470 EHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELI---ELLELEEA 526
Query: 393 -KKKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKA 444
K++++ EKL E +E + K+ +K++ ++
Sbjct: 527 LKEELEEKLEKLENLLEELEEL--------------KEKLQLQQLKEELRQLE 565
>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777). This is
a family of eukaryotic proteins of unknown function.
Some of the proteins in this family are putative nucleic
acid binding proteins.
Length = 158
Score = 31.8 bits (72), Expect = 0.64
Identities = 20/90 (22%), Positives = 43/90 (47%), Gaps = 13/90 (14%)
Query: 336 EYIERHRKLYGRRLDYEERKRKREAR-EPHKRAEKARSVRKIKA------KLYNKQRRNE 388
+ ER R+ R ER R+R +R R+ ++RS R+ ++ + +++R +
Sbjct: 22 DRRERRRERSRSR----ERDRRRRSRSRSPHRSRRSRSPRRHRSRSRSPSRRRDRKRERD 77
Query: 389 KIQMKKKIKAHEEKLTKQKHEKK--EEGAL 416
K + K + ++ + ++ E K EE +
Sbjct: 78 KDAREPKKRERQKLIKEEDLEGKSDEEVEM 107
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 32.8 bits (75), Expect = 0.66
Identities = 29/148 (19%), Positives = 55/148 (37%), Gaps = 14/148 (9%)
Query: 342 RKLYGRRLDYEERKRKREAREPHKRA--EKARSVRKIKAKLYNKQRRNEKIQMKKKIKAH 399
L R+ +E +R + + E A K Q+R E + +++++
Sbjct: 23 VLLKKRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQRE 82
Query: 400 EEKLTKQKHEKKEEGA----LPVYLLDRD--VQSRAKVLSNMIKQKR---KEKAGKWEVP 450
EE+L +++ + L L +R+ + +R L + KQ AG
Sbjct: 83 EERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQ 142
Query: 451 IPKV---RTQADAEVFKIVRSGKTKRKA 475
K+ A+ E K R K + +A
Sbjct: 143 ARKLLLKLLDAELEEEKAQRVKKIEEEA 170
Score = 32.0 bits (73), Expect = 1.3
Identities = 17/75 (22%), Positives = 32/75 (42%), Gaps = 4/75 (5%)
Query: 333 LHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQ- 391
L N+ ER + L R L+ EE +++ + RK+ KL + + EK Q
Sbjct: 103 LENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLKLLDAELEEEKAQR 162
Query: 392 ---MKKKIKAHEEKL 403
++++ E+
Sbjct: 163 VKKIEEEADLEAERK 177
Score = 30.8 bits (70), Expect = 2.9
Identities = 21/130 (16%), Positives = 44/130 (33%), Gaps = 21/130 (16%)
Query: 336 EYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNK--QRRNEKIQMK 393
E + R R + E + +RE ++ E K N+ + +
Sbjct: 59 ELLLRERNQQRQEARREREELQREEERLVQKEE-------QLDARAEKLDNLENQLEERE 111
Query: 394 KKIKAHEEKLTKQKHEKKEE------------GALPVYLLDRDVQSRAKVLSNMIKQKRK 441
K + A E +L + + + E L + LLD +++ I+++
Sbjct: 112 KALSARELELEELEKQLDNELYRVAGLTPEQARKLLLKLLDAELEEEKAQRVKKIEEEAD 171
Query: 442 EKAGKWEVPI 451
+A + I
Sbjct: 172 LEAERKAQNI 181
>gnl|CDD|201707 pfam01280, Ribosomal_L19e, Ribosomal protein L19e.
Length = 148
Score = 31.4 bits (72), Expect = 0.66
Identities = 16/53 (30%), Positives = 22/53 (41%), Gaps = 3/53 (5%)
Query: 339 ERHRKL-YGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKI 390
+R K GR RK + AR P K R +R ++ L + R KI
Sbjct: 64 KRKEKRRKGRHRGPGSRKGTKGARMPKKELWIRR-IRALRRLL-RELRDAGKI 114
>gnl|CDD|153276 cd07592, BAR_Endophilin_A, The Bin/Amphiphysin/Rvs (BAR) domain of
Endophilin-A. BAR domains are dimerization, lipid
binding and curvature sensing modules found in many
different proteins with diverse functions. Endophilins
are accessory proteins, localized at synapses, which
interact with the endocytic proteins, dynamin and
synaptojanin. They are essential for synaptic vesicle
formation from the plasma membrane. They interact with
voltage-gated calcium channels, thus linking vesicle
endocytosis to calcium regulation. They also play roles
in virus budding, mitochondrial morphology maintenance,
receptor-mediated endocytosis inhibition, and endosomal
sorting. Endophilins contain an N-terminal N-BAR domain
(BAR domain with an additional N-terminal amphipathic
helix), followed by a variable region containing proline
clusters, and a C-terminal SH3 domain. They are
classified into two types, A and B. Vertebrates contain
three endophilin-A isoforms. Endophilin-A proteins are
enriched in the brain and play multiple roles in
receptor-mediated endocytosis. They tubulate membranes
and regulate calcium influx into neurons to trigger the
activation of the endocytic machinery. They are also
involved in the sorting of plasma membrane proteins,
actin filament assembly, and the uncoating of
clathrin-coated vesicles for fusion with endosomes. The
BAR domains of endophilin-A1 and A3 form crescent-shaped
dimers that can detect membrane curvature and drive
membrane bending.
Length = 223
Score = 31.9 bits (73), Expect = 0.69
Identities = 14/23 (60%), Positives = 17/23 (73%), Gaps = 1/23 (4%)
Query: 338 IERHR-KLYGRRLDYEERKRKRE 359
I HR KL GRRLDY+ +KRK+
Sbjct: 130 INHHRKKLEGRRLDYDYKKRKQG 152
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 31.3 bits (72), Expect = 0.79
Identities = 17/74 (22%), Positives = 27/74 (36%), Gaps = 12/74 (16%)
Query: 332 SLHNEYIERHRKLYGRRLDYEERKR-----KREAREPHKRAEKARSVRKIKAKLYNKQRR 386
L+ EY + L LD +E K K+ + K +K + K +L+
Sbjct: 46 DLNEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPRLF----- 100
Query: 387 NEKIQMKKKIKAHE 400
+ K I A E
Sbjct: 101 --VLDFKGDIDASE 112
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 32.0 bits (73), Expect = 0.82
Identities = 21/85 (24%), Positives = 32/85 (37%), Gaps = 7/85 (8%)
Query: 330 AMSLHNEYIERHRKLYGRRLDYEERKRKREA-REPHKRAEKARSVRKIKAKLYNKQRRNE 388
A H E I+ D E RK E E + ++K + + L N+ R E
Sbjct: 188 AQEYHEEMIK-----LFEEAD-ELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELE 241
Query: 389 KIQMKKKIKAHEEKLTKQKHEKKEE 413
K + K K +++ E KE
Sbjct: 242 KKIKALRAKEKAAKRREKREELKER 266
Score = 30.4 bits (69), Expect = 3.2
Identities = 21/81 (25%), Positives = 41/81 (50%), Gaps = 8/81 (9%)
Query: 333 LHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQM 392
LH E++E +K+ L E R + E RE K+ ++ ++AK +RR ++ ++
Sbjct: 212 LHEEFVELSKKI--DELHEEFRNLQNELRELEKK------IKALRAKEKAAKRREKREEL 263
Query: 393 KKKIKAHEEKLTKQKHEKKEE 413
K++ + EK + + EE
Sbjct: 264 KERAEEIYEKFKRGEKLTTEE 284
Score = 30.0 bits (68), Expect = 4.1
Identities = 14/103 (13%), Positives = 43/103 (41%), Gaps = 19/103 (18%)
Query: 374 RKIKAKLYN--KQRRNEKIQMKKKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKV 431
RK + K++R+E + ++ ++L + E +E+ ++++
Sbjct: 15 RKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKA--------QELREERDE 66
Query: 432 LSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGKTKRK 474
++ +++ ++++ ++ + E+ K R K KR
Sbjct: 67 INEEVQELKEKRD--------EINAKLQ-ELRKEYRELKEKRN 100
Score = 28.5 bits (64), Expect = 9.8
Identities = 22/115 (19%), Positives = 45/115 (39%), Gaps = 13/115 (11%)
Query: 335 NEYIERHRKLYGRRLDYEERKRK--REAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQM 392
+E E ++L +R + + ++ +E RE ++ + + L + R EK Q
Sbjct: 65 DEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQQ 124
Query: 393 KKKI-KAHEEKLTKQKHEKKEEGALPVYLLDRDVQSR----AKVLSNMIKQKRKE 442
+ E +L ++ E ++E L D K L I + +K+
Sbjct: 125 TSVLTPEEERELVQKIKELRKE------LEDAKKALEENEKLKELKAEIDELKKK 173
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 31.6 bits (72), Expect = 0.89
Identities = 14/60 (23%), Positives = 25/60 (41%)
Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK 411
K E E+ K K K K+++ +K + +K ++ K K+K +KK
Sbjct: 134 TSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 31.6 bits (72), Expect = 0.91
Identities = 24/126 (19%), Positives = 42/126 (33%), Gaps = 27/126 (21%)
Query: 347 RRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKL--- 403
+ L EER +K++ + E + +K + KK +
Sbjct: 76 KELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLD 135
Query: 404 ----------TKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPK 453
T Q E E L +R+++ R K+ + K+K K+K K
Sbjct: 136 SPRRKSSRSSTVQNKEATHE-----RLKEREIR-RKKIQAKARKRKEKKK--------EK 181
Query: 454 VRTQAD 459
TQ +
Sbjct: 182 ELTQEE 187
>gnl|CDD|153336 cd07652, F-BAR_Rgd1, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Saccharomyces cerevisiae
Rho GTPase activating protein Rgd1 and similar proteins.
F-BAR domains are dimerization modules that bind and
bend membranes and are found in proteins involved in
membrane dynamics and actin reorganization.
Saccharomyces cerevisiae Rgd1 is a GTPase activating
protein (GAP) with activity towards Rho3p and Rho4p,
which are involved in bud growth and cytokinesis,
respectively. At low pH, S. cerevisiae Rgd1 is required
for cell survival and the activation of the protein
kinase C pathway, which is important in cell integrity
and the maintenance of cell shape. It contains an
N-terminal F-BAR domain and a C-terminal Rho GAP domain.
The F-BAR domain of S. cerevisiae Rgd1 binds to
phosphoinositides and plays an important role in the
localization of the protein to the bud tip/neck during
the cell cycle. F-BAR domains form banana-shaped dimers
with a positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 234
Score = 31.5 bits (72), Expect = 0.96
Identities = 25/108 (23%), Positives = 41/108 (37%), Gaps = 18/108 (16%)
Query: 328 LTAMSLHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKA-----RSVRKIKAKLYN 382
L M +E + + E+ RK +E KRAEK + K KA+ +
Sbjct: 88 LNEM--SDELSSLAKTV--------EKSRK-SIKETGKRAEKKVQDAEAAAEKAKARYDS 136
Query: 383 KQRRNEKIQMKKKIKAHEEKLTKQK-HEKKEEGAL-PVYLLDRDVQSR 428
E+++ K + L K + E+ L V D+D S+
Sbjct: 137 LADDLERVKTGDPGKKLKFGLKGNKSAAQHEDELLRKVQAADQDYASK 184
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 30.7 bits (70), Expect = 0.97
Identities = 17/62 (27%), Positives = 30/62 (48%), Gaps = 7/62 (11%)
Query: 357 KREAREPHKRAEKARSVRK----IKAKL-YNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK 411
KR R+ K +K K +K + NKQ + K + K+K + +E+ + K +KK
Sbjct: 68 KRLQRQAAKEVKKPGISTKAQQALKLEHERNKQEK--KKRSKEKKEEEKERKRQLKQQKK 125
Query: 412 EE 413
+
Sbjct: 126 KA 127
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 31.8 bits (71), Expect = 1.00
Identities = 39/153 (25%), Positives = 62/153 (40%), Gaps = 17/153 (11%)
Query: 340 RHRKLYGRRLDYEERKRKREAREPHKR--AEKARSVRKIKAKLYNKQRRNEKIQMKKKIK 397
+ K GR R+ REAR P K + R +R++ K +++ + I + +K
Sbjct: 67 KEAKSMGRHEGAGRREGTREARMPSKELWMRRLRILRRLLRKYREEKKIDRHIYRELYVK 126
Query: 398 AH----------EEKLTKQKHEKKEEGALPVYLLD---RDVQSRAKVLSNMIKQKRK--E 442
A E + K K+EKK+E L L +D Q R K ++++ K E
Sbjct: 127 AKGNVFRNKRNLMEHIHKVKNEKKKERQLAEQLAAKRLKDEQHRHKARKQELRKREKDRE 186
Query: 443 KAGKWEVPIPKVRTQADAEVFKIVRSGKTKRKA 475
+A + + Q A SGK KA
Sbjct: 187 RARREDAAAAAAAKQKAAAKKAAAPSGKKSAKA 219
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 31.1 bits (71), Expect = 1.1
Identities = 40/167 (23%), Positives = 68/167 (40%), Gaps = 41/167 (24%)
Query: 352 EERKRKREAREPHKRAE-----KARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLT-- 404
E+R+RKRE R+ K+ + K +K +A+ K N+ + I+ E +
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEE-VKNEENKSKKKAAPIENAEGNIVFS 67
Query: 405 ------------KQKHEKKEEGALPVY--LLDRDVQSRAKVLSNMIKQKRKE--KAGKWE 448
K +KK++ Y LL + +++R K L + + K E + KW
Sbjct: 68 KVEFADGEQAKKDLKLKKKKKKKKTDYKQLLKK-LEARKKKLEELDEDKAAEIEEKEKWT 126
Query: 449 VPIPKVRTQA-------DAEVFK--IVRSGKTKRKA---WKRMITKV 483
K +A D ++ K + R K K+K+ WK KV
Sbjct: 127 ----KALAKAEGVKVKDDEKLLKKALKRKEKQKKKSKKEWKERKEKV 169
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 32.1 bits (73), Expect = 1.1
Identities = 25/136 (18%), Positives = 58/136 (42%), Gaps = 4/136 (2%)
Query: 341 HRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHE 400
++L + +ER+R ++ + E ++++K + ++ + +KI K+IK+ E
Sbjct: 545 KKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESII-RELKEKKIHKAKEIKSIE 603
Query: 401 EKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAG--KWEVPIPKVRTQA 458
+ L K K K++ P + + ++ K K + G KW V + +R +
Sbjct: 604 D-LVKLKETKQKIPQKPTNFQADKIGDKVRIRYFGQKGKIVQILGGNKWNVTVGGMRMKV 662
Query: 459 DAEVFKIVRSGKTKRK 474
+ + +K
Sbjct: 663 HGSELEKINKAPPPKK 678
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 32.0 bits (73), Expect = 1.1
Identities = 17/111 (15%), Positives = 42/111 (37%), Gaps = 7/111 (6%)
Query: 338 IERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIK 397
IE + + E + + E + E+ + + L ++R E ++++
Sbjct: 269 IEELKS---ELEELREELEELQE-ELLELKEEIEELEGEISLL--RERLEELENELEELE 322
Query: 398 AHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWE 448
E+L K+K E +E L +++ L ++ ++ + E
Sbjct: 323 ERLEEL-KEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLE 372
>gnl|CDD|220103 pfam09074, Mer2, Mer2. Mer2 (Rec107) forms part of a complex that
is required for meiotic double strand DNA break
formation. Mer2 increases in abundance and is
phosphorylated during the prophase phase of cell
division. Blocking double strand break formation results
in delayed dephosphorylation and dissociation of Mer2
from the chromosome.
Length = 190
Score = 31.0 bits (70), Expect = 1.1
Identities = 16/97 (16%), Positives = 38/97 (39%), Gaps = 7/97 (7%)
Query: 391 QMKKKIKAHEEKLTK----QKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGK 446
K++ +L + + EE +Y L + ++ + + + +K K G
Sbjct: 55 NNSKELNKTNYQLNELLNSCEASVNEEIKTLIYQLGAQLNNQLEEVVIALSEKVKTINGN 114
Query: 447 ---WEVPIPKVRTQADAEVFKIVRSGKTKRKAWKRMI 480
+VP +V E F I++S + ++ + +
Sbjct: 115 SMMGQVPPSEVERINSDEPFDIIKSFEARQLENNKSV 151
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 32.2 bits (73), Expect = 1.2
Identities = 20/111 (18%), Positives = 44/111 (39%), Gaps = 5/111 (4%)
Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQR--RNEKIQMKKKI 396
+R + R+ +RE + + +++ +++ K N+ E +K K+
Sbjct: 85 KRAQIELERKASTLAENYERELDRNLELEVRLKALEELEKKAENEAAEAEEEAKLLKDKL 144
Query: 397 KAHEEKLTKQKHEKKEEGALPVYLLDRDV---QSRAKVLSNMIKQKRKEKA 444
A KL +K ++ +E + + D+ Q RA+ +K E
Sbjct: 145 DAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNADTELKLLESELE 195
>gnl|CDD|225407 COG2851, CitM, H+/citrate symporter [Energy production and
conversion].
Length = 433
Score = 31.9 bits (73), Expect = 1.2
Identities = 22/81 (27%), Positives = 29/81 (35%), Gaps = 24/81 (29%)
Query: 226 FLPIYLETGDNRLLL-LTPMVS----ELNPWSIHEPSYRVGPMPQTVSNL---------P 271
LP+Y G N LL L +S + PW GP + S L P
Sbjct: 126 LLPLYKRLGMNPYLLALLATLSAGVMNMLPWG--------GPTARAASVLGVDPAELFVP 177
Query: 272 LQPSQSSGTLWLKAMCDALWL 292
L P Q G + + + A L
Sbjct: 178 LIPIQIIGLVLV--LALAWLL 196
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 31.5 bits (72), Expect = 1.2
Identities = 23/88 (26%), Positives = 30/88 (34%), Gaps = 32/88 (36%)
Query: 338 IERHRKLYGRRLDYEER----KRKREAREPHK--------------RAEKAR-------- 371
IE K ++ EER K RE RE EKAR
Sbjct: 229 IEGPAKK--KKQTKEERIATAKEGREDREKFGSRKGKKDKEGKSTTNKEKARKKNFMMTL 286
Query: 372 ---SVR-KIKAKLYNKQRRNEKIQMKKK 395
VR K K L +KQ+ +++K
Sbjct: 287 HKKRVRGKQKRSLRDKQKVLRAHILRQK 314
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 30.6 bits (69), Expect = 1.2
Identities = 16/58 (27%), Positives = 32/58 (55%)
Query: 338 IERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKK 395
E+ +KL ++ ++RKR++ +E KR ++ + K K ++RR +K+ K K
Sbjct: 68 FEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERERRRKKLTKKTK 125
Score = 29.9 bits (67), Expect = 2.6
Identities = 17/85 (20%), Positives = 38/85 (44%), Gaps = 12/85 (14%)
Query: 334 HNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMK 393
+ + RK ++ E+++ ++ + + E A+ +Q+ EKI++
Sbjct: 58 QVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAK-----------RQKELEKIELS 106
Query: 394 KKIKAHEEKLTKQKHEKKEEGALPV 418
KK K E + ++K KK + P+
Sbjct: 107 KK-KQKERERRRKKLTKKTKSGQPL 130
Score = 28.7 bits (64), Expect = 5.3
Identities = 17/80 (21%), Positives = 38/80 (47%), Gaps = 7/80 (8%)
Query: 340 RHRKLYGRRLDYE-----ERKRKREAREPHKRAEKARSVRKI--KAKLYNKQRRNEKIQM 392
R +K Y + L+ E E++ + + K K +K+ K ++ +++R ++ +
Sbjct: 33 RLKKEYLKLLEKEGYAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKE 92
Query: 393 KKKIKAHEEKLTKQKHEKKE 412
K + EK+ K ++KE
Sbjct: 93 LAKRQKELEKIELSKKKQKE 112
>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase. This domain family is found in
eukaryotes, and is approximately 140 amino acids in
length. The family is found in association with
pfam00069. Polo-like kinase 1 (Plx1) is essential during
mitosis for the activation of Cdc25C, for spindle
assembly, and for cyclin B degradation. This family is
Polo kinase kinase (PKK) which phosphorylates Polo
kinase and Polo-like kinase to activate them. PKK is a
serine/threonine kinase.
Length = 142
Score = 30.4 bits (69), Expect = 1.4
Identities = 24/105 (22%), Positives = 48/105 (45%), Gaps = 13/105 (12%)
Query: 342 RKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEE 401
+ +R+ E++ R + +E K + + +++ KL Q + EK +MK + + E
Sbjct: 50 LRRLPKRIRAEQKTRLKMFKESLK--IEKKELKQEVEKLPRFQEQ-EKKRMKAEKEEQE- 105
Query: 402 KLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGK 446
QKH+K+E L + ++ + L + +KR E A
Sbjct: 106 ----QKHQKQERE-----FLAKQEENLEEALQQLQNEKRHELAET 141
>gnl|CDD|219523 pfam07697, 7TMR-HDED, 7TM-HD extracellular. This entry represents
the extracellular domain of the 7TM-HD (7TM Receptors
with HD hydrolase).
Length = 218
Score = 31.1 bits (71), Expect = 1.4
Identities = 19/107 (17%), Positives = 39/107 (36%), Gaps = 17/107 (15%)
Query: 393 KKKIKAHE------EKLTKQKHEKKEEGALPVYLLDRDVQSRAK-----VLSNMIKQKR- 440
I+A E+ T++K ++ E PVY +D +V + + + + K
Sbjct: 8 PVDIRAPRDIEVVDEEATEEKRKEAAESVPPVYDIDPEVTQNLEEEINSLFDEIREVKAS 67
Query: 441 ---KEKAGKWEVPIPKVRTQADAEVFK-IVRSGKTKRKAWKRMITKV 483
EKA + + + Q E + ++ + K + I
Sbjct: 68 AEKAEKADRLK-SLNLSTFQLSDEQWSTLLNADDEDLKLLEDAIRTA 113
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 31.3 bits (71), Expect = 1.6
Identities = 20/114 (17%), Positives = 48/114 (42%), Gaps = 15/114 (13%)
Query: 347 RRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQ 406
R++ EER RK + + ++A+ ++ K Y K+ ++ Q ++ +K EE + +Q
Sbjct: 87 ARVEAEER-RKTLQEQTQQEQQRAQYQDELARKRYQKELEQQRRQNEELLKMQEESVLRQ 145
Query: 407 KHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADA 460
+ ++ + ++L + +E + E K+ +A
Sbjct: 146 EAMRR--------------ATEEEILEMRRETIEEEAELERENIRAKIEAEARG 185
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 31.3 bits (71), Expect = 1.6
Identities = 20/112 (17%), Positives = 44/112 (39%), Gaps = 5/112 (4%)
Query: 335 NEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKK 394
N E ++ +R Y K+K ++ + + +K + L+N+Q + ++ ++
Sbjct: 42 NTLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQ----KKEQKNLFNEQIKQFELALQD 97
Query: 395 KIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGK 446
+I E + KE L L + + K L N + K++
Sbjct: 98 EIAKLEALELLNLEKDKELELLEKELDELSKE-LQKQLQNTAEIIEKKRENN 148
>gnl|CDD|227392 COG5059, KIP1, Kinesin-like protein [Cytoskeleton].
Length = 568
Score = 31.6 bits (72), Expect = 1.6
Identities = 19/120 (15%), Positives = 36/120 (30%), Gaps = 7/120 (5%)
Query: 324 DALWLTAMSLHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNK 383
+ W +L IE R L L EE K++ + + + +
Sbjct: 423 EEGWKYKSTLQFLRIEIDRLL----LLREEELSKKKTKIHKLNKLRHDLSSLLSSIPEET 478
Query: 384 QRRNEKIQMKKKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEK 443
R E + K + KL + + L+ S ++ N + E+
Sbjct: 479 SDRVESEKASKLRSSASTKLNLRSSRSHSKFRDH---LNGSNSSTKELSLNQVDLAGSER 535
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 31.8 bits (72), Expect = 1.6
Identities = 19/59 (32%), Positives = 33/59 (55%), Gaps = 4/59 (6%)
Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEK--ARSVRKIKAKLYNKQRRNEKIQMKKK 395
E+ R+L +L EE +R+R+A E +R E+ A + +AK ++RR + + KK
Sbjct: 253 EKRREL--EKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQNLLKK 309
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 31.4 bits (71), Expect = 1.7
Identities = 19/101 (18%), Positives = 39/101 (38%), Gaps = 4/101 (3%)
Query: 347 RRLDYEERKRKREAREPHKRAEKARS-VRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTK 405
+R Y E++++++A R + R + +A + + K + E K K
Sbjct: 290 QRAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKN--EAKARK 347
Query: 406 QKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGK 446
K ++ A + Q RA ++ ++ KA K
Sbjct: 348 -KEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKK 387
Score = 29.0 bits (65), Expect = 8.0
Identities = 22/115 (19%), Positives = 39/115 (33%), Gaps = 13/115 (11%)
Query: 364 HKRAEKARSVRKIKAKLYNKQR-RNEKIQMKKKIKAHEEKLTKQKHEKKEEGALPVYLLD 422
+RA+ ++ KA L+ +R R I + + + K + E
Sbjct: 289 EQRAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAE--------- 339
Query: 423 RDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGKTKRKAWK 477
++ AK I QKR+ + + R A A + K K+K
Sbjct: 340 ---KNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLI 391
>gnl|CDD|233421 TIGR01453, grpIintron_endo, group I intron endonuclease. This
model represents one subfamily of endonucleases
containing the endo/excinuclease amino terminal domain,
pfam01541 at its amino end. A distinct subfamily
includes excinuclease abc subunit c (uvrC). Members of
pfam01541 are often termed GIY-YIG endonucleases after
conserved motifs near the amino end. This subfamily in
This model is found in open reading frames of group I
introns in both phage and mitochondria. The closely
related endonucleases of phage T4: segA, segB, segC,
segD and segE, score below the trusted cutoff for the
family.
Length = 214
Score = 30.8 bits (70), Expect = 1.7
Identities = 25/85 (29%), Positives = 37/85 (43%), Gaps = 10/85 (11%)
Query: 346 GRRLDY---EERKRKREAREPHKRAEKARSVR---KIKAKLYNKQRRNEKIQMKKKIKAH 399
G L Y EE K K K+ + KAK+ +K + E K H
Sbjct: 92 GSSLGYKHSEETKAKMSKLFSGKKNNPWYGKTHSEETKAKI-SKNKLGENNPFFGK--KH 148
Query: 400 EEKLTKQKHEKKEEGALPVYLLDRD 424
E+ TK+K +KE+GA V + D++
Sbjct: 149 SEE-TKKKISEKEKGAKKVNVYDKN 172
>gnl|CDD|237791 PRK14701, PRK14701, reverse gyrase; Provisional.
Length = 1638
Score = 31.8 bits (72), Expect = 1.7
Identities = 31/138 (22%), Positives = 47/138 (34%), Gaps = 29/138 (21%)
Query: 389 KIQMKKKIKAHEEKLTKQKHEKKEEGALPVYL--------LDRDVQSRAKVLSNMIKQK- 439
+ ++ + L K E L + DV+ +L + K
Sbjct: 414 RFRVDLEDPTIYRILGLLSEILKIEEELKEGIPIEGVLDVFPEDVEFLRSILKDEEVIKK 473
Query: 440 --------RKEKAGKWEVPIPKVRTQADAEVFKIVRSGKTKRKAWKRMITK-VCFVGENF 490
K++ GK+ + IP VRT A SG+T R + ITK + +
Sbjct: 474 VAERPFVSLKKEEGKYYIEIPDVRTYIQA-------SGRTSR-LFAGGITKGASVLIVD- 524
Query: 491 TRKPPKFERFIRPMALRF 508
P F IR M RF
Sbjct: 525 --DPEIFNALIRQMRFRF 540
>gnl|CDD|224691 COG1777, COG1777, Predicted transcriptional regulators
[Transcription].
Length = 217
Score = 30.9 bits (70), Expect = 1.8
Identities = 15/69 (21%), Positives = 28/69 (40%), Gaps = 7/69 (10%)
Query: 342 RKLYG-RRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHE 400
+G R D EE + E E K + + ++ ++L R E++ +
Sbjct: 88 PNFFGAERFDLEEDDLESERSEVSKLFKSPEGISELISRLLEINREIEELSRAQT----- 142
Query: 401 EKLTKQKHE 409
+L KQ +E
Sbjct: 143 -ELQKQLNE 150
>gnl|CDD|238111 cd00187, TOP4c, DNA Topoisomerase, subtype IIA; domain A';
bacterial DNA topoisomerase IV (C subunit, ParC),
bacterial DNA gyrases (A subunit, GyrA),mammalian DNA
toposiomerases II. DNA topoisomerases are essential
enzymes that regulate the conformational changes in DNA
topology by catalysing the concerted breakage and
rejoining of DNA strands during normal cellular growth.
Length = 445
Score = 31.0 bits (71), Expect = 1.9
Identities = 19/101 (18%), Positives = 39/101 (38%), Gaps = 26/101 (25%)
Query: 336 EYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSV-------RKIKAKLYNKQRRNE 388
E+++ ++Y RR +YE K + H +++ I++ K+ E
Sbjct: 323 EFLDHRLEVYTRRKEYELGKAEARL---HILEGLLKAILNIDEVINLIRSSDEAKKALIE 379
Query: 389 KIQMKK--KIKAH--------------EEKLTKQKHEKKEE 413
+++ +I+A EKL K+ E + E
Sbjct: 380 ELEKLGFSEIQADAILDMRLRRLTKLEREKLLKELKELEAE 420
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 31.0 bits (70), Expect = 2.0
Identities = 20/77 (25%), Positives = 37/77 (48%), Gaps = 13/77 (16%)
Query: 340 RHRKLYGRRLDYEERKRKREAREPHKRA---------EKARSVRKIKAKLYNKQRRNEKI 390
R ++L EE+KRK+E E RA E A S+++ K L + +K+
Sbjct: 490 DSRVNAEKQLA-EEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEI---KKL 545
Query: 391 QMKKKIKAHEEKLTKQK 407
+ K+K E ++ +++
Sbjct: 546 EHDLKLKEEECRMLEKE 562
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 31.4 bits (71), Expect = 2.1
Identities = 22/90 (24%), Positives = 31/90 (34%), Gaps = 10/90 (11%)
Query: 333 LHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQM 392
L E Y + E K + E EK ++K K L + N+ I
Sbjct: 462 LGEEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYL-ESEEINKSINE 520
Query: 393 KKKIKAHEEKLT---------KQKHEKKEE 413
KI++ L K KH+K EE
Sbjct: 521 YNKIESARADLEDIKIKINELKDKHDKYEE 550
>gnl|CDD|150786 pfam10153, DUF2361, Uncharacterized conserved protein (DUF2361).
This is a region of 120 amino acids that is conserved in
a family of proteins found from plants to fungi. The
function is not known.
Length = 114
Score = 29.5 bits (67), Expect = 2.2
Identities = 18/75 (24%), Positives = 37/75 (49%), Gaps = 9/75 (12%)
Query: 338 IERHRKL--YGRRLDYEERKRKRE--AREPHK-----RAEKARSVRKIKAKLYNKQRRNE 388
+E+ R L + L+ ER++KR+ A++ H R + R +++++ KL E
Sbjct: 24 VEKERALEALKKELEDAERRKKRKKMAKKYHMVRFFERKKATRRLKQLEKKLEETSDEKE 83
Query: 389 KIQMKKKIKAHEEKL 403
+K+++ E L
Sbjct: 84 IKDLKEQLSKAEVDL 98
Score = 28.3 bits (64), Expect = 5.3
Identities = 18/72 (25%), Positives = 34/72 (47%), Gaps = 11/72 (15%)
Query: 353 ERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQM-----------KKKIKAHEE 401
ER K++ R EK R++ +K +L + +RR ++ +M +KK +
Sbjct: 10 ERLLKKKDLPADVRVEKERALEALKKELEDAERRKKRKKMAKKYHMVRFFERKKATRRLK 69
Query: 402 KLTKQKHEKKEE 413
+L K+ E +E
Sbjct: 70 QLEKKLEETSDE 81
>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
Length = 281
Score = 30.8 bits (70), Expect = 2.2
Identities = 23/95 (24%), Positives = 43/95 (45%), Gaps = 12/95 (12%)
Query: 332 SLHNEYIERHRKLYGRRLDYEERKRKREAREPHKRA-EKARSVRKIK---AKLYNKQRRN 387
L + ER R+ +++ + EA+E + A E A + + AK ++ +
Sbjct: 29 VLSIKEKERLREEEEEQVEQLREEANDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTD 88
Query: 388 E------KIQMK--KKIKAHEEKLTKQKHEKKEEG 414
E K+QM+ ++ + EE+L + E K EG
Sbjct: 89 EASSIIEKLQMQIEREQEEWEEELERLIEEAKAEG 123
>gnl|CDD|235353 PRK05157, PRK05157, pyrroloquinoline quinone biosynthesis protein
PqqC; Provisional.
Length = 246
Score = 30.6 bits (70), Expect = 2.3
Identities = 12/40 (30%), Positives = 17/40 (42%), Gaps = 11/40 (27%)
Query: 303 AMCDALWLKAMCDTLWLTTMCDALWLTAMSLHNEYIERHR 342
+AL K D LW +M DAL++ Y+ H
Sbjct: 208 RALEALQFK--LDVLW--SMLDALYM-------AYVLGHI 236
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 29.5 bits (67), Expect = 2.4
Identities = 15/75 (20%), Positives = 38/75 (50%), Gaps = 3/75 (4%)
Query: 339 ERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKA 398
+R +L + + +E K + A++ R+ +K + + ++ + E+I K++KA
Sbjct: 32 QREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEEKKLRKEKEEEI---KELKA 88
Query: 399 HEEKLTKQKHEKKEE 413
E+L + + +E+
Sbjct: 89 ELEELKAEIEKLEEK 103
Score = 28.3 bits (64), Expect = 5.4
Identities = 15/80 (18%), Positives = 38/80 (47%), Gaps = 10/80 (12%)
Query: 336 EYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEK--IQMK 393
+E+ R++ +L + ++ + +R E+ R + +L K+ ++ I+
Sbjct: 1 RLLEKRREMEEVQLALDAKREE------FERREELLKQR--EEELEKKEEELQESLIKFD 52
Query: 394 KKIKAHEEKLTKQKHEKKEE 413
K +K +E K + + + +EE
Sbjct: 53 KFLKENEAKRRRAEKKAEEE 72
>gnl|CDD|214704 smart00512, Skp1, Found in Skp1 protein family. Family of Skp1
(kinetochore protein required for cell cycle
progression) and elongin C (subunit of RNA polymerase II
transcription factor SIII) homologues.
Length = 104
Score = 29.2 bits (66), Expect = 2.4
Identities = 10/49 (20%), Positives = 24/49 (48%), Gaps = 4/49 (8%)
Query: 418 VYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIV 466
V+ ++R+V ++K + MI+ + +P+P V +++ V
Sbjct: 13 VFEVEREVARQSKTIKAMIEDLGVDDENNNPIPLPNVT----SKILSKV 57
>gnl|CDD|172658 PRK14170, PRK14170, bifunctional 5,10-methylene-tetrahydrofolate
dehydrogenase/ 5,10-methylene-tetrahydrofolate
cyclohydrolase; Provisional.
Length = 284
Score = 30.4 bits (68), Expect = 2.6
Identities = 27/82 (32%), Positives = 40/82 (48%), Gaps = 6/82 (7%)
Query: 388 EKIQMKKKIKAHEEKLTKQKHEKKEEGALP---VYLLDRDVQSRAKVLSNMIKQKRKEKA 444
E I KK K +EK+T++ E +EG P V L+ + SR V + KQKR E+A
Sbjct: 3 EIIDGKKLAKEIQEKVTREVAELVKEGKKPGLAVVLVGDNQASRTYVRN---KQKRTEEA 59
Query: 445 GKWEVPIPKVRTQADAEVFKIV 466
G V I + ++ +V
Sbjct: 60 GMKSVLIELPENVTEEKLLSVV 81
>gnl|CDD|185443 PTZ00097, PTZ00097, 60S ribosomal protein L19; Provisional.
Length = 175
Score = 30.0 bits (68), Expect = 2.7
Identities = 35/122 (28%), Positives = 51/122 (41%), Gaps = 20/122 (16%)
Query: 335 NEYIERHRKLYGRRLDYEERKRKREAREPHK-------RA-----EKARSVRKIKAKLYN 382
+ E RK GR +R+ REAR P K R K R+ +KI +Y+
Sbjct: 62 RRFHEAKRK--GRHTGIGKRRGTREARMPTKVLWMRRQRVLRRLLRKYRAAKKIDRHMYH 119
Query: 383 KQRRNEKIQMKKKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKE 442
+ K K + E + K K+EK +E + L R RAK + ++ KRK
Sbjct: 120 EFYLKSKGNQFKNKRVLIEAIHKTKNEKVKEKKIQDQLEAR----RAK--AKALRNKRKA 173
Query: 443 KA 444
K
Sbjct: 174 KK 175
>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7). RRP7
is an essential protein in yeast that is involved in
pre-rRNA processing and ribosome assembly. It is
speculated to be required for correct assembly of rpS27
into the pre-ribosomal particle.
Length = 131
Score = 29.5 bits (67), Expect = 2.7
Identities = 17/68 (25%), Positives = 31/68 (45%), Gaps = 10/68 (14%)
Query: 354 RKRK----REAREPHKRAEKARSVRKIKAKL--YNKQRRNEKI----QMKKKIKAHEEKL 403
RKRK R +R K + +K K Y Q R +K +++KK + ++++
Sbjct: 60 RKRKAGASRNKAAEERRKLKEKKKKKKKELENFYRFQIREKKKEELAELRKKFEEDKKRI 119
Query: 404 TKQKHEKK 411
+ K +K
Sbjct: 120 EQLKAARK 127
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 30.3 bits (68), Expect = 3.0
Identities = 13/73 (17%), Positives = 29/73 (39%), Gaps = 4/73 (5%)
Query: 347 RRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKK----IKAHEEK 402
R+ E K E + + E A K K ++ NE + + KA +
Sbjct: 146 RQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDD 205
Query: 403 LTKQKHEKKEEGA 415
+T++ +++++
Sbjct: 206 VTEEDYDEEDNPV 218
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 30.5 bits (69), Expect = 3.0
Identities = 19/79 (24%), Positives = 35/79 (44%), Gaps = 7/79 (8%)
Query: 333 LHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQM 392
L + + K Y ++L +K E E +AK ++ + E+ +
Sbjct: 362 LSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEA-------EAKAKEEKLKQEENEK 414
Query: 393 KKKIKAHEEKLTKQKHEKK 411
K+K +A E+K +QK E+K
Sbjct: 415 KQKEQADEDKEKRQKDERK 433
>gnl|CDD|190741 pfam03763, Remorin_C, Remorin, C-terminal region. Remorins are
plant-specific plasma membrane-associated proteins. In
tobacco remorin co-purifies with lipid rafts. Most
remorins have a variable, proline-rich C-half and a more
conserved N-half that is predicted to form coiled coils.
Consistent with this, circular dichroism studies have
demonstrated that much of the protein is alpha-helical.
Remorins exist in plasma membrane preparations as
oligomeric structures and form filaments in vitro. The
proteins can bind polyanions including the extracellular
matrix component oligogalacturonic acid (OGA). In vitro,
remorin in plasma membrane preparations is
phosphorylated (principally on threonine residues) in
the presence of OGA and thus co-purifies with a protein
kinases(s). The biological functions of remorins are
unknown but roles as components of the
membrane/cytoskeleton are possible.
Length = 112
Score = 28.8 bits (65), Expect = 3.0
Identities = 28/112 (25%), Positives = 49/112 (43%), Gaps = 19/112 (16%)
Query: 353 ERKRKREAREPH-KRAEKARSVRKIKAKL------YNKQRRNEKIQMKKKIKAHEEKLTK 405
+K + E+R + AEKA+ K + + NK++ + ++KK E KL K
Sbjct: 1 SKKSEAESRADAWEEAEKAKINNKYQREEAKIQAWENKKKAKAEAELKKI----ERKLEK 56
Query: 406 QKHEKKEEGALPVYLLDR--DVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVR 455
+K E E+ L ++ +A+ + KR E+ K E K+R
Sbjct: 57 KKAEAAEK------LKNKLAAAHKKAEEKRATAEAKRGEEEAKAEEKAAKIR 102
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 29.7 bits (67), Expect = 3.1
Identities = 16/65 (24%), Positives = 31/65 (47%), Gaps = 10/65 (15%)
Query: 351 YEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKK--IKAHEEKLTKQKH 408
Y E K+K++ E A + K+K + KQ+ K + KK K ++K K+
Sbjct: 57 YTEAKKKKK--------ELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDD 108
Query: 409 EKKEE 413
+ +++
Sbjct: 109 KSEKK 113
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 30.1 bits (68), Expect = 3.1
Identities = 12/70 (17%), Positives = 31/70 (44%), Gaps = 2/70 (2%)
Query: 345 YGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRN--EKIQMKKKIKAHEEK 402
Y E K K+E E +KA + + +L + + E+ +++ ++ +
Sbjct: 31 YEETALELEEKLKQEEEEAQLLEKKADELEEENRRLEEEAAASEEERERLEAEVDEATAE 90
Query: 403 LTKQKHEKKE 412
+ K + E+++
Sbjct: 91 VAKLEEEREK 100
>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit. This
family includes the vacuolar ATP synthase E subunit, as
well as the archaebacterial ATP synthase E subunit.
Length = 195
Score = 29.6 bits (67), Expect = 3.3
Identities = 28/118 (23%), Positives = 42/118 (35%), Gaps = 27/118 (22%)
Query: 352 EERKRKREAREPHKRAEKARSVRK--IKAKLYNKQR------RNEKIQM-----KKKIKA 398
E K+ E E + EK + K I + N+ R R E + K+++
Sbjct: 31 EAEKKIEEIYE---KKEKQAEMEKQIIISNAKNEARLKVLNAREELLDSVFEEAKERLAN 87
Query: 399 HEE----------KLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGK 446
E L KQ K E + V + D + L K++ KEK GK
Sbjct: 88 LSEDKDEYKDLLKDLIKQALVKLGEPKVIVRSREEDEELVKSALDE-AKEEYKEKTGK 144
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 30.6 bits (69), Expect = 3.3
Identities = 24/131 (18%), Positives = 52/131 (39%), Gaps = 6/131 (4%)
Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKH--- 408
E+ K + + + K+ + + K K K++R K + K+K K EE +++
Sbjct: 107 EKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKR 166
Query: 409 EKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRS 468
E+ + P + ++ K ++K+++ A + P+ + E K
Sbjct: 167 ERVRAKSRPKKPPKKKPPNKKKEPPE--EEKQRQAAREAVKGKPE-EPDVNEEREKEEDD 223
Query: 469 GKTKRKAWKRM 479
GK + M
Sbjct: 224 GKDRETTTSPM 234
Score = 29.9 bits (67), Expect = 4.8
Identities = 22/129 (17%), Positives = 51/129 (39%), Gaps = 10/129 (7%)
Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKK 411
+ + +E K E+ + +K K + ++ ++ K + + K K ++ K+K +K
Sbjct: 95 AKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKV 154
Query: 412 EEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGKT 471
EE DR+ + + ++ K + K + P K + + E +
Sbjct: 155 EEPR------DREEEKK----RERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAV 204
Query: 472 KRKAWKRMI 480
K K + +
Sbjct: 205 KGKPEEPDV 213
>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain. This
presumed domain is found at the N-terminus of Ribosomal
L30 proteins and has been termed RL30NT or NUC018.
Length = 71
Score = 28.0 bits (63), Expect = 3.3
Identities = 15/60 (25%), Positives = 25/60 (41%), Gaps = 17/60 (28%)
Query: 354 RKRKREAREPHKRAEKARSVRKIKAKL--------------YNKQRRNEKIQMKKKIKAH 399
RKR + R KRA+K + + + K Y R+E I++K++ K
Sbjct: 6 RKRNEKLRA--KRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERDE-IRLKREAKKK 62
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 28.8 bits (65), Expect = 3.8
Identities = 15/62 (24%), Positives = 31/62 (50%), Gaps = 6/62 (9%)
Query: 353 ERKRKREAREPHKRAE--KARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEK 410
E++ RE + KR K R + + +R E+ + +K+ + + EK K++ ++
Sbjct: 72 EKELLREKEKKKKRKRPGKKRRIALRLRR----ERTKERAEKEKRTRKNREKKFKRRQKE 127
Query: 411 KE 412
KE
Sbjct: 128 KE 129
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 29.6 bits (67), Expect = 3.9
Identities = 14/44 (31%), Positives = 23/44 (52%)
Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKK 395
EE KR E RE RA K+RS + + + ++ E +M+ +
Sbjct: 126 EEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHR 169
>gnl|CDD|214660 smart00434, TOP4c, DNA Topoisomerase IV. Bacterial DNA
topoisomerase IV, GyrA, ParC.
Length = 444
Score = 30.2 bits (69), Expect = 4.1
Identities = 16/108 (14%), Positives = 43/108 (39%), Gaps = 11/108 (10%)
Query: 336 EYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKK 395
E+++ ++Y RR +Y K + E H ++ I + + + + K+
Sbjct: 330 EFLDHRLEVYTRRKEYLLGKLEAERL--HILEGLFIALSIIDEIIVLIRSSKDLAKEAKE 387
Query: 396 IKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEK 443
+L++ + + +LD ++ K+ ++++ KE
Sbjct: 388 KLMERFELSEIQAD---------AILDMRLRRLTKLEVEKLEKELKEL 426
>gnl|CDD|214429 MTH00139, COX2, cytochrome c oxidase subunit II; Provisional.
Length = 226
Score = 29.7 bits (67), Expect = 4.2
Identities = 16/61 (26%), Positives = 26/61 (42%), Gaps = 4/61 (6%)
Query: 141 MCDALWLKAMCDTLWLTTMCDALW--LTAMCLISLHISSLATNMLSKFKEILDTYLTDKV 198
+ K +L + + +W L A L+ L + SL +L E+ D YLT K
Sbjct: 42 SLSLMSNKFTSRSLLESQEVETIWTVLPAFILLFLALPSL--RLLYLMDEVSDPYLTFKA 99
Query: 199 I 199
+
Sbjct: 100 V 100
>gnl|CDD|206243 pfam14073, Cep57_CLD, Centrosome localisation domain of Cep57. The
CLD or centrosome localisation domain of Cep57 is found
at the N-terminus, and lies approximately between
residues 58 and 239. This region lies within the first
alpha-helical coiled-coil segment of Cep57, and
localises to the centrosome internally to gamma-tubulin,
suggesting that it is either on both centrioles or on a
centromatrix component. This N-terminal region can also
multimerise with the N-terminus of other Cep57
molecules. The C-terminal part, Family Cep57_MT_bd,
pfam06657, is the microtubule-binding region of Cep57.
Length = 178
Score = 29.4 bits (66), Expect = 4.2
Identities = 18/76 (23%), Positives = 35/76 (46%), Gaps = 5/76 (6%)
Query: 336 EYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKK 395
E E+ R + +R+++ + + + EK + K +L Q E K
Sbjct: 88 ENAEKERTAVLEQQVSLQREKEHDQMDVQAQLEKLEVLEKEYLRLTRTQSLAET-----K 142
Query: 396 IKAHEEKLTKQKHEKK 411
I+ EEKL +++H++K
Sbjct: 143 IQQLEEKLQEEEHQRK 158
>gnl|CDD|130605 TIGR01542, A118_put_portal, phage portal protein, putative, A118
family. This model represents a family of phage minor
structural proteins. The protein is suggested to be the
head-tail connector, or portal protein, on the basis of
its position in the phage gene order, its presence in
mature phage, its size, and its conservation across a
number of complete genomes of tailed phage that lack
other candidate portal proteins. Several other known
portal protein families lack clear homology to this
family and to each other [Mobile and extrachromosomal
element functions, Prophage functions].
Length = 476
Score = 30.3 bits (68), Expect = 4.3
Identities = 19/91 (20%), Positives = 36/91 (39%), Gaps = 2/91 (2%)
Query: 311 KAMCDTLWLTTMCDALWLTAMSLHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKA 370
+ + T L + D + A + I+ ++ Y + DY E KR +
Sbjct: 1 RRVGLTKSLKDIKDHPKINANDEEYKRIDMWKRYY--QGDYAEVHNLNHNGNTVKRRLLS 58
Query: 371 RSVRKIKAKLYNKQRRNEKIQMKKKIKAHEE 401
++ K+ AK NE++++ KA E
Sbjct: 59 MNLPKVAAKKMASLLFNEQVKINVDDKAANE 89
>gnl|CDD|226108 COG3580, COG3580, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 351
Score = 29.7 bits (67), Expect = 4.5
Identities = 16/63 (25%), Positives = 29/63 (46%), Gaps = 14/63 (22%)
Query: 437 KQKRKEKAGKWEVPIPKV----------RTQADAEVFKIVRSGKTKR----KAWKRMITK 482
+KE+AG+ + IP+V T F++V S K+ + K + + ++
Sbjct: 12 HSLKKEEAGRGTIGIPRVLNMYEYYPFFHTFFTELGFRVVLSPKSSKELYEKGIETIPSE 71
Query: 483 VCF 485
VCF
Sbjct: 72 VCF 74
>gnl|CDD|162036 TIGR00784, citMHS, citrate transporter, CitMHS family. This family
includes two characterized citrate/proton symporters
from Bacillus subtilis. CitM transports citrate
complexed to Mg2+, while the CitH apparently transports
citrate without Mg2+. The family also includes
uncharacterized transporters, including a third paralog
in Bacillus subtilis [Transport and binding proteins,
Carbohydrates, organic alcohols, and acids].
Length = 431
Score = 29.8 bits (67), Expect = 4.5
Identities = 19/74 (25%), Positives = 30/74 (40%), Gaps = 13/74 (17%)
Query: 227 LPIYLETGDNRLLLLT-PMVS----ELNPWSIHEPSYRVGPMPQTVSNLPLQPSQSSGTL 281
LP+Y G N ++L M+ + PW GP + +S L L PS+ L
Sbjct: 126 LPLYKRLGMNPMVLAGIAMLGLGIMNMIPWG--------GPTTRAISVLGLDPSEFFVPL 177
Query: 282 WLKAMCDALWLTAM 295
+ L + A+
Sbjct: 178 IPTMLGGILCVIAV 191
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 29.1 bits (66), Expect = 4.5
Identities = 18/71 (25%), Positives = 40/71 (56%), Gaps = 10/71 (14%)
Query: 352 EERKRKREAR--------EPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKL 403
E+RK++++AR E + AE+A++ + + + N+Q R + + +K I A ++L
Sbjct: 23 EKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQ-RQAEAE-QKAIVAQIKQL 80
Query: 404 TKQKHEKKEEG 414
+Q +++G
Sbjct: 81 IEQNRIDRKDG 91
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 30.0 bits (68), Expect = 4.9
Identities = 19/103 (18%), Positives = 43/103 (41%), Gaps = 18/103 (17%)
Query: 351 YEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEK----IQMKKKIKAHEEKLTKQ 406
EE+KR A + A K + + + ++++ K + + ++ + E +L +
Sbjct: 637 LEEQKR---AEAEARTALKQARLDLQRLQ---NEQQSLKDKLELAIAERKQQAETQLRQL 690
Query: 407 KHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEV 449
+ K L Q+ + L + ++ R E+ KW+V
Sbjct: 691 DAQLK--------QLLEQQQAFLEALKDDFRELRTERLAKWQV 725
>gnl|CDD|201249 pfam00472, RF-1, RF-1 domain. This domain is found in peptide
chain release factors such as RF-1 and RF-2, and a
number of smaller proteins of unknown function. This
domain contains the peptidyl-tRNA hydrolase activity.
The domain contains a highly conserved motif GGQ, where
the glutamine is thought to coordinate the water that
mediates the hydrolysis.
Length = 114
Score = 28.3 bits (64), Expect = 5.0
Identities = 12/38 (31%), Positives = 21/38 (55%)
Query: 348 RLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQR 385
+L E ++KRE +P + ++ R RK K + YN +
Sbjct: 70 KLYEAELQKKREKTKPTRASQVRRGDRKEKIRTYNFPQ 107
>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p. Rrp15p is required for the formation of
60S ribosomal subunits.
Length = 132
Score = 28.5 bits (64), Expect = 5.2
Identities = 30/138 (21%), Positives = 56/138 (40%), Gaps = 21/138 (15%)
Query: 309 WLKAMCDTLWLTTMCDALWLTAMSLHNEYIERHRKLYGRRLDYEERKRKREAR-EPHKRA 367
+ A L +S + ++ +KL +L E+K KR+ R E +
Sbjct: 2 FADAFAKILASKLPASKRKDPILSRSKKLLKAKKKLKSEKL---EKKAKRQLRAEKRQAL 58
Query: 368 EKARSVRKIKAKLYNKQRRNEKI------QMKKKIKAHEEKLTKQKHEKKEEGALPVYLL 421
EK R V+ + + K+RR K+ ++ ++A + K + KE G
Sbjct: 59 EKGR-VKPVLPEDLEKERRLRKVAQKGVVKLFNAVRAAQ---KKTEKAVKEAGK------ 108
Query: 422 DRDVQSRAKVLSNMIKQK 439
V+ R ++L + K+K
Sbjct: 109 KARVK-REELLKEVSKEK 125
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 30.1 bits (68), Expect = 5.2
Identities = 16/66 (24%), Positives = 27/66 (40%), Gaps = 5/66 (7%)
Query: 351 YEERKRKREAREPHKRAEKARSVRKI---KAKLYNKQRRNEKIQMKKKIKAHEEKLTKQK 407
E R +E +EP + E K ++ L N+++ E+ + IK + K KQ
Sbjct: 1518 IEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNK--KQY 1575
Query: 408 HEKKEE 413
E
Sbjct: 1576 KSNTEA 1581
>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A.
Length = 427
Score = 29.8 bits (68), Expect = 5.3
Identities = 21/96 (21%), Positives = 41/96 (42%), Gaps = 18/96 (18%)
Query: 336 EYIERHRKLYGRRLDYEERKRKREAR--EPHKRA-EKARSVRK-IKAKLYNKQRRNE--- 388
E++E ++Y RR +Y K + E +A K V + I+ + K+ + E
Sbjct: 312 EFLEHRLEVYKRRKEYLLEKLEERLHILEGLLKALNKIDFVIEVIRGSIDLKKAKKELIE 371
Query: 389 -----------KIQMKKKIKAHEEKLTKQKHEKKEE 413
+++++ K EKL K+ E ++E
Sbjct: 372 ELSEIQADYLLDMRLRRLTKEEIEKLEKEIEELEKE 407
>gnl|CDD|107304 cd06309, PBP1_YtfQ_like, Periplasmic binding domain of ABC-type
YtfQ-like transport systems. Periplasmic binding domain
of ABC-type YtfQ-like transport systems. The YtfQ
protein from Escherichia coli is up-regulated under
glucose-limited conditions and shares homology with a
family of pentose/hexose sugar-binding proteins of the
type I periplasmic binding protein superfamily. Members
of this group are predicted to be involved in the
transport of sugar-containing molecules across cellular
and organellar membranes; however their ligand
specificity is not determined experimentally.
Length = 273
Score = 29.5 bits (67), Expect = 5.4
Identities = 14/45 (31%), Positives = 22/45 (48%), Gaps = 9/45 (20%)
Query: 409 EKKEEGALPVYLLDRDVQSRAKVL------SNMIKQKRKEKAGKW 447
E K G +PV L+DR V + L S+ +++ R +A W
Sbjct: 75 EAKAAG-IPVILVDRGVDVKDDSLYVTFIGSDFVEEGR--RAADW 116
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 29.7 bits (67), Expect = 5.5
Identities = 25/119 (21%), Positives = 35/119 (29%), Gaps = 22/119 (18%)
Query: 356 RKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKKEEGA 415
K + K + K K +K ++K KKK K E+K K + E K
Sbjct: 35 SKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLG-- 92
Query: 416 LPVYLLDRDVQSRAKVLSNMIKQKRKEKAGKWEVPIPKVRTQADAEVFKIVRSGKTKRK 474
K +K K K + P PK D KI +
Sbjct: 93 --------------------FKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKSNV 131
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 29.3 bits (66), Expect = 5.5
Identities = 17/59 (28%), Positives = 30/59 (50%), Gaps = 3/59 (5%)
Query: 354 RKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKKE 412
R + E +K +S +K K KLY + E+ + +KK+K E++L Q+ K+
Sbjct: 158 RISQLEKTSLVDEKQKKKSAKK-KRKLYKEL--KERKEREKKLKKVEQRLELQRELMKK 213
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 29.0 bits (66), Expect = 6.1
Identities = 25/97 (25%), Positives = 50/97 (51%), Gaps = 12/97 (12%)
Query: 357 KREAREPHKRA---EKARSVRKIKAKLYN--KQRRNEKIQMKKKIKAHEEKLTKQ--KHE 409
K+EA E K+ E + K++A+ K+RRNE + +K++ EE L ++ E
Sbjct: 44 KKEA-EALKKEALLEAKEEIHKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESLE 102
Query: 410 KKEEGAL----PVYLLDRDVQSRAKVLSNMIKQKRKE 442
KKEE + + ++ + + L +I+++++E
Sbjct: 103 KKEESLEEKEKELAARQQQLEEKEEELEELIEEQQQE 139
>gnl|CDD|223368 COG0291, RpmI, Ribosomal protein L35 [Translation, ribosomal
structure and biogenesis].
Length = 65
Score = 26.8 bits (60), Expect = 6.2
Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 11/57 (19%)
Query: 452 PKVRT-QADAEVFKIVRSGKTKRK-AWKRMITKVCFVGENFTRKPPKFERFIRPMAL 506
PK++T + A+ FKI +GK KRK A KR I T+K K +R +R A+
Sbjct: 2 PKMKTKKGAAKRFKITGTGKIKRKHAGKRHI---------LTKKSTKRKRHLRKTAV 49
>gnl|CDD|153298 cd07614, BAR_Endophilin_A2, The Bin/Amphiphysin/Rvs (BAR) domain of
Endophilin-A2. BAR domains are dimerization, lipid
binding and curvature sensing modules found in many
different proteins with diverse functions. Endophilins
are accessory proteins, localized at synapses, which
interact with the endocytic proteins, dynamin and
synaptojanin. They are essential for synaptic vesicle
formation from the plasma membrane. They interact with
voltage-gated calcium channels, thus linking vesicle
endocytosis to calcium regulation. They also play roles
in virus budding, mitochondrial morphology maintenance,
receptor-mediated endocytosis inhibition, and endosomal
sorting. Endophilins contain an N-terminal N-BAR domain
(BAR domain with an additional N-terminal amphipathic
helix), followed by a variable region containing proline
clusters, and a C-terminal SH3 domain. They are
classified into two types, A and B. Endophilin-A
proteins are enriched in the brain and play multiple
roles in receptor-mediated endocytosis. Endophilin-A2
(or endophilin-2) is also referred to as SH3P8 (SH3
domain containing protein 8) or SH3GL1 (SH3 domain
containing Grb2-like protein 1). It localizes to
presynaptic nerve terminals and forms heterodimers with
endophilin-A1 through their BAR domains. Endophilin-A2
binds dynamin 1, synaptojanin 1, and the
beta1-adrenergic receptor cytoplasmic tail through its
SH3 domain.
Length = 223
Score = 29.3 bits (65), Expect = 6.2
Identities = 14/34 (41%), Positives = 23/34 (67%), Gaps = 2/34 (5%)
Query: 338 IERH-RKLYGRRLDYEERKRKREAREPHKRAEKA 370
I+ H +KL GRRLD+ + K+KR+ + P + +A
Sbjct: 130 IQHHLKKLEGRRLDF-DYKKKRQGKIPDEELRQA 162
>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
structure and biogenesis].
Length = 107
Score = 28.0 bits (62), Expect = 6.3
Identities = 15/55 (27%), Positives = 23/55 (41%), Gaps = 6/55 (10%)
Query: 389 KIQMKKKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQK-RKE 442
K Q+ K+ KA + K K +KK ++ RA + + K RKE
Sbjct: 4 KSQLSKEKKAEKAKAGTAKDKKKWSKK-----KKKEEARRAVTVDEELLDKIRKE 53
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 28.3 bits (64), Expect = 6.7
Identities = 14/79 (17%), Positives = 41/79 (51%), Gaps = 14/79 (17%)
Query: 374 RKIKAKL--YNKQRRNEKIQMKKKIKAHEEKLTKQ--------KHEKKEEGALPVYLLDR 423
+ + +L K+R+ E +++K+++ +EKL K + +K++E + +
Sbjct: 17 KAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKE----LQKKVQ 72
Query: 424 DVQSRAKVLSNMIKQKRKE 442
+ Q + + L ++++++E
Sbjct: 73 EFQRKQQKLQQDLQKRQQE 91
>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2. A Saccharomyces
cerevisiae member of this family (PGA2) is an ER protein
which has been implicated in protein trafficking.
Length = 139
Score = 28.2 bits (63), Expect = 7.2
Identities = 7/31 (22%), Positives = 18/31 (58%)
Query: 383 KQRRNEKIQMKKKIKAHEEKLTKQKHEKKEE 413
K R+ ++ ++K ++A E+ Q ++ E+
Sbjct: 102 KARKRQRKVIRKLLEAEEQLREDQYDDEDED 132
>gnl|CDD|129363 TIGR00261, traB, pheromone shutdown-related protein TraB. traB is
a plasmid encoded gene that functions in the shutdown of
the peptide sex pheromone cPD1 which is produced by the
plasmid free recipient cell prior to conjugative
transfer in Enterococcus faecalis. Once the recipient
acquires the plasmid, production of cPD1 is shut down.
The gene product may play another role in the other
species in the family [Unknown function, General].
Length = 380
Score = 29.0 bits (65), Expect = 7.5
Identities = 20/65 (30%), Positives = 32/65 (49%), Gaps = 7/65 (10%)
Query: 380 LYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKKEEGALPVYLLDRDVQSRAKVLSNMIKQK 439
L N Q+ K+ ++ IK E T EK ++ +P+ L+DRD+++ K I
Sbjct: 74 LANFQK---KLGEEQGIKPGSEMKTAI--EKAKKHGIPLILIDRDIETTLKRA--WISIT 126
Query: 440 RKEKA 444
EKA
Sbjct: 127 FFEKA 131
>gnl|CDD|153299 cd07615, BAR_Endophilin_A3, The Bin/Amphiphysin/Rvs (BAR) domain of
Endophilin-A3. BAR domains are dimerization, lipid
binding and curvature sensing modules found in many
different proteins with diverse functions. Endophilins
are accessory proteins localized at synapses that
interacts with the endocytic proteins, dynamin and
synaptojanin. They are essential for synaptic vesicle
formation from the plasma membrane. They interact with
voltage-gated calcium channels, thus linking vesicle
endocytosis to calcium regulation. They also play roles
in virus budding, mitochondrial morphology maintenance,
receptor-mediated endocytosis inhibition, and endosomal
sorting. Endophilins contain an N-terminal N-BAR domain
(BAR domain with an additional N-terminal amphipathic
helix), followed by a variable region containing proline
clusters, and a C-terminal SH3 domain. They are
classified into two types, A and B. Endophilin-A
proteins are enriched in the brain and play multiple
roles in receptor-mediated endocytosis. Endophilin-A3
(or endophilin-3) is also referred to as SH3P13 (SH3
domain containing protein 13) or SH3GL3 (SH3 domain
containing Grb2-like protein 3). It regulates
Arp2/3-dependent actin filament assembly during
endocytosis. It binds N-WASP through its SH3 domain and
enhances the ability of N-WASP to activate the Arp2/3
complex. Endophilin-A3 co-localizes with the vesicular
glutamate transporter 1 (VGLUT1), and may play an
important role in the synaptic release of glutamate.
Length = 223
Score = 28.8 bits (64), Expect = 7.6
Identities = 12/29 (41%), Positives = 20/29 (68%), Gaps = 1/29 (3%)
Query: 342 RKLYGRRLDYEERKRKREAREPHKRAEKA 370
+KL GRRLD+ + K+KR+ + P + +A
Sbjct: 135 KKLEGRRLDF-DYKKKRQGKIPDEEIRQA 162
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 28.8 bits (65), Expect = 8.4
Identities = 12/46 (26%), Positives = 26/46 (56%), Gaps = 6/46 (13%)
Query: 352 EERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIK 397
EE + K+E + K+ E+ + K+ + +QR+ E+ + KK+ +
Sbjct: 282 EEAQEKKEEK---KKEEREAKLAKLSPE---EQRKLEEKERKKQAR 321
>gnl|CDD|221313 pfam11917, DUF3435, Protein of unknown function (DUF3435). This
family of proteins are functionally uncharacterized.
This protein is found in eukaryotes. Proteins in this
family are typically between 435 to 791 amino acids in
length. This family is related to pfam00589 suggesting
it may be an integrase enzyme.
Length = 418
Score = 29.3 bits (66), Expect = 8.4
Identities = 11/66 (16%), Positives = 27/66 (40%), Gaps = 1/66 (1%)
Query: 336 EYIERHRKLYGRRLDYEERKRKREAREPHKRAEKA-RSVRKIKAKLYNKQRRNEKIQMKK 394
E I + L + + K + ++R EK R VR + +L + ++ + + +
Sbjct: 292 ELIRKRDHLKKEIIALYGQVAKAKGTPLYERLEKRRREVRNERQRLRRELKKKIREEFDE 351
Query: 395 KIKAHE 400
+ +
Sbjct: 352 EQPVID 357
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 28.1 bits (63), Expect = 8.7
Identities = 14/54 (25%), Positives = 25/54 (46%), Gaps = 5/54 (9%)
Query: 359 EAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKKE 412
E + AE R + KAK ++ R +K +K K + E++ K++ K
Sbjct: 49 EDEDDESTAESKRPEGRKKAK---EKLRRDK--LKAKKEEAEKEKEKEERFMKA 97
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 29.0 bits (66), Expect = 8.7
Identities = 17/69 (24%), Positives = 31/69 (44%), Gaps = 7/69 (10%)
Query: 332 SLHNEYIERHRKLYGRRLDYEERKRKREAREPHKRAEKARSVRKIKAKLYNKQRRNEKIQ 391
+L+ +Y E +L LD +E K +A++ ++ E + K K +L+ +
Sbjct: 44 NLNEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPRLF-------VLD 96
Query: 392 MKKKIKAHE 400
K I A E
Sbjct: 97 FKGDIDASE 105
>gnl|CDD|227780 COG5493, COG5493, Uncharacterized conserved protein containing a
coiled-coil domain [Function unknown].
Length = 231
Score = 28.7 bits (64), Expect = 9.6
Identities = 16/62 (25%), Positives = 25/62 (40%), Gaps = 4/62 (6%)
Query: 347 RRLDYEERKRKREAREPHKRAEKARS----VRKIKAKLYNKQRRNEKIQMKKKIKAHEEK 402
+ D EE +++ E R+ EK K KL + + E K+ IK E
Sbjct: 44 TKQDVEELRKETEQRQKELADEKLEVRKQKATKEDLKLLQRFQEEEFRATKEDIKRLETI 103
Query: 403 LT 404
+T
Sbjct: 104 IT 105
>gnl|CDD|237292 PRK13182, racA, polar chromosome segregation protein; Reviewed.
Length = 175
Score = 28.1 bits (63), Expect = 9.6
Identities = 20/85 (23%), Positives = 37/85 (43%), Gaps = 7/85 (8%)
Query: 332 SLHNEYIERHRKLYGRRLDYEERKRKREAREP------HKRAEKARSVRKIKAKLYNKQR 385
S+ E +E RRLD ER+ +++A + R E + +++ KL + +
Sbjct: 84 SVDFEQLEAQLNTITRRLDELERQLQQKADDVVSYQLLQHRREMEEMLERLQ-KLEARLK 142
Query: 386 RNEKIQMKKKIKAHEEKLTKQKHEK 410
+ E I + + E+ K K K
Sbjct: 143 KLEPIYITPDTEPTYEREKKPKRRK 167
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 28.8 bits (65), Expect = 9.8
Identities = 14/53 (26%), Positives = 24/53 (45%), Gaps = 3/53 (5%)
Query: 364 HKRAEKARSVRKIKAK---LYNKQRRNEKIQMKKKIKAHEEKLTKQKHEKKEE 413
K E+ R +K K L +K +++ K A E++L + +KEE
Sbjct: 182 LKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEE 234
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.323 0.133 0.435
Gapped
Lambda K H
0.267 0.0788 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 33,212,973
Number of extensions: 3196491
Number of successful extensions: 3795
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3497
Number of HSP's successfully gapped: 237
Length of query: 657
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 554
Effective length of database: 6,369,140
Effective search space: 3528503560
Effective search space used: 3528503560
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 62 (27.5 bits)