RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6664
(624 letters)
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 384 bits (987), Expect = e-128
Identities = 164/446 (36%), Positives = 240/446 (53%), Gaps = 41/446 (9%)
Query: 35 SELVCRVKYNNTLPDIPFDCKFIAYPF-ESTRFIEYKSTSLERNYKYEVLT--EHDLGVT 91
+ + +VKY N LPD PF K I YPF + RF TSL R K VL +HDLG+
Sbjct: 1 QDYIAKVKYTNNLPDPPFPPKLITYPFPDENRFSPQLITSLYRKTKVTVLIQLDHDLGMP 60
Query: 92 IDLIN---------KDTYAPNYNIQLDPADEKLLEEDILTPQDSKRSRHHARSVSWLRRT 142
+DL+ K Y + N++LDP D LL + R++ ++ VS+LRRT
Sbjct: 61 LDLLKIPGLLQGDTKVYYGDDVNVKLDPEDRILLRDPGADK----RTKTDSKKVSFLRRT 116
Query: 143 EYISTEQTRFQPQVMDKVEAKVGFSIKKNFKEDNLYMDRDSQVKAIEKTFADSKLPIEKH 202
EYISTE RF Q ++ E+KVG ++K+N +E++ Y+DR++Q++AIEKTF D K I KH
Sbjct: 117 EYISTESARFAVQALELTESKVGRNVKQNLREEDEYLDREAQIEAIEKTFEDVKKEILKH 176
Query: 203 YSKPNVTPVEIMPVYPDFKYWKYPCAQVIFDSDPAPVGRPIPAQIEEMSQAMIRGVMDES 262
SK NV VE+ P+ PD QV FD D AP G+ AQ+ + S I G ++
Sbjct: 177 PSKKNVKAVEVWPLLPDTASMDQQFFQVRFDGDAAPDGKEKHAQLTQASSTAIFGPVELE 236
Query: 263 GEQFVAYFLP-------TAETLDKRKLDQLSNLEYTDEQEYEYKMAREYNWNVKSKSCKG 315
E+FV+Y+LP +TL+KR D +Y +++EY++K REY+ VKSK+ K
Sbjct: 237 EEEFVSYYLPDEKDSEILEDTLEKRSDDL---HDYDEDEEYKFKRVREYDMKVKSKATKL 293
Query: 316 YEENYFLIVRD-DAVYYNELETRVRLSKRR-----QKVGAAPNNTRLVVTHRPLNATEFR 369
E F + + VYY L +RV L +RR + + NN +L V R + E +
Sbjct: 294 NELALFFVSDENGVVYYKPLRSRVELRRRRVNDVIRPLVREHNNDQLNVKLRNPSTKESK 353
Query: 370 IQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
++ R L+P + EE +EDE+EEEE+ + E +E EG + E G ++
Sbjct: 354 MRDKRRARLDPIDFEEVDEDEDEEEEQRSD---------EHEEEEGEDSEEEGSQSREDG 404
Query: 430 KRTSDAEHSSKNSSRASSRSRSRSKS 455
S ++ S + S+A S S S S
Sbjct: 405 SSESSSDVGSDSESKADKESASDSDS 430
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 67.9 bits (166), Expect = 1e-11
Identities = 47/261 (18%), Positives = 81/261 (31%), Gaps = 16/261 (6%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
P + E A + + S
Sbjct: 128 PSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPS 187
Query: 439 SKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDK 498
S + S + + + P + S +++S + RSA D A S S+ S+ S
Sbjct: 188 SPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGC 247
Query: 499 ---------------SAASNRSRSGSAASNK-SRSDRSRSGSAGSDRSAASNKSVRSDKS 542
R S + SR + S S+ +RS + + S
Sbjct: 248 GWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGP 307
Query: 543 AASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRS 602
A S+ A S S S+ S + S+S + + + S ++S S SR + S R
Sbjct: 308 APSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRK 367
Query: 603 RSASKSGSSAASSRSGSPAGS 623
R S+ ++ +G P
Sbjct: 368 RPRPSRAPSSPAASAGRPTRR 388
Score = 60.2 bits (146), Expect = 3e-09
Identities = 46/201 (22%), Positives = 79/201 (39%), Gaps = 11/201 (5%)
Query: 433 SDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSAR 492
+ A ++ S ASSR + S S + + +A S R +
Sbjct: 155 AGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPIS 214
Query: 493 SDRSDKSAASNRS---RSGSAASNKSRSDRSRSGSAGSDR------SAASNKSVRSDKSA 543
+ S + A RS +G+++S+ S S+ S G + + + + + S
Sbjct: 215 ASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASG 274
Query: 544 ASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSR 603
+ S+R + S+S R RS S S + + A + S S S SR S + S S
Sbjct: 275 WNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRA-SSSSSSSRESSSS-STSSS 332
Query: 604 SASKSGSSAASSRSGSPAGSD 624
S S G++ + S S + S
Sbjct: 333 SESSRGAAVSPGPSPSRSPSP 353
Score = 59.8 bits (145), Expect = 3e-09
Identities = 43/211 (20%), Positives = 72/211 (34%), Gaps = 16/211 (7%)
Query: 414 EGGEREGGGEGEN-DNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGS 472
E G E E +S + S + S + + S S
Sbjct: 243 ESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSS 302
Query: 473 NKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAA 532
S + S R++ S S+R S +++S+ S G+A S RS S S +
Sbjct: 303 PGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADP 362
Query: 533 SNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSR 592
S+ K +R+ S + R R+R+A A R + +G R
Sbjct: 363 SSPR----KRPRPSRAPSSPAASAGRPTRRRARAA-------VAGRARRRDATGRFPAGR 411
Query: 593 SVSRARSNRSRSASKSGSSAASSRSGSPAGS 623
R + + + SG+ A +P+G
Sbjct: 412 P----RPSPLDAGAASGAFYARYPLLTPSGE 438
Score = 43.6 bits (103), Expect = 3e-04
Identities = 44/228 (19%), Positives = 64/228 (28%), Gaps = 13/228 (5%)
Query: 361 RPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREG 420
P + + E E T E G
Sbjct: 223 APGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRP 282
Query: 421 GGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAK 480
G + + + S + S S + S S S S ++ S S S SS S R A
Sbjct: 283 GPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSE--SSRGAA 340
Query: 481 SDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSD 540
RS R S SR+ S + SAG + R
Sbjct: 341 VSPGPSPSRSPSPSR---PPPPADPSSPRKRPRPSRAPSSPAASAGR-------PTRRRA 390
Query: 541 KSAASNRSARSDGSDKSASNRSRSR-SASSQASNNSAASNRSSNKSGS 587
++A + R+ R D + + + R R + AS A SG
Sbjct: 391 RAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSGE 438
Score = 34.4 bits (79), Expect = 0.22
Identities = 18/94 (19%), Positives = 36/94 (38%), Gaps = 1/94 (1%)
Query: 433 SDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASS-RGSNKSDRSAKSDRSAKSDRSA 491
S + S + + S K P++ S +AS+ R + + R+A + R+ + D +
Sbjct: 346 SPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATG 405
Query: 492 RSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSA 525
R + + + S A + SG
Sbjct: 406 RFPAGRPRPSPLDAGAASGAFYARYPLLTPSGEP 439
Score = 34.0 bits (78), Expect = 0.27
Identities = 26/152 (17%), Positives = 45/152 (29%), Gaps = 5/152 (3%)
Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEH 437
P+ + E E E Q+G + + R G +
Sbjct: 758 FSNPSLVPAKLAEALALLEPAEPQRGAGSSPPVRAEAAFRRPGRLR-----RSGPAADAA 812
Query: 438 SSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSD 497
S S R S S+S G+A+ +S S+KS +A R+ +
Sbjct: 813 SRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPARSSESSKSKPAAAGGRARGKNGRR 872
Query: 498 KSAASNRSRSGSAASNKSRSDRSRSGSAGSDR 529
+ AA+ + + A + R
Sbjct: 873 RPRPPEPRARPGAAAPPKAAAAAPPAGAPAPR 904
Score = 31.7 bits (72), Expect = 1.5
Identities = 27/144 (18%), Positives = 48/144 (33%), Gaps = 4/144 (2%)
Query: 485 AKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAA 544
S R AR+ S + S K + A R A S+ VR++ +
Sbjct: 738 RYSPRRARARASAWDITDALFSNPSLVPAKLAEALALLEPAEPQRGAGSSPPVRAEAAFR 797
Query: 545 SNRSARSDGSDKSASNR----SRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSN 600
R G A++R +SRS + + S+ R + +RS ++S
Sbjct: 798 RPGRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPARSSESSKSK 857
Query: 601 RSRSASKSGSSAASSRSGSPAGSD 624
+ + ++ R P
Sbjct: 858 PAAAGGRARGKNGRRRPRPPEPRA 881
Score = 31.7 bits (72), Expect = 1.7
Identities = 24/158 (15%), Positives = 52/158 (32%), Gaps = 4/158 (2%)
Query: 442 SSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAA 501
+RAS+ + + +P+ + A + + + S+ R+ + R
Sbjct: 745 RARASAWDITDALFSNPSLVP-AKLAEALALLEPAEPQRGAGSSPPVRAEAAFRRPGRLR 803
Query: 502 SNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNR 561
+ + AAS + +SRS + ++ R +AA ARS S KS
Sbjct: 804 RSGP-AADAASRTASKRKSRSHTPDGGSESSG--PARPPGAAARPPPARSSESSKSKPAA 860
Query: 562 SRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARS 599
+ R+ + + + + +
Sbjct: 861 AGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPA 898
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
Provisional.
Length = 672
Score = 66.1 bits (162), Expect = 3e-11
Identities = 39/214 (18%), Positives = 66/214 (30%), Gaps = 7/214 (3%)
Query: 390 EEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRS 449
A + A RQ + + E R + A + S+ ++++
Sbjct: 68 ATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQA 127
Query: 450 RSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGS 509
R R + A+R + A G ++ A R+ ++R+ +R ++ +R +
Sbjct: 128 RERRERGEAARRGAARKAGEGG----EQPATEARADAAERTEEEERDERRRRGDREDRQA 183
Query: 510 AASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASS 569
A R R G G DR R R R G + R R A
Sbjct: 184 EAERGERGRREERGRDGDDRDRRDR---REQGDRREERGRRDGGDRRGRRRRRDRRDARG 240
Query: 570 QASNNSAASNRSSNKSGSRSRSRSVSRARSNRSR 603
+ + G R R R R R
Sbjct: 241 DDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGR 274
Score = 54.5 bits (132), Expect = 1e-07
Identities = 24/195 (12%), Positives = 59/195 (30%), Gaps = 2/195 (1%)
Query: 409 ERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSAS 468
E + G +R + A +++ + + ++ + + +PA R+ + +A+
Sbjct: 57 EARGGGAAAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAA 116
Query: 469 SRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSD 528
S A+ R +AR + K+ + A ++ +
Sbjct: 117 EAASAPEAAQARERRER--GEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRR 174
Query: 529 RSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSR 588
R ++ +++ R R D R + + +R +
Sbjct: 175 RGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRD 234
Query: 589 SRSRSVSRARSNRSR 603
R R +R
Sbjct: 235 RRDARGDDNREDRGD 249
Score = 53.4 bits (129), Expect = 2e-07
Identities = 30/151 (19%), Positives = 41/151 (27%), Gaps = 3/151 (1%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
+ E + E + GE E E E E E D +R D E
Sbjct: 122 PEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDR 181
Query: 439 SKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDK 498
+ R R +G + +R + D + R R DR D
Sbjct: 182 QAEAERGERGRREE--RGRDGDDRDRRDRREQGDRREERG-RRDGGDRRGRRRRRDRRDA 238
Query: 499 SAASNRSRSGSAASNKSRSDRSRSGSAGSDR 529
NR G + R G DR
Sbjct: 239 RGDDNREDRGDRDGDDGEGRGGRRGRRFRDR 269
Score = 50.3 bits (121), Expect = 2e-06
Identities = 36/187 (19%), Positives = 48/187 (25%), Gaps = 12/187 (6%)
Query: 349 AAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEA 408
AA A + + + P E E Q E E
Sbjct: 74 AAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRER 133
Query: 409 ERQEGEGGEREGGGEGE-NDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSA 467
G R+ G GE E R AE + + R R ++ A+R G
Sbjct: 134 GEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRR 193
Query: 468 SSRGSNKSDRSAKSDRSAK-----------SDRSARSDRSDKSAASNRSRSGSAASNKSR 516
RG + DR + R DR R R D+ A
Sbjct: 194 EERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGD 253
Query: 517 SDRSRSG 523
R G
Sbjct: 254 DGEGRGG 260
Score = 49.9 bits (120), Expect = 4e-06
Identities = 21/165 (12%), Positives = 39/165 (23%), Gaps = 5/165 (3%)
Query: 364 NATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGE 423
R +R + E ++ E A+ ++ E E + + G + E
Sbjct: 125 AQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAE 184
Query: 424 GENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDR 483
E R + + R R + + R + R + R
Sbjct: 185 AERGERGRREERGRDGDDRDRRDRREQGDRRE-----ERGRRDGGDRRGRRRRRDRRDAR 239
Query: 484 SAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSD 528
+ D R R R G +
Sbjct: 240 GDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGDGGNERE 284
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 52.7 bits (127), Expect = 6e-07
Identities = 38/252 (15%), Positives = 84/252 (33%), Gaps = 29/252 (11%)
Query: 370 IQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
+ + E E EE+E +E+ + + + + + + ++ E +
Sbjct: 1134 LDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKE-------------KK 1180
Query: 430 KRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDR 489
K+ S A+ S K S +S+ + + + + ++S GS++ D + + KS
Sbjct: 1181 KKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSV 1240
Query: 490 SARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSA 549
+ + S+ S S +++ SD + + S +
Sbjct: 1241 KRLKSKKNNSSKS------SEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDG 1294
Query: 550 RSDGSDKSASNRSRSR----------SASSQASNNSAASNRSSNKSGSRSRSRSVSRARS 599
S+G K +S + + S A + S ++ + SR
Sbjct: 1295 ESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLR 1354
Query: 600 NRSRSASKSGSS 611
+ S S S
Sbjct: 1355 RPRKKKSDSSSE 1366
Score = 38.9 bits (91), Expect = 0.009
Identities = 28/219 (12%), Positives = 72/219 (32%), Gaps = 7/219 (3%)
Query: 339 RLSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAE 398
+L K + K + + + + + ++ + ++ ++ +
Sbjct: 1168 KLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQED 1227
Query: 399 ETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSP 458
+ +Q + + ++ + DN++ +SD A R + S P
Sbjct: 1228 DEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPP 1287
Query: 459 AKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSD 518
+ G SN + + + R S +S +A KS++
Sbjct: 1288 PSKRPDGE-----SNGGSKPSSPTKKKVKKRLEGS--LAALKKKKKSEKKTARKKKSKTR 1340
Query: 519 RSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKS 557
++ ++ S R + +SD S+ + + D S+
Sbjct: 1341 VKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDE 1379
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 49.9 bits (119), Expect = 3e-06
Identities = 33/191 (17%), Positives = 71/191 (37%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
E EEE+E+++ +EE++ ++ + E + + + E E+ E E + EK+ +
Sbjct: 101 ESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDR 160
Query: 439 SKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDK 498
+ R R++SR K K + + ++ + + +R +
Sbjct: 161 EEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKE 220
Query: 499 SAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSA 558
+ ++ + S +S S S S S S + S++ +
Sbjct: 221 EDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSKRTETRPRT 280
Query: 559 SNRSRSRSASS 569
S R S +S
Sbjct: 281 SLRPPSARPAS 291
Score = 29.1 bits (65), Expect = 8.7
Identities = 21/182 (11%), Positives = 54/182 (29%), Gaps = 7/182 (3%)
Query: 439 SKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDK 498
SK SS + + + SK PA + + N+S + + ++ + + K
Sbjct: 69 SKLSSDEAVKRVEKGGSKGPAAK---TKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPK 125
Query: 499 SAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSA 558
+R A + ++ + + + + ++ A +R +
Sbjct: 126 EEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPN 185
Query: 559 SNRSRSRSAS----SQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAAS 614
+ ++ + N+ + R + +S S+
Sbjct: 186 KKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEI 245
Query: 615 SR 616
SR
Sbjct: 246 SR 247
>gnl|CDD|218883 pfam06075, DUF936, Plant protein of unknown function (DUF936).
This family consists of several hypothetical proteins
from Arabidopsis thaliana and Oryza sativa. The function
of this family is unknown.
Length = 564
Score = 48.3 bits (115), Expect = 8e-06
Identities = 40/201 (19%), Positives = 66/201 (32%), Gaps = 10/201 (4%)
Query: 423 EGENDNEKRTSDAEHSSKNSSRAS--SRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAK 480
+D + + +S R S S S+S S SP + S N +DR+
Sbjct: 122 AFFSDAVIQVIKRKKASSAPRRGSWDSSSKSASIDSSPTVIGPRPRSFSE-LNLTDRTPA 180
Query: 481 SDRSAKS----DRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKS 536
RS++S + S+ RS GS S S R + + R S S
Sbjct: 181 KVRSSRSELGAPSPSGGTSCPSSSGGRRSSIGSRRLRGSASLRKKVAVLSAPRKPGSRSS 240
Query: 537 VRSDKSAASNRSARS-DGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVS 595
A + SA+S S S +AS +++SS + +
Sbjct: 241 DCKSSPRARSSSAKSPFKSSIQRKATKALSKLSLRASPKD--TSKSSKSEVAPPKKSEAK 298
Query: 596 RARSNRSRSASKSGSSAASSR 616
S++ + + S
Sbjct: 299 VPSSSKKWTDGNVSWDSLPSS 319
Score = 37.5 bits (87), Expect = 0.020
Identities = 41/190 (21%), Positives = 65/190 (34%), Gaps = 24/190 (12%)
Query: 435 AEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSD 494
A S S A + R K+ S +R S+S S S + R +D
Sbjct: 117 AADSLAFFSDAVIQVIKRKKASSAPRRGSWDSSSKSASIDSSPTVIGPRPRSFSELNLTD 176
Query: 495 RSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGS 554
R+ S+RS G+ + + S S SG S S R SA+ + +
Sbjct: 177 RTPAKVRSSRSELGAPSPSGGTSCPSSSGGRRSSIG-----SRRLRGSASLRKKVAVLSA 231
Query: 555 DKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAAS 614
+ +RS S KS R+RS S +++ S + + +
Sbjct: 232 PRKPGSRS------------------SDCKSSPRARS-SSAKSPFKSSIQRKATKALSKL 272
Query: 615 SRSGSPAGSD 624
S SP +
Sbjct: 273 SLRASPKDTS 282
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
splicing factor. These splicing factors consist of an
N-terminal arginine-rich low complexity domain followed
by three tandem RNA recognition motifs (pfam00076). The
well-characterized members of this family are auxilliary
components of the U2 small nuclear ribonuclearprotein
splicing factor (U2AF). These proteins are closely
related to the CC1-like subfamily of splicing factors
(TIGR01622). Members of this subfamily are found in
plants, metazoa and fungi.
Length = 509
Score = 44.5 bits (105), Expect = 2e-04
Identities = 31/110 (28%), Positives = 44/110 (40%), Gaps = 9/110 (8%)
Query: 423 EGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKS---DRSA 479
E + + EK S R SR RSR + + R RS SR ++ RS
Sbjct: 4 EPDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSP 63
Query: 480 KSDRSAKSDRS-ARSDRSDKSAASN-----RSRSGSAASNKSRSDRSRSG 523
+S R + RS R R +S S R R S ++ + D+ RS
Sbjct: 64 RSLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWRKDDKKRSL 113
Score = 41.8 bits (98), Expect = 0.001
Identities = 32/117 (27%), Positives = 42/117 (35%), Gaps = 6/117 (5%)
Query: 427 DNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAK 486
D E + ++ R+S R R RS+ +S R R SR + + S DR
Sbjct: 2 DEEPDREREKSRGRDRDRSSERPRRRSRDRS---RFRDRHRRSRERSYREDSRPRDRRRY 58
Query: 487 SDRSARSDRSDKSAAS---NRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSD 540
RS RS R S R RS S S + R R S + K D
Sbjct: 59 DSRSPRSLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWRKDDKKRSLWD 115
Score = 40.3 bits (94), Expect = 0.003
Identities = 31/131 (23%), Positives = 39/131 (29%), Gaps = 19/131 (14%)
Query: 488 DRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNR 547
D +R R R +S + R RSR S DR S + + S +R
Sbjct: 2 DEEPDREREKS-----RGRDRDRSSERPR-RRSRDRSRFRDRHRRSRERSYREDSRPRDR 55
Query: 548 SARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASK 607
S +S S RS R + SRS R R RS S
Sbjct: 56 RRYDSRSPRSLRYSSVRRS-------------RDRPRRRSRSVRSIEQHRRRLRDRSPSN 102
Query: 608 SGSSAASSRSG 618
RS
Sbjct: 103 QWRKDDKKRSL 113
Score = 31.8 bits (72), Expect = 1.1
Identities = 27/112 (24%), Positives = 33/112 (29%), Gaps = 5/112 (4%)
Query: 470 RGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDR 529
+ R DRS++ R DRS RSR S R R R
Sbjct: 6 DREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERS----YREDSRPRDRRRYDSR 61
Query: 530 SAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRS 581
S S + +S R RS +R R R S RS
Sbjct: 62 SPRSLRYSSVRRSRDRPRR-RSRSVRSIEQHRRRLRDRSPSNQWRKDDKKRS 112
Score = 31.4 bits (71), Expect = 1.5
Identities = 21/113 (18%), Positives = 35/113 (30%), Gaps = 4/113 (3%)
Query: 374 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTS 433
R+ + E+ D + E + + +R D + S
Sbjct: 1 RDEEPDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDS 60
Query: 434 DAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAK 486
+ S + SS SR R R +S+S RS R S + + K
Sbjct: 61 RSPRSLRYSSVRRSRDRPRRRSRS----VRSIEQHRRRLRDRSPSNQWRKDDK 109
Score = 30.2 bits (68), Expect = 3.6
Identities = 27/119 (22%), Positives = 37/119 (31%), Gaps = 13/119 (10%)
Query: 448 RSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRS 507
R + KS R R SS + R R RS R D +R R
Sbjct: 3 EEPDREREKS---RGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYRED-----SRPR- 53
Query: 508 GSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRS 566
+ RS S + ++ R +S S R D+S SN+ R
Sbjct: 54 ----DRRRYDSRSPRSLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWRKDD 108
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 43.2 bits (102), Expect = 4e-04
Identities = 27/76 (35%), Positives = 34/76 (44%), Gaps = 9/76 (11%)
Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEH 437
LE + E +DEEEE+EE EE E E EG +E + E + + S E
Sbjct: 379 LEIEDANTERDDEEEEDEEEEE---------EEDEDEGPSKEHSDDEEFEEDDVESKYED 429
Query: 438 SSKNSSRASSRSRSRS 453
S NSS A RS
Sbjct: 430 SDGNSSLAVGYKNDRS 445
>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
Rho/Rac/Cdc42-like GTPases [Signal transduction
mechanisms].
Length = 1175
Score = 43.0 bits (101), Expect = 6e-04
Identities = 39/181 (21%), Positives = 64/181 (35%), Gaps = 6/181 (3%)
Query: 437 HSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRS 496
HS +S SS + + SPA S S + SS S K K S
Sbjct: 94 HSPSATSSTSSLNSNDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQKRKNPLLPSSS 153
Query: 497 DKSAASNRSRSGSAASNKSRSD-RSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSD 555
+ + S+ + RSR + ++S++ + + SD S+
Sbjct: 154 THGTHPPIVFTDNNGSHAGAPNARSRKEI-----PSLGSQSMQLPSPHFRQKFSSSDTSN 208
Query: 556 KSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASS 615
+ R S S S S + ++ S S S SN + S+S S + + SS
Sbjct: 209 GFSYPSIRKNSRHSSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSSSNSEAMSTSS 268
Query: 616 R 616
+
Sbjct: 269 K 269
Score = 39.5 bits (92), Expect = 0.006
Identities = 48/196 (24%), Positives = 69/196 (35%), Gaps = 9/196 (4%)
Query: 437 HSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRS 496
H +S +S + R S P S S ++S+ N +D S S + S +S
Sbjct: 70 HQIFSSFSSSPKLFQRRNSAGPITHSPSATSSTSSLNSNDGDQFSPASDSLSFNPSSTQS 129
Query: 497 DKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDK 556
K + N S G+ +N S +A S + S GS
Sbjct: 130 RKDSGPGDGSPVQKRKNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEIPSLGSQS 189
Query: 557 SA--SNRSRSRSASSQASNNSAASN------RSSNKSGSRSR-SRSVSRARSNRSRSASK 607
S R + +SS SN + + SSN S S +V R + S AS
Sbjct: 190 MQLPSPHFRQKFSSSDTSNGFSYPSIRKNSRHSSNSMPSFPHSSTAVLLKRHSGSSGASL 249
Query: 608 SGSSAASSRSGSPAGS 623
S+ S S S A S
Sbjct: 250 ISSNITPSSSNSEAMS 265
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 42.5 bits (100), Expect = 6e-04
Identities = 26/196 (13%), Positives = 67/196 (34%), Gaps = 2/196 (1%)
Query: 419 EGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRS 478
+ + N+ TS S ++ + ++ + S ++ S
Sbjct: 38 TTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDFIYK 97
Query: 479 AKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVR 538
+ + + S + + S+ S ++ R+ ++ S ++ S
Sbjct: 98 NLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDS-S 156
Query: 539 SDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRAR 598
+ S + ++ A + + ++ ++S NS N+S S+ S + +
Sbjct: 157 IKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPT-QPNQSNSQPASDDTANQK 215
Query: 599 SNRSRSASKSGSSAAS 614
S+ + S S S+ S
Sbjct: 216 SSSKDNQSMSDSALDS 231
Score = 39.4 bits (92), Expect = 0.005
Identities = 26/184 (14%), Positives = 58/184 (31%)
Query: 432 TSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSA 491
+D + S S+ + ++ S + + +N S++ + + D S
Sbjct: 26 YADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSST 85
Query: 492 RSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARS 551
+ + +++ + S + N + S +S
Sbjct: 86 SDSNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSEKS 145
Query: 552 DGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSS 611
S+ S +Q+S A N+ + S + S S + S + ++S S
Sbjct: 146 TNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQ 205
Query: 612 AASS 615
AS
Sbjct: 206 PASD 209
Score = 29.4 bits (66), Expect = 7.7
Identities = 26/179 (14%), Positives = 70/179 (39%), Gaps = 14/179 (7%)
Query: 446 SSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRS 505
S+ + SK S SS+ ++K+D + S++ + + D S + +
Sbjct: 34 STAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIID 93
Query: 506 RSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSR 565
++ + N S+ + N ++ ++ +
Sbjct: 94 ------FIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRN------ 141
Query: 566 SASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPAGSD 624
S +++N+S ++ SS K+ + ++S +A + ++ S++ + S ++ + SP +
Sbjct: 142 --SEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQ 198
>gnl|CDD|216205 pfam00937, Corona_nucleoca, Coronavirus nucleocapsid protein.
Length = 346
Score = 42.0 bits (99), Expect = 6e-04
Identities = 21/78 (26%), Positives = 33/78 (42%), Gaps = 7/78 (8%)
Query: 441 NSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSA 500
SR+SSRS SRS SR S S +N +R++ S + A +
Sbjct: 154 GRSRSSSRSSSRS-------NSRGPSRGSSRNNSRNRNSSSPDDLVAAVLAALAKLGFGK 206
Query: 501 ASNRSRSGSAASNKSRSD 518
+ S+ S + KS ++
Sbjct: 207 QKSSSKKPSRVTKKSAAE 224
Score = 35.5 bits (82), Expect = 0.077
Identities = 17/44 (38%), Positives = 22/44 (50%)
Query: 570 QASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAA 613
S +S+RSS++S SR SR SR S S+S AA
Sbjct: 151 GFRGRSRSSSRSSSRSNSRGPSRGSSRNNSRNRNSSSPDDLVAA 194
Score = 33.9 bits (78), Expect = 0.23
Identities = 18/64 (28%), Positives = 30/64 (46%)
Query: 438 SSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSD 497
SS++SSR++SR SR S++ ++ S S + AK + S + R
Sbjct: 159 SSRSSSRSNSRGPSRGSSRNNSRNRNSSSPDDLVAAVLAALAKLGFGKQKSSSKKPSRVT 218
Query: 498 KSAA 501
K +A
Sbjct: 219 KKSA 222
Score = 31.6 bits (72), Expect = 1.2
Identities = 16/40 (40%), Positives = 20/40 (50%), Gaps = 1/40 (2%)
Query: 586 GSRSRSRSVSRARS-NRSRSASKSGSSAASSRSGSPAGSD 624
G R RSRS SR+ S + SR S+ S S S + D
Sbjct: 151 GFRGRSRSSSRSSSRSNSRGPSRGSSRNNSRNRNSSSPDD 190
Score = 31.2 bits (71), Expect = 1.6
Identities = 24/109 (22%), Positives = 45/109 (41%), Gaps = 10/109 (9%)
Query: 502 SNRSRSGSAASNKSRS---DRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSA 558
RSRS S +S++S S R S + +R+++S + + AA G K
Sbjct: 153 RGRSRSSSRSSSRSNSRGPSRGSSRNNSRNRNSSSPDDLVAAVLAA----LAKLGFGKQ- 207
Query: 559 SNRSRSRSASSQASNNSAASNRSSNKSGSRS--RSRSVSRARSNRSRSA 605
+ S+ S ++ S AA + + R+ + +V++ R
Sbjct: 208 KSSSKKPSRVTKKSAAEAAKKQLNKPRWKRTPNKGENVTQCFGPRGPGQ 256
>gnl|CDD|215814 pfam00242, DNA_pol_viral_N, DNA polymerase (viral) N-terminal
domain.
Length = 379
Score = 42.1 bits (99), Expect = 7e-04
Identities = 33/143 (23%), Positives = 57/143 (39%), Gaps = 2/143 (1%)
Query: 462 SRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSR 521
++S SR N+ R+ ++ S KSD S +S RSR G A N+ + +
Sbjct: 206 AQSSGILSRSENRRTRNLANNTSRKSDTSRSVGPVRQSQ-IQRSRLGLQA-NQGKLAHGQ 263
Query: 522 SGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRS 581
G +GS R + + R S+ ++ + S+S +S + S+ S + S
Sbjct: 264 QGRSGSIRGRKHSTTRRPFGVEPSSSGVTTNRASSSSSCFHQSAVRETAYSSLSTSERHS 323
Query: 582 SNKSGSRSRSRSVSRARSNRSRS 604
S+ RS S +
Sbjct: 324 SSGHAVELRSIPGGSVSSQNAGP 346
Score = 34.0 bits (78), Expect = 0.22
Identities = 24/134 (17%), Positives = 44/134 (32%), Gaps = 8/134 (5%)
Query: 495 RSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGS 554
D+ + S S + N+ + + + S SD S + +S + + G
Sbjct: 199 HGDEPFGAQSSGILSRSENRRTRNLANNTSRKSDTSRSVGPVRQSQIQRSRLGLQANQGK 258
Query: 555 DKSA----SNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGS 610
S R R S+ + S + S S S ++S + S
Sbjct: 259 LAHGQQGRSGSIRGRKHSTTRRPFGVEPSSSGVTTNRASSSSSC----FHQSAVRETAYS 314
Query: 611 SAASSRSGSPAGSD 624
S ++S S +G
Sbjct: 315 SLSTSERHSSSGHA 328
Score = 30.2 bits (68), Expect = 3.4
Identities = 20/121 (16%), Positives = 38/121 (31%), Gaps = 5/121 (4%)
Query: 424 GENDNEKRTSDAEHSSKNSSRASSRSR-----SRSKSKSPAKRSRSGSASSRGSNKSDRS 478
++D + S SR ++ + +S + R R S + R S
Sbjct: 229 RKSDTSRSVGPVRQSQIQRSRLGLQANQGKLAHGQQGRSGSIRGRKHSTTRRPFGVEPSS 288
Query: 479 AKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVR 538
+ + S S+ +S + S S S + S G S+ + +
Sbjct: 289 SGVTTNRASSSSSCFHQSAVRETAYSSLSTSERHSSSGHAVELRSIPGGSVSSQNAGPLL 348
Query: 539 S 539
S
Sbjct: 349 S 349
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 42.2 bits (99), Expect = 8e-04
Identities = 49/186 (26%), Positives = 78/186 (41%), Gaps = 13/186 (6%)
Query: 436 EHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDR 495
H K + S S K+ S +AS + S+K R K+D A S +
Sbjct: 712 GHPYKKGVPPK---PAEKDSLSAPKKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRL 768
Query: 496 SDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSD 555
+KS S+ S S S++ + S S++ R ++ NK S +S S+ S +
Sbjct: 769 EEKS--SSCSPSSSSSHHHSSSNK-------ESRKSSRNKEEEMLPSPSSPLSSSSPKPE 819
Query: 556 KSASNRSRSR-SASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAAS 614
+ R R + SS + SA+S +SS+KS S S+ R S+ S+ S +
Sbjct: 820 HPSRKRPRRQEDTSSSSGPFSASSTKSSSKSSSTSKHRKTEGKGSSTSKEHKGSSGDTPN 879
Query: 615 SRSGSP 620
S P
Sbjct: 880 KASSFP 885
Score = 40.7 bits (95), Expect = 0.002
Identities = 39/198 (19%), Positives = 71/198 (35%), Gaps = 10/198 (5%)
Query: 373 LRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRT 432
L R P ++ E++ + +Q T +E+ +G + + + + +K
Sbjct: 706 LLSRIPGHPYKKGVPPKPAEKDSLSAPKKQTSKTASEKSSSKGKRKH---KNDEEADKIE 762
Query: 433 SDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSR-------GSNKSDRSAKSDRSA 485
S + + SS S S S S K SR S + S S S K + +
Sbjct: 763 SKKQRLEEKSSSCSPSSSSSHHHSSSNKESRKSSRNKEEEMLPSPSSPLSSSSPKPEHPS 822
Query: 486 KSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAAS 545
+ + D S S + S + S++ + S S ++ GS S S + AS
Sbjct: 823 RKRPRRQEDTSSSSGPFSASSTKSSSKSSSTSKHRKTEGKGSSTSKEHKGSSGDTPNKAS 882
Query: 546 NRSARSDGSDKSASNRSR 563
+ + S R +
Sbjct: 883 SFPVPPLSNGSSKPRRPK 900
Score = 35.7 bits (82), Expect = 0.089
Identities = 37/188 (19%), Positives = 68/188 (36%), Gaps = 13/188 (6%)
Query: 424 GENDNEKRTSDAEHSSKNSSRASSRSRSRSK-----SKSPAKRSRSGSASSRGSNKSDRS 478
+ + S S ++SS+ + + K K +K+ R SS S S S
Sbjct: 723 PAEKDSLSAPKKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSSSSS 782
Query: 479 AKSDRSAKSDRSARSDRSDK-------SAASNRSRSGSAASNKSRSDRSRSGSAGSDRSA 531
S K R + ++ ++ +S+ + + + R S S+G SA
Sbjct: 783 HHHSSSNKESRKSSRNKEEEMLPSPSSPLSSSSPKPEHPSRKRPRRQEDTSSSSGP-FSA 841
Query: 532 ASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRS 591
+S KS S + +R GS S ++ S ++AS+ + + R +
Sbjct: 842 SSTKSSSKSSSTSKHRKTEGKGSSTSKEHKGSSGDTPNKASSFPVPPLSNGSSKPRRPKL 901
Query: 592 RSVSRARS 599
R S
Sbjct: 902 VFDDRVHS 909
Score = 31.0 bits (70), Expect = 2.4
Identities = 42/202 (20%), Positives = 75/202 (37%), Gaps = 3/202 (1%)
Query: 421 GGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAK 480
G + ++ E K+SS+ R R+ K + + A S R+
Sbjct: 479 GKVKSSGSQYHPESKEPPPKSSSKEKRRPRTAQKGPESGRGKQKSPAQSEAPP-QRRTVG 537
Query: 481 SDRSAKSDR-SARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRS 539
+ K ++ SA +R+ S +S ++ DR ++ + GS + + + S
Sbjct: 538 KKQPKKPEKASAGDERTGLRPESEPGTLPYGSSVQTPPDRPKAATKGSRKPSPRKEPKSS 597
Query: 540 DKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSR-SVSRAR 598
AA R +S S +SS S + SS G+ S+ S + R
Sbjct: 598 VPPAAEKRKYKSPSKIVPKSREFIETDSSSSDSPEDESLPPSSQSPGNTESSKESCASLR 657
Query: 599 SNRSRSASKSGSSAASSRSGSP 620
+ RS+ S + + R SP
Sbjct: 658 TPVCRSSVGSQNDLSKDRLLSP 679
Score = 30.7 bits (69), Expect = 3.3
Identities = 45/209 (21%), Positives = 82/209 (39%), Gaps = 23/209 (11%)
Query: 433 SDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSAR 492
S +E S + +A+ + SR+ S + +ASS GS+ S S+ S+ S+ SD +
Sbjct: 363 SSSEDS--DEEQATEKPPSRNTPPSAPSSNPEPAASSSGSSSS--SSGSESSSGSDSESE 418
Query: 493 SDRSDKSAASNRSRSGS-----AASNKSRSDR------------SRSGSAGSDRSAASNK 535
S SD S + R+ S ++NK + D + S S+ K
Sbjct: 419 SSSSD-SEENEPPRTASPEPEPPSTNKWQLDNWLNKVNPHKVSPAESVSSNPPIKQPMEK 477
Query: 536 SVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVS 595
+ S + + KS+S R + + + +S +S + + R+V
Sbjct: 478 EGKVKSSGSQYHPESKEPPPKSSSKEKRRPRTAQKGPESGRGKQKSPAQSEAPPQRRTVG 537
Query: 596 RARSNRSRSASKSGSSAASSRSGSPAGSD 624
+ + + AS +G R S G+
Sbjct: 538 KKQPKKPEKAS-AGDERTGLRPESEPGTL 565
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 41.8 bits (98), Expect = 0.001
Identities = 66/300 (22%), Positives = 100/300 (33%), Gaps = 20/300 (6%)
Query: 333 ELETRVRLSKRRQKV---GAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEED 389
E E R R + RQ + P+ P A++ E E EEEEEE+
Sbjct: 400 EEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQ--ESEEEESVEEEEEEEEEEEE 457
Query: 390 EEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRS 449
EE+E EE E + E E E G E EG EG+ D E+ DAE + + S S
Sbjct: 458 EEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMS 517
Query: 450 RSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGS 509
+ R S S + S + + +S+ G
Sbjct: 518 EGQQ------PRGSSVQPESPQEEPLQPESMDAESVGEESDEELLAEESPLSSHTELEGV 571
Query: 510 AASNKSRSDRSRSGSAGSDRSAASNKS-VRSDKSAASNRSARSDGSDKSASNRSRSRSAS 568
A +++ SR ++ N S + + N S + ++ S R R R
Sbjct: 572 ATPVETKISSSRKLPPPPVSTSLENDSATVTSTTRNGNVSPHTPQDEQPPSGRKRKRKEE 631
Query: 569 SQASNNSAASNRSSNKSGSRSR--------SRSVSRARSNRSRSASKSGSSAASSRSGSP 620
++ R N S S S S R + S +++ G P
Sbjct: 632 VESEPLGNQYLRHHNGSEKDGLPAPMDPVTSCSPVADSSTRVDTPSHELVTSSPQTPGDP 691
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 41.1 bits (96), Expect = 0.002
Identities = 37/170 (21%), Positives = 68/170 (40%), Gaps = 5/170 (2%)
Query: 364 NATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGE 423
N E R +L P E++E+ EE EEE+ EE +G ++ ++ + ++ G +
Sbjct: 295 NDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEE--EGGLSKKGKKLKKLKGKKNGLD 352
Query: 424 GENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDR 483
++ + SD S S + K K P K S S N +
Sbjct: 353 KDDSDSGDDSD---DSDIDGEDSVSLVTAKKQKEPKKEEPVDSNPSSPGNSGPARPSPES 409
Query: 484 SAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAAS 533
K R A ++ S A+ + + + KS S +S + +S+++
Sbjct: 410 KDKGKRKAANEVSKSPASVPAKKLKTENAPKSSSGKSTPQTFSGSKSSSN 459
Score = 33.4 bits (76), Expect = 0.39
Identities = 34/199 (17%), Positives = 72/199 (36%), Gaps = 12/199 (6%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEG---EGGEREGGGEGENDNEKRTSDAEHSS 439
+ ++ +DE EE+ ++ ER++ E + + E+ E E
Sbjct: 273 DSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEG 332
Query: 440 KNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKS 499
S + + + K K S S+ + S +AK + + + S
Sbjct: 333 GLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQKEPKKEEPVDS 392
Query: 500 AASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSAS 559
S+ SG A + D+ + +A S + +++ + KS+S
Sbjct: 393 NPSSPGNSGPARPSPESKDKGKRKAANEV----SKSPASVPA-----KKLKTENAPKSSS 443
Query: 560 NRSRSRSASSQASNNSAAS 578
+S ++ S S+++AA
Sbjct: 444 GKSTPQTFSGSKSSSNAAD 462
Score = 33.0 bits (75), Expect = 0.47
Identities = 36/226 (15%), Positives = 65/226 (28%), Gaps = 10/226 (4%)
Query: 402 QGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRAS----SRSRSRSKSKS 457
G + E GE+ GGG G++ K + + S +S+ K K
Sbjct: 188 NGPAAFGDEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKK 247
Query: 458 PAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSD--RSDKSAASNRSRSGSAASNKS 515
AK + +G D A S D R + SD SA+ N +
Sbjct: 248 LAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPE 307
Query: 516 RSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNS 575
+ + K+ + + + K+ ++ S S ++
Sbjct: 308 IPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDI 367
Query: 576 AASNRSS----NKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRS 617
+ S K + V S+ S S + +
Sbjct: 368 DGEDSVSLVTAKKQKEPKKEEPVDSNPSSPGNSGPARPSPESKDKG 413
>gnl|CDD|221931 pfam13136, DUF3984, Protein of unknown function (DUF3984). This
family of proteins is functionally uncharacterized. This
family of proteins is found in eukaryotes. Proteins in
this family are typically between 393 and 442 amino
acids in length.
Length = 301
Score = 40.1 bits (94), Expect = 0.002
Identities = 54/193 (27%), Positives = 73/193 (37%), Gaps = 28/193 (14%)
Query: 447 SRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAK--SDRSARSDRSDKSAASNR 504
S SRS S+S R SSR S SD KS SA +S RS R KS ++
Sbjct: 57 SHSRSPSRS-----RLHKRKKSSRRSPMSDTLLKSKSSAHLLHHQSTRSHRRSKSGTTSP 111
Query: 505 SRSGSAASNKSRSD----RSRSGSAGSDR---------SAASNKSVRSDKSAASNRSARS 551
+ S+A + R+ + A S R S+ S+ S+ R A
Sbjct: 112 RKPSSSAHRRRNDSEWLLRAGAALASSTREEKGQSWLVKRESSTSLVSEDEEDFEREAAR 171
Query: 552 DGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSS 611
+ S R RS S S + ++ GSRS SR SRA + + +
Sbjct: 172 EREHSSRRASRRGRSGYSTPS-------AALSRRGSRSASRRGSRADLSMT-PLEARRAD 223
Query: 612 AASSRSGSPAGSD 624
A SR G D
Sbjct: 224 AEDSRDTVLLGPD 236
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 40.1 bits (94), Expect = 0.002
Identities = 31/152 (20%), Positives = 51/152 (33%), Gaps = 2/152 (1%)
Query: 369 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDN 428
L F EEEEEE+E+E ++ + R + E E + +
Sbjct: 169 VNSMLEALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPS 228
Query: 429 EKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDR--SAK 486
K S + + + S+ RS + S S AK++ G R + S K S
Sbjct: 229 PKSGSSSPAKPTSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKKAAPESTS 288
Query: 487 SDRSARSDRSDKSAASNRSRSGSAASNKSRSD 518
S+ S+ + S S +
Sbjct: 289 QSPSSAEPTSESPQTAGNSSLSSLGDYSDSDE 320
Score = 29.3 bits (66), Expect = 6.7
Identities = 31/186 (16%), Positives = 60/186 (32%), Gaps = 33/186 (17%)
Query: 374 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTS 433
LE E++E+EEEEEE+ + G ++ +R
Sbjct: 169 VNSMLEALFRREKKEEEEEEEEDEALI----------------KSLSFGPETEEDRRRAD 212
Query: 434 DAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARS 493
D + + + S +SS S + + +++ S+++
Sbjct: 213 DEDSEDD-------------EEDNDNTPSPKSGSSSPAKPTSILKKSAAKRSEAPSSSKA 259
Query: 494 DRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDG 553
++ + R S K ++ S S+A S + S+ S+ D
Sbjct: 260 KKNSRGIPKPRDALSSLVVRK----KAAPESTSQSPSSAEPTSESPQTAGNSSLSSLGDY 315
Query: 554 SDKSAS 559
SD S
Sbjct: 316 SDSDES 321
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 40.8 bits (95), Expect = 0.003
Identities = 19/57 (33%), Positives = 35/57 (61%), Gaps = 3/57 (5%)
Query: 374 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEA---ERQEGEGGEREGGGEGEND 427
R+ F+ ++E+E+ED+++EE++ EE ++ E + E +E EGGE + E D
Sbjct: 148 RDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVD 204
Score = 29.6 bits (66), Expect = 5.9
Identities = 18/64 (28%), Positives = 32/64 (50%), Gaps = 7/64 (10%)
Query: 370 IQRLRERFLE----PPNEEEEEEDEEEEEEEAEETQQGETTEAERQE---GEGGEREGGG 422
I+R R R L P + +D++E+E+E ++ ++ + E E +E G E E
Sbjct: 132 IKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191
Query: 423 EGEN 426
GE+
Sbjct: 192 GGED 195
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 40.2 bits (95), Expect = 0.003
Identities = 30/138 (21%), Positives = 54/138 (39%), Gaps = 11/138 (7%)
Query: 384 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSS 443
EEEE +EE + EAEE + E EA E E E + AE +++ ++
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAEAE--------EEAKAEAAAAAPAEEAAEAAA 90
Query: 444 RASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASN 503
A + +R ++ A R +A + K+ + K + K ++ R K
Sbjct: 91 AAEAAAR---PAEDEAARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGG 147
Query: 504 RSRSGSAASNKSRSDRSR 521
+ R + R + +
Sbjct: 148 KGRRRRRGRRRRRKKKKK 165
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 40.4 bits (94), Expect = 0.003
Identities = 32/118 (27%), Positives = 49/118 (41%), Gaps = 4/118 (3%)
Query: 474 KSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAAS 533
++ S +D S D S K +R RS SAA+ D ++ + A
Sbjct: 71 ENKASKDDGNINLTDTSENGDASKKGHGKSRVRSASAAAILEEDDSKDDMEFKANPNEAG 130
Query: 534 NKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRS 591
S+ SDG K+++ +S S+SAS +NS S+ S+ SG S S
Sbjct: 131 KPGKPKGNQGEGLASS-SDGKSKASA-KSGSKSASKHGESNS--SDESATDSGKASAS 184
Score = 40.1 bits (93), Expect = 0.004
Identities = 25/99 (25%), Positives = 44/99 (44%), Gaps = 2/99 (2%)
Query: 527 SDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSG 586
SD S NK+ + D + ++ + + K +SR RSAS+ A S
Sbjct: 65 SDESFLENKASKDDGNINLTDTSENGDASKKGHGKSRVRSASAAAILEEDDSKDDMEFKA 124
Query: 587 SRSRSRSVSRARSNRSRS--ASKSGSSAASSRSGSPAGS 623
+ + + + + N+ +S G S AS++SGS + S
Sbjct: 125 NPNEAGKPGKPKGNQGEGLASSSDGKSKASAKSGSKSAS 163
Score = 33.1 bits (75), Expect = 0.51
Identities = 23/130 (17%), Positives = 50/130 (38%), Gaps = 3/130 (2%)
Query: 496 SDKSAASNRSRSGSAASNKS-RSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGS 554
SD+S N++ N + S+ + G +S + S + ++ +
Sbjct: 65 SDESFLENKASKDDGNINLTDTSENGDASKKGHGKSRVRSASAAAILEEDDSKDDMEFKA 124
Query: 555 DKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAAS 614
+ + + + + S++ +S + S S+S S ++ SA+ SG AS
Sbjct: 125 NPNEAGKPGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSSDESATDSGK--AS 182
Query: 615 SRSGSPAGSD 624
+ G+D
Sbjct: 183 ASVAGIVGAD 192
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 39.8 bits (93), Expect = 0.003
Identities = 47/230 (20%), Positives = 119/230 (51%), Gaps = 14/230 (6%)
Query: 386 EEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRA 445
EE+ +++E+++AEE QQ + E ER + ER E + E+ A K + A
Sbjct: 78 EEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEA 137
Query: 446 SSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRS 505
++++ + +K+K+ A+ R+ +A+ + + ++ + A+++ + K A ++ K+ A +
Sbjct: 138 AAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKK----AAAEAKKKAEAEAAA 193
Query: 506 RSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSR 565
++ + A K+ ++ + +A + + AA+ + K+AA ++A ++K+A+ ++ +
Sbjct: 194 KAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAA----AEKAAAAKAAEK 249
Query: 566 SASSQASN------NSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSG 609
+A+++A+ S +++ K+G ++ A + + SG
Sbjct: 250 AAAAKAAAEVDDLFGGLDSGKNAPKTGGGAKGNGAQGAGAGNGKKGGASG 299
Score = 36.3 bits (84), Expect = 0.048
Identities = 35/213 (16%), Positives = 89/213 (41%), Gaps = 10/213 (4%)
Query: 371 QRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEK 430
+RL++ E +E+++ EE ++A +Q + EA + + + E K
Sbjct: 101 ERLKQLEKERLAAQEQKKQAEEAAKQAAL-KQKQAEEAAAKAAAAAKAKAEAE-----AK 154
Query: 431 RTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRS 490
R + A + ++ + + + K+ + AK+ A + + K+ AK A++ +
Sbjct: 155 RAAAAAKKAAAEAKKKAEAEAAKKAAAEAKK----KAEAEAAAKAAAEAKKKAEAEAKKK 210
Query: 491 ARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSAR 550
A ++ K+AA ++ + AA+ + + + ++++AA+ + D S +
Sbjct: 211 AAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVDDLFGGLDSGK 270
Query: 551 SDGSDKSASNRSRSRSASSQASNNSAASNRSSN 583
+ + + ++ A + AS +
Sbjct: 271 NAPKTGGGAKGNGAQGAGAGNGKKGGASGADID 303
>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566). Family of
related proteins that is plant specific.
Length = 313
Score = 39.5 bits (92), Expect = 0.003
Identities = 31/121 (25%), Positives = 50/121 (41%), Gaps = 1/121 (0%)
Query: 504 RSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSR 563
R+ S S+ S + RS S S+ + S S + N A S + + S
Sbjct: 1 RAASVSSGSTSGDASSPRSSSRRRLSSSFLSTSASS-RPRRLNAPASPPSSSPARNTSSS 59
Query: 564 SRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPAGS 623
S S+ +S + R S++ S SR + A N S + + + S++ SRS S
Sbjct: 60 SSFGLSKQRPSSLSRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGSSSPSRSRRTTSS 119
Query: 624 D 624
D
Sbjct: 120 D 120
Score = 36.1 bits (83), Expect = 0.039
Identities = 31/150 (20%), Positives = 62/150 (41%), Gaps = 14/150 (9%)
Query: 463 RSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRS 522
R+ S SS ++ S R S ++AS+R R +A ++ S +R+
Sbjct: 1 RAASVSSGSTSGDA-----SSPRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARN 55
Query: 523 GSAGSDRSAASNKSVRSDKSAASNRSAR-SDGSDKSASNRSRSRSASSQASNNSAASNRS 581
S+ S + + + S+R S GS +A++ + S + +S + ++S + +R
Sbjct: 56 TSSSSSFGLSKQRPSSLSRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGSSSPSRSRR 115
Query: 582 SNKSGSRSRSRS--------VSRARSNRSR 603
+ S S + V R + S+
Sbjct: 116 TTSSDLSSGNGPSVLSFMADVKRGKKGPSK 145
Score = 36.1 bits (83), Expect = 0.041
Identities = 31/109 (28%), Positives = 45/109 (41%), Gaps = 2/109 (1%)
Query: 430 KRTSDAEHSSKNSSRASSRSRSRSKSKSP--AKRSRSGSASSRGSNKSDRSAKSDRSAKS 487
+ SS S+ ASSR R + SP + +R+ S+SS R + R S
Sbjct: 19 SSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNTSSSSSFGLSKQRPSSLSRGRLS 78
Query: 488 DRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKS 536
R R SAA++ + S + AS S SRS S ++ N
Sbjct: 79 SRFVSPSRGSPSAAASLNGSLATASTSGSSSPSRSRRTTSSDLSSGNGP 127
Score = 35.7 bits (82), Expect = 0.066
Identities = 28/144 (19%), Positives = 52/144 (36%), Gaps = 19/144 (13%)
Query: 431 RTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRS 490
R + S + +S RS SR + S + + S R + + + S S S
Sbjct: 1 RAASVSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNTSSSS 60
Query: 491 ARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSAR 550
+ + ++ +R R S + S S ++ S+ + AS +
Sbjct: 61 SFGLSKQRPSSLSRGRL------------SSRFVSPSRGSPSAAASLNGSLATASTSGSS 108
Query: 551 SDGSDKSASNRSRSRSASSQASNN 574
S +RSR ++S +S N
Sbjct: 109 S-------PSRSRRTTSSDLSSGN 125
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 40.0 bits (93), Expect = 0.005
Identities = 43/193 (22%), Positives = 62/193 (32%), Gaps = 10/193 (5%)
Query: 364 NATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGE 423
N +E I R+ E E E +E + + E EAEE + TEAE E E GE E G E
Sbjct: 679 NESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDE-GEIETGEE 737
Query: 424 GENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDR 483
GE ++ +AE + K + + ++ + A D
Sbjct: 738 GEEVEDEGEGEAE--------GKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDG 789
Query: 484 SAKSDRSA-RSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKS 542
K D A + + S D G A N+
Sbjct: 790 EMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDE 849
Query: 543 AASNRSARSDGSD 555
+ SDG D
Sbjct: 850 KGVDGGGGSDGGD 862
Score = 39.6 bits (92), Expect = 0.005
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQE 412
+ EEEEE+EEEEEEE EE ++ E E E +E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 38.8 bits (90), Expect = 0.010
Identities = 17/31 (54%), Positives = 21/31 (67%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAE 409
+ EEEEEE+EEEEEEE EE ++ E E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 37.7 bits (87), Expect = 0.023
Identities = 19/34 (55%), Positives = 21/34 (61%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 412
E EEEEEE+EEEEEEE EE + E E E
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPE 900
Score = 36.9 bits (85), Expect = 0.042
Identities = 18/34 (52%), Positives = 22/34 (64%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 412
E EEEEEE+EEEEEEE EE E E +++
Sbjct: 872 EEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905
Score = 36.5 bits (84), Expect = 0.048
Identities = 23/59 (38%), Positives = 30/59 (50%), Gaps = 2/59 (3%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE-GGEREGGGEGENDNEKRTSDAE 436
E P E E E EE E AE+ + ET EGE ER+G EGE + E + +D +
Sbjct: 650 ERPTEAEGENGEESGGE-AEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHK 707
Score = 36.1 bits (83), Expect = 0.070
Identities = 23/69 (33%), Positives = 29/69 (42%), Gaps = 7/69 (10%)
Query: 374 RERFLEPPNEEE---EEEDEEEEEEEAEETQQGETTEAER---QEGEGGEREGGGEGEND 427
R E N EE E E E E E + E +GE AER QEGEG + + +
Sbjct: 651 RPTEAEGENGEESGGEAEQEGETETKGENESEGE-IPAERKGEQEGEGEIEAKEADHKGE 709
Query: 428 NEKRTSDAE 436
E + E
Sbjct: 710 TEAEEVEHE 718
Score = 35.7 bits (82), Expect = 0.078
Identities = 16/31 (51%), Positives = 21/31 (67%)
Query: 384 EEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
+ EEE+EEEEEEE EE ++ E E E + E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 34.6 bits (79), Expect = 0.20
Identities = 31/195 (15%), Positives = 49/195 (25%), Gaps = 5/195 (2%)
Query: 381 PNEEEEE---EDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEH 437
E E E E EE E Q+GET E E GE +GE + E E
Sbjct: 646 GEEGERPTEAEGENGEESGGEAEQEGETETKGENESE-GEIPAERKGEQEGEGEIEAKEA 704
Query: 438 SSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSD 497
K + A + G D + + K + DR +
Sbjct: 705 DHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEG-EGEAEGKHEVETEGDRKE 763
Query: 498 KSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKS 557
G ++ G D A + A ++
Sbjct: 764 TEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQA 823
Query: 558 ASNRSRSRSASSQAS 572
+ + + +
Sbjct: 824 DDTEVKDETGEQELN 838
Score = 32.7 bits (74), Expect = 0.69
Identities = 16/59 (27%), Positives = 26/59 (44%), Gaps = 1/59 (1%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQ-EGEGGEREGGGEGENDNEKRTSDAE 436
++ E +E E+E QGE + E+ +G GG G E E + E+ + E
Sbjct: 819 SETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEE 877
Score = 32.3 bits (73), Expect = 1.0
Identities = 15/31 (48%), Positives = 20/31 (64%)
Query: 387 EEDEEEEEEEAEETQQGETTEAERQEGEGGE 417
+ +EEEEEEE EE ++ E E E +E E E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 30.7 bits (69), Expect = 3.1
Identities = 15/31 (48%), Positives = 18/31 (58%)
Query: 389 DEEEEEEEAEETQQGETTEAERQEGEGGERE 419
D EEEEEE EE ++ E E E +E E E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 29.6 bits (66), Expect = 7.1
Identities = 14/37 (37%), Positives = 19/37 (51%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
+ + EEEEE EE ++ E E E +E E E E
Sbjct: 855 GGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891
Score = 29.2 bits (65), Expect = 7.7
Identities = 18/98 (18%), Positives = 37/98 (37%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
E E +E+EDE E + + +G+ + E EG G + + +D
Sbjct: 769 ETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEV 828
Query: 439 SKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSD 476
+ + ++ ++K K G S G ++ +
Sbjct: 829 KDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEE 866
Score = 29.2 bits (65), Expect = 8.1
Identities = 13/37 (35%), Positives = 18/37 (48%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
+ D + EEE EE ++ E E E +E E E E
Sbjct: 853 DGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889
Score = 29.2 bits (65), Expect = 9.0
Identities = 21/94 (22%), Positives = 34/94 (36%), Gaps = 7/94 (7%)
Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEH 437
L ++ + E E E EE ++ E E E GGE E GE E + E+
Sbjct: 627 LGDLSKGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETE-------TKGEN 679
Query: 438 SSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRG 471
S+ A + + + AK + +
Sbjct: 680 ESEGEIPAERKGEQEGEGEIEAKEADHKGETEAE 713
>gnl|CDD|219916 pfam08580, KAR9, Yeast cortical protein KAR9. The KAR9 protein in
Saccharomyces cerevisiae is a cytoskeletal protein
required for karyogamy, correct positioning of the
mitotic spindle and for orientation of cytoplasmic
microtubules. KAR9 localises at the shmoo tip in mating
cells and at the tip of the growing bud in anaphase.
Length = 626
Score = 39.5 bits (92), Expect = 0.005
Identities = 29/181 (16%), Positives = 48/181 (26%)
Query: 441 NSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSA 500
SS SS P S + GS A R + S +K +
Sbjct: 397 PSSSPSSSVIMTPPDSGPGSNVSSRRVGTPGSKSDRVGAVLLRRMNIKPTLASIPDEKPS 456
Query: 501 ASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASN 560
+ + N S R + S + +K + S +S
Sbjct: 457 NISVFEDSETSPNSSTLLRDPPPKKCGEESGHLPNNPFFNKLKLTLSSIPPLSPRQSIIT 516
Query: 561 RSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSP 620
+S+ S+ S S S ++ + S ++S S R G
Sbjct: 517 LPTPSRPASRISSLSLRLGSYSGSIVSPPPYPTLVSRKGAAGLSFNRSVSDIEGERIGRY 576
Query: 621 A 621
Sbjct: 577 N 577
Score = 32.1 bits (73), Expect = 0.92
Identities = 36/179 (20%), Positives = 51/179 (28%), Gaps = 4/179 (2%)
Query: 428 NEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRS--A 485
N K T + K S+ + S + S R S + ++
Sbjct: 442 NIKPTLASIPDEKPSNISVFEDSETSPNSSTLLRDPPPKKCGEESGHLPNNPFFNKLKLT 501
Query: 486 KSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAAS 545
S S R SR S S+ S S SGS S + S + +
Sbjct: 502 LSSIPPLSPRQSIITLPTPSRPASRISSLSLRLGSYSGSIVSPPPYPTLVSRKGAAGLSF 561
Query: 546 NRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRS 604
NRS ++ R A A S SS +S S V + R
Sbjct: 562 NRSVSDIEGER--IGRYNLLPTRIPALPFKAESTTSSRRSSSLPSPTGVIGFPGSVPRF 618
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 39.1 bits (91), Expect = 0.005
Identities = 23/139 (16%), Positives = 62/139 (44%), Gaps = 3/139 (2%)
Query: 428 NEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKS 487
N K+ + + + +AS++++S + K K++ + S++ + K+ +AK+ + AK
Sbjct: 198 NAKKAAKTPAAKSGAKKASAKAKS-AAKKVSKKKAAKTAVSAKKAAKT--AAKAAKKAKK 254
Query: 488 DRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNR 547
++ K+ ++ AA+ ++ + + A K+ K+ A+ +
Sbjct: 255 TAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAK 314
Query: 548 SARSDGSDKSASNRSRSRS 566
+ + K A ++ R+
Sbjct: 315 APKRGAKGKKAKKVTKKRA 333
Score = 32.9 bits (75), Expect = 0.47
Identities = 28/162 (17%), Positives = 59/162 (36%), Gaps = 24/162 (14%)
Query: 456 KSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKS 515
+ AK++ A+ G+ K+ AKS S + A K+A S + + +AA
Sbjct: 196 AANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKA-----AKTAVSAKKAAKTAA---- 246
Query: 516 RSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNS 575
+ A A K+ ++ K AA K A+ + + + +
Sbjct: 247 ----KAAKKAKKTAKKALKKAAKAVKKAA-----------KKAAKAAAKAAKGAAKATKG 291
Query: 576 AASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRS 617
A + + + S++ + A++ + + K R+
Sbjct: 292 KAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKKRA 333
Score = 31.8 bits (72), Expect = 0.93
Identities = 22/147 (14%), Positives = 51/147 (34%), Gaps = 1/147 (0%)
Query: 405 TTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRS 464
T E R G + K + + S+ + +K+ AK++
Sbjct: 184 TREIGRVLAAVGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAK 243
Query: 465 GSASSRGSNKSDRSAKSDRSAKS-DRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSG 523
+A + K ++AK+ ++A+ + A+ + + K++ +
Sbjct: 244 TAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKA 303
Query: 524 SAGSDRSAASNKSVRSDKSAASNRSAR 550
+AGS A + R K + + +
Sbjct: 304 AAGSKAKATAKAPKRGAKGKKAKKVTK 330
Score = 30.6 bits (69), Expect = 2.4
Identities = 23/147 (15%), Positives = 59/147 (40%), Gaps = 14/147 (9%)
Query: 431 RTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRS 490
++A+ ++K + S ++ +K+KS AK+ K+ ++A S + A +
Sbjct: 195 GAANAKKAAKTPAAKSGAKKASAKAKSAAKKV--------SKKKAAKTAVSAKKAAKTAA 246
Query: 491 ARSDRSDKSAASNRSRSGSAA------SNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAA 544
+ ++ K+A ++ A + K+ + ++ + + A + K +A
Sbjct: 247 KAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAG 306
Query: 545 SNRSARSDGSDKSASNRSRSRSASSQA 571
S A + + A + + +A
Sbjct: 307 SKAKATAKAPKRGAKGKKAKKVTKKRA 333
Score = 29.1 bits (65), Expect = 7.7
Identities = 22/135 (16%), Positives = 50/135 (37%), Gaps = 5/135 (3%)
Query: 451 SRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAAS-NRSRSGS 509
++ AK +++ S S K ++AK+ SA+ + K+AA + +
Sbjct: 200 KKAAKTPAAKSGAKKASAKAKSAAKKVSKK--KAAKTAVSAKK--AAKTAAKAAKKAKKT 255
Query: 510 AASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASS 569
A ++ ++ +A AA+ + + K+ A+ K+A+ +A +
Sbjct: 256 AKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKA 315
Query: 570 QASNNSAASNRSSNK 584
+ K
Sbjct: 316 PKRGAKGKKAKKVTK 330
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 38.8 bits (90), Expect = 0.006
Identities = 46/228 (20%), Positives = 103/228 (45%), Gaps = 5/228 (2%)
Query: 386 EEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRA 445
E++ +++EE+ AEE + + E ER + ER E + E+ A+ K
Sbjct: 78 EQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQ 137
Query: 446 SSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRS 505
+ ++ + K K+ A ++++ + +++ ++ K++ +AK+ A++ +A
Sbjct: 138 ARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAE 197
Query: 506 RSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSR 565
AA+ K++++ A AA+ + ++K A+ ++ +DK+A+ +
Sbjct: 198 AEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAK----ADKAAAAAKAAE 253
Query: 566 SASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAA 613
++ A+ + SS K + A S S +A SG+S A
Sbjct: 254 RKAAAAALDDIFGGLSSGK-NAPKTGGGAKNASSQGSGAAKNSGASGA 300
Score = 36.5 bits (84), Expect = 0.040
Identities = 35/216 (16%), Positives = 89/216 (41%), Gaps = 7/216 (3%)
Query: 369 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDN 428
+ +++ E E+E + E+E + QQ + EAE+Q +++ +
Sbjct: 84 KEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAA 143
Query: 429 EKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSD 488
E++ +K ++ A+ + K + +++ + + + K++ AK+
Sbjct: 144 EQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKA- 202
Query: 489 RSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAAS--- 545
+ ++ ++ A ++ +AA K+ +++ ++ + AA+ K+AA+
Sbjct: 203 AAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAAALD 262
Query: 546 ---NRSARSDGSDKSASNRSRSRSASSQASNNSAAS 578
+ + K+ + S S A+ NS AS
Sbjct: 263 DIFGGLSSGKNAPKTGGGAKNASSQGSGAAKNSGAS 298
>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
Provisional.
Length = 1164
Score = 38.8 bits (90), Expect = 0.009
Identities = 33/161 (20%), Positives = 62/161 (38%), Gaps = 9/161 (5%)
Query: 427 DNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAK 486
D +R+ + KN++ ++ +++ K K+ K + R S KS K S
Sbjct: 202 DKMRRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKS----KVQNSHF 257
Query: 487 SDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASN 546
R RS + SG +S + +S D SN ++ N
Sbjct: 258 DVRILRSYTKENKKDEKNVVSGIRSSV---LLKRKSQCLRKDSYVYSNHQKKAKTGDPKN 314
Query: 547 RSARSDGSDKSASNRSRSRS--ASSQASNNSAASNRSSNKS 585
R++GS S ++ + S + S++ SN + +S +
Sbjct: 315 IIHRNNGSSNSNNDDTSSSNHLGSNRISNRNPSSPYKKQTT 355
Score = 37.3 bits (86), Expect = 0.027
Identities = 31/162 (19%), Positives = 64/162 (39%), Gaps = 5/162 (3%)
Query: 430 KRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDR 489
K+T+ + + R S R+ S K+ + +++ + + K DR D ++ R
Sbjct: 191 KQTNVTHTTCYDKMRRSPRNTSTIKN-NTNDKNKEKNKEKDKNIKKDR----DGDKQTKR 245
Query: 490 SARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSA 549
++ + S R NK SG S ++ +R D SN
Sbjct: 246 NSEKSKVQNSHFDVRILRSYTKENKKDEKNVVSGIRSSVLLKRKSQCLRKDSYVYSNHQK 305
Query: 550 RSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRS 591
++ D + S++S + S++++ SN+ +R+ S
Sbjct: 306 KAKTGDPKNIIHRNNGSSNSNNDDTSSSNHLGSNRISNRNPS 347
Score = 34.2 bits (78), Expect = 0.25
Identities = 38/183 (20%), Positives = 63/183 (34%), Gaps = 16/183 (8%)
Query: 423 EGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSD 482
N EK D S +SS SS + S SP S S S S
Sbjct: 112 IKANKKEKHNLD----SSSSSSISSSLTNISFFSSPTSIYSCLS-------NSLSSKHSP 160
Query: 483 RSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGS--AGSDRSAASNKSVRSD 540
+ K ++S + S ++ N+ S ++ + S R+ ++ K+ +D
Sbjct: 161 KVIKENQSTHVNISSDNSPRNKEISNKQLKKQTNVTHTTCYDKMRRSPRNTSTIKNNTND 220
Query: 541 KSAASNRSARSD---GSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRA 597
K+ N+ + D + S + Q S+ RS K + VS
Sbjct: 221 KNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEKNVVSGI 280
Query: 598 RSN 600
RS+
Sbjct: 281 RSS 283
>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
Cobaltochelatase is responsible for the insertion of
cobalt into the corrin ring of coenzyme B12 during its
biosynthesis. Two versions have been well described.
CbiK/CbiX is a monomeric, anaerobic version which acts
early in the biosynthesis (pfam06180). CobNST is a
trimeric, ATP-dependent, aerobic version which acts late
in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
number of genomes (actinobacteria, cyanobacteria,
betaproteobacteria and pseudomonads) which apparently
biosynthesize B12, encode a cobN gene but are
demonstrably lacking cobS and cobT. These genomes do,
however contain a homolog (modelled here) of the
magnesium chelatase subunits BchI/BchD family. Aside
from the cyanobacteria (which have a separate magnesium
chelatase trimer), these species do not make chlorins,
so do not have any use for a magnesium chelatase.
Furthermore, in nearly all cases the members of this
family are proximal to either CobN itself or other genes
involved in cobalt transport or B12 biosynthesis.
Length = 633
Score = 38.5 bits (90), Expect = 0.009
Identities = 42/200 (21%), Positives = 78/200 (39%), Gaps = 20/200 (10%)
Query: 355 RLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
LV+ HR F + ++ E EE+ EE + E+ +E + R
Sbjct: 302 ELVLPHR-RRRKPFEQPQGKD---EKDLEEKPEEPGPDPEKP-DEGEDDAEQSGPRGHPT 356
Query: 415 GGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNK 474
G + + + +D + SS + + R R + ++ A R SG S S+
Sbjct: 357 PGNDDEKEP----DPQEEADGQGSSTDPAGDIFRIRVLAPPQARA-RGASGRRSRTRSDS 411
Query: 475 SDRSAKSDRSAKSDRSARSDRSDKSAASN-RSRSGSAASNKSR-SDRSRSGSAGS----- 527
R ++ R+ D + ++AA + R+R G+ A ++ R+G AG+
Sbjct: 412 RGRYVRARRNRGPPDDLAVDATLRAAAPHQRARPGAVAVEPEDLREKIRAGRAGNLVIFV 471
Query: 528 ---DRSAASNKSVRSDKSAA 544
S A+ + + K A
Sbjct: 472 VDASGSMAARGRMAAAKGAV 491
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 38.7 bits (90), Expect = 0.010
Identities = 24/123 (19%), Positives = 48/123 (39%), Gaps = 2/123 (1%)
Query: 323 IVRDDAVYYNELETRVRLSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPN 382
IV + NELE+ V SK + + N + + + + +I+ R+ +
Sbjct: 2529 IVDNSTYIINELESHV--SKLNELLSYIDNEIKELENEKLKLLEKAKIEESRKERERIES 2586
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNS 442
E +E+ +EE+ ++ + + E + E RE G + + T E +
Sbjct: 2587 ETQEDNTDEEQINRQQQERLQKEEEQKAYSQERLNREVSGTDDTNKNHNTGHDESNYGRY 2646
Query: 443 SRA 445
S
Sbjct: 2647 SNK 2649
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 38.1 bits (88), Expect = 0.016
Identities = 28/140 (20%), Positives = 53/140 (37%), Gaps = 1/140 (0%)
Query: 381 PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSK 440
P ++E EE E E+ +E++E ++G E +EGE G++ G + ++ T +
Sbjct: 524 PGDKEGEEGEHEDSKESDEPKEGGKP-GETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFP 582
Query: 441 NSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSA 500
+ K K P R S + KS + +S +S + +
Sbjct: 583 KDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRP 642
Query: 501 ASNRSRSGSAASNKSRSDRS 520
+S G + +S
Sbjct: 643 SSPERPEGPKIIKSPKPPKS 662
Score = 32.7 bits (74), Expect = 0.66
Identities = 36/170 (21%), Positives = 50/170 (29%), Gaps = 12/170 (7%)
Query: 381 PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSK 440
EEE+ D+ +E E E +EGE GE E E + E E
Sbjct: 497 APIEEEDSDKHDEPPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGG-KPGETKEG 555
Query: 441 NSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSA 500
+ ++ SK P + K + + + K RSA+ KS
Sbjct: 556 EVGKKPGPAKEHKPSKIPTLSKK---PEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSP 612
Query: 501 ASNRS--------RSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKS 542
R S S K R S KS + KS
Sbjct: 613 KLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKS 662
>gnl|CDD|222790 PHA00430, PHA00430, tail fiber protein.
Length = 568
Score = 37.2 bits (86), Expect = 0.024
Identities = 25/134 (18%), Positives = 57/134 (42%), Gaps = 3/134 (2%)
Query: 428 NEKRTSDAEHSSKNSSRASSRSRSR-SKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAK 486
+ S+ N+ ++RSR+ ++++ A+R + S +S + K RS +++
Sbjct: 150 PLGQIKTWNQSAWNARNEANRSRNEADRARNQAERFNNESGASATNTKQWRSEADGSNSE 209
Query: 487 SDRSARSDRSDKS--AASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAA 544
++R S S A+ S+ + D + +A + + AS + + +AA
Sbjct: 210 ANRFKGYADSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASAAHASEVNAANSATAA 269
Query: 545 SNRSARSDGSDKSA 558
+ + R+ A
Sbjct: 270 ATSANRAKQQADRA 283
Score = 36.4 bits (84), Expect = 0.043
Identities = 26/137 (18%), Positives = 60/137 (43%), Gaps = 3/137 (2%)
Query: 473 NKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAA 532
N+S +A+++ + + + R+ ++ N SA + K + ++ ++R
Sbjct: 158 NQSAWNARNEANRSRNEADRARN--QAERFNNESGASATNTKQWRSEADGSNSEANRFKG 215
Query: 533 SNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSR 592
S+ S AA ++ S +A + + +AS+ A++ S + +S + + S +R
Sbjct: 216 YADSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASAAHASEVNAANSATAAATSANR 275
Query: 593 SVSRARSNRSRSASKSG 609
+ +A A K G
Sbjct: 276 AKQQA-DRAKTEADKLG 291
Score = 34.9 bits (80), Expect = 0.12
Identities = 22/112 (19%), Positives = 47/112 (41%), Gaps = 4/112 (3%)
Query: 431 RTSDAEHSSKNSSRASSRSRSRSK-SKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDR 489
+ + S S+ + + RS + S S A R + + S S ++ + S +++
Sbjct: 181 QAERFNNESGASATNTKQWRSEADGSNSEANRFKGYADSMTSSVEAAKGQAESSSKEANT 240
Query: 490 SARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDK 541
+ D + K+AAS + S + + + + + SA + A +DK
Sbjct: 241 AG--DYATKAAASASAAHASEVNAANSATAA-ATSANRAKQQADRAKTEADK 289
Score = 34.9 bits (80), Expect = 0.14
Identities = 28/135 (20%), Positives = 57/135 (42%), Gaps = 11/135 (8%)
Query: 485 AKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAA 544
A D A K+ + + + A N+SR++ R+ + + S S +
Sbjct: 142 AVDDGDAVPLGQIKTWNQSAWNARNEA-NRSRNEADRARNQAERFNNESGASA----TNT 196
Query: 545 SNRSARSDGSDKSASNRSRSRSASSQASNNSAAS--NRSS---NKSGSRSRSRSVSRARS 599
+ +DGS S +NR + + S +S +A SS N +G + + S + +
Sbjct: 197 KQWRSEADGS-NSEANRFKGYADSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASAA 255
Query: 600 NRSRSASKSGSSAAS 614
+ S + + ++AA+
Sbjct: 256 HASEVNAANSATAAA 270
Score = 33.7 bits (77), Expect = 0.32
Identities = 15/93 (16%), Positives = 29/93 (31%)
Query: 525 AGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNK 584
A D A +++ +A N ++ S A + S SA + +
Sbjct: 142 AVDDGDAVPLGQIKTWNQSAWNARNEANRSRNEADRARNQAERFNNESGASATNTKQWRS 201
Query: 585 SGSRSRSRSVSRARSNRSRSASKSGSSAASSRS 617
S S + S ++S + + S
Sbjct: 202 EADGSNSEANRFKGYADSMTSSVEAAKGQAESS 234
Score = 31.8 bits (72), Expect = 1.2
Identities = 28/108 (25%), Positives = 53/108 (49%), Gaps = 10/108 (9%)
Query: 519 RSRSGSAGSDRSAASNKSVRSDKSAASNRSARS-DGSDKSASNRSRSRSASSQASNNSAA 577
++ + SA + R+ A+ R++ A N++ R + S SA+N + RS + SN+ A
Sbjct: 155 KTWNQSAWNARNEAN--RSRNEADRARNQAERFNNESGASATNTKQWRSEAD-GSNSEA- 210
Query: 578 SNRSSNKSGSRSRSRSVSRARSNRSR----SASKSGSSAASSRSGSPA 621
NR + S + S ++ ++ S +A + AA+S S + A
Sbjct: 211 -NRFKGYADSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASAAHA 257
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 36.9 bits (85), Expect = 0.024
Identities = 19/52 (36%), Positives = 22/52 (42%), Gaps = 1/52 (1%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGE-TTEAERQEGEGGEREGGGEGENDNE 429
E EE + EEEE+EE GE E +E E E GE D E
Sbjct: 178 EEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYE 229
Score = 32.3 bits (73), Expect = 0.68
Identities = 15/51 (29%), Positives = 24/51 (47%)
Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEK 430
PP + + + +EEE EE++ ++ E E G RE E E + E
Sbjct: 168 PPAQPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEG 218
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 37.2 bits (86), Expect = 0.025
Identities = 33/161 (20%), Positives = 63/161 (39%), Gaps = 5/161 (3%)
Query: 462 SRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSR 521
S G S++ S + +KS S + + S + + +
Sbjct: 220 SSKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIGINNHH 279
Query: 522 SGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRS 581
S A S S+KS S+ S +GS ++ SN S + + S + ++ +A + +
Sbjct: 280 SKHADSKLQTIEVIENHSNKSRPSSSST--NGSKETTSNSSSAAAGSIGSKSSKSAKHSN 337
Query: 582 SNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPAG 622
NKS S +S S + + S+S S + + R+ +
Sbjct: 338 RNKSNSSPKSHSSA---NGSVPSSSVSDNESKQKRASKSSS 375
Score = 36.1 bits (83), Expect = 0.070
Identities = 55/262 (20%), Positives = 95/262 (36%), Gaps = 33/262 (12%)
Query: 367 EFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGG---------- 416
EF +Q L++ P E ++E+ EA T E + GG
Sbjct: 191 EFYMQLLQQAL--PKEENHTLSVTDKEKSEASSKGLTSTKELVPVQNSGGNHSLSKSSNS 248
Query: 417 ---EREGGGEGE-NDNEKRTSDAEHSSKNS---------------SRASSRSRSRSKSKS 457
E E +G+ + + N S++SR S S +
Sbjct: 249 QTPELEYSEKGKDHHHSHNHQHHSIGINNHHSKHADSKLQTIEVIENHSNKSRPSSSSTN 308
Query: 458 PAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRS 517
+K + S S+S+ + +S+KS + + ++S S +S SA + S S + N+S+
Sbjct: 309 GSKETTSNSSSAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSV-PSSSVSDNESKQ 367
Query: 518 DRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAA 577
R+ S+G+ R + + S S N + S SA R QA A
Sbjct: 368 KRASKSSSGA-RDSKKDASGMSANGTVENCIPENKISTPSAIERLEQDIKKLQAELQQAR 426
Query: 578 SNRSSNKSGSRSRSRSVSRARS 599
N S ++ + +S
Sbjct: 427 QNESELRNQISLLTSLERSLKS 448
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 36.9 bits (86), Expect = 0.028
Identities = 20/43 (46%), Positives = 24/43 (55%)
Query: 372 RLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
R R R PP+ E EE+EEEEEE EE ++ E E E E
Sbjct: 291 RRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 33.9 bits (78), Expect = 0.29
Identities = 19/43 (44%), Positives = 24/43 (55%)
Query: 369 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQ 411
R +R R PP EEEEE+EEE EE EE ++ E E +
Sbjct: 291 RRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 33.1 bits (76), Expect = 0.50
Identities = 20/46 (43%), Positives = 24/46 (52%)
Query: 369 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
R+ R R PP E E+EEEEEEE E ++ E E ER E
Sbjct: 287 RLPLRRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEE 332
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 35.6 bits (82), Expect = 0.030
Identities = 20/48 (41%), Positives = 30/48 (62%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEK 430
+E++E+DEEEEEE+ EE ++ E E E + E E E E +N + K
Sbjct: 45 DEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLK 92
Score = 30.2 bits (68), Expect = 2.0
Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 3/72 (4%)
Query: 386 EEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRA 445
+E+ E++EEEE E+ ++ E E E E E E E E ++E+ D + K +
Sbjct: 45 DEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDE---EEEEEDEEDNVDLKDIEKKNIND 101
Query: 446 SSRSRSRSKSKS 457
S +++
Sbjct: 102 IFNSTQDDNAQN 113
Score = 28.7 bits (64), Expect = 6.8
Identities = 19/63 (30%), Positives = 35/63 (55%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
E ++EEEE+E+EEE E E + E E +E E + E + ++ +K +D +S
Sbjct: 46 EKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNS 105
Query: 439 SKN 441
+++
Sbjct: 106 TQD 108
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 37.0 bits (85), Expect = 0.036
Identities = 29/206 (14%), Positives = 81/206 (39%)
Query: 386 EEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRA 445
EE + E+ ++AE ++ E + + +E + E E E E+ + + +A
Sbjct: 1215 EEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKA 1274
Query: 446 SSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRS 505
++ K+ K+ + + K+D + K AK A+ + ++ +
Sbjct: 1275 EEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAA 1334
Query: 506 RSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSR 565
+ + + K+ A +D + A+ + + + ++D + K A + ++
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKAD 1394
Query: 566 SASSQASNNSAASNRSSNKSGSRSRS 591
A +A + ++ + ++ ++
Sbjct: 1395 EAKKKAEEDKKKADELKKAAAAKKKA 1420
>gnl|CDD|237537 PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional.
Length = 440
Score = 36.4 bits (85), Expect = 0.043
Identities = 29/124 (23%), Positives = 55/124 (44%), Gaps = 20/124 (16%)
Query: 500 AASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSA- 558
AA + + A+ +R + +G A S SA + + S A+ +A G ++
Sbjct: 295 AAGGGAAAAGGAAAAARGGAAAAGGASSAYSAGA-----AGGSGAAGVAAGLGGVARAGA 349
Query: 559 -SNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRS 617
+ S R A+S+A A S +SS ++G+RS + + ++A ++ +
Sbjct: 350 SAAASPLRRAASRA----AESMKSSFRAGARSTGGGA---------GGAAAAAAAGAAAA 396
Query: 618 GSPA 621
G PA
Sbjct: 397 GPPA 400
Score = 30.6 bits (70), Expect = 2.4
Identities = 21/105 (20%), Positives = 42/105 (40%), Gaps = 12/105 (11%)
Query: 519 RSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAAS 578
+ G+A + R A+ +A SA S G A+ S + ++ + A
Sbjct: 301 AAAGGAAAAARGGAA--------AAGGASSAYSAG----AAGGSGAAGVAAGLGGVARAG 348
Query: 579 NRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPAGS 623
++ R+ SR+ +S+ A +G A + + + AG+
Sbjct: 349 ASAAASPLRRAASRAAESMKSSFRAGARSTGGGAGGAAAAAAAGA 393
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 34.2 bits (79), Expect = 0.044
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGE 404
EE++EE+EEEEE+E E +
Sbjct: 74 AAAAEEKKEEEEEEEEKEESEEEAAA 99
Score = 30.7 bits (70), Expect = 0.54
Identities = 10/25 (40%), Positives = 16/25 (64%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQG 403
EE++E+EEEEEE+ E ++
Sbjct: 73 AAAAAEEKKEEEEEEEEKEESEEEA 97
Score = 30.7 bits (70), Expect = 0.72
Identities = 12/25 (48%), Positives = 16/25 (64%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQG 403
++EEEE+EEE+EE EE G
Sbjct: 76 AAEEKKEEEEEEEEKEESEEEAAAG 100
Score = 29.5 bits (67), Expect = 1.7
Identities = 13/37 (35%), Positives = 17/37 (45%), Gaps = 7/37 (18%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEG 415
EE+ EEEEEEE +E E+E + G
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKE-------ESEEEAAAG 100
Score = 28.8 bits (65), Expect = 3.1
Identities = 9/29 (31%), Positives = 14/29 (48%)
Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEA 408
EE++EEE EE ++ E+ E
Sbjct: 68 AAAAAAAAAAEEKKEEEEEEEEKEESEEE 96
Score = 28.0 bits (63), Expect = 4.9
Identities = 11/37 (29%), Positives = 16/37 (43%)
Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGG 416
EE++EE EE ++ E +E E G G
Sbjct: 67 AAAAAAAAAAAEEKKEEEEEEEEKEESEEEAAAGLGA 103
>gnl|CDD|240331 PTZ00254, PTZ00254, 40S ribosomal protein SA; Provisional.
Length = 249
Score = 35.8 bits (83), Expect = 0.045
Identities = 14/44 (31%), Positives = 19/44 (43%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEN 426
+ EE E++EE E + AE +EGE E EN
Sbjct: 206 DPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVAAANNEWEN 249
Score = 33.5 bits (77), Expect = 0.24
Identities = 14/44 (31%), Positives = 19/44 (43%)
Query: 385 EEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDN 428
+ EE EE+EE AE + A + EG E E +N
Sbjct: 206 DPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVAAANNEWEN 249
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 36.4 bits (84), Expect = 0.048
Identities = 16/53 (30%), Positives = 25/53 (47%)
Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSD 434
NE+ E EE E + E + E T+ E +EGE E + E+D ++
Sbjct: 244 NEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESDEDTE 296
Score = 32.5 bits (74), Expect = 0.81
Identities = 17/61 (27%), Positives = 28/61 (45%), Gaps = 5/61 (8%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
E ++ EED +EE+ + ++ E +EA R+E EG + E E D + T
Sbjct: 220 EETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSD-----ESEEDEAEATDGEGEE 274
Query: 439 S 439
Sbjct: 275 G 275
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 36.3 bits (84), Expect = 0.048
Identities = 22/55 (40%), Positives = 26/55 (47%), Gaps = 6/55 (10%)
Query: 365 ATEFRIQRLRE--RFLE----PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEG 413
ATE + E FL+ P E EE+EEEEEEE EE + E E G
Sbjct: 375 ATEEDATTIDELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPMEEPVPG 429
Score = 33.6 bits (77), Expect = 0.31
Identities = 15/34 (44%), Positives = 18/34 (52%)
Query: 375 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEA 408
ER+ E EEEEEE+EE E EA + E
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGFEV 432
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 36.2 bits (84), Expect = 0.056
Identities = 19/40 (47%), Positives = 25/40 (62%), Gaps = 4/40 (10%)
Query: 374 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEG 413
R ++ P ++EEEE E+EEEEE EE E E E +EG
Sbjct: 338 RCTWVNPEQKDEEEEQEDEEEEEEEE----EPEEPEPEEG 373
Score = 32.0 bits (73), Expect = 1.0
Identities = 18/42 (42%), Positives = 23/42 (54%)
Query: 385 EEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEN 426
E E+EEEEEE E ++ E E E +E E +E G G N
Sbjct: 168 REGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGTGVN 209
Score = 30.0 bits (68), Expect = 4.0
Identities = 20/40 (50%), Positives = 23/40 (57%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGG 422
E EEE+EEEE EA+E +GE E E E E EG G
Sbjct: 168 REGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGTG 207
>gnl|CDD|165564 PHA03309, PHA03309, transcriptional regulator ICP4; Provisional.
Length = 2033
Score = 36.4 bits (83), Expect = 0.056
Identities = 57/203 (28%), Positives = 84/203 (41%), Gaps = 29/203 (14%)
Query: 438 SSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSD 497
S+ SS +SS S S S S ++ SRS + S S R A DRS RS R D
Sbjct: 1812 SAGRSSSSSSSSSSSSSSSPSSRPSRSATPSLSPSPSPPRRAPVDRS----RSGRRRERD 1867
Query: 498 KSAASNRSRSGSAASNKSRSDRSRSGSAGSDR------SAASNKSVRSDKSAAS--NRSA 549
+ +A+ A +SR+D S G+A D + + + SA + +RS
Sbjct: 1868 RPSANPFRW---APRQRSRADHSPDGTAPGDAPLNLEDGPGRGRPIWTPSSATTLPSRSG 1924
Query: 550 RSDGSDKSASNRS------------RSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRA 597
D D++ + S SR+ S+ S SN KS +SR R
Sbjct: 1925 PEDSVDETETEDSAPPARLAPSPLETSRAEDSEDSEYPEYSNPRLGKSPPALKSREARRP 1984
Query: 598 RSNRSR--SASKSGSSAASSRSG 618
S + R S+ K+G + S+ S
Sbjct: 1985 SSKQPRRPSSGKNGHTDVSAASA 2007
Score = 31.0 bits (69), Expect = 2.7
Identities = 35/121 (28%), Positives = 55/121 (45%), Gaps = 6/121 (4%)
Query: 404 ETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSR 463
E + + R+ G R G + ++D D E ++ R R +SR S +RS
Sbjct: 1746 EISPSPRRRDAEGRRFGCRQDDDDG----YDYEGGRESPERVLGRRQSRRDSVPVRRRSG 1801
Query: 464 SGSASSRGSNKSDRSAKSDRSAKSDRSAR-SDRSDKSAASNRSRSGSAASNKSRSDRSRS 522
+ + R + RS+ S S+ S S+ S R +SA + S S S ++ DRSRS
Sbjct: 1802 AANCGGRWMISAGRSSSSSSSSSSSSSSSPSSRPSRSATPSLSPSPSP-PRRAPVDRSRS 1860
Query: 523 G 523
G
Sbjct: 1861 G 1861
Score = 30.6 bits (68), Expect = 3.2
Identities = 24/74 (32%), Positives = 36/74 (48%)
Query: 538 RSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRA 597
RS + R S G S+S+ S S S+SS +S S ++ S + S S R V R+
Sbjct: 1799 RSGAANCGGRWMISAGRSSSSSSSSSSSSSSSPSSRPSRSATPSLSPSPSPPRRAPVDRS 1858
Query: 598 RSNRSRSASKSGSS 611
RS R R + ++
Sbjct: 1859 RSGRRRERDRPSAN 1872
Score = 29.8 bits (66), Expect = 5.9
Identities = 30/96 (31%), Positives = 42/96 (43%), Gaps = 2/96 (2%)
Query: 529 RSAASNKSVRSDKSAASNRSARSDGSDKSASNRSR--SRSASSQASNNSAASNRSSNKSG 586
RS A+N R SA + S+ S S S+S+ S SRSA+ S + + R+
Sbjct: 1799 RSGAANCGGRWMISAGRSSSSSSSSSSSSSSSPSSRPSRSATPSLSPSPSPPRRAPVDRS 1858
Query: 587 SRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPAG 622
R R R +N R A + S A S G+ G
Sbjct: 1859 RSGRRRERDRPSANPFRWAPRQRSRADHSPDGTAPG 1894
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 35.9 bits (83), Expect = 0.061
Identities = 18/41 (43%), Positives = 27/41 (65%)
Query: 374 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
+E E P+EEEEEE EE++EEE + T + E + E ++ E
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEE 69
Score = 29.4 bits (66), Expect = 6.8
Identities = 15/38 (39%), Positives = 22/38 (57%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREG 420
E E+E +EEEEEE EE ++ E +++E E E
Sbjct: 30 EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEK 67
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 36.3 bits (84), Expect = 0.064
Identities = 40/194 (20%), Positives = 72/194 (37%), Gaps = 11/194 (5%)
Query: 432 TSDAEHSSKNSSRASSRSRSRSKSKSPAK---RSRSGSASSRGSNKSDRSAKSDRSAKSD 488
+SD+ +S+ ++ S S+ +S + ++ + S + RS + D
Sbjct: 567 SSDSVTGGADSTMSAQDSMMASRMRSAVESYAKAHNISNEQATQELASRSTRGSAGGYGD 626
Query: 489 RSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKS--VRSDKSAASN 546
A + G R+ S S G S++S S R D A +
Sbjct: 627 AHAEWG-VGPKILGKGGKLGLGVKGGGRAGIDWSDSDGHSASSSSRSSHDARHDIDAQAT 685
Query: 547 RSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSAS 606
+ R + SD S + S S ++N+A S + S +S + +N +RS
Sbjct: 686 KDFR-EASDYFTSRKV---SESGSHTDNNADSRVDQLSAALNSAKQSYDQYTTNMTRSHE 741
Query: 607 KSG-SSAASSRSGS 619
+ +S S SG
Sbjct: 742 YAEMASRTESMSGQ 755
>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1). This family
consists of several mammalian dentin matrix protein 1
(DMP1) sequences. The dentin matrix acidic
phosphoprotein 1 (DMP1) gene has been mapped to human
chromosome 4q21. DMP1 is a bone and teeth specific
protein initially identified from mineralised dentin.
DMP1 is primarily localised in the nuclear compartment
of undifferentiated osteoblasts. In the nucleus, DMP1
acts as a transcriptional component for activation of
osteoblast-specific genes like osteocalcin. During the
early phase of osteoblast maturation, Ca(2+) surges into
the nucleus from the cytoplasm, triggering the
phosphorylation of DMP1 by a nuclear isoform of casein
kinase II. This phosphorylated DMP1 is then exported out
into the extracellular matrix, where it regulates
nucleation of hydroxyapatite. DMP1 is a unique molecule
that initiates osteoblast differentiation by
transcription in the nucleus and orchestrates
mineralised matrix formation extracellularly, at later
stages of osteoblast maturation. The DMP1 gene has been
found to be ectopically expressed in lung cancer
although the reason for this is unknown.
Length = 514
Score = 35.8 bits (82), Expect = 0.066
Identities = 57/254 (22%), Positives = 109/254 (42%), Gaps = 13/254 (5%)
Query: 368 FRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER--EGGGEGE 425
R +R R + +E + E+EE+ +++ + ++ E ++ R E G GE
Sbjct: 223 TRSERGNSRMSSAGLKSKESKGEDEEQASTQDSGESQSVEYPSRKFFRKSRISEEDGRGE 282
Query: 426 NDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRS----------GSASSRGSNKS 475
D+ S++N+ + SR SRS+S+ ++ ++S S SS+ ++
Sbjct: 283 LDDSNTMEVKSDSTENAGLSQSREHSRSESQEDSEENQSQEDSQEVQDPSSESSQEADLP 342
Query: 476 DRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDR-SRSGSAGSDRSAASN 534
+ S+ + +R D D + + + + S +S + D S S S ++ A S
Sbjct: 343 SQENSSESQEEVVSESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSSSESQSTEEQADSE 402
Query: 535 KSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSV 594
+ S S S + S +S S S S++ + + + S + S S+ S
Sbjct: 403 SNESLSSSEESPESTEDENSSSQEGLQSHSASTESRSQESQSEQDSRSEEDDSDSQDSSR 462
Query: 595 SRARSNRSRSASKS 608
S+ SN + SAS S
Sbjct: 463 SKEDSNSTESASSS 476
Score = 33.1 bits (75), Expect = 0.50
Identities = 40/211 (18%), Positives = 84/211 (39%), Gaps = 3/211 (1%)
Query: 386 EEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRA 445
EE+ E ++ + ++TE + +++ + D++ SS +
Sbjct: 276 EEDGRGELDDSNTMEVKSDSTENAGLSQSREHSRSESQEDSEENQSQEDSQEVQDPSSES 335
Query: 446 SSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSAR--SDRSDKSAASN 503
S + S+ S ++ + SRG N + ++ S+ S+ S D S + +
Sbjct: 336 SQEADLPSQENS-SESQEEVVSESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSSSESQS 394
Query: 504 RSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSR 563
+ SN+S S S + D +++S + ++S ++ +RS S S S
Sbjct: 395 TEEQADSESNESLSSSEESPESTEDENSSSQEGLQSHSASTESRSQESQSEQDSRSEEDD 454
Query: 564 SRSASSQASNNSAASNRSSNKSGSRSRSRSV 594
S S S S + S S++ S + ++
Sbjct: 455 SDSQDSSRSKEDSNSTESASSSEEDGQPKNT 485
Score = 30.0 bits (67), Expect = 4.5
Identities = 46/208 (22%), Positives = 79/208 (37%), Gaps = 3/208 (1%)
Query: 337 RVRLSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEE 396
+ R+S+ + +NT V + NA + + + +EE + +++ +E ++
Sbjct: 271 KSRISEEDGRGELDDSNTMEVKSDSTENAGLSQSREHSRSESQEDSEENQSQEDSQEVQD 330
Query: 397 AEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSK 456
E ++ + E E DN T+ HS SS S
Sbjct: 331 PSSESSQEADLPSQENSSESQEEVVSESRGDNPDNTT--SHSEDQEDSESSEEDSLDTPS 388
Query: 457 SPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRS-GSAASNKS 515
S +S A S + S +S S + + S+ + +AS SRS S + S
Sbjct: 389 SSESQSTEEQADSESNESLSSSEESPESTEDENSSSQEGLQSHSASTESRSQESQSEQDS 448
Query: 516 RSDRSRSGSAGSDRSAASNKSVRSDKSA 543
RS+ S S S RS + S S S+
Sbjct: 449 RSEEDDSDSQDSSRSKEDSNSTESASSS 476
>gnl|CDD|226365 COG3846, TrbL, Type IV secretory pathway, TrbL components
[Intracellular trafficking and secretion].
Length = 452
Score = 35.9 bits (83), Expect = 0.068
Identities = 30/137 (21%), Positives = 48/137 (35%), Gaps = 24/137 (17%)
Query: 435 AEHSSKNSSRASSRSRSRSKSKSPAKRSRSG-------SASSRGSNKSDRSAKSDRSAKS 487
A +S S+ AS+ + R S S A + +G A + + + S S
Sbjct: 328 ALGTSMASAAASAFASGRKGSGSGAFGTAAGVGDVKSPGAKAAMRTLGRAAGDTGVSVAS 387
Query: 488 DR-----SARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKS 542
SA + KSA + + +A R ++ AG S A
Sbjct: 388 GVGQAPKSAGGSAAGKSAVAKATGVQAAPGWARRMASGQNVFAG--ASTA---------- 435
Query: 543 AASNRSARSDGSDKSAS 559
AA+ R G+D + S
Sbjct: 436 AAAVRGGDGAGNDANPS 452
Score = 34.3 bits (79), Expect = 0.19
Identities = 24/108 (22%), Positives = 36/108 (33%), Gaps = 10/108 (9%)
Query: 522 SGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRS 581
+G+ G+ A+ SV + + S S + AS R S S + +
Sbjct: 311 AGARGATAGASLASSVTALGT-----SMASAAASAFASGRKGSGSGAFGTAAGVGDVKSP 365
Query: 582 SNKSGSRSRSR-----SVSRARSNRSRSASKSGSSAASSRSGSPAGSD 624
K+ R+ R VS A S GS+A S G
Sbjct: 366 GAKAAMRTLGRAAGDTGVSVASGVGQAPKSAGGSAAGKSAVAKATGVQ 413
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 35.7 bits (82), Expect = 0.069
Identities = 12/69 (17%), Positives = 20/69 (28%), Gaps = 1/69 (1%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
PN++ + E+ EE + + E E + E E
Sbjct: 157 PSPNQQPSSFLQPSHEDSPEEPEPPTS-EPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQ 215
Query: 439 SKNSSRASS 447
S +A S
Sbjct: 216 SPTPQQAPS 224
Score = 31.1 bits (70), Expect = 1.9
Identities = 14/79 (17%), Positives = 30/79 (37%), Gaps = 3/79 (3%)
Query: 375 ERFLEPPNEEEEEEDE---EEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKR 431
FL+P +E+ EE E E E ++ Q ET + + G + +
Sbjct: 164 SSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAP 223
Query: 432 TSDAEHSSKNSSRASSRSR 450
+ + + + ++ + R
Sbjct: 224 SPNTQQAVEHEDEPTEPER 242
Score = 30.3 bits (68), Expect = 3.0
Identities = 3/51 (5%), Positives = 6/51 (11%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
P + + E+ E
Sbjct: 126 SSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPE 176
Score = 29.6 bits (66), Expect = 5.4
Identities = 8/54 (14%), Positives = 15/54 (27%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRT 432
P+E E + + + E +E E E + + T
Sbjct: 144 PGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSET 197
Score = 29.6 bits (66), Expect = 5.7
Identities = 22/108 (20%), Positives = 30/108 (27%), Gaps = 4/108 (3%)
Query: 339 RLSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAE 398
R R++ AA P T R E + E EE E E
Sbjct: 30 REQAGRRRGTAARA----AKPAPPAPTTSGPQVRAVAEQGHRQTESDTETAEESRHGEKE 85
Query: 399 ETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRAS 446
E QG + + + G E + + S S AS
Sbjct: 86 ERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPAS 133
Score = 29.6 bits (66), Expect = 6.1
Identities = 22/135 (16%), Positives = 41/135 (30%), Gaps = 9/135 (6%)
Query: 384 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSS 443
E+ E + E AEE++ GE + + G+GG G + S
Sbjct: 63 EQGHRQTESDTETAEESRHGE----KEERGQGGP-----SGSGSESVGSPTPSPSGSAEE 113
Query: 444 RASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASN 503
AS S + SP + S S+ S + + + + +
Sbjct: 114 LASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHED 173
Query: 504 RSRSGSAASNKSRSD 518
+++ D
Sbjct: 174 SPEEPEPPTSEPEPD 188
>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
Length = 1052
Score = 36.1 bits (83), Expect = 0.069
Identities = 23/151 (15%), Positives = 46/151 (30%), Gaps = 13/151 (8%)
Query: 369 RIQRLRER---------FLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
R + LRE+ L D E + E + TQ G EA+ E
Sbjct: 885 RAEELREKRGKLEAHRAMLSGSEANSARRDTMEIQNEQKPTQDGPAAEAQPA----QENT 940
Query: 420 GGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSA 479
E + A+ ++ + + + ++ + + K + A
Sbjct: 941 VVESAEKAVAAVSEAAQQAATAVASGIEKVAEAVRKTVRRSVKKAAATRAAMKKKVAKKA 1000
Query: 480 KSDRSAKSDRSARSDRSDKSAASNRSRSGSA 510
+ ++A +A+ + K A A
Sbjct: 1001 PAKKAAAKKAAAKKAAAKKKVAKKAPAKKVA 1031
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 35.8 bits (83), Expect = 0.082
Identities = 35/236 (14%), Positives = 74/236 (31%), Gaps = 25/236 (10%)
Query: 371 QRLRERFL-EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
+ LR + + +EE+E+ED + EEE+ ++ + + E + + GE + +N
Sbjct: 312 EELRRKIEGKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENS 371
Query: 430 -----KRTSDAEHSSKNSSRA-------------SSRSRS-RSKSKSPAKRSRSGSASSR 470
K AE K + A S SK R + G +
Sbjct: 372 GLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGE 431
Query: 471 GSNKSDRSAK----SDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAG 526
+S + K + K ++ A + + ++
Sbjct: 432 KEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELD 491
Query: 527 SDRSA-ASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRS 581
+ + SV ++ S DK+A+ S++ + + +
Sbjct: 492 EENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLD 547
Score = 33.9 bits (78), Expect = 0.33
Identities = 38/223 (17%), Positives = 71/223 (31%), Gaps = 11/223 (4%)
Query: 342 KRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEE---EDEEEEEEEAE 398
R+K+G + L + +F +QR R E + E EE E E EEE + E
Sbjct: 353 MLRKKLGKLKEGE-DDEENSGLLSMKF-MQRAEARKKEENDAEIEELRRELEGEEESDEE 410
Query: 399 ETQQGETTEAERQE--GEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSK 456
E ++ R++ E GE+E + K + S ++ +
Sbjct: 411 ENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKV 470
Query: 457 SPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSR 516
+ RS A + + + + K+ S + + S A+NK
Sbjct: 471 ANKLLKRSEKAQKE--EEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKIS 528
Query: 517 SDRSRSGSA--GSDRSAASNKSVRSDKSAASNRSARSDGSDKS 557
+ + + + S + D D+
Sbjct: 529 KAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDEDDEE 571
>gnl|CDD|152814 pfam12379, DUF3655, Protein of unknown function (DUF3655). This
domain family is found in viruses, and is approximately
70 amino acids in length. The family is found in
association with pfam08716, pfam01661, pfam05409,
pfam06471, pfam08717, pfam06478, pfam09401, pfam06460,
pfam08715, pfam08710.
Length = 70
Score = 32.3 bits (73), Expect = 0.084
Identities = 16/34 (47%), Positives = 22/34 (64%)
Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEG 413
PP+EEE+ E+ E EEEE EET + E + +G
Sbjct: 7 PPDEEEDCEEYECEEEEIEETCEHEYGTEDDYKG 40
>gnl|CDD|222011 pfam13257, DUF4048, Domain of unknown function (DUF4048). This
presumed domain is functionally uncharacterized. This
domain family is found in eukaryotes, and is typically
between 228 and 257 amino acids in length.
Length = 242
Score = 34.8 bits (80), Expect = 0.088
Identities = 33/111 (29%), Positives = 52/111 (46%), Gaps = 8/111 (7%)
Query: 421 GGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAK 480
G EG N E RT S ++ SR++SRSR R + S + S S++S+G+ S K
Sbjct: 110 GEEGINATESRTVPPPRSRRSGSRSTSRSRLRLQGGSLSSSRSSRSSTSKGA----TSGK 165
Query: 481 SDRSAKSDRSARS----DRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGS 527
+SA D S S D + + S + S + A N ++ + S+
Sbjct: 166 DSKSADIDVSFWSEFGIDTPGQKSKSPQKASSTPAGNTNQGQSQNAQSSNL 216
Score = 29.0 bits (65), Expect = 6.4
Identities = 23/98 (23%), Positives = 38/98 (38%), Gaps = 8/98 (8%)
Query: 498 KSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSD----- 552
++ RSR + S R + GS S RS+ S+ S + S ++SA D
Sbjct: 120 RTVPPPRSRRSGSRSTSRSRLRLQGGSLSSSRSSRSSTS-KGATSGKDSKSADIDVSFWS 178
Query: 553 --GSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSR 588
G D + ASS + N+ + +S +
Sbjct: 179 EFGIDTPGQKSKSPQKASSTPAGNTNQGQSQNAQSSNL 216
Score = 29.0 bits (65), Expect = 7.2
Identities = 21/91 (23%), Positives = 35/91 (38%)
Query: 492 RSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARS 551
R S +++RSR + S S SRS ++ S +KS D S S +
Sbjct: 125 PRSRRSGSRSTSRSRLRLQGGSLSSSRSSRSSTSKGATSGKDSKSADIDVSFWSEFGIDT 184
Query: 552 DGSDKSASNRSRSRSASSQASNNSAASNRSS 582
G + ++ S A + S + S+
Sbjct: 185 PGQKSKSPQKASSTPAGNTNQGQSQNAQSSN 215
>gnl|CDD|237620 PRK14136, recX, recombination regulator RecX; Provisional.
Length = 309
Score = 35.0 bits (80), Expect = 0.089
Identities = 42/198 (21%), Positives = 77/198 (38%), Gaps = 9/198 (4%)
Query: 430 KRTSDAEHSSKNSSRASSRSRSRSKSKSPAKR-SRSGSASSRGSNK-SDRSAKSDRSAKS 487
+R A+ + ++R+ S+ + S G + R + + SD + S A
Sbjct: 3 RRRQGADPQEADHPARAARAGRPHASRETDRTVSGEGRPAGRTATRASDDALVSFEIAAP 62
Query: 488 DRSARSDRS-DKSAASNRSRSGSAASNKSRS-DRSRSGSAGSDRSAASNKSVRSDKSAAS 545
D D S D + R SG + R+ A + SAA D +
Sbjct: 63 DEPFDDDESFDAHDRARRRVSGVGVRDAGAPGGRAADARAANLSSAAKRAEAAGDVYTRT 122
Query: 546 NRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSV---SRARSNRS 602
++ R + ++A S S+ + ++ AA +R+S++ + R++ SR +R+
Sbjct: 123 SQHPRR--TRRAAGPFHSDSSPSASSEDDGAARSRASSRPARSLKGRALGYLSRREYSRA 180
Query: 603 RSASKSGSSAASSRSGSP 620
A K A S S P
Sbjct: 181 ELARKLAPYADESDSVEP 198
Score = 32.3 bits (73), Expect = 0.64
Identities = 26/150 (17%), Positives = 55/150 (36%)
Query: 474 KSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAAS 533
+ + A + R+AR+ R S ++R+ SG + R+ + S AA
Sbjct: 3 RRRQGADPQEADHPARAARAGRPHASRETDRTVSGEGRPAGRTATRASDDALVSFEIAAP 62
Query: 534 NKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRS 593
++ D+S ++ AR S + ++ A + +S ++ +R+
Sbjct: 63 DEPFDDDESFDAHDRARRRVSGVGVRDAGAPGGRAADARAANLSSAAKRAEAAGDVYTRT 122
Query: 594 VSRARSNRSRSASKSGSSAASSRSGSPAGS 623
R R + S+ S+ S +
Sbjct: 123 SQHPRRTRRAAGPFHSDSSPSASSEDDGAA 152
>gnl|CDD|213932 TIGR04319, SerAla_Lrha_rpt, surface protein repeat Ser-Ala-175.
This serine and alanine-rich surface protein repeat,
about 175 amino acids long, occurs up to nine times in
surface proteins of some Lactobacillus strains,
particularly in Lactobacillus rhamnosus. Members
proteins have the N-terminal variant signal sequence
described by TIGR03715 and C-terminal LPXTG signals for
surface attachment by sortase.
Length = 175
Score = 34.1 bits (78), Expect = 0.12
Identities = 44/167 (26%), Positives = 80/167 (47%), Gaps = 5/167 (2%)
Query: 453 SKSKSPAKRSRSGSASSRGSNKSDRS---AKSDRSAKSDRSARSDRSDKSAASNRSRSGS 509
S S A S +AS N+S S S ++ + A S +D SAAS+ + S
Sbjct: 5 SSVASSANAVASSAASRFPDNQSLASLAKTASSANSVTSSYAASASADASAASSLATKVS 64
Query: 510 AASNKSRSDRSR--SGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSA 567
+A+ + S S+ S A + AAS+ + ++ K+A++ S + ++ S + A
Sbjct: 65 SANKAASSAASQANSALAAGNLDAASSYANQASKAASNASSLADKANSAASKALSEALQA 124
Query: 568 SSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAAS 614
SS A+ S+A++ ++ +GS + + S S +AS + A+S
Sbjct: 125 SSAAAIASSAASSAATLAGSLASANDAKSDASAASDAASSASVVASS 171
>gnl|CDD|219627 pfam07897, DUF1675, Protein of unknown function (DUF1675). The
members of this family are sequences derived from
hypothetical plant proteins of unknown function. One
member of this family is annotated as a putative
RNA-binding protein, but no evidence was found to
support this.
Length = 283
Score = 34.6 bits (79), Expect = 0.12
Identities = 33/185 (17%), Positives = 62/185 (33%), Gaps = 9/185 (4%)
Query: 409 ERQEGEGGEREGGGEGENDNE--KRTSD--AEHSSKNSSRASSRSRSRSKSKSPAKRSRS 464
+R G G + +D RT+ E + R +S R ++K KRS
Sbjct: 36 DRSNGGGLDGGDVRTIVSDMFGLMRTTSLPVETEEEWRKRKEMQSLRRLEAKR--KRSEK 93
Query: 465 GSA--SSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRS 522
S+ S +A S + S++ + A++ S G ++S S D
Sbjct: 94 EYNGVSNGDDMDSINAANGGGSGRDKESSKGVETGSPIATSSSSGGPSSSGLSELDVG-V 152
Query: 523 GSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSS 582
S ++ V + + + S +D D + + + S +N
Sbjct: 153 IGGQSGQTKQVRSPVNNILTGTESTSHGTDPVDAETNKSNTTSPTKRSRRPPSVDTNSGG 212
Query: 583 NKSGS 587
S +
Sbjct: 213 EASLN 217
Score = 29.5 bits (66), Expect = 5.0
Identities = 21/125 (16%), Positives = 45/125 (36%), Gaps = 2/125 (1%)
Query: 391 EEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSR 450
EEE + +E Q EA+R+ E + G +D + + S +S
Sbjct: 69 EEEWRKRKEMQSLRRLEAKRKRSE--KEYNGVSNGDDMDSINAANGGGSGRDKESSKGVE 126
Query: 451 SRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSA 510
+ S + + S+ + +S ++ + + + + S+ + A
Sbjct: 127 TGSPIATSSSSGGPSSSGLSELDVGVIGGQSGQTKQVRSPVNNILTGTESTSHGTDPVDA 186
Query: 511 ASNKS 515
+NKS
Sbjct: 187 ETNKS 191
>gnl|CDD|221526 pfam12316, Dsh_C, Segment polarity protein dishevelled (Dsh) C
terminal. This domain family is found in eukaryotes,
and is typically between 177 and 207 amino acids in
length. The family is found in association with
pfam00778, pfam02377, pfam00610, pfam00595. The segment
polarity gene dishevelled (dsh) is required for pattern
formation of the embryonic segments. It is involved in
the determination of body organisation through the
Wingless pathway (analogous to the Wnt-1 pathway).
Length = 202
Score = 34.2 bits (78), Expect = 0.14
Identities = 29/101 (28%), Positives = 38/101 (37%), Gaps = 10/101 (9%)
Query: 522 SGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRS 581
SGSAGS S S +S+ SNRS ++A R R + S + S
Sbjct: 63 SGSAGSQHSEGS-------RSSGSNRS--DGERSRAADGREGGRKSGGSGSESEHTSRSG 113
Query: 582 SNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPAG 622
S +SG R S + S S S +S S G
Sbjct: 114 SRRSGGRRAPSERSGPPPSE-GSVRSSLSHPSSHSSYGAPG 153
Score = 30.7 bits (69), Expect = 1.5
Identities = 28/93 (30%), Positives = 39/93 (41%), Gaps = 10/93 (10%)
Query: 464 SGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSG 523
+GS S GS +S S +SD + SGS + + SRS RSG
Sbjct: 66 AGSQHSEGS-------RSSGSNRSDGERSRAADGREGGRKSGGSGSESEHTSRSGSRRSG 118
Query: 524 S--AGSDRSAASNKSVRSDKSAASNRSARSDGS 554
A S+RS S S +S+ S+ S+ S
Sbjct: 119 GRRAPSERSGPPP-SEGSVRSSLSHPSSHSSYG 150
Score = 30.7 bits (69), Expect = 1.8
Identities = 20/83 (24%), Positives = 30/83 (36%), Gaps = 7/83 (8%)
Query: 435 AEHSSKNSSRASSRSRSRSKS-----KSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDR 489
++HS + S S+RS + K SGS S S R + R+
Sbjct: 68 SQHSEGSRSSGSNRSDGERSRAADGREGGRKSGGSGSESEHTSRSGSRRSGGRRAPSERS 127
Query: 490 SARSDRSDKSAASNRSRSGSAAS 512
S+ S S+ S S +S
Sbjct: 128 GPPP--SEGSVRSSLSHPSSHSS 148
Score = 29.9 bits (67), Expect = 2.8
Identities = 30/92 (32%), Positives = 43/92 (46%), Gaps = 9/92 (9%)
Query: 499 SAASNRSR-SGSAASNKSRSDRSRS--GSAGSDRSAASNKSVRSDKSAASNRSARSDGSD 555
SA S S S S+ SN+S +RSR+ G G +S S S+ S +R G
Sbjct: 65 SAGSQHSEGSRSSGSNRSDGERSRAADGREGGRKSGGSG----SESEHTSRSGSRRSGGR 120
Query: 556 KSASNRSRSRSASSQASNNSAASNRSSNKSGS 587
++ S RS S+ S S+ S+ SS+ S
Sbjct: 121 RAPS--ERSGPPPSEGSVRSSLSHPSSHSSYG 150
Score = 29.5 bits (66), Expect = 3.5
Identities = 21/81 (25%), Positives = 32/81 (39%), Gaps = 1/81 (1%)
Query: 443 SRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAAS 502
SR+S +RS + A G S ++S+ +++S R A S+RS S
Sbjct: 74 SRSSGSNRSDGERSRAADGREGGRKSGGSGSESEHTSRSGSRRSGGRRAPSERS-GPPPS 132
Query: 503 NRSRSGSAASNKSRSDRSRSG 523
S S + S S G
Sbjct: 133 EGSVRSSLSHPSSHSSYGAPG 153
Score = 29.2 bits (65), Expect = 5.2
Identities = 21/90 (23%), Positives = 35/90 (38%), Gaps = 1/90 (1%)
Query: 437 HSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRS 496
S S+ S SRS ++S +RSR+ G ++S+ +++S S RS
Sbjct: 62 GSGSAGSQHSEGSRSSGSNRSDGERSRAADGREGGRKSGGSGSESEHTSRSG-SRRSGGR 120
Query: 497 DKSAASNRSRSGSAASNKSRSDRSRSGSAG 526
+ + + S S S S G
Sbjct: 121 RAPSERSGPPPSEGSVRSSLSHPSSHSSYG 150
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 34.8 bits (80), Expect = 0.14
Identities = 29/169 (17%), Positives = 61/169 (36%), Gaps = 9/169 (5%)
Query: 445 ASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNR 504
AS+ + + + +A++ + + A A + R D + A
Sbjct: 393 ASAVPAVTAVTGAAGAALAPKAAAAAAA--TRAEAPPAAPAPPATADRGDDAADGDAPVP 450
Query: 505 SRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRS 564
+++ + AS SR D R +D +AS + + AA + R+ +
Sbjct: 451 AKANARASADSRCD-ERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPD 509
Query: 565 RSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAA 613
A + AS A + + +R + + + +A G++AA
Sbjct: 510 ARAPAAASREDAPAAAAPPAPEARPPTPAAAAP------AARAGGAAAA 552
Score = 34.1 bits (78), Expect = 0.25
Identities = 44/213 (20%), Positives = 77/213 (36%), Gaps = 23/213 (10%)
Query: 421 GGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAK 480
+ ++ DA +K ++RAS+ SR + P S S SA A
Sbjct: 433 PATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAP----------AS 482
Query: 481 SDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDR-----SRSGSAGSDRSAASNK 535
+ A + +A A + SR D + A AA+
Sbjct: 483 DAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAP 542
Query: 536 SVRSDKSAASNRSARSDG----SDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRS 591
+ R+ +AA+ R+ G SD+ A + ++ A++ A+ A+ R + + +
Sbjct: 543 AARAGGAAAALDVLRNAGMRVSSDRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRAR 602
Query: 592 RSVSRARSNRSRSASKSGSSAASSRSGSPAGSD 624
+ A N + A + AA SR P D
Sbjct: 603 AATGDAPPNGAARAEQ----AAESRGAPPPWED 631
>gnl|CDD|223046 PHA03328, PHA03328, nuclear egress lamina protein UL31;
Provisional.
Length = 316
Score = 34.3 bits (79), Expect = 0.15
Identities = 18/45 (40%), Positives = 25/45 (55%)
Query: 563 RSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASK 607
R S+S + S +A +R + GSR RSR SR RS+R S +
Sbjct: 7 RRSSSSLRRSRRAARRSRRDGRVGSRGRSRYRSRRRSSRRSSTRR 51
Score = 32.0 bits (73), Expect = 0.76
Identities = 13/63 (20%), Positives = 17/63 (26%)
Query: 433 SDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSAR 492
D +SS SR +R + SR S S RS+ R
Sbjct: 2 DDLALRRSSSSLRRSRRAARRSRRDGRVGSRGRSRYRSRRRSSRRSSTRRAELADTERDR 61
Query: 493 SDR 495
Sbjct: 62 YRA 64
Score = 29.7 bits (67), Expect = 4.6
Identities = 16/52 (30%), Positives = 23/52 (44%)
Query: 565 RSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSR 616
R +SS + A+ RS SR RS R+R SR +S + A +
Sbjct: 7 RRSSSSLRRSRRAARRSRRDGRVGSRGRSRYRSRRRSSRRSSTRRAELADTE 58
Score = 29.7 bits (67), Expect = 5.1
Identities = 16/55 (29%), Positives = 24/55 (43%)
Query: 538 RSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSR 592
S S R+AR D +R RSR S + S+ +++ R+ R R R
Sbjct: 9 SSSSLRRSRRAARRSRRDGRVGSRGRSRYRSRRRSSRRSSTRRAELADTERDRYR 63
>gnl|CDD|223025 PHA03253, PHA03253, UL35; Provisional.
Length = 609
Score = 34.9 bits (80), Expect = 0.15
Identities = 38/140 (27%), Positives = 54/140 (38%), Gaps = 15/140 (10%)
Query: 426 NDNEKRTSDAEHSSKNSS--RASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDR 483
N NE R+S S ++S S RS S ++ ++SR S+ S S+R
Sbjct: 460 NVNEGRSSSRASPSHSTSTIPYSPPQSGRSTPTSILRQRTPIRSNSRSSSVSFSQGDSNR 519
Query: 484 SAKSDRSARSDRS---------DKSAASNRSRS-GSAASNKSRSDRSRSGSAGSDRSA-- 531
S SD + SD S D+ + S S +SN S S SR+ R
Sbjct: 520 SHYSDETNISDYSYPMADLDLEDEEPMEDHPHSPQSISSNNSMSRTSRALQNSQRRRPPT 579
Query: 532 -ASNKSVRSDKSAASNRSAR 550
+ S R +A R R
Sbjct: 580 MFPSSSERQQNNARLRRRTR 599
Score = 31.5 bits (71), Expect = 1.8
Identities = 20/61 (32%), Positives = 32/61 (52%), Gaps = 3/61 (4%)
Query: 560 NRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARS---NRSRSASKSGSSAASSR 616
N + RS+S + ++S ++ S RS S+ R R+ + SRS+S S S S+R
Sbjct: 460 NVNEGRSSSRASPSHSTSTIPYSPPQSGRSTPTSILRQRTPIRSNSRSSSVSFSQGDSNR 519
Query: 617 S 617
S
Sbjct: 520 S 520
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 35.0 bits (81), Expect = 0.15
Identities = 16/67 (23%), Positives = 35/67 (52%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNS 442
EEEE+ ++E+EE+ ++ + E + + + E E + + E+D E+ + E K S
Sbjct: 324 EEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKS 383
Query: 443 SRASSRS 449
+ ++
Sbjct: 384 AESTRSE 390
Score = 30.7 bits (70), Expect = 2.7
Identities = 16/73 (21%), Positives = 34/73 (46%), Gaps = 1/73 (1%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEG-GEREGGGEGENDNEKRTSDAEHSSKN 441
EE+EE+EE+ ++ +E + E E ++ + E E + ++D+E + E +
Sbjct: 319 GEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEK 378
Query: 442 SSRASSRSRSRSK 454
+ S+ S
Sbjct: 379 KKKKSAESTRSEL 391
Score = 30.0 bits (68), Expect = 5.3
Identities = 13/42 (30%), Positives = 27/42 (64%)
Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
++ +EEE+EEDE+ ++E+ EE ++ E + +++ E E
Sbjct: 349 VDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390
Score = 28.8 bits (65), Expect = 9.6
Identities = 23/95 (24%), Positives = 41/95 (43%), Gaps = 23/95 (24%)
Query: 369 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEE--------------TQQGETTEAERQEGE 414
R++R+R EEE++E+EE+ +E A++ QGE E E ++G
Sbjct: 278 RLRRMRG-------EEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGV 330
Query: 415 GGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRS 449
E + ++D E+ D + S + S
Sbjct: 331 DD--EDEEDDDDDLEEEEEDVDLSDEEEDEEDEDS 363
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 34.5 bits (80), Expect = 0.16
Identities = 14/21 (66%), Positives = 15/21 (71%)
Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
EEEEEE+EEEEEEE E
Sbjct: 298 AAAAEEEEEEEEEEEEEEPSE 318
Score = 32.9 bits (76), Expect = 0.41
Identities = 14/20 (70%), Positives = 16/20 (80%)
Query: 380 PPNEEEEEEDEEEEEEEAEE 399
EEEEEE+EEEEEE +EE
Sbjct: 300 AAEEEEEEEEEEEEEEPSEE 319
Score = 32.9 bits (76), Expect = 0.43
Identities = 13/19 (68%), Positives = 14/19 (73%)
Query: 379 EPPNEEEEEEDEEEEEEEA 397
E EEEEEE+EE EEEA
Sbjct: 303 EEEEEEEEEEEEEPSEEEA 321
Score = 32.1 bits (74), Expect = 0.75
Identities = 13/21 (61%), Positives = 14/21 (66%)
Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
EEEEEE+EEEEE EE
Sbjct: 300 AAEEEEEEEEEEEEEEPSEEE 320
Score = 31.0 bits (71), Expect = 1.7
Identities = 12/20 (60%), Positives = 13/20 (65%)
Query: 379 EPPNEEEEEEDEEEEEEEAE 398
E EEEEEE+E EEE A
Sbjct: 304 EEEEEEEEEEEEPSEEEAAA 323
Score = 29.8 bits (68), Expect = 3.7
Identities = 12/20 (60%), Positives = 13/20 (65%)
Query: 380 PPNEEEEEEDEEEEEEEAEE 399
EEE+EEEEEEE EE
Sbjct: 296 AQAAAAEEEEEEEEEEEEEE 315
Score = 29.8 bits (68), Expect = 4.6
Identities = 12/21 (57%), Positives = 14/21 (66%)
Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
+ EEEEE+EEEEEEE
Sbjct: 297 QAAAAEEEEEEEEEEEEEEPS 317
Score = 29.4 bits (67), Expect = 6.1
Identities = 12/21 (57%), Positives = 13/21 (61%)
Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
EEEE+EEEEEEE E
Sbjct: 296 AQAAAAEEEEEEEEEEEEEEP 316
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 34.8 bits (80), Expect = 0.16
Identities = 21/77 (27%), Positives = 33/77 (42%), Gaps = 6/77 (7%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKN- 441
EEE+ + EEE+ EE + + + + E E E D+E +AE +K
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDE----EAESPAKAE 56
Query: 442 -SSRASSRSRSRSKSKS 457
S R +R + K K
Sbjct: 57 ISKREKARLKELKKQKK 73
Score = 33.6 bits (77), Expect = 0.41
Identities = 16/55 (29%), Positives = 21/55 (38%)
Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAE 436
+ EEEDEEE E A + E E + E E E+ + S E
Sbjct: 7 TQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKRE 61
Score = 32.5 bits (74), Expect = 0.76
Identities = 13/53 (24%), Positives = 21/53 (39%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKR 431
E N + EE++EEE E + ++ + E + E E E K
Sbjct: 3 EQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKA 55
Score = 31.7 bits (72), Expect = 1.4
Identities = 16/76 (21%), Positives = 32/76 (42%), Gaps = 3/76 (3%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
E + E+E+EEE EA G ++ + E E E +++ + + AE S
Sbjct: 2 EEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAED---EDEDEEDDEEAESPAKAEIS 58
Query: 439 SKNSSRASSRSRSRSK 454
+ +R + + +
Sbjct: 59 KREKARLKELKKQKKQ 74
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 34.6 bits (80), Expect = 0.18
Identities = 14/57 (24%), Positives = 21/57 (36%), Gaps = 3/57 (5%)
Query: 354 TRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAER 410
RLV+ R AT PP E+D++ E+E E+ + E
Sbjct: 247 ARLVLAPR---ATRLPAPPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEI 300
Score = 30.0 bits (68), Expect = 4.8
Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 1/29 (3%)
Query: 595 SRARSNRSRSASKSGSSAASSRSGSPAGS 623
AR+ R+RSA +G++ +R G P GS
Sbjct: 320 GGARA-RARSAGGAGAAQKGNRRGRPLGS 347
>gnl|CDD|219594 pfam07816, DUF1645, Protein of unknown function (DUF1645). These
sequences are derived from a number of hypothetical
plant proteins. The region in question is approximately
270 amino acids long. Some members of this family are
annotated as yeast pheromone receptor proteins AR781 but
no literature was found to support this.
Length = 191
Score = 33.3 bits (76), Expect = 0.20
Identities = 40/167 (23%), Positives = 65/167 (38%), Gaps = 8/167 (4%)
Query: 451 SRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSA 510
S + S SP R R GS S + R ++ RS +R SD + G
Sbjct: 21 SVASSPSPRDRGRVGSPRSESAFAPARLRRALRSLSPER--GGGSSDSESTDEGELEGVP 78
Query: 511 ASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQ 570
S+ S S S S + +S + SD RSA SDG D + ++
Sbjct: 79 PSSYCVSSSPASSSRKSSSTGSSKRWRLSD--LLLFRSA-SDGKDAFVFDAAKDPLLKYS 135
Query: 571 ASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRS 617
++ A+ + + ++SR S S+ + AS A++R+
Sbjct: 136 PLSSPASPVK---PASAKSRESSASKGKRRGKTVASAHELLYATNRA 179
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 32.7 bits (75), Expect = 0.22
Identities = 9/24 (37%), Positives = 22/24 (91%)
Query: 382 NEEEEEEDEEEEEEEAEETQQGET 405
+EE+EEE+++EE+++ +E+++ E+
Sbjct: 117 DEEDEEEEDDEEDDDEDESEEEES 140
Score = 31.9 bits (73), Expect = 0.39
Identities = 9/25 (36%), Positives = 20/25 (80%)
Query: 382 NEEEEEEDEEEEEEEAEETQQGETT 406
++EE+EE+E++EE++ E+ + E +
Sbjct: 116 DDEEDEEEEDDEEDDDEDESEEEES 140
Score = 30.8 bits (70), Expect = 0.90
Identities = 10/20 (50%), Positives = 16/20 (80%)
Query: 383 EEEEEEDEEEEEEEAEETQQ 402
EE++EED++E+E E EE+
Sbjct: 123 EEDDEEDDDEDESEEEESPV 142
>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family. This model
represents a subfamily of RNA splicing factors including
the Pad-1 protein (N. crassa), CAPER (M. musculus) and
CC1.3 (H.sapiens). These proteins are characterized by
an N-terminal arginine-rich, low complexity domain
followed by three (or in the case of 4 H. sapiens
paralogs, two) RNA recognition domains (rrm: pfam00706).
These splicing factors are closely related to the U2AF
splicing factor family (TIGR01642). A homologous gene
from Plasmodium falciparum was identified in the course
of the analysis of that genome at TIGR and was included
in the seed.
Length = 457
Score = 34.1 bits (78), Expect = 0.25
Identities = 19/73 (26%), Positives = 28/73 (38%)
Query: 424 GENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDR 483
D E+ + + R SR RSRS+ +S +R R RG ++S + R
Sbjct: 1 RYRDRERGRLRNDTRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPNRYYR 60
Query: 484 SAKSDRSARSDRS 496
R DR
Sbjct: 61 PRGDRSYRRDDRR 73
Score = 34.1 bits (78), Expect = 0.25
Identities = 31/131 (23%), Positives = 45/131 (34%), Gaps = 8/131 (6%)
Query: 480 KSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSR-SGSAGSDRSAASNKSVR 538
DR R+ + RSDK +R RS S ++ R DR G G RS + N+ R
Sbjct: 2 YRDRERGRLRN-DTRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPNRYYR 60
Query: 539 SDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSN------KSGSRSRSR 592
+ R R G + R + A R + K G +
Sbjct: 61 PRGDRSYRRDDRRSGRNTKEPLTEAERDDRTVFVLQLALKARERDLYEFFSKVGKVRDVQ 120
Query: 593 SVSRARSNRSR 603
+ S RS+
Sbjct: 121 CIKDRNSRRSK 131
Score = 33.7 bits (77), Expect = 0.33
Identities = 14/59 (23%), Positives = 18/59 (30%)
Query: 553 GSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSS 611
R R+ + S + S R R R R R RSRS S +
Sbjct: 1 RYRDRERGRLRNDTRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPNRYY 59
Score = 32.6 bits (74), Expect = 0.76
Identities = 22/60 (36%), Positives = 23/60 (38%), Gaps = 3/60 (5%)
Query: 563 RSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPAG 622
R R N R S+K RSR RS SR RS R R SRS SP
Sbjct: 1 RYRDRERGRLRND---TRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPNR 57
Score = 32.2 bits (73), Expect = 0.86
Identities = 20/77 (25%), Positives = 27/77 (35%), Gaps = 3/77 (3%)
Query: 444 RASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAK-SDRSAKSDRSARSDRSDKSAAS 502
R R R R ++ + + + S R S DRS + DR R RS RS
Sbjct: 1 RYRDRERGRLRNDTR-RSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRS-RSRSPNRY 58
Query: 503 NRSRSGSAASNKSRSDR 519
R R + R
Sbjct: 59 YRPRGDRSYRRDDRRSG 75
Score = 30.2 bits (68), Expect = 3.3
Identities = 18/60 (30%), Positives = 28/60 (46%), Gaps = 1/60 (1%)
Query: 561 RSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSP 620
R R R + N++ S++ +S RSRSR SR R +R + G S + S +
Sbjct: 1 RYRDRE-RGRLRNDTRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPNRYY 59
Score = 29.9 bits (67), Expect = 4.7
Identities = 19/80 (23%), Positives = 29/80 (36%), Gaps = 5/80 (6%)
Query: 443 SRASSRSRSRSKS-KSPAKRSRSGSASS---RGSNKSDRSAKSDRSAKSDRSARSDRSDK 498
R R R R+ + +S R RS S R + DR R +S RS +R +
Sbjct: 2 YRDRERGRLRNDTRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRS-RSRSPNRYYR 60
Query: 499 SAASNRSRSGSAASNKSRSD 518
R S ++ +
Sbjct: 61 PRGDRSYRRDDRRSGRNTKE 80
>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
(DUF2052). This entry is of sequences of two conserved
domains separated by a region of low complexity,
spanning some 200 residues. The function is unknown.
Length = 178
Score = 32.8 bits (75), Expect = 0.29
Identities = 22/48 (45%), Positives = 31/48 (64%)
Query: 367 EFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
E R++ L ++ E EEEEE+ EEE EEE EE Q+G E E ++G+
Sbjct: 68 EARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGD 115
Score = 28.2 bits (63), Expect = 8.7
Identities = 16/37 (43%), Positives = 22/37 (59%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
E++EE+E EEEE + ++ E E E Q G GE E
Sbjct: 74 LEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEE 110
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 31.7 bits (72), Expect = 0.30
Identities = 12/21 (57%), Positives = 16/21 (76%)
Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
E E+EEE+ +EEE+EE EE
Sbjct: 76 EKAEEKEEEKKKEEEKEEEEE 96
Score = 30.5 bits (69), Expect = 0.76
Identities = 12/25 (48%), Positives = 17/25 (68%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQG 403
+ +EEE++ EEE+EEE EE G
Sbjct: 77 KAEEKEEEKKKEEEKEEEEEEALAG 101
Score = 30.2 bits (68), Expect = 1.1
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEA 408
EE+ +E+EEE++ EE ++ E EA
Sbjct: 70 AAAAAEEKAEEKEEEKKKEEEKEEEEEEA 98
Score = 29.4 bits (66), Expect = 2.0
Identities = 12/26 (46%), Positives = 17/26 (65%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGE 404
E EE+EEE ++EEE+E EE +
Sbjct: 75 EEKAEEKEEEKKKEEEKEEEEEEALA 100
Score = 28.6 bits (64), Expect = 2.9
Identities = 12/37 (32%), Positives = 17/37 (45%)
Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGG 416
EE EE+EEE ++ ++ E E E G G
Sbjct: 68 AAAAAAAEEKAEEKEEEKKKEEEKEEEEEEALAGLGA 104
Score = 27.8 bits (62), Expect = 5.5
Identities = 9/29 (31%), Positives = 18/29 (62%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTE 407
E+ EE++EE+++EE +E ++ E
Sbjct: 72 AAAEEKAEEKEEEKKKEEEKEEEEEEALA 100
>gnl|CDD|236394 PRK09169, PRK09169, hypothetical protein; Validated.
Length = 2316
Score = 33.9 bits (78), Expect = 0.31
Identities = 18/92 (19%), Positives = 32/92 (34%), Gaps = 3/92 (3%)
Query: 480 KSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRS 539
K R A + R R + + R+ A S + R R AG+DR S +
Sbjct: 9 KRRRDAAAPADPRPRRRPRLGDAPAPRTARADSGATPRGRPR---AGADREPTSEQLRDY 65
Query: 540 DKSAASNRSARSDGSDKSASNRSRSRSASSQA 571
++ + + D + R ++A
Sbjct: 66 ERWLDRAAAGQLDAQREQQCARLWFLVQQARA 97
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 33.8 bits (77), Expect = 0.33
Identities = 17/66 (25%), Positives = 33/66 (50%), Gaps = 3/66 (4%)
Query: 382 NEEEEEED--EEEEEEEAEETQQGETTEAERQEGEGGEREG-GGEGENDNEKRTSDAEHS 438
++E+ ED EE +E E T+ G ++ E ++GE E + + D + + AE
Sbjct: 4058 DDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAEAD 4117
Query: 439 SKNSSR 444
+N+ +
Sbjct: 4118 EENTDK 4123
Score = 33.4 bits (76), Expect = 0.50
Identities = 25/130 (19%), Positives = 48/130 (36%), Gaps = 4/130 (3%)
Query: 375 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSD 434
+ E NEE+ + +E EA E + + E E D ++ +D
Sbjct: 3835 DDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEEVND 3894
Query: 435 AEHSSKNS-SRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARS 493
NS + + ++S SA+ +N+SD +K D + + R
Sbjct: 3895 IPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAA---NNESDLVSKEDDNKALEDKDRQ 3951
Query: 494 DRSDKSAASN 503
++ D+ S+
Sbjct: 3952 EKEDEEEMSD 3961
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 31.2 bits (71), Expect = 0.37
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGE 404
EEEEEE+EEEEEEE E +
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEEAMA 98
Score = 30.0 bits (68), Expect = 0.96
Identities = 14/21 (66%), Positives = 15/21 (71%)
Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
EEEEEE+EEEEEEE E
Sbjct: 72 AAAAEEEEEEEEEEEEEEEES 92
Score = 29.6 bits (67), Expect = 1.3
Identities = 12/29 (41%), Positives = 15/29 (51%)
Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEA 408
E+EEEEEEE EE ++ EA
Sbjct: 68 AAAAAAAAEEEEEEEEEEEEEEEESEEEA 96
Score = 29.6 bits (67), Expect = 1.4
Identities = 15/25 (60%), Positives = 16/25 (64%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQG 403
EEEEEE+EEEEEE EE G
Sbjct: 75 AEEEEEEEEEEEEEEEESEEEAMAG 99
Score = 27.3 bits (61), Expect = 8.1
Identities = 11/29 (37%), Positives = 15/29 (51%)
Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEA 408
+EEEEEEE EE ++ E+ E
Sbjct: 67 AAAAAAAAAEEEEEEEEEEEEEEEESEEE 95
>gnl|CDD|149635 pfam08646, Rep_fac-A_C, Replication factor-A C terminal domain.
This domain is found at the C terminal of replication
factor A. Replication factor A (RPA) binds
single-stranded DNA and is involved in replication,
repair, and recombination of DNA.
Length = 146
Score = 32.2 bits (74), Expect = 0.37
Identities = 23/88 (26%), Positives = 40/88 (45%), Gaps = 21/88 (23%)
Query: 258 VMDESGEQFVAYFLPTAETLDKRKLDQLSNLEYTDEQEYEY----KMAREYNWNVKSKSC 313
+ D +G+ +V F AE + ++L L+ D+ E+E + REY + ++ K
Sbjct: 60 ISDHTGQIWVTLFNEAAEEILGMSAEELMELKEEDDNEFEKIFQKALFREYVFRLRVK-- 117
Query: 314 KGYEENYFLIVRDDAVYYNELETRVRLS 341
EE YN+ E+RVR +
Sbjct: 118 ---EET-----------YND-ESRVRYT 130
>gnl|CDD|140324 PTZ00303, PTZ00303, phosphatidylinositol kinase; Provisional.
Length = 1374
Score = 33.5 bits (76), Expect = 0.39
Identities = 38/168 (22%), Positives = 65/168 (38%), Gaps = 7/168 (4%)
Query: 460 KRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDR 519
+R R A+ RG+N S + S R A + + S + + SGS A + S
Sbjct: 96 ERLRQAYAARRGANTSGGTVGSGRCGVHGDPAAAAATAGSCGAGGAVSGSGAIHSGVSGI 155
Query: 520 SRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSAS------NRSRSRSASSQASN 573
S A + +++ S SD + +S + SA + AS
Sbjct: 156 PSMCCLNSCSPAGGPVPRKLEETLRETLSENSDFFFRVSSVAYHLDPNPQGASACTAASA 215
Query: 574 NSAASNRSSNKSGSRSRSRSVSRARSNRSRSASK-SGSSAASSRSGSP 620
++ + + G+RS R A + AS + + AA+S +G P
Sbjct: 216 SAGSGVMGNAGVGARSPRRGEGLASAAAGGVASLLTNTIAAASATGPP 263
>gnl|CDD|151621 pfam11179, DUF2967, Protein of unknown function (DUF2967). This
family of proteins with unknown function appears to be
restricted to Drosophila.
Length = 284
Score = 33.0 bits (74), Expect = 0.39
Identities = 35/143 (24%), Positives = 64/143 (44%), Gaps = 1/143 (0%)
Query: 480 KSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRS 539
+S +SA+S ARS S K+ + ++ AS K + G G S V +
Sbjct: 20 QSPKSARSSEKARSHFSFKNYGNEDAKDAGRASGKGTGSGAGVGGGGGSGSGVGGNGVMA 79
Query: 540 DKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSR-SRSVSRAR 598
+A A+ + + + + + R+R A N +A +N + + SR+ S SVS +
Sbjct: 80 AGAADEAADAQEEDAIANRNKKLRARERDPMAGNANANANANGGSNSSRNHISPSVSPSH 139
Query: 599 SNRSRSASKSGSSAASSRSGSPA 621
SNR+ + + + +G+ A
Sbjct: 140 SNRASPKREKKRTTPIASTGAIA 162
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 33.4 bits (77), Expect = 0.41
Identities = 15/52 (28%), Positives = 25/52 (48%), Gaps = 1/52 (1%)
Query: 367 EFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 418
E R R LE ++++EE DEE+EE E + + + + + ER
Sbjct: 113 EPRYDDA-YRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRER 163
Score = 30.3 bits (69), Expect = 3.1
Identities = 9/52 (17%), Positives = 28/52 (53%)
Query: 368 FRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
R + + + +++++E+ +EE+EE+ +++ E + + + + RE
Sbjct: 111 QDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRE 162
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 33.3 bits (76), Expect = 0.44
Identities = 21/82 (25%), Positives = 37/82 (45%), Gaps = 6/82 (7%)
Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKN 441
+ E++ +DEEEE+ ++E+ Q + + + E G E + N + S
Sbjct: 186 SAEDDSDDEEEEDSDSEDYSQ-----YDGMLVDSSDEEEGEEAPSINYNEDTSESESD-E 239
Query: 442 SSRASSRSRSRSKSKSPAKRSR 463
S S SRS S S+ + S+
Sbjct: 240 SDSEISESRSVSDSEESSPPSK 261
>gnl|CDD|178593 PLN03021, PLN03021, Low-temperature-induced protein; Provisional.
Length = 619
Score = 33.4 bits (75), Expect = 0.45
Identities = 54/244 (22%), Positives = 88/244 (36%), Gaps = 25/244 (10%)
Query: 355 RLVVTHRPLNATEFRIQ-RLRERFLEPPNEEEEEEDEEEE---------EEEAEETQQGE 404
R TH PL +EF + RE EP N E+ E++ Q+
Sbjct: 162 REAPTHYPLGVSEFSDRGESREAHQEPLNTPVSLLSATEDVTRTFAPGGEDDYLGGQRKV 221
Query: 405 TTEAERQEGEGGEREGGGE----GENDNEKRTSDAEHSSKNSSR---ASSRSRSRSKSKS 457
E ++ E GGG G ++ + + +D H + A S R + +S
Sbjct: 222 NVETPKRLEEDPAAPGGGSDYLSGVSNYQSKVTDPTHKGGEAGVPEIAESLGRMKVTDES 281
Query: 458 PAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRS 517
P ++SR G R + RS + D +SD + S + S++G +R
Sbjct: 282 PDQKSRQG----REEDFPTRSHEFDLKKESDINKNS----PARFGGESKAGMEEDFPTRG 333
Query: 518 DRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAA 577
D G D ++ + S R + D+S S AS SA
Sbjct: 334 DVKVESGLGRDLPTGTHDQFSPELSRPKERDDSEETKDESTHETKPSTYTEQLASATSAI 393
Query: 578 SNRS 581
+N++
Sbjct: 394 TNKA 397
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 33.4 bits (76), Expect = 0.46
Identities = 18/67 (26%), Positives = 31/67 (46%), Gaps = 4/67 (5%)
Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKN 441
NE+EE+ + E E +E Q+ E T+ E + GE E + + D+ SD + +
Sbjct: 223 NEQEEQGEGEGEGQEGSAPQESEATDRESESGE----EEMVQSDQDDLPDESDDDSETPG 278
Query: 442 SSRASSR 448
+R
Sbjct: 279 EGARPAR 285
Score = 31.1 bits (70), Expect = 2.3
Identities = 22/82 (26%), Positives = 37/82 (45%), Gaps = 3/82 (3%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEN---DNEKRTSDAEHSS 439
EE ++ E E+EE+ ++ Q E + E+ EGEG +EG E+ D E + + E
Sbjct: 201 EEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQ 260
Query: 440 KNSSRASSRSRSRSKSKSPAKR 461
+ S S++ R
Sbjct: 261 SDQDDLPDESDDDSETPGEGAR 282
Score = 30.7 bits (69), Expect = 3.1
Identities = 12/54 (22%), Positives = 24/54 (44%)
Query: 389 DEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNS 442
+ EE + E++ E + ++ E +G GEGE ++E + + S
Sbjct: 198 ELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRES 251
Score = 29.5 bits (66), Expect = 6.1
Identities = 20/65 (30%), Positives = 27/65 (41%), Gaps = 1/65 (1%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNS 442
E EE ++ E E+ E+ + TE E QE +G G EG E +D E S
Sbjct: 198 ELAEEMGDDTESEDEEDGDDDQPTENE-QEEQGEGEGEGQEGSAPQESEATDRESESGEE 256
Query: 443 SRASS 447
S
Sbjct: 257 EMVQS 261
>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
Tim54. Mitochondrial function depends on the import of
hundreds of different proteins synthesised in the
cytosol. Protein import is a multi-step pathway which
includes the binding of precursor proteins to surface
receptors, translocation of the precursor across one or
both mitochondrial membranes, and folding and assembly
of the imported protein inside the mitochondrion. Most
precursor proteins carry amino-terminal targeting
signals, called pre-sequences, and are imported into
mitochondria via import complexes located in both the
outer and the inner membrane (IM). The IM complex, TIM,
is made up of at least two proteins which mediate
translocation of proteins into the matrix by removing
their signal peptide and another pair of proteins, Tim54
and Tim22, that insert the polytopic proteins, that
carry internal targetting information, into the inner
membrane.
Length = 377
Score = 33.2 bits (76), Expect = 0.46
Identities = 17/54 (31%), Positives = 21/54 (38%), Gaps = 1/54 (1%)
Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEG-EGGEREGGGEGENDNEK 430
L+PP E DE E E E T E+ +E E E E +N K
Sbjct: 195 LDPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAPEEENNK 248
Score = 31.2 bits (71), Expect = 1.5
Identities = 17/50 (34%), Positives = 28/50 (56%), Gaps = 2/50 (4%)
Query: 369 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQ--GETTEAERQEGEGG 416
RI++LR + E P EE E+E +E++ + +Q G +A+ G GG
Sbjct: 122 RIRKLRRKAGEEPEEELPLEEEATKEDDLADPRQVLGVYYKAKDPPGAGG 171
>gnl|CDD|236722 PRK10590, PRK10590, ATP-dependent RNA helicase RhlE; Provisional.
Length = 456
Score = 32.9 bits (75), Expect = 0.50
Identities = 19/69 (27%), Positives = 29/69 (42%), Gaps = 5/69 (7%)
Query: 397 AEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSK 456
AE Q G +Q G GG +GGG G+ + R + S ++ A SR +K
Sbjct: 384 AEPIQNGR-----QQRGGGGRGQGGGRGQQQGQPRRGEGGAKSASAKPAEKPSRRLGDAK 438
Query: 457 SPAKRSRSG 465
++ R
Sbjct: 439 PAGEQQRRR 447
>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
subunit beta; Reviewed.
Length = 463
Score = 33.1 bits (76), Expect = 0.51
Identities = 14/29 (48%), Positives = 15/29 (51%)
Query: 384 EEEEEDEEEEEEEAEETQQGETTEAERQE 412
EEE+EEEEEEE EE E E
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEVMMMPAPE 431
Score = 32.7 bits (75), Expect = 0.52
Identities = 13/18 (72%), Positives = 14/18 (77%)
Query: 383 EEEEEEDEEEEEEEAEET 400
EEEE+EEEEEEE EE
Sbjct: 403 AAEEEEEEEEEEEEEEEP 420
Score = 32.7 bits (75), Expect = 0.65
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEA 408
EEEEE+EEEEEEE E + A
Sbjct: 404 AEEEEEEEEEEEEEEEPVAEVMMMPA 429
Score = 31.6 bits (72), Expect = 1.3
Identities = 15/36 (41%), Positives = 18/36 (50%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 418
EEEEEE+EEEEEEE + GG +
Sbjct: 406 EEEEEEEEEEEEEEPVAEVMMMPAPEMQLPASGGIK 441
Score = 30.8 bits (70), Expect = 2.4
Identities = 15/28 (53%), Positives = 17/28 (60%)
Query: 375 ERFLEPPNEEEEEEDEEEEEEEAEETQQ 402
ER+ EEEEEE+EEEEE AE
Sbjct: 400 ERWAAEEEEEEEEEEEEEEEPVAEVMMM 427
>gnl|CDD|220710 pfam10351, Apt1, Golgi-body localisation protein domain. This is
the C-terminus of a family of proteins conserved from
plants to humans. The plant members are localised to the
Golgi proteins and appear to regulate membrane
trafficking, as they are required for rapid vesicle
accumulation at the tip of the pollen tube. The
C-terminus probably contains the Golgi localisation
signal and it is well-conserved.
Length = 451
Score = 33.0 bits (76), Expect = 0.52
Identities = 20/79 (25%), Positives = 34/79 (43%), Gaps = 1/79 (1%)
Query: 417 EREGGGEGENDNEKRTSDAEHSSKNSSRASSRSR-SRSKSKSPAKRSRSGSASSRGSNKS 475
R+ E+ + + + S +SS S RS + S S + S S S+ S+KS
Sbjct: 295 GRDSSLSEEDSDSSKREEDSDISSSSSSGSRRSSSTSRSSSSSSSLLSSSSILSKSSDKS 354
Query: 476 DRSAKSDRSAKSDRSARSD 494
S + +KS++ D
Sbjct: 355 KDKRFSLKLSKSEKEESDD 373
Score = 32.3 bits (74), Expect = 0.81
Identities = 29/92 (31%), Positives = 39/92 (42%), Gaps = 6/92 (6%)
Query: 480 KSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRS 539
D S + S S R + S S+ S SGS S+ + S S S S++S S S
Sbjct: 295 GRDSSLSEEDSDSSKREEDSDISSSSSSGSRRSSSTSRSSSSSSSL---LSSSSILSKSS 351
Query: 540 DKSAASNRSARSDGSDKSAS---NRSRSRSAS 568
DKS S + S+K S SRS+
Sbjct: 352 DKSKDKRFSLKLSKSEKEESDDLEEMISRSSK 383
>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
translational repressor [Translation, ribosomal
structure and biogenesis].
Length = 777
Score = 33.2 bits (76), Expect = 0.54
Identities = 40/222 (18%), Positives = 77/222 (34%), Gaps = 23/222 (10%)
Query: 425 ENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAK---- 480
+ S +S S+ SR++S S + + + ++S S+ + S
Sbjct: 89 SSSTSGSQSLLMELPSSSFNPSTSSRNKSNSALSSTQQGNANSSVTLSSSTASSMFNSNK 148
Query: 481 -----SDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNK 535
+ S + + + AS+ S+ + S S + +
Sbjct: 149 LPLPNPNHSNSATTNQSGSSFINTPASSSSQPLTNLVVSSIKRFPYLTSLSPFFNYLIDP 208
Query: 536 SVRSDKSAASNRSARSDGSDKSASN----RSRSRSASSQASNNSAASNRSSNKSGSRSRS 591
S S ++A + + + S +N S +Q+ N+ N SS+ + S
Sbjct: 209 SSDSATASADTSPSFNPPPNLSPNNLFSTSDLSPLPDTQSVENNIILNSSSSINELTSIY 268
Query: 592 RSVSRARSNR----------SRSASKSGSSAASSRSGSPAGS 623
SV R+ R + S+S SA + + SP GS
Sbjct: 269 GSVPSIRNLRGLNSALVSFLNVSSSSLAFSALNGKEVSPTGS 310
Score = 32.8 bits (75), Expect = 0.59
Identities = 37/169 (21%), Positives = 63/169 (37%), Gaps = 8/169 (4%)
Query: 436 EHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDR 495
H SK S +S+ S+ S + + S + + S ++
Sbjct: 17 LHHSKKSPPSSTTSQELMNGNSTPNSFSPIPSKASSSATFTLNLPINNSVNHKITS---- 72
Query: 496 SDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSD 555
S++S R SGS + S S S++ N S S + S S+ G+
Sbjct: 73 ---SSSSRRKPSGSWSVAISSSTSGSQSLLMELPSSSFNPSTSSRNKSNSALSSTQQGNA 129
Query: 556 KSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRS 604
S+ S S ++S SN N + + S + ++S S S + S S
Sbjct: 130 NSSVTLSSSTASSMFNSNKLPLPNPNHSNSATTNQSGS-SFINTPASSS 177
>gnl|CDD|235562 PRK05687, fliH, flagellar assembly protein H; Validated.
Length = 246
Score = 32.2 bits (74), Expect = 0.55
Identities = 12/47 (25%), Positives = 14/47 (29%)
Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEG 424
PP + E EE EE T E + EG G
Sbjct: 30 PPPPEPAAPPPEPPEPEEVPEEEAAPLTEEELEAIRQQAHEEGFAAG 76
Score = 29.9 bits (68), Expect = 3.5
Identities = 17/51 (33%), Positives = 21/51 (41%), Gaps = 1/51 (1%)
Query: 375 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEG-EGGEREGGGEG 424
E PP E EE EEE E + + +EG G+ EG EG
Sbjct: 34 EPAAPPPEPPEPEEVPEEEAAPLTEEELEAIRQQAHEEGFAAGKAEGRAEG 84
>gnl|CDD|235307 PRK04537, PRK04537, ATP-dependent RNA helicase RhlB; Provisional.
Length = 572
Score = 33.0 bits (75), Expect = 0.56
Identities = 27/164 (16%), Positives = 49/164 (29%), Gaps = 14/164 (8%)
Query: 458 PAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRS 517
P + G + + D R A+ R+A R + S S +
Sbjct: 396 PPRVPVEGEEADDEAG--DSVGTIFREAREQRAAEEQRRGGGRSGPGGGSRSGSVGGG-- 451
Query: 518 DRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAA 577
R AG+D + R + A + +A + A +A+ +A
Sbjct: 452 --GRRDGAGADGKPRPRRKPRVEGEADA--AAAGAETPVVA------AAAAQAPGVVAAD 501
Query: 578 SNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPA 621
R+ K R R V A + + + + +P
Sbjct: 502 GERAPRKRRRRRNGRPVEGAEPVSTPVPAPAAPRKPTQVVATPV 545
Score = 31.5 bits (71), Expect = 1.7
Identities = 23/131 (17%), Positives = 48/131 (36%), Gaps = 3/131 (2%)
Query: 384 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSS 443
E EE D+E + ++ A ++ GG R G G G +
Sbjct: 402 EGEEADDEAGDSVGTIFREAREQRAAEEQRRGGGRSGPGGGSRSGSVGGGGRRDGAGADG 461
Query: 444 RASSRSRSRSKSKSPAKRSRSGS---ASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSA 500
+ R + R + ++ A + + + A++ +A +R+ + R R+ R + A
Sbjct: 462 KPRPRRKPRVEGEADAAAAGAETPVVAAAAAQAPGVVAADGERAPRKRRRRRNGRPVEGA 521
Query: 501 ASNRSRSGSAA 511
+ + A
Sbjct: 522 EPVSTPVPAPA 532
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 32.6 bits (74), Expect = 0.62
Identities = 10/35 (28%), Positives = 17/35 (48%)
Query: 384 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 418
+EE ++E E +E+ + E E Q + ER
Sbjct: 348 TDEESEDEVEIDESVIEEVAEMELLEVQVDDLAER 382
>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3. The SLD3 DNA
replication regulator is required for loading and
maintenance of Cdc45 on chromatin during DNA
replication.
Length = 437
Score = 32.5 bits (74), Expect = 0.64
Identities = 29/187 (15%), Positives = 57/187 (30%), Gaps = 15/187 (8%)
Query: 441 NSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSA 500
S S + + + + G+ R ++ + S KS R R ++ +
Sbjct: 232 KPSMKISPLKKKKTGTLKSSKPEPGTPLKRQTSPASSSQKSRRR----SLQRVLTDERKS 287
Query: 501 ASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASN 560
+S R+ S +SR++ S + S S+ S S+ ++ R S+
Sbjct: 288 SSRRTPSL----LRSRTNSSLIEFLKRESSENLLPSLSSRTSSDLLKNKRLQKRQVDLSD 343
Query: 561 RSRSRSASSQASNNSAASNRS-------SNKSGSRSRSRSVSRARSNRSRSASKSGSSAA 613
SR + + KS S+ + RS + A
Sbjct: 344 SSRQHEEKLKKKQMLNEQKKELKRAISALKKSNRELSSKDIVETAEKRSSQFGQGVQVTA 403
Query: 614 SSRSGSP 620
+
Sbjct: 404 TPAGNRK 410
Score = 32.5 bits (74), Expect = 0.74
Identities = 31/172 (18%), Positives = 54/172 (31%), Gaps = 9/172 (5%)
Query: 430 KRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDR 489
KR + SS+ S R S + + KS ++R+ S S S+ + +
Sbjct: 260 KRQTSPASSSQKSRRRSLQRVLTDERKSSSRRTPSLLRSRTNSSLIEFLKRESSENLLPS 319
Query: 490 SARSDRSD---KSAASNRSRSGSA-ASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAAS 545
+ SD R S + + + + S S
Sbjct: 320 LSSRTSSDLLKNKRLQKRQVDLSDSSRQHEEKLKKKQMLNEQKKELKRAISALK----KS 375
Query: 546 NRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRA 597
NR S ++A RS Q + + A NR + + + S S +
Sbjct: 376 NRELSSKDIVETAEKRSSQFGQGVQVT-ATPAGNRKKDAGLTEATSSSFPSS 426
>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509). This
family consists of several uncharacterized viral
proteins from the Marek's disease-like viruses. Members
of this family are typically around 400 residues in
length. The function of this family is unknown.
Length = 377
Score = 32.3 bits (73), Expect = 0.70
Identities = 25/89 (28%), Positives = 33/89 (37%), Gaps = 1/89 (1%)
Query: 532 ASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRS 591
+ D SA S SA S S + N S + +RS ++SGSR
Sbjct: 290 DNRAGTPDDGSAISEDSAGGSHHTMRRPPHSTSGERRGRRRNRSESRSRSRSRSGSRRYR 349
Query: 592 RSVSRARSNRSRSASKSGSSAASSRSGSP 620
R R R RS S+ + SS S
Sbjct: 350 RRRGRGVPGR-RSESRQDTVLVSSSEASD 377
Score = 29.2 bits (65), Expect = 6.7
Identities = 19/64 (29%), Positives = 27/64 (42%)
Query: 559 SNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSG 618
NR+ + S S +SA + + + S S R NRS S S+S S + S R
Sbjct: 290 DNRAGTPDDGSAISEDSAGGSHHTMRRPPHSTSGERRGRRRNRSESRSRSRSRSGSRRYR 349
Query: 619 SPAG 622
G
Sbjct: 350 RRRG 353
Score = 28.8 bits (64), Expect = 9.0
Identities = 20/63 (31%), Positives = 26/63 (41%), Gaps = 5/63 (7%)
Query: 437 HSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRS 496
HS+ R R+RS S+S RSRS S S R + R RS + S
Sbjct: 319 HSTSGERRGRRRNRSESRS-----RSRSRSGSRRYRRRRGRGVPGRRSESRQDTVLVSSS 373
Query: 497 DKS 499
+ S
Sbjct: 374 EAS 376
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 31.5 bits (72), Expect = 0.71
Identities = 13/37 (35%), Positives = 21/37 (56%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
E EE DEEE+ +E E + + +R+E E ++E
Sbjct: 109 VEVEELDEEEQIDELLEKELAKLKREKRRENERKQKE 145
Score = 30.7 bits (70), Expect = 1.1
Identities = 14/40 (35%), Positives = 23/40 (57%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 418
E EEE E +E +EEE+ +E + E + +R++ ER
Sbjct: 102 EEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENER 141
Score = 30.7 bits (70), Expect = 1.1
Identities = 17/45 (37%), Positives = 24/45 (53%)
Query: 368 FRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 412
+R + + L+ +EEEEE+E E EE EE Q E E E +
Sbjct: 86 WRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAK 130
>gnl|CDD|221620 pfam12527, DUF3727, Protein of unknown function (DUF3727). This
domain family is found in bacteria and eukaryotes, and
is approximately 100 amino acids in length.
Length = 100
Score = 30.3 bits (69), Expect = 0.74
Identities = 10/27 (37%), Positives = 18/27 (66%)
Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGE 404
L P E+E E EEE++++ +E ++ E
Sbjct: 18 LPEPEEDEILELEEEDDDDEDEEEEYE 44
Score = 30.3 bits (69), Expect = 0.78
Identities = 13/37 (35%), Positives = 22/37 (59%), Gaps = 4/37 (10%)
Query: 380 PPNEEEE----EEDEEEEEEEAEETQQGETTEAERQE 412
P EE+E EE+++++E+E EE + + E QE
Sbjct: 19 PEPEEDEILELEEEDDDDEDEEEEYELLASFYHEDQE 55
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 30.4 bits (69), Expect = 0.75
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
E E +E E+EE+EEE EE
Sbjct: 78 EAAAEADEAEEEEKEEEAEEE 98
Score = 29.7 bits (67), Expect = 1.4
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
E +E +EEE+EEEAEE
Sbjct: 77 AEAAAEADEAEEEEKEEEAEE 97
Score = 28.9 bits (65), Expect = 2.5
Identities = 11/21 (52%), Positives = 15/21 (71%)
Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
+E EEEE EEE EEE+++
Sbjct: 81 AEADEAEEEEKEEEAEEESDD 101
>gnl|CDD|220402 pfam09787, Golgin_A5, Golgin subfamily A member 5. Members of this
family of proteins are involved in maintaining Golgi
structure. They stimulate the formation of Golgi stacks
and ribbons, and are involved in intra-Golgi retrograde
transport. Two main interactions have been
characterized: one with RAB1A that has been activated by
GTP-binding and another with isoform CASP of CUTL1.
Length = 509
Score = 32.5 bits (74), Expect = 0.75
Identities = 20/82 (24%), Positives = 34/82 (41%), Gaps = 1/82 (1%)
Query: 541 KSAASNRSARSD-GSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARS 599
K+A R ++ D+ S + S+ + + S SSNK+ SRS + + S
Sbjct: 11 KAAVLQRRKATEEDDDEDLLEVSSTTSSPVGSISWSVRETASSNKARSRSEKWNPDQPGS 70
Query: 600 NRSRSASKSGSSAASSRSGSPA 621
S +SK ++ S S
Sbjct: 71 RVSSPSSKKDGTSRSLSSQVDD 92
>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
Length = 580
Score = 32.0 bits (73), Expect = 0.94
Identities = 38/212 (17%), Positives = 60/212 (28%), Gaps = 27/212 (12%)
Query: 380 PPNEEEEEEDEEEEEEEAEE---------TQQGETTEAERQEGEG--GEREGGGEGENDN 428
P + DE E E T G T+EA ++ + RE G G
Sbjct: 9 PGRWNYDSSDESPEGSRDENFDAERDDFLTPLGSTSEATSEDDDDLYPPRETGSGGGVAT 68
Query: 429 --------EKRTSDAEHSSKNSSRASSRSRSRSKSKSP-AKRSRSGSASSRGSNKSDRSA 479
R + +S AS P R S S
Sbjct: 69 STIYTVPRPPRGPEQTLDKPDSLPASRELPPGPTPVPPGGFRGASSPRLGADSTSPRFLY 128
Query: 480 KSDRSAKSDRSARSDRSDKSAAS--NRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSV 537
+ + S+ S + + SR + S K ++ S
Sbjct: 129 QVNFPVILAPIGESNSSSEELSEEEEHSRPPPSESLKVKNGGKVYPKGFSKHK----THK 184
Query: 538 RSDKSAASNRSAR-SDGSDKSASNRSRSRSAS 568
RS+ S + ++AR GS ++ + S S
Sbjct: 185 RSEFSGLTKKAARKRKGSFVFKPSQLKELSGS 216
>gnl|CDD|224346 COG1429, CobN, Cobalamin biosynthesis protein CobN and related
Mg-chelatases [Coenzyme metabolism].
Length = 1388
Score = 32.4 bits (74), Expect = 0.96
Identities = 21/89 (23%), Positives = 38/89 (42%), Gaps = 1/89 (1%)
Query: 535 KSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSV 594
+ V + AA+ +A + S + S +A S AS+ S+A+ S+ +GS + S
Sbjct: 1277 ELVEAATYAATRYAAFAPASATPGAPESVGTTAVSTASSASSATVTGSD-AGSGADSTGP 1335
Query: 595 SRARSNRSRSASKSGSSAASSRSGSPAGS 623
S + A + + SGS +
Sbjct: 1336 SLGAAGSVTGAGEGYEMTKEAVSGSESTG 1364
Score = 30.8 bits (70), Expect = 2.5
Identities = 25/134 (18%), Positives = 38/134 (28%), Gaps = 12/134 (8%)
Query: 467 ASSRGSNKSDRSAKSDRSAKSDRSARSDRS-----DKSAASNRSRSGSAASNKSRSDRSR 521
A N + + D S + A+ + + AA + +
Sbjct: 1242 AELVAENGVACCHHTCGNPALDEWVLGLVSVPGVPELVEAATYAATRYAAFAPASATPGA 1301
Query: 522 SGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRS 581
S G +A S + SA+S SD + S +A S
Sbjct: 1302 PESVG--TTAVS-----TASSASSATVTGSDAGSGADSTGPSLGAAGSVTGAGEGYEMTK 1354
Query: 582 SNKSGSRSRSRSVS 595
SGS S S S
Sbjct: 1355 EAVSGSESTGMSFS 1368
Score = 30.1 bits (68), Expect = 4.6
Identities = 28/202 (13%), Positives = 49/202 (24%), Gaps = 47/202 (23%)
Query: 371 QRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEK 430
Q + R LE + D+E AE + E G N
Sbjct: 1212 QSITARLLEAARRGLWDADDETLRLLAEVYAE--------LVAENGVACCHHTCGNPALD 1263
Query: 431 RTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRS 490
S + + + + + AS+
Sbjct: 1264 EWVL-----GLVSVPGVPELVEAATYAATRYAAFAPASATPGA----------------P 1302
Query: 491 ARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSAR 550
+ S AS+ S + S+ +G+D + S + S
Sbjct: 1303 ESVGTTAVSTASSASSATVTGSD---------AGSGADSTGP---------SLGAAGSVT 1344
Query: 551 SDGSDKSASNRSRSRSASSQAS 572
G + + S S S+ S
Sbjct: 1345 GAGEGYEMTKEAVSGSESTGMS 1366
Score = 29.7 bits (67), Expect = 6.6
Identities = 14/83 (16%), Positives = 28/83 (33%), Gaps = 12/83 (14%)
Query: 543 AASNRSARSDGSDKSASNRSRS-RSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNR 601
A + S + A+++ + + AS S +
Sbjct: 1260 PALDEWVLGLVSVPGVPELVEAATYAATRYAAFAPASATPGA-PESVGTT---------- 1308
Query: 602 SRSASKSGSSAASSRSGSPAGSD 624
+ S + S SSA + S + +G+D
Sbjct: 1309 AVSTASSASSATVTGSDAGSGAD 1331
>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA. Members
of this protein share two defining regions. One is a
histidine/serine-rich cluster, typically
H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
the context of a pair of radical SAM proteins, HxsB and
HxsC, and a fourth protein HxsD. The system is predicted
to perform peptide modifications, likely in the
His-Xaa-Ser region, to produce some uncharacterized
natural product.
Length = 186
Score = 31.4 bits (71), Expect = 0.99
Identities = 20/79 (25%), Positives = 28/79 (35%), Gaps = 7/79 (8%)
Query: 527 SDRSAASNKSVRSDKSAASNRSA--RSDGSDKSASNRSRSRSASSQASNNSAASNRSSNK 584
S S S+ S S S A + D S S S S S S +S S +
Sbjct: 55 SHSSHRSHSSHSSHYSGAGGSYSVPSGDTSTYSYPVPSPSYSPSPGSSIQS-----LPST 109
Query: 585 SGSRSRSRSVSRARSNRSR 603
+G R +S + + R
Sbjct: 110 TGVRPQSSAENANSEKRKL 128
>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
Length = 191
Score = 31.1 bits (71), Expect = 1.0
Identities = 17/34 (50%), Positives = 19/34 (55%)
Query: 381 PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
NE+ EEE EE E EEA E + E T E E E
Sbjct: 4 KNEQVEEEVEETEVEEAVEDEVEEETVEEESEAE 37
Score = 30.4 bits (69), Expect = 2.0
Identities = 19/55 (34%), Positives = 27/55 (49%)
Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAE 436
+E EEE EEE E E + +Q + E E + E ER + + +N KR E
Sbjct: 23 DEVEEETVEEESEAELLDEEQAKIAELEAKLDELEERYLRLQADFENYKRRIQKE 77
Score = 30.0 bits (68), Expect = 2.4
Identities = 12/42 (28%), Positives = 19/42 (45%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREG 420
E E E EE E+E EE ++ E + ++ + E E
Sbjct: 10 EEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEA 51
Score = 30.0 bits (68), Expect = 2.9
Identities = 13/34 (38%), Positives = 18/34 (52%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 412
E + EEE +E E EE E+ + ET E E +
Sbjct: 3 EKNEQVEEEVEETEVEEAVEDEVEEETVEEESEA 36
Score = 29.6 bits (67), Expect = 3.7
Identities = 11/35 (31%), Positives = 17/35 (48%)
Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 412
E EE ++E EEE EE + E + E+ +
Sbjct: 11 EVEETEVEEAVEDEVEEETVEEESEAELLDEEQAK 45
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 31.7 bits (72), Expect = 1.1
Identities = 16/57 (28%), Positives = 21/57 (36%), Gaps = 3/57 (5%)
Query: 365 ATEFRIQRLRERFLEPPNEEEEEED---EEEEEEEAEETQQGETTEAERQEGEGGER 418
E +QR E LE E EED E +E + + E + E E R
Sbjct: 393 EEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEEAR 449
Score = 31.7 bits (72), Expect = 1.2
Identities = 16/73 (21%), Positives = 29/73 (39%)
Query: 364 NATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGE 423
++ E+ + + E ++ EEEE + E EE E +Q E E + + E
Sbjct: 372 DSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEF 431
Query: 424 GENDNEKRTSDAE 436
+ D + E
Sbjct: 432 EDEDELEEAQPEE 444
Score = 30.5 bits (69), Expect = 3.2
Identities = 16/57 (28%), Positives = 21/57 (36%)
Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAE 436
EEEE + E EE E Q E R E E E++ E+ + E
Sbjct: 389 ISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEE 445
Score = 29.0 bits (65), Expect = 9.3
Identities = 17/63 (26%), Positives = 23/63 (36%), Gaps = 1/63 (1%)
Query: 365 ATEFRIQRLRERFLEP-PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGE 423
E QR E L+ E E E E EE+A + E E ++ + E E
Sbjct: 385 VAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEE 444
Query: 424 GEN 426
E
Sbjct: 445 EEE 447
>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6. The VP6 protein a
minor protein in the core of the virion is probably the
viral helicase.
Length = 322
Score = 31.6 bits (71), Expect = 1.1
Identities = 22/109 (20%), Positives = 47/109 (43%), Gaps = 3/109 (2%)
Query: 367 EFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEN 426
E + ++++ ++ E +E E ++E+ E+ E Q+ +GG+ E E E+
Sbjct: 14 ELKQRQIQINLVDWFESGESKEKEPKDEDGQEQRISDGEGE---QKQKGGKEESDKETED 70
Query: 427 DNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKS 475
+ R + E S +S++ S + K G + +GS +
Sbjct: 71 ASVDRRTHTEVGSGSSAKGSGERADKHADGGDGKTRGGGGDADKGSGAT 119
>gnl|CDD|177618 PHA03381, PHA03381, tegument protein VP22; Provisional.
Length = 290
Score = 31.5 bits (71), Expect = 1.1
Identities = 33/142 (23%), Positives = 64/142 (45%), Gaps = 10/142 (7%)
Query: 449 SRSRSKSKSPAKRSRSGSASSRGSNKSDR------SAKSDRSAKSDRSARSDRSDKSAAS 502
S +R + PA R+R G+ +RG ++++R A++D + S+ +R S
Sbjct: 31 SPARVSFEEPADRARRGAGQARGRSQAERRFHHYDEARADYPYYTGSSSEDERPADPRPS 90
Query: 503 NRSRSGSAASNK--SRSDRSRSGSAGSDRSAASN--KSVRSDKSAASNRSARSDGSDKSA 558
R + AS +R R +GS G R A S + + K A++ R +S +D +A
Sbjct: 91 RRPHAQPEASGPGPARGARGPAGSRGRGRRAESPSPRDPPNPKGASAPRGRKSACADSAA 150
Query: 559 SNRSRSRSASSQASNNSAASNR 580
+ + +A + + + +
Sbjct: 151 LLDAPAPAAPKRQKTPAGLARK 172
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 31.8 bits (72), Expect = 1.2
Identities = 10/45 (22%), Positives = 16/45 (35%)
Query: 375 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
+ + E E E + E+ EE E+ E E + R
Sbjct: 663 DECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRA 707
Score = 29.8 bits (67), Expect = 4.8
Identities = 14/52 (26%), Positives = 26/52 (50%)
Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
+ E EEE++++ EE + +E + ++E + GE E E+D E
Sbjct: 641 HQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAE 692
>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755). This
family is predominated by ORFs from Circoviridae. The
function of this family remains to be determined.
Length = 122
Score = 30.1 bits (68), Expect = 1.2
Identities = 16/54 (29%), Positives = 23/54 (42%)
Query: 429 EKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSD 482
K + + K + S SRS K AK+ R S+SS S+ S S+
Sbjct: 68 RKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRSSSSSSSSSSSSSSSSES 121
>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
family).
Length = 619
Score = 31.8 bits (72), Expect = 1.2
Identities = 28/175 (16%), Positives = 59/175 (33%), Gaps = 3/175 (1%)
Query: 452 RSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAA 511
+ K ++ A++ + S+ + S S S R + + + + R+ + + +
Sbjct: 438 QEKQRAKARKKLANKTSTVMEPPYGSTESSVPSTPSTRRNDRNITSNTPSLKRTPNLTKS 497
Query: 512 SNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQA 571
S + + + + + + K A++RS +S +N + S SS
Sbjct: 498 SLSQEASLISKSTGNTHKHSTPRRLTTLPKLPAASRS-SKGNLIRSGANGNASSDLSSPG 556
Query: 572 SNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRS--GSPAGSD 624
S NS + S S + S ++ RS P S
Sbjct: 557 SINSKSPEHSVPLVRVFDIHLRASTTKGRHSTPSTNEKKKRLLKRSPLSPPKESV 611
Score = 29.4 bits (66), Expect = 6.6
Identities = 37/135 (27%), Positives = 57/135 (42%), Gaps = 11/135 (8%)
Query: 495 RSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSV-RSDKSAASNRSARSDG 553
R +K A R + A+ S GS S S S S R+D++ SN +
Sbjct: 437 RQEKQRAKARKK---LANKTSTVMEPPYGSTES--SVPSTPSTRRNDRNITSNTPSLKRT 491
Query: 554 SDKSASNRSRSRSASSQASNN---SAASNRSSN--KSGSRSRSRSVSRARSNRSRSASKS 608
+ + S+ S+ S S+++ N + R + K + SRS + RS + +AS
Sbjct: 492 PNLTKSSLSQEASLISKSTGNTHKHSTPRRLTTLPKLPAASRSSKGNLIRSGANGNASSD 551
Query: 609 GSSAASSRSGSPAGS 623
SS S S SP S
Sbjct: 552 LSSPGSINSKSPEHS 566
>gnl|CDD|215598 PLN03138, PLN03138, Protein TOC75; Provisional.
Length = 796
Score = 31.7 bits (72), Expect = 1.3
Identities = 15/61 (24%), Positives = 27/61 (44%)
Query: 563 RSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPAG 622
S S SS S+ S+RS + R+ S+ + S + S++ S S++ +
Sbjct: 12 ASTSLSSSRPQLSSFSSRSPQSATRSPRASSIKCSASASASSSATSSSASLVANGAVALL 71
Query: 623 S 623
S
Sbjct: 72 S 72
>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
subunit 8 N-terminus. The largest of the mammalian
translation initiation factors, eIF3, consists of at
least eight subunits ranging in mass from 35 to 170 kDa.
eIF3 binds to the 40 S ribosome in an early step of
translation initiation and promotes the binding of
methionyl-tRNAi and mRNA.
Length = 593
Score = 31.7 bits (72), Expect = 1.3
Identities = 17/64 (26%), Positives = 28/64 (43%), Gaps = 6/64 (9%)
Query: 376 RFLEPPNEEEEEEDEEEEEEEAEETQQGET------TEAERQEGEGGEREGGGEGENDNE 429
R+ E P E+EEE+E+E++++ + E TE E G + E D +
Sbjct: 130 RYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDED 189
Query: 430 KRTS 433
S
Sbjct: 190 ADLS 193
>gnl|CDD|219726 pfam08143, CBFNT, CBFNT (NUC161) domain. This N terminal domain is
found in proteins of CARG-binding factor A-like
proteins.
Length = 65
Score = 28.6 bits (64), Expect = 1.3
Identities = 13/47 (27%), Positives = 18/47 (38%)
Query: 375 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGG 421
E+F+E E E+ + E A G + G GG EG
Sbjct: 6 EQFMETKFANENGEEGADGREGAPIEGMGGGPAGAPRPGGGGGAEGD 52
>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal. Members
of this family are found in the N terminal region of
cytochrome B561, as well as in various other putative
uncharacterized proteins.
Length = 559
Score = 31.7 bits (72), Expect = 1.4
Identities = 30/152 (19%), Positives = 54/152 (35%), Gaps = 6/152 (3%)
Query: 460 KRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDR 519
K S + KS ++K+ + S S + +RS S S +
Sbjct: 104 KAKDSQFTVVSQAKKSPPASKTSTPMNTSEPLVPGHSSFSDSPSRSASPSRKFS------ 157
Query: 520 SRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASN 579
S S + SNK S S + S G S+ NRS RS+ ++ +
Sbjct: 158 PSSTIQQSPQLTPSNKPASPSSSYQSPSYSSSLGPVNSSGNRSNLRSSPWALRSSGDKKD 217
Query: 580 RSSNKSGSRSRSRSVSRARSNRSRSASKSGSS 611
++++ + V + + SA K+ +
Sbjct: 218 ITTDEKYLETFLAEVDEEQHMITSSAGKNATP 249
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 31.3 bits (71), Expect = 1.5
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 1/52 (1%)
Query: 384 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDA 435
E+E++D+ EEEE E ++A +E E GE E E D+ + DA
Sbjct: 232 EDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEME-AAEASADDTPDSDDA 282
Score = 30.6 bits (69), Expect = 2.4
Identities = 15/80 (18%), Positives = 31/80 (38%), Gaps = 1/80 (1%)
Query: 369 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEN-D 427
R+ L + EE DE E + + + + E E + E G + + D
Sbjct: 194 AFARVVRDMLSSMDMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSED 253
Query: 428 NEKRTSDAEHSSKNSSRASS 447
++ + + E ++ AS+
Sbjct: 254 SDASSEEMESGEMEAAEASA 273
>gnl|CDD|221825 pfam12877, DUF3827, Domain of unknown function (DUF3827). This
family contains the human KIAA1549 protein which has
been found to be fused fused to BRAF gene in many cases
of pilocytic astrocytomas. The fusion is due mainly to a
tandem duplication of 2 Mb at 7q34. Although nothing is
known about the function of KIAA1549 protein, the BRAF
protein is a well characterized oncoprotein. It is a
serine/threonine protein kinase which is implicated in
MAP/ERK signalling, a critical pathway for the
regulation of cell division, differentiation and
secretion.
Length = 684
Score = 31.4 bits (71), Expect = 1.6
Identities = 34/140 (24%), Positives = 51/140 (36%), Gaps = 9/140 (6%)
Query: 427 DNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSR--GSNKSDRSAKSDRS 484
D E T ++ S SS R +S S S S S KS R + +
Sbjct: 359 DAEVPTPKSKSSQDGSSNKKRRRGRKSPSDGD---SEGSSVISNRSSREKSGRPSTTPSV 415
Query: 485 AKSDRSARSDRSDKSAASNRSRSG-SAASNKSRSDR-SRSGSAGSDRSAASNKSVRSDKS 542
+ + + K A + + S+AS DR SR S DRS+ + +
Sbjct: 416 TAQQKPTKEEGRKKPAPPSGTDEQLSSASIFEHVDRLSRPSSDPYDRSSGKIQLIAMQPM 475
Query: 543 AASNRSARSDGS--DKSASN 560
A R + S D++A N
Sbjct: 476 PAPPVPPRFEPSRDDRAAEN 495
Score = 30.7 bits (69), Expect = 2.7
Identities = 25/107 (23%), Positives = 41/107 (38%), Gaps = 4/107 (3%)
Query: 478 SAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSV 537
+ KS S + + R KS + S S SN+S ++S S +A
Sbjct: 364 TPKSKSSQDGSSNKKRRRGRKSPSDGDSEGSSVISNRSSREKSGRPSTTPSVTAQQ---- 419
Query: 538 RSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNK 584
+ K + A G+D+ S+ S + +S +RSS K
Sbjct: 420 KPTKEEGRKKPAPPSGTDEQLSSASIFEHVDRLSRPSSDPYDRSSGK 466
>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
Length = 222
Score = 30.8 bits (69), Expect = 1.7
Identities = 16/75 (21%), Positives = 38/75 (50%)
Query: 370 IQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
+ RL+ ++ ++ +++E++E++E+ E Q E E Q+ + + + +N
Sbjct: 138 LNRLKPHMMKTKSKNQKQENQEKQEKLEESEIQSEKQEQPSQQPQSQPQPQQQPQQPNNN 197
Query: 430 KRTSDAEHSSKNSSR 444
KR + E S + +R
Sbjct: 198 KRKAMEEGLSPDRTR 212
>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
This family consists of several Trypanosoma brucei
procyclic acidic repetitive protein (PARP) like
sequences. The procyclic acidic repetitive protein
(parp) genes of Trypanosoma brucei encode a small family
of abundant surface proteins whose expression is
restricted to the procyclic form of the parasite. They
are found at two unlinked loci, parpA and parpB;
transcription of both loci is developmentally regulated.
Length = 145
Score = 30.3 bits (67), Expect = 1.7
Identities = 24/71 (33%), Positives = 32/71 (45%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
+P +E EEEE+ E EEE EE + E E E + E GE E E E + E
Sbjct: 59 DPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPEP 118
Query: 439 SKNSSRASSRS 449
A+ +S
Sbjct: 119 EPEPGAATLKS 129
>gnl|CDD|215556 PLN03064, PLN03064, alpha,alpha-trehalose-phosphate synthase
(UDP-forming); Provisional.
Length = 934
Score = 31.3 bits (71), Expect = 1.7
Identities = 32/102 (31%), Positives = 40/102 (39%), Gaps = 11/102 (10%)
Query: 499 SAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSA 558
S A RSRS KS DR SG S RS + N + +S S S KS
Sbjct: 818 SPAIARSRSPDGL--KSSGDRRPSGKLPSSRSNSKNSQGKKQRSLLS--------SAKSG 867
Query: 559 SNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSN 600
N + S + + S + K G S +V R RSN
Sbjct: 868 VNHAASHGSDRRPSPEKIGWSVLDLK-GENYFSCAVGRKRSN 908
Score = 31.3 bits (71), Expect = 1.7
Identities = 20/64 (31%), Positives = 26/64 (40%), Gaps = 1/64 (1%)
Query: 443 SRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRS-AKSDRSARSDRSDKSAA 501
S A +RSRS KS R SG S SN + K RS S +S + + +
Sbjct: 818 SPAIARSRSPDGLKSSGDRRPSGKLPSSRSNSKNSQGKKQRSLLSSAKSGVNHAASHGSD 877
Query: 502 SNRS 505
S
Sbjct: 878 RRPS 881
>gnl|CDD|220776 pfam10483, Hap2_elong, Histone acetylation protein 2. Hap2 is one
of three histone acetyltransferases proteins that, in
yeasts, are found associated with elongating forms of
RNA polymerase II (Elongator). The Haps can be isolated
in two forms, as a six-subunit complex with Elongator
and as a complex of the three proteins on their own. The
role of the Hap complex in transcription is still
speculative, being possibly to keep the HAT activity of
free Elongator in check, allowing histone acetylation
only in the presence of a transcribing polymerase, or
the interaction with Haps might render Elongator
susceptible to modifications thereby altering its
activity.
Length = 280
Score = 30.8 bits (70), Expect = 1.8
Identities = 29/116 (25%), Positives = 43/116 (37%), Gaps = 30/116 (25%)
Query: 333 ELETRVR-LSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQR---LRERFLEPPNE----- 383
+ E R LS+ + N +V+ E R + L E F+ P
Sbjct: 165 DKEALDRSLSEPVFPLNEGLNGVGIVLE------LENRRKSGRSLTETFILSPATHEYES 218
Query: 384 -EEEEEDEEEEEEEAEETQQGETT------EAERQEGEG--------GEREGGGEG 424
+E+ E + E EE E+ + TT E +R E E+ GGGEG
Sbjct: 219 LKEKPEGDAEAAEEDPESLEDLTTFNLGLSEKQRLAREQVELPFFDAQEKLGGGEG 274
>gnl|CDD|222914 PHA02666, PHA02666, hypothetical protein; Provisional.
Length = 287
Score = 31.1 bits (69), Expect = 1.9
Identities = 23/106 (21%), Positives = 45/106 (42%), Gaps = 4/106 (3%)
Query: 489 RSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRS 548
++ R D +S +R R+ S S + + SA + AS+ + ++ + + +
Sbjct: 17 KTMRGDGHHESCFHHRRRANSMESRRKSRPSRQHRSAERTPTTASSLTHENNTAPSRHGK 76
Query: 549 ARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSV 594
S AS+RS S+ +S+N A + RS+ +V
Sbjct: 77 QHS----CKASSRSSHNRGSTSSSHNHHAHRGPHQSAHRRSKHDAV 118
>gnl|CDD|227928 COG5641, GAT1, GATA Zn-finger-containing transcription factor
[Transcription].
Length = 498
Score = 31.0 bits (70), Expect = 1.9
Identities = 29/196 (14%), Positives = 71/196 (36%), Gaps = 2/196 (1%)
Query: 424 GENDNEKRTSDAEHSSKNSSRASSRS--RSRSKSKSPAKRSRSGSASSRGSNKSDRSAKS 481
++N + ++ + ++S+ + + S +GS S R +N +
Sbjct: 218 SSHNNNDSNGENANTESIGNSSASKLTKSWEERPQGRQLLSDAGSLSPRSNNPKSPLLEG 277
Query: 482 DRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDK 541
+ S + + + + RS ++ + S S + S ++ K S +
Sbjct: 278 LMGSTSLQPVSTPKLVLPSDKKRSTLTTSTATPLWRRTSDKSSFSCNASGSALKPPGSKR 337
Query: 542 SAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNR 601
S S+ + N S + + + ++ S + K +++RS ++ + R
Sbjct: 338 PLLPKPDPNSKRSNATCMNCSSTPTNKILSPPTTSNSPGAQVKLPNQTRSTGATKKKITR 397
Query: 602 SRSASKSGSSAASSRS 617
R S + +SS
Sbjct: 398 RRMNSGKIPALSSSMK 413
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 28.7 bits (65), Expect = 2.0
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 379 EPPNEEEEEEDEEEEEEEA 397
EEE++E+EEEEEE+
Sbjct: 63 AAAAEEEKKEEEEEEEEDD 81
Score = 28.0 bits (63), Expect = 3.6
Identities = 10/17 (58%), Positives = 13/17 (76%)
Query: 383 EEEEEEDEEEEEEEAEE 399
EEE+ EEEEEEE ++
Sbjct: 65 AAEEEKKEEEEEEEEDD 81
Score = 28.0 bits (63), Expect = 4.2
Identities = 8/20 (40%), Positives = 11/20 (55%)
Query: 380 PPNEEEEEEDEEEEEEEAEE 399
E+E++EEEE EE
Sbjct: 59 AAAAAAAAEEEKKEEEEEEE 78
Score = 27.2 bits (61), Expect = 6.5
Identities = 10/18 (55%), Positives = 13/18 (72%)
Query: 379 EPPNEEEEEEDEEEEEEE 396
EEE+++EEEEEEE
Sbjct: 62 AAAAAEEEKKEEEEEEEE 79
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 31.2 bits (71), Expect = 2.1
Identities = 27/88 (30%), Positives = 38/88 (43%), Gaps = 8/88 (9%)
Query: 333 ELETRV-RLSKRRQKVGAAPNNTRLVVTHRP-LNATEFRIQRLRERFLEPPNEEEEEEDE 390
E E R+ RL +RR+ + L+ R + R + LRER E E EE+ +
Sbjct: 506 EAEDRIERLEERREDL------EELIAERRETIEEKRERAEELRERAAELEAEAEEKREA 559
Query: 391 EEEEEEAEETQQGETTEAERQEGEGGER 418
E EE E + E E + E ER
Sbjct: 560 AAEAEEEAEEAREEVAELNSKLAELKER 587
>gnl|CDD|236779 PRK10864, PRK10864, putative methyltransferase; Provisional.
Length = 346
Score = 30.9 bits (70), Expect = 2.2
Identities = 11/49 (22%), Positives = 16/49 (32%)
Query: 551 SDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARS 599
D SDK N + + + A R + S+SR S
Sbjct: 20 DDDSDKRTHNPRTGKGGGRPSGKSRADGGRRPARDDRNSQSRDRKWEDS 68
>gnl|CDD|218576 pfam05395, DARPP-32, Protein phosphatase inhibitor 1/DARPP-32.
This family consists of several mammalian protein
phosphatase inhibitor 1 (IPP-1) and dopamine- and
cAMP-regulated neuronal phosphoprotein (DARPP-32)
proteins. Protein phosphatase inhibitor-1 is involved in
signal transduction and is an endogenous inhibitor of
protein phosphatase-1. It has been demonstrated that
DARPP-32, if phosphorylated, can inhibit
protein-phosphatase-1. DARPP-32 has a key role in many
neurotransmitter pathways throughout the brain and has
been shown to be involved in controlling receptors, ion
channels and other physiological factors including the
brain's response to drugs of abuse, such as cocaine,
opiates and nicotine. DARPP-32 is reciprocally regulated
by the two neurotransmitters that are most often
implicated in schizophrenia - dopamine and glutamate.
Dopamine activates DARPP-32 through the D1 receptor
pathway and disables DARPP-32 through the D2 receptor.
Glutamate, acting through the N-methyl-d-aspartate
receptor, renders DARPP-32 inactive. A mutant form of
DARPP-32 has been linked with gastric cancers.
Length = 170
Score = 29.9 bits (67), Expect = 2.2
Identities = 19/88 (21%), Positives = 27/88 (30%), Gaps = 2/88 (2%)
Query: 370 IQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
+QR+ E L+ EE + EEE+E + G T E G D +
Sbjct: 83 LQRIVESHLQSQGSLEENQASEEEDELDSPDELGYPTGETESTEAQDSEEPGSCPTQDAQ 142
Query: 430 KR--TSDAEHSSKNSSRASSRSRSRSKS 455
H + S S S
Sbjct: 143 SEVGLEGPWHRTAELSGPEQEGDSEEPS 170
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 31.0 bits (70), Expect = 2.4
Identities = 43/269 (15%), Positives = 87/269 (32%), Gaps = 31/269 (11%)
Query: 368 FRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAE--RQEGEGG--------- 416
++ R + P ++ + E EE + AE + +G G
Sbjct: 1195 KKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVKPKGRAGAKKKAPAAA 1254
Query: 417 -EREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKS 475
E+E E + ++ + S+ S + ++ A R + ++ S S+
Sbjct: 1255 KEKEEEDEILDLKDRLAAYNLDSAPAQSAKMEETVKAVPARRAAARKKPLASVSVISDSD 1314
Query: 476 DR--------SAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGS 527
D S K R + + ++ A+ +S +
Sbjct: 1315 DDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEMLKPA 1374
Query: 528 DRSAAS-NKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSG 586
+ S K VR +++ N+ KS S R+ + S+ + + + SS K
Sbjct: 1375 EAIGISPEKKVRKMRASPFNK--------KSGSVLGRAATNKETESSENVSGSSSSEKDE 1426
Query: 587 --SRSRSRSVSRARSNRSRSASKSGSSAA 613
++ R R + S S S +A
Sbjct: 1427 IDVSAKPRPQRANRKQTTYVLSDSESESA 1455
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 30.7 bits (69), Expect = 2.4
Identities = 31/174 (17%), Positives = 72/174 (41%), Gaps = 11/174 (6%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNS 442
EE EE++ + + +A + + +Q+ EG E E E K + A +K +
Sbjct: 83 EEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGT--EEVTEEEKAAAKAKAAAAAKAKAA 140
Query: 443 SRASSRSRSRSKSKSP--------AKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSD 494
+ A + + AK + +A ++ + + + A + +
Sbjct: 141 ALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEEK 200
Query: 495 RSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRS 548
K+ A+ +++ +AA K ++ + +G +G + + A + K+AA+ R+
Sbjct: 201 AKAKAKAAAAAKAKAAALAKQKASQG-NGDSGDEDAKAKAIAAAKAKAAAAARA 253
Score = 29.9 bits (67), Expect = 4.5
Identities = 39/219 (17%), Positives = 92/219 (42%), Gaps = 11/219 (5%)
Query: 371 QRLRERFLEPPNEEEEEEDEEE--EEEEAEETQQGETTEAERQEGEGGEREGGGEGENDN 428
+ R+R L + E + EEE E+E+A T E ++ +
Sbjct: 20 EEARKR-LVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAAALAKQK 78
Query: 429 EKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSD 488
+ T + K ++A + + +++K+ + AK+ R G+ K+ AK+ +AK+
Sbjct: 79 REGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKA- 137
Query: 489 RSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRS 548
A + K + ++K ++ ++++ +A ++AA K + AA
Sbjct: 138 -KAAALAKQKREGTEEVTEEEEETDKEKA-KAKAAAAAKAKAAALAK-----QKAAEAGE 190
Query: 549 ARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGS 587
+ +++ + +A+++A + A ++S +G
Sbjct: 191 GTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKASQGNGD 229
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 30.7 bits (69), Expect = 2.5
Identities = 9/78 (11%), Positives = 20/78 (25%)
Query: 384 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSS 443
+ + + E + ++ +E + E D K A +
Sbjct: 382 QAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVAGPGDDP 441
Query: 444 RASSRSRSRSKSKSPAKR 461
R + S +R
Sbjct: 442 AEPDGIRRQDDFSSRRRR 459
Score = 29.6 bits (66), Expect = 5.6
Identities = 19/177 (10%), Positives = 46/177 (25%), Gaps = 10/177 (5%)
Query: 388 EDEEEEEEEAEETQQGE-TTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRAS 446
DE E E AE + + GE + + ++ S
Sbjct: 292 VDEALEREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQ 351
Query: 447 SRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSR 506
R + + + S + + + + ++ + + SA + A+
Sbjct: 352 VADRDGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDE 411
Query: 507 SGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSR 563
+ K+ + + ++ + D +S R R
Sbjct: 412 TEPEVPEKA------APIPDPAKP---DELAVAGPGDDPAEPDGIRRQDDFSSRRRR 459
Score = 29.6 bits (66), Expect = 6.1
Identities = 8/74 (10%), Positives = 17/74 (22%), Gaps = 7/74 (9%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNS 442
E EE +E + E E+ + +E +
Sbjct: 392 EAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAK-------PDELAVAGPGDDPAEP 444
Query: 443 SRASSRSRSRSKSK 456
+ S+ +
Sbjct: 445 DGIRRQDDFSSRRR 458
>gnl|CDD|240430 PTZ00473, PTZ00473, Plasmodium Vir superfamily; Provisional.
Length = 420
Score = 30.6 bits (69), Expect = 2.5
Identities = 21/104 (20%), Positives = 33/104 (31%), Gaps = 6/104 (5%)
Query: 516 RSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNS 575
G ++ N S+ S R D S S+ + +S
Sbjct: 309 NMGHDSRGPYNANYGGQFNSRSGRTGSSESIRGFTYDSSTTYGG------SSYGTSQTDS 362
Query: 576 AASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGS 619
++ S + S + S S S S++ GSS SS S
Sbjct: 363 TSTYGSRSTFDSSTGGGSQSGGGSTYGGSSTFDGSSRGSSDSFG 406
Score = 29.8 bits (67), Expect = 4.6
Identities = 18/82 (21%), Positives = 32/82 (39%), Gaps = 2/82 (2%)
Query: 545 SNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRS--RSRSVSRARSNRS 602
N + GS +S + S + S+ + S++ GSRS S + ++S
Sbjct: 326 FNSRSGRTGSSESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGGGSQSGGG 385
Query: 603 RSASKSGSSAASSRSGSPAGSD 624
+ S + SSR S +
Sbjct: 386 STYGGSSTFDGSSRGSSDSFGV 407
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 30.1 bits (68), Expect = 2.5
Identities = 14/45 (31%), Positives = 21/45 (46%)
Query: 375 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
E NE +EE+ +EE+ E EE ++ E E + E E
Sbjct: 14 EEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEE 58
>gnl|CDD|215448 PLN02834, PLN02834, 3-dehydroquinate synthase.
Length = 433
Score = 30.5 bits (69), Expect = 2.6
Identities = 9/62 (14%), Positives = 23/62 (37%)
Query: 556 KSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASS 615
KS+S + + + S + + + N S S+ ++ + + + S +
Sbjct: 2 KSSSADNSESNTPTVLSRSPSDAFFDQNSSIESSKEGDLTEVIHEKCPVSGANKSEVTKT 61
Query: 616 RS 617
S
Sbjct: 62 AS 63
>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807). This is
a family of conserved fungal proteins of unknown
function.
Length = 169
Score = 29.7 bits (67), Expect = 2.9
Identities = 17/76 (22%), Positives = 28/76 (36%), Gaps = 3/76 (3%)
Query: 373 LRERFLEPPNEEEEEEDEEEEEEEAEET--QQGETTEAERQEGEGGEREGGGEGENDNEK 430
LRER L+ E EEE + + + + EAE+Q + + + K
Sbjct: 66 LRERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERK-SAK 124
Query: 431 RTSDAEHSSKNSSRAS 446
+ S+AS
Sbjct: 125 DPRGGTQDVVDKSQAS 140
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 30.4 bits (69), Expect = 3.0
Identities = 18/79 (22%), Positives = 33/79 (41%), Gaps = 6/79 (7%)
Query: 383 EEEEEEDEEEEEEEAEETQQG-----ETTEAERQEGEGGEREGGGEGENDNEKRTSDAEH 437
+E + E + +E E EE + + E EG E + GE E+D+E +
Sbjct: 205 KEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESA-WEGFE 263
Query: 438 SSKNSSRASSRSRSRSKSK 456
S R + ++K++
Sbjct: 264 SEYEPINKPVRPKRKTKAQ 282
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 30.4 bits (69), Expect = 3.0
Identities = 19/107 (17%), Positives = 38/107 (35%)
Query: 363 LNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGG 422
L + L + + +E+ E EEE+ E ++ + +AE E E
Sbjct: 1 LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEA 60
Query: 423 EGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASS 469
+ E + E + A R+ R + A+ ++ S +
Sbjct: 61 KPAASGEGGGGEEEEEEAKAEAAEFRAYLRGGDDALAEERKALSTGT 107
>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
[Posttranslational modification, protein turnover,
chaperones].
Length = 193
Score = 29.6 bits (67), Expect = 3.1
Identities = 16/33 (48%), Positives = 19/33 (57%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQ 411
+EEEE E+EE EEE E +Q E E E Q
Sbjct: 19 AEKSEEEEAEEEEPEEENELEEEQQEIAELEAQ 51
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 29.8 bits (68), Expect = 3.2
Identities = 11/33 (33%), Positives = 21/33 (63%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQ 411
E E+++++DE+E+E++ EE G E R+
Sbjct: 50 EEDLEDDDDDDEDEDEDDEEEADLGPDPEEARE 82
Score = 29.1 bits (66), Expect = 5.0
Identities = 13/63 (20%), Positives = 21/63 (33%), Gaps = 7/63 (11%)
Query: 367 EFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEN 426
E R+ L F++P E E EE + E + + + + E E
Sbjct: 19 EGRLSDLISGFIDPNAAAAAATAAAIESELDEE-------DLEDDDDDDEDEDEDDEEEA 71
Query: 427 DNE 429
D
Sbjct: 72 DLG 74
Score = 28.3 bits (64), Expect = 9.1
Identities = 8/36 (22%), Positives = 17/36 (47%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
E E ++E+ E+++ ++ + E E E G
Sbjct: 40 TAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGP 75
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 29.5 bits (66), Expect = 3.2
Identities = 26/115 (22%), Positives = 46/115 (40%), Gaps = 10/115 (8%)
Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEH 437
+ PP ++EEED + + EE EE ++ + + E E E E + K S +
Sbjct: 12 VRPPPTKDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEETAASSKAPSSSSK 71
Query: 438 SSKN----------SSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSD 482
SS + R S R + +P ++ ++R KS R ++
Sbjct: 72 SSSQETISIPPTPPARRPSRRWDQTGRFPNPTTGAKPTLRAARREYKSWRELRNR 126
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 30.2 bits (68), Expect = 3.6
Identities = 8/29 (27%), Positives = 14/29 (48%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTE 407
P E EEE DE ++ + + + + E
Sbjct: 280 PEPPEPEEEPDEPDQTDPDDGEETDQIPE 308
>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region. The precise
function of this eukaryotic protein family is unknown.
The yeast orthologues have been implicated in cell cycle
progression and biogenesis of 60S ribosomal subunits.
The Schistosoma mansoni Mak16 has been shown to target
protein transport to the nucleolus.
Length = 97
Score = 28.3 bits (63), Expect = 3.6
Identities = 16/33 (48%), Positives = 20/33 (60%)
Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
EE EE DEEEEEEE EE + ++ +E E
Sbjct: 57 AEESEENDEEEEEEEEEEDEGEIEYVSDDEELE 89
Score = 27.5 bits (61), Expect = 6.4
Identities = 16/36 (44%), Positives = 19/36 (52%)
Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGE 417
EE EE++EEEEEE EE +GE E E
Sbjct: 56 EAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEE 91
>gnl|CDD|240237 PTZ00038, PTZ00038, ferredoxin; Provisional.
Length = 191
Score = 29.4 bits (66), Expect = 3.7
Identities = 14/64 (21%), Positives = 26/64 (40%), Gaps = 4/64 (6%)
Query: 557 SASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSR 616
SA S ++S + +S+ S N R+ S+ ++ N SR +S + +S
Sbjct: 28 SADLAFSICSGRRRSSASFISSSASPN----RASCYSLGNSKGNTSRPSSNGVTPGSSPV 83
Query: 617 SGSP 620
Sbjct: 84 RSLL 87
>gnl|CDD|221661 pfam12609, DUF3774, Wound-induced protein. This family of proteins
is found in eukaryotes. Proteins in this family are
typically between 81 and 97 amino acids in length. The
proteins in the family are often annotated as
wound-induced proteins however there is little
accompanying literature to confirm this.
Length = 78
Score = 27.8 bits (62), Expect = 3.7
Identities = 12/40 (30%), Positives = 18/40 (45%)
Query: 504 RSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSA 543
RS +A +N RS S+ S SA++ + K A
Sbjct: 24 RSLHQAAKANARRSASQAGKSSSSSSSASAAAAEVKAKQA 63
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 30.0 bits (68), Expect = 3.8
Identities = 18/52 (34%), Positives = 24/52 (46%), Gaps = 9/52 (17%)
Query: 385 EEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAE 436
+EED EE E+AEE E +E + E E E + D E+ D E
Sbjct: 274 GDEEDLEELLEKAEE---------EEEEDDYSESEDEDEEDEDEEEEEDDDE 316
Score = 29.2 bits (66), Expect = 5.8
Identities = 11/35 (31%), Positives = 23/35 (65%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGE 417
E+ EEE+EE++ E+E+ + + E E ++ + G+
Sbjct: 284 EKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGD 318
Score = 29.2 bits (66), Expect = 6.8
Identities = 16/41 (39%), Positives = 22/41 (53%)
Query: 374 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
E LE EEEEE+D E E+E EE + E E + + +
Sbjct: 279 LEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
fusion protein; Provisional.
Length = 131
Score = 28.8 bits (64), Expect = 3.9
Identities = 11/51 (21%), Positives = 24/51 (47%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
E + +E+ ++ E + E ++ ET EA + + + E E ++E
Sbjct: 77 EAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDE 127
>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
synthase complex, beta subunit. Nomenclature follows
the description for Methanosarcina thermophila. The
CO-methylating acetyl-CoA synthase is considered the
defining enzyme of the Wood-Ljungdahl pathway, used for
acetate catabolism by sulfate reducing bacteria but for
acetate biosynthesis by acetogenic bacteria such as
oorella thermoacetica (f. Clostridium thermoaceticum)
[Energy metabolism, Chemoautotrophy].
Length = 458
Score = 30.2 bits (68), Expect = 4.0
Identities = 12/25 (48%), Positives = 14/25 (56%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTE 407
E +EEE EEEEE E + E E
Sbjct: 402 EVDEEEIEEEEEAMQPEEMEMEGFE 426
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 30.1 bits (68), Expect = 4.0
Identities = 24/99 (24%), Positives = 42/99 (42%), Gaps = 8/99 (8%)
Query: 369 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDN 428
+++L + E++E+E EE E+ E Q+GE A Q E G + N
Sbjct: 294 ELKKLENKL-----EKQEDELEELEKAAEELRQKGELLYANLQLIEEGLKSVRLADFYGN 348
Query: 429 EKRTSDAEHS---SKNSSRASSRSRSRSKSKSPAKRSRS 464
E+ + + S S+N+ R + + +K R S
Sbjct: 349 EEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLS 387
>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045). This
presumed domain is functionally uncharacterized. This
domain family is found in bacteria and eukaryotes, and
is typically between 384 and 430 amino acids in length.
Length = 414
Score = 29.8 bits (67), Expect = 4.1
Identities = 39/147 (26%), Positives = 61/147 (41%), Gaps = 16/147 (10%)
Query: 463 RSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRS 522
RS S S R S + S S+R+ SD + + S SG AS +R SR
Sbjct: 20 RSDSVSKRWSAQLPSGLSRGNSFLSNRN-----SDAAPSGTDSLSGRPASRLNREPSSRP 74
Query: 523 GSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSA-SNRSRSRSASSQASNNSAASNRS 581
GS+ S+ + + ++ A + +G + + + RSRSAS +NS +R
Sbjct: 75 GSSHSEATIVRQ-AKEGERPATPPEARPDEGFVRPSLPSHPRSRSASV---SNSKDGDRP 130
Query: 582 SNKSGSRSRSRSVSRARSNRSRSASKS 608
S S S+ R S +K+
Sbjct: 131 S------DLPPSPSKTMDPRRWSPTKA 151
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 30.1 bits (67), Expect = 4.1
Identities = 31/172 (18%), Positives = 62/172 (36%), Gaps = 3/172 (1%)
Query: 374 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTS 433
R+ +P + EE E+E +++ + E E + G + E E+DNE+
Sbjct: 407 RKTRRQPTGKAIAEETSREDELSFDDSDVSTSDENEDVDFTGKKGAINNEDESDNEEVAF 466
Query: 434 DAEHS---SKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRS 490
D++ S+ + R S+ KR R+ + S + +S +S
Sbjct: 467 DSDSQFDESEGNLRWKEGLASKLAYSQSGKRGRNIQKIFYDESLSPEECIEEYKGESAKS 526
Query: 491 ARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKS 542
+ SD + + A+ S+ + + + S+ KS
Sbjct: 527 SESDLVVQDEPEDFFDVSKVANESISSNHEKLMESEFEELKKKWSSLAQLKS 578
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 30.0 bits (67), Expect = 4.2
Identities = 41/243 (16%), Positives = 76/243 (31%), Gaps = 31/243 (12%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNS 442
+ EE+D++ EEE+ + T + ++ + G G + + + +
Sbjct: 114 TDSEEDDDDSEEEDNKSTSSKD--GKGSKKTQPGVST--SSGSTTSGTDLNTKQSQTGLG 169
Query: 443 SRASSRSRSRSKSKSPAKRSR-----------SGSASSRGSNKSDRSAKSDRSAKSDRSA 491
+ S + + S+S G A + R+ S
Sbjct: 170 ASGSHAQQDPAVSQSGVVGVPGLGVPGVGVPGGGGAGALPGVGVGRAGVSPGVGVGGLGG 229
Query: 492 RSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARS 551
ASN SR G ++ R R G VR S +S + R
Sbjct: 230 VPG--VGILASNTSREGQTQDDQERDGDGRVIEPG-----VGLPGVRVGDSTSSPSTTRP 282
Query: 552 DGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSR-------SRSVSRARSNRSRS 604
GS + + S S+ S++ N + + S S R+++ R
Sbjct: 283 SGS--TTTTTPASSGPSAPGGPGSSSRNAVTRSTDSISGPIPSPGAPRAITGQMGEREMF 340
Query: 605 ASK 607
A +
Sbjct: 341 AVQ 343
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 26.9 bits (60), Expect = 4.2
Identities = 10/20 (50%), Positives = 14/20 (70%)
Query: 383 EEEEEEDEEEEEEEAEETQQ 402
EE+ E++EE E+EE E Q
Sbjct: 23 EEKREDEEENEDEEEGEEQS 42
Score = 26.9 bits (60), Expect = 4.2
Identities = 13/22 (59%), Positives = 16/22 (72%)
Query: 383 EEEEEEDEEEEEEEAEETQQGE 404
EEE+ EDEEE E+E E +Q E
Sbjct: 22 EEEKREDEEENEDEEEGEEQSE 43
Score = 26.5 bits (59), Expect = 4.6
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 382 NEEEEEEDEEEEEEEAEETQQ 402
E+ E+E+E E+EEE EE +
Sbjct: 23 EEKREDEEENEDEEEGEEQSE 43
>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
Provisional.
Length = 291
Score = 29.8 bits (66), Expect = 4.3
Identities = 43/217 (19%), Positives = 86/217 (39%), Gaps = 15/217 (6%)
Query: 380 PPNEEEEEEDEEEEEEEAEETQQG-ETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
P EE++ED +++ EE ++ G E E E+ G+ G+ G EK+ +E
Sbjct: 71 PLPTEEDDEDVDDDSEEGDDDDGGAEDEEEEKVRGQSGQEGTVALGSGSTEKKLIGSE-- 128
Query: 439 SKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSDK 498
K + + S + S S S S G ++D++ + + + + +
Sbjct: 129 -KQTELSISSAESISPSGSRELNVNLTQTEVEGKKETDKNTPAVENPLTTGNGENTLPAG 187
Query: 499 SAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSA 558
N S SR G ++ K+V ++AA+ +S GS+ +
Sbjct: 188 IVEGNPSPPPPQDGIHSREQD------GEGTTSEGQKNVPLPETAATPQSHHDKGSEGTG 241
Query: 559 SNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVS 595
+ + ++ + N+ + + N GS + S + S
Sbjct: 242 ED-----TKATTVTANTTDTTNTQNSDGSTAVSHTTS 273
>gnl|CDD|218908 pfam06136, DUF966, Domain of unknown function (DUF966). Family of
plant proteins with unknown function.
Length = 308
Score = 29.7 bits (67), Expect = 4.4
Identities = 23/94 (24%), Positives = 36/94 (38%), Gaps = 8/94 (8%)
Query: 533 SNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNN------SAASNRSSNKSG 586
S+ S + A+S + SD NR ++S SS ++ S+
Sbjct: 91 SSSSKGDPEEASS--RKLQEESDTPPVNRRANQSWSSSDLAEYKVYKAEEPADASTQTDD 148
Query: 587 SRSRSRSVSRARSNRSRSASKSGSSAASSRSGSP 620
RSR S + + S SS++ S S SP
Sbjct: 149 RRSRDSSEAESTELSREEISPPSSSSSPSSSSSP 182
>gnl|CDD|177912 PLN02272, PLN02272, glyceraldehyde-3-phosphate dehydrogenase.
Length = 421
Score = 29.8 bits (67), Expect = 4.4
Identities = 18/60 (30%), Positives = 28/60 (46%), Gaps = 5/60 (8%)
Query: 492 RSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARS 551
RS + +AA+ S S++S D S+ S G S + + S S+ + SARS
Sbjct: 8 RSAATAPAAAARGSDFSSSSS-----DPSKVSSVGFSSSLSFSGSSSGASSSLQSCSARS 62
>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
Length = 869
Score = 30.0 bits (68), Expect = 4.5
Identities = 17/40 (42%), Positives = 20/40 (50%), Gaps = 1/40 (2%)
Query: 378 LEP-PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGG 416
LEP P EE EEE EE EE E + E + G+ G
Sbjct: 828 LEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDTGQLG 867
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 29.7 bits (66), Expect = 4.6
Identities = 17/85 (20%), Positives = 36/85 (42%), Gaps = 4/85 (4%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETT----EAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
E +E E ++E++ EE + E +++R + G E+E E E + A+
Sbjct: 284 IENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADEL 343
Query: 439 SKNSSRASSRSRSRSKSKSPAKRSR 463
++ + R + +P + R
Sbjct: 344 NELEKGIEEKRRQMESATNPILQKR 368
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity
of nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 29.7 bits (67), Expect = 4.7
Identities = 12/38 (31%), Positives = 21/38 (55%)
Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGE 417
+EE E++ +E+EE+ ++ E E +E E GE
Sbjct: 35 TEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72
>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2. Transcripts
harbouring premature signals for translation termination
are recognised and rapidly degraded by eukaryotic cells
through a pathway known as nonsense-mediated mRNA decay.
In Saccharomyces cerevisiae, three trans-acting factors
(Upf1 to Upf3) are required for nonsense-mediated mRNA
decay.
Length = 171
Score = 28.9 bits (65), Expect = 4.7
Identities = 15/56 (26%), Positives = 26/56 (46%)
Query: 381 PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAE 436
E E +D EE+EE EE + E+++ E + E++ + E + T E
Sbjct: 1 SGSESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEE 56
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 29.3 bits (66), Expect = 4.9
Identities = 28/165 (16%), Positives = 49/165 (29%), Gaps = 10/165 (6%)
Query: 433 SDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSG--SASSRGSNKSDRSAKSDRSAKSDRS 490
+ E SK +++ R+ +SK K S K + K+ SA S S
Sbjct: 95 RELEKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPES 154
Query: 491 ARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAAS----N 546
S S +++ + A ++ S S+ KS +S +
Sbjct: 155 --SPSSTPNSSRPSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKM 212
Query: 547 R--SARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRS 589
R D D + + S + A G +
Sbjct: 213 RKWDLDGDEDDDAVLDYSAPDANDENADAPEDVEEVDQESWGRGT 257
>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
(DUF2098). This domain, found in various hypothetical
prokaryotic proteins, has no known function.
Length = 91
Score = 27.7 bits (62), Expect = 5.0
Identities = 15/50 (30%), Positives = 25/50 (50%), Gaps = 1/50 (2%)
Query: 374 RERFLEPPNEEEEEEDEEEE-EEEAEETQQGETTEAERQEGEGGEREGGG 422
R +LE ++ +E+++E EE +EE E + E E + GGG
Sbjct: 42 RTDYLEVTDKVKEKKEEREEDKEELIERIKKEEETFEDVDLGSAGCGGGG 91
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 29.8 bits (68), Expect = 5.0
Identities = 15/29 (51%), Positives = 19/29 (65%)
Query: 371 QRLRERFLEPPNEEEEEEDEEEEEEEAEE 399
Q++RE FL+ EEEEE+EEEE E
Sbjct: 517 QKIREEFLQKHKNEEEEEEEEEELPLIPE 545
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 29.0 bits (65), Expect = 5.0
Identities = 13/66 (19%), Positives = 32/66 (48%)
Query: 377 FLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAE 436
F E +E++E +E EE+EA + ++ + + E++ E ++ + + K+ +
Sbjct: 91 FPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKT 150
Query: 437 HSSKNS 442
+ K
Sbjct: 151 TTKKEE 156
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 30.0 bits (67), Expect = 5.2
Identities = 15/57 (26%), Positives = 23/57 (40%)
Query: 342 KRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAE 398
++ A + L + PL I + +E N EE E +EE E+ E E
Sbjct: 339 EKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDE 395
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 29.3 bits (65), Expect = 5.3
Identities = 28/116 (24%), Positives = 44/116 (37%)
Query: 353 NTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 412
N L P A + +E + EEEEEE+EEE E E E T+ +
Sbjct: 110 NIGLAALQLPKKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEV 169
Query: 413 GEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSAS 468
G GG GE E+ ++AE + ++ S + + + G+
Sbjct: 170 DHGNGSSGGDNGEEGEEESVTEAEAEGTTVAGPTTTSPNGGFQPTTPPQEVYGTTD 225
>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
Length = 200
Score = 29.1 bits (65), Expect = 5.4
Identities = 17/49 (34%), Positives = 20/49 (40%), Gaps = 3/49 (6%)
Query: 381 PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
P + E E E E+ E+T E E EGG G G G D E
Sbjct: 116 PEAKGEGEGHEPEDPPPEDTPPPPGGEGEV---EGGPSPGPGPGPLDQE 161
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 29.2 bits (65), Expect = 5.7
Identities = 27/112 (24%), Positives = 47/112 (41%), Gaps = 5/112 (4%)
Query: 348 GAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTE 407
G P N R + ++ E R +ER++E EEEE + E E + E ++ + +
Sbjct: 175 GDTPLNVREHLWNKAATEREKRQDE-KERYVE---EEEESDTELEAVTDDSEKEKTKKKD 230
Query: 408 AERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSSRASSRSRSRS-KSKSP 458
E+ G E E ++ + SD + N + R + KS+ P
Sbjct: 231 LEKWLGSDQSMETSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRKP 282
Score = 29.2 bits (65), Expect = 6.9
Identities = 19/90 (21%), Positives = 39/90 (43%), Gaps = 6/90 (6%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHS 438
E +E+E EEEEE + E + +E E+ + + E+ G +++ +E
Sbjct: 194 EKRQDEKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLG------SDQSMETSESE 247
Query: 439 SKNSSRASSRSRSRSKSKSPAKRSRSGSAS 468
+ SS + S +K ++ ++ A
Sbjct: 248 EEESSESESDEDEDEDNKGKIRKRKTDDAK 277
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 29.7 bits (66), Expect = 5.8
Identities = 54/237 (22%), Positives = 82/237 (34%), Gaps = 8/237 (3%)
Query: 373 LRERFLEPPNEEEEEEDEEEEEEEAE-ETQQGETTEAERQEGEGGEREGGGEGENDNEKR 431
L ++ +E EE E E E E E E + E E ER E + ER+ E + R
Sbjct: 455 LEKKRIERLEREERERLERERMERIERERLERERLERERLERDRLERDRLDRLERERVDR 514
Query: 432 TS--DAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKSDR 489
E + +NS S P G+ G SD ++ +S
Sbjct: 515 LERDRLEKARRNSYFLKGMENGLSAGGGPGDGPGVGAGVGAGVGTSD--GRNHSGVRSGI 572
Query: 490 SARSDRSDKSAASNRSRSGSAASNKSR-SDRSRSGSAGSDRSAASNKSVRSDKSAASNRS 548
S + + RSG S + D RSG S R + S+R + RS
Sbjct: 573 HCSIQSSARGGVHDSVRSGVHDSVRGGVHDSMRSGVHDSLRGGVHD-SIRGG-AHDGVRS 630
Query: 549 ARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSA 605
DG KS+ + + N + + ++ S S + + SR A
Sbjct: 631 GAHDGVRKSSGVKVEGMGNARAPPPNQGTHDGNFSRGKDSSDHVSSYKPYAYDSRKA 687
Score = 29.3 bits (65), Expect = 7.0
Identities = 47/219 (21%), Positives = 72/219 (32%), Gaps = 4/219 (1%)
Query: 369 RIQRL-RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEND 427
RI+RL RE E E + E E E E ++ E ER + ERE E D
Sbjct: 459 RIERLEREERERLERERMERIERERLERERLERERLERDRLERDRLDRLERERVDRLERD 518
Query: 428 NEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAKSDRSAKS 487
++ + K S +G +S G N S + S +S
Sbjct: 519 RLEKARRNSYFLKGMENGLSAGGGPGDGPGVGAGVGAGVGTSDGRNHSGVRSGIHCSIQS 578
Query: 488 DRSARSDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASN- 546
S +S + R G S +S S G ++ VRS A
Sbjct: 579 SARGGVHDSVRSGVHDSVRGGVHDSMRSGVHDSLRGGVHDSIRGGAHDGVRS--GAHDGV 636
Query: 547 RSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKS 585
R + + + R+ + + N S + S + S
Sbjct: 637 RKSSGVKVEGMGNARAPPPNQGTHDGNFSRGKDSSDHVS 675
>gnl|CDD|240184 cd05384, SCP_PRY1_like, SCP_PRY1_like: SCP-like extracellular
protein domain, PRY1-like sub-family restricted to
fungi. The wider family of SCP containing proteins
includes plant pathogenesis-related protein 1 (PR-1),
CRISPs, mammalian cysteine-rich secretory proteins, and
allergen 5 from vespid venom. It has been proposed that
SCP domains may function as endopeptidases. PRY1 is a
yeast protein that is up-regulated in core ESCRT
mutants. This PRY1-like group also contains fruiting
body proteins SC7/14 from Schizophyllum commune.
Length = 129
Score = 28.4 bits (64), Expect = 5.8
Identities = 7/22 (31%), Positives = 12/22 (54%)
Query: 117 EEDILTPQDSKRSRHHARSVSW 138
IL +SKR+ H + ++W
Sbjct: 3 ASSILDAHNSKRALHGVQPLTW 24
>gnl|CDD|177755 PLN00152, PLN00152, DNA-directed RNA polymerase; Provisional.
Length = 130
Score = 28.3 bits (63), Expect = 5.8
Identities = 12/40 (30%), Positives = 18/40 (45%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 418
EP E EE EE+ E + +G E E +E + +
Sbjct: 12 EPSEPEIEEGAEEDSESNNNDEVKGSIGENEDKEEQEPVQ 51
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 29.2 bits (66), Expect = 5.9
Identities = 13/51 (25%), Positives = 22/51 (43%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTS 433
+E E D E+EEE+ E ++ + E E E E +++ S
Sbjct: 125 KEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKAS 175
>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain. This is a
short domain found in bacterial type II/III secretory
system proteins. The architecture of these proteins
suggest that this family may be functionally analogous
to pfam03958.
Length = 95
Score = 27.7 bits (62), Expect = 5.9
Identities = 20/67 (29%), Positives = 32/67 (47%), Gaps = 13/67 (19%)
Query: 527 SDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSG 586
RS +SN SV S GS S+ + S S S++S +S++S+ + SG
Sbjct: 11 MKRSGSSNTSVTS-------------GSVSSSGSNSSSSSSNSSNGGSSSSSSSGDSSSG 57
Query: 587 SRSRSRS 593
+R + S
Sbjct: 58 TRITTES 64
>gnl|CDD|224304 COG1386, scpB, Chromosome segregation and condensation protein B
[DNA replication, recombination and repair].
Length = 184
Score = 28.8 bits (65), Expect = 6.1
Identities = 14/60 (23%), Positives = 21/60 (35%), Gaps = 8/60 (13%)
Query: 338 VRLSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEA 397
+R RR G T + L+ F + L E P EE ++ EE+
Sbjct: 133 IREVGRRDTPGRP---YLYGTTEKFLD--YFGLDSLDE---LPDLEELKDAGLLSEEDLL 184
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 29.0 bits (65), Expect = 6.2
Identities = 13/26 (50%), Positives = 19/26 (73%)
Query: 374 RERFLEPPNEEEEEEDEEEEEEEAEE 399
+ + LE + +EE+E +EEEEEE EE
Sbjct: 159 KLKELEAEDVDEEDEKDEEEEEEEEE 184
>gnl|CDD|224532 COG1617, COG1617, Uncharacterized conserved protein [Function
unknown].
Length = 158
Score = 28.6 bits (64), Expect = 6.2
Identities = 18/73 (24%), Positives = 27/73 (36%), Gaps = 7/73 (9%)
Query: 334 LETRVRLSKRRQ------KVGAAPNNTRLV-VTHRPLNATEFRIQRLRERFLEPPNEEEE 386
+E +RL+ RQ GA +V V F+ LR+ LE
Sbjct: 71 IEILLRLAGTRQIKDAIKLGGAKKGENFIVAVLVDTPKMEGFKCSELRDFVLEEAEVLRI 130
Query: 387 EEDEEEEEEEAEE 399
++E EE E +
Sbjct: 131 TDEELEEAGEEDV 143
>gnl|CDD|214923 smart00937, PCRF, This domain is found in peptide chain release
factors.
Length = 116
Score = 27.8 bits (63), Expect = 6.3
Identities = 12/31 (38%), Positives = 15/31 (48%)
Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQE 412
EEE++E E EEE EE + E E
Sbjct: 4 EEEEDDEMRELAEEELEELEAELEEELEELL 34
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 28.0 bits (62), Expect = 6.3
Identities = 23/55 (41%), Positives = 29/55 (52%)
Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAE 436
N +E +DE EEEEE +E + E E E +EGE E G G+ E DAE
Sbjct: 42 NGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAE 96
Score = 27.2 bits (60), Expect = 9.6
Identities = 16/51 (31%), Positives = 26/51 (50%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNE 429
E +E EEEE+ +E++EE E + E E E + + + E+D E
Sbjct: 46 EGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAE 96
>gnl|CDD|234700 PRK00249, flgH, flagellar basal body L-ring protein; Reviewed.
Length = 222
Score = 29.1 bits (66), Expect = 6.4
Identities = 13/62 (20%), Positives = 29/62 (46%), Gaps = 1/62 (1%)
Query: 534 NKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRS 593
++ + KS ++NRS R ++ + + ++ +S+ S GS SRS +
Sbjct: 75 QENTSASKSNSTNRS-RDSSTNFGLPSLFGYLNGLDANASLETSSDNSFKGKGSASRSNT 133
Query: 594 VS 595
++
Sbjct: 134 LT 135
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 28.9 bits (64), Expect = 6.5
Identities = 14/35 (40%), Positives = 23/35 (65%)
Query: 370 IQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGE 404
+ +L+E E EE++ ++E+EEEE EE +Q E
Sbjct: 203 LSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEE 237
>gnl|CDD|227824 COG5537, IRR1, Cohesin [Cell division and chromosome partitioning].
Length = 740
Score = 29.5 bits (66), Expect = 6.6
Identities = 12/48 (25%), Positives = 19/48 (39%)
Query: 425 ENDNEKRTSDAEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGS 472
E+D E + D + + R S + SK SP +S + S
Sbjct: 10 EDDYEDQDDDDYVDTATAKRKSRSRKPMSKENSPDGVLAKDMSSDQKS 57
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 28.9 bits (64), Expect = 7.2
Identities = 16/45 (35%), Positives = 25/45 (55%)
Query: 375 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
ER E +E E++E++E E EE +Q + +E E E + E E
Sbjct: 145 ERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDESETE 189
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plant species have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site located in the C-terminal
region of P2. TCS inactivates the ribosome by
depurinating a specific adenine in the sarcin-ricin loop
of 28S rRNA.
Length = 103
Score = 27.7 bits (62), Expect = 7.6
Identities = 9/21 (42%), Positives = 15/21 (71%)
Query: 379 EPPNEEEEEEDEEEEEEEAEE 399
E ++EE +EEEEEE+++
Sbjct: 76 AAAAEAKKEEKKEEEEEESDD 96
>gnl|CDD|131764 TIGR02717, AcCoA-syn-alpha, acetyl coenzyme A synthetase (ADP
forming), alpha domain. Although technically
reversible, it is believed that this group of
ADP-dependent acetyl-CoA synthetases (ACS) act in the
direction of acetate and ATP production in the organisms
in which it has been characterized. In most species this
protein exists as a fused alpha-beta domain polypeptide.
In Pyrococcus and related species, however the domains
exist as separate polypeptides. This model represents
the alpha (N-terminal) domain. In Pyrococcus and related
species there appears to have been the development of a
paralogous family such that four other proteins are
close relatives. In reference , one of these (along with
its beta-domain partner) was characterized as ACS-II
showing specificity for phenylacetyl-CoA. This model has
been constructed to exclude these non-ACS-I paralogs.
This may result in new, authentic ACS-I sequences
falling below the trusted cutoff.
Length = 447
Score = 29.2 bits (66), Expect = 7.8
Identities = 11/20 (55%), Positives = 16/20 (80%)
Query: 605 ASKSGSSAASSRSGSPAGSD 624
S++G+ AASS +G+ AGSD
Sbjct: 240 TSEAGAKAASSHTGALAGSD 259
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 28.6 bits (64), Expect = 8.0
Identities = 15/48 (31%), Positives = 18/48 (37%)
Query: 365 ATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 412
E R + + E+ EE EEEEAEE E E E
Sbjct: 139 VNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAEEAPAEESAE 186
>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist. LIN-8
is a nuclear protein, present at the sites of
transcriptional repressor complexes, which interacts
with LIN-35 Rb.Lin35 Rb is a product of the class B
synMuv gene lin-35 which silences genes required for
vulval specification through chromatin modification and
remodelling. The biological role of the interaction has
not yet been determined however predictions have been
made. The interaction shows that class A synMuv genes
control vulval induction through the transcriptional
regulation of gene expression. LIN-8 normally functions
as part of a protein complex however when the complex is
absent, other family members can partially replace LIN-8
activity.
Length = 316
Score = 28.9 bits (65), Expect = 8.0
Identities = 19/100 (19%), Positives = 33/100 (33%), Gaps = 9/100 (9%)
Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAE-------RQEGEGGEREGGGEGENDNEKRTSD 434
+ EE +E +E E EE + E R + N + S
Sbjct: 159 DMEENDEVDEVEVEEVPDDYGANLAVDEPEQSTMSRPQEVKQSVPAQQAPPNPQQPMPSA 218
Query: 435 AEHSSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNK 474
+ S+ S+++S SR S R + + + K
Sbjct: 219 SSESAT--SKSASTSRESSPQPQSPPPRRVPAPTVDSTAK 256
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 28.2 bits (63), Expect = 8.1
Identities = 13/35 (37%), Positives = 22/35 (62%)
Query: 385 EEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 419
++E+EE EEE +ET+Q + E + + E +RE
Sbjct: 94 SDDEEEETEEESTDETEQEDPPETKTESKEKKKRE 128
>gnl|CDD|223023 PHA03249, PHA03249, DNA packaging tegument protein UL25;
Provisional.
Length = 653
Score = 29.2 bits (65), Expect = 8.2
Identities = 26/135 (19%), Positives = 52/135 (38%), Gaps = 3/135 (2%)
Query: 493 SDRSDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSA---ASNKSVRSDKSAASNRSA 549
S D+ R R+ + ++ + S S+ ++S+ S S+ A R A
Sbjct: 23 SSLEDEVFLDPRPRAPTEDLDRMEAGLSSYSSSSDNKSSFEVVSETDSGSEAEAERGRRA 82
Query: 550 RSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSG 609
G +K+ R+++ + ++ + A+ + + S +S VS S S S G
Sbjct: 83 GMGGRNKATKPSRRNKTTQCRPTSLALATAATMPATPSSGKSPKVSSPPSIPSLSEEDEG 142
Query: 610 SSAASSRSGSPAGSD 624
+ S S +
Sbjct: 143 AERNSGGDDSSHTDN 157
>gnl|CDD|224236 COG1317, FliH, Flagellar biosynthesis/type III secretory pathway
protein [Cell motility and secretion / Intracellular
trafficking and secretion].
Length = 234
Score = 28.5 bits (64), Expect = 8.3
Identities = 17/52 (32%), Positives = 23/52 (44%)
Query: 375 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEN 426
+ LE E+ E +EEE E A + Q E R+ E G + G EG
Sbjct: 34 KAVLEEEELEQALEAKEEELESAAQELQEGIEEGAREGYEEGFQLGYEEGFE 85
>gnl|CDD|235218 PRK04081, PRK04081, hypothetical protein; Provisional.
Length = 207
Score = 28.5 bits (64), Expect = 8.3
Identities = 21/65 (32%), Positives = 33/65 (50%), Gaps = 6/65 (9%)
Query: 546 NRSARSDGSDKSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSA 605
N + + KS RS+++ ++A++ S++S SS KSG S S S S S+
Sbjct: 149 NYQNQRQRTYKSPQAYQRSQNSFNKAASASSSSGSSSKKSGFFGGSSSAS------SSSS 202
Query: 606 SKSGS 610
S GS
Sbjct: 203 SSFGS 207
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 29.0 bits (65), Expect = 8.3
Identities = 18/57 (31%), Positives = 25/57 (43%), Gaps = 1/57 (1%)
Query: 382 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEND-NEKRTSDAEH 437
++E E E+ E EE ++GE E E E E E E G+ + E EH
Sbjct: 66 DDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGDTPEELTEASEQLEEH 122
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 28.5 bits (64), Expect = 8.4
Identities = 18/48 (37%), Positives = 26/48 (54%)
Query: 367 EFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
E R +R + +EEEE EEE E++ EE ++ E E R+E E
Sbjct: 26 EEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQE 73
>gnl|CDD|219038 pfam06459, RR_TM4-6, Ryanodine Receptor TM 4-6. This region covers
TM regions 4-6 of the ryanodine receptor 1 family.
Length = 217
Score = 28.5 bits (64), Expect = 8.5
Identities = 16/46 (34%), Positives = 21/46 (45%), Gaps = 1/46 (2%)
Query: 380 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQE-GEGGEREGGGEG 424
P E EE + E + ET +GE E G ++EGGG G
Sbjct: 15 PEEEGAEEVEPEGSGRDLPETGEGEEEEVGVDAFGLDMKKEGGGYG 60
>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
UL32. The large phosphorylated protein (UL32-like) of
herpes viruses is the polypeptide most frequently
reactive in immuno-blotting analyses with antisera when
compared with other viral proteins.
Length = 777
Score = 29.1 bits (65), Expect = 8.8
Identities = 35/254 (13%), Positives = 74/254 (29%), Gaps = 19/254 (7%)
Query: 378 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEH 437
LE + E DE+++++ Q + + E G+ ++ + A
Sbjct: 275 LEYDDPGLESTDEDDDDDGDSSLQTFKPLLDLTGSSLWSDDEESGD---EDGDGSGFAPE 331
Query: 438 SSKNSSRASSRSRSRSKSKSPAKRSRSGSASSRGSNKSDRSAK--SDRSAKSDRSARSDR 495
+ S+ + + + S A S + + + ++
Sbjct: 332 PLIKTDSRSNDTLVDLGRGGGSLKLDSVDAPGTSSYLFEPGLSPSPNSGKEMPGILTTEN 391
Query: 496 SDKSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSD 555
D AS S + + + + + +V ++S A + +D D
Sbjct: 392 LDLPLASTDSTEMDPEDKRGGAVKINNSGILAWGLKTPGLAVNDERSIAVSSDGITDVLD 451
Query: 556 -----KSASNRSRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGS 610
+ S+ S S + +A SR R + +S S S
Sbjct: 452 PPSPLRLHSSDKVIDSVSPPSKRRVSAPA---------SRLDDAKRPEVTATPESSGSDS 502
Query: 611 SAASSRSGSPAGSD 624
+S SD
Sbjct: 503 EGGASGREDETSSD 516
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 29.0 bits (65), Expect = 8.8
Identities = 8/45 (17%), Positives = 19/45 (42%)
Query: 370 IQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 414
+ + + EE E + +EE+ +E + + E ++ E
Sbjct: 381 YTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKE 425
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 29.3 bits (65), Expect = 9.0
Identities = 29/123 (23%), Positives = 51/123 (41%), Gaps = 9/123 (7%)
Query: 502 SNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNR 561
S R+R + + RS R + ++ R++ +N+ RS S R++ S S S ++
Sbjct: 4 SMRTRRSRGSMSTLRSGRKKQTASPDGRASPTNEDQRS-----SGRNSPSAASTSSNDSK 58
Query: 562 SRSRSASSQASNNSAASNRSSNKSGSRSRSRSVSRARSNRSRSASKSGSSAASSRSGSPA 621
+ S ++ A S KS R R + S R +K + SR SP+
Sbjct: 59 AESTKKPNKKIKEEAT---SPLKSTKRQREKPASDTEEP-ERVTAKKSKTQELSRPNSPS 114
Query: 622 GSD 624
+
Sbjct: 115 EGE 117
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 28.6 bits (64), Expect = 9.1
Identities = 12/23 (52%), Positives = 16/23 (69%)
Query: 379 EPPNEEEEEEDEEEEEEEAEETQ 401
E +E+EEE+E EEEEE E+
Sbjct: 262 EKELKEDEEEEETEEEEEEEDED 284
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 29.2 bits (65), Expect = 9.1
Identities = 23/80 (28%), Positives = 37/80 (46%), Gaps = 5/80 (6%)
Query: 377 FLEPPNEEEEEEDEEE--EEEEAEETQQGETTEAERQEGEGGEREGGGEGEND--NEKRT 432
FL +++E +E EEE E E + + + ET E E + E E E END +E+
Sbjct: 922 FLMVGSDDESDESEEEVSEYEASSDDESDETDEDEESD-ESSEDLSEDESENDSSDEEDG 980
Query: 433 SDAEHSSKNSSRASSRSRSR 452
D + ++ S + R
Sbjct: 981 EDWDELESKAAYDSRPGKRR 1000
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 28.9 bits (65), Expect = 9.4
Identities = 23/89 (25%), Positives = 38/89 (42%), Gaps = 7/89 (7%)
Query: 355 RLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEE-EEAEETQQGETT------E 407
RL H L A E R+ +E E N ++ E++ +EA + E +
Sbjct: 264 RLQHLHGELVADEERLAEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQELSAANAKLA 323
Query: 408 AERQEGEGGEREGGGEGENDNEKRTSDAE 436
A+R E E E + G + D E+ +D +
Sbjct: 324 ADRSELELLEDQKGAFEDADIEQLQADLD 352
>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1. Rec10 / Red1 is involved in meiotic
recombination and chromosome segregation during
homologous chromosome formation. This protein localises
to the synaptonemal complex in S. cerevisiae and the
analogous structures (linear elements) in S. pombe. This
family is currently only found in fungi.
Length = 706
Score = 29.1 bits (65), Expect = 9.5
Identities = 27/142 (19%), Positives = 55/142 (38%), Gaps = 3/142 (2%)
Query: 440 KNSSRASSRSRSRSKSKSP--AKRSRSGSASSRGSNKSDRSAKSDRSAKSDRSARSDRSD 497
N++ + +S+ K+K+ A G SS +NK ++ S+ +S RS
Sbjct: 489 ANNNSQNIKSKKVVKAKTNNKANLQDVGECSSPPNNKEKNDKQTSTSSSVLKSDRSSIEV 548
Query: 498 KSAASNRSRSGSAASNKSRSDRSRSGSAGSDRSAASNKSVRSDKSAAS-NRSARSDGSDK 556
++ +N + N S++ + + S +V S S N + S+ +
Sbjct: 549 RNPNANVKKLEDTTYNAKFPTVSKNNAYTLVDISTSEDAVNSADDTRSVNITDISESTTI 608
Query: 557 SASNRSRSRSASSQASNNSAAS 578
S + S +S+ +S
Sbjct: 609 SGLSSKFPSSFTSKLQEQIYSS 630
>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
Length = 258
Score = 28.6 bits (65), Expect = 9.6
Identities = 13/30 (43%), Positives = 16/30 (53%)
Query: 383 EEEEEEDEEEEEEEAEETQQGETTEAERQE 412
E E+EEEE EE EE ++ E E E
Sbjct: 229 LAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
>gnl|CDD|218190 pfam04651, Pox_A12, Poxvirus A12 protein.
Length = 188
Score = 28.2 bits (63), Expect = 9.8
Identities = 17/55 (30%), Positives = 23/55 (41%)
Query: 525 AGSDRSAASNKSVRSDKSAASNRSARSDGSDKSASNRSRSRSASSQASNNSAASN 579
SD + K V S S+ S R + S K S SRSRS + + + S
Sbjct: 49 NPSDNEVKAGKRVTSASSSKSKRCSTSTSKTKPCSRSSRSRSGAPRRRGTAFGSM 103
>gnl|CDD|239922 cd04476, RPA1_DBD_C, RPA1_DBD_C: A subfamily of OB folds
corresponding to the C-terminal OB fold, the
ssDNA-binding domain (DBD)-C, of human RPA1 (also called
RPA70). RPA1 is the large subunit of Replication protein
A (RPA). RPA is a nuclear ssDNA-binding protein (SSB)
which appears to be involved in all aspects of DNA
metabolism including replication, recombination, and
repair. RPA also mediates specific interactions of
various nuclear proteins. In animals, plants, and fungi,
RPA is a heterotrimer with subunits of 70KDa (RPA1),
32kDa (RPA2), and 14 KDa (RPA3). In addition to DBD-C,
RPA1 contains three other OB folds: DBD-A, DBD-B, and
RPA1N. The major DNA binding activity of RPA is
associated with RPA1 DBD-A and DBD-B. RPA1 DBD-C is
involved in DNA binding and trimerization. It contains
two structural insertions not found to date in other
OB-folds: a zinc ribbon and a three-helix bundle. RPA1
DBD-C also contains a Cys4-type zinc-binding motif,
which plays a role in the ssDNA binding function of this
domain. It appears that zinc itself may not be required
for ssDNA binding.
Length = 166
Score = 28.0 bits (63), Expect = 9.9
Identities = 16/73 (21%), Positives = 29/73 (39%), Gaps = 9/73 (12%)
Query: 258 VMDESGEQFVAYFLPTAETLDKRKLDQLSNLEYTDEQEYEYKMA----REYNWNVKSKSC 313
V D +GE ++ F AE + + ++L L+ D + + + + + V K
Sbjct: 74 VADHTGEAWLTLFDEVAEQIFGKSAEELLELKEEDPDAFPDAIQDLVGKTFLFRVSVK-- 131
Query: 314 KGYEENYFLIVRD 326
EE Y R
Sbjct: 132 ---EETYNDEGRI 141
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 28.4 bits (63), Expect = 9.9
Identities = 13/84 (15%), Positives = 34/84 (40%), Gaps = 2/84 (2%)
Query: 384 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGENDNEKRTSDAEHSSKNSS 443
++EE+ +E + E+ +RQE + ++ + + T+ ++ K
Sbjct: 120 KQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQY--KQDM 177
Query: 444 RASSRSRSRSKSKSPAKRSRSGSA 467
+ + + SK P +++ A
Sbjct: 178 KEKASEQENEDSKEPVEKAERTKA 201
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.305 0.121 0.326
Gapped
Lambda K H
0.267 0.0857 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 29,379,777
Number of extensions: 2803350
Number of successful extensions: 10457
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6918
Number of HSP's successfully gapped: 1064
Length of query: 624
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 521
Effective length of database: 6,369,140
Effective search space: 3318321940
Effective search space used: 3318321940
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (22.0 bits)
S2: 62 (27.4 bits)