RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8411
(385 letters)
>gnl|CDD|215825 pfam00262, Calreticulin, Calreticulin family.
Length = 359
Score = 423 bits (1089), Expect = e-148
Identities = 178/352 (50%), Positives = 219/352 (62%), Gaps = 48/352 (13%)
Query: 8 YQNAWEKEWVYSQHPGKEFGKFVLSAGKFYNDAEKDKGIQTSQDARFYALSKKF-TPFSN 66
+ WEK W+ S+ + GK+ + AGKFY EKDKG+ T DA+FYA+S KF PFSN
Sbjct: 9 DDDLWEKRWIPSKAKKDDDGKWKVEAGKFYGGEEKDKGLVTKDDAKFYAISAKFPKPFSN 68
Query: 67 KDKDLVIQFTVKHEQNIDCGGGYVKVFDCSLEQTNMHGESPYLIMFGPDICGPGTKKVHV 126
K K LV+Q+ VK +Q IDCGG Y+K+ +Q + GE+PY IMFGPDICG TKKVH
Sbjct: 69 KGKTLVVQYEVKLQQGIDCGGAYIKLLSKDFDQKDFSGETPYTIMFGPDICGSDTKKVHF 128
Query: 127 IFN------YKGKNLLINKDIRCKDDVFTHLYTLIVKPDNTYDVLIDNESVQNGNLEDDW 180
IF YK + LI K I D THLYTLI++PDNT+++LID E V++G+L +D+
Sbjct: 129 IFRHKPITGYKEEKHLIKKPISRCDK-LTHLYTLIIRPDNTFEILIDGEVVKSGSLLEDF 187
Query: 181 D--FLPPKKIKDPEAKKPEDWDDRPTIADPNDTKPEDWEK--PEHIPNPDAVKPA----- 231
D F PPK+I DP KKPEDWDDR I DP KPEDW++ PE IP+P+AVKP
Sbjct: 188 DPPFNPPKEIPDPTDKKPEDWDDREKIPDPEAKKPEDWDEDEPEFIPDPNAVKPEGWLED 247
Query: 232 --------------DWDDEMDGEWEAPMIDNP-----------------EYKGVWAPKQI 260
DWDDE DGEWEAPMI NP +YKG W P I
Sbjct: 248 EPEYIPDPDATKPEDWDDEEDGEWEAPMIPNPKCEKACGCGKPPMIKNPKYKGKWKPPMI 307
Query: 261 ENPDYKGVWVHPEIDNPEYSPDPELYYKPEICTIGFDLWQVKSGTIFDNVLI 312
+NP+YKGVW +I NP+Y DP Y I IGF+LWQVKSG +FDN+LI
Sbjct: 308 DNPNYKGVWKPRKIPNPDYFEDPNPYNLEPIGAIGFELWQVKSGILFDNILI 359
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 40.8 bits (95), Expect = 0.001
Identities = 22/113 (19%), Positives = 39/113 (34%), Gaps = 7/113 (6%)
Query: 179 DWDFLPPKKIKDPE-AKKPEDWDDRPTIADPNDTKPEDWEKPEHIPNPDAVKPADWDDEM 237
W +P + +P A PE + + A P E ++P + +
Sbjct: 359 AWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQP--VQPQQPYYAPAAEQPA 416
Query: 238 DGEWEAPMIDNPEYKGVWAPKQIENPDYKGVWVHPEIDNPEYSPDPELYYKPE 290
+ AP + P + +AP E P W E + + P+ Y+ E
Sbjct: 417 QQPYYAPAPEQPAQQPYYAP-APEQPVAGNAWQAEE---QQSTFAPQSTYQTE 465
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 39.1 bits (92), Expect = 0.003
Identities = 19/69 (27%), Positives = 42/69 (60%)
Query: 316 IATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKA 375
+ +KK + I K + EKK +E++ E+++KA K+++++E+ ++ E+E+E +
Sbjct: 402 LTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461
Query: 376 EEHEHDHDE 384
E E + +E
Sbjct: 462 AEEEKEEEE 470
Score = 37.2 bits (87), Expect = 0.011
Identities = 16/69 (23%), Positives = 37/69 (53%)
Query: 311 LITDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEE 370
+T KK + + K + E++ KEK+ + ++ ++++K++ ++E E+EEE
Sbjct: 401 FLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEE 460
Query: 371 TTGKAEEHE 379
+ +E E
Sbjct: 461 EAEEEKEEE 469
Score = 30.7 bits (70), Expect = 1.4
Identities = 17/65 (26%), Positives = 36/65 (55%)
Query: 310 VLITDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEE 369
I + A+K E+ K + K+K++EEE + E+ + ++++E+ +E+++EE
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469
Query: 370 ETTGK 374
E K
Sbjct: 470 EEKKK 474
>gnl|CDD|217940 pfam04177, TAP42, TAP42-like family. The TOR signalling pathway
activates a cell-growth program in response to
nutrients. TIP41 (pfam04176) interacts with TAP42 and
negatively regulates the TOR signaling pathway.
Length = 335
Score = 37.7 bits (88), Expect = 0.009
Identities = 14/38 (36%), Positives = 24/38 (63%)
Query: 333 GEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEE 370
E++MK + A+A++++ DEDD+ED+DEE
Sbjct: 284 LEQEMKRGGVPQGGGEAAASAEEEEDDEDDEEDDDEET 321
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 36.3 bits (83), Expect = 0.032
Identities = 20/63 (31%), Positives = 32/63 (50%), Gaps = 4/63 (6%)
Query: 319 AKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDE----EETTGK 374
AKK A+ K A E +K + + E E A+EA A ++K + + + E+ + K
Sbjct: 1327 AKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386
Query: 375 AEE 377
AEE
Sbjct: 1387 AEE 1389
Score = 34.3 bits (78), Expect = 0.14
Identities = 22/62 (35%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMK-EKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKA 375
A K AE+ K A E +K + +K+ EE +KA+EA K ++ + DE + E KA
Sbjct: 1453 AEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKA 1512
Query: 376 EE 377
+E
Sbjct: 1513 DE 1514
Score = 33.2 bits (75), Expect = 0.27
Identities = 23/70 (32%), Positives = 35/70 (50%), Gaps = 4/70 (5%)
Query: 319 AKKYAEDIWKPAFEGEKKMKEKQDEEE-RKAEEAAAKDKKKDEDDDEDEDE---EETTGK 374
AKK AE+ K A +KK +E + E KAE AA D+ + ++ + E EE K
Sbjct: 1320 AKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379
Query: 375 AEEHEHDHDE 384
A+ + +E
Sbjct: 1380 ADAAKKKAEE 1389
Score = 33.2 bits (75), Expect = 0.31
Identities = 24/73 (32%), Positives = 33/73 (45%), Gaps = 4/73 (5%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMKE--KQDEEERKAEEAAAK--DKKKDEDDDEDEDEEETT 372
A K A++ K A E +KK K+ EE +KA EAA + DE + +E E
Sbjct: 1311 AEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAE 1370
Query: 373 GKAEEHEHDHDEL 385
K EE + D
Sbjct: 1371 KKKEEAKKKADAA 1383
Score = 32.4 bits (73), Expect = 0.54
Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 4/63 (6%)
Query: 319 AKKYAEDIWKPAFEGEKKMKEKQDEEE-RKAEEAAAKDKKKDEDDDEDEDE---EETTGK 374
AKK AE+ K A E +K + K+ +E +KAEEA D+ K ++ + DE E K
Sbjct: 1488 AKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKK 1547
Query: 375 AEE 377
A+E
Sbjct: 1548 ADE 1550
Score = 32.4 bits (73), Expect = 0.56
Identities = 22/73 (30%), Positives = 37/73 (50%), Gaps = 2/73 (2%)
Query: 314 DDIATAKKYAEDIWKPAFEGEKKMKE--KQDEEERKAEEAAAKDKKKDEDDDEDEDEEET 371
D+ A++ AE K E +KK K+ EE++KA+EA K ++ + DE +
Sbjct: 1357 DEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAA 1416
Query: 372 TGKAEEHEHDHDE 384
KA+E + +E
Sbjct: 1417 KKKADEAKKKAEE 1429
Score = 32.0 bits (72), Expect = 0.60
Identities = 24/70 (34%), Positives = 38/70 (54%), Gaps = 3/70 (4%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMKE--KQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGK 374
A KK A++ K A E +KK E K ++KA+EA K ++K + D+ + EE K
Sbjct: 1387 AEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAK-K 1445
Query: 375 AEEHEHDHDE 384
A+E + +E
Sbjct: 1446 ADEAKKKAEE 1455
Score = 32.0 bits (72), Expect = 0.61
Identities = 19/65 (29%), Positives = 37/65 (56%), Gaps = 1/65 (1%)
Query: 314 DDIATAK-KYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETT 372
+ A A+ + A D + A E + ++K++E ++KA+ A K ++K + D+ + EE
Sbjct: 1345 AEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDK 1404
Query: 373 GKAEE 377
KA+E
Sbjct: 1405 KKADE 1409
Score = 32.0 bits (72), Expect = 0.63
Identities = 22/66 (33%), Positives = 34/66 (51%), Gaps = 5/66 (7%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMKE--KQDEEERKAEEAAAKDKKKDEDD---DEDEDEEET 371
A K A++ K A E +KK E K E ++KA+EA ++ K D+ E+ + +
Sbjct: 1479 AEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADE 1538
Query: 372 TGKAEE 377
KAEE
Sbjct: 1539 AKKAEE 1544
Score = 32.0 bits (72), Expect = 0.67
Identities = 21/62 (33%), Positives = 35/62 (56%), Gaps = 3/62 (4%)
Query: 319 AKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDD---DEDEDEEETTGKA 375
AKK A++ K A +K + K+ EE +KA+EA ++ K D+ E++ + + KA
Sbjct: 1495 AKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKA 1554
Query: 376 EE 377
EE
Sbjct: 1555 EE 1556
Score = 32.0 bits (72), Expect = 0.74
Identities = 22/71 (30%), Positives = 37/71 (52%), Gaps = 3/71 (4%)
Query: 317 ATAKKYAEDIWKPAFEGEKK---MKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTG 373
A KK AE++ K E + K +K +E+++KAEEA ++ + + + + E E
Sbjct: 1643 AEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAK 1702
Query: 374 KAEEHEHDHDE 384
KAEE + E
Sbjct: 1703 KAEELKKKEAE 1713
Score = 31.6 bits (71), Expect = 0.82
Identities = 20/68 (29%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAE 376
A + A+++ K A E +K + K+ EE++KA+EA K ++ + D+ + EE KA+
Sbjct: 1274 AEEARKADEL-KKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKAD 1332
Query: 377 EHEHDHDE 384
+ +E
Sbjct: 1333 AAKKKAEE 1340
Score = 31.3 bits (70), Expect = 1.3
Identities = 26/68 (38%), Positives = 40/68 (58%), Gaps = 7/68 (10%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMK-EKQDEEERKAEEA--AAKDKKKDEDDDEDEDE----E 369
A AKK A++ K A E +K + +K+ EE +KA+EA A++ KK E+ + +E +
Sbjct: 1414 AAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKAD 1473
Query: 370 ETTGKAEE 377
E KAEE
Sbjct: 1474 EAKKKAEE 1481
Score = 30.5 bits (68), Expect = 2.1
Identities = 22/67 (32%), Positives = 36/67 (53%), Gaps = 6/67 (8%)
Query: 317 ATAKKYAEDIWKPAFEGEK--KMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEE----E 370
A K A++ K A E +K + K+K +E ++KA+EA + K + D+ + EE +
Sbjct: 1466 AEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKAD 1525
Query: 371 TTGKAEE 377
KAEE
Sbjct: 1526 EAKKAEE 1532
Score = 30.1 bits (67), Expect = 2.7
Identities = 22/68 (32%), Positives = 35/68 (51%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAE 376
A KK A++ K A E +K + K+ EE K E A K ++ + DE + + E KA+
Sbjct: 1427 AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKAD 1486
Query: 377 EHEHDHDE 384
E + +E
Sbjct: 1487 EAKKKAEE 1494
Score = 29.7 bits (66), Expect = 3.0
Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 3/61 (4%)
Query: 319 AKKYAEDIWKPAFEGEKKMKE--KQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAE 376
AKK AE+ K A E +KK +E K+ +E +KA A K + + +E + +E KAE
Sbjct: 1383 AKKKAEEK-KKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAE 1441
Query: 377 E 377
E
Sbjct: 1442 E 1442
Score = 29.7 bits (66), Expect = 3.6
Identities = 24/67 (35%), Positives = 39/67 (58%), Gaps = 4/67 (5%)
Query: 320 KKYAEDIWKPAFEGEKKMKE--KQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEE 377
KK A+++ K A +KK E K+ EE++KA+EA K ++ + D+ + EE KAEE
Sbjct: 1404 KKKADEL-KKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK-KAEE 1461
Query: 378 HEHDHDE 384
+ +E
Sbjct: 1462 AKKKAEE 1468
Score = 29.3 bits (65), Expect = 4.6
Identities = 23/65 (35%), Positives = 37/65 (56%), Gaps = 3/65 (4%)
Query: 323 AEDIWKPAFEGEKKMKE--KQDEEERKAEEAAAKDKKKDEDDDE-DEDEEETTGKAEEHE 379
A + K A E +KK +E K +E+E+KA EA K+ ++ + +E + E E KAEE +
Sbjct: 1663 AAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELK 1722
Query: 380 HDHDE 384
+E
Sbjct: 1723 KAEEE 1727
Score = 29.3 bits (65), Expect = 4.9
Identities = 23/68 (33%), Positives = 32/68 (47%), Gaps = 3/68 (4%)
Query: 320 KKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDED---EEETTGKAE 376
KK E + K E +KK +E + EE +AA + KK +ED + E+ EE KA
Sbjct: 1632 KKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAA 1691
Query: 377 EHEHDHDE 384
E E
Sbjct: 1692 EALKKEAE 1699
Score = 28.6 bits (63), Expect = 6.8
Identities = 20/58 (34%), Positives = 32/58 (55%), Gaps = 4/58 (6%)
Query: 331 FEGEKKMK----EKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
+E EKKMK +K +E + KAEE +++K + + + E E KAEE + +E
Sbjct: 1601 YEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEE 1658
Score = 28.6 bits (63), Expect = 7.0
Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 4/64 (6%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDE---EETTG 373
A AKK A++ K A E +K + K+ EE +KA+EA ++KK D+ + +E E
Sbjct: 1506 AEAKKKADEA-KKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKK 1564
Query: 374 KAEE 377
KAEE
Sbjct: 1565 KAEE 1568
Score = 28.6 bits (63), Expect = 7.1
Identities = 31/77 (40%), Positives = 42/77 (54%), Gaps = 12/77 (15%)
Query: 320 KKYAEDIWKPAFEGEKKMKE---KQDEEERKAEE---AAAKDKKKDEDDDEDEDE----- 368
KK AE+ K A E EKK E K+ EE +KAEE A++KKK E+ + E+E
Sbjct: 1674 KKKAEEA-KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKA 1732
Query: 369 EETTGKAEEHEHDHDEL 385
EE +AEE + +E
Sbjct: 1733 EEAKKEAEEDKKKAEEA 1749
Score = 28.6 bits (63), Expect = 8.3
Identities = 27/74 (36%), Positives = 40/74 (54%), Gaps = 6/74 (8%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMKEKQDEEE---RKAEEA---AAKDKKKDEDDDEDEDEEE 370
A K AE++ K E +KK +E + EE KAEEA A +DKKK E+ +DE+E++
Sbjct: 1698 AEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKK 1757
Query: 371 TTGKAEEHEHDHDE 384
++ E E
Sbjct: 1758 KIAHLKKEEEKKAE 1771
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 36.0 bits (83), Expect = 0.033
Identities = 17/52 (32%), Positives = 33/52 (63%)
Query: 332 EGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHD 383
E E+ ++E+++EEE + EE ++++ ED++E+E+ E G EE E +
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSE 491
Score = 33.7 bits (77), Expect = 0.15
Identities = 15/54 (27%), Positives = 31/54 (57%)
Query: 328 KPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHD 381
+ E E++ +++EEE + EE ++ +++E +DE+E+EE E E +
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487
Score = 31.8 bits (72), Expect = 0.72
Identities = 15/56 (26%), Positives = 31/56 (55%), Gaps = 1/56 (1%)
Query: 329 PAFEGEKKMKEKQDEEERKAEEAAAK-DKKKDEDDDEDEDEEETTGKAEEHEHDHD 383
P+ ++ +E+ EEE + EE + +++ +E++ EDE+EEE E + +
Sbjct: 432 PSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487
Score = 29.9 bits (67), Expect = 3.0
Identities = 16/52 (30%), Positives = 29/52 (55%), Gaps = 7/52 (13%)
Query: 333 GEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
GE Q+ EE ++ E ++E+++E+E+EEE + EE E + +E
Sbjct: 429 GESPSMASQESEEEESVE-------EEEEEEEEEEEEEQESEEEEGEDEEEE 473
Score = 28.7 bits (64), Expect = 6.3
Identities = 12/44 (27%), Positives = 27/44 (61%)
Query: 342 DEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDEL 385
E A + + +++ +E+++E+E+EEE ++EE E + +E
Sbjct: 429 GESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEE 472
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 36.1 bits (83), Expect = 0.037
Identities = 21/87 (24%), Positives = 47/87 (54%), Gaps = 6/87 (6%)
Query: 295 GFDLWQVKSGTIFDNVLITDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAK 354
D + +S T D+ + D+ + AE+ +GE K EK + ++ ++
Sbjct: 811 EKDEHEGQSETQADDTEVKDETGEQELNAEN------QGEAKQDEKGVDGGGGSDGGDSE 864
Query: 355 DKKKDEDDDEDEDEEETTGKAEEHEHD 381
+++++E+++E+E+EEE + EE E++
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEEEEENE 891
Score = 34.6 bits (79), Expect = 0.10
Identities = 16/48 (33%), Positives = 25/48 (52%)
Query: 332 EGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHE 379
E ++ +E +DE E +AE + + D + E E E E GK +E E
Sbjct: 733 ETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDE 780
Score = 33.8 bits (77), Expect = 0.18
Identities = 14/51 (27%), Positives = 28/51 (54%)
Query: 334 EKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
E + + KQDE+ + ++E+++E+E+EEE + EE E + +
Sbjct: 840 ENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEEN 890
Score = 33.0 bits (75), Expect = 0.26
Identities = 13/63 (20%), Positives = 24/63 (38%), Gaps = 6/63 (9%)
Query: 328 KPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDD------DEDEDEEETTGKAEEHEHD 381
+ E E + E+ E + E K + + E + E E E E K +H+ +
Sbjct: 650 ERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGE 709
Query: 382 HDE 384
+
Sbjct: 710 TEA 712
Score = 32.7 bits (74), Expect = 0.39
Identities = 15/53 (28%), Positives = 27/53 (50%)
Query: 332 EGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
GE+++ + E ++ E+ D D E+E+EEE + EE E + +E
Sbjct: 832 TGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEE 884
Score = 30.7 bits (69), Expect = 1.6
Identities = 11/38 (28%), Positives = 26/38 (68%)
Query: 332 EGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEE 369
G + ++EEE + EE ++++++E+++E+E+EE
Sbjct: 855 GGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 30.3 bits (68), Expect = 1.8
Identities = 14/54 (25%), Positives = 24/54 (44%)
Query: 324 EDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEE 377
D+ K + E+ EE + EA ++ ++ + E E E ET G+ E
Sbjct: 628 GDLSKGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENES 681
Score = 29.6 bits (66), Expect = 4.1
Identities = 21/53 (39%), Positives = 28/53 (52%), Gaps = 2/53 (3%)
Query: 332 EGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
EGE + KE + E +AEE + + + E EDE E ET + EE E D E
Sbjct: 696 EGEIEAKEADHKGETEAEEVEHEGETEAEGT-EDEGEIETGEEGEEVE-DEGE 746
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 35.6 bits (82), Expect = 0.038
Identities = 29/70 (41%), Positives = 35/70 (50%), Gaps = 7/70 (10%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMKE---KQDEEERKAEEAAAKDKKKDED----DDEDEDEE 369
A AK+ AE K E EKK KE KQ EEE KA+ AA KK E + E + +
Sbjct: 124 AKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKA 183
Query: 370 ETTGKAEEHE 379
E KA+ E
Sbjct: 184 EAKAKAKAEE 193
Score = 34.8 bits (80), Expect = 0.072
Identities = 22/68 (32%), Positives = 31/68 (45%), Gaps = 9/68 (13%)
Query: 319 AKKYAEDIWKPAFEGEK---KMKEKQDEEERKAEEAAAKDK------KKDEDDDEDEDEE 369
A K AE K A E +K + K KQ E + EA A+ K K+ E++ + +
Sbjct: 103 AAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAA 162
Query: 370 ETTGKAEE 377
E KA E
Sbjct: 163 EAKKKAAE 170
Score = 29.8 bits (67), Expect = 2.1
Identities = 23/63 (36%), Positives = 26/63 (41%), Gaps = 3/63 (4%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMKE---KQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTG 373
A AKK A + K A K E K EE KA+ AAK K E + E E
Sbjct: 162 AEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAA 221
Query: 374 KAE 376
AE
Sbjct: 222 AAE 224
Score = 29.4 bits (66), Expect = 3.4
Identities = 20/63 (31%), Positives = 28/63 (44%), Gaps = 4/63 (6%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEA---AAKDKKKDEDDDEDEDEEETTG 373
A A+K A++ K E E K + E ++KA EA A + K + EE
Sbjct: 138 AEAEKKAKEEAKKQAEEEAK-AKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKA 196
Query: 374 KAE 376
KAE
Sbjct: 197 KAE 199
Score = 29.0 bits (65), Expect = 4.0
Identities = 20/59 (33%), Positives = 31/59 (52%)
Query: 319 AKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEE 377
+ AE K A + K+ +EKQ + E + AA+ K K E + E + +EE +AEE
Sbjct: 96 QRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEE 154
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 34.9 bits (80), Expect = 0.059
Identities = 21/61 (34%), Positives = 34/61 (55%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAE 376
A AKK AE+ K A E + K + +++ +AE AA +K K E + + + E++ AE
Sbjct: 167 AEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAE 226
Query: 377 E 377
E
Sbjct: 227 E 227
Score = 31.1 bits (70), Expect = 0.89
Identities = 18/50 (36%), Positives = 26/50 (52%)
Query: 328 KPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEE 377
K A E EK+ + +Q ++E +A +AAA+ KKK E E KA
Sbjct: 118 KQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAA 167
Score = 28.8 bits (64), Expect = 5.7
Identities = 19/78 (24%), Positives = 30/78 (38%), Gaps = 15/78 (19%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMKEKQD----EEERKAEEAAAKDKKKDEDDDEDED----- 367
+K AE+ K A +K+ +E+ E+++KAE A AK + +
Sbjct: 114 QEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKA 173
Query: 368 ------EEETTGKAEEHE 379
EE KAE
Sbjct: 174 EEAAKAAEEAKAKAEAAA 191
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 33.9 bits (78), Expect = 0.062
Identities = 19/64 (29%), Positives = 37/64 (57%)
Query: 314 DDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTG 373
++I KK E+ K ++ +K K+K ++++K ++ K +KKDE + ED+ E+ T
Sbjct: 69 EEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKS 128
Query: 374 KAEE 377
+E
Sbjct: 129 YSET 132
Score = 32.0 bits (73), Expect = 0.35
Identities = 12/51 (23%), Positives = 32/51 (62%)
Query: 335 KKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDEL 385
+K+K++ +E+++ + KKKD+D D+ +D+++ + ++ + D+L
Sbjct: 72 EKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKL 122
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 34.6 bits (79), Expect = 0.095
Identities = 14/51 (27%), Positives = 30/51 (58%)
Query: 335 KKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDEL 385
+K+ ++ +EE + E+ +D D+D+DED+D+++ E+ + D L
Sbjct: 39 RKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTL 89
Score = 32.3 bits (73), Expect = 0.47
Identities = 18/49 (36%), Positives = 29/49 (59%)
Query: 335 KKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHD 383
K+ +E+ EEE EE D +DEDDD+D+D+E+ + ++ HD
Sbjct: 43 KEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHD 91
Score = 31.9 bits (72), Expect = 0.63
Identities = 20/85 (23%), Positives = 41/85 (48%), Gaps = 4/85 (4%)
Query: 297 DLWQVKSGTIFDNVLITDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDK 356
D W S + + + D + I K E E++ E++D++E ++ +D+
Sbjct: 13 DDWVRGSLDYKEKLTLNDTMKKENA----IRKLGKEAEEEAMEEEDDDEEDDDDDDDEDE 68
Query: 357 KKDEDDDEDEDEEETTGKAEEHEHD 381
D+DDD+++DE+E + H+
Sbjct: 69 DDDDDDDDEDDEDEDDDDSTLHDDS 93
Score = 31.9 bits (72), Expect = 0.67
Identities = 13/51 (25%), Positives = 30/51 (58%)
Query: 334 EKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
E +++ E E +A E D++ D+DDD+++++++ E+ E + D+
Sbjct: 35 ENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDD 85
>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
Length = 390
Score = 33.7 bits (78), Expect = 0.14
Identities = 13/52 (25%), Positives = 26/52 (50%), Gaps = 3/52 (5%)
Query: 322 YAEDIWKPAFEGEKKMKEKQ---DEEERKAEEAAAKDKKKDEDDDEDEDEEE 370
+ E+ + E + ++ + D+ R+ E ++ D DDD DED++E
Sbjct: 331 FIEENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDE 382
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 32.9 bits (76), Expect = 0.16
Identities = 11/43 (25%), Positives = 20/43 (46%)
Query: 339 EKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHD 381
A E+ ++ ++DDD+DEDE+E + + D
Sbjct: 34 AAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPD 76
>gnl|CDD|220098 pfam09057, Smac_DIABLO, Second Mitochondria-derived Activator of
Caspases. Second Mitochondria-derived Activator of
Caspases promotes apoptosis by activating caspases in
the cytochrome c/Apaf-1/caspase-9 pathway, and by
opposing the inhibitory activity of inhibitor of
apoptosis proteins (XIAP-BIR3). The protein assumes an
elongated three-helix bundle structure, and forms a
dimer in solution.
Length = 234
Score = 33.3 bits (76), Expect = 0.16
Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 1/50 (2%)
Query: 316 IATAKKYAEDIWKPAFEGEKKMKEKQDEE-ERKAEEAAAKDKKKDEDDDE 364
+ A+ E++ + + E EKK+ E + EE +R AE A++ D + ED E
Sbjct: 179 LQVAQSQVEEVRQLSKEAEKKLAESKAEEIQRMAEYASSIDLSELEDIPE 228
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function
of NS2 is not known, but it may be involved in the
transport and condensation of viral mRNAs.
Length = 363
Score = 33.3 bits (76), Expect = 0.18
Identities = 12/55 (21%), Positives = 29/55 (52%), Gaps = 2/55 (3%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEET 371
A ++ AE + + E++ ++ +E++ EE +D +++ ED ++ ET
Sbjct: 219 AVERRLAELVEMINWNLEERRRD--LRKEQELEENVERDSDDEDEHGEDSEDGET 271
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 33.4 bits (77), Expect = 0.20
Identities = 15/51 (29%), Positives = 30/51 (58%), Gaps = 3/51 (5%)
Query: 337 MKEKQDEE---ERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
+EE E + ++A ++ D+DD+E ++E+E + K+E+ E D D+
Sbjct: 102 DDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDD 152
Score = 30.7 bits (70), Expect = 1.3
Identities = 14/47 (29%), Positives = 23/47 (48%), Gaps = 1/47 (2%)
Query: 339 EKQDE-EERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
E QDE A +D DE+ DE+++E + E+ + D D+
Sbjct: 109 ELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDD 155
Score = 30.7 bits (70), Expect = 1.5
Identities = 11/33 (33%), Positives = 23/33 (69%)
Query: 342 DEEERKAEEAAAKDKKKDEDDDEDEDEEETTGK 374
DEE + +E ++K + ++DDD+D+D++ T +
Sbjct: 130 DEESDEEDEESSKSEDDEDDDDDDDDDDIATRE 162
Score = 30.3 bits (69), Expect = 2.0
Identities = 14/63 (22%), Positives = 32/63 (50%), Gaps = 1/63 (1%)
Query: 309 NVLITDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDE 368
+ DI + E + A+ ++ + +E + + EE++ + +D+DDD+D+D+
Sbjct: 99 VIFDDGDIEEELQ-DEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDD 157
Query: 369 EET 371
T
Sbjct: 158 IAT 160
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 33.3 bits (76), Expect = 0.20
Identities = 13/50 (26%), Positives = 27/50 (54%)
Query: 335 KKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
K K K ++ +K+++ AK+ +D++E E E+E+ + + DE
Sbjct: 145 KAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDE 194
Score = 31.7 bits (72), Expect = 0.73
Identities = 11/51 (21%), Positives = 28/51 (54%)
Query: 334 EKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
KK + K+ ++ EE+ ++D+ K E+ ED+ ++E ++ ++ +
Sbjct: 158 SKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYD 208
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 32.4 bits (74), Expect = 0.21
Identities = 17/68 (25%), Positives = 26/68 (38%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAE 376
K + K A E EKK+ E + E + + A A +E +E+E E E
Sbjct: 119 DKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAEE 178
Query: 377 EHEHDHDE 384
+ E
Sbjct: 179 APAEESAE 186
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 33.2 bits (77), Expect = 0.25
Identities = 12/55 (21%), Positives = 25/55 (45%), Gaps = 6/55 (10%)
Query: 326 IWK-PAFEGEKKMKE-----KQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGK 374
W GE++++E E ++ ++ D+D+DE+E+E+E
Sbjct: 154 EWYDRLENGERRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDS 208
Score = 28.6 bits (65), Expect = 7.3
Identities = 11/36 (30%), Positives = 21/36 (58%)
Query: 342 DEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEE 377
EE+ + ++ DED++E+EDE + + A+E
Sbjct: 178 AEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADE 213
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 33.2 bits (76), Expect = 0.26
Identities = 18/58 (31%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 320 KKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEE 377
KK++E I P + +K EK+ +E EE ++KK++E+ D++EE + +E
Sbjct: 13 KKHSEFINYPIYLWVEKEVEKEVPDEE--EEEEKEEKKEEEEKTTDKEEEVDEEEEKE 68
Score = 28.6 bits (64), Expect = 5.8
Identities = 19/59 (32%), Positives = 34/59 (57%), Gaps = 9/59 (15%)
Query: 329 PAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEET--TGKAEEHEHDHDEL 385
P E E++ +EK++EEE+ + K++E D+E+E EE+ T K +E + + L
Sbjct: 36 PDEEEEEEKEEKKEEEEKTTD-------KEEEVDEEEEKEEKKKKTKKVKETTTEWELL 87
>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex. This Mediator
complex subunit was formerly known as Srb4 in yeasts or
Trap80 in Drosophila and human. The Med17 subunit is
located within the head domain and is essential for cell
viability to the extent that a mutant strain of
cerevisiae lacking it shows all RNA polymerase
II-dependent transcription ceasing at non-permissive
temperatures.
Length = 454
Score = 33.1 bits (76), Expect = 0.26
Identities = 12/44 (27%), Positives = 28/44 (63%)
Query: 334 EKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEE 377
E+ ++E+ +E K + + D+++D+++D+++D EE EE
Sbjct: 51 EESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEE 94
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 32.4 bits (74), Expect = 0.26
Identities = 15/52 (28%), Positives = 29/52 (55%)
Query: 332 EGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHD 383
E + K E++ K EA D++ ++D++E+E+EEE ++ + D D
Sbjct: 146 EEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDD 197
Score = 31.6 bits (72), Expect = 0.42
Identities = 22/67 (32%), Positives = 37/67 (55%), Gaps = 4/67 (5%)
Query: 314 DDIATAKKYAEDIWKPAFEGEKKMK----EKQDEEERKAEEAAAKDKKKDEDDDEDEDEE 369
+ + EDI + EKK+K E DEE+ K EE +++++DED D+D+D++
Sbjct: 137 RKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDD 196
Query: 370 ETTGKAE 376
+ AE
Sbjct: 197 DDDYNAE 203
Score = 28.2 bits (63), Expect = 6.1
Identities = 17/53 (32%), Positives = 31/53 (58%)
Query: 332 EGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
E E+ + EK E+K +E A+D ++++ DE+E+EEE + + D D+
Sbjct: 144 EEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDD 196
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 33.0 bits (76), Expect = 0.26
Identities = 10/73 (13%), Positives = 27/73 (36%)
Query: 312 ITDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEET 371
D A+K K + ++ ++ D DDD+D+++++
Sbjct: 99 ELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDE 158
Query: 372 TGKAEEHEHDHDE 384
++ + + +E
Sbjct: 159 DDDDDDVDDEDEE 171
Score = 32.7 bits (75), Expect = 0.32
Identities = 16/69 (23%), Positives = 28/69 (40%)
Query: 313 TDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETT 372
D Y +DI + + D+ + + D+ DEDDD+D+ ++E
Sbjct: 111 ALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDE 170
Query: 373 GKAEEHEHD 381
K E E +
Sbjct: 171 EKKEAKELE 179
Score = 32.3 bits (74), Expect = 0.51
Identities = 9/77 (11%), Positives = 26/77 (33%)
Query: 308 DNVLITDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDED 367
D + + + + + + + +++ E+ D D DD+++E
Sbjct: 113 DKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEK 172
Query: 368 EEETTGKAEEHEHDHDE 384
+E + + D
Sbjct: 173 KEAKELEKLSDDDDFVW 189
Score = 31.5 bits (72), Expect = 0.74
Identities = 12/78 (15%), Positives = 34/78 (43%), Gaps = 1/78 (1%)
Query: 308 DNVLITDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDED 367
D++ DI + +D + + D+++ + ++ D D++D+E ++
Sbjct: 116 DDLNYVKDIDVLNQADDDDDDDDDDDLDD-DDIDDDDDDEDDDEDDDDDDVDDEDEEKKE 174
Query: 368 EEETTGKAEEHEHDHDEL 385
+E +++ + DE
Sbjct: 175 AKELEKLSDDDDFVWDED 192
Score = 31.1 bits (71), Expect = 0.95
Identities = 20/78 (25%), Positives = 35/78 (44%), Gaps = 5/78 (6%)
Query: 313 TDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEA-----AAKDKKKDEDDDEDED 367
A AKK +D + + EKK +D++ ++ A D D+DDD D+D
Sbjct: 86 AAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDD 145
Query: 368 EEETTGKAEEHEHDHDEL 385
+ + E+ + D D+
Sbjct: 146 DIDDDDDDEDDDEDDDDD 163
Score = 30.3 bits (69), Expect = 2.0
Identities = 14/82 (17%), Positives = 28/82 (34%)
Query: 296 FDLWQVKSGTIFDNVLITDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKD 355
L Q D+ DD ++ + + E ++++E K E + D
Sbjct: 125 DVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDD 184
Query: 356 KKKDEDDDEDEDEEETTGKAEE 377
D+D+ E + A+
Sbjct: 185 DDFVWDEDDSEALRQARKDAKL 206
Score = 29.2 bits (66), Expect = 4.3
Identities = 13/72 (18%), Positives = 29/72 (40%)
Query: 313 TDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETT 372
D + K AE + + + D + ++ D +DDD D+D+++
Sbjct: 96 LKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDED 155
Query: 373 GKAEEHEHDHDE 384
++ + D D+
Sbjct: 156 DDEDDDDDDVDD 167
Score = 28.8 bits (65), Expect = 5.3
Identities = 14/73 (19%), Positives = 29/73 (39%), Gaps = 1/73 (1%)
Query: 313 TDDIATAKKYAEDIWKPAFEGEKKMKEKQ-DEEERKAEEAAAKDKKKDEDDDEDEDEEET 371
KK A D + Q D+++ ++ D D+DDD+++D+E+
Sbjct: 101 DSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDD 160
Query: 372 TGKAEEHEHDHDE 384
+ E + +
Sbjct: 161 DDDDVDDEDEEKK 173
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 32.4 bits (74), Expect = 0.27
Identities = 23/76 (30%), Positives = 32/76 (42%), Gaps = 3/76 (3%)
Query: 309 NVLITDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDE 368
+ + E K + + E E+ EEE +EAA + K+D+ D E EDE
Sbjct: 33 QLFFPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEE---KEAANSEDKEDKGDAEKEDE 89
Query: 369 EETTGKAEEHEHDHDE 384
E EE E DE
Sbjct: 90 ESEEENEEEDEESSDE 105
Score = 28.9 bits (65), Expect = 3.2
Identities = 10/51 (19%), Positives = 27/51 (52%)
Query: 334 EKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
E+K +++E K + ++ ++E+++EDE+ + K E + + +
Sbjct: 68 EEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESNV 118
>gnl|CDD|220002 pfam08751, TrwC, TrwC relaxase. Relaxases are DNA strand
transferases which function during the conjugative cell
to cell DNA transfer. TrwC binds to the origin of
transfer (oriT) and melts the double helix.
Length = 288
Score = 32.3 bits (74), Expect = 0.31
Identities = 16/51 (31%), Positives = 22/51 (43%), Gaps = 2/51 (3%)
Query: 334 EKKMKEKQDEEERKAEEAAAKD--KKKDEDDDEDEDEEETTGKAEEHEHDH 382
E+ + E + A++AAA K KDE D E E +A E D
Sbjct: 238 EEALAELGGDATAAAKQAAALATRKAKDEPRDRAELRAEWRARARELGFDP 288
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 32.8 bits (74), Expect = 0.32
Identities = 14/50 (28%), Positives = 23/50 (46%)
Query: 330 AFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHE 379
A K++ E E++++ E A K++ + D DE EE E E
Sbjct: 282 AEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEE 331
Score = 29.3 bits (65), Expect = 4.2
Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 2/48 (4%)
Query: 332 EGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHE 379
E + KE+Q EE AE + + + DE +E+E + EE+E
Sbjct: 288 EVSEGDKEQQQEEVENAEAHKEEVQ--SDRPDEIGEEKEEDDENEENE 333
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 32.5 bits (74), Expect = 0.34
Identities = 17/80 (21%), Positives = 40/80 (50%), Gaps = 5/80 (6%)
Query: 300 QVKSGTIFDNVLITDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKD 359
+VKS T + + K+Y + + + + ++K++ D E +AE A ++K K
Sbjct: 355 EVKSNTDLSGDKRQELL---KEYNKKL-QDYTKKLGEVKDETDASE-EAEAKAKEEKLKQ 409
Query: 360 EDDDEDEDEEETTGKAEEHE 379
E++++ + E+ K + +
Sbjct: 410 EENEKKQKEQADEDKEKRQK 429
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 32.6 bits (75), Expect = 0.35
Identities = 14/72 (19%), Positives = 22/72 (30%)
Query: 296 FDLWQVKSGTIFDNVLITDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKD 355
D + G + A KK + K + EK+ EK +++ A
Sbjct: 388 HDKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAK 447
Query: 356 KKKDEDDDEDED 367
E D D
Sbjct: 448 GPDGETKKVDPD 459
Score = 31.4 bits (72), Expect = 0.74
Identities = 15/55 (27%), Positives = 27/55 (49%), Gaps = 3/55 (5%)
Query: 334 EKKMKEKQDEEERKAEEAA---AKDKKKDEDDDEDEDEEETTGKAEEHEHDHDEL 385
KK+++KQ + E+KAE+ A KKK E + + K + + ++L
Sbjct: 411 RKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDPLGEKL 465
>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
factor RRN3. This family consists of several eukaryotic
proteins which are homologous to the yeast RRN3 protein.
RRN3 is one of the RRN genes specifically required for
the transcription of rDNA by RNA polymerase I (Pol I) in
Saccharomyces cerevisiae.
Length = 554
Score = 32.7 bits (75), Expect = 0.37
Identities = 23/91 (25%), Positives = 40/91 (43%), Gaps = 23/91 (25%)
Query: 287 YKPEICTIGFDLWQVKSGTIFDNVLITDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEER 346
Y PE+ GFD+ ++ I + L+ D+ E + ++ + DEEE
Sbjct: 195 YCPEL---GFDILEL----IIER-LLKLDV---------------EIQNELDDIDDEEEE 231
Query: 347 KAEEAAAKDKKKDEDDDEDEDEEETTGKAEE 377
+ D + D D +D+DEEE+ + E
Sbjct: 232 RVLADEDDDDEDDMFDMDDDDEEESDPEVER 262
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 32.2 bits (73), Expect = 0.42
Identities = 12/51 (23%), Positives = 24/51 (47%)
Query: 332 EGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDH 382
E+K K+K+ + + ++ K K DD+ ++ + G E E D+
Sbjct: 236 GDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDY 286
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 31.7 bits (72), Expect = 0.68
Identities = 15/61 (24%), Positives = 21/61 (34%), Gaps = 3/61 (4%)
Query: 319 AKKYAEDIWKPAFEGEKKMKE--KQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAE 376
A K A A K+ K+ E K + A KK + + + E E KA
Sbjct: 138 AAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEA-AKKAAAEAKKKAEAEAAAKAA 196
Query: 377 E 377
Sbjct: 197 A 197
Score = 30.2 bits (68), Expect = 2.0
Identities = 19/62 (30%), Positives = 28/62 (45%), Gaps = 1/62 (1%)
Query: 317 ATAKKYAEDIWKPAFEGEK-KMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKA 375
A +K AE+ A K K + + A++AAA+ KKK E + + E KA
Sbjct: 128 ALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKA 187
Query: 376 EE 377
E
Sbjct: 188 EA 189
Score = 30.2 bits (68), Expect = 2.2
Identities = 23/63 (36%), Positives = 29/63 (46%), Gaps = 2/63 (3%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMKE--KQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGK 374
A AK AE K A KK K+ E A++AAA+ KKK E + + E K
Sbjct: 143 AAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKK 202
Query: 375 AEE 377
AE
Sbjct: 203 AEA 205
Score = 29.8 bits (67), Expect = 2.6
Identities = 20/65 (30%), Positives = 25/65 (38%), Gaps = 1/65 (1%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAE 376
A AKK AE K E K K E + A +AAA+ K E + E+
Sbjct: 197 AEAKKKAEAEAKKKAAAEAKKKAAA-EAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKA 255
Query: 377 EHEHD 381
E D
Sbjct: 256 AAEVD 260
Score = 29.4 bits (66), Expect = 3.2
Identities = 18/55 (32%), Positives = 24/55 (43%), Gaps = 2/55 (3%)
Query: 323 AEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEE 377
A++ K A E K+ KQ + E A +AAA K K E E + KA
Sbjct: 113 AQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAE--AEAKRAAAAAKKAAA 165
Score = 29.0 bits (65), Expect = 4.8
Identities = 22/64 (34%), Positives = 30/64 (46%), Gaps = 5/64 (7%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAK----DKKKDEDDDEDEDEEETT 372
A AKK A + K A E E K + +++ EAAAK KKK E + + + E
Sbjct: 158 AAAKKAAAEAKKKA-EAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAK 216
Query: 373 GKAE 376
KA
Sbjct: 217 KKAA 220
Score = 28.2 bits (63), Expect = 7.1
Identities = 19/72 (26%), Positives = 28/72 (38%), Gaps = 3/72 (4%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMK---EKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTG 373
A AK AE K E +KK +K+ E KA A A + K + +
Sbjct: 191 AAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKA 250
Query: 374 KAEEHEHDHDEL 385
A + + D+L
Sbjct: 251 AAAKAAAEVDDL 262
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 31.9 bits (72), Expect = 0.69
Identities = 12/46 (26%), Positives = 24/46 (52%)
Query: 339 EKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
E + EE +E A+ D + DE D+++E +E + +E+ +
Sbjct: 930 ESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSS 975
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 31.7 bits (72), Expect = 0.69
Identities = 14/41 (34%), Positives = 19/41 (46%)
Query: 332 EGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETT 372
E E+ E D E + E AA+ + E D+ DED E
Sbjct: 259 ESEEDEAEATDGEGEEGEMDAAEASEDSESDESDEDTETPG 299
Score = 29.0 bits (65), Expect = 5.5
Identities = 10/49 (20%), Positives = 22/49 (44%)
Query: 336 KMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
+ + EE+ E+ +ED + +E E + ++EE E + +
Sbjct: 221 ETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATD 269
Score = 28.3 bits (63), Expect = 9.5
Identities = 9/46 (19%), Positives = 22/46 (47%)
Query: 339 EKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
+ EE + + + +D+ + D + +E E + +E+ E D +
Sbjct: 247 SEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESD 292
>gnl|CDD|184060 PRK13453, PRK13453, F0F1 ATP synthase subunit B; Provisional.
Length = 173
Score = 30.6 bits (69), Expect = 0.71
Identities = 14/50 (28%), Positives = 28/50 (56%)
Query: 312 ITDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDED 361
I DI A++ + K E ++K+KE Q+E ++ E+A + +++ E
Sbjct: 54 INRDIDDAEQAKLNAQKLEEENKQKLKETQEEVQKILEDAKVQARQQQEQ 103
>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
Length = 931
Score = 31.7 bits (72), Expect = 0.80
Identities = 14/82 (17%), Positives = 21/82 (25%), Gaps = 1/82 (1%)
Query: 190 DPEAKKPEDWDDRPTIADPNDTKPEDWEKPEHIPNPDAVKP-ADWDDEMDGEWEAPMIDN 248
A+ +IA P P + P A +P A P+ +
Sbjct: 57 PEAAEAAPLPAAAESIASPEVPPPVPPAPAQEGEAPAAEQPSAVPAPSAAPAPAEPVEPS 116
Query: 249 PEYKGVWAPKQIENPDYKGVWV 270
A + VWV
Sbjct: 117 LAANPFAAAIEWLLGGRWVVWV 138
>gnl|CDD|236558 PRK09545, znuA, high-affinity zinc transporter periplasmic
component; Reviewed.
Length = 311
Score = 31.1 bits (71), Expect = 0.81
Identities = 8/28 (28%), Positives = 13/28 (46%)
Query: 357 KKDEDDDEDEDEEETTGKAEEHEHDHDE 384
K DD D+D + + + +H H E
Sbjct: 115 KGAHDDHHDDDHDHAGHEKSDEDHHHGE 142
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 31.4 bits (71), Expect = 0.85
Identities = 14/45 (31%), Positives = 24/45 (53%)
Query: 334 EKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEH 378
+KK K K++ ++RK +E A + + E + E E + E EE
Sbjct: 119 KKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEE 163
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 29.6 bits (67), Expect = 0.90
Identities = 7/23 (30%), Positives = 17/23 (73%)
Query: 348 AEEAAAKDKKKDEDDDEDEDEEE 370
A ++++++E+++E+E EEE
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEE 95
Score = 28.1 bits (63), Expect = 2.8
Identities = 7/23 (30%), Positives = 18/23 (78%)
Query: 348 AEEAAAKDKKKDEDDDEDEDEEE 370
A A ++++++E+++E+E+ EE
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEE 94
Score = 27.7 bits (62), Expect = 3.5
Identities = 7/27 (25%), Positives = 18/27 (66%)
Query: 347 KAEEAAAKDKKKDEDDDEDEDEEETTG 373
A E ++++++E+++E+ +EE G
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEEAMAG 99
Score = 27.7 bits (62), Expect = 3.8
Identities = 9/28 (32%), Positives = 20/28 (71%)
Query: 348 AEEAAAKDKKKDEDDDEDEDEEETTGKA 375
A AA ++++++E+++E+E+E E A
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEEAMA 98
Score = 26.9 bits (60), Expect = 6.9
Identities = 7/25 (28%), Positives = 17/25 (68%)
Query: 344 EERKAEEAAAKDKKKDEDDDEDEDE 368
AEE ++++++E+++E E+E
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEE 95
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 30.9 bits (70), Expect = 0.96
Identities = 13/49 (26%), Positives = 23/49 (46%)
Query: 337 MKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDEL 385
M D E +E + D + +ED+D+ +++E+ EE D L
Sbjct: 202 MLSSMDMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSL 250
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 31.2 bits (71), Expect = 0.99
Identities = 10/45 (22%), Positives = 23/45 (51%)
Query: 335 KKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHE 379
+ ++KQ +++ ++ K KK +DDD E + + E++
Sbjct: 9 EVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENK 53
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 31.0 bits (70), Expect = 1.0
Identities = 10/53 (18%), Positives = 23/53 (43%)
Query: 328 KPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEH 380
+F +K ++K+ +E ++ +++ + D EDE +E E
Sbjct: 218 MSSFFKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDED 270
Score = 29.0 bits (65), Expect = 4.6
Identities = 18/65 (27%), Positives = 32/65 (49%)
Query: 308 DNVLITDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDED 367
+V++ D+ A ED + + + + ++E E K +E + KK ED+DEDE+
Sbjct: 251 RDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEE 310
Query: 368 EEETT 372
E
Sbjct: 311 MEIVP 315
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 30.7 bits (70), Expect = 1.1
Identities = 13/46 (28%), Positives = 24/46 (52%)
Query: 334 EKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHE 379
EK +K ++E + +A+E + KK++ + + E K EE E
Sbjct: 270 EKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKE 315
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 29.8 bits (68), Expect = 1.2
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 334 EKKMKEKQDEEERKAEEAAAKDKKKDED 361
E K EK +++ KA+ A K K K E+
Sbjct: 67 ELKAWEKAEKKAEKAKAKAEKKKAKKEE 94
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 30.4 bits (68), Expect = 1.3
Identities = 12/50 (24%), Positives = 24/50 (48%)
Query: 334 EKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHD 383
K K++Q + + K ++K ++++E+E E K EE + D
Sbjct: 194 APKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 29.6 bits (67), Expect = 1.4
Identities = 14/50 (28%), Positives = 26/50 (52%)
Query: 335 KKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
KK K+K+ ++ +K + K+ K ++ DE+EE K EE ++
Sbjct: 93 KKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEIMEK 142
>gnl|CDD|219047 pfam06473, FGF-BP1, FGF binding protein 1 (FGF-BP1). This family
consists of several mammalian FGF binding protein 1.
Fibroblast growth factors (FGFs) play important roles
during fetal and embryonic development. Fibroblast
growth factor-binding protein (FGF-BP) 1 is a secreted
protein that can bind fibroblast growth factors (FGFs) 1
and 2.
Length = 229
Score = 30.2 bits (68), Expect = 1.4
Identities = 15/51 (29%), Positives = 22/51 (43%)
Query: 328 KPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEH 378
K EKK+K+ + K + + K + EDED EE A E+
Sbjct: 161 KEKSSEEKKLKKDSNSGVTKTKATKLPECKPTQAGVEDEDLEEQRKLALEY 211
>gnl|CDD|224739 COG1826, TatA, Sec-independent protein secretion pathway components
[Intracellular trafficking and secretion].
Length = 94
Score = 28.9 bits (65), Expect = 1.5
Identities = 17/58 (29%), Positives = 28/58 (48%), Gaps = 2/58 (3%)
Query: 316 IATAKKYAEDIWKPAFEGEKKMKEKQDEE-ERKAEEAAAKDKKKDEDDDEDEDEEETT 372
I KK A D+ K + E K++E D++ E AE A K++ + ED ++
Sbjct: 36 IREFKKAASDV-KNELDEELKLEELDDKKKELTAELQATKEELDQLASELKEDLKKKA 92
>gnl|CDD|218187 pfam04637, Herpes_pp85, Herpesvirus phosphoprotein 85 (HHV6-7
U14/HCMV UL25). This family includes UL25 proteins from
HCMV, as well as U14 proteins from HHV 6 and HHV7. These
85 kD phosphoproteins appear to act as structural
antigens, but their precise function is otherwise
unknown.
Length = 502
Score = 30.4 bits (69), Expect = 1.6
Identities = 10/40 (25%), Positives = 18/40 (45%), Gaps = 1/40 (2%)
Query: 129 NYKGKNLLINKDI-RCKDDVFTHLYTLIVKPDNTYDVLID 167
Y+ KN N D+ RC + ++L + T D++
Sbjct: 360 FYRAKNPYGNSDLFRCPRNASSYLRRGLRTRPITQDIITT 399
>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
fusion protein; Provisional.
Length = 131
Score = 29.1 bits (65), Expect = 1.6
Identities = 12/50 (24%), Positives = 22/50 (44%)
Query: 332 EGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHD 381
E + + + +A+E D+ DE DE + E E + E+ E +
Sbjct: 81 ADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEADEEEDEEAE 130
Score = 27.2 bits (60), Expect = 7.1
Identities = 17/58 (29%), Positives = 23/58 (39%)
Query: 327 WKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
W A EK AEEA A D +D D+ + D + + EE + DE
Sbjct: 52 WTEAGRAEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDE 109
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 30.6 bits (69), Expect = 1.7
Identities = 18/71 (25%), Positives = 38/71 (53%), Gaps = 2/71 (2%)
Query: 300 QVKSGTIFDNVLITDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKD 359
++ G + VL+ + A K + K EG +KE ++++E K EE + + K ++
Sbjct: 241 HLRGGALL--VLVEGVLLKAPKILRHVDKLGIEGWDWLKELKEKKEEKDEEKSEEVKTEE 298
Query: 360 EDDDEDEDEEE 370
D++ +E+E+
Sbjct: 299 VDEEFEEEEKG 309
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 30.6 bits (69), Expect = 1.7
Identities = 16/48 (33%), Positives = 25/48 (52%), Gaps = 2/48 (4%)
Query: 332 EGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHE 379
E E ++E D+E+ EA + + E D ED +E+E AE +E
Sbjct: 650 EDEDDLEETDDDEDE--CEAIEDSESESESDGEDGEEDEQEDDAEANE 695
Score = 30.2 bits (68), Expect = 2.2
Identities = 11/57 (19%), Positives = 26/57 (45%), Gaps = 1/57 (1%)
Query: 325 DIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHD 381
D K + E++DE++ + + + + ED + + + + G+ +E E D
Sbjct: 635 DENKSRHQQ-LFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDD 690
Score = 28.7 bits (64), Expect = 7.3
Identities = 13/50 (26%), Positives = 21/50 (42%)
Query: 335 KKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
K + K ++ E +D ++ DDDEDE E ++E D
Sbjct: 633 KADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDG 682
>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
Length = 80
Score = 28.2 bits (63), Expect = 1.7
Identities = 12/32 (37%), Positives = 22/32 (68%)
Query: 338 KEKQDEEERKAEEAAAKDKKKDEDDDEDEDEE 369
+ + D+++ ++ KD K D+DDD+DED+E
Sbjct: 49 EPEDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80
>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22. The
mitochondrial protein translocase family, which is
responsible for movement of nuclear encoded pre-proteins
into mitochondria, is very complex with at least 19
components. These proteins include several chaperone
proteins, four proteins of the outer membrane
translocase (Tom) import receptor, five proteins of the
Tom channel complex, five proteins of the inner membrane
translocase (Tim) and three "motor" proteins. This
family represents the Tom22 proteins. The N terminal
region of Tom22 has been shown to have chaperone-like
activity, and the C terminal region faces the
intermembrane face.
Length = 136
Score = 29.1 bits (66), Expect = 1.7
Identities = 12/43 (27%), Positives = 24/43 (55%)
Query: 342 DEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
++E + + AA K+ ++E DD+DED+ +T + +E
Sbjct: 8 EDETFQEKPAAPKNLAQEESDDDDEDDTDTDSDISDDSDFENE 50
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 30.2 bits (69), Expect = 1.8
Identities = 7/34 (20%), Positives = 21/34 (61%)
Query: 336 KMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEE 369
++KE + + A ++++++E+++E +EE
Sbjct: 287 ELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 28.3 bits (64), Expect = 7.3
Identities = 14/40 (35%), Positives = 25/40 (62%), Gaps = 1/40 (2%)
Query: 338 KEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEE 377
K+ DEE ++ A A+ +E+++E+E+EEE +EE
Sbjct: 281 KDALDEELKEVLSAQAQAAAAEEEEEEEEEEEE-EEPSEE 319
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 30.4 bits (69), Expect = 1.8
Identities = 13/44 (29%), Positives = 24/44 (54%)
Query: 334 EKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEE 377
E+ + E+++ E K E A ++ E + E E++ E +AEE
Sbjct: 522 EELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEE 565
Score = 28.1 bits (63), Expect = 9.9
Identities = 12/54 (22%), Positives = 22/54 (40%)
Query: 313 TDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDE 366
+ ++ A ++ A E + E ++E E EE A + K E + E
Sbjct: 536 RERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIE 589
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 30.5 bits (69), Expect = 1.9
Identities = 19/72 (26%), Positives = 31/72 (43%), Gaps = 13/72 (18%)
Query: 313 TDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETT 372
T T + Y D AF ++ + ER EE DE+++E+EDE+E
Sbjct: 362 TKAKETEQDYILD----AFSA---LEIEDANTERDDEEEE------DEEEEEEEDEDEGP 408
Query: 373 GKAEEHEHDHDE 384
K + + +E
Sbjct: 409 SKEHSDDEEFEE 420
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 30.0 bits (67), Expect = 2.0
Identities = 14/39 (35%), Positives = 25/39 (64%)
Query: 341 QDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHE 379
++ A + A K+ + DED++E+E+EEE + EE+E
Sbjct: 118 LPKKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENE 156
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 29.8 bits (67), Expect = 2.2
Identities = 11/55 (20%), Positives = 25/55 (45%)
Query: 318 TAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETT 372
A + + +K E+ D+++ E + ++E++ E+E+EEE
Sbjct: 229 AAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDE 283
Score = 29.0 bits (65), Expect = 4.1
Identities = 11/42 (26%), Positives = 26/42 (61%)
Query: 335 KKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAE 376
K + ++++++ + K+ K+DE+++E E+EEE + E
Sbjct: 244 NKKSDDEEDDDQDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285
Score = 27.9 bits (62), Expect = 9.5
Identities = 11/53 (20%), Positives = 28/53 (52%)
Query: 332 EGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
EK+ + ++ + + K+KK D+++D+D+D + K + + + +E
Sbjct: 221 YYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEE 273
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 29.0 bits (65), Expect = 2.3
Identities = 14/66 (21%), Positives = 31/66 (46%)
Query: 314 DDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTG 373
++ +K + + K ++K+ +E++ AE+ A K K + + ++TT
Sbjct: 93 EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTT 152
Query: 374 KAEEHE 379
K EE +
Sbjct: 153 KKEEGK 158
>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein. This
family consists of REP proteins from Dictyostelium
(Slime molds). REP protein is likely involved in
transcription regulation and control of DNA replication,
specifically amplification of plasmid at low copy
numbers. The formation of homomultimers may be required
for their regulatory activity.
Length = 910
Score = 30.2 bits (68), Expect = 2.3
Identities = 16/57 (28%), Positives = 28/57 (49%), Gaps = 1/57 (1%)
Query: 315 DIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEA-AAKDKKKDEDDDEDEDEEE 370
++ T + + K K++ + D EE + +D DED+D+DEDE+E
Sbjct: 848 NLLTVQDKFVKLVKNFISKFKELNDLTDLEEYLISQFQENEDDDADEDEDQDEDEDE 904
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 30.0 bits (68), Expect = 2.4
Identities = 10/58 (17%), Positives = 29/58 (50%)
Query: 320 KKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEE 377
+K+ + + E +K KE ++E + K E ++ + +E+ ++ + ++E+
Sbjct: 423 RKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEK 480
>gnl|CDD|146413 pfam03760, LEA_1, Late embryogenesis abundant (LEA) group 1.
Family members are conserved along the entire coding
region, especially within the hydrophobic internal 20
amino acid motif, which may be repeated.
Length = 73
Score = 27.6 bits (61), Expect = 2.4
Identities = 21/63 (33%), Positives = 32/63 (50%), Gaps = 3/63 (4%)
Query: 317 ATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAE 376
+AK+ A ++ A E K K K DE KAE+A A+ K + E + + +E K +
Sbjct: 2 QSAKEKASNMASTAKEKMDKTKAKADE---KAEKATARTKVEKEIAHQRKKAKEAQAKMD 58
Query: 377 EHE 379
HE
Sbjct: 59 LHE 61
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 29.5 bits (66), Expect = 2.6
Identities = 14/51 (27%), Positives = 26/51 (50%), Gaps = 3/51 (5%)
Query: 331 FEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHD 381
F+G E D+E + D+++D DDD++ED ++T + + D
Sbjct: 264 FKGVNYGYESDDDEG---SSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCD 311
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 28.0 bits (63), Expect = 2.8
Identities = 9/23 (39%), Positives = 17/23 (73%)
Query: 348 AEEAAAKDKKKDEDDDEDEDEEE 370
A AA + K+++E+++E E+ EE
Sbjct: 73 AAAAAEEKKEEEEEEEEKEESEE 95
Score = 27.6 bits (62), Expect = 3.7
Identities = 6/22 (27%), Positives = 15/22 (68%)
Query: 347 KAEEAAAKDKKKDEDDDEDEDE 368
A E ++++++E+ +E E+E
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEE 96
Score = 27.6 bits (62), Expect = 4.6
Identities = 8/23 (34%), Positives = 17/23 (73%)
Query: 348 AEEAAAKDKKKDEDDDEDEDEEE 370
A A K ++++E+++++E EEE
Sbjct: 74 AAAAEEKKEEEEEEEEKEESEEE 96
Score = 26.8 bits (60), Expect = 7.8
Identities = 11/28 (39%), Positives = 21/28 (75%)
Query: 348 AEEAAAKDKKKDEDDDEDEDEEETTGKA 375
A AAA++KK++E+++E+++E E A
Sbjct: 72 AAAAAAEEKKEEEEEEEEKEESEEEAAA 99
Score = 26.8 bits (60), Expect = 8.7
Identities = 7/26 (26%), Positives = 16/26 (61%)
Query: 348 AEEAAAKDKKKDEDDDEDEDEEETTG 373
A K+++++E++ E+ +EE G
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEEAAAG 100
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 30.0 bits (67), Expect = 3.0
Identities = 14/55 (25%), Positives = 26/55 (47%), Gaps = 1/55 (1%)
Query: 324 EDIW-KPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEE 377
E +W +P E + ++K +E+ E+ K+ D ED+D +E + E
Sbjct: 3905 EKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEM 3959
>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2. A Saccharomyces
cerevisiae member of this family (PGA2) is an ER protein
which has been implicated in protein trafficking.
Length = 139
Score = 28.6 bits (64), Expect = 3.1
Identities = 16/54 (29%), Positives = 24/54 (44%), Gaps = 14/54 (25%)
Query: 336 KMKEKQDEEERKAEEAAAKDKKK--------------DEDDDEDEDEEETTGKA 375
K +EK+ E+ER E A + K K E+D +DE++EE
Sbjct: 40 KAQEKEHEKERAEREEAREKKAKISPNALRGGATAGHGEEDTDDEEDEEDFATP 93
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 29.6 bits (66), Expect = 3.4
Identities = 16/52 (30%), Positives = 27/52 (51%), Gaps = 1/52 (1%)
Query: 320 KKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEET 371
K+ AE+ + E + +K DEE KA++ A D K++ E E E++
Sbjct: 273 KQVAENQKREI-EKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKE 323
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 29.3 bits (65), Expect = 3.4
Identities = 17/54 (31%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 334 EKKMKEKQDEEERKAEEAAAKDKKK--DEDDDEDEDEEETTGKAEEHEHDHDEL 385
E++ +K++EE+ EE ++ + +E+ D+DED E+ K + E D +EL
Sbjct: 101 EERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDEKESDAEGDENEL 154
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 27.6 bits (62), Expect = 3.4
Identities = 10/22 (45%), Positives = 19/22 (86%)
Query: 347 KAEEAAAKDKKKDEDDDEDEDE 368
A AAA+++KK+E+++E+ED+
Sbjct: 60 AAAAAAAEEEKKEEEEEEEEDD 81
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 29.4 bits (66), Expect = 3.9
Identities = 11/50 (22%), Positives = 27/50 (54%)
Query: 330 AFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHE 379
A + E + + + ++ ER E+A ++ +D+ ++++E + EE E
Sbjct: 397 ALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEE 446
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 29.1 bits (65), Expect = 4.2
Identities = 14/71 (19%), Positives = 29/71 (40%)
Query: 315 DIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGK 374
+ A++ +D E + ++E+E + E + + E D E +G+
Sbjct: 196 SMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGE 255
Query: 375 AEEHEHDHDEL 385
E + D D+L
Sbjct: 256 EEMVQSDQDDL 266
Score = 28.8 bits (64), Expect = 5.4
Identities = 11/47 (23%), Positives = 23/47 (48%)
Query: 338 KEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
+E+Q E E + +E +A + + D + + EEE ++ D +
Sbjct: 225 QEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESD 271
>gnl|CDD|168145 PRK05639, PRK05639, 4-aminobutyrate aminotransferase; Provisional.
Length = 457
Score = 29.1 bits (65), Expect = 4.2
Identities = 23/61 (37%), Positives = 31/61 (50%), Gaps = 4/61 (6%)
Query: 266 KGVWVHPEIDNPEYSPDPELYYKPEICTIGFDLWQV-KSGTIFDNVL-ITDDIATAKKYA 323
KG+ + EI PDPEL K IC F+L + S +F NV+ IT + K+ A
Sbjct: 373 KGLMIGVEIVKENGKPDPELTGK--ICWRAFELGLILPSYGMFGNVIRITPPLVITKEIA 430
Query: 324 E 324
E
Sbjct: 431 E 431
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 28.5 bits (63), Expect = 4.3
Identities = 16/51 (31%), Positives = 33/51 (64%), Gaps = 3/51 (5%)
Query: 334 EKKMKEKQDEEERKAEEAAAK--DKKKDEDDDEDEDEEETT-GKAEEHEHD 381
EK +E+++ EER E + D+++D+++ E E EE+T ++++ EH+
Sbjct: 129 EKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHE 179
Score = 28.5 bits (63), Expect = 5.5
Identities = 18/75 (24%), Positives = 39/75 (52%), Gaps = 2/75 (2%)
Query: 310 VLITDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEE 369
++ + I A++ E++ + E E++ +EK DE E + E+ ++++ E+ +D + E
Sbjct: 122 IIRQEQIEKARQEREEL-EERMEWERR-EEKIDEREDQEEQEREREEQTIEEQSDDSEHE 179
Query: 370 ETTGKAEEHEHDHDE 384
E E D D+
Sbjct: 180 IIEQDESETESDDDK 194
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 29.0 bits (65), Expect = 4.7
Identities = 15/92 (16%), Positives = 28/92 (30%), Gaps = 5/92 (5%)
Query: 195 KPEDWDDRPTIADPNDTKPEDWEKPEHIPNPDAVKPADWDDEMDG-----EWEAPMIDNP 249
+P + + P ++PE P A W + G + +P P
Sbjct: 129 RPPGLPRQDQLPTARPAYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPASYAP 188
Query: 250 EYKGVWAPKQIENPDYKGVWVHPEIDNPEYSP 281
E + P P+Y + P++
Sbjct: 189 EQERDREPYDAGRPEYDQRRRDYDHPRPDWDR 220
>gnl|CDD|218053 pfam04383, KilA-N, KilA-N domain. The amino-terminal module of the
D6R/N1R proteins defines a novel, conserved DNA-binding
domain (the KilA-N domain) that is found in a wide range
of proteins of large bacterial and eukaryotic DNA
viruses. The KilA-N domain family also includes the
previously defined APSES domain. The KilA-N and APSES
domains may also share a common fold with the nucleic
acid-binding modules of the LAGLIDADG nucleases and the
amino-terminal domains of the tRNA endonuclease.
Length = 106
Score = 27.3 bits (61), Expect = 4.9
Identities = 11/39 (28%), Positives = 21/39 (53%), Gaps = 2/39 (5%)
Query: 236 EMDGEWEAPMIDNPEYKGVWAPKQIENPDYKGVWVHPEI 274
E+ E I+NP+ V ++ +N +G +VHP++
Sbjct: 44 ELIEELS--KINNPDKLIVIEGRKGKNGRLQGTYVHPDL 80
>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family. Members of this family are
coiled-coil proteins that are involved in pre-rRNA
processing.
Length = 105
Score = 27.4 bits (61), Expect = 5.0
Identities = 14/46 (30%), Positives = 25/46 (54%)
Query: 334 EKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHE 379
EK+M+++ +++ KA E KD+K+ E + +E EE E
Sbjct: 31 EKRMEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAKEEKE 76
>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
sigma54-dependent transcription [Transcription / Signal
transduction mechanisms].
Length = 225
Score = 28.4 bits (64), Expect = 5.0
Identities = 15/50 (30%), Positives = 22/50 (44%), Gaps = 4/50 (8%)
Query: 329 PAFEGEKKMKEKQDEEERKAEEAAAKDKKKDED-DDEDEDEEETTGKAEE 377
AFE +M+EK +E E +AE AA + +D D E +
Sbjct: 166 AAFE---RMEEKIEEREARAEAAAELAEGSGDDLDKEFAQAGAQSAVDSR 212
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity
of nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 28.6 bits (64), Expect = 5.1
Identities = 11/43 (25%), Positives = 21/43 (48%)
Query: 343 EEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDEL 385
++ +E A D ++++DE EE+ EE E + E+
Sbjct: 31 AQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGEV 73
>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295). This
family is conserved in fungi but the function is not
known.
Length = 509
Score = 28.8 bits (64), Expect = 5.1
Identities = 13/41 (31%), Positives = 22/41 (53%), Gaps = 3/41 (7%)
Query: 342 DEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDH 382
+ EE +E+A +D D+DD + ED E +G++ E
Sbjct: 292 ETEEDDVDESAIED---DDDDSDWEDSVEESGRSSVDEKTM 329
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 28.8 bits (65), Expect = 5.3
Identities = 15/63 (23%), Positives = 26/63 (41%), Gaps = 12/63 (19%)
Query: 334 EKKMKEKQD-----EEERKA-------EEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHD 381
E+ KE+ + E ER ++ +D K+ DD +DE E + + +
Sbjct: 261 EELAKEEAERLKKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGE 320
Query: 382 HDE 384
DE
Sbjct: 321 EDE 323
Score = 28.8 bits (65), Expect = 5.6
Identities = 13/47 (27%), Positives = 30/47 (63%)
Query: 331 FEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEE 377
+ + ++E++++ + EE +D+ D++DDE+E+EEE K ++
Sbjct: 336 EDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKK 382
Score = 28.8 bits (65), Expect = 6.8
Identities = 14/44 (31%), Positives = 24/44 (54%)
Query: 342 DEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDEL 385
D+++ +D++++ED +DEDEE+ EE E D D
Sbjct: 309 DDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLS 352
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 29.2 bits (65), Expect = 5.5
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 349 EEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDE 384
E+ + +D +DDDEDEDE++ + +E E + +E
Sbjct: 143 EDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEE 178
Score = 28.5 bits (63), Expect = 7.7
Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 1/38 (2%)
Query: 342 DEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHE 379
EE+ + D DED+DED+D+EE + EE E
Sbjct: 141 AEEDMSPRDNFVIDDD-DEDEDEDDDDEEDDEEEEEEE 177
>gnl|CDD|184732 PRK14539, PRK14539, 50S ribosomal protein L11/unknown domain fusion
protein; Provisional.
Length = 196
Score = 28.4 bits (63), Expect = 5.6
Identities = 17/57 (29%), Positives = 28/57 (49%), Gaps = 1/57 (1%)
Query: 314 DDIATAKKYAEDIWKPAFEGE-KKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEE 369
DD+ AK+ A+ K A + K+ K E E KA + + + ++DE E +E
Sbjct: 140 DDVVKAKEEAKAAAKAAALAKAKEASLKSAEAELKASKGKSIEVNVIGEEDEGEKDE 196
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 28.1 bits (63), Expect = 5.6
Identities = 12/48 (25%), Positives = 26/48 (54%)
Query: 322 YAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEE 369
E+ K + E + KE+++ EE + ++ +++K+ E+ E EE
Sbjct: 27 EREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEE 74
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 28.7 bits (65), Expect = 6.1
Identities = 9/36 (25%), Positives = 23/36 (63%)
Query: 335 KKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEE 370
K++K ++ EEER+ +E +++ K ++E+ + +
Sbjct: 30 KRIKAEEKEEERRIDEMMEEERLKALAEEEERERKR 65
Score = 28.3 bits (64), Expect = 7.8
Identities = 14/51 (27%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Query: 335 KKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHDHDEL 385
+ KE++ E ER+ E + ++++ + E+E E E + EE E + L
Sbjct: 138 IERKEEEKEREREEELKIL-EYQREKAEREEEREAERRERKEEKEREVARL 187
>gnl|CDD|151958 pfam11521, TFIIE-A_C-term, C-terminal general transcription factor
TFIIE alpha. TFIIE is compiled of two subunits, alpha
and beta. This family of proteins are the C terminal
domain of the alpha subunit of the protein which is the
largest subunit and contains several functional domains
which are important for basal transcription and cell
growth. The C terminal end of the protein binds directly
to the amino-terminal PH domain of p62/Tfb1 (of IIH)
which is involved in the recruitment of the general
transcription factor IIH to the transcription
preinitiation complex. P53 competes for the same binding
site as TFIIE alpha which shows their structural
similarity. Like p53, TFIIE alpha 336-439 can activate
transcription in vivo.
Length = 86
Score = 26.7 bits (59), Expect = 6.1
Identities = 9/32 (28%), Positives = 18/32 (56%)
Query: 341 QDEEERKAEEAAAKDKKKDEDDDEDEDEEETT 372
+ +++ + E A +E+DDED++ EE
Sbjct: 7 ESDDDSPSPEPPAGQSVDEEEDDEDDEFEEVE 38
>gnl|CDD|236154 PRK08119, PRK08119, flagellar motor switch protein; Validated.
Length = 382
Score = 28.7 bits (65), Expect = 6.3
Identities = 12/56 (21%), Positives = 22/56 (39%), Gaps = 4/56 (7%)
Query: 316 IATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEET 371
+ AK+ + E E E+ +EEE +A AA + + E ++
Sbjct: 213 VDFAKELVAILLGEEEEEE----EEVEEEEAQASPAAEPATAQAAPAPKQEQQQAP 264
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 27.4 bits (61), Expect = 6.3
Identities = 9/23 (39%), Positives = 16/23 (69%)
Query: 348 AEEAAAKDKKKDEDDDEDEDEEE 370
AE AA D+ ++E+ +E+ +EE
Sbjct: 77 AEAAAEADEAEEEEKEEEAEEES 99
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 28.6 bits (64), Expect = 6.5
Identities = 14/48 (29%), Positives = 21/48 (43%)
Query: 334 EKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHEHD 381
E++ Q EE EE A + D D+DE E + EE + +
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEE 48
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 28.6 bits (64), Expect = 7.1
Identities = 14/68 (20%), Positives = 22/68 (32%)
Query: 318 TAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEE 377
KK E KK + + +EE AK +K E + + E
Sbjct: 49 PPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESE 108
Query: 378 HEHDHDEL 385
E + +E
Sbjct: 109 EEEEPEET 116
>gnl|CDD|218744 pfam05781, MRVI1, MRVI1 protein. This family consists of mammalian
MRVI1 proteins which are related to the
lymphoid-restricted membrane protein (JAW1) and the IP3
receptor associated cGMP kinase substrates A and B
(IRAGA and IRAGB). The function of MRVI1 is unknown
although mutations in the Mrvi1 gene induces myeloid
leukaemia by altering the expression of a gene important
for myeloid cell growth and/or differentiation so it has
been speculated that Mrvi1 is a tumour suppressor gene.
IRAG is very similar in sequence to MRVI1 and is an
essential NO/cGKI-dependent regulator of IP3-induced
calcium release. Activation of cGKI decreases
IP3-stimulated elevations in intracellular calcium,
induces smooth muscle relaxation and contributes to the
antiproliferative and pro-apoptotic effects of NO/cGMP.
Jaw1 is a member of a class of proteins with
COOH-terminal hydrophobic membrane anchors and is
structurally similar to proteins involved in vesicle
targeting and fusion. This suggests that the function
and/or the structure of the ER in lymphocytes may be
modified by lymphoid-restricted resident ER proteins.
Length = 538
Score = 28.4 bits (63), Expect = 7.6
Identities = 15/42 (35%), Positives = 21/42 (50%)
Query: 332 EGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTG 373
E EKK K+ QD E + EEA + +K ++ E EE
Sbjct: 415 EEEKKTKKLQDLREPEGEEAVERTRKPSLSEEVAETEEWDKE 456
>gnl|CDD|221323 pfam11931, DUF3449, Domain of unknown function (DUF3449). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 181 to 207 amino acids in length. This domain
has two conserved sequence motifs: PIP and CEICG. The
domain carries a zinc-finger domain of the C2H2-type.
Length = 187
Score = 27.6 bits (62), Expect = 7.7
Identities = 10/33 (30%), Positives = 19/33 (57%)
Query: 338 KEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEE 370
K+ EEER+A + + ++ D+D D++E
Sbjct: 28 KQALTEEERQASADESSEDASEDGSDDDSDDDE 60
>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
Length = 155
Score = 27.4 bits (61), Expect = 7.7
Identities = 9/51 (17%), Positives = 13/51 (25%), Gaps = 1/51 (1%)
Query: 328 KPAFEGEKKMKEKQDEEERKAEEAAAKDKK-KDEDDDEDEDEEETTGKAEE 377
KP+ + + AE K KK E + A
Sbjct: 99 KPSKLELFNAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAA 149
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 27.8 bits (62), Expect = 8.0
Identities = 14/28 (50%), Positives = 16/28 (57%)
Query: 343 EEERKAEEAAAKDKKKDEDDDEDEDEEE 370
EE K EE K K K EDD +D D E+
Sbjct: 24 EEYEKLEELRKKLKAKKEDDSDDTDYED 51
>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
This domain is found associated with the Pfam tRNA
synthetase class II domain (pfam00587) and represents
the N-terminal domain of seryl-tRNA synthetase.
Length = 108
Score = 26.8 bits (60), Expect = 8.1
Identities = 13/36 (36%), Positives = 18/36 (50%)
Query: 335 KKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEE 370
K K+K+D E AE KD+ K + + E E E
Sbjct: 61 KAKKKKEDAEALIAEVKELKDELKALEAELRELEAE 96
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 27.3 bits (61), Expect = 8.2
Identities = 13/40 (32%), Positives = 24/40 (60%), Gaps = 4/40 (10%)
Query: 337 MKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAE 376
+E + +++ + EE ++ +EDDDEDE EEE + +
Sbjct: 109 SEEDESDDDEEDEE----EEDDEEDDDEDESEEEESPVKK 144
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plant species have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site located in the C-terminal
region of P2. TCS inactivates the ribosome by
depurinating a specific adenine in the sarcin-ricin loop
of 28S rRNA.
Length = 103
Score = 26.9 bits (60), Expect = 8.2
Identities = 8/21 (38%), Positives = 16/21 (76%)
Query: 348 AEEAAAKDKKKDEDDDEDEDE 368
A A K++KK+E+++E +D+
Sbjct: 77 AAAEAKKEEKKEEEEEESDDD 97
>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2. Transcripts
harbouring premature signals for translation termination
are recognised and rapidly degraded by eukaryotic cells
through a pathway known as nonsense-mediated mRNA decay.
In Saccharomyces cerevisiae, three trans-acting factors
(Upf1 to Upf3) are required for nonsense-mediated mRNA
decay.
Length = 171
Score = 27.3 bits (61), Expect = 8.5
Identities = 15/48 (31%), Positives = 26/48 (54%)
Query: 330 AFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEE 377
E E+ +E +D+E EE D ++DE+ D +E++ T + EE
Sbjct: 10 GEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEE 57
>gnl|CDD|233940 TIGR02583, DevR_archaea, CRISPR-associated protein Cas7/Csa2,
subtype I-A/APERN. CRISPR is a term for Clustered
Regularly Interspaced Short Palidromic Repeats. A number
of protein families appear only in association with
these repeats and are designated Cas (CRISPR associated)
proteins. This model represents one such family,
typified by MJ0381 of Methanococcus jannaschii. This
archaeal clade is a member of the DevR family
(TIGR01875) which includes the DevR protein of
Myxococcus xanthus, a protein whose expression appears
to regulated through a number of means, including both
location and autorepression; DevR mutants are incapable
of fruiting body development. This subfamily is found in
a CRISPR/Cas locus we designate APERN, so the family is
designated Csa2, for CRISPR/Cas Subtype Protein 2
[Mobile and extrachromosomal element functions, Other].
Length = 285
Score = 27.8 bits (62), Expect = 8.7
Identities = 13/28 (46%), Positives = 16/28 (57%)
Query: 326 IWKPAFEGEKKMKEKQDEEERKAEEAAA 353
I P F EK+ E++DE ERK AA
Sbjct: 200 IGVPQFLPEKEGGEEEDERERKRRIEAA 227
>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
peptidyl-prolyl isomerase. This family is only found at
the amino terminus of pfam00254. This domain is of
unknown function.
Length = 124
Score = 27.2 bits (61), Expect = 8.7
Identities = 10/25 (40%), Positives = 16/25 (64%), Gaps = 1/25 (4%)
Query: 334 EKKMKEKQDEE-ERKAEEAAAKDKK 357
+KK++ KQ + E+KAEE A +
Sbjct: 78 QKKLQAKQQAKAEKKAEENKAAGEA 102
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 27.9 bits (62), Expect = 9.1
Identities = 15/41 (36%), Positives = 21/41 (51%)
Query: 318 TAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKK 358
TA K A+ K A + KK + + +KA +AAAK K
Sbjct: 244 TAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKG 284
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 28.1 bits (63), Expect = 9.1
Identities = 14/38 (36%), Positives = 24/38 (63%)
Query: 342 DEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHE 379
D E+ + ++ AAK K+D D++ E++EE + EE E
Sbjct: 131 DSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAE 168
Score = 27.7 bits (62), Expect = 9.7
Identities = 13/38 (34%), Positives = 20/38 (52%)
Query: 342 DEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEHE 379
DEEE+ AK+ +E +EDE+E +AE +
Sbjct: 134 DEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEK 171
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 27.7 bits (62), Expect = 9.3
Identities = 21/65 (32%), Positives = 35/65 (53%), Gaps = 14/65 (21%)
Query: 312 ITDDIATAKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAK-------DKKKDEDDDE 364
IT D+ ++ E + E++ +EK+DEEER+ AAK K+K ++ +
Sbjct: 108 ITSDVRKQLRFLEQL-------EREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQK 160
Query: 365 DEDEE 369
+EDEE
Sbjct: 161 EEDEE 165
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 28.4 bits (63), Expect = 9.4
Identities = 10/39 (25%), Positives = 21/39 (53%)
Query: 336 KMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGK 374
+M+EK DE++ + K+K D+ ++ D ++ K
Sbjct: 16 RMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVK 54
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 28.1 bits (62), Expect = 9.5
Identities = 22/63 (34%), Positives = 29/63 (46%), Gaps = 10/63 (15%)
Query: 319 AKKYAEDIWKPAFEGEKKMKEKQDEEERKAEEAAAKDKKKDEDDDEDEDEEETTGKAEEH 378
AKK E + K E E+K +E E ER+ E K+ E + E E E E KA
Sbjct: 578 AKKREEAVEKAKREAEQKARE---EREREKE-------KEKEREREREREAERAAKASSS 627
Query: 379 EHD 381
H+
Sbjct: 628 SHE 630
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 28.3 bits (64), Expect = 9.5
Identities = 15/47 (31%), Positives = 23/47 (48%), Gaps = 4/47 (8%)
Query: 316 IATAKKYAEDIWKPAFEGEKK----MKEKQDEEERKAEEAAAKDKKK 358
I AKK A++I K + +K +K + E RK A + K+K
Sbjct: 579 IKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEK 625
>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ. Members of
this family are the FliJ protein found, in nearly every
case, in the midst of other flagellar biosynthesis genes
in bacgterial genomes. Typically the fliJ gene is found
adjacent to the gene for the flagellum-specific ATPase
FliI. Sequence scoring in the gray zone between trusted
and noise cutoffs include both probable FliJ proteins
and components of bacterial type III secretion systems.
Length = 141
Score = 26.9 bits (60), Expect = 9.8
Identities = 14/42 (33%), Positives = 20/42 (47%), Gaps = 4/42 (9%)
Query: 322 YAEDIWKPAFEGEK---KMKEKQDEEERKAEEAAAKDKKKDE 360
+ A K K+KEK+ +E R EEA + K+ DE
Sbjct: 93 AKRERLLEARRELKALEKLKEKKQKEYRA-EEAKREQKEMDE 133
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.312 0.135 0.423
Gapped
Lambda K H
0.267 0.0634 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,898,749
Number of extensions: 2095389
Number of successful extensions: 6341
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4926
Number of HSP's successfully gapped: 591
Length of query: 385
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 286
Effective length of database: 6,546,556
Effective search space: 1872315016
Effective search space used: 1872315016
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 60 (27.1 bits)