RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4218
(520 letters)
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 250 bits (640), Expect = 2e-80
Identities = 135/231 (58%), Positives = 165/231 (71%), Gaps = 17/231 (7%)
Query: 160 DYDSKRDRWAGYDPSNHKAIVEHYQKIEEAKRELRADKLDAGVSIDNRGSKEGDEDKDDL 219
DYD+KRDRW GYDPS +K +VE Y+K+EE +++L+A K D D D +D
Sbjct: 3 DYDAKRDRWNGYDPSEYKEVVEEYEKLEELRKKLKAKKEDDSD----------DTDYED- 51
Query: 220 EEDEDKYVDEVDMPGTKVDSKQRITVRNLRIREDTAKYLRNLDPNSAYYDPKTRSMRDNP 279
EDE+ Y+DE DM + DSK R TVRNLRIREDTAKYL NLD NSAYYDPK+RSMRD+P
Sbjct: 52 -EDEELYLDESDMGDS--DSKTRTTVRNLRIREDTAKYLLNLDSNSAYYDPKSRSMRDDP 108
Query: 280 YETVANPEDVDYAGENFVRFSGDTRKHATAQLFAWEAYEKGVDVHVLAEPTKLELLKKEY 339
A+PEDV +AG+NFVR SG+ + Q FAWEA EKG DVH+ A PTKLELL+K+
Sbjct: 109 L---ADPEDVLFAGDNFVRLSGEALEFEELQKFAWEAAEKGGDVHLQANPTKLELLRKKE 165
Query: 340 EEKKEEFKKEVSSSVIDKYGGAEHLQAPPKALLLAQTEDYVEYSRYGKVIK 390
+EKKE+ K + S+++KYGG EHL PPK LLL Q+EDYVEY R GK K
Sbjct: 166 KEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSEDYVEYDRAGKKKK 216
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 36.2 bits (84), Expect = 0.025
Identities = 18/58 (31%), Positives = 28/58 (48%)
Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKR 498
DE+E++ +++ K K KK + EEK K +KEE+ RE + DE
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAE 95
Score = 33.1 bits (76), Expect = 0.25
Identities = 13/49 (26%), Positives = 26/49 (53%)
Query: 440 QDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
+E++E+ +K K K+K KK K K E++ + K+E+ + +
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE 86
Score = 30.4 bits (69), Expect = 1.8
Identities = 13/50 (26%), Positives = 32/50 (64%)
Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREA 487
E +D+D +DS +++ + K+ +K K + A++ +K ++++E+ +RE
Sbjct: 27 EDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREK 76
Score = 28.5 bits (64), Expect = 8.6
Identities = 18/62 (29%), Positives = 30/62 (48%), Gaps = 4/62 (6%)
Query: 441 DEDEEDSGHKKQ----KKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDER 496
+EDEE K + K KK+ K K E+++ A+ + +E +E EE D L
Sbjct: 40 EEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRL 99
Query: 497 KR 498
++
Sbjct: 100 RK 101
>gnl|CDD|130009 TIGR00934, 2a38euk, potassium uptake protein, Trk family. The
proteins of the Trk family are derived from
Gram-negative and Gram-positive bacteria, yeast and
wheat. The proteins of E. coli K12 TrkH and TrkG as well
as several yeast proteins have been functionally
characterized.The E. coli TrkH and TrkG proteins are
complexed to two peripheral membrane proteins, TrkA, an
NAD-binding protein, and TrkE, an ATP-binding protein.
This complex forms the potassium uptake system. This
family is specific for the eukaryotic Trk system
[Transport and binding proteins, Cations and iron
carrying compounds].
Length = 800
Score = 36.9 bits (85), Expect = 0.030
Identities = 20/141 (14%), Positives = 43/141 (30%), Gaps = 3/141 (2%)
Query: 206 NRGSKEGDEDKDDLEEDEDKYVDEVDMPGTKVDSKQRITVRNLRIREDTAKYLRNLDPNS 265
++ D ++ D +D +K + + R NL+ R+ + +
Sbjct: 309 IDRAEACDLEELDRAKDFEKMTYDNWKAHHRKKKNFRPRGWNLKFRKASRFPKDSDRNYE 368
Query: 266 AYYDPKTRS---MRDNPYETVANPEDVDYAGENFVRFSGDTRKHATAQLFAWEAYEKGVD 322
+ + S + P + Y + ++ +T L + +
Sbjct: 369 DNGNHLSASSSFGSEEPSLSSEENLYPTYNKKREDSRHTLSKTMSTNYLSWQPTIGRNSN 428
Query: 323 VHVLAEPTKLELLKKEYEEKK 343
L + K EL EY K
Sbjct: 429 FVGLTKEQKDELGGIEYRALK 449
>gnl|CDD|215323 PLN02598, PLN02598, omega-6 fatty acid desaturase.
Length = 421
Score = 36.0 bits (83), Expect = 0.039
Identities = 21/54 (38%), Positives = 29/54 (53%), Gaps = 10/54 (18%)
Query: 24 PKKKSREDWRKAKELEEARKAGTA----PAAVDEHGRDINPHIPQYISSA-PWY 72
P K++RE W A +A+ GT PA ++ DI+ HIP +ISS P Y
Sbjct: 305 PFKQARE-WNAA----QAQLNGTVHCDYPAWIEFLCHDISVHIPHHISSKIPSY 353
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 35.1 bits (81), Expect = 0.049
Identities = 17/48 (35%), Positives = 29/48 (60%)
Query: 436 RPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEE 483
E + E+EE KK+K+ KK KK+KK+K++ E KK++++
Sbjct: 144 EKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191
Score = 30.1 bits (68), Expect = 1.8
Identities = 19/75 (25%), Positives = 40/75 (53%), Gaps = 1/75 (1%)
Query: 424 GASLGPPPGIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEE 483
GA GPP + + E E E + +K+ + ++ +KK+K+K+K +++ KE K+E+
Sbjct: 119 GAPDGPPSELGSESET-SEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177
Query: 484 QREADRLLRMDERKR 498
+ ++K+
Sbjct: 178 VEPKGSKKKKKKKKK 192
Score = 29.7 bits (67), Expect = 3.0
Identities = 17/73 (23%), Positives = 41/73 (56%), Gaps = 1/73 (1%)
Query: 428 GPPPGIMNRPEHQ-DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQRE 486
G P G + + + E+++ K +K+ + +++KKEK+K E K ++ KK+++E+
Sbjct: 119 GAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMV 178
Query: 487 ADRLLRMDERKRS 499
+ + ++K+
Sbjct: 179 EPKGSKKKKKKKK 191
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 35.9 bits (82), Expect = 0.063
Identities = 14/61 (22%), Positives = 30/61 (49%), Gaps = 2/61 (3%)
Query: 440 QDEDEEDSGHKKQKKNKKSKKKKK--EKEKSAEEKMKEALKKEEEEQREADRLLRMDERK 497
E + + + +++K + EK+K +K +A KK+ EE+++AD + E
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEED 1403
Query: 498 R 498
+
Sbjct: 1404 K 1404
Score = 35.1 bits (80), Expect = 0.12
Identities = 15/49 (30%), Positives = 28/49 (57%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKR 498
K + K + ++ ++AEEK + A KK+EE +++AD + E K+
Sbjct: 1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKK 1391
Score = 34.3 bits (78), Expect = 0.22
Identities = 19/57 (33%), Positives = 34/57 (59%)
Query: 443 DEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRS 499
+E + +KK +++KKK E +K+AE K K K+ EE ++AD + +E K++
Sbjct: 1480 EEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKA 1536
Score = 34.0 bits (77), Expect = 0.23
Identities = 21/62 (33%), Positives = 31/62 (50%), Gaps = 1/62 (1%)
Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK-EALKKEEEEQREADRLLRMDER 496
E + + EE + KK + KK E +K AEE K E KK+ EE ++AD + E
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481
Query: 497 KR 498
+
Sbjct: 1482 AK 1483
Score = 34.0 bits (77), Expect = 0.27
Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 2/58 (3%)
Query: 444 EEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEA--LKKEEEEQREADRLLRMDERKRS 499
EE + KK + KK E +K AEE K+A KK E +++AD + +E K++
Sbjct: 1467 EEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKA 1524
Score = 34.0 bits (77), Expect = 0.28
Identities = 20/57 (35%), Positives = 31/57 (54%), Gaps = 1/57 (1%)
Query: 443 DEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK-EALKKEEEEQREADRLLRMDERKR 498
+E+ + KK +KKK E +K AEEK K + KK+ EE ++AD + E +
Sbjct: 1401 EEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1457
Score = 33.6 bits (76), Expect = 0.33
Identities = 19/62 (30%), Positives = 32/62 (51%)
Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERK 497
E + + EE + K KKK ++ K AEE K K+ EE ++AD + +E+K
Sbjct: 1487 EAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKK 1546
Query: 498 RS 499
++
Sbjct: 1547 KA 1548
Score = 33.6 bits (76), Expect = 0.34
Identities = 18/57 (31%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 443 DEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK-EALKKEEEEQREADRLLRMDERKR 498
+E+ + +KK ++ KKK E +K+A K K + KK+ EE+++AD + E +
Sbjct: 1388 EEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAK 1444
Score = 33.6 bits (76), Expect = 0.37
Identities = 20/65 (30%), Positives = 36/65 (55%), Gaps = 4/65 (6%)
Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKK----EKEKSAEEKMKEALKKEEEEQREADRLLRM 493
E + +EE+ ++ K + KKK +K + E+K EALKKE EE ++A+ L +
Sbjct: 1651 ELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKK 1710
Query: 494 DERKR 498
+ ++
Sbjct: 1711 EAEEK 1715
Score = 33.2 bits (75), Expect = 0.49
Identities = 18/62 (29%), Positives = 29/62 (46%), Gaps = 1/62 (1%)
Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK-EALKKEEEEQREADRLLRMDER 496
E + + KK + KKK E +K AEE K + KK+ EE ++A+ + E
Sbjct: 1409 ELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEE 1468
Query: 497 KR 498
+
Sbjct: 1469 AK 1470
Score = 32.8 bits (74), Expect = 0.49
Identities = 23/59 (38%), Positives = 35/59 (59%), Gaps = 2/59 (3%)
Query: 442 EDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK--EALKKEEEEQREADRLLRMDERKR 498
E E+ +KK +++KKK +K AEEK K EA KK EE++++AD L + K+
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKK 1418
Score = 32.8 bits (74), Expect = 0.52
Identities = 23/63 (36%), Positives = 34/63 (53%), Gaps = 2/63 (3%)
Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK--EALKKEEEEQREADRLLRMDE 495
E + + EE ++ KK + KK E +K AEE K EA KK EE +++AD + E
Sbjct: 1448 EAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAE 1507
Query: 496 RKR 498
K+
Sbjct: 1508 AKK 1510
Score = 32.8 bits (74), Expect = 0.56
Identities = 21/75 (28%), Positives = 40/75 (53%), Gaps = 4/75 (5%)
Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK----EALKKEEEEQREADRLLRM 493
E + +EE ++ KK + +KKK E+ K AEE+ K E KK EE++++A+ +
Sbjct: 1624 ELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKA 1683
Query: 494 DERKRSYNSMISVKE 508
+E ++ + +
Sbjct: 1684 EEDEKKAAEALKKEA 1698
Score = 32.4 bits (73), Expect = 0.83
Identities = 20/58 (34%), Positives = 29/58 (50%), Gaps = 2/58 (3%)
Query: 443 DEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEA--LKKEEEEQREADRLLRMDERKR 498
+E KK + KKK E +K AEE K+A LKK +++AD + E K+
Sbjct: 1374 EEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKK 1431
Score = 32.0 bits (72), Expect = 0.90
Identities = 24/78 (30%), Positives = 42/78 (53%), Gaps = 4/78 (5%)
Query: 442 EDEEDSGHKKQKKNKKSKKKKKEKEKSAEE----KMKEALKKEEEEQREADRLLRMDERK 497
E EE ++ KK + +KKK E+ K AEE K +EA K+ EE++++A+ + +E K
Sbjct: 1697 EAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEK 1756
Query: 498 RSYNSMISVKEPTMDSDR 515
+ + +E + R
Sbjct: 1757 KKIAHLKKEEEKKAEEIR 1774
Score = 32.0 bits (72), Expect = 0.90
Identities = 16/57 (28%), Positives = 28/57 (49%)
Query: 443 DEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRS 499
+E + +KK +++KKK +K AEE K A + E + AD +E+ +
Sbjct: 1312 EEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEA 1368
Score = 32.0 bits (72), Expect = 1.1
Identities = 20/62 (32%), Positives = 32/62 (51%), Gaps = 1/62 (1%)
Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK-EALKKEEEEQREADRLLRMDER 496
E + + EE + KK + KK +E +K AEE K + KK+ EE ++AD + E
Sbjct: 1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEE 1494
Query: 497 KR 498
+
Sbjct: 1495 AK 1496
Score = 31.6 bits (71), Expect = 1.1
Identities = 15/65 (23%), Positives = 38/65 (58%)
Query: 434 MNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRM 493
+ + E +++ + + K +++NK ++ +K + ++K +EA K EE+E++ A+ L +
Sbjct: 1638 LKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKE 1697
Query: 494 DERKR 498
E +
Sbjct: 1698 AEEAK 1702
Score = 31.6 bits (71), Expect = 1.2
Identities = 19/64 (29%), Positives = 35/64 (54%), Gaps = 3/64 (4%)
Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEE---KMKEALKKEEEEQREADRLLRMD 494
E + +E+ ++ KK ++ +KK E K E K +E KKE EE+++A+ L + +
Sbjct: 1666 EAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAE 1725
Query: 495 ERKR 498
E +
Sbjct: 1726 EENK 1729
Score = 30.9 bits (69), Expect = 1.9
Identities = 20/63 (31%), Positives = 36/63 (57%), Gaps = 1/63 (1%)
Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEE-KMKEALKKEEEEQREADRLLRMDER 496
E + DE ++ KK ++KKK +E +K A+E K KK+ +E ++A+ + DE
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEA 1527
Query: 497 KRS 499
K++
Sbjct: 1528 KKA 1530
Score = 30.5 bits (68), Expect = 2.7
Identities = 17/63 (26%), Positives = 35/63 (55%)
Query: 436 RPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDE 495
+ + + + D K ++ K + KK E+ K A+E K KK+ +E ++A+ L + +E
Sbjct: 1503 KKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEE 1562
Query: 496 RKR 498
+K+
Sbjct: 1563 KKK 1565
Score = 30.5 bits (68), Expect = 2.7
Identities = 18/66 (27%), Positives = 31/66 (46%), Gaps = 2/66 (3%)
Query: 436 RPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK--EALKKEEEEQREADRLLRM 493
+ + + EE + KK + KK E +K AEE K +A KK+ EE ++A +
Sbjct: 1291 KADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKA 1350
Query: 494 DERKRS 499
+ +
Sbjct: 1351 EAEAAA 1356
Score = 30.5 bits (68), Expect = 2.8
Identities = 17/55 (30%), Positives = 31/55 (56%)
Query: 445 EDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRS 499
++ K + K + KK ++ K AEE K K+ EE+++AD L + +E K++
Sbjct: 1506 AEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKA 1560
Score = 30.1 bits (67), Expect = 3.4
Identities = 16/63 (25%), Positives = 32/63 (50%), Gaps = 5/63 (7%)
Query: 442 EDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEE-----EEQREADRLLRMDER 496
++ + + +KK +KKK +E +K+AE EA + EE+ EA + + +
Sbjct: 1318 DEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAK 1377
Query: 497 KRS 499
K++
Sbjct: 1378 KKA 1380
Score = 29.0 bits (64), Expect = 7.5
Identities = 22/65 (33%), Positives = 33/65 (50%)
Query: 434 MNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRM 493
+ +D+ + + K +++ KK KKE+EK AEE KE EEE E D RM
Sbjct: 1735 AKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRM 1794
Query: 494 DERKR 498
+ K+
Sbjct: 1795 EVDKK 1799
Score = 29.0 bits (64), Expect = 7.8
Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 4/72 (5%)
Query: 441 DEDEEDSGHKKQKKNKKSKKKKK--EKEKSAEEKMK--EALKKEEEEQREADRLLRMDER 496
DE ++ KK + KK+++KKK E +K AEE K EA KK EE +++AD + E
Sbjct: 1281 DELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEE 1340
Query: 497 KRSYNSMISVKE 508
+ +
Sbjct: 1341 AKKAAEAAKAEA 1352
Score = 28.6 bits (63), Expect = 9.8
Identities = 16/54 (29%), Positives = 29/54 (53%)
Query: 445 EDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKR 498
E++ + K + KKK E +K+ E+K + KK+ EE ++AD + E +
Sbjct: 1275 EEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1328
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 34.5 bits (80), Expect = 0.064
Identities = 14/46 (30%), Positives = 27/46 (58%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDE 495
+++ K + +K++ EK + E EA++K EEE+R+ + DE
Sbjct: 126 EQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADE 171
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 33.7 bits (78), Expect = 0.086
Identities = 18/40 (45%), Positives = 25/40 (62%), Gaps = 1/40 (2%)
Query: 453 KKNKKSKKKKKEKEKSAEEKMKEALKKE-EEEQREADRLL 491
K+ KK KKKKKE E E+K K + +E +E+Q+ D L
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVDAAL 121
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 34.2 bits (79), Expect = 0.16
Identities = 14/59 (23%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 440 QDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKR 498
++E+E+ ++++ +++++KK+EK+K E L EEQR+ + R + ++
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKL--SPEEQRKLEEKERKKQARK 322
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 33.0 bits (76), Expect = 0.19
Identities = 15/46 (32%), Positives = 30/46 (65%), Gaps = 4/46 (8%)
Query: 450 KKQKKNKKSKKKKKEKEK----SAEEKMKEALKKEEEEQREADRLL 491
KK KK KK K+K++++ + ++++K+A ++ + E+ E DR L
Sbjct: 15 KKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDREL 60
>gnl|CDD|235366 PRK05218, PRK05218, heat shock protein 90; Provisional.
Length = 613
Score = 33.5 bits (78), Expect = 0.25
Identities = 13/41 (31%), Positives = 27/41 (65%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRL 490
K+ ++ K+ K++ +E+ K E++KEAL + ++ R + RL
Sbjct: 478 KEDEEEKEEKEEAEEEFKPLLERLKEALGDKVKDVRLSHRL 518
>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
Length = 201
Score = 32.7 bits (75), Expect = 0.30
Identities = 15/57 (26%), Positives = 27/57 (47%), Gaps = 5/57 (8%)
Query: 448 GHKKQKKNKKSKKK-KKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRSYNSM 503
+K K+ +K K+ +KE ++ E + LKK +E+Q E D+R+
Sbjct: 70 DQEKMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQMEM----MDDQRELMKMQF 122
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 33.0 bits (76), Expect = 0.31
Identities = 17/58 (29%), Positives = 27/58 (46%), Gaps = 2/58 (3%)
Query: 462 KKEKEKSAEEKMKEALKKEEEEQREADRLL--RMDERKRSYNSMISVKEPTMDSDRGN 517
KEK AE EA + E+E RE + M+ ++RSY + M+++R
Sbjct: 201 AKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREK 258
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 33.0 bits (76), Expect = 0.37
Identities = 19/39 (48%), Positives = 27/39 (69%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
KK+KK K KKKE+E+ E++ KE K+EEEE+ E +
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465
Score = 33.0 bits (76), Expect = 0.39
Identities = 13/40 (32%), Positives = 26/40 (65%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADR 489
+K KK +++KKEK+K A K+ ++EEE++++ +
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Score = 32.2 bits (74), Expect = 0.61
Identities = 17/40 (42%), Positives = 25/40 (62%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADR 489
KK KK + +KK+E+EK ++K A KK+EEE+ E
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKE 449
Score = 32.2 bits (74), Expect = 0.63
Identities = 11/46 (23%), Positives = 25/46 (54%)
Query: 443 DEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
++ + K+ K KK+++EK++ ++ K+EEEE+ +
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEK 450
Score = 32.2 bits (74), Expect = 0.65
Identities = 16/51 (31%), Positives = 31/51 (60%)
Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
+ +EE KK+ K K++++E+EK +E+ KE ++E EE++E +
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469
Score = 31.8 bits (73), Expect = 1.0
Identities = 17/41 (41%), Positives = 26/41 (63%)
Query: 449 HKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADR 489
KK+++ KK KKKK K EE+ +E +K+EEE+ E +
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEE 460
Score = 30.3 bits (69), Expect = 2.5
Identities = 14/48 (29%), Positives = 29/48 (60%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERK 497
++++K +K KK K+K EE+ ++ K+EE+E+ E + +E +
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEE 470
Score = 29.5 bits (67), Expect = 4.8
Identities = 17/66 (25%), Positives = 34/66 (51%), Gaps = 1/66 (1%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRSYNSMISVKEP 509
K +K +K++KK++E++K ++K KKEEEE+ E ++ E + + +E
Sbjct: 411 KIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEE-EAEEEKEEE 469
Query: 510 TMDSDR 515
+
Sbjct: 470 EEKKKK 475
Score = 29.5 bits (67), Expect = 4.9
Identities = 14/50 (28%), Positives = 26/50 (52%), Gaps = 2/50 (4%)
Query: 441 DEDEED--SGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
E+E + +G KK K K +K EK++ E+K K+ +++ E +
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEE 444
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 33.3 bits (76), Expect = 0.40
Identities = 17/78 (21%), Positives = 33/78 (42%), Gaps = 4/78 (5%)
Query: 432 GIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLL 491
+ E + EE+ ++K KK +K K+++ K + KEA K + +Q +
Sbjct: 3 RTESEAEKKILTEEEL----ERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNV 58
Query: 492 RMDERKRSYNSMISVKEP 509
K+S + + P
Sbjct: 59 PKKSEKKSRKRDVEDENP 76
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 33.0 bits (76), Expect = 0.42
Identities = 13/44 (29%), Positives = 21/44 (47%), Gaps = 4/44 (9%)
Query: 436 RPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKK 479
RP+ + S K K +KK K+ EK++ +EK K +
Sbjct: 377 RPKTK----APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRH 416
Score = 28.8 bits (65), Expect = 8.9
Identities = 13/41 (31%), Positives = 19/41 (46%), Gaps = 5/41 (12%)
Query: 449 HKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADR 489
K + K K KK K AE+K +KE+E+ + R
Sbjct: 380 TKAPSEKKTGKPSKKVLAKRAEKK-----EKEKEKPKVKKR 415
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 31.7 bits (73), Expect = 0.42
Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 2/33 (6%)
Query: 449 HKKQKK--NKKSKKKKKEKEKSAEEKMKEALKK 479
KK+ K K KK +K K K+ ++K K+ K
Sbjct: 64 DKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96
Score = 30.6 bits (70), Expect = 0.95
Identities = 11/32 (34%), Positives = 19/32 (59%)
Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEK 472
D+ E + K +KK +K+K K ++K+ EE
Sbjct: 64 DKKELKAWEKAEKKAEKAKAKAEKKKAKKEEP 95
>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
Length = 200
Score = 32.2 bits (73), Expect = 0.46
Identities = 20/58 (34%), Positives = 28/58 (48%), Gaps = 2/58 (3%)
Query: 429 PPPGIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQRE 486
P PG + P+ D E+D + KK +KKKKE EK A+ K E + + E
Sbjct: 68 PSPG--HPPQTNDTHEKDLALQPPPGGKKKEKKKKETEKPAQGGEKPDQGPEAKGEGE 123
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 32.4 bits (74), Expect = 0.46
Identities = 19/49 (38%), Positives = 29/49 (59%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKR 498
K Q K +K K+KKKEKE + EE++ EA + E + +R +E K+
Sbjct: 166 KIQAKARKRKEKKKEKELTQEERLAEAKETERINLKSLERYEEQEEEKK 214
>gnl|CDD|215456 PLN02850, PLN02850, aspartate-tRNA ligase.
Length = 530
Score = 32.8 bits (75), Expect = 0.55
Identities = 15/49 (30%), Positives = 19/49 (38%)
Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADR 489
E E+SG K KK K K EK + A E+E+ A
Sbjct: 3 QEAVEESGEKISKKAAKKAAAKAEKLRREATAKAAAASLEDEDDPLASN 51
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 31.6 bits (72), Expect = 0.56
Identities = 18/73 (24%), Positives = 39/73 (53%), Gaps = 3/73 (4%)
Query: 436 RPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDE 495
R + ++ +EE+ +K+ + K+ ++K+E+E E+ +E K+EEE + ++ + E
Sbjct: 17 RRQQREAEEEEREERKKLEEKREGERKEEEEL---EEEREKKKEEEERKEREEQARKEQE 73
Query: 496 RKRSYNSMISVKE 508
S V+E
Sbjct: 74 EYEKLKSSFVVEE 86
>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain. DBINO is a DNA-binding
domain found on global transcription activator SNF2L1
proteins and chromatin re-modelling proteins.
Length = 140
Score = 31.1 bits (71), Expect = 0.59
Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 7/40 (17%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADR 489
KK +K ++ +K+ EKE A E+ K+EEE REA R
Sbjct: 79 KKNEKEERELRKRAEKE--ALEQ-----AKKEEELREAKR 111
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 32.6 bits (75), Expect = 0.61
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 5/52 (9%)
Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEA----LKKEEEEQREAD 488
E KKQ+K +K K +K+E EK+A +K EA K + E ++ D
Sbjct: 407 SPAERKKLRKKQRKAEK-KAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVD 457
Score = 28.7 bits (65), Expect = 8.6
Identities = 16/50 (32%), Positives = 31/50 (62%), Gaps = 2/50 (4%)
Query: 439 HQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
+E+E ++G+ + KK +KK+++ EK AE+ +EA K +++ EA
Sbjct: 395 EGEEEEGENGNLSPAERKKLRKKQRKAEKKAEK--EEAEKAAAKKKAEAA 442
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 32.3 bits (74), Expect = 0.78
Identities = 15/50 (30%), Positives = 26/50 (52%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRS 499
+K++K ++K+ K K EEK K+ L+K E+ + + E K S
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYS 605
Score = 28.8 bits (65), Expect = 7.8
Identities = 17/84 (20%), Positives = 41/84 (48%), Gaps = 12/84 (14%)
Query: 442 EDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRSYN 501
ED+ D + +K+ +++KE+++ A ++ K K +++E+++ L ++++ K
Sbjct: 536 EDKPDGPSVWKLDDKEELQREKEEKE-ALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPA 594
Query: 502 SMISVKE-----------PTMDSD 514
+E PT D+D
Sbjct: 595 EFFKRQEDKYSAFDETGLPTHDAD 618
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 31.2 bits (71), Expect = 0.79
Identities = 17/60 (28%), Positives = 33/60 (55%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRSYNSMISVKEP 509
+K K KK KKKK+K+K ++ K+ ++++E+ D+L + + S +S +P
Sbjct: 82 QKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKP 141
Score = 30.0 bits (68), Expect = 1.8
Identities = 16/49 (32%), Positives = 25/49 (51%)
Query: 442 EDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRL 490
E EE K +KK K KK K + +K ++ K K E+E + + + L
Sbjct: 77 EYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDL 125
Score = 29.3 bits (66), Expect = 3.4
Identities = 13/43 (30%), Positives = 23/43 (53%)
Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEE 483
++ K +K +KK K +K+ EK AE+K+++ K E
Sbjct: 90 KSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSET 132
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 32.3 bits (74), Expect = 0.80
Identities = 13/37 (35%), Positives = 22/37 (59%)
Query: 454 KNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRL 490
+N KKK+ + EEK K+ KKE+ ++ E +R+
Sbjct: 644 RNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKERI 680
Score = 29.2 bits (66), Expect = 6.1
Identities = 13/54 (24%), Positives = 27/54 (50%), Gaps = 3/54 (5%)
Query: 450 KKQKKNKKSKKKKKEK--EKSAEEKMKEALKKEEEEQREADRLLRMDERKRSYN 501
KK K+ +++++++E E + K K+E+E E D L +E++
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGT-KQEQEGSTEEDPSLFSEEKEDPDK 276
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 32.2 bits (73), Expect = 0.86
Identities = 9/39 (23%), Positives = 18/39 (46%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
K++ K K K+ + E + E E ++ E++ D
Sbjct: 798 KQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGID 836
Score = 31.1 bits (70), Expect = 1.8
Identities = 12/45 (26%), Positives = 20/45 (44%), Gaps = 3/45 (6%)
Query: 453 KKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERK 497
K ++K+K KEK + +E E E + E ++ E K
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIE--SPSVETEGERCT-IKQREEK 833
>gnl|CDD|136027 PRK06743, PRK06743, flagellar motor protein MotP; Reviewed.
Length = 254
Score = 31.5 bits (71), Expect = 0.90
Identities = 22/75 (29%), Positives = 37/75 (49%), Gaps = 3/75 (4%)
Query: 287 EDVDYAGENFVRFSGDTRKHATAQLFAWEAYEKGVDVHVLAEPTKLELLKKEYEEKKEEF 346
ED++ + FV FS ++KH L + E + VD + + +L L + +E KE
Sbjct: 69 EDLEQLTDLFVDFSKKSKKHG---LLSLEVDGEQVDNPFIQKGIRLMLSGYDEDELKEVL 125
Query: 347 KKEVSSSVIDKYGGA 361
K+V + V + GA
Sbjct: 126 MKDVETEVYELRKGA 140
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 30.5 bits (69), Expect = 1.1
Identities = 13/44 (29%), Positives = 27/44 (61%)
Query: 440 QDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEE 483
++ ++E K+ + K++ K KKEK++ E+K+ E L K++
Sbjct: 93 EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKST 136
Score = 29.3 bits (66), Expect = 2.5
Identities = 10/39 (25%), Positives = 20/39 (51%)
Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKK 479
+E + K+ K + K+ K K++ E+K K+ +K
Sbjct: 91 FPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEK 129
Score = 28.6 bits (64), Expect = 5.5
Identities = 13/30 (43%), Positives = 20/30 (66%)
Query: 458 SKKKKKEKEKSAEEKMKEALKKEEEEQREA 487
S+K++KE K EEK KKE++E++E
Sbjct: 95 SEKEQKEVSKETEEKEAIKAKKEKKEKKEK 124
Score = 28.2 bits (63), Expect = 7.4
Identities = 11/47 (23%), Positives = 26/47 (55%)
Query: 437 PEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEE 483
PE ++++++ + ++K KK+K+++K + K A KK +
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKT 138
>gnl|CDD|234306 TIGR03674, fen_arch, flap structure-specific endonuclease.
Endonuclease that cleaves the 5'-overhanging flap
structure that is generated by displacement synthesis
when DNA polymerase encounters the 5'-end of a
downstream Okazaki fragment. Has 5'-endo-/exonuclease
and 5'-pseudo-Y-endonuclease activities. Cleaves the
junction between single and double-stranded regions of
flap DNA.
Length = 338
Score = 31.5 bits (72), Expect = 1.2
Identities = 14/45 (31%), Positives = 28/45 (62%), Gaps = 3/45 (6%)
Query: 454 KNKKSKKKKKEKEKSAEEKMKEALKKEEEE--QREADRLLRMDER 496
K + +++++ +E+ AEEK +EAL+K + E ++ A R R+
Sbjct: 87 KAETLEERREIREE-AEEKWEEALEKGDLEEARKYAQRSSRLTSE 130
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 30.9 bits (70), Expect = 1.2
Identities = 14/38 (36%), Positives = 20/38 (52%)
Query: 451 KQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
+K K KKK+KEK EE+ K ++ E+ E D
Sbjct: 59 PRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENEND 96
Score = 29.3 bits (66), Expect = 3.8
Identities = 17/53 (32%), Positives = 26/53 (49%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRSYNS 502
+K KKSKKK KEK E+K + K EE + + D + + S ++
Sbjct: 60 RKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASA 112
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 31.6 bits (72), Expect = 1.2
Identities = 17/64 (26%), Positives = 29/64 (45%), Gaps = 4/64 (6%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALK---KEEEEQRE-ADRLLRMDERKRSYNSMIS 505
+ + +K + + KS EE +K A K + EE R+ +RL +DE ++
Sbjct: 78 GRLFILPEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEELEDLEDLLE 137
Query: 506 VKEP 509
EP
Sbjct: 138 ELEP 141
>gnl|CDD|187750 cd09233, ACE1-Sec16-like, Ancestral coatomer element 1 (ACE1) of
COPII coat complex assembly protein Sec16. COPII coat
complex plays an important role in vesicular traffic of
newly synthezised proteins from the endoplasmatic
reticulum (ER) to the Golgi apparatus by mediating the
formation of transport vesicles. COPII consists of an
outer coat, made up of the scaffold proteins Sec31 and
Sec13, and the cargo adaptor complex, Sec23 and Sec24,
which are recruited by the small GTPase Sar1. Sec16 is
involved in the early steps of the assembly process.
Sec16 forms elongated heterotetramers with Sec13,
Sec13-(Sec16)2-Sec13. It interacts with Sec13 by
insertion of a single beta-blade to close the
six-bladded beta propeller of Sec13. In the same way
Sec13 interacts with Sec31 and Nup145C, a nuclear pore
protein, all of these contain a structurally related
ancestral coatomer element 1 (ACE1). Sec16 is believed
to be a key component in maintaining the integrity of
the ER exit site.
Length = 314
Score = 31.1 bits (71), Expect = 1.3
Identities = 14/35 (40%), Positives = 17/35 (48%)
Query: 457 KSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLL 491
K K KKK+ K EEK+ E + E E LL
Sbjct: 7 KGKTKKKDVLKWLEEKIAELEENEGYLDLEDKLLL 41
>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain. This family consists
of several ATP synthase E chain sequences which are
components of the CF(0) subunit.
Length = 83
Score = 29.0 bits (65), Expect = 1.3
Identities = 9/47 (19%), Positives = 24/47 (51%)
Query: 432 GIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALK 478
G ++ + ++E+ ++ Q+K + K K+K A+++ K +
Sbjct: 25 GAKHQRSLKKKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAKSLAE 71
>gnl|CDD|222730 pfam14392, zf-CCHC_4, Zinc knuckle. The zinc knuckle is a zinc
binding motif composed of the the following CX2CX4HX4C
where X can be any amino acid. This particular family is
found in plant proteins.
Length = 49
Score = 28.0 bits (63), Expect = 1.3
Identities = 6/14 (42%), Positives = 9/14 (64%)
Query: 118 CENCGAMTHKKKDC 131
C +CG + H K+C
Sbjct: 34 CFHCGRLGHSDKEC 47
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 31.4 bits (71), Expect = 1.3
Identities = 22/74 (29%), Positives = 40/74 (54%)
Query: 436 RPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDE 495
+ + +++ +E+ +K K+ K K+ KEKEK E+K++E +EEE++RE R +
Sbjct: 117 KKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPK 176
Query: 496 RKRSYNSMISVKEP 509
+ KEP
Sbjct: 177 KPPKKKPPNKKKEP 190
Score = 29.1 bits (65), Expect = 6.2
Identities = 17/73 (23%), Positives = 30/73 (41%), Gaps = 1/73 (1%)
Query: 418 ATSSVVGASLGPPPGIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK-EA 476
A V P + + K++++ K+ KKKKKEK K + K +
Sbjct: 76 AVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKE 135
Query: 477 LKKEEEEQREADR 489
KE+ +E ++
Sbjct: 136 EAKEKRPPKEKEK 148
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 30.0 bits (68), Expect = 1.4
Identities = 22/85 (25%), Positives = 40/85 (47%), Gaps = 1/85 (1%)
Query: 414 EAGLATSSVVGASLGPPPGIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKM 473
G ++ RPE + + +E K K K+ +K+KEKE+ + +
Sbjct: 39 SPGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKAL 98
Query: 474 KEALKKEEE-EQREADRLLRMDERK 497
EA K+ E E+++A+ L +E+K
Sbjct: 99 AEAEKERAELEKKKAEAKLMKEEKK 123
Score = 29.3 bits (66), Expect = 2.3
Identities = 20/69 (28%), Positives = 40/69 (57%), Gaps = 1/69 (1%)
Query: 429 PPPGIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
PG + E++DED+E + K+ + +K K+K ++K + K +EA K++E+E+R
Sbjct: 38 SSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKL-KAKKEEAEKEKEKEERFMK 96
Query: 489 RLLRMDERK 497
L ++ +
Sbjct: 97 ALAEAEKER 105
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 31.2 bits (72), Expect = 1.4
Identities = 15/50 (30%), Positives = 24/50 (48%), Gaps = 8/50 (16%)
Query: 451 KQKKNKKSKK--KKKEKEKSAEEKMKEA--LKKE----EEEQREADRLLR 492
+ ++N SK+ + K K + AE + E LK+E E E E + L
Sbjct: 48 QAERNALSKEIGQAKRKGEDAEALIAEVKELKEEIKALEAELDELEAELE 97
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 29.8 bits (67), Expect = 1.4
Identities = 15/35 (42%), Positives = 21/35 (60%)
Query: 437 PEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEE 471
P H D+ + K +KK KKSKK KK K+ S ++
Sbjct: 86 PSHPDDHPPEPTEKPKKKKKKSKKTKKPKKSSKKD 120
Score = 29.5 bits (66), Expect = 1.8
Identities = 13/37 (35%), Positives = 21/37 (56%)
Query: 431 PGIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEK 467
G + P+ + + KK+KK+KK+KK KK +K
Sbjct: 83 SGEPSHPDDHPPEPTEKPKKKKKKSKKTKKPKKSSKK 119
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 30.9 bits (70), Expect = 1.5
Identities = 18/45 (40%), Positives = 30/45 (66%)
Query: 452 QKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDER 496
QK+NK +K KEK +E+++EAL+ E+EE+ + LL+ +E
Sbjct: 126 QKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEE 170
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 30.5 bits (69), Expect = 1.5
Identities = 13/43 (30%), Positives = 26/43 (60%)
Query: 440 QDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEE 482
ED+ED G +++ + ++ ++E E+S++E KE +K E
Sbjct: 74 NSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTES 116
>gnl|CDD|172358 PRK13831, PRK13831, conjugal transfer protein TrbI; Provisional.
Length = 432
Score = 30.9 bits (70), Expect = 1.6
Identities = 30/112 (26%), Positives = 51/112 (45%), Gaps = 18/112 (16%)
Query: 413 REAGLATSSVVGASL-------GPPPGIMNRPEHQDEDE----EDSGHKKQKKNKKSKKK 461
R+ G TSS AS G GI+ P+ Q + E +K +
Sbjct: 54 RDDGPETSSGTPASTFADQLKRGVTDGIIGEPQQQQTFQPTPVETQQEEKAVNPFTPQPG 113
Query: 462 KKEKEKS---AEEKMKEALKKEEEEQ--READR--LLRMDERKRSYNSMISV 506
++E+ + +EE+ + LK+E+EEQ RE R + R+ +Y+S ++V
Sbjct: 114 QREERRPTLESEEEWRARLKREQEEQYLRERQRQRMARLQANAAAYDSPLAV 165
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 31.1 bits (70), Expect = 1.7
Identities = 14/68 (20%), Positives = 31/68 (45%), Gaps = 5/68 (7%)
Query: 435 NRPEHQDEDEEDSGHKKQKKNKKSKKK-----KKEKEKSAEEKMKEALKKEEEEQREADR 489
+ + + + D +KQ++ K K KE ++ AE + +E K + E ++ +
Sbjct: 237 DFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEE 296
Query: 490 LLRMDERK 497
L+ + K
Sbjct: 297 ALKAKDHK 304
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 30.8 bits (70), Expect = 1.7
Identities = 12/63 (19%), Positives = 23/63 (36%), Gaps = 4/63 (6%)
Query: 420 SSVVGASLGPPPGIMNRPEHQDEDEEDSGHKKQKKNKKSK----KKKKEKEKSAEEKMKE 475
+S PP + P + H ++ K+K+E +K ++K +
Sbjct: 312 HLSSDSSPSPPDTSDSDPSTETPPPASLSHSPPAAFERPLALSPKRKREGDKKQKKKKSK 371
Query: 476 ALK 478
LK
Sbjct: 372 KLK 374
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 31.0 bits (71), Expect = 1.8
Identities = 19/84 (22%), Positives = 40/84 (47%), Gaps = 7/84 (8%)
Query: 427 LGPPPGIMNRP-EHQDEDEED-----SGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKE 480
LG P I+ + ED+E + ++ ++ + K ++ E EK+KE L+++
Sbjct: 497 LGLPENIIEEAKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEK 556
Query: 481 EEEQREADRLLRMDERKRSYNSMI 504
+E+ +E + L E ++ I
Sbjct: 557 KEKLQEEEDKLLE-EAEKEAQQAI 579
Score = 30.6 bits (70), Expect = 2.6
Identities = 10/50 (20%), Positives = 31/50 (62%), Gaps = 1/50 (2%)
Query: 449 HKKQKKNKKSKKKKKEKEKS-AEEKMKEALKKEEEEQREADRLLRMDERK 497
++ ++ K+ +++++K AE++ ++A+K+ ++E E + LR ++
Sbjct: 550 KEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKG 599
Score = 29.8 bits (68), Expect = 4.4
Identities = 18/58 (31%), Positives = 36/58 (62%), Gaps = 5/58 (8%)
Query: 440 QDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKE-----EEEQREADRLLR 492
Q +E ++ K+ +K K+ ++KKEK + E+K+ E +KE +E ++EAD +++
Sbjct: 534 QKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIK 591
>gnl|CDD|235185 PRK03980, PRK03980, flap endonuclease-1; Provisional.
Length = 292
Score = 30.6 bits (70), Expect = 1.8
Identities = 12/44 (27%), Positives = 29/44 (65%), Gaps = 3/44 (6%)
Query: 454 KNKKSKKKKKEKEKSAEEKMKEALKKEEEE--QREADRLLRMDE 495
K ++ +++++ +E+ AEEK +EA ++ + E ++ A R R+ +
Sbjct: 40 KAEEIEERREVREE-AEEKYEEAKEEGDLEEARKYAQRSSRLTD 82
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 30.2 bits (68), Expect = 1.9
Identities = 15/47 (31%), Positives = 23/47 (48%)
Query: 437 PEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEE 483
E + E GH+K+ K KK + K+ K+K E+K K+ E
Sbjct: 128 EETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEHP 174
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 29.7 bits (67), Expect = 2.0
Identities = 9/49 (18%), Positives = 22/49 (44%)
Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADR 489
DE+EE + ++ + K + K +++ K E+E+ + +
Sbjct: 96 DEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPK 144
Score = 29.3 bits (66), Expect = 2.9
Identities = 13/46 (28%), Positives = 25/46 (54%)
Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEE 483
E ++ +EE + +Q+ ++K + KEK+K K K +K + E
Sbjct: 97 EEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142
Score = 28.2 bits (63), Expect = 6.8
Identities = 14/46 (30%), Positives = 22/46 (47%), Gaps = 1/46 (2%)
Query: 435 NRPEHQDEDEEDSGHKKQKKNKKSKKK-KKEKEKSAEEKMKEALKK 479
+ E + ED ++ + ++K K+ K K EKEK E K K
Sbjct: 105 STDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 30.4 bits (69), Expect = 2.0
Identities = 13/41 (31%), Positives = 22/41 (53%), Gaps = 3/41 (7%)
Query: 434 MNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK 474
+R E + + +E+ K+KK K+SK K K+ A+ K
Sbjct: 245 ESRAEKKRKSKEE---IKKKKPKESKGVKALKKVVAKGMKK 282
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 30.8 bits (70), Expect = 2.0
Identities = 11/54 (20%), Positives = 24/54 (44%)
Query: 437 PEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRL 490
E + + E + K K ++ KEK + E+ + K+ +++ + RL
Sbjct: 259 WEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARL 312
Score = 30.4 bits (69), Expect = 2.3
Identities = 11/51 (21%), Positives = 26/51 (50%)
Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
+ + E K + K+ K ++ KEK +E +EA ++++ +++ A
Sbjct: 258 AWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQ 308
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association with
pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 28.4 bits (64), Expect = 2.0
Identities = 18/61 (29%), Positives = 25/61 (40%), Gaps = 3/61 (4%)
Query: 420 SSVVGASLGPPPGIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKK 479
V L G + + D Q+ +K KKKKK K+KS E ++ KK
Sbjct: 5 GPSVPEQLPNGLGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILR---KK 61
Query: 480 E 480
E
Sbjct: 62 E 62
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 31.1 bits (71), Expect = 2.0
Identities = 13/39 (33%), Positives = 19/39 (48%), Gaps = 6/39 (15%)
Query: 454 KNKKSKKKKKEKEKSA-----EEKMKEALKKEEEEQREA 487
+ KK KK EK + ++K KE L+ E+QR
Sbjct: 84 REKKKYKKDVEKRNRSYRLYLDKKRKE-LQALSEKQRHV 121
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 29.3 bits (66), Expect = 2.2
Identities = 14/30 (46%), Positives = 18/30 (60%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKK 479
K++ K KK KKKKK+K+K K KK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 27.7 bits (62), Expect = 7.3
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 449 HKKQKKNKKSKKKKKEKEKSAEEKMK 474
KQKK KK KKKKK+K+ S + K
Sbjct: 108 KAKQKKQKKKKKKKKKKKTSKKAAKK 133
>gnl|CDD|217611 pfam03552, Cellulose_synt, Cellulose synthase. Cellulose, an
aggregate of unbranched polymers of beta-1,4-linked
glucose residues, is the major component of wood and
thus paper, and is synthesised by plants, most algae,
some bacteria and fungi, and even some animals. The
genes that synthesise cellulose in higher plants differ
greatly from the well-characterized genes found in
Acetobacter and Agrobacterium sp. More correctly
designated as 'cellulose synthase catalytic subunits',
plant cellulose synthase (CesA) proteins are integral
membrane proteins, approximately 1,000 amino acids in
length. There are a number of highly conserved residues,
including several motifs shown to be necessary for
processive glycosyltransferase activity.
Length = 716
Score = 30.9 bits (70), Expect = 2.2
Identities = 11/47 (23%), Positives = 21/47 (44%), Gaps = 1/47 (2%)
Query: 452 QKKNKKSKKKKKEKEKSAEEKMKEALKKEE-EEQREADRLLRMDERK 497
++K KS K K+K+ +E E+ +E E ++ M +
Sbjct: 312 RRKKPKSASKAKKKDSKRKESEAPIFNLEDIDEGDEDEKSSLMSQLS 358
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 30.8 bits (70), Expect = 2.2
Identities = 15/52 (28%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 442 EDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRM 493
+++ + KQ ++ KK +KE +K E K+++ + E EE +A LR
Sbjct: 272 DEKFERDKIKQLASELEKKLEKELKKL-ENKLEK-QEDELEELEKAAEELRQ 321
>gnl|CDD|115072 pfam06391, MAT1, CDK-activating kinase assembly factor MAT1. MAT1
is an assembly/targeting factor for cyclin-dependent
kinase-activating kinase (CAK), which interacts with the
transcription factor TFIIH. The domain found to the
N-terminal side of this domain is a C3HC4 RING finger.
Length = 200
Score = 30.1 bits (68), Expect = 2.3
Identities = 13/46 (28%), Positives = 30/46 (65%)
Query: 452 QKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERK 497
+K+NK S + K + +E++++AL++E+E + E L+ +E++
Sbjct: 74 EKENKDSIMRNKRRLTREQEELEQALEEEKEMKEEKRLHLQKEEQE 119
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 30.4 bits (69), Expect = 2.3
Identities = 11/51 (21%), Positives = 27/51 (52%)
Query: 440 QDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRL 490
+ + ++K K ++ EK + ++ K+ KKEE+++++ +RL
Sbjct: 298 KMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERL 348
Score = 28.9 bits (65), Expect = 8.3
Identities = 14/51 (27%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Query: 433 IMNRPEHQDEDEE-DSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEE 482
I+ D + S ++ + ++ K+K+KEK EEK K+ +++ EE
Sbjct: 300 ILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEE 350
>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e.
Length = 108
Score = 28.7 bits (65), Expect = 2.8
Identities = 18/36 (50%), Positives = 23/36 (63%), Gaps = 4/36 (11%)
Query: 453 KKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
K+ KK KKKK E E AE+K K+ + EQR+AD
Sbjct: 38 KREKKKKKKKSEGEFFAEKKEKKEVS----EQRKAD 69
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 28.2 bits (63), Expect = 2.8
Identities = 17/46 (36%), Positives = 24/46 (52%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDE 495
K+ KK KKKKK+K KS EE + E ++E+ + DE
Sbjct: 14 GKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDE 59
Score = 27.4 bits (61), Expect = 4.9
Identities = 14/32 (43%), Positives = 20/32 (62%)
Query: 451 KQKKNKKSKKKKKEKEKSAEEKMKEALKKEEE 482
K KK KKKKK+K+K+ ++ K+EEE
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEE 44
>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein. NinG or Rap is
involved in recombination. Rap (recombination adept with
plasmid) increases lambda-by-plasmid recombination
catalyzed by Escherichia coli's RecBCD pathway.
Length = 188
Score = 29.6 bits (67), Expect = 2.9
Identities = 11/41 (26%), Positives = 21/41 (51%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRL 490
K++K +K +K + + E+ + KE LK + +EA
Sbjct: 35 KREKAQEKKRKAEAQAERRELKARKEKLKTRSDWLKEAQAA 75
>gnl|CDD|206687 cd01900, YchF, YchF GTPase. YchF is a member of the Obg family,
which includes four other subfamilies of GTPases: Obg,
DRG, Ygr210, and NOG1. Obg is an essential gene that is
involved in DNA replication in C. crescentus and
Streptomyces griseus and is associated with the
ribosome. Several members of the family, including YchF,
possess the TGS domain related to the RNA-binding
proteins. Experimental data and genomic analysis suggest
that YchF may be part of a nucleoprotein complex and may
function as a GTP-dependent translational factor.
Length = 274
Score = 29.7 bits (68), Expect = 2.9
Identities = 10/29 (34%), Positives = 18/29 (62%)
Query: 451 KQKKNKKSKKKKKEKEKSAEEKMKEALKK 479
+ +K KS K+ ++E EK+KE L++
Sbjct: 142 RLEKKAKSGDKEAKEELELLEKIKEHLEE 170
>gnl|CDD|107228 cd06471, ACD_LpsHSP_like, Group of bacterial proteins containing an
alpha crystallin domain (ACD) similar to Lactobacillus
plantarum (Lp) small heat shock proteins (sHsp) HSP
18.5, HSP 18.55 and HSP 19.3. sHsps are molecular
chaperones that suppress protein aggregation and protect
against cell stress, and are generally active as large
oligomers consisting of multiple subunits. Transcription
of the genes encoding Lp HSP 18.5, 18.55 and 19.3 is
regulated by a variety of stresses including heat, cold
and ethanol. Early growing L. plantarum cells contain
elevated levels of these mRNAs which rapidly fall of as
the cells enter stationary phase. Also belonging to this
group is Bifidobacterium breve (Bb) HSP20 and Oenococcus
oenis (syn. Leuconostoc oenos) (Oo) HSP18.
Transcription of the gene encoding BbHSP20 is strongly
induced following heat or osmotic shock, and that of the
gene encoding OoHSP18 following heat, ethanol or acid
shock. OoHSP18 is peripherally associated with the
cytoplasmic membrane.
Length = 93
Score = 28.2 bits (64), Expect = 2.9
Identities = 12/30 (40%), Positives = 20/30 (66%), Gaps = 3/30 (10%)
Query: 216 KDDLEEDEDKYVDEVDMPGTKVDSKQRITV 245
K D++E +D+Y+ E D+PG K K+ I +
Sbjct: 2 KTDIKETDDEYIVEADLPGFK---KEDIKL 28
>gnl|CDD|223630 COG0556, UvrB, Helicase subunit of the DNA excision repair complex
[DNA replication, recombination, and repair].
Length = 663
Score = 30.3 bits (69), Expect = 3.1
Identities = 17/52 (32%), Positives = 31/52 (59%)
Query: 433 IMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQ 484
I + + + E++E ++K +K SKK+ ++ K E++MKEA K E E+
Sbjct: 593 IRDILDGEYEEDEYKAKIEKKASKMSKKELEKLIKKLEKEMKEAAKNLEFEE 644
>gnl|CDD|115108 pfam06431, Polyoma_lg_T_C, Polyomavirus large T antigen C-terminus.
Length = 417
Score = 29.9 bits (67), Expect = 3.2
Identities = 14/57 (24%), Positives = 28/57 (49%), Gaps = 1/57 (1%)
Query: 413 REAGLATSSVVGASLGPPPGIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSA 469
+E + T S + ++ GI++ +D ++EDSGH +S+ K + +S
Sbjct: 341 KEISMYTFSKMKFNVAMGIGILDWLREEDSEDEDSGHGSS-TESQSQCKSQVSHESG 396
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 30.1 bits (68), Expect = 3.3
Identities = 16/71 (22%), Positives = 27/71 (38%), Gaps = 14/71 (19%)
Query: 438 EHQDEDEEDSGHKKQ--------------KKNKKSKKKKKEKEKSAEEKMKEALKKEEEE 483
E D ++S ++ K S+ K +EKSA+E+M A EE
Sbjct: 871 EGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELM 930
Query: 484 QREADRLLRMD 494
+R + +
Sbjct: 931 KRAKEYQDKHK 941
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 29.8 bits (68), Expect = 3.5
Identities = 12/27 (44%), Positives = 19/27 (70%), Gaps = 2/27 (7%)
Query: 450 KKQKK--NKKSKKKKKEKEKSAEEKMK 474
KK+ K +K KKK+K++ K+A+ K K
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSK 89
>gnl|CDD|220176 pfam09317, DUF1974, Domain of unknown function (DUF1974). Members
of this family of functionally uncharacterized domains
are predominantly found in various prokaryotic
acyl-coenzyme a dehydrogenases.
Length = 284
Score = 29.8 bits (68), Expect = 3.6
Identities = 14/43 (32%), Positives = 22/43 (51%)
Query: 453 KKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDE 495
KK KK+ KK K + EE ++ AL+ + EA+ L +
Sbjct: 231 KKLKKALKKGKLPKLRLEELLEAALEAGVITEEEAELLREAEA 273
>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
Length = 205
Score = 29.2 bits (66), Expect = 3.6
Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 5/51 (9%)
Query: 446 DSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALK-----KEEEEQREADRLL 491
D G KK+ K ++ KK K+K +S K++ K K EQ ++ RLL
Sbjct: 120 DLGKKKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKALLEQFQSGRLL 170
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 29.3 bits (66), Expect = 3.8
Identities = 8/39 (20%), Positives = 18/39 (46%), Gaps = 1/39 (2%)
Query: 438 EHQDEDEEDSGHKKQKK-NKKSKKKKKEKEKSAEEKMKE 475
E +D +++ +++K + KK + E K K+
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 30.0 bits (68), Expect = 3.9
Identities = 13/60 (21%), Positives = 26/60 (43%)
Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRSY 500
+ EE + + KK ++ +K K K ++ K+ KK + EA + L + +
Sbjct: 105 KKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGLNYKKNFKEK 164
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 29.8 bits (67), Expect = 3.9
Identities = 10/35 (28%), Positives = 19/35 (54%)
Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEK 472
E + ++E+ + +KK K+ + KEK + E K
Sbjct: 399 EAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433
>gnl|CDD|153286 cd07602, BAR_RhoGAP_OPHN1-like, The Bin/Amphiphysin/Rvs (BAR)
domain of Oligophrenin1-like Rho GTPase Activating
Proteins. BAR domains are dimerization, lipid binding
and curvature sensing modules found in many different
proteins with diverse functions. This subfamily is
composed of Rho and Rac GTPase activating proteins
(GAPs) with similarity to oligophrenin1 (OPHN1). Members
contain an N-terminal BAR domain, followed by a
Pleckstrin homology (PH) domain, and a Rho GAP domain.
Some members contain a C-terminal SH3 domain.
Vertebrates harbor at least three Rho GAPs in this
subfamily including OPHN1, GTPase Regulator Associated
with Focal adhesion kinase (GRAF), GRAF2, and an
uncharacterized protein called GAP10-like. OPHN1, GRAF
and GRAF2 show GAP activity towards RhoA and Cdc42. In
addition, OPHN1 is active towards Rac. BAR domains form
dimers that bind to membranes, induce membrane bending
and curvature, and may also be involved in
protein-protein interactions. The BAR domains of OPHN1
and GRAF directly interact with their Rho GAP domains
and inhibit their activity. The autoinhibited proteins
are able to bind membranes and tubulate liposomes,
showing that the membrane-tubulation and GAP-inhibitory
functions of the BAR domains can occur simultaneously.
Length = 207
Score = 29.2 bits (66), Expect = 4.0
Identities = 19/52 (36%), Positives = 29/52 (55%), Gaps = 5/52 (9%)
Query: 450 KKQKKNKKSKKKKKEKE-----KSAEEKMKEALKKEEEEQREADRLLRMDER 496
K+Q K +KKK +KE S E+ + + KK+E + +EAD L M+ R
Sbjct: 103 KEQIGGAKEEKKKFDKETEKFCSSLEKHLNLSTKKKENQLQEADAQLDMERR 154
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 30.0 bits (67), Expect = 4.0
Identities = 13/44 (29%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 187 EEAKRELRADKLDAGVSIDNRGSKEGDEDKDDLEEDEDKYVDEV 230
E REL +D GV+ D+ S+ + D ++ +D D+ V+++
Sbjct: 3854 ESEARELESDM--NGVTKDSVVSENENSDSEEENQDLDEEVNDI 3895
>gnl|CDD|214787 smart00721, BAR, BAR domain.
Length = 239
Score = 29.3 bits (66), Expect = 4.1
Identities = 16/43 (37%), Positives = 25/43 (58%)
Query: 449 HKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLL 491
HK +K K +KKK EK AEE++++A ++ EE + L
Sbjct: 155 HKLKKAKKSKEKKKDEKLAKAEEELRKAKQEFEESNAQLVEEL 197
>gnl|CDD|217553 pfam03430, TATR, Trans-activating transcriptional regulator. This
family of trans-activating transcriptional regulator
(TATR), also known as intermediate early protein 1, are
common to the Nucleopolyhedroviruses.
Length = 571
Score = 29.5 bits (66), Expect = 4.6
Identities = 18/76 (23%), Positives = 27/76 (35%), Gaps = 5/76 (6%)
Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRSY 500
D + G K K KK +K KK +S +L +EE E + D+ + +
Sbjct: 137 DSSDSSKGKKLVNKPKKRQKYKKATIQS-----PTSLTEEENYNTEICTVAPTDQIAKYF 191
Query: 501 NSMISVKEPTMDSDRG 516
S SD
Sbjct: 192 LMDFSPYLEDAKSDCQ 207
>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
(DUF2098). This domain, found in various hypothetical
prokaryotic proteins, has no known function.
Length = 91
Score = 27.7 bits (62), Expect = 4.6
Identities = 15/32 (46%), Positives = 22/32 (68%), Gaps = 1/32 (3%)
Query: 457 KSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
K K+KK+E+E+ EE + E +KKEEE + D
Sbjct: 51 KVKEKKEEREEDKEE-LIERIKKEEETFEDVD 81
>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 881
Score = 29.9 bits (67), Expect = 4.7
Identities = 19/68 (27%), Positives = 35/68 (51%), Gaps = 8/68 (11%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMK--------EALKKEEEEQREADRLLRMDERKRSYN 501
+KQ NKK KKK++E++ S E+ M E ++ + + + + + K S +
Sbjct: 21 EKQTPNKKDKKKEEEEQLSEEDAMLKGDLELLVERIQDPDIDLQNNSLNMLKEVIKSSTS 80
Query: 502 SMISVKEP 509
SM +V +P
Sbjct: 81 SMTAVPKP 88
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 29.8 bits (67), Expect = 4.9
Identities = 13/49 (26%), Positives = 31/49 (63%)
Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQRE 486
E + E E+ + + +++ + +++++E+EK+A E + K E E++RE
Sbjct: 253 EKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRRE 301
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 28.9 bits (65), Expect = 5.1
Identities = 11/48 (22%), Positives = 23/48 (47%), Gaps = 2/48 (4%)
Query: 432 GIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKK 479
G+ + Q ++E + +++KKK++ + K +E E LK
Sbjct: 64 GVKDFLSDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKE--MEELKA 109
Score = 28.2 bits (63), Expect = 9.4
Identities = 12/49 (24%), Positives = 26/49 (53%)
Query: 437 PEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQR 485
PE + E S +Q K K+ + KK+KE + +++ + + +E++
Sbjct: 74 PEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQ 122
>gnl|CDD|173500 PTZ00264, PTZ00264, circumsporozoite-related antigen; Provisional.
Length = 148
Score = 28.4 bits (63), Expect = 5.1
Identities = 18/60 (30%), Positives = 27/60 (45%), Gaps = 10/60 (16%)
Query: 447 SGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRSYNSMISV 506
SG K+KK K K E E + E ++KEEE L+ + ++K Y +V
Sbjct: 29 SGVPKKKKPK---SKSPEPLIDVHELISEMVRKEEE-------LVNVTKKKSKYKLATTV 78
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 29.5 bits (66), Expect = 5.1
Identities = 14/60 (23%), Positives = 27/60 (45%)
Query: 429 PPPGIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
P + + E +E EE+ ++ +KK KK KK K K ++ ++ + + D
Sbjct: 309 PAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDID 368
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 28.0 bits (63), Expect = 5.1
Identities = 18/55 (32%), Positives = 30/55 (54%)
Query: 451 KQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRSYNSMIS 505
++K+ +K + +K + SAEEK + L+K E+E E + L E K M+S
Sbjct: 25 EKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKAEAKKMLS 79
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 29.4 bits (66), Expect = 5.2
Identities = 15/48 (31%), Positives = 32/48 (66%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERK 497
+K K + ++KKKE++++ E + K+A ++E +Q E +RL +++K
Sbjct: 71 QKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKK 118
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 29.0 bits (65), Expect = 5.4
Identities = 14/36 (38%), Positives = 23/36 (63%)
Query: 452 QKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREA 487
+ K K++ + K + E AE+K KE KK+ EE+ +A
Sbjct: 123 EAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKA 158
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 28.0 bits (63), Expect = 5.6
Identities = 18/54 (33%), Positives = 31/54 (57%), Gaps = 5/54 (9%)
Query: 449 HKKQKKNKKSKKK----KKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKR 498
K+ KK S K K E E++ +EK K+ K+++EE++E R L+ ++K
Sbjct: 75 AKEVKKPGISTKAQQALKLEHERNKQEK-KKRSKEKKEEEKERKRQLKQQKKKA 127
>gnl|CDD|236928 PRK11552, PRK11552, putative DNA-binding transcriptional
regulator; Provisional.
Length = 225
Score = 28.9 bits (65), Expect = 5.7
Identities = 10/21 (47%), Positives = 13/21 (61%)
Query: 56 RDINPHIPQYISSAPWYFGSK 76
RDI Q I++ +YFGSK
Sbjct: 36 RDIAAQAGQNIAAITYYFGSK 56
>gnl|CDD|219526 pfam07701, HNOBA, Heme NO binding associated. The HNOBA domain is
found associated with the HNOB domain and pfam00211 in
soluble cyclases and signalling proteins. The HNOB
domain is predicted to function as a heme-dependent
sensor for gaseous ligands, and transduce diverse
downstream signals, in both bacteria and animals.
Length = 209
Score = 28.7 bits (65), Expect = 5.9
Identities = 14/32 (43%), Positives = 23/32 (71%), Gaps = 3/32 (9%)
Query: 460 KKKKEKEKSAEEKMKEALKKEEEEQREADRLL 491
KK+ EK K +K++EA ++ EEE+++ D LL
Sbjct: 169 KKRLEKLK---DKLEEAHRELEEEKKKTDDLL 197
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 29.2 bits (66), Expect = 6.0
Identities = 15/66 (22%), Positives = 24/66 (36%), Gaps = 21/66 (31%)
Query: 447 SGHKKQKKNKKSKKKKK-------------------EKEKSAEEKMKEALKKEEEEQREA 487
K K + ++K ++ K+ EE KE E ++ EA
Sbjct: 219 RTVKPPPKPPMTPEEKDDEYDQRVRELTFDRRAQPTDRTKTEEELAKEE--AERLKKLEA 276
Query: 488 DRLLRM 493
+RL RM
Sbjct: 277 ERLRRM 282
>gnl|CDD|235364 PRK05201, hslU, ATP-dependent protease ATP-binding subunit HslU;
Provisional.
Length = 443
Score = 29.3 bits (67), Expect = 6.1
Identities = 18/44 (40%), Positives = 24/44 (54%), Gaps = 10/44 (22%)
Query: 455 NKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKR 498
KK KK+K K+KEA K EE EA +L+ M+E K+
Sbjct: 208 PKKKKKRKL--------KVKEARKILIEE--EAAKLIDMEEIKQ 241
>gnl|CDD|221864 pfam12932, Sec16, Vesicle coat trafficking protein Sec16
mid-region. Sec16 is a multi-domain vesicle coat
protein. This central region is the functional part of
the molecules and thus is vital for the family's role in
mediating the movement of protein-cargo between the
organelles of the secretory pathway.
Length = 104
Score = 27.6 bits (62), Expect = 6.1
Identities = 13/36 (36%), Positives = 19/36 (52%)
Query: 456 KKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLL 491
K K KKK+ K E K+ E + +R+ D+LL
Sbjct: 60 VKGKSKKKDVIKWLESKIAELENSNDSLKRKEDKLL 95
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 28.9 bits (65), Expect = 6.2
Identities = 11/36 (30%), Positives = 22/36 (61%)
Query: 453 KKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
K S KK K+++K+ E K K KK+++++++
Sbjct: 48 KVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEP 83
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 28.0 bits (63), Expect = 6.2
Identities = 11/49 (22%), Positives = 23/49 (46%)
Query: 449 HKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERK 497
K+ + K+ KKK+K K ++ ++ +E + R + E+K
Sbjct: 72 EKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKK 120
>gnl|CDD|237847 PRK14879, PRK14879, serine/threonine protein kinase; Provisional.
Length = 211
Score = 28.7 bits (65), Expect = 6.3
Identities = 10/43 (23%), Positives = 18/43 (41%), Gaps = 5/43 (11%)
Query: 318 EKGVDVHVLAEPTKLELLKKEYEEKKEEFKKEVSSSVIDKYGG 360
++ VD+HVL L L+ + + EE + + G
Sbjct: 149 DRAVDLHVL-----LRSLESTHPDWAEELFEAFLEGYREVMGE 186
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 29.3 bits (65), Expect = 6.4
Identities = 13/47 (27%), Positives = 25/47 (53%)
Query: 444 EEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRL 490
EE+ H+++ + K +KKK+EKEK + + +E+ A +
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMV 131
>gnl|CDD|236583 PRK09601, PRK09601, GTP-binding protein YchF; Reviewed.
Length = 364
Score = 28.9 bits (66), Expect = 6.4
Identities = 17/65 (26%), Positives = 30/65 (46%), Gaps = 11/65 (16%)
Query: 451 KQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQ--READRLLRMDERK--RSYNSMISV 506
+ +K K K+ + E EK+ E L EE + R + L +E K +S +++
Sbjct: 146 RLEKKAKGGDKEAKAELELLEKLLEHL---EEGKPARTLE--LTDEEEKLLKSLQ-LLTA 199
Query: 507 KEPTM 511
K P +
Sbjct: 200 K-PVL 203
>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
Length = 448
Score = 28.9 bits (65), Expect = 6.5
Identities = 13/40 (32%), Positives = 23/40 (57%), Gaps = 5/40 (12%)
Query: 453 KKNKKSKKKKKEKEKSAE-----EKMKEALKKEEEEQREA 487
K NK+ K K KE + E +++K+ + ++E E +EA
Sbjct: 58 KLNKEVAKLKIAKEDATELIAETKELKKEITEKEAEVQEA 97
>gnl|CDD|223325 COG0247, GlpC, Fe-S oxidoreductase [Energy production and
conversion].
Length = 388
Score = 29.1 bits (65), Expect = 6.6
Identities = 16/54 (29%), Positives = 22/54 (40%)
Query: 299 FSGDTRKHATAQLFAWEAYEKGVDVHVLAEPTKLELLKKEYEEKKEEFKKEVSS 352
F +K A + A++ +G D V P LKKEY E E +V
Sbjct: 198 FLERAKKLAKKNIEAFKKLIEGGDPIVTVCPACYGALKKEYPELLGERALKVVD 251
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 29.0 bits (65), Expect = 6.6
Identities = 17/59 (28%), Positives = 20/59 (33%), Gaps = 9/59 (15%)
Query: 424 GASLGPPPGIMNRPEHQDEDEEDSGHKKQKKNKKSKKK---------KKEKEKSAEEKM 473
G S G +ED G K QKK + K KKE KS E+
Sbjct: 425 GKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDSKAGGKKESVKSQEDNN 483
>gnl|CDD|234659 PRK00137, rplI, 50S ribosomal protein L9; Reviewed.
Length = 147
Score = 28.1 bits (64), Expect = 6.8
Identities = 6/32 (18%), Positives = 14/32 (43%)
Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEE 481
K+ + + + K +E + E + E L+
Sbjct: 45 KQLEARRAELEAKAAEELAEAEALAEKLEGLT 76
>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 758
Score = 29.1 bits (65), Expect = 7.1
Identities = 9/47 (19%), Positives = 22/47 (46%)
Query: 436 RPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEE 482
+ ++E+ ++ + K +++K+EK+ A+ K EE
Sbjct: 124 KESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKNEER 170
>gnl|CDD|184712 PRK14501, PRK14501, putative bifunctional trehalose-6-phosphate
synthase/HAD hydrolase subfamily IIB; Provisional.
Length = 726
Score = 29.1 bits (66), Expect = 7.3
Identities = 15/32 (46%), Positives = 19/32 (59%), Gaps = 2/32 (6%)
Query: 470 EEKMKEALKKEEEEQREADRLLRMDERKRSYN 501
+K AL+ EEEQRE R+ M ER R Y+
Sbjct: 419 AAAIKRALEMPEEEQRE--RMQAMQERLRRYD 448
>gnl|CDD|189037 cd09867, PIN_FEN1, PIN domain of Flap Endonuclease-1, a
structure-specific, divalent-metal-ion dependent, 5'
nuclease and homologs. Flap endonuclease-1 (FEN1) is
involved in multiple DNA metabolic pathways, including
DNA replication processes (5' flap DNA endonuclease
activity and double stranded DNA 5'-exonuclease
activity) and DNA repair processes (long-patch base
excision repair) in eukaryotes and archaea. Interaction
between FEN1 and PCNA (Proliferating cell nuclear
antigen) is an essential prerequisite to FEN1's DNA
replication functionality and stimulates FEN1 nuclease
activity by 10-50 fold. FEN1 belongs to the
FEN1-EXO1-like family of structure-specific, 5'
nucleases. These nucleases contain a PIN (PilT N
terminus) domain with a helical arch/clamp region (I
domain) of variable length (approximately 45 residues in
FEN1 PIN domains) and a H3TH (helix-3-turn-helix)
domain, an atypical helix-hairpin-helix-2-like region.
Both the H3TH domain (not included here) and the helical
arch/clamp region are involved in DNA binding. Nucleases
within this group also have a carboxylate-rich active
site that is involved in binding essential divalent
metal ion cofactors (Mg2+/Mn2+). FEN1 has a C-terminal
extension containing residues forming the consensus
PIP-box - Qxx(M/L/I)xxF(Y/F) which serves to anchor FEN1
to PCNA.
Length = 261
Score = 28.6 bits (65), Expect = 7.5
Identities = 12/37 (32%), Positives = 25/37 (67%), Gaps = 2/37 (5%)
Query: 461 KKKEKEKSAEEKMKEALKKEEEEQRE--ADRLLRMDE 495
K++E+ + AEEK++EA ++ + E+ A R +R+ +
Sbjct: 94 KRRERREEAEEKLEEAKEEGDAEEARKYAKRTVRVTK 130
>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p. Rrp15p is required for the formation of
60S ribosomal subunits.
Length = 132
Score = 27.7 bits (62), Expect = 7.7
Identities = 12/48 (25%), Positives = 24/48 (50%)
Query: 451 KQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKR 498
K+ K K K ++ EK A+ +++ ++ E+ R L E++R
Sbjct: 28 KKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPEDLEKER 75
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 29.2 bits (65), Expect = 7.8
Identities = 11/60 (18%), Positives = 30/60 (50%)
Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERK 497
E D +E ++++ ++ +KE+E A+ + +++E++ +E + L E +
Sbjct: 238 ERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEE 297
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 29.0 bits (66), Expect = 7.8
Identities = 17/63 (26%), Positives = 38/63 (60%), Gaps = 1/63 (1%)
Query: 434 MNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEE-EQREADRLLR 492
+ + E + +E++ +K + +K +++ ++KEK E+K +E KKEEE E+ ++L
Sbjct: 84 LQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143
Query: 493 MDE 495
++
Sbjct: 144 LER 146
>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
SseC is a secreted protein that forms a complex together
with SecB and SecD on the surface of Salmonella. All
these proteins are secreted by the type III secretion
system. Many mucosal pathogens use type III secretion
systems for the injection of effector proteins into
target cells. SecB, SseC and SecD are inserted into the
target cell membrane. where they form a small pore or
translocon. In addition to SseC, this family includes
the bacterial secreted proteins PopB, PepB, YopB and
EspD which are thought to be directly involved in pore
formation, and type III secretion system translocon.
Length = 303
Score = 28.6 bits (64), Expect = 7.9
Identities = 12/39 (30%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 450 KKQKKNKKSK-KKKKEKEKSAEEKMKEALKKEEEEQREA 487
+KQ K+K + ++ ++K++ E+ +E +KK E+ EA
Sbjct: 14 EKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEA 52
>gnl|CDD|216853 pfam02030, Lipoprotein_8, Hypothetical lipoprotein (MG045 family).
This family includes hypothetical lipoproteins, the
amino terminal part of this protein is related to
pfam01547, a family of solute binding proteins. This
suggests this family also has a solute binding function.
Length = 493
Score = 28.8 bits (64), Expect = 8.1
Identities = 8/32 (25%), Positives = 17/32 (53%)
Query: 453 KKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQ 484
K ++ K+++K++S E + K +KE
Sbjct: 390 KTGFEADLKRQKKQESFEIQTKYKTEKERNHI 421
>gnl|CDD|235012 PRK02201, PRK02201, putative inner membrane protein translocase
component YidC; Provisional.
Length = 357
Score = 28.8 bits (65), Expect = 8.2
Identities = 13/25 (52%), Positives = 17/25 (68%)
Query: 461 KKKEKEKSAEEKMKEALKKEEEEQR 485
KKK KE++ K KEALKK+ + Q
Sbjct: 279 KKKNKERTLNVKEKEALKKQNKTQN 303
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 28.5 bits (64), Expect = 8.5
Identities = 15/57 (26%), Positives = 27/57 (47%), Gaps = 2/57 (3%)
Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMD 494
E + D ED K + K K+ ++E S E++ + A ++E E ++E L
Sbjct: 126 EIESSDSEDEEEK--DEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATT 180
>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
This domain is found associated with the Pfam tRNA
synthetase class II domain (pfam00587) and represents
the N-terminal domain of seryl-tRNA synthetase.
Length = 108
Score = 27.2 bits (61), Expect = 8.5
Identities = 15/46 (32%), Positives = 26/46 (56%), Gaps = 8/46 (17%)
Query: 451 KQKKNKKSK---KKKKEKEKSAE-----EKMKEALKKEEEEQREAD 488
+ ++N+ SK K KK+KE + +++K+ LK E E RE +
Sbjct: 49 QAERNELSKEIGKAKKKKEDAEALIAEVKELKDELKALEAELRELE 94
>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2). This family
represents a conserved region approximately 60 residues
long within the eukaryotic targeting protein for Xklp2
(TPX2). Xklp2 is a kinesin-like protein localised on
centrosomes throughout the cell cycle and on spindle
pole microtubules during metaphase. In Xenopus, it has
been shown that Xklp2 protein is required for centrosome
separation and maintenance of spindle bi-polarity. TPX2
is a microtubule-associated protein that mediates the
binding of the C-terminal domain of Xklp2 to
microtubules. It is phosphorylated during mitosis in a
microtubule-dependent way.
Length = 57
Score = 26.2 bits (58), Expect = 8.6
Identities = 16/40 (40%), Positives = 27/40 (67%)
Query: 453 KKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLR 492
+K + KK +EKEK+ E + +EA +++EE+ EA + LR
Sbjct: 8 EKRAEFDKKLEEKEKALEAEKEEAEARQKEEEEEAIKQLR 47
>gnl|CDD|225995 COG3464, COG3464, Transposase and inactivated derivatives [DNA
replication, recombination, and repair].
Length = 402
Score = 28.6 bits (64), Expect = 9.0
Identities = 21/107 (19%), Positives = 34/107 (31%), Gaps = 17/107 (15%)
Query: 96 LDEWYKRGVDTTSQAKKFRKGACENCGAMTHKKKDCLERPRKVLAKYSNAQIACDEFVQP 155
L+E + A+ RK C CG T ++ R RK+ + E P
Sbjct: 19 LEEVEMGPIIIDLYAEMPRKHRCPECGQRTIRRHG--WRIRKIQD------LPLFEV--P 68
Query: 156 KLIQDYDSKRDRWAGYDPSNHKA----IVEHYQKIEEAKRELRADKL 198
+ + R+ A V+ Q+I + A L
Sbjct: 69 VYLFLR---KRRYKCCRCGKRFAEKVDFVDRAQRITRRWAQAAAIAL 112
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 28.7 bits (64), Expect = 9.3
Identities = 17/55 (30%), Positives = 32/55 (58%), Gaps = 6/55 (10%)
Query: 448 GHKKQKKNKKSKKKKKEKEKSAE------EKMKEALKKEEEEQREADRLLRMDER 496
G +K+KK KKSK+KKK+K + A E ++E ++ E+ L+++++
Sbjct: 682 GSRKKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKLEKK 736
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 28.5 bits (64), Expect = 9.5
Identities = 14/59 (23%), Positives = 27/59 (45%), Gaps = 2/59 (3%)
Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSA--EEKMKEALKKEEEEQREADRLLRMDERK 497
+ ++ K+QK+ K K KK+ +KS + L + E +EA+ ++K
Sbjct: 2 MNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 28.3 bits (64), Expect = 9.9
Identities = 17/72 (23%), Positives = 38/72 (52%), Gaps = 13/72 (18%)
Query: 440 QDEDEEDSGHKKQKKNK-------------KSKKKKKEKEKSAEEKMKEALKKEEEEQRE 486
Q+EDE ++ K++K+ K + K+++KE+E+ E K+ E +++ E + E
Sbjct: 109 QEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEE 168
Query: 487 ADRLLRMDERKR 498
+ R + ++
Sbjct: 169 REAERRERKEEK 180
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.309 0.129 0.368
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,778,914
Number of extensions: 2672825
Number of successful extensions: 9566
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7779
Number of HSP's successfully gapped: 789
Length of query: 520
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 419
Effective length of database: 6,457,848
Effective search space: 2705838312
Effective search space used: 2705838312
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 61 (27.1 bits)