RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6893
(988 letters)
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 331 bits (849), Expect = 1e-98
Identities = 154/365 (42%), Positives = 207/365 (56%), Gaps = 40/365 (10%)
Query: 541 DIVDDFWPENEK----EKQLQARITKLSKILSGEVSIDLHLQFLIRSNKTDMLILKNTKE 596
D+ D E + E Q + ILSGE + FL+ N TD L ++K
Sbjct: 249 DLEDSASQEILEILVTELVAQGYDQAVMSILSGEFTKKYLGAFLLEKNNTDFKFLNSSKS 308
Query: 597 SI--RVSICHTATIIANAFMHSGTTSDQFLRDNLEWLARATNWAKLTATASLGVIHRGHE 654
S+ + S H A +AN+FM+ GT++D F R+NL+WL +A+NWAK TATASLGVIH G+
Sbjct: 309 SLARKFSRFHYAVSLANSFMNLGTSNDSFYRNNLDWLGKASNWAKFTATASLGVIHLGNS 368
Query: 655 TEALAHMQSYLPRETGPTSGYYEGGGLYALGLIHANHGAVINDYLLGQVKDATNEM---V 711
++ YLP E +S EGG LYALGLI A G +YLL D +E+ +
Sbjct: 369 NPGYEILKPYLPSEVA-SSRQKEGGALYALGLIKAGFGREDTEYLLEYFLDTEDELTPEL 427
Query: 712 RHGGCLGLGLAAMGSARDDIYEQLKFNLYQDDAVTGEAAGIAMGLVLLGSKSESAIQDMV 771
+G CLG+GL MGSA +IYE+LK L D A+ GEAA MGL++LG+ S AI+DM
Sbjct: 428 AYGVCLGIGLINMGSANREIYEKLKELLKNDRALLGEAAVYGMGLLMLGTWSVEAIEDMR 487
Query: 772 AYAKETQHEKILRGLAVGISFIMYGQLENADDLVRDLCEEKDAILRRSGMYTIAMAYCGS 831
YA ETQHE+I RGL +G + I+YG+ E ADD + +L +K
Sbjct: 488 TYAGETQHERIKRGLGIGFALILYGRQEMADDYINELLYDK------------------- 528
Query: 832 GSNVATKRLLHDAILRRSGMYTIAMAYCGSGSNVATKRLLHVAVSDVNDDVRREAVTALG 891
D+ILR +G++++A+AY G+G+ LLH AVSD NDDVRR AV ALG
Sbjct: 529 -----------DSILRYNGVFSLALAYVGTGNLGVVSTLLHYAVSDGNDDVRRAAVIALG 577
Query: 892 FILFR 896
F+
Sbjct: 578 FVCCD 582
Score = 312 bits (801), Expect = 9e-92
Identities = 164/498 (32%), Positives = 225/498 (45%), Gaps = 133/498 (26%)
Query: 43 LAMSSIQSRSFRAQILRTLVRLYHSLSVPDYVNMAQCWIYLDDPRSVANLLGKLVAGSAD 102
LA++ ++ FR +ILR L + PDY + + +YL+D L+ KLV +
Sbjct: 181 LAITLVEEEGFRKEILRMLAEIGPGKPKPDYFYVIKAVVYLNDAEKAKALIEKLV--KEN 238
Query: 103 DDLMAAQIGFDLYESATQAFLASVLQELKTCAPIPDATHVVVKVPVEGAPTPATPDTEAP 162
D L+ AQ+ FDL +SA+Q L E T
Sbjct: 239 DLLLYAQVAFDLEDSASQEIL-------------------------EILVTELVAQGYDQ 273
Query: 163 TPAPGTESGPSSTTPGPETKTLESLNEKEKQLQARITKLSKILSGEVSIDLHLQFLIRSN 222
+ ILSGE + FL+ N
Sbjct: 274 -------------------------------------AVMSILSGEFTKKYLGAFLLEKN 296
Query: 223 KTDMLILKNTKESI--RVSICHTATIIANAFMHSGTTSDQFLRDNLEWLARATNWAKLTA 280
TD L ++K S+ + S H A +AN+FM+ GT++D F R+NL+WL +A+NWAK TA
Sbjct: 297 NTDFKFLNSSKSSLARKFSRFHYAVSLANSFMNLGTSNDSFYRNNLDWLGKASNWAKFTA 356
Query: 281 TASLGVIHRGHETEALAHMQSYLPRETGPTSGYYEGGGLYALGLIHANHGAVINDYLLGQ 340
TASLGVIH G+ ++ YLP E +S EGG LYALGLI A G +YLL
Sbjct: 357 TASLGVIHLGNSNPGYEILKPYLPSEVA-SSRQKEGGALYALGLIKAGFGREDTEYLLEY 415
Query: 341 VKDATNE-----------------------------------DDAVTGEAAGIAMGLVLL 365
D +E D A+ GEAA MGL++L
Sbjct: 416 FLDTEDELTPELAYGVCLGIGLINMGSANREIYEKLKELLKNDRALLGEAAVYGMGLLML 475
Query: 366 GSKSESAIQDMVAYAKETQHEKILRGLAVGISFIMYGQLENADDLVRDLCEEKDAILRRS 425
G+ S AI+DM YA ETQHE+I RGL +G + I+YG+ E ADD + +L +KD+ILR
Sbjct: 476 GTWSVEAIEDMRTYAGETQHERIKRGLGIGFALILYGRQEMADDYINELLYDKDSILR-- 533
Query: 426 GMYTIAMAYCVSYLYFDLFSDLTYRIVKNRSGMYTIAMAYCGSGSNVATKRLLHVAVSDV 485
+G++++A+AY G+G+ LLH AVSD
Sbjct: 534 -----------------------------YNGVFSLALAYVGTGNLGVVSTLLHYAVSDG 564
Query: 486 NDDVRREAVTALGFILFR 503
NDDVRR AV ALGF+
Sbjct: 565 NDDVRRAAVIALGFVCCD 582
Score = 34.5 bits (79), Expect = 0.33
Identities = 18/64 (28%), Positives = 28/64 (43%), Gaps = 5/64 (7%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
K R K+K EK K+ K E+ + + +K++++ K A I KKK
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREE-----KGIDAPAILNVKKK 846
Query: 958 KKKK 961
K K
Sbjct: 847 KPYK 850
Score = 33.8 bits (77), Expect = 0.63
Identities = 13/59 (22%), Positives = 23/59 (38%)
Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
K A++ K+KG + K + + I ++++K KKKK K
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYK 850
Score = 30.3 bits (68), Expect = 7.1
Identities = 12/64 (18%), Positives = 28/64 (43%), Gaps = 5/64 (7%)
Query: 925 RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK-----KKKKKKEAEKM 979
+ A++ K+K +K K + + + ++ K++++K KKK+ K+
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKV 851
Query: 980 KKKT 983
T
Sbjct: 852 DNMT 855
Score = 29.9 bits (67), Expect = 9.1
Identities = 13/59 (22%), Positives = 25/59 (42%)
Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
K A + ++K K+K +K K + + ++ K +++ KKKK K
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYK 850
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 77.1 bits (189), Expect = 3e-14
Identities = 36/98 (36%), Positives = 50/98 (51%), Gaps = 6/98 (6%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK--KKDAEKMKKKGAEKIGKK 954
KK+ KK + + EAE +K A + KK+ KK A+ KKK EK
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKAD 1394
Query: 955 KKKKKKKKKKKK----KKKKKKKKEAEKMKKKTEEAEK 988
+ KKK ++ KKK KK KK+A++ KKK EE +K
Sbjct: 1395 EAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432
Score = 74.4 bits (182), Expect = 3e-13
Identities = 34/95 (35%), Positives = 56/95 (58%), Gaps = 5/95 (5%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK---KKDAEKMKKKGAEKIGK 953
+K+ + KK AE+ +K + KK+AE+ +KK A+ KKK K A + K AE
Sbjct: 1300 EKKKADEAKKKAEEAKKADEAKKKAEEAKKK--ADAAKKKAEEAKKAAEAAKAEAEAAAD 1357
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ + ++K + +KKK++ KK+A+ KKK EE +K
Sbjct: 1358 EAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392
Score = 72.5 bits (177), Expect = 8e-13
Identities = 40/102 (39%), Positives = 63/102 (61%), Gaps = 11/102 (10%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK----KGAEKMKKK---KKDAEKMKKKGAE 949
KK+ KKK A++++K KK+A++ +KK K A++ KKK K A++ KKK E
Sbjct: 1397 KKKAEEDKKK-ADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEE 1455
Query: 950 KIGKKKKKKKKKKKKKK---KKKKKKKKEAEKMKKKTEEAEK 988
++ KKK ++ KK KKK ++ K+A++ KKK EEA+K
Sbjct: 1456 AKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKK 1497
Score = 72.5 bits (177), Expect = 1e-12
Identities = 42/103 (40%), Positives = 65/103 (63%), Gaps = 12/103 (11%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK-----KGAEKMKKK---KKDAEKMKKKGA 948
+K+ KKK A+ +KK ++KK+A++ +KK K A+++KK KK A++ KKK
Sbjct: 1370 EKKKEEAKKK-ADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAE 1428
Query: 949 EKIGK---KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
EK KKK ++ KK + KKK ++ K+AE+ KKK EEA+K
Sbjct: 1429 EKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK 1471
Score = 71.7 bits (175), Expect = 2e-12
Identities = 39/97 (40%), Positives = 59/97 (60%), Gaps = 7/97 (7%)
Query: 896 RKKRTRRKK----KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
+K +KK KK AE+ +K + KK+AE+ +K A+K ++ K AE+ KKK E
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEE-- 1468
Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK + KKK ++ KK + KK+AE+ KKK +EA+K
Sbjct: 1469 -AKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKK 1504
Score = 71.3 bits (174), Expect = 2e-12
Identities = 40/122 (32%), Positives = 69/122 (56%), Gaps = 13/122 (10%)
Query: 879 NDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKK--EAEKMRKKKGAEKMKKK 936
N+++R+ + R+ + ++ + A++++K ++KKK EA+K +KK A++ KKK
Sbjct: 1251 NEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKK 1310
Query: 937 ---KKDAEKMKKKGAEKIGKK----KKKKKKKKKK---KKKKKKKKKKEAEKMKKKTEEA 986
K A++ KKK AE+ KK KKK ++ KK K + + EAE ++K E A
Sbjct: 1311 AEEAKKADEAKKK-AEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAA 1369
Query: 987 EK 988
EK
Sbjct: 1370 EK 1371
Score = 70.6 bits (172), Expect = 3e-12
Identities = 39/111 (35%), Positives = 58/111 (52%), Gaps = 19/111 (17%)
Query: 897 KKRTRRKKKKGAEKIRKKKK--------KKKEAEKMRKK----KGAEKMKKK----KKDA 940
K AE +K + KK+A+ +KK K A++ KKK KK A
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKA 1407
Query: 941 EKMKKKGAEKIGK---KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+++KK A K KKK ++KKK + KKK ++ K+A++ KKK EEA+K
Sbjct: 1408 DELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK 1458
Score = 70.2 bits (171), Expect = 5e-12
Identities = 37/95 (38%), Positives = 54/95 (56%), Gaps = 6/95 (6%)
Query: 897 KKRTRRKKKKGAEKIRKK---KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
KK+ KKK A+ +KK KK EA K + A++ + ++ AE +KK E K
Sbjct: 1321 KKKAEEAKKK-ADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEA--K 1377
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK KKK ++KKK + KK+AE+ KKK +E +K
Sbjct: 1378 KKADAAKKKAEEKKKADEAKKKAEEDKKKADELKK 1412
Score = 69.8 bits (170), Expect = 6e-12
Identities = 37/106 (34%), Positives = 55/106 (51%), Gaps = 16/106 (15%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRK---KKGAEKMKKK----------KKDAEK 942
+ + + + A + ++K EA + +K KK A+ KKK KK AE+
Sbjct: 1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEE 1402
Query: 943 MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KKK E KK KKK + KKK ++KK+A++ KKK EEA+K
Sbjct: 1403 DKKKADE---LKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKK 1445
Score = 69.8 bits (170), Expect = 6e-12
Identities = 34/103 (33%), Positives = 57/103 (55%), Gaps = 11/103 (10%)
Query: 896 RKKRTRRKKK--KGAEKIRKKKKKKKEAEKMRK----KKGAEKMKKK----KKDAEKMKK 945
+K ++K K AE+ +K + KK+AE+ +K KK AE+ KKK KK AE+ K
Sbjct: 1284 KKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEE-AK 1342
Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K AE + + + + ++K + +K+ E+ KKK + A+K
Sbjct: 1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKK 1385
Score = 69.4 bits (169), Expect = 8e-12
Identities = 35/95 (36%), Positives = 52/95 (54%), Gaps = 6/95 (6%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK---KKGAEKIGK 953
KK+ KKK E + + K EAE AE ++K + AEK K KK A+ K
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEA--AADEAEAAEEKAEAAEKKKEEAKKKADAA-K 1384
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK ++KKK + KKK ++ KK+A+++KK +K
Sbjct: 1385 KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419
Score = 69.0 bits (168), Expect = 1e-11
Identities = 32/92 (34%), Positives = 50/92 (54%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K+ KKK E +K KK+AE+ +K A K + + E + + +KKK
Sbjct: 1314 AKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++ KKK KKK ++KK+A++ KKK EE +K
Sbjct: 1374 EEAKKKADAAKKKAEEKKKADEAKKKAEEDKK 1405
Score = 68.2 bits (166), Expect = 2e-11
Identities = 40/101 (39%), Positives = 64/101 (63%), Gaps = 10/101 (9%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRK----KKGAEKMKKK---KKDAEKMKKKGAEK 950
K+ +KKK E +K ++ KK+A++++K KK A++ KKK KK A++ KKK E
Sbjct: 1384 KKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEA 1443
Query: 951 IGK---KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KKK ++ KK ++ KKK ++ K+A++ KKK EEA+K
Sbjct: 1444 KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKK 1484
Score = 67.9 bits (165), Expect = 3e-11
Identities = 35/92 (38%), Positives = 58/92 (63%), Gaps = 3/92 (3%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
+K+ + KK AE+ +K + KK+AE+ +K + A+K ++ K A++ KKK E KK
Sbjct: 1429 EKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE---AKKA 1485
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ KKK ++ KKK + K+A + KKK +EA+K
Sbjct: 1486 DEAKKKAEEAKKKADEAKKAAEAKKKADEAKK 1517
Score = 64.0 bits (155), Expect = 4e-10
Identities = 40/101 (39%), Positives = 59/101 (58%), Gaps = 11/101 (10%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK----KGAEKMKKK---KKDAEKMKKKGAE 949
KK+ KK A++ +KK ++ K+AE+ +KK K A++ KKK K A++ KKK E
Sbjct: 1437 KKKAEEAKK--ADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEE 1494
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMK--KKTEEAEK 988
K + KK + KKK + KK +EA+K KK EEA+K
Sbjct: 1495 AKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK 1535
Score = 63.2 bits (153), Expect = 7e-10
Identities = 44/100 (44%), Positives = 65/100 (65%), Gaps = 9/100 (9%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRK-----KKGAEKMKKK---KKDAEKMKKKGA 948
KK K K AE+ +K ++ KK+AE+ +K KK AE +KK+ K AE++KKK A
Sbjct: 1653 KKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEA 1712
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
E+ KK ++ KK +++ K K ++ KKEAE+ KKK EEA+K
Sbjct: 1713 EEK-KKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKK 1751
Score = 60.9 bits (147), Expect = 3e-09
Identities = 37/96 (38%), Positives = 62/96 (64%), Gaps = 7/96 (7%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAE--KMRKKKGAEKMKKKKKDAEKMKK-----KGAE 949
KK+ + KKK AE+ +K ++ KK E K++ + A+K ++ KK AE+ KK K A
Sbjct: 1632 KKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAA 1691
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
+ KK+ ++ KK ++ KKK+ ++KK+AE++KK EE
Sbjct: 1692 EALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEE 1727
Score = 60.9 bits (147), Expect = 3e-09
Identities = 34/94 (36%), Positives = 59/94 (62%), Gaps = 5/94 (5%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK---KKKDAEKMKKKGAEKIGKK 954
+ + KK AE+ +K ++ +K+A + KK+ E K KKK+AE+ KK AE++ K
Sbjct: 1667 AKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKK--AEELKKA 1724
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+++ K K ++ KK+ ++ KK+AE+ KK EE +K
Sbjct: 1725 EEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKK 1758
Score = 60.2 bits (145), Expect = 5e-09
Identities = 40/105 (38%), Positives = 64/105 (60%), Gaps = 14/105 (13%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRK----KKGAEKMKKK----KKDAEKMKKK--- 946
K +KK AE+ +K ++ K +AE+++K KK E++KKK KK AE++KK
Sbjct: 1599 KLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEE 1658
Query: 947 ---GAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
A + KK ++ KKK ++ KK ++ +KK AE +KK+ EEA+K
Sbjct: 1659 NKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703
Score = 58.2 bits (140), Expect = 2e-08
Identities = 33/93 (35%), Positives = 58/93 (62%), Gaps = 4/93 (4%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK--KKDAEKMKKKGAEKIGKKK 955
K+ +KKK + +K+ ++KK+AE+++K + K+K K AE+ KKK E KK
Sbjct: 1626 KKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEE--AKKA 1683
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++ +KK + KK+ ++ K+AE++KKK E +K
Sbjct: 1684 EEDEKKAAEALKKEAEEAKKAEELKKKEAEEKK 1716
Score = 58.2 bits (140), Expect = 2e-08
Identities = 34/101 (33%), Positives = 62/101 (61%), Gaps = 10/101 (9%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRK---KKGAEKMKK--KKKDAEKMKK----KG 947
KK+ KKK A++ +K + KK+A++ +K K A++ KK + K A++ KK K
Sbjct: 1489 KKKAEEAKKK-ADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKK 1547
Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
A+++ K ++ KK ++KKK ++ KK +++ +K EEA+K
Sbjct: 1548 ADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKK 1588
Score = 57.5 bits (138), Expect = 3e-08
Identities = 36/105 (34%), Positives = 61/105 (58%), Gaps = 15/105 (14%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK-----KGAEKMKKK---KKDAEKMKKKGA 948
KK+ KK A++ +KK ++ K+A++ +KK K A++ KK KK A++ KK
Sbjct: 1463 KKKAEEAKK--ADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEE 1520
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK-----KTEEAEK 988
K + KK ++ KK + KK ++KK+A+++KK K EE +K
Sbjct: 1521 AKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKK 1565
Score = 57.5 bits (138), Expect = 4e-08
Identities = 33/92 (35%), Positives = 62/92 (67%), Gaps = 1/92 (1%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKK-KDAEKMKKKGAEKIGKKKK 956
K+ +KK AE ++K+ ++ K+AE+++KK+ EK K ++ K AE+ K AE+ K+ +
Sbjct: 1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAE 1740
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ KKK ++ KK +++KKK A K++ ++AE+
Sbjct: 1741 EDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEE 1772
Score = 53.2 bits (127), Expect = 8e-07
Identities = 33/99 (33%), Positives = 57/99 (57%), Gaps = 9/99 (9%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK--KKKDAEKMKKKGAEKIGKK 954
++R + +K E +++ A K + + A+++KK +KK A++ KK AE+ K
Sbjct: 1247 EERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKK--AEEKKKA 1304
Query: 955 KKKKKKKKKKK-----KKKKKKKKKEAEKMKKKTEEAEK 988
+ KKK ++ K KKK ++ KK+A+ KKK EEA+K
Sbjct: 1305 DEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKK 1343
Score = 52.8 bits (126), Expect = 9e-07
Identities = 35/95 (36%), Positives = 56/95 (58%), Gaps = 6/95 (6%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK---KKDAEKMKKKGAEKIGK 953
+ R K E+ + K ++ K+AE+ + K AE++KK KK E++KKK AE+ K
Sbjct: 1591 EARIEEVMKLYEEEKKMKAEEAKKAEEAKIK--AEELKKAEEEKKKVEQLKKKEAEEKKK 1648
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++ KK +++ K K ++ KK E KKK EEA+K
Sbjct: 1649 AEELKKAEEENKIKAAEEAKKAEED-KKKAEEAKK 1682
Score = 52.8 bits (126), Expect = 1e-06
Identities = 34/112 (30%), Positives = 60/112 (53%), Gaps = 19/112 (16%)
Query: 896 RKKRTRRKKK--KGAEKIRKKKKKKKEAEKMR-----------------KKKGAEKMKKK 936
RK +K + K AE+ +K ++ K+AE+ R +++ A K ++
Sbjct: 1218 RKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEA 1277
Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K E K + +K + KK ++KKK + KKK ++ K+A++ KKK EEA+K
Sbjct: 1278 RKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK 1329
Score = 52.1 bits (124), Expect = 2e-06
Identities = 32/92 (34%), Positives = 55/92 (59%), Gaps = 6/92 (6%)
Query: 899 RTRRKKKKGAEKIRK--KKKKKKEAEKMRKKKGAEKMKK--KKKDAEKMKKKGAEKIGKK 954
+ + KK A++ +K + KK EA+K + K A++ KK +KK A+++KK AE++ K
Sbjct: 1503 KKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKK--AEELKKA 1560
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
++KKK ++ KK ++ K + KK EEA
Sbjct: 1561 EEKKKAEEAKKAEEDKNMALRKAEEAKKAEEA 1592
Score = 51.3 bits (122), Expect = 3e-06
Identities = 40/99 (40%), Positives = 60/99 (60%), Gaps = 10/99 (10%)
Query: 897 KKRTRRKKKKGAEKIRKK----KKKKKEAEKMRK-KKGAEKMKK--KKKDAEKMKKKGAE 949
KK+ KK A++ +KK KKK EA+K + KK A++ KK + K A++ KK
Sbjct: 1476 KKKAEEAKK--ADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEA 1533
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K + KK ++KKK + KK ++ K+AE+ KKK EEA+K
Sbjct: 1534 KKADEAKKAEEKKKADELKKAEELKKAEE-KKKAEEAKK 1571
Score = 50.9 bits (121), Expect = 4e-06
Identities = 35/97 (36%), Positives = 56/97 (57%), Gaps = 8/97 (8%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMK----KKKKDAEKMKKKGAEKIG 952
+K+ + KK E +K +EA+K + + E MK +KK AE+ KK KI
Sbjct: 1562 EKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIK 1621
Query: 953 ----KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
KK +++KKK ++ KKK+ ++KK+AE++KK EE
Sbjct: 1622 AEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEE 1658
Score = 50.5 bits (120), Expect = 4e-06
Identities = 28/95 (29%), Positives = 58/95 (61%), Gaps = 2/95 (2%)
Query: 896 RKKRTRRKKK--KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
+K ++K K AE+++K ++KKK E + ++ +K ++A+K ++ E++ K
Sbjct: 1540 KKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMK 1599
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+++KK K ++ KK ++ K +AE++KK EE +K
Sbjct: 1600 LYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKK 1634
Score = 46.7 bits (110), Expect = 8e-05
Identities = 32/93 (34%), Positives = 54/93 (58%), Gaps = 6/93 (6%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK----KKDAEKMKKKGAEKIG 952
K+ KKK AE+ +K ++ KK E+ K K AE+ KK+ KK AE+ KK EK
Sbjct: 1701 AKKAEELKKKEAEEKKKAEELKKA-EEENKIK-AEEAKKEAEEDKKKAEEAKKDEEEKKK 1758
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
KK+++KK ++ +K+K+ E++ ++ E+
Sbjct: 1759 IAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEK 1791
Score = 46.3 bits (109), Expect = 8e-05
Identities = 34/112 (30%), Positives = 58/112 (51%), Gaps = 19/112 (16%)
Query: 896 RKKRTRRKK---------KKGAEKIRK--KKKKKKEAEKMRKKKGAEKMKK--------K 936
+K +KK K A++ +K + KK EA+K +KK A+++KK +
Sbjct: 1503 KKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEE 1562
Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK AE+ KK +K +K ++ KK ++ + ++ K E+ K K EEA+K
Sbjct: 1563 KKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKK 1614
Score = 46.3 bits (109), Expect = 1e-04
Identities = 32/90 (35%), Positives = 51/90 (56%), Gaps = 6/90 (6%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK--KKKDAEKMKKKGAEKIGKKKKKKKKKK 962
KK EK + ++ KK E +K + AE+ KK + + E MK EK K ++ KK ++
Sbjct: 1558 KKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEE 1617
Query: 963 KKKK----KKKKKKKKEAEKMKKKTEEAEK 988
K K KK +++KK+ E++KKK E +K
Sbjct: 1618 AKIKAEELKKAEEEKKKVEQLKKKEAEEKK 1647
Score = 45.9 bits (108), Expect = 1e-04
Identities = 37/99 (37%), Positives = 62/99 (62%), Gaps = 9/99 (9%)
Query: 896 RKKRTRRKKK----KGAEKIRKKK-KKKKEAEKMRKKKGAEKMK--KKKKDAEKMKKKGA 948
+K KK+ K AE+++KK+ ++KK+AE+++K + K+K + KK+AE+ KKK
Sbjct: 1688 KKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAE 1747
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
E KK +++KKK KK+++KK +E K K+ E E
Sbjct: 1748 E--AKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEE 1784
Score = 45.1 bits (106), Expect = 2e-04
Identities = 30/104 (28%), Positives = 55/104 (52%), Gaps = 11/104 (10%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKK-------KEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
+ + + + AE+ +K ++ + E EK K + A+K ++ K AE++KK
Sbjct: 1571 KAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEE 1630
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMK----KKTEEAEK 988
EK ++ KKK+ ++KKK ++ KK +E K+K K E +K
Sbjct: 1631 EKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDK 1674
Score = 42.1 bits (98), Expect = 0.002
Identities = 22/90 (24%), Positives = 42/90 (46%), Gaps = 1/90 (1%)
Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
+ K G ++ K K + + + E ++ AE+ KK K + +K +
Sbjct: 1059 KAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAE 1118
Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ KKK + +K ++ ++AE +K EEA K
Sbjct: 1119 EAKKKAEDARKAEEARKAEDA-RKAEEARK 1147
Score = 39.0 bits (90), Expect = 0.018
Identities = 31/108 (28%), Positives = 54/108 (50%), Gaps = 3/108 (2%)
Query: 877 DVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK 936
D +D R + T F + + K AE+ RK ++ KK+AE RK + A K +
Sbjct: 1082 DAKEDNRADEATEEAFGKAEEAKKTETGK--AEEARKAEEAKKKAEDARKAEEARKAEDA 1139
Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
+K AE+ +K K + +K + +K ++ +K + K+AE +K E
Sbjct: 1140 RK-AEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEE 1186
Score = 38.6 bits (89), Expect = 0.022
Identities = 25/84 (29%), Positives = 51/84 (60%), Gaps = 1/84 (1%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
+ + + AE++RK ++ +K AE RK + A K ++++K E K + A+K KK ++
Sbjct: 1177 KAEAARKAEEVRKAEELRK-AEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEA 1235
Query: 961 KKKKKKKKKKKKKKEAEKMKKKTE 984
KK ++ KK ++++ E+++K E
Sbjct: 1236 KKDAEEAKKAEEERNNEEIRKFEE 1259
Score = 38.2 bits (88), Expect = 0.025
Identities = 27/93 (29%), Positives = 52/93 (55%), Gaps = 1/93 (1%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
R + R+ + + +K + K+AE RK + K ++ +K AE +K A + +++
Sbjct: 1153 RVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRK-AEDARKAEAARKAEEE 1211
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K ++ +K + KK + K+AE+ KK EEA+K
Sbjct: 1212 RKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKK 1244
Score = 37.8 bits (87), Expect = 0.038
Identities = 24/87 (27%), Positives = 48/87 (55%), Gaps = 5/87 (5%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
KK K K E ++ ++ KK+AE+ +K + KKK +K ++K AE+I +K+
Sbjct: 1722 KKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE---KKKIAHLKKEEEKKAEEI--RKE 1776
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKT 983
K+ +++ ++ +K++ E +K K
Sbjct: 1777 KEAVIEEELDEEDEKRRMEVDKKIKDI 1803
Score = 37.8 bits (87), Expect = 0.038
Identities = 45/145 (31%), Positives = 65/145 (44%), Gaps = 44/145 (30%)
Query: 884 REAVTALGFILFRKKRTRRKKKKGAEKIRK--------KKKKKKEAEKMRKKKGAEKMKK 935
++A A RK RK AE RK +++K +EA K K AE +KK
Sbjct: 1176 KKAEAARKAEEVRKAEELRK----AEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKK 1231
Query: 936 ---KKKDAEKMKK------------------------KGAEKIGKKKK-----KKKKKKK 963
KKDAE+ KK + A K + +K K ++KKK
Sbjct: 1232 AEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKK 1291
Query: 964 KKKKKKKKKKKEAEKMKKKTEEAEK 988
+ KK ++KK+A++ KKK EEA+K
Sbjct: 1292 ADEAKKAEEKKKADEAKKKAEEAKK 1316
Score = 37.0 bits (85), Expect = 0.068
Identities = 27/84 (32%), Positives = 47/84 (55%), Gaps = 1/84 (1%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
+ + K AE RK ++ +K AE++RK + A K + +K E+ K + A K KK +
Sbjct: 1171 KAEDAKKAEAARKAEEVRK-AEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAV 1229
Query: 961 KKKKKKKKKKKKKKEAEKMKKKTE 984
KK ++ KK ++ K+AE+ + E
Sbjct: 1230 KKAEEAKKDAEEAKKAEEERNNEE 1253
Score = 36.3 bits (83), Expect = 0.097
Identities = 22/80 (27%), Positives = 45/80 (56%), Gaps = 2/80 (2%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
K + KK+ E +K ++ KK+ E+ KKK A K+++K AE+++K+ I ++
Sbjct: 1728 NKIKAEEAKKEAEEDKKKAEEAKKDEEE--KKKIAHLKKEEEKKAEEIRKEKEAVIEEEL 1785
Query: 956 KKKKKKKKKKKKKKKKKKKE 975
++ +K++ + KK K +
Sbjct: 1786 DEEDEKRRMEVDKKIKDIFD 1805
Score = 35.1 bits (80), Expect = 0.24
Identities = 32/104 (30%), Positives = 53/104 (50%), Gaps = 3/104 (2%)
Query: 884 REAVTALGFILFRKKRTRRKKK--KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAE 941
R+A A + RK RK + + AE +K + +K AE++RK + K + +K
Sbjct: 1146 RKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARK-AEEVRKAEELRKAEDARKAEA 1204
Query: 942 KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K + K + +K + KK + KK ++ KK+AE+ KK EE
Sbjct: 1205 ARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEE 1248
Score = 30.9 bits (69), Expect = 4.1
Identities = 27/105 (25%), Positives = 53/105 (50%), Gaps = 12/105 (11%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKK--------KEAEKMRKKKGAEKMKKKKKDAEKMKKKG 947
RK +KK + A K + +K + ++AE ++ + A K + +K E K +
Sbjct: 1115 RKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAED 1174
Query: 948 AEKI--GKKKKKKKKKKKKKKKKKKKKKKEAEKMK--KKTEEAEK 988
A+K +K ++ +K ++ +K + +K + A K + +K EEA K
Sbjct: 1175 AKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARK 1219
>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 881
Score = 66.9 bits (163), Expect = 4e-11
Identities = 53/245 (21%), Positives = 106/245 (43%), Gaps = 22/245 (8%)
Query: 611 NAFMHSGTTSDQFLRDNLEWLARATNWAKLTATASLGVIHRGHETEALAHMQSYLPRETG 670
N ++ G +D + + EW+ + ++A AS+GVI + + L + YL +
Sbjct: 368 NDPINLGYENDSLIPLDDEWIYKCKVPGLISAFASIGVIESWNSDKGLETLDKYLYAD-- 425
Query: 671 PTSGYYEGGGLYALGLIH-ANHG------AVINDYLLGQVKDATNEMVRHGGCLGLGLAA 723
Y + G L +GL A++++YL +++ LGLG A
Sbjct: 426 --ESYRKAGALLGIGLSGLRVFEERPPALALLSNYL-----QSSSSKHVIAAILGLGAAF 478
Query: 724 MGSARDDIYEQLKFNLYQDDAVTGEA--AGIAMGLVLLGSKSE----SAIQDMVAYAKET 777
G+ +++ E L+ ++ D+ A + +G V +G+ + +Q V K
Sbjct: 479 SGTQAEEVLELLQPIMFSTDSPIEVVFFASLTLGSVFVGTCNGDLTSLILQTFVERGKIE 538
Query: 778 QHEKILRGLAVGISFIMYGQLENADDLVRDLCEEKDAILRRSGMYTIAMAYCGSGSNVAT 837
+ R LA+G++ + YG+ + DD+ + + A+ + + Y G+G +
Sbjct: 539 SETQWFRFLALGLASLFYGRKDQVDDVEETIMAIEGALSKHEEILVKGCQYVGTGDVLVI 598
Query: 838 KRLLH 842
+ LLH
Sbjct: 599 QSLLH 603
Score = 39.1 bits (91), Expect = 0.014
Identities = 52/287 (18%), Positives = 99/287 (34%), Gaps = 75/287 (26%)
Query: 249 NAFMHSGTTSDQFLRDNLEWLARATNWAKLTATASLGVIHRGHETEALAHMQSYLPRETG 308
N ++ G +D + + EW+ + ++A AS+GVI + + L + YL +
Sbjct: 368 NDPINLGYENDSLIPLDDEWIYKCKVPGLISAFASIGVIESWNSDKGLETLDKYLYAD-- 425
Query: 309 PTSGYYEGGGLYALGLIH-ANHG------AVINDYL--------------LGQVKDATNE 347
Y + G L +GL A++++YL LG T
Sbjct: 426 --ESYRKAGALLGIGLSGLRVFEERPPALALLSNYLQSSSSKHVIAAILGLGAAFSGTQA 483
Query: 348 DDAV---------------TGEAAGIAMGLVLLGSKSE----SAIQDMVAYAKETQHEKI 388
++ + A + +G V +G+ + +Q V K +
Sbjct: 484 EEVLELLQPIMFSTDSPIEVVFFASLTLGSVFVGTCNGDLTSLILQTFVERGKIESETQW 543
Query: 389 LRGLAVGISFIMYGQLENADDLVRDLCEEKDAILRRSGMYTIAMAYCVSYLYFDLFSDLT 448
R LA+G++ + YG+ + DD+ + + A+ + I + C
Sbjct: 544 FRFLALGLASLFYGRKDQVDDVEETIMAIEGALSKHE---EILVKGC------------- 587
Query: 449 YRIVKNRSGMYTIAMAYCGSGSNVATKRLLHVAVSDVNDDVRREAVT 495
Y G+G + + LLHV D ++ E
Sbjct: 588 ---------------QYVGTGDVLVIQSLLHVKDEFTGDTLKNEEAL 619
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 63.7 bits (155), Expect = 1e-10
Identities = 32/99 (32%), Positives = 53/99 (53%), Gaps = 6/99 (6%)
Query: 896 RKKRTRRKK-----KKGAEKIRKKKK-KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE 949
R+K ++ K AE+ K+ + K+K+AE+ + K+ AE K + +AEK K+ A+
Sbjct: 90 RQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAK 149
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K +++ K K + KKK + KKK + K K E K
Sbjct: 150 KQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAK 188
Score = 59.5 bits (144), Expect = 3e-09
Identities = 36/102 (35%), Positives = 46/102 (45%), Gaps = 10/102 (9%)
Query: 897 KKRTRRKKKKGAE---KIRKKKKKKKEAEKMRKKKGAEKMK----KKKKDAEKMKKKGAE 949
K+ K+K+ E K + K K EAE +K K K + K K A + KKK AE
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAE 170
Query: 950 KIGKKKKKKKKKKKKKKKKK---KKKKKEAEKMKKKTEEAEK 988
K + + K K + K K K K K EA K K E A K
Sbjct: 171 AKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAK 212
Score = 57.2 bits (138), Expect = 2e-08
Identities = 27/89 (30%), Positives = 51/89 (57%), Gaps = 6/89 (6%)
Query: 903 KKKKGAEKIRKKK---KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
+K++ AE+ R+K+ + E + ++ A++ ++K+K AE+ K K A + K K +
Sbjct: 81 EKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAE---AKAKAE 137
Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ +KK K++ KK+ E E K EA+K
Sbjct: 138 AEAEKKAKEEAKKQAEEEAKAKAAAEAKK 166
Score = 56.4 bits (136), Expect = 3e-08
Identities = 37/104 (35%), Positives = 47/104 (45%), Gaps = 11/104 (10%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKK---KKEAEKMRKKKGAEKMKKK----KKDAEKMKKKGA 948
K + + K AE +KK K KK+AE+ K K A + KKK KK AE K A
Sbjct: 124 AKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKA 183
Query: 949 EKIGKKKKKKKKKK----KKKKKKKKKKKKEAEKMKKKTEEAEK 988
E K K ++ K K K K + K EAE EAE+
Sbjct: 184 EAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAER 227
Score = 55.6 bits (134), Expect = 5e-08
Identities = 31/100 (31%), Positives = 61/100 (61%), Gaps = 11/100 (11%)
Query: 898 KRTRRKKKKGAEKIRKKKKK-KKEAEKMRKKKGAE-----KMKKKKKDAEKMKKKGAEKI 951
R +++KK A+K ++++KK +++AE+ K++ AE +++++ + K+ AE+
Sbjct: 53 NRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQ--AEQA 110
Query: 952 GKKKKKKKK---KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K+ ++K+K + K K+ + K K EAE KK EEA+K
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKK 150
Score = 51.8 bits (124), Expect = 1e-06
Identities = 25/96 (26%), Positives = 52/96 (54%), Gaps = 5/96 (5%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMK--KKKKDAEKMKKKGAEK---IG 952
+ ++++K E+ ++ +K++ AE+ R+K+ ++ K K AE+ K+ EK
Sbjct: 63 AKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAE 122
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ K K+ + K K + + +KK + E K+ EEA+
Sbjct: 123 EAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKA 158
Score = 42.1 bits (99), Expect = 9e-04
Identities = 20/82 (24%), Positives = 44/82 (53%), Gaps = 1/82 (1%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK-KKK 966
A +I+++KK + E+ R+KK ++ ++ +K + + E + +K K+ +
Sbjct: 52 ANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAA 111
Query: 967 KKKKKKKKEAEKMKKKTEEAEK 988
K+ ++K+K+AE+ K K K
Sbjct: 112 KQAEEKQKQAEEAKAKQAAEAK 133
Score = 41.7 bits (98), Expect = 0.002
Identities = 17/82 (20%), Positives = 33/82 (40%), Gaps = 6/82 (7%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
K + + KK A + +KK + + +A+ K K + K K +A K K +
Sbjct: 156 AKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAK------AKAAAEA 209
Query: 956 KKKKKKKKKKKKKKKKKKKEAE 977
K + + + ++K E
Sbjct: 210 AAKAEAEAAAAAAAEAERKADE 231
Score = 36.0 bits (83), Expect = 0.081
Identities = 16/75 (21%), Positives = 41/75 (54%), Gaps = 2/75 (2%)
Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK--KKK 973
+ R ++ + KK+++ +K ++ AE+ K++ ++ ++K+ +++ K
Sbjct: 45 PGAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAA 104
Query: 974 KEAEKMKKKTEEAEK 988
K+AE+ K+ EE +K
Sbjct: 105 KQAEQAAKQAEEKQK 119
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 57.7 bits (140), Expect = 9e-10
Identities = 37/91 (40%), Positives = 57/91 (62%), Gaps = 8/91 (8%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
+ RR++ + E + +K KK+ E E+ ++K+ +KK+KD EK KK A K++KK
Sbjct: 42 RALRRREYERLELMDEKWKKETEDEEFQQKR----EEKKRKDEEKTAKKRA----KRQKK 93
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K+KKKKKKK KK KK+E E K E +++
Sbjct: 94 KQKKKKKKKAKKGNKKEEKEGSKSSEESSDE 124
Score = 47.7 bits (114), Expect = 2e-06
Identities = 29/87 (33%), Positives = 49/87 (56%), Gaps = 16/87 (18%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
+K+ + E +K+++KK++ E+ KK A++ KKK +KKKKKKK
Sbjct: 60 KKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKK----------------QKKKKKKKA 103
Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK KK++K+ K +E+ + EE E+
Sbjct: 104 KKGNKKEEKEGSKSSEESSDEEEEGEE 130
Score = 44.7 bits (106), Expect = 3e-05
Identities = 23/78 (29%), Positives = 47/78 (60%), Gaps = 3/78 (3%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
F++KR +K+K E+ KK+ K++ +K +KKK +K KK K EK K +E+ +
Sbjct: 68 FQQKREEKKRK--DEEKTAKKRAKRQKKK-QKKKKKKKAKKGNKKEEKEGSKSSEESSDE 124
Query: 955 KKKKKKKKKKKKKKKKKK 972
+++ ++ K+++ + +K
Sbjct: 125 EEEGEEDKQEEPVEIMEK 142
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 61.4 bits (149), Expect = 1e-09
Identities = 26/93 (27%), Positives = 55/93 (59%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+++ +++ K +K +K+K K++ ++ K++ EK K+K+ EK KK + +++
Sbjct: 104 KEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEE 163
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK+++ + K + KK KKK K K+ EE ++
Sbjct: 164 KKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQ 196
Score = 58.7 bits (142), Expect = 1e-08
Identities = 27/87 (31%), Positives = 49/87 (56%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
++ +KG K K K + K K EK K++ K+ +K KK+ ++ K +K K++
Sbjct: 78 KRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEA 137
Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K+K+ K+K+K+KE + + + E EK
Sbjct: 138 KEKRPPKEKEKEKEKKVEEPRDREEEK 164
Score = 58.4 bits (141), Expect = 1e-08
Identities = 32/93 (34%), Positives = 51/93 (54%), Gaps = 4/93 (4%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK-KGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+K+ +++K K K RK K++ KE ++K K EK ++ +D E+ KK+ + K
Sbjct: 116 EKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVR---AK 172
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ KK KKK KKK+ E EK ++ EA K
Sbjct: 173 SRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVK 205
Score = 56.0 bits (135), Expect = 6e-08
Identities = 22/94 (23%), Positives = 58/94 (61%), Gaps = 2/94 (2%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
+T+ K+ E ++++K+K++ ++ +KKK ++ K++ K K++ EK K
Sbjct: 87 GPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKK--KEKPKEEPKDRKPKEEAKEKRPPK 144
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K+K+K+KK ++ + ++++K+ E+++ K+ +
Sbjct: 145 EKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKP 178
Score = 52.6 bits (126), Expect = 8e-07
Identities = 27/91 (29%), Positives = 50/91 (54%), Gaps = 7/91 (7%)
Query: 896 RKKRTRRKKKKGAEKIRKKKK-KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
+ + R+ K++ EK K+K K+KE KK + ++++K E+++ K K
Sbjct: 126 EEPKDRKPKEEAKEKRPPKEKEKEKE-----KKVEEPRDREEEKKRERVRAKSRPK-KPP 179
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
KKK KKK+ +++K+++ E +K K EE
Sbjct: 180 KKKPPNKKKEPPEEEKQRQAAREAVKGKPEE 210
Score = 49.9 bits (119), Expect = 6e-06
Identities = 26/76 (34%), Positives = 42/76 (55%), Gaps = 1/76 (1%)
Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
K + K EA K R +KG K K K K + K +K+K++ K++KKKKK+K
Sbjct: 65 KCAESKLSSDEAVK-RVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEK 123
Query: 970 KKKKKEAEKMKKKTEE 985
K++ + K K++ +E
Sbjct: 124 PKEEPKDRKPKEEAKE 139
Score = 48.0 bits (114), Expect = 2e-05
Identities = 23/94 (24%), Positives = 45/94 (47%), Gaps = 4/94 (4%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMR---KKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
KR ++K+K EK + ++ + +E EK R + K K KKK K K+ E+ ++
Sbjct: 140 KRPPKEKEKEKEK-KVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQ 198
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++ K K ++ +++++ E K E
Sbjct: 199 AAREAVKGKPEEPDVNEEREKEEDDGKDRETTTS 232
Score = 41.0 bits (96), Expect = 0.003
Identities = 17/67 (25%), Positives = 38/67 (56%)
Query: 922 EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
K+ + ++++K K K A++ + K+++K+K++ K++KKKKKE K +
Sbjct: 69 SKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEP 128
Query: 982 KTEEAEK 988
K + ++
Sbjct: 129 KDRKPKE 135
Score = 41.0 bits (96), Expect = 0.003
Identities = 20/94 (21%), Positives = 46/94 (48%), Gaps = 4/94 (4%)
Query: 896 RKKRTRRKKKKG--AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
K++ ++ ++ E+ ++++ + K K KK KKK+ E+ K++ A +
Sbjct: 147 EKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPP--NKKKEPPEEEKQRQAAREAV 204
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K K ++ ++++K++ K+ E EE E
Sbjct: 205 KGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDE 238
Score = 39.5 bits (92), Expect = 0.008
Identities = 20/99 (20%), Positives = 39/99 (39%), Gaps = 6/99 (6%)
Query: 896 RKKRTRRKKKKGAEKIRKKKK---KKKEAEKMRKKKGAEKMKKKKKDAEKM---KKKGAE 949
+K+ R + K K KKK KKKE + K++ A + K K E +++ E
Sbjct: 162 EEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEE 221
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
GK ++ ++ + ++ + K +
Sbjct: 222 DDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSP 260
Score = 37.9 bits (88), Expect = 0.028
Identities = 18/91 (19%), Positives = 44/91 (48%), Gaps = 1/91 (1%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK-KKKDAEKMKKKGAEKIGKK 954
++ R+++K E++R K + KK +K K E ++ K++ A + KG +
Sbjct: 154 VEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDV 213
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
++++K++ K ++ E +++ E
Sbjct: 214 NEEREKEEDDGKDRETTTSPMEEDESRQSSE 244
Score = 37.2 bits (86), Expect = 0.050
Identities = 17/61 (27%), Positives = 35/61 (57%)
Query: 928 KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K AE + ++++K G++ K K K+ K + K+++K+K++ ++ KKK +E
Sbjct: 65 KCAESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKP 124
Query: 988 K 988
K
Sbjct: 125 K 125
Score = 31.8 bits (72), Expect = 2.3
Identities = 17/91 (18%), Positives = 33/91 (36%), Gaps = 5/91 (5%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKG-----AEKMKKKKKDAEKMKKKGAEK 950
K+ KKK + K+++ +EA K + ++ EK + KD E E
Sbjct: 178 PPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEED 237
Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
++ + ++ KK A +
Sbjct: 238 ESRQSSEISRRSSSSLKKPDPSPSMASPETR 268
Score = 30.6 bits (69), Expect = 4.7
Identities = 12/89 (13%), Positives = 40/89 (44%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
KK+ KKK+ E+ ++++ ++ + ++ + ++K++D K ++ + + +
Sbjct: 179 PKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDE 238
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
++ + ++ KK + M
Sbjct: 239 SRQSSEISRRSSSSLKKPDPSPSMASPET 267
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 59.5 bits (144), Expect = 3e-09
Identities = 28/91 (30%), Positives = 38/91 (41%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
K+ + + + K K A+K+ KKK A+ KK A+ K + KK
Sbjct: 200 KKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKA 259
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK K KK KK K A K K +A K
Sbjct: 260 LKKAAKAVKKAAKKAAKAAAKAAKGAAKATK 290
Score = 55.7 bits (134), Expect = 5e-08
Identities = 28/93 (30%), Positives = 38/93 (40%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
K ++ K +K KKK + KK A+ K K A+K KK +K K
Sbjct: 208 AKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAV 267
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK KK K K K +A K K K ++
Sbjct: 268 KKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAG 300
Score = 55.7 bits (134), Expect = 5e-08
Identities = 30/90 (33%), Positives = 36/90 (40%), Gaps = 1/90 (1%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K KK +K K K+A K K A+K KK K A K K +K KK
Sbjct: 217 AKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKA-AKKAKKTAKKALKKAAKAVKKAAKKAA 275
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
K K K K K K +A+K K A
Sbjct: 276 KAAAKAAKGAAKATKGKAKAKKKAGKKAAA 305
Score = 52.2 bits (125), Expect = 6e-07
Identities = 29/87 (33%), Positives = 34/87 (39%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
K+ K A+K K K + K KK +K K K A K K A K K K
Sbjct: 228 KKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAK 287
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTE 984
K K K KKK KK K K +
Sbjct: 288 ATKGKAKAKKKAGKKAAAGSKAKATAK 314
Score = 50.7 bits (121), Expect = 2e-06
Identities = 29/87 (33%), Positives = 37/87 (42%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+ + +K K A K KK KK + + K +K KK A KGA K K K
Sbjct: 233 KTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGK 292
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKK 982
K KKK KK K K A+ K+
Sbjct: 293 AKAKKKAGKKAAAGSKAKATAKAPKRG 319
Score = 49.9 bits (119), Expect = 4e-06
Identities = 30/93 (32%), Positives = 39/93 (41%), Gaps = 2/93 (2%)
Query: 897 KKRTRRKKKKGAEKIRKKK--KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
+ KKK + KK K +A K KK + +KK K +K KK A+ K
Sbjct: 222 AAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKA 281
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K K K K K KKK K+A K A+
Sbjct: 282 AKGAAKATKGKAKAKKKAGKKAAAGSKAKATAK 314
Score = 48.7 bits (116), Expect = 7e-06
Identities = 30/87 (34%), Positives = 42/87 (48%), Gaps = 2/87 (2%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K ++ KK + ++K K K+A K + K A K K A K K K +K GKK
Sbjct: 246 AKAAKKAKKTAKKALKKAAKAVKKAAK-KAAKAAAKAAKGAAKATKGKAKAKKKAGKKAA 304
Query: 957 KKKKKKKK-KKKKKKKKKKEAEKMKKK 982
K K K K+ K K+A+K+ KK
Sbjct: 305 AGSKAKATAKAPKRGAKGKKAKKVTKK 331
Score = 46.4 bits (110), Expect = 4e-05
Identities = 30/89 (33%), Positives = 36/89 (40%), Gaps = 6/89 (6%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
A+K K K A+K K + K KK A K A+K K K KK KK
Sbjct: 196 AANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAK-TAVSAKKAAKTAAKAAKKAKK 254
Query: 965 K-----KKKKKKKKKEAEKMKKKTEEAEK 988
KK K KK A+K K +A K
Sbjct: 255 TAKKALKKAAKAVKKAAKKAAKAAAKAAK 283
Score = 46.4 bits (110), Expect = 5e-05
Identities = 31/88 (35%), Positives = 37/88 (42%)
Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
+T KK A+ K KK K+ K KK A+ +KK K A K K A+ K K K
Sbjct: 233 KTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGK 292
Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
K KKK KK K K A
Sbjct: 293 AKAKKKAGKKAAAGSKAKATAKAPKRGA 320
Score = 43.3 bits (102), Expect = 5e-04
Identities = 25/79 (31%), Positives = 32/79 (40%), Gaps = 1/79 (1%)
Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
+ KK K + K K A+K+ KK A K KK K K KK K
Sbjct: 194 VGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAK 253
Query: 971 KK-KKEAEKMKKKTEEAEK 988
K KK +K K ++A K
Sbjct: 254 KTAKKALKKAAKAVKKAAK 272
Score = 38.7 bits (90), Expect = 0.011
Identities = 22/70 (31%), Positives = 28/70 (40%), Gaps = 1/70 (1%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K ++ KK A+ K K +A K K K +K KK K K K
Sbjct: 264 AKAVKKAAKKAAKAAAKAAKGAAKATK-GKAKAKKKAGKKAAAGSKAKATAKAPKRGAKG 322
Query: 957 KKKKKKKKKK 966
KK KK KK+
Sbjct: 323 KKAKKVTKKR 332
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 56.2 bits (136), Expect = 6e-09
Identities = 30/87 (34%), Positives = 45/87 (51%), Gaps = 7/87 (8%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
++KKK+ AE+I K KKE E+ K+K K KK KK +K K K +K K K +KK
Sbjct: 61 KKKKKELAEEIE---KVKKEYEE--KQKWKWKKKKSKKKKDKDKDKKDDK--KDDKSEKK 113
Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+K+ + K + K + E +
Sbjct: 114 DEKEAEDKLEDLTKSYSETLSTLSELK 140
Score = 53.2 bits (128), Expect = 6e-08
Identities = 26/57 (45%), Positives = 36/57 (63%)
Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K K+ ++ EK+KK+ EK K KKKK KKKK K K KK K+ +K +KK E+ +
Sbjct: 63 KKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAE 119
Score = 53.2 bits (128), Expect = 6e-08
Identities = 27/70 (38%), Positives = 42/70 (60%), Gaps = 1/70 (1%)
Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK-KKKKKEAEK 978
K +KK+ AE+++K KK+ E+ +K +K KKKK K K KK KK K +KK+ ++
Sbjct: 58 TEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKE 117
Query: 979 MKKKTEEAEK 988
+ K E+ K
Sbjct: 118 AEDKLEDLTK 127
Score = 52.8 bits (127), Expect = 9e-08
Identities = 27/81 (33%), Positives = 42/81 (51%), Gaps = 3/81 (3%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
E +KKK+ +E EK++K+ ++ K KK K KKK +K KK KK K +KK
Sbjct: 58 TEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSK-KKKDKDK--DKKDDKKDDKSEKKD 114
Query: 968 KKKKKKKEAEKMKKKTEEAEK 988
+K+ + K + K +E
Sbjct: 115 EKEAEDKLEDLTKSYSETLST 135
Score = 46.6 bits (111), Expect = 1e-05
Identities = 23/59 (38%), Positives = 37/59 (62%)
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ KKKKK+ + +K ++ +K+K K KKKK KKKK K K K+ +K K+E+ ++
Sbjct: 57 YTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115
Score = 45.5 bits (108), Expect = 3e-05
Identities = 22/95 (23%), Positives = 45/95 (47%), Gaps = 7/95 (7%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
++K +KKK +K + K KK + + +KK ++ + K +D K + + + K
Sbjct: 81 KQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELK 140
Query: 956 KKKKKKKK-------KKKKKKKKKKKEAEKMKKKT 983
+K K +K++ + KKEAE+++
Sbjct: 141 PRKYALHKDIYQSRLDRKRRAEVAKKEAEELQNPG 175
Score = 40.1 bits (94), Expect = 0.002
Identities = 23/89 (25%), Positives = 41/89 (46%), Gaps = 6/89 (6%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
+K+K K +K KKKK + + KK + K +KKD ++ + K E + K +
Sbjct: 80 EKQKWKWKKKKSKKKKDKDKD--KKDDKKDDKSEKKDEKEAEDK-LEDLTKSYSETLSTL 136
Query: 963 KKKKKKKKKKKKEAEKM---KKKTEEAEK 988
+ K +K K+ + +K+ E K
Sbjct: 137 SELKPRKYALHKDIYQSRLDRKRRAEVAK 165
Score = 35.1 bits (81), Expect = 0.079
Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 14/55 (25%)
Query: 948 AEKIGKKKKKK--------------KKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
AE KKKKK +K+K K KKKK KKKK+ +K KK ++ +K
Sbjct: 55 AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDK 109
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 55.1 bits (133), Expect = 2e-08
Identities = 29/71 (40%), Positives = 40/71 (56%), Gaps = 7/71 (9%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
+ + +EK K +KEAE ++K +K KK+ K +K KK KK+K + K
Sbjct: 130 SESETSEK-ETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKD------KKEKMVEPKG 182
Query: 963 KKKKKKKKKKK 973
KKKKKKKKKK
Sbjct: 183 SKKKKKKKKKK 193
Score = 52.0 bits (125), Expect = 2e-07
Identities = 22/66 (33%), Positives = 40/66 (60%), Gaps = 2/66 (3%)
Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
+ + +EK K + + + ++ EK +KK +++ K+KK+KK KK+K + K KKK+
Sbjct: 130 SESETSEKETTAK--VEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKK 187
Query: 976 AEKMKK 981
+K KK
Sbjct: 188 KKKKKK 193
Score = 49.7 bits (119), Expect = 1e-06
Identities = 25/63 (39%), Positives = 39/63 (61%), Gaps = 1/63 (1%)
Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
+ + +EK K + E ++ EK KKKKK+ KK+KK+KK KK+K E + KKK ++
Sbjct: 131 ESETSEKETTAKVEKEAEVEE-EEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKK 189
Query: 986 AEK 988
+K
Sbjct: 190 KKK 192
Score = 48.9 bits (117), Expect = 2e-06
Identities = 27/68 (39%), Positives = 40/68 (58%), Gaps = 10/68 (14%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
+K T K +K AE ++KK+KK+ ++++K+K +K KK+K K KK
Sbjct: 136 EKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPK----------GSKK 185
Query: 957 KKKKKKKK 964
KKKKKKKK
Sbjct: 186 KKKKKKKK 193
Score = 47.4 bits (113), Expect = 7e-06
Identities = 23/60 (38%), Positives = 39/60 (65%)
Query: 929 GAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
G+E +K+ K++K+ + +KK+KKKKK+ KK+KK+KK KKE K +++ +K
Sbjct: 129 GSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKK 188
Score = 43.9 bits (104), Expect = 1e-04
Identities = 22/79 (27%), Positives = 36/79 (45%), Gaps = 24/79 (30%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+ ++ + E+ ++KKKKK+ ++ ++KK KK
Sbjct: 139 TTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKK------------------------DKK 174
Query: 956 KKKKKKKKKKKKKKKKKKK 974
+K + K KKKKKKKKKK
Sbjct: 175 EKMVEPKGSKKKKKKKKKK 193
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 55.0 bits (133), Expect = 2e-08
Identities = 38/100 (38%), Positives = 60/100 (60%), Gaps = 11/100 (11%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGA---EKMK--KKKKDAEKMKKKGAEK 950
KK+ ++KK + ++K + +KK+ E++ + K A EK K K AE +K K EK
Sbjct: 83 LKKKKKKKKTDYKQLLKKLEARKKKLEELDEDKAAEIEEKEKWTKALAKAEGVKVKDDEK 142
Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKE------AEKMKKKTE 984
+ KK K+K+K+KKK KK+ K++KE AE+ KK+ E
Sbjct: 143 LLKKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREE 182
Score = 51.9 bits (125), Expect = 3e-07
Identities = 35/96 (36%), Positives = 55/96 (57%), Gaps = 9/96 (9%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGA---EKMK--KKKKDAEKMKKKGA 948
L +KK+ ++ K K + +KKK E + + K A EK K K AE +K K
Sbjct: 83 LKKKKKKKKTDYKQLLKKLEARKKKLEE--LDEDKAAEIEEKEKWTKALAKAEGVKVKDD 140
Query: 949 EKIGKK--KKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
EK+ KK K+K+K+KKK KK+ K++K+K +K ++
Sbjct: 141 EKLLKKALKRKEKQKKKSKKEWKERKEKVEKKKAER 176
Score = 51.2 bits (123), Expect = 5e-07
Identities = 29/78 (37%), Positives = 44/78 (56%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
+ K K EK+ KK K+KE +K + KK ++ K+K + + ++K E+ KK+K
Sbjct: 129 LAKAEGVKVKDDEKLLKKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRK 188
Query: 957 KKKKKKKKKKKKKKKKKK 974
KK KKKKK KKK +
Sbjct: 189 DDKKNKKKKKAKKKGRIL 206
Score = 50.4 bits (121), Expect = 9e-07
Identities = 37/111 (33%), Positives = 48/111 (43%), Gaps = 18/111 (16%)
Query: 896 RKKRTRRKKKKGAE-KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK-------G 947
R+KR +RK +K + K KKK+ +++E K K KKK E +
Sbjct: 12 RRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNIVFSKVEF 71
Query: 948 AEKIGKKKKKKKKKKKKKKKK----------KKKKKKEAEKMKKKTEEAEK 988
A+ KK K KKKKKKKK +KKK E K E EK
Sbjct: 72 ADGEQAKKDLKLKKKKKKKKTDYKQLLKKLEARKKKLEELDEDKAAEIEEK 122
Score = 44.6 bits (106), Expect = 7e-05
Identities = 33/119 (27%), Positives = 52/119 (43%), Gaps = 27/119 (22%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEA-EKMRKKKGAEKMKKKKKDAEK------------ 942
R++R RKK+K E +K+ +K EA E ++ ++K ++AE
Sbjct: 15 REQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNIVFSKVEFADG 74
Query: 943 -MKKKGAEKIGKKKKKKK---------KKKKKK----KKKKKKKKKEAEKMKKKTEEAE 987
KK + KKKKKK + +KKK + K + +E EK K +AE
Sbjct: 75 EQAKKDLKLKKKKKKKKTDYKQLLKKLEARKKKLEELDEDKAAEIEEKEKWTKALAKAE 133
Score = 42.3 bits (100), Expect = 5e-04
Identities = 31/94 (32%), Positives = 48/94 (51%), Gaps = 14/94 (14%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
+ KK + +KKKKKK + K+ +K++ +KK E++ + A +I +K+K K
Sbjct: 76 QAKKDLKLKKKKKKKKTDY-----KQLLKKLEARKKKLEELDEDKAAEIEEKEKWTKALA 130
Query: 963 K---KKKK------KKKKKKKEAEKMKKKTEEAE 987
K K K KK K+KE +K K K E E
Sbjct: 131 KAEGVKVKDDEKLLKKALKRKEKQKKKSKKEWKE 164
Score = 34.6 bits (80), Expect = 0.15
Identities = 12/40 (30%), Positives = 28/40 (70%)
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
E + +++++K++++K +KK+K+K+ K+ E +K E K
Sbjct: 5 EALLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVK 44
Score = 34.2 bits (79), Expect = 0.20
Identities = 21/57 (36%), Positives = 33/57 (57%), Gaps = 7/57 (12%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMR-----KKKGAEKMKKKKKDAEKMKKKG 947
++K+ ++ KK+ E+ K +KKK E +K R K+K +K KKKKK K K +
Sbjct: 151 KEKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRKDDKKNKKKKK--AKKKGRI 205
Score = 33.8 bits (78), Expect = 0.25
Identities = 11/40 (27%), Positives = 28/40 (70%)
Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+ ++++K++++K +KK+K+K+ KK+ + K + EE +
Sbjct: 5 EALLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVK 44
Score = 32.3 bits (74), Expect = 0.96
Identities = 15/54 (27%), Positives = 29/54 (53%), Gaps = 5/54 (9%)
Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKK-----KKKKKKEAEKMKKKTEEAEK 988
++ +K+ K KK+K+K+ KKK+ +K K ++ +++K E AE
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEG 62
Score = 29.6 bits (67), Expect = 6.3
Identities = 12/57 (21%), Positives = 29/57 (50%), Gaps = 2/57 (3%)
Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ +++K++ K +KK +K + KKK+ +K + ++ K ++ K +K
Sbjct: 9 EQRRRKREQRKARKK--QKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 56.9 bits (138), Expect = 3e-08
Identities = 38/132 (28%), Positives = 56/132 (42%), Gaps = 29/132 (21%)
Query: 872 HVAVSDVNDDVRREAVTALGFILFRKKRTRRKKKKGA-------------EKIR---KKK 915
HV +N D+ R A T L R RT R +KG KI ++
Sbjct: 316 HV----INFDMPRSADT----YLHRIGRTGRAGRKGTAISLVEAHDHLLLGKIERYIEEP 367
Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
K + +++R K A KK K ++K+ K AEK K+K+K K KK+ + K K++
Sbjct: 368 LKARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRR- 426
Query: 976 AEKMKKKTEEAE 987
K E
Sbjct: 427 ----KPSGTSEE 434
Score = 44.2 bits (105), Expect = 3e-04
Identities = 15/44 (34%), Positives = 25/44 (56%)
Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K +EK K KK K+ +KK+K+K+K + +K + T+ K
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGK 424
Score = 33.0 bits (76), Expect = 0.84
Identities = 15/34 (44%), Positives = 18/34 (52%), Gaps = 1/34 (2%)
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K K +KK K KK K AEK K+K +E K
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEK-KEKEKEKPK 411
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 56.1 bits (136), Expect = 6e-08
Identities = 38/135 (28%), Positives = 76/135 (56%), Gaps = 20/135 (14%)
Query: 864 NVATK--RLLHVAVSDVNDDVRREAVTALGFILFRKKRTRRKK--------KKGAEKIRK 913
++A K LH + V RRE + L I+F+ + ++ E +
Sbjct: 350 SIAKKIAEKLHTSKRKV----RREVLPFLS-IIFKHNPELAARLAAFLELTEEEIEFLTG 404
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
KK K+ +K+ EK +KK+++ +K KKK A KK+++++++K+KK+++K++++
Sbjct: 405 SKKATKKIKKI-----VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459
Query: 974 KEAEKMKKKTEEAEK 988
+EAE+ K++ EE +K
Sbjct: 460 EEAEEEKEEEEEKKK 474
Score = 55.3 bits (134), Expect = 9e-08
Identities = 26/71 (36%), Positives = 48/71 (67%)
Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
KK KK EK KK+ EK +KKKK KK+ E+ K+KK+++K++++++ +++K+
Sbjct: 408 ATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKE 467
Query: 973 KKEAEKMKKKT 983
++E +K K+ T
Sbjct: 468 EEEEKKKKQAT 478
Score = 55.3 bits (134), Expect = 1e-07
Identities = 26/71 (36%), Positives = 50/71 (70%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
+K K +KI +K +KK+E EK KKK A KKK+++ E+ K+K E+ +++++ ++
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464
Query: 961 KKKKKKKKKKK 971
+K+++++KKKK
Sbjct: 465 EKEEEEEKKKK 475
Score = 54.5 bits (132), Expect = 2e-07
Identities = 29/131 (22%), Positives = 61/131 (46%), Gaps = 35/131 (26%)
Query: 893 ILFRKKRTRRKKKKGAEKIRKK--------------------KKKKKEAEKM-------- 924
+L + K R + A+KI +K K + A ++
Sbjct: 337 LLSKTKEKRETRDSIAKKIAEKLHTSKRKVRREVLPFLSIIFKHNPELAARLAAFLELTE 396
Query: 925 -------RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
KK +K+KK + AEK +++ ++ KK KKK+++++++K+KK++E E
Sbjct: 397 EEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456
Query: 978 KMKKKTEEAEK 988
+ +++ EE ++
Sbjct: 457 EEEEEAEEEKE 467
Score = 54.2 bits (131), Expect = 2e-07
Identities = 21/73 (28%), Positives = 46/73 (63%)
Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
T KK K +K +KK E+ ++KK KKK++ E+ +K+ E+ ++++++
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Query: 960 KKKKKKKKKKKKK 972
+++K+++++KKKK
Sbjct: 463 EEEKEEEEEKKKK 475
Score = 51.8 bits (125), Expect = 1e-06
Identities = 19/72 (26%), Positives = 47/72 (65%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
+ KK + +KK+++E ++ +KK A K K+++++ EK KK+ ++ +++ +
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Query: 958 KKKKKKKKKKKK 969
++K+++++KKKK
Sbjct: 464 EEKEEEEEKKKK 475
Score = 51.8 bits (125), Expect = 1e-06
Identities = 19/77 (24%), Positives = 49/77 (63%)
Query: 890 LGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE 949
+ F+ KK T++ KK + +K++++KKE +K ++ +++++ +K ++K E
Sbjct: 399 IEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEE 458
Query: 950 KIGKKKKKKKKKKKKKK 966
+ +++K+++++KKKK
Sbjct: 459 EEEAEEEKEEEEEKKKK 475
Score = 51.5 bits (124), Expect = 2e-06
Identities = 20/76 (26%), Positives = 51/76 (67%), Gaps = 1/76 (1%)
Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
L K+ +K KK EK KK++++K+ +K + G +K ++++++ ++ K++ E+
Sbjct: 401 FLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKK-EEEEEEEKEKKEEEKEEEE 459
Query: 953 KKKKKKKKKKKKKKKK 968
++ +++K+++++KKKK
Sbjct: 460 EEAEEEKEEEEEKKKK 475
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 52.5 bits (126), Expect = 7e-08
Identities = 29/64 (45%), Positives = 38/64 (59%), Gaps = 2/64 (3%)
Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
+K++KE K ++K A K KK+KK EK +KK AEK+ KKK K K KK KK K
Sbjct: 96 EKEQKEVSKETEEKEAIKAKKEKK--EKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTK 153
Query: 975 EAEK 978
+ E
Sbjct: 154 KEEG 157
Score = 50.1 bits (120), Expect = 5e-07
Identities = 26/64 (40%), Positives = 38/64 (59%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
+EK +K+ K+ E ++ K K +K KK+KK AEK+ KK + K K KK KK KK
Sbjct: 95 SEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKK 154
Query: 968 KKKK 971
++ K
Sbjct: 155 EEGK 158
Score = 44.8 bits (106), Expect = 4e-05
Identities = 19/50 (38%), Positives = 32/50 (64%), Gaps = 4/50 (8%)
Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++EK +K+ + K+ ++K+ K KK+KK+KK+KK AEK+ KK
Sbjct: 94 ESEKEQKEVS----KETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTT 139
Score = 43.2 bits (102), Expect = 1e-04
Identities = 22/68 (32%), Positives = 35/68 (51%), Gaps = 2/68 (2%)
Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKM 979
E + +K+ +++ ++K+ K KK+ EK KK +K KKK K K KK +K
Sbjct: 93 EESEKEQKEVSKETEEKEA--IKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKT 150
Query: 980 KKKTEEAE 987
K EE +
Sbjct: 151 TTKKEEGK 158
Score = 42.4 bits (100), Expect = 2e-04
Identities = 19/58 (32%), Positives = 32/58 (55%), Gaps = 1/58 (1%)
Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
E+ +K++K+ K ++ E I KK+KK+KK+KK +K KKK + +K
Sbjct: 93 EESEKEQKEVSKETEE-KEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKK 149
Score = 37.0 bits (86), Expect = 0.015
Identities = 19/57 (33%), Positives = 33/57 (57%), Gaps = 2/57 (3%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKK--KKKKKKKKKKKKEAEKMKKKTEEAEK 988
K++K+ +++ ++K A K K+KK+KK+KK +K KKK K + K + K
Sbjct: 97 KEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTK 153
Score = 37.0 bits (86), Expect = 0.017
Identities = 19/62 (30%), Positives = 32/62 (51%), Gaps = 3/62 (4%)
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK---KKKKKKKKKKKKKEAEKMKKKTEEA 986
+ ++K+ E +K+ + +KK+KK+KK K KKK K K +K KKT
Sbjct: 94 ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTK 153
Query: 987 EK 988
++
Sbjct: 154 KE 155
Score = 35.9 bits (83), Expect = 0.039
Identities = 21/54 (38%), Positives = 29/54 (53%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
K T K+ A+K +K+KK+KK AEK+ KKK + K K A K E+
Sbjct: 103 SKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEE 156
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 51.9 bits (125), Expect = 7e-08
Identities = 24/74 (32%), Positives = 45/74 (60%), Gaps = 8/74 (10%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
KK K+ K+ ++KE +K RK+ G KK++ A + E+ ++ +K+K+ +K
Sbjct: 65 KKHNAKVEKELLREKEKKKKRKRPG-----KKRRIA---LRLRRERTKERAEKEKRTRKN 116
Query: 965 KKKKKKKKKKEAEK 978
++KK K+++KE EK
Sbjct: 117 REKKFKRRQKEKEK 130
Score = 43.8 bits (104), Expect = 4e-05
Identities = 15/57 (26%), Positives = 34/57 (59%), Gaps = 1/57 (1%)
Query: 927 KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
KK K++K+ ++ KKK + GKK++ + ++++ K++ +K+K K ++K
Sbjct: 65 KKHNAKVEKELLREKEKKKKRK-RPGKKRRIALRLRRERTKERAEKEKRTRKNREKK 120
Score = 43.0 bits (102), Expect = 7e-05
Identities = 21/71 (29%), Positives = 42/71 (59%), Gaps = 5/71 (7%)
Query: 918 KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
KK K+ K+ EK KKKK+ K+ G ++ + ++++ K++ +K+K+ +K +
Sbjct: 65 KKHNAKVEKELLREKEKKKKR-----KRPGKKRRIALRLRRERTKERAEKEKRTRKNREK 119
Query: 978 KMKKKTEEAEK 988
K K++ +E EK
Sbjct: 120 KFKRRQKEKEK 130
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 54.8 bits (132), Expect = 1e-07
Identities = 32/98 (32%), Positives = 37/98 (37%), Gaps = 7/98 (7%)
Query: 897 KKRTRRKKKKGAEKIRKKKKK--KKEAEKMRKKKGAEKMKKK-----KKDAEKMKKKGAE 949
KK KKK + KK KK+AE K A + KKK KK A KK A
Sbjct: 161 KKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAA 220
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K K + K +K K AEK AE
Sbjct: 221 AEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAE 258
Score = 52.1 bits (125), Expect = 8e-07
Identities = 28/94 (29%), Positives = 46/94 (48%), Gaps = 1/94 (1%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+++ +++KK AE+ K+ K++ + K A K K + K A+K +
Sbjct: 108 KERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEA 167
Query: 956 KKKKKKKKKKKKKKKKKKK-EAEKMKKKTEEAEK 988
KKK + + KK + KKK EAE K EA+K
Sbjct: 168 KKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKK 201
Score = 50.2 bits (120), Expect = 3e-06
Identities = 31/86 (36%), Positives = 39/86 (45%), Gaps = 7/86 (8%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK-----KKGAEKIGKKKKKKKKKK 962
A+K + KKK EAE KK A + KKK + K KK AE KKK + KK
Sbjct: 160 AKKAAAEAKKKAEAEA--AKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKK 217
Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAEK 988
K + K K A + K E+A
Sbjct: 218 KAAAEAKAAAAKAAAEAKAAAEKAAA 243
Score = 49.4 bits (118), Expect = 6e-06
Identities = 36/111 (32%), Positives = 47/111 (42%), Gaps = 19/111 (17%)
Query: 897 KKRTRRKKKKGAEKIRKKK----KKKKEAEKMR----KKKGAEKMKKK------KKDAEK 942
+ K+K AE+ K K K EAE R KK A + KKK KK A +
Sbjct: 123 AAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAE 182
Query: 943 MKKKGAEKIGKK-----KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KKK + K KKK + + KKK + KKK AE + A +
Sbjct: 183 AKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAE 233
Score = 49.0 bits (117), Expect = 8e-06
Identities = 27/91 (29%), Positives = 42/91 (46%), Gaps = 4/91 (4%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMR----KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
R K+ E++ +++KK+ E + K+K AE+ K A K K + K K
Sbjct: 102 RLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAK 161
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K + KKK + + KK A + KKK E
Sbjct: 162 KAAAEAKKKAEAEAAKKAAAEAKKKAEAEAA 192
Score = 47.5 bits (113), Expect = 2e-05
Identities = 25/95 (26%), Positives = 38/95 (40%), Gaps = 3/95 (3%)
Query: 897 KKRTRRKKKKGAEKIRKKKK---KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
KR KK A + +KK + KK A + +KK AE K +A+K + A+K
Sbjct: 153 AKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAA 212
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ KKK + K K + +K
Sbjct: 213 AEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAA 247
Score = 47.5 bits (113), Expect = 2e-05
Identities = 32/95 (33%), Positives = 41/95 (43%), Gaps = 9/95 (9%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK-----KKKDAEKMKKKGAEKIGKKKKK 957
K K AE R KK A + +KK AE KK KKK + K A + KK +
Sbjct: 146 KAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEA 205
Query: 958 KKKKKKKKKKKKK----KKKKEAEKMKKKTEEAEK 988
+ KKK + KKK K A+ + AEK
Sbjct: 206 EAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEK 240
Score = 44.4 bits (105), Expect = 2e-04
Identities = 31/93 (33%), Positives = 47/93 (50%), Gaps = 7/93 (7%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAE----KMKKKKKDAEKMKKKGAEKIGKKKKKK 958
++K+ AE+ R K+ +K+ +KK AE + K+K AE+ K A K + +
Sbjct: 93 QQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAE 152
Query: 959 KKK---KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K+ KK + KKK EAE KK EA+K
Sbjct: 153 AKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKK 185
Score = 44.0 bits (104), Expect = 3e-04
Identities = 32/99 (32%), Positives = 54/99 (54%), Gaps = 12/99 (12%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKK-KGAEKMKK--KKKDAEKMKKKGAEKIGKKKKKK 958
+K K AE+ RKKK++++ E +K+ E++K+ K++ A + +KK AE+ K+ K
Sbjct: 71 QKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALK 130
Query: 959 KKKKKKKKKKKK---KKKKEAEKM------KKKTEEAEK 988
+K+ ++ K K K EAE KK EA+K
Sbjct: 131 QKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKK 169
Score = 30.5 bits (69), Expect = 4.5
Identities = 16/61 (26%), Positives = 22/61 (36%), Gaps = 4/61 (6%)
Query: 896 RKKRTRRKKKKGAEKIRKK----KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
+KK KKK A + +KK K K AEK K + K A ++
Sbjct: 200 KKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEV 259
Query: 952 G 952
Sbjct: 260 D 260
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 55.0 bits (133), Expect = 1e-07
Identities = 17/91 (18%), Positives = 35/91 (38%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
+++ K E+I++ + KK+ + + K + + +K K K
Sbjct: 25 AKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAK 84
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K KKK K + ++K +KK +
Sbjct: 85 AAAAKAPAKKKLKDELDSSKKAEKKNALDKD 115
Score = 53.1 bits (128), Expect = 5e-07
Identities = 20/95 (21%), Positives = 36/95 (37%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
L ++ ++K KK A K + K KE K + + ++ + + +
Sbjct: 10 LAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDA 69
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ KKK K K K A+K K ++ K
Sbjct: 70 TESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSK 104
Score = 50.4 bits (121), Expect = 3e-06
Identities = 20/109 (18%), Positives = 38/109 (34%)
Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD 939
++ +++ + T+ + K+ E +K ++ + + + D
Sbjct: 14 EEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESD 73
Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K K K A K K KKK K + KK +K+ K K
Sbjct: 74 IPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVK 122
Score = 47.3 bits (113), Expect = 3e-05
Identities = 18/96 (18%), Positives = 34/96 (35%)
Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
L KK+T + + + K +A + K K K A+ KK +
Sbjct: 41 ALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDEL 100
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK +KK K K+ + + + ++ +
Sbjct: 101 DSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDD 136
Score = 47.3 bits (113), Expect = 3e-05
Identities = 15/92 (16%), Positives = 28/92 (30%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K+ + + + + + M K K K K A KK
Sbjct: 35 KEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKK 94
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K K + KK +KK + + ++ +
Sbjct: 95 KLKDELDSSKKAEKKNALDKDDDLNYVKDIDV 126
Score = 45.0 bits (107), Expect = 2e-04
Identities = 20/93 (21%), Positives = 34/93 (36%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+ + KK +K+ K K K K K+ E KK + +++ + +
Sbjct: 8 AELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTD 67
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ KKK K K A+ KK + E
Sbjct: 68 DATESDIPKKKTKTAAKAAAAKAPAKKKLKDEL 100
Score = 40.7 bits (96), Expect = 0.003
Identities = 21/96 (21%), Positives = 32/96 (33%), Gaps = 20/96 (20%)
Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK--------------- 957
K + +E K + KK A K K K ++ K+ E K ++
Sbjct: 7 KAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDT 66
Query: 958 -----KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KKK K K +A KK +E +
Sbjct: 67 DDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDS 102
Score = 38.4 bits (90), Expect = 0.017
Identities = 14/59 (23%), Positives = 24/59 (40%)
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K + E+ KKK + K K K K++ K+ + KKK E++ +
Sbjct: 3 TASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSG 61
Score = 38.4 bits (90), Expect = 0.018
Identities = 17/123 (13%), Positives = 34/123 (27%), Gaps = 3/123 (2%)
Query: 866 ATKRLLHVAVSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMR 925
TK + A+ + + K + +K K K A+
Sbjct: 33 ITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPA 92
Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
KKK +++ KK KK +K K + + + ++
Sbjct: 93 KKKLKDELDSSKKAE---KKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDD 149
Query: 986 AEK 988
+
Sbjct: 150 DDD 152
Score = 38.4 bits (90), Expect = 0.020
Identities = 15/123 (12%), Positives = 29/123 (23%)
Query: 866 ATKRLLHVAVSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMR 925
K + + + KK+T+ K A K KKK K E + +
Sbjct: 45 KKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSK 104
Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K + + K + E + ++
Sbjct: 105 KAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDD 164
Query: 986 AEK 988
+
Sbjct: 165 VDD 167
Score = 35.7 bits (83), Expect = 0.11
Identities = 18/52 (34%), Positives = 24/52 (46%)
Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ K + E+ KK KK K K K K++ KEA + KKKT E
Sbjct: 2 TTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQID 53
Score = 35.7 bits (83), Expect = 0.13
Identities = 17/71 (23%), Positives = 27/71 (38%)
Query: 918 KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
+ K E KK KK A K K KG + K+ + KKK ++ + +
Sbjct: 2 TTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSG 61
Query: 978 KMKKKTEEAEK 988
+K + E
Sbjct: 62 MVKDTDDATES 72
Score = 31.1 bits (71), Expect = 3.3
Identities = 14/56 (25%), Positives = 24/56 (42%)
Query: 933 MKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
M AE ++ A+K KK K K K K++ K+ E++K + +
Sbjct: 1 MTTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVL 56
Score = 30.3 bits (69), Expect = 5.2
Identities = 11/69 (15%), Positives = 26/69 (37%)
Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKM 979
K + A + + KKK + K ++ K++ K+ + KKK ++ +
Sbjct: 1 MTTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLS 60
Query: 980 KKKTEEAEK 988
+ +
Sbjct: 61 GMVKDTDDA 69
Score = 30.0 bits (68), Expect = 7.3
Identities = 14/99 (14%), Positives = 36/99 (36%), Gaps = 8/99 (8%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK--------MKKKGA 948
K +KK K KK +KK +K + + + + +
Sbjct: 88 AKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDI 147
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+ + + + ++KKEA++++K +++ +
Sbjct: 148 DDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDD 186
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 54.4 bits (131), Expect = 3e-07
Identities = 31/110 (28%), Positives = 47/110 (42%), Gaps = 7/110 (6%)
Query: 881 DVRREAVT----ALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK 936
+VR EAV + F L +R R K A R+K KK +K K++ K
Sbjct: 707 EVRVEAVNMDERKIDFSLISSERAPRNVGKTA---REKAKKGDAGKKGGKRRQVGKKVNF 763
Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
+ D+ +K A+ KK +K KK K +K +A++ KK
Sbjct: 764 EPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVAE 813
Score = 37.0 bits (86), Expect = 0.050
Identities = 23/73 (31%), Positives = 34/73 (46%), Gaps = 14/73 (19%)
Query: 925 RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK---------KKKKKKKKKKKKKE 975
R + K + EK KK A K G K+++ KK + +KK K K KK+
Sbjct: 729 RAPRNVGKTAR-----EKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKD 783
Query: 976 AEKMKKKTEEAEK 988
A K KK + + +K
Sbjct: 784 ARKAKKPSAKTQK 796
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 53.8 bits (129), Expect = 3e-07
Identities = 32/98 (32%), Positives = 55/98 (56%), Gaps = 5/98 (5%)
Query: 896 RKKRTRR-----KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
RKK+ + K K+ AE+ R K+ +K+ + ++K AE+ +K+ + +K +++ A K
Sbjct: 81 RKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARK 140
Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++KKK + K K + K K A + KKK EEA K
Sbjct: 141 AAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAK 178
Score = 52.3 bits (125), Expect = 8e-07
Identities = 26/93 (27%), Positives = 42/93 (45%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+ K KKK E + ++ K +AE KK AE K + K + + K KK
Sbjct: 162 KLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKA 221
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ ++K +KKK K +A+K + AE+
Sbjct: 222 EAAAEEKAAAEKKKAAAKAKADKAAAAAKAAER 254
Score = 51.5 bits (123), Expect = 1e-06
Identities = 29/102 (28%), Positives = 44/102 (43%), Gaps = 9/102 (8%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMR------KKKGAEKMKKKKKDAE---KMKKK 946
+ + + K A K++ + KK+AE+ K K KKK +AE +K
Sbjct: 148 KAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKA 207
Query: 947 GAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
AE K K +KK + ++K +KKK A K K A
Sbjct: 208 KAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAA 249
Score = 48.8 bits (116), Expect = 9e-06
Identities = 29/105 (27%), Positives = 49/105 (46%), Gaps = 13/105 (12%)
Query: 897 KKRTRRKKKKGAEKIRK---KKKKKKEAEKMRKKKGAEKMK----------KKKKDAEKM 943
+K+ + ++K+ E+ RK ++KKK EA K + A K+K + K AE+
Sbjct: 124 EKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEA 183
Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K K KKK + + K +K K + + K + K + EK
Sbjct: 184 KAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEK 228
Score = 48.4 bits (115), Expect = 1e-05
Identities = 29/92 (31%), Positives = 43/92 (46%), Gaps = 5/92 (5%)
Query: 898 KRTRRKKKKGAEKIRKK---KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
K+ + K AE+ + K KK+AE K AEK K + + K +KK AE ++
Sbjct: 170 KKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAA-AEKAKAEAEAKAKAEKK-AEAAAEE 227
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
K +KKK K K K A+ ++K A
Sbjct: 228 KAAAEKKKAAAKAKADKAAAAAKAAERKAAAA 259
Score = 46.5 bits (110), Expect = 4e-05
Identities = 26/93 (27%), Positives = 41/93 (44%), Gaps = 1/93 (1%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+KK+ K K A + K K EA+K ++ + K K KK AE K
Sbjct: 145 QKKKAEAAKAKAAAE-AAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAA 203
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K K + + K K +KK + A + K E+ +
Sbjct: 204 AEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKA 236
Score = 44.9 bits (106), Expect = 1e-04
Identities = 24/87 (27%), Positives = 44/87 (50%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
++K+ E ++ + ++K+ E+ +K AE+ KK + K + A+ + KKK +
Sbjct: 115 EQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAE 174
Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ K ++ K K EA KKK E K
Sbjct: 175 EAAKAAEEAKAKAEAAAAKKKAEAEAK 201
Score = 44.6 bits (105), Expect = 2e-04
Identities = 25/104 (24%), Positives = 52/104 (50%), Gaps = 11/104 (10%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK-----KKGAEK 950
++K+ +K+ + ++++++ ++A +KKK K +A K+K KK AE+
Sbjct: 116 QQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEE 175
Query: 951 IGKKKKKKKKK------KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K ++ K K KKK + + K ++A+ + +AEK
Sbjct: 176 AAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEK 219
Score = 43.0 bits (101), Expect = 5e-04
Identities = 27/98 (27%), Positives = 54/98 (55%), Gaps = 5/98 (5%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRK-KKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
+K +RKKK+ K K+ E E++++ +K K ++++K AE+ +K+ + ++
Sbjct: 75 KKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQ 134
Query: 955 KKKKKK----KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+++ +K +KKK + K K EA K+K E +K
Sbjct: 135 EEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKK 172
Score = 41.1 bits (96), Expect = 0.002
Identities = 24/93 (25%), Positives = 58/93 (62%), Gaps = 3/93 (3%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
R + + KKG ++ +KKK+++ AE+++ K+ AE+ + K+ + E++K + +K ++
Sbjct: 66 RIQSQQSSAKKGEQQ--RKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQK-QAEE 122
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K+ + ++K+++++ +K AE+ KK K
Sbjct: 123 AEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAK 155
Score = 38.8 bits (90), Expect = 0.012
Identities = 18/64 (28%), Positives = 28/64 (43%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
KK+ + K AEK + + + K +AEK + EK +KK A K K
Sbjct: 192 AKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKA 251
Query: 956 KKKK 959
++K
Sbjct: 252 AERK 255
Score = 38.4 bits (89), Expect = 0.016
Identities = 18/80 (22%), Positives = 50/80 (62%), Gaps = 2/80 (2%)
Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA--EKIGKKKKKKKKKKKKKKKK 968
+ ++ + + + KK ++ KK+++ AE++K K A ++ K+ +K++ K ++++K+
Sbjct: 60 VVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQ 119
Query: 969 KKKKKKEAEKMKKKTEEAEK 988
++ +K+A+ +K+ EE +
Sbjct: 120 AEEAEKQAQLEQKQQEEQAR 139
>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain. This presumed domain is
found at the C-terminus of a family of FtsJ-like
methyltransferases. Members of this family are involved
in 60S ribosomal biogenesis.
Length = 212
Score = 51.9 bits (125), Expect = 3e-07
Identities = 35/84 (41%), Positives = 47/84 (55%), Gaps = 7/84 (8%)
Query: 905 KKGAE-KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
KK AE K RKK + K EK +KK AE + + + +E+ K K +K+ KK KKKKKK+K
Sbjct: 111 KKVAEAKARKKMRALKRLEKAKKK--AEAIAEDEDISEREKAKQIKKLYKKAKKKKKKRK 168
Query: 964 KK----KKKKKKKKKEAEKMKKKT 983
K KK K K + +K K
Sbjct: 169 KTYVVAKKGNKGKAGRPKGVKGKY 192
Score = 43.4 bits (103), Expect = 2e-04
Identities = 30/94 (31%), Positives = 40/94 (42%), Gaps = 14/94 (14%)
Query: 902 RKKKKGAEKIRKKKKK---------KKEAEKMR-----KKKGAEKMKKKKKDAEKMKKKG 947
RKK + +++ K KKK E EK + KK +K KK+KK KK
Sbjct: 119 RKKMRALKRLEKAKKKAEAIAEDEDISEREKAKQIKKLYKKAKKKKKKRKKTYVVAKKGN 178
Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
K G+ K K K K + KK + K KK
Sbjct: 179 KGKAGRPKGVKGKYKVVDPRMKKDLRALKRKAKK 212
Score = 32.3 bits (74), Expect = 0.83
Identities = 23/68 (33%), Positives = 32/68 (47%), Gaps = 2/68 (2%)
Query: 900 TRRKKKKGAEKIRKK--KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
+ R+K K +K+ KK KKKKK + K K K + K K K + KK +
Sbjct: 145 SEREKAKQIKKLYKKAKKKKKKRKKTYVVAKKGNKGKAGRPKGVKGKYKVVDPRMKKDLR 204
Query: 958 KKKKKKKK 965
K+K KK
Sbjct: 205 ALKRKAKK 212
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 52.4 bits (126), Expect = 4e-07
Identities = 28/85 (32%), Positives = 51/85 (60%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
R+K + A K R K KK+ AE ++ E+ KK+ E KK K++++K
Sbjct: 139 RQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQK 198
Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEA 986
++++++K++KK++E E+ K+K EEA
Sbjct: 199 REEERRKQRKKQQEEEERKQKAEEA 223
Score = 47.8 bits (114), Expect = 1e-05
Identities = 30/136 (22%), Positives = 64/136 (47%), Gaps = 29/136 (21%)
Query: 882 VRREAVTALGFILFRKKRTRRKK--KKGAEKIRKKKKKKKEAEKMRKKKGAEK----MKK 935
+ R+A A L K+ R+KK K EK +++++K++E ++R++ EK ++
Sbjct: 81 LERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQ 140
Query: 936 KKKDAEKMKK--------KGAEKIGKKKKKKKKK---------------KKKKKKKKKKK 972
K + A K + + A K ++ KK K++++K++
Sbjct: 141 KAQQAAKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKRE 200
Query: 973 KKEAEKMKKKTEEAEK 988
++ ++ KK+ EE E+
Sbjct: 201 EERRKQRKKQQEEEER 216
Score = 38.9 bits (91), Expect = 0.008
Identities = 16/73 (21%), Positives = 40/73 (54%), Gaps = 7/73 (9%)
Query: 897 KKRTRRKKKKGAEKIRKK-----KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
+ ++ E+ +K+ KK K+ ++ R+++ ++ KK+++ E+ +K+ AE+
Sbjct: 166 SGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQE--EEERKQKAEEA 223
Query: 952 GKKKKKKKKKKKK 964
+K K K++ K
Sbjct: 224 WQKWMKNVKQRPK 236
Score = 38.6 bits (90), Expect = 0.010
Identities = 17/74 (22%), Positives = 42/74 (56%), Gaps = 1/74 (1%)
Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK-KKKK 974
+ KE + R+ + A + K A++ KK K+K++++K++++ + +++ K+K
Sbjct: 74 RSLKEVKLERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEK 133
Query: 975 EAEKMKKKTEEAEK 988
E ++K ++A K
Sbjct: 134 YEEWCRQKAQQAAK 147
Score = 37.0 bits (86), Expect = 0.034
Identities = 21/91 (23%), Positives = 51/91 (56%), Gaps = 11/91 (12%)
Query: 896 RKKRTRRKKKKGAEK-----IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
K+RT + KK+ AE K ++ +++ KK+ E KK K ++ +++
Sbjct: 146 AKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREE---- 201
Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
+++K++KK+++++++K+K ++ + MK
Sbjct: 202 --ERRKQRKKQQEEEERKQKAEEAWQKWMKN 230
Score = 35.1 bits (81), Expect = 0.14
Identities = 19/80 (23%), Positives = 45/80 (56%), Gaps = 4/80 (5%)
Query: 913 KKKKKKKEA-EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK---K 968
K +++ +EA E K A++ KK +K E+ +K+ EK ++ + +++ K+K + +
Sbjct: 80 KLERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCR 139
Query: 969 KKKKKKEAEKMKKKTEEAEK 988
+K ++ ++ K +EA +
Sbjct: 140 QKAQQAAKQRTPKHKKEAAE 159
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 52.0 bits (125), Expect = 8e-07
Identities = 27/83 (32%), Positives = 52/83 (62%), Gaps = 7/83 (8%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK---KKGAEKI 951
K R K+K A+ R K+K++KE E+ K++ +++KKK ++K K+ A+K
Sbjct: 268 PINKPVRPKRKTKAQ--RNKEKRRKELEREAKEE--KQLKKKLAQLARLKEIAKEVAQKE 323
Query: 952 GKKKKKKKKKKKKKKKKKKKKKK 974
+ +KK+++K++ +KKK K++K
Sbjct: 324 KARARKKEQRKERGEKKKLKRRK 346
Score = 45.1 bits (107), Expect = 1e-04
Identities = 21/103 (20%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK---------------MKK 945
KK++ E++ +KK +K E R + +E + ++ D + +
Sbjct: 209 AEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEP 268
Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K+K K ++ K+K++K+ +++ KE +++KKK + +
Sbjct: 269 INKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLAR 311
Score = 40.5 bits (95), Expect = 0.004
Identities = 19/83 (22%), Positives = 45/83 (54%), Gaps = 3/83 (3%)
Query: 907 GAEKIRKKK-KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
+ + K+K +A++ ++K+ K +++ EK KK ++ + K+ K+ +K+
Sbjct: 266 YEPINKPVRPKRKTKAQRNKEKR--RKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKE 323
Query: 966 KKKKKKKKKEAEKMKKKTEEAEK 988
K + +KK++ E+ +KK + K
Sbjct: 324 KARARKKEQRKERGEKKKLKRRK 346
Score = 33.5 bits (77), Expect = 0.59
Identities = 21/78 (26%), Positives = 36/78 (46%), Gaps = 12/78 (15%)
Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK---------KKKKKK 970
E E + K + K+K K +K+ E + K++K+ KKK K+ +
Sbjct: 265 EYEPINKPV---RPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQ 321
Query: 971 KKKKEAEKMKKKTEEAEK 988
K+K A K +++ E EK
Sbjct: 322 KEKARARKKEQRKERGEK 339
Score = 30.8 bits (70), Expect = 3.6
Identities = 32/120 (26%), Positives = 49/120 (40%), Gaps = 17/120 (14%)
Query: 872 HVAVSDVN---DDVRREAVTALGFILFRKKR-------TRRKKKKGAEKIRKKKKKK--- 918
++ ++DV ++ R E +T G +L K T + +K RKK KK
Sbjct: 11 NIDITDVEQFLEEQRDERIT--GGVLAEKPDEDLFVVDTTGDEATIKKKRRKKLKKPLKS 68
Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
E + R K A +KK+ K KK K +K+ + KK + KK K K
Sbjct: 69 DEILENRSKVPAPSVKKRHVK--KKSKKKKSKQVSRKELLRLKKLARGKKGGKAKLLNAS 126
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 50.8 bits (122), Expect = 1e-06
Identities = 24/74 (32%), Positives = 36/74 (48%), Gaps = 2/74 (2%)
Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKK--KDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
K K E + + K K KK++ +D E + + +K K++ KKKK K+ K
Sbjct: 209 KELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKG 268
Query: 968 KKKKKKKEAEKMKK 981
K KK A+ MKK
Sbjct: 269 VKALKKVVAKGMKK 282
Score = 48.1 bits (115), Expect = 1e-05
Identities = 21/74 (28%), Positives = 37/74 (50%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
+++ K + + K ++K K++ ++ + + AEK K K++ KKKK K+ K
Sbjct: 209 KELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKG 268
Query: 969 KKKKKKEAEKMKKK 982
K KK K KK
Sbjct: 269 VKALKKVVAKGMKK 282
Score = 44.6 bits (106), Expect = 1e-04
Identities = 20/76 (26%), Positives = 32/76 (42%), Gaps = 3/76 (3%)
Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
+ K E K K+ +K R+ + + + + + K+K E+I KKK K+
Sbjct: 209 KELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRA---EKKRKSKEEIKKKKPKE 265
Query: 959 KKKKKKKKKKKKKKKK 974
K K KK K K
Sbjct: 266 SKGVKALKKVVAKGMK 281
Score = 42.7 bits (101), Expect = 5e-04
Identities = 24/99 (24%), Positives = 42/99 (42%), Gaps = 16/99 (16%)
Query: 875 VSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMK 934
S + D+ +E + +L F+ K+ K +K++ ++ + +
Sbjct: 200 CSYLPPDLYKELLKSLLIPEFKPLDKYLKE----SKKKKRETEEDVEAAE------SRAE 249
Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
KK+K E++KKK K K+ K K KK K KK
Sbjct: 250 KKRKSKEEIKKK------KPKESKGVKALKKVVAKGMKK 282
Score = 35.0 bits (81), Expect = 0.16
Identities = 17/57 (29%), Positives = 34/57 (59%)
Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ K K+++K K++ E + + + +KK+K K++ KKKK KE++ +K + K
Sbjct: 222 PLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVVAK 278
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 51.3 bits (123), Expect = 2e-06
Identities = 31/92 (33%), Positives = 43/92 (46%), Gaps = 3/92 (3%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
+ + K +K KK+ K E +K + +K KK+KK+ K +G K+G K
Sbjct: 40 STFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEP---KSEGETKLGFKTP 96
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK KK KKK K K + K E AEK
Sbjct: 97 KKSKKTKKKPPKPKPNEDVDNAFNKIAELAEK 128
Score = 50.5 bits (121), Expect = 3e-06
Identities = 25/84 (29%), Positives = 36/84 (42%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
K +++ K +K K +K + KK +EK KKK EK K + + K K
Sbjct: 35 SKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFK 94
Query: 965 KKKKKKKKKKEAEKMKKKTEEAEK 988
KK KK KK+ K K +
Sbjct: 95 TPKKSKKTKKKPPKPKPNEDVDNA 118
Score = 43.6 bits (103), Expect = 4e-04
Identities = 23/90 (25%), Positives = 38/90 (42%), Gaps = 6/90 (6%)
Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI-GKKKKKK 958
+ K + + KE + + K K K +K KKK +
Sbjct: 17 QKSKLQPISYIYSNVLVLSKEILSTF-----SEEENKVATTSTKKDKKEDKNNESKKKSE 71
Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KKKKKKK+KK+ K + E + K ++++K
Sbjct: 72 KKKKKKKEKKEPKSEGETKLGFKTPKKSKK 101
Score = 39.7 bits (93), Expect = 0.006
Identities = 21/81 (25%), Positives = 31/81 (38%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
+K K + + K E+ K KK+ K + KKK +
Sbjct: 12 FSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSE 71
Query: 968 KKKKKKKEAEKMKKKTEEAEK 988
KKKKKKKE ++ K + E
Sbjct: 72 KKKKKKKEKKEPKSEGETKLG 92
Score = 36.2 bits (84), Expect = 0.090
Identities = 19/72 (26%), Positives = 29/72 (40%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K KKK +K +KK+KK+ ++E K K KK + K K E +
Sbjct: 61 DKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFN 120
Query: 957 KKKKKKKKKKKK 968
K + +K
Sbjct: 121 KIAELAEKSNVY 132
Score = 34.3 bits (79), Expect = 0.28
Identities = 18/69 (26%), Positives = 28/69 (40%), Gaps = 3/69 (4%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+KK ++KKKK K +K+ K + E + K K KKK K + K
Sbjct: 67 KKKSEKKKKKK---KEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIA 123
Query: 956 KKKKKKKKK 964
+ +K
Sbjct: 124 ELAEKSNVY 132
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 49.7 bits (119), Expect = 2e-06
Identities = 28/109 (25%), Positives = 50/109 (45%), Gaps = 17/109 (15%)
Query: 894 LFRKKRTRRKKKK---GAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
L R++R ++KK+ ++ KKKKKK K A + KKK +
Sbjct: 78 LQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSP 137
Query: 951 IGK--------------KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K ++ K+++ ++KK + K +K+KE +K K+ T+E
Sbjct: 138 RRKSSRSSTVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEKELTQE 186
Score = 49.3 bits (118), Expect = 3e-06
Identities = 30/115 (26%), Positives = 51/115 (44%), Gaps = 25/115 (21%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK------------- 942
R K+ +R K K ++ KKKKKK K A + KKK +
Sbjct: 83 RLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSPRRKSS 142
Query: 943 ----MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE------AEKMK--KKTEE 985
++ K A K+++ ++KK + K +K+K+KK+ E++ K+TE
Sbjct: 143 RSSTVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEKELTQEERLAEAKETER 197
Score = 46.2 bits (110), Expect = 3e-05
Identities = 24/102 (23%), Positives = 39/102 (38%), Gaps = 20/102 (19%)
Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
+K ++ + KKKK+ K K+ KKKKKD K A KKK ++
Sbjct: 74 GEKELQREERLKKKKRVKTKAYKEP---TKKKKKKDPTAAKSPKAAAPRPKKKSERISWA 130
Query: 964 -----------------KKKKKKKKKKKEAEKMKKKTEEAEK 988
+ K+ ++ KE E +KK + +
Sbjct: 131 PTLLDSPRRKSSRSSTVQNKEATHERLKEREIRRKKIQAKAR 172
Score = 42.0 bits (99), Expect = 6e-04
Identities = 17/66 (25%), Positives = 28/66 (42%)
Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKM 979
E E ++ + E++KKKK+ K K+ +K KK K K + KKK +
Sbjct: 71 EEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWA 130
Query: 980 KKKTEE 985
+
Sbjct: 131 PTLLDS 136
Score = 38.1 bits (89), Expect = 0.014
Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKK-KKKKKKKKKKEAEKMKKKTEEAEK 988
D E+ +K ++ + KKKK+ K K K+ KKKKKK+ K A +
Sbjct: 69 DDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPR 119
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 51.1 bits (123), Expect = 2e-06
Identities = 21/54 (38%), Positives = 39/54 (72%), Gaps = 1/54 (1%)
Query: 930 AEKMKKK-KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
+++K++ K E++K+K + KK+++KK +K+KKKKK+KKK K+ +K +K
Sbjct: 361 GDELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 46.1 bits (110), Expect = 6e-05
Identities = 24/65 (36%), Positives = 40/65 (61%), Gaps = 11/65 (16%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
++++++ K+ E K+K + KKK++ EK +K +KKKKK+KKK KK+K
Sbjct: 361 GDELKEELNKRIE---EIKEKYPKPPKKKRE--EKKPQK------RKKKKKRKKKGKKRK 409
Query: 968 KKKKK 972
KK +K
Sbjct: 410 KKGRK 414
Score = 43.8 bits (104), Expect = 4e-04
Identities = 18/48 (37%), Positives = 30/48 (62%), Gaps = 1/48 (2%)
Query: 941 EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
E++ K+ E I +K K KKK+++KK +K+KKK+ K K K + +
Sbjct: 366 EELNKRIEE-IKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKG 412
Score = 40.3 bits (95), Expect = 0.004
Identities = 23/78 (29%), Positives = 34/78 (43%), Gaps = 32/78 (41%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
KR K+K + +KK+++KK +K+K
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKP--------------------------------QKRK 396
Query: 957 KKKKKKKKKKKKKKKKKK 974
KKKK+KKK KK+KKK +K
Sbjct: 397 KKKKRKKKGKKRKKKGRK 414
Score = 33.0 bits (76), Expect = 0.75
Identities = 13/32 (40%), Positives = 22/32 (68%), Gaps = 1/32 (3%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK 927
+KKR +K +K +K +K+KKK K+ +K +K
Sbjct: 384 KKKREEKKPQKRKKK-KKRKKKGKKRKKKGRK 414
Score = 32.2 bits (74), Expect = 1.2
Identities = 19/43 (44%), Positives = 28/43 (65%), Gaps = 4/43 (9%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKK 938
++K + KKK EK +K+KKKK+ RKKKG ++ KK +K
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKK----RKKKGKKRKKKGRK 414
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 48.0 bits (114), Expect = 2e-06
Identities = 31/93 (33%), Positives = 54/93 (58%), Gaps = 1/93 (1%)
Query: 896 RKKRTRRKKKKGAEK-IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
KK TR K K ++ + KK + KKE K+ +K+G +K+ + + K K KK
Sbjct: 12 GKKFTREYKVKEIQRNLTKKARLKKEYLKLLEKEGYAVPEKESAEKQVKSSKEDRKFEKK 71
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
KK +KK+ K++K+++++KE K +K+ E+ E
Sbjct: 72 KKLDEKKEIAKQRKREQREKELAKRQKELEKIE 104
Score = 40.6 bits (95), Expect = 7e-04
Identities = 28/97 (28%), Positives = 60/97 (61%), Gaps = 7/97 (7%)
Query: 894 LFRKKRTRRK-----KKKGAEKIRKKKKKK--KEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
L +K R +++ +K+G K+ +K K +++ RK + +K+ +KK+ A++ K++
Sbjct: 28 LTKKARLKKEYLKLLEKEGYAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKRE 87
Query: 947 GAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
EK K++K+ +K + KKK+K++++ +K+ KKT
Sbjct: 88 QREKELAKRQKELEKIELSKKKQKERERRRKKLTKKT 124
Score = 40.6 bits (95), Expect = 9e-04
Identities = 24/75 (32%), Positives = 48/75 (64%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
K + K+ K +++ RK +KKKK EK K ++ +++K+ A++ K+ ++ KKK+K
Sbjct: 52 KESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQK 111
Query: 958 KKKKKKKKKKKKKKK 972
++++++KK KK K
Sbjct: 112 ERERRRKKLTKKTKS 126
Score = 37.9 bits (88), Expect = 0.006
Identities = 22/92 (23%), Positives = 51/92 (55%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
+ ++ KK + + K+ ++ +K R KK K+ +K+ A K+ +++ K+
Sbjct: 5 DQNQKKNGKKFTREYKVKEIQRNLTKKARLKKEYLKLLEKEGYAVPEKESAEKQVKSSKE 64
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K +KKKK +KK+ K+ ++ +++ E A++
Sbjct: 65 DRKFEKKKKLDEKKEIAKQRKREQREKELAKR 96
Score = 36.8 bits (85), Expect = 0.017
Identities = 24/89 (26%), Positives = 48/89 (53%), Gaps = 11/89 (12%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K +K + +K+ +KK+ K+ RK + +++K+ A++ K+ EKI KK
Sbjct: 60 KSSKEDRKFEKKKKLDEKKEIAKQ----RK-----REQREKELAKRQKEL--EKIELSKK 108
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K+K++++++KK KK K M + +
Sbjct: 109 KQKERERRRKKLTKKTKSGQPLMGPRIND 137
Score = 29.9 bits (67), Expect = 3.8
Identities = 15/50 (30%), Positives = 32/50 (64%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKK 945
+K+ +++K++ EK K++K+ E ++ KKK E+ +++KK +K K
Sbjct: 77 KKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERERRRKKLTKKTKS 126
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 48.6 bits (116), Expect = 3e-06
Identities = 24/78 (30%), Positives = 34/78 (43%), Gaps = 7/78 (8%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
FR KR R E + K K K ++ K + K + K+ K K
Sbjct: 7 FRAKRVVRAL----ESEKYKANKDKGNPEIYNK---INSQDKAIEKFKLLIKAQMAERVK 59
Query: 955 KKKKKKKKKKKKKKKKKK 972
K ++KK++KKK KKKK
Sbjct: 60 KLHSQEKKEEKKKPKKKK 77
Score = 39.0 bits (91), Expect = 0.005
Identities = 22/69 (31%), Positives = 33/69 (47%), Gaps = 7/69 (10%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
L +K K K E K + K EK K+ K + AE++KK +++ +
Sbjct: 16 LESEKYKANKDKGNPEIYNKINSQDKAIEK-------FKLLIKAQMAERVKKLHSQEKKE 68
Query: 954 KKKKKKKKK 962
+KKK KKKK
Sbjct: 69 EKKKPKKKK 77
Score = 32.5 bits (74), Expect = 0.86
Identities = 12/60 (20%), Positives = 27/60 (45%), Gaps = 1/60 (1%)
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK-TEEAEK 988
A+++ + + + K K + K +K K K + AE++KK ++E ++
Sbjct: 9 AKRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKE 68
Score = 32.1 bits (73), Expect = 0.95
Identities = 13/55 (23%), Positives = 24/55 (43%), Gaps = 3/55 (5%)
Query: 936 KKKD---AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K++D A+++ + + K K K + K + K E K+ K + AE
Sbjct: 2 KRQDFFRAKRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAE 56
Score = 31.3 bits (71), Expect = 1.7
Identities = 15/64 (23%), Positives = 28/64 (43%), Gaps = 1/64 (1%)
Query: 925 RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
R K+ ++ +K A K K E K + K +K K K + + +K+ + +
Sbjct: 8 RAKRVVRALESEKYKANKDKGN-PEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEK 66
Query: 985 EAEK 988
+ EK
Sbjct: 67 KEEK 70
Score = 31.3 bits (71), Expect = 1.8
Identities = 13/65 (20%), Positives = 28/65 (43%)
Query: 924 MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
+R + + K K ++ K + +K K K + ++ KK + +K +KK
Sbjct: 13 VRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKK 72
Query: 984 EEAEK 988
+ +K
Sbjct: 73 PKKKK 77
Score = 29.4 bits (66), Expect = 7.0
Identities = 13/22 (59%), Positives = 14/22 (63%)
Query: 955 KKKKKKKKKKKKKKKKKKKKEA 976
K K KK KK KKKKKKK +
Sbjct: 196 KGDKSSAKKDKKSKKKKKKKRS 217
Score = 29.4 bits (66), Expect = 8.2
Identities = 13/23 (56%), Positives = 14/23 (60%)
Query: 952 GKKKKKKKKKKKKKKKKKKKKKK 974
K K KK KK KKKKKKK+
Sbjct: 195 YKGDKSSAKKDKKSKKKKKKKRS 217
Score = 29.0 bits (65), Expect = 9.3
Identities = 16/68 (23%), Positives = 29/68 (42%), Gaps = 3/68 (4%)
Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
K+ + E + K +K + + + K I K K K + ++ KK ++K
Sbjct: 10 KRVVRALESEKYKANKDKGNPEIYNKINSQDK---AIEKFKLLIKAQMAERVKKLHSQEK 66
Query: 975 EAEKMKKK 982
+ EK K K
Sbjct: 67 KEEKKKPK 74
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 50.4 bits (120), Expect = 4e-06
Identities = 24/90 (26%), Positives = 43/90 (47%), Gaps = 5/90 (5%)
Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
+ A+K R + ++K++ K K K+ K + +K+ EK + KK ++
Sbjct: 239 AQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEAL 298
Query: 964 KKKKKK----KKKKKEAEKM-KKKTEEAEK 988
K K K K++ K +EK + K EA+K
Sbjct: 299 KAKDHKAFDLKQESKASEKEAEDKELEAQK 328
Score = 42.3 bits (99), Expect = 0.001
Identities = 19/82 (23%), Positives = 36/82 (43%), Gaps = 1/82 (1%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKM-KKKGAEKIGKKK 955
K+ +K+ EK + + KK E K A +K++ K +EK + K E K++
Sbjct: 272 DKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKRE 331
Query: 956 KKKKKKKKKKKKKKKKKKKEAE 977
+ +K K + + + E
Sbjct: 332 PVAEDLQKTKPQVEAQPTSLNE 353
Score = 41.9 bits (98), Expect = 0.001
Identities = 23/101 (22%), Positives = 52/101 (51%), Gaps = 9/101 (8%)
Query: 896 RKKRTRRKKKKGAEKIRKK--------KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG 947
+++ R+K++ A+ + K K+ E +K +K ++KK ++A K K
Sbjct: 245 KQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHK 304
Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKE-AEKMKKKTEEAE 987
A + ++ K +K+ + K+ + +KK++ AE ++K + E
Sbjct: 305 AFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVE 345
Score = 39.6 bits (92), Expect = 0.008
Identities = 24/115 (20%), Positives = 54/115 (46%), Gaps = 21/115 (18%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKM------------RKKKGAEKMKK----KKK 938
FR+ T K+++ E ++ ++ K+E +K + A+K + K++
Sbjct: 196 FRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQ 255
Query: 939 DAEKMKKKGAEKIGKKKKK----KKKKKKKKKKKKKKKKKEAEKMK-KKTEEAEK 988
+A+ + K K+ K+ +K++ +K + + KK +EA K K K + ++
Sbjct: 256 EAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQ 310
Score = 36.1 bits (83), Expect = 0.087
Identities = 18/85 (21%), Positives = 39/85 (45%), Gaps = 2/85 (2%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
K+ + R+ +K + E +K + A K K K K + K +EK + K+ + +KK
Sbjct: 272 DKQVAENQKREIEKAQIEIKK--NDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKK 329
Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAE 987
++ + +K K + + + +
Sbjct: 330 REPVAEDLQKTKPQVEAQPTSLNED 354
Score = 35.4 bits (81), Expect = 0.17
Identities = 15/78 (19%), Positives = 40/78 (51%), Gaps = 9/78 (11%)
Query: 918 KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA- 976
KK E +R+ EK ++D +K++ +++ K+ ++ K++ KK+ K +++A
Sbjct: 180 KKVVEALREDN--EKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKAD 237
Query: 977 ------EKMKKKTEEAEK 988
+K + + + ++
Sbjct: 238 FAQDNADKQRDEVRQKQQ 255
>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947). Family of
eukaryotic proteins with unknown function.
Length = 168
Score = 47.7 bits (114), Expect = 4e-06
Identities = 24/79 (30%), Positives = 41/79 (51%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+ ++R K K ++ R+ K+ K+ EK K+G + KK + +K+ K KK
Sbjct: 89 QSMKSRLKTLKNKDREREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKS 148
Query: 956 KKKKKKKKKKKKKKKKKKK 974
K+ K +KK+KK K+K
Sbjct: 149 KQLDKALEKKRKKNAGKEK 167
Score = 47.7 bits (114), Expect = 4e-06
Identities = 26/77 (33%), Positives = 40/77 (51%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K R + K K E+ K+ KK+E E +++ K +KK + +KKK E K+
Sbjct: 92 KSRLKTLKNKDREREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKSKQL 151
Query: 957 KKKKKKKKKKKKKKKKK 973
K +KK+KK K+KK
Sbjct: 152 DKALEKKRKKNAGKEKK 168
Score = 46.9 bits (112), Expect = 9e-06
Identities = 31/87 (35%), Positives = 47/87 (54%), Gaps = 5/87 (5%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K+ + K + K K K +++E K KK+ E +K+ KK +KK +K+ KKK
Sbjct: 85 KRTLQSMKSR--LKTLKNKDREREILKEHKKQEKELIKEGKK-PYYLKKSEIKKLVLKKK 141
Query: 957 KKKKKKKKKKKK--KKKKKKEAEKMKK 981
+ KK K+ K +KK+KK A K KK
Sbjct: 142 FDELKKSKQLDKALEKKRKKNAGKEKK 168
Score = 46.5 bits (111), Expect = 1e-05
Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 6/89 (6%)
Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRK-----KKGAEKMKKKKKDAEKMKK-KGAEKIGK 953
R K+ + EK KK K +E E++++ K + +K K ++ E +K+ K EK
Sbjct: 60 YREKEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELI 119
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
K+ KK KK + KK KK+ +++KK
Sbjct: 120 KEGKKPYYLKKSEIKKLVLKKKFDELKKS 148
Score = 45.3 bits (108), Expect = 3e-05
Identities = 27/86 (31%), Positives = 47/86 (54%), Gaps = 2/86 (2%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
+K E++ K KK K++E+ + K + K + K K + E + + KK++K+ K+
Sbjct: 62 EKEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIKE 121
Query: 965 KKKKKKKKKKEAEKM--KKKTEEAEK 988
KK KK E +K+ KKK +E +K
Sbjct: 122 GKKPYYLKKSEIKKLVLKKKFDELKK 147
Score = 43.0 bits (102), Expect = 2e-04
Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 11/109 (10%)
Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKK--KKKKKEAEKMRKKKGAEKMKKKK 937
DD R + + L K + K + E++++ K + K + E +K+ K
Sbjct: 58 DDYREKEIEEL-----EKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHK 112
Query: 938 KDAEKMKKKGAEKIGKKKKKKKKKKKKKK----KKKKKKKKEAEKMKKK 982
K +++ K+G + KK + KK KKK KK K+ K EK +KK
Sbjct: 113 KQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKSKQLDKALEKKRKK 161
Score = 41.1 bits (97), Expect = 7e-04
Identities = 23/72 (31%), Positives = 41/72 (56%), Gaps = 11/72 (15%)
Query: 896 RKKRTRRK-KKKGAEKIRKKKK----KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
R++ ++ KK+ E I++ KK KK E +K+ KK +++KK K+ + ++KK
Sbjct: 103 REREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKSKQLDKALEKK---- 158
Query: 951 IGKKKKKKKKKK 962
+KK K+KK
Sbjct: 159 --RKKNAGKEKK 168
Score = 29.9 bits (68), Expect = 3.6
Identities = 22/100 (22%), Positives = 44/100 (44%), Gaps = 14/100 (14%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK--------KKKDAEKMKKKGA 948
KK RK+ + +K K + + + K +K +K ++K+ E+++K
Sbjct: 14 KKPVSRKRDIIEVEKKKPKVRDPRFDSLSGKLNLDKFRKNYKFLDDYREKEIEELEKA-- 71
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK K ++K++ K+ + K + +K K E E
Sbjct: 72 ----LKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREI 107
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 49.6 bits (119), Expect = 5e-06
Identities = 20/77 (25%), Positives = 39/77 (50%), Gaps = 1/77 (1%)
Query: 889 ALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
L + + RK K E+ +K K E E+ + + ++ KKK++ K+ K
Sbjct: 247 KLANLKLSPE-VLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSP 305
Query: 949 EKIGKKKKKKKKKKKKK 965
E+ K ++K++KK+ +K
Sbjct: 306 EEQRKLEEKERKKQARK 322
Score = 46.9 bits (112), Expect = 3e-05
Identities = 19/80 (23%), Positives = 41/80 (51%), Gaps = 16/80 (20%)
Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK- 968
K+ + +K +K R++ E+ K + E+ E+ +KK++KKK++++ K
Sbjct: 252 KLSPEVLRK--VDKTREE--EEEKILKAAEEER-----QEEAQEKKEEKKKEEREAKLAK 302
Query: 969 ------KKKKKKEAEKMKKK 982
+K ++KE +K +K
Sbjct: 303 LSPEEQRKLEEKERKKQARK 322
Score = 41.9 bits (99), Expect = 0.001
Identities = 13/55 (23%), Positives = 36/55 (65%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ + +K +++ EKI K +++++++ ++KK++KKK++ K+ K + E ++
Sbjct: 255 PEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQR 309
Score = 40.7 bits (96), Expect = 0.003
Identities = 17/63 (26%), Positives = 40/63 (63%)
Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
+ K+ K +++ E+ K AE+ +++ ++KK++KKK++++ K K + + ++K EE
Sbjct: 254 SPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEE 313
Query: 986 AEK 988
E+
Sbjct: 314 KER 316
>gnl|CDD|190741 pfam03763, Remorin_C, Remorin, C-terminal region. Remorins are
plant-specific plasma membrane-associated proteins. In
tobacco remorin co-purifies with lipid rafts. Most
remorins have a variable, proline-rich C-half and a more
conserved N-half that is predicted to form coiled coils.
Consistent with this, circular dichroism studies have
demonstrated that much of the protein is alpha-helical.
Remorins exist in plasma membrane preparations as
oligomeric structures and form filaments in vitro. The
proteins can bind polyanions including the extracellular
matrix component oligogalacturonic acid (OGA). In vitro,
remorin in plasma membrane preparations is
phosphorylated (principally on threonine residues) in
the presence of OGA and thus co-purifies with a protein
kinases(s). The biological functions of remorins are
unknown but roles as components of the
membrane/cytoskeleton are possible.
Length = 112
Score = 46.2 bits (110), Expect = 5e-06
Identities = 25/84 (29%), Positives = 42/84 (50%), Gaps = 6/84 (7%)
Query: 909 EKIRKKKKKKKEA----EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
KI K ++++ E +K K ++KK ++ EK K + AEK+ K K KK
Sbjct: 19 AKINNKYQREEAKIQAWENKKKAKAEAELKKIERKLEKKKAEAAEKL--KNKLAAAHKKA 76
Query: 965 KKKKKKKKKKEAEKMKKKTEEAEK 988
++K+ + K E+ K E+A K
Sbjct: 77 EEKRATAEAKRGEEEAKAEEKAAK 100
Score = 36.5 bits (85), Expect = 0.010
Identities = 21/73 (28%), Positives = 34/73 (46%), Gaps = 4/73 (5%)
Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
+K K + ++ K A + KKK K ++KK K +KKK + +K K K
Sbjct: 17 EKAKINNKYQREEAKIQAWENKKKAKAEAELKKIER----KLEKKKAEAAEKLKNKLAAA 72
Query: 973 KKEAEKMKKKTEE 985
K+AE+ + E
Sbjct: 73 HKKAEEKRATAEA 85
Score = 33.5 bits (77), Expect = 0.13
Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 4/50 (8%)
Query: 902 RKKKKGAEKIRKKKKK-KKEAEKMRK---KKGAEKMKKKKKDAEKMKKKG 947
+KK + AEK++ K K+AE+ R K E+ K ++ A K++ G
Sbjct: 56 KKKAEAAEKLKNKLAAAHKKAEEKRATAEAKRGEEEAKAEEKAAKIRATG 105
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 48.5 bits (116), Expect = 7e-06
Identities = 16/77 (20%), Positives = 30/77 (38%)
Query: 907 GAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
E K + ++E K +KKK +K KK + + + + + + K
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAK 225
Query: 967 KKKKKKKKEAEKMKKKT 983
K KKK+ + K +
Sbjct: 226 KLKKKRSIAPDNEKSEV 242
Score = 45.8 bits (109), Expect = 5e-05
Identities = 18/71 (25%), Positives = 30/71 (42%)
Query: 918 KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
++E E ++ + E+ KKKKK +K KK + + + EA+
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAK 225
Query: 978 KMKKKTEEAEK 988
K+KKK A
Sbjct: 226 KLKKKRSIAPD 236
Score = 44.3 bits (105), Expect = 1e-04
Identities = 18/70 (25%), Positives = 30/70 (42%)
Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
K + E E+ +KKK +K K KK +A + + + + KK KKK+
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKR 231
Query: 975 EAEKMKKKTE 984
+K+E
Sbjct: 232 SIAPDNEKSE 241
Score = 43.5 bits (103), Expect = 3e-04
Identities = 18/72 (25%), Positives = 29/72 (40%), Gaps = 2/72 (2%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
+ R ++++ +K +KKKKK K+ AE E G KK KK
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNNA--TGSSAEATVSSAVPTELSSGAGQVGEAKKLKK 229
Query: 958 KKKKKKKKKKKK 969
K+ +K +
Sbjct: 230 KRSIAPDNEKSE 241
Score = 42.7 bits (101), Expect = 5e-04
Identities = 20/69 (28%), Positives = 31/69 (44%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
K + E+ +KKKKKKK+ K G+ GA ++G+ KK KKK+
Sbjct: 173 KARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRS 232
Query: 963 KKKKKKKKK 971
+K +
Sbjct: 233 IAPDNEKSE 241
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 47.0 bits (112), Expect = 8e-06
Identities = 23/90 (25%), Positives = 57/90 (63%), Gaps = 13/90 (14%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKM----RKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
KK +K+ K EK + +++++EAE+ RKK ++ ++K++ E +++
Sbjct: 1 KKIGAKKRAKLEEK--QARRQQREAEEEEREERKKLEEKREGERKEEEELEEER------ 52
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
+KKK+++++K+++++ +K+++E EK+K
Sbjct: 53 -EKKKEEEERKEREEQARKEQEEYEKLKSS 81
Score = 42.8 bits (101), Expect = 2e-04
Identities = 19/75 (25%), Positives = 50/75 (66%), Gaps = 2/75 (2%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
KK+ K EK +++ E +++++ E+ K + + +K++++ +++++KKK+++++K
Sbjct: 5 AKKRAKLEEKQARRQQREAEEEERE--ERKKLEEKREGERKEEEELEEEREKKKEEEERK 62
Query: 974 KEAEKMKKKTEEAEK 988
+ E+ +K+ EE EK
Sbjct: 63 EREEQARKEQEEYEK 77
Score = 39.7 bits (93), Expect = 0.003
Identities = 19/75 (25%), Positives = 54/75 (72%), Gaps = 5/75 (6%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
KK K+ K+ +K+ ++++++AE+ +++ +K+ ++K++ ++K++++ +++++KK
Sbjct: 1 KKIGAKKRAKLEEKQA----RRQQREAEEEEREERKKL-EEKREGERKEEEELEEEREKK 55
Query: 974 KEAEKMKKKTEEAEK 988
KE E+ K++ E+A K
Sbjct: 56 KEEEERKEREEQARK 70
Score = 32.7 bits (75), Expect = 0.54
Identities = 16/64 (25%), Positives = 38/64 (59%), Gaps = 2/64 (3%)
Query: 883 RREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK 942
+REA + + R ++K E++ ++++KKKE E+ +++ E+ +K++++ EK
Sbjct: 20 QREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKER--EEQARKEQEEYEK 77
Query: 943 MKKK 946
+K
Sbjct: 78 LKSS 81
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 47.6 bits (113), Expect = 2e-05
Identities = 26/82 (31%), Positives = 36/82 (43%)
Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
K K + ++KK A+K K K KK K K A K+ K K K KK
Sbjct: 5 KKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVT 64
Query: 966 KKKKKKKKKEAEKMKKKTEEAE 987
K +K + + E + KKT + E
Sbjct: 65 VKFEKTESVKKESVAKKTVKKE 86
Score = 46.5 bits (110), Expect = 3e-05
Identities = 27/93 (29%), Positives = 39/93 (41%), Gaps = 1/93 (1%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
KK T++K + + KK KE K +K K KK A+ K K + KK
Sbjct: 5 KKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVT 64
Query: 957 KKKKKKKKKKKKKKKKKKEA-EKMKKKTEEAEK 988
K +K + KK+ KK E + + EA
Sbjct: 65 VKFEKTESVKKESVAKKTVKKEAVSAEVFEASN 97
Score = 44.9 bits (106), Expect = 1e-04
Identities = 24/88 (27%), Positives = 35/88 (39%), Gaps = 1/88 (1%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
+++ KK K K KK + + K A K+KK K + KK K K +
Sbjct: 14 EEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKS-VKTTTKKVTVKFEKTES 72
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
KK+ KK KK+ E + K
Sbjct: 73 VKKESVAKKTVKKEAVSAEVFEASNKLF 100
Score = 41.1 bits (96), Expect = 0.002
Identities = 28/101 (27%), Positives = 42/101 (41%), Gaps = 10/101 (9%)
Query: 897 KKRTRRKKKKGAEKIRKKK-------KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE 949
KK +K E + KK K K+A K++K K + K KK + K + +
Sbjct: 16 KKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTK-SVKTTTKKVTVKFEKTESVK 74
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKK--KEAEKMKKKTEEAEK 988
K KK KK+ + + K K K+ KK +EK
Sbjct: 75 KESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKKLFASEK 115
Score = 40.3 bits (94), Expect = 0.003
Identities = 21/82 (25%), Positives = 32/82 (39%), Gaps = 3/82 (3%)
Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
KK KK+ + +K + K+ K K KK K KK K K
Sbjct: 1 MAETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTT 60
Query: 970 KK---KKKEAEKMKKKTEEAEK 988
KK K ++ E +KK++ +
Sbjct: 61 KKVTVKFEKTESVKKESVAKKT 82
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 48.8 bits (116), Expect = 2e-05
Identities = 25/100 (25%), Positives = 49/100 (49%), Gaps = 1/100 (1%)
Query: 890 LGFILFRKKRTRRKKKKGAEKIRKKKKK-KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
L R + + E+I K++ +KE E + + K ++K+K ++ + K
Sbjct: 233 LKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLL 292
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K ++ K + K +++K ++K KE+EK KK E+ K
Sbjct: 293 AKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELK 332
Score = 46.9 bits (111), Expect = 6e-05
Identities = 27/93 (29%), Positives = 52/93 (55%), Gaps = 2/93 (2%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+ ++++ +K E + + K+ KE EK +K + E+ K E+ K K+ ++K
Sbjct: 253 EIESSKQELEKEEEILAQVLKENKEEEKEKKLQ--EEELKLLAKEEEELKSELLKLERRK 310
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++K K+ +K+ KK +KE +K K++ EE EK
Sbjct: 311 VDDEEKLKESEKELKKLEKELKKEKEEIEELEK 343
Score = 45.3 bits (107), Expect = 2e-04
Identities = 28/96 (29%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
IL + + ++++K +K+++++ K E+ K K++++K D E+ K K +EK
Sbjct: 267 ILAQVLKENKEEEK-EKKLQEEELKLLAKEEEELKSELLKLERRKVDDEE-KLKESEKEL 324
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK +K+ KK+K++ ++ +K+ KE E ++ EE E+
Sbjct: 325 KKLEKELKKEKEEIEELEKELKELEIKREAEEEEEE 360
Score = 43.8 bits (103), Expect = 5e-04
Identities = 21/96 (21%), Positives = 53/96 (55%), Gaps = 5/96 (5%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK-----KKKDAEKMKKKGAEKI 951
K+ K+++ ++ K+ K++++ +K+++++ K+ + + +K E+
Sbjct: 257 SKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEK 316
Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K+ +K+ KK +K+ KK+K++ +E EK K+ E
Sbjct: 317 LKESEKELKKLEKELKKEKEEIEELEKELKELEIKR 352
Score = 41.1 bits (96), Expect = 0.003
Identities = 24/104 (23%), Positives = 49/104 (47%), Gaps = 8/104 (7%)
Query: 893 ILFRKKRTRRKKKKG--------AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK 944
KK ++K AE+ K +K K E EK K K E+ + ++ K +
Sbjct: 755 KSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEE 814
Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ E+ ++++K K+++ ++ + KE +K++K EE +
Sbjct: 815 AELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELE 858
Score = 41.1 bits (96), Expect = 0.003
Identities = 17/88 (19%), Positives = 46/88 (52%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
K +K + K++K K + E++R + K + + + E++ + EKI +++ ++
Sbjct: 781 EEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELA 840
Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ K+++K +K +E + ++ E+
Sbjct: 841 LELKEEQKLEKLAEEELERLEEEITKEE 868
Score = 41.1 bits (96), Expect = 0.003
Identities = 18/91 (19%), Positives = 53/91 (58%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
+ + + K E+++ ++K KE E+ +K +K +++++ +E K+ ++K +
Sbjct: 726 DKVQEAQDKINEELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTE 785
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K K +++K++K K +++E ++++ +E +
Sbjct: 786 KLKVEEEKEEKLKAQEEELRALEEELKEEAE 816
Score = 40.3 bits (94), Expect = 0.006
Identities = 20/88 (22%), Positives = 46/88 (52%), Gaps = 4/88 (4%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
+K K E+ +K + KKE E+ K + + K K+ ++ EK +K E+ +K++K
Sbjct: 742 LEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEE----EKEEKL 797
Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K ++++ + +++ + E + E+
Sbjct: 798 KAQEEELRALEEELKEEAELLEEEQLLI 825
Score = 40.0 bits (93), Expect = 0.007
Identities = 20/116 (17%), Positives = 49/116 (42%), Gaps = 8/116 (6%)
Query: 881 DVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEA--------EKMRKKKGAEK 932
++ KK EK K +EA + + +K ++
Sbjct: 691 AKNEILRRQEEIKKKEQRIKEELKKLKLEKEELLADKVQEAQDKINEELKLLEQKIKEKE 750
Query: 933 MKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+++K +K +++ + K+K+ ++++K +K K ++++ EK+K + EE
Sbjct: 751 EEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRA 806
Score = 39.2 bits (91), Expect = 0.012
Identities = 19/112 (16%), Positives = 45/112 (40%), Gaps = 2/112 (1%)
Query: 878 VNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMR-KKKGAEKMKKK 936
+++ + L K + KK E + K+K + E E + ++
Sbjct: 182 ETENLAELIIDLEELKLQELKLKEQAKKA-LEYYQLKEKLELEEENLLYLDYLKLNEERI 240
Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
E ++ + E K++ +K+++ + K+ K++E EK ++ E
Sbjct: 241 DLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLL 292
Score = 38.8 bits (90), Expect = 0.016
Identities = 21/101 (20%), Positives = 41/101 (40%), Gaps = 1/101 (0%)
Query: 889 ALGFILFRKKRTRRKKK-KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG 947
A G RKK+ R KK + E + + +E + K + K + K K +
Sbjct: 163 AAGSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLEL 222
Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
E+ K +++ ++ + E E+++ +E EK
Sbjct: 223 EEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEK 263
Score = 38.4 bits (89), Expect = 0.025
Identities = 22/94 (23%), Positives = 48/94 (51%), Gaps = 2/94 (2%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDA-EKMKKKGAEKIGKK 954
R+K +K K +EK KK +K+ ++ + + EK K+ + E +++ + +
Sbjct: 308 RRKVDDEEKLKESEK-ELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQ 366
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K ++ +++ KKK + ++ + K K EE E
Sbjct: 367 EKLEQLEEELLAKKKLESERLSSAAKLKEEELEL 400
Score = 36.1 bits (83), Expect = 0.13
Identities = 11/68 (16%), Positives = 30/68 (44%), Gaps = 3/68 (4%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
E+ R K+ + E A ++K++ + K + +++KK+ ++ ++
Sbjct: 959 EEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDE---LKKERLEEEKKELLREIIEE 1015
Query: 969 KKKKKKEA 976
++ KE
Sbjct: 1016 TCQRFKEF 1023
Score = 34.2 bits (78), Expect = 0.39
Identities = 17/90 (18%), Positives = 50/90 (55%), Gaps = 3/90 (3%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK---KKKDAEKMKKKGAEKIGKKKKKK 958
+++ + EK K+ + K+EAE+ +++ + +K +++ KK +E++ K K
Sbjct: 335 KEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLK 394
Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+++ + K +++K+ K E +++ + ++
Sbjct: 395 EEELELKNEEEKEAKLLLELSEQEEDLLKE 424
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 45.4 bits (108), Expect = 2e-05
Identities = 16/68 (23%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG-AEKIGKKKKKKKKKKKKKKKKKKKK 972
K +KK KK E+ ++++ + E++ ++ +++ +K+ K K++K+++ ++K+K
Sbjct: 85 KWRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQK 144
Query: 973 KKEAEKMK 980
+ E+MK
Sbjct: 145 EILKEQMK 152
Score = 41.9 bits (99), Expect = 3e-04
Identities = 17/70 (24%), Positives = 42/70 (60%), Gaps = 1/70 (1%)
Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
++ +KK +K +K+K E+ ++ + + +++ E + K+ K K++K+++ ++K+
Sbjct: 84 LKWRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQ 143
Query: 971 KKK-KEAEKM 979
K+ KE KM
Sbjct: 144 KEILKEQMKM 153
Score = 37.3 bits (87), Expect = 0.013
Identities = 18/77 (23%), Positives = 48/77 (62%), Gaps = 9/77 (11%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K R + +K G +K K++++++E E E++ ++++ E ++K+ A+ K++K
Sbjct: 85 KWRKKVRKLLGLDKKEKEEEEEEEVEV-------EELDEEEQIDELLEKELAKL--KREK 135
Query: 957 KKKKKKKKKKKKKKKKK 973
+++ ++K+K+ K++ K
Sbjct: 136 RRENERKQKEILKEQMK 152
Score = 31.5 bits (72), Expect = 0.96
Identities = 11/61 (18%), Positives = 36/61 (59%), Gaps = 6/61 (9%)
Query: 922 EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
+K+RK G +K +K++++ E+ ++ + ++++ + +K+ K K+++ + ++
Sbjct: 88 KKVRKLLGLDKKEKEEEEEEE------VEVEELDEEEQIDELLEKELAKLKREKRRENER 141
Query: 982 K 982
K
Sbjct: 142 K 142
Score = 30.3 bits (69), Expect = 2.7
Identities = 14/62 (22%), Positives = 30/62 (48%), Gaps = 20/62 (32%)
Query: 942 KMKKKGAEKIGKKKKKKKKKK--------------------KKKKKKKKKKKKEAEKMKK 981
K +KK + +G KK+K++++ K+ K K++K++E E+ +K
Sbjct: 85 KWRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQK 144
Query: 982 KT 983
+
Sbjct: 145 EI 146
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 43.9 bits (104), Expect = 2e-05
Identities = 21/46 (45%), Positives = 30/46 (65%)
Query: 942 KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K+K KG + KKKKKKKKKK K K++ +K+E EK +++ E
Sbjct: 9 KLKLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKE 54
Score = 42.0 bits (99), Expect = 7e-05
Identities = 20/35 (57%), Positives = 23/35 (65%)
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K K KK KKKKKKKKKK K E++ + EE EK
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEK 45
Score = 41.6 bits (98), Expect = 9e-05
Identities = 19/35 (54%), Positives = 21/35 (60%)
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K KK KKKKKKKKKK K KE +K+ EE
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSS 47
Score = 40.9 bits (96), Expect = 2e-04
Identities = 19/36 (52%), Positives = 23/36 (63%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K K KK KKKKKKKKKK K + E + +K EE +
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKS 46
Score = 40.1 bits (94), Expect = 3e-04
Identities = 14/44 (31%), Positives = 24/44 (54%)
Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K + KKKKKKKKK K K++ +K++E + + + +
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGE 56
Score = 38.9 bits (91), Expect = 0.001
Identities = 21/80 (26%), Positives = 42/80 (52%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
K K ++KKKKKKK+ K +++ EK +++K AE K+G E +K ++++
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDG 70
Query: 963 KKKKKKKKKKKKEAEKMKKK 982
+ ++ ++ K K+
Sbjct: 71 MNLTEAERAFEEAQRKRLKE 90
Score = 34.7 bits (80), Expect = 0.030
Identities = 17/46 (36%), Positives = 27/46 (58%)
Query: 942 KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K+K K + KKKKKKKK K K++ +K+++E + +E E
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGE 56
Score = 33.5 bits (77), Expect = 0.071
Identities = 20/59 (33%), Positives = 29/59 (49%), Gaps = 14/59 (23%)
Query: 927 KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K +KKKKK KKKKK K K++ +K++++K AE K+ EE
Sbjct: 13 KGKKIDVKKKKK--------------KKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEE 57
Score = 32.8 bits (75), Expect = 0.13
Identities = 20/81 (24%), Positives = 39/81 (48%), Gaps = 5/81 (6%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
+ + K K K + ++KKK K KKKK +++ E+ K + K+ ++ +
Sbjct: 3 SNVVGGKLKLKGKKIDVKKKK---KKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDE 59
Query: 968 KKKKKKKEAEKMKKKTEEAEK 988
+K ++E + M EAE+
Sbjct: 60 DNEKIEQEEDGMNLT--EAER 78
Score = 29.7 bits (67), Expect = 1.6
Identities = 18/73 (24%), Positives = 40/73 (54%), Gaps = 1/73 (1%)
Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
++KKKK +K + K++ E E+ +K AE K+ ++ E +K E+ G + +
Sbjct: 19 VKKKKKKKKKKNKSKEEVVTEKEE-EEKSSAESDLKEGEEDEDNEKIEQEEDGMNLTEAE 77
Query: 960 KKKKKKKKKKKKK 972
+ ++ ++K+ K+
Sbjct: 78 RAFEEAQRKRLKE 90
>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein. This is a family of
121-amino acid secretory proteins. Laps functions in the
regulation of neuronal cell adhesion and/or movement and
synapse attachment. Laps binds to the ApC/EBP (Aplysia
CCAAT/enhancer binding protein) promoter and activates
the transcription of ApC/EBP mRNA.
Length = 124
Score = 44.8 bits (106), Expect = 2e-05
Identities = 30/120 (25%), Positives = 48/120 (40%), Gaps = 31/120 (25%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEK-------------MRKKKGAEKMKKKKKDAEKMK 944
K R K ++ I++KK KE + M+ K +K ++ EK K
Sbjct: 1 KSLRSKWRRKMRAIKRKKNAPKELARLKKALGLIDGNVLMKDVKDIATVKPAEEVKEKKK 60
Query: 945 KKGAEKI---------GKKKKKKKKKKKKK---------KKKKKKKKKEAEKMKKKTEEA 986
K+G E K + KK + + +++ KK KK+ EK KKK + A
Sbjct: 61 KEGTESEEDDEKMEVDNKAAVRNKKTLRDQHGQYPAWMNQRQAKKLKKKREKKKKKKKGA 120
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 47.9 bits (114), Expect = 2e-05
Identities = 23/92 (25%), Positives = 37/92 (40%), Gaps = 1/92 (1%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMR-KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
+R K + I + K K A+K + K K + K + K K
Sbjct: 158 ANGKRPSSKPPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNI 217
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KKK K+KK+KK+ +K+++EE
Sbjct: 218 MSSFFKKKTKEKKEKKEASESTVKEESEEESG 249
Score = 45.6 bits (108), Expect = 9e-05
Identities = 16/77 (20%), Positives = 37/77 (48%)
Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
+K K+KK++ E E ++ K ++ + AE G + + + + K ++
Sbjct: 224 KKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDS 283
Query: 972 KKKEAEKMKKKTEEAEK 988
+++ EK K+K + +K
Sbjct: 284 EEETEEKEKEKRKRLKK 300
Score = 45.6 bits (108), Expect = 1e-04
Identities = 22/87 (25%), Positives = 29/87 (33%), Gaps = 1/87 (1%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMR-KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
K A R K K K K A+K + K+ K +G + K+
Sbjct: 153 ALKPTANGKRPSSKPPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPP 212
Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK KKK EK +KK
Sbjct: 213 KKSNIMSSFFKKKTKEKKEKKEASEST 239
Score = 41.7 bits (98), Expect = 0.002
Identities = 24/97 (24%), Positives = 45/97 (46%), Gaps = 6/97 (6%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK----KGAEKMKKKKKDAEKMKKKGAEK 950
F KK+T+ KK+K K++ +E R + AE + + E K E+
Sbjct: 221 FFKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGER 280
Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+++ ++K+K+K+K+ KK E E ++ E
Sbjct: 281 --SDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVP 315
Score = 40.2 bits (94), Expect = 0.004
Identities = 16/92 (17%), Positives = 32/92 (34%), Gaps = 3/92 (3%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEK---MRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
K T +K +G ++ K+ +K M + +KK+K E +
Sbjct: 188 SKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEE 247
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K+ + + + + E E K + E
Sbjct: 248 SGKRDVILEDESAEPTGLDEDEDEDEPKPSGE 279
Score = 38.3 bits (89), Expect = 0.020
Identities = 19/92 (20%), Positives = 32/92 (34%), Gaps = 15/92 (16%)
Query: 902 RKKKKGAEKIRK-----KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K K + K K + K + K+ K K + KK K
Sbjct: 178 VKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPK-KSNIMSSFF---------KKKTK 227
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+KK+KK+ + K++ +E + E E
Sbjct: 228 EKKEKKEASESTVKEESEEESGKRDVILEDES 259
Score = 36.0 bits (83), Expect = 0.090
Identities = 15/80 (18%), Positives = 27/80 (33%), Gaps = 8/80 (10%)
Query: 897 KKRTRRKKKKGAEKIRKKKK-KKKEAEK-------MRKKKGAEKMKKKKKDAEKMKKKGA 948
+++ + R +++ KK+ K +K E + + + K K
Sbjct: 337 EEKEEVTVSPDGGRRRGRRRVMKKKTFKDEEGYLVTKKVYEWESFSEDEAEPPPTKPKPK 396
Query: 949 EKIGKKKKKKKKKKKKKKKK 968
KK K KKKK
Sbjct: 397 VSTPAVPAAAKKPKAPKKKK 416
Score = 32.1 bits (73), Expect = 1.3
Identities = 22/86 (25%), Positives = 35/86 (40%), Gaps = 15/86 (17%)
Query: 913 KKKKKKKEAEK---MRKKKGAEKMKKKK--KDAEK--MKKKGA--------EKIGKKKKK 957
KK+++K+E +++G ++ KKK KD E + KK E K
Sbjct: 334 KKEEEKEEVTVSPDGGRRRGRRRVMKKKTFKDEEGYLVTKKVYEWESFSEDEAEPPPTKP 393
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKT 983
K K KK K +K K+ +
Sbjct: 394 KPKVSTPAVPAAAKKPKAPKKKKQSS 419
Score = 32.1 bits (73), Expect = 1.4
Identities = 19/130 (14%), Positives = 40/130 (30%), Gaps = 41/130 (31%)
Query: 900 TRRKKKKGAEKIRK------KKKKK-------KEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
T K+K+ ++++K + ++ E E+ + + KK+++ E
Sbjct: 287 TEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSP 346
Query: 947 GAEKIGKKKKKKKKK----------------------------KKKKKKKKKKKKKEAEK 978
+ +++ KKK K K K A
Sbjct: 347 DGGRRRGRRRVMKKKTFKDEEGYLVTKKVYEWESFSEDEAEPPPTKPKPKVSTPAVPAAA 406
Query: 979 MKKKTEEAEK 988
K K + +K
Sbjct: 407 KKPKAPKKKK 416
>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564. This
family of proteins has no known function. However, one
of the members is annotated as an EF-hand family
protein.
Length = 349
Score = 47.5 bits (113), Expect = 2e-05
Identities = 21/83 (25%), Positives = 52/83 (62%)
Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
+ +K + + +R ++K+K+ E+ KK+ ++ KKKK+ A K+K++ +K ++
Sbjct: 266 PSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEANDPAQKLQE 325
Query: 959 KKKKKKKKKKKKKKKKEAEKMKK 981
++K++ K +K++K++E E ++
Sbjct: 326 ERKEQLAKLRKEEKEREKEYEQE 348
Score = 43.2 bits (102), Expect = 5e-04
Identities = 22/103 (21%), Positives = 47/103 (45%), Gaps = 16/103 (15%)
Query: 896 RKKRTRRKKKKGAEKI----RKKKKKKKEAEKMRKKKG------------AEKMKKKKKD 939
R+K+ R KK K+ K +KK++EAE +K K E M++ ++
Sbjct: 14 REKKKREKKSIRQSKLEEELNKLEKKEEEAECKKKFKANPVPASVKLPLYEEIMEQNEER 73
Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
E++++K + +K +++++KK +K ++
Sbjct: 74 REEVREKSKAILLSSQKPFSFYEREEQKKAILPRKLRSSTSER 116
Score = 39.0 bits (91), Expect = 0.009
Identities = 19/93 (20%), Positives = 41/93 (44%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
K +R + ++ K + K E + K + K K+ ++ K+
Sbjct: 237 HKSSSRTYLDQENISAGEENLKPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKEL 296
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+++KKKKK+ K K++ + + +K EE ++
Sbjct: 297 EQRKKKKKEMAPKVKQRFEANDPAQKLQEERKE 329
Score = 37.8 bits (88), Expect = 0.021
Identities = 15/79 (18%), Positives = 48/79 (60%), Gaps = 7/79 (8%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
F + + K+ K ++ ++ +++KK+ ++M A K+K++ + + +K ++ K+
Sbjct: 277 FLRTERKEKEAKEQQEKKELEQRKKKKKEM-----APKVKQRFEANDPAQK--LQEERKE 329
Query: 955 KKKKKKKKKKKKKKKKKKK 973
+ K +K++K+++K+ +++
Sbjct: 330 QLAKLRKEEKEREKEYEQE 348
Score = 37.8 bits (88), Expect = 0.021
Identities = 19/65 (29%), Positives = 38/65 (58%)
Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
++ K ++++ K+R + A+++ + + +M K A++ KKKK+ +KK K KKK
Sbjct: 135 LKDKMQEEELKRKIRVQMRAQELLQSSRLPPRMAKHEAQERLTKKKKRGQKKSKYKKKTF 194
Query: 971 KKKKE 975
K K+
Sbjct: 195 KPKRA 199
Score = 36.3 bits (84), Expect = 0.064
Identities = 19/90 (21%), Positives = 44/90 (48%), Gaps = 11/90 (12%)
Query: 903 KKKKGAEKIRKKKKKKKEA-----EKMRKKKGA------EKMKKKKKDAEKMKKKGAEKI 951
+ K + KK E+ RK+K A ++++++KK ++M K ++
Sbjct: 255 ENLKPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRF 314
Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
+K ++++K++ K +K+E E+ K+
Sbjct: 315 EANDPAQKLQEERKEQLAKLRKEEKEREKE 344
Score = 34.8 bits (80), Expect = 0.20
Identities = 14/88 (15%), Positives = 37/88 (42%), Gaps = 4/88 (4%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKK-EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
++R ++K + +K + +KK + + + KK A+ + K
Sbjct: 70 NEERREEVREKSKAILLSSQKPFSFYEREEQKKAILPRKLRSSTSEREPKKFKAKPVPKS 129
Query: 955 KKK---KKKKKKKKKKKKKKKKKEAEKM 979
K K ++++ K+K + + A+++
Sbjct: 130 IYIPLLKDKMQEEELKRKIRVQMRAQEL 157
Score = 33.6 bits (77), Expect = 0.52
Identities = 15/82 (18%), Positives = 36/82 (43%), Gaps = 2/82 (2%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK--KKK 966
E+ R++ ++K +A + +K +++++ + +K +++ KK K K K
Sbjct: 71 EERREEVREKSKAILLSSQKPFSFYEREEQKKAILPRKLRSSTSEREPKKFKAKPVPKSI 130
Query: 967 KKKKKKKKEAEKMKKKTEEAEK 988
K K E+ K+ +
Sbjct: 131 YIPLLKDKMQEEELKRKIRVQM 152
Score = 32.1 bits (73), Expect = 1.4
Identities = 15/33 (45%), Positives = 25/33 (75%)
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K++KKK++KK ++ K ++E K++KK EEAE
Sbjct: 13 KREKKKREKKSIRQSKLEEELNKLEKKEEEAEC 45
Score = 30.9 bits (70), Expect = 3.3
Identities = 12/83 (14%), Positives = 34/83 (40%), Gaps = 6/83 (7%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
+ R+++ ++K + + ++++ + + + +++ KK K K K
Sbjct: 70 NEERREEVREKSKAILLSSQKPFSFYEREEQKKAILPRKLRSSTSEREPKKFKAKPVPKS 129
Query: 969 ------KKKKKKEAEKMKKKTEE 985
K K ++E K K + +
Sbjct: 130 IYIPLLKDKMQEEELKRKIRVQM 152
Score = 29.7 bits (67), Expect = 7.0
Identities = 22/81 (27%), Positives = 39/81 (48%), Gaps = 15/81 (18%)
Query: 894 LFRKKRTRRKKKKGA---------------EKIRKKKKKKKEAEKMRKKKGAEKMKKKKK 938
L RK R + + ++ E++ KKKK+ ++ K +KK K K
Sbjct: 144 LKRKIRVQMRAQELLQSSRLPPRMAKHEAQERLTKKKKRGQKKSKYKKKTFKPKRAKSIP 203
Query: 939 DAEKMKKKGAEKIGKKKKKKK 959
D EK+ +K +++ +KKK K+
Sbjct: 204 DFEKLHEKFQKQLAEKKKSKR 224
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 44.6 bits (106), Expect = 2e-05
Identities = 18/83 (21%), Positives = 43/83 (51%), Gaps = 7/83 (8%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
+ ++ K+ K EK + +K K ++KK +AEK+ K AE+ K + + +K
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEK---KQEEKKSEAEKLAKMSAEE----KAEYELEK 53
Query: 963 KKKKKKKKKKKKEAEKMKKKTEE 985
+K+ ++ + + ++K + ++
Sbjct: 54 LEKELEELEAELARRELKAEAKK 76
Score = 41.9 bits (99), Expect = 2e-04
Identities = 17/67 (25%), Positives = 33/67 (49%), Gaps = 1/67 (1%)
Query: 922 EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK-KKKKKKKKKKKKKKKKKKEAEKMK 980
E+ + E K K+ K +KK EK + +K K ++K + + +K +KE E+++
Sbjct: 3 EEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELE 62
Query: 981 KKTEEAE 987
+ E
Sbjct: 63 AELARRE 69
Score = 40.7 bits (96), Expect = 5e-04
Identities = 18/78 (23%), Positives = 41/78 (52%), Gaps = 2/78 (2%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K T ++ K K + K +KK+E +K +K KM ++K +++K +++ + +
Sbjct: 6 KTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEK-LAKMSAEEKAEYELEKL-EKELEELEA 63
Query: 957 KKKKKKKKKKKKKKKKKK 974
+ +++ K + KK +K
Sbjct: 64 ELARRELKAEAKKMLSEK 81
Score = 36.9 bits (86), Expect = 0.010
Identities = 22/69 (31%), Positives = 38/69 (55%), Gaps = 5/69 (7%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
K+K EK K+++KK EAEK+ K E K + + EK++K+ E + +++ K +
Sbjct: 19 KEKAKWEK--KQEEKKSEAEKLAKMSAEE---KAEYELEKLEKELEELEAELARRELKAE 73
Query: 963 KKKKKKKKK 971
KK +K
Sbjct: 74 AKKMLSEKG 82
Score = 34.5 bits (80), Expect = 0.072
Identities = 14/67 (20%), Positives = 29/67 (43%), Gaps = 2/67 (2%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K + +KK EK + +K K + + + + E K +K+ E + ++ + K
Sbjct: 18 AKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEY--ELEKLEKELEELEAELARRELKAEAK 75
Query: 957 KKKKKKK 963
K +K
Sbjct: 76 KMLSEKG 82
Score = 31.5 bits (72), Expect = 0.78
Identities = 12/36 (33%), Positives = 20/36 (55%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+++K K+ K K+K K +K ++K EAEK
Sbjct: 2 PEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEK 37
Score = 29.5 bits (67), Expect = 3.6
Identities = 10/60 (16%), Positives = 30/60 (50%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+ ++ + +KK AEK+ K ++K ++ K + + + + ++K + + + +K
Sbjct: 23 KWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKAEAKKMLSEKG 82
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 48.1 bits (115), Expect = 2e-05
Identities = 31/120 (25%), Positives = 58/120 (48%), Gaps = 32/120 (26%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKK-----------------------GAEKMK 934
KR ++ K+ EK RK +K+ +E EK+ KK+ E+++
Sbjct: 462 KRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELE 521
Query: 935 KKKKDAEKMKKK---------GAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
KK ++ EK+K+K +K +K ++ KKK + +KK + ++E ++ K+ EE
Sbjct: 522 KKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEE 581
Score = 45.4 bits (108), Expect = 1e-04
Identities = 27/109 (24%), Positives = 65/109 (59%), Gaps = 3/109 (2%)
Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD 939
+++ +E L +L + + E++ K +K+ KE E+++++ E+++K+ +
Sbjct: 192 EELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEE--IEELEKELES 249
Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
E K+K EKI ++ +++ ++ KK+ ++ ++K KE +++K+K EE K
Sbjct: 250 LEGSKRKLEEKI-RELEERIEELKKEIEELEEKVKELKELKEKAEEYIK 297
Score = 44.7 bits (106), Expect = 2e-04
Identities = 27/96 (28%), Positives = 54/96 (56%), Gaps = 4/96 (4%)
Query: 895 FRKKRTRRKKKKG--AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
+ + +K+ E+I +++ KE E+ K++ E++KKK K+ EK ++ E+
Sbjct: 305 YLDELREIEKRLSRLEEEINGIEERIKELEE--KEERLEELKKKLKELEKRLEELEERHE 362
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++ K KK++ ++ KK+ EK++K+ EE EK
Sbjct: 363 LYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEK 398
Score = 44.3 bits (105), Expect = 3e-04
Identities = 19/101 (18%), Positives = 57/101 (56%), Gaps = 9/101 (8%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGA--------EKMKKKKKDAEKMKKKG 947
+ + + ++ ++I + ++K KE +++++K E+ + ++ EK +
Sbjct: 260 KIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRL 319
Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
E+I +++ K+ ++K+++ ++ KKK ++++K+ EE E+
Sbjct: 320 EEEINGIEERIKELEEKEERLEELKKKL-KELEKRLEELEE 359
Score = 43.5 bits (103), Expect = 6e-04
Identities = 32/128 (25%), Positives = 62/128 (48%), Gaps = 20/128 (15%)
Query: 875 VSDVNDDVRREAVTALGFILFRKKRTRRKK---KKGAEKIRKKKKKKKEAEKMRKKKGA- 930
+ ++ + R L + + K ++ ++ E + K KK+E E+++K+
Sbjct: 330 IKELEEKEER-----LEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGL 384
Query: 931 --EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK------ 982
EK++K+ ++ EK K++ E+I K + + KK+ K+ KK +E +K K K
Sbjct: 385 TPEKLEKELEELEKAKEEIEEEI-SKITARIGELKKEIKELKKAIEELKKAKGKCPVCGR 443
Query: 983 --TEEAEK 988
TEE K
Sbjct: 444 ELTEEHRK 451
Score = 43.5 bits (103), Expect = 6e-04
Identities = 24/99 (24%), Positives = 60/99 (60%), Gaps = 6/99 (6%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKK-KKEAEKMRKK-KGAEKMKKKKKDAEKMKKKGA----E 949
+ + +K++ E+++KK K+ +K E++ ++ + E+ K KK++ E++KK+ E
Sbjct: 328 ERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPE 387
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K+ K+ ++ +K K++ +++ K ++KK+ +E +K
Sbjct: 388 KLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKK 426
Score = 43.1 bits (102), Expect = 7e-04
Identities = 18/92 (19%), Positives = 52/92 (56%), Gaps = 4/92 (4%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMR--KKKGAEKMKKKKKDAEKMKKKGAE--KIGKKKK 956
++ + RK ++K +E E+ KK E++++K K+ +++K+K E K+ + +
Sbjct: 244 EKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYE 303
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ + ++ +K+ + ++E ++++ +E E+
Sbjct: 304 EYLDELREIEKRLSRLEEEINGIEERIKELEE 335
Score = 41.2 bits (97), Expect = 0.003
Identities = 23/97 (23%), Positives = 55/97 (56%), Gaps = 7/97 (7%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
R R + E+I++ ++K++ E+++KK ++++K+ ++ E+ + E+ KK
Sbjct: 315 RLSRLEEEINGIEERIKELEEKEERLEELKKKL--KELEKRLEELEERHEL-YEEAKAKK 371
Query: 956 KKKKKKKKKKK----KKKKKKKKEAEKMKKKTEEAEK 988
++ ++ KK+ +K +K+ +E EK K++ EE
Sbjct: 372 EELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEIS 408
Score = 40.0 bits (94), Expect = 0.007
Identities = 24/105 (22%), Positives = 58/105 (55%), Gaps = 13/105 (12%)
Query: 894 LFRKKRTRRKKKKGAEKIRKK--------KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKK 945
L R+++ +K ++ +K ++ ++ +KE E++ KK E+ ++ + E+ +
Sbjct: 614 LEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELR---EEYLE 670
Query: 946 KGAEKIGKKKKKKKKKKKKKKKKK--KKKKKEAEKMKKKTEEAEK 988
E G + + ++ +K++++ KK +K K+E E+ +K +E EK
Sbjct: 671 LSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEK 715
Score = 39.7 bits (93), Expect = 0.009
Identities = 20/105 (19%), Positives = 56/105 (53%), Gaps = 12/105 (11%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKG---AEKMKKKKKDAEKMKKKGAEKIG 952
+ + ++K + +K+ + +++ E K ++ G E+++++ K+ E + E
Sbjct: 550 KLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKD 609
Query: 953 KKKKKKKKKKKKKKKKKK---------KKKKEAEKMKKKTEEAEK 988
+K+ ++++K+ KK +++ + +K E+++K+ EE EK
Sbjct: 610 AEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEK 654
Score = 39.3 bits (92), Expect = 0.011
Identities = 21/90 (23%), Positives = 52/90 (57%), Gaps = 5/90 (5%)
Query: 902 RKKKKGAEKIRKK-KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
K+ K E+++++ ++ +KE E + K K+++K ++ E+ ++ ++I + ++K K+
Sbjct: 227 EKEVKELEELKEEIEELEKELESLEGSKR--KLEEKIRELEERIEELKKEIEELEEKVKE 284
Query: 961 KKKKKKKKKK--KKKKEAEKMKKKTEEAEK 988
K+ K+K ++ K + E+ + E EK
Sbjct: 285 LKELKEKAEEYIKLSEFYEEYLDELREIEK 314
Score = 38.5 bits (90), Expect = 0.019
Identities = 18/83 (21%), Positives = 47/83 (56%), Gaps = 3/83 (3%)
Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
K AEK ++++K+ + + K E++ + +K E+++K+ E +KK +++ ++
Sbjct: 608 KDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEE---LEKKYSEEEYEEL 664
Query: 966 KKKKKKKKKEAEKMKKKTEEAEK 988
+++ + +E ++ + EE EK
Sbjct: 665 REEYLELSRELAGLRAELEELEK 687
Score = 38.1 bits (89), Expect = 0.027
Identities = 24/117 (20%), Positives = 53/117 (45%), Gaps = 24/117 (20%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKK----KGA--- 948
+K+ T +K +++ + +K K+E E+ K A + KK+ E K K A
Sbjct: 378 KKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGK 437
Query: 949 -----------------EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
E+ + K+ +K+ K+ ++K++K +KE +++K ++ +
Sbjct: 438 CPVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESE 494
Score = 38.1 bits (89), Expect = 0.028
Identities = 20/93 (21%), Positives = 45/93 (48%), Gaps = 3/93 (3%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMR-KKKGAEKM-KKKKKDAEKMKKKGAE-KIG 952
K K++ E ++ + + K+ K + E+ ++ KK+ E++++K E K
Sbjct: 229 EVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKEL 288
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K+K ++ K + ++ + +E EK + EE
Sbjct: 289 KEKAEEYIKLSEFYEEYLDELREIEKRLSRLEE 321
Score = 37.0 bits (86), Expect = 0.057
Identities = 30/125 (24%), Positives = 63/125 (50%), Gaps = 19/125 (15%)
Query: 881 DVRREAVTALGFILFRKKRTRRKKKKGAEKIR--KKKKKKKEAEKM-RKKKGAEKMKKK- 936
+R+E + K+ K K+ AE+++ ++K KK E++ +K + EK+K+K
Sbjct: 477 KLRKELRELEKVLK--KESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKL 534
Query: 937 ----------KKDAEK---MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
KK+ EK +KKK AE K + +++ + K+ ++ + E+++++
Sbjct: 535 IKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERL 594
Query: 984 EEAEK 988
+E E
Sbjct: 595 KELEP 599
Score = 36.2 bits (84), Expect = 0.099
Identities = 15/89 (16%), Positives = 48/89 (53%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
+ +++ + EK K+ + +E K ++K+ E +++ + + ++ + E +K+
Sbjct: 171 IKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEV 230
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
K+ ++ K++ ++ +K+ E+ + K+ E
Sbjct: 231 KELEELKEEIEELEKELESLEGSKRKLEE 259
Score = 35.8 bits (83), Expect = 0.14
Identities = 25/118 (21%), Positives = 56/118 (47%), Gaps = 10/118 (8%)
Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEKIRK---KKKKKKEAEKMRKKKGAEKMKKK 936
+ E A + +KR +K+ E +K ++ ++ E + + ++ +
Sbjct: 622 KKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAE 681
Query: 937 KKDAEKMK---KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA---EKMKK-KTEEAE 987
++ EK + KK EK+ ++ ++++K KK+ +K +K ++ EK+KK K E
Sbjct: 682 LEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELREKVKKYKALLKE 739
Score = 34.7 bits (80), Expect = 0.27
Identities = 19/99 (19%), Positives = 49/99 (49%), Gaps = 8/99 (8%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKK-----KGAEKIG 952
++ K+ E++ K K+ + E++ K+K E + ++ E + + EK+
Sbjct: 168 GEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLE 227
Query: 953 KKKKKKKKKKKK---KKKKKKKKKKEAEKMKKKTEEAEK 988
K+ K+ ++ K++ +K+ + + K+++K E E+
Sbjct: 228 KEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEE 266
Score = 33.9 bits (78), Expect = 0.46
Identities = 21/90 (23%), Positives = 47/90 (52%), Gaps = 6/90 (6%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRK--KKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+ R + + + ++ + + +E EK R+ KK EK+K++ E+ +K +++ K +
Sbjct: 662 EELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEEL---EEREKA-KKELEKLE 717
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K ++ ++ ++K KK K E+ K E
Sbjct: 718 KALERVEELREKVKKYKALLKERALSKVGE 747
Score = 32.3 bits (74), Expect = 1.7
Identities = 22/86 (25%), Positives = 47/86 (54%), Gaps = 7/86 (8%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
K E I++ K++ + EK K+ E +++ K+ EK + E + + + + +
Sbjct: 165 KNLGEVIKEIKRRIERLEKFIKRT--ENIEELIKEKEK---ELEEVLREINEISSELPEL 219
Query: 965 KKK--KKKKKKKEAEKMKKKTEEAEK 988
+++ K +K+ KE E++K++ EE EK
Sbjct: 220 REELEKLEKEVKELEELKEEIEELEK 245
Score = 30.4 bits (69), Expect = 6.3
Identities = 28/131 (21%), Positives = 54/131 (41%), Gaps = 38/131 (29%)
Query: 896 RKKRTRRKKKKGAEKIR----KKKKKKKEAEKMRKKKG------------------AEKM 933
K+ + K +I + K+ KK E+++K KG E
Sbjct: 399 AKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYT 458
Query: 934 KKKKKDAEKMKKKGA--EKIGKKKKKKKKKKKKKKK--------------KKKKKKKEAE 977
+ K+ +++K+ K+ K+ ++ +K KK+ + ++K KK E
Sbjct: 459 AELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLE 518
Query: 978 KMKKKTEEAEK 988
+++KK EE EK
Sbjct: 519 ELEKKAEEYEK 529
Score = 30.4 bits (69), Expect = 6.8
Identities = 18/102 (17%), Positives = 54/102 (52%), Gaps = 9/102 (8%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK----KGAEKMKKKKKDAEKMKKKGAEKI 951
+ + ++ G E + + +++ KE E + K AEK ++++ K ++ +K
Sbjct: 572 LAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKA 631
Query: 952 GKK---KKKKKKKKKKKKKKKKKK--KKEAEKMKKKTEEAEK 988
++ +K+ ++ +K+ ++ +KK ++E E+++++ E +
Sbjct: 632 FEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSR 673
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 47.0 bits (112), Expect = 3e-05
Identities = 23/70 (32%), Positives = 36/70 (51%)
Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
K EK++ K K KK +M K+ K ++ +K + K+KKK+KK+ EK
Sbjct: 281 KLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEK 340
Query: 979 MKKKTEEAEK 988
KK+ E E+
Sbjct: 341 KKKQIERLEE 350
Score = 46.2 bits (110), Expect = 6e-05
Identities = 23/85 (27%), Positives = 40/85 (47%), Gaps = 5/85 (5%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
EK+++K K K K KK + K + + K+KKK+KKK++
Sbjct: 280 EKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEE 339
Query: 969 KKKKK-----KEAEKMKKKTEEAEK 988
KKKK+ + EK++ + + E+
Sbjct: 340 KKKKQIERLEERIEKLEVQATDKEE 364
Score = 44.6 bits (106), Expect = 2e-04
Identities = 28/86 (32%), Positives = 47/86 (54%), Gaps = 8/86 (9%)
Query: 906 KGAEKIRKKKKKKKEAEKM-----RKKKGAEKMKKK-KKDAEKMKKKGAEKIGKKKKKKK 959
K EKI+ K + K +KM K+K K ++D EK+ + EK KK+KKK+
Sbjct: 281 KLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEK--KKEKKKE 338
Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEE 985
+KKKK+ ++ +++ ++ E EE
Sbjct: 339 EKKKKQIERLEERIEKLEVQATDKEE 364
Score = 35.4 bits (82), Expect = 0.12
Identities = 18/64 (28%), Positives = 34/64 (53%), Gaps = 4/64 (6%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
K+ + ++ +K +AE KKK +K +KKKK E+++ E+I K + + K
Sbjct: 307 SDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLE----ERIEKLEVQATDK 362
Query: 962 KKKK 965
++ K
Sbjct: 363 EENK 366
Score = 35.4 bits (82), Expect = 0.13
Identities = 21/74 (28%), Positives = 38/74 (51%), Gaps = 1/74 (1%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKK-KDAEKMKKKGAEKIGKKK 955
KR ++ K+K K + E+ +K AE +KKK K E+ KKK E++ ++
Sbjct: 293 LKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERI 352
Query: 956 KKKKKKKKKKKKKK 969
+K + + K++ K
Sbjct: 353 EKLEVQATDKEENK 366
Score = 35.0 bits (81), Expect = 0.20
Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 4/54 (7%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRK----KKGAEKMKKKKKDAEKMK 944
+ K R +K AE KKK+KKKE +K ++ ++ EK++ + D E+ K
Sbjct: 313 LKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEENK 366
>gnl|CDD|220121 pfam09135, Alb1, Alb1. Alb1 is a nuclear shuttling factor involved
in ribosome biogenesis.
Length = 112
Score = 43.6 bits (103), Expect = 3e-05
Identities = 23/89 (25%), Positives = 37/89 (41%), Gaps = 4/89 (4%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG---AEKIGKKKKK 957
R K+ ++ K E KK+ +K K+ + K +KKG AE + K
Sbjct: 25 PRAKRTDRKRPGILAASKNEGVSK-KKQRKKKPKRLTRKQRKRQKKGLERAEIVTDTLSK 83
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
K+ K + K KK+K + K E+
Sbjct: 84 KRAKSIGRAKYVKKRKATWDDTNAKIEDE 112
Score = 32.1 bits (73), Expect = 0.35
Identities = 16/67 (23%), Positives = 32/67 (47%)
Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
R+ + +K + +K + E + KKK++KKK K+ +K++K+
Sbjct: 7 RRAPSPSIDLDKSSLTSLPRAKRTDRKRPGILAASKNEGVSKKKQRKKKPKRLTRKQRKR 66
Query: 972 KKKEAEK 978
+KK E+
Sbjct: 67 QKKGLER 73
Score = 29.0 bits (65), Expect = 4.0
Identities = 17/58 (29%), Positives = 28/58 (48%), Gaps = 4/58 (6%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKK----KKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K + K+ ++ G K KKK++KKK K+ +K+ ++ KK E AE
Sbjct: 18 KSSLTSLPRAKRTDRKRPGILAASKNEGVSKKKQRKKKPKRLTRKQRKRQKKGLERAE 75
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 43.8 bits (104), Expect = 4e-05
Identities = 22/84 (26%), Positives = 47/84 (55%), Gaps = 8/84 (9%)
Query: 913 KKKKKKKEAEKMRKKKGAE---KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK--- 966
K+++++KE ++ + + + KK + + +K+ +K+KKKKKKK+ K
Sbjct: 32 KEEEEEKEEKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKKELKNFY 91
Query: 967 --KKKKKKKKEAEKMKKKTEEAEK 988
+ ++ K+ E +++KK EE +K
Sbjct: 92 RFQIRESKRNELAELRKKFEEDKK 115
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 46.9 bits (112), Expect = 4e-05
Identities = 33/126 (26%), Positives = 55/126 (43%), Gaps = 23/126 (18%)
Query: 880 DDVRREAVTALGFILFRK---------------KRTRRKKKKGAEKIRKKKKKKKEAEKM 924
D +R E + LG + K +R + +K+ E+ R +K KK+E
Sbjct: 521 DKLRDEWLPNLGIRIEDKPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEE--- 577
Query: 925 RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK-----KKKKKKKKKKKKKKKKKKEAEKM 979
+KKK EK++K K + K+ +K + ++ KK++KK KE +K
Sbjct: 578 KKKKELEKLEKAKIPPAEFFKRQEDKYSAFDETGLPTHDADGEEISKKERKKLSKEYDKQ 637
Query: 980 KKKTEE 985
K EE
Sbjct: 638 AKLHEE 643
Score = 40.4 bits (95), Expect = 0.005
Identities = 16/43 (37%), Positives = 30/43 (69%)
Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K +K +++K++K+ K++K+ +K KK+ EK KK+ E+ EK
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 32.3 bits (74), Expect = 1.3
Identities = 12/44 (27%), Positives = 27/44 (61%)
Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K + K K++ +++K++K+ K++K K+KK+ E+ +K
Sbjct: 538 KPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKK 581
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 47.0 bits (112), Expect = 5e-05
Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 2/74 (2%)
Query: 897 KKRTRRKKKKGAEKI--RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
+ + KK KGA+ R+ + K+ K A + + KK E+++++ E+ K
Sbjct: 367 RYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLK 426
Query: 955 KKKKKKKKKKKKKK 968
KKKK+KKK+ +K
Sbjct: 427 SKKKKRKKKEWFEK 440
Score = 38.5 bits (90), Expect = 0.019
Identities = 24/104 (23%), Positives = 40/104 (38%), Gaps = 19/104 (18%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKM--RKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
K R G E+I+ + K K + R K +K+K K + ++ + E I
Sbjct: 337 LKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYY 396
Query: 955 KKKKKKKKKKK-----------------KKKKKKKKKEAEKMKK 981
+ K +K + K KKKK+K+ E +K
Sbjct: 397 ESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440
Score = 30.4 bits (69), Expect = 4.8
Identities = 12/44 (27%), Positives = 25/44 (56%)
Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+K +KI + + +KK +K+ KK + K ++ E ++ E+A
Sbjct: 273 EKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAA 316
Score = 30.4 bits (69), Expect = 5.3
Identities = 24/131 (18%), Positives = 50/131 (38%), Gaps = 20/131 (15%)
Query: 878 VNDDVRREAVTALGFILFRKKRTRRKK-KKGAEKIRKKKKKKKEAEKMRKKKG------A 930
+++ R+ + L L +K KK + EK + ++ ++A + ++KG
Sbjct: 271 LDEKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANL 330
Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKK-------------KKKKKKKKKKKKKKKEAE 977
+ +++ K G E+I + K K KK K K ++ E +
Sbjct: 331 QLIEEGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELK 390
Query: 978 KMKKKTEEAEK 988
+ E A+
Sbjct: 391 EAIAYYESAKT 401
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 46.6 bits (111), Expect = 6e-05
Identities = 25/90 (27%), Positives = 35/90 (38%), Gaps = 8/90 (8%)
Query: 897 KKRTRRKKKKGAEKIRKK-KKKKKEAEKMR-------KKKGAEKMKKKKKDAEKMKKKGA 948
+ K +K A K+ K+ +K +KE E+ K + K KK K K
Sbjct: 460 EDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSK 519
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
K K K KKKKKK+K +
Sbjct: 520 LDKAANKISKAAVKVKKKKKKEKSIDLDDD 549
Score = 41.2 bits (97), Expect = 0.003
Identities = 22/106 (20%), Positives = 38/106 (35%), Gaps = 14/106 (13%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEA-------------EKMRKKKGAEKMKKKKKDAEK 942
K + K + +K++++E KK K K K +
Sbjct: 464 EAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKA 523
Query: 943 MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K ++ K KKKKKK+K ++++ K+ EE E
Sbjct: 524 ANKI-SKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDED 568
Score = 37.3 bits (87), Expect = 0.048
Identities = 24/93 (25%), Positives = 42/93 (45%), Gaps = 6/93 (6%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRK------KKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+K KK + K+KK+ E E++ +K A K+ K+ + A+K +++
Sbjct: 437 KKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPW 496
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K K KK+ KKK + K+ K + K
Sbjct: 497 LKTTSSVGKSAKKQDSKKKSSSKLDKAANKISK 529
Score = 35.4 bits (82), Expect = 0.18
Identities = 29/107 (27%), Positives = 48/107 (44%), Gaps = 20/107 (18%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE------- 949
K RT + + I ++ + +K+ E+++ KK E+ K + AE
Sbjct: 158 KPRTELESE--IASILEESGLASDEKKLTP---FEELELKKLSPEEAKARRAELRKMREL 212
Query: 950 ---KIGKKKKKKKKKKKK-----KKKKKKKKKKEAEKMKKKTEEAEK 988
+ K K+ KK K KK KK+K K++ KE E++ K EA
Sbjct: 213 LFREEAKAKRIKKIKSKKYHRVHKKEKLKEELKEFEELVKADPEAAL 259
Score = 34.3 bits (79), Expect = 0.36
Identities = 22/107 (20%), Positives = 37/107 (34%), Gaps = 19/107 (17%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAE---------KMKKKKKDAEKMKKKG 947
K RRK + + KK K+ K K+ E + + K + K
Sbjct: 418 KNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKR 477
Query: 948 AEKIGKKKKK----------KKKKKKKKKKKKKKKKKEAEKMKKKTE 984
+EK K++++ K K KK+ KK++ K
Sbjct: 478 SEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAA 524
Score = 34.3 bits (79), Expect = 0.41
Identities = 24/108 (22%), Positives = 51/108 (47%), Gaps = 19/108 (17%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRK-------------KKGAEKMKKKKKDA 940
L K +R AE RKK++ E E++R+ ++ ++K ++K
Sbjct: 373 LLSMKFMQR-----AEA-RKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFG 426
Query: 941 EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ +K AE KK+ K + K+KK+ ++++ ++ E+ K + +
Sbjct: 427 PENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKL 474
Score = 33.9 bits (78), Expect = 0.46
Identities = 18/92 (19%), Positives = 37/92 (40%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K + KK+ E+ + +++ K + K + +K+++ E++ ++
Sbjct: 444 KNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSV 503
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K KK+ KKK K +A K K
Sbjct: 504 GKSAKKQDSKKKSSSKLDKAANKISKAAVKVK 535
Score = 32.3 bits (74), Expect = 1.5
Identities = 29/106 (27%), Positives = 50/106 (47%), Gaps = 17/106 (16%)
Query: 893 ILFRK-KRTRRKKK---KGAEKIRKKKKKK---KEAEKMRKKKGAEKMKKKKKDAEKMKK 945
+LFR+ + +R KK K ++ KK+K K KE E++ K ++ EK+++
Sbjct: 212 LLFREEAKAKRIKKIKSKKYHRVHKKEKLKEELKEFEELVKADPEA----ALEELEKLER 267
Query: 946 KGA-EKIGKKKKKKKKKKKKKKKKKKKKK-----KEAEKMKKKTEE 985
+ A E++ K K K K K + K K E+ ++ EE
Sbjct: 268 RRAEERMTLKHKNTSKWAKSMLKTGRAKYDEEARKALEEQLRQGEE 313
Score = 32.0 bits (73), Expect = 2.1
Identities = 28/114 (24%), Positives = 47/114 (41%), Gaps = 27/114 (23%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAE------------------- 941
+KK G K + ++ M+ + AE KK++ DAE
Sbjct: 353 MLRKKLGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEEN 412
Query: 942 --------KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+K G E K+ + KK KK+ K + K+KK+ + E+ + EEA+
Sbjct: 413 EEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAK 466
Score = 31.2 bits (71), Expect = 3.0
Identities = 14/30 (46%), Positives = 20/30 (66%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
KK+KKK+K+K++ K + KKE K KK
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKL 656
Score = 30.8 bits (70), Expect = 4.5
Identities = 20/66 (30%), Positives = 33/66 (50%), Gaps = 16/66 (24%)
Query: 930 AEKMKKKKKDAEKMKKK---------------GAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
AE K+KK+ E+ K G +K KK+K+K++ K + KK+K+K
Sbjct: 593 AEFEKEKKEVIEEEDPKEIDLTLPGWGSWAGDGIKKRKKKRKRKRRFLTKIEGVKKEKRK 652
Query: 975 EAEKMK 980
+ K+K
Sbjct: 653 DK-KLK 657
>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
Length = 1052
Score = 46.5 bits (110), Expect = 7e-05
Identities = 30/80 (37%), Positives = 39/80 (48%), Gaps = 4/80 (5%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKM--KKKGAEKIGK 953
RK R KK A + KKK K+A KK A+K KK A+K KK A+K+ +
Sbjct: 975 RKTVRRSVKKAAATRAAMKKKVAKKAPA--KKAAAKKAAAKKAAAKKKVAKKAPAKKVAR 1032
Query: 954 KKKKKKKKKKKKKKKKKKKK 973
K KK KK +K +K
Sbjct: 1033 KPAAKKAAKKPARKAAGRKA 1052
Score = 43.8 bits (103), Expect = 4e-04
Identities = 33/111 (29%), Positives = 46/111 (41%), Gaps = 4/111 (3%)
Query: 874 AVSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKM 933
A + V A A+ + ++ G EK+ + +K + KK A +
Sbjct: 933 AQPAQENTVVESAEKAVAAVSEAAQQAATAVASGIEKVAEAVRKT--VRRSVKKAAATRA 990
Query: 934 KKKKKDAEK--MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
KKK A+K KK A+K KK KKK KK KK +K A K K
Sbjct: 991 AMKKKVAKKAPAKKAAAKKAAAKKAAAKKKVAKKAPAKKVARKPAAKKAAK 1041
Score = 40.4 bits (94), Expect = 0.005
Identities = 27/87 (31%), Positives = 36/87 (41%), Gaps = 10/87 (11%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+K RR KK A + +K+ KK A+K KK A+K K KK
Sbjct: 974 VRKTVRRSVKKAA------ATRAAMKKKVAKKAPAKKAAAKKAAAKKAAAKKKV--AKKA 1025
Query: 956 KKKKKKKKKKKKKKKKK--KKEAEKMK 980
KK +K KK KK +K A +
Sbjct: 1026 PAKKVARKPAAKKAAKKPARKAAGRKA 1052
Score = 30.3 bits (68), Expect = 6.9
Identities = 22/63 (34%), Positives = 27/63 (42%), Gaps = 3/63 (4%)
Query: 929 GAEKMKK-KKKDAEKMKKKGAEK--IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
G EK+ + +K + KK A KKK KK KK KK KK A K K +
Sbjct: 966 GIEKVAEAVRKTVRRSVKKAAATRAAMKKKVAKKAPAKKAAAKKAAAKKAAAKKKVAKKA 1025
Query: 986 AEK 988
K
Sbjct: 1026 PAK 1028
>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2. This
family consists of several Eukaryotic rRNA processing
protein EBP2 sequences. Ebp2p is required for the
maturation of 25S rRNA and 60S subunit assembly. Ebp2p
may be one of the target proteins of Rrs1p for executing
the signal to regulate ribosome biogenesis. This family
also plays a role in chromosome segregation.
Length = 270
Score = 45.4 bits (108), Expect = 7e-05
Identities = 44/156 (28%), Positives = 59/156 (37%), Gaps = 49/156 (31%)
Query: 875 VSDVNDDVRRE---------AVTALGFILFRKKR--TRRKKKKGAE---------KIRKK 914
+ DV DD +RE AV L +K +R AE K+RKK
Sbjct: 65 IEDVEDDFKRELAFYKQALAAVLE-ARKLLKKLGVPFKRPDDYFAEMVKSDEHMQKVRKK 123
Query: 915 ----KKKKKEAEKMRKKKGAEKMKK-----KKKDAEKMKKKGAEKIGKKKKKKK------ 959
KK +E+ RK++ +K K K ++ K KK EKI K KKK+K
Sbjct: 124 LLAEAAAKKASEEARKQRELKKFGKQVQVEKLQERAKEKKDMLEKIKKLKKKRKGGGDEG 183
Query: 960 -------------KKKKKKKKKKKKKKKEAEKMKKK 982
KKK +K + K K K K
Sbjct: 184 DLFDFLVDDAAKGGSKKKGRKGGAARGKPNAKRKAK 219
Score = 44.6 bits (106), Expect = 1e-04
Identities = 21/86 (24%), Positives = 37/86 (43%), Gaps = 6/86 (6%)
Query: 895 FRKKRTRRK--KKKGAEKIRKKKKKKKEAE----KMRKKKGAEKMKKKKKDAEKMKKKGA 948
RK+R +K K+ EK++++ K+KK+ K++KK+ + D
Sbjct: 137 ARKQRELKKFGKQVQVEKLQERAKEKKDMLEKIKKLKKKRKGGGDEGDLFDFLVDDAAKG 196
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKK 974
K +K + K K+K K K
Sbjct: 197 GSKKKGRKGGAARGKPNAKRKAKDAK 222
Score = 42.3 bits (100), Expect = 7e-04
Identities = 29/93 (31%), Positives = 46/93 (49%), Gaps = 19/93 (20%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK--KKG------------- 947
KK +E+ RK+++ KK ++++ +K E+ K+KK EK+K KK
Sbjct: 129 AAKKASEEARKQRELKKFGKQVQVEKLQERAKEKKDMLEKIKKLKKKRKGGGDEGDLFDF 188
Query: 948 ----AEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
A K G KKK +K + K K+K K+A
Sbjct: 189 LVDDAAKGGSKKKGRKGGAARGKPNAKRKAKDA 221
Score = 33.9 bits (78), Expect = 0.32
Identities = 24/101 (23%), Positives = 38/101 (37%), Gaps = 26/101 (25%)
Query: 898 KRTRRKKKKGAEKIRK---------KKKKKKEAEKMRKKKGAEKMKKKKKDAEKM----K 944
K+ ++K+K G ++ K KK+ K +G K+K KDA K K
Sbjct: 170 KKLKKKRKGGGDEGDLFDFLVDDAAKGGSKKKGRKGGAARGKPNAKRKAKDA-KFGFGGK 228
Query: 945 KKGAEK------------IGKKKKKKKKKKKKKKKKKKKKK 973
K+G++ KK K K K+ K +
Sbjct: 229 KRGSKSNTAESSNDLSGFSVKKMKGGGGAGKGGNKRPGKSR 269
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 44.0 bits (104), Expect = 8e-05
Identities = 23/63 (36%), Positives = 32/63 (50%), Gaps = 4/63 (6%)
Query: 926 KKKGAEKMKKKK----KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
KKK K KK + E K +KK KKKK + K++KKKKK+K+ +K +
Sbjct: 110 KKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRH 169
Query: 982 KTE 984
E
Sbjct: 170 SPE 172
Score = 43.7 bits (103), Expect = 1e-04
Identities = 22/64 (34%), Positives = 36/64 (56%)
Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
+ + KKK K +K + + + ++ + K +K KKK + K++KKKKK+KK
Sbjct: 104 MHIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKK 163
Query: 971 KKKK 974
KKKK
Sbjct: 164 KKKK 167
Score = 41.4 bits (97), Expect = 6e-04
Identities = 25/75 (33%), Positives = 37/75 (49%), Gaps = 7/75 (9%)
Query: 895 FRKKRTRRKKKKGAEKIRKKK-KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
+R + KKK K +K + + E +G + +KK K KK E +
Sbjct: 101 YRLMHIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHK------KKKHEDDKE 154
Query: 954 KKKKKKKKKKKKKKK 968
+KKKKK+KKKKKK+
Sbjct: 155 RKKKKKEKKKKKKRH 169
Score = 39.8 bits (93), Expect = 0.002
Identities = 20/59 (33%), Positives = 29/59 (49%)
Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
KK K K + + + E + KK +K K++KKKKK+KKKKKK+
Sbjct: 111 KKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRH 169
Score = 38.3 bits (89), Expect = 0.007
Identities = 18/37 (48%), Positives = 23/37 (62%)
Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
+K KKKK E +K RKKK EK KKKK+ + + G
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKRHSPEHPGVGF 178
Score = 34.0 bits (78), Expect = 0.21
Identities = 19/67 (28%), Positives = 31/67 (46%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
L + ++K K +K R + +E + + K KKK E K++ +K K
Sbjct: 103 LMHIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEK 162
Query: 954 KKKKKKK 960
KKKKK+
Sbjct: 163 KKKKKRH 169
Score = 32.9 bits (75), Expect = 0.46
Identities = 19/65 (29%), Positives = 32/65 (49%), Gaps = 1/65 (1%)
Query: 924 MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
M + +K K K K + + E+ + K +KK KKKK + KE +K KK+
Sbjct: 104 MHIQPPKKKHKHKHK-KHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEK 162
Query: 984 EEAEK 988
++ +K
Sbjct: 163 KKKKK 167
Score = 32.5 bits (74), Expect = 0.67
Identities = 13/31 (41%), Positives = 21/31 (67%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAE 931
++ KKK E +++KKKKKE +K +K+ E
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKRHSPE 172
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 45.9 bits (110), Expect = 8e-05
Identities = 33/122 (27%), Positives = 62/122 (50%), Gaps = 20/122 (16%)
Query: 886 AVTALGFILFRKKRTRRKKKKGAEKIRKK--KKKKKEAEKMRKKK---GAEKMKKKKKDA 940
V A+ RKK K K AE+ K+ ++ KKEAE ++K+ E++ K + +
Sbjct: 15 VVGAVIGYFVRKKIAE-AKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEF 73
Query: 941 EKMKKKGAEKIGKKKKKKKKKK---KKKKKKKKKKKKEAEK-----------MKKKTEEA 986
EK ++ ++ K +K+ +K+ +K + +K+++E EK ++KK EE
Sbjct: 74 EKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEEL 133
Query: 987 EK 988
E+
Sbjct: 134 EE 135
Score = 37.1 bits (87), Expect = 0.042
Identities = 22/114 (19%), Positives = 62/114 (54%), Gaps = 10/114 (8%)
Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEK---IRKKKKKKKEAEKMRKKKGAEKMKKK 936
++ ++EA L K K + EK R+ + +K E +++ +K+ E + +K
Sbjct: 45 EEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLE-KRLLQKE--ENLDRK 101
Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK----KTEEA 986
+ EK +++ +K + ++K+++ +KK+++ ++ +++ +++++ EEA
Sbjct: 102 LELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEA 155
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 46.1 bits (110), Expect = 8e-05
Identities = 22/82 (26%), Positives = 39/82 (47%), Gaps = 9/82 (10%)
Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
++++ E + + +K++KK++ AEK +K + KKK + KK K + KK
Sbjct: 397 EEEEGENGNLSPAE-RKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKK 455
Query: 975 E-----AEKMKKKT---EEAEK 988
EK+ + EEA K
Sbjct: 456 VDPDPLGEKLARTEDPLEEAMK 477
Score = 44.2 bits (105), Expect = 3e-04
Identities = 24/87 (27%), Positives = 38/87 (43%), Gaps = 4/87 (4%)
Query: 902 RKKKKGAEKIRKK---KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
K GA +I K K E E+ + G ++KK +K +K +K K++ +K
Sbjct: 374 FKAALGAIEIYLKLHDKPLLAEGEEEEGENGNLSPAERKKLRKKQRKA-EKKAEKEEAEK 432
Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTEE 985
KKK + KK K + KK +
Sbjct: 433 AAAKKKAEAAAKKAKGPDGETKKVDPD 459
Score = 37.2 bits (87), Expect = 0.040
Identities = 16/55 (29%), Positives = 28/55 (50%), Gaps = 1/55 (1%)
Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK-KKEAEKMKKKTEEAEK 988
K + ++ + ++KK +KK++K +KK +K + E KKK E A K
Sbjct: 390 KPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAK 444
Score = 36.8 bits (86), Expect = 0.058
Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 5/59 (8%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
RKK RKK++ AEK + +K++ ++A KKK KK K + KK + +G+K
Sbjct: 411 RKKL--RKKQRKAEK-KAEKEEAEKAAA--KKKAEAAAKKAKGPDGETKKVDPDPLGEK 464
Score = 35.7 bits (83), Expect = 0.13
Identities = 13/55 (23%), Positives = 28/55 (50%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ ++ E + KK +KK++K +KK +K++ +K A+K + + K
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAK 447
Score = 33.8 bits (78), Expect = 0.57
Identities = 15/46 (32%), Positives = 21/46 (45%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD 939
L +K+R KK + E + KKK EA + K + KK D
Sbjct: 414 LRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 44.7 bits (106), Expect = 9e-05
Identities = 25/90 (27%), Positives = 46/90 (51%), Gaps = 2/90 (2%)
Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
L ++ R + + + +K+KK++ K ++K K K++K+ EK KK +++
Sbjct: 148 ELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKL--YKELKERKEREKKLKKVEQRLE 205
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
+++ KK K KKKK K K + KK
Sbjct: 206 LQRELMKKGKGKKKKIVKDKDGKVVYKWKK 235
Score = 42.0 bits (99), Expect = 7e-04
Identities = 23/96 (23%), Positives = 41/96 (42%), Gaps = 15/96 (15%)
Query: 879 NDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKK 938
+ + ++ K+ R+ K+ E+ ++KK KK +++ ++ E MKK K
Sbjct: 159 ISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVEQRLELQR--ELMKKGKG 216
Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
KKK K K K K KK++K+
Sbjct: 217 K-------------KKKIVKDKDGKVVYKWKKERKR 239
Score = 33.9 bits (78), Expect = 0.32
Identities = 20/79 (25%), Positives = 40/79 (50%), Gaps = 1/79 (1%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
+ + R + +K ++KK + K K+K K +K+ +K+ KK +++ + +++
Sbjct: 151 DRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKL-KKVEQRLELQRE 209
Query: 965 KKKKKKKKKKEAEKMKKKT 983
KK K KKK+ K K
Sbjct: 210 LMKKGKGKKKKIVKDKDGK 228
Score = 32.3 bits (74), Expect = 0.86
Identities = 20/89 (22%), Positives = 41/89 (46%), Gaps = 13/89 (14%)
Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA--EKIGKKKK-----------KKK 959
+++ + ++ K ++ +KKK +K K E+ ++KK ++
Sbjct: 151 DRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVEQRLELQREL 210
Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK K KKKK K K+ + + K +E ++
Sbjct: 211 MKKGKGKKKKIVKDKDGKVVYKWKKERKR 239
Score = 32.0 bits (73), Expect = 1.1
Identities = 25/90 (27%), Positives = 39/90 (43%), Gaps = 12/90 (13%)
Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEK--------MKKKKKDAEKMKKKGAEKIGKKK 955
KKK + K KK++ K K+K AE+ M K D K EK K
Sbjct: 21 KKKDYKLRAKDYHKKEKTLKALKEKAAERNPDEFYFGMISSKTDDGVHILKREEKEVLSK 80
Query: 956 KKKKKKKKKKKK----KKKKKKKEAEKMKK 981
++ K K + K++ + K+ EK+K+
Sbjct: 81 EQVKLLKTQDLNYVRTKRQAEAKKIEKLKE 110
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 46.0 bits (110), Expect = 1e-04
Identities = 24/96 (25%), Positives = 43/96 (44%), Gaps = 9/96 (9%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+ R + A + + KK KK+ KKK ++ + K K + G+++
Sbjct: 99 AEDEAARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRR 158
Query: 956 KKKKKKKKKKKKKKKKK---------KKEAEKMKKK 982
++KKKKK+K +K ++ + AEKM K
Sbjct: 159 RRKKKKKQKPTEKIPREVVIPETITVAELAEKMAVK 194
Score = 36.0 bits (84), Expect = 0.12
Identities = 15/73 (20%), Positives = 40/73 (54%), Gaps = 7/73 (9%)
Query: 923 KMRKKKGAEKMKKKKKDA-EKMKKKGAEK------IGKKKKKKKKKKKKKKKKKKKKKKE 975
K+R + A+++ K+ EK+K+ G E + +++ +K++ K++ +++ K + +E
Sbjct: 3 KVRVYELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEE 62
Query: 976 AEKMKKKTEEAEK 988
A + + E +
Sbjct: 63 AAAAEAEEEAKAE 75
Score = 35.2 bits (82), Expect = 0.22
Identities = 19/75 (25%), Positives = 31/75 (41%), Gaps = 17/75 (22%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD---------------- 939
+ K+ + K+K R K K + + R+++ +K K+K +
Sbjct: 127 KPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIPREVVIPETITVAE 186
Query: 940 -AEKMKKKGAEKIGK 953
AEKM K AE I K
Sbjct: 187 LAEKMAVKAAEVIKK 201
>gnl|CDD|216581 pfam01577, Peptidase_S30, Potyvirus P1 protease. The potyviridae
family positive stand RNA viruses with genome encoding a
polyprotein. members include zucchini yellow mosaic
virus, and turnip mosaic viruses which cause
considerable losses of crops worldwide. This family
consists of a C terminus region from various plant
potyvirus P1 proteins (found at the N terminus of the
polyprotein). The C terminus of P1 is a serine-type
protease responsible for autocatalytic cleavage between
P1 and the helper component protease pfam00851. The
entire P1 protein may be involved in virus-host
interactions.
Length = 245
Score = 44.6 bits (106), Expect = 1e-04
Identities = 26/98 (26%), Positives = 44/98 (44%), Gaps = 12/98 (12%)
Query: 888 TALGFILFRKKR-TRRKKKKGAEKIRKKKKK--KKEAEKMRKKKGAEKMKKKKKDAEKMK 944
+ L RK+ +++KKG +RK K+ EK+++++ E+ + A +
Sbjct: 5 AKVAERLLRKEMSKIKQEKKGRIILRKLSPAQVAKKREKLKREEREERQFLQGAYASIVS 64
Query: 945 K---------KGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
K E + + K+ KK KKKKKKKK
Sbjct: 65 KITPIGTDKVSKTESVSFRTPYYKRTTKKMKKKKKKKK 102
Score = 41.9 bits (99), Expect = 8e-04
Identities = 22/91 (24%), Positives = 34/91 (37%), Gaps = 6/91 (6%)
Query: 891 GFILFRKKRT----RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
I KK + + A+K K K++++E + + A + K K
Sbjct: 17 SKIKQEKKGRIILRKLSPAQVAKKREKLKREEREERQFLQGAYASIVSKITPIGTDKVSK 76
Query: 947 GA--EKIGKKKKKKKKKKKKKKKKKKKKKKE 975
K+ KK KKKKKKKK +
Sbjct: 77 TESVSFRTPYYKRTTKKMKKKKKKKKVVMSD 107
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 43.5 bits (102), Expect = 1e-04
Identities = 19/92 (20%), Positives = 63/92 (68%), Gaps = 2/92 (2%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKM--KKKKKDAEKMKKKGAEKIGK 953
++++ RR++++ R++ K++ E++R+++ A + ++ ++ EK K+K E+ +
Sbjct: 41 QEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQ 100
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
+++++++ +K+K++ + + ++EAE+M+ + E+
Sbjct: 101 EQEEQERIQKQKEEAEARAREEAERMRLEREK 132
Score = 43.2 bits (101), Expect = 1e-04
Identities = 23/93 (24%), Positives = 64/93 (68%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
R++ RR++++ A + +K K+K E E+ ++++ E+++K+K++AE ++ AE++ ++
Sbjct: 71 REEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAREEAERMRLER 130
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K ++ ++++ ++KK+ +E K +K+E + +
Sbjct: 131 EKHFQQIEQERLERKKRLEEIMKRTRKSEVSPQ 163
Score = 40.1 bits (93), Expect = 0.002
Identities = 20/95 (21%), Positives = 65/95 (68%), Gaps = 5/95 (5%)
Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
+L K+R R+++ E+ +++++++E +++ +++ + +++ E+ ++ E+
Sbjct: 27 LLAEKRRQAREQR---EQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEE-- 81
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+ ++K++K K+K ++++K++++E E+++K+ EEAE
Sbjct: 82 RAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAE 116
Score = 29.3 bits (65), Expect = 6.1
Identities = 20/74 (27%), Positives = 44/74 (59%), Gaps = 7/74 (9%)
Query: 902 RKKKKGAEKIRKKK-----KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
+++++ E+I+K+K + ++EAE+MR ++ EK ++ + +KK E+I K+ +
Sbjct: 99 KQEQEEQERIQKQKEEAEARAREEAERMRLER--EKHFQQIEQERLERKKRLEEIMKRTR 156
Query: 957 KKKKKKKKKKKKKK 970
K + + KK+ K
Sbjct: 157 KSEVSPQVKKEDPK 170
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 44.6 bits (106), Expect = 1e-04
Identities = 17/49 (34%), Positives = 25/49 (51%)
Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
D E+ ++K EK K K KK K K ++K+K K ++ K E E
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE 86
Score = 42.0 bits (99), Expect = 7e-04
Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 5/59 (8%)
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
EK ++K K A K K K K K ++K+K K++K++K +E E+ + E AEK
Sbjct: 43 EEKEEEKAKVAAKAKAKK-----ALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEK 96
Score = 38.5 bits (90), Expect = 0.010
Identities = 12/56 (21%), Positives = 31/56 (55%)
Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
++ K+++K K K + + K ++K+K K++K++K ++ +E + E+
Sbjct: 42 DEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKL 97
Score = 36.2 bits (84), Expect = 0.050
Identities = 14/37 (37%), Positives = 17/37 (45%), Gaps = 3/37 (8%)
Query: 941 EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
EK+K AEK K KKKK K K K + +
Sbjct: 193 EKLK---AEKAAKGGKKKKGKAKAKLNVGGANDDDDD 226
Score = 35.4 bits (82), Expect = 0.10
Identities = 10/36 (27%), Positives = 20/36 (55%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++K+++K K K K KK K + K+K + ++
Sbjct: 42 DEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKE 77
Score = 34.2 bits (79), Expect = 0.23
Identities = 11/36 (30%), Positives = 15/36 (41%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K K +K K KKKK K K K ++ +
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDDYDG 229
Score = 33.5 bits (77), Expect = 0.45
Identities = 14/66 (21%), Positives = 21/66 (31%), Gaps = 17/66 (25%)
Query: 920 EAEKMRKKKGA------EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
+E ++K EK+K +K G KKKK K K K
Sbjct: 176 SSEDIKKVSSTLNVLINEKLKAEKAAK-----------GGKKKKGKAKAKLNVGGANDDD 224
Query: 974 KEAEKM 979
+ +
Sbjct: 225 DDYDGY 230
Score = 32.3 bits (74), Expect = 0.97
Identities = 9/34 (26%), Positives = 21/34 (61%)
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ ++K+++K K K K KK + ++ E+A++
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKR 74
Score = 31.2 bits (71), Expect = 2.2
Identities = 14/57 (24%), Positives = 32/57 (56%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
K+++ A+ K K KK K+ +K+ A++ K++K E + +++ +K + +K
Sbjct: 45 KEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRK 101
Score = 30.0 bits (68), Expect = 5.5
Identities = 11/58 (18%), Positives = 32/58 (55%), Gaps = 3/58 (5%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEK---MKKKKKDAEKMKKKGAEK 950
+K K K + ++ K ++K++A++ +++KG + + + AEK++ + ++
Sbjct: 47 EEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRKLQE 104
Score = 30.0 bits (68), Expect = 5.8
Identities = 12/60 (20%), Positives = 27/60 (45%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
++K EK + K K + K + EK K++K++ + + + +K + +K
Sbjct: 42 DEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRK 101
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 42.8 bits (101), Expect = 1e-04
Identities = 20/34 (58%), Positives = 24/34 (70%)
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
K K+K K+KK+KKKKKKKKKKK + KKK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 40.8 bits (96), Expect = 6e-04
Identities = 22/40 (55%), Positives = 25/40 (62%), Gaps = 6/40 (15%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
K+ K+ K KK+ KKKKKKKKKKK KK KKKK
Sbjct: 102 KQVLKEKAKQKKQ------KKKKKKKKKKKTSKKAAKKKK 135
Score = 40.4 bits (95), Expect = 7e-04
Identities = 20/33 (60%), Positives = 26/33 (78%)
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
+ + +K K+KK+KKKKKKKKKKK K+A K KK
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 40.4 bits (95), Expect = 7e-04
Identities = 20/38 (52%), Positives = 28/38 (73%)
Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
+K +++ K+K K+KK+KKKKKKKKKKK + KKK
Sbjct: 98 QKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 39.3 bits (92), Expect = 0.002
Identities = 19/33 (57%), Positives = 23/33 (69%)
Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
K+ EK +KK+KKKKKKKKKKK KK K+
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 39.3 bits (92), Expect = 0.002
Identities = 20/33 (60%), Positives = 21/33 (63%)
Query: 942 KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
K K K K+KKKKKKKKKKK KK KKK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 38.5 bits (90), Expect = 0.003
Identities = 19/37 (51%), Positives = 27/37 (72%)
Query: 942 KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
K+ K+ ++ K+KK+KKKKKKKKKKK KK + +K
Sbjct: 99 KIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 34.7 bits (80), Expect = 0.060
Identities = 25/62 (40%), Positives = 28/62 (45%), Gaps = 25/62 (40%)
Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
KI K+ K+K K KK+KK KKKKKKKKK KK KK
Sbjct: 99 KIVKQVLKEK-----------AKQKKQKK--------------KKKKKKKKKTSKKAAKK 133
Query: 970 KK 971
KK
Sbjct: 134 KK 135
Score = 34.3 bits (79), Expect = 0.084
Identities = 15/31 (48%), Positives = 24/31 (77%)
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K+ K+K K+KK+KKKK+ +K KK +++A K
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAK 132
Score = 33.5 bits (77), Expect = 0.19
Identities = 15/32 (46%), Positives = 23/32 (71%)
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K+ K+K K+KK+KKKKK+ +K K + A+K
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKK 133
Score = 32.4 bits (74), Expect = 0.43
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K+ K+K K+KK+KKKKKK+ +K K +K
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 31.2 bits (71), Expect = 0.98
Identities = 18/47 (38%), Positives = 31/47 (65%), Gaps = 4/47 (8%)
Query: 892 FILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKK 938
F++ +K ++ K+ A K+KK+KK+ +K +KKK ++K KKKK
Sbjct: 93 FLVEEQKIVKQVLKEKA----KQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 30.8 bits (70), Expect = 1.4
Identities = 14/34 (41%), Positives = 20/34 (58%)
Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD 939
K K + K+KK+K+ +K +KKK K KKK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 43.8 bits (104), Expect = 1e-04
Identities = 15/52 (28%), Positives = 30/52 (57%), Gaps = 2/52 (3%)
Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
K++ ++ AE+ KK K+K ++++K K K +KK+K A+K + +
Sbjct: 108 AKVQAQR--AEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQ 157
Score = 43.0 bits (102), Expect = 2e-04
Identities = 18/74 (24%), Positives = 42/74 (56%), Gaps = 8/74 (10%)
Query: 915 KKKKKEA-EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
+K+ +EA K++ ++ AE+ KK++ A + +K ++++K K K +KK+K + +
Sbjct: 100 RKQLEEAKAKVQAQR-AEQQAKKREAAGEKEKA------PRRERKPKPKAPRKKRKPRAQ 152
Query: 974 KEAEKMKKKTEEAE 987
K + ++ +E
Sbjct: 153 KPEPQHTPVSDISE 166
Score = 42.2 bits (100), Expect = 5e-04
Identities = 12/53 (22%), Positives = 26/53 (49%)
Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K K + ++ A+K +K+K ++++K K K +K+ + +K E
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHT 159
Score = 39.9 bits (94), Expect = 0.003
Identities = 16/72 (22%), Positives = 31/72 (43%), Gaps = 6/72 (8%)
Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
K K + + AE+ KK+ A K+K E+ K K +KK+K + +K + +
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKP------KAPRKKRKPRAQKPEPQHTP 160
Query: 973 KKEAEKMKKKTE 984
+ ++
Sbjct: 161 VSDISELTVGQA 172
Score = 34.9 bits (81), Expect = 0.13
Identities = 10/64 (15%), Positives = 29/64 (45%), Gaps = 1/64 (1%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE-KIGKK 954
++ + KK++ A + K +++++ + +K + +K + +E +G+
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQA 172
Query: 955 KKKK 958
K K
Sbjct: 173 VKVK 176
Score = 34.1 bits (79), Expect = 0.22
Identities = 14/64 (21%), Positives = 28/64 (43%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
++ ++ A+K +K+K + RK K KK+K A+K + + + +
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQA 172
Query: 962 KKKK 965
K K
Sbjct: 173 VKVK 176
Score = 33.4 bits (77), Expect = 0.38
Identities = 13/68 (19%), Positives = 28/68 (41%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
+ + + AE+ KK++ E EK +++ K K +K + +K + +
Sbjct: 109 KVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELT 168
Query: 962 KKKKKKKK 969
+ K K
Sbjct: 169 VGQAVKVK 176
Score = 33.4 bits (77), Expect = 0.41
Identities = 13/55 (23%), Positives = 28/55 (50%), Gaps = 6/55 (10%)
Query: 940 AEKMKKKGAEKIGKKKKKKKKKK------KKKKKKKKKKKKEAEKMKKKTEEAEK 988
A K ++ K+ ++ +++ KK K+K ++++K K KK+ A+K
Sbjct: 99 ARKQLEEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQK 153
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 45.5 bits (108), Expect = 2e-04
Identities = 25/88 (28%), Positives = 41/88 (46%), Gaps = 12/88 (13%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
KK ++KK I +KK E E R ++ E + + +++K KK E + ++
Sbjct: 1500 KKINQKKKGF----IPSNEKKSIEIEN-RNQEEKEPAGQGELESDKEKKGNLESVLSNQE 1554
Query: 957 K-------KKKKKKKKKKKKKKKKKEAE 977
K + KK+K KK+ K EAE
Sbjct: 1555 KNIEEDYAESDIKKRKNKKQYKSNTEAE 1582
Score = 44.3 bits (105), Expect = 3e-04
Identities = 18/88 (20%), Positives = 36/88 (40%), Gaps = 4/88 (4%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKG-AEKMKKKKKDAEKMKKKGAEKIGKK 954
+KK +K + +I + +++KE + + EK + +K E +
Sbjct: 1505 KKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAES 1564
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
KK+K KK+ K + E + K+
Sbjct: 1565 DIKKRKN---KKQYKSNTEAELDFFLKR 1589
Score = 36.6 bits (85), Expect = 0.080
Identities = 21/56 (37%), Positives = 31/56 (55%), Gaps = 2/56 (3%)
Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
MKK ++ K KI ++K KKKKKK+KKK+++ K EK + + EA
Sbjct: 719 GLMKKIFRNW--NGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAEA 772
Score = 36.2 bits (84), Expect = 0.12
Identities = 15/74 (20%), Positives = 34/74 (45%)
Query: 891 GFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
GFI +K++ + + E+ + + E++K +K + ++K+ E+ + K
Sbjct: 1508 GFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIK 1567
Query: 951 IGKKKKKKKKKKKK 964
K KK+ K +
Sbjct: 1568 KRKNKKQYKSNTEA 1581
Score = 33.9 bits (78), Expect = 0.53
Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 6/53 (11%)
Query: 941 EKMKKKGAEKIGKKKKKK------KKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
MKK GK + K +K KKKKKK+KKK+++ + K + E AE
Sbjct: 719 GLMKKIFRNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771
Score = 32.8 bits (75), Expect = 1.2
Identities = 35/115 (30%), Positives = 46/115 (40%), Gaps = 34/115 (29%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKK------------EAE----KMRKKKG-----AEKM 933
F K R K + + K R KKKKKK E E RK+ +++
Sbjct: 863 FYLKPWHRSKLRSSHKDRMKKKKKKKNDFCFLTVWGMETELPFGSPRKRPSFFEPIFKEL 922
Query: 934 KKKKKDAEKMKKKG--AEKIGKKKKKKKKKKKKKKKK--------KKKKKKEAEK 978
KKK + K KKK KI K++ K K K+ KK K+ KE K
Sbjct: 923 KKKIR---KFKKKYFLVLKILKERTKLFLKVSKETKKWIIKSFLFLKRIIKELSK 974
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 45.1 bits (107), Expect = 2e-04
Identities = 23/88 (26%), Positives = 38/88 (43%), Gaps = 7/88 (7%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE-----KIGKKKKK 957
KK + EAEK R++ +K K + + + K
Sbjct: 218 KKSDNLFTLDSGGSTDDEAEKKRQEV--KKKLKINNVSLDDDSTETPASDYYDVSEMVKF 275
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
KK KKKKKKKKK++K + ++++ + E
Sbjct: 276 KKPKKKKKKKKKRRKDLDEDELEPEAEG 303
Score = 42.8 bits (101), Expect = 0.001
Identities = 23/90 (25%), Positives = 44/90 (48%), Gaps = 7/90 (7%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+ +RK+++ EKI K ++K++ K+ K + D + + K K
Sbjct: 47 EEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDDDTKAW-------LKKSK 99
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K++KKK+ ++KK +KE E+ + T E
Sbjct: 100 KRQKKKEAERKKALLLDEKEKERAAEYTSE 129
Score = 42.1 bits (99), Expect = 0.001
Identities = 22/84 (26%), Positives = 45/84 (53%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
++K++ + K+ E++R+K K +E + K G K + D + K +K K+
Sbjct: 42 WKKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDDDTKAWLKKSKKR 101
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEK 978
+KKK+ ++KK +K+K+ A +
Sbjct: 102 QKKKEAERKKALLLDEKEKERAAE 125
Score = 39.4 bits (92), Expect = 0.010
Identities = 21/98 (21%), Positives = 39/98 (39%), Gaps = 19/98 (19%)
Query: 910 KIRKK--------KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
K+R K K + K + A + KK+++ + K+K E K K ++K+
Sbjct: 10 KLRAKLGLKPLPVPGAKPGSTKESRDAAAYENWKKRQEEAEAKRKREELREKIAKAREKR 69
Query: 962 KKKKKKKKKKKKKEAE-----------KMKKKTEEAEK 988
++ K K E + K KK+ ++ E
Sbjct: 70 ERNSKLGGIKTLGEDDDDDDDTKAWLKKSKKRQKKKEA 107
Score = 39.4 bits (92), Expect = 0.011
Identities = 23/110 (20%), Positives = 48/110 (43%), Gaps = 23/110 (20%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K + KKKK +KKKK++K+ ++ + AE + + K ++ ++ K
Sbjct: 273 VKFKKPKKKK-----KKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPK 327
Query: 957 KKKKK------------------KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K+K++ K+++ +KK+KK E + ++ E
Sbjct: 328 KRKEEQEDDDFVEDDDDLQASLAKQRRLAQKKRKKLRPEDIARQIAEERS 377
Score = 39.0 bits (91), Expect = 0.015
Identities = 24/84 (28%), Positives = 44/84 (52%), Gaps = 9/84 (10%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK-IGKKKKK--------KKKKKKK 964
KK++++ K ++++ EK+ K ++ E+ K G K +G+ KK KK++
Sbjct: 43 KKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDDDTKAWLKKSKKRQ 102
Query: 965 KKKKKKKKKKEAEKMKKKTEEAEK 988
KKK+ ++KK K+K AE
Sbjct: 103 KKKEAERKKALLLDEKEKERAAEY 126
Score = 34.0 bits (78), Expect = 0.40
Identities = 19/84 (22%), Positives = 35/84 (41%), Gaps = 12/84 (14%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK--- 964
++ K KK KKK +K K++K E + AE +G +K ++
Sbjct: 269 VSEMVKFKK--------PKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENA 320
Query: 965 -KKKKKKKKKKEAEKMKKKTEEAE 987
+ KK+K+E E ++ +
Sbjct: 321 RLEDSPKKRKEEQEDDDFVEDDDD 344
Score = 32.4 bits (74), Expect = 1.3
Identities = 18/102 (17%), Positives = 47/102 (46%), Gaps = 9/102 (8%)
Query: 896 RKKRTRRKKKK---------GAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
+KK+ ++K++K AE + ++ + + + KK+K++ E
Sbjct: 280 KKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKRKEEQEDDDFV 339
Query: 947 GAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ + K+++ +KK+KK + + A ++ ++ E E+
Sbjct: 340 EDDDDLQASLAKQRRLAQKKRKKLRPEDIARQIAEERSEDEE 381
Score = 29.7 bits (67), Expect = 8.1
Identities = 18/88 (20%), Positives = 32/88 (36%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
K+K KKKK + + K + K ++ E KKK
Sbjct: 174 VEKEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGSTD 233
Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ +KK+++ KKK + + + E
Sbjct: 234 DEAEKKRQEVKKKLKINNVSLDDDSTET 261
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 42.2 bits (100), Expect = 2e-04
Identities = 26/90 (28%), Positives = 46/90 (51%), Gaps = 8/90 (8%)
Query: 892 FILFR----KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKG----AEKMKKKKKDAEKM 943
FIL K K +K K+ +++A K KK G A++ K + + K
Sbjct: 41 FILKHLSTLKFTEAVKVSIKKQKKINPKRLQRQAAKEVKKPGISTKAQQALKLEHERNKQ 100
Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
+KK K K+++K++K++ K++KKK K +
Sbjct: 101 EKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130
Score = 41.5 bits (98), Expect = 3e-04
Identities = 24/80 (30%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD--AEKMKKKGAEKIGKKKKKKKKKK 962
K KK+KK K +++ A+++KK A++ K E+ K++KKK+ K+
Sbjct: 50 KFTEAVKVSIKKQKKINPKRLQRQAAKEVKKPGISTKAQQALKLEHERN-KQEKKKRSKE 108
Query: 963 KKKKKKKKKKKKEAEKMKKK 982
KK+++K++K++ + +K K K
Sbjct: 109 KKEEEKERKRQLKQQKKKAK 128
Score = 35.7 bits (83), Expect = 0.029
Identities = 19/75 (25%), Positives = 42/75 (56%), Gaps = 9/75 (12%)
Query: 923 KMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK---------KKKKKKKKKKKKKKKKKK 973
K + K+KK + ++++++ A+++ K K + ++ K++KKK+ K+K
Sbjct: 50 KFTEAVKVSIKKQKKINPKRLQRQAAKEVKKPGISTKAQQALKLEHERNKQEKKKRSKEK 109
Query: 974 KEAEKMKKKTEEAEK 988
KE EK +K+ + +K
Sbjct: 110 KEEEKERKRQLKQQK 124
Score = 34.1 bits (79), Expect = 0.11
Identities = 22/81 (27%), Positives = 48/81 (59%), Gaps = 7/81 (8%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
A K+ KK+KK ++++++ A K KK + K A++ K + ++ K++KKK+
Sbjct: 54 AVKVSIKKQKKINPKRLQRQ--AAKEVKKPGISTK-----AQQALKLEHERNKQEKKKRS 106
Query: 968 KKKKKKKEAEKMKKKTEEAEK 988
K+KK++++ K + K ++ +
Sbjct: 107 KEKKEEEKERKRQLKQQKKKA 127
Score = 31.5 bits (72), Expect = 0.74
Identities = 12/34 (35%), Positives = 23/34 (67%), Gaps = 4/34 (11%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKG 929
+KKR++ KK++ E+ R+ K++KK+ K +G
Sbjct: 102 KKKRSKEKKEEEKERKRQLKQQKKK----AKHRG 131
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 41.8 bits (99), Expect = 2e-04
Identities = 22/98 (22%), Positives = 55/98 (56%), Gaps = 9/98 (9%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK---------MKKKG 947
K+R + + + R++ ++++E K R+++ +K ++ ++ K K++
Sbjct: 5 KRREMEEVQLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRR 64
Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
AEK +++KK +K+K+++ K+ K + +E + +K EE
Sbjct: 65 AEKKAEEEKKLRKEKEEEIKELKAELEELKAEIEKLEE 102
Score = 33.0 bits (76), Expect = 0.19
Identities = 19/82 (23%), Positives = 47/82 (57%), Gaps = 9/82 (10%)
Query: 916 KKKKEAEKM------RKKKGAEKMKKKKKDAEKMKKKGAE---KIGKKKKKKKKKKKKKK 966
+K++E E++ ++++ + + K+ E+++KK E + K K K+ + K++
Sbjct: 4 EKRREMEEVQLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRR 63
Query: 967 KKKKKKKKEAEKMKKKTEEAEK 988
+ +KK ++E + K+K EE ++
Sbjct: 64 RAEKKAEEEKKLRKEKEEEIKE 85
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 43.6 bits (103), Expect = 2e-04
Identities = 21/110 (19%), Positives = 47/110 (42%), Gaps = 35/110 (31%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKK--------KKKDAEKMKKKGAEKIGK------- 953
E+ R+++K+K+ + R+ + A+ M K + + ++ +K A K K
Sbjct: 106 EENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQKARAAKERKERLVAEA 165
Query: 954 --------------------KKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
+K+K++KKK K+ K+++K++K +
Sbjct: 166 REHFGYWVDPRDPRFQEMLQQKEKEEKKKVKEAKRREKEEKRMAALVAAA 215
Score = 32.5 bits (74), Expect = 0.82
Identities = 14/63 (22%), Positives = 35/63 (55%)
Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
+ +E + +K+K A + ++ + A+ M K + +K+K+++K + K++K++
Sbjct: 102 NQMLEENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQKARAAKERKERL 161
Query: 975 EAE 977
AE
Sbjct: 162 VAE 164
Score = 30.5 bits (69), Expect = 3.3
Identities = 16/82 (19%), Positives = 38/82 (46%), Gaps = 13/82 (15%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKG------AEKMKKKKKDAEKMKKKGAEKIGKK 954
+K+K+ + K++K++ + R+ G + ++ + EK +KK K
Sbjct: 142 AQKRKREQKARAAKERKERLVAEAREHFGYWVDPRDPRFQEMLQQKEKEEKK-------K 194
Query: 955 KKKKKKKKKKKKKKKKKKKKEA 976
K+ K+++K++K+ A
Sbjct: 195 VKEAKRREKEEKRMAALVAAAA 216
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 43.4 bits (103), Expect = 2e-04
Identities = 23/59 (38%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 924 MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
+ KK E++KKK+ D ++ + +K +++KKKKKKK KKKK KK A K
Sbjct: 149 LLKK--VERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205
Score = 40.8 bits (96), Expect = 0.001
Identities = 20/56 (35%), Positives = 28/56 (50%)
Query: 918 KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
K+ E+++KK+ K + +D +K EK KKKK KKKK KK K
Sbjct: 150 LKKVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205
Score = 38.1 bits (89), Expect = 0.010
Identities = 22/63 (34%), Positives = 28/63 (44%), Gaps = 7/63 (11%)
Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
KK E+++KK+ KEAE A KK E+ KKK + KKK KK
Sbjct: 150 LKK-VERLKKKELDIKEAEA------ARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVG 202
Query: 964 KKK 966
K
Sbjct: 203 MKA 205
Score = 30.7 bits (70), Expect = 3.1
Identities = 16/54 (29%), Positives = 25/54 (46%), Gaps = 1/54 (1%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
+R ++K+ E + KKK + KKK K KKK +K+ G + I
Sbjct: 154 ERLKKKELDIKEAEAARDKKKSNNAEEEKKKKK-KKSAKKKKLKKVAAVGMKAI 206
>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain. This
presumed domain is found at the N-terminus of Ribosomal
L30 proteins and has been termed RL30NT or NUC018.
Length = 71
Score = 39.9 bits (94), Expect = 3e-04
Identities = 21/54 (38%), Positives = 30/54 (55%), Gaps = 7/54 (12%)
Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK+K EK++ K + KK+ KK +KKK+K K AEK K+ AE+
Sbjct: 4 KKRKRNEKLRAK-------RAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAER 50
Score = 33.4 bits (77), Expect = 0.057
Identities = 19/68 (27%), Positives = 34/68 (50%), Gaps = 3/68 (4%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
KK+K+ E + ++ K K +K K+ K AEK K+ + ++ + + K++ KKK
Sbjct: 4 KKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERDEIRLKREAKKKG 63
Query: 974 ---KEAEK 978
AE
Sbjct: 64 NFYVPAEP 71
Score = 30.7 bits (70), Expect = 0.53
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK+K+ +K + K+ KK+ + KKK + K
Sbjct: 2 LLKKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFK 36
Score = 30.3 bits (69), Expect = 0.73
Identities = 20/49 (40%), Positives = 28/49 (57%), Gaps = 5/49 (10%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK---KGAEKMKKKKKDAEK 942
KKR R +K + K KK+ KK A K ++K K AEK K+ + AE+
Sbjct: 4 KKRKRNEKLR--AKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAER 50
Score = 29.1 bits (66), Expect = 1.8
Identities = 17/55 (30%), Positives = 34/55 (61%), Gaps = 4/55 (7%)
Query: 934 KKKKKDAE----KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
KK+K++ + + KK+ A+K +KKK+K K+ +K K+ + E ++++ K E
Sbjct: 4 KKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERDEIRLKRE 58
Score = 29.1 bits (66), Expect = 2.1
Identities = 14/39 (35%), Positives = 23/39 (58%), Gaps = 3/39 (7%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE---EAEK 988
KK+K+ +K + K+ KK+ K+A + KK+ AEK
Sbjct: 2 LLKKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEK 40
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 42.0 bits (99), Expect = 3e-04
Identities = 27/91 (29%), Positives = 41/91 (45%), Gaps = 17/91 (18%)
Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK--------------IGKKKKKKKK 960
K + K K + K A K KKK+ ++ E G+KK K+K
Sbjct: 13 KNEPKWKSKRSELKKASKKKKKRSNSSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKL 72
Query: 961 KKKKKKKKKK---KKKKEAEKMKKKTEEAEK 988
++ K K KK+ K+K++ E+ K EAEK
Sbjct: 73 RRDKLKAKKEEAEKEKEKEERFMKALAEAEK 103
Score = 41.2 bits (97), Expect = 4e-04
Identities = 21/68 (30%), Positives = 38/68 (55%), Gaps = 2/68 (2%)
Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
+KK EK+R+ K K K++ + EK K++ K + +K++ + +KKK + K K+E
Sbjct: 65 RKKAKEKLRRDKLKAK--KEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKEEK 122
Query: 977 EKMKKKTE 984
+ M T
Sbjct: 123 KIMFADTS 130
Score = 38.1 bits (89), Expect = 0.006
Identities = 12/56 (21%), Positives = 30/56 (53%)
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K K ++ + K++ ++ K+++ K + +K++ + +KK+AE K E+
Sbjct: 67 KAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKEEK 122
Score = 37.7 bits (88), Expect = 0.007
Identities = 23/63 (36%), Positives = 40/63 (63%), Gaps = 4/63 (6%)
Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
RKK K+K +K++ KK E+ +K+K+ E+ K AE +K++ + +KKK + K K+
Sbjct: 65 RKKAKEKLRRDKLKAKK--EEAEKEKEKEERFMKALAE--AEKERAELEKKKAEAKLMKE 120
Query: 972 KKK 974
+KK
Sbjct: 121 EKK 123
Score = 36.6 bits (85), Expect = 0.018
Identities = 25/65 (38%), Positives = 38/65 (58%), Gaps = 6/65 (9%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
RKK K + K K KK+EAEK ++K+ E+ K +AEK + ++ KKK + K
Sbjct: 65 RKKAKEKLRRDKLKAKKEEAEKEKEKE--ERFMKALAEAEKERA----ELEKKKAEAKLM 118
Query: 962 KKKKK 966
K++KK
Sbjct: 119 KEEKK 123
Score = 36.2 bits (84), Expect = 0.025
Identities = 16/46 (34%), Positives = 29/46 (63%), Gaps = 1/46 (2%)
Query: 944 KKKGAEKIGKKKKKKKKKKK-KKKKKKKKKKKEAEKMKKKTEEAEK 988
+KK EK+ + K K KK++ K+K+K+++ K + +K+ E EK
Sbjct: 65 RKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEK 110
Score = 33.9 bits (78), Expect = 0.14
Identities = 16/51 (31%), Positives = 31/51 (60%), Gaps = 6/51 (11%)
Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK-KKKKKEAEKMKKKTEE 985
+KK EK+++ K KK++ +K+K+K+++ K EAEK + + E+
Sbjct: 65 RKKAKEKLRRDK-----LKAKKEEAEKEKEKEERFMKALAEAEKERAELEK 110
Score = 33.5 bits (77), Expect = 0.21
Identities = 16/51 (31%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK--KGAEKMKKKKKDAEKMKK 945
+K R K K E+ K+K+K++ K + K +++KKK +A+ MK+
Sbjct: 70 EKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKE 120
>gnl|CDD|216739 pfam01851, PC_rep, Proteasome/cyclosome repeat.
Length = 35
Score = 38.6 bits (91), Expect = 4e-04
Identities = 11/34 (32%), Positives = 21/34 (61%)
Query: 355 AAGIAMGLVLLGSKSESAIQDMVAYAKETQHEKI 388
A +A+GL+ GS +E A++ ++ Y +T +E
Sbjct: 1 GAILALGLIHAGSGNEEALELLLPYLSDTSNEVR 34
Score = 38.6 bits (91), Expect = 4e-04
Identities = 11/34 (32%), Positives = 21/34 (61%)
Query: 749 AAGIAMGLVLLGSKSESAIQDMVAYAKETQHEKI 782
A +A+GL+ GS +E A++ ++ Y +T +E
Sbjct: 1 GAILALGLIHAGSGNEEALELLLPYLSDTSNEVR 34
Score = 36.6 bits (86), Expect = 0.002
Identities = 10/35 (28%), Positives = 12/35 (34%)
Query: 714 GGCLGLGLAAMGSARDDIYEQLKFNLYQDDAVTGE 748
G L LGL GS ++ E L L
Sbjct: 1 GAILALGLIHAGSGNEEALELLLPYLSDTSNEVRA 35
Score = 31.6 bits (73), Expect = 0.082
Identities = 14/35 (40%), Positives = 19/35 (54%), Gaps = 1/35 (2%)
Query: 679 GGLYALGLIHANHG-AVINDYLLGQVKDATNEMVR 712
G + ALGLIHA G + LL + D +NE+
Sbjct: 1 GAILALGLIHAGSGNEEALELLLPYLSDTSNEVRA 35
Score = 28.9 bits (66), Expect = 0.95
Identities = 14/32 (43%), Positives = 18/32 (56%), Gaps = 1/32 (3%)
Query: 317 GGLYALGLIHANHG-AVINDYLLGQVKDATNE 347
G + ALGLIHA G + LL + D +NE
Sbjct: 1 GAILALGLIHAGSGNEEALELLLPYLSDTSNE 32
Score = 27.0 bits (61), Expect = 3.6
Identities = 11/26 (42%), Positives = 14/26 (53%), Gaps = 1/26 (3%)
Query: 280 ATASLGVIHRGHETE-ALAHMQSYLP 304
A +LG+IH G E AL + YL
Sbjct: 2 AILALGLIHAGSGNEEALELLLPYLS 27
Score = 27.0 bits (61), Expect = 3.6
Identities = 11/26 (42%), Positives = 14/26 (53%), Gaps = 1/26 (3%)
Query: 642 ATASLGVIHRGHETE-ALAHMQSYLP 666
A +LG+IH G E AL + YL
Sbjct: 2 AILALGLIHAGSGNEEALELLLPYLS 27
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 43.7 bits (103), Expect = 4e-04
Identities = 18/93 (19%), Positives = 34/93 (36%), Gaps = 18/93 (19%)
Query: 897 KKRTRRKKK---------------KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAE 941
T R+ + + AE I+K + +K K RKK K++ + E
Sbjct: 304 FLWTLRRNRLRMIITPWRAPELHAENAE-IKKTRTAEKNEAKARKK--EIAQKRRAAERE 360
Query: 942 KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
++ E+ + + ++ K KKK
Sbjct: 361 INREARQERAAAMARARARRAAVKAKKKGLIDA 393
Score = 37.9 bits (88), Expect = 0.024
Identities = 16/100 (16%), Positives = 44/100 (44%), Gaps = 18/100 (18%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKK----------------EAEKMRKKKGAEKMKKK-KK 938
++ R+K+K +K ++ E +++K + AEK + K +K
Sbjct: 289 EQRAQYREKQK-EKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKARK 347
Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
K++ AE+ ++ ++++ + + ++ +A+K
Sbjct: 348 KEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKK 387
Score = 35.2 bits (81), Expect = 0.19
Identities = 19/92 (20%), Positives = 38/92 (41%), Gaps = 13/92 (14%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
K+TR +K A K +KK+ A+ K++ AE+ ++ E+ A + + +
Sbjct: 331 EIKKTRTAEKNEA-----KARKKEIAQ---KRRAAEREINREARQER-----AAAMARAR 377
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
++ K KKK E + + +
Sbjct: 378 ARRAAVKAKKKGLIDASPNEDTPSENEESKGS 409
Score = 31.7 bits (72), Expect = 2.1
Identities = 17/79 (21%), Positives = 41/79 (51%), Gaps = 2/79 (2%)
Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
++K+K+K+A ++ +M A ++ + AE I K + +K + K +KK+
Sbjct: 293 QYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAE-IKKTRTAEKNEAKARKKEIA 351
Query: 971 KKKKEAE-KMKKKTEEAEK 988
+K++ AE ++ ++ +
Sbjct: 352 QKRRAAEREINREARQERA 370
>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
Length = 177
Score = 41.8 bits (99), Expect = 4e-04
Identities = 33/108 (30%), Positives = 46/108 (42%), Gaps = 20/108 (18%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKK-----------DAEKMKKKGAE 949
++KK K KKK E++ + K KKK K + + K A+
Sbjct: 1 KKKKSSPKRSKGMAKSKKKTREELDAEARERKRKKKHKGLKSGSRHNEGNTQSKGKGQAQ 60
Query: 950 K----IGKKKK-----KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K IG KK ++K K KKKK K KK K E+ +K E E+
Sbjct: 61 KKDPRIGSKKPIPLGVEEKVKPKKKKPKSKKPKLSPEQELEKLENDER 108
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 43.8 bits (104), Expect = 4e-04
Identities = 24/78 (30%), Positives = 38/78 (48%), Gaps = 9/78 (11%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK------ 962
+I + KK+ K+ E+ ++ AE +KK+++ EK K K +K KK KK
Sbjct: 95 AQIEELKKELKKLEEKIEQLEAEI-EKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALK 153
Query: 963 --KKKKKKKKKKKKEAEK 978
KK K+K KE +
Sbjct: 154 GLNYKKNFKEKLLKELKS 171
Score = 35.8 bits (83), Expect = 0.15
Identities = 17/72 (23%), Positives = 33/72 (45%), Gaps = 12/72 (16%)
Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
++ E E ++ KK+ K E EKI + + + +KK+++ +K K K +
Sbjct: 88 EENIEIEAQIEEL-----KKELKKLE-------EKIEQLEAEIEKKEEELEKAKNKFLDK 135
Query: 976 AEKMKKKTEEAE 987
A K K ++
Sbjct: 136 AWKKLAKKYDSN 147
Score = 34.6 bits (80), Expect = 0.27
Identities = 22/93 (23%), Positives = 36/93 (38%), Gaps = 10/93 (10%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
KK + ++ AE I KK+++ ++A+ K +K+ KK KG
Sbjct: 104 LKKLEEKIEQLEAE-IEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGL------- 155
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK K+K K+ K + E K
Sbjct: 156 --NYKKNFKEKLLKELKSVILNASSLLSLEELK 186
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 43.9 bits (103), Expect = 4e-04
Identities = 21/89 (23%), Positives = 45/89 (50%), Gaps = 1/89 (1%)
Query: 897 KKRTRRKKKKGAEKIRKKK-KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
KK+ R K+ +K + + +E ++ + A K+ KK + ++K A +
Sbjct: 70 KKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVE 129
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
++K +KK +K++K KK ++ E ++E
Sbjct: 130 EEKTEKKVRKRRKVKKMDEDVEDQGSESE 158
Score = 40.0 bits (93), Expect = 0.006
Identities = 27/126 (21%), Positives = 52/126 (41%), Gaps = 28/126 (22%)
Query: 890 LGFILFRKKRTRRKKKKGAEKIRKK--------------------KKKKKEAEKMRKKKG 929
LGF FR +R R + K +K KK + ++ KK
Sbjct: 26 LGFNKFRGERWRLQNKFLLAAGSRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTV 85
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK--------KKKKKKKKKKEAEKMKK 981
E + ++ +E + + A + KK ++ ++K +++K +KK ++ K+KK
Sbjct: 86 VEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKK 145
Query: 982 KTEEAE 987
E+ E
Sbjct: 146 MDEDVE 151
Score = 30.4 bits (68), Expect = 6.2
Identities = 13/54 (24%), Positives = 29/54 (53%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE 949
+RTRRK + + ++K +KK ++ + KK E ++ + ++E + +E
Sbjct: 113 TPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESE 166
>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
This family consists of several eukaryotic ribosome
biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
protein that is essential for the maturation of 25 S
rRNA and the 60 S ribosomal subunit assembly in
Saccharomyces cerevisiae.
Length = 164
Score = 41.9 bits (99), Expect = 4e-04
Identities = 23/79 (29%), Positives = 36/79 (45%), Gaps = 11/79 (13%)
Query: 904 KKKGAEKIRKKKKKKK---EAEKMRKKKGAEKMKKKKKDAE-----KMKKKGAEKIGKKK 955
KKKG I+K+KK K EA + K K + + K E KK
Sbjct: 88 KKKG---IKKRKKSKLVYDEATGEWVPRWGYKRANKDLEKDWIIEVKPNDDPKEDPFAKK 144
Query: 956 KKKKKKKKKKKKKKKKKKK 974
+++KK++ K +K++ K K
Sbjct: 145 RREKKERVAKNEKRELKNK 163
Score = 36.1 bits (84), Expect = 0.033
Identities = 22/86 (25%), Positives = 37/86 (43%), Gaps = 18/86 (20%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK----------- 962
K K + EK KKKG +K KK K ++ + + G K+ K +K
Sbjct: 76 KPKPPTKWEKFAKKKGIKKRKKSKLVYDEATGEWVPRWGYKRANKDLEKDWIIEVKPNDD 135
Query: 963 -------KKKKKKKKKKKKEAEKMKK 981
KK+++KK++ K ++ K
Sbjct: 136 PKEDPFAKKRREKKERVAKNEKRELK 161
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 43.4 bits (103), Expect = 4e-04
Identities = 17/93 (18%), Positives = 54/93 (58%), Gaps = 14/93 (15%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
K+ ++K ++ ++ +++ ++KE EK R+++ K+ E +K
Sbjct: 118 EKREKQKKLREEIDEFNEERIERKEEEKEREREEELKI--------------LEYQREKA 163
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+++++++ +++++K++K++E +++ + EEAE
Sbjct: 164 EREEEREAERRERKEEKEREVARLRAQQEEAED 196
Score = 41.4 bits (98), Expect = 0.001
Identities = 18/98 (18%), Positives = 60/98 (61%), Gaps = 5/98 (5%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
+++++ R++++ AE+ +K++K++E ++R ++ + ++++ D + E K+
Sbjct: 158 YQREKAEREEEREAERRERKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKE 217
Query: 955 KKKKKKKKKKKKKKKKKKKKE-----AEKMKKKTEEAE 987
++K+K++ +K++++K++ ++ EK ++ EE
Sbjct: 218 RQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERA 255
Score = 39.5 bits (93), Expect = 0.006
Identities = 14/95 (14%), Positives = 49/95 (51%), Gaps = 11/95 (11%)
Query: 896 RKKRTRRKKKKGAEKIR----KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
K+ ++++ E R +K+ + +E E+ +K K + +++ E+ I
Sbjct: 245 EKEERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQ-------I 297
Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
+K++++ +++++ ++ ++ ++E + + + EE
Sbjct: 298 EEKEERRAAEREEELEEGERLREEEAERQARIEEE 332
Score = 39.1 bits (92), Expect = 0.010
Identities = 18/97 (18%), Positives = 58/97 (59%), Gaps = 9/97 (9%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMK---KKKKDAEKMKKKGAEKIGKKKKK 957
R++++K E+ K++++K+E ++ R+++ EK + +++ + E +++ EK + ++
Sbjct: 215 RKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEEAERERMLEKQAEDEEL 274
Query: 958 KKKKKKKKKKKKKKKKKEAEKM------KKKTEEAEK 988
+++ +K++ K+ + ++E E+ ++ E E+
Sbjct: 275 EQENAEKRRMKRLEHRRELEQQIEEKEERRAAEREEE 311
Score = 38.7 bits (91), Expect = 0.011
Identities = 17/82 (20%), Positives = 51/82 (62%), Gaps = 3/82 (3%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
++K+ R + + E+ ++K+++KE E+ + K E ++K AE+ +++ AE+ +K+
Sbjct: 122 KQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREK---AEREEEREAERRERKE 178
Query: 956 KKKKKKKKKKKKKKKKKKKEAE 977
+K+++ + + ++++ + + E
Sbjct: 179 EKEREVARLRAQQEEAEDEREE 200
Score = 38.4 bits (90), Expect = 0.016
Identities = 19/100 (19%), Positives = 66/100 (66%), Gaps = 8/100 (8%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMR---KKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
++R +R++++ E+++++++ + E+++ + + EK +K+KK E++ + E+I +
Sbjct: 81 EEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIER 140
Query: 954 KKKKKKKKKKKKKK-----KKKKKKKEAEKMKKKTEEAEK 988
K+++K+++++++ K ++K +++E + +++ + EK
Sbjct: 141 KEEEKEREREEELKILEYQREKAEREEEREAERRERKEEK 180
Score = 38.4 bits (90), Expect = 0.016
Identities = 17/118 (14%), Positives = 56/118 (47%), Gaps = 25/118 (21%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMK--------------KKKKDAE 941
K + ++++ E + +++ K E+ R++K E+ + K++++
Sbjct: 32 IKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEY 91
Query: 942 KMKKKGAEKIG-----------KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ + + E++ + ++K++K+KK +++ + +E + K++ +E E+
Sbjct: 92 EERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKERER 149
Score = 38.4 bits (90), Expect = 0.017
Identities = 20/93 (21%), Positives = 53/93 (56%), Gaps = 3/93 (3%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
RK+ R + E+I +++K+++E + R ++ + + ++ E+ + + EK K+K
Sbjct: 65 RKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQK 124
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K +++ + +++ ++K++E E+ + EE K
Sbjct: 125 KLREEIDEFNEERIERKEEEKERER---EEELK 154
Score = 38.0 bits (89), Expect = 0.019
Identities = 15/82 (18%), Positives = 48/82 (58%), Gaps = 10/82 (12%)
Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK---------KKKKKKK 965
KK+ K EK +++ E M++++ A +++ E+ K+++++ ++++K++
Sbjct: 29 KKRIKAEEKEEERRIDEMMEEERLKALAEEEE-RERKRKEERREGRAVLQEQIEEREKRR 87
Query: 966 KKKKKKKKKEAEKMKKKTEEAE 987
+++ +++ +E E+M + E +
Sbjct: 88 QEEYEERLQEREQMDEIIERIQ 109
Score = 33.3 bits (77), Expect = 0.65
Identities = 16/118 (13%), Positives = 59/118 (50%), Gaps = 29/118 (24%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKE-----------------AEKMRKKKGAEKMKKKKK 938
++ + +K++ ++R ++++ ++ K R+K+ E K++++
Sbjct: 172 ERRERKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKEKEEAEKRRRQ 231
Query: 939 DAEKMKKKGAEKIGKKKKKKKKKK-----------KKKKKKKKKKKKEAEKMKKKTEE 985
+++++ E+I +K+++ ++++ +K+ + ++ +++ AEK + K E
Sbjct: 232 -KQELQRAREEQIEEKEERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKRLE 288
Score = 33.0 bits (76), Expect = 0.75
Identities = 20/104 (19%), Positives = 57/104 (54%), Gaps = 20/104 (19%)
Query: 902 RKKKKGAEKIRKKKKKKKEA---------------EKMRKKKGAEKMKKKKKDAEKMKKK 946
++ K + ++++K+KE EK R+++ E+++++++ E +++
Sbjct: 49 EERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERI 108
Query: 947 GAEKIGKKKKKKKKKKKKKK-----KKKKKKKKEAEKMKKKTEE 985
E + ++K++K+KK ++ +++ ++KE EK +++ EE
Sbjct: 109 QEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEE 152
Score = 31.4 bits (72), Expect = 2.2
Identities = 18/83 (21%), Positives = 52/83 (62%), Gaps = 4/83 (4%)
Query: 909 EKIRKKKKKKKEAE--KMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK-KKKK 965
E++R+ +K + A+ K R + EK K+ K + ++ +++ E + +++ K + ++++
Sbjct: 4 EELRELNEKLRAAKVNKERDAQIEEK-KRIKAEEKEEERRIDEMMEEERLKALAEEEERE 62
Query: 966 KKKKKKKKKEAEKMKKKTEEAEK 988
+K+K+++++ ++++ EE EK
Sbjct: 63 RKRKEERREGRAVLQEQIEEREK 85
>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
Length = 291
Score = 43.1 bits (101), Expect = 4e-04
Identities = 33/110 (30%), Positives = 45/110 (40%), Gaps = 6/110 (5%)
Query: 117 SATQAFLASVLQELKTCAPI--PDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSS 174
+ TQA A+ K AP P +T V A TP + EAP P + S+
Sbjct: 23 TTTQAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPSST 82
Query: 175 TTPGPETKTLES----LNEKEKQLQARITKLSKILSGEVSIDLHLQFLIR 220
P++ T + +N K K L+A TK S E+ I L IR
Sbjct: 83 KVETPQSPTTKQVPTEINPKFKDLRAYYTKPSLEFKNEIGIILKKWTTIR 132
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 42.5 bits (100), Expect = 5e-04
Identities = 18/89 (20%), Positives = 32/89 (35%), Gaps = 2/89 (2%)
Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
+ + + E+ + + + E K ++ K +K +K K K K K K K
Sbjct: 56 PQPEPEPPEEQPKPPTEPETPPEPTPPKP--KEKPKPEKKPKKPKPKPKPKPKPKPKVKP 113
Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ K KK K K A + +
Sbjct: 114 QPKPKKPPSKTAAKAPAAPNQPARPPSAA 142
Score = 40.9 bits (96), Expect = 0.002
Identities = 21/84 (25%), Positives = 32/84 (38%), Gaps = 6/84 (7%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
E+ + + + +E K + + + + +K EK KK K K K K K K
Sbjct: 53 TEEPQPEPEPPEEQPKPPTE---PETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKP 109
Query: 968 K---KKKKKKEAEKMKKKTEEAEK 988
K + K KK K K A
Sbjct: 110 KVKPQPKPKKPPSKTAAKAPAAPN 133
Score = 37.5 bits (87), Expect = 0.024
Identities = 17/74 (22%), Positives = 27/74 (36%), Gaps = 2/74 (2%)
Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK--KKKKK 974
E + + E+ K + + + K K +KK KK K K K K K K
Sbjct: 51 APTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPK 110
Query: 975 EAEKMKKKTEEAEK 988
+ K K ++
Sbjct: 111 VKPQPKPKKPPSKT 124
Score = 34.8 bits (80), Expect = 0.17
Identities = 17/72 (23%), Positives = 28/72 (38%), Gaps = 1/72 (1%)
Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
K + + + +++ + E K K+K K +KK KK K K K +
Sbjct: 46 AKVLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPK- 104
Query: 977 EKMKKKTEEAEK 988
K K K + K
Sbjct: 105 PKPKPKVKPQPK 116
Score = 32.5 bits (74), Expect = 0.89
Identities = 15/81 (18%), Positives = 23/81 (28%), Gaps = 8/81 (9%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
+ K+K K ++ K K K K K K K + K K K K
Sbjct: 75 TPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPK--------KPPSKTAA 126
Query: 963 KKKKKKKKKKKKEAEKMKKKT 983
K + + +
Sbjct: 127 KAPAAPNQPARPPSAASASGA 147
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 43.7 bits (104), Expect = 5e-04
Identities = 28/95 (29%), Positives = 50/95 (52%), Gaps = 19/95 (20%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK-------- 954
K AEK++++ ++KKE + + K E+ +K+ + A K KK A++I K+
Sbjct: 541 ALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGG 600
Query: 955 -----------KKKKKKKKKKKKKKKKKKKKEAEK 978
+K+ K +KK+KKKKK+KE ++
Sbjct: 601 YASVKAHELIEARKRLNKANEKKEKKKKKQKEKQE 635
Score = 41.4 bits (98), Expect = 0.003
Identities = 28/96 (29%), Positives = 52/96 (54%), Gaps = 12/96 (12%)
Query: 897 KKRTRRKKKKGAEKIRK----KKKKKKEAEKMRK------KKGAEKMKKKKKDAEKMKKK 946
K + K++ EK K + K +EAEK + KK A+++ K+ + +K
Sbjct: 543 LKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYA 602
Query: 947 G--AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMK 980
A ++ + +K+ K +KK+KKKKK+K++ E++K
Sbjct: 603 SVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638
Score = 39.4 bits (93), Expect = 0.010
Identities = 19/73 (26%), Positives = 44/73 (60%), Gaps = 4/73 (5%)
Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK----KKE 975
E + ++ AE+ + K+AEK+K++ EK K ++++ K ++ +K+ ++ KKE
Sbjct: 526 EELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKE 585
Query: 976 AEKMKKKTEEAEK 988
A+++ K+ + +K
Sbjct: 586 ADEIIKELRQLQK 598
Score = 37.1 bits (87), Expect = 0.051
Identities = 33/89 (37%), Positives = 42/89 (47%), Gaps = 19/89 (21%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRK-----------KKGAEKMKKKKKDAE------KM 943
R++ K EK KKKKK+KE ++ K +KG K+A KM
Sbjct: 613 RKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQKGEVLSIPDDKEAIVQAGIMKM 672
Query: 944 KKKGA--EKIGKKKKKKKKKKKKKKKKKK 970
K + EKI K KKKKKKK K K K +
Sbjct: 673 KVPLSDLEKIQKPKKKKKKKPKTVKPKPR 701
Score = 34.8 bits (81), Expect = 0.25
Identities = 27/111 (24%), Positives = 43/111 (38%), Gaps = 26/111 (23%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAE-------KMKKKG- 947
++ R +K + K + + +K K +KK +K K+K+K E K G
Sbjct: 591 KELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQ 650
Query: 948 -AEKIGKKKKKK-----------------KKKKKKKKKKKKKKKKEAEKMK 980
E + K+ +K +K KKKKKKK K K +
Sbjct: 651 KGEVLSIPDDKEAIVQAGIMKMKVPLSDLEKIQKPKKKKKKKPKTVKPKPR 701
>gnl|CDD|216985 pfam02349, MSG, Major surface glycoprotein. This is a novel repeat
in Pneumocystis carinii Major surface glycoprotein (MSG)
some members of the alignment have up to nine repeats of
this family, the repeats containing several conserved
cysteines. The MSG of P. carinii is an important protein
in host-pathogen interactions. Surface glycoprotein A
from Pneumocystis carinii is a main target for the host
immune system, this protein is implicated in the
attachment of Pneumocystis carinii to the host alveolar
epithelial cells, alveolar macrophages, host surfactant
and possibly accounts in part for the hypoxia seen in
Pneumocystis carinii pneumonia (PCP).
Length = 81
Score = 39.3 bits (92), Expect = 5e-04
Identities = 19/77 (24%), Positives = 37/77 (48%)
Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
+ K ++K E + E+ K+ + K+ K+ K K ++ K+ K + KK
Sbjct: 1 KDFKLEEKCKELEEYCEFLEEACPLLKEKCEKLKEKCYKLKKDLKLEELLLKELKGELKK 60
Query: 972 KKKEAEKMKKKTEEAEK 988
KKK + +K+K + +K
Sbjct: 61 KKKCKKALKEKCTKLKK 77
Score = 35.1 bits (81), Expect = 0.016
Identities = 21/61 (34%), Positives = 31/61 (50%), Gaps = 3/61 (4%)
Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
K++ EK+++K KK E++ K + KKKKK KK K+K K KK+
Sbjct: 24 PLLKEKCEKLKEKC---YKLKKDLKLEELLLKELKGELKKKKKCKKALKEKCTKLKKESN 80
Query: 975 E 975
E
Sbjct: 81 E 81
>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p), ribosomal RNA-processing protein 7 homolog A
(Rrp7A), and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
YCL031C gene from Saccharomyces cerevisiae. It is an
essential yeast protein involved in pre-rRNA processing
and ribosome assembly, and is speculated to be required
for correct assembly of rpS27 into the pre-ribosomal
particle. Rrp7A, also termed gastric cancer antigen
Zg14, is the Rrp7p homolog mainly found in Metazoans.
The cellular function of Rrp7A remains unclear
currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 118
Score = 40.3 bits (95), Expect = 6e-04
Identities = 24/88 (27%), Positives = 41/88 (46%), Gaps = 24/88 (27%)
Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAE-------KMKKKGAEKIGKKKKKKKKKKKKKKKK 968
K+++E ++ K+ E D + +K + + K K+K+KKKKKK
Sbjct: 32 KREEEEKEEAKEARNEP------DEDGFVTVTRGGRKGKTAREEAVEAKAKEKEKKKKKK 85
Query: 969 KK-----------KKKKEAEKMKKKTEE 985
K+ KKK+E +++KK EE
Sbjct: 86 KELEDFYRFQIREKKKEELAELRKKFEE 113
Score = 29.9 bits (68), Expect = 2.4
Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 9/50 (18%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKM---------RKKKGAEKMKKK 936
RK +T R++ A+ K+KKKKK+ E +KK+ +++KK
Sbjct: 61 RKGKTAREEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKEELAELRKK 110
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 43.2 bits (102), Expect = 6e-04
Identities = 18/74 (24%), Positives = 41/74 (55%), Gaps = 3/74 (4%)
Query: 907 GAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
G ++ K+ K+ + KK G K + DA + + A++ K+++ +KK+K++
Sbjct: 364 GDKRQELLKEYNKKLQDYTKKLGEVK---DETDASEEAEAKAKEEKLKQEENEKKQKEQA 420
Query: 967 KKKKKKKKEAEKMK 980
+ K+K+++ E+ K
Sbjct: 421 DEDKEKRQKDERKK 434
Score = 42.1 bits (99), Expect = 0.001
Identities = 16/70 (22%), Positives = 40/70 (57%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
K E +++ KK ++ K + E ++ +A+ ++K ++ +KK+K++ + K
Sbjct: 365 DKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDK 424
Query: 965 KKKKKKKKKK 974
+K++K ++KK
Sbjct: 425 EKRQKDERKK 434
Score = 37.9 bits (88), Expect = 0.026
Identities = 17/63 (26%), Positives = 34/63 (53%), Gaps = 4/63 (6%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
KK + K + K + +A + + K K++K E+ +KK E+ + K+K++K +
Sbjct: 376 KKLQDYTKKLGEVKDETDASEEAE----AKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431
Query: 963 KKK 965
+KK
Sbjct: 432 RKK 434
Score = 36.3 bits (84), Expect = 0.081
Identities = 18/76 (23%), Positives = 40/76 (52%), Gaps = 8/76 (10%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
R++ + KK + +K + K E + ++ K K++K E+ +KK +K+
Sbjct: 367 RQELLKEYNKKLQDYTKKLGEVKDETDA--SEEAEAKAKEEKLKQEENEKK------QKE 418
Query: 956 KKKKKKKKKKKKKKKK 971
+ + K+K++K ++KK
Sbjct: 419 QADEDKEKRQKDERKK 434
Score = 35.2 bits (81), Expect = 0.16
Identities = 20/84 (23%), Positives = 43/84 (51%), Gaps = 8/84 (9%)
Query: 913 KKKKKKKEAEKMRKKKGAEKMKK--------KKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
K ++ K + K E +K+ KK E + A + + K K++K K++
Sbjct: 351 KLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQE 410
Query: 965 KKKKKKKKKKEAEKMKKKTEEAEK 988
+ +KK+K++ + +K K++ +E +K
Sbjct: 411 ENEKKQKEQADEDKEKRQKDERKK 434
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 42.7 bits (101), Expect = 7e-04
Identities = 22/81 (27%), Positives = 32/81 (39%), Gaps = 6/81 (7%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEA--EKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
F ++R K E+++K K KK + + K K K KKK A
Sbjct: 317 NFNEERVE----KYIERLKKAKTKKTQTRLDSFFTATKKPIKKSNSKAKLKKKKKKAGAS 372
Query: 952 GKKKKKKKKKKKKKKKKKKKK 972
K + ++ K KKK KK
Sbjct: 373 AVPKSETSQEAKSSGKKKVKK 393
Score = 36.9 bits (86), Expect = 0.053
Identities = 23/72 (31%), Positives = 33/72 (45%), Gaps = 1/72 (1%)
Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
K+K + R +K E++KK K + + KK KK K K KKKKKK +
Sbjct: 314 KEKNFNEERVEKYIERLKKAKTKKTQTRLDSFFTA-TKKPIKKSNSKAKLKKKKKKAGAS 372
Query: 977 EKMKKKTEEAEK 988
K +T + K
Sbjct: 373 AVPKSETSQEAK 384
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 43.1 bits (101), Expect = 7e-04
Identities = 35/115 (30%), Positives = 53/115 (46%), Gaps = 8/115 (6%)
Query: 865 VATKRLLHVAVSDVNDD-----VRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKK 919
R HVA + +DD ++E I + + + K K + KK+K
Sbjct: 480 FEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETS---EADKDVNKSKNKKRKVD 536
Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
E E+ +K K K+KK +KMK A+K + + KKKKK+ K+KK KK
Sbjct: 537 EEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKKQIAKQKKLDSKK 591
Score = 37.7 bits (87), Expect = 0.032
Identities = 27/76 (35%), Positives = 36/76 (47%), Gaps = 3/76 (3%)
Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
K +E K K K RK E+ KK K K+K K+ KK K KK+++
Sbjct: 514 KYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQK---KLYKKMKYSNAKKEEQ 570
Query: 966 KKKKKKKKKEAEKMKK 981
+ KKKKK+ K KK
Sbjct: 571 AENLKKKKKQIAKQKK 586
Score = 36.6 bits (84), Expect = 0.067
Identities = 22/70 (31%), Positives = 37/70 (52%)
Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
K +E K K K KK+ ++ +++ K+ K+KK KK K KK+++AE
Sbjct: 514 KYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAEN 573
Query: 979 MKKKTEEAEK 988
+KKK ++ K
Sbjct: 574 LKKKKKQIAK 583
Score = 30.0 bits (67), Expect = 7.7
Identities = 15/81 (18%), Positives = 35/81 (43%)
Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
T+R ++ + + + A + E +K+ + E K +E K
Sbjct: 467 ETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVN 526
Query: 959 KKKKKKKKKKKKKKKKEAEKM 979
K K KK+K +++++K+ + +
Sbjct: 527 KSKNKKRKVDEEEEEKKLKMI 547
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 42.8 bits (101), Expect = 8e-04
Identities = 20/63 (31%), Positives = 37/63 (58%), Gaps = 1/63 (1%)
Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK-KKKKKKKKEAE 977
KE EK + E+ K++KK+ E+ E++ ++++K++KKKK KK K+ + +
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLN 88
Query: 978 KMK 980
K K
Sbjct: 89 KTK 91
Score = 33.6 bits (77), Expect = 0.57
Identities = 17/60 (28%), Positives = 34/60 (56%), Gaps = 4/60 (6%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
EK +K+ +E E+ ++ EK ++++K +K ++ E+ ++KKKK KK K+
Sbjct: 27 VEKEVEKEVPDEEEEEEKE----EKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTT 82
Score = 30.1 bits (68), Expect = 6.0
Identities = 14/49 (28%), Positives = 33/49 (67%), Gaps = 1/49 (2%)
Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+ E+ +K+ ++ +K K+++ ++++K++KKKK +K+K+ T E E
Sbjct: 38 EEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKT-KKVKETTTEWE 85
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 42.5 bits (100), Expect = 8e-04
Identities = 27/95 (28%), Positives = 42/95 (44%), Gaps = 10/95 (10%)
Query: 897 KKRTRRKK--KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
KK K KK EK +K+++ ++ + R+ K K + AE + +K +
Sbjct: 321 KKYGSGAKHVKKEREKEQKEREGRQSEWEARQAKREGGDAKAGRAAEPTGSRTQQKGDRP 380
Query: 955 KKKKKKKKKKKKKKK--------KKKKKEAEKMKK 981
K+ +KKK KK K KKK KE + K
Sbjct: 381 KRGEKKKPKKPSVDKPLHPSWEAKKKAKEKKANAK 415
Score = 41.7 bits (98), Expect = 0.002
Identities = 39/151 (25%), Positives = 54/151 (35%), Gaps = 30/151 (19%)
Query: 868 KRLLHVAVSDVNDDVRR-----EAVTALGFILFRKKRTRRKKKKGAEK-----------I 911
+R+ + ND + R EA+ L KK K K
Sbjct: 26 RRIKNAEKKSKNDKLARIEEEIEALKTLDLDQLAKKYLFSKLIKVITSKPTKAWAESPEF 85
Query: 912 RKKKKKKK--------EAEKMRKKKGAEKMK------KKKKDAEKMKKKGAEKIGKKKKK 957
R+ K EAE + K K + K K+ + +G +KI + K
Sbjct: 86 RQIINDKSHKANPSRSEAEVVEKLKSLNNVVSRLLNSKPVKNVLESIMEGLDKILGIETK 145
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK K KKK KK KKK+ E K EE +
Sbjct: 146 AKKGKAKKKTKKSKKKEAKESSDKDDEEESE 176
Score = 37.9 bits (88), Expect = 0.022
Identities = 28/114 (24%), Positives = 48/114 (42%), Gaps = 28/114 (24%)
Query: 896 RKKRT-RRKKKKGAEKIRKKKKK--KKEAEKMRKKKGAEKMKKKKKDAE----------- 941
RK R +R ++ EK K KKE EK E+ +++ + +E
Sbjct: 306 RKNRRGQRARQAIWEKKYGSGAKHVKKEREK-------EQKEREGRQSEWEARQAKREGG 358
Query: 942 -KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKM------KKKTEEAEK 988
+ AE G + ++K + K+ +KKK KK + + KKK +E +
Sbjct: 359 DAKAGRAAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKPLHPSWEAKKKAKEKKA 412
Score = 34.4 bits (79), Expect = 0.28
Identities = 23/82 (28%), Positives = 32/82 (39%), Gaps = 15/82 (18%)
Query: 910 KIRKKKKKKKE----AEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
++ K K E + K G E KK K K KK KKK+ K+
Sbjct: 118 RLLNSKPVKNVLESIMEGLDKILGIETKAKKGKAK-----------KKTKKSKKKEAKES 166
Query: 966 KKKKKKKKKEAEKMKKKTEEAE 987
K +++ E+E K E AE
Sbjct: 167 SDKDDEEESESEDESKSEESAE 188
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast) and
GNL3L (human) are putative GTPases which are required
for growth and play a role in processing of nucleolar
pre-rRNA. This family contains a potential nuclear
localisation signal.
Length = 80
Score = 38.8 bits (91), Expect = 8e-04
Identities = 21/84 (25%), Positives = 42/84 (50%), Gaps = 16/84 (19%)
Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA-----------EKIGKKKKKKKK 960
+ K +KK AE RK + K+ KK+ KK E+I ++ ++KK+
Sbjct: 2 LRYKIEKKVAEHHRKLR-----KEAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKR 56
Query: 961 KKKKKKKKKKKKKKEAEKMKKKTE 984
K++++K+++K+ +K +K
Sbjct: 57 KQEEEKERRKEARKAERAEARKRG 80
Score = 36.1 bits (84), Expect = 0.007
Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 23/78 (29%)
Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK----------KKK-----KKKKKKK 970
+KK AE +K +K+A+K K KKKK K++ ++KK+K+
Sbjct: 7 EKKVAEHHRKLRKEAKKN--------PTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQ 58
Query: 971 KKKKEAEKMKKKTEEAEK 988
+++KE K +K E AE
Sbjct: 59 EEEKERRKEARKAERAEA 76
Score = 34.9 bits (81), Expect = 0.021
Identities = 19/62 (30%), Positives = 30/62 (48%), Gaps = 14/62 (22%)
Query: 941 EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK---------KKK-----EAEKMKKKTEEA 986
K++KK AE K +K+ KK K KKKK K++ E +K K++ E+
Sbjct: 4 YKIEKKVAEHHRKLRKEAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEEEKE 63
Query: 987 EK 988
+
Sbjct: 64 RR 65
Score = 30.3 bits (69), Expect = 0.92
Identities = 24/88 (27%), Positives = 45/88 (51%), Gaps = 22/88 (25%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD-----AEKMKKKGAEK 950
+K +K +KEA+K K KKKKD + K++ E+
Sbjct: 8 KKVAEH------------HRKLRKEAKKNPTWK-----SKKKKDPGIPNSFPFKEEILEE 50
Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
I +KK+K++++K+++K+ +K ++ EA K
Sbjct: 51 IEEKKRKQEEEKERRKEARKAERAEARK 78
Score = 29.9 bits (68), Expect = 1.0
Identities = 8/24 (33%), Positives = 15/24 (62%)
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKM 979
+ + K +KK + +K +KEA+K
Sbjct: 1 RLRYKIEKKVAEHHRKLRKEAKKN 24
Score = 29.1 bits (66), Expect = 2.0
Identities = 9/33 (27%), Positives = 17/33 (51%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
+ K +KK + +K +K+ KK KK ++
Sbjct: 2 LRYKIEKKVAEHHRKLRKEAKKNPTWKSKKKKD 34
Score = 28.8 bits (65), Expect = 3.3
Identities = 16/69 (23%), Positives = 34/69 (49%), Gaps = 16/69 (23%)
Query: 901 RRKKKKGAEK---IRKKKKK----------KKEAEKMRKKKGAEKMKKKKKDAEKMKKKG 947
RK +K A+K + KKKK K E++ ++ +K K++++ + + +
Sbjct: 14 HRKLRKEAKKNPTWKSKKKKDPGIPNSFPFK---EEILEEIEEKKRKQEEEKERRKEARK 70
Query: 948 AEKIGKKKK 956
AE+ +K+
Sbjct: 71 AERAEARKR 79
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 43.1 bits (102), Expect = 8e-04
Identities = 28/89 (31%), Positives = 44/89 (49%), Gaps = 12/89 (13%)
Query: 909 EKIRKK-KKKKKEAEKMRK-----------KKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
EK+ + +KK+KE EK++ K E ++++++ EK K K K
Sbjct: 1105 EKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKG 1164
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K K +K K KKK+KKKK++ K K
Sbjct: 1165 KASKLRKPKLKKKEKKKKKSSADKSKKAS 1193
Score = 40.4 bits (95), Expect = 0.005
Identities = 27/84 (32%), Positives = 42/84 (50%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
K+ A++ R K K K +A K+RK K +K KKKKK + KK + K+ +K+K
Sbjct: 1150 KEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKL 1209
Query: 965 KKKKKKKKKKEAEKMKKKTEEAEK 988
K KK + ++ EE +
Sbjct: 1210 DDKPDNKKSNSSGSDQEDDEEQKT 1233
Score = 40.0 bits (94), Expect = 0.007
Identities = 24/100 (24%), Positives = 39/100 (39%), Gaps = 11/100 (11%)
Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
+R K K K K +K K + ++ +KKK + KK K+ +++ K K
Sbjct: 1155 EQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPD 1214
Query: 960 KKK-----------KKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK +++K K KK + K KK
Sbjct: 1215 NKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSS 1254
Score = 39.3 bits (92), Expect = 0.014
Identities = 23/88 (26%), Positives = 40/88 (45%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
++K K ++ K K K +K +K +KKKK + K K A +G K+
Sbjct: 1145 EEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSD 1204
Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K+K K KK + ++ +E +K
Sbjct: 1205 EKRKLDDKPDNKKSNSSGSDQEDDEEQK 1232
Score = 38.1 bits (89), Expect = 0.030
Identities = 22/93 (23%), Positives = 39/93 (41%), Gaps = 2/93 (2%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K + + ++ + K+ K K +++ + + K+ D E KKK
Sbjct: 1251 SKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKV 1310
Query: 957 KKKKKKKKKKKKKKKKKKE--AEKMKKKTEEAE 987
KK+ + KKKKK ++ A K K KT +
Sbjct: 1311 KKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQ 1343
Score = 37.7 bits (88), Expect = 0.038
Identities = 17/91 (18%), Positives = 31/91 (34%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
+ + K + KK +K + A K KKK +K +KKK
Sbjct: 1277 RVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKK 1336
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K + K+ + + + K K + +
Sbjct: 1337 SKTRVKQASASQSSRLLRRPRKKKSDSSSED 1367
Score = 37.3 bits (87), Expect = 0.050
Identities = 20/93 (21%), Positives = 38/93 (40%), Gaps = 1/93 (1%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
++ ++ K + K +K K +K +KKK + K KK + K + K+K
Sbjct: 1150 KEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKA-SVVGNSKRVDSDEKRK 1208
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K KK ++ + E+ K + + K
Sbjct: 1209 LDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVK 1241
Score = 36.6 bits (85), Expect = 0.079
Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 13/86 (15%)
Query: 901 RRKKKKGAEKIRKK-------------KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG 947
KK+K EK++ ++ +E E++ +K+ A++ + K K K K
Sbjct: 1111 LEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLR 1170
Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKK 973
K+ KK+KKKKK K KK
Sbjct: 1171 KPKLKKKEKKKKKSSADKSKKASVVG 1196
Score = 36.6 bits (85), Expect = 0.080
Identities = 25/97 (25%), Positives = 45/97 (46%), Gaps = 4/97 (4%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK----GAEKI 951
K R + KKK +K + K K+A + K + +K+K D + KK G+++
Sbjct: 1167 SKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQE 1226
Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+++K K KK K+ K KK ++ + E +
Sbjct: 1227 DDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSD 1263
Score = 36.6 bits (85), Expect = 0.081
Identities = 22/93 (23%), Positives = 35/93 (37%), Gaps = 3/93 (3%)
Query: 898 KRTRRKKKKGAEKIRK--KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+ + G K KKK KK E +K K +KK A K K K K
Sbjct: 1289 SKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKK-KSEKKTARKKKSKTRVKQASAS 1347
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ + ++ +KKK ++ + + E E
Sbjct: 1348 QSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDED 1380
Score = 36.6 bits (85), Expect = 0.093
Identities = 26/116 (22%), Positives = 48/116 (41%), Gaps = 8/116 (6%)
Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKK-KKKKEAEKMRKKKGAEK------ 932
++ + K +T+ K K + KKK KKKK++ + KK +
Sbjct: 1142 EEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRV 1201
Query: 933 -MKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+K+K +K K + G ++ +++K K KK K+ K + K+ E
Sbjct: 1202 DSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDN 1257
Score = 32.3 bits (74), Expect = 1.7
Identities = 19/94 (20%), Positives = 29/94 (30%), Gaps = 3/94 (3%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKK---DAEKMKKKGAEKI 951
+K +R K K + + + + K+G K K+
Sbjct: 1235 PKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDG 1294
Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K KKK KK+ + A KKK E
Sbjct: 1295 ESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSE 1328
Score = 32.3 bits (74), Expect = 1.8
Identities = 20/88 (22%), Positives = 31/88 (35%), Gaps = 8/88 (9%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
K+ + K ++ + G+ K +K KK E
Sbjct: 1265 LSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGS----KPSSPTKKKVKKRLEGSLAA 1320
Query: 955 KKKKKKKKK----KKKKKKKKKKKEAEK 978
KKKKK +K KKK K + K+ A +
Sbjct: 1321 LKKKKKSEKKTARKKKSKTRVKQASASQ 1348
Score = 31.6 bits (72), Expect = 2.5
Identities = 20/90 (22%), Positives = 31/90 (34%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K+ +K + + K K K A K + + K +
Sbjct: 1241 KRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGS 1300
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
K KKK KK+ + A K KKK+E+
Sbjct: 1301 KPSSPTKKKVKKRLEGSLAALKKKKKSEKK 1330
Score = 31.2 bits (71), Expect = 4.1
Identities = 19/90 (21%), Positives = 30/90 (33%), Gaps = 3/90 (3%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+ + K+G K K+ + K + KK +K +
Sbjct: 1258 DEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGS 1317
Query: 956 KKKKKKKKKKKKK---KKKKKKEAEKMKKK 982
KKKKK +KK KKK K ++
Sbjct: 1318 LAALKKKKKSEKKTARKKKSKTRVKQASAS 1347
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 42.7 bits (100), Expect = 0.001
Identities = 29/110 (26%), Positives = 48/110 (43%), Gaps = 19/110 (17%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKM-----------------RKKKGAEKMKKKKK 938
+K + + KK+K E+ + KKK+ + + + + A +
Sbjct: 200 KKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSGTAP 259
Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKK--KKKKKKEAEKMKKKTEEA 986
D+E + K AE KK K KKKK++K+K KKKKKK ++
Sbjct: 260 DSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGG 309
Score = 40.4 bits (94), Expect = 0.005
Identities = 37/139 (26%), Positives = 55/139 (39%), Gaps = 51/139 (36%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKG------------------AEKM--------- 933
RR +++ + R+KKK++KE R+ E+M
Sbjct: 88 RRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTEEMPENALPSDE 147
Query: 934 ----------------KKKKKDAEKM---KKKGAEKIGKKKKK-----KKKKKKKKKKKK 969
K D+EK+ K + AE +K +KK KK KKK+K
Sbjct: 148 DDKDPNDPYRALDIDLDKPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEK 207
Query: 970 KKKKKEAEKMKKKTEEAEK 988
K+K+KE +K KKK E K
Sbjct: 208 KEKEKERDKDKKKEVEGFK 226
Score = 31.6 bits (71), Expect = 2.1
Identities = 31/124 (25%), Positives = 49/124 (39%), Gaps = 45/124 (36%)
Query: 903 KKKKGAEKIRKKKKKKKEA--EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK------- 953
+K + AE + +K A +K +K K EK K+K+K+ +K KKK E
Sbjct: 176 QKHRNAETSKSPEKGDVPAVEKKSKKPKKKEK-KEKEKERDKDKKKEVEGFKSLLLALDD 234
Query: 954 -----------------------------------KKKKKKKKKKKKKKKKKKKKKEAEK 978
+ ++ KK K KKKK++K+K+E +K
Sbjct: 235 SPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKK 294
Query: 979 MKKK 982
KK
Sbjct: 295 KKKH 298
Score = 31.2 bits (70), Expect = 3.6
Identities = 9/37 (24%), Positives = 26/37 (70%)
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
E+ +++ +K K++KKK++K+K+ ++ + +++E
Sbjct: 87 ERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDE 123
Score = 31.2 bits (70), Expect = 3.6
Identities = 23/112 (20%), Positives = 53/112 (47%), Gaps = 23/112 (20%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK------KGAEKMKKKKKDAE------ 941
+F K+ R K + E K+ +++EA K + K + +K+ +D
Sbjct: 1 IFAKEEQRHSKPRHTEADEKELARRREARKQEQANNPFYIKSSPSPQKRYQDPPGVEHIP 60
Query: 942 --------KMKKKG---AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
+K G +++ K +++++ +++ +K K++KKK+E EK ++
Sbjct: 61 VVQIDLSVPLKVPGLPMSDQYVKLEEERRHRQRLEKDKREKKKREKEKRGRR 112
Score = 30.8 bits (69), Expect = 4.5
Identities = 8/34 (23%), Positives = 23/34 (67%)
Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
+++ +++ K K++KKK++K+K+ +++ E
Sbjct: 87 ERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTE 120
Score = 30.0 bits (67), Expect = 8.2
Identities = 9/33 (27%), Positives = 22/33 (66%)
Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
K + E+ ++ EK ++KKK++K+K+ +++
Sbjct: 83 KLEEERRHRQRLEKDKREKKKREKEKRGRRRHH 115
Score = 29.6 bits (66), Expect = 8.6
Identities = 8/38 (21%), Positives = 26/38 (68%)
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K+ ++++ +++ +K K++KKK++K++ + + + E
Sbjct: 83 KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTE 120
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 42.9 bits (102), Expect = 0.001
Identities = 11/46 (23%), Positives = 17/46 (36%)
Query: 938 KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
+ K + K KK K +K K++ +EK K T
Sbjct: 814 LAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKAT 859
Score = 42.1 bits (100), Expect = 0.002
Identities = 15/47 (31%), Positives = 23/47 (48%), Gaps = 3/47 (6%)
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
AEK +K + A+ K A GKK K +K K++ +K +A
Sbjct: 815 AEKPEKLRYLADAPAKDPA---GKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 40.5 bits (96), Expect = 0.004
Identities = 12/47 (25%), Positives = 18/47 (38%), Gaps = 3/47 (6%)
Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
AEK +K + K KK K +K K+ +K +A
Sbjct: 815 AEKPEKL---RYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 35.9 bits (84), Expect = 0.11
Identities = 7/41 (17%), Positives = 14/41 (34%)
Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
EK+ K KK K +K ++ ++ +
Sbjct: 818 PEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 35.9 bits (84), Expect = 0.12
Identities = 11/49 (22%), Positives = 18/49 (36%), Gaps = 5/49 (10%)
Query: 921 AEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
AEK K + K +K K + +K K++ +K K
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFS-----RKTKQQYVASEKDGKA 858
Score = 34.0 bits (79), Expect = 0.45
Identities = 9/40 (22%), Positives = 16/40 (40%)
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ +K + K KK K + K K++ +EK
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEK 854
Score = 33.2 bits (77), Expect = 0.73
Identities = 11/51 (21%), Positives = 22/51 (43%)
Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
K ++ + A ++ +K +K + K KK A K +KT++
Sbjct: 798 KGRETRAPTVEEALELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQ 848
Score = 32.1 bits (74), Expect = 1.7
Identities = 14/68 (20%), Positives = 24/68 (35%), Gaps = 7/68 (10%)
Query: 895 FRKKRTRR--KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
K R R ++ E + +K +K + K KK A K +K ++
Sbjct: 796 LPKGRETRAPTVEEALELLAEKPEKLRYLADAPAKD-----PAGKKAAVKFSRKTKQQYV 850
Query: 953 KKKKKKKK 960
+K K
Sbjct: 851 ASEKDGKA 858
>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
outer membrane].
Length = 170
Score = 40.8 bits (96), Expect = 0.001
Identities = 24/103 (23%), Positives = 49/103 (47%), Gaps = 7/103 (6%)
Query: 885 EAVTALGFILFRKK-RTRRKKKKGAEKIRKKKKKK-KEAEKMRKKKGAEKMKKKKKDAEK 942
+A + + + + + KK + + + KK+ KE +KM+K+ K K + D +
Sbjct: 22 QAAPKIAIVNLGRIFQESPQAKKVSADLESEFKKRQKELQKMQKE-LKAKEAKLQDDGKM 80
Query: 943 MKKKGAEKIGKKKKKKKK----KKKKKKKKKKKKKKEAEKMKK 981
K + KK+K KK+++ +K ++EAE+ +K
Sbjct: 81 EALSDRAKAEAEIKKEKLVNAFNKKQQEYEKDLNRREAEEEQK 123
Score = 33.9 bits (78), Expect = 0.20
Identities = 14/65 (21%), Positives = 30/65 (46%), Gaps = 1/65 (1%)
Query: 925 RKKKGAEKMK-KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
+ KK + ++ + KK ++++K E K+ K + K + + K + E +K K
Sbjct: 41 QAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLVN 100
Query: 984 EEAEK 988
+K
Sbjct: 101 AFNKK 105
Score = 31.2 bits (71), Expect = 1.5
Identities = 20/93 (21%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K+ +++ +K ++++ K+ K ++ KM K + + +K K A KK
Sbjct: 53 FKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRA---KAEAEIKKEKLVNAF----NKK 105
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEE-AEK 988
+++ +K +++ ++++K EK+++ E AEK
Sbjct: 106 QQEYEKDLNRREAEEEQKLLEKIQRAIESVAEK 138
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 42.4 bits (100), Expect = 0.001
Identities = 14/61 (22%), Positives = 31/61 (50%), Gaps = 2/61 (3%)
Query: 917 KKKEAEKMRKK--KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
+ + +R++ K EK+K+K K +K + + ++KKK+KK K +++ +
Sbjct: 335 GEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQ 394
Query: 975 E 975
Sbjct: 395 N 395
Score = 38.1 bits (89), Expect = 0.018
Identities = 14/57 (24%), Positives = 35/57 (61%), Gaps = 1/57 (1%)
Query: 930 AEKMKKK-KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
++++ +K EK+K+K + K K ++ KK++ + ++KKK+K+A+ ++ +
Sbjct: 339 GISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
Score = 36.6 bits (85), Expect = 0.053
Identities = 16/75 (21%), Positives = 29/75 (38%), Gaps = 11/75 (14%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
F + + E++ K+ +K KE K K K K++ K
Sbjct: 332 AFSGEPDGISLR---EELEKRIEKLKEKP--------PKPPTKAKPERDKKERPGRYRRK 380
Query: 954 KKKKKKKKKKKKKKK 968
KK+KK K +++ +
Sbjct: 381 KKEKKAKSERRGLQN 395
Score = 35.8 bits (83), Expect = 0.10
Identities = 14/56 (25%), Positives = 29/56 (51%), Gaps = 3/56 (5%)
Query: 936 KKKDAEKMKK---KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ D +++ K EK+ +K K K K ++ KK++ + K K+K ++E+
Sbjct: 335 GEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSER 390
>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
ribosomal structure and biogenesis / Posttranslational
modification, protein turnover, chaperones].
Length = 379
Score = 42.3 bits (99), Expect = 0.001
Identities = 26/112 (23%), Positives = 41/112 (36%), Gaps = 20/112 (17%)
Query: 896 RKKRTRRKKKKGAEKIRKKK-------------------KKKKEAEKMRKKKGA-EKMKK 935
K R +R K K + R K+ K+ K+ K ++ GA K
Sbjct: 201 AKNREKRAKLKNQDNARLKRLVQIAKKRDPRIKSFKEQEKEMKKIRKWEREAGARLKALA 260
Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K + K K + KKK K+ KK K E + +K ++A+
Sbjct: 261 ALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKMEKKAIKNAAKDAD 312
Score = 36.2 bits (83), Expect = 0.081
Identities = 25/102 (24%), Positives = 42/102 (41%), Gaps = 14/102 (13%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAE---------KMKKKKKDAEKMKKKGAEKI 951
R +K+ K R+K+ K K + R K+ + K K+++ K +K +
Sbjct: 193 RDRKRYSEAKNREKRAKLKNQDNARLKRLVQIAKKRDPRIKSFKEQEKEMKKIRKWEREA 252
Query: 952 GKKKKKKKKKKKKKKKKKK-----KKKKEAEKMKKKTEEAEK 988
G + K K K + K K + A +KKK +E K
Sbjct: 253 GARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMK 294
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This
family includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 42.4 bits (98), Expect = 0.001
Identities = 28/84 (33%), Positives = 51/84 (60%), Gaps = 1/84 (1%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
++K A+K +K+ ++ EAE ++ E+ K+ K + E+++K+ +K KK + +KK
Sbjct: 89 EQKVNAKKAAEKEARRAEAEAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKK 148
Query: 963 KKKKKKKKKKKKEAEKMKKKTEEA 986
K+K K +K +K AEK KK + A
Sbjct: 149 KEKAVKAEKAEK-AEKTKKASTPA 171
Score = 41.2 bits (95), Expect = 0.002
Identities = 39/129 (30%), Positives = 66/129 (51%), Gaps = 10/129 (7%)
Query: 870 LLHVAVSDVNDDVRREAVTALGFILFRK---KRTRRKKKKGAEKIRK------KKKKKKE 920
LL + + +D R+EA +L + K + +KKK ++I K K KK
Sbjct: 38 LLTAVLPNSINDQRKEAFASLELTEQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKA 97
Query: 921 AEKMRKKKGAEKMKKKKKDAE-KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKM 979
AEK ++ AE K+ ++ E K K E+I K+++KK+ KK + +KKK+K +
Sbjct: 98 AEKEARRAEAEAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKEKAVKAEK 157
Query: 980 KKKTEEAEK 988
+K E+ +K
Sbjct: 158 AEKAEKTKK 166
Score = 35.0 bits (79), Expect = 0.19
Identities = 24/81 (29%), Positives = 40/81 (49%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
K+ K+ + AE KK+ ++E K K + K+++K +KK AEK +K K
Sbjct: 95 KKAAEKEARRAEAEAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKEKAVK 154
Query: 958 KKKKKKKKKKKKKKKKKEAEK 978
+K +K +K KK E+
Sbjct: 155 AEKAEKAEKTKKASTPAPVEE 175
Score = 32.3 bits (72), Expect = 1.5
Identities = 29/103 (28%), Positives = 44/103 (42%), Gaps = 25/103 (24%)
Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK-------- 961
K KK KKK E+E E++ ++ K + + A KK KK K+K
Sbjct: 214 KKNKKNKKKSESEATAAPASVEQVVEQPKVVTEEPHQQAAPQEKKNKKNKRKSESENVPA 273
Query: 962 -----------------KKKKKKKKKKKKKEAEKMKKKTEEAE 987
+ +KK KK KKK E+EK+ ++ +AE
Sbjct: 274 ASETPVEPVVETTPPASENQKKNKKDKKKSESEKVVEEPVQAE 316
Score = 32.0 bits (71), Expect = 1.7
Identities = 30/108 (27%), Positives = 52/108 (48%), Gaps = 18/108 (16%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKM--------KKKKKD-------- 939
+KK + KKK +EK+ ++ + EA K +K + M K++ KD
Sbjct: 293 QKKNKKDKKKSESEKV-VEEPVQAEAPKSKKPTADDNMDFLDFVTAKEEPKDEPAETPAA 351
Query: 940 -AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
E++ + E + +K + KKK KK KKK E+EK+ ++ E+
Sbjct: 352 PVEEVVENVVENVVEKSTTPPATENKKKNKKDKKKSESEKVTEQPVES 399
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 41.7 bits (98), Expect = 0.001
Identities = 20/87 (22%), Positives = 33/87 (37%), Gaps = 11/87 (12%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK---------KKKDAEKMKKKGAEKI 951
R+ K+ G RK++KK +E+ + + K K+ + K A K+
Sbjct: 130 RKPKEPG--LKRKQRKKAQESATSPESSPSSTPNSSRPSTPHLLKAKEGPSRRAKKAAKL 187
Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEK 978
+K K K KK K+ K
Sbjct: 188 SSTASSGDEKSPKSKAAPKKAGKKMRK 214
Score = 40.5 bits (95), Expect = 0.003
Identities = 23/111 (20%), Positives = 37/111 (33%), Gaps = 23/111 (20%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK---------- 944
+K+ K + E+ +K KK + + K+ K K++KK E
Sbjct: 101 SKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPSSTP 160
Query: 945 -------------KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
K+G + KK K +K K K +K KK
Sbjct: 161 NSSRPSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKK 211
Score = 30.5 bits (69), Expect = 4.4
Identities = 13/70 (18%), Positives = 28/70 (40%), Gaps = 1/70 (1%)
Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE-KIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
KE++K K A + ++ K ++K E K + K+K++KK ++ + +
Sbjct: 99 KESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPSS 158
Query: 978 KMKKKTEEAE 987
Sbjct: 159 TPNSSRPSTP 168
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 42.0 bits (99), Expect = 0.001
Identities = 20/85 (23%), Positives = 37/85 (43%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
+ + +K + K+K K+ K KK + + + ++ ++ KKKKKKKKKK
Sbjct: 7 ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKK 66
Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAE 987
KKK + + + +
Sbjct: 67 KKKNLGEAYDLAYDLPVVWSSAAFQ 91
Score = 41.6 bits (98), Expect = 0.002
Identities = 23/77 (29%), Positives = 38/77 (49%)
Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
+ K++K++ ++ KK +K KK D + + + + + K+ KKKKKKKKKKK
Sbjct: 8 NEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKK 67
Query: 972 KKKEAEKMKKKTEEAEK 988
KK E +
Sbjct: 68 KKNLGEAYDLAYDLPVV 84
Score = 32.4 bits (74), Expect = 1.4
Identities = 23/76 (30%), Positives = 33/76 (43%), Gaps = 19/76 (25%)
Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK-------------------KKKKKKKKK 972
M + ++ K +K+ ++ G KKK KK KK K+ KKKK
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60
Query: 973 KKEAEKMKKKTEEAEK 988
KK+ +K KK EA
Sbjct: 61 KKKKKKKKKNLGEAYD 76
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 40.5 bits (95), Expect = 0.001
Identities = 23/85 (27%), Positives = 33/85 (38%), Gaps = 4/85 (4%)
Query: 905 KKGA--EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
KGA E + K + + EK K + K K A K AEK K + + +
Sbjct: 90 AKGAFTEAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEK--KVNEARAEAV 147
Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAE 987
+KK + E + EEAE
Sbjct: 148 AEKKAAEAAAVAAEEAAAAEEEEAE 172
Score = 36.3 bits (84), Expect = 0.035
Identities = 23/69 (33%), Positives = 33/69 (47%), Gaps = 7/69 (10%)
Query: 927 KKGA--EKMKKKKKDAEKMKKKG-----AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKM 979
KGA E + K +A K +K+G +K+ K KK KK + +KK + + EA
Sbjct: 90 AKGAFTEAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAE 149
Query: 980 KKKTEEAEK 988
KK E A
Sbjct: 150 KKAAEAAAV 158
Score = 32.1 bits (73), Expect = 0.87
Identities = 13/42 (30%), Positives = 19/42 (45%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK 944
KK K ++ + KK EAEK + AE + +KK
Sbjct: 117 KKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAV 158
Score = 30.1 bits (68), Expect = 4.3
Identities = 19/63 (30%), Positives = 27/63 (42%)
Query: 924 MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
+ KK + K + + K +K+ K KK K K KK KK A + +KK
Sbjct: 80 LLKKHLLGGVAKGAFTEAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKV 139
Query: 984 EEA 986
EA
Sbjct: 140 NEA 142
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 38.6 bits (90), Expect = 0.002
Identities = 22/63 (34%), Positives = 25/63 (39%)
Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
K K K + EK K E K K K KK GA + +K K KK KK
Sbjct: 4 KNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDC 63
Query: 973 KKE 975
KK
Sbjct: 64 KKG 66
Score = 37.8 bits (88), Expect = 0.003
Identities = 15/55 (27%), Positives = 24/55 (43%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK K + K+KG + K+ K K K+ K KK + + TE+ +
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKG 56
Score = 37.8 bits (88), Expect = 0.003
Identities = 23/67 (34%), Positives = 30/67 (44%), Gaps = 1/67 (1%)
Query: 909 EKIRKKKKKKKEAE-KMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
K K K + KE K KK+ K K K+ A+K + G + +K K KK KK
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61
Query: 968 KKKKKKK 974
KK K
Sbjct: 62 DCKKGCK 68
Score = 34.3 bits (79), Expect = 0.043
Identities = 15/74 (20%), Positives = 27/74 (36%)
Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
KK K + ++K K+ + + K + + + +K K KK KK
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61
Query: 975 EAEKMKKKTEEAEK 988
+ +K K K
Sbjct: 62 DCKKGCKWEGNTCK 75
Score = 33.6 bits (77), Expect = 0.082
Identities = 24/75 (32%), Positives = 34/75 (45%), Gaps = 7/75 (9%)
Query: 903 KKKKGAEKIRKKK---KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
KK K + ++K KK++ K + K+G K KK A + G E +K K KK
Sbjct: 3 KKNKTKTECKEKGCKWDKKEDDGKCKPKEG----KAKKNGAPVTQTAGTETTTEKCKGKK 58
Query: 960 KKKKKKKKKKKKKKK 974
KK KK K +
Sbjct: 59 DKKDCKKGCKWEGNT 73
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 41.4 bits (98), Expect = 0.002
Identities = 16/62 (25%), Positives = 34/62 (54%)
Query: 921 AEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMK 980
A++ + +KG ++ + ++MK++ + KK+ K K +KKK+K++ K K K
Sbjct: 30 AQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89
Query: 981 KK 982
+
Sbjct: 90 PR 91
Score = 37.5 bits (88), Expect = 0.026
Identities = 15/61 (24%), Positives = 29/61 (47%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
++KK + ++ E+ K+ K++ + E K +KKK+K++ K K K K
Sbjct: 30 AQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89
Query: 974 K 974
Sbjct: 90 P 90
Score = 33.7 bits (78), Expect = 0.42
Identities = 20/66 (30%), Positives = 32/66 (48%), Gaps = 5/66 (7%)
Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK-KKDAEKMKKKGAEKIGKKKKK 957
R+K +KG ++ ++ KE M+++ A + KK K K +KK EK K K
Sbjct: 30 AQRKKSQKGELEVTNLNEQYKE---MKEELKAALLDKKELKAWHKAQKK-KEKQEAKAAK 85
Query: 958 KKKKKK 963
K K +
Sbjct: 86 AKSKPR 91
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 40.3 bits (95), Expect = 0.002
Identities = 23/89 (25%), Positives = 47/89 (52%), Gaps = 5/89 (5%)
Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
K K EK+ KK+ ++ + + EK+KK +++ E + E + + K+ +K
Sbjct: 63 ALNKLKTRLEKL-KKELEELKQRIAELQAQIEKLKKGREETE----ERTELLEELKQLEK 117
Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ KK K + +K +K + E+++K EE +
Sbjct: 118 ELKKLKAELEKYEKNDPERIEKLKEETKV 146
Score = 31.4 bits (72), Expect = 1.3
Identities = 21/72 (29%), Positives = 38/72 (52%), Gaps = 5/72 (6%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEK----MRKKKGAEK-MKKKKKDAEKMKKKGAEK 950
+ +++ + +I K KK ++E E+ + + K EK +KK K + EK +K E+
Sbjct: 77 ELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKAELEKYEKNDPER 136
Query: 951 IGKKKKKKKKKK 962
I K K++ K K
Sbjct: 137 IEKLKEETKVAK 148
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 41.3 bits (97), Expect = 0.002
Identities = 19/86 (22%), Positives = 44/86 (51%), Gaps = 3/86 (3%)
Query: 905 KKGAEKIRKKKKKKKEAEKMR---KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
+ AE I KK++ K E+++ +KK E ++ +++ E+ + + K+ +++++
Sbjct: 135 QNTAEIIEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDLEFKENEEQRE 194
Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAE 987
K KK K++ E + + E E
Sbjct: 195 SKWAILKKLKRRAELGSQQVQGEALE 220
Score = 36.7 bits (85), Expect = 0.056
Identities = 21/101 (20%), Positives = 47/101 (46%), Gaps = 11/101 (10%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKK--KDAEKMKKKGAEKIG 952
R + KKK+ + + +++ +K +K E++K+ + E K + E +
Sbjct: 54 KRAQYRSAKKKE----LSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALELLN 109
Query: 953 KKKKKK-----KKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K K+ K+ + K+ +K+ + AE ++KK E +
Sbjct: 110 LEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKN 150
Score = 35.9 bits (83), Expect = 0.10
Identities = 17/88 (19%), Positives = 41/88 (46%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
K + +K K+ + EK + E K+ + AE ++KK +++ K +
Sbjct: 99 IAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFEN 158
Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+KK ++ + +++K E++ + + E
Sbjct: 159 EKKLEESLELEREKFEEQLHEANLDLEF 186
Score = 35.9 bits (83), Expect = 0.10
Identities = 15/95 (15%), Positives = 44/95 (46%), Gaps = 10/95 (10%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKM-------KKKGAEKIGKKK 955
K K+ ++ + KE +K + AE ++KK+++ + +KK E + ++
Sbjct: 112 KDKELELLEKELDELSKELQKQLQNT-AEIIEKKRENNKNEERLKFENEKKLEESLELER 170
Query: 956 KK--KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K ++ + + K+ +++ E ++ ++
Sbjct: 171 EKFEEQLHEANLDLEFKENEEQRESKWAILKKLKR 205
Score = 35.2 bits (81), Expect = 0.17
Identities = 13/86 (15%), Positives = 41/86 (47%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
+ + + K + + + K+ K+ + +++++K+ +KK++ K
Sbjct: 90 QFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNK 149
Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAEK 988
+++ K + +KK E ++ + E+ E+
Sbjct: 150 NEERLKFENEKKLEESLELEREKFEE 175
Score = 32.1 bits (73), Expect = 1.5
Identities = 13/76 (17%), Positives = 36/76 (47%)
Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
+ +E EK +K A+ KKK+ +++++ + ++K ++ K+ + + +
Sbjct: 39 RILNTLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQDE 98
Query: 973 KKEAEKMKKKTEEAEK 988
+ E ++ E +K
Sbjct: 99 IAKLEALELLNLEKDK 114
Score = 31.3 bits (71), Expect = 2.7
Identities = 16/95 (16%), Positives = 44/95 (46%), Gaps = 1/95 (1%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDA-EKMKKKGAEKIGK 953
R T + +K A + R + + K+ E + ++ KK++K+ + K+ +
Sbjct: 38 SRILNTLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQD 97
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ K + + +K K+ + +++ + ++E +K
Sbjct: 98 EIAKLEALELLNLEKDKELELLEKELDELSKELQK 132
Score = 30.9 bits (70), Expect = 3.2
Identities = 16/95 (16%), Positives = 42/95 (44%), Gaps = 8/95 (8%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK---- 956
+K+ + KKK+ + E++ +K +K ++ + + ++I K +
Sbjct: 50 EANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQF-ELALQDEIAKLEALELL 108
Query: 957 ---KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K K+ + +K+ + K+ ++++ E EK
Sbjct: 109 NLEKDKELELLEKELDELSKELQKQLQNTAEIIEK 143
Score = 30.6 bits (69), Expect = 5.2
Identities = 14/95 (14%), Positives = 43/95 (45%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
L + + +KK + + ++ K+ E + E ++ + +K + +++ +
Sbjct: 66 LSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDE 125
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K+ +K+ + + +KK++ + ++ E EK
Sbjct: 126 LSKELQKQLQNTAEIIEKKRENNKNEERLKFENEK 160
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 39.4 bits (93), Expect = 0.002
Identities = 20/69 (28%), Positives = 33/69 (47%), Gaps = 2/69 (2%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
+KKK + ++ E+ K K+ E E K +K +KK +K K K +KKK
Sbjct: 32 AQKKKGKKGELEITDLNEEYKDLKESLEAALLDKKEL--KAWEKAEKKAEKAKAKAEKKK 89
Query: 974 KEAEKMKKK 982
+ E+ K +
Sbjct: 90 AKKEEPKPR 98
Score = 36.4 bits (85), Expect = 0.026
Identities = 15/33 (45%), Positives = 21/33 (63%)
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK+ K +K +KK +K K +AEK K K EE +
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97
Score = 33.3 bits (77), Expect = 0.23
Identities = 23/93 (24%), Positives = 37/93 (39%), Gaps = 20/93 (21%)
Query: 887 VTALGFILFRKKRTRRKKKKGAEKIR--------KKKKKKKEAEKMRKKKGAEKMKKKKK 938
V A+ ++ +KKK ++ K K+ EA + KK+ K +KK
Sbjct: 18 VVAIIAVVGLIVALAQKKKGKKGELEITDLNEEYKDLKESLEAALLDKKELKAWEKAEKK 77
Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
K K K +KKK KK++ K +
Sbjct: 78 AE------------KAKAKAEKKKAKKEEPKPR 98
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 41.7 bits (98), Expect = 0.002
Identities = 22/94 (23%), Positives = 41/94 (43%), Gaps = 1/94 (1%)
Query: 896 RKKRTRRKKKKGAEKIRKK-KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
+K + K + + K K+KKK +K+ + K ++++ A + +K
Sbjct: 58 KKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRK 117
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K K +K KKK K K+ +K+ +E K
Sbjct: 118 PKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIK 151
Score = 39.4 bits (92), Expect = 0.011
Identities = 20/84 (23%), Positives = 30/84 (35%), Gaps = 8/84 (9%)
Query: 910 KIRKKKKKKKEAE--------KMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
K+++KKK KK+ K + + K EK+ KK K
Sbjct: 76 KLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKST 135
Query: 962 KKKKKKKKKKKKKEAEKMKKKTEE 985
KKKKK K + + K +
Sbjct: 136 NKKKKKVLSSKDELIKYDNNKPKS 159
Score = 34.8 bits (80), Expect = 0.26
Identities = 24/86 (27%), Positives = 35/86 (40%), Gaps = 7/86 (8%)
Query: 897 KKRTRRKKKKGAEKIRKK------KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK-GAE 949
KK ++KKK +K+ K +++ M+K K EK+KKK
Sbjct: 73 KKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVN 132
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKE 975
K KKKKK K + K K +
Sbjct: 133 KSTNKKKKKVLSSKDELIKYDNNKPK 158
Score = 31.3 bits (71), Expect = 3.4
Identities = 16/56 (28%), Positives = 22/56 (39%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
F + K+ G I +K K + EK++KK K KKK K K
Sbjct: 96 FFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIK 151
Score = 30.2 bits (68), Expect = 6.5
Identities = 24/93 (25%), Positives = 37/93 (39%), Gaps = 7/93 (7%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKM---RKKKGAEKMKKKKK----DAEKMKKKGAEKIGKKK 955
KK EK+ KK KK + + + +K K +K K KKK D +
Sbjct: 48 INKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFA 107
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K K K +K K+ + K T + +K
Sbjct: 108 GPLAISLMRKPKPKTEKLKKKITVNKSTNKKKK 140
>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
Length = 822
Score = 41.5 bits (98), Expect = 0.002
Identities = 14/62 (22%), Positives = 22/62 (35%), Gaps = 5/62 (8%)
Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
K+KK+K+ ++ KG K AEK+K G E + K
Sbjct: 766 EKQKKEKESKSELEALKGVGA-----KTAEKLKDAGVETVTDLTAADPDAVAAKVDGVSA 820
Query: 972 KK 973
+
Sbjct: 821 DR 822
Score = 41.5 bits (98), Expect = 0.003
Identities = 15/80 (18%), Positives = 27/80 (33%), Gaps = 5/80 (6%)
Query: 907 GAEKIRKKKKKKKEAE--KMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
G +R K E K EK KK+K+ +++ K K +K K
Sbjct: 740 GLPLLRVKGGFGDELGCCNNPKCNYTEKQKKEKESKSELEA---LKGVGAKTAEKLKDAG 796
Query: 965 KKKKKKKKKKEAEKMKKKTE 984
+ + + + K +
Sbjct: 797 VETVTDLTAADPDAVAAKVD 816
Score = 38.1 bits (89), Expect = 0.024
Identities = 14/75 (18%), Positives = 23/75 (30%), Gaps = 6/75 (8%)
Query: 889 ALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMK-KKKKDAEKMKKKG 947
LG K K+KK E + + K K AEK+K + +
Sbjct: 753 ELGCCNNPKCNYTEKQKKEKESKSELEALKGV-----GAKTAEKLKDAGVETVTDLTAAD 807
Query: 948 AEKIGKKKKKKKKKK 962
+ + K +
Sbjct: 808 PDAVAAKVDGVSADR 822
Score = 30.4 bits (69), Expect = 5.8
Identities = 12/34 (35%), Positives = 19/34 (55%)
Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
+K+KK+K+ K + + K K AEK+K E
Sbjct: 765 TEKQKKEKESKSELEALKGVGAKTAEKLKDAGVE 798
>gnl|CDD|217023 pfam02414, Borrelia_orfA, Borrelia ORF-A. This protein is encoded
by an open reading frame in plasmid borne DNA repeats of
Borrelia species. This protein is known as ORF-A. The
function of this putative protein is unknown.
Length = 285
Score = 40.8 bits (96), Expect = 0.002
Identities = 30/88 (34%), Positives = 38/88 (43%), Gaps = 3/88 (3%)
Query: 904 KKKGAEKIRKK-KKKKKEAEKMRKKKGAEKMKKKKKDAEK--MKKKGAEKIGKKKKKKKK 960
KK+ E I K K+KK E K R +K K EK I KK K++ K
Sbjct: 89 KKECYEIINKYFKEKKIERFKKRVNAYFKKKSNKNGSVEKNECINNKNNVINKKIKEESK 148
Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK K +K K KK +K KT +
Sbjct: 149 KKNSKNIEKLKIKKYFKKCNFKTNLKKD 176
Score = 36.2 bits (84), Expect = 0.066
Identities = 27/90 (30%), Positives = 40/90 (44%), Gaps = 8/90 (8%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD--------AEKMKK 945
L + K+ + K+K + K K+ E + +KK+ + +K + EK K
Sbjct: 192 LIKNKKLKNKQKTLKNILYNLKDKELEKKGYKKKQIEKFFQKIYEKYKNKPHFIIEKNKY 251
Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
K KI K KK KKKKK KK + K
Sbjct: 252 KDLNKIIGKFKKSFKKKKKNSKKNYENIKI 281
Score = 35.0 bits (81), Expect = 0.14
Identities = 30/98 (30%), Positives = 46/98 (46%), Gaps = 12/98 (12%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAE----KMKKKKKDAEKMKKKGAEKIG 952
K+ +++K K EK++ KK KK K KK + K+ K +++ KK K
Sbjct: 144 KEESKKKNSKNIEKLKIKKYFKKCNFKTNLKKDIKIKLLKIIKIEENFLIKNKKLKNKQK 203
Query: 953 K--------KKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
K K+ +KK KKK+ +K +K EK K K
Sbjct: 204 TLKNILYNLKDKELEKKGYKKKQIEKFFQKIYEKYKNK 241
Score = 33.9 bits (78), Expect = 0.35
Identities = 28/105 (26%), Positives = 44/105 (41%), Gaps = 14/105 (13%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEK---MRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
F++K+ R KK+ +KK K EK + K K K++ +K K +
Sbjct: 100 FKEKKIERFKKRVNAYFKKKSNKNGSVEKNECINNKNNVINKKIKEESKKKNSKNIEKLK 159
Query: 952 GKKKKKK-------KKKKKKKKKKKKKKKK----EAEKMKKKTEE 985
KK KK KK K K K K ++ + +K+K K +
Sbjct: 160 IKKYFKKCNFKTNLKKDIKIKLLKIIKIEENFLIKNKKLKNKQKT 204
Score = 32.7 bits (75), Expect = 0.74
Identities = 19/67 (28%), Positives = 29/67 (43%), Gaps = 8/67 (11%)
Query: 903 KKKKGAEKIRKK-KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
KKK+ + +K +K K + + +K K KD K+ K + KKKK KK
Sbjct: 223 KKKQIEKFFQKIYEKYKNKPHFIIEKN-------KYKDLNKIIGKFKKSFKKKKKNSKKN 275
Query: 962 KKKKKKK 968
+ K
Sbjct: 276 YENIKIN 282
Score = 32.7 bits (75), Expect = 0.92
Identities = 21/79 (26%), Positives = 35/79 (44%), Gaps = 2/79 (2%)
Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
KKK K EK++ KK +K K + +K K + I ++ K KK K K+K
Sbjct: 146 ESKKKNSKNIEKLKIKKYFKKCNFKTNLKKDIKIKLLKIIKIEENFLIKNKKLKNKQKTL 205
Query: 972 KK--KEAEKMKKKTEEAEK 988
K + + + + +K
Sbjct: 206 KNILYNLKDKELEKKGYKK 224
Score = 31.2 bits (71), Expect = 2.6
Identities = 27/81 (33%), Positives = 36/81 (44%), Gaps = 8/81 (9%)
Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK-----KKKKKKK 964
K++ K+K K K K EK KKK EK +K K K K +K K K
Sbjct: 197 KLKNKQKTLKNILYNLKDKELEKKGYKKKQIEKFFQK---IYEKYKNKPHFIIEKNKYKD 253
Query: 965 KKKKKKKKKKEAEKMKKKTEE 985
K K KK +K KK +++
Sbjct: 254 LNKIIGKFKKSFKKKKKNSKK 274
>gnl|CDD|225620 COG3078, COG3078, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 169
Score = 39.4 bits (92), Expect = 0.003
Identities = 29/106 (27%), Positives = 48/106 (45%), Gaps = 16/106 (15%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAE-----KMRKKKGAEKMKKKKKDAEKMKKKGAE- 949
R K+TRR + K +K R+K +++ +AE + +K+KG + E K
Sbjct: 3 RSKKTRRPRSKADKKARRKTREELDAEARDRKRQKKRKGLASGSRHSGGNENSGNKQQNQ 62
Query: 950 ----KIGKKKK------KKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
+IG KK +K K+ K K +K K +AE + +E
Sbjct: 63 KKDPRIGSKKPIPLGVTEKVTKQHKPKSEKPKLSPQAELELLENDE 108
>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7). RRP7
is an essential protein in yeast that is involved in
pre-rRNA processing and ribosome assembly. It is
speculated to be required for correct assembly of rpS27
into the pre-ribosomal particle.
Length = 131
Score = 38.7 bits (91), Expect = 0.003
Identities = 20/79 (25%), Positives = 41/79 (51%), Gaps = 7/79 (8%)
Query: 891 GFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGA-----EKMKKKKKDAEKMKK 945
G+ + +RK K ++++K KE +K +KK+ + +KKK++ +++K
Sbjct: 51 GWTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKKKKELENFYRFQIREKKKEELAELRK 110
Query: 946 KGAEKIGKKKKKKKKKKKK 964
K E KK+ ++ K +K
Sbjct: 111 KFEED--KKRIEQLKAARK 127
Score = 36.0 bits (84), Expect = 0.021
Identities = 23/89 (25%), Positives = 39/89 (43%), Gaps = 26/89 (29%)
Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAE------KMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
K++E K K +E D + + +K + K ++++K K+KKKKK
Sbjct: 32 KREEEAKEEAKARSEP------DEDGWTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKK 85
Query: 971 K--------------KKKEAEKMKKKTEE 985
K KK+E +++KK EE
Sbjct: 86 KKELENFYRFQIREKKKEELAELRKKFEE 114
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 38.8 bits (91), Expect = 0.003
Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 4/75 (5%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE----KI 951
RK+ ++ KG + +++K+ +E EK + K A + D E +++ + +
Sbjct: 30 RKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDEDYDEELKEQERWDDPMAQF 89
Query: 952 GKKKKKKKKKKKKKK 966
+KKK+K KK K
Sbjct: 90 LRKKKEKTDKKGKPL 104
Score = 38.4 bits (90), Expect = 0.004
Identities = 22/96 (22%), Positives = 47/96 (48%), Gaps = 19/96 (19%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKG---AEKMKKKKKDAEKMKKKG------ 947
+ +K++ ++ K++K++KE ++ KG E+ +K+ ++ EK K K
Sbjct: 10 RIIDIEEKRE-EKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYAD 68
Query: 948 AEKIGKKKKKKKK---------KKKKKKKKKKKKKK 974
E ++ K++++ +KKK+K KK K
Sbjct: 69 DEDYDEELKEQERWDDPMAQFLRKKKEKTDKKGKPL 104
Score = 35.7 bits (83), Expect = 0.037
Identities = 16/77 (20%), Positives = 40/77 (51%), Gaps = 2/77 (2%)
Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
K + EK +K+ ++ K++K++ EK KG + +++K+ ++ +K K K +
Sbjct: 7 KSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARY 66
Query: 973 K--KEAEKMKKKTEEAE 987
++ ++ K+ E +
Sbjct: 67 ADDEDYDEELKEQERWD 83
Score = 34.6 bits (80), Expect = 0.077
Identities = 14/53 (26%), Positives = 35/53 (66%), Gaps = 2/53 (3%)
Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ D E+ +++ ++ K++K++K++K+K+ K +K+E EK ++ E+A+
Sbjct: 10 RIIDIEEKREE--KEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKN 60
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 41.4 bits (97), Expect = 0.003
Identities = 21/64 (32%), Positives = 33/64 (51%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
+KK E E RKKK EK K+K+ K +K A+ + ++ KK +KK +K
Sbjct: 9 EKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRK 68
Query: 974 KEAE 977
++ E
Sbjct: 69 RDVE 72
Score = 39.1 bits (91), Expect = 0.015
Identities = 20/64 (31%), Positives = 36/64 (56%), Gaps = 4/64 (6%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
KK E++ +KKKK+++A K+K +K+K +K+A+ + G KK +KK
Sbjct: 10 KKILTEEELERKKKKEEKA----KEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKK 65
Query: 963 KKKK 966
+K+
Sbjct: 66 SRKR 69
Score = 38.3 bits (89), Expect = 0.024
Identities = 21/70 (30%), Positives = 38/70 (54%), Gaps = 2/70 (2%)
Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK--KKKKKKEAE 977
E+E +K E++++KKK EK K+K +K+ +K+ K K + ++ KK +
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEK 64
Query: 978 KMKKKTEEAE 987
K +K+ E E
Sbjct: 65 KSRKRDVEDE 74
Score = 34.1 bits (78), Expect = 0.44
Identities = 21/65 (32%), Positives = 31/65 (47%), Gaps = 2/65 (3%)
Query: 913 KKKKKKKEAEKMRKK--KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
KK ++E E+ +KK K EK KK K A+K K + KK +KK +K+
Sbjct: 10 KKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKR 69
Query: 971 KKKKE 975
+ E
Sbjct: 70 DVEDE 74
Score = 33.7 bits (77), Expect = 0.54
Identities = 18/57 (31%), Positives = 28/57 (49%), Gaps = 3/57 (5%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
+ RKKKK + K+ KK K A+K K A+ ++ D + KK +K K+
Sbjct: 16 EELERKKKKEEKAKEKELKKLKAAQKEAK---AKLQAQQASDGTNVPKKSEKKSRKR 69
Score = 33.3 bits (76), Expect = 0.86
Identities = 15/43 (34%), Positives = 28/43 (65%), Gaps = 1/43 (2%)
Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKK-KKKEAEKMKKKTEEAE 987
E +KK +++ ++KKKK++K K+KE +K+K +EA+
Sbjct: 2 SRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAK 44
Score = 32.6 bits (74), Expect = 1.5
Identities = 16/50 (32%), Positives = 28/50 (56%)
Query: 933 MKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
+ + +AEK E KKKK++K K+K+ KK K +K+ K++ +
Sbjct: 1 GSRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQ 50
Score = 31.4 bits (71), Expect = 2.8
Identities = 18/57 (31%), Positives = 31/57 (54%), Gaps = 3/57 (5%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAE-KMRKKKGAEKMKKKKKDAEKMKKKGAE 949
L RKK+ K K+ K K K +KEA+ K++ ++ ++ KK +K +K+ E
Sbjct: 18 LERKKKKEEKAKEKELK--KLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVE 72
Score = 30.6 bits (69), Expect = 4.9
Identities = 12/35 (34%), Positives = 20/35 (57%)
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++KKKK++K K+K+ KK K + E K +
Sbjct: 19 ERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQAS 53
Score = 30.3 bits (68), Expect = 7.2
Identities = 14/36 (38%), Positives = 26/36 (72%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++ ++KKKK++K K+K+ KK K A+K K +A++
Sbjct: 16 EELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQ 51
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 40.4 bits (95), Expect = 0.003
Identities = 21/87 (24%), Positives = 57/87 (65%), Gaps = 3/87 (3%)
Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
T ++++ +KI++ +K+ ++A+K ++ EK+K+ K + +++KKK A +I +K ++
Sbjct: 129 TPEEERELVQKIKELRKELEDAKKALEEN--EKLKELKAEIDELKKK-AREIHEKIQELA 185
Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEEA 986
+ ++ ++ K +EA++++K+ +E
Sbjct: 186 NEAQEYHEEMIKLFEEADELRKEADEL 212
Score = 37.0 bits (86), Expect = 0.043
Identities = 21/112 (18%), Positives = 56/112 (50%), Gaps = 20/112 (17%)
Query: 895 FRKKRT--RRKKKKGAEKIRKKKKK----KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
+KK K ++ A + ++ ++ +EA+++RK+ A+++ ++ + K +
Sbjct: 170 LKKKAREIHEKIQELANEAQEYHEEMIKLFEEADELRKE--ADELHEEFVELSKKIDELH 227
Query: 949 EKIGKKKKK------------KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
E+ + + K+K K+++K+++ K+ AE++ +K + EK
Sbjct: 228 EEFRNLQNELRELEKKIKALRAKEKAAKRREKREELKERAEEIYEKFKRGEK 279
Score = 36.2 bits (84), Expect = 0.063
Identities = 25/119 (21%), Positives = 52/119 (43%), Gaps = 26/119 (21%)
Query: 896 RKKRTRRKKK-----KGAEKIRKKKKKKKEAEKMRKK---------KGAEKMKKKKKDAE 941
RK+ KK K E + + KK+A ++ +K + E+M K ++A+
Sbjct: 144 RKELEDAKKALEENEKLKELKAEIDELKKKAREIHEKIQELANEAQEYHEEMIKLFEEAD 203
Query: 942 KMKKKG------AEKIGKKKKKKKKKKKKKKK------KKKKKKKEAEKMKKKTEEAEK 988
+++K+ ++ KK + ++ + + KK K + EK K+ E+ E+
Sbjct: 204 ELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKALRAKEKAAKRREKREE 262
Score = 34.3 bits (79), Expect = 0.26
Identities = 24/118 (20%), Positives = 55/118 (46%), Gaps = 22/118 (18%)
Query: 893 ILFRKKRTRRKKKKGAEKI-RKKKKKKKEAEKMRKK---------------KGAEKMKKK 936
L ++ R ++K+ R K ++E E++ KK + ++++K+
Sbjct: 87 ELRKEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKE 146
Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK------KEAEKMKKKTEEAEK 988
+DA+K ++ + K + + KKK ++ +K ++ + E+M K EEA++
Sbjct: 147 LEDAKKALEENEKLKELKAEIDELKKKAREIHEKIQELANEAQEYHEEMIKLFEEADE 204
Score = 32.7 bits (75), Expect = 0.78
Identities = 19/84 (22%), Positives = 45/84 (53%), Gaps = 7/84 (8%)
Query: 903 KKKKGAEKIRKKKKK-KKEAEKMRKKKGA--EKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
K + A+++RK+ + +E ++ KK E+ + + + +++KK K+K K+
Sbjct: 197 KLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKALRAKEKAAKR 256
Query: 960 KKK----KKKKKKKKKKKKEAEKM 979
++K K++ ++ +K K EK+
Sbjct: 257 REKREELKERAEEIYEKFKRGEKL 280
Score = 32.3 bits (74), Expect = 1.1
Identities = 19/100 (19%), Positives = 51/100 (51%), Gaps = 7/100 (7%)
Query: 895 FRKKRTRRKK-KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
++ + +R + +++RK+ ++ KE G K +++ E+++KK +
Sbjct: 71 VQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSI-KSLEREIERLEKKQQTSVLT 129
Query: 954 KKKKKK-----KKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+++++ K+ +K+ + KK +E EK+K+ E ++
Sbjct: 130 PEEERELVQKIKELRKELEDAKKALEENEKLKELKAEIDE 169
Score = 32.0 bits (73), Expect = 1.5
Identities = 21/116 (18%), Positives = 52/116 (44%), Gaps = 31/116 (26%)
Query: 901 RRKKKKGAEKIRKKKKKKKE-----------AEKMRKK------KGAEKMKKKKK-DAEK 942
R ++ + E++++ K+K+ E ++++K G +++ + +
Sbjct: 61 REERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIERLE 120
Query: 943 MK-----------KKGAEKIGKKKKKKKKKKKKKKKKKKKK--KKEAEKMKKKTEE 985
K ++ +KI + +K+ + KK ++ +K K K E +++KKK E
Sbjct: 121 KKQQTSVLTPEEERELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKARE 176
Score = 31.6 bits (72), Expect = 1.7
Identities = 17/97 (17%), Positives = 47/97 (48%), Gaps = 7/97 (7%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAE----KMKKKGAEKI 951
++ R++ + AEK + K +E +R+K + ++ + + E K K+
Sbjct: 28 KRDELRKEASELAEKRDELNAKVRE---LREKAQELREERDEINEEVQELKEKRDEINAK 84
Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++ +K+ ++ K+K+ + + + ++++ E EK
Sbjct: 85 LQELRKEYRELKEKRNEFNLGGRSIKSLEREIERLEK 121
Score = 31.6 bits (72), Expect = 1.9
Identities = 18/94 (19%), Positives = 44/94 (46%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
K K+K ++ ++ K+K++ + + AEK + ++++K E ++
Sbjct: 5 LDKLDELELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREER 64
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ ++ ++ K+K+ + K E K+ E EK
Sbjct: 65 DEINEEVQELKEKRDEINAKLQELRKEYRELKEK 98
Score = 30.8 bits (70), Expect = 3.7
Identities = 22/113 (19%), Positives = 50/113 (44%), Gaps = 25/113 (22%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAE------KMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
K ++ EK ++ ++++ E K ++ + K+++ +K+ ++K+K E
Sbjct: 47 NAKVRELREKAQELREERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGG 106
Query: 955 KKKKKKKKKKKK-------------------KKKKKKKKEAEKMKKKTEEAEK 988
+ K +++ ++ +K K+ +KE E KK EE EK
Sbjct: 107 RSIKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKELEDAKKALEENEK 159
>gnl|CDD|148820 pfam07432, Hc1, Histone H1-like protein Hc1. This family consists
of several bacterial histone H1-like Hc1 proteins. In
Chlamydia, Hc1 is expressed in the late stages of the
life cycle, concomitant with the reorganisation of
chlamydial reticulate bodies into elementary bodies.
This suggests that Hc1 protein plays a role in the
condensation of chromatin during intracellular
differentiation.
Length = 123
Score = 38.2 bits (88), Expect = 0.003
Identities = 30/84 (35%), Positives = 40/84 (47%)
Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
RT K + K+ +K+ K E + KKK A+K K K +K K A+K KK
Sbjct: 35 RTDSIKLEKIAKLYRKESIKAEKSGLMKKKPAKKAPKAAKAPKKAAKAPAKKAAAAPAKK 94
Query: 959 KKKKKKKKKKKKKKKKEAEKMKKK 982
K K K KK K+ +KM KK
Sbjct: 95 AKAVKAKPATKKTAAKKVKKMSKK 118
Score = 34.7 bits (79), Expect = 0.059
Identities = 29/75 (38%), Positives = 33/75 (44%), Gaps = 2/75 (2%)
Query: 902 RKKKKGAEKIRKKKKK--KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
RK+ AEK KKK KK + + K A K KK A KK A K KK
Sbjct: 49 RKESIKAEKSGLMKKKPAKKAPKAAKAPKKAAKAPAKKAAAAPAKKAKAVKAKPATKKTA 108
Query: 960 KKKKKKKKKKKKKKK 974
KK KK KK+ K
Sbjct: 109 AKKVKKMSKKRSTAK 123
Score = 32.4 bits (73), Expect = 0.34
Identities = 29/85 (34%), Positives = 42/85 (49%), Gaps = 6/85 (7%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK---MKKKGAEKIGKKKKKKKKKKKK 964
AEK K ++ + ++ +K A+ +K+ AEK MKKK A+K K K KK K
Sbjct: 23 AEKGNKAAAQRVRTDSIKLEKIAKLYRKESIKAEKSGLMKKKPAKKAPKAAKAPKKAAKA 82
Query: 965 KKKK---KKKKKKEAEKMKKKTEEA 986
KK KK +A K K T++
Sbjct: 83 PAKKAAAAPAKKAKAVKAKPATKKT 107
Score = 30.1 bits (67), Expect = 2.5
Identities = 26/76 (34%), Positives = 34/76 (44%), Gaps = 2/76 (2%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK-KKKKKKKKKKKKKKKKKKK 972
K +K +A R + + K++K K K K AEK G KKK KK K K KK
Sbjct: 22 KAEKGNKAAAQRVRTDSIKLEKIAKLYRKESIK-AEKSGLMKKKPAKKAPKAAKAPKKAA 80
Query: 973 KKEAEKMKKKTEEAEK 988
K A+K + K
Sbjct: 81 KAPAKKAAAAPAKKAK 96
Score = 28.9 bits (64), Expect = 5.0
Identities = 26/77 (33%), Positives = 34/77 (44%), Gaps = 6/77 (7%)
Query: 894 LFRKKRTRRKK-----KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAE-KMKKKG 947
L+RK+ + +K KK A+K K K K+A K KK A KK K + K K
Sbjct: 47 LYRKESIKAEKSGLMKKKPAKKAPKAAKAPKKAAKAPAKKAAAAPAKKAKAVKAKPATKK 106
Query: 948 AEKIGKKKKKKKKKKKK 964
KK KK+ K
Sbjct: 107 TAAKKVKKMSKKRSTAK 123
>gnl|CDD|218585 pfam05424, Duffy_binding, Duffy binding domain. This domain is
found in Plasmodium Duffy binding proteins. Plasmodium
vivax and Plasmodium knowlesi merozoites invade human
erythrocytes that express Duffy blood group surface
determinants. The Duffy receptor family is localised in
micronemes, an organelle found in all organisms of the
phylum Apicomplexa.
Length = 276
Score = 40.0 bits (94), Expect = 0.003
Identities = 26/60 (43%), Positives = 34/60 (56%), Gaps = 5/60 (8%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKK-----KKKKKKKKKKKKEAEKMKKKTEEAEK 988
+++KK EK+KKK EK K KK K KK KK KKKE +K KKK ++ +K
Sbjct: 199 RERKKKLEKLKKKCKEKKCNKYCSKKCTKCKNACKKYKKWIDNKKKEWDKQKKKYKKYKK 258
Score = 38.9 bits (91), Expect = 0.010
Identities = 27/78 (34%), Positives = 42/78 (53%), Gaps = 5/78 (6%)
Query: 915 KKKKKEAEKMRKK-KGAEKMKKKKKDAEKMK---KKGAEKIGKKKKKKKKKKKKKKKKKK 970
+++KK+ EK++KK K + K K K K KK + I KKK+ K+KKK KK K
Sbjct: 199 RERKKKLEKLKKKCKEKKCNKYCSKKCTKCKNACKKYKKWI-DNKKKEWDKQKKKYKKYK 257
Query: 971 KKKKEAEKMKKKTEEAEK 988
K+ + K + A++
Sbjct: 258 KRNNKKNYSDIKDKYAKE 275
Score = 36.5 bits (85), Expect = 0.053
Identities = 23/77 (29%), Positives = 36/77 (46%), Gaps = 9/77 (11%)
Query: 902 RKKKKGAEKIRKK-KKKKKEAEKMRKKKG--------AEKMKKKKKDAEKMKKKGAEKIG 952
R++KK EK++KK K+KK +K + + KKK+ +K KKK +
Sbjct: 199 RERKKKLEKLKKKCKEKKCNKYCSKKCTKCKNACKKYKKWIDNKKKEWDKQKKKYKKYKK 258
Query: 953 KKKKKKKKKKKKKKKKK 969
+ KK K K K+
Sbjct: 259 RNNKKNYSDIKDKYAKE 275
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 39.6 bits (93), Expect = 0.004
Identities = 26/92 (28%), Positives = 50/92 (54%), Gaps = 12/92 (13%)
Query: 878 VNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKK 937
+ DVR++ L F L + +R +K+ E+ R + K +E+ R K+ A++M +K
Sbjct: 108 ITSDVRKQ----LRF-LEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEM--QK 160
Query: 938 KDAEKMKKKGAE-----KIGKKKKKKKKKKKK 964
++ E+M+ + A IG +KKKK++ +
Sbjct: 161 EEDEEMRHRAANATALAAIGGRKKKKRRITMR 192
Score = 35.4 bits (82), Expect = 0.097
Identities = 11/49 (22%), Positives = 32/49 (65%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
+ ++++ EK ++ E++ + K + ++ + K+K K+ +K+E E+M+ +
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHR 169
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 40.6 bits (95), Expect = 0.004
Identities = 22/51 (43%), Positives = 24/51 (47%), Gaps = 2/51 (3%)
Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
K G K +KK K KKK A GK K KKK K K KK KK+
Sbjct: 738 KYAGTPKTPYEKK--TKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGP 786
Score = 37.1 bits (86), Expect = 0.057
Identities = 15/35 (42%), Positives = 15/35 (42%)
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
EK K KKK K K K KKK K K K
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKK 782
Score = 35.6 bits (82), Expect = 0.16
Identities = 16/50 (32%), Positives = 21/50 (42%)
Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
DA K G K +KK K KKK K K K + + K + +K
Sbjct: 733 DAVITKYAGTPKTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKK 782
Score = 34.8 bits (80), Expect = 0.29
Identities = 14/44 (31%), Positives = 16/44 (36%)
Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K K +K K K K KKK K K KK + T
Sbjct: 750 KTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSPA 793
Score = 32.9 bits (75), Expect = 1.1
Identities = 11/35 (31%), Positives = 15/35 (42%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
K KKK + K + K +K K K K+ G
Sbjct: 752 KAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGP 786
Score = 32.1 bits (73), Expect = 1.7
Identities = 21/90 (23%), Positives = 35/90 (38%), Gaps = 11/90 (12%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDA------EKMKKKGAEKIGKKKK 956
KK K +K K K + K + K ++K KK+ GAE +G+ +
Sbjct: 749 KKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSPALAAMIGAEPVGRGEA 808
Query: 957 KKK-----KKKKKKKKKKKKKKKEAEKMKK 981
KK K+ + + KK K++
Sbjct: 809 TKKVWDYIKEHGLQSPENKKLIIPDSKLQG 838
>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
CCCH-type Zn-finger protein [General function prediction
only].
Length = 299
Score = 40.1 bits (93), Expect = 0.004
Identities = 24/74 (32%), Positives = 39/74 (52%), Gaps = 9/74 (12%)
Query: 924 MRKKKGAEKMKKKKKDAEK-MKKKGAEK-IGKKKKKKKKKKKKKKK-------KKKKKKK 974
M KK A+K ++ K A + M+K+ +K G K K + K + K K++ +KK
Sbjct: 1 MPPKKMAKKQQESGKKATRDMRKELEDKTFGLKNKNRSTKVQAIIKQIETLNLKEQLEKK 60
Query: 975 EAEKMKKKTEEAEK 988
E +M++K E EK
Sbjct: 61 EKMRMEEKRREPEK 74
Score = 39.7 bits (92), Expect = 0.006
Identities = 17/77 (22%), Positives = 38/77 (49%)
Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
+K+++ K+A + +K+ +K K K + K + K++ +KK+K + ++
Sbjct: 8 KKQQESGKKATRDMRKELEDKTFGLKNKNRSTKVQAIIKQIETLNLKEQLEKKEKMRMEE 67
Query: 972 KKKEAEKMKKKTEEAEK 988
K++E EK + K
Sbjct: 68 KRREPEKQVIRAGVDPK 84
Score = 33.5 bits (76), Expect = 0.44
Identities = 25/103 (24%), Positives = 43/103 (41%), Gaps = 17/103 (16%)
Query: 897 KKRTRRKKKKGAEK---IRKKKKKKKEAEKMRK---KKGAEKMKKKKKDAEKMKKKGAEK 950
KK TR +K+ +K ++ K + K +++ E+++KK+K + K++ EK
Sbjct: 15 KKATRDMRKELEDKTFGLKNKNRSTKVQAIIKQIETLNLKEQLEKKEKMRMEEKRREPEK 74
Query: 951 ----IGKKKK-------KKKKKKKKKKKKKKKKKKEAEKMKKK 982
G K K K K K+EA K +K
Sbjct: 75 QVIRAGVDPKTVVCALFLNKTCAKGDACKFAHGKEEARKTEKP 117
>gnl|CDD|204122 pfam09072, TMA7, Translation machinery associated TMA7. TMA7 plays
a role in protein translation. Deletions of the TMA7
gene results in altered protein synthesis rates.
Length = 63
Score = 36.0 bits (83), Expect = 0.005
Identities = 22/57 (38%), Positives = 30/57 (52%)
Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
KKK + +K+KK+ E + K+K E+ K K+ A K K KG G KK KK
Sbjct: 7 KKKPLKAPKKQKKELDEEDLAFKQKQKEEAKALKELAAKAKGKGPLGGGGIKKSGKK 63
Score = 28.7 bits (64), Expect = 2.3
Identities = 19/57 (33%), Positives = 27/57 (47%), Gaps = 2/57 (3%)
Query: 914 KKKKKKEAEKMRKKKGAEKM--KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
KKK K +K +K+ E + K+K+K+ K K+ A K K KK KK
Sbjct: 7 KKKPLKAPKKQKKELDEEDLAFKQKQKEEAKALKELAAKAKGKGPLGGGGIKKSGKK 63
Score = 26.7 bits (59), Expect = 9.7
Identities = 16/44 (36%), Positives = 24/44 (54%), Gaps = 8/44 (18%)
Query: 947 GAEKIGKKKKKKKKK--------KKKKKKKKKKKKKEAEKMKKK 982
G +K K KK+KK+ K+K+K++ K K+ A K K K
Sbjct: 6 GKKKPLKAPKKQKKELDEEDLAFKQKQKEEAKALKELAAKAKGK 49
>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family. Members of this family are
coiled-coil proteins that are involved in pre-rRNA
processing.
Length = 105
Score = 37.4 bits (87), Expect = 0.005
Identities = 23/75 (30%), Positives = 48/75 (64%), Gaps = 2/75 (2%)
Query: 897 KKRTRRKKKKGAEKIRKKK-KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+KR ++ ++ A K R+K+ K +KEAE+ R+ + ++ + K++ E+ +K A K+ KK
Sbjct: 31 EKRMEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAKEEKERYEKM-AAKMHAKK 89
Query: 956 KKKKKKKKKKKKKKK 970
++ K+++K+ K K
Sbjct: 90 VERLKRREKRNKLLK 104
Score = 37.0 bits (86), Expect = 0.006
Identities = 24/86 (27%), Positives = 50/86 (58%), Gaps = 7/86 (8%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
KK R K K + + R +K+ +++A K R EK K +K+AE+ ++ + I +++
Sbjct: 18 KKAFRPKSKLTSWEKRMEKRLEQQAIKAR-----EKELKDEKEAERQRRI--QAIKERRA 70
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKK 982
K++K++ +K K K+ E++K++
Sbjct: 71 AKEEKERYEKMAAKMHAKKVERLKRR 96
Score = 32.0 bits (73), Expect = 0.41
Identities = 22/63 (34%), Positives = 32/63 (50%), Gaps = 8/63 (12%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
L +K R+++ A K R+ K EK R +K A KM KK E++K++ EK K
Sbjct: 50 LKDEKEAERQRRIQAIKERRAAK----EEKERYEKMAAKMHAKK--VERLKRR--EKRNK 101
Query: 954 KKK 956
K
Sbjct: 102 LLK 104
Score = 30.1 bits (68), Expect = 1.6
Identities = 16/82 (19%), Positives = 44/82 (53%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
K KG K K +K+A + + K + + + +K+ ++ K +++ +K+ +++++
Sbjct: 2 KASKGMRVNGKNWKTEKKAFRPKSKLTSWEKRMEKRLEQQAIKAREKELKDEKEAERQRR 61
Query: 963 KKKKKKKKKKKKEAEKMKKKTE 984
+ K+++ K+E E+ +K
Sbjct: 62 IQAIKERRAAKEEKERYEKMAA 83
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 40.4 bits (94), Expect = 0.005
Identities = 20/72 (27%), Positives = 31/72 (43%), Gaps = 2/72 (2%)
Query: 903 KKKKGAEKIRKKKKKK--KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
KK A + K + + K E+ +KK+ KK A+K +K G EK+ KK
Sbjct: 541 YKKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSS 600
Query: 961 KKKKKKKKKKKK 972
K ++ K
Sbjct: 601 AFFAKLQETKAV 612
Score = 37.7 bits (87), Expect = 0.032
Identities = 25/82 (30%), Positives = 34/82 (41%), Gaps = 9/82 (10%)
Query: 909 EKIRKKKK--KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
E+I KK K +K + E R K E+ KKK+K K K KK++K K+K
Sbjct: 538 EEIYKKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKA-------VTKAKKERKIGKEK 590
Query: 967 KKKKKKKKEAEKMKKKTEEAEK 988
KK + K E
Sbjct: 591 VDGVAKKSSSAFFAKLQETKAV 612
Score = 34.2 bits (78), Expect = 0.37
Identities = 17/62 (27%), Positives = 28/62 (45%), Gaps = 3/62 (4%)
Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
R K + + RKKK+K++ A+K K E+ K+K KK + K ++ K
Sbjct: 554 LDRTDKNR---ERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSSAFFAKLQETK 610
Query: 959 KK 960
Sbjct: 611 AV 612
>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
structure and biogenesis].
Length = 107
Score = 37.2 bits (86), Expect = 0.005
Identities = 17/49 (34%), Positives = 23/49 (46%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
K K KK K G K KKK KKKKK++ ++ E++ K
Sbjct: 1 MAPKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELLDK 49
Score = 29.5 bits (66), Expect = 2.5
Identities = 14/57 (24%), Positives = 29/57 (50%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
K K +K +K K K +KK +K K++ + A + ++ +KI K+ +++
Sbjct: 2 APKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELLDKIRKEVPRER 58
Score = 29.1 bits (65), Expect = 3.6
Identities = 12/37 (32%), Positives = 18/37 (48%)
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
+ K+KK +K K K KKK +K K++ A
Sbjct: 3 PKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARRA 39
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many hypothetical
proteins.
Length = 98
Score = 37.1 bits (86), Expect = 0.006
Identities = 23/56 (41%), Positives = 31/56 (55%), Gaps = 2/56 (3%)
Query: 935 KKKKDAEKMKKKGAEKI--GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K++K EK K+ + I KK+KKKKK+ K + K KK +K KKK E E
Sbjct: 1 KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEF 56
Score = 35.9 bits (83), Expect = 0.013
Identities = 20/70 (28%), Positives = 35/70 (50%), Gaps = 3/70 (4%)
Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
K++K+ EK K+ + KK +K K+ K KK +KK KKK + + +
Sbjct: 1 KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPE-- 58
Query: 976 AEKMKKKTEE 985
E +++TE+
Sbjct: 59 -ESKRRRTED 67
Score = 33.6 bits (77), Expect = 0.086
Identities = 17/61 (27%), Positives = 27/61 (44%), Gaps = 4/61 (6%)
Query: 932 KMKKKKKDAEKMK----KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K +KK + +K +K KKKK+ K + K KK +KK + + + E
Sbjct: 1 KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEES 60
Query: 988 K 988
K
Sbjct: 61 K 61
Score = 29.0 bits (65), Expect = 3.2
Identities = 23/76 (30%), Positives = 37/76 (48%), Gaps = 5/76 (6%)
Query: 914 KKKKKKEAEKMRK----KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
K++KK E R +K KKKKK K + A K +KK KKK + + ++
Sbjct: 1 KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKK-GQKKDKKKDEFPEFPEE 59
Query: 970 KKKKKEAEKMKKKTEE 985
K+++ + + TEE
Sbjct: 60 SKRRRTEDGLPIYTEE 75
Score = 28.2 bits (63), Expect = 6.4
Identities = 14/38 (36%), Positives = 19/38 (50%)
Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
+KKKKK+ A+ R K + KK KK E + K
Sbjct: 24 RKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESK 61
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 39.9 bits (93), Expect = 0.006
Identities = 16/58 (27%), Positives = 32/58 (55%)
Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
+K+ + A+ ++ +D+E+ +++ E+ G KK KK KK K KK K +++
Sbjct: 300 REDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSD 357
Score = 37.6 bits (87), Expect = 0.032
Identities = 24/95 (25%), Positives = 43/95 (45%), Gaps = 5/95 (5%)
Query: 888 TALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG 947
TA GF L K + ++ + + +K +G + K + ++
Sbjct: 174 TANGFQLMMMKAAKNGPAAFGDEDEETEGEKGG-----GGRGKDLKIKDLEGDDEDDGDE 228
Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
++K G+ ++K KKKKKK K KKK + +K K+
Sbjct: 229 SDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKR 263
Score = 37.6 bits (87), Expect = 0.032
Identities = 18/110 (16%), Positives = 48/110 (43%), Gaps = 17/110 (15%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKG-----------------AEKMKKKKK 938
+KK+ + KKK + + K+ +A++ G + +++ K
Sbjct: 244 KKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDK 303
Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ ++ K + + ++ +++K +++ KK K+ +K+K K +K
Sbjct: 304 LSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDK 353
Score = 35.3 bits (81), Expect = 0.16
Identities = 17/90 (18%), Positives = 32/90 (35%), Gaps = 21/90 (23%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAE---KMKKKKKDAEKMKKKGAEKIGKKKKK- 957
R+ K E K + ++ E + +++ E + KK K +K+K K
Sbjct: 300 REDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSG 359
Query: 958 -----------------KKKKKKKKKKKKK 970
KK+K+ KK++
Sbjct: 360 DDSDDSDIDGEDSVSLVTAKKQKEPKKEEP 389
Score = 34.6 bits (79), Expect = 0.27
Identities = 16/71 (22%), Positives = 28/71 (39%), Gaps = 4/71 (5%)
Query: 907 GAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
G + ++ K E K + + +++E+ K + + KK KK KK K KK
Sbjct: 294 GNDPEEREDKLSPEIPA----KPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKN 349
Query: 967 KKKKKKKKEAE 977
K +
Sbjct: 350 GLDKDDSDSGD 360
>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
This domain is found associated with the Pfam tRNA
synthetase class II domain (pfam00587) and represents
the N-terminal domain of seryl-tRNA synthetase.
Length = 108
Score = 37.2 bits (87), Expect = 0.006
Identities = 15/82 (18%), Positives = 41/82 (50%), Gaps = 9/82 (10%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK-KKKKKKK 966
EK+RK+ + +++ + E+ ++ + + E+++ + + + K+ K KKKK
Sbjct: 15 KEKLRKRGVDVLDVDELLEL--DEERRELQVELEELQAE------RNELSKEIGKAKKKK 66
Query: 967 KKKKKKKKEAEKMKKKTEEAEK 988
+ + E +++K + + E
Sbjct: 67 EDAEALIAEVKELKDELKALEA 88
Score = 33.7 bits (78), Expect = 0.090
Identities = 16/89 (17%), Positives = 41/89 (46%), Gaps = 10/89 (11%)
Query: 894 LFRKK-RTRRKKKKGAEKI----RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
++K R R +++ ++++ + E E+++ ++ + K+ K KKK
Sbjct: 13 AVKEKLRKRGVDVLDVDELLELDEERRELQVELEELQAER-----NELSKEIGKAKKKKE 67
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
+ + K+ K + K + + ++ EAE
Sbjct: 68 DAEALIAEVKELKDELKALEAELRELEAE 96
Score = 30.3 bits (69), Expect = 1.7
Identities = 15/63 (23%), Positives = 33/63 (52%), Gaps = 9/63 (14%)
Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD 939
D+ RRE + + + ++ + +++I K KKKK++AE + + +++K + K
Sbjct: 35 DEERRE-------LQVELEELQAERNELSKEIGKAKKKKEDAEALIAE--VKELKDELKA 85
Query: 940 AEK 942
E
Sbjct: 86 LEA 88
>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
domain [Chromatin structure and dynamics].
Length = 211
Score = 38.7 bits (90), Expect = 0.007
Identities = 19/85 (22%), Positives = 34/85 (40%), Gaps = 5/85 (5%)
Query: 897 KKRTRRKKKKGAEKIRKKK---KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
++R +R+K++ +K+ K + K+R K K E+ K
Sbjct: 128 RERYQREKEEYNKKLPNKAPIGPFIENEPKIRPKVEGPS--PDKALVEETKIISKAWSEL 185
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEK 978
+ KKKK K KK K++ +
Sbjct: 186 DESKKKKYIDKYKKLKEEYDSFYPE 210
Score = 33.3 bits (76), Expect = 0.41
Identities = 22/91 (24%), Positives = 33/91 (36%), Gaps = 11/91 (12%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK------ 962
EK K+ + E+ +++K K K + KI K + K
Sbjct: 116 EKEPYYKEANSDRERYQREKEEYNKKLPNKAPIGPFIENEPKIRPKVEGPSPDKALVEET 175
Query: 963 KKKKKK-----KKKKKKEAEKMKKKTEEAEK 988
K K + KKKK +K KK EE +
Sbjct: 176 KIISKAWSELDESKKKKYIDKYKKLKEEYDS 206
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 40.2 bits (94), Expect = 0.007
Identities = 19/79 (24%), Positives = 36/79 (45%), Gaps = 3/79 (3%)
Query: 896 RKKRTR---RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
R+K R R + A+K+ ++ KK +K KK + ++ + M+ + ++
Sbjct: 1179 REKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVV 1238
Query: 953 KKKKKKKKKKKKKKKKKKK 971
K K + KKK K+K
Sbjct: 1239 KPKGRAGAKKKAPAAAKEK 1257
Score = 39.1 bits (91), Expect = 0.016
Identities = 24/99 (24%), Positives = 41/99 (41%), Gaps = 13/99 (13%)
Query: 901 RRKKKKGAEKIRKKKKK----------KKEAEKMRKK--KGAEKMKKKKKDAEKMKKKGA 948
R K E ++K K +KE +K+ K+ K E +K ++ A + + A
Sbjct: 1135 RDKLNIEVEDLKKTTPKSLWLKDLDALEKELDKLDKEDAKAEEAREKLQRAAARGESGAA 1194
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+K+ ++ KK KK KK + + E E E
Sbjct: 1195 KKV-SRQAPKKPAPKKTTKKASESETTEETYGSSAMETE 1232
Score = 35.6 bits (82), Expect = 0.17
Identities = 21/89 (23%), Positives = 37/89 (41%), Gaps = 10/89 (11%)
Query: 909 EKIRKKKKKKKEAEKMRKK-------KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
E + K E E ++K K + ++K+ +K K E K ++ +
Sbjct: 1130 ELCADRDKLNIEVEDLKKTTPKSLWLKDLDALEKELDKLDKEDAKAEEAREKLQRAAARG 1189
Query: 962 KKK--KKKKKKKKKKEA-EKMKKKTEEAE 987
+ KK ++ KK A +K KK E+E
Sbjct: 1190 ESGAAKKVSRQAPKKPAPKKTTKKASESE 1218
Score = 35.2 bits (81), Expect = 0.24
Identities = 24/83 (28%), Positives = 39/83 (46%), Gaps = 14/83 (16%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
KK +R+ KK A +K KK E+E + G+ M + + AE +K KG
Sbjct: 1195 KKVSRQAPKKPAP--KKTTKKASESETTEETYGSSAM-ETENVAEVVKPKG--------- 1242
Query: 957 KKKKKKKKKKKKKKKKKKEAEKM 979
+ KKK K+K+E +++
Sbjct: 1243 --RAGAKKKAPAAAKEKEEEDEI 1263
Score = 34.8 bits (80), Expect = 0.31
Identities = 25/89 (28%), Positives = 37/89 (41%), Gaps = 5/89 (5%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
R KKKG K KK + KK+G ++ +K +M K AE IG +KK +K
Sbjct: 1329 RLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEM-LKPAEAIGISPEKKVRK 1387
Query: 962 KKKKKKKKKK----KKKEAEKMKKKTEEA 986
+ KK + K + +E
Sbjct: 1388 MRASPFNKKSGSVLGRAATNKETESSENV 1416
Score = 32.5 bits (74), Expect = 1.5
Identities = 16/92 (17%), Positives = 37/92 (40%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+ + R K ++ A + KK + +K + KK + + G+ + +
Sbjct: 1174 KAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETEN 1233
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+ K K + KKK A++ +++ E +
Sbjct: 1234 VAEVVKPKGRAGAKKKAPAAAKEKEEEDEILD 1265
>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 753
Score = 39.6 bits (93), Expect = 0.008
Identities = 17/90 (18%), Positives = 24/90 (26%), Gaps = 3/90 (3%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+ K E + K + K + + K A K
Sbjct: 522 EELEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLAHMGLPVPAEEAKVEAVDEVKFL 581
Query: 956 KKKKKKKKKK---KKKKKKKKKEAEKMKKK 982
KKKK K KK K + KK+
Sbjct: 582 KKKKGILDAGAFESTLKDKKNKVLPEAKKR 611
>gnl|CDD|211393 cd11381, NSA2, pre-ribosomal protein NSA2 (Nop seven-associated 2).
NSA2 appears to be a protein required for the
maturation of 27S pre-rRNA in yeast; it has been
characterized in mammalian cells as a nucleolar protein
that might play a role in the regulation of the cell
cycle and in cell proliferation.
Length = 257
Score = 38.8 bits (91), Expect = 0.008
Identities = 28/89 (31%), Positives = 44/89 (49%), Gaps = 18/89 (20%)
Query: 894 LFRKKRTRR------KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG 947
L RK+ RR K+KK A R+ K+ K+A+K+R K K+ KK+ EK++
Sbjct: 6 LHRKRHGRRLDHEERKRKKEA---REPHKRSKKAQKLRGLKA--KLYNKKRYKEKIQM-- 58
Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
KK K +++ K+K K + A
Sbjct: 59 -----KKTIKMHEERNVKQKVDDKVPEGA 82
Score = 36.8 bits (86), Expect = 0.039
Identities = 23/78 (29%), Positives = 40/78 (51%), Gaps = 11/78 (14%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKM---------KKKGAEKIGKKKKKKK 959
E RK+ ++ + E+ ++KK A + K+ K A+K+ KK+ EKI + KK
Sbjct: 5 ELHRKRHGRRLDHEERKRKKEAREPHKRSKKAQKLRGLKAKLYNKKRYKEKI--QMKKTI 62
Query: 960 KKKKKKKKKKKKKKKEAE 977
K +++ K+K K E
Sbjct: 63 KMHEERNVKQKVDDKVPE 80
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 39.5 bits (93), Expect = 0.008
Identities = 14/84 (16%), Positives = 46/84 (54%), Gaps = 5/84 (5%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKM---KKKGAEKIGKKKKKKKKKKKK 964
EK++K+ + +K+ + E+ +K ++ E++ + + +++IG+ K+ + ++
Sbjct: 15 REKLKKRGGDALDVDKLLELD--EERRKLLRELEELQAERNELSKEIGRALKRGEDDAEE 72
Query: 965 KKKKKKKKKKEAEKMKKKTEEAEK 988
+ K+ K++ ++++ +E E
Sbjct: 73 LIAEVKELKEKLKELEAALDELEA 96
Score = 38.3 bits (90), Expect = 0.017
Identities = 10/89 (11%), Positives = 38/89 (42%), Gaps = 5/89 (5%)
Query: 894 LFRKK-RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
R+K + R +K+ + +++++ ++ E ++ + ++++ +
Sbjct: 13 AVREKLKKRGGDALDVDKLLELDEERRK----LLRELEELQAERNELSKEIGRALKRGED 68
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
++ + K+ K+K K+ + E +
Sbjct: 69 DAEELIAEVKELKEKLKELEAALDELEAE 97
Score = 32.6 bits (75), Expect = 1.1
Identities = 12/86 (13%), Positives = 36/86 (41%), Gaps = 9/86 (10%)
Query: 883 RREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK 942
R + +L + R+ ++ E ++ + KE +K+ + DAE+
Sbjct: 21 RGGDALDVDKLLELDEERRKLLRELEELQAERNELSKE--------IGRALKRGEDDAEE 72
Query: 943 MKKKGAEKIGKKKKKKKKKKKKKKKK 968
+ + +++ +K K+ + + + +
Sbjct: 73 LIAEV-KELKEKLKELEAALDELEAE 97
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 38.0 bits (89), Expect = 0.009
Identities = 19/83 (22%), Positives = 47/83 (56%), Gaps = 4/83 (4%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
KK A+K +K+K+K++ K +K + + K+ AE+ K + AE+ ++ ++++ + +
Sbjct: 14 KKKAKKAKKEKRKQR---KQARKGADDGDDELKQAAEEAKAEKAER-DRELNRQRQAEAE 69
Query: 965 KKKKKKKKKKEAEKMKKKTEEAE 987
+K + K+ E+ + ++ E
Sbjct: 70 QKAIVAQIKQLIEQNRIDRKDGE 92
Score = 35.3 bits (82), Expect = 0.078
Identities = 13/35 (37%), Positives = 23/35 (65%)
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KKK KK KK+K+K++K+ +K + + + AE+
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEE 48
Score = 34.5 bits (80), Expect = 0.14
Identities = 18/45 (40%), Positives = 28/45 (62%), Gaps = 11/45 (24%)
Query: 953 KKKKKKKKKKKKKKKKKKKK---------KKEAEKMKKKTEEAEK 988
KKK KK KK+K+K++K+ +K K+ AE + K E+AE+
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAE--EAKAEKAER 56
Score = 33.7 bits (78), Expect = 0.25
Identities = 17/63 (26%), Positives = 34/63 (53%), Gaps = 14/63 (22%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKK--------KKKKKKKKKKKKKKEAEKMKKKTEE 985
KKK K A+K K+K ++K+ +K K+ ++ K +K +++ E +++ E
Sbjct: 14 KKKAKKAKKEKRK------QRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAE 67
Query: 986 AEK 988
AE+
Sbjct: 68 AEQ 70
>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
non-specific RNA binding region part 2. This is a
region found N terminal to the catalytic domain of
glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
but not in Escherichia coli. This region is thought to
bind RNA in a non-specific manner, enhancing
interactions between the tRNA and enzyme, but is not
essential for enzyme function.
Length = 83
Score = 35.8 bits (83), Expect = 0.009
Identities = 16/40 (40%), Positives = 21/40 (52%)
Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
K A+ + KKKKKKKKK + K KK AE + +
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAM 56
Score = 35.4 bits (82), Expect = 0.014
Identities = 17/49 (34%), Positives = 21/49 (42%), Gaps = 6/49 (12%)
Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K +A+ +KKK KKKKKKK + K KK E E
Sbjct: 17 KTEADLVKKK------KKKKKKKAEDTAATAKAKKATAEDVSEGAMFME 59
Score = 34.3 bits (79), Expect = 0.040
Identities = 13/35 (37%), Positives = 17/35 (48%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
KKKKKKKKKK + K + + +E A
Sbjct: 22 LVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAM 56
Score = 34.3 bits (79), Expect = 0.040
Identities = 13/32 (40%), Positives = 17/32 (53%)
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KKKKKKKKKK + A+ K E+ +
Sbjct: 22 LVKKKKKKKKKKAEDTAATAKAKKATAEDVSE 53
Score = 33.5 bits (77), Expect = 0.057
Identities = 17/38 (44%), Positives = 20/38 (52%), Gaps = 1/38 (2%)
Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+G K + KKKKKKKKKK + A K K AE
Sbjct: 14 LGPKTEADLVKKKKKKKKKKAEDTAATA-KAKKATAED 50
Score = 33.5 bits (77), Expect = 0.066
Identities = 16/36 (44%), Positives = 18/36 (50%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K + KKKKKKKKKK + K KK T E
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVS 52
Score = 30.0 bits (68), Expect = 1.1
Identities = 13/48 (27%), Positives = 18/48 (37%), Gaps = 4/48 (8%)
Query: 938 KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K + KK KKKKKK + K KK ++ + E
Sbjct: 17 KTEADLVKKKK----KKKKKKAEDTAATAKAKKATAEDVSEGAMFMEG 60
>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
Length = 300
Score = 38.9 bits (91), Expect = 0.010
Identities = 28/84 (33%), Positives = 39/84 (46%), Gaps = 4/84 (4%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK-- 960
K KK K K E M KK A + K KKK +K+ KK K KK
Sbjct: 219 KYKKQFSKYIKAGVGPDSLEDMYKK--AHAAIRANPSKVKKKKKKKKKVVHKKYKTKKLT 276
Query: 961 KKKKKKKKKKKKKKEAEKMKKKTE 984
K++K + K KK + E+++KK +
Sbjct: 277 GKQRKARVKAKKAQRRERLQKKIK 300
Score = 38.1 bits (89), Expect = 0.016
Identities = 36/101 (35%), Positives = 43/101 (42%), Gaps = 15/101 (14%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKK------------KDAEKMKKKGAEK 950
K AE R + K AE M++ K + K KK E M KK
Sbjct: 188 KDSYDAEVHRDRIFGKHVAEYMKQLKEEDPDKYKKQFSKYIKAGVGPDSLEDMYKKAHAA 247
Query: 951 IGKKKKKKKKKKKKKKK---KKKKKKKEAEKMKKKTEEAEK 988
I K KKKKKKKKK KK K KK K +K +A+K
Sbjct: 248 IRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKK 288
Score = 30.0 bits (68), Expect = 5.6
Identities = 21/61 (34%), Positives = 33/61 (54%), Gaps = 9/61 (14%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
R ++ KKK KKKKKK +K + KK K +K + A+K +++ E++ KK
Sbjct: 249 RANPSKVKKK-------KKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRR--ERLQKKI 299
Query: 956 K 956
K
Sbjct: 300 K 300
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 39.3 bits (92), Expect = 0.011
Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 6/89 (6%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
K K + AE+I K K ++ K RKK K +D E+ KK+ E I KK K
Sbjct: 156 KGEDLKTVETAEEIEKSLKAMQQKWKKRKKDLPNK-----QDLEEYKKR-LEAIKKKDIK 209
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
+ +K K++ K KKE + K+K + A
Sbjct: 210 NPLELQKIKEEFDKLKKEGKADKQKIKSA 238
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 39.4 bits (92), Expect = 0.012
Identities = 26/87 (29%), Positives = 44/87 (50%), Gaps = 3/87 (3%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK-KKKKKK 961
+K+ +K +K KE EK+ KK E+ ++ K+ E+ KK EK ++ K KK+
Sbjct: 523 LEKELEQKNEHLEKLLKEQEKL--KKELEQEMEELKERERNKKLELEKEAQEALKALKKE 580
Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ ++ K+KK K K E+ K
Sbjct: 581 VESIIRELKEKKIHKAKEIKSIEDLVK 607
Score = 38.6 bits (90), Expect = 0.016
Identities = 24/96 (25%), Positives = 46/96 (47%), Gaps = 8/96 (8%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK--------MKK 945
L ++ +K + EK+ K+++K K+ + ++ E+ + KK + EK +KK
Sbjct: 520 LSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKK 579
Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
+ I + K+KK K K+ K + K + K K
Sbjct: 580 EVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKI 615
Score = 38.6 bits (90), Expect = 0.019
Identities = 26/91 (28%), Positives = 48/91 (52%), Gaps = 4/91 (4%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K K K EK++K+ +++ E K R++ +++K+ ++A K KK E I ++ K
Sbjct: 530 KNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRELK 589
Query: 957 KKKKKKKKKKKK----KKKKKKEAEKMKKKT 983
+KK K K+ K K K+ + + +K T
Sbjct: 590 EKKIHKAKEIKSIEDLVKLKETKQKIPQKPT 620
Score = 36.7 bits (85), Expect = 0.071
Identities = 25/97 (25%), Positives = 48/97 (49%), Gaps = 15/97 (15%)
Query: 901 RRKKKKGAEK------IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
+ K G K I K +KE E+ E ++K K+ EK+KK+ +++ +
Sbjct: 501 QAKTFYGEFKEEINVLIEKLSALEKELEQK-----NEHLEKLLKEQEKLKKELEQEMEEL 555
Query: 955 KKKKKKKKKKKKK----KKKKKKKEAEKMKKKTEEAE 987
K++++ KK + +K K KKE E + ++ +E +
Sbjct: 556 KERERNKKLELEKEAQEALKALKKEVESIIRELKEKK 592
Score = 32.5 bits (74), Expect = 1.3
Identities = 24/85 (28%), Positives = 42/85 (49%), Gaps = 9/85 (10%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE-KIGKKKKKKKKKKK 963
++ E +++ KK E E K+ E +K KK+ E + ++ E KI K K+ K +
Sbjct: 550 QEMEELKERERNKKLELE----KEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDL 605
Query: 964 KKKKKKKKKKKEAEKMKKKTEEAEK 988
K K+ K+K + K +A+K
Sbjct: 606 VKLKETKQKIPQ----KPTNFQADK 626
Score = 32.5 bits (74), Expect = 1.5
Identities = 22/87 (25%), Positives = 42/87 (48%), Gaps = 10/87 (11%)
Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK---------K 961
I ++ K K EK+ +K+ E+ K + EK+ K+++K KK K
Sbjct: 498 IIEQAKTFYGEFKEEINVLIEKLSALEKELEQ-KNEHLEKLLKEQEKLKKELEQEMEELK 556
Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++++ KK + +K+ E +K +E E
Sbjct: 557 ERERNKKLELEKEAQEALKALKKEVES 583
Score = 30.6 bits (69), Expect = 5.9
Identities = 20/86 (23%), Positives = 41/86 (47%), Gaps = 8/86 (9%)
Query: 910 KIRKKKKKKK------EAEKMRKKKGAEKMK--KKKKDAEKMKKKGAEKIGKKKKKKKKK 961
K R++ KK + EA K KK+ ++ K+KK + + K E + K K+ K+K
Sbjct: 556 KERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKI 615
Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAE 987
+K + K + +++ ++ +
Sbjct: 616 PQKPTNFQADKIGDKVRIRYFGQKGK 641
Score = 29.8 bits (67), Expect = 9.5
Identities = 27/96 (28%), Positives = 42/96 (43%), Gaps = 10/96 (10%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK----KKKDAEKMKKKGAEKIGKKKKK 957
+ KK E I ++ K+KK K ++ K E + K K+K +K A+KIG K +
Sbjct: 575 KALKKEVESIIRELKEKKI-HKAKEIKSIEDLVKLKETKQKIPQKPTNFQADKIGDKVRI 633
Query: 958 KKKKKKKKKKKKKKKKKEAE-----KMKKKTEEAEK 988
+ +K K + K +MK E EK
Sbjct: 634 RYFGQKGKIVQILGGNKWNVTVGGMRMKVHGSELEK 669
>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit. This family includes
prefoldin subunits that are not detected by pfam02996.
Length = 106
Score = 36.1 bits (84), Expect = 0.012
Identities = 19/103 (18%), Positives = 55/103 (53%), Gaps = 13/103 (12%)
Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAE-------------KMRKKKGAEKMKKKKKD 939
++ + ++ +++ + A++I++ + + KE E K+ K G +K+ K++
Sbjct: 3 LINKFQQLQQQLQLLAQQIKQLETQLKEIELVLEELELLDEDTKVYKLIGDVLVKQDKEE 62
Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
++ ++ E + K+ K +K+ +K +K+ ++ K+E K+ +
Sbjct: 63 VKEELEERKETLEKEIKTLEKQLEKLEKELEELKEELYKLFGQ 105
>gnl|CDD|240377 PTZ00352, PTZ00352, 60S ribosomal protein L13; Provisional.
Length = 212
Score = 37.8 bits (88), Expect = 0.013
Identities = 20/87 (22%), Positives = 38/87 (43%), Gaps = 4/87 (4%)
Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK- 957
R +K KKG I K++ K+ ++ ++ ++ K + K +
Sbjct: 126 RKGKKPKKGFAGIPADTSKEEVVALPVKQNKNSEVIPFQRTPKREKARVITKEERAFNAY 185
Query: 958 ---KKKKKKKKKKKKKKKKKEAEKMKK 981
++ K K KK+KKK+A++ KK
Sbjct: 186 RTLRQAKLNAKFVGKKEKKKQAKEKKK 212
Score = 33.2 bits (76), Expect = 0.51
Identities = 22/93 (23%), Positives = 37/93 (39%), Gaps = 11/93 (11%)
Query: 891 GFILFRKKRTRRKK----------KKGAEKIRKKKKKKKEAEKMRKKKGAEK-MKKKKKD 939
+LF +K + KK K+ + K+ K E ++ EK K++
Sbjct: 120 KLVLFPRKGKKPKKGFAGIPADTSKEEVVALPVKQNKNSEVIPFQRTPKREKARVITKEE 179
Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
+ + K KK+KKK+ K+KKK
Sbjct: 180 RAFNAYRTLRQAKLNAKFVGKKEKKKQAKEKKK 212
>gnl|CDD|151665 pfam11223, DUF3020, Protein of unknown function (DUF3020). This
family of fungal proteins is conserved towards the
C-terminus of HMG domains. The function is not known.
Length = 49
Score = 34.5 bits (79), Expect = 0.014
Identities = 16/48 (33%), Positives = 22/48 (45%)
Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+ KKK E ++ K + + KKK KKK KE K K + E
Sbjct: 1 NRERKKKWREANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDAYIDDE 48
Score = 31.0 bits (70), Expect = 0.19
Identities = 13/46 (28%), Positives = 23/46 (50%)
Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
++KKK E ++ + + + KKK KKK K+ K+K +
Sbjct: 1 NRERKKKWREANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDAYID 46
Score = 29.5 bits (66), Expect = 0.85
Identities = 15/48 (31%), Positives = 24/48 (50%)
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
+ KKK ++A + K + + KKK KKK K+ K+K + E
Sbjct: 1 NRERKKKWREANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDAYIDDE 48
Score = 26.8 bits (59), Expect = 7.5
Identities = 13/35 (37%), Positives = 20/35 (57%)
Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
R++KKK +EA R K + + KKK +K K+
Sbjct: 2 RERKKKWREANSERNKDNDLRSRVKKKAKKKFGKE 36
>gnl|CDD|218481 pfam05177, RCSD, RCSD region. Proteins contain this region include
C.elegans UNC-89. This region is found repeated in
UNC-89 and shows conservation in prolines, lysines and
glutamic acids. Proteins with RCSD are involved in
muscle M-line assembly, but the function of this region
RCSD is not clear.
Length = 101
Score = 36.0 bits (82), Expect = 0.014
Identities = 22/85 (25%), Positives = 42/85 (49%), Gaps = 5/85 (5%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
+ ++ + E+ + +KK+K EK+ K K K + +K E++ KK+K
Sbjct: 22 KTEEDRATEEAKSPEKKEKSPEKVD-----GKPKSPTKKEKSPEKSATEEVKSPTKKEKS 76
Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEE 985
+K ++K KKE + K+K+ E
Sbjct: 77 PEKVEEKPASPTKKERVQKKEKSPE 101
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 39.0 bits (90), Expect = 0.014
Identities = 21/86 (24%), Positives = 44/86 (51%), Gaps = 2/86 (2%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
KR+ + E + ++K K R K K +++K+ E +++ E+IGKK+K
Sbjct: 992 DKRSIAVVSRRIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEK 1051
Query: 957 KKKKKKKKKKKK--KKKKKKEAEKMK 980
+++++ +K K+ KK +K +
Sbjct: 1052 EREQRIRKTIHDNYKEMAKKRLKKKR 1077
Score = 34.3 bits (78), Expect = 0.41
Identities = 14/80 (17%), Positives = 38/80 (47%), Gaps = 10/80 (12%)
Query: 913 KKKKKKKEAEKMRKKKGAEK----------MKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
K+ + ++ E +K++ KD E+ ++ + + K+++ KK+K
Sbjct: 992 DKRSIAVVSRRIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEK 1051
Query: 963 KKKKKKKKKKKKEAEKMKKK 982
+++++ +K ++M KK
Sbjct: 1052 EREQRIRKTIHDNYKEMAKK 1071
Score = 33.2 bits (75), Expect = 0.82
Identities = 16/68 (23%), Positives = 40/68 (58%), Gaps = 1/68 (1%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+++ K + E+I+ +++K++ E +++ K E KK+K+ ++++K + +
Sbjct: 1011 CREKHEIKDRIVKERIKDQEEKERM-ESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMA 1069
Query: 956 KKKKKKKK 963
KK+ KKK+
Sbjct: 1070 KKRLKKKR 1077
Score = 30.5 bits (68), Expect = 6.6
Identities = 12/79 (15%), Positives = 41/79 (51%)
Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
+ ++ + K +K + +++ + K + ++I K++ K +++K++ + +
Sbjct: 980 SLPREIESKLPLDKRSIAVVSRRIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQ 1039
Query: 970 KKKKKEAEKMKKKTEEAEK 988
+ K++E K +K+ E+ +
Sbjct: 1040 RAKEEEIGKKEKEREQRIR 1058
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this family
(Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 35.7 bits (83), Expect = 0.015
Identities = 9/34 (26%), Positives = 24/34 (70%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
K+++K +++++ K+ K K+++ EK++K + A
Sbjct: 4 KEEEKAQREEELKRLKNLKREEIEEKLEKIKKVA 37
Score = 32.2 bits (74), Expect = 0.30
Identities = 14/53 (26%), Positives = 28/53 (52%), Gaps = 5/53 (9%)
Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K++K+ EK +++ E K+ K K+++ ++K +K KK A E +
Sbjct: 1 KERKEEEKAQRE-EEL----KRLKNLKREEIEEKLEKIKKVAGLRGADLSEED 48
Score = 31.1 bits (71), Expect = 0.67
Identities = 10/32 (31%), Positives = 23/32 (71%), Gaps = 2/32 (6%)
Query: 916 KKKKEAEKMRKKKGAEKMKKKKKD--AEKMKK 945
K++KE EK ++++ +++K K++ EK++K
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREEIEEKLEK 32
Score = 30.7 bits (70), Expect = 1.0
Identities = 14/45 (31%), Positives = 23/45 (51%), Gaps = 10/45 (22%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDA 940
RK+ + +++ E R K K++E E EK++K KK A
Sbjct: 3 RKEEEKAQRE--EELKRLKNLKREEIE--------EKLEKIKKVA 37
Score = 28.4 bits (64), Expect = 5.0
Identities = 8/32 (25%), Positives = 23/32 (71%)
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K++K+++K +++++ K+ + K ++ E+ EK
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREEIEEKLEK 32
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 36.4 bits (84), Expect = 0.016
Identities = 15/22 (68%), Positives = 16/22 (72%)
Query: 954 KKKKKKKKKKKKKKKKKKKKKE 975
K KKKKKK KK KK KK KK+
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 36.0 bits (83), Expect = 0.017
Identities = 15/23 (65%), Positives = 15/23 (65%)
Query: 949 EKIGKKKKKKKKKKKKKKKKKKK 971
EK KKKKK KK KK KK KK
Sbjct: 98 EKPKKKKKKSKKTKKPKKSSKKD 120
Score = 36.0 bits (83), Expect = 0.022
Identities = 15/22 (68%), Positives = 15/22 (68%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKK 974
K KKKKKK KK KK KK KK
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 33.7 bits (77), Expect = 0.12
Identities = 14/21 (66%), Positives = 16/21 (76%)
Query: 958 KKKKKKKKKKKKKKKKKEAEK 978
K KKKKKK KK KK KK ++K
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKK 119
Score = 32.9 bits (75), Expect = 0.21
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 956 KKKKKKKKKKKKKKKKKKKEAE 977
K KKKKKK KK KK KK + +
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 32.2 bits (73), Expect = 0.39
Identities = 13/22 (59%), Positives = 14/22 (63%)
Query: 957 KKKKKKKKKKKKKKKKKKEAEK 978
K KKKKKK KK KK KK +
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 30.2 bits (68), Expect = 1.8
Identities = 12/22 (54%), Positives = 13/22 (59%)
Query: 960 KKKKKKKKKKKKKKKEAEKMKK 981
K KKKKKK KK KK + K
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 29.5 bits (66), Expect = 3.5
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 959 KKKKKKKKKKKKKKKKEAEKMK 980
K KKKKKK KK KK K++ K
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 29.5 bits (66), Expect = 3.9
Identities = 13/22 (59%), Positives = 14/22 (63%)
Query: 961 KKKKKKKKKKKKKKEAEKMKKK 982
K KKKKKK KK KK + KK
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 29.1 bits (65), Expect = 4.9
Identities = 11/21 (52%), Positives = 16/21 (76%)
Query: 965 KKKKKKKKKKEAEKMKKKTEE 985
K KKKKKK K+ +K KK +++
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKK 119
>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
Length = 801
Score = 38.6 bits (90), Expect = 0.016
Identities = 18/63 (28%), Positives = 29/63 (46%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
K+ K +E E R+K + + + KK A+++ KK KKK+ KK K
Sbjct: 570 KELKDEEVEAYREKFAGSQADRAARAEAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGK 629
Query: 974 KEA 976
+A
Sbjct: 630 SKA 632
Score = 32.1 bits (73), Expect = 2.1
Identities = 14/57 (24%), Positives = 25/57 (43%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
+ KK A++++KK +K KK G K + E + ++G K +K
Sbjct: 599 EAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKAAAEREIDVSRLDIRVGLIVKAEK 655
Score = 29.7 bits (67), Expect = 9.4
Identities = 18/60 (30%), Positives = 28/60 (46%), Gaps = 5/60 (8%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
R+K G++ R + AE KK A+++KKK KK+G + G K K +
Sbjct: 581 REKFAGSQADRAAR-----AEAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKAAAE 635
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 38.1 bits (89), Expect = 0.017
Identities = 37/153 (24%), Positives = 59/153 (38%), Gaps = 22/153 (14%)
Query: 833 SNVATKRLLHDAILRRSGMYTIAMAYCGSGSNVATKRLLHVAVSDVNDDVRREAVTALGF 892
S +AT R+L A + I G + A L D + E V A
Sbjct: 175 SELATTRILTPADFAK-----IQELRLEKGVDKALGGKLKRRDKDAPERHSDELVDADDI 229
Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEK---MRKKKGAE------KMKKKKKDAEKM 943
KK+ K+ E+I K+ +++ EK + KK E K K +KK+
Sbjct: 230 EGPAKKK----KQTKEERIATAKEGREDREKFGSRKGKKDKEGKSTTNKEKARKKNFMMT 285
Query: 944 KKKGAEKIGKKKKK---KKKKKKKKKKKKKKKK 973
K + GK+K+ K+K + ++KK
Sbjct: 286 LHKKRVR-GKQKRSLRDKQKVLRAHILRQKKGG 317
Score = 37.7 bits (88), Expect = 0.023
Identities = 25/109 (22%), Positives = 44/109 (40%), Gaps = 22/109 (20%)
Query: 896 RKKRTRRKKKKGAEK---------IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
K+R + ++ +++ KKKK+ KE K+G E +K K K+
Sbjct: 208 LKRRDKDAPERHSDELVDADDIEGPAKKKKQTKEERIATAKEGREDREKFGSRKGKKDKE 267
Query: 947 GAEKIGKKKKKKK-------------KKKKKKKKKKKKKKKEAEKMKKK 982
G K+K +KK K+K+ + K+K + + KK
Sbjct: 268 GKSTTNKEKARKKNFMMTLHKKRVRGKQKRSLRDKQKVLRAHILRQKKG 316
Score = 30.0 bits (68), Expect = 6.2
Identities = 11/89 (12%), Positives = 40/89 (44%), Gaps = 11/89 (12%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG-----------KKKKK 957
E+ + + KK + + + ++ + +++ + +K+ A ++ K ++
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATTRILTPADFAKIQEL 194
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
+ +K K K K+++ + ++ ++E
Sbjct: 195 RLEKGVDKALGGKLKRRDKDAPERHSDEL 223
>gnl|CDD|131972 TIGR02926, AhaH, ATP synthase archaeal, H subunit. he A1/A0 ATP
synthase is homologous to the V-type (V1/V0, vacuolar)
ATPase, but functions in the ATP synthetic direction as
does the F1/F0 ATPase of bacteria. The hydrophilic A1
"stalk" complex (AhaABCDEFG) is the site of ATP
generation and is coupled to the membrane-embedded
proton translocating A0 complex. It is unclear precisely
where AhaH fits into these complexes.
Length = 85
Score = 35.2 bits (81), Expect = 0.018
Identities = 22/84 (26%), Positives = 47/84 (55%), Gaps = 2/84 (2%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
+ KK E + ++ +E K R + E+ ++ ++AE+ K E+I K+ +++ +
Sbjct: 2 EEIKKAEEDAEELIEEAEEERKQRIAEAREEARELLEEAEEEASKLGEEIIKEAEEEIE- 60
Query: 962 KKKKKKKKKKKKKEAEKMKKKTEE 985
K+ +K +++ +KE E MK K +E
Sbjct: 61 -KEAEKIREEGEKEIEAMKSKAKE 83
Score = 32.5 bits (74), Expect = 0.17
Identities = 21/68 (30%), Positives = 40/68 (58%), Gaps = 4/68 (5%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
+K++ AE + ++ +EAE+ K G E +K+ +++ EK AEKI ++ +K+ +
Sbjct: 21 ERKQRIAEAREEARELLEEAEEEASKLGEEIIKEAEEEIEKE----AEKIREEGEKEIEA 76
Query: 962 KKKKKKKK 969
K K K+K
Sbjct: 77 MKSKAKEK 84
Score = 27.9 bits (62), Expect = 6.8
Identities = 19/74 (25%), Positives = 37/74 (50%), Gaps = 2/74 (2%)
Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKK--KEAEKMRKKKGAEKMKKKK 937
+++ EA + + R+ + AE+ K ++ KEAE+ +K+ + ++ +
Sbjct: 12 EELIEEAEEERKQRIAEAREEARELLEEAEEEASKLGEEIIKEAEEEIEKEAEKIREEGE 71
Query: 938 KDAEKMKKKGAEKI 951
K+ E MK K EKI
Sbjct: 72 KEIEAMKSKAKEKI 85
>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724). This
family contains several uncharacterized proteins found
in Arabidopsis thaliana and other plants. This region is
often found associated with Agenet domains and may
contain coiled-coil.
Length = 190
Score = 37.0 bits (86), Expect = 0.019
Identities = 20/79 (25%), Positives = 43/79 (54%), Gaps = 7/79 (8%)
Query: 917 KKKEAEKMRKKKGAEK-MKKKK---KDAEKMKKKGAEKIGKKKKKKKKKKKKK---KKKK 969
K + +K+ ++KG EK + +K+ ++ + + KI + +++ K+KK K+
Sbjct: 102 KDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEI 161
Query: 970 KKKKKEAEKMKKKTEEAEK 988
+ K EA K+K++ +AE
Sbjct: 162 ARLKSEASKIKQELVDAEL 180
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 38.5 bits (90), Expect = 0.020
Identities = 20/100 (20%), Positives = 58/100 (58%), Gaps = 9/100 (9%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRK--KKGAEKMKKKKKDAEKMKKKGAEK----- 950
+ + +K++ K+ + ++ + E + + +K EK++++ + AE+ ++ AE
Sbjct: 168 SKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELEL 227
Query: 951 --IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ K K+ +K+ ++ +++ + ++E E+++++ EEAEK
Sbjct: 228 ALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEK 267
Score = 36.6 bits (85), Expect = 0.085
Identities = 16/96 (16%), Positives = 55/96 (57%), Gaps = 4/96 (4%)
Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
+L ++ K ++ AEK + ++ K E ++ K+K+ +K+ E+++ E++
Sbjct: 194 LLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELE----EELS 249
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ +++ ++ +++ ++ +K+ ++ ++++ EE E+
Sbjct: 250 RLEEELEELQEELEEAEKEIEELKSELEELREELEE 285
Score = 34.3 bits (79), Expect = 0.35
Identities = 15/94 (15%), Positives = 47/94 (50%), Gaps = 3/94 (3%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
+ R ++ + K++ ++ E E + E++++ + + E+++++ E K
Sbjct: 276 LEELREELEELQEELLELKEEIEELEGEISLLR---ERLEELENELEELEERLEELKEKI 332
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ K++ ++++ ++ ++ AE + K E EK
Sbjct: 333 EALKEELEERETLLEELEQLLAELEEAKEELEEK 366
Score = 31.6 bits (72), Expect = 3.0
Identities = 12/60 (20%), Positives = 37/60 (61%), Gaps = 1/60 (1%)
Query: 929 GAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
G K K++K++AE+ ++ E + ++ + ++ +K+ +K +++ ++AE+ ++ E +
Sbjct: 166 GVSKYKERKEEAERKLERTEENL-ERLEDLLEELEKQLEKLERQAEKAERYQELKAELRE 224
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 36.5 bits (85), Expect = 0.024
Identities = 17/79 (21%), Positives = 49/79 (62%), Gaps = 4/79 (5%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK---KKGAEKIGKKKKKKK 959
++ K A+ + + ++ K+EA+ + + +++++ ++ A ++ KK AE+I ++ K +
Sbjct: 38 RQAKIADDLAEAERLKEEAQAL-LAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEA 96
Query: 960 KKKKKKKKKKKKKKKEAEK 978
+++ ++ K+ + + EAEK
Sbjct: 97 EEELERIKEAAEAEIEAEK 115
Score = 33.4 bits (77), Expect = 0.26
Identities = 20/80 (25%), Positives = 37/80 (46%), Gaps = 9/80 (11%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAE-KMKKKKKDAEKMKKK--------GAEKIGKKK 955
KK AE+I ++ K + E E R K+ AE +++ +K+ A + + AEK+ KK
Sbjct: 82 KKEAEQIAEEIKAEAEEELERIKEAAEAEIEAEKERALEELRAEVAELAVAIAEKLLGKK 141
Query: 956 KKKKKKKKKKKKKKKKKKKE 975
+ +K + +
Sbjct: 142 VDEAAQKDLIDAFIAELGEN 161
Score = 33.4 bits (77), Expect = 0.27
Identities = 16/98 (16%), Positives = 45/98 (45%), Gaps = 5/98 (5%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKK-----KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
R K + E+ ++ +++ ++A+K ++ E + +++ E++K+ +
Sbjct: 51 RLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEELERIKEAAEAE 110
Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
I +K++ ++ + + + E KK E A+K
Sbjct: 111 IEAEKERALEELRAEVAELAVAIAEKLLGKKVDEAAQK 148
Score = 31.5 bits (72), Expect = 0.95
Identities = 16/79 (20%), Positives = 44/79 (55%), Gaps = 8/79 (10%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDA-EKMKKKGAEKIGKKKKKKKKKKKKKKK 967
E+ K EAE+++++ A+ + + + E+ +++ +E I ++ KK+ ++ +
Sbjct: 37 ERQAKIADDLAEAERLKEE--AQALLAEYEQELEEAREQASEII----EQAKKEAEQIAE 90
Query: 968 KKKKK-KKEAEKMKKKTEE 985
+ K + ++E E++K+ E
Sbjct: 91 EIKAEAEEELERIKEAAEA 109
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 36.2 bits (84), Expect = 0.024
Identities = 25/90 (27%), Positives = 54/90 (60%), Gaps = 13/90 (14%)
Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
+ ++ KKKG E++ K+ +KE G K K++++ KK E+ K+K++
Sbjct: 1 KKKKAKKKGVEEVVFDKEARKEY-----LTGFHKRKQQRR------KKAQEEA--KEKER 47
Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+++ +++K+ ++++K+E EK K+ +EA K
Sbjct: 48 EERIEERKRIREERKQELEKQLKERKEALK 77
Score = 34.2 bits (79), Expect = 0.11
Identities = 21/69 (30%), Positives = 42/69 (60%), Gaps = 12/69 (17%)
Query: 879 NDDVRREAVTALGFILFRK-KRTRRKKKKGAEKIRKKKKKKKEAE--KMR--KKKGAEKM 933
+ + R+E +T F K K+ RRKK + E+ ++K+++++ E ++R +K+ EK
Sbjct: 16 DKEARKEYLTG-----FHKRKQQRRKKAQ--EEAKEKEREERIEERKRIREERKQELEKQ 68
Query: 934 KKKKKDAEK 942
K++K+A K
Sbjct: 69 LKERKEALK 77
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 36.0 bits (83), Expect = 0.025
Identities = 14/58 (24%), Positives = 17/58 (29%), Gaps = 3/58 (5%)
Query: 127 LQELKTCAPIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPETKTL 184
L P+ T + P PTP T A PAP + P P
Sbjct: 82 LTPPAPPEPVTPPT---AQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQ 136
>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
pleckstrin homology (PH) domain. The BCR gene is one of
the two genes in the BCR-ABL complex, which is
associated with the Philadelphia chromosome, a product
of a reciprocal translocation between chromosomes 22 and
9. BCR is a GTPase-activating protein (GAP) for RAC1
(primarily) and CDC42. The Dbl region of BCR has the
most RhoGEF activity for Cdc42, and less activity
towards Rac and Rho. Since BCR possesses both GAP and
GEF activities, it may function to temporally regulate
the activity of these GTPases. It also displays
serine/threonine kinase activity. The BCR protein
contains multiple domains including an N-terminal kinase
domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
domain, and a C-terminal RhoGAP domain. ABR, a related
smaller protein, is structurally similar to BCR, but
lacks the N-terminal kinase domain and has GAP activity
for both Rac and Cdc42. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 179
Score = 36.5 bits (85), Expect = 0.025
Identities = 15/51 (29%), Positives = 24/51 (47%), Gaps = 2/51 (3%)
Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K E ++ K I K+K K+ ++KK + EK++KK E E
Sbjct: 86 KSSEELVQMKK--AISALKQKIKRDSAERKKAASSGSRAIEKLRKKLAEQE 134
Score = 31.5 bits (72), Expect = 1.3
Identities = 11/47 (23%), Positives = 22/47 (46%)
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
E ++ KK + +K + +KK + +K +KK ++EA
Sbjct: 89 EELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRKKLAEQEA 135
Score = 30.4 bits (69), Expect = 3.2
Identities = 13/44 (29%), Positives = 18/44 (40%), Gaps = 3/44 (6%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
KK A K K K ++KK + +K +KK E E
Sbjct: 94 MKKAISALKQKIKRDSA---ERKKAASSGSRAIEKLRKKLAEQE 134
Score = 29.6 bits (67), Expect = 5.0
Identities = 15/43 (34%), Positives = 23/43 (53%), Gaps = 2/43 (4%)
Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
+E +M+K A K K K+ AE+ KK A + +K +KK
Sbjct: 89 EELVQMKKAISALKQKIKRDSAER--KKAASSGSRAIEKLRKK 129
>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN. FtsN is a poorly
conserved protein active in cell division in a number of
Proteobacteria. The N-terminal 30 residue region tends
to by Lys/Arg-rich, and is followed by a
membrane-spanning region. This is followed by an acidic
low-complexity region of variable length and a
well-conserved C-terminal domain of two tandem regions
matched by pfam05036 (Sporulation related repeat), found
in several cell division and sporulation proteins. The
role of FtsN as a suppressor for other cell division
mutations is poorly understood; it may involve cell wall
hydrolysis [Cellular processes, Cell division].
Length = 298
Score = 37.4 bits (86), Expect = 0.026
Identities = 23/97 (23%), Positives = 39/97 (40%), Gaps = 4/97 (4%)
Query: 896 RKKRTRRKKKKGA--EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
R K A E+ + +K+ AEK +K + +K + EK+ K E K
Sbjct: 127 ADMRAAEKVLATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTPVETEKIASKVKEAKQK 186
Query: 954 KK--KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K K+ + + K + K + K K + EK
Sbjct: 187 QKALPKQTAETQSNSKPIETAPKADKADKTKPKPKEK 223
Score = 35.1 bits (80), Expect = 0.15
Identities = 22/80 (27%), Positives = 35/80 (43%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
K EKI K K+ K+ +K K+ AE K K A+K K K+K ++
Sbjct: 168 KTPVETEKIASKVKEAKQKQKALPKQTAETQSNSKPIETAPKADKADKTKPKPKEKAERA 227
Query: 963 KKKKKKKKKKKKEAEKMKKK 982
+ K++AE ++ K
Sbjct: 228 AALQCGAYANKEQAESVRAK 247
Score = 34.3 bits (78), Expect = 0.31
Identities = 16/86 (18%), Positives = 40/86 (46%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
++++ E+++ + ++ + ++ +K+ AEK +K +K + +K
Sbjct: 117 EQRQLLEQMQADMRAAEKVLATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTPVETEKI 176
Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAEK 988
K K+ K+K+K K +T+ K
Sbjct: 177 ASKVKEAKQKQKALPKQTAETQSNSK 202
Score = 31.2 bits (70), Expect = 2.9
Identities = 19/76 (25%), Positives = 30/76 (39%), Gaps = 2/76 (2%)
Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
KK + + +K E EK+ K ++ K+K+K K + + K K
Sbjct: 156 KKPQKARTAEAQKTPVETEKIASK--VKEAKQKQKALPKQTAETQSNSKPIETAPKADKA 213
Query: 964 KKKKKKKKKKKEAEKM 979
K K K K+K E
Sbjct: 214 DKTKPKPKEKAERAAA 229
>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region. This
domain is the C-terminal end of ribosomal L18/L5
proteins.
Length = 93
Score = 34.8 bits (81), Expect = 0.027
Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 3/50 (6%)
Query: 939 DAEKMKKKGAEKIGK---KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
D E+M KK I +KKK+KK+ K + K+ KK E+ K + +
Sbjct: 43 DIEEMYKKAHAAIRADPSRKKKEKKEVKAESKRYNAKKLTLEQRKARVAQ 92
Score = 27.9 bits (63), Expect = 6.9
Identities = 9/33 (27%), Positives = 16/33 (48%)
Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMK 980
+K KK+ K + K+ KK +++K K
Sbjct: 61 RKKKEKKEVKAESKRYNAKKLTLEQRKARVAQK 93
>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain. This family
contains a C-terminal presumed domain in Sas10 which
hash been identified as a regulator of chromatin
silencing.
Length = 76
Score = 34.5 bits (80), Expect = 0.028
Identities = 15/64 (23%), Positives = 29/64 (45%), Gaps = 7/64 (10%)
Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
TR++KK + + KK+KK + +++ K ++ + E+ G E G K +
Sbjct: 20 TRKRKKD--RRNPRVKKRKKYEKAVKRLKS---QRRVVR--EEDGGYGGETTGIKTNVVR 72
Query: 960 KKKK 963
K
Sbjct: 73 SVKL 76
Score = 28.4 bits (64), Expect = 3.6
Identities = 12/43 (27%), Positives = 25/43 (58%)
Query: 943 MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K KG + KK ++ + KK+KK +K K+ ++++ + E+
Sbjct: 14 EKNKGLTRKRKKDRRNPRVKKRKKYEKAVKRLKSQRRVVREED 56
Score = 28.0 bits (63), Expect = 4.5
Identities = 15/63 (23%), Positives = 29/63 (46%), Gaps = 11/63 (17%)
Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
K+K + ++ K KG + +KK + + KK+KK +K K+ K +++
Sbjct: 5 GKRKISYQIEKNKGLTRKRKKD-----------RRNPRVKKRKKYEKAVKRLKSQRRVVR 53
Query: 976 AEK 978
E
Sbjct: 54 EED 56
Score = 27.2 bits (61), Expect = 8.2
Identities = 16/61 (26%), Positives = 30/61 (49%), Gaps = 3/61 (4%)
Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K+K + +++K K K KK + + KK+KK +K K+ K +++ E+ E
Sbjct: 6 KRKISYQIEKNKGLTRKRKKD--RRN-PRVKKRKKYEKAVKRLKSQRRVVREEDGGYGGE 62
Query: 986 A 986
Sbjct: 63 T 63
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 38.0 bits (89), Expect = 0.028
Identities = 26/98 (26%), Positives = 42/98 (42%), Gaps = 9/98 (9%)
Query: 899 RTRRKKKKGAE--KIRKKKK-----KKKEAEKMRKKKGAEKMKKKKKDA--EKMKKKGAE 949
R ++KK AE K R + + ++K A + R KK AE K KDA + + A+
Sbjct: 439 RAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAK 498
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K + K + +EA K + + +AE
Sbjct: 499 KAAATQPIVIKAGARPDNSAVIAAREARKAQARARQAE 536
Score = 37.6 bits (88), Expect = 0.036
Identities = 12/83 (14%), Positives = 31/83 (37%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
R+ R R+K + +K + + +K ++K KK A + A
Sbjct: 457 ARQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDN 516
Query: 955 KKKKKKKKKKKKKKKKKKKKEAE 977
++ +K + + ++ ++
Sbjct: 517 SAVIAAREARKAQARARQAEKQA 539
Score = 31.5 bits (72), Expect = 2.4
Identities = 16/82 (19%), Positives = 33/82 (40%), Gaps = 5/82 (6%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
R+K R + K A + R K K A + + K KK + + K +
Sbjct: 464 REKAAREARHKKAAEARAAKDKDAVAAALARVKA-----KKAAATQPIVIKAGARPDNSA 518
Query: 956 KKKKKKKKKKKKKKKKKKKEAE 977
++ +K + + ++ +K+A
Sbjct: 519 VIAAREARKAQARARQAEKQAA 540
Score = 30.7 bits (70), Expect = 5.0
Identities = 23/105 (21%), Positives = 38/105 (36%), Gaps = 27/105 (25%)
Query: 910 KIRKKKKKKKEAE--------------------KMRKKKGAEKMKKKKKDA-------EK 942
+IR +++KK+AE + R KK AE K KDA K
Sbjct: 437 EIRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVK 496
Query: 943 MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
KK A + K + ++ +K +A + + + A
Sbjct: 497 AKKAAATQPIVIKAGARPDNSAVIAAREARKAQARARQAEKQAAA 541
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 37.9 bits (89), Expect = 0.029
Identities = 25/101 (24%), Positives = 43/101 (42%), Gaps = 7/101 (6%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
KK+ K K EK K K++ + +K+++ EK+ K+ E + K
Sbjct: 138 HLKKKHLPKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKL 197
Query: 955 KKKKKKKKKKKKKKKKK-------KKKEAEKMKKKTEEAEK 988
+ +KKK+ K K+ K+E MK+ EE +
Sbjct: 198 EAEKKKQSKNFDPKEGPVQIGKKIDKEEITPMKEINEEERR 238
Score = 34.8 bits (81), Expect = 0.26
Identities = 15/78 (19%), Positives = 35/78 (44%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
+ ++KK K + + G K+ + D+++ +K + ++ +K K+ + K
Sbjct: 136 RDHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMK 195
Query: 968 KKKKKKKEAEKMKKKTEE 985
K + +KK+ K E
Sbjct: 196 KLEAEKKKQSKNFDPKEG 213
Score = 30.2 bits (69), Expect = 6.4
Identities = 22/83 (26%), Positives = 36/83 (43%)
Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
K +I + KKK K+ K+ K D E K + + +K+++ +K K
Sbjct: 129 KVNNEIERDHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEKLAK 188
Query: 966 KKKKKKKKKEAEKMKKKTEEAEK 988
+ + KK EAEK K+ K
Sbjct: 189 EALEAMKKLEAEKKKQSKNFDPK 211
>gnl|CDD|224237 COG1318, COG1318, Predicted transcriptional regulators
[Transcription].
Length = 182
Score = 36.4 bits (84), Expect = 0.029
Identities = 22/96 (22%), Positives = 41/96 (42%), Gaps = 4/96 (4%)
Query: 894 LFRKKRTRRKKKKGAEK----IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE 949
L R ++T R KG K +R+ +K KE + EK++K+ + +
Sbjct: 71 LGRTEQTVRNHLKGETKAGQLVRETYEKLKEGGLDAVEVEIEKLEKEGLKIRWAVEVLKK 130
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
G+ K+ ++K K + K + E++ K E
Sbjct: 131 IKGEHFPMDKELLEEKLKGEVIKGEPLEEVIDKLEY 166
Score = 36.4 bits (84), Expect = 0.030
Identities = 17/90 (18%), Positives = 38/90 (42%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
+ T K K+G + + +K E E ++ + E +KK K + M K+ E+ K +
Sbjct: 93 RETYEKLKEGGLDAVEVEIEKLEKEGLKIRWAVEVLKKIKGEHFPMDKELLEEKLKGEVI 152
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K + ++ K + + + + +
Sbjct: 153 KGEPLEEVIDKLEYPVESPADLLHEIKIHL 182
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 35.6 bits (83), Expect = 0.032
Identities = 23/85 (27%), Positives = 51/85 (60%), Gaps = 7/85 (8%)
Query: 905 KKGAEKIRKKKKK-KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
K +++ K+ KK + E EK+ K+ ++K K+ +K +E ++KK+K+ +KK
Sbjct: 17 KAAQKQLEKEFKKRQAELEKLEKE-----LQKLKEKLQKDAATLSEAA-REKKEKELQKK 70
Query: 964 KKKKKKKKKKKEAEKMKKKTEEAEK 988
++ ++K++K + + K++ EE +K
Sbjct: 71 VQEFQRKQQKLQQDLQKRQQEELQK 95
>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p) and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
gene from Saccharomyces cerevisiae. It is an essential
yeast protein involved in pre-rRNA processing and
ribosome assembly. Rrp7p contains an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 128
Score = 35.3 bits (82), Expect = 0.033
Identities = 24/85 (28%), Positives = 42/85 (49%), Gaps = 15/85 (17%)
Query: 916 KKKKEAEKMRKKKGAE-----------KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
+++E ++ RK+ E +K E+ K E+ +K+KKKKKKK+
Sbjct: 32 AREEERKRERKRLRNEPDEDGFVTVVRGGRKGPAAGEEAGKAAEEEKKEKEKKKKKKKEL 91
Query: 965 KK----KKKKKKKKEAEKMKKKTEE 985
+ + ++KKK+E + KK EE
Sbjct: 92 EDFYRFQLREKKKEEQADLLKKFEE 116
Score = 30.3 bits (69), Expect = 2.1
Identities = 24/82 (29%), Positives = 41/82 (50%), Gaps = 14/82 (17%)
Query: 891 GFILF-RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEK--------MKKKKKDAE 941
GF+ R R + A K +++KK+KE +K +KK E +KKK++
Sbjct: 52 GFVTVVRGGRKGPAAGEEAGKAAEEEKKEKEKKKKKKK---ELEDFYRFQLREKKKEEQA 108
Query: 942 KMKKKGAEKIGKKKKKKKKKKK 963
+ KK E K++ +K K+K+
Sbjct: 109 DLLKKFEED--KERVEKMKEKR 128
Score = 29.2 bits (66), Expect = 4.8
Identities = 27/98 (27%), Positives = 47/98 (47%), Gaps = 15/98 (15%)
Query: 896 RKKRTRRKKKKGAEK--------IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG 947
++R R +K+ E +R +K E+ K EK +K+KK K KKK
Sbjct: 34 EEERKRERKRLRNEPDEDGFVTVVRGGRKGPAAGEEAGKAAEEEKKEKEKK---KKKKKE 90
Query: 948 AEKIGKKKKKKKKKKK----KKKKKKKKKKKEAEKMKK 981
E + + ++KKK++ KK ++ K++ E K K+
Sbjct: 91 LEDFYRFQLREKKKEEQADLLKKFEEDKERVEKMKEKR 128
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 36.6 bits (85), Expect = 0.035
Identities = 23/69 (33%), Positives = 38/69 (55%), Gaps = 4/69 (5%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE-KIGKKKKKKKKKKKKKK 966
E +RKK+K+KKE K++KK+ + +K E + K E +G+ + + + KK
Sbjct: 158 LELLRKKEKEKKEQLKIQKKQ---SLLEKYGGEEHLDKPPKELLLGQSEDYVEYDRAGKK 214
Query: 967 KKKKKKKKE 975
KK K K +E
Sbjct: 215 KKAKSKYEE 223
Score = 31.6 bits (72), Expect = 1.4
Identities = 18/63 (28%), Positives = 31/63 (49%), Gaps = 5/63 (7%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK-KKGAEKIGKKKKKKK 959
R+K+K+ E+++ +KK+ + K G E + K K+ + + E KKKK
Sbjct: 162 RKKEKEKKEQLKIQKKQS----LLEKYGGEEHLDKPPKELLLGQSEDYVEYDRAGKKKKA 217
Query: 960 KKK 962
K K
Sbjct: 218 KSK 220
>gnl|CDD|217970 pfam04220, YihI, Der GTPase activator (YihI). YihI activates the
GTPase activity of Der, a 50S ribosomal subunit
stability factor. The stimulation is specific to Der as
YihI does not stimulate the GTPase activity of Era or
ObgE. The interaction of YihI with Der requires only the
C-terminal 78 amino acids of YihI. A yihI deletion
mutant is viable and shows a shorter lag period, but the
same post-lag growth rate as a wild-type strain. yihI is
expressed during the lag period. Overexpression of yihI
inhibits cell growth and biogenesis of the 50S ribosomal
subunit. YihI is an unusual, highly hydrophilic protein
with an uneven distribution of charged residues,
resulting in an N-terminal region with high pI and a
C-terminal region with low pI.
Length = 169
Score = 36.2 bits (84), Expect = 0.036
Identities = 30/96 (31%), Positives = 43/96 (44%), Gaps = 25/96 (26%)
Query: 902 RKKKKGAEKIRKKKKKK---------KEAEKMRKKKG-------AEKMKKKKKDAEKMKK 945
RK K K+ K KKK +E ++ +K+KG E+ + +K+ KK
Sbjct: 1 RKSGKNGPKLAPKGKKKTRYELDQEARERKRKKKRKGLKSGSRHNEESESQKQKGAAQKK 60
Query: 946 KGAEKIGKKKK-------KKKKKKKKKKKKKKKKKK 974
+IG KK K K +K+ KKKK KK K
Sbjct: 61 D--PRIGSKKPVPLIVEFKVKPEKQAPKKKKVKKPK 94
>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2. A Saccharomyces
cerevisiae member of this family (PGA2) is an ER protein
which has been implicated in protein trafficking.
Length = 139
Score = 35.6 bits (82), Expect = 0.037
Identities = 23/114 (20%), Positives = 52/114 (45%), Gaps = 10/114 (8%)
Query: 884 REAVTALGFILFR--------KKRTRRKKKKGAEKIRKKKKKKK-EAEKMRKKKGAEKMK 934
R V G++L R K + + +K+ AE+ ++KK K +R A
Sbjct: 19 RIIVIVGGYLLIRPYFIKLGAKAQEKEHEKERAEREEAREKKAKISPNALRGGATAGH-G 77
Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++ D E+ ++ A + KK +K+++K +K + E + + + ++ ++
Sbjct: 78 EEDTDDEEDEEDFATPSAVPQWGKKARKRQRKVIRKLLEAEEQLREDQYDDEDE 131
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region covers
both the Brf homology II and III regions. This region is
involved in binding TATA binding protein.
Length = 95
Score = 34.5 bits (80), Expect = 0.038
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
+ ++++++ K K GKKKKK+K KKK+ +
Sbjct: 26 KDYLEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64
Score = 30.7 bits (70), Expect = 0.83
Identities = 11/35 (31%), Positives = 21/35 (60%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
++K+ K+K + KKKKK++A+K + + A
Sbjct: 33 EEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPAS 67
Score = 29.5 bits (67), Expect = 2.1
Identities = 14/46 (30%), Positives = 23/46 (50%), Gaps = 3/46 (6%)
Query: 941 EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
E+ ++K ++ K + KKKKK+K KKK++ EA
Sbjct: 30 EEQEEKELKQ---KADEGNNSGKKKKKRKAKKKRDEAGPASTAAEA 72
Score = 29.1 bits (66), Expect = 2.6
Identities = 14/53 (26%), Positives = 29/53 (54%), Gaps = 6/53 (11%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK------KGAEKMKKKKKDAEKM 943
+K ++K +G +KKKK+K + ++ + +KM +KKK ++K+
Sbjct: 34 EKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAAEAVKKMLQKKKFSKKI 86
>gnl|CDD|150240 pfam09495, DUF2462, Protein of unknown function (DUF2462). This
protein is highly conserved, but its function is
unknown. It can be isolated from HeLa cell nucleoli and
is found to be homologous with Leydig cell tumour
protein whose function is unknown.
Length = 82
Score = 34.3 bits (79), Expect = 0.038
Identities = 17/44 (38%), Positives = 24/44 (54%), Gaps = 3/44 (6%)
Query: 942 KMKKKGAEKIGKKKKKKKKKKKKKK---KKKKKKKKEAEKMKKK 982
K+K KG + KKK+K KK K KK + +A+K+KK
Sbjct: 7 KVKAKGPAAVKAGKKKQKGPKKAGPRVIKPKKAQAVQAQKLKKV 50
Score = 30.5 bits (69), Expect = 0.87
Identities = 18/53 (33%), Positives = 23/53 (43%)
Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
K KG +K KK + KK G I KK + + +K KK K EK
Sbjct: 9 KAKGPAAVKAGKKKQKGPKKAGPRVIKPKKAQAVQAQKLKKVLTSGLNKSTEK 61
Score = 28.5 bits (64), Expect = 4.1
Identities = 18/63 (28%), Positives = 32/63 (50%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
KK+ + KK G I+ KK + +A+K++K + K +K +G ++ K +
Sbjct: 20 KKKQKGPKKAGPRVIKPKKAQAVQAQKLKKVLTSGLNKSTEKLIASRASEGHLELLKGVR 79
Query: 957 KKK 959
KKK
Sbjct: 80 KKK 82
>gnl|CDD|235219 PRK04098, PRK04098, sec-independent translocase; Provisional.
Length = 158
Score = 35.9 bits (83), Expect = 0.040
Identities = 16/82 (19%), Positives = 34/82 (41%), Gaps = 7/82 (8%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK-------KKKKKKKK 966
KK+ + E ++KK E++ K AE K + + KK ++
Sbjct: 67 KKEFESAVESLKKKLKFEELDDLKITAENEIKSIQDLLQDYKKSLEEDTIPNHLNEEVSN 126
Query: 967 KKKKKKKKEAEKMKKKTEEAEK 988
+ K ++ +++ K+ + A K
Sbjct: 127 ETKLTQESSSDESPKEVKLATK 148
Score = 34.7 bits (80), Expect = 0.090
Identities = 15/87 (17%), Positives = 32/87 (36%), Gaps = 8/87 (9%)
Query: 907 GAEKIRKKKK-----KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
E ++KK K K + K + ++ KK E+ ++ + K
Sbjct: 73 AVESLKKKLKFEELDDLKITAENEIKSIQDLLQDYKKSLEEDT---IPNHLNEEVSNETK 129
Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++ + K+ K KT++ +K
Sbjct: 130 LTQESSSDESPKEVKLATKNKTKKHDK 156
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 36.6 bits (85), Expect = 0.042
Identities = 16/83 (19%), Positives = 42/83 (50%), Gaps = 4/83 (4%)
Query: 908 AEKIRKKKK--KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
+I++ +K KK +AE K E ++ + +D E + +I ++ +++ K+ ++
Sbjct: 23 EPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEI--QEIRERIKRAEE 80
Query: 966 KKKKKKKKKEAEKMKKKTEEAEK 988
K K ++E + + + A++
Sbjct: 81 KLSAVKDERELRALNIEIQIAKE 103
Score = 33.9 bits (78), Expect = 0.28
Identities = 20/116 (17%), Positives = 49/116 (42%), Gaps = 11/116 (9%)
Query: 884 REAVTALGFILFRKKRTRRKKKKGAEKIRKKKKK-KKEAEKMRKK--KGAEKMKKKKKDA 940
R+A+ L + + E + + + + E +++R++ + EK+ K +
Sbjct: 30 RKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDER 89
Query: 941 EKMKKKGAEKIGKKKKKKK--------KKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
E +I K++ ++ +K +K+ + K+ E+++K EAE
Sbjct: 90 ELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEA 145
Score = 31.2 bits (71), Expect = 2.1
Identities = 14/87 (16%), Positives = 39/87 (44%), Gaps = 6/87 (6%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
R++ K AE+ K ++E + + K + + E ++ ++ +K +K+
Sbjct: 72 RERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDE------LAELMEEIEKLEKE 125
Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ K++ ++ +K + + + EE
Sbjct: 126 IEDLKERLERLEKNLAEAEARLEEEVA 152
Score = 29.6 bits (67), Expect = 7.6
Identities = 14/96 (14%), Positives = 41/96 (42%), Gaps = 3/96 (3%)
Query: 893 ILFRKKRTRRKKKKGAEKIR-KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
R + ++ +K +K + + + K E + + E ++ + E ++ E+I
Sbjct: 18 EKDRLEPRIKEIRKALKKAKAELEALNKALEALEIEL--EDLENQVSQLESEIQEIRERI 75
Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+ ++K K +++ + + + K + + E E
Sbjct: 76 KRAEEKLSAVKDERELRALNIEIQIAKERINSLEDE 111
>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
Length = 381
Score = 37.0 bits (86), Expect = 0.043
Identities = 25/82 (30%), Positives = 37/82 (45%), Gaps = 2/82 (2%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
R KKK A++ +KK K + A K K +K A ++K K +K K ++
Sbjct: 302 RPAKKKPAKRRVQKKNPLKNRRVRLRLNPAAK--KLRKLAVLAQEKATAKGAQKVKNRRA 359
Query: 961 KKKKKKKKKKKKKKEAEKMKKK 982
+KK KK + K K K K
Sbjct: 360 RKKAKKARLAKVAKALYKSIKA 381
Score = 34.7 bits (80), Expect = 0.24
Identities = 21/79 (26%), Positives = 36/79 (45%), Gaps = 4/79 (5%)
Query: 914 KKKKKKEAEKMRKKKGAEK----MKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
+ KKK A++ +KK K + A+K++K K K +K K ++ +K
Sbjct: 302 RPAKKKPAKRRVQKKNPLKNRRVRLRLNPAAKKLRKLAVLAQEKATAKGAQKVKNRRARK 361
Query: 970 KKKKKEAEKMKKKTEEAEK 988
K KK K+ K ++ K
Sbjct: 362 KAKKARLAKVAKALYKSIK 380
Score = 34.7 bits (80), Expect = 0.25
Identities = 21/85 (24%), Positives = 34/85 (40%), Gaps = 10/85 (11%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K RR +KK K R+ + + A K +K +K A+ K K
Sbjct: 307 KPAKRRVQKKNPLKNRRVRLRLNPAAKKLRKLAVLAQEKAT----------AKGAQKVKN 356
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKK 981
++ +KK KK + K K + +K
Sbjct: 357 RRARKKAKKARLAKVAKALYKSIKA 381
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 37.2 bits (87), Expect = 0.044
Identities = 19/100 (19%), Positives = 53/100 (53%), Gaps = 3/100 (3%)
Query: 889 ALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKK---GAEKMKKKKKDAEKMKK 945
+G++L ++ ++ R ++ KKEAE ++K+ E++ K + + E+ K
Sbjct: 13 IIGYLLRKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELK 72
Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
+ ++ + +++ ++++ +K + K+ E ++KK +E
Sbjct: 73 ERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKE 112
Score = 33.7 bits (78), Expect = 0.49
Identities = 28/113 (24%), Positives = 56/113 (49%), Gaps = 7/113 (6%)
Query: 883 RREAVTALGFILFRKKRTRRKKKKGAEKIRKKK--KKKKEAEKMRKKKGAE--KMKKKKK 938
+R A LG KR + KK AE ++K+ + K+E K+R + E + + + +
Sbjct: 20 KRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQ 79
Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKK---KKKKKKKKKKKEAEKMKKKTEEAEK 988
E+ + E + +K + KK++ KK+K+ K+K ++ +++ EE
Sbjct: 80 RLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIA 132
Score = 32.6 bits (75), Expect = 1.3
Identities = 23/103 (22%), Positives = 54/103 (52%), Gaps = 6/103 (5%)
Query: 889 ALGFILFR--KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
+G I+ +KR KK AE K+ +EA+K + E + + K++ K++ +
Sbjct: 9 IVGLIIGYLLRKRIAEKKLGSAE--ELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAE 66
Query: 947 GAEKIGKKKKKKKKKKKKKKKKKKKK-KKEAEKMKKKTEEAEK 988
E+ K+++ + ++ +++ ++++ ++ E + KK E EK
Sbjct: 67 -LERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEK 108
Score = 29.9 bits (68), Expect = 7.7
Identities = 22/111 (19%), Positives = 57/111 (51%), Gaps = 4/111 (3%)
Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD 939
++ ++EA T L K K + E+ K+++ + + + R + E + +K +
Sbjct: 39 EEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMES 98
Query: 940 AEKMKK---KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT-EEA 986
+K ++ K +++ K+K +K+++ ++ ++++E E++ T EEA
Sbjct: 99 LDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERISGLTQEEA 149
>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
unknown].
Length = 622
Score = 37.1 bits (85), Expect = 0.045
Identities = 18/68 (26%), Positives = 34/68 (50%), Gaps = 9/68 (13%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
KK G +KI +K+K+ + + + G ++K KK+ AEK G ++++ +
Sbjct: 564 KKTGGMKKIMDEKRKRLKNNIEQTQDGKPELKIKKRKAEK---------GDQRQELDRIV 614
Query: 963 KKKKKKKK 970
K K+ K
Sbjct: 615 KSIKRSGK 622
Score = 35.1 bits (80), Expect = 0.20
Identities = 27/135 (20%), Positives = 49/135 (36%), Gaps = 28/135 (20%)
Query: 876 SDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK 935
SD ++ E + + K + EK+ +K ++++E K R+ +KK
Sbjct: 402 SDDGENGEMEDTFTSHLPASNESESDDKLETTIEKLDRKLRERQENRKERQ------LKK 455
Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK----------------------K 973
K D++ K E I KK KK K ++ K++ K
Sbjct: 456 TKDDSDVDLKDKKESINKKNKKGKHAIERTAASKEELELIKADDEDDEQLDHFDMKSILK 515
Query: 974 KEAEKMKKKTEEAEK 988
E K +K ++
Sbjct: 516 AEKFKKNRKLKKKAS 530
Score = 31.3 bits (70), Expect = 3.4
Identities = 23/99 (23%), Positives = 39/99 (39%), Gaps = 20/99 (20%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK------ 950
K + ++++ K R+ KK K +++ K K KK KK +++ A K
Sbjct: 436 KLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGKHAIERTAASKEELELI 495
Query: 951 --------------IGKKKKKKKKKKKKKKKKKKKKKKE 975
+ K +K KK +K KKK +E
Sbjct: 496 KADDEDDEQLDHFDMKSILKAEKFKKNRKLKKKASNLEE 534
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association with
pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 33.8 bits (78), Expect = 0.048
Identities = 17/50 (34%), Positives = 30/50 (60%), Gaps = 8/50 (16%)
Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK------KKKEAEKMKKK 982
E ++ A K+ + ++ ++KKKKKKK KKK +KK E+M+++
Sbjct: 20 GEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKK--EQMRRR 67
Score = 33.0 bits (76), Expect = 0.097
Identities = 16/69 (23%), Positives = 30/69 (43%), Gaps = 5/69 (7%)
Query: 859 CGSGSNVATKRLLHVAVSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKK 918
G L + ++ + V I R +R ++KKKK +K ++ +K
Sbjct: 4 GGPSVPEQLPNGLGEEGEEDDEQIDASKVR---RISQRNRRKKKKKKKLKKKSKEWILRK 60
Query: 919 KEAEKMRKK 927
K E+MR++
Sbjct: 61 K--EQMRRR 67
Score = 31.5 bits (72), Expect = 0.37
Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 1/48 (2%)
Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE-KMKKK 982
E + K ++ ++ ++KKKKKKK KKK E ++KK
Sbjct: 14 NGLGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKK 61
Score = 30.3 bits (69), Expect = 0.90
Identities = 16/47 (34%), Positives = 30/47 (63%), Gaps = 7/47 (14%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG 947
K ++ +++ R+KKKKKK K++KK ++KK E+M+++G
Sbjct: 29 ASKVRRISQRNRRKKKKKK---KLKKKSKEWILRKK----EQMRRRG 68
Score = 28.8 bits (65), Expect = 2.8
Identities = 17/45 (37%), Positives = 30/45 (66%), Gaps = 2/45 (4%)
Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
+A K+R+ + KKKKK +K+KKK E I +KK++ +++ K+
Sbjct: 28 DASKVRRISQRNRRKKKKK--KKLKKKSKEWILRKKEQMRRRGKE 70
>gnl|CDD|223625 COG0551, TopA, Zn-finger domain associated with topoisomerase type
I [DNA replication, recombination, and repair].
Length = 140
Score = 35.2 bits (81), Expect = 0.048
Identities = 20/74 (27%), Positives = 30/74 (40%), Gaps = 1/74 (1%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
KG ++K + K K +K K +K K G+ K+ +KK+KKK +
Sbjct: 67 GKGLLVLKKGRFGKNFLGCSNYPKC-RFTEKPKPKEKKCPKCGSRKLVEKKEKKKGVFLE 125
Query: 965 KKKKKKKKKKEAEK 978
K K E K
Sbjct: 126 CSNKPCCLKVENLK 139
Score = 33.7 bits (77), Expect = 0.14
Identities = 24/113 (21%), Positives = 38/113 (33%), Gaps = 22/113 (19%)
Query: 896 RKKRTRRKKKKGA---EKIRKKKKKKKEAEKMRKK-----KGAEKMKKKKKD-------- 939
KK + G K + +K AEK K KG +KK +
Sbjct: 28 VKKFGKYGIFLGCSNYPKCDYYEPEKAIAEKTGVKCPKCGKGLLVLKKGRFGKNFLGCSN 87
Query: 940 ------AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
EK K K + +K +KK+KKK + + +K + +
Sbjct: 88 YPKCRFTEKPKPKEKKCPKCGSRKLVEKKEKKKGVFLECSNKPCCLKVENLKN 140
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 36.4 bits (84), Expect = 0.050
Identities = 15/38 (39%), Positives = 23/38 (60%)
Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
++KK + KK+ + ++ K+KKKKK KKKK K
Sbjct: 67 ERKKGYQAGKKETEDWFSPNQENKQKKKKKDKKKKSPK 104
Score = 33.7 bits (77), Expect = 0.30
Identities = 16/38 (42%), Positives = 19/38 (50%), Gaps = 2/38 (5%)
Query: 927 KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
KKG + KK+ +D K KKKK KKKK K
Sbjct: 69 KKGYQAGKKETEDW--FSPNQENKQKKKKKDKKKKSPK 104
Score = 30.2 bits (68), Expect = 4.6
Identities = 16/34 (47%), Positives = 22/34 (64%), Gaps = 5/34 (14%)
Query: 945 KKGAEKIGKKKKKK----KKKKKKKKKKKKKKKK 974
KKG + GKK+ + ++ K+KKKKK KKKK
Sbjct: 69 KKGYQA-GKKETEDWFSPNQENKQKKKKKDKKKK 101
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 37.4 bits (86), Expect = 0.050
Identities = 23/108 (21%), Positives = 42/108 (38%), Gaps = 20/108 (18%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK----- 955
RR+ E+ RK +K++ + + EK + +K + +GA K
Sbjct: 573 RREVFPVVEQRRKLAPRKRKEDFVTPSTSLEKSMDRILHGQKKRAEGAVVFEKPLEATEN 632
Query: 956 ---------------KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KKK ++ ++ K+ KKK E E ++ +EK
Sbjct: 633 FNPWLDRKMRRIKRIKKKAYRRIRRDKRLKKKMPEEENTQENHLGSEK 680
Score = 32.8 bits (74), Expect = 1.0
Identities = 16/97 (16%), Positives = 37/97 (38%), Gaps = 6/97 (6%)
Query: 897 KKRTRRKK--KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK----KKKDAEKMKKKGAEK 950
+K R +K + +K EA + KM++ KKK ++++ K
Sbjct: 603 EKSMDRILHGQKKRAEGAVVFEKPLEATENFNPWLDRKMRRIKRIKKKAYRRIRRDKRLK 662
Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+++ ++ +KK+ +K+ E +
Sbjct: 663 KKMPEEENTQENHLGSEKKRHGGVPDILLKEIEVEDD 699
Score = 31.6 bits (71), Expect = 2.8
Identities = 20/123 (16%), Positives = 58/123 (47%), Gaps = 4/123 (3%)
Query: 866 ATKRLLHVAVSDVNDDVRREAVTALGF---ILFRKKRTRRKKKKGAEKIRKKKKKKKEAE 922
A K+ +A++ ++ + E L F ++FR++R ++ K K +K +K ++ +
Sbjct: 354 ALKKQEELALNKLSVEEVAERTRQLRFMRELMFREERKAKRVAKIKSKTYRKIRKNRKEK 413
Query: 923 KMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
+M +E ++ +K + + ++ ++ K +K ++ + ++ + +K
Sbjct: 414 EMALIPKSEDLENEKSEEARALERMTQR-HKNTSSWTRKMLERASHGEGTREAVNEQIRK 472
Query: 983 TEE 985
+E
Sbjct: 473 GDE 475
Score = 30.4 bits (68), Expect = 5.3
Identities = 20/76 (26%), Positives = 36/76 (47%), Gaps = 2/76 (2%)
Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK--GAEKIGKKKKKKKKKKKKKKKKKKKK 972
KK+ E K E++ ++ + M++ E+ K+ K K K +K +K +K+
Sbjct: 353 SALKKQEELALNKLSVEEVAERTRQLRFMRELMFREERKAKRVAKIKSKTYRKIRKNRKE 412
Query: 973 KKEAEKMKKKTEEAEK 988
K+ A K + E EK
Sbjct: 413 KEMALIPKSEDLENEK 428
>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 158 and 187 amino
acids in length. This family includes the PcfB protein.
Length = 137
Score = 34.9 bits (81), Expect = 0.050
Identities = 19/59 (32%), Positives = 33/59 (55%), Gaps = 3/59 (5%)
Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG-AEKIGKKKKKKKKKKKKKKKKKKKKKK 974
K K+A+ + ++ KK E+ +K +K+ K K+++ KKK++K K K KKK
Sbjct: 81 KAKDADVI--NAAFKEFTAKKLKKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKNKKK 137
Score = 32.6 bits (75), Expect = 0.31
Identities = 34/131 (25%), Positives = 61/131 (46%), Gaps = 37/131 (28%)
Query: 889 ALGFILFRKKRTRRKKKKGAEKIRKKKKK-----------------KKEAEK------MR 925
AL +L K+ +++ G + ++K K+ K+EA+K ++
Sbjct: 7 ALKAVLKDIKKQKQQPPHGKQSLKKLLKQGRGLSNIEITDENLKKFKREAKKYGVDYAVK 66
Query: 926 KKKGAEKMKK----KKKDAEKM----KKKGAEKIGKKKKKKK------KKKKKKKKKKKK 971
K K K K KDA+ + K+ A+K+ K++ +K KK K+++ KKK
Sbjct: 67 KDKNNGPPKYLVFFKAKDADVINAAFKEFTAKKLKKEQDEKPSVRKKLKKFKEEQAAKKK 126
Query: 972 KKKEAEKMKKK 982
++K+ K KKK
Sbjct: 127 ERKDKVKNKKK 137
>gnl|CDD|179522 PRK02998, prsA, peptidylprolyl isomerase; Reviewed.
Length = 283
Score = 36.5 bits (84), Expect = 0.051
Identities = 36/144 (25%), Positives = 70/144 (48%), Gaps = 9/144 (6%)
Query: 853 TIAMAYCGSGSNVATKRLLHVAVSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIR 912
+A++ CGS NV T ++ ++ +++ ++R++ + + + K K K E+ +
Sbjct: 16 VLALSACGSSDNVVTSKVGNITEKELSKELRQKYGESTLYQMVLSKALLDKYKVSDEEAK 75
Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKD--AEKMKKKGA-EKIGKKKKKKKKKKKKKKKKK 969
K+ ++ K+ K E++ K +D EKMK + A EK K +K K K +
Sbjct: 76 KQVEEAKDKMGDNFKSTLEQVGLKNEDELKEKMKPEIAFEKAIKATVTEKDVKDNYKPEM 135
Query: 970 K------KKKKEAEKMKKKTEEAE 987
K K +K A+++K+K E
Sbjct: 136 KVSHILVKDEKTAKEVKEKVNNGE 159
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 37.2 bits (87), Expect = 0.051
Identities = 25/87 (28%), Positives = 41/87 (47%), Gaps = 18/87 (20%)
Query: 918 KKEAEKMRKKKGAEKMK---KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
E E+M K AE+ KK+K+ + K + E + +K K++ K + KKK +
Sbjct: 503 DDEIERMVKD--AEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKKKVE 560
Query: 975 EA-------------EKMKKKTEEAEK 988
EA E+++ KTEE +K
Sbjct: 561 EAIEWLKEELEGEDKEEIEAKTEELQK 587
>gnl|CDD|234465 TIGR04092, LTA_DltD, D-alanyl-lipoteichoic acid biosynthesis
protein DltD. Members of this protein family are DltD,
part of the DltABCD system widely distributed in the
Firmicutes for D-alanylation of lipoteichoic acids. The
most common form of LTA, as in Staphylococcus aureus,
has a backbone of polyglycerolphosphate.
Length = 383
Score = 36.8 bits (86), Expect = 0.052
Identities = 31/109 (28%), Positives = 46/109 (42%), Gaps = 25/109 (22%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKK------KKKKEAEKMRKKKGAEKMKKKKK------DAE 941
L +K KK +K+ + +K+ + K K +K+KK K D E
Sbjct: 180 LLKKLYANGKKLSKFDKVYLEPYYRNLLEKEDDLFSRFKLKYKKKIKKAAKKLPETYDWE 239
Query: 942 KMKKKGAEKIGKKKK------------KKKKKKKKKKKKKKKKKKEAEK 978
++ KK AEK GKK+ K+ KKK KK K +K + K
Sbjct: 240 ELDKK-AEKEGKKRTTNNPFGIDNKYYNKRIKKKLKKLKGSQKNFDYLK 287
Score = 36.0 bits (84), Expect = 0.083
Identities = 27/104 (25%), Positives = 44/104 (42%), Gaps = 9/104 (8%)
Query: 894 LFRKKRTRRKKKKGAEKIRK--KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
L K + K+ A+++ + K K E + KK A K K D ++ +
Sbjct: 148 LKNPKISTETKRYAAKRLLELSVVKSDKILENLLKKLYANGKKLSKFDKVYLEPYYRNLL 207
Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEA------EKMKKK-TEEAEK 988
K+ + K K KKK KK K+ E++ KK +E +K
Sbjct: 208 EKEDDLFSRFKLKYKKKIKKAAKKLPETYDWEELDKKAEKEGKK 251
Score = 35.7 bits (83), Expect = 0.12
Identities = 18/79 (22%), Positives = 36/79 (45%), Gaps = 3/79 (3%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
+ +K+ KK + +K+ + + +K+ D + + K KK KK KK +
Sbjct: 178 ENLLKKLYANGKKLSKFDKVYLEPYYRNLLEKEDD---LFSRFKLKYKKKIKKAAKKLPE 234
Query: 965 KKKKKKKKKKEAEKMKKKT 983
++ KK ++ KK+T
Sbjct: 235 TYDWEELDKKAEKEGKKRT 253
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 37.0 bits (85), Expect = 0.056
Identities = 15/58 (25%), Positives = 43/58 (74%), Gaps = 2/58 (3%)
Query: 929 GAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
+ K+ KK+++A + K+ AE+ K +++++++K+K+K++++++++EAE+ K + +
Sbjct: 573 ASSKLAKKREEAVEKAKREAEQ--KAREEREREKEKEKEREREREREAERAAKASSSS 628
Score = 33.1 bits (75), Expect = 0.85
Identities = 16/61 (26%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
Query: 886 AVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK-KGAEKMKKKKKDAEKMK 944
A T L F + +K+++ EK +++ ++K E+ R+K K E+ ++++++AE+
Sbjct: 563 ARTDLYFTPLASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAA 622
Query: 945 K 945
K
Sbjct: 623 K 623
>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
Length = 209
Score = 35.9 bits (83), Expect = 0.058
Identities = 19/66 (28%), Positives = 23/66 (34%), Gaps = 1/66 (1%)
Query: 920 EAEKMRKKKGAEKMKKKKKD-AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
A+ K K + K A K K E I KK KK K K +
Sbjct: 143 AAQAAAAPKAPAKPRAKSPRPAAKAAPKPTETITAKKAKKTAAAPKPTADKTATPAKKTT 202
Query: 979 MKKKTE 984
KKKT+
Sbjct: 203 KKKKTK 208
Score = 33.6 bits (77), Expect = 0.29
Identities = 18/63 (28%), Positives = 20/63 (31%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
K K R K K K E + K A+K K K KK KKK
Sbjct: 146 AAAAPKAPAKPRAKSPRPAAKAAPKPTETITAKKAKKTAAAPKPTADKTATPAKKTTKKK 205
Query: 974 KEA 976
K
Sbjct: 206 KTK 208
Score = 29.4 bits (66), Expect = 7.3
Identities = 16/67 (23%), Positives = 18/67 (26%), Gaps = 3/67 (4%)
Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
K K R K + K E + KK K KK
Sbjct: 145 QAAAAPKAPAKPRAKSPRPAAKAA---PKPTETITAKKAKKTAAAPKPTADKTATPAKKT 201
Query: 960 KKKKKKK 966
KKKK K
Sbjct: 202 TKKKKTK 208
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 36.2 bits (84), Expect = 0.059
Identities = 18/47 (38%), Positives = 25/47 (53%)
Query: 942 KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K +K E + K+ K KKK KKK+K K AE K+ +AE+
Sbjct: 5 KKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAER 51
Score = 35.4 bits (82), Expect = 0.093
Identities = 21/74 (28%), Positives = 35/74 (47%), Gaps = 1/74 (1%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
E + KK+K ++E K+ K KK K K+ K AE K+ +K +++ + K+
Sbjct: 1 ELLLKKRKARQELAVQVAKQAKAK-KKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRL 59
Query: 969 KKKKKKEAEKMKKK 982
KK K + K
Sbjct: 60 AKKPGKFYVPAEHK 73
Score = 34.3 bits (79), Expect = 0.23
Identities = 18/70 (25%), Positives = 28/70 (40%), Gaps = 13/70 (18%)
Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK-------------KKKKKKKKEAEK 978
+KK+K E + + KKK KKK+K K ++ + K+ A+K
Sbjct: 3 LLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKK 62
Query: 979 MKKKTEEAEK 988
K AE
Sbjct: 63 PGKFYVPAEH 72
Score = 33.9 bits (78), Expect = 0.34
Identities = 19/67 (28%), Positives = 32/67 (47%), Gaps = 6/67 (8%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
KK+K +++ + K+ +A+K KK K K K AE K E +++ + K+
Sbjct: 5 KKRKARQELAVQVAKQAKAKKKANKK---KRKIYFKRAESFVK---EYRKAEREIIRLKR 58
Query: 963 KKKKKKK 969
KK K
Sbjct: 59 LAKKPGK 65
Score = 32.7 bits (75), Expect = 0.73
Identities = 18/73 (24%), Positives = 35/73 (47%), Gaps = 1/73 (1%)
Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
+L +K++ R++ K K KKK + ++ K AE K+ + AE+ + +++
Sbjct: 2 LLLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAER-EIIRLKRLA 60
Query: 953 KKKKKKKKKKKKK 965
KK K + K
Sbjct: 61 KKPGKFYVPAEHK 73
Score = 31.9 bits (73), Expect = 1.1
Identities = 20/81 (24%), Positives = 35/81 (43%), Gaps = 7/81 (8%)
Query: 884 REAVTALGFILFRKKRTRRKKK---KGAEKI----RKKKKKKKEAEKMRKKKGAEKMKKK 936
AV +KK ++K+K K AE RK +++ +++ KK G + +
Sbjct: 12 ELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKKPGKFYVPAE 71
Query: 937 KKDAEKMKKKGAEKIGKKKKK 957
K ++ KG I K +K
Sbjct: 72 HKLVFVIRIKGINGIPPKPRK 92
>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
(DUF874). This family consists of several hypothetical
proteins specific to Helicobacter pylori. The function
of this family is unknown.
Length = 417
Score = 36.4 bits (83), Expect = 0.070
Identities = 22/73 (30%), Positives = 39/73 (53%), Gaps = 3/73 (4%)
Query: 916 KKKKEAEKMR---KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
+ KKEAE R K G E ++++K ++ +K E I + K +++K+K +++K+
Sbjct: 132 QAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQ 191
Query: 973 KKEAEKMKKKTEE 985
K E EK K
Sbjct: 192 KTEQEKQKTSNIA 204
Score = 35.3 bits (80), Expect = 0.18
Identities = 23/87 (26%), Positives = 45/87 (51%), Gaps = 6/87 (6%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
R K G E ++++K ++E +K K+ + K + EK K + ++K+K +++
Sbjct: 143 RANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTE------QEKQKTEQE 196
Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K+K K E E+ K+KTE ++
Sbjct: 197 KQKTSNIANKNAIELEQEKQKTENEKQ 223
Score = 33.3 bits (75), Expect = 0.57
Identities = 22/83 (26%), Positives = 46/83 (55%), Gaps = 1/83 (1%)
Query: 897 KKRTRRKKKKGAEKIR-KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+K + K+K E I + K E EK + ++ +K +++K+ + K A ++ ++K
Sbjct: 156 QKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEK 215
Query: 956 KKKKKKKKKKKKKKKKKKKEAEK 978
+K + +K+ K++K KEAE+
Sbjct: 216 QKTENEKQDLIKEQKDFIKEAEQ 238
Score = 31.0 bits (69), Expect = 3.5
Identities = 16/77 (20%), Positives = 41/77 (53%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
E+ +++K+K E E + K +++K+ E+ K+K ++ K K + +++
Sbjct: 155 EQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQE 214
Query: 969 KKKKKKEAEKMKKKTEE 985
K+K + E + + K+ ++
Sbjct: 215 KQKTENEKQDLIKEQKD 231
>gnl|CDD|221643 pfam12572, DUF3752, Protein of unknown function (DUF3752). This
domain family is found in eukaryotes, and is typically
between 140 and 163 amino acids in length.
Length = 148
Score = 34.7 bits (80), Expect = 0.071
Identities = 21/67 (31%), Positives = 35/67 (52%), Gaps = 3/67 (4%)
Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK---KKKKKKKKKK 971
K K+ + E + + GA A K K+ A K+ + +KK+ K ++ +KK+KKK
Sbjct: 42 KAKRLQDEVLGVEAGASAPAAASAKASKRDKEMARKVKEYNEKKRGKSLVEQHQKKQKKK 101
Query: 972 KKKEAEK 978
KK+E
Sbjct: 102 KKEEEND 108
Score = 28.5 bits (64), Expect = 10.0
Identities = 21/77 (27%), Positives = 34/77 (44%), Gaps = 10/77 (12%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKK---DAEK-MKKKGAEKI 951
R K RK K+ EK R K ++ +K +KKK E+ + D EK + +
Sbjct: 70 RDKEMARKVKEYNEKKRGKSLVEQHQKKQKKKKKEEENDDPSRRPFDREKDL------AV 123
Query: 952 GKKKKKKKKKKKKKKKK 968
G K K ++++ K
Sbjct: 124 GGKISKAQRRELINKAA 140
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 36.5 bits (84), Expect = 0.072
Identities = 26/95 (27%), Positives = 43/95 (45%), Gaps = 4/95 (4%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAE----KMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
+ K K K A ++K++ +E K K A K K K K++G E++
Sbjct: 94 KAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEV 153
Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
+++++ K+K K K K K A K+K EA
Sbjct: 154 TEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEA 188
Score = 35.0 bits (80), Expect = 0.20
Identities = 25/100 (25%), Positives = 42/100 (42%), Gaps = 16/100 (16%)
Query: 904 KKKGAEKIRKK-----KKKKKEAEKMR-------KKKGAEKMKKKKKDAEKMKKKGAEKI 951
K++ A + K K+K++ E++ K K A K K K K++G E++
Sbjct: 60 KRRAAAAAKAKAAALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEV 119
Query: 952 GKKKKKKKKKKK----KKKKKKKKKKKEAEKMKKKTEEAE 987
+++K K K K K K+K + EE E
Sbjct: 120 TEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEE 159
Score = 33.0 bits (75), Expect = 0.84
Identities = 25/104 (24%), Positives = 41/104 (39%)
Query: 885 EAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK 944
E VT + K K K A ++K++ +E + ++ EK K K A K K
Sbjct: 117 EEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAK 176
Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K + + ++ +++K K K K A K K K
Sbjct: 177 AAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAAKAKAAALAK 220
Score = 33.0 bits (75), Expect = 0.88
Identities = 22/104 (21%), Positives = 36/104 (34%), Gaps = 13/104 (12%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
+ +R+ + + K K K A + K A +K++ E +++ K K K
Sbjct: 109 AKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAK 168
Query: 958 KKKKKKKKKKKKKKKK-------------KEAEKMKKKTEEAEK 988
K K K+K +E K K K A K
Sbjct: 169 AAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAAK 212
Score = 31.9 bits (72), Expect = 1.8
Identities = 26/96 (27%), Positives = 46/96 (47%), Gaps = 5/96 (5%)
Query: 896 RKKRTRRKKKKGAEKIR--KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
+ ++K++G E++ +K K +A K K A K+K++ E++ ++ E +
Sbjct: 104 KAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKE 163
Query: 954 KKKKKKKK--KKKKKKKKKKKKKEA-EKMKKKTEEA 986
K K K K K K+K EA E ++ TEE
Sbjct: 164 KAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEE 199
Score = 31.5 bits (71), Expect = 2.4
Identities = 17/85 (20%), Positives = 36/85 (42%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+ ++K++G E++ +++++ + + K A K K +K + G +
Sbjct: 138 KAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTE 197
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMK 980
++K K K K K K K K
Sbjct: 198 EEKAKAKAKAAAAAKAKAAALAKQK 222
Score = 30.3 bits (68), Expect = 4.9
Identities = 26/104 (25%), Positives = 43/104 (41%)
Query: 885 EAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK 944
A A L ++KR ++ E+ K+K K +A K K A K+K +A +
Sbjct: 133 AAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGT 192
Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++ E+ K K K K K K+K ++ +E K
Sbjct: 193 EEVTEEEKAKAKAKAAAAAKAKAAALAKQKASQGNGDSGDEDAK 236
>gnl|CDD|219392 pfam07382, HC2, Histone H1-like nucleoprotein HC2. This family
contains the bacterial histone H1-like nucleoprotein HC2
(approximately 200 residues long), which seems to be
found mostly in Chlamydia. HC2 functions in DNA
condensation, although it has been suggested that it
also has other roles.
Length = 187
Score = 35.5 bits (81), Expect = 0.075
Identities = 30/90 (33%), Positives = 35/90 (38%), Gaps = 2/90 (2%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
KK RK KK KK A+K KK K KK K K A+K+ KK
Sbjct: 41 KKPAARKTVAKKTVAAKKPAAKKAAKKAVAKKVVAKKPVAKKAVAK--KATAKKVAAKKV 98
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
KK KK KK K+A K +
Sbjct: 99 VAKKTVAKKAAAKKPAAKKAVAKKAVARKP 128
Score = 35.1 bits (80), Expect = 0.093
Identities = 34/98 (34%), Positives = 45/98 (45%), Gaps = 5/98 (5%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKM----KKKGAEKI 951
+KKR+ +K +RK KK A+K +K A K +K K KK A+K
Sbjct: 5 QKKRSSKKTAAKKAAVRKPAAKKAAAKKTVVRKVAAKKPAARKTVAKKTVAAKKPAAKKA 64
Query: 952 GKKK-KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK KK KK KK KK A+K+ K A+K
Sbjct: 65 AKKAVAKKVVAKKPVAKKAVAKKATAKKVAAKKVVAKK 102
Score = 32.8 bits (74), Expect = 0.55
Identities = 34/87 (39%), Positives = 38/87 (43%), Gaps = 7/87 (8%)
Query: 907 GAEKIRKKKK---KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK--KKKKK 961
GA+K R KK KK K KK A K +K A KK A K KK KK
Sbjct: 3 GAQKKRSSKKTAAKKAAVRKPAAKKAAAKKTVVRKVA--AKKPAARKTVAKKTVAAKKPA 60
Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK KK KK A+K K A+K
Sbjct: 61 AKKAAKKAVAKKVVAKKPVAKKAVAKK 87
Score = 32.4 bits (73), Expect = 0.66
Identities = 35/92 (38%), Positives = 43/92 (46%), Gaps = 7/92 (7%)
Query: 904 KKKGAEKIRKKK---KKKKEAEKMRKKKGAEKMKKKKKDAEKM--KKKGAEKIGKKKKKK 958
+K A+K +K KK A+K KK A+K KK A+K KK A+K KK
Sbjct: 36 RKVAAKKPAARKTVAKKTVAAKKPAAKKAAKKAVAKKVVAKKPVAKKAVAKKATAKKVAA 95
Query: 959 KK--KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK KK KK KK A+K K A K
Sbjct: 96 KKVVAKKTVAKKAAAKKPAAKKAVAKKAVARK 127
Score = 32.1 bits (72), Expect = 0.83
Identities = 28/92 (30%), Positives = 33/92 (35%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
+K +K +RK KK A K KK K K A K K
Sbjct: 21 RKPAAKKAAAKKTVVRKVAAKKPAARKTVAKKTVAAKKPAAKKAAKKAVAKKVVAKKPVA 80
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK KK KK KK A+K K A+K
Sbjct: 81 KKAVAKKATAKKVAAKKVVAKKTVAKKAAAKK 112
Score = 31.7 bits (71), Expect = 1.2
Identities = 32/83 (38%), Positives = 40/83 (48%), Gaps = 4/83 (4%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMR--KKKGAEKMKKKKKDAEKM--KKKGAEKIGK 953
K+T KK A+K KK KK K KK A+K KK A+K+ KK A+K
Sbjct: 51 KKTVAAKKPAAKKAAKKAVAKKVVAKKPVAKKAVAKKATAKKVAAKKVVAKKTVAKKAAA 110
Query: 954 KKKKKKKKKKKKKKKKKKKKKEA 976
KK KK KK +K K+A
Sbjct: 111 KKPAAKKAVAKKAVARKPAAKKA 133
>gnl|CDD|153342 cd07658, F-BAR_NOSTRIN, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Nitric Oxide Synthase
TRaffic INducer (NOSTRIN). F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Nitric Oxide Synthase TRaffic
INducer (NOSTRIN) is expressed in endothelial and
epithelial cells and is involved in the regulation,
trafficking and targeting of endothelial NOS (eNOS).
NOSTRIN facilitates the endocytosis of eNOS by
coordinating the functions of dynamin and the
Wiskott-Aldrich syndrome protein (WASP). Increased
expression of NOSTRIN may be correlated to preeclampsia.
NOSTRIN contains an N-terminal F-BAR domain and a
C-terminal SH3 domain. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
The F-BAR domain of NOSTRIN is necessary and sufficient
for its membrane association and is responsible for its
subcellular localization.
Length = 239
Score = 35.8 bits (83), Expect = 0.080
Identities = 23/94 (24%), Positives = 41/94 (43%), Gaps = 3/94 (3%)
Query: 898 KRTRRKKKKGAEKIRKKKKK---KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
K + +E+I+ KKK +E EK++ + K K+ KK AE K+
Sbjct: 113 KAAKLLTDWRSEQIKVKKKLHGLARENEKLQDQVEDNKQSCTKQKMLNKLKKSAEVQDKE 172
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K + K+KK ++ + K + E + E
Sbjct: 173 DEKLEAKRKKGEESRLKAENEYYTCCVRLERLRL 206
Score = 33.1 bits (76), Expect = 0.59
Identities = 20/85 (23%), Positives = 43/85 (50%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
+ KKK R+ +K + + E ++ +KM K K + +++ K EK+ K+KK ++
Sbjct: 127 KVKKKLHGLARENEKLQDQVEDNKQSCTKQKMLNKLKKSAEVQDKEDEKLEAKRKKGEES 186
Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEA 986
+ K + + E+++ + E A
Sbjct: 187 RLKAENEYYTCCVRLERLRLEWESA 211
>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain. This region has been
identified as the binding site of the SRP72 protein to
SRP RNA.
Length = 57
Score = 32.4 bits (74), Expect = 0.082
Identities = 15/29 (51%), Positives = 20/29 (68%)
Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
A+K KK +K G ++KKKK KKK+K K
Sbjct: 2 AKKPKKIAKKKPGTEQKKKKSAKKKRKPK 30
Score = 32.0 bits (73), Expect = 0.12
Identities = 15/28 (53%), Positives = 19/28 (67%)
Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
K +KI KKK ++KKKK KKK+K K
Sbjct: 3 KKPKKIAKKKPGTEQKKKKSAKKKRKPK 30
Score = 28.6 bits (64), Expect = 1.9
Identities = 14/28 (50%), Positives = 18/28 (64%)
Query: 947 GAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
A+K K KKK ++KKKK KKK+K
Sbjct: 1 EAKKPKKIAKKKPGTEQKKKKSAKKKRK 28
Score = 27.8 bits (62), Expect = 4.5
Identities = 13/28 (46%), Positives = 16/28 (57%)
Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
KK + KKK + KKK KKK+K K
Sbjct: 3 KKPKKIAKKKPGTEQKKKKSAKKKRKPK 30
>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p. Rrp15p is required for the formation of
60S ribosomal subunits.
Length = 132
Score = 34.3 bits (79), Expect = 0.090
Identities = 27/107 (25%), Positives = 48/107 (44%), Gaps = 17/107 (15%)
Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD---AEKMKKKGAE 949
IL R K+ + KKK + K +KK ++R +K K + K + K++
Sbjct: 23 ILSRSKKLLKAKKKLKSE----KLEKKAKRQLRAEKRQALEKGRVKPVLPEDLEKERRLR 78
Query: 950 KIGKK----------KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
K+ +K +KK +K K+ KK + K E +K+ ++E
Sbjct: 79 KVAQKGVVKLFNAVRAAQKKTEKAVKEAGKKARVKREELLKEVSKEK 125
Score = 31.9 bits (73), Expect = 0.49
Identities = 15/53 (28%), Positives = 25/53 (47%), Gaps = 4/53 (7%)
Query: 932 KMKKKKKDAEKMKK----KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMK 980
K+ K A K K + + + KKK K +K +KK K++ + +K K
Sbjct: 8 KILASKLPASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEK 60
Score = 31.2 bits (71), Expect = 0.98
Identities = 13/53 (24%), Positives = 23/53 (43%), Gaps = 2/53 (3%)
Query: 913 KKKKKKKEAEKM--RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
K K + + R KK + KK K + + K K + K++ +K + K
Sbjct: 12 SKLPASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVK 64
Score = 30.0 bits (68), Expect = 2.3
Identities = 15/53 (28%), Positives = 24/53 (45%)
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
A+ K K+ + + KK K KKK K +K +KK K + +K+
Sbjct: 3 ADAFAKILASKLPASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKR 55
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The function
is unclear.
Length = 151
Score = 34.6 bits (79), Expect = 0.090
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 924 MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
M++ E + K++K++ K + + K KKKK+K +KK K KKKE EK K
Sbjct: 1 MKETSYEEALAKQRKESGKTQSQ--------KSDKKKKEKVSEKKGKSKKKE-EKPNGKI 51
Query: 984 EEAE 987
E E
Sbjct: 52 PEHE 55
Score = 31.9 bits (72), Expect = 0.89
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 943 MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
MK+ E+ K++K+ K + +K KKKK+K +EK K ++ EK
Sbjct: 1 MKETSYEEALAKQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEK 46
Score = 31.1 bits (70), Expect = 1.2
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 10/49 (20%)
Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
K++KE+ K + +K KKKK+ K +EK GK KKK++K K
Sbjct: 12 KQRKESGKTQSQKS----DKKKKE------KVSEKKGKSKKKEEKPNGK 50
Score = 28.8 bits (64), Expect = 9.6
Identities = 15/37 (40%), Positives = 24/37 (64%), Gaps = 4/37 (10%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD 939
K++K + K + +K KK+ EK+ +KKG K KKK+
Sbjct: 12 KQRKESGKTQSQKSDKKKKEKVSEKKG----KSKKKE 44
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 34.3 bits (79), Expect = 0.091
Identities = 25/92 (27%), Positives = 49/92 (53%), Gaps = 3/92 (3%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDA-EKMKKKGAEKIGKKK 955
K + +KK+ RK+ +K K + + AE+ K++++ A E+ +K EK G +
Sbjct: 27 IKAAKEEKKQAKLSERKENRKPKYIGSLLEA--AERRKREREIAEERKLQKEREKEGDEF 84
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K+K KK+ ++ ++ E+ +K+ EE E
Sbjct: 85 ADKEKFVTSAYKKQLEENRKLEEEEKEREELE 116
Score = 33.2 bits (76), Expect = 0.19
Identities = 15/88 (17%), Positives = 42/88 (47%), Gaps = 14/88 (15%)
Query: 912 RKKKKKKKEAEK--------------MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
+K +K+ ++A + ++ K +K K + E K K + + ++
Sbjct: 1 KKARKEAQKALEEDPSIYDYDEVYDSIKAAKEEKKQAKLSERKENRKPKYIGSLLEAAER 60
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
+K++++ +++K +K++E E + +E
Sbjct: 61 RKREREIAEERKLQKEREKEGDEFADKE 88
Score = 29.7 bits (67), Expect = 3.5
Identities = 12/56 (21%), Positives = 27/56 (48%), Gaps = 9/56 (16%)
Query: 941 EKMKKKGAEKIGKKKKKKKKKKK-------KKKKKKKKKKKEA--EKMKKKTEEAE 987
+ +K EK K ++K+ +K + +++K+++E E+ +K E E
Sbjct: 25 DSIKAAKEEKKQAKLSERKENRKPKYIGSLLEAAERRKREREIAEERKLQKEREKE 80
Score = 29.3 bits (66), Expect = 4.3
Identities = 15/64 (23%), Positives = 32/64 (50%), Gaps = 10/64 (15%)
Query: 934 KKKKKDAEKMKKKGA---------EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
KK +K+A+K ++ + I K K++KK+ K ++K+ +K K + + E
Sbjct: 1 KKARKEAQKALEEDPSIYDYDEVYDSI-KAAKEEKKQAKLSERKENRKPKYIGSLLEAAE 59
Query: 985 EAEK 988
++
Sbjct: 60 RRKR 63
>gnl|CDD|203489 pfam06644, ATP11, ATP11 protein. This family consists of several
eukaryotic ATP11 proteins. In Saccharomyces cerevisiae,
expression of functional F1-ATPase requires two proteins
encoded by the ATP11 and ATP12 genes. Atp11p is a
molecular chaperone of the mitochondrial matrix that
participates in the biogenesis pathway to form F1, the
catalytic unit of the ATP synthase.
Length = 250
Score = 35.4 bits (82), Expect = 0.096
Identities = 27/91 (29%), Positives = 37/91 (40%), Gaps = 8/91 (8%)
Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK--KKKKKKK 963
K K+ +K K+ R KK + +KK+ + K G K K K K K
Sbjct: 3 KYRSKLLQKAKESGLEFIERLKKALKDKIEKKEFSAKKPPTGPSKQASKFKTLKPPKPAD 62
Query: 964 KKKKKKKKKKK-----EAEKMKKKT-EEAEK 988
KKK K K + EK+K + EE E
Sbjct: 63 KKKPFDKPFKPLSSILDVEKIKDLSAEEIEF 93
>gnl|CDD|153271 cd07307, BAR, The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization
module that binds membranes and detects membrane
curvature. BAR domains are dimerization, lipid binding
and curvature sensing modules found in many different
proteins with diverse functions including organelle
biogenesis, membrane trafficking or remodeling, and cell
division and migration. Mutations in BAR containing
proteins have been linked to diseases and their
inactivation in cells leads to altered membrane
dynamics. A BAR domain with an additional N-terminal
amphipathic helix (an N-BAR) can drive membrane
curvature. These N-BAR domains are found in amphiphysins
and endophilins, among others. BAR domains are also
frequently found alongside domains that determine lipid
specificity, such as the Pleckstrin Homology (PH) and
Phox Homology (PX) domains which are present in beta
centaurins (ACAPs and ASAPs) and sorting nexins,
respectively. A FES-CIP4 Homology (FCH) domain together
with a coiled coil region is called the F-BAR domain and
is present in Pombe/Cdc15 homology (PCH) family
proteins, which include Fes/Fes tyrosine kinases, PACSIN
or syndapin, CIP4-like proteins, and srGAPs, among
others. The Inverse (I)-BAR or IRSp53/MIM homology
Domain (IMD) is found in multi-domain proteins, such as
IRSp53 and MIM, that act as scaffolding proteins and
transducers of a variety of signaling pathways that link
membrane dynamics and the underlying actin cytoskeleton.
BAR domains form dimers that bind to membranes, induce
membrane bending and curvature, and may also be involved
in protein-protein interactions. The I-BAR domain
induces membrane protrusions in the opposite direction
compared to classical BAR and F-BAR domains, which
produce membrane invaginations. BAR domains that also
serve as protein interaction domains include those of
arfaptin and OPHN1-like proteins, among others, which
bind to Rac and Rho GAP domains, respectively.
Length = 194
Score = 35.1 bits (81), Expect = 0.10
Identities = 20/73 (27%), Positives = 40/73 (54%), Gaps = 7/73 (9%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
KK KE +K RKK + K + D + ++K K +KKKK K + +++ ++ K
Sbjct: 89 LKKDLKEIKKRRKK-----LDKARLDYDAAREK--LKKLRKKKKDSSKLAEAEEELQEAK 141
Query: 974 KEAEKMKKKTEEA 986
++ E+++++ E
Sbjct: 142 EKYEELREELIED 154
Score = 30.5 bits (69), Expect = 2.7
Identities = 20/55 (36%), Positives = 29/55 (52%), Gaps = 1/55 (1%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK K+ +K +KK +K ++K KK +KKKK K AE ++ E EK
Sbjct: 90 KKDLKEIKKRRKK-LDKARLDYDAAREKLKKLRKKKKDSSKLAEAEEELQEAKEK 143
>gnl|CDD|236172 PRK08173, PRK08173, DNA topoisomerase III; Validated.
Length = 862
Score = 36.2 bits (84), Expect = 0.10
Identities = 20/89 (22%), Positives = 28/89 (31%), Gaps = 20/89 (22%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGA-----EKMKKKKKDAEKMKKKGAE 949
F+ RT R K A +++ G E + K A+K K
Sbjct: 788 FKSSRTGRNFK---------------AYLVKQPDGKIGFEFEPREPKAAAAKKTAAKATA 832
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
K +K KK KK KK +
Sbjct: 833 AAATKAEKAAAKKAPAKKTAAKKTAARKT 861
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 36.1 bits (83), Expect = 0.10
Identities = 23/73 (31%), Positives = 30/73 (41%), Gaps = 3/73 (4%)
Query: 117 SATQAFLASVLQELKTCA-PIPDATHVVVK-VPVEGAPTPATPDTEAP-TPAPGTESGPS 173
S T LA ++ L T A P + VPV + +TP AP P S PS
Sbjct: 587 SDTAEALAGAIETLLTQASAQPAGLSLPAPAVPVNASTPASTPPPLAPQEPPQPGTSAPS 646
Query: 174 STTPGPETKTLES 186
T P+ K + S
Sbjct: 647 LETSLPQQKPVLS 659
>gnl|CDD|223095 COG0016, PheS, Phenylalanyl-tRNA synthetase alpha subunit
[Translation, ribosomal structure and biogenesis].
Length = 335
Score = 35.7 bits (83), Expect = 0.10
Identities = 18/89 (20%), Positives = 36/89 (40%), Gaps = 9/89 (10%)
Query: 904 KKKGAEKIRKKKKKKKEAEKMRK----KKG--AEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
KKG I K E+++ KKG + +KK K + ++K ++G +
Sbjct: 10 AKKGIAAIELASDLLKALEELKVKYLGKKGVLTDLLKKLGKLSPLEERK---EVGALINE 66
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
KK+ + + + + A ++ E
Sbjct: 67 LKKEVEDAITELTPELEAAGLWERLAFEK 95
Score = 33.0 bits (76), Expect = 0.74
Identities = 14/65 (21%), Positives = 26/65 (40%)
Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
+ + KKG KK K + RK+ GA + KK+ + + + E ++
Sbjct: 31 KVKYLGKKGVLTDLLKKLGKLSPLEERKEVGALINELKKEVEDAITELTPELEAAGLWER 90
Query: 959 KKKKK 963
+K
Sbjct: 91 LAFEK 95
>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
region. Members of this family have very varied
localisations within the eukaryotic cell. pinin is known
to localise at the desmosomes and is implicated in
anchoring intermediate filaments to the desmosomal
plaque. SDK2/3 is a dynamically localised nuclear
protein thought to be involved in modulation of
alternative pre-mRNA splicing. memA is a tumour marker
preferentially expressed in human melanoma cell lines. A
common feature of the members of this family is that
they may all participate in regulating protein-protein
interactions.
Length = 131
Score = 34.0 bits (78), Expect = 0.11
Identities = 22/98 (22%), Positives = 49/98 (50%), Gaps = 19/98 (19%)
Query: 897 KKRTRR----------KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
KKR RR K + ++ K+K++ E E +K E+ K+++ E+++K+
Sbjct: 6 KKRNRRMFGLLLGTLQKFSQEESRLTSKEKRRAEIE----QKLEEQEKQER---EELRKE 58
Query: 947 GAEKIGKKKKKKKKKKKKKKKKKKKKKKEA--EKMKKK 982
E ++++K+ + +K ++K + +K +E E
Sbjct: 59 KRELFEERRRKQLELRKLEQKMEDEKLQETWHEHNLAL 96
Score = 30.9 bits (70), Expect = 1.4
Identities = 18/94 (19%), Positives = 45/94 (47%), Gaps = 2/94 (2%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKK--KGAEKIGKK 954
+ T ++K++ + + ++++K+E E++RK+K ++++K E K K ++ ++
Sbjct: 28 SRLTSKEKRRAEIEQKLEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKMEDEKLQE 87
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ + K K + + T E EK
Sbjct: 88 TWHEHNLALANFIRTKTKPHLYYRPWRLTPETEK 121
>gnl|CDD|201540 pfam00992, Troponin, Troponin. Troponin (Tn) contains three
subunits, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). this Pfam contains members of
the TnT subunit. Troponin is a complex of three
proteins, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). The troponin complex
regulates Ca++ induced muscle contraction. This family
includes troponin T and troponin I. Troponin I binds to
actin and troponin T binds to tropomyosin.
Length = 131
Score = 34.1 bits (79), Expect = 0.11
Identities = 20/71 (28%), Positives = 29/71 (40%), Gaps = 6/71 (8%)
Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
+ + + E+ EK+ KK K+ E +KKK + GK KK KK +K K
Sbjct: 55 HARIDRLDEERYDI---EEKVAKKDKEIEDLKKKVNDLRGKFKKPTLKKVRKSADAMLKA 111
Query: 973 ---KKEAEKMK 980
K M
Sbjct: 112 LLGSKHKVSMD 122
>gnl|CDD|217011 pfam02388, FemAB, FemAB family. The femAB operon codes for two
nearly identical approximately 50-kDa proteins involved
in the formation of the Staphylococcal pentaglycine
interpeptide bridge in peptidoglycan. These proteins are
also considered as a factor influencing the level of
methicillin resistance.
Length = 408
Score = 35.7 bits (83), Expect = 0.11
Identities = 15/58 (25%), Positives = 32/58 (55%)
Query: 925 RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
+ EK++K +KD K+++K + KKK K + +++ +K+ EA+++ K
Sbjct: 243 YLDELQEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELIAK 300
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 36.2 bits (84), Expect = 0.11
Identities = 20/97 (20%), Positives = 39/97 (40%), Gaps = 13/97 (13%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG--------- 947
+ R KK AEK+ + K+ E+ K+ E+ +++ A
Sbjct: 864 INQYRAKKLDTAEKLEELYILAKK-EEEFKQFAEEEGLNEEELAFYDDLALNGGKLPENG 922
Query: 948 ---AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
EK+ K+K ++K K K K++ + K ++
Sbjct: 923 TELVEKLAKEKSLREKNKDDWKAKEEVEAKLKAFFRR 959
>gnl|CDD|192718 pfam11176, DUF2962, Protein of unknown function (DUF2962). This
eukaryotic family of proteins has no known function.
Length = 155
Score = 34.2 bits (79), Expect = 0.12
Identities = 15/73 (20%), Positives = 30/73 (41%), Gaps = 23/73 (31%)
Query: 910 KIRKKKKKKKEA--EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
K+ K KKK+ K RK A+++ + + K +KKK +
Sbjct: 1 KVTKNIAKKKKVLHPKSRK---------------------AKQLNRATLRDDKLEKKKAE 39
Query: 968 KKKKKKKEAEKMK 980
+ +K+ E +++
Sbjct: 40 RAEKRSLELDRLF 52
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 34.8 bits (80), Expect = 0.13
Identities = 17/33 (51%), Positives = 21/33 (63%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
KKKK +KKK KK KKKKKE E++ + E
Sbjct: 51 SKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAE 83
Score = 34.0 bits (78), Expect = 0.23
Identities = 23/68 (33%), Positives = 38/68 (55%)
Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
+I KK KK+ RKKK +K KKKKK+ E++ + AE++ ++ ++ KK +
Sbjct: 44 EIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELP 103
Query: 970 KKKKKEAE 977
K + AE
Sbjct: 104 KVQNTAAE 111
Score = 34.0 bits (78), Expect = 0.24
Identities = 17/35 (48%), Positives = 20/35 (57%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K KKKK +KKK KK KKK+ EK + AE
Sbjct: 49 KTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAE 83
Score = 33.6 bits (77), Expect = 0.26
Identities = 16/39 (41%), Positives = 22/39 (56%)
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+I KK KKKK +KKK KK K+ +K K++ E
Sbjct: 44 EIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAA 82
Score = 33.6 bits (77), Expect = 0.31
Identities = 18/39 (46%), Positives = 22/39 (56%)
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K KKKK +KKK KK KKKKK++ E + EE
Sbjct: 49 KTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSD 87
Score = 32.1 bits (73), Expect = 1.0
Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
D E KK ++K K +KKK KK KKKKK+K++ E ++ ++E+
Sbjct: 42 DVEIPSKKTSKK-KKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEE 90
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 35.7 bits (82), Expect = 0.13
Identities = 20/85 (23%), Positives = 46/85 (54%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
++K K+ + + K+ K + ++ +K+ AE+ K+KK++ E + A+ +++ +
Sbjct: 471 KQKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAES 530
Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEE 985
K+ K+ + + KK +K K EE
Sbjct: 531 LKQAKQDLEMEIKKLEHDLKLKEEE 555
Score = 34.9 bits (80), Expect = 0.26
Identities = 22/95 (23%), Positives = 47/95 (49%), Gaps = 3/95 (3%)
Query: 896 RKKRTRRKKKKGAEKI---RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
++ ++ + AEK KK+KK++E R A +++ ++ K K+ E
Sbjct: 483 KRLKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEI 542
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
KK + K K+++ + +K+ +E K ++ +E E
Sbjct: 543 KKLEHDLKLKEEECRMLEKEAQELRKYQESEKETE 577
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 35.7 bits (83), Expect = 0.14
Identities = 16/44 (36%), Positives = 28/44 (63%)
Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
E MK K+ ++KK KI ++K KK++KK+K K+++K+
Sbjct: 635 ELMKIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678
Score = 34.6 bits (80), Expect = 0.30
Identities = 15/78 (19%), Positives = 32/78 (41%)
Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
KK K+ E E+ ++ E + K +++G+ + ++K+ K +
Sbjct: 222 FTKKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLD 281
Query: 971 KKKKEAEKMKKKTEEAEK 988
K + EK ++ EK
Sbjct: 282 KLEILKEKKDEELFWFEK 299
Score = 31.5 bits (72), Expect = 2.8
Identities = 17/78 (21%), Positives = 33/78 (42%), Gaps = 2/78 (2%)
Query: 902 RKKKKGAEKIRKKKK-KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
KK K + ++++ E E + KG ++ ++ + E EK K +
Sbjct: 223 TKKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTE-EDPSLFSEEKEDPDKTEDLD 281
Query: 961 KKKKKKKKKKKKKKEAEK 978
K + K+KK ++ EK
Sbjct: 282 KLEILKEKKDEELFWFEK 299
Score = 31.1 bits (71), Expect = 3.5
Identities = 20/65 (30%), Positives = 35/65 (53%), Gaps = 7/65 (10%)
Query: 931 EKMKKKK----KDAEKMKKKGAEK--IGKKKK-KKKKKKKKKKKKKKKKKKEAEKMKKKT 983
++++K +E MK K + I KKK+ K ++K KK++KK+K + K++
Sbjct: 621 DRIEKTLFFSFDISELMKIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKERI 680
Query: 984 EEAEK 988
E AE
Sbjct: 681 EIAET 685
>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
Validated.
Length = 605
Score = 35.6 bits (82), Expect = 0.14
Identities = 18/83 (21%), Positives = 34/83 (40%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
+ KK I+ +KK ++ + KK + K + KKK + K
Sbjct: 369 ENKKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQNNK 428
Query: 963 KKKKKKKKKKKKEAEKMKKKTEE 985
++ KK++ +KE K +K +
Sbjct: 429 EQDIKKEELLEKEFVKKSEKIPK 451
>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein. YkyA is a
family of proteins containing a lipoprotein signal and a
hydrolase domain. It is similar to cell wall binding
proteins and might also be recognisable by a host immune
defence system. It is thus likely to belong to pathways
important for pathogenicity.
Length = 205
Score = 34.6 bits (80), Expect = 0.14
Identities = 25/101 (24%), Positives = 51/101 (50%), Gaps = 15/101 (14%)
Query: 903 KKKKGAEKIRKK-----KKKKKEAEKMRK--KKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+++K E+ +KK KK+++ +++ + K E +KK A K + + K+K
Sbjct: 30 EQEKPFEEEQKKLEELEKKEQELYDQIIQLGKDDNEAIKKLSDQALANVDKREKLLKKEK 89
Query: 956 KKKKKKKKKKKKKK--------KKKKKEAEKMKKKTEEAEK 988
+ +K +++ K K KK KK+A+++ K +E K
Sbjct: 90 ESIEKSEEEFKSAKKYIEKIEDKKLKKKAKQLVKVMKERYK 130
Score = 32.3 bits (74), Expect = 0.94
Identities = 22/84 (26%), Positives = 44/84 (52%), Gaps = 7/84 (8%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
KK EKI + KK KK+A+++ + MK++ K +K+ K + + +K+ + K K
Sbjct: 103 KKYIEKI-EDKKLKKKAKQL-----VKVMKERYKSYDKLYKAYKKALNLEKELYEYLKDK 156
Query: 965 KKKKKKKKKKEAEKMKKKTEEAEK 988
K+ +K + + + E+ +K
Sbjct: 157 DLTLKELDEK-IKAVNQSYEKIQK 179
Score = 31.5 bits (72), Expect = 1.6
Identities = 26/93 (27%), Positives = 48/93 (51%), Gaps = 5/93 (5%)
Query: 901 RRKKKKGAEK--IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK-MKKKGA--EKIGKKK 955
+R+K EK I K +++ K A+K +K +K+KKK K K MK++ +K+ K
Sbjct: 80 KREKLLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKKAKQLVKVMKERYKSYDKLYKAY 139
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK +K+ + K K +++ +K + +
Sbjct: 140 KKALNLEKELYEYLKDKDLTLKELDEKIKAVNQ 172
Score = 30.4 bits (69), Expect = 3.3
Identities = 24/83 (28%), Positives = 40/83 (48%)
Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
+K A K++K + EK +K E+ K KK EK++ K +K K+ K
Sbjct: 66 AIKKLSDQALANVDKREKLLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKKAKQLVKVM 125
Query: 960 KKKKKKKKKKKKKKKEAEKMKKK 982
K++ K K K K+A ++K+
Sbjct: 126 KERYKSYDKLYKAYKKALNLEKE 148
Score = 30.0 bits (68), Expect = 4.4
Identities = 27/93 (29%), Positives = 46/93 (49%), Gaps = 13/93 (13%)
Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI---GKKKKKKKK 960
KK E I ++ +K E EK E+ +KK ++ EK +++ ++I GK + K
Sbjct: 14 GKKPEESIYEQLEKAAEQEKP-----FEEEQKKLEELEKKEQELYDQIIQLGKDDNEAIK 68
Query: 961 KKKKK-----KKKKKKKKKEAEKMKKKTEEAEK 988
K + K++K KKE E ++K EE +
Sbjct: 69 KLSDQALANVDKREKLLKKEKESIEKSEEEFKS 101
Score = 29.2 bits (66), Expect = 7.1
Identities = 18/71 (25%), Positives = 37/71 (52%), Gaps = 2/71 (2%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA--EKIGKKKKKKKK 960
K KK A+++ K K++ ++ K + + +K+ E +K K +++ +K K +
Sbjct: 113 KLKKKAKQLVKVMKERYKSYDKLYKAYKKALNLEKELYEYLKDKDLTLKELDEKIKAVNQ 172
Query: 961 KKKKKKKKKKK 971
+K +K KKK
Sbjct: 173 SYEKIQKAKKK 183
>gnl|CDD|216134 pfam00816, Histone_HNS, H-NS histone family.
Length = 92
Score = 32.6 bits (75), Expect = 0.14
Identities = 15/64 (23%), Positives = 26/64 (40%), Gaps = 8/64 (12%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG--AEKIGKKKKKKKKKKKKKK 966
I K +K+ + A K + +++ E + G E++ KKK KKK+
Sbjct: 5 ALIEKLEKEIEAARKAERAAALAEIR------ELAAEYGLTLEELLGGAAKKKAKKKRAP 58
Query: 967 KKKK 970
K
Sbjct: 59 VPAK 62
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 34.0 bits (78), Expect = 0.15
Identities = 13/55 (23%), Positives = 29/55 (52%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ E+ +++ ++ ++ + K + K+KKK++ K + EK K KTE +
Sbjct: 92 SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKP 146
Score = 34.0 bits (78), Expect = 0.17
Identities = 13/50 (26%), Positives = 25/50 (50%)
Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
E+ + ++ + K + K+KKK++ K K +KE K + K + K
Sbjct: 101 TEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150
Score = 32.4 bits (74), Expect = 0.58
Identities = 16/71 (22%), Positives = 35/71 (49%), Gaps = 4/71 (5%)
Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
KI ++ + E+ +++ ++ +++ K + K KK++ K K +K+K K
Sbjct: 86 KIFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEK----KKREVPKPKTEKEKPKT 141
Query: 970 KKKKKEAEKMK 980
+ KK + K K
Sbjct: 142 EPKKPKPSKPK 152
>gnl|CDD|233034 TIGR00584, mug, mismatch-specific thymine-DNA glycosylate (mug).
All proteins in this family for whcih functions are
known are G-T or G-U mismatch glycosylases that function
in base excision repair. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University). Used 2pf model [DNA metabolism,
DNA replication, recombination, and repair].
Length = 328
Score = 35.0 bits (80), Expect = 0.15
Identities = 22/87 (25%), Positives = 38/87 (43%), Gaps = 5/87 (5%)
Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK---MKKKGAEKIGKKKK 956
+++ G ++ +EA K RK+K + + +K KK G K+K+
Sbjct: 35 EEQEETSGVPAKAPTQEPSEEAPKFRKRKPRSNEPYRPVEPKKPSDSKKSGKSTKSKEKQ 94
Query: 957 KK--KKKKKKKKKKKKKKKKEAEKMKK 981
+K K K K+K + EAE + K
Sbjct: 95 EKITDKFKVKRKVDRFNGVSEAELLTK 121
>gnl|CDD|212010 cd11620, HR1_PKC-like_2_fungi, Second Protein kinase C-related
kinase homology region 1 (HR1) Rho-binding domain of
fungal Protein Kinase C-like proteins. This subfamily
is composed of fungal PKC-like proteins including Pkc1p
from Saccharomyces cerevisiae, and Pck1p and Pck2p from
Schizosaccharomyces pombe. The yeast PKC-like proteins
play a critical role in regulating cell wall
biosynthesis and maintaining cell wall integrity. They
contain two HR1 domains, C2 and C1 domains, and a kinase
domain. This model characterizes the second HR1 domain.
HR1 domains are anti-parallel coiled-coil (ACC) domains
that bind small GTPases from the Rho family. The HR1
domains of Pck1p and Pck2p interact with GTP-bound Rho1p
and Rho2p.
Length = 72
Score = 31.9 bits (73), Expect = 0.16
Identities = 17/54 (31%), Positives = 29/54 (53%)
Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
K++ +K+ K+ EKM + AE K+ DAE + + +KI KK K+ +
Sbjct: 17 KLQVEKQYKEGIEKMARLYQAEGDKRSIADAENKRVESEQKIQLLKKALKRYQD 70
>gnl|CDD|225468 COG2916, Hns, DNA-binding protein H-NS [General function prediction
only].
Length = 128
Score = 33.6 bits (77), Expect = 0.16
Identities = 15/69 (21%), Positives = 31/69 (44%), Gaps = 2/69 (2%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG--AEKIGKKKKKKKKKKKKK 965
EK + ++++E E + E+ +K E + K G E++ + K ++K K+
Sbjct: 24 LEKEEQVVQERQEEEAAAIAEIEERQEKYGTIRELLIKDGITPEELLNTEVTKPRRKAKR 83
Query: 966 KKKKKKKKK 974
+ K K
Sbjct: 84 AARPAKYKY 92
>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 594
Score = 35.4 bits (82), Expect = 0.16
Identities = 20/103 (19%), Positives = 34/103 (33%), Gaps = 24/103 (23%)
Query: 903 KKKKGAEKIRKKKKKKKEAEK--MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK- 959
K +KI+K KE E ++ ++ EK+ K + EK+ K + I + + K
Sbjct: 474 AVLKLLQKIKKNNDFIKEFETDALKLQEILEKVDSKSEKLEKISAK-IDNIKELFDESKM 532
Query: 960 --------------------KKKKKKKKKKKKKKKEAEKMKKK 982
K K K+ K K +
Sbjct: 533 SILQEILQPALHHIELNIARIYKLNIKDKEDFLNKLYIKAHLE 575
Score = 35.0 bits (81), Expect = 0.23
Identities = 18/86 (20%), Positives = 35/86 (40%), Gaps = 6/86 (6%)
Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
KG ++ K+ +K +K + K + + EK K +KI K K+ +
Sbjct: 437 KGTIELPFKEVCEKLLDKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDA 496
Query: 966 KKKKK------KKKKEAEKMKKKTEE 985
K ++ K ++ EK+ K +
Sbjct: 497 LKLQEILEKVDSKSEKLEKISAKIDN 522
Score = 35.0 bits (81), Expect = 0.24
Identities = 26/114 (22%), Positives = 43/114 (37%), Gaps = 21/114 (18%)
Query: 888 TALGFILFRKKRTRRKKKKGAEKIRKKKKKK-----------KEAEKMRKKKGAEKM--K 934
T + LFR+K E I K K+ K ++ K+ EK+ K
Sbjct: 403 TDSVWNLFREKFE--------EDIANYKDKEFYNCTEGGARIKGTIELPFKEVCEKLLDK 454
Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K K K+ K K +K KK K+ + +A K+++ E+ +
Sbjct: 455 DKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDALKLQEILEKVDS 508
Score = 31.2 bits (71), Expect = 3.0
Identities = 15/68 (22%), Positives = 33/68 (48%), Gaps = 4/68 (5%)
Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK- 971
++K K +K++K + +K+ + DA K ++ EK+ K +K +K K K+
Sbjct: 471 QEKAVLKLLQKIKKNN--DFIKEFETDALK-LQEILEKVDSKSEKLEKISAKIDNIKELF 527
Query: 972 KKKEAEKM 979
+ + +
Sbjct: 528 DESKMSIL 535
Score = 30.8 bits (70), Expect = 4.5
Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 15/85 (17%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
K+ EK+ K K K + + + + + K +K KK K
Sbjct: 445 KEVCEKLLDKDKNKPFIKLI--------LLSNNSQEKAVLKL------LQKIKKNNDFIK 490
Query: 965 KKKKKKKKKKEA-EKMKKKTEEAEK 988
+ + K +E EK+ K+E+ EK
Sbjct: 491 EFETDALKLQEILEKVDSKSEKLEK 515
>gnl|CDD|217010 pfam02387, IncFII_repA, IncFII RepA protein family. This protein
is plasmid encoded and found to be essential for plasmid
replication.
Length = 279
Score = 35.0 bits (81), Expect = 0.16
Identities = 29/108 (26%), Positives = 47/108 (43%), Gaps = 11/108 (10%)
Query: 886 AVTALGFILF----RKKRTRRKKKKGAEKIRKKKKKKK-----EAEKMRKKKGAEK--MK 934
+T L F+L K + RK++ E + KKK EA + K+ E+
Sbjct: 152 TLTPLFFMLLGISEEKLNSARKQQLEWENKKLKKKGLIPLTLDEARRRAKEFHIERAFSY 211
Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
+ ++ A K++ A K+ K +K +KK K+ K E KM K
Sbjct: 212 RTERKAFGKKRRRARKLAKLDEKDIRKKILNALVKEYSKGELTKMGLK 259
>gnl|CDD|222331 pfam13707, RloB, RloB-like protein. This family includes the RloB
protein that is found within a bacterial restriction
modification operon. This family includes the AbiLii
protein that is found as part of a plasmid encoded phage
abortive infection mechanism. Deletion within abiLii
abolished the phage resistance. The family includes some
proteins annotated as CRISPR Csm2 proteins.
Length = 176
Score = 34.3 bits (79), Expect = 0.16
Identities = 12/58 (20%), Positives = 24/58 (41%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
E + A+K+ KKG KK K +++ ++ I K+ +K ++
Sbjct: 104 EDTTAALSRDAYAKKLLLKKGLPDYKKGKNLFKELLERLDTAIQNAKRLRKDNEEVNP 161
Score = 29.2 bits (66), Expect = 7.3
Identities = 23/131 (17%), Positives = 38/131 (29%), Gaps = 36/131 (27%)
Query: 892 FILFRKKRTRRKKKKGAEKIRKKKKKK--KEAEKMRKKKGAE------------KMKKKK 937
F + K + +EA+K+RKK E KKK
Sbjct: 15 FEHLKDLLRLSTLKIKVVPAGGGSPESLVEEAKKIRKKNANEYDEVWCVFDRDDFKKKKL 74
Query: 938 KDAEKMKKKGAEKIG----------------------KKKKKKKKKKKKKKKKKKKKKKE 975
+A K K+ + + KK KK KK K
Sbjct: 75 DEAIKKAKENGINLILSNPCFELWLLLHFEDTTAALSRDAYAKKLLLKKGLPDYKKGKNL 134
Query: 976 AEKMKKKTEEA 986
+++ ++ + A
Sbjct: 135 FKELLERLDTA 145
>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
Provisional.
Length = 1164
Score = 35.7 bits (82), Expect = 0.16
Identities = 18/89 (20%), Positives = 41/89 (46%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+ +R+ R K K+K + + KK + K+ K+++EK K + + +
Sbjct: 203 KMRRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRIL 262
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
+ K+ KK +K + + +K+K++
Sbjct: 263 RSYTKENKKDEKNVVSGIRSSVLLKRKSQ 291
Score = 35.4 bits (81), Expect = 0.21
Identities = 17/72 (23%), Positives = 37/72 (51%), Gaps = 6/72 (8%)
Query: 915 KKKKKEAEKMRKKKGA------EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
+ K+ ++++K+ +KM++ ++ +K +K +K K+K K KK +
Sbjct: 180 RNKEISNKQLKKQTNVTHTTCYDKMRRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDG 239
Query: 969 KKKKKKEAEKMK 980
K+ K+ +EK K
Sbjct: 240 DKQTKRNSEKSK 251
Score = 31.9 bits (72), Expect = 1.9
Identities = 17/69 (24%), Positives = 31/69 (44%), Gaps = 5/69 (7%)
Query: 925 RKKKGAEKMKKKKKDA-----EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKM 979
K+ +++KK+ +KM++ K K K+K K+K K KK+ +
Sbjct: 181 NKEISNKQLKKQTNVTHTTCYDKMRRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGD 240
Query: 980 KKKTEEAEK 988
K+ +EK
Sbjct: 241 KQTKRNSEK 249
Score = 30.0 bits (67), Expect = 7.7
Identities = 18/86 (20%), Positives = 35/86 (40%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
KK+T +K+R+ + + K EK K+K K+ +K + + +K
Sbjct: 190 KKQTNVTHTTCYDKMRRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEK 249
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKK 982
K + + + KE +K +K
Sbjct: 250 SKVQNSHFDVRILRSYTKENKKDEKN 275
>gnl|CDD|233421 TIGR01453, grpIintron_endo, group I intron endonuclease. This
model represents one subfamily of endonucleases
containing the endo/excinuclease amino terminal domain,
pfam01541 at its amino end. A distinct subfamily
includes excinuclease abc subunit c (uvrC). Members of
pfam01541 are often termed GIY-YIG endonucleases after
conserved motifs near the amino end. This subfamily in
This model is found in open reading frames of group I
introns in both phage and mitochondria. The closely
related endonucleases of phage T4: segA, segB, segC,
segD and segE, score below the trusted cutoff for the
family.
Length = 214
Score = 34.7 bits (80), Expect = 0.17
Identities = 20/91 (21%), Positives = 31/91 (34%), Gaps = 15/91 (16%)
Query: 896 RKKRTRRKKKK----GAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
K + K K + K K +E+ + K + K K
Sbjct: 96 GYKHSEETKAKMSKLFSGKKNNPWYGKTHSEETKAK-----ISKNKLGE------NNPFF 144
Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
GKK ++ KKK +K+K KK +K
Sbjct: 145 GKKHSEETKKKISEKEKGAKKVNVYDKNNGL 175
>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729). This
family of proteins is found in viruses. Proteins in this
family are typically between 145 and 1707 amino acids in
length. The family is found in association with
pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
There is a single completely conserved residue L that
may be functionally important.
Length = 115
Score = 33.1 bits (76), Expect = 0.17
Identities = 14/54 (25%), Positives = 16/54 (29%), Gaps = 8/54 (14%)
Query: 134 APIPDATHVVVKVPVEG--------APTPATPDTEAPTPAPGTESGPSSTTPGP 179
P+ D VV P PA P + P PG PS P
Sbjct: 41 PPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSPLAPPA 94
>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases.and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain.
Expression of a construct consisting of only the
N-terminal domain of Saccharomyces pombe Cdc37 results
in cellular viability. This indicates that interactions
with the cochaperone Hsp90 may not be essential for
Cdc37 function.
Length = 154
Score = 33.9 bits (78), Expect = 0.17
Identities = 19/97 (19%), Positives = 42/97 (43%), Gaps = 6/97 (6%)
Query: 898 KRTRRK--KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKM----KKKGAEKI 951
R R K ++ K +K+ K E++ + +M + + +++
Sbjct: 40 ARVERMEEIKNLKYELIMNDHLNKRIDKLLKGLREEELSPETPTYNEMLAELQDQLKKEL 99
Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ + ++ KK + K KKE ++++KK +E EK
Sbjct: 100 EEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEK 136
Score = 32.8 bits (75), Expect = 0.36
Identities = 17/78 (21%), Positives = 41/78 (52%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
K+ + ++ ++++ E + +++ + K + +E + ++ KK + K KK
Sbjct: 63 KRIDKLLKGLREEELSPETPTYNEMLAELQDQLKKELEEANGDSEGLLEELKKHRDKLKK 122
Query: 965 KKKKKKKKKKEAEKMKKK 982
++K+ +KK E EK +KK
Sbjct: 123 EQKELRKKLDELEKEEKK 140
Score = 31.6 bits (72), Expect = 0.84
Identities = 16/58 (27%), Positives = 32/58 (55%), Gaps = 1/58 (1%)
Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
+ E + KK+ E + E++KK +K+ K++K+ +KK + +K++KKK
Sbjct: 85 NEMLAELQDQLKKELEEANGDSEGLLEELKKH-RDKLKKEQKELRKKLDELEKEEKKK 141
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 34.5 bits (80), Expect = 0.17
Identities = 15/61 (24%), Positives = 32/61 (52%), Gaps = 7/61 (11%)
Query: 922 EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
+ ++K+ GA +K D + + + +K+KKK K++KK K+E +K+++
Sbjct: 70 KILKKEGGAIIKDLEKCDFTPIYEY-------FEAEKEKKKAMSKEEKKAIKEEKDKLEE 122
Query: 982 K 982
Sbjct: 123 P 123
Score = 29.5 bits (67), Expect = 7.3
Identities = 13/59 (22%), Positives = 25/59 (42%), Gaps = 10/59 (16%)
Query: 895 FRKKRTRRKKKKGAE-------KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
FRK + + I + + +KE +K K EK K K++ +K+++
Sbjct: 68 FRKILKKEGGAIIKDLEKCDFTPIYEYFEAEKEKKKAMSK--EEK-KAIKEEKDKLEEP 123
>gnl|CDD|237034 PRK12278, PRK12278, 50S ribosomal protein L21/unknown domain fusion
protein; Provisional.
Length = 221
Score = 34.4 bits (79), Expect = 0.18
Identities = 10/48 (20%), Positives = 13/48 (27%)
Query: 929 GAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
GA+ K + AE K KK+ K A
Sbjct: 105 GADASGVKAATGAGKVEVAAEAAPAKAKKEAAPKAAPAPAAAAAPPAA 152
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 35.2 bits (81), Expect = 0.19
Identities = 15/45 (33%), Positives = 19/45 (42%)
Query: 138 DATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPETK 182
+A V V P PT A P PTPAP T ++ P +
Sbjct: 357 EALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKE 401
>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807). This is
a family of conserved fungal proteins of unknown
function.
Length = 169
Score = 34.0 bits (78), Expect = 0.19
Identities = 11/78 (14%), Positives = 27/78 (34%), Gaps = 6/78 (7%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
L R++ K++ AE+ + + +K+ + + +K K
Sbjct: 65 LLRERE--LKEEAEAEEEGEVDASPDAGAVAGESSA----DRKEAEQQGAAQKRKSCRDK 118
Query: 954 KKKKKKKKKKKKKKKKKK 971
++K K + + K
Sbjct: 119 ERKSAKDPRGGTQDVVDK 136
Score = 32.8 bits (75), Expect = 0.50
Identities = 10/73 (13%), Positives = 32/73 (43%), Gaps = 4/73 (5%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
+ ++++ K+EAE + ++ + A++ +++ +K+K + K
Sbjct: 63 HALLRERELKEEAEA----EEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDK 118
Query: 969 KKKKKKEAEKMKK 981
++K K+ +
Sbjct: 119 ERKSAKDPRGGTQ 131
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 34.9 bits (79), Expect = 0.20
Identities = 22/79 (27%), Positives = 42/79 (53%), Gaps = 3/79 (3%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
K ++K+++ AE++ K+ K E+ R K ++++K++KD E+ +++ A K+K
Sbjct: 146 KNEKKKERQLAEQLAAKRLKD---EQHRHKARKQELRKREKDRERARREDAAAAAAAKQK 202
Query: 958 KKKKKKKKKKKKKKKKKEA 976
KK KK K A
Sbjct: 203 AAAKKAAAPSGKKSAKAAA 221
Score = 34.2 bits (77), Expect = 0.37
Identities = 24/95 (25%), Positives = 53/95 (55%), Gaps = 1/95 (1%)
Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
IL R R R++KK I ++ K + R K+ + K K+ +K +++ AE++
Sbjct: 101 ILRRLLRKYREEKKIDRHIYRELYVKAKGNVFRNKRNLMEHIHKVKNEKKKERQLAEQLA 160
Query: 953 KKK-KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
K+ K ++ + K +K++ +K++K+ E+ +++ A
Sbjct: 161 AKRLKDEQHRHKARKQELRKREKDRERARREDAAA 195
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 34.9 bits (81), Expect = 0.21
Identities = 17/70 (24%), Positives = 38/70 (54%), Gaps = 4/70 (5%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRK-KKGA---EKMKKKKKDAEKMKKKGAEK 950
+ +++ R +++ A +++K+ KE E +R+ K A +K K + K EK++ + AE+
Sbjct: 238 YLEQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEKLEARLAEE 297
Query: 951 IGKKKKKKKK 960
++ K
Sbjct: 298 RPVEEGKPLA 307
Score = 30.7 bits (70), Expect = 4.5
Identities = 14/73 (19%), Positives = 28/73 (38%), Gaps = 13/73 (17%)
Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
++K E + ++ K+ K+ E + ++ K K KK K + K
Sbjct: 240 EQKAERLRQEAAAYEKQQKELAKEQEWI-------------RRGKAAASKAKKAKSRIKR 286
Query: 976 AEKMKKKTEEAEK 988
EK++ + E
Sbjct: 287 LEKLEARLAEERP 299
>gnl|CDD|150786 pfam10153, DUF2361, Uncharacterized conserved protein (DUF2361).
This is a region of 120 amino acids that is conserved in
a family of proteins found from plants to fungi. The
function is not known.
Length = 114
Score = 33.0 bits (76), Expect = 0.21
Identities = 30/97 (30%), Positives = 53/97 (54%), Gaps = 10/97 (10%)
Query: 897 KKRTRR-----KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKK-KGAEK 950
KK+ R KKK +R +K++ EA K + AE+ KK+KK A+K + E+
Sbjct: 3 KKQIRDIERLLKKKDLPADVRVEKERALEALKKELED-AERRKKRKKMAKKYHMVRFFER 61
Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
KK ++ K+ +KK ++ +KE + +K++ +AE
Sbjct: 62 ---KKATRRLKQLEKKLEETSDEKEIKDLKEQLSKAE 95
Score = 31.8 bits (73), Expect = 0.49
Identities = 29/101 (28%), Positives = 48/101 (47%), Gaps = 14/101 (13%)
Query: 867 TKRLLHVAVSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRK 926
+RLL D+ DVR E K+R KK E ++KK+KK A+K
Sbjct: 9 IERLL--KKKDLPADVRVE-----------KERALEALKKELEDAERRKKRKKMAKKYHM 55
Query: 927 KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
+ E+ KK + ++++KK E +K+ K K++ K +
Sbjct: 56 VRFFER-KKATRRLKQLEKKLEETSDEKEIKDLKEQLSKAE 95
Score = 31.4 bits (72), Expect = 0.57
Identities = 23/81 (28%), Positives = 47/81 (58%), Gaps = 7/81 (8%)
Query: 915 KKKKKEAEKMRKKKG--AEKMKKKKKDAEKMKKKGAEKIGKKKKKK--KKKKKKK---KK 967
KK+ ++ E++ KKK A+ +K++ E +KK+ + +KK+KK KK + +K
Sbjct: 3 KKQIRDIERLLKKKDLPADVRVEKERALEALKKELEDAERRKKRKKMAKKYHMVRFFERK 62
Query: 968 KKKKKKKEAEKMKKKTEEAEK 988
K ++ K+ EK ++T + ++
Sbjct: 63 KATRRLKQLEKKLEETSDEKE 83
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 34.9 bits (81), Expect = 0.22
Identities = 17/87 (19%), Positives = 46/87 (52%), Gaps = 13/87 (14%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
E + K+ ++E +++ E+++K KK +K+ KK ++ +K+++ +K + K+K
Sbjct: 331 ELKEELKELEEELKELE-----EELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEK 385
Query: 969 --------KKKKKKEAEKMKKKTEEAE 987
+++ K+ E+++ E +
Sbjct: 386 LSEELEELEEELKELKEELESLYSEGK 412
Score = 30.3 bits (69), Expect = 5.7
Identities = 15/94 (15%), Positives = 47/94 (50%)
Query: 890 LGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE 949
+G K ++ ++ +++ ++ +K K+ K KK ++ +K++ + + E
Sbjct: 325 VGVDFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKE 384
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
K+ ++ ++ +++ K+ K++ + E + KT
Sbjct: 385 KLSEELEELEEELKELKEELESLYSEGKISVNKT 418
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 35.1 bits (81), Expect = 0.22
Identities = 18/90 (20%), Positives = 55/90 (61%), Gaps = 4/90 (4%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK- 959
K ++ +IR++ ++ + E++ +K E++++ +++ E+++++ E + K + ++
Sbjct: 660 EEKVEELEAEIRRELQRIENEEQLEEKL--EELEQLEEELEQLREELEELLKKLGEIEQL 717
Query: 960 -KKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++ + +K + ++ KKE EK++K E E+
Sbjct: 718 IEELESRKAELEELKKELEKLEKALELLEE 747
Score = 34.0 bits (78), Expect = 0.55
Identities = 18/84 (21%), Positives = 47/84 (55%), Gaps = 5/84 (5%)
Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
+ ++K ++ EAE R+ + E ++ ++ E++++ E +++ ++ KK + ++
Sbjct: 657 EELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQ 716
Query: 970 -----KKKKKEAEKMKKKTEEAEK 988
+ +K E E++KK+ E+ EK
Sbjct: 717 LIEELESRKAELEELKKELEKLEK 740
Score = 32.8 bits (75), Expect = 1.2
Identities = 18/92 (19%), Positives = 47/92 (51%), Gaps = 1/92 (1%)
Query: 895 FRKKRTRRKK-KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
R RTR+++ ++ E++++ KKK KE E+ + + +AE ++ E++
Sbjct: 578 LRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELES 637
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
+ +K + + ++ + ++ EK+++ E
Sbjct: 638 ELEKLNLQAELEELLQAALEELEEKVEELEAE 669
Score = 31.3 bits (71), Expect = 3.7
Identities = 11/81 (13%), Positives = 42/81 (51%), Gaps = 1/81 (1%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
++ E K + ++ + E + + +++K+ ++ ++ + E++ + +++ K+
Sbjct: 539 ENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRK-EELEELRERLKE 597
Query: 961 KKKKKKKKKKKKKKEAEKMKK 981
KKK K+ +++ + E ++
Sbjct: 598 LKKKLKELEERLSQLEELLQS 618
Score = 30.1 bits (68), Expect = 7.4
Identities = 18/118 (15%), Positives = 57/118 (48%), Gaps = 4/118 (3%)
Query: 875 VSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGA--EK 932
+ +++ + A+ L + + R++ + E + ++K +E E++ ++ E+
Sbjct: 644 LQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREE 703
Query: 933 MKKKKKDAEKMKK--KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+++ K ++++ + E + ++ KK+ +K +K + ++ EK+ K A+
Sbjct: 704 LEELLKKLGEIEQLIEELESRKAELEELKKELEKLEKALELLEELREKLGKAGLRADI 761
>gnl|CDD|221040 pfam11235, Med25_SD1, Mediator complex subunit 25 synapsin 1. The
overall function of the full-length Med25 is efficiently
to coordinate the transcriptional activation of RAR/RXR
(retinoic acid receptor/retinoic X receptor) in higher
eukaryotic cells. Human Med25 consists of several
domains with different binding properties, the
N-terminal, VWA, domain, this SD1 - synapsin 1 - domain
from residues 229-381, a PTOV(B) or ACID domain from
395-545, an SD2 domain from residues 564-645 and a
C-terminal NR box-containing domain (646-650) from
646-747. This The function of the SD domains is unclear.
Length = 168
Score = 33.7 bits (76), Expect = 0.23
Identities = 13/55 (23%), Positives = 20/55 (36%), Gaps = 2/55 (3%)
Query: 124 ASVLQELKTCAPIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPG 178
AS L + T +P V+ + + P + PTPA G P +
Sbjct: 116 ASQLSLVTTVSPGSGLAPVLTQQ--QVPPQQPQQPSMVPTPALGGVQPPQPSQQQ 168
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 34.8 bits (80), Expect = 0.24
Identities = 18/32 (56%), Positives = 22/32 (68%)
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+KKKKKKK K+KKKKK +EA K E E+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIHALENIEE 715
Score = 31.4 bits (71), Expect = 3.3
Identities = 14/23 (60%), Positives = 19/23 (82%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKE 975
+KKKKKKK K+KKKKK ++ K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
Score = 30.2 bits (68), Expect = 7.8
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEK 923
RKKKK + KKKKK +EA K
Sbjct: 684 RKKKKKKKSKEKKKKKNREASK 705
>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
Length = 405
Score = 34.8 bits (80), Expect = 0.24
Identities = 21/87 (24%), Positives = 34/87 (39%), Gaps = 10/87 (11%)
Query: 911 IRKKKKKKKEAEKMR---KKKGA-EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
+ KK K A + K A K+ K A +M K KK+K KK+K+
Sbjct: 307 VNPIKKDVKRAPLKKNPLKNLNAMLKLNPYAKTARRMALLAEAARVKAKKEKLAKKRKQV 366
Query: 967 KKKKKKKKEA------EKMKKKTEEAE 987
K++ +A + M ++ E
Sbjct: 367 SKEEAAAIKAAGKAWYKTMISDSDYTE 393
>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
topoisomerase IA [General function prediction only].
Length = 298
Score = 34.3 bits (79), Expect = 0.24
Identities = 19/47 (40%), Positives = 23/47 (48%), Gaps = 2/47 (4%)
Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
AE+ K G K K KK K KK KK K+A K KK ++A
Sbjct: 253 AERRAKGGPGK--KPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKKA 297
Score = 32.8 bits (75), Expect = 0.73
Identities = 18/47 (38%), Positives = 20/47 (42%), Gaps = 1/47 (2%)
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
AE+ K +K KK KK KK KK K KK KK A
Sbjct: 253 AERRAKGGP-GKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKKAA 298
Score = 32.8 bits (75), Expect = 0.87
Identities = 21/45 (46%), Positives = 23/45 (51%), Gaps = 1/45 (2%)
Query: 921 AEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
AE+ R K G K KK A K KK A+K KK K KK KK
Sbjct: 253 AER-RAKGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKK 296
Score = 32.0 bits (73), Expect = 1.4
Identities = 16/42 (38%), Positives = 17/42 (40%)
Query: 925 RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
R+ KG K KK KK K KK K KK KK
Sbjct: 255 RRAKGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKK 296
Score = 29.3 bits (66), Expect = 8.9
Identities = 27/105 (25%), Positives = 37/105 (35%), Gaps = 19/105 (18%)
Query: 903 KKKKGAEKIRKKK--KKKKEAEKMRKKKGAEK--MKKKKKDAEKMKKKGAEKI------- 951
K+ +G K+ + + + K G +K K +A K K I
Sbjct: 191 KRSRGRSAAALKELGTHPEGGKPITVKDGRFGPYVKDGKTNATLPKGKDPASITLEEALE 250
Query: 952 -----GKKKKKKKKKKKKKKKKKKKK---KKEAEKMKKKTEEAEK 988
K KK KK K KK KK A K KT++A K
Sbjct: 251 LLAERRAKGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAK 295
>gnl|CDD|110514 pfam01517, HDV_ag, Hepatitis delta virus delta antigen. The
hepatitis delta virus (HDV) encodes a single protein,
the hepatitis delta antigen (HDAg). The central region
of this protein has been shown to bind RNA. Several
interactions are also mediated by a coiled-coil region
at the N terminus of the protein.
Length = 194
Score = 33.7 bits (77), Expect = 0.27
Identities = 27/115 (23%), Positives = 57/115 (49%), Gaps = 27/115 (23%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEK-----------MRKKK---GAEKMKKKKKDAEKM--- 943
R+K+++ + +RK KK K+ E+ +RK K GA K+ + D ++
Sbjct: 24 RKKREELEKDLRKANKKIKKLEEENPWLGNVVGIIRKDKDGEGAPPAKRPRTDQMEVDSG 83
Query: 944 --KKKGAEKIGKKKKKKKKKKKKKKKKKKK--------KKKEAEKMKKKTEEAEK 988
K+ A ++++ +++K + KKK+ ++E E++++ TEE E+
Sbjct: 84 PGKRPHAGGFTDQERRDHRRRKALENKKKQLSSGGKHLSREEEEELRRLTEEDEE 138
>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11. The WW domain is a
small protein module with a triple-stranded beta-sheet
fold. This is a family of WW domain binding proteins.
Length = 78
Score = 31.5 bits (72), Expect = 0.28
Identities = 21/81 (25%), Positives = 47/81 (58%), Gaps = 13/81 (16%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
R+++KK ++ KK K E + R+ K KK+ ++++++ +K+ K +++ K
Sbjct: 11 RKEQKK-----KELKKNKAERQARREA------KLAKKNPDRLQRE-LDKL-KAQEESGK 57
Query: 961 KKKKKKKKKKKKKKEAEKMKK 981
+ +K+K KK +K+ + +KK
Sbjct: 58 LRPVEKQKLKKLEKDLDAVKK 78
Score = 30.7 bits (70), Expect = 0.66
Identities = 11/34 (32%), Positives = 24/34 (70%)
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K++KKK+ KK K +++ ++EA+ KK + ++
Sbjct: 11 RKEQKKKELKKNKAERQARREAKLAKKNPDRLQR 44
Score = 28.4 bits (64), Expect = 4.4
Identities = 14/66 (21%), Positives = 30/66 (45%), Gaps = 9/66 (13%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
KK+ +K K + R E KK ++++++ + ++ G + +K+K
Sbjct: 15 KKKELKKNKAERQARR---------EAKLAKKNPDRLQRELDKLKAQEESGKLRPVEKQK 65
Query: 957 KKKKKK 962
KK +K
Sbjct: 66 LKKLEK 71
Score = 27.2 bits (61), Expect = 9.2
Identities = 8/36 (22%), Positives = 24/36 (66%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK+ KK K +++ +++ K KK ++++++ ++ +
Sbjct: 16 KKELKKNKAERQARREAKLAKKNPDRLQRELDKLKA 51
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 34.1 bits (78), Expect = 0.28
Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 2/48 (4%)
Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKK--KKKKKKKKEAEKMKKKTE 984
+A+++KK + K K K KK KKK + K + K +
Sbjct: 143 EAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPD 190
Score = 31.0 bits (70), Expect = 2.6
Identities = 12/41 (29%), Positives = 18/41 (43%)
Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
A+++ K KK + K K K KK KKK + +
Sbjct: 144 AKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGID 184
Score = 29.5 bits (66), Expect = 8.8
Identities = 13/54 (24%), Positives = 19/54 (35%), Gaps = 6/54 (11%)
Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K KK + + K K KK KKK + K K K+++
Sbjct: 148 KKPKKGQSPKVPKAP------KPKKPKKKGSVSNRSVKMPGIDPRSKPDWKSQD 195
>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
Length = 773
Score = 34.8 bits (81), Expect = 0.29
Identities = 17/72 (23%), Positives = 43/72 (59%), Gaps = 4/72 (5%)
Query: 911 IRKKKKKKKEAEKMR----KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
RK+KK K+E + ++ KK ++K ++ EK ++ + K K K+++++++K+
Sbjct: 490 RRKEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKE 549
Query: 967 KKKKKKKKEAEK 978
+K K+ +++ +
Sbjct: 550 EKDKETEEDEPE 561
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 34.8 bits (80), Expect = 0.29
Identities = 18/75 (24%), Positives = 34/75 (45%), Gaps = 6/75 (8%)
Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
++K + KE + +K+ ++K K+KKDA K KK K + +K
Sbjct: 183 VKKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKS------KKAKTDRDVSTSTAASQK 236
Query: 971 KKKKEAEKMKKKTEE 985
K K++ +++E
Sbjct: 237 KSSDLESKLEAQSKE 251
Score = 31.7 bits (72), Expect = 2.2
Identities = 19/68 (27%), Positives = 31/68 (45%), Gaps = 5/68 (7%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKK----GAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
KK K K +++K+E++K K+K G +K KK K D + A K +
Sbjct: 184 KKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTD-RDVSTSTAASQKKSSDLE 242
Query: 959 KKKKKKKK 966
K + + K
Sbjct: 243 SKLEAQSK 250
>gnl|CDD|138835 PRK12279, PRK12279, 50S ribosomal protein L22/unknown domain fusion
protein; Provisional.
Length = 311
Score = 34.3 bits (78), Expect = 0.29
Identities = 28/142 (19%), Positives = 59/142 (41%), Gaps = 24/142 (16%)
Query: 868 KRLLHVAVSDVNDDVRREAVTALGFILFR--------KKRTRRKKKKGAEKIRKK----- 914
+LL+ A+S+V + + + +++ KRT + K A+++ K+
Sbjct: 48 LKLLNSAISNVQHN--SKDMDPSKLYIYKIVANQGPTMKRTLPRAKGSADQLFKRTTHLE 105
Query: 915 ---------KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
++K+ A K +K K ++ K K K K ++ K +K + +
Sbjct: 106 IVLSDDVNEREKELAAIKAKKSKKPLPVEPKAKVETKKVAKPSKVEVKPVEKDENVDPEL 165
Query: 966 KKKKKKKKKEAEKMKKKTEEAE 987
K++++ K EK + EE
Sbjct: 166 LKREQQVLKVVEKTASQKEEET 187
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 34.5 bits (80), Expect = 0.30
Identities = 19/48 (39%), Positives = 32/48 (66%)
Query: 941 EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+++K+ A+ +G ++K+KKK+KKK KKKK K K+ K +EA +
Sbjct: 383 QELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATR 430
>gnl|CDD|177283 PHA00451, PHA00451, protein kinase.
Length = 362
Score = 34.4 bits (79), Expect = 0.30
Identities = 14/67 (20%), Positives = 31/67 (46%), Gaps = 3/67 (4%)
Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
E E + ++ E+ K +K K G ++ +K K++K+ +K + + K+ A +
Sbjct: 235 AEVEAIANQRMIERCKNRK---AKRDPNGTFQVARKAAMKRRKRNRKLRARNAKELAAMR 291
Query: 979 MKKKTEE 985
M+
Sbjct: 292 MEANQIR 298
>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Proteins. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs, all of which are expressed
during embryonic and early development in the nervous
system but with different localization and timing.
srGAPs contain an N-terminal F-BAR domain, a Rho GAP
domain, and a C-terminal SH3 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 241
Score = 33.8 bits (78), Expect = 0.31
Identities = 16/47 (34%), Positives = 29/47 (61%), Gaps = 3/47 (6%)
Query: 928 KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
K AE+ K K+AEK ++K + KK ++ + KK +K+ +K++ K
Sbjct: 148 KSAER---KLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191
Score = 33.1 bits (76), Expect = 0.53
Identities = 14/45 (31%), Positives = 25/45 (55%), Gaps = 1/45 (2%)
Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
K AE+ K+ +K++E ++ +K E+ + KK EK +K K
Sbjct: 148 KSAERKLKEAEKQEEKQEQSPEKKLERSRSSKK-IEKEVEKRQAK 191
Score = 32.7 bits (75), Expect = 0.69
Identities = 20/49 (40%), Positives = 31/49 (63%), Gaps = 1/49 (2%)
Query: 938 KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
K AE+ K K AEK +K+++ +KK ++ + KK +KE EK + K EA
Sbjct: 148 KSAER-KLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAKYSEA 195
Score = 29.6 bits (67), Expect = 7.5
Identities = 18/62 (29%), Positives = 30/62 (48%), Gaps = 11/62 (17%)
Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
K AE RK K AEK + +K+ K ++ + KK +K+ +K++ K K
Sbjct: 148 KSAE--RKLKEAEK---------QEEKQEQSPEKKLERSRSSKKIEKEVEKRQAKYSEAK 196
Query: 979 MK 980
+K
Sbjct: 197 LK 198
>gnl|CDD|212008 cd00089, HR1, Protein kinase C-related kinase homology region 1
(HR1) domain that binds Rho family small GTPases. The
HR1 domain, also called the ACC (anti-parallel
coiled-coil) finger domain or Rho-binding domain binds
small GTPases from the Rho family. It is found in Rho
effector proteins including PKC-related kinases such as
vertebrate PRK1 (or PKN) and yeast PKC1 protein kinases
C, as well as in rhophilins and Rho-associated kinase
(ROCK). Rho family members function as molecular
switches, cycling between inactive and active forms,
controlling a variety of cellular processes. HR1 domains
may occur in repeat arrangements (PKN contains three HR1
domains), separated by a short linker region.
Length = 68
Score = 31.1 bits (71), Expect = 0.31
Identities = 14/51 (27%), Positives = 27/51 (52%)
Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
K+ K+ K ++ AE + K K+KK + + K+ EKI K++ ++
Sbjct: 13 KLEKELKIREGAENLLKLYSNPKVKKDLAEVQLNLKESKEKIDLLKRQLER 63
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 33.6 bits (77), Expect = 0.32
Identities = 16/42 (38%), Positives = 22/42 (52%), Gaps = 2/42 (4%)
Query: 949 EKIGKKKKK--KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K KKK+ +K KK KKK K+K E+ KK + +K
Sbjct: 48 PKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDK 89
Score = 33.2 bits (76), Expect = 0.43
Identities = 18/48 (37%), Positives = 24/48 (50%)
Query: 941 EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K KK +K KK KKK K+K +++K+ E KTEE E
Sbjct: 48 PKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENEN 95
Score = 29.7 bits (67), Expect = 5.5
Identities = 12/46 (26%), Positives = 24/46 (52%), Gaps = 4/46 (8%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK 942
K+ ++K+ +K K+++K K+K E+ KK + D +K
Sbjct: 48 PKQPKKKRPT----TPRKPATTKKSKKKDKEKLTEEEKKPESDDDK 89
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 33.9 bits (78), Expect = 0.32
Identities = 16/108 (14%), Positives = 33/108 (30%), Gaps = 16/108 (14%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKK---------------KDAE 941
KK K+ +K+ + E E ++K +K K A
Sbjct: 157 KKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASALGEKAAL 216
Query: 942 KMKKKGAEKI-GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ +K AE + K++ K +++ + E+
Sbjct: 217 EEQKALAEAAAAEAAKQEAAAKAAAQEQAALQAAATAAQPSAVTESAS 264
Score = 29.7 bits (67), Expect = 6.9
Identities = 17/83 (20%), Positives = 32/83 (38%), Gaps = 3/83 (3%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
KK E+ ++ KK +E + + K E + + + E + K +K
Sbjct: 147 KKILEQQKEDKKSLEEKQAALEDK-LETLVALQNELETQLNSLNSQ--KAEKNALIAALA 203
Query: 965 KKKKKKKKKKEAEKMKKKTEEAE 987
K+ +K A + +K EA
Sbjct: 204 AKEASALGEKAALEEQKALAEAA 226
>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 486
Score = 34.4 bits (79), Expect = 0.32
Identities = 23/102 (22%), Positives = 48/102 (47%), Gaps = 11/102 (10%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKK---KDAEKMKKKGAEKIGKK 954
K +++ K + ++++K++ + +K+ + + K K+AE +++G I +
Sbjct: 365 KEGSKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKITIKVE 424
Query: 955 KKKKKKKKKKKKKKKKK--------KKKEAEKMKKKTEEAEK 988
K ++ + K KK KKE EK K++ E EK
Sbjct: 425 KSEEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKERPEFVEK 466
Score = 31.3 bits (71), Expect = 2.5
Identities = 17/78 (21%), Positives = 37/78 (47%)
Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
I K + K+E K+ K + + + +KK ++ KK+K+K+K++ + +
Sbjct: 406 ILKNAEIKEEEGKITIKVEKSEEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKERPEFVE 465
Query: 971 KKKKEAEKMKKKTEEAEK 988
K + + K T + +K
Sbjct: 466 KVLELFKGSKIITYKKKK 483
>gnl|CDD|197773 smart00527, HMG17, domain in high mobilty group proteins HMG14 and
HMG 17.
Length = 88
Score = 31.8 bits (72), Expect = 0.33
Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 3/86 (3%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
K+K A + K + K+ + ++ K K + K K A KK K KK K
Sbjct: 2 KRKVAAGEGDDKDEPKRRSARLSAKPAPPKPEPKPKKAAAKKKSEN---VAAKGKKGAKG 58
Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAEK 988
K+ ++ K++ AE + K+EEA
Sbjct: 59 KQTEEAGKEENNPAENGETKSEEAPA 84
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 33.3 bits (76), Expect = 0.33
Identities = 19/89 (21%), Positives = 45/89 (50%), Gaps = 6/89 (6%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKK----KDAEKMKKKGAEK 950
F K+ +K+ ++ + K KE E+ + + AEK K+ +D + ++K
Sbjct: 31 FIIKK--ADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIA 88
Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKM 979
++K+++ + K+++ K+KK ++M
Sbjct: 89 QEEEKEEEAEDVKQQEVFSFKRKKPFKEM 117
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 34.4 bits (79), Expect = 0.35
Identities = 24/82 (29%), Positives = 32/82 (39%), Gaps = 1/82 (1%)
Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
A K KKKK K AE + ++DA K K K +K G+ K K
Sbjct: 409 SSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPK-KGKKNQKKGRDKSSKVPSDS 467
Query: 964 KKKKKKKKKKKEAEKMKKKTEE 985
K KK+ K + + EE
Sbjct: 468 KAGGKKESVKSQEDNNNIPPEE 489
Score = 32.9 bits (75), Expect = 1.1
Identities = 18/73 (24%), Positives = 26/73 (35%)
Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
I +K E ++K KKKK K G + ++ KK KK +KK
Sbjct: 397 IGSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKG 456
Query: 971 KKKKEAEKMKKKT 983
+ K K
Sbjct: 457 RDKSSKVPSDSKA 469
>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein. This
model represents a group of paralogous families in
plasmodium species alternately annotated as reticulocyte
binding protein, 235-kDa family protein and rhoptry
protein. Rhoptry protein is localized on the cell surface
and is extremely large (although apparently lacking in
repeat structure) and is important for the process of
invasion of the RBCs by the parasite. These proteins are
found in P. falciparum, P. vivax and P. yoelii.
Length = 2757
Score = 34.6 bits (79), Expect = 0.38
Identities = 30/87 (34%), Positives = 43/87 (49%), Gaps = 7/87 (8%)
Query: 909 EKIRKKKKKKKEAEKMRK--KKGAEKMKKKKKDAEKMKKKGAE-----KIGKKKKKKKKK 961
E I K K K EA+K K +K E ++ KKD ++ K + K K KK +
Sbjct: 1490 EHIDKSKGCKDEADKNAKAIEKNKELFEQYKKDVTELLNKYSALAIKNKFAKTKKDSEII 1549
Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K+ K KK EAEK ++K +E +K
Sbjct: 1550 IKEIKDAHKKFILEAEKSEQKIKEIKK 1576
>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
production and conversion].
Length = 108
Score = 32.0 bits (73), Expect = 0.39
Identities = 14/92 (15%), Positives = 41/92 (44%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
KK ++ E + ++ KEA + ++ E ++ +K A+++ ++ E+ ++ +
Sbjct: 13 KKAEISADEEIEEAKEEAEQIIKEAREEAREIIEEAEEEAEKLAQEILEEAREEAEEEAE 72
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ + +K+ K E + ++ E
Sbjct: 73 EILAEAEKEASAILSKAAEGKVVEAALSEFLA 104
Score = 28.2 bits (63), Expect = 8.7
Identities = 12/78 (15%), Positives = 35/78 (44%), Gaps = 1/78 (1%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
++ E + ++ +EAE+ +K E +++ +++AE+ ++ + +K+
Sbjct: 29 EAEQIIKEAREEAREIIEEAEEEAEKLAQEILEEAREEAEEEAEEILAE-AEKEASAILS 87
Query: 962 KKKKKKKKKKKKKEAEKM 979
K + K + E +
Sbjct: 88 KAAEGKVVEAALSEFLAI 105
>gnl|CDD|153331 cd07647, F-BAR_PSTPIP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
Phosphatase-Interacting Proteins. F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Vetebrates contain two
Proline-Serine-Threonine Phosphatase-Interacting
Proteins (PSTPIPs), PSTPIP1 and PSTPIP2. PSTPIPs are
mainly expressed in hematopoietic cells and are involved
in the regulation of cell adhesion and motility.
Mutations in PSTPIPs have been shown to cause
autoinflammatory disorders. PSTPIP1 contains an
N-terminal F-BAR domain, PEST motifs, and a C-terminal
SH3 domain, while PSTPIP2 contains only the N-terminal
F-BAR domain. F-BAR domains form banana-shaped dimers
with a positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 239
Score = 33.6 bits (77), Expect = 0.40
Identities = 25/81 (30%), Positives = 44/81 (54%), Gaps = 13/81 (16%)
Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDA-----EKMKKKGAEKIGKKKKKKKKKKKKK---- 965
+ ++EAEK+ + + +K ++KK + +K KK+ +K K KK ++K ++K
Sbjct: 81 QSLREEAEKLEEFREKQKEERKKTEDIMKRSQKNKKELYKKTMKAKKSYEQKCREKDKAE 140
Query: 966 ----KKKKKKKKKEAEKMKKK 982
K + KEAEK+KKK
Sbjct: 141 QAYEKSSSGAQPKEAEKLKKK 161
Score = 30.5 bits (69), Expect = 3.2
Identities = 29/98 (29%), Positives = 48/98 (48%), Gaps = 12/98 (12%)
Query: 883 RREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEK--MRKKKGAEKMKKKKKDA 940
R EA F K++ RKK + + +++ +K KKE K M+ KK E+ ++K A
Sbjct: 84 REEAEKLEEF--REKQKEERKKTE--DIMKRSQKNKKELYKKTMKAKKSYEQKCREKDKA 139
Query: 941 EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
E+ +K + K+ +K KKK + K AE+
Sbjct: 140 EQAYEK------SSSGAQPKEAEKLKKKAAQCKTSAEE 171
Score = 30.5 bits (69), Expect = 4.0
Identities = 27/94 (28%), Positives = 45/94 (47%), Gaps = 12/94 (12%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
FR+K+ +KK R +K KK+ +K K K + + K ++KD AE+
Sbjct: 93 FREKQKEERKKTEDIMKRSQKNKKELYKKTMKAKKSYEQKCREKD-------KAEQA--- 142
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K + K+ +K KK+A + K EEA+
Sbjct: 143 --YEKSSSGAQPKEAEKLKKKAAQCKTSAEEADS 174
>gnl|CDD|115185 pfam06513, DUF1103, Repeat of unknown function (DUF1103). This
family consists of several repeats of around 30 residues
in length which are found specifically in
mature-parasite-infected erythrocyte surface antigen
proteins from Plasmodium falciparum. This family often
found in conjunction with pfam00226.
Length = 215
Score = 33.3 bits (75), Expect = 0.40
Identities = 20/46 (43%), Positives = 31/46 (67%)
Query: 928 KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
+G ++ K+K D +K K G+EK K+ K+K +K+ K+K KKK KK
Sbjct: 5 EGDKEDVKEKNDGKKDKVIGSEKTQKEIKEKVEKRVKEKCKKKVKK 50
>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
Length = 306
Score = 33.9 bits (77), Expect = 0.40
Identities = 24/109 (22%), Positives = 36/109 (33%), Gaps = 1/109 (0%)
Query: 105 LMAAQIGFDLYESATQAFLASVLQELKTCAPIPDATHVVVKVPVEGAPTPATPDTEAPTP 164
++ Q + LA Q+ + HVV + P + AP A
Sbjct: 32 VIQPQAPVQPGQPGAPQQLAIPTQQPQPVPTSAMTPHVVQQAPAQPAPAAPPAAGAALPE 91
Query: 165 APGTESGPSSTTPGPETKTLESLNEKEKQLQARITKLSKILSGEVSIDL 213
A P+ T G TL E + QL ++ L SG +D
Sbjct: 92 ALEVPPPPAFTPNGEIVGTLAGNLEGDPQLAPSVSYLEAF-SGLDKLDT 139
>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase. The model describes
alanine--tRNA ligase. This enzyme catalyzes the reaction
(tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
pyrophosphate + AMP) [Protein synthesis, tRNA
aminoacylation].
Length = 851
Score = 34.3 bits (79), Expect = 0.41
Identities = 29/127 (22%), Positives = 48/127 (37%), Gaps = 22/127 (17%)
Query: 873 VAVSDVNDDVRR-EAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAE 931
V S + VRR EAVT I + ++ + K+ K ++ KK E
Sbjct: 679 VKESGIAAGVRRIEAVTGNAAIEYLNEQEDKLKELADIL-------KVTPNELPKK--IE 729
Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK------------KKKKKKKKKKEAEKM 979
+ ++ K +K + KI +K + K K + K K K A+++
Sbjct: 730 RFFEEWKALQKELESLKLKIADQKIDELKDKAETINGVKVLVEVVDAKDMKSLKTMADRL 789
Query: 980 KKKTEEA 986
K K A
Sbjct: 790 KSKLGSA 796
Score = 31.2 bits (71), Expect = 3.4
Identities = 21/90 (23%), Positives = 38/90 (42%), Gaps = 10/90 (11%)
Query: 867 TKRLLHVAVSDVNDDVRR--EAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKM 924
+L V +++ + R E AL +K+ K K +KI + K K + +
Sbjct: 713 LADILKVTPNELPKKIERFFEEWKAL-----QKELESLKLKIADQKIDELKDKAETINGV 767
Query: 925 R---KKKGAEKMKKKKKDAEKMKKKGAEKI 951
+ + A+ MK K A+++K K I
Sbjct: 768 KVLVEVVDAKDMKSLKTMADRLKSKLGSAI 797
>gnl|CDD|227708 COG5421, COG5421, Transposase [DNA replication, recombination, and
repair].
Length = 480
Score = 33.9 bits (78), Expect = 0.41
Identities = 20/70 (28%), Positives = 37/70 (52%), Gaps = 4/70 (5%)
Query: 907 GAEKIRKKK----KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
+ K++KK+ + + E E + +K EK+K ++ + EK + AEKI K + +
Sbjct: 274 LSYKLQKKEEQTLRTRIEKELEKAEKSLEKLKGREFNCEKDARIAAEKILKDYSSVEFLE 333
Query: 963 KKKKKKKKKK 972
K KKK++
Sbjct: 334 VDFKSKKKRE 343
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 33.2 bits (76), Expect = 0.44
Identities = 18/101 (17%), Positives = 42/101 (41%), Gaps = 9/101 (8%)
Query: 887 VTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
V F ++ A+K ++ + E K +K+ A +K K
Sbjct: 30 VAYQLFFPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAA-------NSEDKEDKG 82
Query: 947 GAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
AEK + ++ +++ +++ ++ + +KE E+ + E E
Sbjct: 83 DAEK--EDEESEEENEEEDEESSDENEKETEEKTESNVEKE 121
>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
Members of the NSCC2 family have been sequenced from
various yeast, fungal and animals species including
Saccharomyces cerevisiae, Drosophila melanogaster and
Homo sapiens. These proteins are the Sec62 proteins,
believed to be associated with the Sec61 and Sec63
constituents of the general protein secretary systems of
yeast microsomes. They are also the non-selective cation
(NS) channels of the mammalian cytoplasmic membrane. The
yeast Sec62 protein has been shown to be essential for
cell growth. The mammalian NS channel proteins has been
implicated in platelet derived growth factor(PGDF)
dependent single channel current in fibroblasts. These
channels are essentially closed in serum deprived
tissue-culture cells and are specifically opened by
exposure to PDGF. These channels are reported to exhibit
equal selectivity for Na+, K+ and Cs+ with low
permeability to Ca2+, and no permeability to anions
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 232
Score = 33.3 bits (76), Expect = 0.44
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEA 976
K K KKK K KK KKK+ K+EA
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKREA 231
>gnl|CDD|219555 pfam07753, DUF1609, Protein of unknown function (DUF1609). This
region is found in a number of hypothetical proteins
thought to be expressed by the eukaryote Encephalitozoon
cuniculi, an obligate intracellular microsporidial
parasite. It is approximately 200 residues long.
Length = 230
Score = 33.2 bits (76), Expect = 0.45
Identities = 20/63 (31%), Positives = 32/63 (50%), Gaps = 7/63 (11%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK--KKDAEKMKKKGAEKIGK 953
KK+ +KK KG K K+ +R +K EK+K++ K EK K + E+I +
Sbjct: 22 AKKKGGKKKSKGGRHCYKIHKRV-----LRWRKSPEKIKEELDKGSEEKWKGRSIEEIKE 76
Query: 954 KKK 956
+K
Sbjct: 77 QKV 79
>gnl|CDD|237100 PRK12429, PRK12429, 3-hydroxybutyrate dehydrogenase; Provisional.
Length = 258
Score = 33.3 bits (77), Expect = 0.46
Identities = 26/79 (32%), Positives = 31/79 (39%), Gaps = 12/79 (15%)
Query: 307 TGPTSGYYEGGGLYALGLIHANHGA--VINDYLLGQVKDATNEDDAVTGEAAGIAMGLVL 364
TG SG GL + L A GA VI D A G+A G+AM +
Sbjct: 10 TGAASGI----GL-EIALALAKEGAKVVIADLNDEAAAAAAEALQKAGGKAIGVAMDV-- 62
Query: 365 LGSKSESAIQDMVAYAKET 383
E AI + YA ET
Sbjct: 63 ---TDEEAINAGIDYAVET 78
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 34.3 bits (78), Expect = 0.47
Identities = 19/75 (25%), Positives = 33/75 (44%), Gaps = 1/75 (1%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
+K ++E E++ K KK + +K+K G K + K + K K+K+
Sbjct: 918 LEKDQQEKEELISSKETSN-KKAQDKVNDIKEKVKNIHGYMKDIENKIQDGKDDYLKQKE 976
Query: 974 KEAEKMKKKTEEAEK 988
E + + EE EK
Sbjct: 977 TELNTVNAQLEECEK 991
Score = 31.9 bits (72), Expect = 1.9
Identities = 20/106 (18%), Positives = 48/106 (45%), Gaps = 7/106 (6%)
Query: 884 REAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK--KGAEKMKKKKKDAE 941
R+ LG + KK KK+ E+++ K+ ++ E + + ++++K +++
Sbjct: 432 RDEKKGLGRTIELKKEILEKKQ---EELKFVIKELQQLEGSSDRILELDQELRKAERELS 488
Query: 942 KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K +K KK+ K + +K +K +K ++M++
Sbjct: 489 KAEKNSL--TETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTT 532
>gnl|CDD|216298 pfam01101, HMG14_17, HMG14 and HMG17.
Length = 93
Score = 31.3 bits (71), Expect = 0.50
Identities = 30/88 (34%), Positives = 45/88 (51%), Gaps = 5/88 (5%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
R+ + K A K+ K++ K++ A K + K KK A K KK K KK K
Sbjct: 3 RKGEAKDAAKVTKQEPKRRSARLSAKPAPPKPEPKPKKAAAKKKKPAV-----KGKKGAK 57
Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK+ K+++ K++ AE + KTEEA
Sbjct: 58 GKKETKQEEAKEENPAENGETKTEEAPA 85
Score = 29.4 bits (66), Expect = 2.0
Identities = 23/69 (33%), Positives = 32/69 (46%)
Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
K+K EA+ K E ++ + + K E KK KKKK K KK K KKE
Sbjct: 2 KRKGEAKDAAKVTKQEPKRRSARLSAKPAPPKPEPKPKKAAAKKKKPAVKGKKGAKGKKE 61
Query: 976 AEKMKKKTE 984
++ + K E
Sbjct: 62 TKQEEAKEE 70
>gnl|CDD|226059 COG3528, COG3528, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 330
Score = 33.7 bits (77), Expect = 0.51
Identities = 12/41 (29%), Positives = 16/41 (39%), Gaps = 4/41 (9%)
Query: 261 FLRDNLEWLARATNWAKLTATASLGVIHR---GHETEALAH 298
+L L W + W + LGV G E + LAH
Sbjct: 115 YLHTGLNWNSLHPTWMRYFLL-DLGVTGPDALGQEVQNLAH 154
Score = 33.7 bits (77), Expect = 0.51
Identities = 12/41 (29%), Positives = 16/41 (39%), Gaps = 4/41 (9%)
Query: 623 FLRDNLEWLARATNWAKLTATASLGVIHR---GHETEALAH 660
+L L W + W + LGV G E + LAH
Sbjct: 115 YLHTGLNWNSLHPTWMRYFLL-DLGVTGPDALGQEVQNLAH 154
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 33.8 bits (77), Expect = 0.51
Identities = 15/74 (20%), Positives = 23/74 (31%), Gaps = 6/74 (8%)
Query: 113 DLYESATQAFLASVLQELKTCAPIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGP 172
L SA + A V P P E PTP+T + T P +
Sbjct: 189 ALPLSAPRLGPADVFV---PATPRPTPR---TTASPETTPTPSTTTSPPSTTIPAPSTTI 242
Query: 173 SSTTPGPETKTLES 186
++ G + +
Sbjct: 243 AAPQAGTTPEAEGT 256
Score = 31.8 bits (72), Expect = 2.0
Identities = 24/95 (25%), Positives = 33/95 (34%), Gaps = 14/95 (14%)
Query: 99 GSADDDL--MAAQIG-FDLYESATQAFLASVLQELKTCAPIPDATHVVVKVPVEGAPTPA 155
GS D L A ++G D++ AT +T T + T A
Sbjct: 184 GSCDPALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIA 243
Query: 156 TP----------DTEAPTPAPGTESGPSSTTPGPE 180
P PTP G E+ P++ TP PE
Sbjct: 244 APQAGTTPEAEGTPAPPTPGGG-EAPPANATPAPE 277
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 33.9 bits (77), Expect = 0.51
Identities = 21/92 (22%), Positives = 48/92 (52%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
K + ++ E + K K + +GA+ K+ + K K++ A ++ K
Sbjct: 161 KPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELK 220
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KK+++++K +++++++K+ E +K EE EK
Sbjct: 221 KKREERRKVLEEEEQRRKQEEADRKSREEEEK 252
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 33.6 bits (77), Expect = 0.52
Identities = 19/87 (21%), Positives = 39/87 (44%), Gaps = 8/87 (9%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAE---KMKKKKKDAEKMKKKGAEKIGK 953
+KRT R+ K+R + K + + + E K+ +K G ++ +
Sbjct: 289 QKRTTRR-----VKMRPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDE 343
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMK 980
+ ++ K++ +KK+K KKK K+
Sbjct: 344 DEDDEESKEEVEKKQKVKKKPRKRKVN 370
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 33.5 bits (78), Expect = 0.53
Identities = 12/99 (12%), Positives = 39/99 (39%), Gaps = 22/99 (22%)
Query: 894 LFRKKRTRRKKKKGAEKI----RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE 949
++ +R +++ ++++ + E E+++ ++ ++
Sbjct: 13 AVKEALAKRGFPLDVDELLELDEERRELQTELEELQAER----------------NALSK 56
Query: 950 KIGKKKKKKKKKKKKKK--KKKKKKKKEAEKMKKKTEEA 986
+IG+ K+K + + K+ K++ K E + E
Sbjct: 57 EIGQAKRKGEDAEALIAEVKELKEEIKALEAELDELEAE 95
Score = 32.3 bits (75), Expect = 1.4
Identities = 15/72 (20%), Positives = 35/72 (48%), Gaps = 9/72 (12%)
Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD 939
D+ RRE T L + + R +++I + K+K ++AE + + +++K++ K
Sbjct: 34 DEERRELQTELE-----ELQAERNAL--SKEIGQAKRKGEDAEALIAE--VKELKEEIKA 84
Query: 940 AEKMKKKGAEKI 951
E + ++
Sbjct: 85 LEAELDELEAEL 96
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 32.6 bits (75), Expect = 0.53
Identities = 8/49 (16%), Positives = 24/49 (48%)
Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
E ++ K + KK+K + + + K+ + +A++ + +++ A
Sbjct: 61 LELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDHALH 109
>gnl|CDD|148491 pfam06898, YqfD, Putative stage IV sporulation protein YqfD. This
family consists of several putative bacterial stage IV
sporulation (SpoIV) proteins. YqfD of Bacillus subtilis
is known to be essential for efficient sporulation
although its exact function is unknown.
Length = 383
Score = 33.5 bits (77), Expect = 0.57
Identities = 18/96 (18%), Positives = 36/96 (37%), Gaps = 10/96 (10%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK----MKKKGAE 949
L +K + KK + K + +K K+ ++ +K K
Sbjct: 258 LKTNFEVLTGEKITKHYLSIGGKK------LPLKFRKNEFEKYDKEEDEKPLFLKWKLPI 311
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
I ++ + + K K K++ +K + KKK E+
Sbjct: 312 SIVRETYYEVQDKVVKLTKEEAVEKGKKLAKKKLEK 347
>gnl|CDD|218370 pfam04997, RNA_pol_Rpb1_1, RNA polymerase Rpb1, domain 1. RNA
polymerases catalyze the DNA dependent polymerisation of
RNA. Prokaryotes contain a single RNA polymerase
compared to three in eukaryotes (not including
mitochondrial. and chloroplast polymerases). This
domain, domain 1, represents the clamp domain, which a
mobile domain involved in positioning the DNA,
maintenance of the transcription bubble and positioning
of the nascent RNA strand.
Length = 330
Score = 33.4 bits (77), Expect = 0.59
Identities = 24/93 (25%), Positives = 43/93 (46%), Gaps = 3/93 (3%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+ K ++++ KK +KK K E G + + K K+ K GAE I
Sbjct: 121 KGKNSKKRLKKINNLCKKKSICSKCGEDNG---GLKAFEGCGKYQPKISKDGAEAIKALL 177
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K + ++ K+K +K +K + KK ++E E+
Sbjct: 178 KNIEIEELKEKLRKLNPEKVLQIFKKISKEDEE 210
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 32.6 bits (75), Expect = 0.60
Identities = 11/28 (39%), Positives = 18/28 (64%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMK 980
+K+KKK K++KK K++K+K E
Sbjct: 97 EKEKKKAMSKEEKKAIKEEKEKLEEPYG 124
Score = 32.2 bits (74), Expect = 0.95
Identities = 14/54 (25%), Positives = 25/54 (46%), Gaps = 10/54 (18%)
Query: 941 EKMKKKGAEKIGKKKK----------KKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
+ + K+ I K K +++K+KKK K++KK + EK K +
Sbjct: 69 KILTKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
Score = 29.5 bits (67), Expect = 6.5
Identities = 15/51 (29%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Query: 926 KKKGAEKMKK-KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
K+ +KK K D + + E+ KKK K++KK K++K+K ++
Sbjct: 72 TKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
>gnl|CDD|221893 pfam13010, pRN1_helical, Primase helical domain. This alpha
helical domain is found in a set of bacterial plasmid
replication proteins. The domain is found to the
C-terminus of the primase/polymerase domain. Mutants of
this domain are defective in template binding,
dinucleotide formation and conformation change prior to
DNA extension.
Length = 135
Score = 31.7 bits (72), Expect = 0.64
Identities = 19/63 (30%), Positives = 30/63 (47%), Gaps = 2/63 (3%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
KK+++ E K K E+M K + K + E++ KK KK ++ K+KKK
Sbjct: 3 KKEEEDSEEDFEKLK--EEMAKYDRFKGKTVEAIREEVCKKIKKSLEELKEKKKAILNTA 60
Query: 974 KEA 976
K
Sbjct: 61 KGV 63
>gnl|CDD|169428 PRK08404, PRK08404, V-type ATP synthase subunit H; Validated.
Length = 103
Score = 31.3 bits (71), Expect = 0.66
Identities = 30/83 (36%), Positives = 55/83 (66%), Gaps = 8/83 (9%)
Query: 911 IRKKKKKKKEAE-KMRKKKGAEK--MKKKKKDAEKMKK---KGAEKIGKK--KKKKKKKK 962
I++ K +KEAE ++ K K K ++K K++A+K+++ K AE+ +K +KKKK+ +
Sbjct: 5 IKEIVKAEKEAEERIEKAKEEAKKIIRKAKEEAKKIEEEIIKKAEEEAQKLIEKKKKEGE 64
Query: 963 KKKKKKKKKKKKEAEKMKKKTEE 985
++ KK ++ +KE E++K K EE
Sbjct: 65 EEAKKILEEGEKEIEELKVKAEE 87
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 33.5 bits (77), Expect = 0.66
Identities = 19/108 (17%), Positives = 53/108 (49%), Gaps = 15/108 (13%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAE------KMKKKKKDAEKMKKKGAE 949
R + +++ +I +K+ K+ E+ ++ E ++++ K++ EK++ +
Sbjct: 402 RPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELER 461
Query: 950 KIGKKKKKKKKKKK---------KKKKKKKKKKKEAEKMKKKTEEAEK 988
+ + K +K ++ + +K+ ++KKK E++++K E K
Sbjct: 462 FRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAELRK 509
>gnl|CDD|146145 pfam03357, Snf7, Snf7. This family of proteins are involved in
protein sorting and transport from the endosome to the
vacuole/lysosome in eukaryotic cells. Vacuoles/lysosomes
play an important role in the degradation of both lipids
and cellular proteins. In order to perform this
degradative function, vacuoles/lysosomes contain
numerous hydrolases which have been transported in the
form of inactive precursors via the biosynthetic pathway
and are proteolytically activated upon delivery to the
vacuole/lysosome. The delivery of transmembrane
proteins, such as activated cell surface receptors to
the lumen of the vacuole/lysosome, either for
degradation/downregulation, or in the case of
hydrolases, for proper localisation, requires the
formation of multivesicular bodies (MVBs). These late
endosomal structures are formed by invaginating and
budding of the limiting membrane into the lumen of the
compartment. During this process, a subset of the
endosomal membrane proteins is sorted into the forming
vesicles. Mature MVBs fuse with the vacuole/lysosome,
thereby releasing cargo containing vesicles into its
hydrolytic lumen for degradation. Endosomal proteins
that are not sorted into the intralumenal MVB vesicles
are either recycled back to the plasma membrane or Golgi
complex, or remain in the limiting membrane of the MVB
and are thereby transported to the limiting membrane of
the vacuole/lysosome as a consequence of fusion.
Therefore, the MVB sorting pathway plays a critical role
in the decision between recycling and degradation of
membrane proteins. A few archaeal sequences are also
present within this family.
Length = 169
Score = 32.2 bits (74), Expect = 0.67
Identities = 15/57 (26%), Positives = 28/57 (49%), Gaps = 4/57 (7%)
Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK----KKKKEAEKMKKKTE 984
++K ++ +K ++ +KI K + + KK KK K K+KK EK + +
Sbjct: 5 SLRKAIRELDKKQESLEKKIKKLEAEIKKLAKKGNKDAALILLKQKKRYEKQLDQLD 61
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 33.3 bits (76), Expect = 0.71
Identities = 17/56 (30%), Positives = 19/56 (33%), Gaps = 2/56 (3%)
Query: 135 PIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPETKTLESLNEK 190
P P A HV P P APG + + PGP T LN K
Sbjct: 231 PPPGAGHVHRGGPGPPER-DDAPVVPIRPSAPGPLAAQPAPAPGPGEPT-ARLNPK 284
>gnl|CDD|146016 pfam03179, V-ATPase_G, Vacuolar (H+)-ATPase G subunit. This family
represents the eukaryotic vacuolar (H+)-ATPase
(V-ATPase) G subunit. V-ATPases generate an acidic
environment in several intracellular compartments.
Correspondingly, they are found as membrane-attached
proteins in several organelles. They are also found in
the plasma membranes of some specialised cells.
V-ATPases consist of peripheral (V1) and membrane
integral (V0) heteromultimeric complexes. The G subunit
is part of the V1 subunit, but is also thought to be
strongly attached to the V0 complex. It may be involved
in the coupling of ATP degradation to H+ translocation.
Length = 105
Score = 31.0 bits (71), Expect = 0.72
Identities = 23/71 (32%), Positives = 41/71 (57%), Gaps = 4/71 (5%)
Query: 896 RKKRTRRKK--KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
RK+R +R K K+ AEK ++ + ++EAE K+ AE + + +K++K+ EKI +
Sbjct: 24 RKRRAKRLKQAKEEAEKEIEEYRAQREAE--FKEFEAEHSGSRGELEKKIEKETEEKIDE 81
Query: 954 KKKKKKKKKKK 964
K+ K K+
Sbjct: 82 LKRSFNKNKEA 92
>gnl|CDD|234659 PRK00137, rplI, 50S ribosomal protein L9; Reviewed.
Length = 147
Score = 32.0 bits (74), Expect = 0.72
Identities = 8/31 (25%), Positives = 16/31 (51%)
Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK 936
KG K + ++ + EA+ + AE + +K
Sbjct: 41 KGNLKQLEARRAELEAKAAEELAEAEALAEK 71
Score = 30.8 bits (71), Expect = 1.8
Identities = 7/33 (21%), Positives = 17/33 (51%)
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K+ + ++ + + K ++ EAE + +K E
Sbjct: 44 LKQLEARRAELEAKAAEELAEAEALAEKLEGLT 76
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 32.5 bits (74), Expect = 0.74
Identities = 13/27 (48%), Positives = 14/27 (51%)
Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAE 977
KKK K KK K K KK + KKK
Sbjct: 90 EKKKKGKSKKNKLKGKKDEDKKKAREG 116
Score = 32.5 bits (74), Expect = 0.75
Identities = 16/34 (47%), Positives = 19/34 (55%)
Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
+K + KKKK K KK K K KK + KKK E
Sbjct: 82 RKPATTAVEKKKKGKSKKNKLKGKKDEDKKKARE 115
Score = 31.7 bits (72), Expect = 1.1
Identities = 16/33 (48%), Positives = 21/33 (63%)
Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
++K A +KKKK K KK K K KK + KK+A
Sbjct: 81 RRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKA 113
Score = 31.3 bits (71), Expect = 1.8
Identities = 13/26 (50%), Positives = 16/26 (61%)
Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTE 984
KKKK K KK K K KK+ +K K +
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAREG 116
Score = 31.0 bits (70), Expect = 2.2
Identities = 13/27 (48%), Positives = 16/27 (59%)
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKE 975
EK K K KK K K KK + KKK ++
Sbjct: 90 EKKKKGKSKKNKLKGKKDEDKKKAREG 116
Score = 31.0 bits (70), Expect = 2.3
Identities = 14/26 (53%), Positives = 15/26 (57%)
Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEE 985
KKKK K KK K K K+ E KK E
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAREG 116
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 32.2 bits (74), Expect = 0.76
Identities = 24/93 (25%), Positives = 45/93 (48%), Gaps = 8/93 (8%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
KK +K AE ++ K+ + +K ++ E+++KK KD EK +K K
Sbjct: 39 MKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEV--EELRKKLKDYEK------DKQSLKN 90
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K + K+ +K+ K K + E + + + E E+
Sbjct: 91 LKARLKELEKELKNLKWESEVLEQRFEKVERER 123
Score = 30.3 bits (69), Expect = 3.6
Identities = 20/74 (27%), Positives = 40/74 (54%), Gaps = 3/74 (4%)
Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
K K+E +M+K E +K + + K+ E + KK +++ ++ +KK K +K K+
Sbjct: 30 KSLKEEIAEMKKN--EEHNEKLMAEIAQENKRLVEPL-KKAEEEVEELRKKLKDYEKDKQ 86
Query: 975 EAEKMKKKTEEAEK 988
+ +K + +E EK
Sbjct: 87 SLKNLKARLKELEK 100
>gnl|CDD|236272 PRK08475, PRK08475, F0F1 ATP synthase subunit B; Validated.
Length = 167
Score = 31.9 bits (73), Expect = 0.76
Identities = 21/76 (27%), Positives = 39/76 (51%), Gaps = 5/76 (6%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
EK+++ K+KK++A K E+ K+K + + KK A + +K +K+ K + K
Sbjct: 67 EKLKESKEKKEDALKK-----LEEAKEKAELIVETAKKEAYILTQKIEKQTKDDIENLIK 121
Query: 969 KKKKKKEAEKMKKKTE 984
++ E E K + E
Sbjct: 122 SFEELMEFEVRKMERE 137
Score = 29.6 bits (67), Expect = 5.0
Identities = 16/50 (32%), Positives = 29/50 (58%), Gaps = 3/50 (6%)
Query: 940 AEKMK---KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
A+ +K K KI K+ ++ ++K K+ K+KK+ K+ E+ K+K E
Sbjct: 44 AKPLKNFYKSRINKISKRLEEIQEKLKESKEKKEDALKKLEEAKEKAELI 93
>gnl|CDD|234385 TIGR03897, lanti_2_LanM, type 2 lantibiotic biosynthesis protein
LanM. Members of this family are known generally as
LanM, a multifunctional enzyme of lantibiotic
biosynthesis. This catalysis by LanM distinguishes the
type 2 lantibiotics, such as mersacidin, cinnamycin, and
lichenicidin, from LanBC-produced type 1 lantibiotics
such as nisin and subtilin. The N-terminal domain
contains regions associated with Ser and Thr
dehydration. The C-terminal region contains a pfam05147
domain, which catalyzes the formation of the lanthionine
bridge [Cellular processes, Toxin production and
resistance].
Length = 931
Score = 33.4 bits (77), Expect = 0.76
Identities = 31/165 (18%), Positives = 50/165 (30%), Gaps = 43/165 (26%)
Query: 269 LARATNWAKLTATASLGVIHRGHETEALAHMQSYLPRET-----GPTSGYY-EGGGLYAL 322
LA T + A +AL ++ YL + + GG +YAL
Sbjct: 608 LAALTGKKRYRDLAR----------KALQPLRKYLETQLRLAKSMGLGAFSGLGGIIYAL 657
Query: 323 GLI--------HANHGAVINDYLLGQVKDATNEDDAVTGEAAGIAMGLVLLGSKSESAI- 373
+ N I + L + D + G AAG + L+ L +
Sbjct: 658 AHLGQLLNDSELLNDAKRILNRLEELIIKDEEFLDLIGG-AAGAILVLLNLYEVTGDPEV 716
Query: 374 -----------------QDMVAYAKETQHEKILRGLAVGISFIMY 401
++ K Q EK L G + G + I +
Sbjct: 717 LELAIACGEHLLKQAVKEEGGIAWKTPQDEKPLTGFSHGAAGIAW 761
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 32.9 bits (75), Expect = 0.77
Identities = 15/50 (30%), Positives = 20/50 (40%), Gaps = 4/50 (8%)
Query: 134 APIPDATHVVVKVP----VEGAPTPATPDTEAPTPAPGTESGPSSTTPGP 179
A + A+ V + +P + AP P P T A PAP P P
Sbjct: 21 AAVAGASAVAIALPATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPP 70
>gnl|CDD|216568 pfam01555, N6_N4_Mtase, DNA methylase. Members of this family are
DNA methylases. The family contains both N-4
cytosine-specific DNA methylases and N-6
Adenine-specific DNA methylases.
Length = 219
Score = 32.4 bits (74), Expect = 0.78
Identities = 11/68 (16%), Positives = 22/68 (32%), Gaps = 9/68 (13%)
Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK-KKKKKKKKKKKKKK 970
K KK +K + +K+KK+G + K + + +K+
Sbjct: 105 SKTKKYYTFNYD--------AIKVPYDEKDKLKKRGDSEPNGKPIGDVWDFPRVQPSEKE 156
Query: 971 KKKKEAEK 978
+K
Sbjct: 157 SGGHPTQK 164
>gnl|CDD|149048 pfam07768, PVL_ORF50, PVL ORF-50-like family. This is a family of
sequences found in both bacteria and bacteriophages.
This region is approximately 130 residues long and in
some cases is found as part of the PVL (Panton-Valentine
leukocidin) group of genes, which encode a member of the
leukocidin group of bacterial toxins that kill
leukocytes by creation of pores in the cell membrane.
PVL appears to be a virulence factor associated with a
number of human diseases.
Length = 118
Score = 31.3 bits (71), Expect = 0.78
Identities = 22/84 (26%), Positives = 44/84 (52%), Gaps = 11/84 (13%)
Query: 910 KIRKKKKK--KKEAEKMRKKK-GAEKMKKKKKDAEKMKK--------KGAEKIGKKKKKK 958
KIR K K + E E M+K ++K+ K ++ + + E KK +
Sbjct: 3 KIRIKPYKFTEFENELMKKYGITPGIVRKRVKGGWELSEALDAPYGMRLKEYREIKKSEN 62
Query: 959 KKKKKKKKKKKKKKKKEAEKMKKK 982
++++K+++ ++K++KEAE +KK
Sbjct: 63 IEQERKERELERKRRKEAELRRKK 86
>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755). This
family is predominated by ORFs from Circoviridae. The
function of this family remains to be determined.
Length = 122
Score = 31.2 bits (71), Expect = 0.78
Identities = 14/61 (22%), Positives = 30/61 (49%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
+K+ KE +K + + ++ + K ++K ++ KK+KK + + KK KK
Sbjct: 41 QKQLLKECQKTSQLIHLSQQLQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKK 100
Query: 974 K 974
+
Sbjct: 101 R 101
>gnl|CDD|235263 PRK04243, PRK04243, 50S ribosomal protein L15e; Validated.
Length = 196
Score = 32.2 bits (74), Expect = 0.79
Identities = 11/28 (39%), Positives = 15/28 (53%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEK 923
+K R RKK KG EK+R + + K
Sbjct: 169 KKGRGLRKKGKGTEKVRPSIRANERRGK 196
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 32.8 bits (75), Expect = 0.80
Identities = 16/89 (17%), Positives = 46/89 (51%), Gaps = 9/89 (10%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGA-----EKMKKKKKDA----EKMKKKGAEKIGKKKKKK 958
E+ R + + + ++ +K+ E++ K ++++ E M++ E+I + +++
Sbjct: 104 QEQQRAQYQDELARKRYQKELEQQRRQNEELLKMQEESVLRQEAMRRATEEEILEMRRET 163
Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+++ + +++ + K EAE + EE E
Sbjct: 164 IEEEAELERENIRAKIEAEARGRAKEERE 192
>gnl|CDD|223587 COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA
replication, recombination, and repair / Transcription /
Translation, ribosomal structure and biogenesis].
Length = 513
Score = 33.2 bits (76), Expect = 0.81
Identities = 16/84 (19%), Positives = 28/84 (33%)
Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
I F + KK K EK ++K + + + A+ +K + E+ E
Sbjct: 374 ISFVTEEEEVKKLKRIEKRLERKLPSAVLLPLDEPEDAKLLKTTRPGLEEESDISDEIKK 433
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEA 976
K KK + + K
Sbjct: 434 LKSSKKALLRGLGVRFTLSKLLAN 457
Score = 30.5 bits (69), Expect = 5.7
Identities = 6/74 (8%), Positives = 16/74 (21%), Gaps = 12/74 (16%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKM----------RKKKGAEKMKKKKK--DAEK 942
+ + K+ K+ R A+ ++ K
Sbjct: 438 KKALLRGLGVRFTLSKLLANLGKEIPGAGDAVTIDPELERRSPNSADDIEYILKGLSYRA 497
Query: 943 MKKKGAEKIGKKKK 956
++ + K
Sbjct: 498 EERTAKNEAANIKL 511
>gnl|CDD|217479 pfam03297, Ribosomal_S25, S25 ribosomal protein.
Length = 105
Score = 30.8 bits (70), Expect = 0.82
Identities = 15/35 (42%), Positives = 17/35 (48%)
Query: 933 MKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
M KK+ + K A GK KKKK K K K K
Sbjct: 1 MPPKKQQKKAAKAAAASAGGKAKKKKWSKGKVKDK 35
Score = 30.4 bits (69), Expect = 1.4
Identities = 15/32 (46%), Positives = 16/32 (50%)
Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
KK +K K A G K KKKK K K K K
Sbjct: 4 KKQQKKAAKAAAASAGGKAKKKKWSKGKVKDK 35
Score = 28.1 bits (63), Expect = 9.0
Identities = 13/32 (40%), Positives = 16/32 (50%), Gaps = 2/32 (6%)
Query: 953 KKKKKKKKKKKKKKK--KKKKKKKEAEKMKKK 982
KK++KK K K KKKK K+K K
Sbjct: 4 KKQQKKAAKAAAASAGGKAKKKKWSKGKVKDK 35
Score = 27.7 bits (62), Expect = 9.7
Identities = 13/32 (40%), Positives = 14/32 (43%)
Query: 938 KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
K +K K A K KKKK K K K K
Sbjct: 4 KKQQKKAAKAAAASAGGKAKKKKWSKGKVKDK 35
>gnl|CDD|223898 COG0828, RpsU, Ribosomal protein S21 [Translation, ribosomal
structure and biogenesis].
Length = 67
Score = 30.0 bits (68), Expect = 0.82
Identities = 14/46 (30%), Positives = 30/46 (65%)
Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
K++K+ E +++ EK +K+K+KK +K+K K+ +K+++ E
Sbjct: 22 KVEKEGILREMKEREFYEKPSEKRKRKKAAARKRKFKRLRKEQQRE 67
Score = 27.7 bits (62), Expect = 5.5
Identities = 12/33 (36%), Positives = 19/33 (57%)
Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEK 932
K+++ EK +K+K+KK A + RK K K
Sbjct: 30 REMKEREFYEKPSEKRKRKKAAARKRKFKRLRK 62
Score = 27.3 bits (61), Expect = 7.1
Identities = 11/30 (36%), Positives = 19/30 (63%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEK 923
+ K +RK+KK A + RK K+ +KE ++
Sbjct: 37 FYEKPSEKRKRKKAAARKRKFKRLRKEQQR 66
>gnl|CDD|130214 TIGR01144, ATP_synt_b, ATP synthase, F0 subunit b. This model
describes the F1/F0 ATP synthase b subunit in bacteria
only. Scoring just below the trusted cutoff are the
N-terminal domains of Mycobacterial b/delta fusion
proteins and a subunit from an archaeon, Methanosarcina
barkeri, in which the ATP synthase homolog differs in
architecture and is not experimentally confirmed. This
model helps resolve b from the related b' subunit.
Within the family is an example from a
sodium-translocating rather than proton-translocating
ATP synthase [Energy metabolism, ATP-proton motive force
interconversion].
Length = 147
Score = 31.6 bits (72), Expect = 0.83
Identities = 22/86 (25%), Positives = 51/86 (59%), Gaps = 3/86 (3%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
++KK A+ + ++ KKEA +KK +K+ K +A+++ + A K G + ++ K +
Sbjct: 27 RQKKIADGLASAERAKKEAALA-QKKAQVILKEAKDEAQEIIEN-ANKRGSEILEEAKAE 84
Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAEK 988
+++++K K + AE ++ + E+A +
Sbjct: 85 AREEREKIKAQARAE-IEAEKEQARE 109
Score = 28.9 bits (65), Expect = 7.1
Identities = 24/82 (29%), Positives = 40/82 (48%), Gaps = 5/82 (6%)
Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
+ R+KK A R KK A +KK + K K A++I + K+ + ++ K
Sbjct: 25 ETRQKKIADGLASAERAKKEAALAQKKAQVILKEAKDEAQEI--IENANKRGSEILEEAK 82
Query: 970 KKKKKEAEKMKKKTE---EAEK 988
+ ++E EK+K + EAEK
Sbjct: 83 AEAREEREKIKAQARAEIEAEK 104
>gnl|CDD|233294 TIGR01159, DRP1, density-regulated protein DRP1. This protein
family shows weak but suggestive similarity to
translation initiation factor SUI1 and its prokaryotic
homologs.
Length = 173
Score = 32.2 bits (73), Expect = 0.83
Identities = 18/57 (31%), Positives = 30/57 (52%), Gaps = 2/57 (3%)
Query: 918 KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
EAE + + E+ K +KD ++K + G KKKKK +K K++ + K+K
Sbjct: 48 TAEAEAVTIGEAQEE--KGEKDLLGIQKAQEAREGGKKKKKLPQKVTIKREPRTKRK 102
>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
This family consists of several plant proteins of
unknown function. Several sequences in this family are
described as being "myosin heavy chain-like".
Length = 484
Score = 33.0 bits (75), Expect = 0.86
Identities = 30/108 (27%), Positives = 56/108 (51%), Gaps = 12/108 (11%)
Query: 884 REAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKM 943
+EAV+ G I K + R ++ KE E M ++ A K+KK+ + AEK
Sbjct: 8 KEAVSLFGGIADWKAHLPQ---------RSERVLVKETELMLAQEEANKLKKELEVAEKE 58
Query: 944 KKKGAEKIGKKKKKK---KKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K + +++ K+ K K +K +K++++ K+++E K + EE E+
Sbjct: 59 KLQVLKELESTKRTVEDLKLKLEKAEKEEQQAKQDSELAKLRAEELEQ 106
>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
Validated.
Length = 725
Score = 33.3 bits (76), Expect = 0.88
Identities = 18/70 (25%), Positives = 25/70 (35%)
Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
K K+ + EK+ K + KK + K KK K +K K+ EK
Sbjct: 645 KNNYKVNFQNFLEKIFGGPKHIFAISKKLINEAKIYWKAINNTKKSVVIKPLEKPKKYEK 704
Query: 979 MKKKTEEAEK 988
E EK
Sbjct: 705 TSVSKENLEK 714
Score = 31.3 bits (71), Expect = 3.0
Identities = 16/55 (29%), Positives = 22/55 (40%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
KK + + K +K K EK KK +K + + E G K KKK
Sbjct: 671 KKLINEAKIYWKAINNTKKSVVIKPLEKPKKYEKTSVSKENLEKEIFGDKFTKKK 725
Score = 30.6 bits (69), Expect = 4.6
Identities = 17/86 (19%), Positives = 27/86 (31%), Gaps = 3/86 (3%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK---MKKKGAEKIGKKKKKKK 959
+ E + K KK E + EK + + D E K K K K
Sbjct: 355 LSELEEEDENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKN 414
Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEE 985
K + +++ +KT E
Sbjct: 415 SSFINKTENILTNSPLKDELLEKTTE 440
Score = 30.2 bits (68), Expect = 7.3
Identities = 19/57 (33%), Positives = 23/57 (40%), Gaps = 2/57 (3%)
Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
KK EA+ K K K EK KK EK K+ +K+ K KKK
Sbjct: 671 KKLINEAKIYWKAINNTKKSVVIKPLEKPKK--YEKTSVSKENLEKEIFGDKFTKKK 725
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 32.5 bits (74), Expect = 0.89
Identities = 21/119 (17%), Positives = 42/119 (35%), Gaps = 19/119 (15%)
Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEK----------------IRKKKKKKKEAEK 923
D + + A G IL K T + + + K + +
Sbjct: 73 DGGKCKTYFARGSILCSKSITSISQNTEGRERLLTVSHENGKLQVTYVTHSKVAHEAEIR 132
Query: 924 MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK---KKKKEAEKM 979
+ +K+ A KK E + K+K+K++ + +K K +KK+E ++
Sbjct: 133 NLGDVKNAEKFEKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRR 191
Score = 32.1 bits (73), Expect = 1.3
Identities = 14/77 (18%), Positives = 37/77 (48%), Gaps = 1/77 (1%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
+K+ G +K +++ + ++ ++ R+KK E+ ++ +D G+ K +
Sbjct: 159 HRKRSGKQKEKRRVEDSQKHKEDRRKK-QEEKRRNDEDKRPGGGGGSSGGQSGLSTKDEP 217
Query: 962 KKKKKKKKKKKKKEAEK 978
K+K++K ++ E
Sbjct: 218 PKEKRQKHHDPERRLEP 234
Score = 31.8 bits (72), Expect = 1.7
Identities = 16/75 (21%), Positives = 37/75 (49%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
K + E + K AEK +K+ + + K + K+K+K++ + +K K+ ++
Sbjct: 124 KVAHEAEIRNLGDVKNAEKFEKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRR 183
Query: 974 KEAEKMKKKTEEAEK 988
K+ E+ ++ E+
Sbjct: 184 KKQEEKRRNDEDKRP 198
Score = 29.5 bits (66), Expect = 6.8
Identities = 13/51 (25%), Positives = 28/51 (54%)
Query: 925 RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
RKK E K+ K EK + + ++K + ++KK+++K++ + K+
Sbjct: 152 RKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGG 202
Score = 29.5 bits (66), Expect = 8.0
Identities = 19/81 (23%), Positives = 39/81 (48%), Gaps = 4/81 (4%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
EK + +KK ++ RK+ G +K K++ +D++K K E KK+++K++ + K+
Sbjct: 143 FEKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHK----EDRRKKQEEKRRNDEDKRP 198
Query: 968 KKKKKKKEAEKMKKKTEEAEK 988
+ +E K
Sbjct: 199 GGGGGSSGGQSGLSTKDEPPK 219
>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
binding. This is the middle region of a family of TATA
element modulatory factor 1 proteins conserved in
eukaryotes that contains at its N-terminal section a
number of leucine zippers that could potentially form
coiled coil structures. The whole proteins bind to the
TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells.
Length = 74
Score = 30.0 bits (68), Expect = 0.90
Identities = 20/59 (33%), Positives = 36/59 (61%), Gaps = 1/59 (1%)
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K+ +K + ++ ++G EK+ KK+ K KK + K K+ +KE ++KKK E+ EK
Sbjct: 4 EKKLAEKDEQIAQLMEEG-EKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKLEK 61
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 32.7 bits (75), Expect = 0.91
Identities = 20/94 (21%), Positives = 48/94 (51%), Gaps = 6/94 (6%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKK---GAEKMKKKKKDAEKMKKKG---AEKIGKK 954
R+ + + +++ K + + + +K ++ KK + E+++ K +++IGK
Sbjct: 4 RKLLRNNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKA 63
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K +KK K ++ KK+ K+ K+E ++ + E
Sbjct: 64 KGQKKDKIEEIKKELKELKEELTELSAALKALEA 97
>gnl|CDD|220400 pfam09785, Prp31_C, Prp31 C terminal domain. This is the C
terminal domain of the pre-mRNA processing factor Prp31.
Prp31 is required for U4/U6.U5 tri-snRNP formation. In
humans this protein has been linked to autosomal
dominant retinitis pigmentosa.
Length = 124
Score = 31.2 bits (71), Expect = 0.91
Identities = 22/87 (25%), Positives = 40/87 (45%), Gaps = 4/87 (4%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK-IGKKKKKKKKK 961
KKK+G + RK K++ E +RK +M+ K++ E +G +
Sbjct: 16 KKKRGGRRFRKMKERFAMTE-LRKA--QNRMEFGKEEEEVGYDFDEGVGLGMLGSGGGGR 72
Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ +K K K K ++KMKK+ + +
Sbjct: 73 IRLSQKDAKTKAKLSKKMKKRLQSSGA 99
>gnl|CDD|234504 TIGR04216, halo_surf_glyco, major cell surface glycoprotein.
Members of this family are the S-layer-forming
halobacterial major cell surface glycoprotein. The
highest scores below model cutoffs are fragmentary
paralogs to actual members of the family. Modifications
include at N-linked and O-linked glycosylation, a
C-terminal diphytanylglyceryl modification, and probable
cleavage of the PGF-CTERM tail.
Length = 782
Score = 32.9 bits (75), Expect = 0.94
Identities = 15/55 (27%), Positives = 19/55 (34%), Gaps = 2/55 (3%)
Query: 138 DATHVVVKVPVEGAPTPATPDTEAPTPAPGTESG-PSSTTPGPETKTLESLNEKE 191
D +V V V P T + T P T ++ T P T T E E
Sbjct: 701 DRVNVEV-VEETERPDTTTSEDPTTTTTPTTTGPEETTETAEPTTTTEEPTEETT 754
>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 879
Score = 33.0 bits (76), Expect = 0.94
Identities = 25/128 (19%), Positives = 55/128 (42%), Gaps = 23/128 (17%)
Query: 873 VAVSDVNDDVRR-EAVTALGFILFRKKRTRRKKKKGA---------EKIR----KKKKKK 918
++ + VRR EAVT + + +++ K+ + K+ + K+ +
Sbjct: 686 ISEEGIAAGVRRIEAVTGEAALEYLQEQEDLLKEIASLLKVPPELLPKVERLLEELKELE 745
Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
KE E+++KK A ++ +A+ + G + + K+ K+ ++ A+
Sbjct: 746 KELERLKKKLAAAELADLLSNAKAEEIGGVKVLA---------KEVDGADMKELREIADD 796
Query: 979 MKKKTEEA 986
+KKK A
Sbjct: 797 LKKKLGSA 804
>gnl|CDD|225949 COG3415, COG3415, Transposase and inactivated derivatives [DNA
replication, recombination, and repair].
Length = 138
Score = 31.6 bits (72), Expect = 0.94
Identities = 11/81 (13%), Positives = 34/81 (41%), Gaps = 1/81 (1%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
K + +R++ E + ++ A++ + ++ E G K +K
Sbjct: 1 MAKPFSNDLRERVVDAVVGEGLSCREAAKRFGVSISTVYRWVRRYRET-GLDLPPKPRKG 59
Query: 963 KKKKKKKKKKKKEAEKMKKKT 983
+ +K +++ + E++++K
Sbjct: 60 RPRKLSEEQLEILLERLREKD 80
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 33.0 bits (75), Expect = 1.0
Identities = 14/46 (30%), Positives = 16/46 (34%), Gaps = 10/46 (21%)
Query: 132 TCAPIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTP 177
AP P A PATP A P T +GP + P
Sbjct: 2736 PAAPAPPAV----------PAGPATPGGPARPARPPTTAGPPAPAP 2771
Score = 33.0 bits (75), Expect = 1.0
Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 1/45 (2%)
Query: 134 APIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPG 178
AP+P + PV A P P T P+ PG++ GP+
Sbjct: 409 APVPASVPTPAPTPVP-ASAPPPPATPLPSAEPGSDDGPAPPPER 452
Score = 31.1 bits (70), Expect = 3.9
Identities = 10/35 (28%), Positives = 14/35 (40%)
Query: 147 PVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPET 181
P+P PDT AP P P + S ++
Sbjct: 2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPP 2643
Score = 30.7 bits (69), Expect = 4.9
Identities = 15/51 (29%), Positives = 15/51 (29%), Gaps = 1/51 (1%)
Query: 135 PIPDATHVVVKVPVEGAPTPATPDTE-APTPAPGTESGPSSTTPGPETKTL 184
P P P P P P APT P PS P P L
Sbjct: 2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL 2967
Score = 30.3 bits (68), Expect = 8.0
Identities = 11/52 (21%), Positives = 13/52 (25%), Gaps = 1/52 (1%)
Query: 134 APIPDATHVVVKVPVEGAPTPATPDT-EAPTPAPGTESGPSSTTPGPETKTL 184
AP P T AP P +GP P +L
Sbjct: 2740 APPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASL 2791
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 32.8 bits (76), Expect = 1.0
Identities = 15/59 (25%), Positives = 26/59 (44%), Gaps = 2/59 (3%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
E E+E +K EK K K +K++K +K+ + + KK K ++K
Sbjct: 202 EDENDDSLAADESE--LPEKVLEKFKALAKQYKKLRKAQEKKVEGRLAQHKKYAKLREK 258
>gnl|CDD|217318 pfam03011, PFEMP, PFEMP DBL domain. PfEMP1 (Plasmodium falciparum
erythrocyte membrane protein) has been identified as the
rosetting ligand of the malaria parasite P. falciparum.
Rosetting is the adhesion of infected erythrocytes with
uninfected erythrocytes in the vasculature of the
infected organ, and is associated with severe malaria.
PfEMP1 interacts with Complement Receptor One on
uninfected erythrocytes to form rosettes.
Length = 150
Score = 31.5 bits (72), Expect = 1.1
Identities = 24/104 (23%), Positives = 37/104 (35%), Gaps = 25/104 (24%)
Query: 903 KKKKGAEKIR----KKKKKKKEA----------EKMRKKKGAEKMKKKKKDAEKMKKKGA 948
KKKK EKI+ K+ K E + + +K K+ +K+K
Sbjct: 49 KKKKEWEKIKEHFNKQYKIDDGRDEYNVLESFLEDLLFQIDIKKAYGDAKELKKIK---- 104
Query: 949 EKIGKKKKKKKKKKKKKKKKK-------KKKKKEAEKMKKKTEE 985
E + K + K +K+A+K KKK E
Sbjct: 105 ELLKCNGCNNSANSAKNGENKDAIDCLLDHLQKKAKKCKKKHSE 148
Score = 28.8 bits (65), Expect = 7.3
Identities = 21/102 (20%), Positives = 31/102 (30%), Gaps = 21/102 (20%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAE--KMKKKKKDAEKMKKKG------------- 947
KG KKK + EK +KK E K+K+ K+
Sbjct: 25 NNGKGKCCKCICKKKCECVEKWIEKKKKEWEKIKEHFNKQYKIDDGRDEYNVLESFLEDL 84
Query: 948 ------AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
+ G K+ KK K+ K A+ + K
Sbjct: 85 LFQIDIKKAYGDAKELKKIKELLKCNGCNNSANSAKNGENKD 126
Score = 28.8 bits (65), Expect = 8.8
Identities = 18/53 (33%), Positives = 24/53 (45%), Gaps = 13/53 (24%)
Query: 938 KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK---------KKKKEAEKMKK 981
+D+ K +KK + K K K KKK KKKKE EK+K+
Sbjct: 11 EDSIKWRKK----LKHCINNGKGKCCKCICKKKCECVEKWIEKKKKEWEKIKE 59
>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
Length = 155
Score = 31.6 bits (72), Expect = 1.1
Identities = 9/55 (16%), Positives = 15/55 (27%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K K + A+ + KKKK + + AE+
Sbjct: 99 KPSKLELFNAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAEE 153
>gnl|CDD|201294 pfam00542, Ribosomal_L12, Ribosomal protein L7/L12 C-terminal
domain.
Length = 68
Score = 29.8 bits (68), Expect = 1.1
Identities = 15/36 (41%), Positives = 22/36 (61%)
Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
+G K+ K + K K+ K+EAE++KKK EEA
Sbjct: 26 LGLKEAKDLVEGAPKTIKEGVSKEEAEEIKKKLEEA 61
>gnl|CDD|179302 PRK01558, PRK01558, V-type ATP synthase subunit E; Provisional.
Length = 198
Score = 32.0 bits (73), Expect = 1.1
Identities = 17/57 (29%), Positives = 30/57 (52%), Gaps = 4/57 (7%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKK----KKKKKKKKKKKKKKKEAEKMKKKTEEA 986
K KK E+ ++ E I + K++ + K +++ K+ K K +KEA K+ EA
Sbjct: 11 KIKKDGLEEAERLANEIILEAKEEAEEIIAKAEEEAKELKAKAEKEANDYKRHALEA 67
>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e.
Length = 108
Score = 30.7 bits (70), Expect = 1.1
Identities = 12/33 (36%), Positives = 21/33 (63%), Gaps = 1/33 (3%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK 927
F++++ ++KKK E +KK+KKE + RK
Sbjct: 37 FKREKKKKKKKSEGE-FFAEKKEKKEVSEQRKA 68
Score = 30.3 bits (69), Expect = 1.4
Identities = 11/35 (31%), Positives = 22/35 (62%)
Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
K++ +K KKK + +KK+KK+ +++K +K
Sbjct: 38 KREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72
Score = 29.1 bits (66), Expect = 3.2
Identities = 13/24 (54%), Positives = 17/24 (70%)
Query: 964 KKKKKKKKKKKEAEKMKKKTEEAE 987
K++KKKKKKK E E +K E+ E
Sbjct: 38 KREKKKKKKKSEGEFFAEKKEKKE 61
Score = 29.1 bits (66), Expect = 3.9
Identities = 15/35 (42%), Positives = 24/35 (68%), Gaps = 3/35 (8%)
Query: 955 KKKKKKKKKKKKK---KKKKKKKEAEKMKKKTEEA 986
K++KKKKKKK + +KK+KKE + +K ++A
Sbjct: 38 KREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72
Score = 28.7 bits (65), Expect = 4.8
Identities = 14/35 (40%), Positives = 21/35 (60%), Gaps = 3/35 (8%)
Query: 952 GKKKKKKKKKKKK---KKKKKKKKKKEAEKMKKKT 983
++KKKKKKK + +KK+KK+ E K +K
Sbjct: 38 KREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 32.8 bits (75), Expect = 1.1
Identities = 18/80 (22%), Positives = 29/80 (36%), Gaps = 3/80 (3%)
Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK---GAEKIGKKKKKKKKKK 962
K E K++K + K ++K D + G + K+K +K K
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDL 271
Query: 963 KKKKKKKKKKKKEAEKMKKK 982
K+K KK + M K
Sbjct: 272 KEKAKKLGISIILFDDMTKN 291
Score = 31.6 bits (72), Expect = 2.2
Identities = 18/95 (18%), Positives = 31/95 (32%), Gaps = 14/95 (14%)
Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
L + K K++ K ++KG + + + + + EK+
Sbjct: 209 TLIKSKEININKEEKNNGSNVNNNGNKN--NKEEQKGNDLSNELEDISLGPLEYDKEKLE 266
Query: 953 KKKKKKKKKKK------------KKKKKKKKKKKE 975
K K K+K KK K K K + E
Sbjct: 267 KIKDLKEKAKKLGISIILFDDMTKNKTTNYKIQNE 301
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 32.7 bits (75), Expect = 1.1
Identities = 10/45 (22%), Positives = 16/45 (35%)
Query: 933 MKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
K K K K+ EK K++ K K + +K +
Sbjct: 9 SKNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADD 53
Score = 30.7 bits (70), Expect = 5.0
Identities = 11/48 (22%), Positives = 23/48 (47%)
Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
E + K +K K K+ +++K K++ K A+ +K+ A+
Sbjct: 5 GEPLSKNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAAD 52
Score = 30.0 bits (68), Expect = 6.7
Identities = 12/38 (31%), Positives = 17/38 (44%), Gaps = 2/38 (5%)
Query: 896 RKKRTRRKK--KKGAEKIRKKKKKKKEAEKMRKKKGAE 931
KKR + K+ ++ A K K A K R +K A
Sbjct: 13 LKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAA 50
>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein. Taxilin
contains an extraordinarily long coiled-coil domain in
its C-terminal half and is ubiquitously expressed. It is
a novel binding partner of several syntaxin family
members and is possibly involved in Ca2+-dependent
exocytosis in neuroendocrine cells. Gamma-taxilin,
described as leucine zipper protein Factor Inhibiting
ATF4-mediated Transcription (FIAT), localises to the
nucleus in osteoblasts and dimerises with ATF4 to form
inactive dimers, thus inhibiting ATF4-mediated
transcription.
Length = 309
Score = 32.3 bits (74), Expect = 1.1
Identities = 23/67 (34%), Positives = 39/67 (58%), Gaps = 9/67 (13%)
Query: 918 KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK---KKKKKKKKKKKKKKKKKKKKK 974
K+E EKM KK +KK +K+ K+K EK K + ++++K+ K+ +K KKK
Sbjct: 243 KQEMEKMSKK-----IKKLEKENLAWKRK-WEKSNKALLEMAEERQKRDKEIEKLKKKID 296
Query: 975 EAEKMKK 981
+ EK+ +
Sbjct: 297 KLEKLCR 303
Score = 30.0 bits (68), Expect = 5.4
Identities = 24/64 (37%), Positives = 37/64 (57%), Gaps = 8/64 (12%)
Query: 927 KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK--KKKKKKKKKKKEAEKMKKKTE 984
K+ EKM KK K EK E + K+K +K K + ++++K+ KE EK+KKK +
Sbjct: 243 KQEMEKMSKKIKKLEK------ENLAWKRKWEKSNKALLEMAEERQKRDKEIEKLKKKID 296
Query: 985 EAEK 988
+ EK
Sbjct: 297 KLEK 300
>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein. NinG or Rap is
involved in recombination. Rap (recombination adept with
plasmid) increases lambda-by-plasmid recombination
catalyzed by Escherichia coli's RecBCD pathway.
Length = 188
Score = 31.9 bits (73), Expect = 1.1
Identities = 9/37 (24%), Positives = 25/37 (67%)
Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
A + ++K ++KK+K + + ++++ K EK+K +++
Sbjct: 31 ALALKREKAQEKKRKAEAQAERRELKARKEKLKTRSD 67
Score = 30.0 bits (68), Expect = 3.8
Identities = 11/49 (22%), Positives = 26/49 (53%), Gaps = 5/49 (10%)
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
A +K++K ++K + + ++++ K +K+K K + KEA+
Sbjct: 31 ALALKREKA-----QEKKRKAEAQAERRELKARKEKLKTRSDWLKEAQA 74
>gnl|CDD|150406 pfam09727, CortBP2, Cortactin-binding protein-2. This entry is the
first approximately 250 residues of cortactin-binding
protein 2. In addition to being a positional candidate
for autism this protein is expressed at highest levels
in the brain in humans. The human protein has six
associated ankyrin repeat domains pfam00023 towards the
C-terminus which act as protein-protein interaction
domains.
Length = 193
Score = 31.8 bits (72), Expect = 1.2
Identities = 16/73 (21%), Positives = 39/73 (53%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
AEK ++K + E EK + + +K E+ +++ + + ++K ++ KK+++ +K
Sbjct: 108 AEKRQRKTVLELEEEKRKHIRYMKKSDDFTNLLEQERERLKKLLEQEKSQQAKKEQEHRK 167
Query: 968 KKKKKKKEAEKMK 980
++E K+K
Sbjct: 168 LLATLEEELGKLK 180
>gnl|CDD|240396 PTZ00388, PTZ00388, 40S ribosomal protein S8-like; Provisional.
Length = 223
Score = 32.0 bits (73), Expect = 1.2
Identities = 24/84 (28%), Positives = 36/84 (42%), Gaps = 6/84 (7%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK------IGKKKK 956
KK+ AEK KK K EK K+K EK+ + + ++ + + K+K+
Sbjct: 14 NKKRYAEKAEMKKTIKAHEEKDVKEKVPEKVPDGAVPSYLLDREQVNRAKVLSNMLKQKR 73
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMK 980
K+K K K K E E K
Sbjct: 74 KEKAGKWDVPIPKVKAMNEDEMFK 97
>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein. The PcfK-like protein family
includes the Enterococcus faecalis PcfK protein, which
is functionally uncharacterized. This family of proteins
is found in bacteria and viruses. Proteins in this
family are typically between 137 and 257 amino acids in
length. There are two completely conserved residues (D
and L) that may be functionally important.
Length = 136
Score = 31.2 bits (71), Expect = 1.2
Identities = 13/54 (24%), Positives = 24/54 (44%)
Query: 922 EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
K+ K + + +K A K K ++++ +K +K+ KK KK E
Sbjct: 72 IKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAE 125
Score = 30.8 bits (70), Expect = 1.4
Identities = 9/51 (17%), Positives = 21/51 (41%)
Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K + E ++K + +K+ K ++++ +K + K K E
Sbjct: 75 GKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAE 125
Score = 30.8 bits (70), Expect = 1.6
Identities = 8/63 (12%), Positives = 23/63 (36%)
Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
+ + E+ + E + ++++ +K +K+ KK KK +
Sbjct: 69 EDDIKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPVQ 128
Query: 977 EKM 979
++
Sbjct: 129 GQL 131
Score = 29.7 bits (67), Expect = 3.5
Identities = 13/54 (24%), Positives = 25/54 (46%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
K K ++ E +++K +A K K ++ +K +K+ KK KK +
Sbjct: 72 IKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAE 125
Score = 28.9 bits (65), Expect = 5.9
Identities = 8/52 (15%), Positives = 24/52 (46%)
Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ D + K + +++K + +K+ K + E+++K + ++K
Sbjct: 69 EDDIKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKK 120
Score = 28.5 bits (64), Expect = 7.8
Identities = 9/34 (26%), Positives = 18/34 (52%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKK 938
++ + RK+ K + E++RK + K KK +
Sbjct: 92 EEEKAEARKEALKAYQQEELRKIQKRSKKSKKAE 125
Score = 28.5 bits (64), Expect = 8.3
Identities = 13/58 (22%), Positives = 24/58 (41%), Gaps = 4/58 (6%)
Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
I+ K + E+ K + + E +K E+ +K +K+ KK KK +
Sbjct: 72 IKVGKPINCRVTVNHTVELTEEEKAEAR-KEALKAYQQEE---LRKIQKRSKKSKKAE 125
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 32.5 bits (74), Expect = 1.2
Identities = 17/79 (21%), Positives = 31/79 (39%), Gaps = 2/79 (2%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
E+ K K KK E + G + KK + +E+ K +K + + K
Sbjct: 42 EQSEKAKPPKKPKEA--SRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPK 99
Query: 969 KKKKKKEAEKMKKKTEEAE 987
K E+E+ ++ E +
Sbjct: 100 SKAPSTESEEEEEPEETPD 118
Score = 32.1 bits (73), Expect = 1.5
Identities = 18/90 (20%), Positives = 32/90 (35%), Gaps = 8/90 (8%)
Query: 907 GAEKIRKKKKKKKEAEKMRKKKGAEKM--------KKKKKDAEKMKKKGAEKIGKKKKKK 958
EK KK KK +A K + + + +K K +K K+ + KK
Sbjct: 9 PDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKS 68
Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K + ++K + K + T +
Sbjct: 69 SKPTESSAASSEEKPAKPRKSAESTRSSHP 98
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 32.9 bits (75), Expect = 1.2
Identities = 26/103 (25%), Positives = 46/103 (44%), Gaps = 8/103 (7%)
Query: 885 EAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK 944
+ T L F + K K E+I+ ++K + K K+ + K ++K
Sbjct: 1053 KMKTKLSSFHFNIDIKKYKNPKIKEEIKLLEEKVEALLK--------KIDENKNKLIEIK 1104
Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K E + K+K K+ + KKKK +K ++M+K +E E
Sbjct: 1105 NKSHEHVVNADKEKNKQTEHYNKKKKSLEKIYKQMEKTLKELE 1147
Score = 31.3 bits (71), Expect = 3.3
Identities = 26/93 (27%), Positives = 39/93 (41%), Gaps = 8/93 (8%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGA----EKMKKKKKDAEKMKKKGAEK----IG 952
+ K+KG E K + EKM+ K + +KK K K + K E+ +
Sbjct: 1031 KLIKEKGKEIEEKVDQYISLLEKMKTKLSSFHFNIDIKKYKNPKIKEEIKLLEEKVEALL 1090
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
KK + K K + K K + A+K K K E
Sbjct: 1091 KKIDENKNKLIEIKNKSHEHVVNADKEKNKQTE 1123
>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
family. Members of this protein family occur in
Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
related Mycoplasmas in small paralogous families that
may also include truncated forms and/or pseudogenes.
Members are predicted lipoproteins with a conserved
signal peptidase II processing and lipid attachment
site. Note that the name for certain characterized
members, p72, reflects an anomalous apparent molecular
weight, given a theoretical MW of about 61 kDa.
Length = 541
Score = 32.5 bits (74), Expect = 1.2
Identities = 14/75 (18%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 913 KKKKKKKEAEKMRK--KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
K+ +KK E + K + K+ + K K + +KK K K
Sbjct: 32 KQPEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKNK 91
Query: 971 KKKKEAEKMKKKTEE 985
++ ++ + KK ++
Sbjct: 92 EEIEKPKDEPKKPDK 106
Score = 29.8 bits (67), Expect = 7.5
Identities = 20/87 (22%), Positives = 35/87 (40%), Gaps = 6/87 (6%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+++ +KK I+ + K +K K+ +E K + K E
Sbjct: 27 TSSNSKQPEKKPE--IKPNENTPKIPKKPDNKEPSENNNNKSNNENK----DEENPSSTN 80
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKK 982
+KK K K++ +K K E +K KK
Sbjct: 81 PEKKPDPSKNKEEIEKPKDEPKKPDKK 107
>gnl|CDD|217754 pfam03836, RasGAP_C, RasGAP C-terminus.
Length = 139
Score = 31.1 bits (71), Expect = 1.2
Identities = 13/42 (30%), Positives = 20/42 (47%)
Query: 941 EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
++KK K KK +KK KK K K A+++ +K
Sbjct: 86 STLQKKKKFSKQKFSVKKLQKKHGKKPKFGSYKYSAKQLYEK 127
>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
Length = 627
Score = 32.8 bits (76), Expect = 1.2
Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 16/83 (19%)
Query: 918 KKEAEKMRK--KKGAEKMKKKKKD-------------AEKMKKKGAEKIGKKKKKKKKKK 962
+E E+M K + AE+ KK+K+ EK K+ +K+ +K+K +
Sbjct: 502 DEEIERMVKDAEANAEEDKKRKELVEARNQADSLIYQTEKTLKELGDKVPADEKEKIEAA 561
Query: 963 KKKKKKKKKKKKEAEKMKKKTEE 985
K+ K+ K + + E +K KTEE
Sbjct: 562 IKELKEALKGE-DKEAIKAKTEE 583
>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090). This
family consists of several bacterial proteins of unknown
function and is known as YqjC in E. coli.
Length = 115
Score = 30.7 bits (70), Expect = 1.2
Identities = 12/49 (24%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 921 AEKMRKKKGAE-KMKKKKKDAEKMKKKG-AEKIGKKKKKKKKKKKKKKK 967
AE+ K + E ++ +++++ ++ + KG A+KI KK++K + + + ++
Sbjct: 66 AERQAKVEEKEQEVAEREQELKEAQAKGDADKIEKKQRKLAEAQAELQE 114
Score = 28.4 bits (64), Expect = 6.5
Identities = 10/46 (21%), Positives = 27/46 (58%)
Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
K+++K+++ + +++ E K K +KK++K + + + +EA
Sbjct: 70 AKVEEKEQEVAEREQELKEAQAKGDADKIEKKQRKLAEAQAELQEA 115
Score = 28.4 bits (64), Expect = 8.2
Identities = 9/25 (36%), Positives = 16/25 (64%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKK 927
+ K A+KI KK++K EA+ ++
Sbjct: 90 QAKGDADKIEKKQRKLAEAQAELQE 114
>gnl|CDD|227453 COG5124, COG5124, Protein predicted to be involved in meiotic
recombination [Cell division and chromosome partitioning
/ General function prediction only].
Length = 209
Score = 31.8 bits (72), Expect = 1.2
Identities = 25/98 (25%), Positives = 48/98 (48%), Gaps = 6/98 (6%)
Query: 897 KKRTRRKKKKGAEKIRKK-KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
K +T +K +E ++KK ++ K++ +++ EK ++KK E K E + +K+
Sbjct: 73 KSQTLQKLYDSSELLKKKIQEVKQDIATYKEEIDKEKATRRKKFTEGQKNYNREALLEKR 132
Query: 956 KKKKKKKKKKKKKKKKKKKEA-----EKMKKKTEEAEK 988
KK++ + KKK +K + + KKK K
Sbjct: 133 KKEQDEIKKKLNSLQKIEPIRWDAAKIQEKKKKVHLNK 170
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 32.6 bits (74), Expect = 1.2
Identities = 22/104 (21%), Positives = 32/104 (30%), Gaps = 23/104 (22%)
Query: 106 MAAQIGFDLYESATQAFLAS--VLQELKTCAP--IPDATHVVVKV--------------- 146
+ A+ G D Y A Q + S +L + KT I + VVV V
Sbjct: 30 IEAEQGKDAYPVAQQKLIYSGKILSDDKTVKEYKIKEKDFVVVMVSKPKTGTGKVAPPAA 89
Query: 147 ----PVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPETKTLES 186
P+P + AP + S + T T
Sbjct: 90 TPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPE 133
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 31.8 bits (72), Expect = 1.2
Identities = 16/76 (21%), Positives = 31/76 (40%)
Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
RK K+K +E K + + EK+ K++ + K+ K + +++ + K
Sbjct: 95 RKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGK 154
Query: 972 KKKEAEKMKKKTEEAE 987
+E E E A
Sbjct: 155 VHEETEDSYHVEETAS 170
Score = 29.1 bits (65), Expect = 9.1
Identities = 15/96 (15%), Positives = 31/96 (32%), Gaps = 8/96 (8%)
Query: 901 RRKKKKGAEKIRKK----KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
+ R++ K +E E + + K+ EK ++ E + +
Sbjct: 135 DKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVE 194
Query: 957 KKKKKKKKKKK----KKKKKKKEAEKMKKKTEEAEK 988
K ++ K + ++ E K EE K
Sbjct: 195 KAERTKAETDDVTEEDYDEEDNPVEDSKAIKEELAK 230
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 32.3 bits (73), Expect = 1.2
Identities = 22/104 (21%), Positives = 47/104 (45%), Gaps = 1/104 (0%)
Query: 884 REAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKM 943
++ +T L L + +R K+ G +RK K ++E + RK A K +K + E M
Sbjct: 108 KQRLTKLTQYLLKARRLALFKEHGEYMVRKPKMNRREKLRERKALNAAKFEKNIEK-ELM 166
Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+ + G ++ K +++K+++ ++ + EE
Sbjct: 167 LRLKSGIYGDTPLNVREHLWNKAATEREKRQDEKERYVEEEEES 210
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 32.6 bits (74), Expect = 1.2
Identities = 13/57 (22%), Positives = 29/57 (50%)
Query: 927 KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
KKG +K KK+ ++ +K K K+K K ++ K + + +++++K+
Sbjct: 716 KKGVPPKPAEKDSLSAPKKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKS 772
Score = 30.3 bits (68), Expect = 6.0
Identities = 19/86 (22%), Positives = 33/86 (38%), Gaps = 6/86 (6%)
Query: 906 KGAEKIRKKKKK-----KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI-GKKKKKKK 959
K + +K KK+ K +K + K K+K K+ E+ K ++K ++K
Sbjct: 716 KKGVPPKPAEKDSLSAPKKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSC 775
Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEE 985
KE+ K + EE
Sbjct: 776 SPSSSSSHHHSSSNKESRKSSRNKEE 801
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 31.4 bits (72), Expect = 1.2
Identities = 15/34 (44%), Positives = 25/34 (73%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
+KKKKKKKKK+ + ++KKKK + +K+ ++A
Sbjct: 83 RKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
Score = 31.0 bits (71), Expect = 1.6
Identities = 13/35 (37%), Positives = 21/35 (60%)
Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
K+K +K KKK E +KKKK +++K+ +K
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
Score = 31.0 bits (71), Expect = 1.6
Identities = 15/29 (51%), Positives = 19/29 (65%)
Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K+KKKKKKKKK+ + EK KK E+
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEER 110
Score = 30.6 bits (70), Expect = 1.9
Identities = 14/34 (41%), Positives = 25/34 (73%)
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K+KKKKKKKKK+ + ++KK+ + ++ +E +K
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQK 115
Score = 29.8 bits (68), Expect = 4.2
Identities = 16/32 (50%), Positives = 23/32 (71%), Gaps = 2/32 (6%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEA--EKMKKK 982
K+KKKKKKKKK+ + ++KKKK E+ K+
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKED 113
Score = 29.4 bits (67), Expect = 5.7
Identities = 14/31 (45%), Positives = 23/31 (74%)
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
K+KKKKKKKKK+ + ++KKK+ +++ E
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKE 112
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 32.4 bits (74), Expect = 1.2
Identities = 20/110 (18%), Positives = 37/110 (33%), Gaps = 8/110 (7%)
Query: 883 RREAVTALGFI--LFRKKRTRRKKKKGA-EKIRKKKKKKKEAEKMRK-----KKGAEKMK 934
RR V + LFR+++ ++++ E + K E E+ R+ ++
Sbjct: 163 RRADVDVNSMLEALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEED 222
Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
+ K K KK K+ + K KK K +
Sbjct: 223 NDNTPSPKSGSSSPAKPTSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDA 272
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 31.4 bits (72), Expect = 1.3
Identities = 17/83 (20%), Positives = 45/83 (54%), Gaps = 6/83 (7%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
+K + ++ K +E ++ KK ++E E++ K+ + K +A ++K+ E+ ++K
Sbjct: 113 RKALQAEQGK-SELEQEIKKLEEEKEELEKRV---AELEAKLEA--IEKREEEERQIEEK 166
Query: 957 KKKKKKKKKKKKKKKKKKEAEKM 979
+ + KK+ ++ K + E++
Sbjct: 167 RHADEIAFLKKQNQQLKSQLEQI 189
Score = 30.6 bits (70), Expect = 2.3
Identities = 12/46 (26%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 943 MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
M+K A + + K + +++ KK +++K++ +K +++ K E EK
Sbjct: 112 MRK--ALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEK 155
>gnl|CDD|227315 COG4982, COG4982, 3-oxoacyl-[acyl-carrier protein].
Length = 866
Score = 32.5 bits (74), Expect = 1.3
Identities = 20/86 (23%), Positives = 28/86 (32%), Gaps = 14/86 (16%)
Query: 134 APIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPE---------TKTL 184
P P AP PA A PAP + P++ P P+ L
Sbjct: 8 KEEPAKEEATPPAPAASAPAPA-----AAAPAPVAAAAPAAAGPRPDDEPFKASDALHAL 62
Query: 185 ESLNEKEKQLQARITKLSKILSGEVS 210
+L K++ Q + L G S
Sbjct: 63 VALKLKKRIDQIEALDSIEDLVGGKS 88
>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
[DNA replication, recombination, and repair].
Length = 417
Score = 32.4 bits (74), Expect = 1.3
Identities = 21/73 (28%), Positives = 33/73 (45%), Gaps = 6/73 (8%)
Query: 919 KEAEKMRKK---KGAEKMKKKK-KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
K K+ KK K A +KK K A + K+ ++ K+ KK+ K+K+ K
Sbjct: 325 KNDSKLNKKYILKIANDWARKKVKTAYEAIKENRKR--KELALAKKQSSTTKRKENLPKW 382
Query: 975 EAEKMKKKTEEAE 987
+ KK T + E
Sbjct: 383 SNPEYKKTTSQEE 395
Score = 30.1 bits (68), Expect = 6.0
Identities = 24/99 (24%), Positives = 42/99 (42%), Gaps = 6/99 (6%)
Query: 890 LGFILFRKKRTRRKK--KKGAEKI-RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
L ++L + KK K A RKK K EA K +K+ + KK+ K K+
Sbjct: 319 LDYVLLKNDSKLNKKYILKIANDWARKKVKTAYEAIKENRKRKELALAKKQSSTTKRKEN 378
Query: 947 GAEKIGKKKKKKKKKKKKKKKKKKK---KKKEAEKMKKK 982
+ + KK ++++ K + K K+ +K+
Sbjct: 379 LPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLENGKQKR 417
>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
[Translation, ribosomal structure and biogenesis].
Length = 569
Score = 32.4 bits (73), Expect = 1.3
Identities = 17/82 (20%), Positives = 32/82 (39%), Gaps = 7/82 (8%)
Query: 885 EAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK 944
E G + +++ TR K + ++ K+K ++ K K+ + +
Sbjct: 483 EIRLKNGVAMSKEELTREDKNR-----LRRALKRKRSKA--NLPNVNKRSKRNDVVDTLS 535
Query: 945 KKGAEKIGKKKKKKKKKKKKKK 966
K I +K +KK K KK
Sbjct: 536 KANITVINQKGEKKDVSGKTKK 557
>gnl|CDD|239500 cd03406, Band_7_3, A subgroup of the band 7 domain of flotillin
(reggie) like proteins. This subgroup contains proteins
similar to stomatin, prohibitin, flotillin, HlfK/C and
podicin. Many of these band 7 domain-containing
proteins are lipid raft-associated. Individual proteins
of this band 7 domain family may cluster to form
membrane microdomains which may in turn recruit
multiprotein complexes. Microdomains formed from
flotillin proteins may in addition be dynamic units with
their own regulatory functions. Flotillins have been
implicated in signal transduction, vesicle trafficking,
cytoskeleton rearrangement and are known to interact
with a variety of proteins. Stomatin interacts with and
regulates members of the degenerin/epithelia Na+ channel
family in mechanosensory cells of Caenorhabditis elegans
and vertebrate neurons and participates in trafficking
of Glut1 glucose transporters. Prohibitin may act as a
chaperone for the stabilization of mitochondrial
proteins. Prokaryotic HflK/C plays a role in the
decision between lysogenic and lytic cycle growth during
lambda phage infection. Flotillins have been implicated
in the progression of prion disease, in the pathogenesis
of neurodegenerative diseases such as Parkinson's and
Alzheimer's disease and, in cancer invasion and
metastasis. Mutations in the podicin gene give rise to
autosomal recessive steroid resistant nephritic
syndrome.
Length = 280
Score = 32.1 bits (73), Expect = 1.3
Identities = 24/74 (32%), Positives = 38/74 (51%), Gaps = 4/74 (5%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG---AEKIGKKKKK 957
R K K E IR+ + EAEK + +K K +K+AE +KK AEK+ + K
Sbjct: 151 RVTKPKIPEAIRRNYELM-EAEKTKLLIAIQKQKVVEKEAETERKKAVIEAEKVAQVAKI 209
Query: 958 KKKKKKKKKKKKKK 971
+K +K+ +K+
Sbjct: 210 LFGQKVMEKETEKR 223
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 31.9 bits (73), Expect = 1.4
Identities = 18/96 (18%), Positives = 50/96 (52%), Gaps = 16/96 (16%)
Query: 908 AEKIRKKKKKKKEAEKM-------RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK--- 957
EK + ++ K EA + ++K+ + M+ +++ ++ K+ EK+ +++K
Sbjct: 202 KEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLA 261
Query: 958 ------KKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+ K +++++ K+ K EAE ++K+ ++ +
Sbjct: 262 EQERMLEHKLQEQEELLKEGFKTEAESLQKEIQDLK 297
Score = 29.9 bits (68), Expect = 6.2
Identities = 17/80 (21%), Positives = 42/80 (52%), Gaps = 9/80 (11%)
Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK----KKKKKKKKKKKK 969
K+K EAE+ + AE + +++ + +K+ + + +++ K+ +K + ++
Sbjct: 201 AKEKAIEAERAK----AEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAER 256
Query: 970 KKKKKEAEKM-KKKTEEAEK 988
+K E E+M + K +E E+
Sbjct: 257 EKLLAEQERMLEHKLQEQEE 276
>gnl|CDD|153337 cd07653, F-BAR_CIP4-like, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Cdc42-Interacting Protein
4 and similar proteins. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. This subfamily is composed of
Cdc42-Interacting Protein 4 (CIP4), Formin Binding
Protein 17 (FBP17), FormiN Binding Protein 1-Like
(FNBP1L), and similar proteins. CIP4 and FNBP1L are
Cdc42 effectors that bind Wiskott-Aldrich syndrome
protein (WASP) and function in endocytosis. CIP4 and
FBP17 bind to the Fas ligand and may be implicated in
the inflammatory response. CIP4 may also play a role in
phagocytosis. Members of this subfamily typically
contain an N-terminal F-BAR domain and a C-terminal SH3
domain. In addition, some members such as FNBP1L contain
a central Cdc42-binding HR1 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 251
Score = 31.8 bits (73), Expect = 1.4
Identities = 21/71 (29%), Positives = 40/71 (56%), Gaps = 4/71 (5%)
Query: 921 AEKMRK---KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
AE + K+ + + +++ +K +G+ K+ +K + K+ +K KK +K KEAE
Sbjct: 85 AENLNSNVCKELKTLISELRQERKKHLSEGS-KLQQKLESSIKQLEKSKKAYEKAFKEAE 143
Query: 978 KMKKKTEEAEK 988
K K+K E+A+
Sbjct: 144 KAKQKYEKADA 154
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 32.5 bits (74), Expect = 1.4
Identities = 22/90 (24%), Positives = 42/90 (46%), Gaps = 7/90 (7%)
Query: 905 KKGAEKIRK-KKKKKKEAEKMRKKKGAE---KMKKKKKDAEKMKKKGAE---KIGKKKKK 957
K+ ++++ KK+KK+E +K+ +++ A M K K K + E K +
Sbjct: 59 KREKARLKELKKQKKQEIQKILEQQNAAIDADMNNKGKGRLKYLLQQTEIFAHFAKGDQS 118
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
KK K + + K E E+ ++ +E E
Sbjct: 119 ASAKKAKGRGRHASKLTEEEEDEEYLKEEE 148
Score = 30.9 bits (70), Expect = 3.8
Identities = 22/98 (22%), Positives = 38/98 (38%), Gaps = 21/98 (21%)
Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK- 964
+ AE K + K+E ++++ K K+KK++ +K+ ++ I K K + K
Sbjct: 47 EEAESPAKAEISKREKARLKELK-----KQKKQEIQKILEQQNAAIDADMNNKGKGRLKY 101
Query: 965 ---------------KKKKKKKKKKEAEKMKKKTEEAE 987
+ KK K K TEE E
Sbjct: 102 LLQQTEIFAHFAKGDQSASAKKAKGRGRHASKLTEEEE 139
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 32.3 bits (74), Expect = 1.4
Identities = 20/106 (18%), Positives = 45/106 (42%), Gaps = 1/106 (0%)
Query: 881 DVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDA 940
+ + L + TR + K EK+ K K++ E ++ + E + ++ A
Sbjct: 364 EELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELA 423
Query: 941 E-KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
+ G E + +++K+ K + KK++ K ++ K E+
Sbjct: 424 DLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQ 469
Score = 32.0 bits (73), Expect = 2.2
Identities = 20/91 (21%), Positives = 49/91 (53%), Gaps = 2/91 (2%)
Query: 900 TRRKKKKGAEKIRKKKKKKKEAE-KMRKKKGAEKMKKKK-KDAEKMKKKGAEKIGKKKKK 957
++ K G E+ + K+K E E ++ + + K+++ +DAE+ K +I K +
Sbjct: 278 NKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAE 337
Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++ +++ ++++K++ K E+ + EE E
Sbjct: 338 IEELEREIEEERKRRDKLTEEYAELKEELED 368
Score = 30.8 bits (70), Expect = 4.3
Identities = 20/96 (20%), Positives = 53/96 (55%), Gaps = 9/96 (9%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKM--RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
RKK+K E++ + ++ + + + K++ E+++++++ AE+ + EK + +
Sbjct: 170 RKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELL 229
Query: 960 KKKKKKKKKKKK-------KKKEAEKMKKKTEEAEK 988
K+K+ +++K+ ++E EK+ ++ E EK
Sbjct: 230 KEKEALERQKEAIERQLASLEEELEKLTEEISELEK 265
Score = 30.4 bits (69), Expect = 6.0
Identities = 19/97 (19%), Positives = 52/97 (53%), Gaps = 4/97 (4%)
Query: 893 ILFRKKRTRRKK----KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
++ +KR + ++ ++ AE+ + K+K+E E K E ++++K+ E+
Sbjct: 191 LIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLE 250
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
E++ K ++ + +K+ ++ ++ ++ +K+K EE
Sbjct: 251 EELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEE 287
Score = 29.7 bits (67), Expect = 9.4
Identities = 15/88 (17%), Positives = 42/88 (47%), Gaps = 3/88 (3%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
++ + EK +K+ ++ E+ K K +K+ + KK E++ ++ ++ +
Sbjct: 819 QKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKK---EELEEELEELEA 875
Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ + + KKE ++++ + E E+
Sbjct: 876 ALRDLESRLGDLKKERDELEAQLRELER 903
>gnl|CDD|223630 COG0556, UvrB, Helicase subunit of the DNA excision repair complex
[DNA replication, recombination, and repair].
Length = 663
Score = 32.6 bits (75), Expect = 1.4
Identities = 21/85 (24%), Positives = 43/85 (50%), Gaps = 4/85 (4%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG----KKKKKKKKKKK 963
A+KI +K + + R++ ++ + +KKK + + + + K K +KK
Sbjct: 554 ADKITDSMQKAIDETERRREIQMAYNEEHGITPQTIKKKIRDILDGEYEEDEYKAKIEKK 613
Query: 964 KKKKKKKKKKKEAEKMKKKTEEAEK 988
K KK+ +K +K++K+ +EA K
Sbjct: 614 ASKMSKKELEKLIKKLEKEMKEAAK 638
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 32.3 bits (74), Expect = 1.4
Identities = 19/83 (22%), Positives = 45/83 (54%), Gaps = 1/83 (1%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKK-KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
+R ++ K G +++ + + E E+ KKK EK+++K+K E++++ E+ +
Sbjct: 287 YRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKE 346
Query: 954 KKKKKKKKKKKKKKKKKKKKKEA 976
K ++ +KK + +K+ +K
Sbjct: 347 LNSKLEEIQKKLEDLEKRLEKLK 369
Score = 31.1 bits (71), Expect = 2.6
Identities = 20/83 (24%), Positives = 45/83 (54%), Gaps = 7/83 (8%)
Query: 909 EKIRKKK---KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
E R +K K +++ A + ++ ++KKK EKI +K+K+ ++ ++
Sbjct: 285 ENYRTEKLSGLKNSGEPSLKEIHEA----RLNEEERELKKKFTEKIREKEKRLEELEQNL 340
Query: 966 KKKKKKKKKEAEKMKKKTEEAEK 988
+++K+ + E+++KK E+ EK
Sbjct: 341 IEERKELNSKLEEIQKKLEDLEK 363
>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
Length = 145
Score = 30.8 bits (70), Expect = 1.4
Identities = 13/29 (44%), Positives = 15/29 (51%)
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
K KK KKK KK K KK + K+K
Sbjct: 1 KAPAKKAKKKAKKAAKAAKKGVKVKKRKV 29
Score = 30.8 bits (70), Expect = 1.7
Identities = 14/30 (46%), Positives = 17/30 (56%)
Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
K A+K KK KK K KK K KK+K +
Sbjct: 1 KAPAKKAKKKAKKAAKAAKKGVKVKKRKVR 30
Score = 30.4 bits (69), Expect = 1.9
Identities = 18/57 (31%), Positives = 28/57 (49%)
Query: 927 KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
K A+K KKK K A K KKG + +K + + + K + KK K ++K +
Sbjct: 1 KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRTSVRFFRPKTLRLPKKPKYPRRVKPAS 57
>gnl|CDD|234045 TIGR02876, spore_yqfD, sporulation protein YqfD. YqfD is part of
the sigma-E regulon in the sporulation program of
endospore-forming Gram-positive bacteria. Mutation
results in a sporulation defect in Bacillus subtilis.
Members are found in all currently known
endospore-forming bacteria, including the genera
Bacillus, Symbiobacterium, Carboxydothermus,
Clostridium, and Thermoanaerobacter [Cellular processes,
Sporulation and germination].
Length = 382
Score = 32.3 bits (74), Expect = 1.4
Identities = 23/94 (24%), Positives = 41/94 (43%), Gaps = 11/94 (11%)
Query: 896 RKKRTRRKKKKGAEKIRKK----KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
R+ RT +K + I K KK KE K ++ K K + +I
Sbjct: 259 REVRTGKKIVRYYLNIGGKKIYLKKPPKEFAKYDVEE-------KTKKFGLWRWSLPIEI 311
Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K+ + K+K +K K++ +K EK +K+ ++
Sbjct: 312 VKETYYEVKEKVEKVTKEEAIEKAKEKAEKELKK 345
>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
Length = 278
Score = 31.8 bits (72), Expect = 1.5
Identities = 23/76 (30%), Positives = 45/76 (59%), Gaps = 9/76 (11%)
Query: 922 EKMRKKKGAEKMKKKKKDAEK-------MKK--KGAEKIGKKKKKKKKKKKKKKKKKKKK 972
E++RK + E++K+K ++A+K +K+ K EK + KK++ ++ K + KKKK
Sbjct: 12 EELRKAQSLEELKQKYEEAQKQIADGKTLKRLYKVYEKREFELKKQQFEQLKAELSKKKK 71
Query: 973 KKEAEKMKKKTEEAEK 988
K + EK+ + + +K
Sbjct: 72 KFKKEKVDVRVKVVKK 87
>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
Length = 202
Score = 31.6 bits (72), Expect = 1.5
Identities = 13/43 (30%), Positives = 20/43 (46%)
Query: 922 EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
E+ RK++ A KKK K + K+ + + K K KK
Sbjct: 156 EEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKK 198
Score = 30.0 bits (68), Expect = 4.2
Identities = 16/46 (34%), Positives = 22/46 (47%), Gaps = 1/46 (2%)
Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK-KKKKKKEAEKMKK 981
K EK K++ A KK K +K K+ +KK K K +KK
Sbjct: 153 AKLEEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKK 198
Score = 29.2 bits (66), Expect = 8.6
Identities = 14/41 (34%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEA-EKMKKKTEEAEK 988
K+ +K+K++ KKK K +K KEA +K K +A
Sbjct: 153 AKLEEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIV 193
>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
Members of this family of small ribosomal proteins
possess one of three conserved blocks of sequence found
in proteins that stimulate the dissociation of guanine
nucleotides from G-proteins, leaving open the
possibility that MRP-S27 might be a functional partner
of GTP-binding ribosomal proteins.
Length = 417
Score = 32.0 bits (73), Expect = 1.5
Identities = 12/79 (15%), Positives = 35/79 (44%), Gaps = 1/79 (1%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
E ++ + K + K + + ++++ EK + E+ + + +K ++KK
Sbjct: 340 GESFQEWRSKLQALAK-VESRSLLSVEERLPTIEKEDLELYEQRQQLWFFENRKLWQRKK 398
Query: 968 KKKKKKKEAEKMKKKTEEA 986
K +++ E + + T +
Sbjct: 399 KLREQADEEYQQRHATRAS 417
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A peptide
from the central, syntaxin binding domain of synaphin
competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal exocytosis
by promoting interaction between the complementary
syntaxin and synaptobrevin transmembrane regions that
reside in opposing membranes prior to fusion.
Length = 139
Score = 31.0 bits (70), Expect = 1.5
Identities = 24/99 (24%), Positives = 51/99 (51%), Gaps = 18/99 (18%)
Query: 903 KKKKGAEKIRKKK-----KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
K+ +G E +++ + +EAE+ RK K +K +++ E M++ +K G KKK+
Sbjct: 23 KEDEGDESDAEEEDEEIQEALREAEEERK----AKHRKMEEEREVMRQGIRDKYGIKKKE 78
Query: 958 KKKKKKKKK---------KKKKKKKKEAEKMKKKTEEAE 987
+ +++ + +KKK ++ A + ++ EE E
Sbjct: 79 EDEEEPQAAAEEEEGRLGRKKKTPEELAAEAGEEDEEEE 117
>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV.
Length = 336
Score = 32.1 bits (73), Expect = 1.5
Identities = 16/43 (37%), Positives = 25/43 (58%)
Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
K+K E+M + A +I + K + K +K K+ KK+KK EK
Sbjct: 3 KRKIKEEMLQILAPEIYGPRPVKDEAKPRKIKRVKKRKKREEK 45
Score = 30.2 bits (68), Expect = 5.0
Identities = 15/45 (33%), Positives = 25/45 (55%)
Query: 933 MKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
M K+K E ++ E G + K + K +K K+ KK+KK+E +
Sbjct: 1 MSKRKIKEEMLQILAPEIYGPRPVKDEAKPRKIKRVKKRKKREEK 45
>gnl|CDD|238706 cd01418, Ribosomal_L19e_A, Ribosomal protein L19e, archaeal. L19e
is found in the large ribosomal subunit of eukaryotes
and archaea. L19e is distinct from the ribosomal subunit
L19, which is found in prokaryotes. It consists of two
small globular domains connected by an extended segment.
It is located toward the surface of the large subunit,
with one exposed end involved in forming the
intersubunit bridge with the small subunit. The other
exposed end is involved in forming the translocon
binding site, along with L22, L23, L24, L29, and L31e
subunits.
Length = 145
Score = 30.8 bits (70), Expect = 1.6
Identities = 28/111 (25%), Positives = 52/111 (46%), Gaps = 13/111 (11%)
Query: 874 AVSDVNDDVRREAVTAL---GFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGA 930
+ +V + + R+ + AL G I + K KKG + R K++ +K + R+ G+
Sbjct: 25 RLEEVAEAITRDDIRALIKEGVI-------KAKPKKGISRGRLKERHEKRKKGRRRGPGS 77
Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
+K KK A KK+ K + ++ K+ + K K KK ++ + K
Sbjct: 78 ---RKGKKGARTPKKERWIKTIRALRRYLKELRDKGKIDKKTYRKLYRKAK 125
>gnl|CDD|219589 pfam07808, RED_N, RED-like protein N-terminal region. This family
contains sequences that are similar to the N-terminal
region of Red protein. This and related proteins contain
a RED repeat which consists of a number of RE and RD
sequence elements. The region in question has several
conserved NLS sequences and a putative trimeric
coiled-coil region, suggesting that these proteins are
expressed in the nucleus. The function of Red protein is
unknown, but efficient sequestration to nuclear bodies
suggests that its expression may be tightly regulated of
that the protein self-aggregates extremely efficiently.
Length = 238
Score = 31.7 bits (72), Expect = 1.6
Identities = 10/33 (30%), Positives = 18/33 (54%)
Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
KKKKK +K+++ +K+ K + + E K
Sbjct: 1 KKKKKYAYLRKQEENAEKEINPKYRDRARERRK 33
>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777). This is
a family of eukaryotic proteins of unknown function.
Some of the proteins in this family are putative nucleic
acid binding proteins.
Length = 158
Score = 31.0 bits (70), Expect = 1.6
Identities = 12/83 (14%), Positives = 45/83 (54%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
++ RR++ + E+ R+++ + + + R+ + + + + + + + + E+ ++
Sbjct: 23 RRERRRERSRSRERDRRRRSRSRSPHRSRRSRSPRRHRSRSRSPSRRRDRKRERDKDARE 82
Query: 957 KKKKKKKKKKKKKKKKKKEAEKM 979
KK++++K K++ + K E++
Sbjct: 83 PKKRERQKLIKEEDLEGKSDEEV 105
>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
Length = 1176
Score = 32.2 bits (74), Expect = 1.6
Identities = 11/40 (27%), Positives = 18/40 (45%)
Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
EK+ K +K K KKKE E+ ++ +E +
Sbjct: 140 VEKLEKFGEKVGCGVKILYYHSSLKKKEKEEFLERLKEGD 179
>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16. Nop16 is a
protein involved in ribosome biogenesis.
Length = 173
Score = 31.3 bits (71), Expect = 1.6
Identities = 22/94 (23%), Positives = 39/94 (41%), Gaps = 5/94 (5%)
Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKG----AEKMKKKKKDAEKMKKKG-AEKIGK 953
R ++K + K+ +K+ K+K+ +K G A+ KKK A+ K+ G A
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGNPIIAKNWDKKKTLAQNYKRLGLAADPNT 61
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+ + K + KK + E+ K E
Sbjct: 62 AARTDLLIQDKPIPELKKGEARIERDDKGNPLDE 95
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 31.6 bits (72), Expect = 1.6
Identities = 10/36 (27%), Positives = 19/36 (52%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K K K K K K KK +++ K+E + ++ + +
Sbjct: 98 KPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFE 133
Score = 29.7 bits (67), Expect = 6.6
Identities = 11/48 (22%), Positives = 19/48 (39%)
Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
E K+ K + K K K K KK +++ K++ K +
Sbjct: 83 EPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPAS 130
Score = 29.7 bits (67), Expect = 7.6
Identities = 12/34 (35%), Positives = 17/34 (50%)
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K + K K K K K K KK E+ K++ + E
Sbjct: 93 PKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEP 126
Score = 29.7 bits (67), Expect = 8.0
Identities = 12/35 (34%), Positives = 18/35 (51%)
Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
I K + K K K K K K KK +++ ++ K E
Sbjct: 92 IPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEP 126
>gnl|CDD|236698 PRK10475, PRK10475, 23S rRNA pseudouridine synthase F; Provisional.
Length = 290
Score = 31.6 bits (72), Expect = 1.6
Identities = 11/43 (25%), Positives = 17/43 (39%)
Query: 923 KMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
K + K +K+ EK +KG K+ +KKK
Sbjct: 246 KAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRKKK 288
Score = 31.6 bits (72), Expect = 1.9
Identities = 10/42 (23%), Positives = 14/42 (33%)
Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
K K K EK +K + K+ +KKK
Sbjct: 247 AKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRKKK 288
Score = 31.2 bits (71), Expect = 2.1
Identities = 10/45 (22%), Positives = 16/45 (35%)
Query: 928 KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
K K K K ++ K + K + K+ +KKK
Sbjct: 244 KPKAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRKKK 288
Score = 30.9 bits (70), Expect = 3.4
Identities = 10/44 (22%), Positives = 16/44 (36%)
Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
K K K K K+ K+ K +K + K+ +K
Sbjct: 243 AKPKAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRK 286
>gnl|CDD|234773 PRK00461, rpmC, 50S ribosomal protein L29; Reviewed.
Length = 87
Score = 29.7 bits (67), Expect = 1.7
Identities = 21/86 (24%), Positives = 39/86 (45%), Gaps = 21/86 (24%)
Query: 914 KKKKKKEAEKMRKKKGAE-----------------KMKKKKKDAEKMKKKGAEKIGKKKK 956
+KK +E EK+ + AE K+K+ +KD ++ + +++
Sbjct: 6 RKKSVEELEKLVIELKAELFTLRFKNATGSLDQTHKIKEIRKDIARILTI----LNEREL 61
Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKK 982
++K+ K+ KK KK K A K K+
Sbjct: 62 EEKENNKEPKKNTKKAVKLASKAGKQ 87
>gnl|CDD|191748 pfam07464, ApoLp-III, Apolipophorin-III precursor (apoLp-III).
This family consists of several insect
apolipoprotein-III sequences. Exchangeable
apolipoproteins constitute a functionally important
family of proteins that play critical roles in lipid
transport and lipoprotein metabolism. Apolipophorin III
(apoLp-III) is a prototypical exchangeable
apolipoprotein found in many insect species that
functions in transport of diacylglycerol (DAG) from the
fat body lipid storage depot to flight muscles in the
adult life stage.
Length = 155
Score = 31.0 bits (70), Expect = 1.7
Identities = 19/83 (22%), Positives = 43/83 (51%), Gaps = 10/83 (12%)
Query: 914 KKKKKKEAEKMRKK-----KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
++ ++ AE++RK + A++++ K + A + + ++K+ K+ + +K
Sbjct: 69 RQNIEETAEELRKAHPEVEQQAQELRDKLQAAVQSTVQESQKLAKEVASNVEGTNEKLAP 128
Query: 969 KKKKK-----KEAEKMKKKTEEA 986
K K+ K AE+++KK EA
Sbjct: 129 KIKQAYDDFVKNAEEVQKKLHEA 151
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 31.9 bits (73), Expect = 1.8
Identities = 19/49 (38%), Positives = 28/49 (57%)
Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
E++K+ AEK + +KK +KK KK KK K +EA + EAE+
Sbjct: 75 EEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123
Score = 31.1 bits (71), Expect = 2.9
Identities = 16/63 (25%), Positives = 35/63 (55%), Gaps = 2/63 (3%)
Query: 919 KEAEKMRKKKGAEKMK--KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
+E E+ +++ E++K + +K+ + +KKG EK KK KK K ++ ++ + + E
Sbjct: 64 EEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123
Query: 977 EKM 979
+
Sbjct: 124 PEP 126
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 30.7 bits (70), Expect = 1.8
Identities = 11/74 (14%), Positives = 41/74 (55%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
+K+ + K A+K +K+ + + +K ++++K+ + + ++ +K K+++
Sbjct: 24 QKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQE 83
Query: 969 KKKKKKEAEKMKKK 982
++K++E ++ ++
Sbjct: 84 LQQKQQELQQKQQA 97
>gnl|CDD|236078 PRK07726, PRK07726, DNA topoisomerase III; Provisional.
Length = 658
Score = 32.1 bits (74), Expect = 1.8
Identities = 13/40 (32%), Positives = 16/40 (40%)
Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
M K K KM + GK+K KK + KKK
Sbjct: 619 PMLKVKGKNGKMLVCQDRECGKRKNVSKKTNARCPNCKKK 658
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 31.2 bits (71), Expect = 1.9
Identities = 15/52 (28%), Positives = 24/52 (46%), Gaps = 7/52 (13%)
Query: 943 MKKKGAEKIGKKKK-------KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
M K+ I K K +K++KK K++K+A K +K+ E E
Sbjct: 71 MTKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEE 122
Score = 30.0 bits (68), Expect = 5.4
Identities = 10/28 (35%), Positives = 18/28 (64%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMK 980
+K++KK K++KK K +K+K E+
Sbjct: 97 QKEEKKAMSKEEKKAIKAEKEKLEEEYG 124
Score = 29.6 bits (67), Expect = 7.3
Identities = 10/30 (33%), Positives = 18/30 (60%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
K +K++KK K++KK K +KE + +
Sbjct: 95 KAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 31.6 bits (72), Expect = 1.9
Identities = 10/33 (30%), Positives = 23/33 (69%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
+ ++ KKK++ + KKKK+ +E + ++KK ++
Sbjct: 84 SLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQ 116
Score = 30.9 bits (70), Expect = 2.9
Identities = 15/44 (34%), Positives = 24/44 (54%)
Query: 938 KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
KD +K EK + ++ KKK++ + KKKKE E++K
Sbjct: 66 KDFLSDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKA 109
>gnl|CDD|219911 pfam08572, PRP3, pre-mRNA processing factor 3 (PRP3). Pre-mRNA
processing factor 3 (PRP3) is a U4/U6-associated
splicing factor. The human PRP3 has been implicated in
autosomal retinitis pigmentosa.
Length = 222
Score = 31.2 bits (71), Expect = 1.9
Identities = 15/68 (22%), Positives = 36/68 (52%), Gaps = 3/68 (4%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKK-KDAEKMKKKGAEKIGKK 954
KR +R+++ + +K K ++AE++R++ E+ + + ++A+ K A + +K
Sbjct: 11 EAKRKKRRRRGL--QFIEKGKFSQQAEELRREAKLEEEEARISEEAQNAGLKSATDLAEK 68
Query: 955 KKKKKKKK 962
KK+
Sbjct: 69 IPKKELPD 76
>gnl|CDD|234054 TIGR02904, spore_ysxE, spore coat protein YsxE. Members of this
family are homologs of the Bacillus subtilis spore coat
protein CotS. Members of this family, designated YsxE,
are found only in the family Bacillaceae, from among the
endospore-forming members of the Firmicutes branch of
the Bacteria. As a rule, the ysxE gene is found
immediately downstream of spoVID, a gene necessary for
spore coat assembly. The protein has been shown to be
part of the spore coat [Cellular processes, Sporulation
and germination].
Length = 309
Score = 31.7 bits (72), Expect = 2.0
Identities = 25/105 (23%), Positives = 43/105 (40%), Gaps = 18/105 (17%)
Query: 529 IELKEFSLKKLNDIVDDFWPENEKEKQLQARITKLSKILSGEVSIDLHLQFLIRSNKTDM 588
++ EF+ KL D W E KEK+ R ++ G++S+ H + ++T
Sbjct: 171 MQAAEFAQNKLED-----WHEATKEKE-SWRTV----LVHGKLSLS-HFLY----DETRG 215
Query: 589 LILKNTKESIRVSICHTATIIANAFMHSGTTSDQFLRDNLEWLAR 633
N +++ S T +A F S T Q + + EW
Sbjct: 216 GYFINFEKA---SFASPLTDLAAFFRRSFGTYPQPIDEMFEWFYT 257
>gnl|CDD|145004 pfam01632, Ribosomal_L35p, Ribosomal protein L35.
Length = 61
Score = 28.8 bits (65), Expect = 2.0
Identities = 15/50 (30%), Positives = 28/50 (56%)
Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
KMK K A++ K+ + KI ++K K+ KK K+K++ ++ + K
Sbjct: 2 KMKTVKAAAKRFKRTASGKIKRRKAGKRHLLTKKSTKRKRRLRKKVLVSK 51
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 30.4 bits (69), Expect = 2.0
Identities = 13/80 (16%), Positives = 29/80 (36%), Gaps = 2/80 (2%)
Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
I +E ++ K+ + +KK + +K E++ + + + K
Sbjct: 35 DIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTVKAT 94
Query: 970 KKK--KKEAEKMKKKTEEAE 987
KK+ E E+ +E
Sbjct: 95 KKQLFSSEYEQTSSSSESTS 114
>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
(ETRAMP). These sequences represent a family of
proteins from the malaria parasite Plasmodium
falciparum, several of which have been shown to be
expressed specifically in the ring stage as well as the
rodent parasite Plasmodium yoelii. A homologue from
Plasmodium chabaudi was localized to the parasitophorous
vacuole membrane. Members have an initial hydrophobic,
Phe/Tyr-rich, stretch long enough to span the membrane,
a highly charged region rich in Lys, a second putative
transmembrane region and a second highly charged, low
complexity sequence region. Some members have up to 100
residues of additional C-terminal sequence. These genes
have been shown to be found in the sub-telomeric regions
of both Plasmodium falciparum and P. yoelii chromosomes.
Length = 82
Score = 29.1 bits (66), Expect = 2.0
Identities = 13/29 (44%), Positives = 14/29 (48%)
Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
KK KK KK +KKKK KKK
Sbjct: 25 NNVVAKKKKLKKLKKIDDDLEKKKKNKKK 53
Score = 28.4 bits (64), Expect = 4.3
Identities = 13/31 (41%), Positives = 15/31 (48%)
Query: 943 MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
+ K K KK KK +KKKK KKK
Sbjct: 23 LNNNVVAKKKKLKKLKKIDDDLEKKKKNKKK 53
Score = 28.0 bits (63), Expect = 6.0
Identities = 12/25 (48%), Positives = 14/25 (56%)
Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKK 960
KK K KK + + KKKK KKK
Sbjct: 29 AKKKKLKKLKKIDDDLEKKKKNKKK 53
>gnl|CDD|238028 cd00073, H15, linker histone 1 and histone 5 domains; the basic
subunit of chromatin is the nucleosome, consisting of an
octamer of core histones, two full turns of DNA, a
linker histone (H1 or H5) and a variable length of
linker DNA; H1/H5 are chromatin-associated proteins that
bind to the exterior of nucleosomes and dramatically
stabilize the highly condensed states of chromatin
fibers; stabilization of higher order folding occurs
through electrostatic neutralization of the linker DNA
segments, through a highly positively charged carboxy-
terminal domain known as the AKP helix (Ala, Lys, Pro);
thought to be involved in specific protein-protein and
protein-DNA interactions and play a role in suppressing
core histone tail domain acetylation in the chromatin
fiber.
Length = 88
Score = 29.5 bits (67), Expect = 2.0
Identities = 32/93 (34%), Positives = 43/93 (46%), Gaps = 30/93 (32%)
Query: 885 EAVTALGFILFRKKRTRRKKKKGA--EKIRK--KKKKKKEAEKMRK------KKGAEKMK 934
EA+ AL K++KG+ + I+K + K K + E K KKG K K
Sbjct: 13 EAIKAL------------KERKGSSLQAIKKYIEAKYKVDDENFNKLLKLALKKGVAKGK 60
Query: 935 KKKKDAEKMKKKGAE---KIGKKKKKKKKKKKK 964
+ +K GA K+ KKKKKKKK KK
Sbjct: 61 LVQ-----VKGTGASGSFKLSKKKKKKKKPAKK 88
>gnl|CDD|225816 COG3277, GAR1, RNA-binding protein involved in rRNA processing
[Translation, ribosomal structure and biogenesis].
Length = 98
Score = 29.6 bits (67), Expect = 2.0
Identities = 10/24 (41%), Positives = 15/24 (62%)
Query: 951 IGKKKKKKKKKKKKKKKKKKKKKK 974
I K +KK+K +KK++ KK K
Sbjct: 75 IPPDKLIRKKRKLPRKKRRPKKPK 98
Score = 29.6 bits (67), Expect = 2.3
Identities = 11/42 (26%), Positives = 20/42 (47%), Gaps = 1/42 (2%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
K +D + G + + K +KK+K +KK++ KK
Sbjct: 57 KPDDRDVKLESLVG-DTLYIPPDKLIRKKRKLPRKKRRPKKP 97
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 31.6 bits (72), Expect = 2.0
Identities = 15/61 (24%), Positives = 45/61 (73%), Gaps = 4/61 (6%)
Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE-AEK 978
E ++R+++ + +++++AE + + E+ ++++++K+++KK+ K++K+++KE AEK
Sbjct: 153 ELAELRQQQ---RQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQREKETAEK 209
Query: 979 M 979
+
Sbjct: 210 I 210
Score = 29.6 bits (67), Expect = 7.1
Identities = 14/74 (18%), Positives = 44/74 (59%), Gaps = 7/74 (9%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
E++ + ++++++ E+ R AE + ++ + + +++ EK +++KK+ K++K+++K
Sbjct: 152 EELAELRQQQRQFEQRRN---AELAETQRLEEAE-RRRREEK---ERRKKQDKERKQREK 204
Query: 969 KKKKKKEAEKMKKK 982
+ +K A +
Sbjct: 205 ETAEKIAARAFAQG 218
>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3. The SLD3 DNA
replication regulator is required for loading and
maintenance of Cdc45 on chromatin during DNA
replication.
Length = 437
Score = 31.7 bits (72), Expect = 2.1
Identities = 11/44 (25%), Positives = 26/44 (59%)
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
++ K +K+ +K K ++ + + K++K++K+KK KK+
Sbjct: 120 LDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPVL 163
>gnl|CDD|235132 PRK03577, PRK03577, acid shock protein precursor; Provisional.
Length = 102
Score = 29.6 bits (66), Expect = 2.1
Identities = 19/57 (33%), Positives = 29/57 (50%)
Query: 927 KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
K KK+ K A + K + A+K K KK++K ++K + KK KK + K K
Sbjct: 40 AKTTHHHKKQHKKAPEQKAQAAKKHHKNKKEQKAPEQKAQAAKKHAKKHSHKTAAKP 96
Score = 28.1 bits (62), Expect = 8.7
Identities = 18/50 (36%), Positives = 26/50 (52%), Gaps = 1/50 (2%)
Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
KK+ KK +K + K K KK++K A + K + A+K KK K K
Sbjct: 47 KKQHKKAPEQKAQAAKKHHKNKKEQK-APEQKAQAAKKHAKKHSHKTAAK 95
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 31.8 bits (72), Expect = 2.1
Identities = 19/94 (20%), Positives = 42/94 (44%), Gaps = 6/94 (6%)
Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD--AEKMKKKGAEK 950
+ +R +R + K +K+ + +M K EK+ KK++ + + E+
Sbjct: 1 MAYRNRREANINNNDRMQ-EKDDEKQDQKNRMELK---EKVLDKKEEVVTDNVDSPVKEQ 56
Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
++ K + KK K++ K+ E K K++ +
Sbjct: 57 SSQENLKIADEVKKSTKEESKQLLEVLKTKEEHQ 90
>gnl|CDD|215822 pfam00257, Dehydrin, Dehydrin.
Length = 137
Score = 30.6 bits (69), Expect = 2.1
Identities = 14/54 (25%), Positives = 22/54 (40%), Gaps = 6/54 (11%)
Query: 925 RKKKG-AEKMKKK-----KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
+KKG EK+K+K + G + ++KK K K K+K
Sbjct: 77 ERKKGIKEKIKEKLPGGHGGKGQATTTTTTGTGGGYGQHGHEEKKGKMDKIKEK 130
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 32.0 bits (72), Expect = 2.1
Identities = 12/34 (35%), Positives = 14/34 (41%)
Query: 147 PVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPE 180
P AP A P AP A + P + TP P
Sbjct: 746 PPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPP 779
>gnl|CDD|237748 PRK14537, PRK14537, 50S ribosomal protein L20/unknown domain fusion
protein; Provisional.
Length = 230
Score = 31.1 bits (70), Expect = 2.1
Identities = 12/75 (16%), Positives = 33/75 (44%)
Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
KK ++ K K+ + + ++K + + +K + ++ ++K + + +
Sbjct: 114 KKFLSQFQSTTITKPKKVLINEVLQEKTINQNEEKTSLQNEKVLSPELSEEKSDSELETQ 173
Query: 964 KKKKKKKKKKKEAEK 978
+K + K+KK E
Sbjct: 174 PQKTQLKEKKPSIEH 188
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 30.6 bits (69), Expect = 2.2
Identities = 22/109 (20%), Positives = 49/109 (44%), Gaps = 12/109 (11%)
Query: 875 VSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMK 934
+ N+DV+ E +++ ++++ E+I + + + E E + ++ E+ +
Sbjct: 36 IIKENEDVKDE----------KQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDE 85
Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKK--KKKKKKKKKKKKEAEKMKK 981
+ D + ++KK I + + K KK +K KK AE + K
Sbjct: 86 EDNVDLKDIEKKNINDIFNSTQDDNAQNLISKNYKKNEKSKKTAEDIVK 134
>gnl|CDD|185511 PTZ00199, PTZ00199, high mobility group protein; Provisional.
Length = 94
Score = 29.4 bits (66), Expect = 2.2
Identities = 25/89 (28%), Positives = 45/89 (50%), Gaps = 5/89 (5%)
Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK-----KKGAEKIGKKKKKK 958
KK+G +RK K+KKK+ ++ A K+K AE + K +GK +
Sbjct: 3 KKQGKVLVRKNKRKKKDPNAPKRALSAYMFFAKEKRAEIIAENPELAKDVAAVGKMVGEA 62
Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K +++K +KK + +K++ + E+AE
Sbjct: 63 WNKLSEEEKAPYEKKAQEDKVRYEKEKAE 91
>gnl|CDD|216689 pfam01765, RRF, Ribosome recycling factor. The ribosome recycling
factor (RRF / ribosome release factor) dissociates the
ribosome from the mRNA after termination of translation,
and is essential bacterial growth. Thus ribosomes are
"recycled" and ready for another round of protein
synthesis.
Length = 165
Score = 30.5 bits (70), Expect = 2.2
Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 10/79 (12%)
Query: 909 EKIRKKKKK--KKEAEKMR------KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
E+ RK+ K KK AE+ + ++ +K+KK +KD K + K +K+ +K
Sbjct: 89 EERRKELVKQAKKLAEEAKVAIRNIRRDANDKLKKLEKD--KEISEDEVKRAEKEIQKLT 146
Query: 961 KKKKKKKKKKKKKKEAEKM 979
K KK + KKKE E M
Sbjct: 147 DKYIKKIDELLKKKEKEIM 165
Score = 30.5 bits (70), Expect = 2.5
Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 16/74 (21%)
Query: 901 RRKK-----KKGAEK-------IRKKKKKKKEAEKMRKKKGAEK--MKKKKKDAEKMKKK 946
RRK+ KK AE+ IR+ K +K+ K K + +K+ +K+ +K+ K
Sbjct: 91 RRKELVKQAKKLAEEAKVAIRNIRRDANDK--LKKLEKDKEISEDEVKRAEKEIQKLTDK 148
Query: 947 GAEKIGKKKKKKKK 960
+KI + KKK+K
Sbjct: 149 YIKKIDELLKKKEK 162
>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0). Part of the CF(0)
(base unit) of the ATP synthase. The base unit is
thought to translocate protons through membrane (inner
membrane in mitochondria, thylakoid membrane in plants,
cytoplasmic membrane in bacteria). The B subunits are
thought to interact with the stalk of the CF(1)
subunits. This domain should not be confused with the ab
CF(1) proteins (in the head of the ATP synthase) which
are found in pfam00006.
Length = 132
Score = 30.3 bits (69), Expect = 2.3
Identities = 21/94 (22%), Positives = 43/94 (45%), Gaps = 10/94 (10%)
Query: 890 LGFILFRKKRTRRKKKKGAEKIRKK-KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
LG IL +K K AE+ K+ EAE+ ++ + + +A ++ A
Sbjct: 24 LGKILDERKEKIANNIKEAEERLKQAAALLAEAEQ--------QLAQARAEASEIINN-A 74
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
+K +K K++ + +K ++ + AE ++K
Sbjct: 75 KKEAQKLKEEILAEAQKDAERLLESARAEIEQEK 108
>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694). Family of
hypothetical bacterial proteins.
Length = 278
Score = 31.0 bits (71), Expect = 2.4
Identities = 16/47 (34%), Positives = 28/47 (59%)
Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
++K+ + + + ++ K+KKKKKKKKKKK++ E M +K
Sbjct: 63 QEKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREWMCEK 109
Score = 30.7 bits (70), Expect = 4.0
Identities = 15/43 (34%), Positives = 26/43 (60%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
+K+K + E + ++ K+KKKKKKKKKKK++ ++
Sbjct: 64 EKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREWM 106
Score = 29.5 bits (67), Expect = 7.6
Identities = 16/42 (38%), Positives = 25/42 (59%)
Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
EK K + E ++ + K+KKKKKKKKKKK++ ++
Sbjct: 64 EKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREW 105
>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
Length = 567
Score = 31.8 bits (72), Expect = 2.4
Identities = 12/44 (27%), Positives = 32/44 (72%)
Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++ EK+ ++++++K++ K K ++++K K+EA K ++ E A++
Sbjct: 6 RRAREKLEREQRERKQRAKLKLERERKAKEEAAKQREAIEAAQR 49
>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 758
Score = 31.8 bits (72), Expect = 2.4
Identities = 19/71 (26%), Positives = 35/71 (49%), Gaps = 3/71 (4%)
Query: 915 KKKKKEAEKMRKKKGA--EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
KK + EK+ KK + +K +KK K+ ++ K + ++K + KK +++
Sbjct: 111 KKTNDDLEKIESKKESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKNEER 170
Query: 973 KKE-AEKMKKK 982
KE E K+K
Sbjct: 171 FKEYLENFKRK 181
Score = 31.1 bits (70), Expect = 3.3
Identities = 21/84 (25%), Positives = 40/84 (47%), Gaps = 2/84 (2%)
Query: 905 KKGAEKIRKKKKKKKE--AEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
KK + + K + KK+ + +K K ++ K + EK +K + KK +++
Sbjct: 111 KKTNDDLEKIESKKESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKNEER 170
Query: 963 KKKKKKKKKKKKEAEKMKKKTEEA 986
K+ + K+KK K+ K+ E A
Sbjct: 171 FKEYLENFKRKKFKRKILKEFENA 194
>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase. This domain family is found in
eukaryotes, and is approximately 140 amino acids in
length. The family is found in association with
pfam00069. Polo-like kinase 1 (Plx1) is essential during
mitosis for the activation of Cdc25C, for spindle
assembly, and for cyclin B degradation. This family is
Polo kinase kinase (PKK) which phosphorylates Polo
kinase and Polo-like kinase to activate them. PKK is a
serine/threonine kinase.
Length = 142
Score = 30.4 bits (69), Expect = 2.4
Identities = 22/98 (22%), Positives = 52/98 (53%), Gaps = 18/98 (18%)
Query: 909 EKIRKKKKKKKEAEKMRK--KKGAEKMKKK----KKDAEKMKKKGAE----------KIG 952
E+ + K+ +KE E++ + K+ EK++++ + K + + KI
Sbjct: 16 ERTQLLKRHEKELEQLERQQKRTIEKLEQRQTQELRRLPKRIRAEQKTRLKMFKESLKIE 75
Query: 953 KKKKKKKKKKKKKKKKKKKK--KKEAEKMKKKTEEAEK 988
KK+ K++ +K + ++++KK K E E+ ++K ++ E+
Sbjct: 76 KKELKQEVEKLPRFQEQEKKRMKAEKEEQEQKHQKQER 113
Score = 29.2 bits (66), Expect = 5.2
Identities = 18/83 (21%), Positives = 47/83 (56%), Gaps = 9/83 (10%)
Query: 914 KKKKKKEAEKMRKKKGAEKM---KKKKKDAEK-MKK-----KGAEKIGKKKKKKKKKKKK 964
++++K+ EK+ +++ E K+ + + + +K K +K K++ +K + ++
Sbjct: 32 ERQQKRTIEKLEQRQTQELRRLPKRIRAEQKTRLKMFKESLKIEKKELKQEVEKLPRFQE 91
Query: 965 KKKKKKKKKKEAEKMKKKTEEAE 987
++KK+ K +KE ++ K + +E E
Sbjct: 92 QEKKRMKAEKEEQEQKHQKQERE 114
Score = 28.8 bits (65), Expect = 6.3
Identities = 21/95 (22%), Positives = 49/95 (51%), Gaps = 10/95 (10%)
Query: 883 RREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK 942
R E T L +F++ + + +KK ++ +K + +E EK R K E+ ++K + E+
Sbjct: 58 RAEQKTRL--KMFKE--SLKIEKKELKQEVEKLPRFQEQEKKRMKAEKEEQEQKHQKQER 113
Query: 943 MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
E + K+++ ++ ++ + +K+ + E E
Sbjct: 114 ------EFLAKQEENLEEALQQLQNEKRHELAETE 142
Score = 28.8 bits (65), Expect = 8.2
Identities = 21/88 (23%), Positives = 45/88 (51%), Gaps = 12/88 (13%)
Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
++++ E R K+ + E + R K E +K +KK+ ++ EK+ + ++++K
Sbjct: 43 EQRQTQELRRLPKRIRAEQKT-RLKMFKESLKIEKKELKQEV----EKL---PRFQEQEK 94
Query: 963 KKKKKKKK----KKKKEAEKMKKKTEEA 986
K+ K +K+ K +K+ + K EE
Sbjct: 95 KRMKAEKEEQEQKHQKQEREFLAKQEEN 122
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 31.8 bits (73), Expect = 2.4
Identities = 13/69 (18%), Positives = 20/69 (28%), Gaps = 1/69 (1%)
Query: 134 APIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTT-PGPETKTLESLNEKEK 192
A A AP PA P AP P + ++ P +
Sbjct: 52 AAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAA 111
Query: 193 QLQARITKL 201
++ +T L
Sbjct: 112 AVEDEVTPL 120
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 30.9 bits (70), Expect = 2.5
Identities = 15/63 (23%), Positives = 25/63 (39%), Gaps = 6/63 (9%)
Query: 922 EKMRKKKGAEK--MKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK----KKKKKKKKKKE 975
M K A K K K + + E I +K +KK K+ ++ +K +E
Sbjct: 118 SVMGINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEE 177
Query: 976 AEK 978
E+
Sbjct: 178 EEE 180
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved L residues that may
be functionally important.
Length = 70
Score = 28.8 bits (65), Expect = 2.5
Identities = 15/66 (22%), Positives = 37/66 (56%), Gaps = 7/66 (10%)
Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
++KK + + M K+K KKK+ +K + K++++++ +K++++K+++K
Sbjct: 1 KQKKDEPVTLKDMLNADLLAKLKAKKKE---LKAEEE----KREEEEEARKREERKEREK 53
Query: 972 KKKEAE 977
K E
Sbjct: 54 NKSFEE 59
>gnl|CDD|237582 PRK14011, PRK14011, prefoldin subunit alpha; Provisional.
Length = 144
Score = 30.1 bits (68), Expect = 2.5
Identities = 17/55 (30%), Positives = 32/55 (58%)
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
E KK ++ +K KK+G +KI + K+ K +K+ +K+ + ++ +MK KT
Sbjct: 90 IEDFKKSVEELDKTKKEGNKKIEELNKEITKLRKELEKRAQAIEQRQAQMKPKTN 144
>gnl|CDD|203471 pfam06518, DUF1104, Protein of unknown function (DUF1104). This
family consists of several hypothetical proteins of
unknown function which appear to be found largely in
Helicobacter pylori.
Length = 93
Score = 29.2 bits (66), Expect = 2.5
Identities = 14/62 (22%), Positives = 30/62 (48%)
Query: 924 MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
+ K+ G K K+ K K + + K + ++K +K+ +++ KK +A +K+
Sbjct: 30 IDKRLGKMKEKEAKNFKAGFKAAMRKNLSKMSVEDRRKFRKEVREEMNKKVDAMSVKEAR 89
Query: 984 EE 985
E
Sbjct: 90 EL 91
>gnl|CDD|191968 pfam08213, DUF1713, Mitochondrial domain of unknown function
(DUF1713). This domain is found at the C terminal end
of mitochondrial proteins of unknown function.
Length = 34
Score = 27.7 bits (62), Expect = 2.6
Identities = 10/29 (34%), Positives = 18/29 (62%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
+K+++K KK K KK +K+ + K+ K
Sbjct: 6 LRKRRRKMKKHKYKKLRKRTRALRRKLDK 34
>gnl|CDD|233366 TIGR01348, PDHac_trf_long, pyruvate dehydrogenase complex
dihydrolipoamide acetyltransferase, long form. This
model describes a subset of pyruvate dehydrogenase
complex dihydrolipoamide acetyltransferase specifically
close by both phylogenetic and per cent identity (UPGMA)
trees. Members of this set include two or three copies
of the lipoyl-binding domain. E. coli AceF is a member
of this model, while mitochondrial and some other
bacterial forms belong to a separate model [Energy
metabolism, Pyruvate dehydrogenase].
Length = 546
Score = 31.4 bits (71), Expect = 2.6
Identities = 14/80 (17%), Positives = 29/80 (36%), Gaps = 8/80 (10%)
Query: 142 VVVKVPVEGAPTPATPDT----EAPTPAPGTESGPSSTTPGPETKTLESLNEKEKQLQAR 197
+++ + V G+ TPAT + +P + P + + Q A+
Sbjct: 185 LILTLSVAGS-TPATAPAPASAQPAAQSPAATQPEPAAAPAAAKAQAPAPQQAGTQNPAK 243
Query: 198 I---TKLSKILSGEVSIDLH 214
+ + L+ E +DL
Sbjct: 244 VDHAAPAVRRLAREFGVDLS 263
>gnl|CDD|204032 pfam08703, PLC-beta_C, PLC-beta C terminal. This domain
corresponds to the alpha helical C terminal domain of
phospholipase C beta.
Length = 181
Score = 30.5 bits (69), Expect = 2.6
Identities = 26/91 (28%), Positives = 54/91 (59%), Gaps = 2/91 (2%)
Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK-MKKKGAEKIGKKKKKK 958
+RK++ E+I+K + +E + + KK E +K+KK+ +K + +K E+I + K K+
Sbjct: 28 KKRKEQHLKEQIQKLTELAREKQAAQLKKLKEISEKEKKELKKKLDRKRLERIQEAKTKE 87
Query: 959 KKKKKKKKKKKKKKK-KEAEKMKKKTEEAEK 988
K ++++K + + +E + K+ EEA+K
Sbjct: 88 KHAQEREKTEINRSHIQEVVQSIKRLEEAQK 118
>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
Autographa californica nuclear polyhedrosis virus
(AcMNPV) pp31 is a nuclear phosphoprotein that
accumulates in the virogenic stroma, which is the viral
replication centre in the infected-cell nucleus, binds
to DNA, and serves as a late expression factor.
Length = 267
Score = 31.2 bits (71), Expect = 2.6
Identities = 10/30 (33%), Positives = 15/30 (50%)
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKM 979
+K+K KK KK KK+K+K +
Sbjct: 222 SPSRKRKAPKKVAKKVAAAKKRKQKRSPPP 251
>gnl|CDD|165528 PHA03270, PHA03270, envelope glycoprotein C; Provisional.
Length = 466
Score = 31.4 bits (71), Expect = 2.7
Identities = 10/29 (34%), Positives = 12/29 (41%)
Query: 151 APTPATPDTEAPTPAPGTESGPSSTTPGP 179
PTP PA +SGP + P P
Sbjct: 51 TPTPTRGKGTPTGPASPPKSGPPKSPPAP 79
>gnl|CDD|117486 pfam08919, F_actin_bind, F-actin binding. The F-actin binding
domain forms a compact bundle of four antiparallel
alpha-helices, which are arranged in a left-handed
topology. Binding of F-actin to the F-actin binding
domain may result in cytoplasmic retention and
subcellular distribution of the protein, as well as
possible inhibition of protein function.
Length = 179
Score = 30.4 bits (68), Expect = 2.7
Identities = 24/79 (30%), Positives = 34/79 (43%), Gaps = 8/79 (10%)
Query: 135 PIPDATHVVVKVPVEGAPTPATPDTEAPTPAP-GTESGPSSTTPGPETKTLESLNEKEKQ 193
P P +T V P P P+T +P+P+ + PSS P T SL K +Q
Sbjct: 12 PKPQSTAKPVGTPPSPVPLPST----SPSPSKMANGTQPSSAAFIPLISTRVSL-RKTRQ 66
Query: 194 LQARITKLSKILSGEVSID 212
RI S ++ V +D
Sbjct: 67 PPERIA--SGKITKGVVLD 83
>gnl|CDD|213527 TIGR00390, hslU, ATP-dependent protease HslVU, ATPase subunit.
This model represents the ATPase subunit of HslVU, while
the proteasome-related peptidase subunit is HslV.
Residues 54-61 of the model contain a P-loop ATP-binding
motif. Cys-287 of E. coli (position 308 in the seed
alignment), studied in MEDLINE:98389714, is Ser in other
members of the seed alignment [Protein fate, Protein
folding and stabilization].
Length = 441
Score = 31.3 bits (71), Expect = 2.8
Identities = 25/96 (26%), Positives = 37/96 (38%), Gaps = 12/96 (12%)
Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGA----------EKMKKKKKDAEKMKKKGAEKIGK 953
K + + ++++ E RKK E K E M G E++
Sbjct: 136 AKNQWGNMEEIQEEESVREAFRKKLREGELDDKEIEIEVSAKMPSGIEIMAPPGMEEMTM 195
Query: 954 KKKKKKKKKKKKKKKKKKKK-KEAEKMKKKTEEAEK 988
+ + KKKK+K K KEA+K EEA K
Sbjct: 196 NLQSLFQNLGPDKKKKRKLKIKEAKKALIA-EEAAK 230
>gnl|CDD|222242 pfam13584, BatD, Oxygen tolerance. This family of proteins carries
up to three membrane spanning regions and is involved in
tolerance to oxygen in in Bacteroides spp.
Length = 488
Score = 31.5 bits (72), Expect = 2.8
Identities = 14/57 (24%), Positives = 23/57 (40%), Gaps = 9/57 (15%)
Query: 886 AVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK--KGAEKMKKKKKDA 940
+ + ++ +KR R + K+A K+ KK K A+K KDA
Sbjct: 439 LLLFVLLLILYRKRRARNAD-------VAGNRTKKANKLAKKRLKEAKKALAGNKDA 488
>gnl|CDD|187861 cd09730, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Large proteins, some contain Zn-finger domain; signature
gene for I-A subtype; also known as TM1802 family.
Length = 579
Score = 31.6 bits (72), Expect = 2.8
Identities = 13/59 (22%), Positives = 23/59 (38%), Gaps = 1/59 (1%)
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K + +K+KK EK + KK+ K K K +K + +K +
Sbjct: 87 TYLLKTTEPLNKKIKK-WFEKSYLLLESKKEILKLKNIKDVLEKNIEKILKDIQDLKPG 144
>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106. This
archaebacterial protein family has no known function.
Members are predicted to be integral membrane proteins.
Length = 168
Score = 30.3 bits (69), Expect = 2.8
Identities = 13/65 (20%), Positives = 30/65 (46%), Gaps = 12/65 (18%)
Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
+K E + R+K+ ++ ++ K G K KK +K++++ + +K+
Sbjct: 40 RKMEKYQKREKEIQKRARE------------LRKNGDKLSPKKFEKRQEELMEDQKEMMM 87
Query: 977 EKMKK 981
+ MK
Sbjct: 88 DMMKP 92
Score = 30.3 bits (69), Expect = 3.2
Identities = 10/51 (19%), Positives = 24/51 (47%), Gaps = 3/51 (5%)
Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK---KKKEAEKMKKKT 983
+K + + ++K +K ++ +K K KK +K++ + + E M
Sbjct: 40 RKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEELMEDQKEMMMDMM 90
>gnl|CDD|235202 PRK04023, PRK04023, DNA polymerase II large subunit; Validated.
Length = 1121
Score = 31.4 bits (72), Expect = 2.8
Identities = 16/45 (35%), Positives = 26/45 (57%)
Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
A K K K +K+K G E + + KK+K+++++KKK K K
Sbjct: 250 ALKAPKILKYVDKLKIDGWEWLEELISGKKEKEEEEEKKKIKPKD 294
>gnl|CDD|221805 pfam12848, ABC_tran_2, ABC transporter. This domain is related to
pfam00005.
Length = 85
Score = 28.7 bits (65), Expect = 2.9
Identities = 18/70 (25%), Positives = 36/70 (51%), Gaps = 9/70 (12%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRK-KKGAEKMKK---KKKDAEKMKKKGAEK 950
+ +++ R +++ +++K+ KK E + + + A K K+ + K EKM E+
Sbjct: 18 YLEQKEERLEQQEKAYEKQQKEIKKLEEFIDRFRAKASKAKQAQSRIKALEKM-----ER 72
Query: 951 IGKKKKKKKK 960
I K +KK K
Sbjct: 73 IEKPVEKKPK 82
Score = 27.5 bits (62), Expect = 8.6
Identities = 18/72 (25%), Positives = 34/72 (47%), Gaps = 12/72 (16%)
Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
++KE +++K EK +K+ K E+ I + + K K K+ + + K
Sbjct: 20 EQKEERLEQQEKAYEKQQKEIKKLEEF-------I-----DRFRAKASKAKQAQSRIKAL 67
Query: 977 EKMKKKTEEAEK 988
EKM++ + EK
Sbjct: 68 EKMERIEKPVEK 79
>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 379
Score = 31.1 bits (70), Expect = 2.9
Identities = 12/78 (15%), Positives = 27/78 (34%), Gaps = 8/78 (10%)
Query: 919 KEAEKMRKKKGAEKMKKKKKDAE--------KMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
EA ++ G + + ++ + K EK+ +K + K +
Sbjct: 181 GEAPITTRRIGVDGISLSLEETKASEVEHLSASLKTATEKVDALEKDLHAAQAKLDSGQA 240
Query: 971 KKKKEAEKMKKKTEEAEK 988
K+E + K + +A
Sbjct: 241 LTKEELDAKKAELSKALA 258
>gnl|CDD|240256 PTZ00071, PTZ00071, 40S ribosomal protein S24; Provisional.
Length = 132
Score = 30.0 bits (68), Expect = 3.0
Identities = 15/34 (44%), Positives = 21/34 (61%), Gaps = 2/34 (5%)
Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEK--MKKK 982
I KKK +K +K+ K ++KK + KE K KKK
Sbjct: 99 IKKKKAGRKARKELKNRRKKVRGKEKTKVSGKKK 132
Score = 29.6 bits (67), Expect = 3.3
Identities = 19/57 (33%), Positives = 27/57 (47%), Gaps = 9/57 (15%)
Query: 918 KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
KK K R + + KKKK K +K+ K ++KK + K+K K KKK
Sbjct: 85 KKFEPKYRLIR--LGLIKKKKAGRKARKE-------LKNRRKKVRGKEKTKVSGKKK 132
Score = 29.2 bits (66), Expect = 4.8
Identities = 12/31 (38%), Positives = 19/31 (61%)
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
KKKK +K +K+ K ++KK + EK K +
Sbjct: 100 KKKKAGRKARKELKNRRKKVRGKEKTKVSGK 130
>gnl|CDD|215038 PLN00040, PLN00040, Protein MAK16 homolog; Provisional.
Length = 233
Score = 30.9 bits (70), Expect = 3.0
Identities = 24/101 (23%), Positives = 37/101 (36%), Gaps = 22/101 (21%)
Query: 899 RTRRKKKKGAEKI--RKKKKKKKEAEKMRK-KKGAEKMKKKKKDAEKMKKKG-------- 947
R R+ K EKI +K K+E + K +K A+ K +K+ + K G
Sbjct: 121 RMRKLALKTREKIVTTPRKLLKRERRRESKAQKAAQLEKSIEKELLERLKSGTYGDIYNF 180
Query: 948 -----------AEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
E +++ K K KKK + E E
Sbjct: 181 PSKSYNKVLEMEEVEEAEEELPKSDKNPNSKKKSRVHVEIE 221
>gnl|CDD|227818 COG5531, COG5531, SWIB-domain-containing proteins implicated in
chromatin remodeling [Chromatin structure and dynamics].
Length = 237
Score = 30.9 bits (70), Expect = 3.1
Identities = 24/107 (22%), Positives = 35/107 (32%), Gaps = 5/107 (4%)
Query: 879 NDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMK-KKK 937
D + V L F + + R K K +E +RK+K K K
Sbjct: 20 RDTNNEKDVGKLLFSEWTVRLEGRFKDNNDLIRDKFDSLAEEPRVLRKEKYNITRKTTGK 79
Query: 938 KDAEKMKKKGAEKIGK----KKKKKKKKKKKKKKKKKKKKKEAEKMK 980
D K + + + K+ KKKK K EK+K
Sbjct: 80 NDLPKEEDSSLPSSKETENGDTEGKETDKKKKSSTISKNSPSGEKVK 126
>gnl|CDD|233063 TIGR00631, uvrb, excinuclease ABC, B subunit. All proteins in this
family for wich functions are known are DNA helicases
that function in the nucleotide excision repair and are
endonucleases that make the 3' incision next to DNA
damage. They are part of a pathway requiring UvrA, UvrB,
UvrC, and UvrD homologs. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 655
Score = 31.1 bits (71), Expect = 3.1
Identities = 19/81 (23%), Positives = 39/81 (48%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
A+KI +K E + R+K ++ + ++K + + + K+K+ KKKKK
Sbjct: 550 ADKITDSMQKAIEETERRRKIQMAYNEEHGITPQTIRKPIRDILDIELKEKEDAAKKKKK 609
Query: 968 KKKKKKKEAEKMKKKTEEAEK 988
+ +++KK ++ EK
Sbjct: 610 GEDLSDLSKKELKKLIKQLEK 630
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 31.6 bits (72), Expect = 3.1
Identities = 19/101 (18%), Positives = 32/101 (31%), Gaps = 11/101 (10%)
Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD-AEKMKKKGAEKIGKKKKK 957
K K + + KK K ++K KK+ AE K+ GA + K+ K
Sbjct: 960 HEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLK 1019
Query: 958 KKK----------KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ K + + K +K+K
Sbjct: 1020 ELPVEVAELQSASKIISSESTELSILKPLQKLKGLLLLENN 1060
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 31.5 bits (72), Expect = 3.2
Identities = 11/39 (28%), Positives = 20/39 (51%), Gaps = 3/39 (7%)
Query: 951 IGKKKKKKKKKKKKKKKKK---KKKKKEAEKMKKKTEEA 986
+KKK KK +K+ + + KK+KE + + +K
Sbjct: 83 FREKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRHV 121
Score = 31.1 bits (71), Expect = 3.8
Identities = 10/34 (29%), Positives = 19/34 (55%)
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
++KKK KK +K+ + + +K K+ +EK
Sbjct: 84 REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEK 117
>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
Length = 205
Score = 30.3 bits (69), Expect = 3.2
Identities = 18/46 (39%), Positives = 23/46 (50%), Gaps = 3/46 (6%)
Query: 944 KKKGAEKIGKKKKKKKK---KKKKKKKKKKKKKKEAEKMKKKTEEA 986
K +GKKKK KK+ KK KKK + K E K K ++A
Sbjct: 114 AKHYGIDLGKKKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKA 159
Score = 29.2 bits (66), Expect = 7.5
Identities = 18/50 (36%), Positives = 26/50 (52%)
Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
DA K+ A+ G KKKK KK+++ KK KKK E+ K + +
Sbjct: 105 DAAPFKQWYAKHYGIDLGKKKKYKKEEENKKAKKKSESLVAKLEVRKKNH 154
>gnl|CDD|221569 pfam12420, DUF3671, Protein of unknown function. This domain
family is found in eukaryotes, and is typically between
96 and 116 amino acids in length.
Length = 106
Score = 29.2 bits (66), Expect = 3.2
Identities = 18/48 (37%), Positives = 21/48 (43%), Gaps = 3/48 (6%)
Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
KKKG +K+ EK + I K KK KKK KK KK
Sbjct: 1 KKKGLKKLDCY---CEKKIFDKIDYIDKLKKDPNIDKKKFKKIIFKKY 45
>gnl|CDD|219186 pfam06810, Phage_GP20, Phage minor structural protein GP20. This
family consists of several phage minor structural
protein GP20 sequences of around 180 residues in length.
The function of this family is unknown.
Length = 156
Score = 30.0 bits (68), Expect = 3.3
Identities = 24/81 (29%), Positives = 41/81 (50%), Gaps = 7/81 (8%)
Query: 909 EKIRKKKKKKKEAEKMRKK--KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
K+ K +++ +K K K E +KKK KD E+++K KI K K++ K K++ +
Sbjct: 20 SKLDKANEERDSLKKQLKNRDKQIEDLKKKVKDNEELQK----KIEKLKQQNKTAKEEYE 75
Query: 967 KKKKKKKKE-AEKMKKKTEEA 986
K K A ++ K +A
Sbjct: 76 AKLADTKLNNAIELALKKAKA 96
>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit. This
family includes the vacuolar ATP synthase E subunit, as
well as the archaebacterial ATP synthase E subunit.
Length = 195
Score = 30.4 bits (69), Expect = 3.3
Identities = 19/78 (24%), Positives = 41/78 (52%), Gaps = 4/78 (5%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
AE+IR + +++ E EK + AEK K ++ + K+K AE + K+ K + + K
Sbjct: 10 AEEIRAEAEEEFEIEKAEAVEEAEK---KIEEIYEKKEKQAE-MEKQIIISNAKNEARLK 65
Query: 968 KKKKKKKEAEKMKKKTEE 985
+++ + + ++ +E
Sbjct: 66 VLNAREELLDSVFEEAKE 83
>gnl|CDD|221161 pfam11661, DUF2986, Protein of unknown function (DUF2986). This
family of proteins has no known function.
Length = 44
Score = 27.6 bits (62), Expect = 3.3
Identities = 14/36 (38%), Positives = 17/36 (47%)
Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKM 979
KKK +K+ KK KK K K K K E K+
Sbjct: 4 KKKINQKLKKKAKKANAKLHPSNKPKYISKAERAKL 39
>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
II/NMEMI. CRISPR loci appear to be mobile elements with
a wide host range. This model represents a protein found
only in CRISPR-containing species, near other
CRISPR-associated proteins (cas), as part of the NMENI
subtype of CRISPR/Cas locus. The species range so far
for this protein is animal pathogens and commensals only
[Mobile and extrachromosomal element functions, Other].
Length = 805
Score = 31.2 bits (71), Expect = 3.4
Identities = 10/54 (18%), Positives = 20/54 (37%), Gaps = 4/54 (7%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
E+ K+ E+ +K + K K+ + K+ E+ + K K
Sbjct: 525 EEQGTNFGKRNSKERYKK----NEDKIKEFASALGKEILKEEPTENSSKNILKL 574
Score = 30.9 bits (70), Expect = 4.0
Identities = 9/34 (26%), Positives = 17/34 (50%)
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
E+ G K+ K++ KK + K K+ + K+
Sbjct: 525 EEQGTNFGKRNSKERYKKNEDKIKEFASALGKEI 558
Score = 30.1 bits (68), Expect = 8.5
Identities = 8/31 (25%), Positives = 12/31 (38%)
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
K+ K++ KK + K KE K
Sbjct: 528 GTNFGKRNSKERYKKNEDKIKEFASALGKEI 558
>gnl|CDD|153355 cd07671, F-BAR_PSTPIP1, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
Phosphatase-Interacting Protein 1. F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Proline-Serine-Threonine
Phosphatase-Interacting Protein 1 (PSTPIP1), also known
as CD2 Binding Protein 1 (CD2BP1), is mainly expressed
in hematopoietic cells. It is a binding partner of the
cell surface receptor CD2 and PTP-PEST, a tyrosine
phosphatase which functions in cell motility and Rac1
regulation. It also plays a role in the activation of
the Wiskott-Aldrich syndrome protein (WASP), which
couples actin rearrangement and T cell activation.
Mutations in the gene encoding PSTPIP1 cause the
autoinflammatory disorder known as PAPA (pyogenic
sterile arthritis, pyoderma gangrenosum, and acne)
syndrome. PSTPIP1 contains an N-terminal F-BAR domain,
PEST motifs, and a C-terminal SH3 domain. F-BAR domains
form banana-shaped dimers with a positively-charged
concave surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 242
Score = 30.7 bits (69), Expect = 3.4
Identities = 23/93 (24%), Positives = 45/93 (48%), Gaps = 6/93 (6%)
Query: 895 FR-KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
FR +++ +RKK + + +K K + M KK E+ ++ +AE + E+
Sbjct: 93 FRERQKEQRKKYEAVMERVQKSKVSLYKKTMESKKTYEQRCREADEAE----QTFERSSS 148
Query: 954 KKKKKKKKKKKKKKKK-KKKKKEAEKMKKKTEE 985
K+ +K + K K+ + EAE++ K+ E
Sbjct: 149 TGNPKQSEKSQNKAKQCRDAATEAERVYKQNIE 181
>gnl|CDD|233390 TIGR01385, TFSII, transcription elongation factor S-II. This model
represents eukaryotic transcription elongation factor
S-II. This protein allows stalled RNA transcription
complexes to perform a cleavage of the nascent RNA and
restart at the newly generated 3-prime end.
Length = 299
Score = 31.0 bits (70), Expect = 3.4
Identities = 17/89 (19%), Positives = 40/89 (44%), Gaps = 8/89 (8%)
Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKK------KGAEKMKKKKKDAEKMKKKGAEKIGK 953
+ K+ E++ ++ K + K+RK K A+K+ K K + K ++ G
Sbjct: 28 HQLKEFPPTEELLQETKVGVKVNKLRKHPNEDISKLAKKIIKSWKKV--VDKNKSDHPGG 85
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
+ K + K++ K +++K+++
Sbjct: 86 NPEDKTTVGESVNSVKQEAKSQSDKIEQP 114
>gnl|CDD|180550 PRK06382, PRK06382, threonine dehydratase; Provisional.
Length = 406
Score = 31.0 bits (70), Expect = 3.4
Identities = 18/46 (39%), Positives = 24/46 (52%), Gaps = 3/46 (6%)
Query: 790 ISFIMYGQLENADDLVRDLCEEKDAILRRSGMYTIAMAYCGSGSNV 835
+S I+Y +LEN LVR C D R +Y IA A +G N+
Sbjct: 316 MSKIIYKELENLGQLVRIECNIPD---RPGNLYRIANAIASNGGNI 358
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 31.1 bits (71), Expect = 3.4
Identities = 25/86 (29%), Positives = 48/86 (55%), Gaps = 16/86 (18%)
Query: 918 KKEAEKMRK--KKGAEKMKKKKKD-------------AEKMKKKGAEKIGKKKKKKKKKK 962
++E E+M K + AE+ KK+K++ AEK K+ +K+ ++K+K +K
Sbjct: 500 EEEIERMVKEAEANAEEDKKRKEEIEARNNADSLAYQAEKTLKEAGDKLPAEEKEKIEKA 559
Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ K+ K + + E++K KTEE ++
Sbjct: 560 VAELKEALKGE-DVEEIKAKTEELQQ 584
>gnl|CDD|233626 TIGR01897, cas_MJ1666, CRISPR-associated protein, MJ1666 family.
CRISPR is a term for Clustered, Regularly Interspaced
Short Palidromic Repeats. A number of protein families
appear only in association with these repeats and are
designated Cas (CRISPR-Associated) proteins. This model
describes a Cas protein about 400 residues in length,
found mostly in the Archaea but also in Aquifex [Mobile
and extrachromosomal element functions, Other].
Length = 410
Score = 30.8 bits (70), Expect = 3.4
Identities = 18/84 (21%), Positives = 33/84 (39%), Gaps = 3/84 (3%)
Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK---MKKKGAE 949
F ++ + ++ K + K+AE++ K E K + EK + G E
Sbjct: 326 TFFDEEYISVYIEIIKNELDNIKNRSKKAEELALIKATEIYKVSGEGFEKRNFLAHAGFE 385
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKK 973
+ KK+K K + K +K
Sbjct: 386 YNVVEVKKEKDKIYVRYGDKIEKI 409
>gnl|CDD|238288 cd00520, RRF, Ribosome recycling factor (RRF). Ribosome recycling
factor dissociates the posttermination complex, composed
of the ribosome, deacylated tRNA, and mRNA, after
termination of translation. Thus ribosomes are
"recycled" and ready for another round of protein
synthesis. RRF is believed to bind the ribosome at the
A-site in a manner that mimics tRNA, but the specific
mechanisms remain unclear. RRF is essential for
bacterial growth. It is not necessary for cell growth
in archaea or eukaryotes, but is found in mitochondria
or chloroplasts of some eukaryotic species.
Length = 179
Score = 30.3 bits (69), Expect = 3.5
Identities = 14/58 (24%), Positives = 31/58 (53%)
Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
E+ K+ KDA+K+ ++ I ++ K KK +K+K+ + E +K ++ ++
Sbjct: 104 ERRKELVKDAKKIAEEAKVAIRNIRRDANDKIKKLEKEKEISEDEVKKAEEDLQKLTD 161
Score = 29.5 bits (67), Expect = 6.2
Identities = 18/72 (25%), Positives = 36/72 (50%), Gaps = 6/72 (8%)
Query: 896 RKKRTRRKKKKGAE------KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE 949
RK+ + KK E IR+ K + + K+ +++KK ++D +K+ + +
Sbjct: 106 RKELVKDAKKIAEEAKVAIRNIRRDANDKIKKLEKEKEISEDEVKKAEEDLQKLTDEYIK 165
Query: 950 KIGKKKKKKKKK 961
KI + K K+K+
Sbjct: 166 KIDELLKSKEKE 177
>gnl|CDD|153357 cd07673, F-BAR_FCHO2, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of FCH domain Only 2
protein. F-BAR domains are dimerization modules that
bind and bend membranes and are found in proteins
involved in membrane dynamics and actin reorganization.
The specific function of FCH domain Only 2 (FCHO2) is
still unknown. It contains an N-terminal F-BAR domain
and a C-terminal domain of unknown function named SAFF
which is also present in FCHO1 and endophilin
interacting protein 1. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
Length = 269
Score = 30.8 bits (69), Expect = 3.5
Identities = 29/102 (28%), Positives = 45/102 (44%), Gaps = 19/102 (18%)
Query: 904 KKKGAEKIRKKKKKKKEAEKMRK------------KKGAEKMKKKKKDAEKMKKKGA--- 948
+K G E+++ KK K+E + +K E K + E++KK+GA
Sbjct: 98 QKYGEEQVKSHKKTKEEVAGTLEAVQNIQSITQALQKSKENYNAKCLEQERLKKEGATQR 157
Query: 949 --EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
EK K KK + K +K K + E +K TE A+K
Sbjct: 158 EIEKAAVKSKKATESYKLYVEKYALAKADFE--QKMTETAQK 197
>gnl|CDD|237803 PRK14724, PRK14724, DNA topoisomerase III; Provisional.
Length = 987
Score = 31.1 bits (70), Expect = 3.5
Identities = 24/88 (27%), Positives = 28/88 (31%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
F RTRR K ++ K E E K K K A G +
Sbjct: 821 FVSMRTRRAFKAFLAWDKEAGKVNFEFEPRESKFPPRKTAAAKAGAASAAFGGTVAVKAA 880
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
K KK KK K K + KKK
Sbjct: 881 KPAKKAAAKKVAAKTAAAKTPRKAAKKK 908
>gnl|CDD|227545 COG5220, TFB3, Cdk activating kinase (CAK)/RNA polymerase II
transcription initiation/nucleotide excision repair
factor TFIIH, subunit TFB3 [Cell division and chromosome
partitioning / Transcription / DNA replication,
recombination, and repair].
Length = 314
Score = 30.8 bits (69), Expect = 3.5
Identities = 16/71 (22%), Positives = 32/71 (45%)
Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
+ K K E+M + + GK+K +K+ K KK++ +++ +E
Sbjct: 123 LIDVSLTEEKVKKYEEMNQDSILNNLERPTPEVMPGKQKNVLQKRMKLKKRQLERQIEEE 182
Query: 977 EKMKKKTEEAE 987
E+M K+ + E
Sbjct: 183 ERMNKEWTKKE 193
>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2). This family
represents a conserved region approximately 60 residues
long within the eukaryotic targeting protein for Xklp2
(TPX2). Xklp2 is a kinesin-like protein localised on
centrosomes throughout the cell cycle and on spindle
pole microtubules during metaphase. In Xenopus, it has
been shown that Xklp2 protein is required for centrosome
separation and maintenance of spindle bi-polarity. TPX2
is a microtubule-associated protein that mediates the
binding of the C-terminal domain of Xklp2 to
microtubules. It is phosphorylated during mitosis in a
microtubule-dependent way.
Length = 57
Score = 27.7 bits (62), Expect = 3.5
Identities = 12/35 (34%), Positives = 23/35 (65%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
+K+ + KK ++K+K + +K+EAE +K+ EE
Sbjct: 8 EKRAEFDKKLEEKEKALEAEKEEAEARQKEEEEEA 42
>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
TopJ. This model describes TopJ (MG_200, CbpA), a DnaJ
homolog and probable assembly protein of the Mycoplasma
terminal organelle. The terminal organelle is involved
in both cytadherence and gliding motility [Cellular
processes, Chemotaxis and motility].
Length = 871
Score = 31.3 bits (70), Expect = 3.6
Identities = 13/33 (39%), Positives = 16/33 (48%)
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
KKKKKKKKK K K+ K + E +
Sbjct: 105 TPSKKKKKKKKKGWFWAKSKQESKTIETEEIID 137
>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
Length = 64
Score = 27.9 bits (63), Expect = 3.6
Identities = 12/39 (30%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 945 KKGAEKIGKKKKKKKKKK-KKKKKKKKKKKKEAEKMKKK 982
K+ EK G ++ ++++ +K +K+K+KK A K ++K
Sbjct: 20 KRKVEKAGILRELRRREFYEKPSEKRKRKKAAARKRRRK 58
Score = 27.5 bits (62), Expect = 5.0
Identities = 8/20 (40%), Positives = 15/20 (75%)
Query: 956 KKKKKKKKKKKKKKKKKKKE 975
K+K+KK +K+++KK +E
Sbjct: 44 KRKRKKAAARKRRRKKLARE 63
>gnl|CDD|240235 PTZ00032, PTZ00032, 60S ribosomal protein L18; Provisional.
Length = 211
Score = 30.2 bits (68), Expect = 3.6
Identities = 19/65 (29%), Positives = 26/65 (40%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
+ T+R K KG K RK K AE +K E++ KD +K KG +
Sbjct: 51 KLLLTKRSKLKGIPKPRKLHKHGFWAEIFEEKVEREELGNPCKDLDKDIIKGKRRPRLTL 110
Query: 956 KKKKK 960
K
Sbjct: 111 KNTNN 115
Score = 29.4 bits (66), Expect = 6.4
Identities = 10/51 (19%), Positives = 18/51 (35%)
Query: 938 KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ +K + K K+ K K K +K K + E+ E+
Sbjct: 35 PNRKKSAPEQVPTGKNKLLLTKRSKLKGIPKPRKLHKHGFWAEIFEEKVER 85
>gnl|CDD|237512 PRK13800, PRK13800, putative oxidoreductase/HEAT repeat-containing
protein; Provisional.
Length = 897
Score = 31.0 bits (70), Expect = 3.7
Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 4/90 (4%)
Query: 409 DLVRDLCEEKDAILRRSGMYTIAMAYCVSYLYFDLFSDLTYRIVKNRSGMYTIAMAYCGS 468
D VR L + D ++R + + +A C + L + R G A A G+
Sbjct: 778 DAVRALTGDPDPLVRAAALAALAELGCPPDDVAAATAALRASAWQVRQGA---ARALAGA 834
Query: 469 GSNVATKRLLHVAVSDVNDDVRREAVTALG 498
++VA L A++D + DVR+ AV AL
Sbjct: 835 AADVAVP-ALVEALTDPHLDVRKAAVLALT 863
>gnl|CDD|182380 PRK10328, PRK10328, DNA binding protein, nucleoid-associated;
Provisional.
Length = 134
Score = 29.5 bits (66), Expect = 3.8
Identities = 18/68 (26%), Positives = 34/68 (50%), Gaps = 3/68 (4%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG---AEKIGKKKKKKKKKKKKK 965
EK R K+++E E+ ++++ AE+ +K E MK G E +G + KK+
Sbjct: 31 EKFRVVTKERREEEEQQQRELAERQEKINTWLELMKADGINPEELLGNSSAAAPRAGKKR 90
Query: 966 KKKKKKKK 973
+ + K +
Sbjct: 91 QPRPAKYR 98
>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
(DUF2098). This domain, found in various hypothetical
prokaryotic proteins, has no known function.
Length = 91
Score = 28.8 bits (65), Expect = 4.1
Identities = 10/39 (25%), Positives = 22/39 (56%)
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
E K K+KK+++++ K++ ++ KKE E + +
Sbjct: 47 EVTDKVKEKKEEREEDKEELIERIKKEEETFEDVDLGSA 85
>gnl|CDD|217553 pfam03430, TATR, Trans-activating transcriptional regulator. This
family of trans-activating transcriptional regulator
(TATR), also known as intermediate early protein 1, are
common to the Nucleopolyhedroviruses.
Length = 571
Score = 31.0 bits (70), Expect = 4.1
Identities = 13/75 (17%), Positives = 26/75 (34%), Gaps = 7/75 (9%)
Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK-------KKKK 972
E + + K K+K + + GKK K KK++K KK ++
Sbjct: 112 LPESLSETKQVTVSKRKGSELDSDSDSSDSSKGKKLVNKPKKRQKYKKATIQSPTSLTEE 171
Query: 973 KKEAEKMKKKTEEAE 987
+ ++ +
Sbjct: 172 ENYNTEICTVAPTDQ 186
>gnl|CDD|216860 pfam02063, MARCKS, MARCKS family.
Length = 296
Score = 30.6 bits (68), Expect = 4.1
Identities = 15/46 (32%), Positives = 19/46 (41%), Gaps = 1/46 (2%)
Query: 139 ATHVVVKVPVEGAPT-PATPDTEAPTPAPGTESGPSSTTPGPETKT 183
A+ K P E P PA+P A TE TP P ++T
Sbjct: 87 ASTEPDKEPAEAEPAEPASPAEAEGEAATSTEKAEDGATPSPSSET 132
>gnl|CDD|185404 PTZ00028, PTZ00028, 40S ribosomal protein S6e; Provisional.
Length = 218
Score = 30.2 bits (68), Expect = 4.2
Identities = 23/91 (25%), Positives = 46/91 (50%), Gaps = 5/91 (5%)
Query: 898 KRTRRKKKKGAEKIRK--KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKK-KGAEKIGKK 954
+R RR K A KIRK K + K ++ E K K A K+++ +++ +K
Sbjct: 128 ERPRRLGPKRASKIRKLFNLSKSDDVRKFVVRRKIEGKNKTK--APKIQRLVTPQRLQRK 185
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
++ +K KKK + +++ K+ + +++ E
Sbjct: 186 RRYRKTLKKKVLQSREEAKEYQKLLEEYKNE 216
>gnl|CDD|205286 pfam13105, DUF3959, Protein of unknown function (DUF3959). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 260 amino acids in length.
Length = 242
Score = 30.2 bits (68), Expect = 4.2
Identities = 22/75 (29%), Positives = 34/75 (45%), Gaps = 7/75 (9%)
Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK--KKKKKKKKKKKKKKK 969
RK+K KE + K+K KK+KK + K++ K+ K K +K K +K
Sbjct: 152 RKEKDDNKENINLFKQK-----KKRKKLSFKIRLPRLPKLKMKLFKFGRKPSNSKDPRKY 206
Query: 970 KKKKKEAEKMKKKTE 984
+ +E M TE
Sbjct: 207 EHNYEENVPMYHMTE 221
>gnl|CDD|238146 cd00238, ERp29c, ERp29 and ERp38, C-terminal domain; composed of
the protein disulfide isomerase (PDI)-like proteins
ERp29 and ERp38. ERp29 (also called ERp28) is a
ubiquitous endoplasmic reticulum (ER)-resident protein
expressed in high levels in secretory cells. It contains
a redox inactive TRX-like domain at the N-terminus. The
expression profile of ERp29 suggests a role in secretory
protein production, distinct from that of PDI. It has
also been identified as a member of the thyroglobulin
folding complex and is essential in regulating the
secretion of thyroglobulin. The Drosophila homolog,
Wind, is the product of windbeutel, an essential gene in
the development of dorsal-ventral patterning. Wind is
required for correct targeting of Pipe, a Golgi-resident
type II transmembrane protein with homology to
2-O-sulfotransferase. ERp38 is a P5-like protein, first
isolated from alfalfa (the cDNA clone was named G1),
which contains two redox active TRX domains at the
N-terminus, like human P5. However, unlike human P5,
ERp38 also contains a C-terminal domain with homology to
the C-terminal domain of ERp29. It may be a
glucose-regulated protein. The function of the
all-helical C-terminal domain of ERp29 and ERp38 remains
unclear. The C-terminal domain of Wind is thought to
provide a distinct site required for interaction with
its substrate, Pipe.
Length = 93
Score = 28.8 bits (65), Expect = 4.2
Identities = 15/66 (22%), Positives = 32/66 (48%), Gaps = 1/66 (1%)
Query: 907 GAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
E+ +K K+ EK+++ + K K EK+ +KG + + K+ + ++ +KK
Sbjct: 16 DEERKELLEKVKEAVEKLKEAEAKYA-KYYVKVMEKILEKGEDYVEKELARLERLLEKKG 74
Query: 967 KKKKKK 972
+K
Sbjct: 75 LAPEKA 80
>gnl|CDD|222855 PHA02529, O, capsid-scaffolding protein; Provisional.
Length = 278
Score = 30.5 bits (69), Expect = 4.3
Identities = 14/81 (17%), Positives = 39/81 (48%), Gaps = 1/81 (1%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
EK++K +KKE + M ++ A+ + + AEK+ A ++ + + ++ K +
Sbjct: 175 FEKVKKLFSRKKEDDAMTDEQFADVHEAVEGVAEKIDHTSA-QVATTETAEPPQEGKDTE 233
Query: 968 KKKKKKKEAEKMKKKTEEAEK 988
+ + ++ +K+ ++
Sbjct: 234 ENGETAEQFSALKETLDKLAS 254
>gnl|CDD|110537 pfam01540, Lipoprotein_7, Adhesin lipoprotein. This family
consists of the p50 and variable adherence-associated
antigen (Vaa) adhesins from Mycoplasma hominis. M.
hominis is a mycoplasma associated with human urogenital
diseases, pneumonia, and septic arthritis. An adhesin is
a cell surface molecule that mediates adhesion to other
cells or to the surrounding surface or substrate. The
Vaa antigen is a 50-kDa surface lipoprotein that has
four tandem repetitive DNA sequences encoding a periodic
peptide structure, and is highly immunogenic in the
human host. p50 is also a 50-kDa lipoprotein, having
three repeats A,B and C, that may be a tetramer of
191-kDa in its native environment.
Length = 353
Score = 30.4 bits (68), Expect = 4.3
Identities = 21/76 (27%), Positives = 37/76 (48%), Gaps = 5/76 (6%)
Query: 918 KKEAEKMRK-KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE- 975
K E ++ KG E++KK + + + I + +KK K K K++ KK K E
Sbjct: 227 KNELTEINSIIKGVEELKKLSHEISEFSNSVKKTISELEKKFKIDDKTNKEEAKKFKNEL 286
Query: 976 ---AEKMKKKTEEAEK 988
A+++ K+ E +K
Sbjct: 287 ENFADQLLNKSHEIDK 302
>gnl|CDD|129586 TIGR00495, crvDNA_42K, 42K curved DNA binding protein. Proteins
identified by This model have been identified in a
number of species as a nuclear (but not nucleolar)
protein with a cell cycle dependence. Various names
given to members of this family have included cell cycle
protein p38-2G4, DNA-binding protein GBP16, and
proliferation-associated protein 1. This protein is
closely related to methionine aminopeptidase, a
cobolt-binding protein [Unknown function, General].
Length = 389
Score = 30.6 bits (69), Expect = 4.3
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMK 980
K+KK+KKK KK K + E E M+
Sbjct: 364 KRKKQKKKAKKASKTGEAATEGETME 389
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 30.7 bits (70), Expect = 4.3
Identities = 17/85 (20%), Positives = 35/85 (41%), Gaps = 6/85 (7%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG---AEKIGKKKKKK 958
RK + K+ + K + EK K K E++ K EKI K+ K+
Sbjct: 46 RKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKEL 105
Query: 959 KKKKKK---KKKKKKKKKKEAEKMK 980
+++ + + K+ +++ + E
Sbjct: 106 EEEISELENEIKELEQEIERLEPWG 130
Score = 30.7 bits (70), Expect = 4.8
Identities = 15/69 (21%), Positives = 30/69 (43%), Gaps = 13/69 (18%)
Query: 522 SLLDDPMIELKEFSLKKLNDIVDDFWPENEKE--------KQLQARITKLSKILS----- 568
+ L + ++ SL++L V++ + EKE +L+ I +L + +
Sbjct: 70 NPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLEPW 129
Query: 569 GEVSIDLHL 577
G +DL L
Sbjct: 130 GNFDLDLSL 138
>gnl|CDD|219401 pfam07404, TEBP_beta, Telomere-binding protein beta subunit (TEBP
beta). This family consists of several telomere-binding
protein beta subunits which appear to be specific to the
family Oxytrichidae. Telomeres are specialised
protein-DNA complexes that compose the ends of
eukaryotic chromosomes. Telomeres protect chromosome
termini from degradation and recombination and act
together with telomerase to ensure complete genome
replication. TEBP beta forms a complex with TEBP alpha
and this complex is able to recognise and bind ssDNA to
form a sequence-specific, telomeric nucleoprotein
complex that caps the very 3' ends of chromosomes.
Length = 375
Score = 30.5 bits (68), Expect = 4.4
Identities = 18/63 (28%), Positives = 28/63 (44%), Gaps = 1/63 (1%)
Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
++KG + + K + K KG K GK K K K KK KK + ++K ++
Sbjct: 222 QEKGKDALNKAADHTDVAKVKGGAK-GKGKAAAKSAKGKKLSAKKGDSASSADVRKSVDK 280
Query: 986 AEK 988
K
Sbjct: 281 IVK 283
>gnl|CDD|153294 cd07610, FCH_F-BAR, The Extended FES-CIP4 Homology (FCH) or F-BAR
(FCH and Bin/Amphiphysin/Rvs) domain, a dimerization
module that binds and bends membranes. F-BAR domains
are dimerization modules that bind and bend membranes
and are found in proteins involved in membrane dynamics
and actin reorganization. F-BAR domain containing
proteins, also known as Pombe Cdc15 homology (PCH)
family proteins, include Fes and Fer tyrosine kinases,
PACSINs/Syndapins, FCHO, PSTPIP, CIP4-like proteins and
srGAPs. Many members also contain an SH3 domain and play
roles in endocytosis. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
These tubules have diameters larger than those observed
with N-BARs. The F-BAR domains of some members such as
NOSTRIN and Rgd1 are important for the subcellular
localization of the protein.
Length = 191
Score = 30.0 bits (68), Expect = 4.5
Identities = 30/118 (25%), Positives = 54/118 (45%), Gaps = 21/118 (17%)
Query: 890 LGFILFRKKRTRRKKK--------KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAE 941
LG L + R KK+ K +K+ KK KK E+ K ++++ + A
Sbjct: 11 LGLDLLKDLREFLKKRAAIEEEYAKNLQKLAKKFSKKPESGKTSLGTSWNSLREETESAA 70
Query: 942 KMKKKGAEKIGKK--------KKKKKKKKKK-----KKKKKKKKKKEAEKMKKKTEEA 986
+ ++ +EK+ + K+ K++ +KK +K KKK ++ A+ KK EE
Sbjct: 71 TVHEELSEKLSQLIREPLEKVKEDKEQARKKELAEGEKLKKKLQELWAKLAKKADEEY 128
>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
function prediction only].
Length = 872
Score = 31.0 bits (70), Expect = 4.5
Identities = 12/48 (25%), Positives = 18/48 (37%)
Query: 140 THVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPETKTLESL 187
T VV+ V T + + T +P S ++TT P T
Sbjct: 802 TSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTTGGG 849
>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal. The yeast member
of this family (Kri1p) is found to be required for 40S
ribosome biogenesis in the nucleolus. This is the
C-terminal domain of the family.
Length = 93
Score = 28.7 bits (65), Expect = 4.5
Identities = 9/23 (39%), Positives = 17/23 (73%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKE 975
K++++K KKK KK + ++ KK+
Sbjct: 69 KEERRKDKKKYGKKARLREWKKK 91
Score = 27.9 bits (63), Expect = 6.6
Identities = 10/24 (41%), Positives = 15/24 (62%)
Query: 960 KKKKKKKKKKKKKKKEAEKMKKKT 983
K++++K KKK KK + KKK
Sbjct: 69 KEERRKDKKKYGKKARLREWKKKV 92
>gnl|CDD|185443 PTZ00097, PTZ00097, 60S ribosomal protein L19; Provisional.
Length = 175
Score = 29.7 bits (67), Expect = 4.6
Identities = 19/74 (25%), Positives = 36/74 (48%)
Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
+R+ +K + A+K+ + E K K + K K+ E I K K +K K+KK + + +
Sbjct: 100 LRRLLRKYRAAKKIDRHMYHEFYLKSKGNQFKNKRVLIEAIHKTKNEKVKEKKIQDQLEA 159
Query: 971 KKKKEAEKMKKKTE 984
++ K K+
Sbjct: 160 RRAKAKALRNKRKA 173
>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 29.8 bits (67), Expect = 4.6
Identities = 13/71 (18%), Positives = 31/71 (43%), Gaps = 2/71 (2%)
Query: 918 KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
E+M ++ + K K +A+ K ++ K + + +K +++ K K E
Sbjct: 121 MSIVEEMLEENAKKGGKIDKMEADSTDLK--ARLRKAQILLEGLQKNQEELFKLLDKYNE 178
Query: 978 KMKKKTEEAEK 988
++ +E+ K
Sbjct: 179 LREQVQKESSK 189
>gnl|CDD|130217 TIGR01147, V_ATP_synt_G, vacuolar ATP synthase, subunit G. This
model describes the vacuolar ATP synthase G subunit in
eukaryotes and includes members from diverse groups
e.g., fungi, plants, parasites etc. V-ATPases are
multi-subunit enzymes composed of two functional
domains: A transmembrane Vo domain and a peripheral
catalytic domain V1. The G subunit is one of the
subunits of the catalytic domain. V-ATPases are
responsible for the acidification of endosomes and
lysosomes, which are part of the central vacuolar system
[Energy metabolism, ATP-proton motive force
interconversion].
Length = 113
Score = 29.0 bits (65), Expect = 4.7
Identities = 20/67 (29%), Positives = 36/67 (53%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
RK++T+R K+ E ++ +K K++ EK K+ A+ + EK + + KI + K
Sbjct: 26 RKRKTKRLKQAKEEAQKEVEKYKQQREKEFKEFEAKHLGGNGAAEEKAEAETQAKIREIK 85
Query: 956 KKKKKKK 962
K +K K
Sbjct: 86 KAVQKNK 92
Score = 28.3 bits (63), Expect = 8.3
Identities = 21/78 (26%), Positives = 46/78 (58%), Gaps = 1/78 (1%)
Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK-MKKKGAEKIGKKKKKKKKKK 962
+K+ AEK+ + +K+K + K K++ ++++K K+ EK K+ A+ +G ++K +
Sbjct: 15 EKRAAEKVSEARKRKTKRLKQAKEEAQKEVEKYKQQREKEFKEFEAKHLGGNGAAEEKAE 74
Query: 963 KKKKKKKKKKKKEAEKMK 980
+ + K ++ KK +K K
Sbjct: 75 AETQAKIREIKKAVQKNK 92
>gnl|CDD|218611 pfam05499, DMAP1, DNA methyltransferase 1-associated protein 1
(DMAP1). DNA methylation can contribute to
transcriptional silencing through several
transcriptionally repressive complexes, which include
methyl-CpG binding domain proteins (MBDs) and histone
deacetylases (HDACs). The chief enzyme that maintains
mammalian DNA methylation, DNMT1, can also establish a
repressive transcription complex. The non-catalytic
amino terminus of DNMT1 binds to HDAC2 and DMAP1 (for
DNMT1 associated protein), and can mediate
transcriptional repression. DMAP1 has intrinsic
transcription repressive activity, and binds to the
transcriptional co-repressor TSG101. DMAP1 is targeted
to replication foci through interaction with the far N
terminus of DNMT1 throughout S phase, whereas HDAC2
joins DNMT1 and DMAP1 only during late S phase,
providing a platform for how histones may become
deacetylated in heterochromatin following replication.
Length = 175
Score = 29.8 bits (67), Expect = 4.8
Identities = 18/62 (29%), Positives = 32/62 (51%), Gaps = 2/62 (3%)
Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
+K EA K ++K A+ ++K A++ + + +K++KK KKK +K KE
Sbjct: 10 RKIEARKKEREKKAQDLQKLITAADQQAAGFST--AEARKREKKLPKKKIPQKPGPSKED 67
Query: 977 EK 978
K
Sbjct: 68 SK 69
>gnl|CDD|225651 COG3109, ProQ, Activator of osmoprotectant transporter ProP [Signal
transduction mechanisms].
Length = 208
Score = 29.8 bits (67), Expect = 4.9
Identities = 12/51 (23%), Positives = 25/51 (49%)
Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
RK+ K + A++ ++ KK++EA +K AE+ + K ++
Sbjct: 100 RKQLAEAKARVQAQRAEQQAKKREEAPAAGEKPTAERPATAARPKPKAPRE 150
>gnl|CDD|218769 pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA).
This family consists of the N-terminal region of the
prokaryotic fibronectin-binding protein. Fibronectin
binding is considered to be an important virulence
factor in streptococcal infections. Fibronectin is a
dimeric glycoprotein that is present in a soluble form
in plasma and extracellular fluids; it is also present
in a fibrillar form on cell surfaces. Both the soluble
and cellular forms of fibronectin may be incorporated
into the extracellular tissue matrix. While fibronectin
has critical roles in eukaryotic cellular processes,
such as adhesion, migration and differentiation, it is
also a substrate for the attachment of bacteria. The
binding of pathogenic Streptococcus pyogenes and
Staphylococcus aureus to epithelial cells via
fibronectin facilitates their internalisation and
systemic spread within the host.
Length = 447
Score = 30.3 bits (69), Expect = 5.0
Identities = 19/73 (26%), Positives = 42/73 (57%), Gaps = 2/73 (2%)
Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
++K + ++ + E+ ++ K+E E + + + + +D E+++++ E+
Sbjct: 377 YYKKYKKLKRAVEAVEEQ--IEETKEEIEYLESVEAQLENAESLEDLEEIREELIEQGYL 434
Query: 954 KKKKKKKKKKKKK 966
K+KK+KKKKKKKK
Sbjct: 435 KEKKRKKKKKKKK 447
>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain. FH proteins control
rearrangements of the actin cytoskeleton, especially in
the context of cytokinesis and cell polarisation.
Members of this family have been found to interact with
Rho-GTPases, profilin and other actin-assoziated
proteins. These interactions are mediated by the
proline-rich FH1 domain, usually located in front of FH2
(but not listed in SMART). Despite this cytosolic
function, vertebrate formins have been assigned
functions within the nucleus. A set of Formin-Binding
Proteins (FBPs) has been shown to bind FH1 with their WW
domain.
Length = 392
Score = 30.4 bits (69), Expect = 5.0
Identities = 11/77 (14%), Positives = 31/77 (40%), Gaps = 6/77 (7%)
Query: 892 FILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
F F + K ++ KK+E E+ R+KK ++ + ++ + + K++
Sbjct: 320 FKDFNEFLKEFSKA------AEENIKKEEEEEERRKKLVKETTEYEQSSSRQKERNPSMD 373
Query: 952 GKKKKKKKKKKKKKKKK 968
+ ++ ++
Sbjct: 374 FEVERDFLGVLDSLLEE 390
>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
carrier protein subunit; Validated.
Length = 153
Score = 29.4 bits (66), Expect = 5.0
Identities = 17/65 (26%), Positives = 26/65 (40%), Gaps = 3/65 (4%)
Query: 134 APIPDATHVVVKVP---VEGAPTPATPDTEAPTPAPGTESGPSSTTPGPETKTLESLNEK 190
+ + + P V APTP P AP PA E+ ++ PG + L ++
Sbjct: 46 SAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPAPASAGENVVTAPMPGKILRILVREGQQ 105
Query: 191 EKQLQ 195
K Q
Sbjct: 106 VKVGQ 110
>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
Length = 448
Score = 30.4 bits (69), Expect = 5.1
Identities = 13/63 (20%), Positives = 34/63 (53%), Gaps = 6/63 (9%)
Query: 915 KKKKKEAEKMRKK-----KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
++++ E + +RK+ K K+K K+DA ++ + +++ K+ +K+ + ++ K
Sbjct: 43 RQRQFELDSLRKEFNKLNKEVAKLKIAKEDATELIAE-TKELKKEITEKEAEVQEAKAAL 101
Query: 970 KKK 972
K
Sbjct: 102 DAK 104
>gnl|CDD|218078 pfam04419, 4F5, 4F5 protein family. Members of this family are
short proteins that are rich in aspartate, glutamate,
lysine and arginine. Although the function of these
proteins is unknown, they are found to be ubiquitously
expressed.
Length = 38
Score = 26.8 bits (60), Expect = 5.1
Identities = 12/34 (35%), Positives = 18/34 (52%)
Query: 947 GAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMK 980
G ++ +++ KK KKKKK K + K A K
Sbjct: 4 GNQREQARERNAKKAAKKKKKGAKSQLKAAAKAL 37
>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region. The
defective chorion-1 gene (dec-1) in Drosophila encodes
follicle cell proteins necessary for proper eggshell
assembly. Multiple products of the dec-1 gene are formed
by alternative RNA splicing and proteolytic processing.
Cleavage products include S80 (80 kDa) which is
incorporated into the eggshell, and further proteolysis
of S80 gives S60 (60 kDa).
Length = 407
Score = 30.5 bits (68), Expect = 5.1
Identities = 17/57 (29%), Positives = 21/57 (36%), Gaps = 1/57 (1%)
Query: 123 LASVLQELKTCAPIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPGP 179
+ S L +L +P VP AP PA AP PA T + P P
Sbjct: 86 VGSFLAQLPAMPSMPGLLGAAAPVP-APAPAPAAAPPAAPAPAADTPAAPIPDAVQP 141
>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
Provisional.
Length = 418
Score = 30.4 bits (69), Expect = 5.1
Identities = 11/57 (19%), Positives = 23/57 (40%), Gaps = 2/57 (3%)
Query: 147 PVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPETKTLESLNEKEKQLQARITKLSK 203
P APTP P PTP + + P + + +E ++ ++++ +
Sbjct: 145 PKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKPPPTPVARADPRETRV--PMSRMRQ 199
>gnl|CDD|191382 pfam05837, CENP-H, Centromere protein H (CENP-H). This family
consists of several eukaryotic centromere protein H
(CENP-H) sequences. Macromolecular
centromere-kinetochore complex plays a critical role in
sister chromatid separation, but its complete protein
composition as well as its precise dynamic function
during mitosis has not yet been clearly determined.
CENP-H contains a coiled-coil structure and a nuclear
localisation signal. CENP-H is specifically and
constitutively localised in kinetochores throughout the
cell cycle. CENP-H may play a role in kinetochore
organisation and function throughout the cell cycle.
This the C-terminus of the region, which is conserved
from fungi to humans.
Length = 106
Score = 28.8 bits (65), Expect = 5.1
Identities = 17/58 (29%), Positives = 27/58 (46%), Gaps = 6/58 (10%)
Query: 909 EKIRKKKK--KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
+ K++ K+K E + E KKK+ E M+ K E++ K + KK K K
Sbjct: 20 SDLEKERLQLKQKNVELALELL--ELTKKKESWREDMELK--EQLEKLEADLKKSKAK 73
>gnl|CDD|220764 pfam10454, DUF2458, Protein of unknown function (DUF2458). This a
is family of uncharacterized proteins.
Length = 155
Score = 29.5 bits (66), Expect = 5.1
Identities = 9/48 (18%), Positives = 21/48 (43%)
Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
++ ++ E+ K E + +K++ +K+ KKK K +
Sbjct: 30 RRLIIEQHRHERQWWKEREALIQKQEARKEDKKKLLPNGTDSKLSSVG 77
>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160. This
family consists of several special lobe-specific silk
protein SSP160 sequences which appear to be specific to
Chironomus (Midge) species.
Length = 758
Score = 30.5 bits (68), Expect = 5.2
Identities = 15/42 (35%), Positives = 22/42 (52%)
Query: 928 KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
K EK K+K EKM+ K +G + K KK+ + + KK
Sbjct: 379 KSGEKKVKRKARLEKMRAKCRRAVGNRMKGSMKKRVRSRAKK 420
>gnl|CDD|222285 pfam13646, HEAT_2, HEAT repeats. This family includes multiple
HEAT repeats.
Length = 88
Score = 28.1 bits (63), Expect = 5.3
Identities = 25/95 (26%), Positives = 33/95 (34%), Gaps = 13/95 (13%)
Query: 802 DDLVRDLCEEKDAILRRSGMYTIAMAYCGSGSNVATKRLLH-----DAILRRSGMYTIAM 856
+ L+ L + D +R A A G A LL D +RR A
Sbjct: 2 EALLEALLSDPDPEVRA----AAARALGELGDPEALPALLELLKDPDPEVRR----AAAE 53
Query: 857 AYCGSGSNVATKRLLHVAVSDVNDDVRREAVTALG 891
A G A LL + D + VR A +AL
Sbjct: 54 ALGKLGDPEALPALLELLQDDDDAVVRAAAASALA 88
>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
protein. This domain family is found in eukaryotes, and
is typically between 212 and 238 amino acids in length.
The family is found in association with pfam01805. There
are two completely conserved residues (W and H) that may
be functionally important. PRP21 is required for
assembly of the prespliceosome and it interacts with U2
snRNP and/or pre-mRNA in the prespliceosome. This family
also contains proteins similar to PRP21, such as the
mammalian SF3a. SF3a also interacts with U2 snRNP from
the prespliceosome, converting it to its active form.
Length = 230
Score = 30.1 bits (68), Expect = 5.4
Identities = 20/84 (23%), Positives = 33/84 (39%), Gaps = 6/84 (7%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK------KGAEKMKKKKKDAEKMKKKGAEK 950
R+ +K K E+ ++E E ++ K E KK ++MK A +
Sbjct: 85 IYRSLEQKSKALEEPDAAAAPEEEEEMEEEEEDIDEEKKEESAKKSTPPIKEMKIPAAGE 144
Query: 951 IGKKKKKKKKKKKKKKKKKKKKKK 974
K +K K + +K KKK
Sbjct: 145 SRLKIRKDYVPSGNKPRAEKPKKK 168
>gnl|CDD|227313 COG4980, GvpP, Gas vesicle protein [General function prediction
only].
Length = 115
Score = 28.9 bits (65), Expect = 5.5
Identities = 26/103 (25%), Positives = 45/103 (43%), Gaps = 10/103 (9%)
Query: 886 AVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKK 945
A AL F K R+K KK + + + + K M K K+ AE +K
Sbjct: 19 AAAALLFAPKSGKELRKKLKKSGDALFELAEDKGTDILMIADK------LSKESAETLKD 72
Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+G G+ K+ KK K+ + + ++ K E E +++ + K
Sbjct: 73 QG----GEIKESVKKWKEDIQPEIERLKSEIEDLQEAISDETK 111
>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
beta and delta phosphatidylinositol 3-kinases (PI3Ks).
PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
regulate cell processes such as cell growth,
differentiation, proliferation, and motility. PI3Ks
work on phosphorylation of phosphatidylinositol,
phosphatidylinositide (4)P (PtdIns (4)P),2 or
PtdIns(4,5)P2. Specifically they phosphorylate the D3
hydroxyl group of phosphoinositol lipids on the inositol
ring. There are 3 classes of PI3Ks based on structure,
regulation, and specificity. All classes contain a C2
domain, a PIK domain, and a kinase catalytic domain.
The members here are class I, beta and delta isoforms of
PI3Ks and contain both a Ras-binding domain and a
p85-binding domain. Class II PI3Ks contain both of
these as well as a PX domain, and a C-terminal C2 domain
containing a nuclear localization signal. C2 domains
fold into an 8-standed beta-sandwich that can adopt 2
structural arrangements: Type I and Type II,
distinguished by a circular permutation involving their
N- and C-terminal beta strands. Many C2 domains are
Ca2+-dependent membrane-targeting modules that bind a
wide variety of substances including bind phospholipids,
inositol polyphosphates, and intracellular proteins.
Most C2 domain proteins are either signal transduction
enzymes that contain a single C2 domain, such as protein
kinase C, or membrane trafficking proteins which contain
at least two C2 domains, such as synaptotagmin 1.
However, there are a few exceptions to this including
RIM isoforms and some splice variants of piccolo/aczonin
and intersectin which only have a single C2 domain. C2
domains with a calcium binding region have negatively
charged residues, primarily aspartates, that serve as
ligands for calcium ions. Members have a type-I
topology.
Length = 173
Score = 29.6 bits (67), Expect = 5.5
Identities = 11/22 (50%), Positives = 14/22 (63%)
Query: 952 GKKKKKKKKKKKKKKKKKKKKK 973
KK K K+ +K + KKKKKK
Sbjct: 84 EVSKKAKGKRSRKNQTKKKKKK 105
Score = 29.6 bits (67), Expect = 5.9
Identities = 11/21 (52%), Positives = 15/21 (71%)
Query: 951 IGKKKKKKKKKKKKKKKKKKK 971
+ KK K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
Score = 28.8 bits (65), Expect = 8.2
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 954 KKKKKKKKKKKKKKKKKKKKK 974
KK K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
>gnl|CDD|224308 COG1390, NtpE, Archaeal/vacuolar-type H+-ATPase subunit E [Energy
production and conversion].
Length = 194
Score = 29.6 bits (67), Expect = 5.5
Identities = 12/80 (15%), Positives = 40/80 (50%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
+KI ++ +++ E ++ AEK+K++ K + + + +K+ ++++++
Sbjct: 9 KKILREAEEEAEEILEEAREEAEKIKEEAKREAEEAIEEILRKAEKEAERERQRIISSAL 68
Query: 969 KKKKKKEAEKMKKKTEEAEK 988
+ ++K E ++ E +
Sbjct: 69 LEARRKLLEAKEEILESVFE 88
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 30.1 bits (68), Expect = 5.6
Identities = 13/81 (16%), Positives = 47/81 (58%), Gaps = 2/81 (2%)
Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK--KKGAEKIGKKKKKKKKKKKKKK 966
+++ ++ ++ +E + +K+ E + + EK+K ++ A+ + KK + +++ ++ +
Sbjct: 8 QELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEENRRLE 67
Query: 967 KKKKKKKKEAEKMKKKTEEAE 987
++ ++E E+++ + +EA
Sbjct: 68 EEAAASEEERERLEAEVDEAT 88
>gnl|CDD|222649 pfam14284, PcfJ, PcfJ-like protein. The PcfJ-like protein family
includes the E. faecalis PcfJ protein, which is
functionally uncharacterized. It is found in bacteria
and viruses, and is typically between 159 and 170 amino
acids in length. There is a conserved HCV sequence
motif.
Length = 169
Score = 29.6 bits (67), Expect = 5.6
Identities = 10/32 (31%), Positives = 20/32 (62%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
+ KK+K+K +K +K+ + +K ++K K E
Sbjct: 38 YQAKKEKEKAEKMRKRYPELEKIFRELKSKFE 69
>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
peptidyl-prolyl isomerase. This family is only found at
the amino terminus of pfam00254. This domain is of
unknown function.
Length = 124
Score = 28.7 bits (65), Expect = 5.7
Identities = 11/37 (29%), Positives = 20/37 (54%)
Query: 922 EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
+ +KK A++ K +K AE+ K G + + KK+
Sbjct: 75 QAFQKKLQAKQQAKAEKKAEENKAAGEAFLAENAKKE 111
>gnl|CDD|212012 cd11622, HR1_PKN_1, First Protein kinase C-related kinase homology
region 1 (HR1) Rho-binding domain of Protein Kinase N.
PKN, also called Protein-kinase C-related kinase (PRK),
is a serine/threonine protein kinase that can be
activated by the small GTPase Rho, and by fatty acids
such as arachidonic and linoleic acids. It is involved
in many biological processes including cytoskeletal
regulation, cell adhesion, vesicle transport, glucose
transport, regulation of meiotic maturation and
embryonic cell cycles, signaling to the nucleus, and
tumorigenesis. In some vertebrates, there are three PKN
isoforms from different genes (designated PKN1, PKN2,
and PKN3), which show different enzymatic properties,
tissue distribution, and varied functions. PKN proteins
contain three HR1 domains, a C2 domain, and a kinase
domain. This model characterizes the first HR1 domain of
PKN. HR1 domains are anti-parallel coiled-coil (ACC)
domains that bind small GTPases from the Rho family.
Length = 66
Score = 27.6 bits (62), Expect = 5.7
Identities = 16/41 (39%), Positives = 21/41 (51%), Gaps = 3/41 (7%)
Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
IRK+ K K+ AE +RK A KK E + KK K+
Sbjct: 15 IRKELKIKEGAENLRK---ATTDKKSLAHVESILKKSNRKL 52
>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
function prediction only].
Length = 386
Score = 30.2 bits (68), Expect = 5.8
Identities = 13/39 (33%), Positives = 18/39 (46%), Gaps = 5/39 (12%)
Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
DA K + +K+K +K K KK KK +AE
Sbjct: 348 DAYPKKAEA-----ARKRKGDRKGVSHKAKKGGKKNQAE 381
>gnl|CDD|216061 pfam00687, Ribosomal_L1, Ribosomal protein L1p/L10e family. This
family includes prokaryotic L1 and eukaryotic L10.
Length = 200
Score = 29.5 bits (67), Expect = 5.9
Identities = 13/38 (34%), Positives = 21/38 (55%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
K + A++ K GA+ +G + K+K K KK +K K
Sbjct: 54 KDPEAKAKEAKDAGADVVGGEDLKEKYKIKKGRKLAKD 91
>gnl|CDD|153370 cd07686, F-BAR_Fer, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Fer (Fes related)
tyrosine kinase. F-BAR domains are dimerization modules
that bind and bend membranes and are found in proteins
involved in membrane dynamics and actin reorganization.
Fer (Fes related) is a cytoplasmic (or nonreceptor)
tyrosine kinase expressed in a wide variety of tissues,
and is found to reside in both the cytoplasm and the
nucleus. It plays important roles in neuronal
polarization and neurite development, cytoskeletal
reorganization, cell migration, growth factor signaling,
and the regulation of cell-cell interactions mediated by
adherens junctions and focal adhesions. Fer kinase also
regulates cell cycle progression in malignant cells. It
contains an N-terminal F-BAR domain, an SH2 domain, and
a C-terminal catalytic kinase domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 234
Score = 29.6 bits (66), Expect = 5.9
Identities = 21/76 (27%), Positives = 37/76 (48%), Gaps = 5/76 (6%)
Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
I+ K++ KK + ++ AE K K + EK+K + K+ K+K K
Sbjct: 100 IKDKQQVKKSYIGVHQQIEAEMYKVTKTELEKLKCSYRQLT-----KEVNSAKEKYKDAV 154
Query: 971 KKKKEAEKMKKKTEEA 986
K KE EK +++ ++A
Sbjct: 155 AKGKETEKARERYDKA 170
>gnl|CDD|225074 COG2163, RPL14A, Ribosomal protein L14E/L6E/L27E [Translation,
ribosomal structure and biogenesis].
Length = 125
Score = 28.9 bits (65), Expect = 5.9
Identities = 12/80 (15%), Positives = 33/80 (41%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
++R K + +K+ + + + + + A + K K A+ ++ K +
Sbjct: 45 VPRRRINIKHLEPTDKVIDLVRGASDEKVKKANEAAGVLAKLDKSAKNLETKKVREPLTD 104
Query: 955 KKKKKKKKKKKKKKKKKKKK 974
++ K K K+++ + K
Sbjct: 105 AERFKVMKLVKEERLQVGKA 124
>gnl|CDD|191868 pfam07839, CaM_binding, Plant calmodulin-binding domain. The
sequences featured in this family are found repeated in
a number of plant calmodulin-binding proteins, and are
thought to constitute the calmodulin-binding domains.
Binding of the proteins to calmodulin depends on the
presence of calcium ions. These proteins are thought to
be involved in various processes, such as plant defence
responses and stolonisation or tuberization.
Length = 115
Score = 28.5 bits (64), Expect = 6.0
Identities = 11/40 (27%), Positives = 21/40 (52%)
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+K ++ KK+ K +K+K K K+ +K+ + EK
Sbjct: 4 KKEERRSKKEDPKPEKEKPKSWSNLKKVILLKRFVKSLEK 43
>gnl|CDD|235516 PRK05582, PRK05582, DNA topoisomerase I; Validated.
Length = 650
Score = 30.2 bits (69), Expect = 6.0
Identities = 20/62 (32%), Positives = 31/62 (50%), Gaps = 9/62 (14%)
Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
KK KKK EA K +K++ K ++ ++I + KKK K K KKK++K+
Sbjct: 198 KKGKKKFEASFYGYKG--KKIELKNEED-------VKEILAELKKKDFKVSKVKKKERKR 248
Query: 973 KK 974
Sbjct: 249 NP 250
>gnl|CDD|236658 PRK10153, PRK10153, DNA-binding transcriptional activator CadC;
Provisional.
Length = 517
Score = 30.4 bits (69), Expect = 6.0
Identities = 15/76 (19%), Positives = 22/76 (28%), Gaps = 19/76 (25%)
Query: 137 PDATHVVVKVPVEG-------------------APTPATPDTEAPTPAPGTESGPSSTTP 177
D +V VP G A +P P A +ES +
Sbjct: 84 EDNPEYIVTVPKRGYKLVAPVIWYSEEEGEEILASSPPPIPEAVPATAEPSESANAQFPA 143
Query: 178 GPETKTLESLNEKEKQ 193
P T+ + K K+
Sbjct: 144 PPLTRAVCQSPAKSKR 159
>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain.
Length = 225
Score = 29.7 bits (67), Expect = 6.0
Identities = 13/64 (20%), Positives = 32/64 (50%), Gaps = 10/64 (15%)
Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKK----------KKKKKKKKKKKKEAEKMKKKTE 984
++++D ++ A + K++ +K +K +KK+K K + +KEAE+ + +
Sbjct: 118 RERRDLRIEERDEASEKKKEELIEKAQKEIDDFYENYNEKKEKTKAQNRKEAEQFLAERD 177
Query: 985 EAEK 988
+
Sbjct: 178 DFSS 181
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 29.4 bits (67), Expect = 6.1
Identities = 30/122 (24%), Positives = 60/122 (49%), Gaps = 23/122 (18%)
Query: 889 ALGFI---LFRKKRTRRKKKKGAEKIRKK--KKKKKEAEKMRKKK---GAEKMKKKKKDA 940
+GF RK K AE+ K+ ++ KKEAE ++K+ E++ K + +A
Sbjct: 11 VVGFAIGYFVRKSIAE-AKISSAEEEAKQIIEEAKKEAEALKKEALLEAKEEIHKLRAEA 69
Query: 941 EKMKK------KGAEK--------IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
E+ K + EK + +K + +KK++ ++K+K+ +++++K EE
Sbjct: 70 ERELKERRNELQRQEKRLLQKEETLDRKDESLEKKEESLEEKEKELAARQQQLEEKEEEL 129
Query: 987 EK 988
E+
Sbjct: 130 EE 131
>gnl|CDD|239571 cd03489, Topoisomer_IB_N_LdtopoI_like,
Topoisomer_IB_N_LdtopoI_like: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the heterodimeric topo I from
Leishmania donvanni. Topo I enzymes are divided into:
topo type IA (bacterial) and type IB (eukaryotic). Topo
I relaxes superhelical tension in duplex DNA by creating
a single-strand nick, the broken strand can then rotate
around the unbroken strand to remove DNA supercoils and,
the nick is religated, liberating topo I. These enzymes
regulate the topological changes that accompany DNA
replication, transcription and other nuclear processes.
Human topo I is the target of a diverse set of
anticancer drugs including camptothecins (CPTs). CPTs
bind to the topo I-DNA complex and inhibit re-ligation
of the single-strand nick, resulting in the accumulation
of topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topo I play putative roles in
organizing the kinetoplast DNA network unique to these
parasites. This family may represent more than one
structural domain.
Length = 212
Score = 29.8 bits (67), Expect = 6.2
Identities = 13/28 (46%), Positives = 20/28 (71%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMK 980
+K+KKK + K++KK K++K KEAE
Sbjct: 94 EKEKKKSRTKEEKKALKEEKDKEAEPYM 121
>gnl|CDD|240309 PTZ00189, PTZ00189, 60S ribosomal protein L21; Provisional.
Length = 160
Score = 29.2 bits (66), Expect = 6.2
Identities = 21/99 (21%), Positives = 39/99 (39%), Gaps = 20/99 (20%)
Query: 889 ALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
A+G I+ ++ R R +K+ +I E +RK + E K+ +
Sbjct: 71 AVGVIVNKRVRGRIVRKRIHVRI----------EHVRKSRCREDFLKRVAANDA------ 114
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
K + KKK + K++ + + A +K K E
Sbjct: 115 ----IKAEAKKKGELPSTKRQPAQPRPAGFVKPKPTEVV 149
>gnl|CDD|215494 PLN02915, PLN02915, cellulose synthase A [UDP-forming], catalytic
subunit.
Length = 1044
Score = 30.3 bits (68), Expect = 6.2
Identities = 16/55 (29%), Positives = 25/55 (45%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
+ K K ++K KK +G KK+KKK KK ++ + EE E+
Sbjct: 608 RGKSKKSKKGKKGRRSLLGGLKKRKKKGGGGGSMMGKKYGRKKSQAVFDLEEIEE 662
>gnl|CDD|220297 pfam09581, Spore_III_AF, Stage III sporulation protein AF
(Spore_III_AF). This family represents the stage III
sporulation protein AF (Spore_III_AF) of the bacterial
endospore formation program, which exists in some but
not all members of the Firmicutes (formerly called
low-GC Gram-positives). The C-terminal region of these
proteins is poorly conserved.
Length = 185
Score = 29.6 bits (67), Expect = 6.3
Identities = 19/88 (21%), Positives = 45/88 (51%), Gaps = 10/88 (11%)
Query: 908 AEKIRKK-KKKKKEAEKMRKKK----GAEKMKKKKKDAEKMK---KKGAEKIGKKKKKKK 959
A+++ K+ +KK KE ++ K E ++ D +++ K+ +++ K K +
Sbjct: 85 AKQLEKQVEKKLKEEYGVKVKDVEVEIDEDLESNNFDIKEVNVTLKEESKEKQKSKVEPV 144
Query: 960 --KKKKKKKKKKKKKKKEAEKMKKKTEE 985
+ K K+++++ +EAEK+K +
Sbjct: 145 VIDTQTSKPKEEEEESEEAEKIKNFLAD 172
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 30.1 bits (68), Expect = 6.3
Identities = 16/86 (18%), Positives = 37/86 (43%), Gaps = 5/86 (5%)
Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK----- 962
AE+I K K+ +R + AE+ + +E+ ++ +++KK +
Sbjct: 160 AERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSEL 219
Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAEK 988
+KK ++ + ++K + AE
Sbjct: 220 SADQKKLEELRANESRLKNEIASAEA 245
Score = 30.1 bits (68), Expect = 6.4
Identities = 10/93 (10%), Positives = 30/93 (32%), Gaps = 4/93 (4%)
Query: 896 RKKRTRRKKKKGAEKI----RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
+ ++ ++ RKK + +E +K E+++ + +
Sbjct: 188 TTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAA 247
Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
K ++ + + + + K + K T
Sbjct: 248 AKAREAAAAAEAAAARARAAEAKRTGETYKPTA 280
>gnl|CDD|153289 cd07605, I-BAR_IMD, Inverse (I)-BAR, also known as the IRSp53/MIM
homology Domain (IMD), a dimerization module that binds
and bends membranes. Inverse (I)-BAR (or IMD) is a
member of the Bin/Amphiphysin/Rvs (BAR) domain family.
It is a dimerization and lipid-binding module that bends
membranes and induces membrane protrusions in the
opposite direction compared to classical BAR and F-BAR
domains, which produce membrane invaginations. IMD
domains are found in Insulin Receptor tyrosine kinase
Substrate p53 (IRSp53), Missing in Metastasis (MIM), and
Brain-specific Angiogenesis Inhibitor 1-Associated
Protein 2-like (BAIAP2L) proteins. These are
multi-domain proteins that act as scaffolding proteins
and transducers of a variety of signaling pathways that
link membrane dynamics and the underlying actin
cytoskeleton. Most members contain an N-terminal IMD, an
SH3 domain, and a WASP homology 2 (WH2) actin-binding
motif at the C-terminus, exccept for MIM which does not
carry an SH3 domain. Some members contain additional
domains and motifs. The IMD domain binds and bundles
actin filaments, binds membranes and produces membrane
protrusions, and interacts with the small GTPase Rac.
Length = 223
Score = 29.6 bits (67), Expect = 6.7
Identities = 17/62 (27%), Positives = 30/62 (48%), Gaps = 2/62 (3%)
Query: 927 KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
+ + + K +KD +K K+ E + K + + KK +KK +K K EK+ + EE
Sbjct: 104 ELDQKVINKFEKDYKKEYKQKREDL--DKARSELKKLQKKSQKSGTGKYQEKLDQALEEL 161
Query: 987 EK 988
Sbjct: 162 ND 163
>gnl|CDD|223747 COG0675, COG0675, Transposase and inactivated derivatives [DNA
replication, recombination, and repair].
Length = 364
Score = 30.0 bits (67), Expect = 6.7
Identities = 11/65 (16%), Positives = 25/65 (38%)
Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
+K+ +R+K + K K K + ++K K+ KK + + +
Sbjct: 212 AQKRLSRKKSRSKRRKKAKLKLARLRERIRNRRKDFHKLAKKLVELADLVGVETLVVEDL 271
Query: 955 KKKKK 959
K++
Sbjct: 272 VKRRS 276
>gnl|CDD|214787 smart00721, BAR, BAR domain.
Length = 239
Score = 29.7 bits (67), Expect = 6.7
Identities = 19/46 (41%), Positives = 25/46 (54%), Gaps = 8/46 (17%)
Query: 951 IGKKKKKKKKKK--------KKKKKKKKKKKKEAEKMKKKTEEAEK 988
I K +KK ++K K KK KK K+KK+ EK+ K EE K
Sbjct: 136 IKKARKKLERKLLDYDSARHKLKKAKKSKEKKKDEKLAKAEEELRK 181
>gnl|CDD|151885 pfam11446, DUF2897, Protein of unknown function (DUF2897). This is
a bacterial family of uncharacterized proteins.
Length = 55
Score = 26.8 bits (60), Expect = 6.8
Identities = 8/26 (30%), Positives = 14/26 (53%)
Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKE 975
K + K+KKKK K+ ++ K +
Sbjct: 30 KFPQFDKQKKKKAKRNAEQDAKNSAD 55
>gnl|CDD|233309 TIGR01200, GLPGLI, GLPGLI family protein. This protein family was
first noted as a paralogous set in Porphyromonas
gingivalis, but it is more widely distributed among the
Bacteroidetes. The protein family is now renamed GLPGLI
after its best-conserved motif.
Length = 227
Score = 29.6 bits (67), Expect = 6.8
Identities = 11/35 (31%), Positives = 18/35 (51%), Gaps = 1/35 (2%)
Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
A KI + K KK KK +++ +K+K+K
Sbjct: 192 AIKI-ELNPTPIIIIPKPKKGKKISREKFKKIKEK 225
>gnl|CDD|233048 TIGR00605, rad4, DNA repair protein rad4. All proteins in this
family for which functions are known are involved in
targeting nucleotide excision repair to specific regions
of the genome.This family is based on the phylogenomic
analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 713
Score = 30.2 bits (68), Expect = 6.9
Identities = 14/84 (16%), Positives = 31/84 (36%), Gaps = 4/84 (4%)
Query: 897 KKRTRRKKKKGAEKIRK--KKKKKKEAEK--MRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
++ + EK+R +K E + + + +K K + +KG K
Sbjct: 172 PLKSAKLSNLIPEKVRLLLHPSVRKSEELPSRSLRGLRKPLVEKLKKCMETWQKGLRKTT 231
Query: 953 KKKKKKKKKKKKKKKKKKKKKKEA 976
K K + + K ++ +K +
Sbjct: 232 KGLLKLLNGGRYSRSKWEEIEKSS 255
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 30.0 bits (67), Expect = 6.9
Identities = 15/38 (39%), Positives = 24/38 (63%), Gaps = 1/38 (2%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMK 934
+KR RRK+K + RK +K++ E + KGAE+M+
Sbjct: 388 QKRQRRKEKFWRQHCRKARKEQAE-RRPIPGKGAERMR 424
>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
Length = 977
Score = 30.3 bits (68), Expect = 7.0
Identities = 15/32 (46%), Positives = 19/32 (59%)
Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K + + K KK KKKK KKKEA K + E+
Sbjct: 115 KNRVESWKDKKNKKKKSAKKKEAHKAQIPPEQ 146
>gnl|CDD|204272 pfam09618, Cas_Csy4, CRISPR-associated protein (Cas_Csy4). CRISPR
(Clustered Regularly Interspaced Short Palindromic
Repeats) is a widespread family of prokaryotic direct
repeats with spacers of unique sequence between
consecutive repeats. This protein family, typified by
YPO2462 of Yersinia pestis, is a CRISPR-associated (Cas)
family strictly associated with the Ypest subtype of
CRISPR/Cas locus. It is designated Csy4, for CRISPR/Cas
Subtype Ypest protein 4.
Length = 182
Score = 29.1 bits (66), Expect = 7.3
Identities = 14/58 (24%), Positives = 22/58 (37%), Gaps = 6/58 (10%)
Query: 872 HVAVSDVN---DDVRREAVTALGFILFRKKRTRRKKKKG---AEKIRKKKKKKKEAEK 923
+ +SDV DDV V+ + + RR K+ E+ R + K E
Sbjct: 77 YFHISDVLPVPDDVEYVTVSRVQAKSSPARLRRRLMKRAGLTEEEARARIPKSLEKRL 134
>gnl|CDD|100102 cd00387, Ribosomal_L7_L12, Ribosomal protein L7/L12. Ribosomal
protein L7/L12 refers to the large ribosomal subunit
proteins L7 and L12, which are identical except that L7
is acetylated at the N terminus. It is a component of
the L7/L12 stalk, which is located at the surface of the
ribosome. The stalk base consists of a portion of the
23S rRNA and ribosomal proteins L11 and L10. An extended
C-terminal helix of L10 provides the binding site for
L7/L12. L7/L12 consists of two domains joined by a
flexible hinge, with the helical N-terminal domain (NTD)
forming pairs of homodimers that bind to the extended
helix of L10. It is the only multimeric ribosomal
component, with either four or six copies per ribosome
that occur as two or three dimers bound to the L10
helix. L7/L12 is the only ribosomal protein that does
not interact directly with rRNA, but instead has
indirect interactions through L10. The globular
C-terminal domains of L7/L12 are highly mobile. They are
exposed to the cytoplasm and contain binding sites for
other molecules. Initiation factors, elongation factors,
and release factors are known to interact with the
L7/L12 stalk during their GTP-dependent cycles. The
binding site for the factors EF-Tu and EF-G comprises
L7/L12, L10, L11, the L11-binding region of 23S rRNA,
and the sarcin-ricin loop of 23S rRNA. Removal of L7/L12
has minimal effect on factor binding and it has been
proposed that L7/L12 induces the catalytically active
conformation of EF-Tu and EF-G, thereby stimulating the
GTPase activity of both factors. In eukaryotes, the
proteins that perform the equivalent function to L7/L12
are called P1 and P2, which do not share sequence
similarity with L7/L12. However, a bacterial L7/L12
homolog is found in some eukaryotes, in mitochondria and
chloroplasts. In archaea, the protein equivalent to
L7/L12 is called aL12 or L12p, but it is closer in
sequence to P1 and P2 than to L7/L12.
Length = 127
Score = 28.7 bits (65), Expect = 7.4
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 968 KKKKKKKEAEKMKKKTEEA 986
K+ K+EAE++KKK EEA
Sbjct: 103 KEGVSKEEAEEIKKKLEEA 121
>gnl|CDD|240334 PTZ00258, PTZ00258, GTP-binding protein; Provisional.
Length = 390
Score = 29.9 bits (68), Expect = 7.5
Identities = 19/57 (33%), Positives = 29/57 (50%), Gaps = 7/57 (12%)
Query: 938 KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK--KK-----KKKKEAEKMKKKTEEAE 987
KD E ++K+ E K+KKKKKKK++K + KK ++ K +E E
Sbjct: 154 KDLEFVEKRLDELTKKRKKKKKKKEEKVELDVLKKVLEWLEEGKPVRDGDWTDKEIE 210
>gnl|CDD|187774 cd09643, Csn1, CRISPR/Cas system-associated protein Cas9. CRISPR
(Clustered Regularly Interspaced Short Palindromic
Repeats) and associated Cas proteins comprise a system
for heritable host defense by prokaryotic cells against
phage and other foreign DNA; Very large protein
containing McrA/HNH-nuclease related domain and a
RuvC-like nuclease domain; signature gene for type II.
Length = 799
Score = 30.1 bits (68), Expect = 7.5
Identities = 13/58 (22%), Positives = 31/58 (53%), Gaps = 2/58 (3%)
Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
E+ + K +++ K++ + KK K K+K+K K+++ + ++ + +AEK
Sbjct: 302 EEQGETKILSKEEKQELLD--LLFKKNKLTYKQKRKLLGLKEEEIFKGLRYEGLKAEK 357
>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201). This is
a family of coiled-coil proteins from eukaryotes. The
function is not known.
Length = 177
Score = 29.1 bits (66), Expect = 7.5
Identities = 22/88 (25%), Positives = 45/88 (51%), Gaps = 8/88 (9%)
Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
R KK+ EK + K +KE K++ ++ +K++K+ K K++++G
Sbjct: 94 ERLKKELKEKEERLAKLRKELYKVKLER--DKLRKQNK---KLRQQGGL---LHVPALLY 145
Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
K +K ++K+K EK+++K + E
Sbjct: 146 DYDKTVEKVEEKRKSVEKLRRKHKILEM 173
>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 472
Score = 29.7 bits (67), Expect = 7.6
Identities = 9/59 (15%), Positives = 28/59 (47%)
Query: 924 MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
+ + ++ ++ K++ K K++K++ + +K+ K+ E++K+K
Sbjct: 331 ATRFSSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKELMEELKEK 389
>gnl|CDD|227187 COG4850, COG4850, Uncharacterized conserved protein [Function
unknown].
Length = 373
Score = 29.8 bits (67), Expect = 7.6
Identities = 14/39 (35%), Positives = 19/39 (48%), Gaps = 2/39 (5%)
Query: 482 VSDVNDDVRREAVTALGFILFRTLFYVAPACTRSPWPGV 520
+SD++D V+ VT R+L A TR PGV
Sbjct: 165 ISDIDDTVKVTGVTEGPRKAGRSLL--LHALTRQVIPGV 201
>gnl|CDD|204658 pfam11477, PM0188, Sialyltransferase PMO188. PMO188 is a
sialyltransferase from P.multocida. It transfers sialic
acid from cytidine 5'-monophosphonuraminic acid to an
acceptor sugar. It has important catalytic residues such
as Asp141, His311, Glu338, Ser355 and Ser356.
Length = 381
Score = 29.8 bits (67), Expect = 7.6
Identities = 18/54 (33%), Positives = 20/54 (37%), Gaps = 2/54 (3%)
Query: 294 EALAHMQSYLPRE-TGPTSGYYEGGGLYALGLIHANHGAVINDYLLGQVKDATN 346
E A Q L E P S +Y G H N G IND +L D N
Sbjct: 258 EYYAQQQINLLNEAQNPHSPHYIGNNYDLFFKGHPN-GGAINDIILLHFPDMIN 310
Score = 29.8 bits (67), Expect = 7.6
Identities = 18/54 (33%), Positives = 20/54 (37%), Gaps = 2/54 (3%)
Query: 656 EALAHMQSYLPRE-TGPTSGYYEGGGLYALGLIHANHGAVINDYLLGQVKDATN 708
E A Q L E P S +Y G H N G IND +L D N
Sbjct: 258 EYYAQQQINLLNEAQNPHSPHYIGNNYDLFFKGHPN-GGAINDIILLHFPDMIN 310
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 30.2 bits (68), Expect = 7.6
Identities = 8/53 (15%), Positives = 11/53 (20%), Gaps = 1/53 (1%)
Query: 134 APIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPETKTLES 186
A D + P G+ S P+S P
Sbjct: 443 ADGDAPVPAKANARASADSRCDERDAQ-PPADSGSASAPASDAPPDAAFEPAP 494
>gnl|CDD|216249 pfam01025, GrpE, GrpE.
Length = 165
Score = 29.1 bits (66), Expect = 7.6
Identities = 11/49 (22%), Positives = 26/49 (53%), Gaps = 3/49 (6%)
Query: 909 EKIRKKKKKKKEAEK--MRKKKGAEKMKKK-KKDAEKMKKKGAEKIGKK 954
E++ + +++ +E + +R E +K+ +++ E+ KK EK K
Sbjct: 18 EELEELEEEIEELKDRLLRLLAEFENYRKRTEREREEAKKYAIEKFAKD 66
>gnl|CDD|227463 COG5134, COG5134, Uncharacterized conserved protein [Function
unknown].
Length = 272
Score = 29.7 bits (66), Expect = 7.7
Identities = 13/88 (14%), Positives = 34/88 (38%), Gaps = 5/88 (5%)
Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
K + + + EK ++ + K + + +++ ++
Sbjct: 118 DPAKAENVEKVPESDAIEALEKQLTQQ--KSEKHNSSAINFIDELN-KRLWSDPFVSSQR 174
Query: 962 KKKKKKKKKK--KKKEAEKMKKKTEEAE 987
+K+ +++KK KK+EA+ + K A
Sbjct: 175 LRKQFRERKKIEKKQEAKDLSLKNRAAL 202
>gnl|CDD|236295 PRK08570, rpl19e, 50S ribosomal protein L19e; Reviewed.
Length = 150
Score = 28.7 bits (65), Expect = 7.8
Identities = 20/72 (27%), Positives = 35/72 (48%), Gaps = 10/72 (13%)
Query: 874 AVSDVNDDVRREAVTAL---GFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGA 930
A+ DV + + RE + L G I + K KKG + R +++ +K + R+ G+
Sbjct: 28 ALEDVAEAITREDIRELIKEGVI-------KAKPKKGISRGRARERHEKRKKGRRRGPGS 80
Query: 931 EKMKKKKKDAEK 942
K KK + +K
Sbjct: 81 RKGKKGARTPKK 92
>gnl|CDD|152325 pfam11889, DUF3409, Domain of unknown function (DUF3409). This
domain is functionally uncharacterized. This domain is
found in viruses. This presumed domain is about 60 amino
acids in length. This domain is found associated with
pfam00271, pfam05550, pfam05578.
Length = 56
Score = 27.1 bits (60), Expect = 7.8
Identities = 22/58 (37%), Positives = 29/58 (50%), Gaps = 14/58 (24%)
Query: 928 KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK-----------KKKKKKKKKKK 974
+GA+K K+KK D KKGA KI K+ +K K K K + KKK K +
Sbjct: 2 EGAKKKKQKKPD---RVKKGAMKITPKESEKDSKTKPPDATIVVEGVKYQVKKKGKVR 56
>gnl|CDD|225281 COG2425, COG2425, Uncharacterized protein containing a von
Willebrand factor type A (vWA) domain [General function
prediction only].
Length = 437
Score = 29.7 bits (67), Expect = 7.9
Identities = 14/97 (14%), Positives = 44/97 (45%), Gaps = 5/97 (5%)
Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
++ + +++G+E + +++ +EK + + +M+ ++ E++ + +
Sbjct: 113 EELLQDLQREGSEDFLEGEREGLLSEKQEEISLSGEMEGILEEVEELISGRLWDMSAGEL 172
Query: 957 KKKKKKKKKK-----KKKKKKKKEAEKMKKKTEEAEK 988
KK + K K + +K E++ ++ E E+
Sbjct: 173 KKGDLELLGKYADFLLKFPELRKILEQLGREDLEIEE 209
>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
SecD/SecF; Reviewed.
Length = 755
Score = 29.8 bits (68), Expect = 8.0
Identities = 12/23 (52%), Positives = 18/23 (78%)
Query: 953 KKKKKKKKKKKKKKKKKKKKKKE 975
+K++ KKKKK+KK KK + +K E
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755
>gnl|CDD|225437 COG2882, FliJ, Flagellar biosynthesis chaperone [Cell motility and
secretion / Intracellular trafficking and secretion /
Posttranslational modification, protein turnover,
chaperones].
Length = 148
Score = 28.9 bits (65), Expect = 8.3
Identities = 16/105 (15%), Positives = 42/105 (40%), Gaps = 18/105 (17%)
Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAE-KMRKKKGA-----------------EKMKKKKKD 939
+ R +K+ E+++ + E E + +K + E +++
Sbjct: 26 SKIRSEKENAEEQLKMLSGYRNEYEQNLNEKLKSGVSAAQWQNYQQFISQLEVAIDQQQS 85
Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
+K E+ + ++K+ + K +K K+++K E + + E
Sbjct: 86 QLSKLRKQVEQKREIWQEKQIELKALEKLKERQKTEFLLEENRRE 130
>gnl|CDD|218601 pfam05477, SURF2, Surfeit locus protein 2 (SURF2). Surfeit locus
protein 2 is part of a group of at least six sequence
unrelated genes (Surf-1 to Surf-6). The six Surfeit
genes have been classified as housekeeping genes, being
expressed in all tissue types tested and not containing
a TATA box in their promoter region. The exact function
of SURF2 is unknown.
Length = 244
Score = 29.5 bits (66), Expect = 8.5
Identities = 14/44 (31%), Positives = 21/44 (47%), Gaps = 2/44 (4%)
Query: 939 DAEKMKKKGAEKIGKKKKKKKK--KKKKKKKKKKKKKKEAEKMK 980
D ++M+ + E K+ KK+ KK KK KKK +K
Sbjct: 201 DEDEMEVESPELQQKRSKKQSGSLTKKFKKNHKKKGPFRFSSVK 244
>gnl|CDD|152940 pfam12506, DUF3713, Protein of unknown function (DUF3713). This
family of proteins is found in bacteria. Proteins in
this family are typically between 92 and 1225 amino
acids in length. There is a single completely conserved
residue S that may be functionally important.
Length = 116
Score = 28.2 bits (63), Expect = 8.5
Identities = 16/53 (30%), Positives = 23/53 (43%)
Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
K K +++ K K+ K K K KK K + K K+ EK+K T
Sbjct: 12 KNKVGQEETNTSSDKSTNKLKTKDADVDKLKDKKTKVETKAKELVEKLKVTTA 64
>gnl|CDD|221190 pfam11727, ISG65-75, Invariant surface glycoprotein. This family
is found in Trypanosome species, and appears to be one
of two invariant surface glycoproteins, ISG65 and ISG75.
that are found in the mammalian stage of the parasitic
protozoan. the sequence suggests the two families are
polypeptides with N-terminal signal sequences,
hydrophilic extracellular domains, single trans-membrane
alpha-helices and short cytoplasmic domains. they are
both expressed in the bloodstream form but not in the
midgut stage. Both polypeptides are distributed over the
entire surface of the parasite.
Length = 286
Score = 29.4 bits (66), Expect = 8.7
Identities = 21/98 (21%), Positives = 35/98 (35%), Gaps = 5/98 (5%)
Query: 868 KRLLHVAVSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK 927
K L ++ D + ++ I K + K EK+ + K +K
Sbjct: 46 KDLADKVGNERADYLVKKTEGFAANI----KLALDRVKYYLEKLESRSDSKLSDSDKKKI 101
Query: 928 KGA-EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
K EK K K +A KK + K ++ K+ K
Sbjct: 102 KEICEKAKDKIDEALPEAKKAGREAEKLAEEVKEAAAK 139
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 29.9 bits (68), Expect = 8.8
Identities = 18/79 (22%), Positives = 33/79 (41%), Gaps = 8/79 (10%)
Query: 915 KKKKKEAEKMRKKKG----AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
AE G A ++ + +K+ EK++K+ ++I KK + K ++ +
Sbjct: 794 TAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEI-DRIEKKLSNEGFVAKAPEEVVE 852
Query: 971 K---KKKEAEKMKKKTEEA 986
K K E + K EE
Sbjct: 853 KEKEKLAEYQVKLAKLEER 871
>gnl|CDD|224192 COG1273, COG1273, Ku-homolog [Replication, recombination, and
repair].
Length = 278
Score = 29.6 bits (67), Expect = 8.8
Identities = 17/89 (19%), Positives = 31/89 (34%), Gaps = 27/89 (30%)
Query: 908 AEKIRKKKKKKKEAEKMR-------------KKKGAEKMKKKKKDA----------EKMK 944
A+++ KK + ++ K +G + + KK+ + +K
Sbjct: 194 AKQLIDKKTGTFDPDEYEDRYQEALMELIEAKLEGRKTVAAPKKEPPEPSNVIDLMDALK 253
Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
+ +KK KKK K KKK
Sbjct: 254 AS----LEARKKAGKKKPAAKPAAAAKKK 278
>gnl|CDD|234671 PRK00157, rplL, 50S ribosomal protein L7/L12; Reviewed.
Length = 123
Score = 28.2 bits (64), Expect = 8.9
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 968 KKKKKKKEAEKMKKKTEEA 986
K+ K+EAE++KKK EEA
Sbjct: 98 KEGVSKEEAEEIKKKLEEA 116
>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein. This model
describes nucleoid occlusion protein, a close homolog to
ParB chromosome partitioning proteins including Spo0J in
Bacillus subtilis. Its gene often is located near the
gene for the Spo0J ortholog. This protein bind a
specific DNA sequence and blocks cytokinesis from
happening until chromosome segregation is complete.
Length = 255
Score = 29.4 bits (67), Expect = 8.9
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKK 971
K E++ KK +K +K KKKKK++K
Sbjct: 185 KQTEELIKKLLEKPEKPKKKKKRRKG 210
>gnl|CDD|216599 pfam01603, B56, Protein phosphatase 2A regulatory B subunit (B56
family). Protein phosphatase 2A (PP2A) is a major
intracellular protein phosphatase that regulates
multiple aspects of cell growth and metabolism. The
ability of this widely distributed heterotrimeric enzyme
to act on a diverse array of substrates is largely
controlled by the nature of its regulatory B subunit.
There are multiple families of B subunits (See also
pfam01240), this family is called the B56 family.
Length = 405
Score = 29.6 bits (67), Expect = 9.1
Identities = 12/30 (40%), Positives = 20/30 (66%)
Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
++ G K K+K+ K+KKK+KK+KK +
Sbjct: 373 DECGNKYKEKEAKEKKKEKKRKKTWARIAE 402
>gnl|CDD|153367 cd07683, F-BAR_srGAP1, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Protein 1. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs. srGAP1, also called Rho
GTPase-Activating Protein 13 (ARHGAP13), is a Cdc42- and
RhoA-specific GAP and is expressed later in the
development of CNS (central nervous system) tissues. It
is an important downstream signaling molecule of Robo1.
srGAP1 contains an N-terminal F-BAR domain, a Rho GAP
domain, and a C-terminal SH3 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 253
Score = 29.3 bits (65), Expect = 9.2
Identities = 17/64 (26%), Positives = 32/64 (50%), Gaps = 3/64 (4%)
Query: 915 KKKKKEAEKMRKK---KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
+ K KEAEK +K + + + + + ++ +KI K K+K++ K + K K K
Sbjct: 154 ESKLKEAEKQEEKQIGRSGDPVFHIRLEDRHQRRSSVKKIEKMKEKRQAKYSENKLKSIK 213
Query: 972 KKKE 975
+ E
Sbjct: 214 ARNE 217
>gnl|CDD|184064 PRK13461, PRK13461, F0F1 ATP synthase subunit B; Provisional.
Length = 159
Score = 28.9 bits (65), Expect = 9.3
Identities = 19/83 (22%), Positives = 40/83 (48%), Gaps = 17/83 (20%)
Query: 908 AEKIRKKKKKKKEAEKMRKK-----KGAEKMKKK-----KKDAEKM-------KKKGAEK 950
KI K + +K+A +++ K K A++ KK K AE + + A+
Sbjct: 42 DNKIEKADEDQKKARELKLKNERELKNAKEEGKKIVEEYKSKAENVYEEIVKEAHEEADL 101
Query: 951 IGKKKKKKKKKKKKKKKKKKKKK 973
I ++ K + +++K+K + + K +
Sbjct: 102 IIERAKLEAQREKEKAEYEIKNQ 124
>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
splicing MRS1. Members of this family adopt a secondary
structure consisting of two beta sheets and one alpha
helix, arranged as a beta-alpha-beta motif. Each beta
sheet has five strands, arranged in a 32145 order, with
the second strand being antiparallel to the rest.
Mitochondrial resolvase Ydc2 is capable of resolving
Holliday junctions and cleaves DNA after 5'-CT-3' and
5'-TT-3' sequences. This family also contains the
mitochondrial RNA-splicing protein MRS1 which is
involved in the excision of group I introns.
Length = 254
Score = 29.3 bits (66), Expect = 9.3
Identities = 9/28 (32%), Positives = 13/28 (46%)
Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
+KKK + KKK K KK + +
Sbjct: 147 ERTEILAEKKKPRSKKKSSKNSKKLRID 174
>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are typically between 137 to 196
amino acids in length.
Length = 149
Score = 28.4 bits (64), Expect = 9.3
Identities = 13/42 (30%), Positives = 19/42 (45%)
Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
+K+ K +K K +K K K K+KK K K+ E
Sbjct: 5 RKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASE 46
>gnl|CDD|223436 COG0359, RplI, Ribosomal protein L9 [Translation, ribosomal
structure and biogenesis].
Length = 148
Score = 28.7 bits (65), Expect = 9.3
Identities = 11/42 (26%), Positives = 19/42 (45%)
Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
K K + ++ K +KK ++ EAE +K+K E
Sbjct: 33 KGLAVPATKGNLKLLEARRAKLEKKAAEELAEAEALKEKLEG 74
>gnl|CDD|135898 PRK06397, PRK06397, V-type ATP synthase subunit H; Validated.
Length = 111
Score = 28.0 bits (62), Expect = 9.5
Identities = 22/79 (27%), Positives = 36/79 (45%), Gaps = 4/79 (5%)
Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK-- 970
K++ KE ++ ++ E + K K EK KK E + + +K+ +KK
Sbjct: 18 KEESIDKEIANIKNEQENEIKEAKSKYEEKAKKTEEESLNMYNAALMEARKEAEKKAVEI 77
Query: 971 --KKKKEAEKMKKKTEEAE 987
K K+EA +K K E E
Sbjct: 78 INKAKQEASLIKLKISEDE 96
>gnl|CDD|219226 pfam06911, Senescence, Senescence-associated protein. This family
contains a number of plant senescence-associated
proteins of approximately 450 residues in length. In
Hemerocallis, petals have a genetically based program
that leads to senescence and cell death approximately 24
hours after the flower opens, and it is believed that
senescence proteins produced around that time have a
role in this program. This family extends to the higher
vertebrates where the full-length protein is often a
Spartin, associated with mitochondrial membranes and
transportation along microtubules.
Length = 181
Score = 28.8 bits (65), Expect = 9.6
Identities = 17/78 (21%), Positives = 29/78 (37%), Gaps = 9/78 (11%)
Query: 905 KKGAEKIRKKKKKKKEAEKMRK--KKGAEKMKKKKKDAEKMKK-------KGAEKIGKKK 955
+ G E + K ++ ++ K+ ++KK K K+ K K A +G
Sbjct: 25 QSGGEYLISKISPTEKPVEVSPSTKERLRRVKKLSKATAKVSKSVLNGVGKVAGNVGSSL 84
Query: 956 KKKKKKKKKKKKKKKKKK 973
KKK K K +
Sbjct: 85 ANSVAGKKKGKDGKGEPN 102
>gnl|CDD|227429 COG5098, COG5098, Chromosome condensation complex Condensin, subunit
D2 [Chromatin structure and dynamics / Cell division and
chromosome partitioning].
Length = 1128
Score = 29.9 bits (67), Expect = 9.7
Identities = 30/185 (16%), Positives = 62/185 (33%), Gaps = 34/185 (18%)
Query: 364 LLGSKSESAIQDMVAYAKETQHEKILRGLAVGISFIMYGQ---LENADDLVRDLC----E 416
+G SE D + KE + ++YG+ L N +V + C
Sbjct: 861 RIGGTSEDDFVDAFFFIKEKE--------------LLYGEKSVLSNFKPVVEEGCSNSSR 906
Query: 417 EKDAILRRSGMYTIAMAYCVSYLYFDLFSDLTYRI--------VKNRSGMYTIAMAYCGS 468
D L+ + ++ C+S+ + L ++ + + C +
Sbjct: 907 FSDEELQVAAYLSLYKLMCLSFEFCSEHLPLLITSMEKHPIPRIRANAVVGLGDFLVCFN 966
Query: 469 GSNVATKRLLHVAVSDVNDDVRREAVTALGFILFRTLFYVAPACTRSPWPGVISLLDDPM 528
+ L+ + D + DVRR + + F++ V + + LL D
Sbjct: 967 TTADEHTHYLYRRLGDEDADVRRTCLMTIHFLILAGQLKV-----KGQLGKMALLLTDED 1021
Query: 529 IELKE 533
E+ +
Sbjct: 1022 AEISD 1026
>gnl|CDD|234874 PRK00955, PRK00955, hypothetical protein; Provisional.
Length = 620
Score = 29.5 bits (67), Expect = 9.7
Identities = 17/62 (27%), Positives = 24/62 (38%)
Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
+K + + KK G E + KK +K KK KK+KKK+
Sbjct: 559 RKPENYDLVYEALKKAGREDLIGNGPKCLIPPKKPKKKSNKKNNNNNYSSNNSKKRKKKR 618
Query: 973 KK 974
KK
Sbjct: 619 KK 620
>gnl|CDD|235602 PRK05776, PRK05776, DNA topoisomerase I; Provisional.
Length = 670
Score = 29.6 bits (67), Expect = 9.7
Identities = 16/63 (25%), Positives = 34/63 (53%), Gaps = 1/63 (1%)
Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
TR ++K E IR K ++E + K+ + +++ KK+ +++ ++ A+ +G K K
Sbjct: 540 TRDFEEK-LEMIRTGKATREEVIEEAKETLNKLLEEFKKNKDEIGEELAKALGLIKPVGK 598
Query: 960 KKK 962
K
Sbjct: 599 CKI 601
>gnl|CDD|225058 COG2147, RPL19A, Ribosomal protein L19E [Translation, ribosomal
structure and biogenesis].
Length = 150
Score = 28.4 bits (64), Expect = 9.8
Identities = 20/93 (21%), Positives = 41/93 (44%), Gaps = 14/93 (15%)
Query: 874 AVSDVNDDVRREAVTAL---GFIL---------FRKKRTRRKKKKGAEKIRKKKKKKKEA 921
+ ++ + RE + AL G I R ++ +KKKG + +K K A
Sbjct: 28 EIEEIASAITREDIRALIKDGVIKAKPKKGISRGRARKRHAQKKKGRRRGPGSRKGTKGA 87
Query: 922 EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
K++ ++++ +++ K++ G KI K
Sbjct: 88 RMPSKERWIKRIRALRRELRKLRDDG--KIDKH 118
>gnl|CDD|217451 pfam03249, TSA, Type specific antigen. There are several antigenic
variants in Rickettsia tsutsugamushi, and a
type-specific antigen (TSA) of 56-kilodaltons located on
the rickettsial surface is responsible for the
variation. TSA proteins are probably integral membrane
proteins.
Length = 502
Score = 29.5 bits (66), Expect = 9.8
Identities = 19/72 (26%), Positives = 33/72 (45%), Gaps = 5/72 (6%)
Query: 921 AEKMRKKKGAEKMKKKKKDAEKMK-----KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
A +R G E++ + KD K++ KK EK+ ++++ + KKK+ E
Sbjct: 331 AAAVRLLNGNEQIIQLYKDLVKLQRHAGIKKAMEKLAAQQEEDAGNQGGGDCKKKQGASE 390
Query: 976 AEKMKKKTEEAE 987
K K +E E
Sbjct: 391 KSKEGGKGKETE 402
>gnl|CDD|177553 PHA03185, PHA03185, UL14 tegument protein; Provisional.
Length = 214
Score = 29.2 bits (65), Expect = 10.0
Identities = 14/44 (31%), Positives = 19/44 (43%), Gaps = 1/44 (2%)
Query: 137 PDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPE 180
P + V +P GAP P TP PT P +ES + +
Sbjct: 172 PTSGPVAGGLPGGGAPAPPTPPRPGPTD-PASESDCGRDSRDGD 214
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.132 0.376
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 50,055,462
Number of extensions: 5135822
Number of successful extensions: 32922
Number of sequences better than 10.0: 1
Number of HSP's gapped: 20860
Number of HSP's successfully gapped: 2050
Length of query: 988
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 882
Effective length of database: 6,236,078
Effective search space: 5500220796
Effective search space used: 5500220796
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.5 bits)